GeoDict: an integrated gazetteer

Fize Jacques, Shrivastava Gaurav. 2017. GeoDict: an integrated gazetteer. In : Proceedings of Language, Ontology, Terminology and Knowledge Structures Workshop (LOTKS 2017). Francesca Frontini (ed.), Larisa Grčić Simeunović(ed.), Špela Vintar(ed.), Fahad Khan(ed.), Artemis Parvisi(ed.). Montpellier : Association for Computational Linguistics, pp. 31-41. Language, Ontology, Terminology and Knowledge Structures Workshop (LOTKS 2017), Montpellier, France.

Paper with proceedings ; Article de recherche
[img] Published version - Anglais
Access restricted to CIRAD agents
Use under authorization by the author or CIRAD.

Télécharger (323kB) | Request a copy

Type d'url non précisé :

Abstract : Nowadays, spatial analysis in text is widely considered as important for both researchers and users. In certain fields such as epidemiology, the extraction of spatial information in text is crucial and both resources and methods are necessary. In most of spatial analysis process, gazetteer is a commonly used resource. A gazetteer is a data source where toponyms (place name) are associated with concepts and their geographic footprint. Unfortunately, most of publicly available gazetteer are incomplete due to their initial purpose. Hence, we propose Geodict, an integrated gazetteer that contains basic yet precise information (multilingual labels, administrative boundaries polygon, etc.) which can be customized. We show its utility when using it for geoparsing (extraction of spatial entities in text). Early evaluation on toponym resolution shows promising results. (Résumé d'auteur)

Mots-clés libres : Gazetteer, Spatial entities, Geonames, DBpedia, Wikidata

Classification Agris : C30 - Documentation and information
B10 - Geography

Auteurs et affiliations

  • Fize Jacques, CIRAD-ES-UMR TETIS (FRA) ORCID: 0000-0003-1783-934X
  • Shrivastava Gaurav, Birla Institute of Science and Technology (IND)

Source : Cirad-Agritrop (

View Item (staff only) View Item (staff only)

[ Page générée et mise en cache le 2019-10-08 ]