Fize Jacques, Shrivastava Gaurav.
2017. GeoDict: an integrated gazetteer.
In : Proceedings of Language, Ontology, Terminology and Knowledge Structures Workshop (LOTKS 2017). Francesca Frontini (ed.), Larisa Grčić Simeunović(ed.), Špela Vintar(ed.), Fahad Khan(ed.), Artemis Parvisi(ed.)
Version publiée
- Anglais
Accès réservé aux agents Cirad Utilisation soumise à autorisation de l'auteur ou du Cirad. geodict2017.pdf Télécharger (323kB) | Demander une copie |
Url - jeu de données - Dataverse Cirad : https://doi.org/10.18167/DVN1/MWQQOQ
Résumé : Nowadays, spatial analysis in text is widely considered as important for both researchers and users. In certain fields such as epidemiology, the extraction of spatial information in text is crucial and both resources and methods are necessary. In most of spatial analysis process, gazetteer is a commonly used resource. A gazetteer is a data source where toponyms (place name) are associated with concepts and their geographic footprint. Unfortunately, most of publicly available gazetteer are incomplete due to their initial purpose. Hence, we propose Geodict, an integrated gazetteer that contains basic yet precise information (multilingual labels, administrative boundaries polygon, etc.) which can be customized. We show its utility when using it for geoparsing (extraction of spatial entities in text). Early evaluation on toponym resolution shows promising results.
Mots-clés libres : Gazetteer, Spatial entities, Geonames, DBpedia, Wikidata
Classification Agris : C30 - Documentation et information
B10 - Géographie
Auteurs et affiliations
- Fize Jacques, CIRAD-ES-UMR TETIS (FRA) ORCID: 0000-0003-1783-934X
- Shrivastava Gaurav, Birla Institute of Science and Technology (IND)
Source : Cirad-Agritrop (https://agritrop.cirad.fr/586514/)
[ Page générée et mise en cache le 2022-02-03 ]