Syed Mehtab alam, Arsevska Elena, Roche Mathieu, Teisseire Maguelonne.
2024. GeospaCy: A tool for extraction and geographical referencing of spatial expressions in textual data.
In : Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics: System demonstrations. EACL
|
Version publiée
- Anglais
Sous licence CC0 1.0 Sans restriction de droits pour le monde entier. 609245.pdf Télécharger (2MB) | Prévisualisation |
Résumé : Spatial information in text enables to understand the geographical context and relationships within text for better decision-making across various domains such as disease surveillance, disaster management and other locationbased services. Therefore, it is crucial to understand the precise geographical context for location-sensitive applications. In response to this necessity, we introduce the GeospaCy software tool, designed for the extraction and georeferencing of spatial information present in textual data. GeospaCy fulfils two primary objectives: 1) Geoparsing, which involves extracting spatial expressions, encompassing place names and associated spatial relations within the text data, and 2) Geocoding, which facilitates the assignment of geographical coordinates to the spatial expressions extracted during the Geoparsing task. Geoparsing is evaluated with a disease news article dataset consisting of event information, whereas a qualitative evaluation of geographical coordinates (polygons/geometries) of spatial expressions is performed by end-users for Geocoding task.
Mots-clés libres : Text Mining, GeoParsing, Geocoding, Spatial entities, Natural Language Processing, One Health, Epidemic intelligence, PADI‐web, Event-based surveillance
Agences de financement européennes : European Commission
Programme de financement européen : H2020
Projets sur financement : (EU) MOnitoring Outbreak events for Disease surveillance in a data science context
Auteurs et affiliations
- Syed Mehtab alam, CIRAD-ES-UMR TETIS (FRA)
- Arsevska Elena, CIRAD-BIOS-UMR ASTRE (FRA) ORCID: 0000-0002-6693-2316
- Roche Mathieu, CIRAD-ES-UMR TETIS (FRA) ORCID: 0000-0003-3272-8568
- Teisseire Maguelonne, INRAE (FRA)
Source : Cirad-Agritrop (https://agritrop.cirad.fr/609245/)
[ Page générée et mise en cache le 2024-04-19 ]