Fize Jacques, Roche Mathieu, Teisseire Maguelonne.
2018. Gemedoc: A text similarity annotation platform.
In : Natural language processing and information systems: 23rd International Conference on Applications of Natural Language to Information Systems, NLDB 2018, Paris, France, June 13-15, 2018, Proceedings. Silberztein Max (ed.), Atigui Faten (ed.), Kornyshova Elena (ed.), Métais Elisabeth (ed.), Meziane Farid (ed.). CNAM
Version publiée
- Anglais
Accès réservé aux personnels Cirad Utilisation soumise à autorisation de l'auteur ou du Cirad. Fize_et_al_NLDB_2018.pdf Télécharger (832kB) | Demander une copie |
Résumé : We present Gemedoc, a platform for text similarity annotation based on the spatial and the thematic dimension. To this end, a two-step annotation protocol was designed to assess the similarity between two documents: (1) identification of salient features according to the two analysis dimensions; (2) similarity assessment according to a 4-degree scale. Ultimately, the labeled data retrieved from different corpora could be used as benchmark for text-mining applications.
Mots-clés libres : Spatial data mining, Heterogeneous data, Text mining
Auteurs et affiliations
- Fize Jacques, CIRAD-ES-UMR TETIS (FRA) ORCID: 0000-0003-1783-934X
- Roche Mathieu, CIRAD-ES-UMR TETIS (FRA) ORCID: 0000-0003-3272-8568
- Teisseire Maguelonne, IRSTEA (FRA)
Autres liens de la publication
Source : Cirad-Agritrop (https://agritrop.cirad.fr/588240/)
[ Page générée et mise en cache le 2024-11-21 ]