Agritrop
Accueil

Xart system: discovering and extracting correlated arguments of n-ary relations from text

Berrahou Soumia Lilia, Buche Patrice, Dibie-Barthélemy Juliette, Roche Mathieu. 2016. Xart system: discovering and extracting correlated arguments of n-ary relations from text. In : Proceedings of the 6th International Conference on Web Intelligence, Mining and Semantics. Akerkar Rajendra (ed.), Plantié Michel (ed.), Ranwez Sylvie (ed.), Harispe Sébastein (ed.), Laurent Anne (ed.), Bellot Patrice (ed.), Montmain Jacky (ed.), Trousset François (ed.). Ecole des mines Alès. New-York : ACM, 12 p. ISBN 978-1-4503-4056-4 International Conference on Web Intelligence, Mining and Semantics (WIMS 2016). 6, Nîmes, France, 13 Septembre 2016/15 Septembre 2016.

Communication avec actes
[img] Version publiée - Anglais
Accès réservé aux agents Cirad
Utilisation soumise à autorisation de l'auteur ou du Cirad.
WIMS16.pdf

Télécharger (792kB) | Demander une copie

Url - éditeur : http://dl.acm.org/citation.cfm?id=2912855&dl=ACM&coll=DL&CFID=808170115&CFTOKEN=73638767

Résumé : In this paper, we present Xart system based on a hybrid method using data mining approaches and syntactic analysis to automatically discover and extract relevant information modeled as n-ary relations from text. A n-ary relation links a studied object with its features considered as several arguments. Our work focuses on extracting quantitative arguments associated with their attributes, i.e. a numerical value and a unit of measure, in order to populate a domain Ontological and Terminological Resource (OTR) with new instances. The proposed method relies on the discovery of correlated arguments in text using sequential pattern mining and the OTR. Then, those Ontological Sequential Patterns (OSP) are enriched with specific syntactic relations in order to construct Ontological Linguistic Sequential Patterns (OLSP) where the arguments are expressed according to different levels of term abstraction (term, grammatical cate- gory and concept). We have made concluding experiments on a corpus from food packaging domain where relevant data to be extracted are experimental results on packagings. We have been able to extract up to 4 correlated arguments with a F-measure from 0.6 to 0.8.

Mots-clés libres : Text mining, Data mining, Sequential patterns, Syntactic parsing, Natural language processing

Classification Agris : C30 - Documentation et information
000 - Autres thèmes
U10 - Informatique, mathématiques et statistiques
U30 - Méthodes de recherche

Auteurs et affiliations

  • Berrahou Soumia Lilia, CIRAD-PERSYST-UMR IATE (FRA)
  • Buche Patrice, Montpellier SupAgro (FRA)
  • Dibie-Barthélemy Juliette, INRA (FRA)
  • Roche Mathieu, CIRAD-ES-UMR TETIS (FRA) ORCID: 0000-0003-3272-8568

Source : Cirad-Agritrop (https://agritrop.cirad.fr/580893/)

Voir la notice (accès réservé à la Dist) Voir la notice (accès réservé à la Dist)

[ Page générée et mise en cache le 2022-04-16 ]