Monitoring disease outbreak events on the web using text-mining approach and domain expert knowledge

Arsevska Elena, Roche Mathieu, Falala Sylvain, Lancelot Renaud, Chavernac David, Hendrikx Pascal, Dufour Barbara. 2016. Monitoring disease outbreak events on the web using text-mining approach and domain expert knowledge. In : LREC 2016 Proceedings. Calzolari Nicoletta (ed.), Choukri Khalid (ed.) , Declerck Thierry (ed.) , Grobelnik Marko (ed.), Maegaard Bente (ed.), Mariani Joseph (ed.) , Moreno Asuncion (ed.), Odijk Jan (ed.), Piperidis Stelios (ed.). Paris : ELRA, pp. 3407-3411. ISBN 978-2-9517408-9-1 International Conference on Language Resources and Evaluation. 10, Portoroz, Slovénie, 23 May 2016/28 May 2016.

Paper with proceedings
Published version - Anglais
License Licence Creative Commons.

Télécharger (446kB) | Preview

Abstract : Timeliness and precision for detection of infectious animal disease outbreaks from the information published on the web is crucial for prevention against their spread. We propose a generic method to enrich and extend the use of different expressions as queries in order to improve the acquisition of relevant disease related pages on the web. Our method combines a text mining approach to extract terms from corpora of relevant disease outbreak documents, and domain expert elicitation (Delphi method) to propose expressions and to select relevant combinations between terms obtained with text mining. In this paper we evaluated the performance as queries of a number of expressions obtained with text mining and validated by a domain expert and expressions proposed by a panel of 21 domain experts. We used African swine fever as an infectious animal disease model. The expressions obtained with text mining outperformed as queries the expressions proposed by domain experts. However, domain experts proposed expressions not extracted automatically. Our method is simple to conduct and flexible to adapt to any other animal infectious disease and even in the public health domain. (Résumé d'auteur)

Mots-clés libres : Digital disease detection, Text mining, Delphi method

Classification Agris : L73 - Animal diseases
C30 - Documentation and information
U30 - Research methods

Auteurs et affiliations

  • Arsevska Elena, CIRAD-BIOS-UMR CMAEE (FRA)
  • Roche Mathieu, CIRAD-ES-UMR TETIS (FRA) ORCID: 0000-0003-3272-8568
  • Falala Sylvain, INRA (FRA)
  • Lancelot Renaud, CIRAD-BIOS-UMR CMAEE (FRA)
  • Chavernac David, CIRAD-BIOS-UMR CMAEE (FRA)
  • Hendrikx Pascal, ANSES (FRA)
  • Dufour Barbara, ENVA (FRA)

Source : Cirad-Agritrop (

View Item (staff only) View Item (staff only)

[ Page générée et mise en cache le 2019-09-30 ]