Disease outbreak documents as a source of queries for detection of signals of disease emergence on the Internet. [391]

Arsevska Elena, Lefrançois Thierry, Lancelot Renaud, Roche Mathieu, Chavernac David, Falala Sylvain, Hendrikx Pascal, Dufour Barbara. 2015. Disease outbreak documents as a source of queries for detection of signals of disease emergence on the Internet. [391]. In : 14th Conference of the International Society for Veterinary Epidemiology and Economics: planning our future. Mérida : ISVEE, Résumé, 1 p. ISVEE : Veterinary epidemiology and economics: Planning our future. 14, Mérida, Mexique, 3 November 2015/7 November 2015.

Paper without proceedings
Published version - Anglais
Use under authorization by the author or CIRAD.

Télécharger (69kB) | Preview
[img] Published version - Anglais
Access restricted to CIRAD agents
Use under authorization by the author or CIRAD.

Télécharger (68kB) | Request a copy

Abstract : Timeliness and precision in detecting exotic animal infectious disease outbreaks is crucial for preventing their spread. In 2013, the French national platform for animal disease surveillance has set up an international epidemiological intelligence team (so-called VSI team) aiming at detecting, verifying and monitoring signals of disease emergence from different sources of information, including the Internet. We propose an innovative method for monitoring disease emergence on the Internet. It is based on 3sequential steps:1) web crawling,2) automatic classification of disease outbreak documents by machine learning approaches,3) extraction of information from documents(e.g., disease, number of cases, location, etc.).To query the web, the choice of relevant terms is crucial. For this purpose, we used text mining together with a collective domain expertise following a Delphi method. This approach allowed highlighting the relevant terms to detect signals of disease emergence on the Internet. We have applied it to detect documents addressing African swine fever (ASF) outbreaks(i.e. 123 dispatches from Google, and 45 from PubMed) written in English language, obtained for the period 2011-2014 with the baseline query “African swine fever outbreak”. Based on 2400 terms extracted with the text-mining approach, our automatic search system associated with the collective domain expertise (i.e. evaluation of 20 groups of terms by 21 specialists) identified 3 groups of highly specific terms to detect signals of ASF emergence:1) haemorrhagic fever in Suidae, 2) mortality in Suidae and 3) swine fever. Implemented as complex queries, these groups of terms allowed finding previously undetected ASF outbreak articles with the baseline query (period 2011-14):3for each of groups 1 and 2, vs.54 for group 3.Monitoring disease emergence on the Internet is a promising method towards improved disease introduction risk assessment. Nevertheless, domain experts still play a central role. Our method is generic: we intend to evaluate it on data from other exotic infectious diseases and with real-time data stream. Should this evaluation be successful, the method might be routinely used by the VSI team. (Texte intégral)

Classification Agris : C30 - Documentation and information
L73 - Animal diseases
000 - Autres thèmes

Auteurs et affiliations

  • Arsevska Elena, CIRAD-BIOS-UMR CMAEE (FRA)
  • Lefrançois Thierry, CIRAD-BIOS-UMR CMAEE (FRA) ORCID: 0000-0001-8793-5228
  • Lancelot Renaud, CIRAD-BIOS-UMR CMAEE (FRA)
  • Roche Mathieu, CIRAD-ES-UMR TETIS (FRA) ORCID: 0000-0003-3272-8568
  • Chavernac David, CIRAD-BIOS-UMR CMAEE (FRA)
  • Falala Sylvain, INRA (FRA)
  • Hendrikx Pascal, ANSES (FRA)
  • Dufour Barbara, ENVA (FRA)

Source : Cirad-Agritrop (

View Item (staff only) View Item (staff only)

[ Page générée et mise en cache le 2019-10-06 ]