Valentin Sarah, Arsevska Elena, Mercier Alize, Falala Sylvain, Rabatel Julien, Lancelot Renaud, Roche Mathieu.
2020. PADI-web: An event-based surveillance system for detecting, classifying and processing online news.
In : Human language technology. Challenges for computer science and linguistics. Vetulani Zygmunt (ed.), Paroubek Patrick (ed.), Kubis Marek (ed.)
![]() |
Version publiée
- Anglais
Accès réservé aux personnels Cirad Utilisation soumise à autorisation de l'auteur ou du Cirad. Valentin_PADI-Web_LTC_LNCS_2020.pdf Télécharger (1MB) | Demander une copie |
Résumé : The Platform for Automated Extraction of Animal Disease Information from the Web (PADI-web) is a multilingual text mining tool for automatic detection, classification, and extraction of disease outbreak information from online news articles. PADI-web currently monitors the Web for nine animal infectious diseases and eight syndromes in five animal hosts. The classification module is based on a supervised machine learning approach to filter the relevant news with an overall accuracy of 0.94. The classification of relevant news between 5 topic categories (confirmed, suspected or unknown outbreak, preparedness and impact) obtained an overall accuracy of 0.75. In the first six months of its implementation (January–June 2016), PADI-web detected 73% of the outbreaks of African swine fever; 20% of foot-and-mouth disease; 13% of bluetongue, and 62% of highly pathogenic avian influenza. The information extraction module of PADI-web obtained F-scores of 0.80 for locations, 0.85 for dates, 0.95 for diseases, 0.95 for hosts, and 0.85 for case numbers. PADI-web allows complementary disease surveillance in the domain of animal health.
Mots-clés libres : Epidemic intelligence, Animal health, Web monitoring, Text mining, Classification, Information extraction
Auteurs et affiliations
-
Valentin Sarah, CIRAD-BIOS-UMR ASTRE (FRA)
ORCID: 0000-0002-9028-681X
-
Arsevska Elena, CIRAD-BIOS-UMR ASTRE (FRA)
ORCID: 0000-0002-6693-2316
- Mercier Alize, CIRAD-BIOS-UMR ASTRE (FRA)
- Falala Sylvain, INRA (FRA)
- Rabatel Julien
-
Lancelot Renaud, CIRAD-BIOS-UMR ASTRE (FRA)
ORCID: 0000-0002-5826-5242
-
Roche Mathieu, CIRAD-ES-UMR TETIS (FRA)
ORCID: 0000-0003-3272-8568
Autres liens de la publication
Source : Cirad-Agritrop (https://agritrop.cirad.fr/597344/)
[ Page générée et mise en cache le 2025-02-03 ]