Syed Mehtab Alam, Arsevska Elena, Roche Mathieu, Teissere Maguelonne.
2023. A metadata approach to classify domain-specific documents for Event-based Surveillance Systems.
In : 2023 International Conference on Communication, Computing and Digital Systems (C-CODE 2023). IEEE
Version publiée
- Anglais
Accès réservé aux personnels Cirad Sous licence . Metadata_classification_camera.pdf Télécharger (295kB) | Demander une copie |
Résumé : Digital news sources are the primary source of information for health officials and stakeholders to stay informed about potential health risks. However, with the abundance of news sources available, it can be challenging to distinguish relevant news articles from irrelevant ones. To address this issue, we propose a metadata-based approach for classifying news articles containing information on health events. The first step involves extracting metadata from each news article in the dataset. We then use a machine learning model to classify news articles as relevant or irrelevant. The proposed approach was validated using two different datasets with varying combinations of relevant and irrelevant news articles. The experiments were conducted using a 70%-30% train-test split. The results of the experiments show that the proposed approach is highly effective in classifying relevant news articles for Event-based Surveillance System (EBS). Additionally, several metadata features were identified as being important for the classification task.
Mots-clés libres : Natural Language Processing, Metadata, Feature selection, Machine Learning, Event-based Surveillance System
Agences de financement européennes : European Commission
Projets sur financement : (EU) MOnitoring Outbreak events for Disease surveillance in a data science context
Auteurs et affiliations
- Syed Mehtab Alam, CIRAD-ES-UMR TETIS (FRA)
- Arsevska Elena, CIRAD-BIOS-UMR ASTRE (FRA) ORCID: 0000-0002-6693-2316
- Roche Mathieu, CIRAD-ES-UMR TETIS (FRA) ORCID: 0000-0003-3272-8568
- Teissere Maguelonne, INRAE (FRA)
Source : Cirad-Agritrop (https://agritrop.cirad.fr/607316/)
[ Page générée et mise en cache le 2024-02-09 ]