Agritrop
Accueil

Labeled entities from social media data related to avian influenza disease

Schaeffer Camille, Interdonato Roberto, Lancelot Renaud, Roche Mathieu, Teisseire Maguelonne. 2022. Labeled entities from social media data related to avian influenza disease. Data in Brief, 43:108317, 7 p.

Article de revue ; Data paper ; Article de revue à facteur d'impact Revue en libre accès total
[img]
Prévisualisation
Version publiée - Anglais
Sous licence Licence Creative Commons.
Camille_Schaeffer_DIB.pdf

Télécharger (288kB) | Prévisualisation

Url - jeu de données - Entrepôt autre : https://doi.org/10.15454/GR5EFS

Résumé : This dataset is composed by spatial (e.g. location) and thematic (e.g. diseases, symptoms, virus) entities concerning avian influenza in social media (textual) data in English. It was created from three corpora: the first one includes 10 transcriptions of YouTube videos and 70 tweets manually annotated. The second corpus is composed by the same textual data but automatically annotated with Named Entity Recognition (NER) tools. These two corpora have been built to evaluate NER tools and apply them to a bigger corpus. The third corpus is composed of 100 YouTube transcriptions automatically annotated with NER tools. The aim of the annotation task is to recognize spatial information such as the names of the cities and epidemiological information such as the names of the diseases. An annotation guideline is provided in order to ensure a unified annotation and to help the annotators. This dataset can be used to train or evaluate Natural Language Processing (NLP) approaches such as specialized entity recognition.

Mots-clés Agrovoc : fouille de textes, médias sociaux, données spatiales, analyse de données, grippe aviaire, épidémiologie

Mots-clés libres : Text Mining, Named entity recognition, Social network, Epidemic intelligence, Twitter, Youtube, Avian influenza

Classification Agris : U10 - Informatique, mathématiques et statistiques
C30 - Documentation et information
L73 - Maladies des animaux

Champ stratégique Cirad : CTS 4 (2019-) - Santé des plantes, des animaux et des écosystèmes

Auteurs et affiliations

  • Schaeffer Camille, INRAE (FRA)
  • Interdonato Roberto, CIRAD-ES-UMR TETIS (FRA) ORCID: 0000-0002-0536-6277
  • Lancelot Renaud, CIRAD-BIOS-UMR ASTRE (REU)
  • Roche Mathieu, CIRAD-ES-UMR TETIS (FRA) ORCID: 0000-0003-3272-8568 - auteur correspondant
  • Teisseire Maguelonne, INRAE (FRA)

Source : Cirad-Agritrop (https://agritrop.cirad.fr/601106/)

Voir la notice (accès réservé à Agritrop) Voir la notice (accès réservé à Agritrop)

[ Page générée et mise en cache le 2024-03-12 ]