Lentschat Martin, Buche Patrice, Menut Luc, Guari Romane, Roche Mathieu. 2022. Partial n-Ary relation instances on food packaging composition and permeability extracted from scientific publication tables. Data in Brief, 41:108000, 9 p.
![]()
|
Version publiée
- Anglais
Sous licence ![]() Lentschat_DIB2022.pdf Télécharger (1MB) | Prévisualisation |
Url - jeu de données - Dataverse Cirad : https://doi.org/10.18167/DVN1/GCZBC9
Résumé : This dataset is dedicated to text mining and is composed of partial n-Ary relation instances concerning food packaging composition and gas permeability. It was created from 31 tables derived from 10 English-language scientific articles in html format from several international journals hosted on the ScienceDirect website. This dataset includes two sets of data: manual table annotation results and automatic data extraction results. The tables were first annotated by one annotator and cross-curated by three different annotators. The annotation task aimed to identify all table data dealing with packaging permeability measurements and compositions. An Ontological and Terminological Resource (OTR) was used for the annotation process. The annotation guidelines were drawn up through a collective iterative approach involving the annotators, and they may be accessed alongside the data. This dataset of n-Ary relations can be used in natural language processing (NLP) approaches implemented in experimental fields, especially for n-Ary relation extraction research. It can also be useful for training or evaluation of methods for the extraction of experimental data from tables and text in scientific documents, especially in experimental domains such as food packaging.
Mots-clés libres : Text Mining, Natural language processing, Information extraction, Table extraction, Ontological and Terminological Resource, Food packaging, Permeability, Component, Quantity
Classification Agris : J10 - Manutention, transport, stockage et conservation des produits agricoles
Q80 - Conditionnement
Champ stratégique Cirad : CTS 7 (2019-) - Hors champs stratégiques
Auteurs et affiliations
- Lentschat Martin, CIRAD-ES-UMR TETIS (FRA) - auteur correspondant
- Buche Patrice, INRAE (FRA)
- Menut Luc, Université de Montpellier (FRA)
- Guari Romane, CIRAD-ES-UMR TETIS (FRA)
-
Roche Mathieu, CIRAD-ES-UMR TETIS (FRA)
ORCID: 0000-0003-3272-8568
Source : Cirad-Agritrop (https://agritrop.cirad.fr/600464/)
[ Page générée et mise en cache le 2025-04-30 ]