Automatic Identification of Research Fields in Scientific Papers

Kergosien Eric, Farvardin Mohammad Amin, Teisseire Maguelonne, Bessagnet Marie-Noëlle, Schöpfel Joachim, Chaudiron Stephane, Jacquemin Bernard, Lacayrelle Annig, Roche Mathieu, Sallaberry Christian, Tonneau Jean-Philippe. 2018. Automatic Identification of Research Fields in Scientific Papers. In : Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018). Calzolari Nicoletta (ed.), Choukri Khalid (ed.), Cieri Christopher (ed.), Declerck Thierry (ed.), Goggi Sara (ed.), Hasida Koiti (ed.), Isahara Hitoshi (ed.), Maegaard Bente (ed.), Mariani Joseph (ed.), Mazo Hélène (ed.), Moreno Asuncion (ed.), Odijk Jan. ELRA. Miyazaki : ELRA, pp. 1902-1907. ISBN 979-10-95546-00-9 International Conference on Language Resources and Evaluation. 11, Miyazaki, Japon, 7 May 2018/12 May 2018.

Paper with proceedings
Published version - Anglais
License Licence Creative Commons.

Télécharger (1MB) | Preview

Abstract : The TERRE-ISTEX project aims to identify scientific research dealing with specific geographical territories areas based on heterogeneous digital content available in scientific papers. The project is divided into three main work packages: (1) identification of the periods and places of empirical studies, and which reflect the publications resulting from the analyzed text samples, (2) identification of the themes which appear in these documents, and (3) development of a web-based geographical information retrieval tool (GIR). The first two actions combine Natural Language Processing patterns with text mining methods. The integration of the spatial, thematic and temporal dimensions in a GIR contributes to a better understanding of what kind of research has been carried out, of its topics and its geographical and historical coverage. Another originality of the TERRE-ISTEX project is the heterogeneous character of the corpus, including PhD theses and scientific articles from the ISTEX digital libraries and the CIRAD research center.

Mots-clés libres : Named entity recognition, Information extraction, Information retrieval, Text mining

Auteurs et affiliations

  • Kergosien Eric, Université de Lille (FRA)
  • Farvardin Mohammad Amin, Université de Lille (FRA)
  • Teisseire Maguelonne, IRSTEA (FRA)
  • Bessagnet Marie-Noëlle, UPPA (FRA)
  • Schöpfel Joachim, Université de Lille (FRA)
  • Chaudiron Stephane, Université de Lille (FRA)
  • Jacquemin Bernard, Université de Lille (FRA)
  • Lacayrelle Annig, UPPA (FRA)
  • Roche Mathieu, CIRAD-ES-UMR TETIS (FRA) ORCID: 0000-0003-3272-8568
  • Sallaberry Christian, UPPA (FRA)
  • Tonneau Jean-Philippe, CIRAD-ES-UMR TETIS (FRA) ORCID: 0000-0003-4331-7238

Source : Cirad-Agritrop (

View Item (staff only) View Item (staff only)

[ Page générée et mise en cache le 2019-10-08 ]