Agritrop
Home

Combining C-value and keyword extraction methods for biomedical terms extraction

Lossio Ventura Juan Antonio, Jonquet Clément, Roche Mathieu, Teisseire Maguelonne. 2013. Combining C-value and keyword extraction methods for biomedical terms extraction. In : 5th International Symposium on Languages in Biology and Medicine (LBM 2013), 12th and 13th December, 2013, Tokyo, Japan. s.l. : s.n., 5 p. International Symposium on Languages in Biology and Medicine. 5, Tokyo, Japon, 12 December 2013/13 December 2013.

Paper with proceedings
[img] Published version - Anglais
Access restricted to CIRAD agents
Use under authorization by the author or CIRAD.
document_572345.pdf

Télécharger (216kB)

Abstract : The objective of this work is to extract and to rank biomedical terms from free text. We present new extraction methods that use linguistic patterns specialized for the biomedical field, and use term extraction measures, such as C-value, and keyword extraction measures, such as Okapi BM25, and TFIDF. We propose several combinations of these measures to improve the extraction and ranking process. Our experiments show that an appropriate harmonic mean of C-value used with keyword extraction measures offers better precision results than used alone, either for the extraction of single-word and multi-words terms. We illustrate our results on the extraction of English and French biomedical terms from a corpus of laboratory tests. The results are validated by using UMLS (in English) and only MeSH (in French) as reference dictionary. (Résumé d'auteur)

Classification Agris : C30 - Documentation and information
000 - Autres thèmes

Auteurs et affiliations

  • Lossio Ventura Juan Antonio, LIRMM (FRA)
  • Jonquet Clément, LIRMM (FRA)
  • Roche Mathieu, CIRAD-ES-UMR TETIS (FRA) ORCID: 0000-0003-3272-8568
  • Teisseire Maguelonne, LIRMM (FRA)

Source : Cirad - Agritrop (https://agritrop.cirad.fr/572345/)

View Item (staff only) View Item (staff only)

[ Page générée et mise en cache le 2019-10-04 ]