Plant identification based on noisy web data: the amazing performance of deep learning (lifeclef 2017)

Plant identification based on noisy web data: the amazing performance of deep learning (lifeclef 2017). Goeau Hervé, Bonnet Pierre, Joly Alexis. 2017. In : CLEF 2017 Working Notes. Cappellato Linda (ed.), Ferro Nicola (ed.), Goeuriot Lorraine (ed.), Mandl Thomas (ed.). s.l : CEUR Workshop Proceedings, 13 p. (CEUR Workshop Proceedings, 1866)

International Conference of the CLEF Association. 8, Dulbin, Irlande, 11 Septembre 2017/24 Septembre 2017.
Version publiée - Anglais
Creative Commons - Sans restriction de droits pour le monde entier.

Télécharger (533kB) | Prévisualisation

Communication avec actes

Résumé : The 2017 fith edition of the LifeCLEF plant identification challenge is an important milestone towards automated plant identification systems working at the scale of continental floras with 10.000 plant species living mainly in Europe and North America illustrated by a total of 1.1M images. Nowadays, such ambitious systems are enabled thanks to the conjunction of the dazzling recent progress in image classification with deep learning and several outstanding international initiatives, such as the Encyclopedia of Life (EOL), aggregating the visual knowledge on plant species coming from the main national botany institutes. However, despite all these efforts the majority of the plant species still remain without pictures or are poorly illustrated. Outside the institutional channels, a much larger number of plant pictures are available and spread on the web through botanist blogs, plant lovers web-pages, image hosting websites and on-line plant retailers. The LifeCLEF 2017 plant challenge presented in this paper aimed at evaluating to what extent a large noisy training dataset collected through the web and containing a lot of labelling errors can compete with a smaller but trusted training dataset checked by experts. To fairly compare both training strategies, the test dataset was created from a third data source, i.e. the Pl@ntNet mobile application that collects millions of plant image queries all over the world. This paper presents more precisely the resources and assessments of the challenge, summarizes the approaches and systems employed by the participating research groups, and provides an analysis of the main outcomes. (Résumé d'auteur)

Classification Agris : F70 - Taxonomie végétale et phyto-géographie
C30 - Documentation et information
U10 - Méthodes mathématiques et statistiques

Auteurs et affiliations

  • Goeau Hervé, CIRAD-BIOS-UMR AMAP (FRA)
  • Bonnet Pierre, CIRAD-BIOS-UMR AMAP (FRA)
  • Joly Alexis, INRIA (FRA)

Source : Cirad-Agritrop

Accès réservé

Voir la notice Voir la notice

[ Page générée et mise en cache le 2017-10-24 ]