Agritrop
Home

Cluster analysis for two-level data sets: Classifying tree species from individual measurements

Picard Nicolas, Bar-Hen Avner. 2013. Cluster analysis for two-level data sets: Classifying tree species from individual measurements. Ecological Informatics, 15 : pp. 1-7.

Journal article ; Article de revue à facteur d'impact
[img] Published version - Anglais
Access restricted to CIRAD agents
Use under authorization by the author or CIRAD.
document_568059.pdf

Télécharger (339kB)

Quartile : Q2, Sujet : ECOLOGY

Abstract : Two-level data sets consist of higher level (say population) traits computed from lower level (say individual) observations. Cluster analysis for two-level data sets aims at classifying populations using individual observations. Most existing techniques to classify populations in two-level data sets actually operate on population traits (e.g. the k-means algorithm), thus disregarding the within-population individual variability. In this study, the k-means algorithm was compared with a recently developed classification method that accounts for within-population variability. Populations were tree species in a tropical rain forest in French Guiana, and individual observations were tree diameters and diameter growth rates. Tree species were classified according to either their diameter and growth rate, or to their asymptotic diameter distribution as predicted by an Usher matrix population model. In both cases, the k-means algorithm and the two-level classification method defined species clusters that were significantly related according to the Rand index. Nevertheless, clusters showed increasing differences between the two methods as the within-population individual variability increased. Whereas the k-means algorithm produced equally-sized spherical clusters, the two-level classification method adapted the size and shape of clusters to the individual within-population variability. Taking account of individual variability to classify populations in ecology may thus be important, albeit rarely done. (Résumé d'auteur)

Mots-clés Agrovoc : Forêt tropicale humide, Cecropia obtusifolia, Inga, Modèle mathématique, Méthode statistique, Classification, Diamètre, Croissance, Biométrie, Écologie, Distribution géographique, Analyse de variance

Mots-clés géographiques Agrovoc : Guyane française

Mots-clés complémentaires : Algorithme, Variabilite, Dicorynia guianensis, Vouacapoua americana, Swartzia, Vismia guianensis, Jacaranda copaia, Qualea rosea

Classification Agris : F40 - Plant ecology
U10 - Computer science, mathematics and statistics
F70 - Plant taxonomy and geography
K01 - Forestry - General aspects

Champ stratégique Cirad : Axe 6 (2005-2013) - Agriculture, environnement, nature et sociétés

Auteurs et affiliations

  • Picard Nicolas, CIRAD-ES-UPR BSef (GAB)
  • Bar-Hen Avner, Université René Descartes (FRA)

Source : Cirad - Agritrop (https://agritrop.cirad.fr/568059/)

View Item (staff only) View Item (staff only)

[ Page générée et mise en cache le 2021-02-24 ]