Lesnoff Matthieu, Metz Maxime, Roger Jean-Michel. 2020. Comparison of locally weighted PLS strategies for regression and discrimination on agronomic NIR data. Journal of Chemometrics, 34 (5):e3209, 13 p.
Version Online first
- Anglais
Accès réservé aux personnels Cirad Utilisation soumise à autorisation de l'auteur ou du Cirad. Lesnoff_et_Journal_of_Chemometrics 2020.pdf Télécharger (991kB) | Demander une copie |
|
Version publiée
- Anglais
Accès réservé aux personnels Cirad Utilisation soumise à autorisation de l'auteur ou du Cirad. 595227.pdf Télécharger (1MB) | Demander une copie |
Quartile : Q1, Sujet : STATISTICS & PROBABILITY / Quartile : Q2, Sujet : INSTRUMENTS & INSTRUMENTATION / Quartile : Q2, Sujet : MATHEMATICS, INTERDISCIPLINARY APPLICATIONS / Quartile : Q3, Sujet : AUTOMATION & CONTROL SYSTEMS / Quartile : Q3, Sujet : CHEMISTRY, ANALYTICAL / Quartile : Q3, Sujet : COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Résumé : In multivariate calibrations, locally weighted partial least squared regression (LWPLSR) is an efficient prediction method when heterogeneity of data generates nonlinear relations (curvatures and clustering) between the response and the explicative variables. This is frequent in agronomic data sets that gather materials of different natures or origins. LWPLSR is a particular case of weighted PLSR (WPLSR; ie, a statistical weight different from the standard 1/n is given to each of the n calibration observations for calculating the PLS scores/loadings and the predictions). In LWPLSR, the weights depend from the dissimilarity (which has to be defined and calculated) to the new observation to predict. This article compares two strategies of LWPLSR: (a) “LW”: the usual strategy where, for each new observation to predict, a WPLSR is applied to the n calibration observations (ie, entire calibration set) vs (b) “KNN‐LW”: a number of k nearest neighbors to the observation to predict are preliminary selected in the training set and WPLSR is applied only to this selected KNN set. On three illustrating agronomic data sets (quantitative and discrimination predictions), both strategies overpassed the standard PLSR. LW and KNN‐LW had close prediction performances, but KNN‐LW was much faster in computation time. KNN‐LW strategy is therefore recommended for large data sets. The article also presents a new algorithm for WPLSR, on the basis of the “improved kernel #1” algorithm, which is competitor and in general faster to the already published weighted PLS nonlinear iterative partial least squares (NIPALS).
Mots-clés Agrovoc : analyse de données, spectroscopie infrarouge
Mots-clés libres : Discrimination, Locally weighted calibration, Nearinfrared spectroscopy, Partial least squares, Regression
Classification Agris : U10 - Informatique, mathématiques et statistiques
U30 - Méthodes de recherche
A50 - Recherche agronomique
Champ stratégique Cirad : CTS 7 (2019-) - Hors champs stratégiques
Auteurs et affiliations
- Lesnoff Matthieu, CIRAD-ES-UMR SELMET (FRA) ORCID: 0000-0002-5205-9763 - auteur correspondant
- Metz Maxime, IRSTEA (FRA)
- Roger Jean-Michel, IRSTEA (FRA)
Source : Cirad-Agritrop (https://agritrop.cirad.fr/595227/)
[ Page générée et mise en cache le 2024-12-18 ]