Agritrop
Accueil

RoBoost-PLSR: Robust PLS regression method inspired from boosting principles

Metz Maxime, Abdelghafour Florent, Roger Jean-Michel, Lesnoff Matthieu. 2021. RoBoost-PLSR: Robust PLS regression method inspired from boosting principles. In : e-Chimiométrie 2021, 2-3 février 2021. Programme, book of abstracts. GFC. s.l. : GFC, Résumé, 11-12. e-Chimiometrie 2021, France, 2 Février 2021/3 Février 2021.

Communication avec actes
[img]
Prévisualisation
Version publiée - Anglais
Utilisation soumise à autorisation de l'auteur ou du Cirad.
ID597664.pdf

Télécharger (1MB) | Prévisualisation
[img]
Prévisualisation
Version publiée - Anglais
Utilisation soumise à autorisation de l'auteur ou du Cirad.
ID597664.Diaporama.pdf

Télécharger (712kB) | Prévisualisation

Matériel d'accompagnement : 1 diaporama (16 vues)

Note générale : Le congrès s'est tenu en ligne

Résumé : Introduction - The calibration of Partial Least Square regression (PLSR) models can be disturbed by outlying samples in the data. In these cases, the models can be unstable and their predictive potential can be depreciated. To address this issue, a new method and algorithm to better apprehend the downweighting of outliers in a context of high dimensional data processing is proposed. This novel robust PLSR algorithm is inspired from the principles of boosting and is called RoBoost-PLSR. Theory - RoBoost-PLSR consists in achieving a series of K unidimensional (1 LV) iteratively reweighted PLSR[1] models. The weigthed PLSR algorithm used is the weighted NIPALS[2]. The model (k+1) is calibrated with the X and Y residuals of the previous k models. Within each model, weights are computed according to a combination of X-residuals, Y-residuals and leverages. The more the samples deviate from the model, the lower the weights. Iteratively, the model is updated according to the weights previously attributed until convergence to a stable solution. Material and methods - RoBoost-PLSR was compared with the PLSR algorithm calibrated with and without outliers (i.e. the reference) and with Partial Robust M-regression (PRM), a reference robust method. This evaluation was conducted on the basis of a simulated dataset and a real dataset. The simulated dataset was generated with the framework proposed in [4]. The simulation objective is to reproduce a contamination in the samples leading to inconsistent spectral measurements. The real dataset is an example of one animal nutrition application: the prediction of the protein content of feed materials and of the presence of incorrectly categorised samples. In this database the samples resulting from animal bonemeal (noted ANF) represent the outliers polluting the regular soyabean cakes (noted TTS). Conclusion - Roboost-PLSR proves to be resilient to the tested outliers, and can achieve the performances of the reference PLSR calibrated without any of these outliers.

Mots-clés libres : Chimiométrie, PLS regression, Méthode robustes, SPIR, NIRS

Auteurs et affiliations

  • Metz Maxime
  • Abdelghafour Florent
  • Roger Jean-Michel, INRAE (FRA)
  • Lesnoff Matthieu, CIRAD-ES-UMR SELMET (FRA) ORCID: 0000-0002-5205-9763

Source : Cirad-Agritrop (https://agritrop.cirad.fr/597664/)

Voir la notice (accès réservé à Agritrop) Voir la notice (accès réservé à Agritrop)

[ Page générée et mise en cache le 2024-06-30 ]