Component-based regularization of a multivariate GLM with a thematic partitioning of the explanatory variables

Bry Xavier, Trottier Catherine, Mortier Frédéric, Cornu Guillaume. 2020. Component-based regularization of a multivariate GLM with a thematic partitioning of the explanatory variables. Statistical Modelling, 20 (1) : pp. 96-119.

Journal article ; Article de recherche ; Article de revue à facteur d'impact
[img] Version Online first - Anglais
Access restricted to CIRAD agents
Use under authorization by the author or CIRAD.
Bry et al. - 2018 - Component-based regularization of a multivariate G.pdf

Télécharger (2MB) | Request a copy

Abstract : We address component-based regularization of a multivariate generalized linear model (GLM). A vector of random responses Y is assumed to depend, through a GLM, on a set X of explanatory variables, as well as on a set A of additional covariates. X is partitioned into R conceptually homogenous variable groups X1,…,XR, viewed as explanatory themes. Variables in each Xr are assumed many and redundant. Thus, generalized linear regression demands dimension reduction and regularization with respect to each Xr. By contrast, variables in A are assumed few and selected so as to demand no regularization. Regularization is performed searching each Xr for an appropriate number of orthogonal components that both contribute to model Y and capture relevant structural information in Xr. To estimate a single-theme model, we first propose an enhanced version of Supervised Component Generalized Linear Regression (SCGLR), based on a flexible measure of structural relevance of components, and able to deal with mixed-type explanatory variables. Then, to estimate the multiple-theme model, we develop an algorithm encapsulating this enhanced SCGLR: THEME-SCGLR. The method is tested on simulated data and then applied to rainforest data in order to model the abundance of tree species.

Mots-clés géographiques Agrovoc : Angola, Burundi, Cameroun, Gabon, République centrafricaine, Congo, République démocratique du Congo, Rwanda, République-Unie de Tanzanie, Zambie

Classification Agris : U10 - Computer science, mathematics and statistics
F40 - Plant ecology
K01 - Forestry - General aspects

Champ stratégique Cirad : CTS 7 (2019-) - Hors champs stratégiques

Auteurs et affiliations

  • Bry Xavier, UM2 (FRA)
  • Trottier Catherine, UM2 (FRA) - auteur correspondant
  • Mortier Frédéric, CIRAD-ES-UPR BSef (FRA)
  • Cornu Guillaume, CIRAD-ES-UPR BSef (FRA) ORCID: 0000-0002-7523-5176

Source : Cirad-Agritrop (

View Item (staff only) View Item (staff only)

[ Page générée et mise en cache le 2021-03-01 ]