Variable Clustering in High-Dimensional Linear Regression: The R Package clere

Abstract:

Dimension reduction is one of the biggest challenges in high-dimensional regression models. We recently introduced a new methodology based on variable clustering as a means to reduce dimen sionality. We present here the R package clere that implements some refinements of this methodology. An overview of the package functionalities as well as examples to run an analysis are described. Numerical experiments on real data were performed to illustrate the good predictive performance of our parsimonious method compared to standard dimension reduction approaches.

Cite PDF Tweet

Published

April 2, 2016

Received

Mar 26, 2015

DOI

10.32614/RJ-2016-006

Volume

Pages

8/1

92 - 106

CRAN packages used

glmnet, spikeslab, clere, Rcpp, RcppEigen, lasso2, flare

CRAN Task Views implied by cited packages

MachineLearning, NumericalMathematics, Bayesian, HighPerformanceComputing, Survival

Footnotes

    Reuse

    Text and figures are licensed under Creative Commons Attribution CC BY 4.0. The figures that have been reused from other sources don't fall under this license and can be recognized by a note in their caption: "Figure from ...".

    Citation

    For attribution, please cite this work as

    Yengo, et al., "The R Journal: Variable Clustering in High-Dimensional Linear Regression: The R Package clere", The R Journal, 2016

    BibTeX citation

    @article{RJ-2016-006,
      author = {Yengo, Loïc and Jacques, Julien and Biernacki, Christophe and Canouil, Mickael},
      title = {The R Journal: Variable Clustering in High-Dimensional Linear Regression: The R Package clere},
      journal = {The R Journal},
      year = {2016},
      note = {https://doi.org/10.32614/RJ-2016-006},
      doi = {10.32614/RJ-2016-006},
      volume = {8},
      issue = {1},
      issn = {2073-4859},
      pages = {92-106}
    }