dad: an R Package for Visualisation, Classification and Discrimination of Multivariate Groups Modelled by their Densities

Abstract:

Multidimensional scaling (MDS), hierarchical cluster analysis (HCA), and discriminant analysis (DA) are classical techniques which deal with data made of n individuals and p variables. When the individuals are divided into T groups, the R package dad associates with each group a multivariate probability density function and then carries out these techniques on the densities, which are estimated by the data under consideration. These techniques are based on distance measures between densities: chi-square, Hellinger, Jeffreys, Jensen-Shannon, and L p for discrete densities, Hellinger , Jeffreys, L2 , and 2-Wasserstein for Gaussian densities, and L2 for numeric non-Gaussian densities estimated by the Gaussian kernel method. Practical methods help the user to give meaning to the outputs in the context of MDS and HCA and to look for an optimal prediction in the context of DA based on the one-leave-out misclassification ratio. Some functions for data management or basic statistics calculations on groups are annexed.

Cite PDF Tweet

Published

Aug. 16, 2021

Received

Oct 13, 2020

DOI

10.32614/RJ-2021-071

Volume

Pages

13/2

179 - 207

CRAN packages used

stats, MASS, ade4, FactoMineR, cluster, dad, fda, fda.usc, fdadensity, compositions, Compositional, robCompositions

CRAN Task Views implied by cited packages

Multivariate, Distributions, Environmetrics, FunctionalData, Psychometrics, Robust, ChemPhys, Cluster, Econometrics, MissingData, NumericalMathematics, SocialSciences, Spatial, TeachingStatistics

Footnotes

    Reuse

    Text and figures are licensed under Creative Commons Attribution CC BY 4.0. The figures that have been reused from other sources don't fall under this license and can be recognized by a note in their caption: "Figure from ...".

    Citation

    For attribution, please cite this work as

    Boumaza, et al., "The R Journal: dad: an R Package for Visualisation, Classification and Discrimination of Multivariate Groups Modelled by their Densities", The R Journal, 2021

    BibTeX citation

    @article{RJ-2021-071,
      author = {Boumaza, Rachid and Santagostini, Pierre and Yousfi, Smail and Demotes-Mainard, Sabine},
      title = {The R Journal: dad: an R Package for Visualisation, Classification and Discrimination of Multivariate Groups Modelled by their Densities},
      journal = {The R Journal},
      year = {2021},
      note = {https://doi.org/10.32614/RJ-2021-071},
      doi = {10.32614/RJ-2021-071},
      volume = {13},
      issue = {2},
      issn = {2073-4859},
      pages = {179-207}
    }