treeClust: An R Package for Tree-Based Clustering Dissimilarities

Abstract:

This paper describes treeClust, an R package that produces dissimilarities useful for cluster ing. These dissimilarities arise from a set of classification or regression trees, one with each variable in the data acting in turn as a the response, and all others as predictors. This use of trees produces dissim ilarities that are insensitive to scaling, benefit from automatic variable selection, and appear to perform well. The software allows a number of options to be set, affecting the set of objects returned in the call; the user can also specify a clustering algorithm and, optionally, return only the clustering vector. The package can also generate a numeric data set whose inter-point distances relate to the treeClust ones; such a numeric data set can be much smaller than the vector of inter-point dissimilarities, a useful feature in big data sets.

Cite PDF Tweet

Authors

Affiliations

Samuel E. Buttrey

 

Lyn R. Whitaker

 

Published

Sept. 15, 2015

Received

Apr 7, 2015

DOI

10.32614/RJ-2015-032

Volume

Pages

7/2

227 - 236

CRAN packages used

treeClust, cluster, rpart, tree

CRAN Task Views implied by cited packages

Cluster, Environmetrics, MachineLearning, Multivariate, Survival

Footnotes

    Reuse

    Text and figures are licensed under Creative Commons Attribution CC BY 4.0. The figures that have been reused from other sources don't fall under this license and can be recognized by a note in their caption: "Figure from ...".

    Citation

    For attribution, please cite this work as

    Buttrey & Whitaker, "The R Journal: treeClust: An R Package for Tree-Based Clustering Dissimilarities", The R Journal, 2015

    BibTeX citation

    @article{RJ-2015-032,
      author = {Buttrey, Samuel E. and Whitaker, Lyn R.},
      title = {The R Journal: treeClust: An R Package for Tree-Based Clustering Dissimilarities},
      journal = {The R Journal},
      year = {2015},
      note = {https://doi.org/10.32614/RJ-2015-032},
      doi = {10.32614/RJ-2015-032},
      volume = {7},
      issue = {2},
      issn = {2073-4859},
      pages = {227-236}
    }