progenyClust: an R package for Progeny Clustering

Abstract:

Identifying the optimal number of clusters is a common problem faced by data scientists in various research fields and industry applications. Though many clustering evaluation techniques have been developed to solve this problem, the recently developed algorithm Progeny Clustering is a much faster alternative and one that is relevant to biomedical applications. In this paper, we introduce an R package progenyClust that implements and extends the original Progeny Clustering algorithm for evaluating clustering stability and identifying the optimal cluster number. We illustrate its applicability using two examples: a simulated test dataset for proof-of-concept, and a cell imaging dataset for demonstrating its application potential in biomedical research. The progenyClust package is versatile in that it offers great flexibility for picking methods and tuning parameters. In addition, the default parameter setting as well as the plot and summary methods offered in the package make the application of Progeny Clustering straightforward and coherent.

Cite PDF Tweet

Authors

Affiliations

Chenyue W. Hu

 

Amina A. Qutub

 

Published

April 30, 2016

Received

Jan 1, 2016

DOI

10.32614/RJ-2016-023

Volume

Pages

8/1

328 - 338

CRAN packages used

cclust, clusterSim, cluster, Nbclust, fpc, progenyClust, stat, Hmisc

CRAN Task Views implied by cited packages

Cluster, Multivariate, Bayesian, ClinicalTrials, Econometrics, Environmetrics, OfficialStatistics, ReproducibleResearch, SocialSciences

Footnotes

    Reuse

    Text and figures are licensed under Creative Commons Attribution CC BY 4.0. The figures that have been reused from other sources don't fall under this license and can be recognized by a note in their caption: "Figure from ...".

    Citation

    For attribution, please cite this work as

    Hu & Qutub, "The R Journal: progenyClust: an R package for Progeny Clustering", The R Journal, 2016

    BibTeX citation

    @article{RJ-2016-023,
      author = {Hu, Chenyue W. and Qutub, Amina A.},
      title = {The R Journal: progenyClust: an R package for Progeny Clustering},
      journal = {The R Journal},
      year = {2016},
      note = {https://doi.org/10.32614/RJ-2016-023},
      doi = {10.32614/RJ-2016-023},
      volume = {8},
      issue = {1},
      issn = {2073-4859},
      pages = {328-338}
    }