scmamp: Statistical Comparison of Multiple Algorithms in Multiple Problems

Abstract:

Comparing the results obtained by two or more algorithms in a set of problems is a central task in areas such as machine learning or optimization. Drawing conclusions from these comparisons may require the use of statistical tools such as hypothesis testing. There are some interesting papers that cover this topic. In this manuscript we present scmamp, an R package aimed at being a tool that simplifies the whole process of analyzing the results obtained when comparing algorithms, from loading the data to the production of plots and tables. Comparing the performance of different algorithms is an essential step in many research and practical computational works. When new algorithms are proposed, they have to be compared with the state of the art. Similarly, when an algorithm is used for a particular problem, its performance with different sets of parameters has to be compared, in order to tune them for the best results. When the differences are very clear (e.g., when an algorithm is the best in all the problems used in the comparison), the direct comparison of the results may be enough. However, this is an unusual situation and, thus, in most situations a direct comparison may be misleading and not enough to draw sound conclusions; in those cases, the statistical assessment of the results is advisable. The statistical comparison of algorithms in the context of machine learning has been covered in several papers. In particular, the tools implemented in this package are those presented in Demšar (2006); García and Herrera (2008); García et al. (2010). Another good review that covers, among other aspects, the statistical assessment of the results in the context of supervised classification can be found in Santafé et al. (2015).

Cite PDF Tweet

Authors

Affiliations

Borja Calvo

 

Guzmán Santafé

 

Published

Nov. 25, 2015

Received

Sep 28, 2015

DOI

10.32614/RJ-2016-017

Volume

Pages

8/1

248 - 256

CRAN packages used

scmamp

CRAN Task Views implied by cited packages

Footnotes

    Reuse

    Text and figures are licensed under Creative Commons Attribution CC BY 4.0. The figures that have been reused from other sources don't fall under this license and can be recognized by a note in their caption: "Figure from ...".

    Citation

    For attribution, please cite this work as

    Calvo & Santafé, "The R Journal: scmamp: Statistical Comparison of Multiple Algorithms in Multiple Problems", The R Journal, 2015

    BibTeX citation

    @article{RJ-2016-017,
      author = {Calvo, Borja and Santafé, Guzmán},
      title = {The R Journal: scmamp: Statistical Comparison of Multiple Algorithms in Multiple Problems},
      journal = {The R Journal},
      year = {2015},
      note = {https://doi.org/10.32614/RJ-2016-017},
      doi = {10.32614/RJ-2016-017},
      volume = {8},
      issue = {1},
      issn = {2073-4859},
      pages = {248-256}
    }