SchemaOnRead: A Package for Schema-on-Read in R

Abstract:

SchemaOnRead is a CRAN package that provides an extensible mechanism for importing a wide range of file types into R as well as support for the emerging schema-on-read paradigm in R. The schema-on-read tools within the package include a single function call that recursively reads folders with text, comma separated value, raster image, R data, HDF5, NetCDF, spreadsheet, Weka, Epi Info, Pajek network, R network, HTML, SPSS, Systat, and Stata files. It also recursively reads folders (e.g., schemaOnRead(“folder”)), returning a nested list of the contained elements. The provided tools can be used as-is or easily customized to implement tool chains in R. This paper’s contribution is that it introduces and describes the SchemaOnRead package and compares it to related R packages.

Cite PDF Tweet

Author

Affiliation

Michael J. North

 

Published

April 2, 2016

Received

Oct 27, 2015

DOI

10.32614/RJ-2016-019

Volume

Pages

8/1

269 - 275

CRAN packages used

SchemaOnRead, rio, readbitmap, foreign, testthat

CRAN Task Views implied by cited packages

OfficialStatistics, WebTechnologies

Footnotes

    Reuse

    Text and figures are licensed under Creative Commons Attribution CC BY 4.0. The figures that have been reused from other sources don't fall under this license and can be recognized by a note in their caption: "Figure from ...".

    Citation

    For attribution, please cite this work as

    North, "The R Journal: SchemaOnRead: A Package for Schema-on-Read in R", The R Journal, 2016

    BibTeX citation

    @article{RJ-2016-019,
      author = {North, Michael J.},
      title = {The R Journal: SchemaOnRead: A Package for Schema-on-Read in R},
      journal = {The R Journal},
      year = {2016},
      note = {https://doi.org/10.32614/RJ-2016-019},
      doi = {10.32614/RJ-2016-019},
      volume = {8},
      issue = {1},
      issn = {2073-4859},
      pages = {269-275}
    }