The R Journal: Towards a Grammar for Processing Clinical Trial Data

Michael J. Kane

doi:10.32614/RJ-2021-052

Supplementary materials are available in addition to this article. It can be downloaded at RJ-2021-052.zip

References

Clinical data interchange standards consortium. 2020. Accessed: 2020-10-25.

P. Higgins. Ggconsort: Creates CONSORT diagrams for RCTs. 2020. URL https://github.com/higgi13425/ggconsort. R package version 0.0.0.9000.

Immport: Bioinformatics for the future of immunology. 2020. Accessed: 2020-10-25.

M. J. Kane. Forceps: A grammar for manipulating clinical trial data. 2020. URL https://github.com/kaneplusplus/forceps. R package version 0.0.1.

Project data sphere: Convener, collaborator, catalyst in the fight against cancer. 2020. Accessed: 2020-10-25.

R Core Team. R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing, 2012. URL http://www.R-project.org/. ISBN 3-900051-07-0.

SAS Institute. Base SAS 9.4 procedures guide. SAS Institute, 2020.

M. Shotwell. sas7bdat: SAS database reader (experimental). 2014. URL https://CRAN.R-project.org/package=sas7bdat. R package version 0.5.

M. S. Shotwell, C. Cummins, S. Header, S. Pages, S. Subheaders and S. P. B. Data. SAS7BDAT database binary format. 2013.

M. van der Loo. Monitoring data in r with the lumberjack package. Journal of Statistical Software, Accepted for publication, 2020. URL https://CRAN.R-project.org/package=lumberjack.

H. Wickham et al. Tidy data. Journal of Statistical Software, 59(10): 1–23, 2014.

H. Wickham and E. Miller. Haven: Import and export ’SPSS’, ’stata’ and ’SAS’ files. 2020. URL https://CRAN.R-project.org/package=haven. R package version 2.3.1.

Reuse

Text and figures are licensed under Creative Commons Attribution CC BY 4.0. The figures that have been reused from other sources don't fall under this license and can be recognized by a note in their caption: "Figure from ...".

Towards a Grammar for Processing Clinical Trial Data

Author

Affiliation

Published

Received

DOI

Volume

Pages

Introduction: On the use of historical clinical data

Clinical trial analysis data sets

Challenges to analyzing these data sets

A tidy representation for a consolidated analysis data set

Processing ADaM data to reach the tidy representation

Creating the data dictionary

Cohorting

Identifying conflicts and redundancies

Consolidating

Direction: An integrated approach to processing clinical data

Supplementary materials

Footnotes

References

Reuse

Citation