Generating and evaluating synthetic data in R

Author
Affiliations

Thom Benjamin Volker, MSc.

Utrecht University

Statistics Netherlands

Published

July 8, 2026

Contributors

This site was authored by Thom Volker, and edits/contributions were made by Malika Ihle.

Licenses

The overall project is available under the CC BY-SA 4.0 license found at LICENSE; all code without any narrative text is also (at your option) available under the CC0 1.0 Universal license found at LICENSE-CODE.

Why two licenses? The CC BY-SA 4.0 license is for the content of the website, while the CC0 1.0 Universal license is for the code and configuration files. This is a common practice for websites that include code snippets and other content that may be reused in other projects, particularly because the CC BY-SA 4.0 license is not intended to be used with software.

Notes

This tutorial was primarily written in 2025 using R version 4.5.0. Major changes to R can be found at the R website for the most up-to-date information, and to confirm that our tutorial is not out-of-date.

Back to top

Citation

BibTeX citation:
@online{benjamin_volker2026,
  author = {Benjamin Volker, Thom},
  title = {Generating and Evaluating Synthetic Data in {R}},
  date = {2026-07-08},
  url = {https://lmu-osc.github.io/synthetic-data-tutorial/},
  doi = {TODO: DOI HERE},
  langid = {en}
}
For attribution, please cite this work as:
Benjamin Volker, Thom. 2026. “Generating and Evaluating Synthetic Data in R.” LMU Open Science Center. July 8, 2026. https://doi.org/TODO: DOI HERE.