Generating and evaluating synthetic data in R
Contributors
This site was authored by Thom Volker, and edits/contributions were made by Malika Ihle.
Licenses
The overall project is available under the CC BY-SA 4.0 license found at LICENSE; all code without any narrative text is also (at your option) available under the CC0 1.0 Universal license found at LICENSE-CODE.
Why two licenses? The CC BY-SA 4.0 license is for the content of the website, while the CC0 1.0 Universal license is for the code and configuration files. This is a common practice for websites that include code snippets and other content that may be reused in other projects, particularly because the CC BY-SA 4.0 license is not intended to be used with software.
Notes
This tutorial was primarily written in 2025 using R
version 4.5.0
. Major changes to R
can be found at the R website for the most up-to-date information, and to confirm that our tutorial is not out-of-date.
Citation
@online{benjamin_volker2026,
author = {Benjamin Volker, Thom},
title = {Generating and Evaluating Synthetic Data in {R}},
date = {2026-07-08},
url = {https://lmu-osc.github.io/synthetic-data-tutorial/},
doi = {TODO: DOI HERE},
langid = {en}
}