Choosing the Right Technique

Determine more relevant risk factors for making that decision

Create some kind of decision tree/checklist to choose techniques

Exercise

Below are three fictional datasets. For each, decide which anonymization technique(s) you would apply and why.

1. A national survey (n = 10,000) on voting behavior. Variables: age group, gender, federal state, party preference, income bracket. The data will be published as a fully open dataset.

2. A study on workplace bullying at a mid-sized company (n = 120). Variables: department, job level, years at company, bullying score, mental health score. The data will be shared with other researchers under a data use agreement.

3. A clinical trial dataset (n = 500) with diagnosis codes, treatment group, age, sex, and a rare genetic marker. The data must be deposited in a public repository as required by the funder.

Add solutions

Learning Objective

  • After completing this part of the tutorial, you will be able to choose a suitable technique based on your data.

Exercises

  • Give short examples of datasets from various contexts and ask for the best anonymization strategy
Back to top

References

Benschop, Thijs, and Matthew Welch. n.d. Statistical Disclosure Control for Microdata: A Practice Guide for sdcMicro. https://sdcpractice.readthedocs.io/en/latest/index.html.
Guo, Wentao, Paige Pepitone, Adam J Aviv, and Michelle L Mazurek. 2025. “USENIX Security Symposium.” (Seattle, WA, USA). https://www.usenix.org/system/files/usenixsecurity25-guo-wentao.pdf.