Overview Anonymization Techniques

Explain the goal of anonymization further

Data anonymization is the process of transforming sensitive data to protect individuals’ privacy (El Emam and Arbuckle 2013). Its primary goal is to remove the association between identifying data and the data subject, making it impossible or very difficult to trace the data back to an individual (El Emam and Arbuckle 2013)

In the next sections of the tutorial, I will present you with several groups of techniques for anonymization, following the taxonomy by Carvalho et al. (2023).

Non-perturbative techniques are those that do not change data values but rather mask them.
Perturbative techniques involve modifying data values to anonymize the data.
De-associative techniques separate specific parts of the data from other parts.
Synthesizing data means creating new data with the same statistical properties as the original data. I will not discuss this method in detail in this tutorial.

Insert graphics that explain the underlying principles

Add info on which risks these techniques mitigate

For each anonymization technique, the variable’s scale level needs to be taken into account. Add relevant scale levels for all anonymization techniques

Explain the structure of the following parts (exercises based on the same data)

Mention to what data one can apply these techniqes

After learning more about these techniques in the following chapters, I will explain how to decide which technique is the best for your data in the Chapter “Choosing the Right Technique”.

Learning Objective

After completing this part of the tutorial, you will understand the fundamental principles of anonymization techniques and how they relate to one another.

Exercise

none

Resources, Links, Examples

Carvalho et al. (2023) for detailed information

References

Carvalho, Tânia, Nuno Moniz, Pedro Faria, and Luís Antunes. 2023. “Survey on Privacy-Preserving Techniques for Microdata Publication.” ACM Computing Surveys 55 (14s): 1–42. https://doi.org/10.1145/3588765.

El Emam, Khaled, and Luk Arbuckle. 2013. Anonymizing Health Data: Case Studies and Methods to Get You Started. " O’Reilly Media, Inc.".