Volltext-Downloads (blau) und Frontdoor-Views (grau)

Pseudonymisation of speech data as an alternative approach to GDPR compliance

  • The debate on the use of personal data in language resources usually focuses — and rightfully so — on anonymisation. However, this very same debate usually ends quickly with the conclusion that proper anonymisation would necessarily cause loss of linguistically valuable information. This paper discusses an alternative approach — pseudonymisation. While pseudonymisation does not solve all the problems (inasmuch as pseudonymised data are still to be regarded as personal data and therefore their processing should still comply with the GDPR principles), it does provide a significant relief, especially — but not only — for those who process personal data for research purposes. This paper describes pseudonymisation as a measure to safeguard rights and interests of data subjects under the GDPR (with a special focus on the right to be informed). It also provides a concrete example of pseudonymisation carried out within a research project at the Institute of Information Technology and Communications of the Otto von Guericke University Magdeburg.

Export metadata

Additional Services

Share in Twitter Search Google Scholar


Author:Paweł KamockiORCiDGND, Ingo SiegertORCiD
Parent Title (English):Proceedings of the LREC 2022 Joint Workshop on Legal and Ethical Issues in Human Language Technologies and Multilingual De-Identification of Sensitive Language Resources (LEGAL - MDLR 2022). Marseille, 20 June 2022
Publisher:European Language Resources Association (ELRA)
Place of publication:Paris
Editor:Mickaël Rigault, Victoria Arranz, Ingo Siegert
Document Type:Conference Proceeding
Year of first Publication:2022
Date of Publication (online):2022/07/01
Publishing Institution:Leibniz-Institut für Deutsche Sprache (IDS)
Tag:GDPR; personal data; pseudonymisation; speech data
GND Keyword:Anonymisierung; Auskunftsanspruch; Datenschutz; Europäische Union : Datenschutz-Grundverordnung; Forschungsdaten; Personenbezogene Daten; Pseudonymisierung; Recht; Sprachdaten
First Page:17
Last Page:21
DDC classes:300 Sozialwissenschaften / 340 Recht
DDC classes:400 Sprache / 400 Sprache, Linguistik
Open Access?:ja
Leibniz-Classification:Sprache, Linguistik
Program areas:S2: Forschungskoordination und –infrastrukturen
Licence (English):License LogoCreative Commons - Attribution-NonCommercial 4.0 International