Open Science and language data: Expectations vs. reality. The role of research data infrastructures
- Language data are essential for any scientific endeavor. However, unlike numerical data, language data are often protected by copyright, as they easily meet the threshold of originality. The role of research infrastructures (such CLARIN, DARIAH, and Text+) is to bridge the gap between uses allowed by statutory exceptions and the requirements of Open Science. This is achieved on the one hand by sharing language data produced by research organisations with the widest possible circle of persons, and on the other by mutualizing efforts towards copyright clearance and appropriate licensing of datasets.
Author: | Paweł KamockiORCiDGND, Erhard HinrichsORCiDGND, Peter LeinenORCiDGND, Sabine SpringerORCiD, Andreas WittORCiDGND, Dorothea ZechmannORCiD |
---|---|
URN: | urn:nbn:de:bsz:mh39-121653 |
DOI: | https://doi.org/10.52825/cordi.v1i.301 |
ISSN: | 2941-296X |
Parent Title (English): | 1st Conference on Research Data Infrastructure (CoRDI) - Connecting Communities |
Publisher: | Technische Informationsbibliothek |
Place of publication: | Hannover |
Editor: | York Sure-Vetter, Carole Goble |
Document Type: | Conference Proceeding |
Language: | English |
Year of first Publication: | 2023 |
Date of Publication (online): | 2023/10/11 |
Publishing Institution: | Leibniz-Institut für Deutsche Sprache (IDS) |
Publicationstate: | Veröffentlichungsversion |
Reviewstate: | Peer-Review |
Tag: | Research infrastructures; Text data |
GND Keyword: | Data Mining; Forschungsdaten; Infrastruktur; Open Science; Sprachdaten; Urheberrecht |
Volume: | 1 |
First Page: | 1 |
Last Page: | 3 |
DDC classes: | 400 Sprache / 400 Sprache, Linguistik |
Open Access?: | ja |
Leibniz-Classification: | Sprache, Linguistik |
Linguistics-Classification: | Computerlinguistik |
Program areas: | S2: Forschungskoordination und –infrastrukturen |
Licence (English): | ![]() |