Volltext-Downloads (blau) und Frontdoor-Views (grau)

Providing Digital Infrastructure for Audio-Visual Linguistic Research Data with Diverse Usage Scenarios: Lessons Learnt

  • This article describes the development of the digital infrastructure at a research data centre for audio-visual linguistic research data, the Hamburg Centre for Language Corpora (HZSK) at the University of Hamburg in Germany, over the past ten years. The typical resource hosted in the HZSK Repository, the core component of the infrastructure, is a collection of recordings with time-aligned transcripts and additional contextual data, a spoken language corpus. Since the centre has a thematic focus on multilingualism and linguistic diversity and provides its service to researchers within linguistics and other disciplines, the development of the infrastructure was driven by diverse usage scenarios and user needs on the one hand, and by the common technical requirements for certified service centres of the CLARIN infrastructure on the other. Beyond the technical details, the article also aims to be a contribution to the discussion on responsibilities and services within emerging digital research data infrastructures and the fundamental issues in sustainability of research software engineering, concluding that in order to truly cater to user needs across the research data lifecycle, we still need to bridge the gap between discipline-specific research methods in the process of digitalisation and generic digital research data management approaches.

Export metadata

Additional Services

Search Google Scholar


Author:Hanna HedelandORCiD
Parent Title (English):Publications
Place of publication:Basel
Document Type:Article
Year of first Publication:2020
Date of Publication (online):2020/07/07
Tag:audio-visual data; data quality; domain-specific solutions; linguistic data; research data
GND Keyword:Audiovisuelles Material; Datenmanagement; Forschungsdaten; Gesprochene Sprache; Korpus <Linguistik>
Page Number:13
First Page:33 (Artikelnummer)
Hinweis zur Zitationsweise:
Das Journal bzw. der Verlag nutzt Artikelnummern anstelle einer fortlaufenden Paginierung, bei dem vorliegenden Artikel handelt es sich um die Nummer 33.
DDC classes:400 Sprache / 400 Sprache, Linguistik
Open Access?:ja
Leibniz-Classification:Sprache, Linguistik
Program areas:P2: Mündliche Korpora
Licence (English):License LogoCreative Commons - Attribution 4.0 International