Volltext-Downloads (blau) und Frontdoor-Views (grau)

The German Reference Corpus: New developments building on almost 50 years of experience

  • This paper describes the efforts in the field of sustainability of the Institut für Deutsche Sprache (IDS) in Mannheim with respect to DEREKO (Deutsches Referenzkorpus) the Archive of General Reference Corpora of Contemporary Written German. With focus on re-usability and sustainability, we discuss its history and our future plans. We describe legal challenges related to the creation of a large and sustainable resource; sketch out the pipeline used to convert raw texts to the final corpus format and outline migration plans to TEI P5. Due to the fact, that the current version of the corpus management and query system is pushed towards its limits, we discuss the requirements for a new version which will be able to handle current and future DEREKO releases. Furthermore, we outline the institute’s plans in the field of digital preservation.

Export metadata

Additional Services

Share in Twitter Search Google Scholar

Statistics

frontdoor_oas
Metadaten
Author:Marc KupietzGND, Oliver Schonefeld, Andreas WittORCiDGND
URN:urn:nbn:de:bsz:mh39-45002
URL:http://lrec-conf.org/proceedings/lrec2010/index.html
Parent Title (English):Language Resources: From Storyboard to Sustainability and LR Lifecycle Management, Workshop held at the seventh conference on International Language Resources and Evaluation (LREC). Malta, May 2010
Publisher:European Language Resources Association (ELRA)
Place of publication:Paris
Editor:Victoria Arranz, Laura van Eerten
Document Type:Conference Proceeding
Language:English
Year of first Publication:2010
Date of Publication (online):2015/12/18
Publicationstate:Veröffentlichungsversion
Reviewstate:(Verlags)-Lektorat
Tag:Deutsches Referenzkorpus (DeReKo); Institut für Deutsche Sprache <Mannheim>
GND Keyword:Korpus <Linguistik>; Langzeitarchivierung
First Page:39
Last Page:43
Dewey Decimal Classification:400 Sprache / 410 Linguistik
Leibniz-Classification:Sprache, Linguistik
Linguistics-Classification:Korpuslinguistik
Open Access?:Ja
Licence (German):Es gilt das UrhG