Volltext-Downloads (blau) und Frontdoor-Views (grau)

Sustainability of Linguistic Resources

  • This paper describes a new research initiative addressing the issue of sustainability of linguistic resources. This initiative is a cooperation between three linguistic collaborative research centres in Germany, which comprise more than 40 individual research projects altogether. These projects are involved in creating manifold language resources, especially corpora, tailored to their particular needs. The aim of the project described here is to ensure an effective and sustainable access of these data by third-party researchers beyond the termination of these projects. This goal involves a number of measures, such as the definition of a common data format to completely capture the heterogeneous information encoded in the individual corpora, the development of user-friendly and sustainably usable tools for processing (e.g. querying) the data, and the specification of common inventories of metadata and terminology. Moreover, the project aims at formulating general rules of best practice for creating, accessing, and archiving linguistic resources.

Download full text files

Export metadata

Statistics

frontdoor_oas
Metadaten
Author:Stefanie Dipper, Erhard Hinrichs, Thomas SchmidtORCiDGND, Andreas Wagner, Andreas WittORCiDGND
URN:urn:nbn:de:bsz:mh39-22718
Parent Title (English):Proceedings of the LREC 2006 Workshop on Merging and Layering Linguistic Information, Genoa, Italy.
Place of publication:Genoa, Italy
Editor:James Pustejovsky, Martha Palmer, Nancy Ide, Erhard Hinrichs
Document Type:Conference Proceeding
Language:English
Year of first Publication:2006
Date of Publication (online):2014/05/08
GND Keyword:Computerlinguistik; Forschungsdaten; Langzeitarchivierung; Linguistik
First Page:48
Last Page:54
DDC classes:400 Sprache / 410 Linguistik / 410 Linguistik
Open Access?:ja
Linguistics-Classification:Computerlinguistik
Licence (German):License LogoUrheberrechtlich geschützt