TY - CHAP U1 - Konferenzveröffentlichung A1 - Rehm, Georg A1 - Schonefeld, Oliver A1 - Witt, Andreas A1 - Lehmberg, Timm A1 - Chiarcos, Christian A1 - Béchara, Hannan A1 - Eishold, Florian A1 - Evang, Kilian A1 - Leshtanska, Magdalena A1 - Savkov, Alexandar A1 - Stark, Matthias ED - Calzolari, Nicoletta ED - Choukri, Khalid ED - Maegaard, Bente ED - Mariani, Joseph ED - Odijk, Jan ED - Piperidis, Stelios ED - Tapias, Daniel T1 - The Meta-data-Database of a Next Generation Sustainability Web-Platform for Language Resources T2 - Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08) N2 - Our goal is to provide a web-based platform for the long-term preservation and distribution of a heterogeneous collection of linguistic resources. We discuss the corpus preprocessing and normalisation phase that results in sets of multi-rooted trees. At the same time we transform the original metadata records, just like the corpora annotated using different annotation approaches and exhibiting different levels of granularity, into the all-encompassing and highly flexible format eTEI for which we present editing and parsing tools. We also discuss the architecture of the sustainability platform. Its primary components are an XML database that contains corpus and metadata files and an SQL database that contains user accounts and access control lists. A staging area, whose structure, contents, and consistency can be checked using tools, is used to make sure that new resources about to be imported into the platform have the correct structure. KW - Sprachdaten KW - Metadaten KW - Korpus Y1 - 2008 U6 - https://nbn-resolving.org/urn:nbn:de:bsz:mh39-45081 UN - https://nbn-resolving.org/urn:nbn:de:bsz:mh39-45081 UR - http://www.lrec-conf.org/proceedings/lrec2008/ SN - 2-9517408-4-0 SB - 2-9517408-4-0 SP - 371 EP - 378 PB - European Language Resources Association (ELRA) CY - Paris ER -