Trailblazing through forests of resources in linguistics
- Linguistics is facing the challenge of many other sciences as it continues to grow into increasingly complex subfields, each with its own separate or overarching branches. While linguists are certainly aware of the overall structure of the research field, they cannot follow all developments other than those of their subfields. It is thus important to help specialists but also newcomers alike to bushwhack through evolved or unknown territory of linguistic data. A considerable amount of research data in linguistics is described with metadata. While studies described and published in archived journals and conference proceedings receive a quite homogeneous set of metadata tags — e.g., author, title, publisher —, this does not hold for the empirical data and analyses that underlie such studies. Moreover, lexicons, grammars, experimental data, and other types of resources come in different forms; and to make things worse, their description in terms of metadata is also not uniform, if existing at all. These problems are well-known and there are now a number of international initiatives — e.g., CLARIN, FlareNet, MetaNet, DARIAH — to build infrastructures for managing linguistic resources. The NaLiDa project, funded by the German Research Foundation, aims at facilitating the management and access to linguistic resources originating from German research institutions. In cooperation with the German SFB 833 research center, we are developing a combination of faceted and full-text search to give integrated access through heterogeneous metadata sets. Our approach is supported by a central registry for metadata field descriptors, and a component repository for structured groups of data categories as larger building blocks.
Author: | Reinhild Barkey, Erhard HinrichsGND, Christina HoppermannORCiD, Thorsten TrippelORCiDGND, Claus ZinnORCiDGND |
---|---|
URN: | urn:nbn:de:bsz:mh39-109046 |
URL: | https://web.stanford.edu/group/dh2011/cgi-bin/wordpress/wp-content/uploads/2011/05/DH2011_BookOfAbs.pdf |
ISBN: | 978-0-911221-47-3 |
Parent Title (English): | Digital Humanities 2011. Stanford University, Stanford, CA, USA, June 19-22, 2011. Conference Abstracts. |
Publisher: | Stanford University Library |
Place of publication: | Stanford |
Document Type: | Conference Proceeding |
Language: | English |
Year of first Publication: | 2011 |
Date of Publication (online): | 2022/02/04 |
Publishing Institution: | Leibniz-Institut für Deutsche Sprache (IDS) |
Publicationstate: | Veröffentlichungsversion |
Reviewstate: | Peer-Review |
GND Keyword: | Computerlinguistik; Datenmanagement; Digital Humanities; Forschungsdaten; Metadaten |
First Page: | 88 |
Last Page: | 90 |
DDC classes: | 400 Sprache / 400 Sprache, Linguistik |
Open Access?: | ja |
Linguistics-Classification: | Computerlinguistik |
Licence (German): | Urheberrechtlich geschützt |