Trailblazing through forests of resources in linguistics

Barkey, Reinhild; Hinrichs, Erhard; Hoppermann, Christina; Trippel, Thorsten; Zinn, Claus

Trailblazing through forests of resources in linguistics

Reinhild Barkey, Erhard Hinrichs, Christina Hoppermann, Thorsten Trippel, Claus Zinn

Linguistics is facing the challenge of many other sciences as it continues to grow into increasingly complex subfields, each with its own separate or overarching branches. While linguists are certainly aware of the overall structure of the research field, they cannot follow all developments other than those of their subfields. It is thus important to help specialists but also newcomers alike to bushwhack through evolved or unknown territory of linguistic data. A considerable amount of research data in linguistics is described with metadata. While studies described and published in archived journals and conference proceedings receive a quite homogeneous set of metadata tags — e.g., author, title, publisher —, this does not hold for the empirical data and analyses that underlie such studies. Moreover, lexicons, grammars, experimental data, and other types of resources come in different forms; and to make things worse, their description in terms of metadata is also not uniform, if existing at all. These problems are well-known and there are now a number of international initiatives — e.g., CLARIN, FlareNet, MetaNet, DARIAH — to build infrastructures for managing linguistic resources. The NaLiDa project, funded by the German Research Foundation, aims at facilitating the management and access to linguistic resources originating from German research institutions. In cooperation with the German SFB 833 research center, we are developing a combination of faceted and full-text search to give integrated access through heterogeneous metadata sets. Our approach is supported by a central registry for metadata field descriptors, and a component repository for structured groups of data categories as larger building blocks.

Metadaten
Author:	Reinhild Barkey, Erhard Hinrichs GND, Christina Hoppermann ORCiD, Thorsten Trippel ORCiD GND, Claus Zinn ORCiD GND
URN:	urn:nbn:de:bsz:mh39-109046
URL:	https://web.stanford.edu/group/dh2011/cgi-bin/wordpress/wp-content/uploads/2011/05/DH2011_BookOfAbs.pdf
ISBN:	978-0-911221-47-3
Parent Title (English):	Digital Humanities 2011. Stanford University, Stanford, CA, USA, June 19-22, 2011. Conference Abstracts.
Publisher:	Stanford University Library
Place of publication:	Stanford
Document Type:	Conference Proceeding
Language:	English
Year of first Publication:	2011
Date of Publication (online):	2022/02/04
Publishing Institution:	Leibniz-Institut für Deutsche Sprache (IDS)
Publicationstate:	Veröffentlichungsversion
Reviewstate:	Peer-Review
GND Keyword:	Computerlinguistik; Datenmanagement; Digital Humanities; Forschungsdaten; Metadaten
First Page:	88
Last Page:	90
DDC classes:	400 Sprache / 400 Sprache, Linguistik
Open Access?:	ja
Linguistics-Classification:	Computerlinguistik
Licence (German):	Urheberrechtlich geschützt

Open Access

Trailblazing through forests of resources in linguistics

Download full text files

Export metadata

Additional Services

Statistics