Federated content search for Lexical Resources (LexFCS): Specification
- The landscape of digital lexical resources is often characterized by dedicated local portals and proprietary interfaces as primary access points for scholars and the interested public. In addition, legal and technical restrictions are potential issues that can make it difficult to efficiently query and use these valuable resources. As part of the research data consortium Text+, solutions for the storage and provision of digital language resources are being developed and provided in the context of the unified cross-domain German research data infrastructure NFDI. The specific topic of accessing lexical resources in a diverse and heterogenous landscape with a variety of participating institutions and established technical solutions is met with the development of the federated search and query framework LexFCS. The LexFCS extends the established CLARIN Federated Content Search that already allows accessing spatially distributed text corpora using a common specification of technical interfaces, data formats, and query languages. This paper describes the current state of development of the LexFCS, gives an insight into its technical details, and provides an outlook on its future development.
Author: | Erik KörnerORCiD, Thomas EckartORCiD, Axel Herold, Frank Wiegand, Frank MichaelisORCiDGND, Matthias Bremm, Louis CotgroveORCiDGND, Thorsten TrippelORCiDGND, Felix RauORCiD |
---|---|
URN: | urn:nbn:de:bsz:mh39-121122 |
DOI: | https://doi.org/10.5281/zenodo.7986303 |
Publisher: | Zenodo |
Place of publication: | Potsdam |
Document Type: | Working Paper |
Language: | English |
Year of first Publication: | 2023 |
Publishing Institution: | Leibniz-Institut für Deutsche Sprache (IDS) |
Publicationstate: | Veröffentlichungsversion |
Reviewstate: | (Verlags)-Lektorat |
GND Keyword: | Forschungsdaten; Information Retrieval; Infrastruktur; Korpus <Linguistik>; Lexikalische Analyse |
First Page: | 1 |
Last Page: | 9 |
DDC classes: | 400 Sprache / 400 Sprache, Linguistik / 400 Sprache |
Open Access?: | ja |
Leibniz-Classification: | Sprache, Linguistik |
Linguistics-Classification: | Korpuslinguistik |
Program areas: | L3: Lexik empirisch und digital |
Program areas: | S2: Forschungskoordination und –infrastrukturen |
Licence (English): | Creative Commons - Attribution 4.0 International |