OPUS 4 | Search

6 search hits

1 to 6

Sort by

Relevancy
Year
Year
Title
Title
Author
Author

Annotating Modality Interdependencies (2015)

Reimer, Eva ; Trevisan, Bianka ; Eraßme, Denise ; Schmidt, Thomas ; Jakobs, Eva-Maria

This paper discusses computational linguistic methods for the semi-automatic analysis of modality interdependencies (the combination of complex resources such as speaking, writing, and visualizing; MID) in professional crosssituational interaction settings. The overall purpose of the approach is to develop models, methods, and a framework for the description and analysis of MID forms and functions. The paper describes work in progress—the development of an annotation framework that allows annotating different data and file formats at various levels, to relate annotation levels and entries independently of the given file format, and to visualize patterns.

Recent Initiatives towards New Standards for Language Resources (2015)

Herzog, Gottfried ; Heid, Ulrich ; Trippel, Thorsten ; Bański, Piotr ; Romary, Laurent ; Schmidt, Thomas ; Witt, Andreas ; Eckart, Kerstin

CLARIN Web Services for TEI-annotated Transcripts of Spoken Language (2020)

Fisseni, Bernhard ; Schmidt, Thomas

We present web services which implement a workflow for transcripts of spoken language following the TEI guidelines, in particular ISO 24624:2016 “Language resource management – Transcription of spoken language”. The web services are available at our website and will be available via the CLARIN infrastructure, including the Virtual Language Observatory and WebLicht.

Addressing Cha(lle)nges in Long-Term Archiving of Large Corpora (2020)

Arnold, Denis ; Fisseni, Bernhard ; Kamocki, Paweł ; Schonefeld, Oliver ; Kupietz, Marc ; Schmidt, Thomas

This paper addresses long-term archival for large corpora. Three aspects specific to language resources are focused, namely (1) the removal of resources for legal reasons, (2) versioning of (unchanged) objects in constantly growing resources, especially where objects can be part of multiple releases but also part of different collections, and (3) the conversion of data to new formats for digital preservation. It is motivated why language resources may have to be changed, and why formats may need to be converted. As a solution, the use of an intermediate proxy object called a signpost is suggested. The approach will be exemplified with respect to the corpora of the Leibniz Institute for the German Language in Mannheim, namely the German Reference Corpus (DeReKo) and the Archive for Spoken German (AGD).

Datenübernahmerichtlinien des Leibniz-Instituts für Deutsche Sprache (2019)

Arnold, Denis ; Fankhauser, Peter ; Fisseni, Bernhard ; Kupietz, Marc ; Lüngen, Harald ; Schmidt, Thomas ; Witt, Andreas

CLARIN Web Services for TEI-annotated Transcripts of Spoken Language (2019)

Fisseni, Bernhard ; Schmidt, Thomas

We present web services implementing a workflow for transcripts of spoken language following TEI guidelines, in particular ISO 24624:2016 "Language resource management - Transcription of spoken language". The web services are available at our website and will be available via the CLARIN infrastructure, including the Virtual Language Observatory and WebLicht.

1 to 6

Person(s)
Title
Subject
Abstract
Fulltext
Year(s)

Open Access

Refine

Author

Year of publication

Document Type

Language

Has Fulltext

Is part of the Bibliography

Keywords

Publicationstate

Reviewstate

Publisher

6 search hits