Refine
Year of publication
- 2017 (3) (remove)
Document Type
- Part of a Book (2)
- Conference Proceeding (1)
Language
- English (3) (remove)
Has Fulltext
- yes (3)
Is part of the Bibliography
- yes (3)
Keywords
- Annotation (2)
- Gesprochene Sprache (2)
- Korpus <Linguistik> (2)
- annotation (2)
- Anonymisierung (1)
- Multimodalität (1)
- Standardisierung (1)
- Transkription (1)
- Videoaufzeichnung (1)
- anonymization (1)
Publicationstate
Reviewstate
- Peer-Review (2)
- Peer-review (1)
Publisher
The paper reports on the results of a scientific colloquium dedicated to the creation of standards and best practices which are needed to facilitate the integration of language resources for CMC stemming from different origins and the linguistic analysis of CMC phenomena in different languages and genres. The key issue to be solved is that of interoperability – with respect to the structural representation of CMC genres, linguistic annotations metadata, and anonymization/pseudonymization schemas. The objective of the paper is to convince more projects to partake in a discussion about standards for CMC corpora and for the creation of a CMC corpus infrastructure across languages and genres. In view of the broad range of corpus projects which are currently underway all over Europe, there is a great window of opportunity for the creation of standards in a bottom-up approach.
We present an approach to making existing CLARIN web services usable for spoken language transcriptions. Our approach is based on a new TEI-based ISO standard for such transcriptions. We show how existing tool formats can be transformed to this standard, how an encoder/decoder pair for the TCF format enables users to feed this type of data through a WebLicht tool chain, and why and how web services operating directly on the standard format would be useful.
Researchers interested in the sounds of speech or the physical gestures of Speakers make use of audio and video recordings in their work. Annotating these recordings presents a different set of requirements to the annotation of text. Special purpose tools have been developed to display video and audio Signals and to allow the creation of time-aligned annotations. This chapter reviews the most widely used of these tools for both manual and automatic generation of annotations on multimodal data.