OPUS 4 | Search

97 search hits

1 to 10

Sort by

Korpus "Skandinavische Semikommunikation" - ein mehrsprachiges Diskurskorpus auf XML-Basis (2003)

EXMARaLDA - ein System zur computergestützten Diskurstranskription (2004)

Der Aufsatz beschreibt EXMARaLDA, ein XML-basiertes System zur computergestutzten Diskurstranskription, das am Sonderforschungsbereich „Mehrsprachigkeit“ an der Universität Hamburg entwickelt wurde.

Transcribing and annotating spoken language with EXMARaLDA (2004)

Schmidt, Thomas

This paper describes EXMARaLDA, an XML-based framework for the construction, dissemination and analysis of corpora of spoken language transcriptions. Departing from a prototypical example of a “partitur” (musical score) transcription, the EXMARaLDA “single timeline, multiple tiers” data model and format is presented alongside with the EXMARaLDA Partitur-Editor, a tool for inputting and visualizing such data. This is followed by a discussion of the interaction of EXMARaLDA with other frameworks and tools that work with similar data models. Finally, this paper presents an extension of the “single timeline, multiple tiers” data model and describes its application within the EXMARaLDA system.

Datenarchive für die Gesprächsforschung : Perspektiven, Probleme und Lösungsansätze (2005)

Schmidt, Thomas

Dieser Aufsatz befasst sich mit Fragen, die sich im Zusammenhang mit der Archivierung und öffentlichen Bereitstellungen von gesprächsanalytischen Daten (Audio- bzw. Videoaufnahmen und deren Transkriptionen) stellen. Er gibt zunächst einen Überblick über die Forschungsperspektiven, die eine verbesserte Praxis der Datenm•chivierung flir die Gesprächsforschung bieten würde, und nennt dann einige der wesentlichen Probleme, die in der derzeitigen Praxis der Schaffung solcher Archive im Wege stehen können. Anschließend werden vorhandene Lösungsansätze vorgestellt, die helfen können, diese Probleme zu überwinden.

Comparison of multimodal annotation tools (2006)

Rohlfing, Katharina ; Loehr, Daniel ; Duncan, Susan ; Brown, Amanda ; Franklin, Amy ; Kimbara, Irene ; Milde, Jan-Torsten ; Parrill, Fey ; Rose, Travis ; Schmidt, Thomas ; Sloetjes, Han ; Thies, Alexandra ; Wellinghoff, Sandra

Interfacing Lexical and Ontological Information in a Multilingual Soccer FrameNet (2006)

Schmidt, Thomas

This paper presents ongoing work on a multilingual (English, French, German) lexical resource of soccer language. The first part describes how lexicographic descriptions based on frame-semantic principles are derived from a partially aligned multilingual corpus of soccer match reports. The remainder of the paper then discusses how different types of ontological knowledge are linked to this resource in order to provide an access structure to the resulting dictionary. It is argued that linking lexical resources and ontologies in such a way provides novel ways to a dictionary user of navigating a domain vocabulary

The Kicktionary: A Multilingual Resource of the Language of Football (2007)

Schmidt, Thomas

This paper presents the Kicktionary, a multilingual (English — German - French) electronic lexical resource of the language of football. It explains how a corpus of football match reports was analysed according to the FrameNet and WordNet approaches and how the result of this analysis is presented to a dictionary user via a website

Transkriptionskonventionen für die computergestützte gesprächsanalytische Transkription (2007)

Schmidt, Thomas

The Kicktionary : Combining corpus linguistics and lexical semantics for a multilingual football dictionary (2008)

Schmidt, Thomas

This paper presents the Kicktionary, a multilingual (English - German - French) electronic lexical resource of the language of football. In the Kicktionary, methods from corpus linguistics and two approaches to lexical semantics - the theory of frame semantics and the concept of semantic relations - are combined to construct a lexical resource in which the user can explore relationships between lexical units in various ways. This paper explains the theoretical background of the Kicktionary, sketches the data and methods which were used in its construction, and describes how the resulting resource is presented to users via a set of hyperlinked webpages.

Rescuing Legacy Data (2008)

Schmidt, Thomas ; Bennöhr, Jasmine

This paper discusses issues that arise in the transformation of electronic language data from outdated to modern, sustainable formats. We first describe the problem and then present four different cases in which corpora of spoken language were converted from legacy formats to an XML-based representation. For each of the four cases, we describe the conversion workflow and discuss the difficulties that we had to overcome. Based on this experience, we formulate some more general observations about transforming legacy data and conclude with a set of best practice recommendations for a more sustainable handling of language corpora.

1 to 10

Person(s)
Title
Subject
Abstract
Fulltext
Year(s)

Open Access

Refine

Author

Year of publication

Document Type

Language

Has Fulltext

Is part of the Bibliography

Keywords

Publicationstate

Reviewstate

Publisher

97 search hits