OPUS 4 | Search

Refine

Has Fulltext

yes (6)

6 search hits

1 to 6

Sort by

EXMARaLDA - Creating, Analysing and Sharing Spoken Language Corpora for Pragmatic Research (2009)

This paper presents EXMARaLDA, a system for the computer-assisted creation and analysis of spoken language corpora. The first part contains some general observations about technological and methodological requirements for doing corpus-based pragmatics. The second part explains the systems architecture and gives an overview of its most important software components a transcription editor, a corpus management tool and a corpus query tool. The last part presents some corpora which have been or are currently being compiled with the help of EXMARaLDA.

Introduction: putting practices in spoken corpora into focus (2014)

Ruhi, Şükriye ; Haugh, Michael ; Schmidt, Thomas ; Wörner, Kai

Modelling Linguistic Data Structures (2006)

Wörner, Kai ; Witt, Andreas ; Rehm, Georg ; Dipper, Stefanie

Linguistic corpora have been annotated by means of SGML-based markup languages for almost 20 years. We can, very roughly, differentiate between three distinct evolutionary stages of markup technologies. (1)Originally, single SGML tree-based document instances were deemed sufficient for the representation of linguistic structures. (2) Linguists began to realize that alternatives and extensions to the traditional model are needed. Formalisms such as, for example, NITE were proposed: the NITE Object Model (NOM) consists of multi-rooted trees. (3) We are now on the threshold of the third evolutionary stage: even NITE's very flexible approach is not suited for all linguistic purposes. As some structures, such as these, cannot be modeled by multi-rooted trees, an even more flexible approach is needed in order to provide a generic annotation format that is able to represent genuinely arbitrary linguistic data structures.

Multilingual corpora at the Hamburg centre for language corpora (2014)

Hedeland, Hanna ; Lehmberg, Timm ; Schmidt, Thomas ; Wörner, Kai

Multilingual Corpora at the Hamburg Centre for Language Corpora (2011)

Hedeland, Hanna ; Lehmberg, Timm ; Schmidt, Thomas ; Wörner, Kai

We give an overview of the content and the technical background of a number of corpora which were developed in various projects of the Research Centre on Multilingualism (SFB 538) between 1999 and 2011 and which are now made available to the scientific community via the Hamburg Centre for Language Corpora.

New and future developments in EXMARaLDA (2011)

Schmidt, Thomas ; Wörner, Kai ; Hedeland, Hanna ; Lehmberg, Timm

We present some recent and planned future developments in EXMARaLDA, a system for creating, managing, analysing and publishing spoken language corpora. The new functionality concerns the areas of transcription and annotation, corpus management, query mechanisms, interoperability and corpus deployment. Future work is planned in the areas of automatic annotation, standardisation and workflow management.

1 to 6

Open Access

Refine

Author

Year of publication

Document Type

Language

Has Fulltext

Is part of the Bibliography

Keywords

Publicationstate

Publisher

6 search hits