Refine
Document Type
- Conference Proceeding (3)
- Article (2)
- Part of a Book (1)
Has Fulltext
- yes (6)
Keywords
- transcription (6) (remove)
Publicationstate
- Veröffentlichungsversion (2)
- Zweitveröffentlichung (2)
- Postprint (1)
Reviewstate
- Peer-Review (4)
This paper provides insights into the ongoing international research project Unserdeutsch (Rabaul Creole German): Documentation of a highly endangered creole language in Papua New Guinea, based at the University of Augsburg, Germany. It elaborates on the different stages of the project, ranging from fieldwork to corpus development, thereby outlining the methods and software background used for the intended purposes. In doing so, we also give some approaches to solving specific problems, which have arisen in the course of practical work until now.
This paper describes the TEI-based ISO standard 24624:2016 ‘Transcription of spoken language’ and other formats used within CLARIN for spoken language resources. It assesses the current state of support for the standard and the interoperability between these formats and with rele- vant tools and services. The main idea behind the paper is that a digital infrastructure providing language resources and services to researchers should also allow the combined use of resources and/or services from different contexts. This requires syntactic and semantic interoperability. We propose a solution based on the ISO/TEI format and describe the necessary steps for this format to work as an exchange format with basic semantic interoperability for spoken language resources across the CLARIN infrastructure and beyond.
In this chapter, we overview the specificity of comparisons made within the perspective of Conversation Analysis (CA), and we position them in relation to other fields. We introduce the analytical mentality, methodology, and procedures of CA, and we show how we used it for the analysis of OKAY in this volume.
This paper discusses the technological and methodological challenges in creating and sharing HAMATAC, the Hamburg Map Task Corpus. The first version of the corpus, consisting of 24 recordings with orthographic transcriptions and metadata, is publicly available. A second version featuring different types of linguistic annotation is in progress. I will describe how the various software tools and data formats of the EXMARaLDA system were used for transcription and multi-level annotation, to compile recordings and transcriptions into a corpus and manage metadata, to publish the corpus, and how they can be used for carrying out corpus queries (KWIC) and analyses. Some recurrent issues in corpus building and sharing and the interaction of technological and methodological aspects will be illustrated using HAMATAC.
Der Beitrag stellt eine aktualisierte Version des Gesprächsanalytischen Transkriptionssystems(GAT) dar. Nachdem GAT seit seiner Erstvorstellung im Jahr 1998 in der Gesprächsforschung eine breite Verwendung gefunden hat, war es nun an der Zeit, es aufgrund der bisherigen Erfahrungen und im Hinblick auf neue Anforderungen an Transkriptionen vorsichtig zu überarbeiten. Dieser Text stellt
das aktualisierte GAT 2-Transkriptionssystem mit allen seinen alten und neuen Konventionen dar, versucht bekannte Zweifelsfälle zu klären und bekannte Schwächen der ersten Version zu beheben. GAT 2 gibt detaillierte Anweisungen zum Erstellen gesprächsanalytischer Transkriptionen auf drei Detailliertheitsstufen, dem Minimal-, Basis- und Feintranskript, sowie neue Vorschläge zur Darstellung komplexerer Phänomene in Sonderzeilen. Zudem wurden für GAT 2 einige zusätzliche Hilfsmittel entwickelt, die im Anhang kurz vorgestellt werden: das Online-Tutorial GAT-TO sowie der Transkriptionseditor FOLKER.
This paper presents two toolsets for transcribing and annotating spoken language: the EXMARaLDA system, developed at the University of Hamburg, and the FOLK tools, developed at the Institute for the German Language in Mannheim. Both systems are targeted at users interested in the analysis of spontaneous, multi-party discourse. Their main user community is situated in conversation analysis, pragmatics, sociolinguistics and related fields. The paper gives an overview of the individual tools of the two systems – the Partitur-Editor, a tool for multi-level annotation of audio or video recordings, the Corpus Manager, a tool for creating and administering corpus metadata, EXAKT, a query and analysis tool for spoken language corpora, FOLKER, a transcription editor optimized for speed and efficiency of transcription, and OrthoNormal, a tool for orthographical normalization of transcription data. It concludes with some thoughts about the integration of these tools into the larger tool landscape.