TY - CHAP U1 - Buchbeitrag A1 - Frick, Elena A1 - Helmer, Henrike A1 - Schmidt, Thomas T1 - Querying Interaction Structure: Approaches to Overlap in Spoken Language Corpora T2 - Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022). Marseille, 20-25 June 2022 N2 - In this paper, we address two problems in indexing and querying spoken language corpora with overlapping speaker contributions. First, we look into how token distance and token precedence can be measured when multiple primary data streams are available and when transcriptions happen to be tokenized, but are not synchronized with the sound at the level of individual tokens. We propose and experiment with a speaker based search mode that enables any speaker’s transcription tier to be the basic tokenization layer whereby the contributions of other speakers are mapped to this given tier. Secondly, we address two distinct methods of how speaker overlaps can be captured in the TEI based ISO Standard for Spoken Language Transcriptions (ISO 24624:2016) and how they can be queried by MTAS – an open source Lucene-based search engine for querying text with multilevel annotations. We illustrate the problems, introduce possible solutions and discuss their benefits and drawbacks. KW - Deutsch KW - Korpus KW - Gesprochene Sprache KW - Sprecherwechsel KW - Token KW - Abfragesprache KW - spoken language corpora KW - multi-turn conversations KW - corpus search engine KW - query language KW - MTAS KW - oral corpora KW - spoken language data KW - Suchmaschine Y1 - 2022 U6 - https://nbn-resolving.org/urn:nbn:de:bsz:mh39-111054 UN - https://nbn-resolving.org/urn:nbn:de:bsz:mh39-111054 UR - http://www.lrec-conf.org/proceedings/lrec2022/pdf/2022.lrec-1.75.pdf SP - 715 EP - 722 PB - European Language Resources Association (ELRA) CY - Paris ER -