Volltext-Downloads (blau) und Frontdoor-Views (grau)

A cross-database comparison of two large German speech databases

  • Ph@ttSessionz and Deutsch heute are two large German speech databases. They were created for different purposes: Ph@ttSessionz to test Internet-based recordings and to adapt speech recognizers to the voices of adolescent speakers, Deutsch heute to document regional variation of German. The databases differ in their recording technique, the selection of recording locations and speakers, elicitation mode, and data processing. In this paper, we outline how the recordings were performed, how the data was processed and annotated, and how the two databases were imported into a single relational database system. We present acoustical measurements on the digit items of both databases. Our results confirm that the elicitation technique affects the speech produced, that f0 is quite comparable despite different recording procedures, and that large speech technology databases with suitable metadata may well be used for the analysis of regional variation of speech.

Export metadata

Additional Services

Share in Twitter Search Google Scholar

Statistics

frontdoor_oas
Metadaten
Author:Christoph Draxler, Stefan Kleiner
URN:urn:nbn:de:bsz:mh39-59983
URL:https://www.internationalphoneticassociation.org/icphs-proceedings/ICPhS2015/Papers/ICPHS0631.pdf
ISBN:978-0-85261-941-4
ISSN:0241-0669
Parent Title (English):Proceedings of the 18th International Congress of Phonetic Sciences. Glasgow, UK. 10-15 Aug, 2015
Publisher:International Phonetic Association (IPA)
Place of publication:London
Editor:Maria Wolters, Judy Livingstone, Bernie Beattie, Rachel Smith, Mike MacMahon, Jane Stuart-Smith, Jim Scobbie
Document Type:Conference Proceeding
Language:English
Year of first Publication:2015
Date of Publication (online):2017/03/21
Publicationstate:Veröffentlichungsversion
Reviewstate:Peer-Review
Tag:acoustic analysis; metadata; regional variation; speech database; speech technology
GND Keyword:Akustische Phonetik; Annotation; Deutsch; Gesprochene Sprache; Korpus <Linguistik>; Metadaten; Sprachvariante
Pagenumber:5
Dewey Decimal Classification:400 Sprache / 430 Deutsch
BDSL-Classification:Grammatik
Leibniz-Classification:Sprache, Linguistik
Linguistics-Classification:Phonetik / Phonologie
Open Access?:Ja
Licence (English):License LogoCreative Commons - Attribution-NonCommercial-NoDerivs 3.0 Unported