Volltext-Downloads (blau) und Frontdoor-Views (grau)

The IFCASL Corpus of French and German Non-native and Native Read Speech

  • The IFCASL corpus is a French-German bilingual phonetic learner corpus designed, recorded and annotated in a project on individualized feedback in computer-assisted spoken language learning. The motivation for setting up this corpus was that there is no phonetically annotated and segmented corpus for this language pair of comparable of size and coverage. In contrast to most learner corpora, the IFCASL corpus incorporate data for a language pair in both directions, i.e. in our case French learners of German, and German learners of French. In addition, the corpus is complemented by two sub-corpora of native speech by the same speakers. The corpus provides spoken data by about 100 speakers with comparable productions, annotated and segmented on the word and the phone level, with more than 50% manually corrected data. The paper reports on inter-annotator agreement and the optimization of the acoustic models for forced speech-text alignment in exercises for computer-assisted pronunciation training. Example studies based on the corpus data with a phonetic focus include topics such as the realization of /h/ and glottal stop, final devoicing of obstruents, vowel quantity and quality, pitch range, and tempo.

Export metadata

Additional Services

Share in Twitter Search Google Scholar

Statistics

frontdoor_oas
Metadaten
Author:Jürgen Trouvain, Anne Bonneau, Vincent Colotte, Camille Fauth, Dominique Fohr, Denis Jouvet, Jeanin Jügler, Yves Laprie, Odile Mella, Bernd Möbius, Frank Zimmerer
URN:urn:nbn:de:bsz:mh39-59057
URL:http://www.lrec-conf.org/proceedings/lrec2016/index.html
ISBN:978-2-9517408-9-1
Parent Title (English):Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016). May 23-28, 2016. Portorož, Slovenia
Publisher:European Language Resources Association
Place of publication:Paris
Editor:Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Sara Goggi, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asunción Moreno, Jan Odijk, Stelios Piperidis
Document Type:Conference Proceeding
Language:English
Year of first Publication:2016
Date of Publication (online):2017/02/22
Publicationstate:Veröffentlichungsversion
Tag:French; German; learner corpus; multilinguality; native speech; non-native speech; phonetic databases; phonetics
GND Keyword:Deutsch als Fremdsprache; Französisch; Gesprochene Sprache; Korpus <Linguistik>; Phonetik
First Page:1333
Last Page:1338
Dewey Decimal Classification:400 Sprache / 400 Sprache, Linguistik
Linguistics-Classification:Korpuslinguistik
Linguistics-Classification:Phonetik / Phonologie
Open Access?:Ja
Licence (English):License LogoCreative Commons - Attribution-NonCommercial 4.0 International