Volltext-Downloads (blau) und Frontdoor-Views (grau)

Words from spontaneous conversational speech can be recognized with human-like accuracy by an error-driven learning algorithm that discriminates between meanings straight from smart acoustic features, bypassing the phoneme as recognition unit

  • Sound units play a pivotal role in cognitive models of auditory comprehension. The general consensus is that during perception listeners break down speech into auditory words and subsequently phones. Indeed, cognitive speech recognition is typically taken to be computationally intractable without phones. Here we present a computational model trained on 20 hours of conversational speech that recognizes word meanings within the range of human performance (model 25%, native speakers 20–44%), without making use of phone or word form representations. Our model also generates successfully predictions about the speed and accuracy of human auditory comprehension. At the heart of the model is a ‘wide’ yet sparse two-layer artificial neural network with some hundred thousand input units representing summaries of changes in acoustic frequency bands, and proxies for lexical meanings as output units. We believe that our model holds promise for resolving longstanding theoretical problems surrounding the notion of the phone in linguistic theory.

Export metadata

Additional Services

Share in Twitter Search Google Scholar

Statistics

frontdoor_oas
Metadaten
Author:Denis Arnold, Fabian Tomaschek, Konstantin Sering, Florence Lopez, R. Harald BaayenGND
URN:urn:nbn:de:bsz:mh39-60616
DOI:https://doi.org/10.1371/journal.pone.0174623
Parent Title (English):PLoS ONE
Editor:Hedderik van Rijn
Document Type:Article
Language:English
Year of first Publication:2017
Date of Publication (online):2017/04/19
Publicationstate:Veröffentlichungsversion
Reviewstate:Peer-Review
Tag:human learning; learning; speech; speech signal processing; word recognition
Volume:12
Issue:4
First Page:1
Last Page:16
Note:
Verwendete R-Pakete
Dewey Decimal Classification:400 Sprache
Leibniz-Classification:Sprache, Linguistik
Linguistics-Classification:Quantitative Linguistik
Open Access?:Ja
Licence (English):License LogoCreative Commons - Attribution 4.0 International