Refine
Year of publication
Document Type
- Conference Proceeding (12)
- Article (3)
- Part of a Book (2)
Has Fulltext
- yes (17)
Keywords
- automatische Sprachproduktion (7)
- historische Phonetik (7)
- Deutsch (6)
- Französisch (5)
- Kempelen, Wolfgang von (5)
- Fremdsprachenlernen (4)
- Phonetik (4)
- Artikulation (3)
- German (3)
- Korpus <Linguistik> (3)
Publicationstate
- Veröffentlichungsversion (8)
- Postprint (2)
- Preprint (1)
Reviewstate
- (Verlags)-Lektorat (1)
- Peer-Review (1)
- Peer-review (1)
Publisher
- TUDpress (4)
- International Speech Communication Association (3)
- European Language Resources Association (2)
- INRIA (1)
- ISCA (1)
- Kluwer (1)
- Université de Strasbourg (1)
In order to determine priorities for the improvement of timing in synthetic speech this study looks at the role of segmental duration prediction and the role of phonological symbolic representation in listeners' preferences. In perception experiments using German speech synthesis, two standard duration models (Klatt rules and CART) were tested. The input to these models consisted of symbolic strings which were either derived from a database or a text-to-speech system. Results of the perception experiments show that different duration models can only be distinguished when the symbolic string is appropriate. Considering the relative importance of the symbolic representation, "post-lexical" segmental rules were investigated with the outcome that listeners differ in their preferences regarding the degree of segmental reduction. As a conclusion, before fine-tuning the duration prediction, it is important to calculate an appropriate phonological symbolic representation in order to improve timing in synthetic speech.
In order to determine priorities for the improvement of timing in synthetic speech this study looks at the role of segmental duration prediction and the role of phonological symbolic representation in the perceptual quality of a text-to-speech system. In perception experiments using German speech synthesis, two standard duration models (Klatt rules and CART) were tested. The input to these models consisted of a symbolic representation which was either derived from a database or a text-to-speech system. Results of the perception experiments show that different duration models can only be distinguished when the symbolic representation is appropriate. Considering the relative importance of the symbolic representation, post-lexical segmental rules were investigated with the outcome that listeners differ in their preferences regarding the degree of segmental reduction. As a conclusion, before fine-tuning the duration prediction, it is important to derive an appropriate phonological symbolic representation in order to improve timing in synthetic speech.
The paper reports on experiments with acoustic recordings of a self-built replica of the historic speaking machine of Wolfgang von Kempelen. Several possibilities of the reed as the glottal excitation mechanism were tested. Perception tests with naïve listeners revealed that the machinegenerated words 'mama' and 'papa' were partially recognised as an authentic child voice – as it was also the case in von Kempelen's demonstrations in the late 18th century.
Die wissenschaftliche Beschäftigung mit der Kempelen'schen Sprechmaschine erfolgt zumeist aus wissenschaftshistorischen Motiven heraus. Der vorliegende Aufsatz widmet sich der Frage, welche Bedeutung der Sprechmaschine heutzutage zukommt. Neben möglichen Erklärungen, weswegen die Sprechmaschine auf Wissenschaftler wie Nicht-Wissenschaftler faszinierend wirkt, beschreiben wir den Einsatz von Nachbauten als Instrument zur Demonstration und auch zur Erforschung der Erzeugung von Sprachschall.
Scientific interest in von Kempelen's 'speaking machine' stems mainly from a general interest in the history of science. This study, however, is devoted to the question of what relevance the 'speaking machine' has today. Apart for discussing why it fascinates researchers and non-researchers alike we describe the potential of replicas as an instrument for demonstration and for researching speech generation.
Scientific interest in von Kempelen's 'speaking machine' stems mainly from a general interest in the history of science. This study, however, is devoted to the question of what relevance the 'speaking machine' has today. Apart for discussing why it fascinates researchers and non-researchers alike we describe the construction of a replica and its potential as an instrument for demonstration and for researching speech generation.
Der Aufsatz widmet sich einigen markanten historischen Einzelleistungen auf dem Gebiet der mechanischen Sprachsynthese, die auch heute noch faszinierend, jedoch zumeist nur in groben Zügen bekannt sind. An der hier präsentierten Auswahl erweist sich sowohl die fesselnde Kraft eines einmal als grundsätzlich praktikabel erkannten Konzeptes der stimmlichen Anregung als auch die hieraus resultierende Originalität immer neuer Ansätze, diesem Syntheseprinzip zum technologischen Durchbruch zu verhelfen.
In mechanical speech synthesis from the 18th up to the 20th century, reed pipes were mainly used for the generation of the voice and the organ stop vox humana was central in this process. This has been described in different historical documents which report that the vox humana in some organs sounded like human vowels. In this study, tones of four different voces humanae were recorded to investigate their similarity to human vowels. The acoustical and perceptual analysis revealed that some, though not all, tones show a high similarity to selected vowels.
In mechanical speech synthesis reed pipes were mainly used for the generation of the voice. The organ stop "vox humana" played a central role for this concept. Historical documents report that the "vox humana" sounded like human vowels. In this study tones of four different "voces humanae" were recorded to investigate the similarity to human vowels. The acoustical and perceptual analysis revealed that some though not all tones show a high similarity to selected vowels.