A BLARK extension for temporal annotation mining
- The Basic Language Resource Kit (BLARK) proposed by Krauwer is designed for the creation of initial textual resources. There are a number of toolkits for the development of spoken language resources and systems, but tools for second level resources, that is, resources which are the result of processing primary level speech resources such as speech recordings. Typically, processing of this kind in phonetics is done manually, with the aid of spreadsheets multi-purpose statistics software. We propose a Basic Language and Speech Kit (BLAST) as an extension to BLARK and suggest a strategy for integrating the kit into the Natural Language Toolkit (NLTK). The prototype kit is evaluated in an application to examining temporal properties of spoken Brazilian Portuguese.
Author: | Dafydd GibbonORCiDGND, Flaviane Romani Fernandes, Thorsten TrippelORCiDGND |
---|---|
URN: | urn:nbn:de:bsz:mh39-126357 |
URL: | http://www.lrec-conf.org/proceedings/lrec2006/pdf/735_pdf.pdf |
URL: | https://aclanthology.org/L06-1457/ |
Parent Title (English): | Proceedings of the fifth international conference on language resources and evaluation (LREC’06). 22 May - 28 May 2006, Genoa, Italy |
Publisher: | European Language Resources Association (ELRA) |
Place of publication: | Paris |
Editor: | Nicoletta Calzolari, Khalid Choukri, Aldo Gangemi, Bente Maegaard, Joseph Mariani, Jan Odijk, Daniel Tapias |
Document Type: | Conference Proceeding |
Language: | English |
Year of first Publication: | 2006 |
Date of Publication (online): | 2024/04/16 |
Publishing Institution: | Leibniz-Institut für Deutsche Sprache (IDS) |
Publicationstate: | Veröffentlichungsversion |
Reviewstate: | Peer-Review |
Tag: | BLARK; BLAST; annotation mining; rhythm; speech timing |
GND Keyword: | Annotation; Data Mining; Gesprochene Sprache; Phonetik; Rhythmus |
First Page: | 1600 |
Last Page: | 1605 |
DDC classes: | 400 Sprache / 400 Sprache, Linguistik |
Open Access?: | ja |
Linguistics-Classification: | Computerlinguistik |
Licence (English): | Creative Commons - Attribution-NonCommercial-ShareAlike 3.0 Unported |