Conversion and annotation web services for spoken language data in CLARIN
- We present an approach to making existing CLARIN web services usable for spoken language transcriptions. Our approach is based on a new TEI-based ISO standard for such transcriptions. We show how existing tool formats can be transformed to this standard, how an encoder/decoder pair for the TCF format enables users to feed this type of data through a WebLicht tool chain, and why and how web services operating directly on the standard format would be useful.
| Author: | Thomas SchmidtORCiDGND, Hanna HedelandORCiD, Daniel Jettka |
|---|---|
| URN: | urn:nbn:de:bsz:mh39-62167 |
| URL: | http://www.ep.liu.se/ecp/article.asp?issue=136&article=009&volume |
| ISBN: | 978-91-7685-499-0 |
| Parent Title (English): | Linköping Electronic Conference Proceedings; Selected papers from the CLARIN Annual Conference 2016, Aix-en-Provence, 26–28 October 2016, CLARIN Common Language Resources and Technology Infrastructure |
| Publisher: | Linköping University Electronic Press |
| Place of publication: | Linköping |
| Editor: | Lars Borin |
| Document Type: | Conference Proceeding |
| Language: | English |
| Year of first Publication: | 2017 |
| Date of Publication (online): | 2017/06/22 |
| Publicationstate: | Veröffentlichungsversion |
| Reviewstate: | Peer-Review |
| GND Keyword: | Gesprochene Sprache; Korpus <Linguistik>; Standardisierung; Transkription |
| Volume: | 2017 |
| Issue: | 136 |
| Page Number: | 18 |
| First Page: | 113 |
| Last Page: | 130 |
| DDC classes: | 400 Sprache / 430 Deutsch |
| Open Access?: | ja |
| Leibniz-Classification: | Sprache, Linguistik |
| Program areas: | Pragmatik |
| Licence (German): | Creative Commons - Namensnennung 4.0 International |


