Volltext-Downloads (blau) und Frontdoor-Views (grau)

POS tagset refinement for linguistic analysis and the impact on statistical parsing

  • The annotation of parts of speech (POS) in linguistically annotated corpora is a fundamental annotation layer which provides the basis for further syntactic analyses, and many NLP tools rely on POS information as input. However, most POS annotation schemes have been developed with written (newspaper) text in mind and thus do not carry over well to text from other domains and genres. Recent discussions have concentrated on the shortcomings of present POS annotation schemes with regard to their applicability to data from domains other than newspaper text.

Export metadata

Additional Services

Share in Twitter Search Google Scholar

Statistics

frontdoor_oas
Metadaten
Author:Ines Rehbein, Hagen Hirschmann
URN:urn:nbn:de:bsz:mh39-80368
URL:http://tlt13.sfs.uni-tuebingen.de/tlt13-proceedings.pdf
ISBN:978-3-9809183-9-8
Parent Title (English):Proceedings of the Thirteenth International Workshop on Treebanks and Linguistic Theories (TLT13). December 12-13, 2014, Tübingen, Germany
Publisher:University of Tübingen
Place of publication:Tübingen
Editor:Verena Henrich, Erhard Hinrichs, Daniël de Kok, Petya Osenova, Adam Przepiórkowski
Document Type:Conference Proceeding
Language:English
Year of first Publication:2014
Date of Publication (online):2018/10/04
Publicationstate:Veröffentlichungsversion
Reviewstate:Peer-Review
GND Keyword:Annotation; Korpus <Linguistik>; Parts of speech; Syntaktische Analyse
First Page:172
Last Page:183
Dewey Decimal Classification:400 Sprache / 430 Deutsch
Leibniz-Classification:Sprache, Linguistik
Linguistics-Classification:Computerlinguistik
Open Access?:Ja
Licence (German):Es gilt das UrhG