TY - JOUR U1 - Zeitschriftenartikel, wissenschaftlich - begutachtet (reviewed) A1 - Horbach, Andrea A1 - Thater, Stefan A1 - Steffen, Diana A1 - Fischer, Peter M. A1 - Witt, Andreas A1 - Pinkal, Manfred T1 - Internet Corpora: A Challenge for Linguistic Processing JF - Datenbank-Spektrum N2 - Natural language Processing tools are mostly developed for and optimized on newspaper texts, and often Show a substantial performance drop when applied to other types of texts such as Twitter feeds, Chat data or Internet forum posts. We explore a range of easy-to-implement methods of adapting existing part-of-speech taggers to improve their performance on Internet texts. Our results show that these methods can improve tagger performance substantially. KW - Natural language processing KW - Part-of-speech tagging KW - Computer-mediated communication KW - Korpus KW - Internet KW - Natürliche Sprache KW - Automatische Sprachanalyse Y1 - 2015 UN - https://nbn-resolving.org/urn:nbn:de:bsz:mh39-43565 SN - 1618-2162 SS - 1618-2162 U6 - https://doi.org/10.1007/s13222-014-0172-z DO - https://doi.org/10.1007/s13222-014-0172-z N1 - Dieser Beitrag ist aus urheberrechtlichen Gründen nicht frei zugänglich. VL - 15 IS - 1 SP - 41 EP - 47 ER -