TY - CHAP U1 - Konferenzveröffentlichung A1 - Sanguinetti, Manuela A1 - Bosco, Cristina A1 - Cassidy, Lauren A1 - Çetinoğlu, Özlem A1 - Cignarella, Alessandra Teresa A1 - Lynn, Teresa A1 - Rehbein, Ines A1 - Ruppenhofer, Josef A1 - Seddah, Djamé A1 - Zeldes, Amir ED - Calzolari, Nicoletta ED - Béchet, Frédéric ED - Blache, Philippe ED - Choukri, Khalid ED - Cieri, Christopher ED - Declerck, Thierry ED - Goggi, Sara ED - Isahara, Hitoshi ED - Maegaard, Bente ED - Mariani, Joseph ED - Mazo, Hélène ED - Moreno, Asuncion ED - Odijk, Jan ED - Piperidis, Stelios T1 - Treebanking User-Generated Content: A Proposal for a Unified Representation in Universal Dependencies T2 - Proceedings of the 12th International Conference on Language Resources and Evaluation (LREC), May 11-16, 2020, Palais du Pharo, Marseille, France N2 - The paper presents a discussion on the main linguistic phenomena of user-generated texts found in web and social media, and proposes a set of annotation guidelines for their treatment within the Universal Dependencies (UD) framework. Given on the one hand the increasing number of treebanks featuring user-generated content, and its somewhat inconsistent treatment in these resources on the other, the aim of this paper is twofold: (1) to provide a short, though comprehensive, overview of such treebanks - based on available literature - along with their main features and a comparative analysis of their annotation criteria, and (2) to propose a set of tentative UD-based annotation guidelines, to promote consistent treatment of the particular phenomena found in these types of texts. The main goal of this paper is to provide a common framework for those teams interested in developing similar resources in UD, thus enabling cross-linguistic consistency, which is a principle that has always been in the spirit of UD. KW - Web KW - treebanks KW - Universal Dependencies KW - annotation guidelines KW - UGC KW - Strukturbaum KW - Social Media KW - Annotation KW - Natürliche Sprache KW - User Generated Content Y1 - 2020 U6 - https://nbn-resolving.org/urn:nbn:de:bsz:mh39-98686 UN - https://nbn-resolving.org/urn:nbn:de:bsz:mh39-98686 UR - http://www.lrec-conf.org/proceedings/lrec2020/index.html#5240 SN - 979-10-95546-34-4 SB - 979-10-95546-34-4 SP - 5240 EP - 5250 PB - European Language Resources Association CY - Paris ER -