Refine
Year of publication
- 2012 (102) (remove)
Document Type
- Part of a Book (53)
- Conference Proceeding (24)
- Article (22)
- Book (1)
- Other (1)
- Part of Periodical (1)
Keywords
- Deutsch (31)
- Korpus <Linguistik> (18)
- Computerlinguistik (9)
- Konversationsanalyse (8)
- Englisch (7)
- Kontrastive Grammatik (7)
- Sprachpolitik (7)
- Metadaten (6)
- Annotation (5)
- Datenmanagement (5)
Publicationstate
- Veröffentlichungsversion (102) (remove)
Reviewstate
- (Verlags)-Lektorat (68)
- Peer-Review (31)
- Peer-review (1)
- Verlags-Lektorat (1)
Publisher
"wer ich bin? dein schlimmster alptraum, baby!" Cybermobbing - ein Thema für den Deutschunterricht
(2012)
This paper presents the application of the <tiger2/> format to various linguistic scenarios with the aim of making it the standard serialisation for the ISO 24615 [1] (SynAF) standard. After outlining the main characteristics of both the SynAF metamodel and the <tiger2/> format, as extended from the initial Tiger XML format [2], we show through a range of different language families how <tiger2/> covers a variety of constituency and dependency based analyses.
We present a gold standard for semantic relation extraction in the food domain for German. The relation types that we address are motivated by scenarios for which IT applications present a commercial potential, such as virtual customer advice in which a virtual agent assists a customer in a supermarket in finding those products that satisfy their needs best. Moreover, we focus on those relation types that can be extracted from natural language text corpora, ideally content from the internet, such as web forums, that are easy to retrieve. A typical relation type that meets these requirements are pairs of food items that are usually consumed together. Such a relation type could be used by a virtual agent to suggest additional products available in a shop that would potentially complement the items a customer has already in their shopping cart. Our gold standard comprises structural data, i.e. relation tables, which encode relation instances. These tables are vital in order to evaluate natural language processing systems that extract those relations.
Creating and maintaining metadata for various kinds of resources requires appropriate tools to assist the user. The paper presents the metadata editor ProFormA for the creation and editing of CMDI (Component Metadata Infrastructure) metadata in web forms. This editor supports a number of CMDI profiles currently being provided for different types of resources. Since the editor is based on XForms and server-side processing, users can create and modify CMDI files in their standard browser without the need for further processing. Large parts of ProFormA are implemented as web services in order to reuse them in other contexts and programs.
This paper presents the system architecture as well as the underlying workflow of the Extensible Repository System of Digital Objects (ERDO) which has been developed for the sustainable archiving of language resources within the Tübingen CLARIN-D project. In contrast to other approaches focusing on archiving experts, the described workflow can be used by researchers without required knowledge in the field of long-term storage for transferring data from their local file systems into a persistent repository.
This paper presents Release 2.0 of the SALSA corpus, a German resource for lexical semantics. The new corpus release provides new annotations for German nouns, complementing the existing annotations of German verbs in Release 1.0. The corpus now includes around 24,000 sentences with more than 36,000 annotated instances. It was designed with an eye towards NLP applications such as semantic role labeling but will also be a useful resource for linguistic studies in lexical semantics.
Die adnominalen (attributiven) Verwendungsmöglichkeiten von temporalen und lokalen Adverbien im Deutschen werden untersucht und mit denen aus vier anderen europäischen Nachbarsprachen – Englisch, Französisch, Polnisch, Ungarisch – verglichen. Gezeigt wird, wie diese Sprachen unterschiedliche Anbindungsstrategien nutzen, um Adverbien in attributiver Funktion einsetzen zu können. Drei solcher Strategien werden unterschieden: Juxtaposition, Adjektivierung und formale Verknüpfung. Die Anbindungsstrategien sind in den Vergleichssprachen unterschiedlich verteilt und in unterschiedlichem Maße dominant. Verfügt eine Sprache über zwei oder mehr Anbindungsstrategien, so können diese in Abhängigkeit von der semantischen Teilklasse des Attributs mit verschiedenen semantischen Beschränkungen und Effekten korreliert sein. Diese bezeichnen wir als temporale bzw. lokale Kompatibilität, Persistenz und Oppositivität. Es lassen sich z.T. übereinzelsprachlich bestimmte Form-Funktions-Korrelationen zwischen Anbindungsstrategien und semantischen Beschränkungen bzw. Effekten feststellen. So können adjektivische und formal verknüpfte Attribute Persistenz und Oppositivität kodieren, juxtaponierte dagegen grundsätzlich nicht.
Am Anfang war die Lücke
(2012)