Refine
Year of publication
- 2019 (79) (remove)
Document Type
- Article (27)
- Part of a Book (18)
- Conference Proceeding (18)
- Book (8)
- Other (4)
- Working Paper (3)
- Part of Periodical (1)
Language
- German (47)
- English (31)
- Multiple languages (1)
Is part of the Bibliography
- yes (79) (remove)
Keywords
- Deutsch (31)
- Korpus <Linguistik> (20)
- Automatische Sprachanalyse (10)
- Gesprochene Sprache (10)
- Sprachstatistik (6)
- Social Media (5)
- Annotation (4)
- Grammatik (4)
- Interaktionsanalyse (4)
- Neologismus (4)
Publicationstate
- Veröffentlichungsversion (79) (remove)
Reviewstate
- Peer-Review (41)
- (Verlags)-Lektorat (35)
Publisher
- Leibniz-Institut für Deutsche Sprache (IDS) (17)
- German Society for Computational Linguistics & Language Technology und Friedrich-Alexander-Universität Erlangen-Nürnberg (8)
- Leibniz-Institut für Deutsche Sprache (6)
- Lexical Computing CZ s.r.o. (5)
- Heidelberg University Publishing (4)
- Narr (4)
- Spektrum der Wissenschaft Verlagsgesellschaft (3)
- The Association for Computational Linguistics (3)
- Association for Computational Linguistics (1)
- Buske (1)
"Wie Schule Sprache macht"
(2019)
This paper presents the prototype of a lexicographic resource for spoken German in interaction, which was conceived within the framework of the LeGeDe-project (LeGeDe=Lexik des gesprochenen Deutsch). First of all, it summarizes the theoretical and methodological approaches that were used for the initial planning of the resource. The headword candidates were selected by analyzing corpus-based data. Therefore, the data of two corpora (written and spoken German) were compared with quantitative methods. The information that was gathered on the selected headword candidates can be assigned to two different sections: meanings and functions in interaction.
Additionally, two studies on the expectations of future users towards the resource were carried out. The results of these two studies were also taken into account in the development of the prototype. Focusing on the presentation of the resource’s content, the paper shows both the different lexicographical information in selected dictionary entries, and the information offered by the provided hyperlinks and external texts. As a conclusion, it summarizes the most important innovative aspects that were specifically developed for the implementation of such a resource.
We present a descriptive analysis on the two datasets from the shared task on Source, Subjective Expression and Target Extraction from Political Speeches (STEPS), the only existing German dataset for opinion role extraction of its size. Our analysis discusses the individual properties of the three components, subjective expressions, sources and targets and their relations towards each other. Our observations should help practitioners and researchers when building a system to extract opinion roles from German data.
Classical null hypothesis significance tests are not appropriate in corpus linguistics, because the randomness assumption underlying these testing procedures is not fulfilled. Nevertheless, there are numerous scenarios where it would be beneficial to have some kind of test in order to judge the relevance of a result (e.g. a difference between two corpora) by answering the question whether the attribute of interest is pronounced enough to warrant the conclusion that it is substantial and not due to chance. In this paper, I outline such a test.
A Supervised learning approach for the extraction of opinion sources and targets from German text
(2019)
We present the first systematic supervised learning approach for the extraction of opinion sources and targets on German language data. A wide choice of different features is presented, particularly syntactic features and generalization features. We point out specific differences between opinion sources and targets. Moreover, we explain why implicit sources can be extracted even with fairly generic features. In order to ensure comparability our classifier is trained and tested on the dataset of the STEPS shared task.
The Lehnwortportal Deutsch (2012 seqq.) serves as an integrated online information system on German lexical borrowings into other languages, synthesizing an increasing number of lexicographical dictionaries and providing basic cross-resource search options. The paper discusses the far-reaching revision of the system’s conceptual, lexicographical and technological underpinnings currently under way, focussing on their relevance for multilingual loanword lexicography.