Refine
Year of publication
- 2019 (137) (remove)
Document Type
- Article (53)
- Conference Proceeding (30)
- Part of a Book (23)
- Book (15)
- Other (5)
- Review (5)
- Working Paper (4)
- Part of Periodical (2)
Language
- German (76)
- English (57)
- Multiple languages (2)
- Ukrainian (2)
Keywords
- Deutsch (46)
- Korpus <Linguistik> (31)
- Gesprochene Sprache (13)
- Automatische Sprachanalyse (12)
- Konversationsanalyse (9)
- corpus linguistics (9)
- Interaktion (7)
- Social Media (7)
- Kommunikation (6)
- Sprachstatistik (6)
- corpus processing (6)
- Annotation (5)
- Computerlinguistik (5)
- Diskursanalyse (5)
- Gespräch (5)
- Interaktionsanalyse (5)
- Mundart (5)
- Sprache (5)
- Sprachpolitik (5)
- Computerunterstütztes Verfahren (4)
- Fremdsprachenlernen (4)
- Grammatik (4)
- Mehrsprachigkeit (4)
- Neologismus (4)
- Online-Wörterbuch (4)
- Sprachkritik (4)
- Sprecherwechsel (4)
- Syntax (4)
- Transkription (4)
- corpus management (4)
- web corpora (4)
- Anonymität (3)
- Automatische Sprachverarbeitung (3)
- Deutschunterricht (3)
- Englisch (3)
- German (3)
- Gewalt (3)
- Lexikographie (3)
- Massenmedien (3)
- Mobiles Endgerät (3)
- Multimodalität (3)
- Natürliche Sprache (3)
- Privatheit (3)
- Semantik (3)
- Smartphone (3)
- Spracherwerb (3)
- Sprachgebrauch (3)
- Sprachnorm (3)
- Sprachwandel (3)
- Wiedervereinigung <Deutschland> (3)
- online lexicography (3)
- spoken German (3)
- Öffentlichkeit (3)
- Arabisch (2)
- Augenzeuge (2)
- Beleidigung (2)
- Biografisches Interview (2)
- Datenmanagement (2)
- Dialog (2)
- Diskurs (2)
- Estnisch (2)
- Estonia (2)
- Experimentelle Psychologie (2)
- Flüchtling (2)
- Framing (2)
- Framing-Effekt (2)
- Fremdsprache (2)
- Gestik (2)
- Hassparolen (2)
- Internet (2)
- Kognitive Linguistik (2)
- Kontrastive Linguistik (2)
- Korrektur (2)
- Leibniz-Institut für Deutsche Sprache <Mannheim> (2)
- Leichte Sprache (2)
- Lettisch (2)
- Lettland (2)
- Linguistic Landscape (2)
- Linguistik (2)
- Metapher (2)
- Minderheitensprache (2)
- Morphosyntax (2)
- Nationalsozialismus (2)
- OCR-Schrift (2)
- Politische Sprache (2)
- Populismus (2)
- Propositionale Einstellung (2)
- Protest (2)
- Rechtschreibung (2)
- Rezension (2)
- Rhetorik (2)
- Russisch (2)
- Schimpfwort (2)
- Schwedisch (2)
- Segmentierung (2)
- Spanisch (2)
- Sprachkurs (2)
- Sprachpflege (2)
- Sprachpurismus (2)
- Sprachverarbeitung <Psycholinguistik> (2)
- Sprachwechsel (2)
- Sprachwirkung (2)
- Text Encoding Initiative (2)
- Textproduktion (2)
- Ukrainisch (2)
- Verb (2)
- Wortbildung (2)
- Wortschatz (2)
- comparable corpora (2)
- conversation analysis (2)
- conversation-analytic transcription (2)
- corpus infrastructures (2)
- corpus-based (2)
- deduplication (2)
- directionality (2)
- minority languages (2)
- multilingual data (2)
- multilingual transcripts (2)
- multimodal analysis (2)
- online dictionaries (2)
- parallel corpora (2)
- spoken Arabic (2)
- systemisation (2)
- temporality (2)
- turn taking (2)
- varieties (2)
- Abendmahlsgottesdienst (1)
- Agens (1)
- Akkulturation (1)
- Akzent (1)
- Analyse (1)
- Angewandte Linguistik (1)
- Anpassung (1)
- Architektur (1)
- Argumentation (1)
- Augenfolgebewegung (1)
- Autochthon (1)
- Automatische Spracherkennung (1)
- Baltic States (1)
- Baltikum (1)
- Bautzen (1)
- Begegnung (1)
- Beiläufiges Schreiben (1)
- Benennung (1)
- Benutzeroberfläche (1)
- Berufliche Integration (1)
- Beschwerdebrief (1)
- Bibliothekskatalog (1)
- Bildung (1)
- Bildungsreform (1)
- Bildungssystem (1)
- Bilinguale Wörterbücher (1)
- Blickbewegung (1)
- Bürger (1)
- CMC (1)
- Communion (1)
- Computergestützte Lexikographie (1)
- Computerlingustik (1)
- Computerspiel (1)
- Computerunterstützte Lexikogaphie (1)
- Computerunterstützte Lexikografie (1)
- Computerunterstützte Lexikographie (1)
- Cyber-Mobbing (1)
- Datenschutz (1)
- Datenschutzrichtlinie (1)
- Demonstration (1)
- Dependenzgrammatik (1)
- Deutsch als Fremdsprache (1)
- Deutsch als Muttersprache (1)
- Deutschland <DDR> / Revolution <1989> (1)
- Deutschland. Bundesregierung. Presse- und Informationsamt (1)
- Didaktik (1)
- Digital Humanities (1)
- Digitalisierung (1)
- Diskriminierung (1)
- Diskursmarker (1)
- Distribution <Linguistik> (1)
- Doktorand (1)
- Eberhard-Schöck-Stiftung (1)
- Einbettung <Linguistik> (1)
- Einleitung (1)
- Einsprachigkeit (1)
- English (1)
- Entropie (1)
- Enzyklopädie (1)
- Episteme (1)
- Epistemic Stance (1)
- Erlebte Rede (1)
- Estland (1)
- Ethik (1)
- Ethnologie (1)
- Ethnomethodologie (1)
- Europa (1)
- Expressionismus (1)
- FAZ (1)
- Fachsprache (1)
- Fandrych, Christian (1)
- Festakt (1)
- Flexion (1)
- Font (1)
- Forschungsbericht (1)
- Forschungsdaten (1)
- Frage (1)
- Frame-Semantik (1)
- Fremd (1)
- Fremdsprachenunterricht (1)
- Fremdsprachige Textproduktion (1)
- Frendsprache (1)
- Gemüse (1)
- Gerichtetheit (1)
- German as foreign language (1)
- German as mother tongue (1)
- German lessons (1)
- German, Italian, Spanish (1)
- Geschichte 1989-1990 (1)
- Geschichtsschreibung (1)
- Geschlecht (1)
- Gesprochenes Arabisch (1)
- Gesprächsanalytische Transkription (1)
- Gesprächskorpus (1)
- Gesture (1)
- Globalisierung (1)
- Graphem (1)
- Graphemik (1)
- Hashtag (1)
- Hate Speech (1)
- Henning-Kaufmann-Stiftung (1)
- Heteroglossie (1)
- Hispanophone learners of GFL (1)
- Historische Sprachwissenschaft (1)
- Hochschullehre (1)
- Ideologie (1)
- Indirekte Rede (1)
- Information Retrieval (1)
- Informationsplattform (1)
- Interactional Linguistics (1)
- Interaktionslinguistik (1)
- Interaktivität (1)
- Interdisziplinäre Forschung (1)
- Interview (1)
- Islamischer Staat (1)
- Italienisch (1)
- Jensen-Shannon divergence (1)
- Journalismus (1)
- Journalistische Textproduktion (1)
- Kommunikationsgeschichte <Fach> (1)
- Kommunikationsverben (1)
- Kommunikative Grammatik (1)
- Kommunikatives Handeln (1)
- Komposition (1)
- Komposition <Wortbildung> (1)
- Kompositum <Wortbildung> (1)
- Kongressbericht (1)
- Konjunktiv I (1)
- Konrad-Duden-Preis 2018 (1)
- Konzeptgeschichte (1)
- Korpusaufbereitung (1)
- Korpuslinguistik (1)
- Kritische Diskursanalyse (1)
- Kräuter (1)
- Language Policy (1)
- Language attitudes (1)
- Languages in education (1)
- Latgalian (1)
- Latvia (1)
- Laudatio (1)
- Lehnwort (1)
- Lektürehinweis (1)
- Lexikografie (1)
- Liedtext (1)
- Lindenberg, Udo (1)
- Linked Data (1)
- Litauisch (1)
- Literarizität (1)
- Literaturunterricht (1)
- Literaturwissenschaft (1)
- Lithuanian (1)
- Logdatei (1)
- Logische Partikel (1)
- Manufacturing Consent (1)
- Maschinelles Lernen (1)
- Medien (1)
- Medienkonsum (1)
- Medienlinguistik (1)
- Mehrheit (1)
- Mehrsprachige Daten (1)
- Mehrsprachigkeit in Transkripten (1)
- Mehrworteinheit (1)
- Metadaten (1)
- Minderheit (1)
- Mobilität (1)
- Modalität (1)
- Modalpartikel (1)
- Modalverb (1)
- Morphem (1)
- Morphologie (1)
- Morphologie <Linguistik> (1)
- Multimedia (1)
- Mundart Russlanddeutsch (1)
- Museum (1)
- Muttersprache (1)
- N400 (1)
- Name (1)
- Naming (1)
- Narratives Interview (1)
- Nationalbewusstsein (1)
- Nationalsprache (1)
- Native speaker (1)
- Nebensatz (1)
- Neologismenlexikografie (1)
- Neue Medien (1)
- Neurolinguistik (1)
- New speakers (1)
- Niederdeutsch (1)
- Niederrhein (1)
- Noam Chomsky (1)
- Non-native speaker (1)
- O.K. (1)
- Objektsatz (1)
- Obst (1)
- Online-Ressource (1)
- Onlinelexikografie (1)
- P300 (1)
- Periphrase (1)
- Pfälzisch (1)
- Phonologie (1)
- Phraseologie (1)
- Poetizität (1)
- Polish (1)
- Political Correctness (1)
- Politiker (1)
- Politische Kommunikation (1)
- Politolinguistik (1)
- Polnisch (1)
- Popmusik (1)
- Poster (1)
- Pragmatik (1)
- Progressionsanalyse (1)
- Projektorkonstruktion (1)
- Projektseminar (1)
- Propaganda (1)
- Prosodie (1)
- Protestbewegung (1)
- Prototyp <Linguistik> (1)
- Präsident (1)
- Psycholinguistik (1)
- RDF <Informatik> (1)
- Rassismus (1)
- Raum (1)
- Rechtspopulismus (1)
- Redewiedergabe (1)
- Relativpronomen (1)
- Relativsatz (1)
- Revolution <1989> (1)
- Ritual (1)
- Rockmusik (1)
- Russian (1)
- SAT (1)
- Sakkade (1)
- Satz (1)
- Schreibvariante (1)
- Schriftsprache (1)
- Schriftsystem (1)
- Semantische Analyse (1)
- Simultanübersetzen (1)
- Sorbian (1)
- Sorbisch (1)
- Sozialkompetenz (1)
- Sozialtopografie (1)
- Soziolinguistik (1)
- Spanischsprachige DaF-Lernender (1)
- Spanisch–Deutsch (1)
- Spanish–German (1)
- Spoken German (1)
- Sport (1)
- Spracheinstellungen (1)
- Sprachgesellschaft (1)
- Sprachhandeln (1)
- Sprachinsel (1)
- Sprachkompetenz (1)
- Sprachkontakt (1)
- Sprachkultur (1)
- Sprachliche Minderheit (1)
- Sprachliche Universalien (1)
- Sprachliches Weltbild (1)
- Sprachtabu (1)
- Sprachunterricht (1)
- Sprachvariante (1)
- Sprachverarbeitung (1)
- Sprachverein (1)
- Sprachvermittlung (1)
- Srachvariante (1)
- Standardisierung (1)
- Standardsprache (1)
- Statistischer Test (1)
- Stichprobenumfang (1)
- Stickel, Gerhard (1)
- Straßenverkehr (1)
- Stuttgart 21 (1)
- Swahili (1)
- Swedish (1)
- TEI (1)
- Tattoo (1)
- Temporalität (1)
- Thematische Relation (1)
- Titling (1)
- Tourismus (1)
- Transdisziplinarität (1)
- Transdisziplinäre Forschung (1)
- Transitives Verb (1)
- Transkript (1)
- Translanguaging (1)
- Tschechisch (1)
- Tun (1)
- Tweet (1)
- Twitter <Softwareplattform> (1)
- Twittern (1)
- Typografie (1)
- Ukrainian (1)
- Ukrainian language (1)
- Ukrainian national identity (1)
- Ulrich, Engel (1)
- Uralische Sprachen (1)
- Urban Studies (1)
- User <Benutzer> (1)
- VDS (1)
- Valenzgrammatik (1)
- Varietäten (1)
- Verbalagression (1)
- Versdichtung (1)
- Verständlichkeit (1)
- Verwaltungssprache (1)
- Videointerview (1)
- Visualisierung (1)
- Vokal (1)
- Vorhersagbarkeit (1)
- Vorumaisch (1)
- Võro (1)
- Website (1)
- Werbeslogan (1)
- WhatsApp (1)
- Whatsapp (1)
- Wikipedia (1)
- Worthäufigkeit (1)
- Wortverbindung (1)
- Wörterbuchartikel (1)
- YouTube (1)
- ZDF (1)
- Zeitlichkeit (1)
- Zeitschrift (1)
- Zipf's law (1)
- Zipfsches Gesetz (1)
- Zivilgesellschaft (1)
- Zulu-Sprache (1)
- Zweisprachiges Wörterbuch (1)
- acceptability ratings (1)
- accounts (1)
- adaptive design (1)
- agent prominence (1)
- agent prototypicality (1)
- allemand parlé (1)
- analyse conversationnelle (1)
- analyse multimodale (1)
- announcements (1)
- architecture-for-interaction (1)
- bilingual dictionaries (1)
- causal adverbial connectives (1)
- celebrity video interview (1)
- cmc corpora (1)
- code-switching (1)
- communicative deviation (1)
- communicative failure (1)
- computerunterstützte Lexikographie (1)
- coordination (1)
- corpora of talk in interaction (1)
- corpora of talk-in-interaction (1)
- corpus frequencies (1)
- corpus linguistic methodology (1)
- corpus-based lexicography (1)
- deshalb (1)
- deswegen (1)
- dialog (1)
- dictionaries (1)
- digitization (1)
- discourse analysis (1)
- dual task (1)
- ethno-regionalism (1)
- ethnography (1)
- folk linguistics (1)
- foreshadowing (1)
- generalized divergence (1)
- generalized entropy (1)
- geschlechtergerechter Sprachgebrauch (1)
- gesprochene Sprache (1)
- gesture (1)
- globalization (1)
- heute-journal (1)
- interaction (1)
- interaction space (1)
- interactional linguistics (1)
- kalbos žanrai (1)
- kalbų sąveika (1)
- korpusbasierte Lexikografie (1)
- language attitudes (1)
- language complexity (1)
- language discourses (1)
- language learning (1)
- language policy (1)
- language regards (1)
- language teaching (1)
- language universals (1)
- learning motivation (1)
- lexicography (1)
- linguistic acculturation (1)
- linguistic diversity (1)
- linguistic landscape (1)
- linguistic niche hypothesis (1)
- log files (1)
- marqueurs de réponse (1)
- mediation talks (1)
- messenger communication (1)
- minority–majority relations (1)
- mixed-effects modeling (1)
- mobile Medien (1)
- mobile devices (1)
- modal verbs (1)
- monospaced font (1)
- multilingualism (1)
- multimodal interaction analysis (1)
- multimodal storytelling (1)
- multimodality (1)
- mundane technology use (1)
- narrative analysis (1)
- neologism lexicography (1)
- nepavykęs komunikacinis aktas (1)
- null-hypothesis testing (1)
- okay (1)
- ordinary conversation (1)
- post-soviet states (1)
- predictive processing (1)
- processing load (1)
- progression analysis (1)
- proportional font (1)
- prosody (1)
- quantitative approaches (1)
- reading speed (1)
- response tokens (1)
- sample size (1)
- sans-serif (1)
- semantic roles (1)
- sentence processing (1)
- serif (1)
- smartphones (1)
- social media storytelling (1)
- social topography (1)
- socio-spatial positioning (1)
- speech genres (1)
- speech planning (1)
- spoken language (1)
- standard (1)
- syntactic projection phenomena (1)
- task-evoked pupillary responses (1)
- text length (1)
- text production in a foreign language (1)
- text production in journalism (1)
- tourism (1)
- transdisciplinary research (1)
- ukrainiečių kalba (1)
- user interface (1)
- user-centred design (1)
- utterance (1)
- vaizdo pokalbiai su garsenybėmis (1)
- verbs of communication (1)
- video interview (1)
- visibility of ritual meaning (1)
- visualisation (1)
- vocabulary of quotation expressions (1)
- voice messages (1)
- vokiečių kalba (1)
- wikibasierte Wörterbücher (1)
- word predictability (1)
- worship (1)
- writing-by-the-way (1)
- відеоінтерв’ю (1)
- комунікативна невдача (1)
- українська мова (1)
- українська національна ідентичність (1)
Publicationstate
- Veröffentlichungsversion (137) (remove)
Reviewstate
- Peer-Review (75)
- (Verlags)-Lektorat (57)
Publisher
- Leibniz-Institut für Deutsche Sprache (IDS) (26)
- Leibniz-Institut für Deutsche Sprache (19)
- German Society for Computational Linguistics & Language Technology und Friedrich-Alexander-Universität Erlangen-Nürnberg (9)
- Lexical Computing CZ s.r.o. (6)
- Institut für Deutsche Sprache (5)
- de Gruyter (5)
- Heidelberg University Publishing (4)
- Narr (4)
- The Association for Computational Linguistics (4)
- Spektrum der Wissenschaft Verlagsgesellschaft (3)
Although the N400 was originally discovered in a paradigm designed to elicit a P300 (Kutas and Hillyard, 1980), its relationship with the P300 and how both overlapping event-related potentials (ERPs) determine behavioral profiles is still elusive. Here we conducted an ERP (N = 20) and a multiple-response speed-accuracy tradeoff (SAT) experiment (N = 16) on distinct participant samples using an antonym paradigm (The opposite of black is white/nice/yellow with acceptability judgment). We hypothesized that SAT profiles incorporate processes of task-related decision-making (P300) and stimulus-related expectation violation (N400). We replicated previous ERP results (Roehm et al., 2007): in the correct condition (white), the expected target elicits a P300, while both expectation violations engender an N400 [reduced for related (yellow) vs. unrelated targets (nice)]. Using multivariate Bayesian mixed-effects models, we modeled the P300 and N400 responses simultaneously and found that correlation between residuals and subject-level random effects of each response window was minimal, suggesting that the components are largely independent. For the SAT data, we found that antonyms and unrelated targets had a similar slope (rate of increase in accuracy over time) and an asymptote at ceiling, while related targets showed both a lower slope and a lower asymptote, reaching only approximately 80% accuracy. Using a GLMM-based approach (Davidson and Martin, 2013), we modeled these dynamics using response time and condition as predictors. Replacing the predictor for condition with the averaged P300 and N400 amplitudes from the ERP experiment, we achieved identical model performance. We then examined the piecewise contribution of the P300 and N400 amplitudes with partial effects (see Hohenstein and Kliegl, 2015). Unsurprisingly, the P300 amplitude was the strongest contributor to the SAT-curve in the antonym condition and the N400 was the strongest contributor in the unrelated condition. In brief, this is the first demonstration of how overlapping ERP responses in one sample of participants predict behavioral SAT profiles of another sample. The P300 and N400 reflect two independent but interacting processes and the competition between these processes is reflected differently in behavioral parameters of speed and accuracy.
Preface
(2019)
This paper aims at investigating the usage of present subjunctive (Konjunktiv I), which is traditionally labelled as a feature of standard written language and therefore as typically occurring in communication genres based on it such as press texts and reporting, in everyday spoken German. Through an analysis of corpus data performed according to theory and method of Interactional Linguistics and encompassing private, institutional and public interactional domains, the paper will show how this particular verb form expresses different epistemic stances according to its syntactic embedment.
This paper focuses on so called syntactic projection phenomena in the German language. This term from the German Gesprächsforschung is used to define the fact that an utterance or part of it foreshadows another one. This paper aims at pointing out how such projection phenomena are consciously exploited for rhethorical purposes. This will be observed on the basis of excerpts from the Stuttgart 21 mediation talks. The linguistic analysis carried out in this paper will focus on syntactic projection phenomena involving the use of causal adverbial connectives deshalb and deswegen.
In this paper, we describe a data processing pipeline used for annotated spoken corpora of Uralic languages created in the INEL (Indigenous Northern Eurasian Languages) project. With this processing pipeline we convert the data into a loss-less standard format (ISO/TEI) for long-term preservation while simultaneously enabling a powerful search in this version of the data. For each corpus, the input we are working with is a set of files in EXMARaLDA XML format, which contain transcriptions, multimedia alignment, morpheme segmentation and other kinds of annotation. The first step of processing is the conversion of the data into a certain subset of TEI following the ISO standard ’Transcription of spoken language’ with the help of an XSL transformation. The primary purpose of this step is to obtain a representation of our data in a standard format, which will ensure its long-term accessibility. The second step is the conversion of the ISO/TEI files to a JSON format used by the “Tsakorpus” search platform. This step allows us to make the corpora available through a web-based search interface. As an addition, the existence of such a converter allows other spoken corpora with ISO/TEI annotation to be made accessible online in the future.
As the Web ought to be considered as a series of sources rather than as a source in itself, a problem facing corpus construction resides in meta-information and categorization. In addition, we need focused data to shed light on particular subfields of the digital public sphere. Blogs are relevant to that end, especially if the resulting web texts can be extracted along with metadata and made available in coherent and clearly describable collections.
Speech planning is a sophisticated process. In dialog, it regularly starts in overlap with an incoming turn by a conversation partner. We show that planning spoken responses in overlap with incoming turns is associated with higher processing load than planning in silence. In a dialogic experiment, participants took turns with a confederate describing lists of objects. The confederate’s utterances (to which participants responded) were pre-recorded and varied in whether they ended in a verb or an object noun and whether this ending was predictable or not. We found that response planning in overlap with sentence-final verbs evokes larger task-evoked pupillary responses, while end predictability had no effect. This finding indicates that planning in overlap leads to higher processing load for next speakers in dialog and that next speakers do not proactively modulate the time course of their response planning based on their predictions of turn endings. The turn-taking system exerts pressure on the language processing system by pushing speakers to plan in overlap despite the ensuing increase in processing load.
Since 2013 representatives of several French and German CMC corpus projects have developed three customizations of the TEI-P5 standard for text encoding in order to adapt the encoding schema and models provided by the TEI to the structural peculiarities of CMC discourse. Based on the three schema versions, a 4th version has been created which takes into account the experiences from encoding our corpora and which is specifically designed for the submission of a feature request to the TEI council. On our poster we would present the structure of this schema and its relations (commonalities and differences) to the previous schemas.
Wie werden Wörter im Deutschen und im Englischen geschrieben? Wo sind Gemeinsamkeiten, wo sind Unterschiede? Diese Fragen werden aus morphologisch-graphematischer Perspektive bearbeitet. Es geht hier also nicht um Bezüge zwischen Schrift und Lautform (traditionell oft im Fokus der Graphematik), sondern um Korrespondenzen zwischen Schrift und Morphologie. Das betrifft zum einen den Aufbau von Morphemen. Welche Beschränkungen lassen sich hier für die Abfolge der Buchstaben formulieren? Was sind minimale, was sind prototypische Stämme und Affixe? Zum anderen geht es um Fragen der Einheitlichkeit (Wie uniform wird ein Morphem in der Schrift repräsentiert?) und der Eindeutigkeit (Wie distinkt verweist eine Schreibung auf ein Morphem?). Insgesamt zeigt sich, dass im Englischen eher Affixe verlässlich kodiert werden (oft eindeutig und einheitlich), während im Deutschen häufig Stämme einheitlich kodiert werden. Das sind zwei grundsätzlich unterschiedliche Strategien der Leseerleichterung.
Most authors agree that modal particles - a dass of function words widely considered characteristic of Modem German - cannot receive prosodic stress, though the reasons for this restriction have not yet been satisfactorily explained. This paper argues that unstressability follows from the general contribution of modal particles to compositional utterance meaning, which requires them to take scope over focus-background structures. Form and function of modal particle meanings are modelled and illustrated for five representative examples - the particles wohl, ja, eigentlich, eben and halt. It is argued that these as well as other particles, whenever they occur under prosodic stress, cannot preserve the meaning nor the syntactic behaviour of modal particles. All instances of stressed particles in German must therefore be categorized in other functional classes.
Modern theoretical linguistics lives by the insight that the meanings of complex expressions derive from the meanings of their parts and the way these are composed. However, the currently dominating theories of the syntax-semantics interface hastily relegate important aspects of meaning which cannot readily be aligned with visible structure to empty projecting heads non-reductively (mainstream Generative Grammar) or to the syntactic construction holistically (Construction Grammar). This book develops an alternative, compositional analysis of the hidden aspectual-temporal, modal and comparative meanings of a range of productive constructions of which pseudorefl exive, excessive and directional complement constructions take center stage. Accordingly, a contradiction-inducing hence semantically problematic part of literally coded meaning is locally ignored and systematically realized „expatriately“ with respect to parts of structure that achieve the indexical anchoring of propositional contents in terms of times, worlds and standards of comparison, thus yielding the observed hidden meanings.
This contribution presents a quantitative approach to speech, thought and writing representation (ST&WR) and steps towards its automatic detection. Automatic detection is necessary for studying ST&WR in a large number of texts and thus identifying developments in form and usage over time and in different types of texts. The contribution summarizes results of a pilot study: First, it describes the manual annotation of a corpus of short narrative texts in relation to linguistic descriptions of ST&WR. Then, two different techniques of automatic detection – a rule-based and a machine learning approach – are described and compared. Evaluation of the results shows success with automatic detection, especially for direct and indirect ST&WR.
In this paper, we present our work-inprogress to automatically identify free indirect representation (FI), a type of thought representation used in literary texts. With a deep learning approach using contextual string embeddings, we achieve f1 scores between 0.45 and 0.5 (sentence-based evaluation for the FI category) on two very different German corpora, a clear improvement on earlier attempts for this task. We show how consistently marked direct speech can help in this task. In our evaluation, we also consider human inter-annotator scores and thus address measures of certainty for this difficult phenomenon.
Wenn Menschen heute auf der Straße protestieren, sind immer auch digitale Medien im Spiel. Ob zur Mobilisierung oder Koordination, zur Vor- oder Nachbereitung: Proteste sind durch die sprachlichen Interaktionen und medialen Praktiken der Teilnehmenden bestimmt. Mark Dang-Anh widmet sich der situativen Protestkommunikation in digitalen Medien mit einem Fokus auf Interaktionen im Mikrobloggingdienst Twitter. Anhand zweier Falluntersuchungen von Protesten gegen rechte Aufmärsche analysiert er die vielschichtigen Relationen zwischen Sprache, Medien und der sozialen Praxis des Protestierens.
Nearly all of the very large corpora of English are “static”, which allows a wide range of one-time, pre-processed data, such as collocates. The challenge comes with large “dynamic” corpora, which are updated regularly, and where preprocessing is much more difficult. This paper provides an overview of the NOW corpus (News on the Web), which is currently 8.2 billion words in size, and which grows by about 170 million words each month. We discuss the architecture of NOW, and provide many examples that show how data from NOW can (uniquely) be extracted to look at a wide range of ongoing changes in English.
We propose a Cross-lingual Encoder-Decoder model that simultaneously translates and generates sentences with Semantic Role Labeling annotations in a resource-poor target language. Unlike annotation projection techniques, our model does not need parallel data during inference time. Our approach can be applied in monolingual, multilingual and cross-lingual settings and is able to produce dependencybased and span-based SRL annotations. We benchmark the labeling performance of our model in different monolingual and multilingual settings using well-known SRL datasets. We then train our model in a cross-lingual setting to generate new SRL labeled data. Finally, we measure the effectiveness of our method by using the generated data to augment the training basis for resource-poor languages and perform manual evaluation to show that it produces high-quality sentences and assigns accurate semantic role annotations. Our proposed architecture offers a flexible method for leveraging SRL data in multiple languages.
In an earlier publication it was claimed that there is no useful relationship between Swahili-English dictionary look-up frequencies and the occurrence frequencies for the same wordforms in Swahili-English corpora, at least not beyond the top few thousand wordforms. This result was challenged using data for German by a different team of researchers using an improved methodology. In the present article the original Swahili-English data is revisited, using ten years’ worth of it rather than just two, and using the improved methodology. We conclude that there is indeed a positive relationship. In addition, we show that online dictionary look-up behaviour is remarkably similar across languages, even when, as in our case, one is dealing with languages from very dissimilar language families. Furthermore, online dictionaries turn out to have minimum look-up success rates, below which they simply cannot go. These minima are language-sensitive and vary depending on the regularity of the searched-for entries, but are otherwise constant no matter the size of randomly sampled dictionaries. Corpus-informed sampling always improves on any random method. Lastly, from the point of view of the graphical user interface, we argue that the average user of an online bilingual dictionary is better served with a single search box, rather than separate search boxes for each dictionary side.
How do people communicate in mobile settings of interaction? How does mobility affect the way we speak? How does mobility exert influence on the manner in which talk itself is consequential for how we move in space? Recently, questions of this sort have attracted increasing attention in the human and social sciences. This Special Issue contributes to the emerging body of studies on mobility and talk by inspecting an ordinary and ubiquitous phenomenon in which communication among mobile participants is paramount: participation in traffic. This editorial presents previous work on mobility in natural settings, as carried out by interactionally oriented researchers. It also shows how the investigation into traffic participation adds new perspectives to research on language and communication.
Статтю присвячено дослідженню комунікативних невдач у мовленнєвому жанрі відеоінтерв’ю крізь призму української національної ідентичності. Визначено тематику, типи і жанрово-мовну специфіку українського відеоінтерв’ю як зразка діалогічного мовлення. Встановлено специфіку комунікативних невдач у цьому жанрі (зі спортсменами, політиками і культурними діячами) з огляду на позиції комунікантів, структурні рівні досліджуваного жанру та максими спілкування.
The article deals with communicative failures of journalists in “YouTube” celebrity video interviews in the Ukrainian and German linguacultures from the point of view of social interaction and the theory of speech genres at all structural levels of the communicative genre construction, establishing common and distinctive features in both linguacultures. The analysis made it possible to conclude that behind a language (speech) failure there is a violation caused by a journalist, a respondent, or an external noise.
Man muss glaube ich unterscheiden. Poetisch zu sein ist das eine, literarisch das andere. Das lässt sich auch auf die zugehörigen schwierigen Substantive beziehen, die Literarizität und die Poetizität. Und dann kann man auch über die poetische Funktion nachdenken, einem Postulat aus der Ergänzung der Bühler’schen Funktionstrias im Kontext des Prager Funktionalismus, die wir Roman Jakobson verdanken. Dass man unterscheiden muss, gilt vor allem oder auch noch mehr in einer nicht mehr (so) regelgeleiteten Moderne – und für eine sprachwissenschaftlich basierte Antwort.
We report on a new project building a Natural Language Processing resource for Zulu by making use of resources already available. Combining tagging results with the results of morphological analysis semi-automatically, we expect to reduce the amount of manual work when generating a finely-grained gold standard corpus usable for training a tagger. From the tagged corpus, we plan to extract verb-argument pairs with the aim of compiling a verb valency lexicon for Zulu.
Der vorliegende Beitrag setzt sich mit dem computergestützten Transkriptionsverfahren arabisch-deutscher Gesprächsdaten für interaktionsbezogene Untersuchungen auseinander. Zunächst werden wesentliche methodische Herausforderungen der gesprächsanalytischen Arbeit adressiert: Hinsichtlich der derzeitigen Korpustechnologie ermöglicht die Verwendung von arabischen Schriftzeichen in einem mehrsprachigen, bidirektionalen Transkript keine analysegerechte Rekonstruktion von Reziprozität, Linearität und Simultaneität sprachlichen Handelns. Zudem ist die Verschriftung von arabischen Gesprächsdaten aufgrund der unzureichenden (gesprächsanalytischen) Beschäftigung mit den standardfernen Varietäten und gesprochensprachlichen Phänomenen erschwert. Daher widmet sich der zweite Teil des Beitrags den bisher erarbeiteten und erprobten Lösungsansätzen ̶ einem stringenten, gesprächsanalytisch fundierten Transkriptionssystem für gesprochenes Arabisch.
The paper deals with the process of computer-aided transcription regarding Arabic-German data material for interaction-based studies. First of all, it sheds light upon some major methodological challenges posed by the conversation-analytic approaches: due to current corpus technology, the reciprocity, linearity, and simultaneity of linguistic activities cannot be reconstructed in an analytically proper way when using the Arabic characters in multilingual and bidirectional transcripts. The difficulty of transcribing Arabic encounters is also compounded by the fact that Spoken Arabic as well as its varieties and phenomena have not been standardised enough (for conversation-analytic purposes). Therefore, the second part of this paper is dedicated to preliminary, self-developed solutions, namely a systematic method for transcribing Spoken Arabic.
Lektürehinweis
(2019)
Das Handbuch ist eine periodische und mehrsprachige Online-Publikation. Die bisher veröffentlichten Bände wurden bereits über 8.500 Mal heruntergeladen. Für Leserinnen und Leser, die das haptische Leseerlebnis bevorzugen, ist die Publikation zudem im Printformat erhältlich. Zu ausgewählten Konzepten der Sprachkritik werden sukzessive enzyklopädische Artikel veröffentlicht, die ein sprachkritisches Schlüsselkonzept betreffen und die für die europäische Perspektive von kultureller Bedeutung sind. Das Ziel ist demnach, eine Konzeptgeschichte der europäischen Sprachkritik zu präsentieren. Zum einen liefert das Handbuch einen spezifischen Blick auf die jeweiligen Sprachkulturen. Zum anderen werden diese vergleichend in den Blick genommen.
Einleitung
(2019)
Der vorliegende Band „Sprachinstitutionen und Sprachkritik“ weist eine unmittelbare Verbindung zu den ersten drei Bänden unserer Handbuchreihe und der Frage auf, wie sich das viel diskutierte und diskursiv konstituierte Konzept der sprachlichen Normierung und Standardisierung einer Nationalsprache im Vergleich der Sprachkulturen entwickelt hat und wie es sich aktuell wandelt. Diese Gesichtspunkte lassen aufschlussreiche Verbindungen zum ersten Handbuchband „Sprachnormierung und Sprachkritik“ erkennen, aber auch zum zweiten („Standardisierung und Sprachkritik“) und zum dritten Handbuchband („Sprachpurismus und Sprachkritik“).
Einleitung
(2019)
Die lexikografische Behandlung von Neologismen aus der Perspektive hispanophoner DaF-Lernender
(2019)
Anhand von einigen medialen Kommunikationsverben wie mailen oder twittern wird das lexikografische Informationsangebot zu Neologismen auf seine Adäquatheit für die fremdsprachige Produktion untersucht. Die Untersuchung erfolgt aus der Perspektive eines spanischsprachigen DaF-Lernenden. Zur Analyse werden sowohl Neologismenwörterbücher und -datenbanken für das Deutsche als auch gängige, bilinguale Online-Wörterbücher für das Sprachenpaar Spanisch–Deutsch gezogen. Die Ergebnisse der lexikografischen Untersuchung werden exemplarisch mit korpusbasierten Daten aus einer Doktorarbeit verglichen. Die Befunde zeigen den Bedarf und die Notwendigkeit auf, die lexikografische Behandlung von (verbalen) Neologismen im spanisch–deutschen Kontext zu optimieren. Dabei soll — insbesondere — die fremdsprachige Textproduktion berücksichtigt werden.
Text corpora come in many different shapes and sizes and carry heterogeneous annotations, depending on their purpose and design. The true benefit of corpora is rooted in their annotation and the method by which this data is encoded is an important factor in their interoperability. We have accumulated a large collection of multilingual and parallel corpora and encoded it in a unified format which is compatible with a broad range of NLP tools and corpus linguistic applications. In this paper, we present our corpus collection and describe a data model and the extensions to the popular CoNLL-U format that enable us to encode it.
Für das Deutsche gibt es keine staatliche Institution, abgesehen vom Rat für deutsche Rechtschreibung, die den politischen Auftrag hat, sprachliche Normierung und schließlich die Standardisierung einer Nationalsprache legitimiert zu begleiten. Vor diesem Hintergrund werden in dem Artikel die verschiedenen nichtstaatlichen Sprachinstitutionen im Deutschen dargelegt. Im deutschen Sprachraum waren die Sprachakademien der Nachbarländer stets Vorbild, herausgebildet haben sich aber vor allem Sprachgesellschaften und Sprachvereine, die sich sprachkultivierend einbrachten.
Im vorliegenden Artikel werden einleitend Gegenstand, Fragestellung und Ziele einer Studie zu „absoluten“ Verwendungen von Modalverben in verbaler Interaktion vorgestellt, gefolgt von Bemerkungen zu Forschungskontext, Theorie, Methodik und Datengrundlage. Ergebnisse der Untersuchung werden unter drei Perspektiven präsentiert: Erstens geht es um Modalverbverwendungen, die sich in der Forschung zwischen Vollverbund Ellipsenerklärungen verorten, zweitens um Strukturen mit (grammatischen) Kontextbezügen, drittens um Konstruktionen und usuelle Handlungsformate. Den Abschluss bilden eine Diskussion der Befunde und ein Ausblick auf Vermittlungspotenziale interaktionslinguistischer Befunde im Bereich Deutsch als Fremdsprache.