Refine
Year of publication
Document Type
- Part of a Book (581)
- Conference Proceeding (561)
- Article (453)
- Book (66)
- Working Paper (26)
- Doctoral Thesis (21)
- Other (18)
- Part of Periodical (12)
- Preprint (12)
- Contribution to a Periodical (6)
Language
- English (1765) (remove)
Keywords
- Korpus <Linguistik> (416)
- Deutsch (410)
- Computerlinguistik (161)
- Konversationsanalyse (138)
- Interaktion (116)
- Englisch (112)
- Annotation (97)
- Gesprochene Sprache (93)
- Automatische Sprachanalyse (75)
- Wörterbuch (73)
- Semantik (63)
- German (61)
- Forschungsdaten (54)
- Natürliche Sprache (53)
- Syntax (52)
- Online-Wörterbuch (50)
- Computerunterstützte Lexikographie (49)
- Grammatik (47)
- Lexikografie (47)
- Mehrsprachigkeit (46)
- Verb (44)
- conversation analysis (42)
- Kommunikation (40)
- Neologismus (39)
- Datenmanagement (37)
- Kognitive Linguistik (37)
- Metadaten (35)
- Digital Humanities (34)
- Maschinelles Lernen (34)
- Sprachstatistik (34)
- Polnisch (33)
- Sprachpolitik (33)
- Fremdsprachenlernen (32)
- Information Extraction (32)
- Multimodalität (31)
- Prosodie (29)
- corpus linguistics (28)
- Kontrastive Linguistik (27)
- Lexikographie (27)
- Standardisierung (27)
- Lehnwort (26)
- Minderheitensprache (26)
- Sprachkontakt (26)
- Sprachvariante (26)
- Text Mining (26)
- Diskursanalyse (25)
- Sprecherwechsel (24)
- XML (24)
- Französisch (23)
- Soziolinguistik (23)
- Morphologie <Linguistik> (22)
- Pragmatik (22)
- Sprachdaten (22)
- Sprachgebrauch (22)
- Sprachwandel (21)
- COVID-19 (20)
- Infrastruktur (20)
- Psycholinguistik (20)
- Syntaktische Analyse (20)
- CLARIN (19)
- Computerunterstützte Kommunikation (19)
- Interaktionsanalyse (19)
- Semantische Analyse (19)
- Argumentstruktur (18)
- Corpus linguistics (18)
- Datensatz (18)
- Russisch (18)
- Social Media (18)
- Text Encoding Initiative (18)
- Wortbildung (18)
- Wortschatz (18)
- Automatische Spracherkennung (17)
- Europa (17)
- Metapher (17)
- Terminologie (17)
- Internet (16)
- Transkription (16)
- Automatische Sprachverarbeitung (15)
- Computerunterstützte Lexikografie (15)
- Forschung (15)
- Frame-Semantik (15)
- Kollokation (15)
- Sentimentanalyse (15)
- Texttechnologie (15)
- Urheberrecht (15)
- Wortstellung (15)
- Zweisprachiges Wörterbuch (15)
- computerunterstützte Lexikographie (15)
- Datenbank (14)
- Forschungsmethode (14)
- Gespräch (14)
- Information Retrieval (14)
- Mundart (14)
- Phonetik (14)
- Phraseologie (14)
- Semasiologie (14)
- Sprachgeschichte (14)
- Experimentelle Psychologie (13)
- Historische Sprachwissenschaft (13)
- Pandemie (13)
- Phonologie (13)
- Soziale Wahrnehmung (13)
- Spracherwerb (13)
- Thematische Relation (13)
- Worthäufigkeit (13)
- interactional linguistics (13)
- lexicography (13)
- Corpus technology (12)
- Deutsches Referenzkorpus (DeReKo) (12)
- Dialog (12)
- Körpersprache (12)
- Methodologie (12)
- Negation (12)
- Sprache (12)
- Visualisierung (12)
- gesprochene Sprache (12)
- Auszeichnungssprache (11)
- Beleidigung (11)
- Enzyklopädie (11)
- Ethnolinguistik (11)
- Head-driven phrase structure grammar (11)
- Kognitive Grammatik (11)
- Lebensmittel (11)
- Linguistic Landscape (11)
- Linguistik (11)
- Polarität (11)
- Propositionale Einstellung (11)
- Recht (11)
- Rumänisch (11)
- Sprachliche Minderheit (11)
- Sprachwechsel (11)
- Textlinguistik (11)
- Verstehen (11)
- metadata (11)
- sentiment analysis (11)
- Augenfolgebewegung (10)
- Bedeutung (10)
- Daten (10)
- Datenschutz (10)
- Diskurs (10)
- Einsprachiges Wörterbuch (10)
- Elektronisches Wörterbuch (10)
- Formale Semantik (10)
- Identität (10)
- Italienisch (10)
- Kasus (10)
- Kongress (10)
- Langzeitarchivierung (10)
- Paronym (10)
- Polish (10)
- Prädikat (10)
- Schriftsprache (10)
- Sprachverarbeitung (10)
- TEI (10)
- Valenz <Linguistik> (10)
- language policy (10)
- prosody (10)
- spoken German (10)
- Akzent (9)
- Digitalisierung (9)
- Frage (9)
- Germanische Sprachen (9)
- Kolonialismus (9)
- Kontrastive Grammatik (9)
- Korpuslinguistik (9)
- Nominalphrase (9)
- Präposition (9)
- Raum (9)
- Spanisch (9)
- Standardsprache (9)
- Tempus (9)
- Tschechisch (9)
- language contact (9)
- multimodality (9)
- research infrastructure (9)
- Übersetzung (9)
- Artikulation (8)
- Automatische Textanalyse (8)
- Benutzer (8)
- Blickbewegung (8)
- CMC (8)
- Data Mining (8)
- Datenanalyse (8)
- Datenqualität (8)
- Dialektologie (8)
- French (8)
- HPSG (8)
- Imperativ (8)
- Institut für Deutsche Sprache <Mannheim> (8)
- Intersubjektivität (8)
- Kempelen, Wolfgang von (8)
- Lettland (8)
- Lexikon (8)
- Methode (8)
- Morphologie (8)
- Neue Medien (8)
- Parser (8)
- Politische Sprache (8)
- Pronomen (8)
- Rezension (8)
- Segmentierung (8)
- Software (8)
- Sprachproduktion (8)
- Sprachverarbeitung <Psycholinguistik> (8)
- Sprechakt (8)
- Textkorpus (8)
- Twitter <Softwareplattform> (8)
- Wikipedia (8)
- YouTube (8)
- Zeit (8)
- Zweisprachigkeit (8)
- corpora (8)
- neologisms (8)
- Bildung (7)
- Biografie (7)
- Computergestützte Lexikographie (7)
- Datenbanksystem (7)
- Datenschutz-Grundverordnung (7)
- Deutschland (7)
- Ethnopsychologie (7)
- Finnisch (7)
- FrameNet (7)
- Instruktion (7)
- Interoperabilität (7)
- Jugendsprache (7)
- Konstruktionsgrammatik (7)
- Korrektur (7)
- Large corpora (7)
- Latgalian (7)
- Lemma (7)
- Lettisch (7)
- Längsschnittuntersuchung (7)
- Maschinelle Übersetzung (7)
- Mehrworteinheit (7)
- Morphosyntax (7)
- N400 (7)
- Online-Ressource (7)
- Ontologie <Wissensverarbeitung> (7)
- Partikel (7)
- Smartphone (7)
- Synonym (7)
- Temporalität (7)
- Textverstehen (7)
- Theater (7)
- Variation (7)
- Wissenschaftssprache (7)
- corpus analysis (7)
- language attitudes (7)
- language resources (7)
- machine learning (7)
- natural language processing (7)
- online dictionary (7)
- Adjektiv (6)
- Ambiguität (6)
- Antwort (6)
- Artikulatorische Phonetik (6)
- Aspekt <Linguistik> (6)
- Benutzerforschung (6)
- Benutzeroberfläche (6)
- Computerunterstützte Lexikogaphie (6)
- Computerunterstütztes Verfahren (6)
- Corpus annotation (6)
- Datenformat (6)
- Empirische Linguistik (6)
- Ethnomethodologie (6)
- Europäische Union (6)
- Experiment (6)
- Fachsprache (6)
- Fahrschule (6)
- Fallstudie (6)
- Forschungsinfrastruktur (6)
- Grammis (6)
- Griechisch (6)
- Handlung (6)
- Historische Lexikografie (6)
- Kontrastive Syntax (6)
- Korpusanalyseplattform (KorAP) (6)
- Mobiles Endgerät (6)
- Name (6)
- Neurolinguistik (6)
- Non-native speaker (6)
- Objektsatz (6)
- Opinion Mining (6)
- Personenbezogene Daten (6)
- Politische Kommunikation (6)
- Rechtschreibung (6)
- Repository <Informatik> (6)
- Soziale Identität (6)
- Soziales Handeln (6)
- Sozialwissenschaften (6)
- Sprachtypologie (6)
- Theaterprobe (6)
- Tonhöhe (6)
- Türkisch (6)
- USA (6)
- Ungarisch (6)
- Videoaufzeichnung (6)
- annotation (6)
- automatische Sprachproduktion (6)
- comparable corpora (6)
- computer-mediated communication (6)
- corpus (6)
- corpus processing (6)
- early responses (6)
- morphology (6)
- multilingualism (6)
- multimodal analysis (6)
- online dictionaries (6)
- video data (6)
- web corpora (6)
- word embeddings (6)
- API (5)
- Abfragesprache (5)
- Algorithmus (5)
- Althochdeutsch (5)
- Anapher <Syntax> (5)
- Audiovisuelles Material (5)
- Aufforderung (5)
- Bibliografie (5)
- Bibliographie (5)
- Compterunterstützte Lexikographie (5)
- Concurrent Markup/Overlap (5)
- Conversation Analysis (5)
- Conversation analysis (5)
- Corpus management (5)
- Datenmodell (5)
- Datenstruktur (5)
- Datenverarbeitung (5)
- Deutsche (5)
- Empirische Forschung (5)
- English (5)
- Entlehnung (5)
- Ergänzung <Linguistik> (5)
- Erwartung (5)
- Familie (5)
- Fremdsprache (5)
- Gefühl (5)
- Geschlechtergerechte Sprache (5)
- Gesellschaft (5)
- Gesprächsanalyse (5)
- Globalisierung (5)
- Grammatikalisation (5)
- Konferenz (5)
- Kontrastive Pragmatik (5)
- Kooperation (5)
- Kopulasatz (5)
- Lasisch (5)
- Latein (5)
- Latvia (5)
- Lettgallen (5)
- Lexikalische Semantik (5)
- Linguist (5)
- Linguistische Datenverarbeitung (5)
- Lyrik / Lyrik (5)
- Massenmedien (5)
- Mehrsprachiges Wörterbuch (5)
- Modalverb (5)
- National corpus (5)
- Negativer Polaritätsausdruck (5)
- Niederländisch (5)
- Nominalkompositum (5)
- O.K. (5)
- Open Source (5)
- Parlamentsdebatte (5)
- Phraseologismus (5)
- Portugiesisch (5)
- Prosody (5)
- Psychotherapie (5)
- Regionalsprache (5)
- Soziale Norm (5)
- Spaltsatz (5)
- Sprachgeografie (5)
- Sprachhandeln (5)
- Sprachunterricht (5)
- Statistik (5)
- Stereotyp (5)
- Strukturbaum (5)
- Südkaukasische Sprachen (5)
- Text Encoding Initiative (TEI) (5)
- Textanalyse (5)
- Textproduktion (5)
- Verständlichkeit (5)
- Vokal (5)
- Web Services (5)
- Zulu-Sprache (5)
- accountability (5)
- action formation (5)
- agency (5)
- agentivity (5)
- argument structure (5)
- copyright (5)
- corpus annotation (5)
- dictionary use (5)
- historische Phonetik (5)
- intersubjectivity (5)
- language learning (5)
- minority language (5)
- phonetics (5)
- semantic similarity (5)
- semantics (5)
- spoken language (5)
- survey (5)
- syntax (5)
- transcription (5)
- Abweichung (4)
- Akustische Phonetik (4)
- Angewandte Linguistik (4)
- Antonym (4)
- Archivierung (4)
- Baltikum (4)
- Bantusprachen (4)
- Bedeutungswandel (4)
- Beschimpfung (4)
- Bewegung (4)
- Bias (4)
- CLARIAH-DE (4)
- CLARIN-D (4)
- Chatten <Kommunikation> (4)
- Czech (4)
- Datensammlung (4)
- Dativ (4)
- Definition (4)
- Direktiv (4)
- Distribution <Linguistik> (4)
- Einbettung <Linguistik> (4)
- Erzählforschung (4)
- Ethik (4)
- Etymologie (4)
- Fehleranalyse (4)
- Fokus <Linguistik> (4)
- Fremdsprachenunterricht (4)
- Fußball (4)
- Geisteswissenschaften (4)
- German language (4)
- Gestik (4)
- Handlungsstruktur <Literatur> (4)
- Hypertext (4)
- Informationsmanagement (4)
- Informationstheorie (4)
- Intonation <Linguistik> (4)
- Isländisch (4)
- Kind (4)
- Komposition <Wortbildung> (4)
- Kontrastive Morphologie (4)
- Konversation (4)
- Kulturkontakt (4)
- Künstliche Intelligenz (4)
- Latin (4)
- Leibniz-Institut für Deutsche Sprache (IDS) (4)
- Metadatenmodell (4)
- Migration (4)
- Mikrozensus (4)
- Mittelhochdeutsch (4)
- Morphem (4)
- Multimodal interaction (4)
- Muttersprache (4)
- Nationalsozialismus (4)
- Natural Language Processing (4)
- Nichtverbale Kommunikation (4)
- Nominalisierung (4)
- Open Science (4)
- Optimalitätstheorie (4)
- Partikelverb (4)
- Patient (4)
- Pidgin (4)
- Planung (4)
- Polysemie (4)
- Privatsphäre (4)
- Proposition (4)
- Präsidentenwahl (4)
- Russland (4)
- Sakkade (4)
- Schwedisch (4)
- Sentiment Analysis (4)
- Slawische Sprachen (4)
- Sportsprache (4)
- Sprachkompetenz (4)
- Sprachverstehen (4)
- Straßenverkehr (4)
- Subjektivität (4)
- Suchmaschine (4)
- Syntaktische Kongruenz (4)
- Technische Infrastruktur (4)
- Textsorte (4)
- Ukrainisch (4)
- Unserdeutsch (4)
- Vergleichende politische Wissenschaft (4)
- Videospiel (4)
- Wort (4)
- Wortlänge (4)
- Wortverbindung (4)
- XML (Extensible Markup Language) (4)
- Zulu (4)
- Zustandsverb (4)
- abusive language (4)
- author name disambiguation (4)
- cognitive linguistics (4)
- colonialism (4)
- corpus management (4)
- discourse analysis (4)
- formulations (4)
- historical lexicography (4)
- historische Lexikographie (4)
- inference (4)
- information theory (4)
- infrastructure (4)
- instructions (4)
- interaction (4)
- language (4)
- language change (4)
- language complexity (4)
- legal issues (4)
- linguistic diversity (4)
- methodology (4)
- multimodal interaction (4)
- oral corpora (4)
- personal data (4)
- phraseology (4)
- prediction (4)
- reply relations (4)
- rules (4)
- sequentiality (4)
- social interaction (4)
- software (4)
- stereotypes (4)
- syllable prominence (4)
- time series analysis (4)
- turn-taking (4)
- youth language (4)
- 17th century (3)
- Adverb (3)
- Afrikanische Sprachen (3)
- Agens (3)
- Anonymisierung (3)
- Argumentation (3)
- Arzt (3)
- Augenbewegung (3)
- Aussprache (3)
- Bibliothekskatalog (3)
- Bildungspolitik (3)
- British English (3)
- Bulgarian (3)
- Bulgarisch (3)
- CLARIAH (3)
- CMDI (3)
- Cluster <Datenanalyse> (3)
- CoRoLa (3)
- Coaching (3)
- Component MetaData Infrastructure (CMDI) (3)
- Component Metadata Infrastructure (CMDI) (3)
- Computerspiel (3)
- Computerunterstütztes Informationssystem (3)
- Computerunterstütztes Lernen (3)
- Computeruntertützte Lexikographie (3)
- Corpus query language (3)
- Culture (3)
- Datenerhebung (3)
- Debatte (3)
- Deixis (3)
- Deklination (3)
- Dictionary use (3)
- Digitale Sprachressourcen (3)
- Diphthong (3)
- Direkte Rede (3)
- Diskriminierung (3)
- Diskursmarker (3)
- Dokumentation (3)
- Elektronische Publikation (3)
- Ellipse <Linguistik> (3)
- Entropie (3)
- Epistemics (3)
- Epistemische Logik (3)
- Estland (3)
- Estonia (3)
- Ethnizität (3)
- Ethnologie (3)
- Europäisierung (3)
- Evaluation (3)
- Feldforschung (3)
- Festschrift (3)
- Food item (3)
- Formale Sprache (3)
- Forschungseinrichtung (3)
- Forschungsprojekt (3)
- Frage-Antwort-System (3)
- Frame semantics (3)
- Freezing principle (3)
- GDPR (3)
- Gebärdensprache (3)
- Germanic (3)
- Geschlechterstereotyp (3)
- Gesture (3)
- Gleichberechtigung (3)
- Glossar (3)
- Grammar (3)
- Grammatiktheorie (3)
- Grassfields Bantu (3)
- Historical lexicography (3)
- Historische Phonetik (3)
- Höheres Bildungswesen (3)
- Hörgerät (3)
- Hörschädigung (3)
- ISO-Norm (3)
- Icelandic (3)
- Informationsstruktur (3)
- Informationssystem (3)
- Informationsverarbeitung (3)
- Interactional Linguistics (3)
- Interdisziplinarität (3)
- Isolationismus (3)
- Japanisch (3)
- Jugendlicher (3)
- Kognition (3)
- Kognitive Semantik (3)
- Kompositum (3)
- Konditional (3)
- Konditionalsatz (3)
- Konflikt (3)
- Kongressbericht (3)
- Konjunktion (3)
- KorAP (3)
- Kreolische Sprachen (3)
- Kroatisch (3)
- Language attitude (3)
- Lautschrift (3)
- Lautstärke (3)
- Lernerwörterbuch (3)
- Lexikogaphie (3)
- Lexikologie (3)
- Lexikostatistik (3)
- Linguistische Informationswissenschaft (3)
- Lokativ (3)
- Lächeln (3)
- Mannheim (3)
- Medien (3)
- Mediensprache (3)
- Medizin (3)
- Meinungsäußerung (3)
- Metadata (3)
- Minderheit (3)
- Modality (3)
- Modalität (3)
- Modalität <Linguistik> (3)
- Modus (3)
- Morphemanalyse (3)
- Morphology (3)
- Nachhaltigkeit (3)
- Named Entity Recognition (3)
- Native speaker (3)
- Natural language processing (3)
- Niederdeutsch (3)
- Normung (3)
- Northern Sotho (3)
- Nutzungsrecht (3)
- Online-Datenbank (3)
- Open Access (3)
- Open Data (3)
- P300 (3)
- Partizipation (3)
- Pedi-Sprache (3)
- Peer-Group (3)
- Pennsylvaniadeutsch (3)
- Personalpronomen (3)
- Perspektivität (3)
- Pidgin-Sprachen (3)
- Plurizentrische Sprache (3)
- Positionsverb (3)
- Programmiersprache (3)
- Progressiv (3)
- Prosa (3)
- Prototyp <Linguistik> (3)
- Quantitative Analyse (3)
- Quantitative Methode (3)
- Raumvorstellung (3)
- Rechtssprache (3)
- Rechtsstellung (3)
- Regel (3)
- Regressionsanalyse (3)
- Relation extraction (3)
- Rezipient (3)
- Richtlinie (3)
- Routinearbeit (3)
- Semantic Web (3)
- Semantische Relation (3)
- Semantisches Netz (3)
- Sequentialanalyse (3)
- Slowenisch (3)
- Sorbisch (3)
- Soziolekt (3)
- Spielregel (3)
- Sprachanalyse (3)
- Sprachnorm (3)
- Sprachstil (3)
- Sprichwort (3)
- Statistischer Test (3)
- Stimmgebung (3)
- Suffix (3)
- Südpazifik (3)
- Tagebuch (3)
- Technologie (3)
- Textgestaltung (3)
- Textverarbeitung (3)
- Textverarbeitung <Psycholinguistik> (3)
- Tourismus (3)
- Twitter (3)
- Ukrainian language (3)
- Verantwortlichkeit (3)
- Verbalphrase (3)
- Vergangenheitstempus (3)
- Verkehrssprache (3)
- Veröffentlichung (3)
- Vorhersagbarkeit (3)
- Vorurteil (3)
- Web corpora (3)
- Website (3)
- Zeitung (3)
- Zipfsches Gesetz (3)
- Zusammenkunft (3)
- abstractness (3)
- acoustic correlates (3)
- action ascription (3)
- agent prominence (3)
- agreement (3)
- animacy (3)
- articulation (3)
- aspect (3)
- collocations (3)
- commitment (3)
- computational models of narrative (3)
- constructicography (3)
- construction grammar (3)
- context (3)
- corpus infrastructures (3)
- corpus linguistic methodology (3)
- corpus-based (3)
- cross-language differences (3)
- dictionary (3)
- dictionary of neologisms (3)
- digital humanities (3)
- digital research infrastructure (3)
- digitale Infrastruktur (3)
- eLexiko (3)
- electronic lexicography (3)
- expectancy violations (3)
- eye movements (3)
- formal semantics (3)
- gender linguistics (3)
- gender-fair language (3)
- grammaticalization (3)
- graph database (3)
- hearing aid use (3)
- hearing impairment (3)
- heritage language (3)
- humanities (3)
- identity (3)
- impression formation (3)
- instruction (3)
- interactional history (3)
- interoperability (3)
- kontrastive Linguistik (3)
- language documentation (3)
- language models (3)
- language planning (3)
- language politics (3)
- language processing (3)
- language technology (3)
- large corpora (3)
- learning (3)
- lexical borrowings (3)
- lexical data (3)
- lexicon (3)
- linguistic research software (3)
- linked data (3)
- loanword (3)
- modality (3)
- multilingual lexicography (3)
- multimodal (3)
- multiword expressions (3)
- neologism (3)
- nonnative accents (3)
- null-hypothesis testing (3)
- online lexicography (3)
- online resources (3)
- paronyms (3)
- pitch range (3)
- pitch variation (3)
- planning (3)
- policy convergence (3)
- positioning (3)
- pragmatics (3)
- projection (3)
- prominence (3)
- psychotherapy (3)
- quantitative approaches (3)
- questions (3)
- reading (3)
- recipient design (3)
- request (3)
- requests (3)
- research data (3)
- research data management (3)
- research into dictionary use (3)
- reusability (3)
- semantic roles (3)
- semantic web (3)
- sentience (3)
- smartphone use (3)
- speech (3)
- speech acts (3)
- spoken language corpora (3)
- spoken language data (3)
- standardization (3)
- tense (3)
- terminology (3)
- tokenization (3)
- transition (3)
- treebanks (3)
- understanding (3)
- variability (3)
- word formation (3)
- word frequency (3)
- word order (3)
- word structure (3)
- Übersetzungswissenschaft (3)
- ASR (2)
- Abfrage (2)
- Ableitung <Linguistik> (2)
- Affirmativer Polaritätsausdruck (2)
- Agency-Theorie (2)
- Akkusativ (2)
- Akustik (2)
- Allomorph (2)
- Amazonas (2)
- Amerikanismus (2)
- Amondawa-Sprache (2)
- Amtssprache (2)
- Anpassung (2)
- Apokope (2)
- Appellativum (2)
- Arabic (2)
- Arabisch (2)
- Archiv für Gesprochenes Deutsch (AGD) (2)
- Argument <Linguistik> (2)
- Articulography (2)
- Aspekt (2)
- Attributsatz (2)
- Auftritt (2)
- Ausgrenzung (2)
- Aussagesatz (2)
- Autofahren (2)
- Automatic recognition of speech (2)
- Automatische Sprachproduktion (2)
- Automatisches Beweisverfahren (2)
- Autor (2)
- BNC (2)
- Baltic States (2)
- Baltische Sprachen (2)
- Begegnung (2)
- Benutzerverhalten (2)
- Bericht (2)
- Beurteilung (2)
- Bewegungsverb (2)
- Bibliografische Daten (2)
- Bibliothek (2)
- Bildungssystem (2)
- Biographie (2)
- Blickkontakt (2)
- Bologna-Prozess (2)
- Brasilien (2)
- CELEX (2)
- COHA (2)
- Case (2)
- Categories of PSMs (2)
- Chinesisch (2)
- Church Slavonic (2)
- Clarin (2)
- Compterunterstützte Lexikografie (2)
- Computational linguistics (2)
- Computer-Mediated Communication (2)
- Computer-mediated communication (2)
- Computerlingustik (2)
- Constraint-Erfüllung (2)
- Conversational alignment (2)
- Copyright (2)
- Corpora (2)
- Corpus Linguistics (2)
- Creative Commons (2)
- DARIAH (2)
- DMC (2)
- Dank (2)
- Decision Trees (2)
- Dekomposition (2)
- Denken (2)
- Dependenzgrammatik (2)
- Deskriptive Linguistik (2)
- Deutsch als Fremdsprache (2)
- Deutschland <Bundesrepublik> (2)
- Deutschland. Deutscher Bundestag (2)
- Dezentralisation (2)
- Dialectology (2)
- Dictionaries (2)
- Didaktik (2)
- Digitale Kommunikation (2)
- Disambiguierung (2)
- Diskurssemantik (2)
- Dokumentenserver (2)
- Dortmunder Chat-Korpus (2)
- E-Learning (2)
- EFNIL (2)
- ERP (2)
- Einbettungssatz <Linguistik> (2)
- Einleitung (2)
- Einsprachigkeit (2)
- Einstellung (2)
- Einwanderer (2)
- Elektronische Bibliothek (2)
- Elektronisches Forum (2)
- Elektrophysiologie (2)
- Emotion (2)
- Empfehlungssystem (2)
- Empirical research (2)
- Empowerment (2)
- Entscheidungsbaum (2)
- Entscheidungsfrage (2)
- Estnisch (2)
- Europäische Föderation Nationaler Sprachinstitutionen (EFNIL) (2)
- Europäische Sportkonferenz (2)
- Europäische Union : Datenschutz-Grundverordnung (2)
- Evaluation methodologies (2)
- FAIR data principles (2)
- Fahrstunde (2)
- Faux amis (2)
- Fernsehsendung (2)
- Fertigkeit (2)
- Flexion (2)
- Folgerung (2)
- Font (2)
- Formalisierung (2)
- Formulierung (2)
- Frame-Theorie (2)
- Framing-Effekt (2)
- Frequency (2)
- Frequenz (2)
- Friesisch (2)
- Frühneuhochdeutsch (2)
- Funktionale Grammatik (2)
- Futur (2)
- Färöisch (2)
- Gedächtnis (2)
- Gefangenenliteratur (2)
- Gefühlsverb (2)
- Geistiges Eigentum (2)
- Generalized additive modeling (2)
- Generalversammlung (2)
- Generative Semantik (2)
- Genitiv (2)
- Genitive Classification (2)
- Genus (2)
- Geoinformationssystem (2)
- GermaNet (2)
- German Americans (2)
- German as a foreign language (2)
- German-based (2)
- Geschlechterforschung (2)
- Gesprächsforschung (2)
- Geste (2)
- Gleichheit (2)
- Google Books Ngram corpora (2)
- Google Ngram Corpora (2)
- Graded tense (2)
- Gruppenidentität (2)
- Gälisch-Schottisch (2)
- Haftung (2)
- Hamlet (2)
- Hausa-Sprache (2)
- Hebrew (2)
- Hebräisch (2)
- Helfen (2)
- Hilfsverb (2)
- Historische Soziolinguistik (2)
- Hochlettisch (2)
- Hochschulbildung (2)
- Honesty (2)
- ISO (2)
- Ideologie (2)
- Imperfekt (2)
- Implementation (2)
- Indirect speech (2)
- Indirekte Rede (2)
- Infinitkonstruktion (2)
- Information Science (2)
- Inklusion <Soziologie> (2)
- Institut für Deutsche Sprache (2)
- Integration (2)
- Intention (2)
- Interactional history (2)
- Interactional linguistics (2)
- Interaktionale Linguistik (2)
- Interferenz <Linguistik> (2)
- Internationale Politik (2)
- Interoperability (2)
- Interpretation (2)
- Intransitives Verb (2)
- Jensen-Shannon divergence (2)
- Jugend (2)
- Kategorialgrammatik (2)
- Kategorisierung (2)
- Kausalität (2)
- Kiezdeutsch (2)
- Kindersprache (2)
- Klassifikation (2)
- Kognitiver Prozess (2)
- Koloniallinguistik (2)
- Kolonie (2)
- Komitativ <Kasus> (2)
- Kommunikationsverb (2)
- Kommunikativer Sinn (2)
- Komplementierer (2)
- Komposition (2)
- Konsonant (2)
- Konstruktion <Linguistik> (2)
- Kontrolle <Linguistik> (2)
- Konversationanalyse (2)
- Koordination <Linguistik> (2)
- KorAP (Korpusanalyseplattform der nächsten Generation) (2)
- Koreanisch (2)
- Korpus <Linguistik (2)
- Korpustechnologie (2)
- Kultur (2)
- Kulturpsychologie (2)
- Kulturvergleich (2)
- Language (2)
- Language Variation (2)
- Lautquantität (2)
- Lautwandel (2)
- Lehrmittel (2)
- Lernen (2)
- Lernsoftware (2)
- Lesen (2)
- Leseverhalten (2)
- Lexem (2)
- Lexicon (2)
- Lexikalisch funktionale Grammatik (2)
- Lexikgraphie (2)
- Lingua Franca (2)
- Linked Data (2)
- Literary corpus (2)
- Logdatei (2)
- Logische Semantik (2)
- Lower Sorbian (2)
- Lyrics <Lyrik> (2)
- MMAX (2)
- MTAS (2)
- Machine Leaming (2)
- Mandarin (2)
- Maschinelle Sprachverarbeitung (2)
- Meaning (2)
- Mediation (2)
- Medienlinguistik (2)
- Meinung (2)
- Meinungsverb (2)
- Mental verb constructions (2)
- Mikrostruktur (2)
- Mimik (2)
- Mobilität (2)
- Modalpartikel (2)
- Modeling (2)
- Morphology of the Folktale (2)
- Multikulturelle Gesellschaft (2)
- Multimedia (2)
- MySQL (2)
- Mündliche Kommunikation (2)
- NFDI (2)
- Narrative (2)
- Nationalsozialistische Verbrechen (2)
- Nationalsprache (2)
- Neologie (2)
- Neologimus (2)
- Neologisms (2)
- Neumelanesisch (2)
- Nicht-kanonisches Subjekt (2)
- Norwegen. Sameting (2)
- Norwegisch (2)
- Notation (2)
- NottDeuYTSch corpus (2)
- NottDeuYTSch-Korpus (2)
- OCR (2)
- OCR-Schrift (2)
- OWID (2)
- Online dictionary (2)
- Online-Medien (2)
- Online-Spiel (2)
- Opinion Inference (2)
- Optische Zeichenerkennung (2)
- Ortsadverb (2)
- Ortsname (2)
- Papua-Neuguinea (2)
- Paradigma (2)
- Parlament (2)
- Paronymie (2)
- Parsing (2)
- Part-of-Speech-Tagging (2)
- Part-of-Speech-Tagging = POS (2)
- Past interpretation (2)
- Perfekt (2)
- Perspektivierung (2)
- Phonem (2)
- Pitch contour (2)
- Pitch matching (2)
- Pokorny, Julius (2)
- Polarity classification (2)
- Politik (2)
- Politische Beteiligung (2)
- Preference organization (2)
- Pro-Drop-Parameter (2)
- Problem Solving Methods (2)
- Projection (2)
- Projektion <Psychologie> (2)
- Prose (2)
- Prosodic similarity (2)
- Prädikation (2)
- Psychologie (2)
- Psychologische Diagnostik (2)
- Psychverb (2)
- Python <Programmiersprache> (2)
- Qualitative Methode (2)
- Quantitative Linguistik (2)
- Reaktion (2)
- Rechtsfrage (2)
- Redeerwähnung (2)
- Redigieren (2)
- Referenz <Linguistik> (2)
- Register <Linguistik> (2)
- Relativsatz (2)
- Requests (2)
- Resources (2)
- Reuse (2)
- Rezeption (2)
- Rhetorik (2)
- Ripuarian (2)
- Romanische Sprachen (2)
- Russian (2)
- Russlanddeutsche (2)
- Rückmeldesignal (2)
- SGML (2)
- SKOS (2)
- Sapir-Whorf-Hypothese (2)
- Satzakzent (2)
- Satzanalyse (2)
- Satzverbindung (2)
- Schimpfwort (2)
- Schottland (2)
- Schriftstück (2)
- Schwerhörigkeit (2)
- Scottish Gaelic (2)
- SemEval (2)
- Semantics (2)
- Semantische Verbklasse (2)
- Sentiment Analyse (2)
- Sentiment analysis (2)
- Serbisch (2)
- Service provider (2)
- Shakespeare, William (2)
- Similarities (2)
- Slowakisch (2)
- Smile (2)
- Softwarewerkzeug (2)
- Sorbian institute (2)
- Sozialberuf (2)
- Soziale Software (2)
- Speech synthesis (2)
- Sport (2)
- Sprachentwicklung (2)
- Spracherhaltung (2)
- Sprachkonflikt (2)
- Sprachphilosophie (2)
- Sprachpurismus (2)
- Sprachstörung (2)
- Sprachvariation (2)
- Sprachvergleich (2)
- Sprachverlust (2)
- Sprecher (2)
- Sprechmaschine (2)
- Standard (2)
- Stereotypisierung (2)
- Stichprobenumfang (2)
- Studie (2)
- Subjekt <Linguistik> (2)
- Swedish (2)
- Syntagma (2)
- Tagging (2)
- Technischer Fortschritt (2)
- Telefonieren (2)
- Temporaladverb (2)
- Terminologiemanagement (2)
- Text (2)
- Text classification (2)
- Text-to-Speech (2)
- Textkohärenz (2)
- Textstruktur (2)
- Theater rehearsals (2)
- Thesaurus (2)
- Tok Pisin (2)
- Topikalisierung (2)
- Tote Sprachen (2)
- Tracy, Rosemarie (2)
- Treebanks (2)
- Trees/Graphs (2)
- Trump, Donald (2)
- Tupi-Guarani-Sprachen (2)
- Turn construction (2)
- Turn-beginnings (2)
- UGC (2)
- Understanding in interaction (2)
- Universal Dependencies (2)
- Universitätsbibliothek (2)
- Uralische Sprachen (2)
- Valenz (2)
- Validating (2)
- Verbalaggression (2)
- Verbbedeutung (2)
- Verbsemantik (2)
- Verhalten (2)
- Verwaltungssprache (2)
- Vielfalt (2)
- Vietnamese (2)
- Virtual Language Observatory (VLO) (2)
- Virtuelle Realität (2)
- Vollversammlung (2)
- Vorumaisch (2)
- Wahlverhalten (2)
- Warlpiri (2)
- Web (2)
- Westsamoa (2)
- Wiedervereinigung <Deutschland> (2)
- Wiktionary (2)
- Wirtschaft (2)
- Wissenschaftler (2)
- Wissenschaftliche Kooperation (2)
- Wissenschaftsforschung (2)
- Wissenspräsentation (2)
- Wissensvermittlung (2)
- Word formation (2)
- Word length (2)
- WordNet (2)
- Workplace studies (2)
- Wortart (2)
- Wortwahl (2)
- XQuery (2)
- Zeichensprache (2)
- Zeitungsartikel (2)
- Zipf's law (2)
- Zipf’s law (2)
- Zufriedenheit (2)
- Zusammenfassung (2)
- Zuschauer (2)
- Zweitsprache (2)
- Zweitspracherwerb (2)
- acceptability judgements (2)
- access structures (2)
- acoustic analysis (2)
- advanced search options (2)
- affect (2)
- agent prototypicality (2)
- annotation guidelines (2)
- annotation scheme (2)
- annotation tool (2)
- anonymisation (2)
- anotación de corpus (2)
- artificial intelligence (2)
- audiovisual data (2)
- automatic transcription (2)
- bibliographic metadata (2)
- bilingual dictionaries (2)
- blending (2)
- categorisation (2)
- clipping (2)
- closing (2)
- cmc corpora (2)
- co-presence (2)
- cognitive lexicography (2)
- collocation (2)
- collocation analysis (2)
- colonial language contact (2)
- colonial linguistics (2)
- computational linguistics (2)
- computer-assisted language learning (2)
- computer-mediated communication (CMC) (2)
- conditional connectives (2)
- conditionals (2)
- confusables (2)
- conjunction (2)
- constructional meaning (2)
- contrastive analysis (2)
- contrastive linguistics (2)
- controlled natural language (2)
- cooperation (2)
- coordination (2)
- copula (2)
- corpus curation (2)
- corpus frequency (2)
- corpus reusability (2)
- corpus semantics (2)
- corpus study (2)
- corpus-based lexicography (2)
- correlate (2)
- courses of action (2)
- culture (2)
- data (2)
- data analysis (2)
- data collection (2)
- data migration (2)
- data mining (2)
- data quality (2)
- data repositories (2)
- database (2)
- declarative (2)
- deduplication (2)
- dependency parsing (2)
- diachronic corpora (2)
- diachronic corpus linguistics (2)
- dialog (2)
- dictionaries (2)
- dictionary culture (2)
- dictionary design (2)
- dictionary writing system (2)
- disambiguation (2)
- discourse (2)
- discourse semantics (2)
- discrimination (2)
- distributional semantics (2)
- driving lessons (2)
- driving school (2)
- e-lexicography (2)
- easily confused words (2)
- eben (2)
- embodied responses (2)
- embodiment (2)
- ethics (2)
- ethnography (2)
- event-related brain potentials (2)
- expectation (2)
- experiencer (2)
- eye-movements (2)
- eyetracking (2)
- face (2)
- false friends (2)
- family language policy (2)
- feedback (2)
- fixation-related potentials (2)
- flagging (2)
- formulation (2)
- free variation (2)
- gaze (2)
- gender equality (2)
- gender-inclusive language (2)
- general assembly (2)
- general language dictionaries (2)
- general language dictionary (2)
- generalized divergence (2)
- generalized entropy (2)
- genre and register variation (2)
- geschriebene Sprache (2)
- gesture (2)
- glossaries (2)
- grammar (2)
- grammar and syntax (2)
- grammatical information system (2)
- grammatical terminology (2)
- grammatical variation (2)
- grammatische Terminologie (2)
- grammis (2)
- gratitude (2)
- helping relationship (2)
- higher education research (2)
- history of lexicography (2)
- history of phonetics (2)
- ideology (2)
- impact assessment (2)
- impact categories (2)
- imperative (2)
- informal interaction (2)
- interactional semantics (2)
- interaktionale Semanitik (2)
- international language (2)
- internet forums (2)
- internet lexicography (2)
- interpretation (2)
- intonation (2)
- it (2)
- it-clefts (2)
- kognitive Linguistik (2)
- kontextuelle Bedeutung (2)
- language comprehension (2)
- language functions (2)
- language ideology (2)
- language learners (2)
- language portal (2)
- language statistics (2)
- language status (2)
- language universals (2)
- late positivity (2)
- lay-lexicography (2)
- learner corpus (2)
- lexical borrowing (2)
- lexical database (2)
- lexical information system (2)
- lexical innovation (2)
- lexical richness (2)
- lexical semantics (2)
- lexicographic database (2)
- lexicography and war (2)
- lexicography equality (2)
- lexicology (2)
- liability (2)
- linguistic data (2)
- linguistic variation (2)
- loanword lexicography (2)
- log files (2)
- long-term archival (2)
- longitudinal study (2)
- machine learning methods (2)
- mechanical speech synthesis (2)
- membership categorization (2)
- metaphor (2)
- methods (2)
- microstructure (2)
- minority languages (2)
- missionary linguistics (2)
- modal verbs (2)
- morphological analysis (2)
- motion verbs (2)
- multi-level annotation (2)
- multi-party dialog (2)
- multiparty interaction (2)
- mysql (2)
- n-grams (2)
- negation (2)
- neological lexicography (2)
- neologism dictionaries (2)
- newsmark (2)
- non-native speech (2)
- norms (2)
- noun–pronoun ratio (2)
- ob <Wort> (2)
- online language (2)
- opinion frames (2)
- opinion mining (2)
- optimality theory (2)
- ordinary conversation (2)
- organized helping (2)
- orthography (2)
- overlapping talk (2)
- parallel corpora (2)
- parser adaptation (2)
- part-of-speech (POS) (2)
- part-of-speech ontology (2)
- participation framework (2)
- pedagogical lexicography (2)
- perception (2)
- perception experiment (2)
- phonemic representation (2)
- phonological grammar (2)
- phonology (2)
- plurale tantum (2)
- pluricentric (2)
- policy diffusion (2)
- politischer Diskurs (2)
- polysemy (2)
- power law (2)
- practice (2)
- precision (2)
- predictability (2)
- predictive coding (2)
- predictive processing (2)
- prepositions (2)
- priming (2)
- privacy (2)
- pro-drop (2)
- problem-solving approach (2)
- production (2)
- professional lexicography (2)
- pronoun resolution (2)
- pseudo-coordination (2)
- public space (2)
- punctual verb (2)
- query (2)
- read speech (2)
- reference corpora (2)
- register variation (2)
- relationship (2)
- relationship management (2)
- relationships (2)
- repair (2)
- representativeness (2)
- response tokens (2)
- responsive action (2)
- sample size (2)
- scalability (2)
- second language acquisition (2)
- second language learning (2)
- semantic role labeling (2)
- sense discrimination (2)
- sentence processing (2)
- serif (2)
- shortening (2)
- showing sequences (2)
- sitzen <Wort> (2)
- smartphones (2)
- social action (2)
- software quality management (2)
- specialized lexicography (2)
- specification (2)
- specificational clause (2)
- speech corpus (2)
- speech planning (2)
- speech production (2)
- spoken Czech (2)
- standard (2)
- standards (2)
- stehen <Wort> (2)
- stereotyping (2)
- stops (2)
- subjectivity (2)
- subjunctive (2)
- synonyms (2)
- syntactic complexity (2)
- tagging (2)
- talk-in-interaction (2)
- task-oriented dialogue (2)
- text classification (2)
- text corpus (2)
- text length (2)
- text mining (2)
- text production (2)
- theater (2)
- transnational communication (2)
- tun (2)
- turn taking (2)
- type token ratio (2)
- typology (2)
- understudied languages (2)
- usage-based model (2)
- user guidance (2)
- user interface (2)
- valency (2)
- variation (2)
- verbs (2)
- virtual collections (2)
- visualisation (2)
- visualization (2)
- vocabulary size (2)
- von Kempelen, Wolfgang (2)
- wenn (2)
- wiktionary (2)
- word predictability (2)
- word senses (2)
- workplace studies (2)
- written language (2)
- Älterer Mensch (2)
- Ästhetik (2)
- Öffentlicher Raum (2)
- Österreich (2)
- Übung (2)
- (discrepancy of) expectation (1)
- (enhanced) webcomics (1)
- (multimodal) instructions (1)
- (multimodale) Instruktionen (1)
- (re-)openings (1)
- (un)certainty (1)
- 0nline dictionary (1)
- 1/3 power law (1)
- 19th Century (1)
- 2008 (1)
- 3-Circle-Model (1)
- ASD (1)
- Ablaut (1)
- Absentiv (1)
- Abstractness (1)
- AcI (1)
- Access Control (1)
- Acquisition (1)
- Action formation (1)
- Active Learning (1)
- Active learning (1)
- Additional Language of Society (1)
- Adjazenz (1)
- Adjective (1)
- Adposition (1)
- Adressat (1)
- Adressatenzuschnitt (1)
- Adverbial Noun Phrases (AdvNps) (1)
- Adverbiale (1)
- Aerodynamik (1)
- Affekt (1)
- Affirmative (1)
- Affix (1)
- Affixoid (1)
- African languages (1)
- African languages dictionaries (1)
- Afrikaans (1)
- Afrikatale (1)
- Agreement <Syntax> (1)
- Agumentation (1)
- Aichinger, Ilse (1)
- Akademischer Grad (1)
- Akkadisch (1)
- Akkulturation (1)
- Akronym (1)
- Akteur (1)
- Akustische Analyse (1)
- Akzeptabilität (1)
- Allgemeinwissen (1)
- Allomorphy (1)
- Alltag (1)
- Alltagsgespräche (1)
- Alltagssprache (1)
- Altchinesisch (1)
- Altenbild (1)
- Alter (1)
- Altertumswissenschaft (1)
- Altgriechisch (1)
- Altägyptisch (1)
- Alveolar (1)
- Amazonia (1)
- Amazonian languages (1)
- American politics (1)
- Amerikanisches Englisch (1)
- Anapher (1)
- Anapher <Rhetorik> (1)
- Ancient Greek (1)
- Ancient Greek language (1)
- Ancient Greek scholarship (1)
- Angst (1)
- Annotation guidelines (1)
- Annotation of causal language (1)
- Annotation of discourse relations (DRs) (1)
- Annotations (1)
- Annotator Agreement (1)
- Anspielung (1)
- Antezedens <Linguistik> (1)
- Antizipation (1)
- Antwortpartikel (1)
- Antwortrelationen (1)
- Antwortstrukturen (1)
- Anweisung (1)
- Anwendung (1)
- Anwendungsbereich (1)
- Anwesenheit (1)
- Anxiety (1)
- Arbeitsbündnis (1)
- Arbeitsplatz (1)
- Arbeitsstudie (1)
- Architectures (1)
- Architektur (1)
- Areallinguistik <Typologie> (1)
- Argument (1)
- Argument structure (1)
- Argumentrealisierung (1)
- Arizona (1)
- Articulatory settings (1)
- Arzt-Patient-Interaktion (1)
- Asian Americans (1)
- Aspect (1)
- Assertion (1)
- Assimilation <Soziologie> (1)
- Assistance (1)
- Assoziationsexperiment (1)
- Assoziationsmaß (1)
- Astrolabe-Bay (1)
- Attribution (1)
- Audio-video Synchronisation (1)
- Auffforderung (1)
- Aufforderungssatz (1)
- Aufsatzsammlung (1)
- Aushandlung (1)
- Auskunftsanspruch (1)
- Auslassung (1)
- Auslaut (1)
- Ausrichten <Technik> (1)
- Austausch (1)
- Authentische Ressourcen (1)
- Authentizität (1)
- Autismus (1)
- Autochthon (1)
- Autocorrelated errors (1)
- Autokorrelation (1)
- Automated information (1)
- Automatisch (1)
- Automatische Indexierung (1)
- Automatische Klassifikation (1)
- Automatische Lauterkennung (1)
- Automatische Lautidentifizierung (1)
- Automatische Sprachanalyse; (1)
- Automatische Worterkennung (1)
- Automobil <Personenkraftwagen> (1)
- Autonomie (1)
- Autorin (1)
- Autorschaft (1)
- Außenpolitik (1)
- BERT (1)
- Bairisch (1)
- Balkansprachen (1)
- Baltic states (1)
- Bangante Sprache (1)
- Bantu morphology (1)
- Barack Obama (1)
- Bartmiński, Jerzy (1)
- Baskisch (1)
- Basnage de Beauval (1)
- Basque language (1)
- Bautzen (1)
- Bayesian inference (1)
- Bearbeitung von Korpusanfragen (1)
- Bedeutungserweiterung (1)
- Bedeutungsvielfalft (1)
- Bedienungsanleitung (1)
- Bedrohte Sprache (1)
- Begriffsgeschichte <Fach> (1)
- Beispiel (1)
- Benennung (1)
- Benin (1)
- Benin (West Africa) (1)
- Benutzerfreundlichkeit (1)
- Benutzerführung (1)
- Benutzung (1)
- Benutzungsforschung (1)
- Beratung (1)
- Berufsbezeichnung (1)
- Beschreibung (1)
- Beschuldigung (1)
- Beschwerdebrief (1)
- Best-Practice (1)
- Beteiligung (1)
- Betrieb (1)
- Bewertung (1)
- Bibel. Altes Testament (1)
- Bibliographie 1960-1985 (1)
- Bibliography (1)
- Big Two (1)
- Bildungswesen (1)
- Bilingualised dictionary (1)
- Bilingualismus (1)
- Bittbrief (1)
- Blended learning (1)
- Blick (1)
- Blickregistrierung (1)
- Blickverhalten (1)
- Blindheit (1)
- Bologna Process (1)
- Bootstrapping methods (1)
- Borrowing (1)
- Bosnian (1)
- Bosnisch (1)
- Brazilian Portuguese dictionaries (1)
- Brettspiel (1)
- British National Corpus (1)
- British twenty first century lexicography (1)
- Brown clustering (1)
- Buchstabe (1)
- Buchstabenhäufigkeit (1)
- Burgenland (1)
- C++ (1)
- CAQDAS (1)
- CART (1)
- CLARIN Knowledge Sharing Infrastructure (1)
- CLARIN Legal Issues Committee (CLIC) (1)
- CLARIN infrastructure (1)
- CMC (International Conference on Cooperative Multimodal Communication) <2023, Mannheim> (1)
- CMC Corpora (1)
- CMC corpora (1)
- CMC corpus (1)
- CMDI experiences (1)
- CMDI infrastructure use (1)
- CMDI metadata (1)
- CMDI profile creation (1)
- CNL (1)
- COVID-19 discourse (1)
- CQLF (1)
- CSC (1)
- CTS (1)
- Canonical text services (1)
- Carl Friedrich Aichinger (1)
- Ceteris paribus laws (1)
- Chadic (1)
- Change (1)
- China (1)
- Chirurgie (1)
- Christian Ludwig (1)
- Chunking (1)
- Cinie Louw (1)
- Citizen Science (1)
- Clarín (1)
- Clusters (1)
- Co-Reference (1)
- CoMParS (1)
- CoRDI 2023 (1)
- Code (1)
- Codierung (1)
- Cognitive Bootstrapping (1)
- Cognitive artefacts (1)
- Collocation analysis (1)
- Collocations (1)
- Comic (1)
- Comitative Construction (1)
- Comitative Preposition (1)
- Comitative case (1)
- Common ground (1)
- Communicative Functions (1)
- Communion (1)
- Community theatre (1)
- Comparable Corpus (1)
- Comparable corpora (1)
- Comparison of representations and representational formats (1)
- Competence Theories (1)
- Complexity theory (1)
- Component Metadata Description Infrastructure (1)
- Composition (1)
- Compositional Semantics (1)
- Computational lexicography (1)
- Computationelle Semantik (1)
- Computer-Assisted Language Learning (CALL) (1)
- Computerprogramm (1)
- Computerunterstützte Übersetzung (1)
- Computervermittelte Kommunikation (1)
- Computing in the Humanities (1)
- Conceptual metaphor (1)
- Concurrency (1)
- Concurrent markup (1)
- Consonant (1)
- Construction Grammar (1)
- Consultation behavior (1)
- Context (1)
- Contextual meaning (1)
- Contrary and complementary opposites (1)
- Contrast (1)
- Contrastive linguistics (1)
- Controlled Natural Language (CNL) (1)
- Conversational Feedback (1)
- Conversational analysis (1)
- Conversational rhetoric (1)
- Coordination (1)
- Coreference (1)
- Corpora (Linguistics) (1)
- Corporate Identity (1)
- Corpus (1)
- Corpus Analysis (1)
- Corpus Comparison (1)
- Corpus Management (1)
- Corpus Tools (1)
- Corpus query platform (1)
- Corpus-based retrieval (1)
- Corruption (1)
- Couplet (1)
- Covariation (1)
- Creole languages (1)
- Croatian (1)
- Cross references (1)
- Cross-cultural psychology (1)
- Cross-linguistic conversation analysis (1)
- Crowdsourcing (1)
- Cultural metric (1)
- Cyber-Mobbing (1)
- Cyrillic (1)
- DARIAH-DE (1)
- DKPro repository (1)
- DMPTY (1)
- DO-cleft (1)
- DRs in spoken and written genres (1)
- DRuKoLA (1)
- DSSSL (1)
- Dagestan (1)
- Darmstadt Knowledge Processing Software Repository (1)
- Darstellungsart (1)
- Data Architecture (1)
- Data Augmentation (1)
- Data Formats (1)
- Data Governance Act (1)
- Data Innovation Board (1)
- Data Science (1)
- Data Vizualization (1)
- Data altruism (1)
- Data mining (1)
- Database Management Systems (1)
- Dateiformat (1)
- Datenaufbereitung (1)
- Datenaustausch (1)
- Datenbank für Gesprochenes Deutsch (DGD) (1)
- Datenbank für gesprochenes Deutsch = DGD (1)
- Datenerfassung (1)
- Datenkompetenz (1)
- Datenkonvertierung (1)
- Deep learning (1)
- Deeutschamerikaner (1)
- Definitheit (1)
- Definitionen (1)
- Deliberation (1)
- Demonstrativpronomen (1)
- Dependency Parsing (1)
- Dependenz (1)
- Depression (1)
- Derivation (1)
- Derivation <Linguistik> (1)
- Determinator (1)
- Deutsche Gebärdensprache (DGS) (1)
- Deutsches Referenzkorpus zur internetbasierten Kommunikation (DeRiK) (1)
- Deutsches Spracharchiv (1)
- Deutschland (DDR) (1)
- Deutschland (Westliche Länder) (1)
- Deutschland <DDR> (1)
- Developmental Robotics (1)
- Devolution (1)
- Diachronie (1)
- Dialekt (1)
- Dialektgeografie (1)
- Dialogue (1)
- Diccionario de la lengua Española (Madrid) (1)
- Diccionario histórico de la lengua española (1)
- Dichtersprache (1)
- Dictionary and text analysis (1)
- Dictionary editing software (1)
- Dictionary encoding (1)
- Dictionary use strategies (1)
- Dictionnaire universel (1)
- Die Sprach-Checker (1)
- Differential object marking (1)
- Differenzielle Objektmarkierung (1)
- Digital Humanities Studium (1)
- Digital Library (1)
- Digital lexical systems (1)
- Digitale Daten (1)
- Digitale Forschungsinfrastruktur (1)
- Digitale Geisteswissenschaften (1)
- Digitale Revolution (1)
- Digitaler Sprachassistent (1)
- Digitales Wörterbuch der deutschen Sprache (DWDS) (1)
- Diminutiv (1)
- Direct Speech (1)
- Direct speech (1)
- Directive 95/46/EC (1)
- Directive on Copyright in the Digital Single Market (1)
- Directive particles (1)
- Disambiguation (1)
- Discourse Representation Theory (1)
- Discourse analysis (1)
- Discourse annotation (1)
- Discourse parsing (1)
- Discourse relations (1)
- Diskursivität (1)
- Diskurstheorie (1)
- Diskurstopik (1)
- Dispositiv (1)
- Distance learning (1)
- Distributional semantics (1)
- Distributionsidiosynkrasie (1)
- Document Classification (1)
- Document Images (1)
- Document structure (1)
- Documentation (1)
- Dokument (1)
- Dokumentenverarbeitung (1)
- Dokumentverarbeitung (1)
- Dolmetschen (1)
- Domain-specific Relation Extraction (1)
- Dominanz (1)
- Donald Trump (1)
- Double verb constructions (1)
- Dublin Core (1)
- Duits (1)
- Dutch (1)
- Dynamische Psychotherapie (1)
- Dzongkha (1)
- Dänisch (1)
- E-dictionary (1)
- E-lexicography (1)
- EEG (1)
- ELEXIS (1)
- EMLex (1)
- EOSC (1)
- EURALEX (20 : 2022 : Mannheim) (1)
- EURALEX International Congress (1)
- Early New High German (ENHG) (1)
- Early responses (1)
- Edition (1)
- Editor (1)
- Educational software (1)
- Effects (1)
- Effizienz (1)
- Egozentrismus (1)
- Ehe (1)
- Eigengruppe (1)
- Eigentum (1)
- Eigentumsrecht (1)
- Einführung (1)
- Einwanderung (1)
- Ejektiv (1)
- Electronic Lexicography (1)
- Electronic dictionaries (1)
- Electronic dictionary (1)
- Elektronisches Buch (1)
- Elizabeth Weir (1)
- Ellipse (1)
- Eltern (1)
- Embodiment (1)
- Emergence (1)
- Empfehlung (1)
- Empfindung (1)
- Empirical database (1)
- Endlicher Zustandsraum (1)
- Englisch als Lingua Franca-Interaktionen (1)
- Englischunterricht (1)
- English lingua franca interactions (1)
- English monolingual learner’s dictionaries (1)
- Entwicklungspsychologie (1)
- Epistemizität (1)
- Ereignis (1)
- Ereignisdatenanalyse (1)
- Ereigniskorreliertes Potenzial (1)
- Ereignissemantik (1)
- Erlebte Rede (1)
- Error analysis (1)
- Error classification (1)
- Erwachsenenbildung (1)
- Erzählen (1)
- Erzählstruktur (1)
- Erzähltheorie (1)
- Erzählung (1)
- Ethnische Gruppe (1)
- EuReCo (1)
- Europarat (1)
- European Americans (1)
- European Association for Lexicography (1)
- European Reference Corpus (EuReCo) (1)
- European Strategy for Data (1)
- Europeanisms (1)
- Europäische Kommission. Digital Single Market (1)
- Europäische Sprachen (1)
- Europäische Union: Datenschutz-Grundverordnung (1)
- Event mapping (1)
- Events (1)
- Evidentialität (1)
- Evolution (1)
- Evoziertes Potenzial (1)
- Experte (1)
- Expertenmeinung (1)
- Explanation (1)
- Expletiv (1)
- Expressionismus (1)
- FAIR (1)
- FAIR Index (1)
- FAIR data (1)
- FML (1)
- FO prediction (1)
- FSR (1)
- Fachwissen (1)
- Fahrunterricht (1)
- Fair Use (1)
- Fantasiespiel (1)
- Faroese (1)
- Feature engineering (1)
- Federated Content Search (FCS) (1)
- Feedback (1)
- Feedback marker (1)
- Fehler (1)
- Feldpost (1)
- Fernsehduell (1)
- Fernsehen (1)
- Fernsehinterview (1)
- Fernsehsprache (1)
- Fernunterricht (1)
- Fiktion (1)
- Film (1)
- Filmkritik (1)
- Finalsatz (1)
- Finnic minorities of Ingria (1)
- Fokus (1)
- Food Domain (1)
- Food domain (1)
- Footing Shifts (1)
- Formal learning (1)
- Formalization (1)
- Formulation (1)
- Forschungs- und Lehrkorpus Gesprochenes Deutsch (FOLK) (1)
- Forschungs- und Lehrkorpus Gesprochenes Deutsch = FOLK (1)
- Forschungsbericht (1)
- Forschungsprozess (1)
- Fortschrittlichkeit (1)
- Forum Deutsche Sprache (1)
- Fotografie (1)
- Fracto-morphèmes (1)
- Fragen (1)
- Fragment (1)
- France (1)
- Frankreich (1)
- Frau (1)
- Frauenforschung (1)
- Frauensport (1)
- Freiheit (1)
- Fremd-initiierte Reparaturen (1)
- Fremdgruppe (1)
- Fremdwort (1)
- French-German (1)
- Frequenzanalyse (1)
- Frisian (1)
- Frisian Act (1)
- Frühneuhochdeutsche Wörterbuch (1)
- Frühneuhochdeutsches Wörterbuch (1)
- Funktionale Kategorie (1)
- Funktionelle Kernspintomografie (1)
- Fußballsprache (1)
- Förderung (1)
- Führungskraft (1)
- GB-Theorie (1)
- GDE-V (1)
- GIS (1)
- GOLD standard (1)
- GUI (1)
- Gamification (1)
- Gangsta-Rap (1)
- Gastarbeiterdeutsch (1)
- Gebrauchsstandard (1)
- Gefühlsausdruck (1)
- Gemeinschaft (1)
- Gender (1)
- Gender egalitarianism (1)
- Gender stereotypes (1)
- General Data Protection Regulation (GDPR) (1)
- Generation (1)
- Generative Syntax (1)
- Generative Transformationsgrammatik (1)
- Generic Document Structure (1)
- Generic Search (GS) (1)
- GeoBib (1)
- GeoHumantities (1)
- Geopolitik (1)
- Georgian language (1)
- Georgisch (1)
- Geriatrie (1)
- Germaans (1)
- German Language Atlas (1)
- German Microcensus (1)
- German Reference Corpus (DeReKo) (1)
- German Verschmelzungsformen (contracted forms) (1)
- German clause structure (1)
- German clause-embedding predicates (1)
- German colonialism (1)
- German data (1)
- German definitions on garments (1)
- German dialects (1)
- German grammar (1)
- German interrogative embedding predicates (1)
- German mission society (1)
- German phraseological patterns (1)
- German reference corpus (1)
- German spoken language (1)
- German vowels (1)
- German, Italian, Spanish (1)
- German-American relation (1)
- German-Canadian (1)
- German-Italian (1)
- Geschichte (1)
- Geschichte 1700-1800 (1)
- Geschichte 1945-1955 (1)
- Geschichte 1989-1990 (1)
- Geschichte 1995-1999 (1)
- Geschichte <1700-1900> (1)
- Geschichte <1884-1914> (1)
- Geschichte <1989-1990> (1)
- Geschichte <1989-1994> (1)
- Geschichtskarte (1)
- Geschlecht (1)
- Geschlechtsidentität (1)
- Gesellschaftsleben (1)
- Gesicht (1)
- Gesprochenes Deutsch (1)
- Gesprächsführung (1)
- Gestural matching (1)
- Gestures (1)
- Gesundheit (1)
- Gigafida 2.1 corpus (1)
- Gitksan-Sprache (1)
- Goodwin, Charles W. (1)
- Google Ngram (1)
- Google Translate (1)
- Gospel <Musik> (1)
- Governance (1)
- Gradability (1)
- Grafische Darstellung (1)
- Grammatical Categories (1)
- Grammaticalization (1)
- Grammatikalisierung (1)
- Grammatikografie (1)
- Graph (1)
- Graph cluster (1)
- Graphdatenbank (1)
- Graphem (1)
- Graphemik (1)
- Graphische Benutzeroberfläche (1)
- Graphisches Symbol (1)
- Grasland-Bantu <Sprachfamilie> (1)
- Grasland-Bantu-Sprachen (1)
- Gravity's Rainbow (1)
- Greek Sign Language (1)
- Grewendorf, Günther (1)
- Grundschule (1)
- Guided self-help (1)
- Gälische Sprachen (1)
- HTML (Hypertext Markup Language) (1)
- Haltung (1)
- Handgeste (1)
- Handlung <Literatur> (1)
- Handlungskonstitution (1)
- Handlungstheoretische Semantik (1)
- Handschrift (1)
- Hass (1)
- Hassrede (1)
- Hausa (1)
- Head-Driven Phrase Structure Grammar (HPSG) (1)
- Heroismus (1)
- Hester Piozzi (1)
- Hethitisch (1)
- Hierarchical modeling (1)
- Hieroglyphe (1)
- Higher Education (1)
- Higher education (1)
- Hilfesystem (1)
- Hip-Hop (1)
- Historical Corpora (1)
- Historical Maps (1)
- Historische Korpora (1)
- Historische Lexikographie (1)
- Historische Syntax (1)
- Historsche Sprachsynthese (1)
- History of lexicography (1)
- Hochschulpolitik (1)
- Holocaust (1)
- Home environment (1)
- Homographie (1)
- Homonym (1)
- Human Robot Interaction (HRI) (1)
- Humanities (1)
- Humor (1)
- Hungarian (1)
- Hyperkorrektur (1)
- Hyperlink (1)
- Häufigkeitsverteilung (1)
- Hören (1)
- Hörverlust (1)
- ICC corpus (1)
- ICE corpus (1)
- IDS (1)
- IP Rights (1)
- ISO/TC 37/SC 4 (1)
- ISO/TEI (1)
- ISOcat (1)
- ISOcat registry (1)
- IT infrastructure (1)
- IVK-Ler corpus of German (1)
- Illustration (1)
- Imageloss Compensation (1)
- Immigrants (1)
- Imperative (1)
- Impersonale (1)
- Implicit attitudes (1)
- Implikatur (1)
- Impression formation (1)
- Improvisation (1)
- Inchoativ (1)
- Inclusive lexicography (1)
- Indefinite pronoun (1)
- Indefinitpronomen (1)
- Index (1)
- Index Generation (1)
- Indexierung <Inhaltserschließung> (1)
- Indikativ (1)
- Indikator (1)
- Indirekte Anapher (1)
- Individual differences (1)
- Infinitiv (1)
- Infinitivkonstruktion (1)
- Inflectional morphology (1)
- Informatik (1)
- Information (1)
- Information-Retrieval-System (1)
- Informationsgehalt (1)
- Informationsintegration (1)
- Informationsmodellierung (1)
- Innovation (1)
- Inspektionssequenzen (1)
- Institut für Corpuslinguistik und Texttechnologie (ICLTT) (1)
- Instructions (1)
- Instruktionen (1)
- Integer Linear Program (1)
- Intelligence (1)
- Intensität <Phonetik> (1)
- Intensivierung (1)
- Interaction (1)
- Interactional Semantics (1)
- Interactional semantics (1)
- Interactional sociolinguistics (1)
- Interaktionales Projekt (1)
- Interaktiv (1)
- Interaktive Medien (1)
- Interdisciplinarity (1)
- Interfacedesign (1)
- Intermedialität (1)
- Internal and external coherence (1)
- International Conference on Conversation Analysis (ICCA) (1)
- International Conference on Language Resources and Evaluation (12. : 2020 : Marseille) (1)
- International Contrastive Linguistics Conference (1)
- International Corpus of English (1)
- International Society of Conversation Analysis (ISCA) (1)
- Internationale Migration (1)
- Internationales Urheberrecht (1)
- Internationalismus (1)
- Internetdictionary (1)
- Internetforum (1)
- Internetportal (1)
- Internetwörterbuch (1)
- Interoperability of annotation schemes (1)
- Interozeption (1)
- Interpretative Semantik (1)
- Interrelated document grammars (1)
- Interrogativlogik (1)
- Interrogativpronomen (1)
- Interrogativsatz (1)
- Intersektionalität (1)
- Intersubjectivity (1)
- Intertextuality (1)
- Intertextualität (1)
- Intonation (1)
- Inuktitut (1)
- Invariance (1)
- Inversion (1)
- Irisch (1)
- Isolationism (1)
- Italian (1)
- Japanese (1)
- Japanese controlled language (1)
- Java (1)
- Jesuiten (1)
- Johann Andreas Schmeller (1)
- Joint digital storytelling (1)
- Jost Trier (1)
- Journalismus (1)
- Jueju (1)
- Jugendkultur (1)
- Junktion (1)
- Kanada (1)
- Kanji (1)
- Karl Duncker (1)
- Kaukasus (Süd) (1)
- Kind / Sprachentwicklung (1)
- Kinder (1)
- Kindergarten (1)
- Kinderspiel (1)
- Kindesmisshandlung (1)
- Kirchensprache (1)
- Klient (1)
- Klima (1)
- Knowledge Acquisition (1)
- Knowledge Graph (1)
- Knowledge Level Descriptions (1)
- Knowledge Map (1)
- Kochbuch (1)
- Kognitionswissenschaft (1)
- Kognitive Entwicklung (1)
- Kognitivie Linguistik (1)
- Kollaborative Filterung (1)
- Komitative Präposition (1)
- Komitativkonstruktion (1)
- Kommentar (1)
- Kommunikationsforschung (1)
- Kommunikationstechnik (1)
- Kommunikationsverhalten (1)
- Kommunikative Kompetenz (1)
- Kompensation (1)
- Komplement (1)
- Komplement <Linguistik> (1)
- Komplexität (1)
- Kompositinalität (1)
- Kompositionelle Semantik (1)
- Kompositum <Wortbildung> (1)
- Konfigurationsmanagement (1)
- Kongruenz <Linguistik> (1)
- Konjunktiv (1)
- Konkordanz (1)
- Kontamination <Wortbildung> (1)
- Kontext (1)
- Kontextanalyse (1)
- Kontingenzen (1)
- Kontrastive Phonetik (1)
- Kontrastive Phraseologie (1)
- Kontrastive Semantik (1)
- Konvention (1)
- Konversationsanalysse (1)
- Koordination (1)
- Korpora (1)
- Korpusannotation (1)
- Korpusaufbereitung (1)
- Korpusmanagement (1)
- Korpusvergleich (1)
- Korrelationsanalyse (1)
- Kosraean (1)
- Krankenschwester (1)
- Kratzenstein, Christian Gottlieb (1)
- Kreativität (1)
- Kriminalität (1)
- Kritik (1)
- Kritische Diskursanalyse (1)
- Kulturelle Vielfalt (1)
- Kulturerbe (1)
- Kulturgeschichte (1)
- Kulturrelativismus (1)
- Kulturwandel (1)
- Kulturwissenschaften (1)
- Kurdisch (1)
- Kurzwort (1)
- Kuturvergleich (1)
- Kymrisch (1)
- Kyrillische Schrift <Druckschrift> (1)
- L1 error correcttion (1)
- L2 Russian (1)
- L2 effects (1)
- LFG (1)
- LIVE-Data (1)
- LMF (1)
- LR infrastructures and architectures (1)
- LRTwiki (1)
- LSP dictionaries (1)
- Labeling approach (1)
- Labial (1)
- Lachen (1)
- Lafourche Basin (1)
- Lafourche Parish (1)
- Laie (1)
- Language Policy (1)
- Language attitudes (1)
- Language biographies (1)
- Language concept (1)
- Language contact (1)
- Language laws (1)
- Language resources (1)
- Language statistics (1)
- Language technology (1)
- Languages in education (1)
- Langzeitarchierung (1)
- Large Classes (1)
- Large Corpora (1)
- Large Language Models (1)
- Laryngal (1)
- Lateinunterricht (1)
- Latin Americans (1)
- Latin grammar (1)
- Latin morphology (1)
- Latin syntax (1)
- Latvian (1)
- Latvian as a medium of instruction (1)
- Latvian as second language (1)
- Learner’s lexicography (1)
- Lebenslauf (1)
- Lehnwortportal Deutsch (LWPD) (1)
- Lehnwörter (1)
- Leibliche Displays (1)
- Leibniz-Zentrum Allgemeine Sprachwissenschaft (1)
- Leichte Sprache (1)
- Lelxikographie (1)
- Lemmata (1)
- Lemmatisierung (1)
- Lernhilfe (1)
- Lerntheorie (1)
- Lesbarkeit (1)
- Lesekompetenz (1)
- Leseverstehen (1)
- Let's Play (1)
- Let's Plays (1)
- Lettischunterricht (1)
- Let’s Play (1)
- Levelled Study Corpus of Russian (LeStCoR) (1)
- LexMeta (1)
- Lexical Database (1)
- Lexical Functional Grammar (LFG) (1)
- Lexical Semantics (1)
- Lexical functional grammar (1)
- Lexical resources metadata (1)
- Lexical semantics (1)
- Lexicographically interpreted information (1)
- Lexicography (1)
- Lexikalische Analyse (1)
- Lexikalisierung (1)
- Lexikon <Psycholinguistik> (1)
- Lexonomy (1)
- License (1)
- Light verbs (1)
- Likelihood-Quotienten-Test (1)
- Linguistic Category Model (1)
- Linguistic Relativity (1)
- Linguistic Retrieval (1)
- Linguistic annotation (1)
- Linguistic annotations (1)
- Linguistic processing (1)
- Linguistically informed feature engineering (1)
- Linked Open Data (1)
- Linking-Regel (1)
- Linksverzweigende Konstruktion (1)
- Litauen (1)
- Litauisch (1)
- Literatur (1)
- Literaturauswertung (1)
- Literaturdatenbank (1)
- Literature (1)
- Literaturunterricht (1)
- Literaturverwaltung (1)
- Literaturwissenschaft (1)
- Lithuanian (1)
- Livevideostream (1)
- Lizenz (1)
- Lizenzierung (1)
- Lizenzvergabe (1)
- Lizenzvertrag (1)
- Local and global effectiveness (1)
- Logical Document Structure (1)
- Logische Partikel (1)
- Logit-Modell (1)
- Lokalisation (1)
- Lokalismus (1)
- Long-Term Archiving (1)
- Lorraine (1)
- Lothringen (1)
- Louisiana French (1)
- Low German (1)
- Luxembourg (1)
- MARC 21 (1)
- META-SHARE (1)
- MLP (1)
- MLSA (1)
- Machine Learning (1)
- Machine Learning Algorithms (1)
- Machine Leraning (1)
- Machine Translation (1)
- Machine learning (1)
- Machine translating (1)
- Magnetoencephalographie (1)
- Makrostruktur (1)
- Malaga (1)
- Mann, Thomas (1)
- Manner of articulation (1)
- Mannheim-Neckarstadt (West) (1)
- Map Task (1)
- Margrethe Thiele (1)
- Markup Languages (1)
- Material objects (1)
- Mathematik (1)
- Matomo (1)
- Maya-Sprachen (1)
- Mean reciprocal rank (1)
- Mechanismus der Menschlichen Sprache (1)
- Mediale Durchformung (1)
- Medialität (1)
- Mediatisierung (1)
- Mediendiskurse (1)
- Medieninteraktion (1)
- Medienkompetenz (1)
- Medienkonsum (1)
- Medienpraktiken (1)
- Mehrheit (1)
- Meinungsfreiheit (1)
- Mental Lexicon (1)
- Mentalität (1)
- Menzerath (1)
- Menzerath's Law (1)
- Menzerathsches Gesetz (1)
- Merkmal (1)
- Meta Modeling (1)
- Metadata Management (1)
- Metakommunikation (1)
- Metalexicography (1)
- Metalinguistik (1)
- Metasprache (1)
- Methodik (1)
- Methods (1)
- Middle High German (MHG) (1)
- Migrant (1)
- Migrationshintergrund (1)
- Minimalist program <Linguistik> (1)
- Mission (1)
- Missionsgesellschaft (1)
- Missverständnis (1)
- Mitarbeit (1)
- Mitschrift (1)
- Modaladverb (1)
- Modaler Infinitiv (1)
- Modelltheoretische Semantik (1)
- Modern Icelandic (1)
- Modifikation <Linguistik> (1)
- Monitor corpus (1)
- Montague-Grammatik (1)
- Morality in interaction (1)
- Moralität (1)
- Morph Moulder (MoMo) (1)
- Morphemik (1)
- Morphologie<Linguistik> (1)
- Morphonologie (1)
- Morphophonologie (1)
- Multi- Word Patterns (1)
- Multi-Strategy Learning (1)
- Multi-layer Annotation (1)
- Multi-modality (1)
- Multilingual Corpus (1)
- Multilingual corpora (1)
- Multilingual corpus (1)
- Multilingual dictionary (1)
- Multilingual lexicography (1)
- Multilingualismus (1)
- Multimodal (1)
- Multimodal interaction (1)
- Multimodale Analyse (1)
- Multimodale Interaktion (1)
- Multimodality (1)
- Multinomial modeling (1)
- Multiple annotations (1)
- Mundart Schwäbisch <Kaukasus> (1)
- N-Gram (1)
- N-N compound (1)
- N-gram modeling (1)
- NFDI section (1)
- NLP pipeline (1)
- NPI (1)
- NZSL Share (1)
- NaLiDa (1)
- Nachfeld (1)
- Namenforschung (1)
- Namenkunde (1)
- Namespaces (1)
- Naming (1)
- Narrative Interaktion (1)
- Narrativität (1)
- Nasal (1)
- Nationalbewusstsein (1)
- Nationale Forschungsdateninfrastruktur (NFDI) e.V. (1)
- Nationalismus (1)
- Nationalitätenpolitik (1)
- Natural Language Processing (NLP) (1)
- Near synonymy (1)
- Nebensatz (1)
- Neg-raising (1)
- Negationsanhebung (1)
- Negationen (1)
- Negotiation (1)
- Neighbour classifier (1)
- NeoRate (1)
- Neologismenwörterbuch (1)
- Netzwerk (1)
- Neugriechisch (1)
- Neurolinguistisches Programmieren (1)
- Neurologie (1)
- Neuseeland (1)
- Neutralisation <Linguistik> (1)
- New Guinea (1)
- New Zealand Sign Language (NZSL) (1)
- New speakers (1)
- Newspaper (1)
- Niedersorbisch (1)
- Nomen (1)
- Nominalsyntagma (1)
- Non-projecting words (1)
- Nonverbal communication (1)
- Nord-Sotho (1)
- Nordchinesisch (1)
- Nordsotho (1)
- Norm <Ethik> (1)
- Normativität (1)
- Normdatei (1)
- North Frisian (1)
- Norwegen (1)
- Norwegian Nynorsk (1)
- NottDeuYTSch Corpus (1)
- Null instantiation (1)
- Null-Subjekt (1)
- Nurse-patient communication (1)
- Nutzen (1)
- Nutzer (1)
- OAuth (1)
- OBELEX (1)
- OCR-Verarbeitung (1)
- OEC (1)
- OED (1)
- OO-correspondence (1)
- OTRS (1)
- OWL-Ontology (1)
- Objektsprache (1)
- Old High German (OHG) (1)
- Old Norse (1)
- Old Romanian (1)
- Old Testament (1)
- Older German (OHG, MHG, OS, MLG) (1)
- On-line syntax (1)
- Online thesaurus (1)
- Online-Dienst (1)
- Online-Informationssystem (1)
- Online-Publikation (1)
- Online-Wortschatz-Informationssystem Deutsch (OWID) (1)
- Onlinekommentare (1)
- Onomasiologie (1)
- OntoLex-Lemon (1)
- Ontologie (1)
- Ontology (1)
- Ontology development (1)
- Open Information (1)
- Opposition (1)
- Optimality Theory (1)
- Optimality theory (1)
- Oral history (1)
- Ost-West-Konflikt (1)
- Osthoff, Hermann (1)
- P600 (1)
- PCFG (1)
- POS-Tagging (1)
- Pacific (1)
- Palauan (1)
- Parallel European Corpus of Informal Interaction (PECII) (1)
- Parallel corpora (1)
- Parallelismus (1)
- Parsing Systems (1)
- Part-of-speech tagging (1)
- Parteipolitik (1)
- Particle Verbs (1)
- Parts of speech (1)
- Pathogener Mikroorganismus (1)
- Patiens (1)
- Pazifischer Ozean (1)
- Pazifischer Ozean <Süd> (1)
- Pearson Korrelation (1)
- Pennsylvania German (1)
- Performanz <Linguistik> (1)
- Periscope (1)
- Periscope <Programm> (1)
- Persian (1)
- Persisch (1)
- Persistent identifier (1)
- Personal Learning Environment (1)
- Personal data (1)
- Persönlichkeitsrecht (1)
- Petition (1)
- Pflegeheim (1)
- Phonatory behavior (1)
- Phonemic level (1)
- Phonesthemes (1)
- Phonetics (1)
- Phonology (1)
- Phrase <Syntagma> (1)
- Phrase Based Active Dictionary (PAD) (1)
- Phrasenstruktur (1)
- Phrasenstrukturgrammatik (1)
- Pitch Range (1)
- Pivot (1)
- Place reference (1)
- Plenum (1)
- Pleonastic Prepositions (1)
- Plesionymy (1)
- Plural Comitative Construction (PCC) (1)
- Poetik (1)
- Polarity Shifter (1)
- Polarity items (1)
- Polaritätsprofil (1)
- Poliqarp (1)
- Polish dialectology (1)
- Politiker (1)
- Politische Berichterstattung (1)
- Politische Einstellung (1)
- Politische Entscheidung (1)
- Politische Identität (1)
- Politische Kommunikation im Fernsehen (1)
- Politische Rede (1)
- Politische Willensbildung (1)
- Politischer Protest (1)
- Polysem (1)
- Popmusik (1)
- Portuguese (1)
- Positionierung (1)
- Possessivpronomen (1)
- Post-Soviet (1)
- Postkolonialismus (1)
- Practice (1)
- Pragmalinguistik (1)
- Pragmatic inference (1)
- Praktische Vernunft (1)
- Prepositional object clause (1)
- Preservation (1)
- Presse (1)
- Pressekonferenz (1)
- Priming (1)
- Privacy (1)
- Privacy by Design (1)
- Privatheit (1)
- Pro-Form (1)
- Probe (1)
- Processing (1)
- Produktivität <Linguistik> (1)
- Prognose (1)
- Programmieren <Informatik> (1)
- Prolog (1)
- Propp system (1)
- Propp, Vladimir Jakovlevič (1)
- Propriozeption (1)
- Prosodic Matching (1)
- Prosodic repetition (1)
- Prosody Transplantation (1)
- Proust, Marcel (1)
- Proverb (1)
- Provider (1)
- Prädikatives Adjektiv (1)
- Prädikativsatz (1)
- Prädiktor (1)
- Präferenz (1)
- Präfix be (1)
- Präpositionaler Objektsatz (1)
- Präpositionalobjekt (1)
- Präpositionalphrase (1)
- Präsident (1)
- Präteritum (1)
- Pseudonymisierung (1)
- Psychische Störung (1)
- Psychisches Trauma (1)
- Psychoanalyse (1)
- Psychodynamische Psychotherapie (1)
- Psychose (1)
- Public sector information (1)
- Pynchon, Thomas (1)
- QUEST (1)
- QUEST project (1)
- Qualitative Inhaltsanalyse (1)
- Qualitative research (1)
- Qualitätskontrolle (1)
- Quantitative research (1)
- Query Languages (1)
- Query Rewriting (1)
- Querying (1)
- Question Answering (1)
- Question Answering System (1)
- Questioning sequences (1)
- Quotations (1)
- R <Programm> (1)
- R package (1)
- RDF <Informatik> (1)
- RDM (1)
- RSS newsfeed corpus (1)
- Rabaul Creole German (1)
- Rapmusiker (1)
- Rassismus (1)
- Rat für Deutsche Rechtschreibung (1)
- Rationalität (1)
- Re-Recordings (1)
- Reaktionszeit (1)
- Rechtschreibreform (1)
- Rechtsschutz (1)
- Rechtsversetzung (1)
- Recipient Design (1)
- Redaktionssystem (1)
- Redefreiheit (1)
- Reduplikation (1)
- Reference Corpora (1)
- Reflexitität <Linguistik> (1)
- Regeln (1)
- Register (1)
- Reibelaut (1)
- Reim (1)
- Reisebericht (1)
- Reiseführer (1)
- Reiseliteratur (1)
- Relation type (1)
- Relative pronoun (1)
- Relativism (1)
- Relativpronomen (1)
- Religion and Psychology (1)
- Repair (1)
- Replication (1)
- Replikat (1)
- Reproduzierbarkeit (1)
- Repräsentation <Politik> (1)
- Republican Party (USA) (1)
- Research Data Infrastructure (RDI) (1)
- Research infrastructure (1)
- Research infrastructures (1)
- Response tokens (1)
- Ressourcen (1)
- Rheinische Missions-Gesellschaft (1)
- Robot Language (1)
- Robotik (1)
- Romanian corpus (1)
- Romanian lexicography (1)
- Romantische Liebe (1)
- Routines (1)
- Rule-following (1)
- Russia (1)
- Russian-Germans (1)
- Russophones (1)
- Rückläufiges Wörterbuch (1)
- Rückmeldepartikel (1)
- Rückmeldung (1)
- Rēzekne (1)
- SABIO-RK (1)
- SALSA (1)
- SALSA corpus (1)
- SAT (1)
- SCAD-zbMATH (1)
- SCyDia (1)
- SDO (1)
- SIS (1)
- SOA (1)
- SQL (1)
- SSH (1)
- Sachverhalt (1)
- Samen <Volk> (1)
- Samoan German (1)
- Satz (1)
- Satzeinbettendes Prädikat (1)
- Satzende (1)
- Satzkonjunktion (1)
- Satzlänge (1)
- Satzsemantik (1)
- Satztyp (1)
- Scale (1)
- Schauspielkunst (1)
- Schema Languages (1)
- Scherz (1)
- Schleswig-Holstein (1)
- Schlieren photography (1)
- Schlüsselwort (1)
- Schottland. Parliament (1)
- Schreiben (1)
- Schriftzeichen (1)
- Schulbildung (1)
- Schulbuch (1)
- Schuld (1)
- Schule (1)
- Schulung (1)
- Schulwahl (1)
- Schwa (1)
- Schweigen (1)
- Schwäbisch (1)
- Schüler (1)
- SciLogs (1)
- Science theory (1)
- Searle, John R. (1)
- Second Language Learning (1)
- Segmentdauer (1)
- Sehbehinderung (1)
- Selbst (1)
- Selbstdarstellung (1)
- Selbstgesteuertes Lernen (1)
- Selbsthilfe (1)
- Selbstorganisation (1)
- Selbstreflexion (1)
- Self-Regulated Learning (1)
- Semantic (1)
- Semantic Analysis (1)
- Semantic Interoperability (1)
- Semantic analysis (1)
- Semantic opposition (1)
- Semantic relation (1)
- Semantic role labelling (1)
- Semantic roles (1)
- Semantic similarity (1)
- Semi-automatic annotation (1)
- Sentence connectives (1)
- Sentence level (1)
- Sentence processing (1)
- SentiFrameNet (1)
- Sepedi (1)
- Sequenz (1)
- Serbian (1)
- Serbian language (1)
- Server (1)
- Serviceintegration (1)
- Serviceorientierte Architektur (1)
- Sexismus (1)
- Sexuelle Belästigung (1)
- Sichtbarkeit (1)
- Sie (1)
- Sie <Wort> (1)
- Sign Languages (1)
- Sign language dictionary (1)
- Sign-Based Construction Grammar (1)
- Silbentrennung (1)
- Simultanübersetzen (1)
- Situatives Involvement (1)
- Sketch Engine (1)
- Sketch engine (1)
- Slavic languages (1)
- Slavische Sprachen (1)
- Slawisch (1)
- Slawische Minderheit (1)
- Slawistik (1)
- Slips (1)
- Slovak (1)
- Slovene (1)
- Slovenisch (1)
- Slowenien (1)
- Smartphone-Gebrauch (1)
- Smartphones (1)
- Smiley (1)
- Social cognition (1)
- Social interaction (1)
- Social media (1)
- Social perception (1)
- Social sciences and humanities (1)
- Socio-Economic Panel (SOEP) (1)
- Softwareergonomie (1)
- Softwarewiederverwendung (1)
- Solidarität (1)
- Sonora (1)
- Sorbian (1)
- Sorbian languages in Germany (1)
- Sotho-Sprache (1)
- Source/goal assymetry (1)
- South Caucasian (1)
- South Tyrol (1)
- Sowjetunion (1)
- Soziale Integration (1)
- Soziale Rolle (1)
- Soziale Sanktion (1)
- Sozialer Konflikt (1)
- Sozialer Prozess (1)
- Sozialer Wandel (1)
- Sozialisation (1)
- Sozialkompetenz (1)
- Sozialtopografie (1)
- Sozialverhalten (1)
- Space (1)
- Space in language (1)
- Spanien (1)
- Spanish (1)
- Spanish Royal Academy (1)
- Spanish lexicography (1)
- Sparkling wine (1)
- Spatial cases (1)
- Special field lexicography (1)
- Speech Corpora (1)
- Speech Lexica (1)
- Speech production (1)
- Spezifikation (1)
- Spiel (1)
- Spieler (1)
- Spielrahmen (1)
- Spoken Language Data (1)
- Sprachakt (1)
- Sprachbiographien (1)
- Sprachdeterminismus (1)
- Spracheinstellung (1)
- Spracheinstellungen (1)
- Spracherkennung (1)
- Sprachfertigkeit (1)
- Sprachgemeinschaft (1)
- Sprachkritik (1)
- Sprachkurs (1)
- Sprachliche Universalien (1)
- Sprachliche Varietät (1)
- Sprachliches Relativitätsprinzip (1)
- Sprachplanung (1)
- Sprachressource (1)
- Sprachstudie (1)
- Sprachsynthese (1)
- Sprachtheorie (1)
- Sprachursprung (1)
- Sprachvarietät (1)
- Sprachzeichen (1)
- Sprachübersetzung (1)
- Sprechakte (1)
- Sprechakttheorie (1)
- Sprechen (1)
- Sprechererkennung (1)
- Sprichwortforschung (1)
- Spurious regression (1)
- Staatssprache (1)
- Stadtmundart (1)
- State-of-Affairs (1)
- Statistical Learning (1)
- Statistical methods (1)
- Statistische Analyse (1)
- Statistische Linguistik (1)
- Statistisches Modell (1)
- Stimmapparat (1)
- Storage Requirements (1)
- Strategie (1)
- Stressbewältigung (1)
- Struktur (1)
- Student (1)
- Studium (1)
- Subjectivity (1)
- Subjektivierung <Linguistik> (1)
- Subkultur (1)
- Subordination <Linguistik> (1)
- Substantiv (1)
- Substrat <Linguistik> (1)
- Such- und Recherchesysteme (1)
- Suchtechnologie (1)
- Suffigierung (1)
- Summary (1)
- Supervised Classification (1)
- Surface pattern (1)
- Suspendierung (1)
- Sustainability (1)
- Swahili (1)
- Symptom (1)
- Synchronizität (1)
- Syncretism (1)
- Synkretismus (1)
- Synonymie (1)
- Syrer (1)
- Szene (1)
- Sámi (1)
- Sámi languages in Finland (1)
- Südtirol (1)
- Südwestdeutsch (1)
- Südwestdeutschland (1)
- T-shirt lexicography (1)
- TBX (1)
- TEI LingSIG (1)
- TEI XML (1)
- TEI encoding (1)
- TEI-Lex0 (1)
- TEI/XML (1)
- TRP (1)
- TSPP Model (1)
- Tabelle (1)
- Tag (1)
- Tagung (1)
- Tagungsbericht (1)
- Take-In-Interaction (1)
- Target relation (1)
- Teamwork (1)
- Technik (1)
- Technologiegebrauch (1)
- Telepräsenz (1)
- Temporal Reference (1)
- Tense (1)
- Tenseless Languages (1)
- Terminologiedatenbank (1)
- Terminology (1)
- Terrebonne Parish (1)
- Testdaten (1)
- Testproduktion (1)
- Text Categorisation (1)
- Text Classification (1)
- Text Technology (1)
- Text data (1)
- Text mining (1)
- Text retrieval (1)
- Text technology (1)
- Text+ (1)
- Textanalyse ; Diskursanalyse ; Computerlinguistik (1)
- Textbaustein (1)
- Textklassifikation (1)
- Textklassifizierung (1)
- Textlingustik (1)
- Textplus NFDI (1)
- Textverstehendes System (1)
- Thailändisch (1)
- The Oxford English dictionary (1)
- Theaterspiel (1)
- Thema-Rhema-Gliederung (1)
- Thematische Rolle (1)
- Theodor Arnold (1)
- Theorie und Praxis (1)
- Thurneysen, Eduard Rudolf (1)
- Thurneysen, Eduard Rudolf (1)
- Tiefenpsychologisch fundierte Psychotherapie (1)
- Tiersprache (1)
- Time (1)
- Timing (1)
- Titling (1)
- Token <Linguistik> (1)
- Topic map (1)
- Topik-drop (1)
- Topikmodellierung (1)
- Totalitarismus (1)
- Traffic (1)
- Training (1)
- Transformative Sequences (1)
- Transformatives Lernen (1)
- Transitives Verb (1)
- Transitivity (1)
- Transitivität (1)
- Transkripte (1)
- Transkritpion (1)
- Transparenz (1)
- Treebank (1)
- Tschadische Sprachen (1)
- Tsingtau (1)
- Tunnel DP-algorithm (1)
- Tunnel Matrix (1)
- Turn Competition (1)
- Turn design (1)
- Tweet (1)
- Type-Token Verhältnis (1)
- Typologie (1)
- Türkei (1)
- Türkischer Jugendlicher (1)
- UIMA (1)
- Ukraine (1)
- Uncertainty (1)
- Uncertainty avoidance (1)
- Unconnected node (1)
- Unfähigkeit (1)
- Ungenauigkeit (1)
- Union of Soviet Socialist Republics (USSR) (1)
- Universalgrammatik (1)
- Universalität (1)
- Universität zu Köln (1)
- Unterricht (1)
- Unterrichtsmethode (1)
- Unterrichtsprache (1)
- Unterrichtssprache (1)
- Unvollständige TCUs (1)
- Urban dialects (1)
- Usability (1)
- UseNet (1)
- User <Benutzer> (1)
- User Generated Content (1)
- VLO (1)
- VR-games (1)
- Valences (1)
- Valenztheorie <Linguistik> (1)
- Varianz <Linguistik> (1)
- Variation des gesprochenen Deutsch (1)
- Ventspils University of Applied Sciences (VUAS) (1)
- Verb <verdienen> (1)
- Verb-Erst-Stellung (1)
- Verbal fluency (1)
- Verbalagression (1)
- Verbale Äußerung (1)
- Verbzweit (1)
- Vereinfachung (1)
- Vereinheitlichung (1)
- Verfahren der Zeichenprozessierung (1)
- Verfügbarkeit (1)
- Vergessen (1)
- Vergewaltigung (1)
- Vergleich (1)
- Vergleich <Rhetorik> (1)
- Vergleichbarkeit (1)
- Verhaltensmodifikation (1)
- Verhandlung (1)
- Verlaufsform (1)
- Vermutung <Linguistik> (1)
- Versdichtung (1)
- Verstehen und Intersubjektivität (1)
- Verständigung (1)
- Verwandtschaftsbezeichnung (1)
- Very Large Corpora (1)
- Veränderungsmessung (1)
- Videaufzeichnung (1)
- Video (1)
- Videodaten (1)
- Videointerview (1)
- Videokonferenz (1)
- Vietnamesisch (1)
- Vision (1)
- Visualization (1)
- Visualizations (1)
- Visueller Kontrast (1)
- Vokabellernen (1)
- Vokalisierung (1)
- Volksabstimmung (1)
- Volltext (1)
- Vorlesung (1)
- Vormachen (1)
- Vorschlagen (1)
- Vortragstechnik (1)
- Vorwort (1)
- Võro (1)
- WCC (1)
- WH-cleft (1)
- WOrd eMBedding dATabase (WOMBAT) (1)
- WSD (1)
- Wabi Sabi (1)
- Wahlforschung (1)
- Wahrnehmungsverb (1)
- Walbiri-Sprache (1)
- Walisisch (1)
- Walter Porzig (1)
- Web corpus (1)
- Web spam (1)
- WebLicht (1)
- Weblog (1)
- Weißrussisch (1)
- Welsh (1)
- Werbung (1)
- West Germanic (1)
- Westeuropa (1)
- What-about questions (1)
- WhatsApp (1)
- Widerstand (1)
- Widget bundle (1)
- Wien <2018> (1)
- Wikibase (1)
- Wikipedia articles (1)
- Wikipedia talk pages (1)
- Wiktionary revision history (1)
- Wirtschaftssprache (1)
- Wisconsin (1)
- Wissenschaft (1)
- Wissenschaftsentwicklung (1)
- Wissenschaftskommunikation (1)
- Wissenschaftspublizistik (1)
- Wissensextraktion (1)
- Wissensextration (1)
- Wissensgraph (1)
- Wissensrepräsentation (1)
- Wissensverarbeitung (1)
- Witz (1)
- Word associations (1)
- Word history (1)
- Word selection (1)
- Word‐Length (1)
- World War I (1)
- World War II (1)
- World Wide Web (1)
- Wortbedeutung <Semasiologie> (1)
- Wortfamilie (1)
- Wortfeld (1)
- Wortfolge (1)
- Wortgeschichte digital (Digital Word History) (1)
- Wortgrenze (1)
- Wortliste (1)
- Wortphonologie (1)
- Wortspiel (1)
- Writing (1)
- Writing process (1)
- Writing research (1)
- Writing technology (1)
- Wörterbuch Geschichte (1)
- Wörterbuch der deutschen Gegenwartssprache (WDG) (1)
- Wörterbucharbeit (1)
- Wörterbücher afrikanischer Sprachen (1)
- XForms (1)
- XML applications (1)
- XML database (1)
- XQuery Full Text (1)
- XSL Transformation (1)
- XSLT (1)
- YouTube comments (1)
- ZAS Database of Clause-Embedding Predicates (1)
- ZAS-Datenbank satzeinbettender Prädikate (1)
- Zeichen (1)
- Zeigesequenzen (1)
- Zeitreihenanalyse (1)
- Zeitschrift (1)
- Zeitsemantik (1)
- Zeitwahrnehmung (1)
- Zertifizierung (1)
- Zipf (1)
- Zipf–Mandelbrot law (1)
- Zugehörigkeit (1)
- Zuverlässigkeit (1)
- Zweierbeziehung (1)
- Zwischenmenschliche Beziehung (1)
- Zäsur <Metrik> (1)
- aanlyn woordeboeke (1)
- aboriginal culture in northern Russia (1)
- abusive comparisons (1)
- abusive emojis (1)
- abusive remarks (1)
- abusive words (1)
- academic dictionary (1)
- accent (1)
- acceptability ratings (1)
- access structure (1)
- accounting (1)
- accounts (1)
- accusation (1)
- acquisition (1)
- acting technique (1)
- action (1)
- action recognition (1)
- action-ascription (1)
- actuation problem (1)
- acute hospital (1)
- adaptive design (1)
- addition (1)
- adjacency pair (1)
- adjectives (1)
- ado file (1)
- adposition (1)
- adult education (1)
- adverb (1)
- adverbial connective (1)
- aerodynamics (1)
- aesthetic concept (1)
- aesthetic evaluation (1)
- aesthetics (1)
- affective stance (1)
- affiliation (1)
- affirmation of the consequent (1)
- african languages dictionaries (1)
- afrikataalwoordeboeke (1)
- age of acquisition (1)
- age stereotypes (1)
- agent (1)
- agent role (1)
- agentivity effect (1)
- aging (1)
- aikuiskoulutus (1)
- algorithms (1)
- allemand parlé (1)
- allomorphy (1)
- allostructions (1)
- ambiguous words (1)
- ambivalent sexism (1)
- analepsis (1)
- analogy (1)
- analyse conversationnelle (1)
- analyse multimodale (1)
- analytical opacity (1)
- anaphor (1)
- anaphoric relations (1)
- ancestry (1)
- animation (1)
- annotated corpora (1)
- annotation schema (1)
- annotation tools (1)
- announcements (1)
- anonymization (1)
- anotación multinivel (1)
- antecedence (1)
- anticipatory mechanism (1)
- application (1)
- application domain (1)
- applied language studies (1)
- applied linguistics (1)
- arbitrary scripts (1)
- architecture-for-interaction (1)
- archiving support (1)
- archiving workflow (1)
- argumentation (1)
- art reception (1)
- artefacts (1)
- articulography (1)
- assertion (1)
- assessment (1)
- assistance (1)
- attitudes towards dictionaries (1)
- audio-visual data (1)
- authentic language (1)
- authentic materials (1)
- author name homography (1)
- author name variability (1)
- authority records (1)
- automated tracking (1)
- automatic classification (1)
- automatic processing (1)
- automatic summarization (1)
- automatic term extraction (1)
- automatic translators (1)
- automatische Annotation (1)
- automotive domain (1)
- auxiliary selection (1)
- availability (1)
- avatars (1)
- average prediction complexity (1)
- aviation terminology (1)
- avun mobilisointi (1)
- base recognition model application (1)
- beliefs (1)
- benefit (1)
- bias awareness (1)
- bibliographic database (1)
- biconditional (1)
- bidirectionality (1)
- big data (1)
- bilingual (1)
- bilingual community (1)
- bilingual dictionaries in electronic format (1)
- bilingual electronic dictionaries (1)
- bilingual paronyms (1)
- bilingual resources (1)
- bilingual thesaurus (1)
- bilingualism (1)
- bilingualized dictionary (1)
- biomedical language processing (1)
- blindness (1)
- blog corpus (1)
- bodily conduct (1)
- bodily response (1)
- borrowing (1)
- bound word (1)
- boundary effects (1)
- brain rhythms (1)
- bridging relations (1)
- bridging resolution (1)
- business coaching (1)
- business data (1)
- business research (1)
- búsqueda (1)
- car racing (1)
- case (1)
- case syncretism (1)
- casual conversation (1)
- category detection (1)
- causal tagger (1)
- census (1)
- centres (1)
- cessation implicatures (1)
- cesuras (1)
- change-of-state token (1)
- child-directed speech (1)
- children (1)
- children’s specialised lexicography (1)
- children’s vocabulary (1)
- cipient (1)
- classification (1)
- clause linkage (1)
- clause union (1)
- climate (1)
- clitic climbing (1)
- close reading of dictionaries (1)
- closed vocabulary (1)
- clusivity (1)
- cluster analysis (1)
- clustering (1)
- co-training (1)
- code of ethics (1)
- code-switching (1)
- coding (1)
- coercion (1)
- cognitive availability (1)
- cognitive impairment (1)
- cognitive processing (1)
- cognitive salience (1)
- coherence (1)
- coherent construction (1)
- cohering affixes (1)
- collaboration (1)
- collaborative dictionary (1)
- collaborative filtering (1)
- collective emotions (1)
- collo-profile (1)
- collocated smartphone use (1)
- collocational behaviour (1)
- collostructional analysis (1)
- colonial group construction (1)
- combination of methods (1)
- combinatoric semantics (1)
- commonly confused words (1)
- communication (1)
- communication verb (1)
- communication verbs (1)
- communicative competence (1)
- communicative deviation (failure) (1)
- communicative deviations (1)
- community engagement (1)
- community size (1)
- comparable corpus (1)
- comparative lexicographic principles (1)
- comparative political science (1)
- comparison (1)
- compatibility (1)
- competence (1)
- complaint (1)
- complement clause (1)
- complementizer (1)
- complex graphemes (1)
- complex preposition (1)
- complex prepositions (CPs) (1)
- compositionality (1)
- compound family (1)
- compound formation (1)
- compound interpretation (1)
- compounding (1)
- comprehensibility (1)
- comprehension (1)
- compression (1)
- compuer-assisted language learning (1)
- computational language models (1)
- computer game (1)
- computer-assisted language learning (CALL) (1)
- computer-assisted pronunciation training (CAPT) (1)
- computerized grammar (1)
- comunicación mediada por computadora (CMC) (1)
- conative construction (1)
- concept scheme (1)
- concept system (1)
- concept system visualization (1)
- concept systems (1)
- conceptual approach (1)
- conceptual domain (1)
- conceptual field (1)
- conceptual history (1)
- conceptual metaphor theory (1)
- conceptualisation (1)
- concersation analysis (1)
- conflict (1)
- confusion (1)
- connectives (1)
- constrained poetic structure (1)
- constraint optimization (1)
- constraint satisfaction (1)
- constraint solving (1)
- construal (1)
- constructional ambiguity (1)
- constructional synonymy (1)
- contact linguistics (1)
- content management platform (1)
- content questions (1)
- context markers (1)
- contexts of dictionary use (1)
- contextual framework (1)
- contextual meaning (1)
- contingencies (1)
- continuer (1)
- continuers (1)
- contradiction (1)
- contrast (1)
- contrastive entries (1)
- contrastive focus (1)
- contrastive lexicography (1)
- controlled vocabularies (1)
- conversation (1)
- conversation analyses (1)
- conversation-analytic transcription (1)
- conversational analysis (1)
- conversational constructions (1)
- conversational narrative (1)
- convolutional neural networks (1)
- coordination of verbal and embodied action (1)
- copular clauses (1)
- copular constructions (1)
- copulatives (1)
- copyright laws (1)
- corona-neologism (1)
- coronacorpus (1)
- coronavirus (1)
- corpora of talk-in-interaction (1)
- corpus linguistics (1)
- corpus CMC (1)
- corpus access (1)
- corpus analysis tools (1)
- corpus architecture (1)
- corpus compilation (1)
- corpus construction (1)
- corpus creation (1)
- corpus de aprendices (1)
- corpus development (1)
- corpus driven approach (1)
- corpus exploitation (1)
- corpus frequencies (1)
- corpus information (1)
- corpus management systems (1)
- corpus pragmatics (1)
- corpus query processing (1)
- corpus query protocol (1)
- corpus querying (1)
- corpus retrieval (1)
- corpus search engine (1)
- corpus search platform (1)
- corpus size (1)
- corpus storage (1)
- corpus-based evaluation (1)
- corpus-based lexicon building (1)
- corpus-based methods (1)
- corpus-based statistical methods (1)
- corpus-based terminography (1)
- corpus-driven lexicography (1)
- corpus-lexicographic tool (1)
- corrections (1)
- correspondence (1)
- corpus-based lexicography (1)
- counterfactual recipient design (1)
- couple interaction (1)
- creole (1)
- crime (1)
- critical events (1)
- cross-cultural (1)
- cross-cultural research (1)
- cross-linguistic analysis (1)
- cross-linguistic data (1)
- cross-national policy convergence (1)
- crosswalks (1)
- cultural diversity (1)
- cultural heritage resources (1)
- culture specific items (1)
- curation (1)
- da (1)
- das <Wort> (1)
- data category (1)
- data control mechanism (1)
- data curation (1)
- data deposition (1)
- data dissemination (1)
- data exploration (1)
- data modeling (1)
- data modelling (1)
- data presentation (1)
- data processing (1)
- data provision (1)
- data referencing (1)
- data sets (1)
- data sustainability (1)
- data visualization (1)
- database applications (1)
- database systems (1)
- dataset (1)
- dative (1)
- decentralization (1)
- decision tree modelling (1)
- decision tree structure (1)
- decision-making (1)
- deep learning (1)
- deep-level morphological analyses (1)
- deep-structure morphological analyses (1)
- definiteness (1)
- definition (1)
- definitions (1)
- delayed completion (1)
- demonstration (1)
- demonstrative (1)
- denial of the antecedent (1)
- deontic (1)
- deontic modality (1)
- depiction (1)
- derivation (1)
- derivational morphology (1)
- derived subject (1)
- desambiguación (1)
- description (1)
- description of neologisms (1)
- descriptive (1)
- detection of neologisms (1)
- determinologisation (1)
- deviation (1)
- diachronic change (1)
- diachronic variation in language use (1)
- dialect (1)
- dialect competence (1)
- dialect lexicography (1)
- dialectometry (1)
- dialektometrie (1)
- dialogue interpreting (1)
- diary omission (1)
- diaspora communities (1)
- dictionaries as social agents (1)
- dictionarisability (1)
- dictionary didactics (1)
- dictionary editing system (1)
- dictionary encoding (1)
- dictionary of language contact (1)
- dictionary portal (1)
- dictionary teaching (1)
- dictionary typology (1)
- dictionary+ (1)
- dictionnaire des néologismes (1)
- didactic corpus (1)
- difference (1)
- diffusion mechanism (1)
- diffusion studies (1)
- digitaaliset taidot (1)
- digital collocation database (1)
- digital communication (1)
- digital lexicography (1)
- digital libraries (1)
- digital library (1)
- digital skills (1)
- digitally-mediated communication (1)
- diphthongs (1)
- directionality (1)
- directives (1)
- discourse deixis (1)
- discourse dictionary (1)
- discourse history (1)
- discourse keywords (DKW) (1)
- discourse markers (1)
- discourse metaphor (1)
- discourse metaphors (1)
- discourse parsing (1)
- discourse particles (1)
- discourse processing (1)
- discourse structure (1)
- discourse-level associations (1)
- discovering collocations in corpora (1)
- disengagement (1)
- disjunction (1)
- dissemination (1)
- do-support (1)
- doctor-patient interaction (1)
- document management and text processing (1)
- document processing (1)
- document triage (1)
- domain label (1)
- domain-specific solutions (1)
- double object (1)
- download vs. citation patterns (1)
- driving (1)
- drop out (1)
- dual task (1)
- duration (1)
- duration prediction (1)
- dyadic coping (1)
- dynamic lexicography (1)
- e-dictionary (1)
- e-dictionary application (1)
- eHumanities (1)
- early response (1)
- ecolinguistics (1)
- economic conditions (1)
- economic data (1)
- economy principles (1)
- editorial (1)
- editorial process (1)
- edutainment (1)
- efficiency (1)
- ego-documents (1)
- egocentrism (1)
- einsprachiges Wörterbuch (1)
- elderspeak (1)
- electromagnetic articulography (1)
- electronic corpus (1)
- electronic dictionaries (1)
- electronic dictionary (1)
- elektroniese woordeboeke (1)
- elicitation (1)
- ellipsis (1)
- embedded tense (1)
- embodied action (1)
- embodied displays (1)
- embodied other-initiation of repair (1)
- embodied withdrawal (1)
- emergence (1)
- emotional valence (1)
- empirical aesthetics (1)
- encoding (1)
- encounter (1)
- encyclopedic-conceptual approach (1)
- entropy (1)
- environment (1)
- epistemic priority (1)
- epistemicity (1)
- epistemische Priorität (1)
- equi-complexity hypothesis (1)
- error collection (1)
- es (1)
- ethnicity (1)
- ethno-regionalism (1)
- ethnolinguistic identity (1)
- ethnolinguistic vitality (1)
- ethnomethodology (1)
- etymological data base (1)
- etymology (1)
- europeanization (1)
- event structure (1)
- event-related brain potentials (ERP) (1)
- event-related potentials (1)
- evidentiality (1)
- evoked potentials (1)
- evolution of Scientific English (1)
- exceptional case marking (1)
- excessive (1)
- exclusive particles (1)
- existential, tense (1)
- experience (1)
- experiment (1)
- experimental evidence (1)
- experimental linguistics (1)
- experimental syntax (1)
- experimentation (1)
- experimentelle Phonetik (1)
- expertise (1)
- expert–novice (1)
- explicit and integrated intervention program (1)
- exploration of CMDI metadata (1)
- extended search (1)
- extensibility (1)
- extralexicographic features (1)
- eye tracking (1)
- eye-tracking (1)
- f0 accommodation (1)
- face-to-face interaction (1)
- factuality (1)
- family interaction (1)
- family relationships (1)
- family studies (1)
- feature compound (1)
- feature structure representation (1)
- fiabilidad (1)
- fieldwork (1)
- figurative meaning (1)
- finite state (1)
- finite state tokenization (1)
- first pair part (1)
- first person plural pronouns (1)
- fixation duration (1)
- focus (1)
- focus alternatives (1)
- focus phrase (1)
- folk linguistics (1)
- fonologie (1)
- food photography (1)
- footing shifts (1)
- foreign accent (1)
- foreign language learner (1)
- foreign language teacher (1)
- foreign language teaching (1)
- forgetfulness (1)
- form of communication (1)
- formal mathematics (1)
- formal model (1)
- format migration (1)
- formation de mots (1)
- formats (1)
- forms of representation in digital lexicography (1)
- frame semantics (1)
- frame structure (1)
- frame-based contrastive analysis (1)
- framing (1)
- free-sorting (1)
- frequency (1)
- fuck (1)
- full form systems (1)
- functional categories (1)
- functional status (1)
- future (1)
- fuzziness (1)
- gam (1)
- gathering (1)
- gebruikersleiding (1)
- gender (1)
- gender and language (1)
- gender differences (1)
- gender en taal (1)
- gender identity (1)
- gender stereotypes (1)
- genderstereotipes (1)
- general dictionary (1)
- general monolingual dictionary (1)
- generating information on demand (1)
- genericity (1)
- genre conceptions (1)
- genre expectations (1)
- genre-specific literary reading (1)
- genre-specific reading strategies (1)
- gestural hold (1)
- globaLex (Körperschaft) (1)
- global biodiversity in the early 21st century (1)
- global extension (1)
- global extinction of languages (1)
- global structural information (1)
- globalization (1)
- gold standard corpus (1)
- governance (1)
- govorni njemački u interakciji (1)
- gradable adjectives (1)
- grammar acquisistion (1)
- grammar competition (1)
- grammar development (1)
- grammar engineering (1)
- grammar learning (1)
- grammar testing (1)
- grammar-based language learning (1)
- grammatical KOS (1)
- grammatical complexity (1)
- grammatical construction (1)
- grammatical framework (1)
- grammatical information (1)
- grammatical particle (1)
- graph databases (1)
- graph-based dictionaries (1)
- graphematics (1)
- graphemic representation (1)
- graphetics (1)
- guidelines (1)
- handwriting (1)
- head alignment (1)
- head nod (1)
- headword (1)
- help desk (1)
- helping interaction (1)
- heroism (1)
- high-variability training (1)
- high-vowel laxing (1)
- higher education policy (1)
- hiperskakels (1)
- historical corpora (1)
- historical encyclopedias (1)
- historical lexicology (1)
- historical word formation of German (1)
- history of science (1)
- hosting provider (1)
- household work (1)
- human annotation studies (1)
- human cognition (1)
- human learning (1)
- humor (1)
- hyperlinks (1)
- identities in talk (1)
- identity construction (1)
- identity effects (1)
- identity groups (1)
- idiom detection (1)
- idiosyncrasy (1)
- imagination (1)
- impact indicator (1)
- imperatives (1)
- imperfective (1)
- impersonal (1)
- impersonal deontic statement (1)
- impersonal structures (1)
- implicit abuse (1)
- implicit association test (IAT) (1)
- implicitly abusive comparisons (1)
- implicitly abusive language (1)
- inbreath (1)
- incoherent construction (1)
- incomplete TCUs (1)
- increments (1)
- indirect questions (1)
- indirekter Sprechakt (1)
- individual alpha frequency (1)
- individual differences (1)
- inferences (1)
- infinite canvas (1)
- infinitival complements (1)
- inflected complementizer (1)
- inflected form (1)
- inflected forms (1)
- inflection (1)
- información de corpus (1)
- information density (1)
- information extraction (1)
- information infrastructure (1)
- information presentation devices (1)
- information retrieval (1)
- informing (1)
- infrastructure technology (1)
- infrastructures and architectures (1)
- inligtingsaanbiedingsinstrumente (1)
- innovation (1)
- inspection sequences (1)
- institutional action (1)
- instructional imteratives (1)
- integrated e-dictionary (1)
- integration (1)
- integriertes Lernen (1)
- intelligence (1)
- intensification (1)
- intention (1)
- intention ascription (1)
- inter-annotator reliability (1)
- inter-rater variability (1)
- interaction space (1)
- interactional competence (1)
- interactional grammar (1)
- interactional histories (1)
- interactional phonetics (1)
- interactional project (1)
- interactive editing (1)
- interactive graph visualisation (1)
- interactive turn space (1)
- interactivity (1)
- interakcijsko jezikoslovlje (1)
- interaktives Editieren (1)
- intercultural communication (1)
- intergroup relations (1)
- interlocking organization (1)
- intermediality (1)
- international comparable corpus (1)
- international comparison (1)
- international school (1)
- international work settings (1)
- internetbasierte Kommunikation (1)
- internetbasierte Kommunikation (IBK) (1)
- interpret (1)
- interpretation practices (1)
- interrogatives (1)
- intersectionality (1)
- intersemiotic translation adequacy (1)
- intervention (1)
- intonation units (1)
- intra-rater variability (1)
- intra-writer variation (1)
- inversion (1)
- island (1)
- iso24613 (1)
- isomorphism (1)
- item variability (1)
- joint projects (1)
- joint utterance formulation (1)
- joke (1)
- justification (1)
- keuse-boomstruktuur (1)
- keyphrase extraction (1)
- keyword analysis (1)
- kinship terminology (1)
- knowledge sources (1)
- kognitive Semantik (1)
- kollokasies (1)
- kontrafaktischer Adressatenzuschnitt (1)
- kontrastive Grammatik (1)
- kontrastive Lexikologie (1)
- kopulatiewe (1)
- korpusgebaseerde leksikografie (1)
- landscape (1)
- landscapes (1)
- language Standardization (1)
- language acquisition (1)
- language activism (1)
- language and gender (1)
- language area (1)
- language attitude (1)
- language awareness (1)
- language comparison (1)
- language corpora (1)
- language data (1)
- language discourses (1)
- language efficiency (1)
- language endangerment (1)
- language fixedness (1)
- language legislation (1)
- language marketing (1)
- language model (1)
- language modelling (1)
- language narratives (1)
- language regards (1)
- language resource (1)
- language shift (1)
- language socialisation (1)
- language structure (1)
- language studies (1)
- language teaching (1)
- language use (1)
- language variation (1)
- languages in Mari El (1)
- languages in Udmurtia (1)
- languages in the Russian Federation (1)
- large corpus data (1)
- large-scale corpora (1)
- latent semantic analysis (1)
- laughter (1)
- law (1)
- lean syntax (1)
- learner corpora (1)
- learner corpus of adolescent (1)
- learner's dictionary (1)
- learners’ dictionary (1)
- learning activities (1)
- learning motivation (1)
- lecture (1)
- legal aspects (1)
- legal lexicon (1)
- leksikografiese model (1)
- leksikografski izvori (1)
- lemma (1)
- length (1)
- lenguaje oral (1)
- less-resourced languages (1)
- lexcial decomposition (1)
- lexical borrowings (1)
- lexical analysis (1)
- lexical decision (1)
- lexical fields (1)
- lexical frequency (1)
- lexical level (1)
- lexical loans (1)
- lexical markup framework (1)
- lexical resources (1)
- lexical-functional grammar (1)
- lexicographers’ needs (1)
- lexicographic data (1)
- lexicographic functions (1)
- lexicographic model (1)
- lexicographic neology (1)
- lexicographic practices (1)
- lexicographic situation (1)
- lexicographical neology (1)
- lexicographical resource (1)
- lexicographical system (1)
- lexicon generation (1)
- lexicon graph (1)
- lexicon graphs (1)
- lexicon model (1)
- lexicon model formalism (1)
- lexicon structure (1)
- lexicotainment (1)
- lexikalische Repräsentation (1)
- lexikography (1)
- lexis (1)
- license (1)
- life science (1)
- lifelong learning (1)
- light-verb constructions (1)
- lightweight annotation (1)
- likelihood ratio test (1)
- linguistic abstractness (1)
- linguistic acculturation (1)
- linguistic and cultural diversity (1)
- linguistic annotation (1)
- linguistic borrowings (1)
- linguistic change (1)
- linguistic expectancy bias (LEB) (1)
- linguistic integration (1)
- linguistic intergroup bias (LIB) (1)
- linguistic landscape (1)
- linguistic landscapes (1)
- linguistic locational reference (1)
- linguistic minorities (1)
- linguistic minority (1)
- linguistic niche hypothesis (1)
- linguistic prominence (1)
- linguistic repair (1)
- linguistic rights of national groups (1)
- linguistic technology (1)
- linguistic typology (1)
- linguistically based measures (1)
- linguistics (1)
- linguistique interactionnelle (1)
- linking patterns (1)
- list of headwords (1)
- literary comprehension (1)
- literary processing (1)
- live video stream (1)
- loan translation (1)
- loan words (1)
- loans (1)
- loanwords (1)
- local ecology (1)
- locally uninstantiated arguments (1)
- locative vs. goal adverbial (1)
- log file (1)
- log file analysis (1)
- logical information systems (1)
- logical problem of language change (1)
- logistic regression (1)
- lokalistische Hypothese (1)
- machine translation (1)
- macrostructure (1)
- major reference work (1)
- makrostruktuur (1)
- mantenimiento (1)
- manual database curation (1)
- manual information extraction (1)
- marital satisfaction (1)
- markup framework (1)
- markup language (1)
- marqueurs de réponse (1)
- mashup (1)
- material culture (1)
- mathematical language (1)
- mathematical terms (1)
- mathematics (1)
- maximum likelihood (1)
- meaning (1)
- measurement (1)
- mechanisms (1)
- media discourse (1)
- media effects (1)
- media linguistics (1)
- media literacy (1)
- media practices (1)
- media technology (1)
- mediated interaction (1)
- mediation (1)
- mediostructure (1)
- mediostruktuur (1)
- mediterranean (1)
- meeting talk (1)
- meetings (1)
- mental health services (1)
- mental illness (1)
- mentalitiy (1)
- message effectiveness (1)
- messenger communication (1)
- meta-language (1)
- meta-pragmatic accounts (1)
- meta-semantic effects (1)
- metacommunication (1)
- metadata analysis (1)
- metadata curation (1)
- metadata editor (1)
- metadata formats (1)
- metadata quality (1)
- metadata quality assessment (1)
- metadata score (1)
- metadata standards (1)
- metaphor theory (1)
- metaphorical extension (1)
- metodologia (1)
- micro-constructions (1)
- micro-sequential relationship (1)
- microservices (1)
- microstructure bilingual dictionaries of linguistics (1)
- migration (1)
- migration linguistics (1)
- mikrostruktura (1)
- mikrostruktuur (1)
- minorities in Germany (1)
- minority language protection (1)
- minority language revitalisation (1)
- minority language speakers (1)
- minority languages and cultures (1)
- minority protection (1)
- minority–majority relations (1)
- mission societies (1)
- mixed-effects logistic regression models (1)
- mixed-effects modeling (1)
- mobile devices (1)
- mobilising assistance (1)
- mobiliy (1)
- mobilizing response (1)
- mock story (1)
- modal enrichment (1)
- modal meaning (1)
- modal particles (1)
- modal verb constructions (1)
- modalne čestice (1)
- modern forms of prejudice (1)
- modular pivot (1)
- modus ponens (1)
- monolingualised dictionary (1)
- monospaced font (1)
- mood (1)
- morfologie (1)
- morphemic categories (1)
- morpho-syntactic argument realization (1)
- morpho-syntactic database (1)
- morphological analyses (1)
- morphological complexity (1)
- morphological level (1)
- morphological parsing (1)
- morphological productivity (1)
- morphological treebank (1)
- mot d'emprunt (1)
- motion verb (1)
- motivation to control prejudiced responding (1)
- movie recommendation (1)
- mrežni rječnik (1)
- multi-activity and multi-party settings (1)
- multi-layer annotation (1)
- multi-layer corpora (1)
- multi-lingual grammar (1)
- multi-modality (1)
- multi-party dialogues (1)
- multi-relational learning (1)
- multi-turn conversations (1)
- multi-unit turn (1)
- multi-word expression (1)
- multiactivity (1)
- multidimensional scaling (1)
- multidimensionele skalering (1)
- multidisciplinarity (1)
- multifunctional lexical resource (1)
- multifunksionele leksikale bron (1)
- multilevel modeling (1)
- multilingual corpora (1)
- multilingual data (1)
- multilingual database (1)
- multilingual grammar (1)
- multilingual matter (1)
- multilingual platform (1)
- multilingual setting (1)
- multilingual transcripts (1)
- multilinguality (1)
- multimedia (1)
- multimodaalinen keskustelunanalyysi (1)
- multimodal conversation analysis (1)
- multimodal corpora (1)
- multimodal database (1)
- multimodal interaction analysis (1)
- multimodal storytelling (1)
- multiparty setting (1)
- multiple etymologies (1)
- mundane technology use (1)
- murder (1)
- naming (1)
- narrative (1)
- narrative analysis (1)
- narrative comparison (1)
- narratives in interaction (1)
- national and subnational standard varieties (1)
- national corpora (1)
- national identification (1)
- nationaler Mythos (1)
- nationalistic purism (1)
- native speech (1)
- natürlichsprachliche Systeme (1)
- negation Raising (1)
- negation content words (1)
- negation modeling (1)
- negation particle (1)
- negotiation (1)
- neologism detection (1)
- neologisms in Brazilian Portuguese (1)
- neology (1)
- neoterm (1)
- network analysis (1)
- neural oscillations and entrainment (1)
- neural phase precession (1)
- new media (1)
- new public management (1)
- newsfeed (1)
- newspaper reports (1)
- nodding (1)
- non-players (1)
- nonnative speakers (1)
- nonnative speech (1)
- nonstandard accent (1)
- normalisation (1)
- normalization (1)
- normativity (1)
- norms and rules (1)
- noun phrase (1)
- null complementation (1)
- null subject (1)
- néologismes des médias sociaux (1)
- object manipulation (1)
- objektorientierte Graphdatenbank (1)
- observation study (1)
- observational study (1)
- offers (1)
- official language (1)
- oh that’s right (1)
- okay (1)
- online dictionaries of linguistics (1)
- online discourse (1)
- online grammars (1)
- online information systems (1)
- online lexicographic resources (1)
- onomasiological search (1)
- onomastics (1)
- ontology (1)
- open class repair initiators (1)
- open dictionary (1)
- open educational trainer (1)
- open science (1)
- open source software (1)
- operationalized psychodynamic diagnosis (1)
- opinion extraction (1)
- opinion inference (1)
- opinion role extraction (1)
- opinion verb (1)
- opinion verbs (1)
- oral and written skills (1)
- oral corpus platform (1)
- oral history corpora (1)
- oral language (1)
- other-initiated repair (1)
- overlap resolution (1)
- overtaking (1)
- own experience (1)
- pandemic neologism (1)
- paradigm uniformity (1)
- parallel text corpus (1)
- parallelism (1)
- parameters (1)
- parental interventions (1)
- parliaments (1)
- paronym dictionaries (1)
- paronyms, easily confused words (1)
- paronymy (1)
- parser evaluation (1)
- parsing (1)
- part-of-speech tagging (1)
- participant opacity (1)
- participation (1)
- passive (1)
- past (1)
- patientivity (1)
- pattern-based lexicography (1)
- patterns (1)
- pean languages (1)
- pedagogical lexicography Greek (1)
- peer-group interaction (1)
- perceptual evaluation (1)
- perfect (1)
- performativity (1)
- permutation testing (1)
- persistent identifiers (1)
- person agreement (1)
- person perception (1)
- person reference (1)
- personal designations (1)
- personal learning environments (1)
- perspective (1)
- phi-features (1)
- phonetic databases (1)
- phonetic ending (1)
- phonological status (1)
- phonological word (1)
- picture naming (1)
- pitch (1)
- pitch contour matching (1)
- place names (1)
- plurilingualism (1)
- poetic diction (1)
- poetic language (1)
- poetic structure (1)
- poetry (1)
- poetry comprehension (1)
- pointing gesture (1)
- polar question (1)
- polarity sensitive items (1)
- polarity shifter (1)
- policy analysis (1)
- policy preference (1)
- policy transfer (1)
- political debate (1)
- political discourse (1)
- political relations (1)
- political text analysis (1)
- political video interviews (1)
- political views (1)
- politics (1)
- politische Willensbildung (1)
- pop lyrics (1)
- popular knowledge (1)
- positionally-sensitive grammar (1)
- positioning analysis (1)
- positioning of self and other (1)
- possessives (1)
- post-soviet states (1)
- post-war history (1)
- postcolonialism (1)
- postlexical processes (1)
- posture verb (1)
- posture verbs (1)
- practical contexts (1)
- practical reasoning (1)
- pragmatic focus (1)
- praxeological context (1)
- pre-school choice (1)
- predication (1)
- predicative adjectives (1)
- prediction error (1)
- predictive approach (1)
- prefabs (1)
- preface (1)
- prejudice and discrimination (1)
- preposition (1)
- preposition-noun combinations (1)
- preposition-pronoun contraction (PPC) (1)
- prepositional clause (1)
- prepositional object clauses (1)
- prepositional object construction (1)
- prescriptive (1)
- present (1)
- presentation (1)
- presidential debate (1)
- pretend play frame (1)
- preterite (1)
- prevalence (1)
- primary research data repository (1)
- print lexicography (1)
- prior talk (1)
- privative adjectives comprehension (1)
- probabilistic approach (1)
- processing fluency (1)
- processing load (1)
- processing pipeline (1)
- product feature extraction (1)
- productivity (1)
- productivity measures (1)
- progressive (1)
- progressive aspect (1)
- prohibitive (1)
- prohibitive markers (1)
- project report (1)
- projective mechanism (1)
- promotion of junior researchers (1)
- pronominal agreement (1)
- pronouns (1)
- pronunciation (1)
- proof checking (1)
- proportional font (1)
- proposing (1)
- propositional argument (1)
- prosodic constituency (1)
- prosodic form (1)
- prosodic organization (1)
- prosodic word (pword) (1)
- prospective possession (1)
- proverb (1)
- pseudonymisation (1)
- psychoanalysis (1)
- psychodiagnostic interview (1)
- psycholinguistics (1)
- public discourse (1)
- public mediation (1)
- public/ political discourse (1)
- publishing model (1)
- quality (1)
- quality checking (1)
- quality evaluation (1)
- quantitative analysis (1)
- quantitative and qualitative methods (1)
- quantitative linguistics (1)
- quantitative quality metrics (1)
- quantitative typology (1)
- query building (1)
- query language (1)
- query languages (1)
- question (1)
- question under discussion (1)
- question-word questions (1)
- questioning sequences (1)
- questionnaire (1)
- raising (1)
- random forests (1)
- rape myth acceptance (1)
- rapid serial visual presentation (1)
- rating scales (1)
- reading speed (1)
- reading strategies (1)
- reading strategy (1)
- reading time (1)
- realia (1)
- reanalysis (1)
- reciprocity (1)
- recollection (1)
- recommendation system (1)
- recommender (1)
- recording (1)
- recruitment (1)
- recursos (1)
- redress (1)
- reduplication construction (1)
- reference corpus (1)
- reference dictionary (1)
- reference resolution (1)
- reference tools (1)
- referencing strategies (1)
- referendum (1)
- referentiality (1)
- reflexivity (1)
- regional languages (1)
- regional phonetic variation (1)
- regional variation (1)
- register (1)
- regressions (1)
- rehearsals (1)
- relaciones de respuesta (1)
- relation (1)
- relation registry (1)
- relational database (1)
- relationship satisfaction (1)
- reliability (1)
- reminders (1)
- repair sequences (1)
- repair-initiation (1)
- repatriation (1)
- repositories (1)
- repository (1)
- representación semántica superficial (1)
- request sequences (1)
- requesting examples (1)
- research infrastructures (1)
- research literature (1)
- research methods (1)
- research overview (1)
- research report (1)
- research reports (1)
- research tools (1)
- resources (1)
- respondent (1)
- response latency (1)
- retro-digitization (1)
- retro-digitized dictionaries (1)
- retro-gedigitaliseerde woordeboeke (1)
- reusability of research data (1)
- revision (1)
- revitalization of endangered languages (1)
- rhetoric (1)
- rhetorical device (1)
- rhetorical structure (1)
- right-dislocation (1)
- role decomposition (1)
- role prototypicality (1)
- romantic relationship (1)
- routines (1)
- rule enforcement (1)
- rule formulations (1)
- saami languages (1)
- sanctioning (1)
- sans-serif (1)
- scalar rhetoric (1)
- schema.org (1)
- schematicity (1)
- school choice (1)
- schwa (1)
- scientific communication (1)
- screen-based interaction (1)
- search (1)
- search engine (1)
- search strategies (1)
- search systems (1)
- search technology (1)
- second pair part (1)
- second position (1)
- selection of textual sources (1)
- self (1)
- self-paced reading (1)
- self-reflection (1)
- self-regulated learning (1)
- semantic change (1)
- semantic classification (1)
- semantic extension (1)
- semantic frames (1)
- semantic information management (1)
- semantic interoperability (1)
- semantic map (1)
- semantic network (1)
- semantic predictability (1)
- semantic presence/absence (1)
- semantic processing (1)
- semantic relatedness (1)
- semantic reversal anomalies (1)
- semantic role (1)
- semantische Analyse (1)
- semiotic mediation (1)
- semiotic resource (1)
- semiotics (1)
- sentence boundary detection (1)
- sentiment (1)
- sentiment polarity (1)
- separation of adjectives (1)
- sequence (1)
- sequence of tense (1)
- sequential analysis (1)
- sequential organization (1)
- service integration (1)
- service interoperability (1)
- service provider (1)
- sexual harassment (1)
- shallow semantic representation (1)
- shared courses of action (1)
- shared gameplay (1)
- shared meaning (1)
- shared task (1)
- sharing data (1)
- sign language resources (1)
- signs (1)
- silences (1)
- simplification (1)
- single player games (1)
- single word borrowings (1)
- sintaksis (1)
- situational involvement (1)
- skills training (1)
- small clause (1)
- smile (1)
- social action format (1)
- social categorization (1)
- social cognition (1)
- social coordination (1)
- social grammar (1)
- social identity theory (1)
- social integration (1)
- social judgment (1)
- social media (1)
- social media interaction (1)
- social media neologisms (1)
- social media storytelling (1)
- social perception (1)
- social relevance (1)
- social roles (1)
- social rules (1)
- social sanctioning (1)
- social topography (1)
- societal inclusion (1)
- societal multilingualism (1)
- socio-spatial positioning (1)
- sociocultural situatedness (1)
- sociolinguistic ethnolinguistic variation (1)
- sociolinguistics (1)
- soft governance (1)
- software tools (1)
- solution-oriented questions (1)
- sostenibilidad (1)
- soveltava kielentutkimus (1)
- soveltava kielitiede (1)
- sowieso <Lemma> (1)
- soziale Interaktion (1)
- space-delimited languages (1)
- speaker variability (1)
- speakership (1)
- speaking machine (1)
- specialised languages (1)
- specialist corpora (1)
- specialized dictionary (1)
- specialized knowledge (1)
- specialized language (1)
- specificational copular clauses (1)
- spectating (1)
- speech act verb (1)
- speech communities (1)
- speech content grouping (1)
- speech corpora (1)
- speech data (1)
- speech database (1)
- speech segmentation (1)
- speech signal processing (1)
- speech technology (1)
- speech thought writing representation (1)
- speed-curvature relation (1)
- spelling reform (1)
- spoken (colloquial) standard (1)
- spoken Arabic (1)
- spoken German in interaction (1)
- spoken corpora (1)
- spoken language transcripts (1)
- spoken syntax (1)
- spoken vs. written (1)
- stance (1)
- stance management (1)
- standardisation (1)
- standardology (1)
- standards for LRs (1)
- standoff annotation (1)
- state change (1)
- statistical complexity (1)
- statistical significance (1)
- status (1)
- stereotype content model (1)
- strategic reading (1)
- strategy (1)
- strategy ascription (1)
- structural information (1)
- sub-grammar extraction (1)
- subextraction (1)
- subject island (1)
- subject-to-object-raising (1)
- subjectification (1)
- subjective comprehensibility (1)
- subtraction (1)
- subtraction neglect (1)
- survey design (1)
- suspension (1)
- sustainability (1)
- sustainable archives (1)
- swing vote (1)
- syllable (1)
- syllable duration (1)
- symbolic prosody prediction (1)
- synonymity (1)
- synonymy (1)
- syntactic competence (1)
- syntactic extensions (1)
- syntactic processing (1)
- syntactical level (1)
- syntactico-semantic argument structure (1)
- syntax-semantics interface (1)
- systemisation (1)
- task-evoked pupillary responses (1)
- technical neologisms (1)
- technologieunterstütztes Lernen (1)
- technology use (1)
- technology watch (1)
- teksproduksie (1)
- teksresepsie (1)
- tele-presence (1)
- telephone interpreting (1)
- telicity (1)
- temporal organization (1)
- temporal phraseological units (1)
- temporality (1)
- tentative taxonomy (1)
- term (1)
- term base exchange format (1)
- terminography (1)
- terminological neologism (1)
- terminological structurer (1)
- terminology visualisation (1)
- test (1)
- text (1)
- text analysis (1)
- text analytics (1)
- text categorization (1)
- text complexity (1)
- text parsing (1)
- text reception (1)
- text-to-speech (1)
- thanking (1)
- that (1)
- theater rehearsals (1)
- theory and practice (1)
- therapeutic alliance (1)
- there (1)
- third position (1)
- third-position repair (1)
- time reckoning (1)
- time windows and constants (1)
- time-series analysis (1)
- timing of turn-taking (1)
- tipologie (1)
- toegangstruktuur (1)
- top-down (1)
- topic drop (1)
- topic management (1)
- topic models (1)
- topic shift (1)
- topical event (1)
- topicalization (1)
- topologisches Feldermodell (1)
- tourism (1)
- traduction de prêt (1)
- traffic (1)
- training software (1)
- transcripción (1)
- translation exercises (1)
- translation studies (1)
- translation tools (1)
- translators (1)
- transmission problem (1)
- travel guides (1)
- treebank (1)
- trends (1)
- trosanalise (1)
- trouble sources (1)
- turn competition (1)
- turn design (1)
- turn-design (1)
- turn-final particles (1)
- tutkimusaineistot (1)
- tutkimusmenetelmät (1)
- type frequency (1)
- uncertainty (1)
- uncertainty avoidance (1)
- under-resourced language (1)
- under-resourced language varieties (1)
- underspecification (1)
- understanding in interaction (1)
- uniform information density (1)
- unregistered words (1)
- unrestricted dialog (1)
- urban youth language (1)
- usability (1)
- usability study (1)
- usage labels (1)
- use cases (1)
- user behavior (1)
- user communities (1)
- user interface design (1)
- user preference (1)
- user research (1)
- user satisfication (1)
- user studies (1)
- user support (1)
- user survey (1)
- user-centred design (1)
- user-generated content (1)
- utterance interpretation (1)
- valency changes (1)
- variable analysis (1)
- variasie (1)
- variation management (1)
- varieties (1)
- vehicular language (1)
- verb valency (1)
- verb-argument linking (1)
- verbale Interaktion (1)
- verbsemantik (1)
- vernacular lexicography (1)
- verwantskapsterminologie (1)
- very large corpora (1)
- video (1)
- video-mediated interactions (1)
- videogames (1)
- violation (1)
- virtual corpus (1)
- virtual embodiment (1)
- virtual worlds (1)
- visibility of ritual meaning (1)
- visual world paradigm (1)
- visualisering (1)
- visually impaired children (1)
- vocabulary (1)
- vocabulary growth (1)
- vocabulary of quotation expressions (1)
- vocabulary organization in dictionaries (1)
- voice (1)
- voice messages (1)
- volition (1)
- vowels (1)
- wabi sabi (1)
- warmth (1)
- ways of spectating (1)
- weakeniing (1)
- web application (1)
- web crawling (1)
- web data (1)
- web service (1)
- web-based information system (1)
- web-based platform (1)
- websites (1)
- wh-movement (1)
- widget (1)
- widget store (1)
- wir (1)
- wisdom of the crowd (1)
- wollen (1)
- women (1)
- woordeboeke as sosiale werktuie (1)
- woordeboekontwerp (1)
- word (1)
- word embedding (1)
- word family database (1)
- word formation in German (1)
- word frequency distribution (1)
- word history (1)
- word meaning relationship (1)
- word recognition (1)
- word segmentation (1)
- word selection (1)
- word sense alignment (1)
- word trees (1)
- word-level alignment (1)
- word-sense disambiguation (1)
- worship (1)
- writing (1)
- writing support tool (1)
- youth (1)
- zipf (1)
- zipf-mandelbrot (1)
- Ägyptisch (1)
- Ähnlichkeitssuche (1)
- Öffentliche Meinung (1)
- Öffentlichkeit (1)
- überhaupt <Lemma> (1)
- żeby (1)
- комунікативна компетентність (1)
- комунікативна девіація (невдача) (1)
- комунікативні девіації (1)
- міжкультурна комунікація (1)
- німецька мова (1)
- німецька мова як іноземна (1)
- політичне відеоінтерв’ю (1)
- респондент (1)
- українська мова як іноземна (1)
- українськамова (1)
Publicationstate
- Veröffentlichungsversion (961)
- Zweitveröffentlichung (248)
- Postprint (236)
- Ahead of Print (6)
- Preprint (5)
- Erstveröffentlichung (2)
Reviewstate
- Peer-Review (828)
- (Verlags)-Lektorat (410)
- Peer-review (24)
- Qualifikationsarbeit (Dissertation, Habilitationsschrift) (18)
- Verlags-Lektorat (14)
- Peer-Revied (8)
- Review-Status-unbekannt (6)
- Abschlussarbeit (Bachelor, Master, Diplom, Magister) (Bachelor, Master, Diss.) (3)
- (Verlags-)Lektorat (2)
- Peer review (2)
Publisher
- de Gruyter (104)
- Benjamins (87)
- IDS-Verlag (81)
- Springer (63)
- European Language Resources Association (ELRA) (56)
- Association for Computational Linguistics (46)
- European Language Resources Association (42)
- Oxford University Press (35)
- Elsevier (33)
- Institut für Deutsche Sprache (33)
Speakers’ dialogical orientation to the particular others they talk to is implemented by practices of recipient-design. One such practice is the use of negation as a means to constrain interpretations of speaker’s actions by the partner. The paper situates this use of negation within the larger context of other recipient-designed uses of negation which negate assumptions the speaker makes about what the addressee holds to be true (second-order assumptions) or what the addressee assumes the speaker holds to be true (third- order assumptions). The focus of the study is on the ways in which speakers use negation to disclaim interpretations of their turns which partners have displayed or may possibly arrive at. Special emphasis is given to the positionally sensitive uses of negation, which may occur before, after or inserted between the nucleus actions whose interpretation is constrained by the negation. Interactional motivations and rhetorical potentials of the practice are pointed out, partly depending on the position of the negation vis-à-vis the nucleus action. The analysis shows that the concept of ‘recipient design’ is in need of distinctions which have not been in focus in prior research.
"Standard language" is a contested concept, ideologically, empirically and theoretically. This is particularly true for a language such as German, where the standardization of the spoken language was based on the written standard and was established with respect to a communicative situation, i.e. public speech on stage (Bühnenaussprache), which most speakers never come across. As a consequence, the norms of the oral standard exhibit many features which are infrequent in the everyday speech even of educated speakers. This paper discusses ways to arrive at a more realistic conception of (spoken) standard German, which will be termed "standard usage". It must be founded on empirical observations of speakers linguistic choices in everyday situations. Arguments in favor of a corpus-based notion of standard have to consider sociolinguistic, political, and didactic concerns. We report on the design of a large study of linguistic variation conducted at the Institute for the German Language (project "Variation in Spoken German", Variation des gesprochenen Deutsch) with the aim of arriving at a representative picture of "standard usage" in contemporary German. It systematically takes into account both diatopic variation covering the multi-national space in which German an official language, and diastratic variation in terms of varying degrees of formality. Results of the study of phonetic and morphosyntactic variation are discussed. At least for German, a corpus-based notion of "standard usage" inevitably includes some degree of pluralism concerning areal variation, and it needs to do justice to register-based variation as well.
With the advent of mobile devices, mediatized political discourse became more dynamic. I assume that the microblog Twitter can be considered as a medium for spatial coordination during protests. Therefore, the case of neo-Nazi demonstrations and counter-protests in the city of Dresden that occurred in February 2012 is analysed. Data consists of microposts that occurred during the event. Quantitative analysis of hashtag and retweet frequencies was performed as well as qualitative speech act pattern analysis and a tempo-spatial discourse analysis on selected subsets of microposts. Results show that a common linguistic practice is verbal georeferencing and by that constructing space. Empirical analysis indicates a strong relation between communicational online space and physical offline place: Protest participants permanently reconfigure spatial context discursively and thus the contested protest area becomes a temporarily meaningful place.
"What makes this so complicated?" On the value of disorienting dilemmas in language instruction
(2017)
This thesis is a corpus linguistic investigation of the language used by young German speakers online, examining lexical, morphological, orthographic, and syntactic features and changes in language use over time. The study analyses the language in the Nottinghamer Korpus deutscher YouTube‐Sprache ("Nottingham corpus of German YouTube language", or NottDeuYTSch corpus), one of the first large corpora of German‐language comments taken from the videosharing website YouTube, and built specifically for this project. The metadatarich corpus comprises c.33 million tokens from more than 3 million comments posted underneath videos uploaded by mainstream German‐language youthorientated YouTube channels from 2008‐2018.
The NottDeuYTSch corpus was created to enable corpus linguistic approaches to studying digital German youth language (Jugendsprache), having identified the need for more specialised web corpora (see Barbaresi 2019). The methodology for compiling the corpus is described in detail in the thesis to facilitate future construction of web corpora. The thesis is situated at the intersection of Computer‐Mediated Communication (CMC) and youth language, which have been important areas of sociolinguistic scholarship since the 1980s, and explores what we can learn from a corpus‐driven, longitudinal approach to (online) youth language. To do so, the thesis uses corpus linguistic methods to analyse three main areas:
1. Lexical trends and the morphology of polysemous lexical items. For this purpose, the analysis focuses on geil, one of the most iconic and productive words in youth language, and presents a longitudinal analysis, demonstrating that usage of geil has decreased, and identifies lexical items that have emerged as potential replacements. Additionally, geil is used to analyse innovative morphological productiveness, demonstrating how different senses of geil are used as a base lexeme or affixoid in compounding and derivation.
2. Syntactic developments. The novel grammaticalization of several subordinating conjunctions into both coordinating conjunctions and discourse markers is examined. The investigation is supported by statistical analyses that demonstrate an increase in the use of non‐standard syntax over the timeframe of the corpus and compares the results with other corpora of written language.
3. Orthography and the metacommunicative features of digital writing. This analysis identifies orthographic features and strategies in the corpus, e.g. the repetition of certain emoji, and develops a holistic framework to study metacommunicative functions, such as the communication of illocutionary force, information structure, or the expression of identities. The framework unifies previous research that had focused on individual features, integrating a wide range of metacommunicative strategies within a single, robust system of analysis.
By using qualitative and computational analytical frameworks within corpus linguistic methods, the thesis identifies emergent linguistic features in digital youth language in German and sheds further light on lexical and morphosyntactic changes and trends in the language of young people over the period 2008‐2018. The study has also further developed and augmented existing analytical frameworks to widen the scope of their application to orthographic features associated with digital writing.
Positioning analysis, a variant of discourse analysis, was used to explore the narratives of 40 psychiatric patients (11 females and 29 males; mean age = 40 years) who had manifest difficulties with engagement with statutory mental health services. Positioning analysis is a qualitative method that captures how people linguistically position the roles and identities of themselves and others in their day-to-day lives and narratives. The language of disengagement incorporated the passive positioning of self in relation to their lives and treatment through the use of metaphor, the passive voice and them and us attribution, while the discourse of engagement incorporated more active positioning of self, achieved through the use of the personal pronoun we and metaphoric references to balanced relationships. The findings corroborate previous thematic analysis that highlighted the importance of identity and agency in the ‘making or breaking’ of therapeutic relationships (Priebe et al. 2005). Implications are discussed in relation to how positioning analysis may help signal and emphasize important life and therapeutic experiences in spoken narratives as well as clinical consultations.
The purpose of this paper is to describe the functions of ‘where’-based relative elements' in six Balkan languages, paying particular attention to non-standard varieties.2 Relative elements based on an originally interrogative pronoun meaning ‘where’ are attested in all Balkan languages and, more generally, in all European languages. In accordance with the locative meaning of the original pronoun, ‘where’-based relative elements are primarily used to relativize locatives. However, it will be shown that in some Balkan languages, and especially in non-standard varieties, these elements have extended their functional domain. This process does not appear to be random, but rather to pattern with the following hierarchy: locative > unspecific connector > other syntactic positions (indirect/direct object, subject).3 Additionally, ‘where’-based relative elements will be compared with ‘what’-based ones in order to highlight common patterns of development.
The present investigation targets the phenomenon commonly called control. Many languages including German and Polish employ non-finite clauses (besides finite clauses) as propositional complements. The subject of these complement clauses is left unexpressed and must generally be interpreted co-referentially with the subject or object of the matrix clause (subject or object control). However. there are also infinitive-selecting verbs that do not allow for a co- referential interpretation of the embedded subject - semantically, the embedded infinitives of these anti-control verbs are thus less dependent on or less unifiable with the matrix proposition. In Polish anti-control constructions, non-finite complements are overtly marked with the complementizer zeby, suggesting that they are structurally more complex (namely. containing a C-projection) than the non-finite complements in control constructions lacking zeby (modulo special contexts. viz. 'control switch'). In a comparative perspective, the paper brings corpuslinguistic and experimental evidence to bear on the question whether surface appearances notwithstanding, the infinitival complements of anti-control verbs in German should similarly be analyzed as truly sentential, i.e., C-headed structures.
The paper reports the results of the curation project ChatCorpus2CLARIN. The goal of the project was to develop a workflow and resources for the integration of an existing chat corpus into the CLARIN-D research infrastructure for language resources and tools in the Humanities and the Social Sciences (http://clarin-d.de). The paper presents an overview of the resources and practices developed in the project, describes the added value of the resource after its integration and discusses, as an outlook, to what extent these practices can be considered best practices which may be useful for the annotation and representation of other CMC and social media corpora.
This study explores how ‘gatherings’ turn into ‘encounters’ in a virtual world (VW) context. Most communication technologies enable only focused encounters between distributed participants, but in VWs both gatherings and encounters can occur. We present close sequential analysis of moments when after a silent gathering, interaction among participants in a VW is gradually resumed, and also investigate the social actions in the verbal (re-)opening turns. Our findings show that like in face-to-face situations, also in VWs participants often use different types of embodied resources to achieve the transition, rather than rely on verbal means only. However, the transition process in VWs has distinctive characteristics compared to the one in face-to-face situations. We discuss how participants in a VW use virtually embodied pre-beginnings to display what we call encounter-readiness, instead of displaying lack of presence by avatar stillness. The data comprise 40 episodes of video-recorded team interactions in a VW.
This conference booklet provides information about 10th International Contrastive Linguistics Conference (ICLC-10) that took place in Mannheim, Germany, from 18 to 21 July 2023. It contains
– a description of the conference aims,
– details on the conference venue,
– information on committees,
– the conference program,
– the abstracts of the keynotes, oral and poster presentations, and
– an author index.
This paper focusss on the first Slavonic-Romanian lexicons, compiled in the second half of the 17th century and their use(rs), proposing a method of investigating the manner in which lexical information available in the above corpus relates, if at all, to the vocabulary of texts from the same period. We chose to investigate their relation to an anonymous Old Testament translation made from Church Slavonic, also from the second half of the 17th century, which was supposed to be produced in the same geographical area, in the same Church Slavonic school or even by the same author as the lexicons. After applying a lemmatizer on both the Biblical text (Books of Genesis and Daniel) and the Romanian material from the lexicons, we analyse the results and double the statistical analysis with a series of case studies, focusing on some common lexemes that might be an indicator of the relatedness of the texts. Even if the analysis points out that the lexicons might not have been compiled as a tool for the translation of religious texts, it proves to be a useful method that reveals interesting data and provides the basis for more extensive approaches.
This paper presents the application of the <tiger2/> format to various linguistic scenarios with the aim of making it the standard serialisation for the ISO 24615 [1] (SynAF) standard. After outlining the main characteristics of both the SynAF metamodel and the <tiger2/> format, as extended from the initial Tiger XML format [2], we show through a range of different language families how <tiger2/> covers a variety of constituency and dependency based analyses.
A "polyglottal" speech synthesis - modifications for a replica of Kempelen's speaking machine
(2019)
This introductory tutorial describes a strictly corpus-driven approach for uncovering indications for aspects of use of lexical items. These aspects include ‘(lexical) meaning’ in a very broad sense and involve different dimensions, they are established in and emerge from respective discourses. Using data-driven mathematical-statistical methods with minimal (linguistic) premises, a word’s usage spectrum is summarized as a collocation profile. Self-organizing methods are applied to visualize the complex similarity structure spanned by these profiles. These visualizations point to the typical aspects of a word’s use, and to the common and distinctive aspects of any two words.
In the management of cooperation, the fit of a requested action with what the addressee is presently doing is a pervasively relevant consideration. We present evidence that imperative turns are adapted to, and reflexively create, contexts in which the other person is committed to the course of action advanced by the imperative. This evidence comes from systematic variation in the design of imperative turns, relative to the fittedness of the imperatively mandated action to the addressee’s ongoing trajectory of actions, what we call the “dine of commitment”. We present four points on this dine: Responsive imperatives perform an operation on the deontic dimension of what the addressee has announced or already begun to do (in particular its permissibility); local-project imperatives formulate a new action advancing a course of action in which the addressee is already actively engaged; global-project-imperatives target a next task for which the addressee is available on the grounds of their participation in the overall event, and in the absence of any competing work; and competitive imperatives draw on a presently otherwise engaged addressee on the grounds of their social commitment to the relevant course of actions. These four turn shapes are increasingly complex, reflecting the interactional work required to bridge the increasing distance between what the addressee is currently doing, and what the imperative mandates. We present data from German and Polish informal and institutional settings.
This manual introduces a conversation analytically informed coding scheme for episodes involving the direct social sanctioning of problem behavior in informal social interaction which was developed in the project Norms, Rules, and Morality across Languages (NoRM-aL) at the Leibniz-Institute for the German Language. It outlines the background for its development, delimits the phenomena to which the coding scheme can be applied and provides instructions for its use.
The scheme asks for basic information about the recording and the participants involved in the episode, before taking stock of different features of the sanctioning episode as a whole. This is followed by sets of specific coding questions about the sanctioning move itself (such as its timing and composition) and the reaction it engenders. The coding enables researchers to get a bird’s eye view on recurrent features of such episodes in larger quantities of data and allows for comparisons across different languages and informal settings.
To build a comparable Wikipedia corpus of German, French, Italian, Norwegian, Polish and Hungarian for contrastive grammar research, we used a set of XSLT stylesheets to transform the mediawiki anntations to XML. Furthermore, the data has been amnntated with word class information using different taggers. The outcome is a corpus with rich meta data and linguistic annotation that can be used for multilingual research in various linguistic topics.
A comparison between morphological complexity measures: typological data vs. language corpora
(2016)
Language complexity is an intriguing phenomenon argued to play an important role in both language learning and processing. The need to compare languages with regard to their complexity resulted in a multitude of approaches and methods, ranging from accounts targeting specific structural features to global quantification of variation more generally. In this paper, we investigate the degree to which morphological complexity measures are mutually correlated in a sample of more than 500 languages of 101 language families. We use human expert judgements from the World Atlas of Language Structures (WALS), and compare them to four quantitative measures automatically calculated from language corpora. These consist of three previously defined corpus-derived measures, which are all monolingual, and one new measure based on automatic word-alignment across pairs of languages. We find strong correlations between all the measures, illustrating that both expert judgements and automated approaches converge to similar complexity ratings, and can be used interchangeably.
Authors like Fillmore 1986 and Goldberg 2006 have made a strong case for regarding argument omission in English as a lexical and construction-based affordance rather than one based on general semantico-pragmatic constraints. They do not, however, address the question of how grammatical restrictions on null complementation might interact with broader narrative conventions, in particular those of genre. In this paper, we attempt to remedy this oversight by presenting a comprehensive overview of genre-based argument omissions and offering a construction-based analysis of genre-based omission conventions. We consider five genre-based omission types: instructional imperatives (Culy 1996, Bender 1999), labelese, diary style (Haegeman 1990), match reports (Ruppenhofer 2004) and quotative clauses. We show that these omission types share important traits; all, for example, have anaphoric rather than indefinite construals. We also show, however, that the omission types differ from each other in idiosyncratic ways. We then address several interrelated representational problems posed by the grammatical treatment of genre-based omissions. For example, the constructions that represent genre-based omission conventions must interact with the lexical entries of verbs, many of which do not generally permit omitted arguments. Accordingly, we offer constructional analyses of genre-based omissions that allow constructions to override lexical valence constraints.
Song lyrics can be considered as a text genre that has features of both written and spoken discourse, and potentially provides extensive linguistic and cultural information to scientists from various disciplines. However, pop songs play a rather subordinate role in empirical language research so far - most likely due to the absence of scientifically valid and sustainable resources. The present paper introduces a multiply annotated corpus of German lyrics as a publicly available basis for multidisciplinary research. The resource contains three types of data for the investigation and evaluation of quite distinct phenomena: TEI-compliant song lyrics as primary data, linguistically and literary motivated annotations, and extralinguistic metadata. It promotes empirically/statistically grounded analyses of genre-specific features, systemic-structural correlations and tendencies in the texts of contemporary pop music. The corpus has been stratified into thematic and author-specific archives; the paper presents some basic descriptive statistics, as well as the public online frontend with its built-in evaluation forms and live visualisations.
This report presents a corpus of articulations recorded with Schlieren photography, a recording technique to visualize aeroflow dynamics for two purposes. First, as a means to investigate aerodynamic processes during speech production without any obstruction of the lips and the nose. Second, to provide material for lecturers of phonetics to illustrates these aerodynamic processes. Speech production was recorded with 10 kHz frame rate for statistical video analyses. Downsampled videos (500 Hz) were uplodad to a youtube channel for illustrative purposes. Preliminary analyses demonstrate potential in applying Schlieren photography in research.
In this paper, we will present a first attempt to classify commonly confused words in German by consulting their communicative functions in corpora. Although the use of so-called paronyms causes frequent uncertainties due to similarities in spelling, sound and semantics, up until now the phenomenon has attracted little attention either from the perspective of corpus linguistics or from cognitive linguistics. Existing investigations rely on structuralist models, which do not account for empirical evidence. Still, they have developed an elaborate model based on formal criteria, primarily on word formation (cf. Lăzărescu 1999). Looking from a corpus perspective, such classifications are incompatible with language in use and cognitive elements of misuse.
This article sketches first lexicological insights into a classification model as derived from semantic analyses of written communication. Firstly, a brief description of the project will be provided. Secondly, corpus-assisted paronym detection will be focused. Thirdly, in the main section the paper concerns the description of the datasets for paronym classification and the classification procedures. As a work in progress, new insights will continually be extended once spoken and CMC data are added to the investigations.
This paper investigates the conditions that govern the choice between the German neuter singular relative pronouns das ‘that’ and was ‘what’. We show that das requires a lexical head noun, while in all other cases was is usually the preferred option; therefore, the distribution of das and was is most successfully captured by an approach that does not treat was as an exception but analyzes it as the elsewhere case that applies when the relativizer fails to pick up a lexical gender feature from the head noun. We furthermore show how the non-uniform behavior of different types of nominalized adjectives (positives allow both options, while superlatives trigger was) can be attributed to semantic differences rooted in syntactic structure. In particular, we argue that superlatives select was due to the presence of a silent counterpart of the quantifier alles ‘all’ that is part of the superlative structure.
This paper presents a short insight into a new project at the "Institute for the German Language” (IDS) (Mannheim). It gives an insight into some basic ideas for a corpus-based dictionary of spoken German, which will be developed and compiled by the new project "The Lexicon of spoken German” (Lexik des gesprochenen Deutsch, LeGeDe). The work is based on the "Research and Teaching Corpus of Spoken German” (Forschungs- und Lehrkorpus Gesprochenes Deutsch, FOLK), which is implemented in the "Database for Spoken German” (Datenbank für Gesprochenes Deutsch, DGD). Both resources, the database and the corpus, have been developed at the IDS.
This paper presents the prototype of a lexicographic resource for spoken German in interaction, which was conceived within the framework of the LeGeDe-project (LeGeDe=Lexik des gesprochenen Deutsch). First of all, it summarizes the theoretical and methodological approaches that were used for the initial planning of the resource. The headword candidates were selected by analyzing corpus-based data. Therefore, the data of two corpora (written and spoken German) were compared with quantitative methods. The information that was gathered on the selected headword candidates can be assigned to two different sections: meanings and functions in interaction.
Additionally, two studies on the expectations of future users towards the resource were carried out. The results of these two studies were also taken into account in the development of the prototype. Focusing on the presentation of the resource’s content, the paper shows both the different lexicographical information in selected dictionary entries, and the information offered by the provided hyperlinks and external texts. As a conclusion, it summarizes the most important innovative aspects that were specifically developed for the implementation of such a resource.
Ph@ttSessionz and Deutsch heute are two large German speech databases. They were created for different purposes: Ph@ttSessionz to test Internet-based recordings and to adapt speech recognizers to the voices of adolescent speakers, Deutsch heute to document regional variation of German. The databases differ in their recording technique, the selection of recording locations and speakers, elicitation mode, and data processing.
In this paper, we outline how the recordings were performed, how the data was processed and annotated, and how the two databases were imported into a single relational database system. We present acoustical measurements on the digit items of both databases. Our results confirm that the elicitation technique affects the speech produced, that f0 is quite comparable despite different recording procedures, and that large speech technology databases with suitable metadata may well be used for the analysis of regional variation of speech.
There have been several attempts to annotate communicative functions to utterances of verbal feedback in English previously. Here, we suggest an annotation scheme for verbal and non-verbal feedback utterances in French including the categories base, attitude, previous and visual. The data comprises conversations, maptasks and negotiations from which we extracted ca. 13,000 candidate feedback utterances and gestures. 12 students were recruited for the annotation campaign of ca. 9,500 instances. Each instance was annotated by between 2 and 7 raters. The evaluation of the annotation agreement resulted in an average best-pair kappa of 0.6. While the base category with the values acknowledgement, evaluation, answer, elicit and other achieves good agreement, this is not the case for the other main categories. The data sets, which also include automatic extractions of lexical, positional and acoustic features, are freely available and will further be used for machine learning classification experiments to analyse the form-function relationship of feedback.
In this paper, an exploratory data-driven method is presented that extracts word-types from diachronic corpora that have undergone the most pronounced change in frequency of occurrence in a given period of time. Combined with statistical methods from time series analysis, the method is able to find meaningful patterns and relationships in diachronic corpora, an idea that is still uncommon in linguistics. This indicates that the approach can facilitate an improved understanding of diachronic processes.
The main objective of this article is to describe the current activities at the Mannheim Institute for German Language regarding the implementation of a domain-specific ontology for German grammar. We differentiate ontology bases from ontology management Systems, point out the benefits of database-driven Solutions, and go Step by Step through all phases of the ontology lifecycle. In Order to demonstrate the practical use of our approach, we outline the interface between our ontology and the grammis web Information System, and compare the ontology-based retrieval mechanism with traditional full text search.
We present a descriptive analysis on the two datasets from the shared task on Source, Subjective Expression and Target Extraction from Political Speeches (STEPS), the only existing German dataset for opinion role extraction of its size. Our analysis discusses the individual properties of the three components, subjective expressions, sources and targets and their relations towards each other. Our observations should help practitioners and researchers when building a system to extract opinion roles from German data.
The present paper reports the first results of the compilation and annotation of a blog corpus for German. The main aim of the project is the representation of the blog discourse structure and relations between its elements (blog posts, comments) and participants (bloggers, commentators). The data included in the corpus were manually collected from the scientific blog portal SciLogs. The feature catalogue for the corpus annotation includes three types of information which is directly or indirectly provided in the blog or can be construed by means of statistical analysis or computational tools. At this point, only directly available information (e.g. title of the blog post, name of the blogger etc.) has been annotated. We believe, our blog corpus can be of interest for the general study of blog structure or related research questions as well as for the development of NLP methods and techniques (e.g. for authorship detection).
Most cultures have metaphors for time that involve movement, for example, ‘time passes’. Although time is objectively measured, it is subjectively understood, as we can perceive time as stationary, whereby we move towards future events, or we can perceive ourselves as stationary, with time moving past us and events moving towards us. This paper reports a series of studies that first examines whether people think about time in a metaphor-consistent manner (Study 1) and then explores the relationship between ‘time perspective’, level of perceived personal agency, and time representations (Study 2), the relationship between emotional experiences and time representation (Study 3), and whether this relationship is bidirectional by manipulating either emotional experiences (Study 4) or time representation (Study 5). Results provide bidirectional evidence for an ego-moving representation of time, with happiness eliciting more agentic control, and evidence for a time-moving passivity associated with emotional experiences of anxiety and depression. This bidirectional relationship suggests that our representation of time is malleable, and therefore, current emotional experiences may change through modification of time representations.
We present an implemented XML data model and a new, simplified query language for multi-level annotated corpora. The new query language involves automatic conversion of queries into the underlying, more complicated MMAXQL query language. It supports queries for sequential and hierarchical, but also associative (e.g. coreferential) relations. The simplified query language has been designed with non-expert users in mind.
This paper discusses a specific subclass of English it-clefts posited in the theoretical literature, so-called predicational clefts. The main point of the paper is to show that there is no need to postulate such a separate class. Predicational clefts look special because of the narrow focus on the adjective within an indefinite pivot, but their special properties can all be derived from this narrow focus in a focus analysis in which it-clefts express contrasting focus. Contrasting focus means that besides the assertion of the proposition expressed in the cleft, there is one contrasting proposition which is excluded. The focus on the adjective in apparent predicational clefts gives rise to a narrow set of relevant alternatives, all of which differ only in the adjectival property within the pivot. The analysis developed here can account for many of the observations for apparent predicational clefts. Other properties are shown to be not conclusive. Thus, predicational clefts need not be considered a special subclass beyond their special focus characteristics.
A key difference between traditional humanities research and the emerging field of digital humanities is that the latter aims to complement qualitative methods with quantitative data. In linguistics, this means the use of large corpora of text, which are usually annotated automatically using natural language processing tools. However, these tools do not exist for historical texts, so scholars have to work with unannotated data. We have developed a system for systematic iterative exploration and annotation of historical text corpora, which relies on an XML database (BaseX) and in particular on the Full Text and Update facilities of XQuery.
In this paper, a method for measuring synchronic corpus (dis-)similarity put forward by Kilgarriff (2001) is adapted and extended to identify trends and correlated changes in diachronic text data, using the Corpus of Historical American English (Davies 2010a) and the Google Ngram Corpora (Michel et al. 2010a). This paper shows that this fully data-driven method, which extracts word types that have undergone the most pronounced change in frequency in a given period of time, is computationally very cheap and that it allows interpretations of diachronic trends that are both intuitively plausible and motivated from the perspective of information theory. Furthermore, it demonstrates that the method is able to identify correlated linguistic changes and diachronic shifts that can be linked to historical events. Finally, it can help to improve diachronic POS tagging and complement existing NLP approaches. This indicates that the approach can facilitate an improved understanding of diachronic processes in language change.
Linguistic query systems are special purpose IR applications. We present a novel state-of-the-art approach for the efficient exploitation of very large linguistic corpora, combining the advantages of relational database management systems (RDBMS) with the functional MapReduce programming model. Our implementation uses the German DEREKO reference corpus with multi-layer linguistic annotations and several types of text-specific metadata, but the proposed strategy is language-independent and adaptable to large-scale multilingual corpora.
So far, there have been few descriptions on creating structures capable of storing lexicographic data, ISO 24613:2008 being one of the latest. Another one is by Spohr (2012), who designs a multifunctional lexical resource which is able to store data of different types of dictionaries in a user-oriented way. Technically, his design is based on the principle of a hierarchical XML/OWL (eXtensible Markup Language/Web Ontology Language) representation model. This article follows another route in describing a model based on entities and relations between them; MySQL (usually referred to as: Structured Query Language) describes a database system of tables containing data and definitions of relations between them. The model was developed in the context of the project "Scientific eLexicography for Africa" and the lexicographic database to be built thereof will be implemented with MySQL. The principles of the ISO model and of Spohr's model are adhered to with one major difference in the implementation strategy: we do not place the lemma in the centre of attention, but the sense description — all other elements, including the lemma, depend on the sense description. This article also describes the contained lexicographic data sets and how they have been collected from different sources. As our aim is to compile several prototypical internet dictionaries (a monolingual Northern Sotho dictionary, a bilingual learners' Xhosa–English dictionary and a bilingual Zulu–English dictionary), we describe the necessary microstructural elements for each of them and which principles we adhere to when designing different ways of accessing them. We plan to make the model and the (empty) database with all graphical user interfaces that have been developed, freely available by mid-2015.
We present a gold standard for semantic relation extraction in the food domain for German. The relation types that we address are motivated by scenarios for which IT applications present a commercial potential, such as virtual customer advice in which a virtual agent assists a customer in a supermarket in finding those products that satisfy their needs best. Moreover, we focus on those relation types that can be extracted from natural language text corpora, ideally content from the internet, such as web forums, that are easy to retrieve. A typical relation type that meets these requirements are pairs of food items that are usually consumed together. Such a relation type could be used by a virtual agent to suggest additional products available in a shop that would potentially complement the items a customer has already in their shopping cart. Our gold standard comprises structural data, i.e. relation tables, which encode relation instances. These tables are vital in order to evaluate natural language processing systems that extract those relations.
This paper argues that there is a correlation between functional and purely grammatical patterning in language, yet the nature of this correlation has to be explored. This claim is based on the results of a corpus-driven study of the Slavic aspect, drawing on the socalled Distributional Hypothesis. According to the East-West Theory of the Slavic aspect, there is a broad east-west isogloss dividing the Slavic languages into an eastern group and a western group. There are also two transitional zones in the north and south, which share some properties with each group (Dickey 2000; Barentsen 1998, 2008). The East-West Theory uses concepts of cognitive grammar such as totality and temporal definiteness, and is based on various parameters of aspectual usage in discourse, including contexts such as habituals, general factuals, historical (narrative) present, performatives, sequenced events in the past etc. The purpose of the above-mentioned study is to challenge the semantic approach to the Slavic aspect by comparing the perfective and imperfective verbal aspect on the basis of purely grammatical co-occurrence patterns (see also Janda & Lyashevskaya 2011). The study focused on three Slavic languages: Russian, which, following the East-West Theory, belongs to the eastern group, Czech, which belongs to the western group, and Polish, which is considered as transitional in its aspectual patterning.
We present a testsuite for POS tagging German web data. Our testsuite provides the original raw text as well as the gold tokenisations and is annotated for parts-of-speech. The testsuite includes a new dataset for German tweets, with a current size of 3,940 tokens. To increase the size of the data, we harmonised the annotations in already existing web corpora, based on the Stuttgart-Tübingen Tag Set. The current version of the corpus has an overall size of 48,344 tokens of web data, around half of it from Twitter. We also present experiments, showing how different experimental setups (training set size, additional out-of-domain training data, self-training) influence the accuracy of the taggers. All resources and models will be made publicly available to the research community.
Large classes at universities(> 1600 students) create their own challenges for teaching and learning. Audience feedback is lacking and fine tuning of lectures, courses and exam preparation to address individual needs is very difficult to achieve. At RWTH Aachen University, a course concept and a knowledge map learning tool aimed to support individual students to prepare for exams in information science through theme-based exercises were developed and evaluated. The tool was grounded in the notion of self-regul ated learning with the goal of enabling students to learn
independently.
One of the fundamental questions about human language is whether all languages are equally complex. Here, we approach this question from an information-theoretic perspective. We present a large scale quantitative cross-linguistic analysis of written language by training a language model on more than 6500 different documents as represented in 41 multilingual text collections consisting of ~ 3.5 billion words or ~ 9.0 billion characters and covering 2069 different languages that are spoken as a native language by more than 90% of the world population. We statistically infer the entropy of each language model as an index of what we call average prediction complexity. We compare complexity rankings across corpora and show that a language that tends to be more complex than another language in one corpus also tends to be more complex in another corpus. In addition, we show that speaker population size predicts entropy. We argue that both results constitute evidence against the equi-complexity hypothesis from an information-theoretic perspective.
This paper argues that a lectometric approach may shed light on the distinction between destandardization and demotization, a pair of concepts that plays a key role in ongoing discussions about contemporary trends in standard languages. Instead of a binary distinction, the paper proposes three different types of destandardization, defined as quantitatively measurable changes in a stratigraphic language continuum. The three types are illustrated on the basis of a case study describing changes in the vocabulary of Dutch in The Netherlands and Flanders between 1990 and 2010.
We apply a decision tree based approach to pronoun resolution in spoken dialogue. Our system deals with pronouns with NP- and non-NP-antecedents. We present a set of features designed for pronoun resolution in spoken dialogue and determine the most promising features. We evaluate the system on twenty Switchboard dialogues and show that it compares well to Byron’s (2002) manually tuned system.
Creating and maintaining metadata for various kinds of resources requires appropriate tools to assist the user. The paper presents the metadata editor ProFormA for the creation and editing of CMDI (Component Metadata Infrastructure) metadata in web forms. This editor supports a number of CMDI profiles currently being provided for different types of resources. Since the editor is based on XForms and server-side processing, users can create and modify CMDI files in their standard browser without the need for further processing. Large parts of ProFormA are implemented as web services in order to reuse them in other contexts and programs.
In this paper we present a new approach to lexicographical design for the description of German speech act verbs. This approach is based on an action-theoretical semantic conception. The several conditions for linguistic action provide the basis for the elaboration of the central semantic features. The systematic relationship of these features is reflected in the organization of a lexical database which allows various possibilities of access to different types of lexical information.
In the following paper we shall give an outline of the semantic framework for describing speech act verbs, i. e. verbs of communication, with the practical goal of a semantical database for a (dictionary of) synonymy of German speech act verbs which enables the user not only to find a list of synonymous verbs but also enables him to gain an insight into the semantic relations between the words.
The semantic framework is based on
(i) a set of conditions for performing speech acts as the relevant domain of reference
(ii) the introduction of a notion of situation, or better type of situation
The performative as well as the descriptive use of the verbs can be reduced to their fundamental dependency on the situations in which they are used: on the one hand with regard to the possibility of the action itself, and on the other hand with regard to the possibility of their designation. For both ways of use the relevant aspects of the situation constitute the necessary conditions.
This paper presents three electronic collections of polarity items: (i) negative polarity items in Romanian, (ii) negative polarity items in German, and (iii) positive polarity items in German. The presented collections are a part of a linguistic resource on lexical units with highly idiosyncratic occurrence patterns. The motivation for collecting and documenting polarity items was to provide a solid empirical basis for linguistic investigations of these expressions. Our databe provides general information about the collected items, specifies their syntactic properties, and describes the environment that licenses a given item. For each licensing context, examples from various corpora and the Internet are introduced. Finally, the type of polarity (negative or positive) and the class (superstrong, strong, weak or open) associated with a given item is speci ed. Our database is encoded in XML and is available via the Internet, offering dynamic and exible access.
The authors present a multilingual electronic database of lexical items with idiosyncratic occurrence patterns. Currently, our database consists of: (1) a collection of 444 bound words in German; (2) a collection of 77 bound words in English; (3) a collection of 58 negative polarity items in Romanian; (4) a collection of 84 negative polarity items in German; and (5) a collection of 52 positive polarity items in German. The database is encoded in XML and is available via the Internet, offering dynamic and flexible access.
This paper outlines the generation process of a specifi computational linguistic representation termed the Multilingual Time Map, conceptually a multi-tape finit state transducer encoding linguistic data at different levels of granularity. The fi st component acquires phonological data from syllable labeled speech data, the second component define feature profiles the third component generates feature hierarchies and augments the acquired data with the define feature profiles and the fourth component displays the Multilingual Time Map as a graph.
One of the most popular techniques used in HPSG-based studies to describe linguistic phenomena is the raising mechanism. Besides ordinary raising verbs or adjectives, this tool has been applied for handling verbal complexes and discontinuous constituents, among other phenomena. In this paper, a new application for raising within the HPSG paradigm will be discussed, thereby investigating data from the prepositional domain. We will analyze linguistic properties of word combinations in German consisting of a preposition, a noun, and another preposition (such as auf Grund von (‘by virtue of’)), thus arguing that raising is the most appropriate method for satisfactorily describing the crucial syntactic features which are typical for those expressions. The objective of this paper is thus to demonstrate the efficiency of the raising mechanism as used in HPSG, and therefore, to emphasize the importance of designing a satisfactory uniform theory of raising within this grammar framework.
One of the most popular techniques used in HPSG-based studies to describe linguistic phenomena is the raising mechanism. Besides ordinary raising verbs or adjectives, this tool has been applied for handling verbal complexes and discontinuous constituents, among other phenomena. In this paper, a new application for raising within the HPSG paradigm will be discussed, thereby investigating data from the prepositional domain. We will analyze linguistic properties of word combinations in German consisting of a preposition, a noun, and another preposition (such as auf Grund von (‘by virtue of’)), thus arguing that raising is the most appropriate method for satisfactorily describing the crucial syntactic features which are typical for those expressions. The objective of this paper is thus to demonstrate the efficiency of the raising mechanism as used in HPSG, and therefore, to emphasize the importance of designing a satisfactory uniform theory of raising within this grammar framework.
We present a new resource for German causal language, with annotations in context for verbs, nouns and adpositions. Our dataset includes 4,390 annotated instances for more than 150 different triggers. The annotation scheme distinguishes three different types of causal events (CONSEQUENCE, MOTIVATION, PURPOSE). We also provide annotations for semantic roles, i.e. of the cause and effect for the causal event as well as the actor and affected party, if present. In the paper, we present inter-annotator agreement scores for our dataset and discuss problems for annotating causal language. Finally, we present experiments where we frame causal annotation as a sequence labelling problem and report baseline results for the prediciton of causal arguments and for predicting different types of causation.
Classical null hypothesis significance tests are not appropriate in corpus linguistics, because the randomness assumption underlying these testing procedures is not fulfilled. Nevertheless, there are numerous scenarios where it would be beneficial to have some kind of test in order to judge the relevance of a result (e.g. a difference between two corpora) by answering the question whether the attribute of interest is pronounced enough to warrant the conclusion that it is substantial and not due to chance. In this paper, I outline such a test.
The understanding of story variation, whether motivated by cultural currents or other factors, is important for applications of formal models of narrative such as story generation or story retrieval. We present the first stage of an experiment to elicit natural narrative variation data suitable for evaluation with respect to story similarity, to qualitative and quantitative analysis of story variation, and also for data processing. We also present few preliminary results from the first stage of the experiment, using Red Riding Hood and Romeo and Juliet as base texts.
XML has been designed for creating structured documents, but the information that is encoded in these structures are, by definition, out of scope for XML. Additional sources, normally not easily interpretable by computers, such as documentation are needed to determine the intention of specific tags in a tag-set. The Component Metadata Infrastructure (CMDI) takes a rather pragmatic approach to foster interoperability between XML instances in the domain of metadata descriptions for language resources. This paper gives an overview of this approach.
This paper presents the current results of an ongoing research project on corpus distribution of prepositions and pronouns within Polish preposition-pronoun contractions. The goal of the project is to provide a quantitative description of Polish preposition-pronoun contractions taking into consideration morphosyntactic properties of their components. It is expected that the results will provide a basis for a revision of the traditionally assumed inflectional paradigms of Polish pronouns and, thus, for a possible remodeling of these paradigms. The results of corpus-based investigations of the distribution of prepositions within preposition-pronoun contractions can be used for grammar-theoretical and lexicographic purposes.
The paper deals with the use of ICH WEIß NICHT (‘I don’t know’) in German talk-in-interaction. Pursuing an Interactional Linguistics approach, we identify different interactional uses of ICH WEIß NICHT and discuss their relationship to variation in argument structure (SV (O), (O)VS, V-only). After ICH WEIß NICHT with full complementation, speakers emphasize their lack of knowledge or display reluctance to answer. In contrast, after variants without an object complement, in contrast, speakers display uncertainty about the truth of the following proposition or about its sufficiency as an answer. Thus, while uses with both subject and object tend to close a sequence or display lack of knowledge, responses without an object, in contrast, function as a prepositioned epistemic hedge or a pragmatic marker framing the following TCU. When ICH WEIß NICHT is used in response to a statement, it indexes disagreement (independently from all complementation patterns).
Our paper deals with the use of ICH WEIß NICHT (‘I don’t know’) in German talk-in-interaction. Pursuing an Interactional Linguistics approach, we identify different interactional uses of ICH WEIß NICHT and discuss their relationship to variation in argument structure (SV (O), (O)VS, V-only). After ICH WEIß NICHT with full complementation, speakers emphasize their lack of knowledge or display reluctance to answer. In contrast, after variants without an object complement, in contrast, speakers display uncertainty about the truth of the following proposition or about its sufficiency as an answer. Thus, while uses with both subject and object tend to close a sequence or display lack of knowledge, responses without an object, in contrast, function as a prepositioned epistemic hedge or a pragmatic marker framing the following TCU. When ICH WEIß NICHT is used in response to a statement, it indexes disagreement (independently from all complementation patterns).
In a number of languages, agreement in specificational copular sentences can or must be with the second of the two nominals, even when it is the first that occupies the canonical subject position. Béjar & Kahnemuyipour (2017) show that Persian and Eastern Armenian are two such languages. They then argue that ‘NP2 agreement’ occurs because the nominal in subject position (NP1) is not accessible to an external probe. It follows that actual agreement with NP1 should never be possible: the alternative to NP2 agreement should be ‘default’ agreement. We show that this prediction is false. In addition to showing that English has NP1, not default, agreement, we present new data from Icelandic, a language with rich agreement morphology, including cases that involve ‘plurale tantum’ nominals as NP1. These allow us to control for any confound from the fact that typically in a specificational sentence with two nominals differing in number, it is NP2 that is plural. We show that even in this case, the alternative to agreement with NP2 is agreement with NP1, not a default. Hence, we conclude that whatever the correct analysis of specificational sentences turns out to be, it must not predict obligatory failure of NP1 agreement.
This paper presents the system architecture as well as the underlying workflow of the Extensible Repository System of Digital Objects (ERDO) which has been developed for the sustainable archiving of language resources within the Tübingen CLARIN-D project. In contrast to other approaches focusing on archiving experts, the described workflow can be used by researchers without required knowledge in the field of long-term storage for transferring data from their local file systems into a persistent repository.
This paper describes the lexical database tool LOLA (Linguistic-Oriented Lexical database Approach) which has been developed for the construction and maintenance of lexicons for the machine translation system LMT. First, the requirements such a tool should meet are discussed, then LMT and the lexical information it requires, and some issues concerning vocabulary acquisition are presented. Afterwards the architecture and the components of the LOLA system are described and it is shown how we tried to meet the requirements worked out earlier. Although LOLA originally has been designed and implemented for the German-English LMT prototype, it aimed from the beginning at a representation of lexical data that can be reused for other LMT or MT prototypes or even other NLP applications. A special point of discussion will therefore be the adaptability of the tool and its components as well as the reusability of the lexical data stored in the database for the lexicon development for LMT or for other applications.
Connectives are conjunctions, prepositions, adverbs and other particles which share the function of encoding semantic relations between sentences, or rather, between semantic objects some of which can be meanings of sentences. The relata linked by any such relation will fall into one of four distinct categories: they will be physical objects, states of affairs, propositions, or pragmatic options (the atoms of human interaction). Physical objects constitute the conceptual domain of space, states of affairs the domain of time, propositions the epistemic domain, and pragmatic options the deontic domain. The relations encodable in any of these domains can be divided into four basic types: similarity relations, situating relations, conditional relations, and causal relations. Conceptual domains and types of relations define the universe of possible connections between semantic objects.
Connectives differ as to the interpretations they permit in terms of conceptual domains and types of relations. Very few connectives are specialized on relata of one certain category and relations of one certain type. Possible examples in German are später (‘later on’) and zwischenzeitlich (‘in the meantime’), which encode situating relations between states of affairs. Other connectives are specialized on relata of one certain category, but are underspecified with respect to the type of relation. An example is German sobald (‘as soon as’), which can only connect states of affairs, but accepts situating, conditional and causal readings. Connectives of a third group are specialized on relations of a certain type, but are underspecified with respect to the category of the relata. Examples of this kind are German weil (‘because’) and trotzdem (‘nevertheless’), which encode causal relations, but accept states of affairs, propositions and pragmatic options as their relata. Connectives of a fourth group are underspecified both for the category of relata and the type of relation. An example is German da (‘there’), which accepts relata of any category and allows for situating, conditional and causal readings. Connectives like und (‘and’) and oder (‘or’) exhibit an even higher degree of under specification, in that they allow for all kinds of relations and relata.
Feedback utterances are among the most frequent in dialogue. Feedback is also a crucial aspect of linguistic theories that take social interaction, involving language, into account. This paper introduces the corpora and datasets of a project scrutinizing this kind of feedback utterances in French. We present the genesis of the corpora (for a total of about 16 hours of transcribed and phone force-aligned speech) involved in the project. We introduce the resulting datasets and discuss how they are being used in on-going work with focus on the form-function relationship of conversational feedback. All the corpora created and the datasets produced in the framework of this project will be made available for research purposes.
We present a study on gaps in spoken language interaction as a potential candidate for syntactic boundaries. On the basis of an online annotation experiment, we can show that there is an effect of gap duration and gap type on its likelihood of being a syntactic boundary. We discuss the potential of these findings for an automation of the segmentation process.
A Supervised learning approach for the extraction of opinion sources and targets from German text
(2019)
We present the first systematic supervised learning approach for the extraction of opinion sources and targets on German language data. A wide choice of different features is presented, particularly syntactic features and generalization features. We point out specific differences between opinion sources and targets. Moreover, we explain why implicit sources can be extracted even with fairly generic features. In order to ensure comparability our classifier is trained and tested on the dataset of the STEPS shared task.
This paper presents a survey on hate speech detection. Given the steadily growing body of social media content, the amount of online hate speech is also increasing. Due to the massive scale of the web, methods that automatically detect hate speech are required. Our survey describes key areas that have been explored to automatically recognize these types of utterances using natural language processing. We also discuss limits of those approaches.
This paper presents a survey on the role of negation in sentiment analysis. Negation is a very common linguistic construction that affects polarity and, therefore, needs to be taken into consideration in sentiment analysis.
We will present various computational approaches modeling negation in sentiment analysis. We will, in particular, focus on aspects such as level of representation used for sentiment analysis, negation word detection and scope of negation. We will also discuss limits and challenges of negation modeling on that task.
We question the growing consensus in the literature that European Americans behave as a homogenous pan-ethnic coalition of voters. Seemingly below the radar of scholarship on voting groups in American politics, we identify a group of white voters that behaves differently from others: German Americans, the largest ethnic group, regionally concentrated in the ‘Swinging Midwest’. Using county level voting returns, ancestry group information from the American Community Survey (ACS), current survey data and historical census data going back as early as 1910, we provide evidence for a partisan and a non-partisan pathway that motivated German Americans to vote for Trump in 2016: a historically grown association with the Republican Party and an acquired taste for isolationist attitudes that mobilizes non-partisan German Americans to support isolationist candidates. Our findings indicate that European American experiences of migration and integration still echo into the political arena of today.
A syntax-based scheme for the annotation and segmentation of German spoken language interactions
(2018)
Unlike corpora of written language where segmentation can mainly be derived from orthographic punctuation marks, the basis for segmenting spoken language corpora is not predetermined by the primary data, but rather has to be established by the corpus compilers. This impedes consistent querying and visualization of such data. Several ways of segmenting have been proposed,
some of which are based on syntax. In this study, we developed and evaluated annotation and segmentation guidelines in reference to the topological field model for German. We can show that these guidelines are used consistently across annotators. We also investigated the influence of various interactional settings with a rather simple measure, the word-count per segment and unit-type. We observed that the word count and the distribution of each unit type differ in varying interactional settings and that our developed segmentation and annotation guidelines are used consistently across annotators. In conclusion, our syntax-based segmentations reflect interactional properties that are intrinsic to the social interactions that participants are involved in. This can be used for further analysis of social interaction and opens the possibility for automatic segmentation of transcripts.
This article presents a revised version of GAT, a transcription system first devel-oped by a group of German conversation analysts and interactional linguists in 1998. GAT tries to follow as many principles and conventions as possible of the Jefferson-style transcription used in Conversation Analysis, yet proposes some conventions which are more compatible with linguistic and phonetic analyses of spoken language, especially for the representation of prosody in talk-in-interaction. After ten years of use by researchers in conversation and discourse analysis, the original GAT has been revised, against the background of past experience and in light of new necessities for the transcription of corpora arising from technologi-cal advances and methodological developments over recent years. The present text makes GAT accessible for the English-speaking community. It presents the GAT 2 transcription system with all its conventions and gives detailed instructions on how to transcribe spoken interaction at three levels of delicacy: minimal, basic and fine. In addition, it briefly introduces some tools that may be helpful for the user: the German online tutorial GAT-TO and the transcription editing software FOLKER.
A tale of many stories: explaining policy diffusion between European higher education systems
(2013)
The thesis ”A Tale of Many Stories - Explaining Policy Diffusion between European Higher Education Systems" systematically examines diffusion processes and their effects with regard to a rather neglected policy area – the case of European higher education policy. The thesis contributes to the slowly growing number of comparative and mechanism-based studies on policy diffusion and represents the first study on the diffusion of policies between European Higher Education Systems. The main aim is to contrast and compare testable and coherent explanatory models on the functioning of different diffusion mechanisms. Three sets of explanatory models on the relationship between variables triggering and conditioning diffusion mechanisms and their impact on policy adoption are drawn from mechanism-based thinking on policy diffusion: on learning, socialization, and externalities. These approaches conceptualize the policy process in terms of interdependencies between international and national actors. Explanatory models based on assumptions about domestic policies and the common responses of countries to similar policy problems extend this theoretical framework. The thesis is based on event history modelling of policy change and adoption in higher education systems of 16 West European countries between the yeas 1980 and 1998. Overall 14 policy items describing performance-orientated reforms for public universities ranging from the adoption of external quality assurance systems to tuition fees are examined. Empirically, the main research question is what international, national and policy-specific factors cause and condition diffusion processes and the adoption of public policies? Evidence can be found for and against all of the four theoretical approaches tested. In comparison, many of the assumptions related to interdependencies lack robustness, whereas the common response model is the most stable one. This does not mean that explanatory models based on interdependent decision-making are not suitable for analysing policy diffusion in higher education. Rather interdependency is a multi- dimensional concept that requires a comparative assessment of diffusion mechanisms. Some of explanatory factors based on interdependent decision- making are still supported by the empirical analysis though. From this point of view, the recommendation for analysing diffusion is to start with a model based on domestic politics, that is successively extended by explanatory factors dealing with interdependencies between international and national actors. Diffusion variables matter – but it is only one side of the tale on policy diffusion.
In this paper we present an evaluation of rule-based morphological components for German for use in an interactive editing environment. The criteria for the evaluation are deduced from the intended use of these components, namely availability, performance, programming interfaces, and analysis quality. We evaluated systems developed and maintained since decades as well as new systems. However, we note serious general shortcomings when looking closer at recent implementations and come to the conclusion that the oldest system is the only one that satisfies our requirements.
This paper describes work in progress on I5, a TEI-based document grammar for the corpus holdings of the Institut für Deutsche Sprache (IDS) in Mannheim and the text model used by IDS in its work. The paper begins with background information on the nature and purposes of the corpora collected at IDS and the motivation for the I5 project (section 1). It continues with a description of the origin and history of the IDS text model (section 2), and a description (section 3) of the techniques used to automate, as far as possible, the preparation of the ODD file documenting the IDS text model. It ends with some concluding remarks (section 4). A survey of the additional features of the IDS-XCES realization of the IDS text model is given in an appendix.
The paper presents an XML schema for the representation of genres of computer-mediated communication (CMC) that is compliant with the encoding framework defined by the TEI. It was designed for the annotation of CMC documents in the project Deutsches Referenzkorpus zur internetbasierten Kommunikation (DeRiK), which aims at building a corpus on language use in the most popular CMC genres on the German-speaking Internet. The focus of the schema is on those CMC genres which are written and dialogic―such as forums, bulletin boards, chats, instant messaging, wiki and weblog discussions, microblogging on Twitter, and conversation on “social network” sites.
The schema provides a representation format for the main structural features of CMC discourse as well as elements for the annotation of those units regarded as “typical” for language use on the Internet. The schema introduces an element <posting>, which describes stretches of text that are sent to the server by a user at a certain point in time. Postings are the main constituting elements of threads and logfiles, which, in our schema, are the two main types of CMC macrostructures. For the microlevel of CMC documents (that is, the structure of the <posting> content), the schema introduces elements for selected features of Internet jargon such as emoticons, interaction words and addressing terms. It allows for easy anonymization of CMC data for purposes in which the annotated data are made publicly available and includes metadata which are necessary for referencing random excerpts from the data as references in dictionary entries or as results of corpus queries.
Documentation of the schema as well as encoding examples can be retrieved from the web at http://www.empirikom.net/bin/view/Themen/CmcTEI. The schema is meant to be a core model for representing CMC that can be modified and extended by others according to their own specific perspectives on CMC data. It could be a first step towards an integration of features for the representation of CMC genres into a future new version of the TEI Guidelines.
This paper formulates a proposal for standardising spoken language transcription, as practised in conversation analysis, sociolinguistics, dialectology and related fields, with the help of the TEI guidelines. Two areas relevant to standardisation are identified and discussed: first, the macro structure of transcriptions, as embodied in the data models and file formats of transcription tools such as ELAN, Praat or EXMARaLDA; second, the micro structure of transcriptions as embodied in transcription conventions such as CA, HIAT or GAT. A two-step process is described in which first the macro structure is represented in a generic TEI format based on elements defined in the P5 version of the Guidelines. In the second step, character data in this representation is parsed according to the regularities of a transcription convention resulting in a more fine-grained TEI markup which is also based on P5. It is argued that this two step process can, on the one hand, map idiosyncratic differences in tool formats and transcription conventions onto a unified representation. On the other hand, differences motivated by different theoretical decisions can be retained in a manner which still allows a common processing of data from different sources. In order to make the standard usable in practice, a conversion tool—TEI Drop—is presented which uses XSL transformations to carry out the conversion between different tool formats (CHAT, ELAN, EXMARaLDA, FOLKER and Transcriber) and the TEI representation of transcription macro structure (and vice versa) and which also provides methods for parsing the micro structure of transcriptions according to two different transcription conventions (HIAT and cGAT). Using this tool, transcribers can continue to work with software they are familiar with while still producing TEI-conformant transcription files. The paper concludes with a discussion of the work needed in order to establish the proposed standard. It is argued that both tool formats and the TEI guidelines are in a sufficiently mature state to serve as a basis for standardisation. Most work consequently remains in analysing and standardising differences between different transcription conventions.
The paper will give a concise account of the theory of Lexical Event Structures. The paper has three objectives which correspond to the following three sections. In section 2 I will sketch the theory and discuss the empirical goals the theory pursues (section 2.1) and the semantic components Lexical Event Structures consist of (section 2.2). Section 3 is devoted to linguistic phenomena whose explanation depends on Lexical Event Structures. In section 3.1 I will briefly illustrate in how far Lexical Event Structures are related to phenomena from five central empirical domains of lexical semantics and in section 3.2 it will be shown how Lexical Event Structures function in a linking theory. Section 4 aims to show how the central semantic concepts in Lexical Event Structures can be anchored to concepts which are well-founded in cognitive science. Section 4.1 discusses the event concept employed and illustrates the relation between the perception of movements and the use of verbs of movement. Section 4.2 deals with the concept of volition with respect to the licensing conditions for intransitive verb passives. In section 4.3 the distinction between durativity and punctuality, which has proven relevant for a number of verb semantic phenomena, is tied to the way we perceive events and structure our own actions. Section 5 provides a conclusion.
Travel guides and travel reports constitute an important source for the generation and spread of popular geopolitical epistemes and assumptions. With regard to colonial attitudes and their possible perpetuation, it is therefore of great interest what kind of information such texts convey regarding (post)colonial places, and how they contextualize it. The paper compares descriptions of Qingdao (Tsingtau), a German colonized territory between 1897 and 1914, in travel guides and related material from colonial and postcolonial times and in different European languages. It investigates what differences can be found between these descriptions in relation to time, language, and medium (print or online) of publication. Of particular interest is the question whether, and in what ways, colonial perspectives are perpetuated in present-day (especially German) travel literature.
The Lehnwortportal Deutsch (2012 seqq.) serves as an integrated online information system on German lexical borrowings into other languages, synthesizing an increasing number of lexicographical dictionaries and providing basic cross-resource search options. The paper discusses the far-reaching revision of the system’s conceptual, lexicographical and technological underpinnings currently under way, focussing on their relevance for multilingual loanword lexicography.
We present SPLICR, the Web-based Sustainability Platform for Linguistic Corpora and Resources. The system is aimed at people who work in Linguistics or Computational Linguistics: a comprehensive database of metadata records can be explored in order to find language resources that could be appropriate for one’s spe cific research needs. SPLICR also provides a graphical interface that enables users to query and to visualise corpora. The project in which the system is developed aims at sustainably archiving the ca. 60 language resources that have been constructed in three collaborative research centres. Our project has two primary goals: (a) To process and to archive sustainably the resources so that they are still available to the research community in five, ten, or even 20 years time. (b) To enable researchers to query the resources both on the level of their metadata as well as on the level of linguistic annotations. In more general terms, our goal is to enable solutions that leverage the interoperability, reusability, and sustainability of heterogeneous collec- tions of language resources.
In this paper we present an experimental semantic search function, based on word embeddings, for an integrated online information system on German lexical borrowings into other languages, the Lehnwortportal Deutsch (LWPD). The LWPD synthesizes an increasing number of lexicographical resources and provides basic cross-resource search options. Onomasiological access to the lexical units of the portal is a highly desirable feature for many research questions, such as the likelihood of borrowing lexical units with a given meaning (Haspelmath & Tadmor, 2009; Zeller, 2015). The search technology is based on multilingual pre-trained word embeddings, and individual word senses in the portal are associated with word vectors. Users may select one or more among a very large number of search terms, and the database returns lexical items with word sense vectors similar to these terms. We give a preliminary assessment of the feasibility, usability and efficacy of our approach, in particular in comparison to search options based on semantic domains or fields.
Abertura/Opening
(2010)
In this paper I explore the theoretical significance of phonologically conditioned gaps in word formation. The data support the original approach to gaps in Optimality Theory proposed by Prince & Smolensky (1993), which crucially involves MPARSE as a ranked and violable constraint. The alternative CONTROL model proposed by Orgun & Sprouse (1999) is found to be inadequate because of lost generalisations and technical flaws. It is shown that a careful distinction between various morphophonological effects (e.g. paradigm uniformity effects, phonological repair and ‘stem selection’) is necessary to shed light on the morphology–phonology interface. The data investigated here support affixspecific constraint rankings, but argue against any stratal organisation of morphology.
The Manatee corpus management system on which the Sketch Engine is built is efficient, but unable to harness the power of today’s multiprocessor machines. We describe a new, compatible implementation of Manatee which we develop in the Go language and report on the performance gains that we obtained.
Accentuation, Uncertainty and Exhaustivity - Towards a Model of Pragmatic Focus Interpretation
(2010)
This paper presents a model of pragmatic focus interpretation that is assumed to be part of a complete language comprehension model and that is inspired by Levelt's language processing model. The model is derived from our empirical data on the role of accentuation, prosodic indicators of uncertainty and context for pragmatic focus interpretation. In its present state, the model is restricted to these data, but nevertheless generates predictions.
We present an approach to an aspect of managing complex access scenarios to large and heterogeneous corpora that involves handling user queries that, intentionally or due to the complexity of the queried resource, target texts or annotations outside of the given user’s permissions. We first outline the overall architecture of the corpus analysis platform KorAP, devoting some attention to the way in which it handles multiple query languages, by implementing ISO CQLF (Corpus Query Lingua Franca), which in turn constitutes a component crucial for the functionality discussed here. Next, we look at query rewriting as it is used by KorAP and zoom in on one kind of this procedure, namely the rewriting of queries that is forced by data access restrictions.
This paper is concerned with a novel methodology for generating phonetic questions used in tree-based state tying for speech recognition. In order to implement a speech recognition system, language-dependent knowledge which goes beyond annotated material is usually required. The approach presented here generates phonetic questions for decision trees are based on a feature table that summarizes the articulatory characteristics of each sound. On the one hand, this method allows better language-specific triphone models to be defined given only a feature-table as linguistic input. On the other hand, the feature-table approach facilitates efficient definition of triphone models for other languages since again only a feature table for this language is required. The approach is exemplified with speech recognition systems for English and Thai.
In this paper, we present an overview of freely available web applications providing online access to spoken language corpora. We explore and discuss various solutions with which the corpus providers and corpus platform developers address the needs of researchers who are working with spoken language. The paper aims to contribute to the long-overdue exchange and discussion of methods and best practices in the design of online access to spoken language corpora.
This study investigates high vowel laxing in the Louisiana French of the Lafourche Basin. Unlike Canadian French, in which the high vowels /i, y, u/ are traditionally described as undergoing laxing (to [I, Y, U]) in word-final syllables closed by any consonant other than a voiced fricative (see Poliquin 2006), Oukada (1977) states that in the Louisiana French of Lafourche Parish, any coda consonant will trigger high vowel laxing of /i/; he excludes both /y/ and /u/ from his discussion of high vowel laxing. The current study analyzes tokens of /i, y, u/ from pre-recorded interviews with three older male speakers from Terrebonne Parish. We measured the first and second formants and duration for high vowel tokens produced in four phonetic environments, crossing syllable type (open vs. closed) by consonant type (voiced fricative vs. any consonant other than a voiced fricative). Results of the acoustic analysis show optional laxing for /i/ and /y/ and corroborate the finding that high vowels undergo laxing in word-final closed syllables, regardless of consonant type. Data for /u/ show that the results vary widely by speaker, with the dominant pattern (shown by two out of three speakers) that of lowering and backing in the vowel space of closed syllable tokens. Duration data prove inconclusive, likely due to the effects of stress. The formant data published here constitute the first acoustic description of high vowels for any variety of Louisiana French and lay the groundwork for future study on these endangered varieties.