Refine
Year of publication
Document Type
- Part of a Book (581)
- Conference Proceeding (561)
- Article (453)
- Book (66)
- Working Paper (26)
- Doctoral Thesis (21)
- Other (18)
- Part of Periodical (12)
- Preprint (12)
- Contribution to a Periodical (6)
Language
- English (1765) (remove)
Keywords
- Korpus <Linguistik> (416)
- Deutsch (410)
- Computerlinguistik (161)
- Konversationsanalyse (138)
- Interaktion (116)
- Englisch (112)
- Annotation (97)
- Gesprochene Sprache (93)
- Automatische Sprachanalyse (75)
- Wörterbuch (73)
- Semantik (63)
- German (61)
- Forschungsdaten (54)
- Natürliche Sprache (53)
- Syntax (52)
- Online-Wörterbuch (50)
- Computerunterstützte Lexikographie (49)
- Grammatik (47)
- Lexikografie (47)
- Mehrsprachigkeit (46)
- Verb (44)
- conversation analysis (42)
- Kommunikation (40)
- Neologismus (39)
- Datenmanagement (37)
- Kognitive Linguistik (37)
- Metadaten (35)
- Digital Humanities (34)
- Maschinelles Lernen (34)
- Sprachstatistik (34)
- Polnisch (33)
- Sprachpolitik (33)
- Fremdsprachenlernen (32)
- Information Extraction (32)
- Multimodalität (31)
- Prosodie (29)
- corpus linguistics (28)
- Kontrastive Linguistik (27)
- Lexikographie (27)
- Standardisierung (27)
- Lehnwort (26)
- Minderheitensprache (26)
- Sprachkontakt (26)
- Sprachvariante (26)
- Text Mining (26)
- Diskursanalyse (25)
- Sprecherwechsel (24)
- XML (24)
- Französisch (23)
- Soziolinguistik (23)
- Morphologie <Linguistik> (22)
- Pragmatik (22)
- Sprachdaten (22)
- Sprachgebrauch (22)
- Sprachwandel (21)
- COVID-19 (20)
- Infrastruktur (20)
- Psycholinguistik (20)
- Syntaktische Analyse (20)
- CLARIN (19)
- Computerunterstützte Kommunikation (19)
- Interaktionsanalyse (19)
- Semantische Analyse (19)
- Argumentstruktur (18)
- Corpus linguistics (18)
- Datensatz (18)
- Russisch (18)
- Social Media (18)
- Text Encoding Initiative (18)
- Wortbildung (18)
- Wortschatz (18)
- Automatische Spracherkennung (17)
- Europa (17)
- Metapher (17)
- Terminologie (17)
- Internet (16)
- Transkription (16)
- Automatische Sprachverarbeitung (15)
- Computerunterstützte Lexikografie (15)
- Forschung (15)
- Frame-Semantik (15)
- Kollokation (15)
- Sentimentanalyse (15)
- Texttechnologie (15)
- Urheberrecht (15)
- Wortstellung (15)
- Zweisprachiges Wörterbuch (15)
- computerunterstützte Lexikographie (15)
- Datenbank (14)
- Forschungsmethode (14)
- Gespräch (14)
- Information Retrieval (14)
- Mundart (14)
- Phonetik (14)
- Phraseologie (14)
- Semasiologie (14)
- Sprachgeschichte (14)
- Experimentelle Psychologie (13)
- Historische Sprachwissenschaft (13)
- Pandemie (13)
- Phonologie (13)
- Soziale Wahrnehmung (13)
- Spracherwerb (13)
- Thematische Relation (13)
- Worthäufigkeit (13)
- interactional linguistics (13)
- lexicography (13)
- Corpus technology (12)
- Deutsches Referenzkorpus (DeReKo) (12)
- Dialog (12)
- Körpersprache (12)
- Methodologie (12)
- Negation (12)
- Sprache (12)
- Visualisierung (12)
- gesprochene Sprache (12)
- Auszeichnungssprache (11)
- Beleidigung (11)
- Enzyklopädie (11)
- Ethnolinguistik (11)
- Head-driven phrase structure grammar (11)
- Kognitive Grammatik (11)
- Lebensmittel (11)
- Linguistic Landscape (11)
- Linguistik (11)
- Polarität (11)
- Propositionale Einstellung (11)
- Recht (11)
- Rumänisch (11)
- Sprachliche Minderheit (11)
- Sprachwechsel (11)
- Textlinguistik (11)
- Verstehen (11)
- metadata (11)
- sentiment analysis (11)
- Augenfolgebewegung (10)
- Bedeutung (10)
- Daten (10)
- Datenschutz (10)
- Diskurs (10)
- Einsprachiges Wörterbuch (10)
- Elektronisches Wörterbuch (10)
- Formale Semantik (10)
- Identität (10)
- Italienisch (10)
- Kasus (10)
- Kongress (10)
- Langzeitarchivierung (10)
- Paronym (10)
- Polish (10)
- Prädikat (10)
- Schriftsprache (10)
- Sprachverarbeitung (10)
- TEI (10)
- Valenz <Linguistik> (10)
- language policy (10)
- prosody (10)
- spoken German (10)
- Akzent (9)
- Digitalisierung (9)
- Frage (9)
- Germanische Sprachen (9)
- Kolonialismus (9)
- Kontrastive Grammatik (9)
- Korpuslinguistik (9)
- Nominalphrase (9)
- Präposition (9)
- Raum (9)
- Spanisch (9)
- Standardsprache (9)
- Tempus (9)
- Tschechisch (9)
- language contact (9)
- multimodality (9)
- research infrastructure (9)
- Übersetzung (9)
- Artikulation (8)
- Automatische Textanalyse (8)
- Benutzer (8)
- Blickbewegung (8)
- CMC (8)
- Data Mining (8)
- Datenanalyse (8)
- Datenqualität (8)
- Dialektologie (8)
- French (8)
- HPSG (8)
- Imperativ (8)
- Institut für Deutsche Sprache <Mannheim> (8)
- Intersubjektivität (8)
- Kempelen, Wolfgang von (8)
- Lettland (8)
- Lexikon (8)
- Methode (8)
- Morphologie (8)
- Neue Medien (8)
- Parser (8)
- Politische Sprache (8)
- Pronomen (8)
- Rezension (8)
- Segmentierung (8)
- Software (8)
- Sprachproduktion (8)
- Sprachverarbeitung <Psycholinguistik> (8)
- Sprechakt (8)
- Textkorpus (8)
- Twitter <Softwareplattform> (8)
- Wikipedia (8)
- YouTube (8)
- Zeit (8)
- Zweisprachigkeit (8)
- corpora (8)
- neologisms (8)
- Bildung (7)
- Biografie (7)
- Computergestützte Lexikographie (7)
- Datenbanksystem (7)
- Datenschutz-Grundverordnung (7)
- Deutschland (7)
- Ethnopsychologie (7)
- Finnisch (7)
- FrameNet (7)
- Instruktion (7)
- Interoperabilität (7)
- Jugendsprache (7)
- Konstruktionsgrammatik (7)
- Korrektur (7)
- Large corpora (7)
- Latgalian (7)
- Lemma (7)
- Lettisch (7)
- Längsschnittuntersuchung (7)
- Maschinelle Übersetzung (7)
- Mehrworteinheit (7)
- Morphosyntax (7)
- N400 (7)
- Online-Ressource (7)
- Ontologie <Wissensverarbeitung> (7)
- Partikel (7)
- Smartphone (7)
- Synonym (7)
- Temporalität (7)
- Textverstehen (7)
- Theater (7)
- Variation (7)
- Wissenschaftssprache (7)
- corpus analysis (7)
- language attitudes (7)
- language resources (7)
- machine learning (7)
- natural language processing (7)
- online dictionary (7)
- Adjektiv (6)
- Ambiguität (6)
- Antwort (6)
- Artikulatorische Phonetik (6)
- Aspekt <Linguistik> (6)
- Benutzerforschung (6)
- Benutzeroberfläche (6)
- Computerunterstützte Lexikogaphie (6)
- Computerunterstütztes Verfahren (6)
- Corpus annotation (6)
- Datenformat (6)
- Empirische Linguistik (6)
- Ethnomethodologie (6)
- Europäische Union (6)
- Experiment (6)
- Fachsprache (6)
- Fahrschule (6)
- Fallstudie (6)
- Forschungsinfrastruktur (6)
- Grammis (6)
- Griechisch (6)
- Handlung (6)
- Historische Lexikografie (6)
- Kontrastive Syntax (6)
- Korpusanalyseplattform (KorAP) (6)
- Mobiles Endgerät (6)
- Name (6)
- Neurolinguistik (6)
- Non-native speaker (6)
- Objektsatz (6)
- Opinion Mining (6)
- Personenbezogene Daten (6)
- Politische Kommunikation (6)
- Rechtschreibung (6)
- Repository <Informatik> (6)
- Soziale Identität (6)
- Soziales Handeln (6)
- Sozialwissenschaften (6)
- Sprachtypologie (6)
- Theaterprobe (6)
- Tonhöhe (6)
- Türkisch (6)
- USA (6)
- Ungarisch (6)
- Videoaufzeichnung (6)
- annotation (6)
- automatische Sprachproduktion (6)
- comparable corpora (6)
- computer-mediated communication (6)
- corpus (6)
- corpus processing (6)
- early responses (6)
- morphology (6)
- multilingualism (6)
- multimodal analysis (6)
- online dictionaries (6)
- video data (6)
- web corpora (6)
- word embeddings (6)
- API (5)
- Abfragesprache (5)
- Algorithmus (5)
- Althochdeutsch (5)
- Anapher <Syntax> (5)
- Audiovisuelles Material (5)
- Aufforderung (5)
- Bibliografie (5)
- Bibliographie (5)
- Compterunterstützte Lexikographie (5)
- Concurrent Markup/Overlap (5)
- Conversation Analysis (5)
- Conversation analysis (5)
- Corpus management (5)
- Datenmodell (5)
- Datenstruktur (5)
- Datenverarbeitung (5)
- Deutsche (5)
- Empirische Forschung (5)
- English (5)
- Entlehnung (5)
- Ergänzung <Linguistik> (5)
- Erwartung (5)
- Familie (5)
- Fremdsprache (5)
- Gefühl (5)
- Geschlechtergerechte Sprache (5)
- Gesellschaft (5)
- Gesprächsanalyse (5)
- Globalisierung (5)
- Grammatikalisation (5)
- Konferenz (5)
- Kontrastive Pragmatik (5)
- Kooperation (5)
- Kopulasatz (5)
- Lasisch (5)
- Latein (5)
- Latvia (5)
- Lettgallen (5)
- Lexikalische Semantik (5)
- Linguist (5)
- Linguistische Datenverarbeitung (5)
- Lyrik / Lyrik (5)
- Massenmedien (5)
- Mehrsprachiges Wörterbuch (5)
- Modalverb (5)
- National corpus (5)
- Negativer Polaritätsausdruck (5)
- Niederländisch (5)
- Nominalkompositum (5)
- O.K. (5)
- Open Source (5)
- Parlamentsdebatte (5)
- Phraseologismus (5)
- Portugiesisch (5)
- Prosody (5)
- Psychotherapie (5)
- Regionalsprache (5)
- Soziale Norm (5)
- Spaltsatz (5)
- Sprachgeografie (5)
- Sprachhandeln (5)
- Sprachunterricht (5)
- Statistik (5)
- Stereotyp (5)
- Strukturbaum (5)
- Südkaukasische Sprachen (5)
- Text Encoding Initiative (TEI) (5)
- Textanalyse (5)
- Textproduktion (5)
- Verständlichkeit (5)
- Vokal (5)
- Web Services (5)
- Zulu-Sprache (5)
- accountability (5)
- action formation (5)
- agency (5)
- agentivity (5)
- argument structure (5)
- copyright (5)
- corpus annotation (5)
- dictionary use (5)
- historische Phonetik (5)
- intersubjectivity (5)
- language learning (5)
- minority language (5)
- phonetics (5)
- semantic similarity (5)
- semantics (5)
- spoken language (5)
- survey (5)
- syntax (5)
- transcription (5)
- Abweichung (4)
- Akustische Phonetik (4)
- Angewandte Linguistik (4)
- Antonym (4)
- Archivierung (4)
- Baltikum (4)
- Bantusprachen (4)
- Bedeutungswandel (4)
- Beschimpfung (4)
- Bewegung (4)
- Bias (4)
- CLARIAH-DE (4)
- CLARIN-D (4)
- Chatten <Kommunikation> (4)
- Czech (4)
- Datensammlung (4)
- Dativ (4)
- Definition (4)
- Direktiv (4)
- Distribution <Linguistik> (4)
- Einbettung <Linguistik> (4)
- Erzählforschung (4)
- Ethik (4)
- Etymologie (4)
- Fehleranalyse (4)
- Fokus <Linguistik> (4)
- Fremdsprachenunterricht (4)
- Fußball (4)
- Geisteswissenschaften (4)
- German language (4)
- Gestik (4)
- Handlungsstruktur <Literatur> (4)
- Hypertext (4)
- Informationsmanagement (4)
- Informationstheorie (4)
- Intonation <Linguistik> (4)
- Isländisch (4)
- Kind (4)
- Komposition <Wortbildung> (4)
- Kontrastive Morphologie (4)
- Konversation (4)
- Kulturkontakt (4)
- Künstliche Intelligenz (4)
- Latin (4)
- Leibniz-Institut für Deutsche Sprache (IDS) (4)
- Metadatenmodell (4)
- Migration (4)
- Mikrozensus (4)
- Mittelhochdeutsch (4)
- Morphem (4)
- Multimodal interaction (4)
- Muttersprache (4)
- Nationalsozialismus (4)
- Natural Language Processing (4)
- Nichtverbale Kommunikation (4)
- Nominalisierung (4)
- Open Science (4)
- Optimalitätstheorie (4)
- Partikelverb (4)
- Patient (4)
- Pidgin (4)
- Planung (4)
- Polysemie (4)
- Privatsphäre (4)
- Proposition (4)
- Präsidentenwahl (4)
- Russland (4)
- Sakkade (4)
- Schwedisch (4)
- Sentiment Analysis (4)
- Slawische Sprachen (4)
- Sportsprache (4)
- Sprachkompetenz (4)
- Sprachverstehen (4)
- Straßenverkehr (4)
- Subjektivität (4)
- Suchmaschine (4)
- Syntaktische Kongruenz (4)
- Technische Infrastruktur (4)
- Textsorte (4)
- Ukrainisch (4)
- Unserdeutsch (4)
- Vergleichende politische Wissenschaft (4)
- Videospiel (4)
- Wort (4)
- Wortlänge (4)
- Wortverbindung (4)
- XML (Extensible Markup Language) (4)
- Zulu (4)
- Zustandsverb (4)
- abusive language (4)
- author name disambiguation (4)
- cognitive linguistics (4)
- colonialism (4)
- corpus management (4)
- discourse analysis (4)
- formulations (4)
- historical lexicography (4)
- historische Lexikographie (4)
- inference (4)
- information theory (4)
- infrastructure (4)
- instructions (4)
- interaction (4)
- language (4)
- language change (4)
- language complexity (4)
- legal issues (4)
- linguistic diversity (4)
- methodology (4)
- multimodal interaction (4)
- oral corpora (4)
- personal data (4)
- phraseology (4)
- prediction (4)
- reply relations (4)
- rules (4)
- sequentiality (4)
- social interaction (4)
- software (4)
- stereotypes (4)
- syllable prominence (4)
- time series analysis (4)
- turn-taking (4)
- youth language (4)
- 17th century (3)
- Adverb (3)
- Afrikanische Sprachen (3)
- Agens (3)
- Anonymisierung (3)
- Argumentation (3)
- Arzt (3)
- Augenbewegung (3)
- Aussprache (3)
- Bibliothekskatalog (3)
- Bildungspolitik (3)
- British English (3)
- Bulgarian (3)
- Bulgarisch (3)
- CLARIAH (3)
- CMDI (3)
- Cluster <Datenanalyse> (3)
- CoRoLa (3)
- Coaching (3)
- Component MetaData Infrastructure (CMDI) (3)
- Component Metadata Infrastructure (CMDI) (3)
- Computerspiel (3)
- Computerunterstütztes Informationssystem (3)
- Computerunterstütztes Lernen (3)
- Computeruntertützte Lexikographie (3)
- Corpus query language (3)
- Culture (3)
- Datenerhebung (3)
- Debatte (3)
- Deixis (3)
- Deklination (3)
- Dictionary use (3)
- Digitale Sprachressourcen (3)
- Diphthong (3)
- Direkte Rede (3)
- Diskriminierung (3)
- Diskursmarker (3)
- Dokumentation (3)
- Elektronische Publikation (3)
- Ellipse <Linguistik> (3)
- Entropie (3)
- Epistemics (3)
- Epistemische Logik (3)
- Estland (3)
- Estonia (3)
- Ethnizität (3)
- Ethnologie (3)
- Europäisierung (3)
- Evaluation (3)
- Feldforschung (3)
- Festschrift (3)
- Food item (3)
- Formale Sprache (3)
- Forschungseinrichtung (3)
- Forschungsprojekt (3)
- Frage-Antwort-System (3)
- Frame semantics (3)
- Freezing principle (3)
- GDPR (3)
- Gebärdensprache (3)
- Germanic (3)
- Geschlechterstereotyp (3)
- Gesture (3)
- Gleichberechtigung (3)
- Glossar (3)
- Grammar (3)
- Grammatiktheorie (3)
- Grassfields Bantu (3)
- Historical lexicography (3)
- Historische Phonetik (3)
- Höheres Bildungswesen (3)
- Hörgerät (3)
- Hörschädigung (3)
- ISO-Norm (3)
- Icelandic (3)
- Informationsstruktur (3)
- Informationssystem (3)
- Informationsverarbeitung (3)
- Interactional Linguistics (3)
- Interdisziplinarität (3)
- Isolationismus (3)
- Japanisch (3)
- Jugendlicher (3)
- Kognition (3)
- Kognitive Semantik (3)
- Kompositum (3)
- Konditional (3)
- Konditionalsatz (3)
- Konflikt (3)
- Kongressbericht (3)
- Konjunktion (3)
- KorAP (3)
- Kreolische Sprachen (3)
- Kroatisch (3)
- Language attitude (3)
- Lautschrift (3)
- Lautstärke (3)
- Lernerwörterbuch (3)
- Lexikogaphie (3)
- Lexikologie (3)
- Lexikostatistik (3)
- Linguistische Informationswissenschaft (3)
- Lokativ (3)
- Lächeln (3)
- Mannheim (3)
- Medien (3)
- Mediensprache (3)
- Medizin (3)
- Meinungsäußerung (3)
- Metadata (3)
- Minderheit (3)
- Modality (3)
- Modalität (3)
- Modalität <Linguistik> (3)
- Modus (3)
- Morphemanalyse (3)
- Morphology (3)
- Nachhaltigkeit (3)
- Named Entity Recognition (3)
- Native speaker (3)
- Natural language processing (3)
- Niederdeutsch (3)
- Normung (3)
- Northern Sotho (3)
- Nutzungsrecht (3)
- Online-Datenbank (3)
- Open Access (3)
- Open Data (3)
- P300 (3)
- Partizipation (3)
- Pedi-Sprache (3)
- Peer-Group (3)
- Pennsylvaniadeutsch (3)
- Personalpronomen (3)
- Perspektivität (3)
- Pidgin-Sprachen (3)
- Plurizentrische Sprache (3)
- Positionsverb (3)
- Programmiersprache (3)
- Progressiv (3)
- Prosa (3)
- Prototyp <Linguistik> (3)
- Quantitative Analyse (3)
- Quantitative Methode (3)
- Raumvorstellung (3)
- Rechtssprache (3)
- Rechtsstellung (3)
- Regel (3)
- Regressionsanalyse (3)
- Relation extraction (3)
- Rezipient (3)
- Richtlinie (3)
- Routinearbeit (3)
- Semantic Web (3)
- Semantische Relation (3)
- Semantisches Netz (3)
- Sequentialanalyse (3)
- Slowenisch (3)
- Sorbisch (3)
- Soziolekt (3)
- Spielregel (3)
- Sprachanalyse (3)
- Sprachnorm (3)
- Sprachstil (3)
- Sprichwort (3)
- Statistischer Test (3)
- Stimmgebung (3)
- Suffix (3)
- Südpazifik (3)
- Tagebuch (3)
- Technologie (3)
- Textgestaltung (3)
- Textverarbeitung (3)
- Textverarbeitung <Psycholinguistik> (3)
- Tourismus (3)
- Twitter (3)
- Ukrainian language (3)
- Verantwortlichkeit (3)
- Verbalphrase (3)
- Vergangenheitstempus (3)
- Verkehrssprache (3)
- Veröffentlichung (3)
- Vorhersagbarkeit (3)
- Vorurteil (3)
- Web corpora (3)
- Website (3)
- Zeitung (3)
- Zipfsches Gesetz (3)
- Zusammenkunft (3)
- abstractness (3)
- acoustic correlates (3)
- action ascription (3)
- agent prominence (3)
- agreement (3)
- animacy (3)
- articulation (3)
- aspect (3)
- collocations (3)
- commitment (3)
- computational models of narrative (3)
- constructicography (3)
- construction grammar (3)
- context (3)
- corpus infrastructures (3)
- corpus linguistic methodology (3)
- corpus-based (3)
- cross-language differences (3)
- dictionary (3)
- dictionary of neologisms (3)
- digital humanities (3)
- digital research infrastructure (3)
- digitale Infrastruktur (3)
- eLexiko (3)
- electronic lexicography (3)
- expectancy violations (3)
- eye movements (3)
- formal semantics (3)
- gender linguistics (3)
- gender-fair language (3)
- grammaticalization (3)
- graph database (3)
- hearing aid use (3)
- hearing impairment (3)
- heritage language (3)
- humanities (3)
- identity (3)
- impression formation (3)
- instruction (3)
- interactional history (3)
- interoperability (3)
- kontrastive Linguistik (3)
- language documentation (3)
- language models (3)
- language planning (3)
- language politics (3)
- language processing (3)
- language technology (3)
- large corpora (3)
- learning (3)
- lexical borrowings (3)
- lexical data (3)
- lexicon (3)
- linguistic research software (3)
- linked data (3)
- loanword (3)
- modality (3)
- multilingual lexicography (3)
- multimodal (3)
- multiword expressions (3)
- neologism (3)
- nonnative accents (3)
- null-hypothesis testing (3)
- online lexicography (3)
- online resources (3)
- paronyms (3)
- pitch range (3)
- pitch variation (3)
- planning (3)
- policy convergence (3)
- positioning (3)
- pragmatics (3)
- projection (3)
- prominence (3)
- psychotherapy (3)
- quantitative approaches (3)
- questions (3)
- reading (3)
- recipient design (3)
- request (3)
- requests (3)
- research data (3)
- research data management (3)
- research into dictionary use (3)
- reusability (3)
- semantic roles (3)
- semantic web (3)
- sentience (3)
- smartphone use (3)
- speech (3)
- speech acts (3)
- spoken language corpora (3)
- spoken language data (3)
- standardization (3)
- tense (3)
- terminology (3)
- tokenization (3)
- transition (3)
- treebanks (3)
- understanding (3)
- variability (3)
- word formation (3)
- word frequency (3)
- word order (3)
- word structure (3)
- Übersetzungswissenschaft (3)
- ASR (2)
- Abfrage (2)
- Ableitung <Linguistik> (2)
- Affirmativer Polaritätsausdruck (2)
- Agency-Theorie (2)
- Akkusativ (2)
- Akustik (2)
- Allomorph (2)
- Amazonas (2)
- Amerikanismus (2)
- Amondawa-Sprache (2)
- Amtssprache (2)
- Anpassung (2)
- Apokope (2)
- Appellativum (2)
- Arabic (2)
- Arabisch (2)
- Archiv für Gesprochenes Deutsch (AGD) (2)
- Argument <Linguistik> (2)
- Articulography (2)
- Aspekt (2)
- Attributsatz (2)
- Auftritt (2)
- Ausgrenzung (2)
- Aussagesatz (2)
- Autofahren (2)
- Automatic recognition of speech (2)
- Automatische Sprachproduktion (2)
- Automatisches Beweisverfahren (2)
- Autor (2)
- BNC (2)
- Baltic States (2)
- Baltische Sprachen (2)
- Begegnung (2)
- Benutzerverhalten (2)
- Bericht (2)
- Beurteilung (2)
- Bewegungsverb (2)
- Bibliografische Daten (2)
- Bibliothek (2)
- Bildungssystem (2)
- Biographie (2)
- Blickkontakt (2)
- Bologna-Prozess (2)
- Brasilien (2)
- CELEX (2)
- COHA (2)
- Case (2)
- Categories of PSMs (2)
- Chinesisch (2)
- Church Slavonic (2)
- Clarin (2)
- Compterunterstützte Lexikografie (2)
- Computational linguistics (2)
- Computer-Mediated Communication (2)
- Computer-mediated communication (2)
- Computerlingustik (2)
- Constraint-Erfüllung (2)
- Conversational alignment (2)
- Copyright (2)
- Corpora (2)
- Corpus Linguistics (2)
- Creative Commons (2)
- DARIAH (2)
- DMC (2)
- Dank (2)
- Decision Trees (2)
- Dekomposition (2)
- Denken (2)
- Dependenzgrammatik (2)
- Deskriptive Linguistik (2)
- Deutsch als Fremdsprache (2)
- Deutschland <Bundesrepublik> (2)
- Deutschland. Deutscher Bundestag (2)
- Dezentralisation (2)
- Dialectology (2)
- Dictionaries (2)
- Didaktik (2)
- Digitale Kommunikation (2)
- Disambiguierung (2)
- Diskurssemantik (2)
- Dokumentenserver (2)
- Dortmunder Chat-Korpus (2)
- E-Learning (2)
- EFNIL (2)
- ERP (2)
- Einbettungssatz <Linguistik> (2)
- Einleitung (2)
- Einsprachigkeit (2)
- Einstellung (2)
- Einwanderer (2)
- Elektronische Bibliothek (2)
- Elektronisches Forum (2)
- Elektrophysiologie (2)
- Emotion (2)
- Empfehlungssystem (2)
- Empirical research (2)
- Empowerment (2)
- Entscheidungsbaum (2)
- Entscheidungsfrage (2)
- Estnisch (2)
- Europäische Föderation Nationaler Sprachinstitutionen (EFNIL) (2)
- Europäische Sportkonferenz (2)
- Europäische Union : Datenschutz-Grundverordnung (2)
- Evaluation methodologies (2)
- FAIR data principles (2)
- Fahrstunde (2)
- Faux amis (2)
- Fernsehsendung (2)
- Fertigkeit (2)
- Flexion (2)
- Folgerung (2)
- Font (2)
- Formalisierung (2)
- Formulierung (2)
- Frame-Theorie (2)
- Framing-Effekt (2)
- Frequency (2)
- Frequenz (2)
- Friesisch (2)
- Frühneuhochdeutsch (2)
- Funktionale Grammatik (2)
- Futur (2)
- Färöisch (2)
- Gedächtnis (2)
- Gefangenenliteratur (2)
- Gefühlsverb (2)
- Geistiges Eigentum (2)
- Generalized additive modeling (2)
- Generalversammlung (2)
- Generative Semantik (2)
- Genitiv (2)
- Genitive Classification (2)
- Genus (2)
- Geoinformationssystem (2)
- GermaNet (2)
- German Americans (2)
- German as a foreign language (2)
- German-based (2)
- Geschlechterforschung (2)
- Gesprächsforschung (2)
- Geste (2)
- Gleichheit (2)
- Google Books Ngram corpora (2)
- Google Ngram Corpora (2)
- Graded tense (2)
- Gruppenidentität (2)
- Gälisch-Schottisch (2)
- Haftung (2)
- Hamlet (2)
- Hausa-Sprache (2)
- Hebrew (2)
- Hebräisch (2)
- Helfen (2)
- Hilfsverb (2)
- Historische Soziolinguistik (2)
- Hochlettisch (2)
- Hochschulbildung (2)
- Honesty (2)
- ISO (2)
- Ideologie (2)
- Imperfekt (2)
- Implementation (2)
- Indirect speech (2)
- Indirekte Rede (2)
- Infinitkonstruktion (2)
- Information Science (2)
- Inklusion <Soziologie> (2)
- Institut für Deutsche Sprache (2)
- Integration (2)
- Intention (2)
- Interactional history (2)
- Interactional linguistics (2)
- Interaktionale Linguistik (2)
- Interferenz <Linguistik> (2)
- Internationale Politik (2)
- Interoperability (2)
- Interpretation (2)
- Intransitives Verb (2)
- Jensen-Shannon divergence (2)
- Jugend (2)
- Kategorialgrammatik (2)
- Kategorisierung (2)
- Kausalität (2)
- Kiezdeutsch (2)
- Kindersprache (2)
- Klassifikation (2)
- Kognitiver Prozess (2)
- Koloniallinguistik (2)
- Kolonie (2)
- Komitativ <Kasus> (2)
- Kommunikationsverb (2)
- Kommunikativer Sinn (2)
- Komplementierer (2)
- Komposition (2)
- Konsonant (2)
- Konstruktion <Linguistik> (2)
- Kontrolle <Linguistik> (2)
- Konversationanalyse (2)
- Koordination <Linguistik> (2)
- KorAP (Korpusanalyseplattform der nächsten Generation) (2)
- Koreanisch (2)
- Korpus <Linguistik (2)
- Korpustechnologie (2)
- Kultur (2)
- Kulturpsychologie (2)
- Kulturvergleich (2)
- Language (2)
- Language Variation (2)
- Lautquantität (2)
- Lautwandel (2)
- Lehrmittel (2)
- Lernen (2)
- Lernsoftware (2)
- Lesen (2)
- Leseverhalten (2)
- Lexem (2)
- Lexicon (2)
- Lexikalisch funktionale Grammatik (2)
- Lexikgraphie (2)
- Lingua Franca (2)
- Linked Data (2)
- Literary corpus (2)
- Logdatei (2)
- Logische Semantik (2)
- Lower Sorbian (2)
- Lyrics <Lyrik> (2)
- MMAX (2)
- MTAS (2)
- Machine Leaming (2)
- Mandarin (2)
- Maschinelle Sprachverarbeitung (2)
- Meaning (2)
- Mediation (2)
- Medienlinguistik (2)
- Meinung (2)
- Meinungsverb (2)
- Mental verb constructions (2)
- Mikrostruktur (2)
- Mimik (2)
- Mobilität (2)
- Modalpartikel (2)
- Modeling (2)
- Morphology of the Folktale (2)
- Multikulturelle Gesellschaft (2)
- Multimedia (2)
- MySQL (2)
- Mündliche Kommunikation (2)
- NFDI (2)
- Narrative (2)
- Nationalsozialistische Verbrechen (2)
- Nationalsprache (2)
- Neologie (2)
- Neologimus (2)
- Neologisms (2)
- Neumelanesisch (2)
- Nicht-kanonisches Subjekt (2)
- Norwegen. Sameting (2)
- Norwegisch (2)
- Notation (2)
- NottDeuYTSch corpus (2)
- NottDeuYTSch-Korpus (2)
- OCR (2)
- OCR-Schrift (2)
- OWID (2)
- Online dictionary (2)
- Online-Medien (2)
- Online-Spiel (2)
- Opinion Inference (2)
- Optische Zeichenerkennung (2)
- Ortsadverb (2)
- Ortsname (2)
- Papua-Neuguinea (2)
- Paradigma (2)
- Parlament (2)
- Paronymie (2)
- Parsing (2)
- Part-of-Speech-Tagging (2)
- Part-of-Speech-Tagging = POS (2)
- Past interpretation (2)
- Perfekt (2)
- Perspektivierung (2)
- Phonem (2)
- Pitch contour (2)
- Pitch matching (2)
- Pokorny, Julius (2)
- Polarity classification (2)
- Politik (2)
- Politische Beteiligung (2)
- Preference organization (2)
- Pro-Drop-Parameter (2)
- Problem Solving Methods (2)
- Projection (2)
- Projektion <Psychologie> (2)
- Prose (2)
- Prosodic similarity (2)
- Prädikation (2)
- Psychologie (2)
- Psychologische Diagnostik (2)
- Psychverb (2)
- Python <Programmiersprache> (2)
- Qualitative Methode (2)
- Quantitative Linguistik (2)
- Reaktion (2)
- Rechtsfrage (2)
- Redeerwähnung (2)
- Redigieren (2)
- Referenz <Linguistik> (2)
- Register <Linguistik> (2)
- Relativsatz (2)
- Requests (2)
- Resources (2)
- Reuse (2)
- Rezeption (2)
- Rhetorik (2)
- Ripuarian (2)
- Romanische Sprachen (2)
- Russian (2)
- Russlanddeutsche (2)
- Rückmeldesignal (2)
- SGML (2)
- SKOS (2)
- Sapir-Whorf-Hypothese (2)
- Satzakzent (2)
- Satzanalyse (2)
- Satzverbindung (2)
- Schimpfwort (2)
- Schottland (2)
- Schriftstück (2)
- Schwerhörigkeit (2)
- Scottish Gaelic (2)
- SemEval (2)
- Semantics (2)
- Semantische Verbklasse (2)
- Sentiment Analyse (2)
- Sentiment analysis (2)
- Serbisch (2)
- Service provider (2)
- Shakespeare, William (2)
- Similarities (2)
- Slowakisch (2)
- Smile (2)
- Softwarewerkzeug (2)
- Sorbian institute (2)
- Sozialberuf (2)
- Soziale Software (2)
- Speech synthesis (2)
- Sport (2)
- Sprachentwicklung (2)
- Spracherhaltung (2)
- Sprachkonflikt (2)
- Sprachphilosophie (2)
- Sprachpurismus (2)
- Sprachstörung (2)
- Sprachvariation (2)
- Sprachvergleich (2)
- Sprachverlust (2)
- Sprecher (2)
- Sprechmaschine (2)
- Standard (2)
- Stereotypisierung (2)
- Stichprobenumfang (2)
- Studie (2)
- Subjekt <Linguistik> (2)
- Swedish (2)
- Syntagma (2)
- Tagging (2)
- Technischer Fortschritt (2)
- Telefonieren (2)
- Temporaladverb (2)
- Terminologiemanagement (2)
- Text (2)
- Text classification (2)
- Text-to-Speech (2)
- Textkohärenz (2)
- Textstruktur (2)
- Theater rehearsals (2)
- Thesaurus (2)
- Tok Pisin (2)
- Topikalisierung (2)
- Tote Sprachen (2)
- Tracy, Rosemarie (2)
- Treebanks (2)
- Trees/Graphs (2)
- Trump, Donald (2)
- Tupi-Guarani-Sprachen (2)
- Turn construction (2)
- Turn-beginnings (2)
- UGC (2)
- Understanding in interaction (2)
- Universal Dependencies (2)
- Universitätsbibliothek (2)
- Uralische Sprachen (2)
- Valenz (2)
- Validating (2)
- Verbalaggression (2)
- Verbbedeutung (2)
- Verbsemantik (2)
- Verhalten (2)
- Verwaltungssprache (2)
- Vielfalt (2)
- Vietnamese (2)
- Virtual Language Observatory (VLO) (2)
- Virtuelle Realität (2)
- Vollversammlung (2)
- Vorumaisch (2)
- Wahlverhalten (2)
- Warlpiri (2)
- Web (2)
- Westsamoa (2)
- Wiedervereinigung <Deutschland> (2)
- Wiktionary (2)
- Wirtschaft (2)
- Wissenschaftler (2)
- Wissenschaftliche Kooperation (2)
- Wissenschaftsforschung (2)
- Wissenspräsentation (2)
- Wissensvermittlung (2)
- Word formation (2)
- Word length (2)
- WordNet (2)
- Workplace studies (2)
- Wortart (2)
- Wortwahl (2)
- XQuery (2)
- Zeichensprache (2)
- Zeitungsartikel (2)
- Zipf's law (2)
- Zipf’s law (2)
- Zufriedenheit (2)
- Zusammenfassung (2)
- Zuschauer (2)
- Zweitsprache (2)
- Zweitspracherwerb (2)
- acceptability judgements (2)
- access structures (2)
- acoustic analysis (2)
- advanced search options (2)
- affect (2)
- agent prototypicality (2)
- annotation guidelines (2)
- annotation scheme (2)
- annotation tool (2)
- anonymisation (2)
- anotación de corpus (2)
- artificial intelligence (2)
- audiovisual data (2)
- automatic transcription (2)
- bibliographic metadata (2)
- bilingual dictionaries (2)
- blending (2)
- categorisation (2)
- clipping (2)
- closing (2)
- cmc corpora (2)
- co-presence (2)
- cognitive lexicography (2)
- collocation (2)
- collocation analysis (2)
- colonial language contact (2)
- colonial linguistics (2)
- computational linguistics (2)
- computer-assisted language learning (2)
- computer-mediated communication (CMC) (2)
- conditional connectives (2)
- conditionals (2)
- confusables (2)
- conjunction (2)
- constructional meaning (2)
- contrastive analysis (2)
- contrastive linguistics (2)
- controlled natural language (2)
- cooperation (2)
- coordination (2)
- copula (2)
- corpus curation (2)
- corpus frequency (2)
- corpus reusability (2)
- corpus semantics (2)
- corpus study (2)
- corpus-based lexicography (2)
- correlate (2)
- courses of action (2)
- culture (2)
- data (2)
- data analysis (2)
- data collection (2)
- data migration (2)
- data mining (2)
- data quality (2)
- data repositories (2)
- database (2)
- declarative (2)
- deduplication (2)
- dependency parsing (2)
- diachronic corpora (2)
- diachronic corpus linguistics (2)
- dialog (2)
- dictionaries (2)
- dictionary culture (2)
- dictionary design (2)
- dictionary writing system (2)
- disambiguation (2)
- discourse (2)
- discourse semantics (2)
- discrimination (2)
- distributional semantics (2)
- driving lessons (2)
- driving school (2)
- e-lexicography (2)
- easily confused words (2)
- eben (2)
- embodied responses (2)
- embodiment (2)
- ethics (2)
- ethnography (2)
- event-related brain potentials (2)
- expectation (2)
- experiencer (2)
- eye-movements (2)
- eyetracking (2)
- face (2)
- false friends (2)
- family language policy (2)
- feedback (2)
- fixation-related potentials (2)
- flagging (2)
- formulation (2)
- free variation (2)
- gaze (2)
- gender equality (2)
- gender-inclusive language (2)
- general assembly (2)
- general language dictionaries (2)
- general language dictionary (2)
- generalized divergence (2)
- generalized entropy (2)
- genre and register variation (2)
- geschriebene Sprache (2)
- gesture (2)
- glossaries (2)
- grammar (2)
- grammar and syntax (2)
- grammatical information system (2)
- grammatical terminology (2)
- grammatical variation (2)
- grammatische Terminologie (2)
- grammis (2)
- gratitude (2)
- helping relationship (2)
- higher education research (2)
- history of lexicography (2)
- history of phonetics (2)
- ideology (2)
- impact assessment (2)
- impact categories (2)
- imperative (2)
- informal interaction (2)
- interactional semantics (2)
- interaktionale Semanitik (2)
- international language (2)
- internet forums (2)
- internet lexicography (2)
- interpretation (2)
- intonation (2)
- it (2)
- it-clefts (2)
- kognitive Linguistik (2)
- kontextuelle Bedeutung (2)
- language comprehension (2)
- language functions (2)
- language ideology (2)
- language learners (2)
- language portal (2)
- language statistics (2)
- language status (2)
- language universals (2)
- late positivity (2)
- lay-lexicography (2)
- learner corpus (2)
- lexical borrowing (2)
- lexical database (2)
- lexical information system (2)
- lexical innovation (2)
- lexical richness (2)
- lexical semantics (2)
- lexicographic database (2)
- lexicography and war (2)
- lexicography equality (2)
- lexicology (2)
- liability (2)
- linguistic data (2)
- linguistic variation (2)
- loanword lexicography (2)
- log files (2)
- long-term archival (2)
- longitudinal study (2)
- machine learning methods (2)
- mechanical speech synthesis (2)
- membership categorization (2)
- metaphor (2)
- methods (2)
- microstructure (2)
- minority languages (2)
- missionary linguistics (2)
- modal verbs (2)
- morphological analysis (2)
- motion verbs (2)
- multi-level annotation (2)
- multi-party dialog (2)
- multiparty interaction (2)
- mysql (2)
- n-grams (2)
- negation (2)
- neological lexicography (2)
- neologism dictionaries (2)
- newsmark (2)
- non-native speech (2)
- norms (2)
- noun–pronoun ratio (2)
- ob <Wort> (2)
- online language (2)
- opinion frames (2)
- opinion mining (2)
- optimality theory (2)
- ordinary conversation (2)
- organized helping (2)
- orthography (2)
- overlapping talk (2)
- parallel corpora (2)
- parser adaptation (2)
- part-of-speech (POS) (2)
- part-of-speech ontology (2)
- participation framework (2)
- pedagogical lexicography (2)
- perception (2)
- perception experiment (2)
- phonemic representation (2)
- phonological grammar (2)
- phonology (2)
- plurale tantum (2)
- pluricentric (2)
- policy diffusion (2)
- politischer Diskurs (2)
- polysemy (2)
- power law (2)
- practice (2)
- precision (2)
- predictability (2)
- predictive coding (2)
- predictive processing (2)
- prepositions (2)
- priming (2)
- privacy (2)
- pro-drop (2)
- problem-solving approach (2)
- production (2)
- professional lexicography (2)
- pronoun resolution (2)
- pseudo-coordination (2)
- public space (2)
- punctual verb (2)
- query (2)
- read speech (2)
- reference corpora (2)
- register variation (2)
- relationship (2)
- relationship management (2)
- relationships (2)
- repair (2)
- representativeness (2)
- response tokens (2)
- responsive action (2)
- sample size (2)
- scalability (2)
- second language acquisition (2)
- second language learning (2)
- semantic role labeling (2)
- sense discrimination (2)
- sentence processing (2)
- serif (2)
- shortening (2)
- showing sequences (2)
- sitzen <Wort> (2)
- smartphones (2)
- social action (2)
- software quality management (2)
- specialized lexicography (2)
- specification (2)
- specificational clause (2)
- speech corpus (2)
- speech planning (2)
- speech production (2)
- spoken Czech (2)
- standard (2)
- standards (2)
- stehen <Wort> (2)
- stereotyping (2)
- stops (2)
- subjectivity (2)
- subjunctive (2)
- synonyms (2)
- syntactic complexity (2)
- tagging (2)
- talk-in-interaction (2)
- task-oriented dialogue (2)
- text classification (2)
- text corpus (2)
- text length (2)
- text mining (2)
- text production (2)
- theater (2)
- transnational communication (2)
- tun (2)
- turn taking (2)
- type token ratio (2)
- typology (2)
- understudied languages (2)
- usage-based model (2)
- user guidance (2)
- user interface (2)
- valency (2)
- variation (2)
- verbs (2)
- virtual collections (2)
- visualisation (2)
- visualization (2)
- vocabulary size (2)
- von Kempelen, Wolfgang (2)
- wenn (2)
- wiktionary (2)
- word predictability (2)
- word senses (2)
- workplace studies (2)
- written language (2)
- Älterer Mensch (2)
- Ästhetik (2)
- Öffentlicher Raum (2)
- Österreich (2)
- Übung (2)
- (discrepancy of) expectation (1)
- (enhanced) webcomics (1)
- (multimodal) instructions (1)
- (multimodale) Instruktionen (1)
- (re-)openings (1)
- (un)certainty (1)
- 0nline dictionary (1)
- 1/3 power law (1)
- 19th Century (1)
- 2008 (1)
- 3-Circle-Model (1)
- ASD (1)
- Ablaut (1)
- Absentiv (1)
- Abstractness (1)
- AcI (1)
- Access Control (1)
- Acquisition (1)
- Action formation (1)
- Active Learning (1)
- Active learning (1)
- Additional Language of Society (1)
- Adjazenz (1)
- Adjective (1)
- Adposition (1)
- Adressat (1)
- Adressatenzuschnitt (1)
- Adverbial Noun Phrases (AdvNps) (1)
- Adverbiale (1)
- Aerodynamik (1)
- Affekt (1)
- Affirmative (1)
- Affix (1)
- Affixoid (1)
- African languages (1)
- African languages dictionaries (1)
- Afrikaans (1)
- Afrikatale (1)
- Agreement <Syntax> (1)
- Agumentation (1)
- Aichinger, Ilse (1)
- Akademischer Grad (1)
- Akkadisch (1)
- Akkulturation (1)
- Akronym (1)
- Akteur (1)
- Akustische Analyse (1)
- Akzeptabilität (1)
- Allgemeinwissen (1)
- Allomorphy (1)
- Alltag (1)
- Alltagsgespräche (1)
- Alltagssprache (1)
- Altchinesisch (1)
- Altenbild (1)
- Alter (1)
- Altertumswissenschaft (1)
- Altgriechisch (1)
- Altägyptisch (1)
- Alveolar (1)
- Amazonia (1)
- Amazonian languages (1)
- American politics (1)
- Amerikanisches Englisch (1)
- Anapher (1)
- Anapher <Rhetorik> (1)
- Ancient Greek (1)
- Ancient Greek language (1)
- Ancient Greek scholarship (1)
- Angst (1)
- Annotation guidelines (1)
- Annotation of causal language (1)
- Annotation of discourse relations (DRs) (1)
- Annotations (1)
- Annotator Agreement (1)
- Anspielung (1)
- Antezedens <Linguistik> (1)
- Antizipation (1)
- Antwortpartikel (1)
- Antwortrelationen (1)
- Antwortstrukturen (1)
- Anweisung (1)
- Anwendung (1)
- Anwendungsbereich (1)
- Anwesenheit (1)
- Anxiety (1)
- Arbeitsbündnis (1)
- Arbeitsplatz (1)
- Arbeitsstudie (1)
- Architectures (1)
- Architektur (1)
- Areallinguistik <Typologie> (1)
- Argument (1)
- Argument structure (1)
- Argumentrealisierung (1)
- Arizona (1)
- Articulatory settings (1)
- Arzt-Patient-Interaktion (1)
- Asian Americans (1)
- Aspect (1)
- Assertion (1)
- Assimilation <Soziologie> (1)
- Assistance (1)
- Assoziationsexperiment (1)
- Assoziationsmaß (1)
- Astrolabe-Bay (1)
- Attribution (1)
- Audio-video Synchronisation (1)
- Auffforderung (1)
- Aufforderungssatz (1)
- Aufsatzsammlung (1)
- Aushandlung (1)
- Auskunftsanspruch (1)
- Auslassung (1)
- Auslaut (1)
- Ausrichten <Technik> (1)
- Austausch (1)
- Authentische Ressourcen (1)
- Authentizität (1)
- Autismus (1)
- Autochthon (1)
- Autocorrelated errors (1)
- Autokorrelation (1)
- Automated information (1)
- Automatisch (1)
- Automatische Indexierung (1)
- Automatische Klassifikation (1)
- Automatische Lauterkennung (1)
- Automatische Lautidentifizierung (1)
- Automatische Sprachanalyse; (1)
- Automatische Worterkennung (1)
- Automobil <Personenkraftwagen> (1)
- Autonomie (1)
- Autorin (1)
- Autorschaft (1)
- Außenpolitik (1)
- BERT (1)
- Bairisch (1)
- Balkansprachen (1)
- Baltic states (1)
- Bangante Sprache (1)
- Bantu morphology (1)
- Barack Obama (1)
- Bartmiński, Jerzy (1)
- Baskisch (1)
- Basnage de Beauval (1)
- Basque language (1)
- Bautzen (1)
- Bayesian inference (1)
- Bearbeitung von Korpusanfragen (1)
- Bedeutungserweiterung (1)
- Bedeutungsvielfalft (1)
- Bedienungsanleitung (1)
- Bedrohte Sprache (1)
- Begriffsgeschichte <Fach> (1)
- Beispiel (1)
- Benennung (1)
- Benin (1)
- Benin (West Africa) (1)
- Benutzerfreundlichkeit (1)
- Benutzerführung (1)
- Benutzung (1)
- Benutzungsforschung (1)
- Beratung (1)
- Berufsbezeichnung (1)
- Beschreibung (1)
- Beschuldigung (1)
- Beschwerdebrief (1)
- Best-Practice (1)
- Beteiligung (1)
- Betrieb (1)
- Bewertung (1)
- Bibel. Altes Testament (1)
- Bibliographie 1960-1985 (1)
- Bibliography (1)
- Big Two (1)
- Bildungswesen (1)
- Bilingualised dictionary (1)
- Bilingualismus (1)
- Bittbrief (1)
- Blended learning (1)
- Blick (1)
- Blickregistrierung (1)
- Blickverhalten (1)
- Blindheit (1)
- Bologna Process (1)
- Bootstrapping methods (1)
- Borrowing (1)
- Bosnian (1)
- Bosnisch (1)
- Brazilian Portuguese dictionaries (1)
- Brettspiel (1)
- British National Corpus (1)
- British twenty first century lexicography (1)
- Brown clustering (1)
- Buchstabe (1)
- Buchstabenhäufigkeit (1)
- Burgenland (1)
- C++ (1)
- CAQDAS (1)
- CART (1)
- CLARIN Knowledge Sharing Infrastructure (1)
- CLARIN Legal Issues Committee (CLIC) (1)
- CLARIN infrastructure (1)
- CMC (International Conference on Cooperative Multimodal Communication) <2023, Mannheim> (1)
- CMC Corpora (1)
- CMC corpora (1)
- CMC corpus (1)
- CMDI experiences (1)
- CMDI infrastructure use (1)
- CMDI metadata (1)
- CMDI profile creation (1)
- CNL (1)
- COVID-19 discourse (1)
- CQLF (1)
- CSC (1)
- CTS (1)
- Canonical text services (1)
- Carl Friedrich Aichinger (1)
- Ceteris paribus laws (1)
- Chadic (1)
- Change (1)
- China (1)
- Chirurgie (1)
- Christian Ludwig (1)
- Chunking (1)
- Cinie Louw (1)
- Citizen Science (1)
- Clarín (1)
- Clusters (1)
- Co-Reference (1)
- CoMParS (1)
- CoRDI 2023 (1)
- Code (1)
- Codierung (1)
- Cognitive Bootstrapping (1)
- Cognitive artefacts (1)
- Collocation analysis (1)
- Collocations (1)
- Comic (1)
- Comitative Construction (1)
- Comitative Preposition (1)
- Comitative case (1)
- Common ground (1)
- Communicative Functions (1)
- Communion (1)
- Community theatre (1)
- Comparable Corpus (1)
- Comparable corpora (1)
- Comparison of representations and representational formats (1)
- Competence Theories (1)
- Complexity theory (1)
- Component Metadata Description Infrastructure (1)
- Composition (1)
- Compositional Semantics (1)
- Computational lexicography (1)
- Computationelle Semantik (1)
- Computer-Assisted Language Learning (CALL) (1)
- Computerprogramm (1)
- Computerunterstützte Übersetzung (1)
- Computervermittelte Kommunikation (1)
- Computing in the Humanities (1)
- Conceptual metaphor (1)
- Concurrency (1)
- Concurrent markup (1)
- Consonant (1)
- Construction Grammar (1)
- Consultation behavior (1)
- Context (1)
- Contextual meaning (1)
- Contrary and complementary opposites (1)
- Contrast (1)
- Contrastive linguistics (1)
- Controlled Natural Language (CNL) (1)
- Conversational Feedback (1)
- Conversational analysis (1)
- Conversational rhetoric (1)
- Coordination (1)
- Coreference (1)
- Corpora (Linguistics) (1)
- Corporate Identity (1)
- Corpus (1)
- Corpus Analysis (1)
- Corpus Comparison (1)
- Corpus Management (1)
- Corpus Tools (1)
- Corpus query platform (1)
- Corpus-based retrieval (1)
- Corruption (1)
- Couplet (1)
- Covariation (1)
- Creole languages (1)
- Croatian (1)
- Cross references (1)
- Cross-cultural psychology (1)
- Cross-linguistic conversation analysis (1)
- Crowdsourcing (1)
- Cultural metric (1)
- Cyber-Mobbing (1)
- Cyrillic (1)
- DARIAH-DE (1)
- DKPro repository (1)
- DMPTY (1)
- DO-cleft (1)
- DRs in spoken and written genres (1)
- DRuKoLA (1)
- DSSSL (1)
- Dagestan (1)
- Darmstadt Knowledge Processing Software Repository (1)
- Darstellungsart (1)
- Data Architecture (1)
- Data Augmentation (1)
- Data Formats (1)
- Data Governance Act (1)
- Data Innovation Board (1)
- Data Science (1)
- Data Vizualization (1)
- Data altruism (1)
- Data mining (1)
- Database Management Systems (1)
- Dateiformat (1)
- Datenaufbereitung (1)
- Datenaustausch (1)
- Datenbank für Gesprochenes Deutsch (DGD) (1)
- Datenbank für gesprochenes Deutsch = DGD (1)
- Datenerfassung (1)
- Datenkompetenz (1)
- Datenkonvertierung (1)
- Deep learning (1)
- Deeutschamerikaner (1)
- Definitheit (1)
- Definitionen (1)
- Deliberation (1)
- Demonstrativpronomen (1)
- Dependency Parsing (1)
- Dependenz (1)
- Depression (1)
- Derivation (1)
- Derivation <Linguistik> (1)
- Determinator (1)
- Deutsche Gebärdensprache (DGS) (1)
- Deutsches Referenzkorpus zur internetbasierten Kommunikation (DeRiK) (1)
- Deutsches Spracharchiv (1)
- Deutschland (DDR) (1)
- Deutschland (Westliche Länder) (1)
- Deutschland <DDR> (1)
- Developmental Robotics (1)
- Devolution (1)
- Diachronie (1)
- Dialekt (1)
- Dialektgeografie (1)
- Dialogue (1)
- Diccionario de la lengua Española (Madrid) (1)
- Diccionario histórico de la lengua española (1)
- Dichtersprache (1)
- Dictionary and text analysis (1)
- Dictionary editing software (1)
- Dictionary encoding (1)
- Dictionary use strategies (1)
- Dictionnaire universel (1)
- Die Sprach-Checker (1)
- Differential object marking (1)
- Differenzielle Objektmarkierung (1)
- Digital Humanities Studium (1)
- Digital Library (1)
- Digital lexical systems (1)
- Digitale Daten (1)
- Digitale Forschungsinfrastruktur (1)
- Digitale Geisteswissenschaften (1)
- Digitale Revolution (1)
- Digitaler Sprachassistent (1)
- Digitales Wörterbuch der deutschen Sprache (DWDS) (1)
- Diminutiv (1)
- Direct Speech (1)
- Direct speech (1)
- Directive 95/46/EC (1)
- Directive on Copyright in the Digital Single Market (1)
- Directive particles (1)
- Disambiguation (1)
- Discourse Representation Theory (1)
- Discourse analysis (1)
- Discourse annotation (1)
- Discourse parsing (1)
- Discourse relations (1)
- Diskursivität (1)
- Diskurstheorie (1)
- Diskurstopik (1)
- Dispositiv (1)
- Distance learning (1)
- Distributional semantics (1)
- Distributionsidiosynkrasie (1)
- Document Classification (1)
- Document Images (1)
- Document structure (1)
- Documentation (1)
- Dokument (1)
- Dokumentenverarbeitung (1)
- Dokumentverarbeitung (1)
- Dolmetschen (1)
- Domain-specific Relation Extraction (1)
- Dominanz (1)
- Donald Trump (1)
- Double verb constructions (1)
- Dublin Core (1)
- Duits (1)
- Dutch (1)
- Dynamische Psychotherapie (1)
- Dzongkha (1)
- Dänisch (1)
- E-dictionary (1)
- E-lexicography (1)
- EEG (1)
- ELEXIS (1)
- EMLex (1)
- EOSC (1)
- EURALEX (20 : 2022 : Mannheim) (1)
- EURALEX International Congress (1)
- Early New High German (ENHG) (1)
- Early responses (1)
- Edition (1)
- Editor (1)
- Educational software (1)
- Effects (1)
- Effizienz (1)
- Egozentrismus (1)
- Ehe (1)
- Eigengruppe (1)
- Eigentum (1)
- Eigentumsrecht (1)
- Einführung (1)
- Einwanderung (1)
- Ejektiv (1)
- Electronic Lexicography (1)
- Electronic dictionaries (1)
- Electronic dictionary (1)
- Elektronisches Buch (1)
- Elizabeth Weir (1)
- Ellipse (1)
- Eltern (1)
- Embodiment (1)
- Emergence (1)
- Empfehlung (1)
- Empfindung (1)
- Empirical database (1)
- Endlicher Zustandsraum (1)
- Englisch als Lingua Franca-Interaktionen (1)
- Englischunterricht (1)
- English lingua franca interactions (1)
- English monolingual learner’s dictionaries (1)
- Entwicklungspsychologie (1)
- Epistemizität (1)
- Ereignis (1)
- Ereignisdatenanalyse (1)
- Ereigniskorreliertes Potenzial (1)
- Ereignissemantik (1)
- Erlebte Rede (1)
- Error analysis (1)
- Error classification (1)
- Erwachsenenbildung (1)
- Erzählen (1)
- Erzählstruktur (1)
- Erzähltheorie (1)
- Erzählung (1)
- Ethnische Gruppe (1)
- EuReCo (1)
- Europarat (1)
- European Americans (1)
- European Association for Lexicography (1)
- European Reference Corpus (EuReCo) (1)
- European Strategy for Data (1)
- Europeanisms (1)
- Europäische Kommission. Digital Single Market (1)
- Europäische Sprachen (1)
- Europäische Union: Datenschutz-Grundverordnung (1)
- Event mapping (1)
- Events (1)
- Evidentialität (1)
- Evolution (1)
- Evoziertes Potenzial (1)
- Experte (1)
- Expertenmeinung (1)
- Explanation (1)
- Expletiv (1)
- Expressionismus (1)
- FAIR (1)
- FAIR Index (1)
- FAIR data (1)
- FML (1)
- FO prediction (1)
- FSR (1)
- Fachwissen (1)
- Fahrunterricht (1)
- Fair Use (1)
- Fantasiespiel (1)
- Faroese (1)
- Feature engineering (1)
- Federated Content Search (FCS) (1)
- Feedback (1)
- Feedback marker (1)
- Fehler (1)
- Feldpost (1)
- Fernsehduell (1)
- Fernsehen (1)
- Fernsehinterview (1)
- Fernsehsprache (1)
- Fernunterricht (1)
- Fiktion (1)
- Film (1)
- Filmkritik (1)
- Finalsatz (1)
- Finnic minorities of Ingria (1)
- Fokus (1)
- Food Domain (1)
- Food domain (1)
- Footing Shifts (1)
- Formal learning (1)
- Formalization (1)
- Formulation (1)
- Forschungs- und Lehrkorpus Gesprochenes Deutsch (FOLK) (1)
- Forschungs- und Lehrkorpus Gesprochenes Deutsch = FOLK (1)
- Forschungsbericht (1)
- Forschungsprozess (1)
- Fortschrittlichkeit (1)
- Forum Deutsche Sprache (1)
- Fotografie (1)
- Fracto-morphèmes (1)
- Fragen (1)
- Fragment (1)
- France (1)
- Frankreich (1)
- Frau (1)
- Frauenforschung (1)
- Frauensport (1)
- Freiheit (1)
- Fremd-initiierte Reparaturen (1)
- Fremdgruppe (1)
- Fremdwort (1)
- French-German (1)
- Frequenzanalyse (1)
- Frisian (1)
- Frisian Act (1)
- Frühneuhochdeutsche Wörterbuch (1)
- Frühneuhochdeutsches Wörterbuch (1)
- Funktionale Kategorie (1)
- Funktionelle Kernspintomografie (1)
- Fußballsprache (1)
- Förderung (1)
- Führungskraft (1)
- GB-Theorie (1)
- GDE-V (1)
- GIS (1)
- GOLD standard (1)
- GUI (1)
- Gamification (1)
- Gangsta-Rap (1)
- Gastarbeiterdeutsch (1)
- Gebrauchsstandard (1)
- Gefühlsausdruck (1)
- Gemeinschaft (1)
- Gender (1)
- Gender egalitarianism (1)
- Gender stereotypes (1)
- General Data Protection Regulation (GDPR) (1)
- Generation (1)
- Generative Syntax (1)
- Generative Transformationsgrammatik (1)
- Generic Document Structure (1)
- Generic Search (GS) (1)
- GeoBib (1)
- GeoHumantities (1)
- Geopolitik (1)
- Georgian language (1)
- Georgisch (1)
- Geriatrie (1)
- Germaans (1)
- German Language Atlas (1)
- German Microcensus (1)
- German Reference Corpus (DeReKo) (1)
- German Verschmelzungsformen (contracted forms) (1)
- German clause structure (1)
- German clause-embedding predicates (1)
- German colonialism (1)
- German data (1)
- German definitions on garments (1)
- German dialects (1)
- German grammar (1)
- German interrogative embedding predicates (1)
- German mission society (1)
- German phraseological patterns (1)
- German reference corpus (1)
- German spoken language (1)
- German vowels (1)
- German, Italian, Spanish (1)
- German-American relation (1)
- German-Canadian (1)
- German-Italian (1)
- Geschichte (1)
- Geschichte 1700-1800 (1)
- Geschichte 1945-1955 (1)
- Geschichte 1989-1990 (1)
- Geschichte 1995-1999 (1)
- Geschichte <1700-1900> (1)
- Geschichte <1884-1914> (1)
- Geschichte <1989-1990> (1)
- Geschichte <1989-1994> (1)
- Geschichtskarte (1)
- Geschlecht (1)
- Geschlechtsidentität (1)
- Gesellschaftsleben (1)
- Gesicht (1)
- Gesprochenes Deutsch (1)
- Gesprächsführung (1)
- Gestural matching (1)
- Gestures (1)
- Gesundheit (1)
- Gigafida 2.1 corpus (1)
- Gitksan-Sprache (1)
- Goodwin, Charles W. (1)
- Google Ngram (1)
- Google Translate (1)
- Gospel <Musik> (1)
- Governance (1)
- Gradability (1)
- Grafische Darstellung (1)
- Grammatical Categories (1)
- Grammaticalization (1)
- Grammatikalisierung (1)
- Grammatikografie (1)
- Graph (1)
- Graph cluster (1)
- Graphdatenbank (1)
- Graphem (1)
- Graphemik (1)
- Graphische Benutzeroberfläche (1)
- Graphisches Symbol (1)
- Grasland-Bantu <Sprachfamilie> (1)
- Grasland-Bantu-Sprachen (1)
- Gravity's Rainbow (1)
- Greek Sign Language (1)
- Grewendorf, Günther (1)
- Grundschule (1)
- Guided self-help (1)
- Gälische Sprachen (1)
- HTML (Hypertext Markup Language) (1)
- Haltung (1)
- Handgeste (1)
- Handlung <Literatur> (1)
- Handlungskonstitution (1)
- Handlungstheoretische Semantik (1)
- Handschrift (1)
- Hass (1)
- Hassrede (1)
- Hausa (1)
- Head-Driven Phrase Structure Grammar (HPSG) (1)
- Heroismus (1)
- Hester Piozzi (1)
- Hethitisch (1)
- Hierarchical modeling (1)
- Hieroglyphe (1)
- Higher Education (1)
- Higher education (1)
- Hilfesystem (1)
- Hip-Hop (1)
- Historical Corpora (1)
- Historical Maps (1)
- Historische Korpora (1)
- Historische Lexikographie (1)
- Historische Syntax (1)
- Historsche Sprachsynthese (1)
- History of lexicography (1)
- Hochschulpolitik (1)
- Holocaust (1)
- Home environment (1)
- Homographie (1)
- Homonym (1)
- Human Robot Interaction (HRI) (1)
- Humanities (1)
- Humor (1)
- Hungarian (1)
- Hyperkorrektur (1)
- Hyperlink (1)
- Häufigkeitsverteilung (1)
- Hören (1)
- Hörverlust (1)
- ICC corpus (1)
- ICE corpus (1)
- IDS (1)
- IP Rights (1)
- ISO/TC 37/SC 4 (1)
- ISO/TEI (1)
- ISOcat (1)
- ISOcat registry (1)
- IT infrastructure (1)
- IVK-Ler corpus of German (1)
- Illustration (1)
- Imageloss Compensation (1)
- Immigrants (1)
- Imperative (1)
- Impersonale (1)
- Implicit attitudes (1)
- Implikatur (1)
- Impression formation (1)
- Improvisation (1)
- Inchoativ (1)
- Inclusive lexicography (1)
- Indefinite pronoun (1)
- Indefinitpronomen (1)
- Index (1)
- Index Generation (1)
- Indexierung <Inhaltserschließung> (1)
- Indikativ (1)
- Indikator (1)
- Indirekte Anapher (1)
- Individual differences (1)
- Infinitiv (1)
- Infinitivkonstruktion (1)
- Inflectional morphology (1)
- Informatik (1)
- Information (1)
- Information-Retrieval-System (1)
- Informationsgehalt (1)
- Informationsintegration (1)
- Informationsmodellierung (1)
- Innovation (1)
- Inspektionssequenzen (1)
- Institut für Corpuslinguistik und Texttechnologie (ICLTT) (1)
- Instructions (1)
- Instruktionen (1)
- Integer Linear Program (1)
- Intelligence (1)
- Intensität <Phonetik> (1)
- Intensivierung (1)
- Interaction (1)
- Interactional Semantics (1)
- Interactional semantics (1)
- Interactional sociolinguistics (1)
- Interaktionales Projekt (1)
- Interaktiv (1)
- Interaktive Medien (1)
- Interdisciplinarity (1)
- Interfacedesign (1)
- Intermedialität (1)
- Internal and external coherence (1)
- International Conference on Conversation Analysis (ICCA) (1)
- International Conference on Language Resources and Evaluation (12. : 2020 : Marseille) (1)
- International Contrastive Linguistics Conference (1)
- International Corpus of English (1)
- International Society of Conversation Analysis (ISCA) (1)
- Internationale Migration (1)
- Internationales Urheberrecht (1)
- Internationalismus (1)
- Internetdictionary (1)
- Internetforum (1)
- Internetportal (1)
- Internetwörterbuch (1)
- Interoperability of annotation schemes (1)
- Interozeption (1)
- Interpretative Semantik (1)
- Interrelated document grammars (1)
- Interrogativlogik (1)
- Interrogativpronomen (1)
- Interrogativsatz (1)
- Intersektionalität (1)
- Intersubjectivity (1)
- Intertextuality (1)
- Intertextualität (1)
- Intonation (1)
- Inuktitut (1)
- Invariance (1)
- Inversion (1)
- Irisch (1)
- Isolationism (1)
- Italian (1)
- Japanese (1)
- Japanese controlled language (1)
- Java (1)
- Jesuiten (1)
- Johann Andreas Schmeller (1)
- Joint digital storytelling (1)
- Jost Trier (1)
- Journalismus (1)
- Jueju (1)
- Jugendkultur (1)
- Junktion (1)
- Kanada (1)
- Kanji (1)
- Karl Duncker (1)
- Kaukasus (Süd) (1)
- Kind / Sprachentwicklung (1)
- Kinder (1)
- Kindergarten (1)
- Kinderspiel (1)
- Kindesmisshandlung (1)
- Kirchensprache (1)
- Klient (1)
- Klima (1)
- Knowledge Acquisition (1)
- Knowledge Graph (1)
- Knowledge Level Descriptions (1)
- Knowledge Map (1)
- Kochbuch (1)
- Kognitionswissenschaft (1)
- Kognitive Entwicklung (1)
- Kognitivie Linguistik (1)
- Kollaborative Filterung (1)
- Komitative Präposition (1)
- Komitativkonstruktion (1)
- Kommentar (1)
- Kommunikationsforschung (1)
- Kommunikationstechnik (1)
- Kommunikationsverhalten (1)
- Kommunikative Kompetenz (1)
- Kompensation (1)
- Komplement (1)
- Komplement <Linguistik> (1)
- Komplexität (1)
- Kompositinalität (1)
- Kompositionelle Semantik (1)
- Kompositum <Wortbildung> (1)
- Konfigurationsmanagement (1)
- Kongruenz <Linguistik> (1)
- Konjunktiv (1)
- Konkordanz (1)
- Kontamination <Wortbildung> (1)
- Kontext (1)
- Kontextanalyse (1)
- Kontingenzen (1)
- Kontrastive Phonetik (1)
- Kontrastive Phraseologie (1)
- Kontrastive Semantik (1)
- Konvention (1)
- Konversationsanalysse (1)
- Koordination (1)
- Korpora (1)
- Korpusannotation (1)
- Korpusaufbereitung (1)
- Korpusmanagement (1)
- Korpusvergleich (1)
- Korrelationsanalyse (1)
- Kosraean (1)
- Krankenschwester (1)
- Kratzenstein, Christian Gottlieb (1)
- Kreativität (1)
- Kriminalität (1)
- Kritik (1)
- Kritische Diskursanalyse (1)
- Kulturelle Vielfalt (1)
- Kulturerbe (1)
- Kulturgeschichte (1)
- Kulturrelativismus (1)
- Kulturwandel (1)
- Kulturwissenschaften (1)
- Kurdisch (1)
- Kurzwort (1)
- Kuturvergleich (1)
- Kymrisch (1)
- Kyrillische Schrift <Druckschrift> (1)
- L1 error correcttion (1)
- L2 Russian (1)
- L2 effects (1)
- LFG (1)
- LIVE-Data (1)
- LMF (1)
- LR infrastructures and architectures (1)
- LRTwiki (1)
- LSP dictionaries (1)
- Labeling approach (1)
- Labial (1)
- Lachen (1)
- Lafourche Basin (1)
- Lafourche Parish (1)
- Laie (1)
- Language Policy (1)
- Language attitudes (1)
- Language biographies (1)
- Language concept (1)
- Language contact (1)
- Language laws (1)
- Language resources (1)
- Language statistics (1)
- Language technology (1)
- Languages in education (1)
- Langzeitarchierung (1)
- Large Classes (1)
- Large Corpora (1)
- Large Language Models (1)
- Laryngal (1)
- Lateinunterricht (1)
- Latin Americans (1)
- Latin grammar (1)
- Latin morphology (1)
- Latin syntax (1)
- Latvian (1)
- Latvian as a medium of instruction (1)
- Latvian as second language (1)
- Learner’s lexicography (1)
- Lebenslauf (1)
- Lehnwortportal Deutsch (LWPD) (1)
- Lehnwörter (1)
- Leibliche Displays (1)
- Leibniz-Zentrum Allgemeine Sprachwissenschaft (1)
- Leichte Sprache (1)
- Lelxikographie (1)
- Lemmata (1)
- Lemmatisierung (1)
- Lernhilfe (1)
- Lerntheorie (1)
- Lesbarkeit (1)
- Lesekompetenz (1)
- Leseverstehen (1)
- Let's Play (1)
- Let's Plays (1)
- Lettischunterricht (1)
- Let’s Play (1)
- Levelled Study Corpus of Russian (LeStCoR) (1)
- LexMeta (1)
- Lexical Database (1)
- Lexical Functional Grammar (LFG) (1)
- Lexical Semantics (1)
- Lexical functional grammar (1)
- Lexical resources metadata (1)
- Lexical semantics (1)
- Lexicographically interpreted information (1)
- Lexicography (1)
- Lexikalische Analyse (1)
- Lexikalisierung (1)
- Lexikon <Psycholinguistik> (1)
- Lexonomy (1)
- License (1)
- Light verbs (1)
- Likelihood-Quotienten-Test (1)
- Linguistic Category Model (1)
- Linguistic Relativity (1)
- Linguistic Retrieval (1)
- Linguistic annotation (1)
- Linguistic annotations (1)
- Linguistic processing (1)
- Linguistically informed feature engineering (1)
- Linked Open Data (1)
- Linking-Regel (1)
- Linksverzweigende Konstruktion (1)
- Litauen (1)
- Litauisch (1)
- Literatur (1)
- Literaturauswertung (1)
- Literaturdatenbank (1)
- Literature (1)
- Literaturunterricht (1)
- Literaturverwaltung (1)
- Literaturwissenschaft (1)
- Lithuanian (1)
- Livevideostream (1)
- Lizenz (1)
- Lizenzierung (1)
- Lizenzvergabe (1)
- Lizenzvertrag (1)
- Local and global effectiveness (1)
- Logical Document Structure (1)
- Logische Partikel (1)
- Logit-Modell (1)
- Lokalisation (1)
- Lokalismus (1)
- Long-Term Archiving (1)
- Lorraine (1)
- Lothringen (1)
- Louisiana French (1)
- Low German (1)
- Luxembourg (1)
- MARC 21 (1)
- META-SHARE (1)
- MLP (1)
- MLSA (1)
- Machine Learning (1)
- Machine Learning Algorithms (1)
- Machine Leraning (1)
- Machine Translation (1)
- Machine learning (1)
- Machine translating (1)
- Magnetoencephalographie (1)
- Makrostruktur (1)
- Malaga (1)
- Mann, Thomas (1)
- Manner of articulation (1)
- Mannheim-Neckarstadt (West) (1)
- Map Task (1)
- Margrethe Thiele (1)
- Markup Languages (1)
- Material objects (1)
- Mathematik (1)
- Matomo (1)
- Maya-Sprachen (1)
- Mean reciprocal rank (1)
- Mechanismus der Menschlichen Sprache (1)
- Mediale Durchformung (1)
- Medialität (1)
- Mediatisierung (1)
- Mediendiskurse (1)
- Medieninteraktion (1)
- Medienkompetenz (1)
- Medienkonsum (1)
- Medienpraktiken (1)
- Mehrheit (1)
- Meinungsfreiheit (1)
- Mental Lexicon (1)
- Mentalität (1)
- Menzerath (1)
- Menzerath's Law (1)
- Menzerathsches Gesetz (1)
- Merkmal (1)
- Meta Modeling (1)
- Metadata Management (1)
- Metakommunikation (1)
- Metalexicography (1)
- Metalinguistik (1)
- Metasprache (1)
- Methodik (1)
- Methods (1)
- Middle High German (MHG) (1)
- Migrant (1)
- Migrationshintergrund (1)
- Minimalist program <Linguistik> (1)
- Mission (1)
- Missionsgesellschaft (1)
- Missverständnis (1)
- Mitarbeit (1)
- Mitschrift (1)
- Modaladverb (1)
- Modaler Infinitiv (1)
- Modelltheoretische Semantik (1)
- Modern Icelandic (1)
- Modifikation <Linguistik> (1)
- Monitor corpus (1)
- Montague-Grammatik (1)
- Morality in interaction (1)
- Moralität (1)
- Morph Moulder (MoMo) (1)
- Morphemik (1)
- Morphologie<Linguistik> (1)
- Morphonologie (1)
- Morphophonologie (1)
- Multi- Word Patterns (1)
- Multi-Strategy Learning (1)
- Multi-layer Annotation (1)
- Multi-modality (1)
- Multilingual Corpus (1)
- Multilingual corpora (1)
- Multilingual corpus (1)
- Multilingual dictionary (1)
- Multilingual lexicography (1)
- Multilingualismus (1)
- Multimodal (1)
- Multimodal interaction (1)
- Multimodale Analyse (1)
- Multimodale Interaktion (1)
- Multimodality (1)
- Multinomial modeling (1)
- Multiple annotations (1)
- Mundart Schwäbisch <Kaukasus> (1)
- N-Gram (1)
- N-N compound (1)
- N-gram modeling (1)
- NFDI section (1)
- NLP pipeline (1)
- NPI (1)
- NZSL Share (1)
- NaLiDa (1)
- Nachfeld (1)
- Namenforschung (1)
- Namenkunde (1)
- Namespaces (1)
- Naming (1)
- Narrative Interaktion (1)
- Narrativität (1)
- Nasal (1)
- Nationalbewusstsein (1)
- Nationale Forschungsdateninfrastruktur (NFDI) e.V. (1)
- Nationalismus (1)
- Nationalitätenpolitik (1)
- Natural Language Processing (NLP) (1)
- Near synonymy (1)
- Nebensatz (1)
- Neg-raising (1)
- Negationsanhebung (1)
- Negationen (1)
- Negotiation (1)
- Neighbour classifier (1)
- NeoRate (1)
- Neologismenwörterbuch (1)
- Netzwerk (1)
- Neugriechisch (1)
- Neurolinguistisches Programmieren (1)
- Neurologie (1)
- Neuseeland (1)
- Neutralisation <Linguistik> (1)
- New Guinea (1)
- New Zealand Sign Language (NZSL) (1)
- New speakers (1)
- Newspaper (1)
- Niedersorbisch (1)
- Nomen (1)
- Nominalsyntagma (1)
- Non-projecting words (1)
- Nonverbal communication (1)
- Nord-Sotho (1)
- Nordchinesisch (1)
- Nordsotho (1)
- Norm <Ethik> (1)
- Normativität (1)
- Normdatei (1)
- North Frisian (1)
- Norwegen (1)
- Norwegian Nynorsk (1)
- NottDeuYTSch Corpus (1)
- Null instantiation (1)
- Null-Subjekt (1)
- Nurse-patient communication (1)
- Nutzen (1)
- Nutzer (1)
- OAuth (1)
- OBELEX (1)
- OCR-Verarbeitung (1)
- OEC (1)
- OED (1)
- OO-correspondence (1)
- OTRS (1)
- OWL-Ontology (1)
- Objektsprache (1)
- Old High German (OHG) (1)
- Old Norse (1)
- Old Romanian (1)
- Old Testament (1)
- Older German (OHG, MHG, OS, MLG) (1)
- On-line syntax (1)
- Online thesaurus (1)
- Online-Dienst (1)
- Online-Informationssystem (1)
- Online-Publikation (1)
- Online-Wortschatz-Informationssystem Deutsch (OWID) (1)
- Onlinekommentare (1)
- Onomasiologie (1)
- OntoLex-Lemon (1)
- Ontologie (1)
- Ontology (1)
- Ontology development (1)
- Open Information (1)
- Opposition (1)
- Optimality Theory (1)
- Optimality theory (1)
- Oral history (1)
- Ost-West-Konflikt (1)
- Osthoff, Hermann (1)
- P600 (1)
- PCFG (1)
- POS-Tagging (1)
- Pacific (1)
- Palauan (1)
- Parallel European Corpus of Informal Interaction (PECII) (1)
- Parallel corpora (1)
- Parallelismus (1)
- Parsing Systems (1)
- Part-of-speech tagging (1)
- Parteipolitik (1)
- Particle Verbs (1)
- Parts of speech (1)
- Pathogener Mikroorganismus (1)
- Patiens (1)
- Pazifischer Ozean (1)
- Pazifischer Ozean <Süd> (1)
- Pearson Korrelation (1)
- Pennsylvania German (1)
- Performanz <Linguistik> (1)
- Periscope (1)
- Periscope <Programm> (1)
- Persian (1)
- Persisch (1)
- Persistent identifier (1)
- Personal Learning Environment (1)
- Personal data (1)
- Persönlichkeitsrecht (1)
- Petition (1)
- Pflegeheim (1)
- Phonatory behavior (1)
- Phonemic level (1)
- Phonesthemes (1)
- Phonetics (1)
- Phonology (1)
- Phrase <Syntagma> (1)
- Phrase Based Active Dictionary (PAD) (1)
- Phrasenstruktur (1)
- Phrasenstrukturgrammatik (1)
- Pitch Range (1)
- Pivot (1)
- Place reference (1)
- Plenum (1)
- Pleonastic Prepositions (1)
- Plesionymy (1)
- Plural Comitative Construction (PCC) (1)
- Poetik (1)
- Polarity Shifter (1)
- Polarity items (1)
- Polaritätsprofil (1)
- Poliqarp (1)
- Polish dialectology (1)
- Politiker (1)
- Politische Berichterstattung (1)
- Politische Einstellung (1)
- Politische Entscheidung (1)
- Politische Identität (1)
- Politische Kommunikation im Fernsehen (1)
- Politische Rede (1)
- Politische Willensbildung (1)
- Politischer Protest (1)
- Polysem (1)
- Popmusik (1)
- Portuguese (1)
- Positionierung (1)
- Possessivpronomen (1)
- Post-Soviet (1)
- Postkolonialismus (1)
- Practice (1)
- Pragmalinguistik (1)
- Pragmatic inference (1)
- Praktische Vernunft (1)
- Prepositional object clause (1)
- Preservation (1)
- Presse (1)
- Pressekonferenz (1)
- Priming (1)
- Privacy (1)
- Privacy by Design (1)
- Privatheit (1)
- Pro-Form (1)
- Probe (1)
- Processing (1)
- Produktivität <Linguistik> (1)
- Prognose (1)
- Programmieren <Informatik> (1)
- Prolog (1)
- Propp system (1)
- Propp, Vladimir Jakovlevič (1)
- Propriozeption (1)
- Prosodic Matching (1)
- Prosodic repetition (1)
- Prosody Transplantation (1)
- Proust, Marcel (1)
- Proverb (1)
- Provider (1)
- Prädikatives Adjektiv (1)
- Prädikativsatz (1)
- Prädiktor (1)
- Präferenz (1)
- Präfix be (1)
- Präpositionaler Objektsatz (1)
- Präpositionalobjekt (1)
- Präpositionalphrase (1)
- Präsident (1)
- Präteritum (1)
- Pseudonymisierung (1)
- Psychische Störung (1)
- Psychisches Trauma (1)
- Psychoanalyse (1)
- Psychodynamische Psychotherapie (1)
- Psychose (1)
- Public sector information (1)
- Pynchon, Thomas (1)
- QUEST (1)
- QUEST project (1)
- Qualitative Inhaltsanalyse (1)
- Qualitative research (1)
- Qualitätskontrolle (1)
- Quantitative research (1)
- Query Languages (1)
- Query Rewriting (1)
- Querying (1)
- Question Answering (1)
- Question Answering System (1)
- Questioning sequences (1)
- Quotations (1)
- R <Programm> (1)
- R package (1)
- RDF <Informatik> (1)
- RDM (1)
- RSS newsfeed corpus (1)
- Rabaul Creole German (1)
- Rapmusiker (1)
- Rassismus (1)
- Rat für Deutsche Rechtschreibung (1)
- Rationalität (1)
- Re-Recordings (1)
- Reaktionszeit (1)
- Rechtschreibreform (1)
- Rechtsschutz (1)
- Rechtsversetzung (1)
- Recipient Design (1)
- Redaktionssystem (1)
- Redefreiheit (1)
- Reduplikation (1)
- Reference Corpora (1)
- Reflexitität <Linguistik> (1)
- Regeln (1)
- Register (1)
- Reibelaut (1)
- Reim (1)
- Reisebericht (1)
- Reiseführer (1)
- Reiseliteratur (1)
- Relation type (1)
- Relative pronoun (1)
- Relativism (1)
- Relativpronomen (1)
- Religion and Psychology (1)
- Repair (1)
- Replication (1)
- Replikat (1)
- Reproduzierbarkeit (1)
- Repräsentation <Politik> (1)
- Republican Party (USA) (1)
- Research Data Infrastructure (RDI) (1)
- Research infrastructure (1)
- Research infrastructures (1)
- Response tokens (1)
- Ressourcen (1)
- Rheinische Missions-Gesellschaft (1)
- Robot Language (1)
- Robotik (1)
- Romanian corpus (1)
- Romanian lexicography (1)
- Romantische Liebe (1)
- Routines (1)
- Rule-following (1)
- Russia (1)
- Russian-Germans (1)
- Russophones (1)
- Rückläufiges Wörterbuch (1)
- Rückmeldepartikel (1)
- Rückmeldung (1)
- Rēzekne (1)
- SABIO-RK (1)
- SALSA (1)
- SALSA corpus (1)
- SAT (1)
- SCAD-zbMATH (1)
- SCyDia (1)
- SDO (1)
- SIS (1)
- SOA (1)
- SQL (1)
- SSH (1)
- Sachverhalt (1)
- Samen <Volk> (1)
- Samoan German (1)
- Satz (1)
- Satzeinbettendes Prädikat (1)
- Satzende (1)
- Satzkonjunktion (1)
- Satzlänge (1)
- Satzsemantik (1)
- Satztyp (1)
- Scale (1)
- Schauspielkunst (1)
- Schema Languages (1)
- Scherz (1)
- Schleswig-Holstein (1)
- Schlieren photography (1)
- Schlüsselwort (1)
- Schottland. Parliament (1)
- Schreiben (1)
- Schriftzeichen (1)
- Schulbildung (1)
- Schulbuch (1)
- Schuld (1)
- Schule (1)
- Schulung (1)
- Schulwahl (1)
- Schwa (1)
- Schweigen (1)
- Schwäbisch (1)
- Schüler (1)
- SciLogs (1)
- Science theory (1)
- Searle, John R. (1)
- Second Language Learning (1)
- Segmentdauer (1)
- Sehbehinderung (1)
- Selbst (1)
- Selbstdarstellung (1)
- Selbstgesteuertes Lernen (1)
- Selbsthilfe (1)
- Selbstorganisation (1)
- Selbstreflexion (1)
- Self-Regulated Learning (1)
- Semantic (1)
- Semantic Analysis (1)
- Semantic Interoperability (1)
- Semantic analysis (1)
- Semantic opposition (1)
- Semantic relation (1)
- Semantic role labelling (1)
- Semantic roles (1)
- Semantic similarity (1)
- Semi-automatic annotation (1)
- Sentence connectives (1)
- Sentence level (1)
- Sentence processing (1)
- SentiFrameNet (1)
- Sepedi (1)
- Sequenz (1)
- Serbian (1)
- Serbian language (1)
- Server (1)
- Serviceintegration (1)
- Serviceorientierte Architektur (1)
- Sexismus (1)
- Sexuelle Belästigung (1)
- Sichtbarkeit (1)
- Sie (1)
- Sie <Wort> (1)
- Sign Languages (1)
- Sign language dictionary (1)
- Sign-Based Construction Grammar (1)
- Silbentrennung (1)
- Simultanübersetzen (1)
- Situatives Involvement (1)
- Sketch Engine (1)
- Sketch engine (1)
- Slavic languages (1)
- Slavische Sprachen (1)
- Slawisch (1)
- Slawische Minderheit (1)
- Slawistik (1)
- Slips (1)
- Slovak (1)
- Slovene (1)
- Slovenisch (1)
- Slowenien (1)
- Smartphone-Gebrauch (1)
- Smartphones (1)
- Smiley (1)
- Social cognition (1)
- Social interaction (1)
- Social media (1)
- Social perception (1)
- Social sciences and humanities (1)
- Socio-Economic Panel (SOEP) (1)
- Softwareergonomie (1)
- Softwarewiederverwendung (1)
- Solidarität (1)
- Sonora (1)
- Sorbian (1)
- Sorbian languages in Germany (1)
- Sotho-Sprache (1)
- Source/goal assymetry (1)
- South Caucasian (1)
- South Tyrol (1)
- Sowjetunion (1)
- Soziale Integration (1)
- Soziale Rolle (1)
- Soziale Sanktion (1)
- Sozialer Konflikt (1)
- Sozialer Prozess (1)
- Sozialer Wandel (1)
- Sozialisation (1)
- Sozialkompetenz (1)
- Sozialtopografie (1)
- Sozialverhalten (1)
- Space (1)
- Space in language (1)
- Spanien (1)
- Spanish (1)
- Spanish Royal Academy (1)
- Spanish lexicography (1)
- Sparkling wine (1)
- Spatial cases (1)
- Special field lexicography (1)
- Speech Corpora (1)
- Speech Lexica (1)
- Speech production (1)
- Spezifikation (1)
- Spiel (1)
- Spieler (1)
- Spielrahmen (1)
- Spoken Language Data (1)
- Sprachakt (1)
- Sprachbiographien (1)
- Sprachdeterminismus (1)
- Spracheinstellung (1)
- Spracheinstellungen (1)
- Spracherkennung (1)
- Sprachfertigkeit (1)
- Sprachgemeinschaft (1)
- Sprachkritik (1)
- Sprachkurs (1)
- Sprachliche Universalien (1)
- Sprachliche Varietät (1)
- Sprachliches Relativitätsprinzip (1)
- Sprachplanung (1)
- Sprachressource (1)
- Sprachstudie (1)
- Sprachsynthese (1)
- Sprachtheorie (1)
- Sprachursprung (1)
- Sprachvarietät (1)
- Sprachzeichen (1)
- Sprachübersetzung (1)
- Sprechakte (1)
- Sprechakttheorie (1)
- Sprechen (1)
- Sprechererkennung (1)
- Sprichwortforschung (1)
- Spurious regression (1)
- Staatssprache (1)
- Stadtmundart (1)
- State-of-Affairs (1)
- Statistical Learning (1)
- Statistical methods (1)
- Statistische Analyse (1)
- Statistische Linguistik (1)
- Statistisches Modell (1)
- Stimmapparat (1)
- Storage Requirements (1)
- Strategie (1)
- Stressbewältigung (1)
- Struktur (1)
- Student (1)
- Studium (1)
- Subjectivity (1)
- Subjektivierung <Linguistik> (1)
- Subkultur (1)
- Subordination <Linguistik> (1)
- Substantiv (1)
- Substrat <Linguistik> (1)
- Such- und Recherchesysteme (1)
- Suchtechnologie (1)
- Suffigierung (1)
- Summary (1)
- Supervised Classification (1)
- Surface pattern (1)
- Suspendierung (1)
- Sustainability (1)
- Swahili (1)
- Symptom (1)
- Synchronizität (1)
- Syncretism (1)
- Synkretismus (1)
- Synonymie (1)
- Syrer (1)
- Szene (1)
- Sámi (1)
- Sámi languages in Finland (1)
- Südtirol (1)
- Südwestdeutsch (1)
- Südwestdeutschland (1)
- T-shirt lexicography (1)
- TBX (1)
- TEI LingSIG (1)
- TEI XML (1)
- TEI encoding (1)
- TEI-Lex0 (1)
- TEI/XML (1)
- TRP (1)
- TSPP Model (1)
- Tabelle (1)
- Tag (1)
- Tagung (1)
- Tagungsbericht (1)
- Take-In-Interaction (1)
- Target relation (1)
- Teamwork (1)
- Technik (1)
- Technologiegebrauch (1)
- Telepräsenz (1)
- Temporal Reference (1)
- Tense (1)
- Tenseless Languages (1)
- Terminologiedatenbank (1)
- Terminology (1)
- Terrebonne Parish (1)
- Testdaten (1)
- Testproduktion (1)
- Text Categorisation (1)
- Text Classification (1)
- Text Technology (1)
- Text data (1)
- Text mining (1)
- Text retrieval (1)
- Text technology (1)
- Text+ (1)
- Textanalyse ; Diskursanalyse ; Computerlinguistik (1)
- Textbaustein (1)
- Textklassifikation (1)
- Textklassifizierung (1)
- Textlingustik (1)
- Textplus NFDI (1)
- Textverstehendes System (1)
- Thailändisch (1)
- The Oxford English dictionary (1)
- Theaterspiel (1)
- Thema-Rhema-Gliederung (1)
- Thematische Rolle (1)
- Theodor Arnold (1)
- Theorie und Praxis (1)
- Thurneysen, Eduard Rudolf (1)
- Thurneysen, Eduard Rudolf (1)
- Tiefenpsychologisch fundierte Psychotherapie (1)
- Tiersprache (1)
- Time (1)
- Timing (1)
- Titling (1)
- Token <Linguistik> (1)
- Topic map (1)
- Topik-drop (1)
- Topikmodellierung (1)
- Totalitarismus (1)
- Traffic (1)
- Training (1)
- Transformative Sequences (1)
- Transformatives Lernen (1)
- Transitives Verb (1)
- Transitivity (1)
- Transitivität (1)
- Transkripte (1)
- Transkritpion (1)
- Transparenz (1)
- Treebank (1)
- Tschadische Sprachen (1)
- Tsingtau (1)
- Tunnel DP-algorithm (1)
- Tunnel Matrix (1)
- Turn Competition (1)
- Turn design (1)
- Tweet (1)
- Type-Token Verhältnis (1)
- Typologie (1)
- Türkei (1)
- Türkischer Jugendlicher (1)
- UIMA (1)
- Ukraine (1)
- Uncertainty (1)
- Uncertainty avoidance (1)
- Unconnected node (1)
- Unfähigkeit (1)
- Ungenauigkeit (1)
- Union of Soviet Socialist Republics (USSR) (1)
- Universalgrammatik (1)
- Universalität (1)
- Universität zu Köln (1)
- Unterricht (1)
- Unterrichtsmethode (1)
- Unterrichtsprache (1)
- Unterrichtssprache (1)
- Unvollständige TCUs (1)
- Urban dialects (1)
- Usability (1)
- UseNet (1)
- User <Benutzer> (1)
- User Generated Content (1)
- VLO (1)
- VR-games (1)
- Valences (1)
- Valenztheorie <Linguistik> (1)
- Varianz <Linguistik> (1)
- Variation des gesprochenen Deutsch (1)
- Ventspils University of Applied Sciences (VUAS) (1)
- Verb <verdienen> (1)
- Verb-Erst-Stellung (1)
- Verbal fluency (1)
- Verbalagression (1)
- Verbale Äußerung (1)
- Verbzweit (1)
- Vereinfachung (1)
- Vereinheitlichung (1)
- Verfahren der Zeichenprozessierung (1)
- Verfügbarkeit (1)
- Vergessen (1)
- Vergewaltigung (1)
- Vergleich (1)
- Vergleich <Rhetorik> (1)
- Vergleichbarkeit (1)
- Verhaltensmodifikation (1)
- Verhandlung (1)
- Verlaufsform (1)
- Vermutung <Linguistik> (1)
- Versdichtung (1)
- Verstehen und Intersubjektivität (1)
- Verständigung (1)
- Verwandtschaftsbezeichnung (1)
- Very Large Corpora (1)
- Veränderungsmessung (1)
- Videaufzeichnung (1)
- Video (1)
- Videodaten (1)
- Videointerview (1)
- Videokonferenz (1)
- Vietnamesisch (1)
- Vision (1)
- Visualization (1)
- Visualizations (1)
- Visueller Kontrast (1)
- Vokabellernen (1)
- Vokalisierung (1)
- Volksabstimmung (1)
- Volltext (1)
- Vorlesung (1)
- Vormachen (1)
- Vorschlagen (1)
- Vortragstechnik (1)
- Vorwort (1)
- Võro (1)
- WCC (1)
- WH-cleft (1)
- WOrd eMBedding dATabase (WOMBAT) (1)
- WSD (1)
- Wabi Sabi (1)
- Wahlforschung (1)
- Wahrnehmungsverb (1)
- Walbiri-Sprache (1)
- Walisisch (1)
- Walter Porzig (1)
- Web corpus (1)
- Web spam (1)
- WebLicht (1)
- Weblog (1)
- Weißrussisch (1)
- Welsh (1)
- Werbung (1)
- West Germanic (1)
- Westeuropa (1)
- What-about questions (1)
- WhatsApp (1)
- Widerstand (1)
- Widget bundle (1)
- Wien <2018> (1)
- Wikibase (1)
- Wikipedia articles (1)
- Wikipedia talk pages (1)
- Wiktionary revision history (1)
- Wirtschaftssprache (1)
- Wisconsin (1)
- Wissenschaft (1)
- Wissenschaftsentwicklung (1)
- Wissenschaftskommunikation (1)
- Wissenschaftspublizistik (1)
- Wissensextraktion (1)
- Wissensextration (1)
- Wissensgraph (1)
- Wissensrepräsentation (1)
- Wissensverarbeitung (1)
- Witz (1)
- Word associations (1)
- Word history (1)
- Word selection (1)
- Word‐Length (1)
- World War I (1)
- World War II (1)
- World Wide Web (1)
- Wortbedeutung <Semasiologie> (1)
- Wortfamilie (1)
- Wortfeld (1)
- Wortfolge (1)
- Wortgeschichte digital (Digital Word History) (1)
- Wortgrenze (1)
- Wortliste (1)
- Wortphonologie (1)
- Wortspiel (1)
- Writing (1)
- Writing process (1)
- Writing research (1)
- Writing technology (1)
- Wörterbuch Geschichte (1)
- Wörterbuch der deutschen Gegenwartssprache (WDG) (1)
- Wörterbucharbeit (1)
- Wörterbücher afrikanischer Sprachen (1)
- XForms (1)
- XML applications (1)
- XML database (1)
- XQuery Full Text (1)
- XSL Transformation (1)
- XSLT (1)
- YouTube comments (1)
- ZAS Database of Clause-Embedding Predicates (1)
- ZAS-Datenbank satzeinbettender Prädikate (1)
- Zeichen (1)
- Zeigesequenzen (1)
- Zeitreihenanalyse (1)
- Zeitschrift (1)
- Zeitsemantik (1)
- Zeitwahrnehmung (1)
- Zertifizierung (1)
- Zipf (1)
- Zipf–Mandelbrot law (1)
- Zugehörigkeit (1)
- Zuverlässigkeit (1)
- Zweierbeziehung (1)
- Zwischenmenschliche Beziehung (1)
- Zäsur <Metrik> (1)
- aanlyn woordeboeke (1)
- aboriginal culture in northern Russia (1)
- abusive comparisons (1)
- abusive emojis (1)
- abusive remarks (1)
- abusive words (1)
- academic dictionary (1)
- accent (1)
- acceptability ratings (1)
- access structure (1)
- accounting (1)
- accounts (1)
- accusation (1)
- acquisition (1)
- acting technique (1)
- action (1)
- action recognition (1)
- action-ascription (1)
- actuation problem (1)
- acute hospital (1)
- adaptive design (1)
- addition (1)
- adjacency pair (1)
- adjectives (1)
- ado file (1)
- adposition (1)
- adult education (1)
- adverb (1)
- adverbial connective (1)
- aerodynamics (1)
- aesthetic concept (1)
- aesthetic evaluation (1)
- aesthetics (1)
- affective stance (1)
- affiliation (1)
- affirmation of the consequent (1)
- african languages dictionaries (1)
- afrikataalwoordeboeke (1)
- age of acquisition (1)
- age stereotypes (1)
- agent (1)
- agent role (1)
- agentivity effect (1)
- aging (1)
- aikuiskoulutus (1)
- algorithms (1)
- allemand parlé (1)
- allomorphy (1)
- allostructions (1)
- ambiguous words (1)
- ambivalent sexism (1)
- analepsis (1)
- analogy (1)
- analyse conversationnelle (1)
- analyse multimodale (1)
- analytical opacity (1)
- anaphor (1)
- anaphoric relations (1)
- ancestry (1)
- animation (1)
- annotated corpora (1)
- annotation schema (1)
- annotation tools (1)
- announcements (1)
- anonymization (1)
- anotación multinivel (1)
- antecedence (1)
- anticipatory mechanism (1)
- application (1)
- application domain (1)
- applied language studies (1)
- applied linguistics (1)
- arbitrary scripts (1)
- architecture-for-interaction (1)
- archiving support (1)
- archiving workflow (1)
- argumentation (1)
- art reception (1)
- artefacts (1)
- articulography (1)
- assertion (1)
- assessment (1)
- assistance (1)
- attitudes towards dictionaries (1)
- audio-visual data (1)
- authentic language (1)
- authentic materials (1)
- author name homography (1)
- author name variability (1)
- authority records (1)
- automated tracking (1)
- automatic classification (1)
- automatic processing (1)
- automatic summarization (1)
- automatic term extraction (1)
- automatic translators (1)
- automatische Annotation (1)
- automotive domain (1)
- auxiliary selection (1)
- availability (1)
- avatars (1)
- average prediction complexity (1)
- aviation terminology (1)
- avun mobilisointi (1)
- base recognition model application (1)
- beliefs (1)
- benefit (1)
- bias awareness (1)
- bibliographic database (1)
- biconditional (1)
- bidirectionality (1)
- big data (1)
- bilingual (1)
- bilingual community (1)
- bilingual dictionaries in electronic format (1)
- bilingual electronic dictionaries (1)
- bilingual paronyms (1)
- bilingual resources (1)
- bilingual thesaurus (1)
- bilingualism (1)
- bilingualized dictionary (1)
- biomedical language processing (1)
- blindness (1)
- blog corpus (1)
- bodily conduct (1)
- bodily response (1)
- borrowing (1)
- bound word (1)
- boundary effects (1)
- brain rhythms (1)
- bridging relations (1)
- bridging resolution (1)
- business coaching (1)
- business data (1)
- business research (1)
- búsqueda (1)
- car racing (1)
- case (1)
- case syncretism (1)
- casual conversation (1)
- category detection (1)
- causal tagger (1)
- census (1)
- centres (1)
- cessation implicatures (1)
- cesuras (1)
- change-of-state token (1)
- child-directed speech (1)
- children (1)
- children’s specialised lexicography (1)
- children’s vocabulary (1)
- cipient (1)
- classification (1)
- clause linkage (1)
- clause union (1)
- climate (1)
- clitic climbing (1)
- close reading of dictionaries (1)
- closed vocabulary (1)
- clusivity (1)
- cluster analysis (1)
- clustering (1)
- co-training (1)
- code of ethics (1)
- code-switching (1)
- coding (1)
- coercion (1)
- cognitive availability (1)
- cognitive impairment (1)
- cognitive processing (1)
- cognitive salience (1)
- coherence (1)
- coherent construction (1)
- cohering affixes (1)
- collaboration (1)
- collaborative dictionary (1)
- collaborative filtering (1)
- collective emotions (1)
- collo-profile (1)
- collocated smartphone use (1)
- collocational behaviour (1)
- collostructional analysis (1)
- colonial group construction (1)
- combination of methods (1)
- combinatoric semantics (1)
- commonly confused words (1)
- communication (1)
- communication verb (1)
- communication verbs (1)
- communicative competence (1)
- communicative deviation (failure) (1)
- communicative deviations (1)
- community engagement (1)
- community size (1)
- comparable corpus (1)
- comparative lexicographic principles (1)
- comparative political science (1)
- comparison (1)
- compatibility (1)
- competence (1)
- complaint (1)
- complement clause (1)
- complementizer (1)
- complex graphemes (1)
- complex preposition (1)
- complex prepositions (CPs) (1)
- compositionality (1)
- compound family (1)
- compound formation (1)
- compound interpretation (1)
- compounding (1)
- comprehensibility (1)
- comprehension (1)
- compression (1)
- compuer-assisted language learning (1)
- computational language models (1)
- computer game (1)
- computer-assisted language learning (CALL) (1)
- computer-assisted pronunciation training (CAPT) (1)
- computerized grammar (1)
- comunicación mediada por computadora (CMC) (1)
- conative construction (1)
- concept scheme (1)
- concept system (1)
- concept system visualization (1)
- concept systems (1)
- conceptual approach (1)
- conceptual domain (1)
- conceptual field (1)
- conceptual history (1)
- conceptual metaphor theory (1)
- conceptualisation (1)
- concersation analysis (1)
- conflict (1)
- confusion (1)
- connectives (1)
- constrained poetic structure (1)
- constraint optimization (1)
- constraint satisfaction (1)
- constraint solving (1)
- construal (1)
- constructional ambiguity (1)
- constructional synonymy (1)
- contact linguistics (1)
- content management platform (1)
- content questions (1)
- context markers (1)
- contexts of dictionary use (1)
- contextual framework (1)
- contextual meaning (1)
- contingencies (1)
- continuer (1)
- continuers (1)
- contradiction (1)
- contrast (1)
- contrastive entries (1)
- contrastive focus (1)
- contrastive lexicography (1)
- controlled vocabularies (1)
- conversation (1)
- conversation analyses (1)
- conversation-analytic transcription (1)
- conversational analysis (1)
- conversational constructions (1)
- conversational narrative (1)
- convolutional neural networks (1)
- coordination of verbal and embodied action (1)
- copular clauses (1)
- copular constructions (1)
- copulatives (1)
- copyright laws (1)
- corona-neologism (1)
- coronacorpus (1)
- coronavirus (1)
- corpora of talk-in-interaction (1)
- corpus linguistics (1)
- corpus CMC (1)
- corpus access (1)
- corpus analysis tools (1)
- corpus architecture (1)
- corpus compilation (1)
- corpus construction (1)
- corpus creation (1)
- corpus de aprendices (1)
- corpus development (1)
- corpus driven approach (1)
- corpus exploitation (1)
- corpus frequencies (1)
- corpus information (1)
- corpus management systems (1)
- corpus pragmatics (1)
- corpus query processing (1)
- corpus query protocol (1)
- corpus querying (1)
- corpus retrieval (1)
- corpus search engine (1)
- corpus search platform (1)
- corpus size (1)
- corpus storage (1)
- corpus-based evaluation (1)
- corpus-based lexicon building (1)
- corpus-based methods (1)
- corpus-based statistical methods (1)
- corpus-based terminography (1)
- corpus-driven lexicography (1)
- corpus-lexicographic tool (1)
- corrections (1)
- correspondence (1)
- corpus-based lexicography (1)
- counterfactual recipient design (1)
- couple interaction (1)
- creole (1)
- crime (1)
- critical events (1)
- cross-cultural (1)
- cross-cultural research (1)
- cross-linguistic analysis (1)
- cross-linguistic data (1)
- cross-national policy convergence (1)
- crosswalks (1)
- cultural diversity (1)
- cultural heritage resources (1)
- culture specific items (1)
- curation (1)
- da (1)
- das <Wort> (1)
- data category (1)
- data control mechanism (1)
- data curation (1)
- data deposition (1)
- data dissemination (1)
- data exploration (1)
- data modeling (1)
- data modelling (1)
- data presentation (1)
- data processing (1)
- data provision (1)
- data referencing (1)
- data sets (1)
- data sustainability (1)
- data visualization (1)
- database applications (1)
- database systems (1)
- dataset (1)
- dative (1)
- decentralization (1)
- decision tree modelling (1)
- decision tree structure (1)
- decision-making (1)
- deep learning (1)
- deep-level morphological analyses (1)
- deep-structure morphological analyses (1)
- definiteness (1)
- definition (1)
- definitions (1)
- delayed completion (1)
- demonstration (1)
- demonstrative (1)
- denial of the antecedent (1)
- deontic (1)
- deontic modality (1)
- depiction (1)
- derivation (1)
- derivational morphology (1)
- derived subject (1)
- desambiguación (1)
- description (1)
- description of neologisms (1)
- descriptive (1)
- detection of neologisms (1)
- determinologisation (1)
- deviation (1)
- diachronic change (1)
- diachronic variation in language use (1)
- dialect (1)
- dialect competence (1)
- dialect lexicography (1)
- dialectometry (1)
- dialektometrie (1)
- dialogue interpreting (1)
- diary omission (1)
- diaspora communities (1)
- dictionaries as social agents (1)
- dictionarisability (1)
- dictionary didactics (1)
- dictionary editing system (1)
- dictionary encoding (1)
- dictionary of language contact (1)
- dictionary portal (1)
- dictionary teaching (1)
- dictionary typology (1)
- dictionary+ (1)
- dictionnaire des néologismes (1)
- didactic corpus (1)
- difference (1)
- diffusion mechanism (1)
- diffusion studies (1)
- digitaaliset taidot (1)
- digital collocation database (1)
- digital communication (1)
- digital lexicography (1)
- digital libraries (1)
- digital library (1)
- digital skills (1)
- digitally-mediated communication (1)
- diphthongs (1)
- directionality (1)
- directives (1)
- discourse deixis (1)
- discourse dictionary (1)
- discourse history (1)
- discourse keywords (DKW) (1)
- discourse markers (1)
- discourse metaphor (1)
- discourse metaphors (1)
- discourse parsing (1)
- discourse particles (1)
- discourse processing (1)
- discourse structure (1)
- discourse-level associations (1)
- discovering collocations in corpora (1)
- disengagement (1)
- disjunction (1)
- dissemination (1)
- do-support (1)
- doctor-patient interaction (1)
- document management and text processing (1)
- document processing (1)
- document triage (1)
- domain label (1)
- domain-specific solutions (1)
- double object (1)
- download vs. citation patterns (1)
- driving (1)
- drop out (1)
- dual task (1)
- duration (1)
- duration prediction (1)
- dyadic coping (1)
- dynamic lexicography (1)
- e-dictionary (1)
- e-dictionary application (1)
- eHumanities (1)
- early response (1)
- ecolinguistics (1)
- economic conditions (1)
- economic data (1)
- economy principles (1)
- editorial (1)
- editorial process (1)
- edutainment (1)
- efficiency (1)
- ego-documents (1)
- egocentrism (1)
- einsprachiges Wörterbuch (1)
- elderspeak (1)
- electromagnetic articulography (1)
- electronic corpus (1)
- electronic dictionaries (1)
- electronic dictionary (1)
- elektroniese woordeboeke (1)
- elicitation (1)
- ellipsis (1)
- embedded tense (1)
- embodied action (1)
- embodied displays (1)
- embodied other-initiation of repair (1)
- embodied withdrawal (1)
- emergence (1)
- emotional valence (1)
- empirical aesthetics (1)
- encoding (1)
- encounter (1)
- encyclopedic-conceptual approach (1)
- entropy (1)
- environment (1)
- epistemic priority (1)
- epistemicity (1)
- epistemische Priorität (1)
- equi-complexity hypothesis (1)
- error collection (1)
- es (1)
- ethnicity (1)
- ethno-regionalism (1)
- ethnolinguistic identity (1)
- ethnolinguistic vitality (1)
- ethnomethodology (1)
- etymological data base (1)
- etymology (1)
- europeanization (1)
- event structure (1)
- event-related brain potentials (ERP) (1)
- event-related potentials (1)
- evidentiality (1)
- evoked potentials (1)
- evolution of Scientific English (1)
- exceptional case marking (1)
- excessive (1)
- exclusive particles (1)
- existential, tense (1)
- experience (1)
- experiment (1)
- experimental evidence (1)
- experimental linguistics (1)
- experimental syntax (1)
- experimentation (1)
- experimentelle Phonetik (1)
- expertise (1)
- expert–novice (1)
- explicit and integrated intervention program (1)
- exploration of CMDI metadata (1)
- extended search (1)
- extensibility (1)
- extralexicographic features (1)
- eye tracking (1)
- eye-tracking (1)
- f0 accommodation (1)
- face-to-face interaction (1)
- factuality (1)
- family interaction (1)
- family relationships (1)
- family studies (1)
- feature compound (1)
- feature structure representation (1)
- fiabilidad (1)
- fieldwork (1)
- figurative meaning (1)
- finite state (1)
- finite state tokenization (1)
- first pair part (1)
- first person plural pronouns (1)
- fixation duration (1)
- focus (1)
- focus alternatives (1)
- focus phrase (1)
- folk linguistics (1)
- fonologie (1)
- food photography (1)
- footing shifts (1)
- foreign accent (1)
- foreign language learner (1)
- foreign language teacher (1)
- foreign language teaching (1)
- forgetfulness (1)
- form of communication (1)
- formal mathematics (1)
- formal model (1)
- format migration (1)
- formation de mots (1)
- formats (1)
- forms of representation in digital lexicography (1)
- frame semantics (1)
- frame structure (1)
- frame-based contrastive analysis (1)
- framing (1)
- free-sorting (1)
- frequency (1)
- fuck (1)
- full form systems (1)
- functional categories (1)
- functional status (1)
- future (1)
- fuzziness (1)
- gam (1)
- gathering (1)
- gebruikersleiding (1)
- gender (1)
- gender and language (1)
- gender differences (1)
- gender en taal (1)
- gender identity (1)
- gender stereotypes (1)
- genderstereotipes (1)
- general dictionary (1)
- general monolingual dictionary (1)
- generating information on demand (1)
- genericity (1)
- genre conceptions (1)
- genre expectations (1)
- genre-specific literary reading (1)
- genre-specific reading strategies (1)
- gestural hold (1)
- globaLex (Körperschaft) (1)
- global biodiversity in the early 21st century (1)
- global extension (1)
- global extinction of languages (1)
- global structural information (1)
- globalization (1)
- gold standard corpus (1)
- governance (1)
- govorni njemački u interakciji (1)
- gradable adjectives (1)
- grammar acquisistion (1)
- grammar competition (1)
- grammar development (1)
- grammar engineering (1)
- grammar learning (1)
- grammar testing (1)
- grammar-based language learning (1)
- grammatical KOS (1)
- grammatical complexity (1)
- grammatical construction (1)
- grammatical framework (1)
- grammatical information (1)
- grammatical particle (1)
- graph databases (1)
- graph-based dictionaries (1)
- graphematics (1)
- graphemic representation (1)
- graphetics (1)
- guidelines (1)
- handwriting (1)
- head alignment (1)
- head nod (1)
- headword (1)
- help desk (1)
- helping interaction (1)
- heroism (1)
- high-variability training (1)
- high-vowel laxing (1)
- higher education policy (1)
- hiperskakels (1)
- historical corpora (1)
- historical encyclopedias (1)
- historical lexicology (1)
- historical word formation of German (1)
- history of science (1)
- hosting provider (1)
- household work (1)
- human annotation studies (1)
- human cognition (1)
- human learning (1)
- humor (1)
- hyperlinks (1)
- identities in talk (1)
- identity construction (1)
- identity effects (1)
- identity groups (1)
- idiom detection (1)
- idiosyncrasy (1)
- imagination (1)
- impact indicator (1)
- imperatives (1)
- imperfective (1)
- impersonal (1)
- impersonal deontic statement (1)
- impersonal structures (1)
- implicit abuse (1)
- implicit association test (IAT) (1)
- implicitly abusive comparisons (1)
- implicitly abusive language (1)
- inbreath (1)
- incoherent construction (1)
- incomplete TCUs (1)
- increments (1)
- indirect questions (1)
- indirekter Sprechakt (1)
- individual alpha frequency (1)
- individual differences (1)
- inferences (1)
- infinite canvas (1)
- infinitival complements (1)
- inflected complementizer (1)
- inflected form (1)
- inflected forms (1)
- inflection (1)
- información de corpus (1)
- information density (1)
- information extraction (1)
- information infrastructure (1)
- information presentation devices (1)
- information retrieval (1)
- informing (1)
- infrastructure technology (1)
- infrastructures and architectures (1)
- inligtingsaanbiedingsinstrumente (1)
- innovation (1)
- inspection sequences (1)
- institutional action (1)
- instructional imteratives (1)
- integrated e-dictionary (1)
- integration (1)
- integriertes Lernen (1)
- intelligence (1)
- intensification (1)
- intention (1)
- intention ascription (1)
- inter-annotator reliability (1)
- inter-rater variability (1)
- interaction space (1)
- interactional competence (1)
- interactional grammar (1)
- interactional histories (1)
- interactional phonetics (1)
- interactional project (1)
- interactive editing (1)
- interactive graph visualisation (1)
- interactive turn space (1)
- interactivity (1)
- interakcijsko jezikoslovlje (1)
- interaktives Editieren (1)
- intercultural communication (1)
- intergroup relations (1)
- interlocking organization (1)
- intermediality (1)
- international comparable corpus (1)
- international comparison (1)
- international school (1)
- international work settings (1)
- internetbasierte Kommunikation (1)
- internetbasierte Kommunikation (IBK) (1)
- interpret (1)
- interpretation practices (1)
- interrogatives (1)
- intersectionality (1)
- intersemiotic translation adequacy (1)
- intervention (1)
- intonation units (1)
- intra-rater variability (1)
- intra-writer variation (1)
- inversion (1)
- island (1)
- iso24613 (1)
- isomorphism (1)
- item variability (1)
- joint projects (1)
- joint utterance formulation (1)
- joke (1)
- justification (1)
- keuse-boomstruktuur (1)
- keyphrase extraction (1)
- keyword analysis (1)
- kinship terminology (1)
- knowledge sources (1)
- kognitive Semantik (1)
- kollokasies (1)
- kontrafaktischer Adressatenzuschnitt (1)
- kontrastive Grammatik (1)
- kontrastive Lexikologie (1)
- kopulatiewe (1)
- korpusgebaseerde leksikografie (1)
- landscape (1)
- landscapes (1)
- language Standardization (1)
- language acquisition (1)
- language activism (1)
- language and gender (1)
- language area (1)
- language attitude (1)
- language awareness (1)
- language comparison (1)
- language corpora (1)
- language data (1)
- language discourses (1)
- language efficiency (1)
- language endangerment (1)
- language fixedness (1)
- language legislation (1)
- language marketing (1)
- language model (1)
- language modelling (1)
- language narratives (1)
- language regards (1)
- language resource (1)
- language shift (1)
- language socialisation (1)
- language structure (1)
- language studies (1)
- language teaching (1)
- language use (1)
- language variation (1)
- languages in Mari El (1)
- languages in Udmurtia (1)
- languages in the Russian Federation (1)
- large corpus data (1)
- large-scale corpora (1)
- latent semantic analysis (1)
- laughter (1)
- law (1)
- lean syntax (1)
- learner corpora (1)
- learner corpus of adolescent (1)
- learner's dictionary (1)
- learners’ dictionary (1)
- learning activities (1)
- learning motivation (1)
- lecture (1)
- legal aspects (1)
- legal lexicon (1)
- leksikografiese model (1)
- leksikografski izvori (1)
- lemma (1)
- length (1)
- lenguaje oral (1)
- less-resourced languages (1)
- lexcial decomposition (1)
- lexical borrowings (1)
- lexical analysis (1)
- lexical decision (1)
- lexical fields (1)
- lexical frequency (1)
- lexical level (1)
- lexical loans (1)
- lexical markup framework (1)
- lexical resources (1)
- lexical-functional grammar (1)
- lexicographers’ needs (1)
- lexicographic data (1)
- lexicographic functions (1)
- lexicographic model (1)
- lexicographic neology (1)
- lexicographic practices (1)
- lexicographic situation (1)
- lexicographical neology (1)
- lexicographical resource (1)
- lexicographical system (1)
- lexicon generation (1)
- lexicon graph (1)
- lexicon graphs (1)
- lexicon model (1)
- lexicon model formalism (1)
- lexicon structure (1)
- lexicotainment (1)
- lexikalische Repräsentation (1)
- lexikography (1)
- lexis (1)
- license (1)
- life science (1)
- lifelong learning (1)
- light-verb constructions (1)
- lightweight annotation (1)
- likelihood ratio test (1)
- linguistic abstractness (1)
- linguistic acculturation (1)
- linguistic and cultural diversity (1)
- linguistic annotation (1)
- linguistic borrowings (1)
- linguistic change (1)
- linguistic expectancy bias (LEB) (1)
- linguistic integration (1)
- linguistic intergroup bias (LIB) (1)
- linguistic landscape (1)
- linguistic landscapes (1)
- linguistic locational reference (1)
- linguistic minorities (1)
- linguistic minority (1)
- linguistic niche hypothesis (1)
- linguistic prominence (1)
- linguistic repair (1)
- linguistic rights of national groups (1)
- linguistic technology (1)
- linguistic typology (1)
- linguistically based measures (1)
- linguistics (1)
- linguistique interactionnelle (1)
- linking patterns (1)
- list of headwords (1)
- literary comprehension (1)
- literary processing (1)
- live video stream (1)
- loan translation (1)
- loan words (1)
- loans (1)
- loanwords (1)
- local ecology (1)
- locally uninstantiated arguments (1)
- locative vs. goal adverbial (1)
- log file (1)
- log file analysis (1)
- logical information systems (1)
- logical problem of language change (1)
- logistic regression (1)
- lokalistische Hypothese (1)
- machine translation (1)
- macrostructure (1)
- major reference work (1)
- makrostruktuur (1)
- mantenimiento (1)
- manual database curation (1)
- manual information extraction (1)
- marital satisfaction (1)
- markup framework (1)
- markup language (1)
- marqueurs de réponse (1)
- mashup (1)
- material culture (1)
- mathematical language (1)
- mathematical terms (1)
- mathematics (1)
- maximum likelihood (1)
- meaning (1)
- measurement (1)
- mechanisms (1)
- media discourse (1)
- media effects (1)
- media linguistics (1)
- media literacy (1)
- media practices (1)
- media technology (1)
- mediated interaction (1)
- mediation (1)
- mediostructure (1)
- mediostruktuur (1)
- mediterranean (1)
- meeting talk (1)
- meetings (1)
- mental health services (1)
- mental illness (1)
- mentalitiy (1)
- message effectiveness (1)
- messenger communication (1)
- meta-language (1)
- meta-pragmatic accounts (1)
- meta-semantic effects (1)
- metacommunication (1)
- metadata analysis (1)
- metadata curation (1)
- metadata editor (1)
- metadata formats (1)
- metadata quality (1)
- metadata quality assessment (1)
- metadata score (1)
- metadata standards (1)
- metaphor theory (1)
- metaphorical extension (1)
- metodologia (1)
- micro-constructions (1)
- micro-sequential relationship (1)
- microservices (1)
- microstructure bilingual dictionaries of linguistics (1)
- migration (1)
- migration linguistics (1)
- mikrostruktura (1)
- mikrostruktuur (1)
- minorities in Germany (1)
- minority language protection (1)
- minority language revitalisation (1)
- minority language speakers (1)
- minority languages and cultures (1)
- minority protection (1)
- minority–majority relations (1)
- mission societies (1)
- mixed-effects logistic regression models (1)
- mixed-effects modeling (1)
- mobile devices (1)
- mobilising assistance (1)
- mobiliy (1)
- mobilizing response (1)
- mock story (1)
- modal enrichment (1)
- modal meaning (1)
- modal particles (1)
- modal verb constructions (1)
- modalne čestice (1)
- modern forms of prejudice (1)
- modular pivot (1)
- modus ponens (1)
- monolingualised dictionary (1)
- monospaced font (1)
- mood (1)
- morfologie (1)
- morphemic categories (1)
- morpho-syntactic argument realization (1)
- morpho-syntactic database (1)
- morphological analyses (1)
- morphological complexity (1)
- morphological level (1)
- morphological parsing (1)
- morphological productivity (1)
- morphological treebank (1)
- mot d'emprunt (1)
- motion verb (1)
- motivation to control prejudiced responding (1)
- movie recommendation (1)
- mrežni rječnik (1)
- multi-activity and multi-party settings (1)
- multi-layer annotation (1)
- multi-layer corpora (1)
- multi-lingual grammar (1)
- multi-modality (1)
- multi-party dialogues (1)
- multi-relational learning (1)
- multi-turn conversations (1)
- multi-unit turn (1)
- multi-word expression (1)
- multiactivity (1)
- multidimensional scaling (1)
- multidimensionele skalering (1)
- multidisciplinarity (1)
- multifunctional lexical resource (1)
- multifunksionele leksikale bron (1)
- multilevel modeling (1)
- multilingual corpora (1)
- multilingual data (1)
- multilingual database (1)
- multilingual grammar (1)
- multilingual matter (1)
- multilingual platform (1)
- multilingual setting (1)
- multilingual transcripts (1)
- multilinguality (1)
- multimedia (1)
- multimodaalinen keskustelunanalyysi (1)
- multimodal conversation analysis (1)
- multimodal corpora (1)
- multimodal database (1)
- multimodal interaction analysis (1)
- multimodal storytelling (1)
- multiparty setting (1)
- multiple etymologies (1)
- mundane technology use (1)
- murder (1)
- naming (1)
- narrative (1)
- narrative analysis (1)
- narrative comparison (1)
- narratives in interaction (1)
- national and subnational standard varieties (1)
- national corpora (1)
- national identification (1)
- nationaler Mythos (1)
- nationalistic purism (1)
- native speech (1)
- natürlichsprachliche Systeme (1)
- negation Raising (1)
- negation content words (1)
- negation modeling (1)
- negation particle (1)
- negotiation (1)
- neologism detection (1)
- neologisms in Brazilian Portuguese (1)
- neology (1)
- neoterm (1)
- network analysis (1)
- neural oscillations and entrainment (1)
- neural phase precession (1)
- new media (1)
- new public management (1)
- newsfeed (1)
- newspaper reports (1)
- nodding (1)
- non-players (1)
- nonnative speakers (1)
- nonnative speech (1)
- nonstandard accent (1)
- normalisation (1)
- normalization (1)
- normativity (1)
- norms and rules (1)
- noun phrase (1)
- null complementation (1)
- null subject (1)
- néologismes des médias sociaux (1)
- object manipulation (1)
- objektorientierte Graphdatenbank (1)
- observation study (1)
- observational study (1)
- offers (1)
- official language (1)
- oh that’s right (1)
- okay (1)
- online dictionaries of linguistics (1)
- online discourse (1)
- online grammars (1)
- online information systems (1)
- online lexicographic resources (1)
- onomasiological search (1)
- onomastics (1)
- ontology (1)
- open class repair initiators (1)
- open dictionary (1)
- open educational trainer (1)
- open science (1)
- open source software (1)
- operationalized psychodynamic diagnosis (1)
- opinion extraction (1)
- opinion inference (1)
- opinion role extraction (1)
- opinion verb (1)
- opinion verbs (1)
- oral and written skills (1)
- oral corpus platform (1)
- oral history corpora (1)
- oral language (1)
- other-initiated repair (1)
- overlap resolution (1)
- overtaking (1)
- own experience (1)
- pandemic neologism (1)
- paradigm uniformity (1)
- parallel text corpus (1)
- parallelism (1)
- parameters (1)
- parental interventions (1)
- parliaments (1)
- paronym dictionaries (1)
- paronyms, easily confused words (1)
- paronymy (1)
- parser evaluation (1)
- parsing (1)
- part-of-speech tagging (1)
- participant opacity (1)
- participation (1)
- passive (1)
- past (1)
- patientivity (1)
- pattern-based lexicography (1)
- patterns (1)
- pean languages (1)
- pedagogical lexicography Greek (1)
- peer-group interaction (1)
- perceptual evaluation (1)
- perfect (1)
- performativity (1)
- permutation testing (1)
- persistent identifiers (1)
- person agreement (1)
- person perception (1)
- person reference (1)
- personal designations (1)
- personal learning environments (1)
- perspective (1)
- phi-features (1)
- phonetic databases (1)
- phonetic ending (1)
- phonological status (1)
- phonological word (1)
- picture naming (1)
- pitch (1)
- pitch contour matching (1)
- place names (1)
- plurilingualism (1)
- poetic diction (1)
- poetic language (1)
- poetic structure (1)
- poetry (1)
- poetry comprehension (1)
- pointing gesture (1)
- polar question (1)
- polarity sensitive items (1)
- polarity shifter (1)
- policy analysis (1)
- policy preference (1)
- policy transfer (1)
- political debate (1)
- political discourse (1)
- political relations (1)
- political text analysis (1)
- political video interviews (1)
- political views (1)
- politics (1)
- politische Willensbildung (1)
- pop lyrics (1)
- popular knowledge (1)
- positionally-sensitive grammar (1)
- positioning analysis (1)
- positioning of self and other (1)
- possessives (1)
- post-soviet states (1)
- post-war history (1)
- postcolonialism (1)
- postlexical processes (1)
- posture verb (1)
- posture verbs (1)
- practical contexts (1)
- practical reasoning (1)
- pragmatic focus (1)
- praxeological context (1)
- pre-school choice (1)
- predication (1)
- predicative adjectives (1)
- prediction error (1)
- predictive approach (1)
- prefabs (1)
- preface (1)
- prejudice and discrimination (1)
- preposition (1)
- preposition-noun combinations (1)
- preposition-pronoun contraction (PPC) (1)
- prepositional clause (1)
- prepositional object clauses (1)
- prepositional object construction (1)
- prescriptive (1)
- present (1)
- presentation (1)
- presidential debate (1)
- pretend play frame (1)
- preterite (1)
- prevalence (1)
- primary research data repository (1)
- print lexicography (1)
- prior talk (1)
- privative adjectives comprehension (1)
- probabilistic approach (1)
- processing fluency (1)
- processing load (1)
- processing pipeline (1)
- product feature extraction (1)
- productivity (1)
- productivity measures (1)
- progressive (1)
- progressive aspect (1)
- prohibitive (1)
- prohibitive markers (1)
- project report (1)
- projective mechanism (1)
- promotion of junior researchers (1)
- pronominal agreement (1)
- pronouns (1)
- pronunciation (1)
- proof checking (1)
- proportional font (1)
- proposing (1)
- propositional argument (1)
- prosodic constituency (1)
- prosodic form (1)
- prosodic organization (1)
- prosodic word (pword) (1)
- prospective possession (1)
- proverb (1)
- pseudonymisation (1)
- psychoanalysis (1)
- psychodiagnostic interview (1)
- psycholinguistics (1)
- public discourse (1)
- public mediation (1)
- public/ political discourse (1)
- publishing model (1)
- quality (1)
- quality checking (1)
- quality evaluation (1)
- quantitative analysis (1)
- quantitative and qualitative methods (1)
- quantitative linguistics (1)
- quantitative quality metrics (1)
- quantitative typology (1)
- query building (1)
- query language (1)
- query languages (1)
- question (1)
- question under discussion (1)
- question-word questions (1)
- questioning sequences (1)
- questionnaire (1)
- raising (1)
- random forests (1)
- rape myth acceptance (1)
- rapid serial visual presentation (1)
- rating scales (1)
- reading speed (1)
- reading strategies (1)
- reading strategy (1)
- reading time (1)
- realia (1)
- reanalysis (1)
- reciprocity (1)
- recollection (1)
- recommendation system (1)
- recommender (1)
- recording (1)
- recruitment (1)
- recursos (1)
- redress (1)
- reduplication construction (1)
- reference corpus (1)
- reference dictionary (1)
- reference resolution (1)
- reference tools (1)
- referencing strategies (1)
- referendum (1)
- referentiality (1)
- reflexivity (1)
- regional languages (1)
- regional phonetic variation (1)
- regional variation (1)
- register (1)
- regressions (1)
- rehearsals (1)
- relaciones de respuesta (1)
- relation (1)
- relation registry (1)
- relational database (1)
- relationship satisfaction (1)
- reliability (1)
- reminders (1)
- repair sequences (1)
- repair-initiation (1)
- repatriation (1)
- repositories (1)
- repository (1)
- representación semántica superficial (1)
- request sequences (1)
- requesting examples (1)
- research infrastructures (1)
- research literature (1)
- research methods (1)
- research overview (1)
- research report (1)
- research reports (1)
- research tools (1)
- resources (1)
- respondent (1)
- response latency (1)
- retro-digitization (1)
- retro-digitized dictionaries (1)
- retro-gedigitaliseerde woordeboeke (1)
- reusability of research data (1)
- revision (1)
- revitalization of endangered languages (1)
- rhetoric (1)
- rhetorical device (1)
- rhetorical structure (1)
- right-dislocation (1)
- role decomposition (1)
- role prototypicality (1)
- romantic relationship (1)
- routines (1)
- rule enforcement (1)
- rule formulations (1)
- saami languages (1)
- sanctioning (1)
- sans-serif (1)
- scalar rhetoric (1)
- schema.org (1)
- schematicity (1)
- school choice (1)
- schwa (1)
- scientific communication (1)
- screen-based interaction (1)
- search (1)
- search engine (1)
- search strategies (1)
- search systems (1)
- search technology (1)
- second pair part (1)
- second position (1)
- selection of textual sources (1)
- self (1)
- self-paced reading (1)
- self-reflection (1)
- self-regulated learning (1)
- semantic change (1)
- semantic classification (1)
- semantic extension (1)
- semantic frames (1)
- semantic information management (1)
- semantic interoperability (1)
- semantic map (1)
- semantic network (1)
- semantic predictability (1)
- semantic presence/absence (1)
- semantic processing (1)
- semantic relatedness (1)
- semantic reversal anomalies (1)
- semantic role (1)
- semantische Analyse (1)
- semiotic mediation (1)
- semiotic resource (1)
- semiotics (1)
- sentence boundary detection (1)
- sentiment (1)
- sentiment polarity (1)
- separation of adjectives (1)
- sequence (1)
- sequence of tense (1)
- sequential analysis (1)
- sequential organization (1)
- service integration (1)
- service interoperability (1)
- service provider (1)
- sexual harassment (1)
- shallow semantic representation (1)
- shared courses of action (1)
- shared gameplay (1)
- shared meaning (1)
- shared task (1)
- sharing data (1)
- sign language resources (1)
- signs (1)
- silences (1)
- simplification (1)
- single player games (1)
- single word borrowings (1)
- sintaksis (1)
- situational involvement (1)
- skills training (1)
- small clause (1)
- smile (1)
- social action format (1)
- social categorization (1)
- social cognition (1)
- social coordination (1)
- social grammar (1)
- social identity theory (1)
- social integration (1)
- social judgment (1)
- social media (1)
- social media interaction (1)
- social media neologisms (1)
- social media storytelling (1)
- social perception (1)
- social relevance (1)
- social roles (1)
- social rules (1)
- social sanctioning (1)
- social topography (1)
- societal inclusion (1)
- societal multilingualism (1)
- socio-spatial positioning (1)
- sociocultural situatedness (1)
- sociolinguistic ethnolinguistic variation (1)
- sociolinguistics (1)
- soft governance (1)
- software tools (1)
- solution-oriented questions (1)
- sostenibilidad (1)
- soveltava kielentutkimus (1)
- soveltava kielitiede (1)
- sowieso <Lemma> (1)
- soziale Interaktion (1)
- space-delimited languages (1)
- speaker variability (1)
- speakership (1)
- speaking machine (1)
- specialised languages (1)
- specialist corpora (1)
- specialized dictionary (1)
- specialized knowledge (1)
- specialized language (1)
- specificational copular clauses (1)
- spectating (1)
- speech act verb (1)
- speech communities (1)
- speech content grouping (1)
- speech corpora (1)
- speech data (1)
- speech database (1)
- speech segmentation (1)
- speech signal processing (1)
- speech technology (1)
- speech thought writing representation (1)
- speed-curvature relation (1)
- spelling reform (1)
- spoken (colloquial) standard (1)
- spoken Arabic (1)
- spoken German in interaction (1)
- spoken corpora (1)
- spoken language transcripts (1)
- spoken syntax (1)
- spoken vs. written (1)
- stance (1)
- stance management (1)
- standardisation (1)
- standardology (1)
- standards for LRs (1)
- standoff annotation (1)
- state change (1)
- statistical complexity (1)
- statistical significance (1)
- status (1)
- stereotype content model (1)
- strategic reading (1)
- strategy (1)
- strategy ascription (1)
- structural information (1)
- sub-grammar extraction (1)
- subextraction (1)
- subject island (1)
- subject-to-object-raising (1)
- subjectification (1)
- subjective comprehensibility (1)
- subtraction (1)
- subtraction neglect (1)
- survey design (1)
- suspension (1)
- sustainability (1)
- sustainable archives (1)
- swing vote (1)
- syllable (1)
- syllable duration (1)
- symbolic prosody prediction (1)
- synonymity (1)
- synonymy (1)
- syntactic competence (1)
- syntactic extensions (1)
- syntactic processing (1)
- syntactical level (1)
- syntactico-semantic argument structure (1)
- syntax-semantics interface (1)
- systemisation (1)
- task-evoked pupillary responses (1)
- technical neologisms (1)
- technologieunterstütztes Lernen (1)
- technology use (1)
- technology watch (1)
- teksproduksie (1)
- teksresepsie (1)
- tele-presence (1)
- telephone interpreting (1)
- telicity (1)
- temporal organization (1)
- temporal phraseological units (1)
- temporality (1)
- tentative taxonomy (1)
- term (1)
- term base exchange format (1)
- terminography (1)
- terminological neologism (1)
- terminological structurer (1)
- terminology visualisation (1)
- test (1)
- text (1)
- text analysis (1)
- text analytics (1)
- text categorization (1)
- text complexity (1)
- text parsing (1)
- text reception (1)
- text-to-speech (1)
- thanking (1)
- that (1)
- theater rehearsals (1)
- theory and practice (1)
- therapeutic alliance (1)
- there (1)
- third position (1)
- third-position repair (1)
- time reckoning (1)
- time windows and constants (1)
- time-series analysis (1)
- timing of turn-taking (1)
- tipologie (1)
- toegangstruktuur (1)
- top-down (1)
- topic drop (1)
- topic management (1)
- topic models (1)
- topic shift (1)
- topical event (1)
- topicalization (1)
- topologisches Feldermodell (1)
- tourism (1)
- traduction de prêt (1)
- traffic (1)
- training software (1)
- transcripción (1)
- translation exercises (1)
- translation studies (1)
- translation tools (1)
- translators (1)
- transmission problem (1)
- travel guides (1)
- treebank (1)
- trends (1)
- trosanalise (1)
- trouble sources (1)
- turn competition (1)
- turn design (1)
- turn-design (1)
- turn-final particles (1)
- tutkimusaineistot (1)
- tutkimusmenetelmät (1)
- type frequency (1)
- uncertainty (1)
- uncertainty avoidance (1)
- under-resourced language (1)
- under-resourced language varieties (1)
- underspecification (1)
- understanding in interaction (1)
- uniform information density (1)
- unregistered words (1)
- unrestricted dialog (1)
- urban youth language (1)
- usability (1)
- usability study (1)
- usage labels (1)
- use cases (1)
- user behavior (1)
- user communities (1)
- user interface design (1)
- user preference (1)
- user research (1)
- user satisfication (1)
- user studies (1)
- user support (1)
- user survey (1)
- user-centred design (1)
- user-generated content (1)
- utterance interpretation (1)
- valency changes (1)
- variable analysis (1)
- variasie (1)
- variation management (1)
- varieties (1)
- vehicular language (1)
- verb valency (1)
- verb-argument linking (1)
- verbale Interaktion (1)
- verbsemantik (1)
- vernacular lexicography (1)
- verwantskapsterminologie (1)
- very large corpora (1)
- video (1)
- video-mediated interactions (1)
- videogames (1)
- violation (1)
- virtual corpus (1)
- virtual embodiment (1)
- virtual worlds (1)
- visibility of ritual meaning (1)
- visual world paradigm (1)
- visualisering (1)
- visually impaired children (1)
- vocabulary (1)
- vocabulary growth (1)
- vocabulary of quotation expressions (1)
- vocabulary organization in dictionaries (1)
- voice (1)
- voice messages (1)
- volition (1)
- vowels (1)
- wabi sabi (1)
- warmth (1)
- ways of spectating (1)
- weakeniing (1)
- web application (1)
- web crawling (1)
- web data (1)
- web service (1)
- web-based information system (1)
- web-based platform (1)
- websites (1)
- wh-movement (1)
- widget (1)
- widget store (1)
- wir (1)
- wisdom of the crowd (1)
- wollen (1)
- women (1)
- woordeboeke as sosiale werktuie (1)
- woordeboekontwerp (1)
- word (1)
- word embedding (1)
- word family database (1)
- word formation in German (1)
- word frequency distribution (1)
- word history (1)
- word meaning relationship (1)
- word recognition (1)
- word segmentation (1)
- word selection (1)
- word sense alignment (1)
- word trees (1)
- word-level alignment (1)
- word-sense disambiguation (1)
- worship (1)
- writing (1)
- writing support tool (1)
- youth (1)
- zipf (1)
- zipf-mandelbrot (1)
- Ägyptisch (1)
- Ähnlichkeitssuche (1)
- Öffentliche Meinung (1)
- Öffentlichkeit (1)
- überhaupt <Lemma> (1)
- żeby (1)
- комунікативна компетентність (1)
- комунікативна девіація (невдача) (1)
- комунікативні девіації (1)
- міжкультурна комунікація (1)
- німецька мова (1)
- німецька мова як іноземна (1)
- політичне відеоінтерв’ю (1)
- респондент (1)
- українська мова як іноземна (1)
- українськамова (1)
Publicationstate
- Veröffentlichungsversion (961)
- Zweitveröffentlichung (248)
- Postprint (236)
- Ahead of Print (6)
- Preprint (5)
- Erstveröffentlichung (2)
Reviewstate
- Peer-Review (828)
- (Verlags)-Lektorat (410)
- Peer-review (24)
- Qualifikationsarbeit (Dissertation, Habilitationsschrift) (18)
- Verlags-Lektorat (14)
- Peer-Revied (8)
- Review-Status-unbekannt (6)
- Abschlussarbeit (Bachelor, Master, Diplom, Magister) (Bachelor, Master, Diss.) (3)
- (Verlags-)Lektorat (2)
- Peer review (2)
Publisher
- de Gruyter (104)
- Benjamins (87)
- IDS-Verlag (81)
- Springer (63)
- European Language Resources Association (ELRA) (56)
- Association for Computational Linguistics (46)
- European Language Resources Association (42)
- Oxford University Press (35)
- Elsevier (33)
- Institut für Deutsche Sprache (33)
Ungoliant: An optimized pipeline for the generation of a very large-scale multilingual web corpus
(2021)
Since the introduction of large language models in Natural Language Processing, large raw corpora have played a crucial role in Computational Linguistics. However, most of these large raw corpora are either available only for English or not available to the general public due to copyright issues. Nevertheless, there are some examples of freely available multilingual corpora for training Deep Learning NLP models, such as the OSCAR and Paracrawl corpora. However, they have quality issues, especially for low-resource languages. Moreover, recreating or updating these corpora is very complex. In this work, we try to reproduce and improve the goclassy pipeline used to create the OSCAR corpus. We propose a new pipeline that is faster, modular, parameterizable, and well documented. We use it to create a corpus similar to OSCAR but larger and based on recent data. Also, unlike OSCAR, the metadata information is at the document level. We release our pipeline under an open source license and publish the corpus under a research-only license.
The aim of this work is to describe criteria used in the process of inclusion and treatment of neologisms in dictionaries of Spanish within the framework of pandemic instability. Our starting point will be data obtained by the Antenas Neológicas Network (https://www.upf.edu/web/antenas), whose representation in three different lexicographic tools will be analyzed with the purpose of identifying problems in the methodology used to dictionarize – that is, how and what words were selected to be included in dictionaries and how they were represented in their entries – neologisms during the COVID-19 pandemic (sources and corpora of analysis, selection criteria, types of definition, among other aspects). Two of them are monolingual and COVID-19 lexical units were included as part of their updates: the Antenario, a dictionary of neologisms of Spanish varieties, and the Diccionario de la Lengua Española [DLE], a dictionary of general Spanish, published by the Real Academia Española [RAE], Spanish Royal Academy). The other is a bilingual unidirectional English-Spanish dictionary first published as a glossary, Diccionario de COVID-19 EN-ES [TREMEDICA], entirely made up of neological and non-neological lexical units related to the virus and the pandemic. Thus, the target lexis was either included in existing works or makes up the whole of a new tool located in a portal together with other lexicographic tools. Unlike other collections of COVID-19 vocabulary that kept cropping up as the pandemic unfolded, all three have been designed and written according to well-established lexicographic practices.
Our working hypothesis is that the need to record and define words which were recently created impacts the criteria for inclusion and treatment of neologisms in dictionaries about Spanish, including a certain degree of overlap of some features which are traditionally thought to be specific to each type of dictionary.
The annual microcensus provides Germany’s most important official statistics. Unlike a census it does not cover the whole population, but a representative 1%-sample of it. In 2017, the German microcensus asked a question on the language of the population, i.e. ‘Which language is mainly spoken in your household?’ Unfortunately, the question, its design and its position within the whole microcensus’ questionnaire feature several shortcomings. The main shortcoming is that multilingual repertoires cannot be captured by it. Recommendations for the improvement of the microcensus’ language question: first and foremost the question (i.e. its wording, design, and answer options) should make it possible to count multilingual repertoires.
This paper explores how attitudes affect the seemingly objective process of counting speakers of varieties using the example of Low German, Germany’s sole regional language. The initial focus is on the basic taxonomy of classifying a variety as a language or a dialect. Three representative surveys then provide data for the analysis: the Germany Survey 2008, the Northern Germany Survey 2016, and the Germany Survey 2017. The results of these surveys indicate that there is no consensus concerning the evaluation of Low German’s status and that attitudes towards Low German are related to, for example, proficiency in the language. These attitudes are shown to matter when counting speakers of Low German and investigating the status it has been accorded.
Language attitudes matter; they influence people’s behaviour and decisions. Therefore, it is crucial to learn more about patterns in the way that languages are evaluated. One means of doing so is using a quantitative approach with data representative of a whole population, so that results mirror dispositions at a societal level. This kind of approach is adopted here, with a focus on the situation in Germany. The article consists of two parts. First, I will present some results of a new representative survey on language attitudes in Germany (the Germany Survey 2017). Second, I will show how language attitudes penetrate even seemingly objective data collection processes by examining the German Microcensus. In 2017, for the first time in eighty years, the German Microcensus included a question on language use ‘at home’. Unfortunately, however, the question was clearly tainted by language attitudes instead of being objective. As a result, the Microcensus significantly misrepresents the linguistic reality of different migrant languages spoken in Germany.
Germany's (single) national official language is German. The dominance of German in schools, politics, the legal system, administration and the entire written public domain is so great that for a long time the lack of a coherent language policy was not seen as a problem. State restraint in this area is due, on the one hand, to historical reasons; on the other hand, it has been promoted by the federal system in Germany, which grants the federal states far-reaching responsibilities in the fields of education and culture. More recently, multilingualism among the population has increased and has resulted in a growing interest in understanding the language situation in Germany and (in particular) taking a closer look at the different minority languages. In 2017, for the first time in about 80 years, there is a question on the language of the population in the German micro census. The Institute for the German Language has also carried out various representative surveys; in the winter of 2017/201, a large representative survey with questions on the language repertoire and language attitudes is in the field.
Who understands Low German today and who can speak it? Who makes use of media and cultural events in Low German? What images do people in northern Germany associate with Low German and what is their view of their regional language?
These and further questions are answered in this brochure with the help of representative data collected in a telephone survey of a total of 1,632 people from eight federal states (Bremen, Hamburg, Lower Saxony, Mecklenburg-West Pomerania and Schleswig-Holstein as well as Brandenburg, North Rhine-Westphalia and Saxony-Anhalt).
This paper outlines the generation process of a specifi computational linguistic representation termed the Multilingual Time Map, conceptually a multi-tape finit state transducer encoding linguistic data at different levels of granularity. The fi st component acquires phonological data from syllable labeled speech data, the second component define feature profiles the third component generates feature hierarchies and augments the acquired data with the define feature profiles and the fourth component displays the Multilingual Time Map as a graph.
Although the N400 was originally discovered in a paradigm designed to elicit a P300 (Kutas and Hillyard, 1980), its relationship with the P300 and how both overlapping event-related potentials (ERPs) determine behavioral profiles is still elusive. Here we conducted an ERP (N = 20) and a multiple-response speed-accuracy tradeoff (SAT) experiment (N = 16) on distinct participant samples using an antonym paradigm (The opposite of black is white/nice/yellow with acceptability judgment). We hypothesized that SAT profiles incorporate processes of task-related decision-making (P300) and stimulus-related expectation violation (N400). We replicated previous ERP results (Roehm et al., 2007): in the correct condition (white), the expected target elicits a P300, while both expectation violations engender an N400 [reduced for related (yellow) vs. unrelated targets (nice)]. Using multivariate Bayesian mixed-effects models, we modeled the P300 and N400 responses simultaneously and found that correlation between residuals and subject-level random effects of each response window was minimal, suggesting that the components are largely independent. For the SAT data, we found that antonyms and unrelated targets had a similar slope (rate of increase in accuracy over time) and an asymptote at ceiling, while related targets showed both a lower slope and a lower asymptote, reaching only approximately 80% accuracy. Using a GLMM-based approach (Davidson and Martin, 2013), we modeled these dynamics using response time and condition as predictors. Replacing the predictor for condition with the averaged P300 and N400 amplitudes from the ERP experiment, we achieved identical model performance. We then examined the piecewise contribution of the P300 and N400 amplitudes with partial effects (see Hohenstein and Kliegl, 2015). Unsurprisingly, the P300 amplitude was the strongest contributor to the SAT-curve in the antonym condition and the N400 was the strongest contributor in the unrelated condition. In brief, this is the first demonstration of how overlapping ERP responses in one sample of participants predict behavioral SAT profiles of another sample. The P300 and N400 reflect two independent but interacting processes and the competition between these processes is reflected differently in behavioral parameters of speed and accuracy.
In this paper we examine the composition and interactional deployment of suspended assessments in ordinary German conversation. We define suspended assessments as lexicosyntactically incomplete assessing TCUs that share a distinct cluster of prosodic-phonetic features which auditorily makes them come off as 'left hanging' rather than cut-off (e.g., Schegloff/Jefferson/Sacks 1977; Jasperson 2002) or trailing-off (e.g., Local/Kelly 1986; Walker 2012). Using CA/IL methodology (Couper-Kuhlen/Selting 2018) and drawing on a large body of video-recorded face-to-face conversations, we highlight the verbal, vocal and bodily-visual resources participants use to render such unfinished assessing TCUs recognizably incomplete and identify six recurrent usage types. Overall, the suspension of assessing TCUs appears to either serve as a practice for circumventing the production of assessments that are interactionally inapposite, or as a practice for coping with local contingencies that render the very doing of an assessment problematic for the speaker. Data are in German with English translations.
Preface
(2019)
Preface
(2020)
Physicists look at language
(2006)
This paper aims at verifying if the most important online Brazilian Portuguese dictionaries include some of the neologisms identified in texts published in the 1990s to 2000s, formed with the elements ciber-, e-, bio-, eco- and narco, which we refer to as fractomorphemes / fracto-morphèmes. Three online dictionaries were analyzed (Aulete, Houaiss and Michaelis), as well as Vocabulário Ortográfico da Língua Portuguesa (VOLP). We were able to conclude that all three dictionaries and VOLP include neologisms with these elements; Michaelis and VOLP do not include separate entries for bound morphemes, whereas Houaiss includes entries for all of them and Aulete includes entries for bio-, eco- and narco-. Aulete also describes the neological meaning of eco- and narco-, whereas Houaiss does not.
This White Paper sets out commonly agreed definitions on activities of consortia within NFDI. It aims to provide a common basis for reporting and reference regarding selected questions of cross-consortial relevance in DFG’s template for the Interim Reports. The questions were prioritised by an NFDI Task Force on Evaluation and Reporting (formerly Task Force Monitoring) as a result of discussing possible answers to the DFG template. In this process the need to agree on a generalizable meaning of terms commonly used in the context of NFDI, and reporting in particular, were identified from cross-consortial perspectives. Questions that showed the highest requirement on clarification are discussed in this White Paper. As NFDI evolves, the Task Force will likely propose further joint approaches for reporting in information infrastructures.
While each of broad relevance, the questions addressed relate to substantially different aspects of consortia’s work. They are thus also structured slightly different.
Collaborative work in NFDI
(2023)
The non-profit association National Research Data Infrastructure (NFDI) promotes science and research through a National Research Data Infrastructure. Its aim is to develop and establish an overarching research data management (RDM) for Germany and to increase the efficiency of the entire German science system. After a two-and-a-half year build up phase, the process of adding new consortia, each representing a different data domain, has ended in March 2023. NFDI now has 26 disciplinary consortia (and one additional basic service collaboration). Now the full extent of cross-consortial interaction is beginning to show.
The automatic recognition of idioms poses a challenging problem for NLP applications. Whereas native speakers can intuitively handle multiword expressions whose compositional meanings are hard to trace back to individual word semantics, there is still ample scope for improvement regarding computational approaches. We assume that idiomatic constructions can be characterized by gradual intensities of semantic non-compositionality, formal fixedness, and unusual usage context, and introduce a number of measures for these characteristics, comprising count-based and predictive collocation measures together with measures of context (un)similarity. We evaluate our approach on a manually labelled gold standard, derived from a corpus of German pop lyrics. To this end, we apply a Random Forest classifier to analyze the individual contribution of features for automatically detecting idioms, and study the trade-off between recall and precision. Finally, we evaluate the classifier on an independent dataset of idioms extracted from a list of Wikipedia idioms, achieving state-of-the art accuracy.
In order to differentiate between figurative and literal usage of verb-noun combinations for the shared task on the disambiguation of German Verbal Idioms issued for KONVENS 2021, we apply and extend an approach originally developed for detecting idioms in a dataset consisting of random ngram samples. The classification is done by implementing a rather shallow, statistics-based pipeline without intensive preprocessing and examinations on the morphosyntactic and semantic level. We describe the overall approach, the differences between the original dataset and the dataset of the KONVENS task, provide experimental classification results, and analyse the individual contributions of our feature sets.
This study investigates cross-language differences in pitch range and variation in four languages from two language groups: English and German (Germanic) and Bulgarian and Polish (Slavic). The analysis is based on large multi-speaker corpora (48 speakers for Polish, 60 for each of the other three languages). Linear mixed models were computed that include various distributional measures of pitch level, span and variation, revealing characteristic differences across languages and between language groups. A classification experiment based on the relevant parameter measures (span, kurtosis and skewness values for pitch distributions for each speaker) succeeded in separating the language groups.
This study presents the results of a large-scale comparison of various measures of pitch range and pitch variation in two Slavic (Bulgarian and Polish) and two Germanic (German and British English) languages. The productions of twenty-two speakers per language (eleven male and eleven female) in two different tasks (read passages and number sets) are compared. Significant differences between the language groups are found: German and English speakers use lower pitch maxima, narrower pitch span, and generally less variable pitch than Bulgarian and Polish speakers. These findings support the hypothesis that inguistic communities tend to be characterized by particular pitch profiles.
Based on specific linguistic landmarks in the speech signal, this study investigates pitch level and pitch span differences in English, German, Bulgarian and Polish. The analysis is based on 22 speakers per language (11 males and 11 females). Linear mixed models were computed that include various linguistic measures of pitch level and span, revealing characteristic differences across languages and between language groups. Pitch level appeared to have significantly higher values for the female speakers in the Slavic than the Germanic group. The male speakers showed slightly different results, with only the Polish speakers displaying significantly higher mean values for pitch level than the German males. Overall, the results show that the Slavic speakers tend to have a wider pitch span than the German speakers. But for the linguistic measure, namely for span between the initial peaks and the non-prominent valleys, we only find the difference between Polish and German speakers. We found a flatter intonation contour in German than in Polish, Bulgarian and English male and female speakers and differences in the frequency of the landmarks between languages. Concerning “speaker liveliness” we found that the speakers from the Slavic group are significantly livelier than the speakers from the Germanic group.
New KARL (Knowledge Acquisition and Representation Language) allows to specify all parts of a problem-solving method (PSM). It is a formal language with a well-defined semantics and thus allows to represent PSMs precisely and unambiguously yet abstracting from implementation detail. In this paper it is shown how the language KARL has been modified and extended to New KARL to better meet the needs for the representation of PSMs. Based on a conceptual structure of PSMs new language primitives are introduced for KARL to specify such a conceptual structure and to support the configuration of methods. An important goal for this extension was to preserve three important properties of KARL: to be (i) a conceptual, (ii) a formal, and (iii) an executable language.
This poster summarizes the results of the CLARIAH-DE Work Package 3: Skills Training and Promotion of Junior Researchers.
For a research field that is characterised by rapid technical development, CLARIAH-DE has to include the promotion of data literacy necessary for the efficient use of this digital research infrastructure as part of its objective. To develop, consolidate and refine a common programme in this area, work package 3 set itself the following sub goals:
- Consolidation of the activities from the previous projects into a joint service
- Cataloguing and reflecting on the methods and tools used in the research field, with the aim of identifying remaining gaps
- Skills training of, individual support for and the promotion of junior researchers
An ongoing academic and research program, the “Vocabula Grammatica” lexicon, implemented by the Centre for the Greek Language (Thessaloniki, Greece), aims at lemmatizing all the philological, grammatical, rhetorical, and metrical terms in the written texts of scholars (philologists and scholiasts) who curated the ancient Greek literature from the beginning of the Hellenistic period (4th/3rd c. BC) until the end of the Byzantine era (15th c. AD). In particular, it aspires to fill serious gaps (a) in the study of ancient Greek scholarship and (b) in the lexicography of the ancient Greek language and literature. By providing specific examples, we will highlight the typical and methodological features of the forthcoming dictionary.
In this paper, we describe a data processing pipeline used for annotated spoken corpora of Uralic languages created in the INEL (Indigenous Northern Eurasian Languages) project. With this processing pipeline we convert the data into a loss-less standard format (ISO/TEI) for long-term preservation while simultaneously enabling a powerful search in this version of the data. For each corpus, the input we are working with is a set of files in EXMARaLDA XML format, which contain transcriptions, multimedia alignment, morpheme segmentation and other kinds of annotation. The first step of processing is the conversion of the data into a certain subset of TEI following the ISO standard ’Transcription of spoken language’ with the help of an XSL transformation. The primary purpose of this step is to obtain a representation of our data in a standard format, which will ensure its long-term accessibility. The second step is the conversion of the ISO/TEI files to a JSON format used by the “Tsakorpus” search platform. This step allows us to make the corpora available through a web-based search interface. As an addition, the existence of such a converter allows other spoken corpora with ISO/TEI annotation to be made accessible online in the future.
This paper presents the QUEST project and describes concepts and tools that are being developed within its framework. The goal of the project is to establish quality criteria and curation criteria for annotated audiovisual language data. Building on existing resources developed by the participating institutions earlier, QUEST develops tools that could be used to facilitate and verify adherence to these criteria. An important focus of the project is making these tools accessible for researchers without substantial technical background and helping them produce high-quality data. The main tools we intend to provide are the depositors’ questionnaire and automatic quality assurance, both developed as web applications. They are accompanied by a Knowledge base, which will contain recommendations and descriptions of best practices established in the course of the project. Conceptually, we split linguistic data into three resource classes (data deposits, collections and corpora). The class of a resource defines the strictness of the quality assurance it should undergo. This division is introduced so that too strict quality criteria do not prevent researchers from depositing their data.
This paper presents the QUEST project and describes concepts and tools that are being developed within its framework. The goal of the project is to establish quality criteria and curation criteria for annotated audiovisual language data. Building on existing resources developed by the participating institutions earlier, QUEST also develops tools that could be used to facilitate and verify adherence to these criteria. An important focus of the project is making these tools accessible for researchers without substantial technical background and helping them produce high-quality data. The main tools we intend to provide are a questionnaire and automatic quality assurance for depositors of language resources, both developed as web applications. They are accompanied by a knowledge base, which will contain recommendations and descriptions of best practices established in the course of the project. Conceptually, we consider three main data maturity levels in order to decide on a suitable level of strictness of the quality assurance. This division has been introduced to avoid that a set of ideal quality criteria prevent researchers from depositing or even assessing their (legacy) data. The tools described in the paper are work in progress and are expected to be released by the end of the QUEST project in 2022.
The CMDI Explorer
(2020)
We present the CMDI Explorer, a tool that empowers users to easily explore the contents of complex CMDI records and to process selected parts of them with little effort. The tool allows users, for instance, to analyse virtual collections represented by CMDI records, and to send collection items to other CLARIN services such as the Switchboard for subsequent processing. The CMDI Explorer hence adds functionality that many users felt was lacking from the CLARIN tool space.
CMDI Explorer
(2021)
We present CMDI Explorer, a tool that empowers users to easily explore the contents of complex CMDI records and to process selected parts of them with little effort. The tool allows users, for instance, to analyse virtual collections represented by CMDI records, and to send collection items to other CLARIN services such as the Switchboard for subsequent processing. CMDI Explorer hence adds functionality that many users felt was lacking from the CLARIN tool space.
This technology watch report discusses digital repository solutions, in the context of the research infrastructure projects CLARIAH-DE, CLARIN, and DARIAH. It provides an overview of different repository systems, comparing them and discussing their respective applicabilities from the perspectives of the project partners at the time of writing.
This paper addresses long-term archival for large corpora. Three aspects specific to language resources are focused, namely (1) the removal of resources for legal reasons, (2) versioning of (unchanged) objects in constantly growing resources, especially where objects can be part of multiple releases but also part of different collections, and (3) the conversion of data to new formats for digital preservation. It is motivated why language resources may have to be changed, and why formats may need to be converted. As a solution, the use of an intermediate proxy object called a signpost is suggested. The approach will be exemplified with respect to the corpora of the Leibniz Institute for the German Language in Mannheim, namely the German Reference Corpus (DeReKo) and the Archive for Spoken German (AGD).
Signposts for CLARIN
(2020)
An implementation of CMDI-based signposts and its use is presented in this paper. Arnold et al. 2020 present Signposts as a solution to challenges in long-term preservation of corpora, especially corpora that are continuously extended and subject to modification, e.g., due to legal injunctions, but also may overlap with respect to constituents, and may be subject to migrations to new data formats. We describe the contribution Signposts can make to the CLARIN infrastructure and document the design for the CMDI profile.
Signposts for CLARIN
(2021)
An implementation of CMDI-based signposts and its use is presented in this paper. Arnold, Fisseni et al. (2020) present signposts as a solution to challenges in long-term preservation of corpora. Though applicable to digital resources in general, we focus on corpora, especially those that are continuously extended or subject to modification, e.g., due to legal injunctions, but also may overlap with respect to constituents, and may be subject to migrations to new data formats. We describe the contribution signposts can make to the CLARIN infrastructure, notably virtual collections, and document the design for the CMDI profile.
Prominence has been widely studied on the word level and the syllable level. An extensive study comparing the two approaches is missing in the literature. This study investigates how word and syllable prominence relate to each other in German. We find that perceptual ratings based on the word level are more extreme than those based on the syllable level. The correlations between word prominence and acoustic features are greater than the correlations between syllable prominence and acoustic features.
The current paper presents a corpus containing 35 dialogues of spontaneously spoken southern German, including half an hour of articulography for 13 of the speakers. Speakers were seated in separate recording chambers, mimicking a telephone call, and recorded on individual audio channels. The corpus provides manually corrected word boundaries and automatically aligned segment boundaries. Annotations are provided in the Praat format. In addition to audio recordings, speakers filled out a detailed questionnaire, assessing among others their audio-visual consumption habits.
Sound units play a pivotal role in cognitive models of auditory comprehension. The general consensus is that during perception listeners break down speech into auditory words and subsequently phones. Indeed, cognitive speech recognition is typically taken to be computationally intractable without phones. Here we present a computational model trained on 20 hours of conversational speech that recognizes word meanings within the range of human performance (model 25%, native speakers 20–44%), without making use of phone or word form representations. Our model also generates successfully predictions about the speed and accuracy of human auditory comprehension. At the heart of the model is a ‘wide’ yet sparse two-layer artificial neural network with some hundred thousand input units representing summaries of changes in acoustic frequency bands, and proxies for lexical meanings as output units. We believe that our model holds promise for resolving longstanding theoretical problems surrounding the notion of the phone in linguistic theory.
In our study we use the experimental framework of priming to manipulate our subjects’ expectations of syllable prominence in sentences with a well-defined syntactic and phonological structure. It shows that it is possible to prime prominence patterns and that priming leads to significant differences in the judgment of syllable prominence.
The perception of prosodic prominence is influenced by different sources like different acoustic cues, linguistic expectations and context. We use a generalized additive model and a random forest to model the perceived prominence on a corpus of spoken German. Both models are able to explain over 80% of the variance. While the random forests give us some insights on the relative importance of the cues, the general additive model gives us insights on the interaction between different cues to prominence.
A frequently replicated finding is that higher frequency words tend to be shorter and contain more strongly reduced vowels. However, little is known about potential differences in the articulatory gestures for high vs. low frequency words. The present study made use of electromagnetic articulography to investigate the production of two German vowels, [i] and [a], embedded in high and low frequency words. We found that word frequency differently affected the production of [i] and [a] at the temporal as well as the gestural level. Higher frequency of use predicted greater acoustic durations for long vowels; reduced durations for short vowels; articulatory trajectories with greater tongue height for [i] and more pronounced downward articulatory trajectories for [a]. These results show that the phonological contrast between short and long vowels is learned better with experience, and challenge both the Smooth Signal Redundancy Hypothesis and current theories of German phonology.
Streefkerk defines prominence as the perceptually outstanding parts in spoken language. An optimal rating scale for syllable prominence has not been found yet. This paper evaluates a 4-point, an 11-point, a 31-point, and a continuous scale for the rating of syllable prominence and gives support for scales using a higher number of levels. Priming effects found by Arnold, et al., could only be replicated using the 31-point scale.
In previous research we showed that the priming paradigm can be used to significantly alter the prominence ratings of subjects. In that study we only looked at the changes in the subjects’ ratings. In the present study, we analyzed the acoustic parameters of the stimuli used in the priming study and investigated the correlation between prominence ratings and acoustic parameters. The results show that priming has a significant effect on these correlations. The contribution of acoustic features on perceived prominence was found to depend on the prominence pattern. If a dominantly prominent syllable is present in a given utterance, f0 and intensity contribute most to the perceived prominence, while duration contributes most when no syllable is dominantly prominent.
In many European languages, propositional arguments (PAs) can be realized as different types of structures. Cross-linguistically, complex structures with PAs show a systematic correlation between the strength of the semantic bond and the syntactic union (cf. Givón 2001; Wurmbrand/Lohninger 2023). Also, different languages show similarities with respect to the (lexical) licensing of different PAs (cf. Noonan 1985; Givón 2001; Cristofaro 2003 on different predicate types). However, on a more fine-grained level, a variation across languages can be observed both with respect to the syntactic-semantic properties of PAs as well as to their licensing and usage. This presentation takes a multi-contrastive view of different types of PAs as syntactic subjects and objects by looking at five European languages: EN, DE, IT, PL and HU. Our goal is to identify the parameters of variation in the clausal domain with PAs and by this to contribute to a better understanding of the individual language systems on the one hand and the nature of the linguistic variation in the clausal domain on the other hand. Phenomena and Methodology: We investigate the following types of PAs: direct object (DO) clauses (1), prepositional object (PO) clauses (2), subject clauses (3), and nominalizations (4, 5). Additionally, we discuss clause union phenomena (6, 7). The analyzed parameters include among others finiteness, linear position of the PA, (non) presence of a correlative element, (non) presence of a complementizer, lexical-semantic class of the embedding verb. The phenomena are analyzed based on corpus data (using mono- and multilingual corpora), experimental data (acceptability judgement surveys) or introspective data.
This article investigates mundane photo taking practices with personal mobile devices in the co-presence of others, as well as “divergent” self-initiated smartphone use, thereby exploring the impact of everyday technologies on social interaction. Utilizing multimodal conversation analysis, we examined sequences in which young adults take pictures of food and drinks in restaurants and cafés. Although everyday interactions are abundant in opportunities for accomplishing food photography as a side activity, our data show that taking pictures is also often prioritized over other activities. Through a detailed sequential analysis of video recordings and dynamic screen captures of mobile devices, we illustrate how photographers orient to the momentary opportunities for and relevance of photo taking, that is, how they systematically organize their photographing with respect to the ongoing social encounter and the (projected) changes in the material environment. We investigate how the participants multimodally negotiate the “mainness” and “sideness” (Mondada, 2014) of situated food photography and describe some particular features of participants’ conduct in moments of mundane multiactivity.
In this chapter, we will investigate smartphone-based showing sequences in everyday social encounters, that is, moments in which a personal mobile device is used for presenting (audio-)visual content to co-present participants. Despite a growing interest in object-centred sequences and mundane technology use, detailed accounts of the sequential, multimodal, and material dimensions of showing sequences are lacking. Based on video data of social interactions in different languages and on the framework of multimodal interaction analysis, this chapter will explore the link between mobile device use and social practices. We will analyse how smartphone showers and their recipients coordinate the manipulation of a technological object with multiple courses of action, and reflect upon the fundamental complexity of this by-now routine joint activity.
The ubiquity of smartphones has been recognised within conversation analysis as having an impact on conversational structures and on the participants’ interactional involvement. However, most of the previous studies have relied exclusively on video recordings of overall encounters and have not systematically considered what is taking place on the device. Due to the personal nature of smartphones and their small displays, onscreen activities are of limited visibility and are thus potentially opaque for both the co-present participants (“participant opacity”) and the researchers (“analytical opacity”). While opacity can be an inherent feature of smartphones in general, analytical opacity might not be desirable for research purposes. This chapter discusses how a recording set-up consisting of static cameras, wearable cameras and dynamic screen captures allowed us to address the analytical opacity of mobile devices. Excerpts from multi-source video data of everyday encounters will illustrate how the combination of multiple perspectives can increase the visibility of interactional phenomena, reveal new analytical objects and improve analytical granularity. More specifically, these examples will emphasise the analytical advantages and challenges of a combined recording set-up with regard to smartphone use as multiactivity, the role of the affordances of the mobile device, and the prototypicality and “naturalness” of the recorded practices.
Introduction
(2023)
A trainable prosodic model called SFC (Superposition of Functional Contours), proposed by Holm and Bailly, is here confronted to German intonation. Training material is the publicly available Siemens Synthesis Corpus that provides spoken utterances for high-quality speech synthesis. We describe the labeling framework and first evaluation results that compares the original prosody of test sentences of this corpus with their prosodic rendering by the proposed model and state-of-the-art systems available on-line on the web.
The classification of verbs in Levin's (1993) English Verb Classes and Alternations: A preliminary Investigation, on the basis of both intuitive semantic grouping and their participation in valence alternations, is often used by the NLP community as evidence of the semantic similarity of verbs (Jing & McKeown 1998; Lapata & Brew 1999; Kohl et al. 1998). In this paper, we compare the Levin classification with the work of the FrameNet project (Fillmore & Baker 2001), where words (not just verbs) are grouped according to the conceptual structures (frames) that underlie them and their combinatorial patterns are inductively derived from corpus evidence. This means that verbs grouped together in FrameNet (FN) might be semantically similar but have different (or no) alternations, and that verbs which share the same alternation might be represented in two different semantic frames.
Playing videogames is a popular social activity; people play videogames in different places, on different media, in different situations, alone or with partners, online or offline. Unsurprisingly, they thereby share space (physically or virtually) with other playing or non-playing people. The special issue investigates through different contexts and settings how non-players become participants of the gaming interaction and how players and non-players co-construct presence. The introduction provides a problem-related context for the individual contributions and then briefly presents them.
This paper investigates situations in French videogame interactions where non-players who share the same physical space as players, participate in the gaming activities as spectators. Through a detailed multimodal and sequential analysis, we show that being a spectator is a local achievement of all co-present participants - players and non-players.
Among the many peculiarities of the German tense system which make its description or reconstruction such a difficult task to perform, there is one outstanding stumbling-block, viz. the relation between the - morphologically simple - Preterite and the - compound - (Present-)Perfect. Disregarding problems of variety - in spoken German in the South, the Preterite either doesn't exist or is restricted to modal verbs, e.g.
In the first part of this contribution, we will present, as a starting point for the following discussions, a simple formal language P containing one stative predicate. We will then discuss, on an intuitive level, how a treatment of predicates of change could be conceived, and how the progressive could be rendered in a formal language.
We will then give a formal definition of a language, TP1, based on P, and we will construct a semantics for TP1, which incorporates the ideas discussed.
We present an approach to an aspect of managing complex access scenarios to large and heterogeneous corpora that involves handling user queries that, intentionally or due to the complexity of the queried resource, target texts or annotations outside of the given user’s permissions. We first outline the overall architecture of the corpus analysis platform KorAP, devoting some attention to the way in which it handles multiple query languages, by implementing ISO CQLF (Corpus Query Lingua Franca), which in turn constitutes a component crucial for the functionality discussed here. Next, we look at query rewriting as it is used by KorAP and zoom in on one kind of this procedure, namely the rewriting of queries that is forced by data access restrictions.
As the Web ought to be considered as a series of sources rather than as a source in itself, a problem facing corpus construction resides in meta-information and categorization. In addition, we need focused data to shed light on particular subfields of the digital public sphere. Blogs are relevant to that end, especially if the resulting web texts can be extracted along with metadata and made available in coherent and clearly describable collections.
We present a method to identify and document a phenomenon on which there is very little empirical data: German phrasal compounds occurring in the form of as a single token (without punctuation between their components). Relying on linguistic criteria, our approach implies to have an operational notion of compounds which can be systematically applied as well as (web) corpora which are large and diverse enough to contain rarely seen phenomena. The method is based on word segmentation and morphological analysis, it takes advantage of a data-driven learning process. Our results show that coarse-grained identification of phrasal compounds is best performed with empirical data, whereas fine-grained detection could be improved with a combination of rule-based and frequency-based word lists. Along with the characteristics of web texts, the orthographic realizations seem to be linked to the degree of expressivity.
While adjusting to the COVID-19 pandemic, people around the world started to talk about the “new normal” way of life, and they conveyed feelings and thoughts on the topic through social networks and traditional communication channels resorting to a set of specific linguistic strategies, such as metaphors and neologisms. The vocabulary in different domains and in everyday speech was expanded to accommodate a complex social, cultural, and professional phenomenon of changes. Therefore, this new life gave birth to a new language – the “coronaspeak”. According to Thorne (2020), the “coronaspeak” has three stages: first, it emerged in the way medical aspects were communicated in everyday language; secondly, it occurred when speakers verbalized the experiences they had undergone and “invented their own terms”; finally, this “new” way of speaking emerged in the government and authorities’ jargon, to ensure that the new rules and policies were understood, and that population adopted socially responsible behaviours.
In this paper, we will focus on the second stage, because we intend to take stock of how speakers communicate and verbalize this new way of living, particularly on social networks, for example. Alongside, we are interested in the context in which the neologism – be it a new word, a new meaning, or a new use – emerged, is used, and understood, through the observation of the occurrence of the new word(s) either on social networks or through dissemination texts (press) to confront it with the ones that Portuguese digital dictionaries have attested so far. Different criteria regarding the insertion of new units, the inclusion date, and the lexicographic description of the entries in the dictionaries will be debated.
Linguistics is facing the challenge of many other sciences as it continues to grow into increasingly complex subfields, each with its own separate or overarching branches. While linguists are certainly aware of the overall structure of the research field, they cannot follow all developments other than those of their subfields. It is thus important to help specialists but also newcomers alike to bushwhack through evolved or unknown territory of linguistic data. A considerable amount of research data in linguistics is described with metadata. While studies described and published in archived journals and conference proceedings receive a quite homogeneous set of metadata tags — e.g., author, title, publisher —, this does not hold for the empirical data and analyses that underlie such studies. Moreover, lexicons, grammars, experimental data, and other types of resources come in different forms; and to make things worse, their description in terms of metadata is also not uniform, if existing at all. These problems are well-known and there are now a number of international initiatives — e.g., CLARIN, FlareNet, MetaNet, DARIAH — to build infrastructures for managing linguistic resources. The NaLiDa project, funded by the German Research Foundation, aims at facilitating the management and access to linguistic resources originating from German research institutions. In cooperation with the German SFB 833 research center, we are developing a combination of faceted and full-text search to give integrated access through heterogeneous metadata sets. Our approach is supported by a central registry for metadata field descriptors, and a component repository for structured groups of data categories as larger building blocks.
The long road to a historical dictionary of Lower Sorbian. Towards a lexical information system
(2022)
The Sorbian Institute has been taking preparatory steps for a historical-documentary vocabulary information system for Lower Sorbian for about 10 years. To this end, the entire extant written material (16th–21st centuries) of this strongly endangered European minority language is to be systematically evaluated. An attempt made a few years ago to organise and finance the project as a long-term scientific project was not successful in the end. Therefore, it can only be advanced step by step and via some detours. The article informs about the interim status of the project, especially with respect to the creation of a reliable database.
This paper contributes to the growing body of knowledge on current listeners' responses in talk-in-interaction. In particular, it complements earlier findings on double sayings of German JA by describing some additional prosodic-phonetic parameters and a visual feature of its realization in institutional and semi-private interaction (doctor-patient interaction, Big Brother, TV talk shows). These include pitch contour, pitch range and phonetic ending, on the one hand, and nodding on the other. The paper shows that JAJA is a truly multimodal phenomenon, with the individual features accomplishing interactional functions across sequence-organizational habitats, including re)claiming epistemic priority in an aside, making continuation relevant, agreeing/ acknowledging with reservation and aligning with the continuation of a sequence. Lack of nodding is suggested to have situational as well as misalignment reasons. On the basis of its observations, the paper also raises the question whether it is the applicability of response token variants across action and sequence types which makes them memorizable despite their variability.
This paper presents the concept of the "participant perspective" as an approach to the study of spoken language. It discusses three aspects of this concept and shows that they can offer helpful tools in spoken language research. Employing the participant perspective provides us with an alternative to many of the approaches currently in use in the study of spoken language in that it favours small-scale, qualitative research that aims to uncover categories relevant for the participants. Its results can usefully complement large-scale studies of phenomena on all linguistic dimensions of talk.
Contrasting and turn transition: Prosodic projection with the parallel-opposition constructions
(2009)
The parallel-opposition construction has not yet been widely described as an independent construction type. This article reports on its realization in everyday British-English conversation. In particular, it focusses on prosodic projection in the lexically and syntactically unmarked first component of this syntactic pattern, and thus adds to the body of research investigating the organization of turn-taking in the context of bi-clausal constructions with which the first part lacks explicit lexical hints to their continuation. It is shown that the parallel-opposition construction, next to specific semantic–pragmatic, syntactic and lexical features, also exhibits a relatively fixed range of prosodic features in the first conjunct, among these narrow focus, continuing intonation and/or the avoidance of intonation-unit boundary signals. These are used to project continuation of an otherwise complete utterance and, thus, to secure the floor for the expression of contrast. In addition, the detailed analysis of apparently deviant cases, which takes into account the on-line production of syntax, shows that a lack of prosodically projective features in the first component of the parallel-opposition construction can be explained by the strategic, retrospective use of the construction to resolve problems in turn transition.
The term “pivot” usually refers to two overlapping syntactic units such that the completion of the first unit simultaneously launches the second. In addition, pivots are generally said to be characterized by the smooth prosodic integration of their syntactic parts. This prosodic integration is typically achieved by prosodic-phonetic matching of the pivot components. As research on such turns in a range of languages has illustrated, speakers routinely deploy pivots so as to be able to continue past a point of possible turn completion, in the service of implementing some additional or revised action. This article seeks to build on, and complement, earlier research by exploring two issues in more detail as follows: (1) what exactly do pivotal turn extensions accomplish on the action dimension, and (2) what role does prosodic-phonetic packaging play in this? We will show that pivot constructions not only exhibit various degrees of prosodic-phonetic (non-)integration, i.e., differently strong cesuras, but that they can be ordered on a continuum, and that this cline maps onto the relationship of the actions accomplished by the components of the pivot construction. While tighter prosodic-phonetic integration, i.e., weak(er) cesuring, co-occurs with post-pivot actions whose relationship to that of the pre-pivot tends to be rather retrospective in character, looser prosodic-phonetic integration, i.e., strong(er) cesuring, is associated with a more prospective orientation of the post-pivot’s action. These observations also raise more general questions with regard to the analysis of action.
In conversation, speakers need to plan and comprehend language in parallel in order to meet the tight timing constraints of turn taking. Given that language comprehension and speech production planning both require cognitive resources and engage overlapping neural circuits, these two tasks may interfere with one another in dialogue situations. Interference effects have been reported on a number of linguistic processing levels, including lexicosemantics. This paper reports a study on semantic processing efficiency during language comprehension in overlap with speech planning, where participants responded verbally to questions containing semantic illusions. Participants rejected a smaller proportion of the illusions when planning their response in overlap with the illusory word than when planning their response after the end of the question. The obtained results indicate that speech planning interferes with language comprehension in dialogue situations, leading to reduced semantic processing of the incoming turn. Potential explanatory processing accounts are discussed.
When humans have a conversation with one-another, they generally take turns speaking one after the other without overlapping each others talk or leaving silence between turns for long stretches of time. Previous research has shown that conversation is a structured practice following rules that help interlocutors to manage the flow of conversation interactively. While at the beginning of a conversation it remains open who will speak when about what and for how long, interlocutors regulate the flow of conversation as it unfolds. One basic set of rules that interlocutors operate with governs the allocation of speaking turns, with the central rule stating that whoever starts speaking first at a point in time when speaker change becomes relevant has the rights and obligations to produce the next turn. The organization of turn allocation, therefore, is one reason for conversational turn taking to be so remarkably fast, with the beginnings of turns most often being quite accurately aligned with the ends of the previous turns. Observations of this outstanding speed of turn taking gave rise to a number of questions concerning language processing in conversational situations. The studies presented in this thesis investigate some of these questions from the perspective of the current listener preparing to be the next speaker who will respond to the current turn.
The study presented in Chapter 2 investigates when next speakers begin to plan their own turn with respect to two points in time, (i) the moment when the incoming turn’s message becomes clear enough to make response planning possible and (ii) the moment when the incoming turn terminates. Results of previous studies were inconclusive about the timing of language planning in conversation, with evidence in favour of both late and early response planning. Furthermore, previous studies presented both evidence as well as counter evidence indicating that response planning depends or does not depend on an accurate prediction of the timing of the incoming turn’s end. The study presented here makes use of a novel experimental paradigm which includes a dialogic task that participants need to fulfil in response to critical utterances by a confederate. These critical utterances were structured, on the one hand, so that their message became clear either only at the end of the turn or before the end of the turn, and, on the other hand, so that it was either predictable or not predictable when exactly the turn would end. Participant’s eye-movements as well as their response latencies indicated that they always planned their next turn as early as possible, irrespective of the predictability of the incoming turn’s end. The presented results provide evidence in favour of models of turn taking that predict speech planning to happen in overlap with the incoming turn.
Having established that next speakers begin to plan their turn in overlap, the study presented in Chapter 3 goes more into detail investigating to which depth language planning progresses while the incoming turn is still unfolding. To this end, a number of psycholinguistic paradigms were combined. In the study’s main experiment, participants had to fulfil a switch-task in which they switched from picture naming in response to an auditorily presented question to making a lexical decision. By manipulating the relatedness of the word for lexical decision with the picture that was prepared to be named before the task-switch it was possible to draw inferences on which processing stages were entered during the speech production process in overlap with the incoming turn. Participants’ behavioural responses in the lexical decision task revealed that they entered the stage of phonological encoding while the incoming turn was still unfolding, showing that planning in overlap is not limited to conceptual preparation but includes all sub-processes of formulation.
Given that speech production regularly enters the stages of formulation in overlap with the incoming turn, as shown in Chapters 2 and 3, the question arises whether planning the next turn in overlap is cognitively more demanding than during the gap between turns. This question is approached in the study presented in Chapter 4 by measuring pupillometric responses of participants in a dialogic task. An increase in pupil diameter during a cognitive task is indicative of increased processing load, and pupillometric responses to planning in overlap with the incoming turn were found to be greater than responses to planning in the gap between turns. These results show that planning in overlap is more demanding than planning during the gap, even though it is highly practiced by speakers.
After Chapters 2 to 4 investigated the timing and mechanisms of speech planning in conversation, Chapter 5 turns towards the timing of articulation of a planned turn, asking the question what sources of information next speakers use to time the articulation of a planned utterance to start closely after the incoming turn comes to an end. In this Chapter’s study, participants taking turns with a confederate responded to utterances containing or not containing different cues to the location of the incoming turn’s end. Participants made use of lexical and turn-final intonational cues, but not of turn-initial intonational cues, responding faster when the relevant cues were present than when they were not present. These results show that the timing of turn initiation in next speakers depends on the recognition of the incoming turn’s point of completion and not merely on the progress in planning the next turn.
All evidence presented in Chapters 2 to 5 is summed up and bundled together in a cognitive model of turn taking, which is being presented in Chapter 6. This model assumes, centrally, that the planning of a turn and the timing of its articulation are separate cognitive processes that run in parallel in any next speaker during conversation. Planning generally starts as early as possible, often in overlap with the incoming turn, while the timing of articulation depends on the next speaker’s level of certainty that speaker change has become relevant at a particular moment, with a number of cues to the end of the incoming turn leading to an increase of certainty. Next turns are assumed to often be planned down to fully formulated utterance plans including their phonological form as early as possible on the basis of anticipations of the incoming turn’s message, which are created with the help of the general and situational knowledge about the world, the current speaker and her intentions, as well as the input that has been received so far. The level of certainty that speaker change becomes relevant rises or decreases as lexico-syntactic, prosodic, and pragmatic projections about the development of the current turn are fulfilled or not fulfilled. As the incoming turn progresses towards its end as was projected by the current listener, he becomes certain that speaker change becomes relevant and will initiate articulation of the prepared next turn. Viewing these two processes, planning a next turn and timing of its articulation, as separate makes it possible to explain the observable fast timing of turn taking while still modelling the allocation of turns as interactionally managed by interlocutors — a considerable advantage of the presented model compared to more traditional perspectives on turn taking and conversation.
We present a collection of (currently) about 5.500 commands directed to voice-controlled virtual assistants (VAs) by sixteen initial users of a VA system in their homes. The collection comprises recordings captured by the VA itself and with a conditional voice recorder (CVR) selectively capturing recordings including the VA-directed commands plus some surrounding context. Next to a description of the collection, we present initial findings on the patterns of use of the VA systems during the first weeks after installation, including usage timing, the development of usage frequency, distributions of sentence structures across commands, and (the development of) command success rates. We discuss the advantages and disadvantages of the applied collection-specific recording approach and describe potential research questions that can be investigated in the future, based on the collection, as well as the merit of combining quantitative corpus linguistic approaches with qualitative in-depth analyses of single cases.
To ensure short gaps between turns in conversation, next speakers regularly start planning their utterance in overlap with the incoming turn. Three experiments investigate which stages of utterance planning are executed in overlap. E1 establishes effects of associative and phonological relatedness of pictures and words in a switch-task from picture naming to lexical decision. E2 focuses on effects of phonological relatedness and investigates potential shifts in the time-course of production planning during background speech. E3 required participants to verbally answer questions as a base task. In critical trials, however, participants switched to visual lexical decision just after they began planning their answer. The task-switch was time-locked to participants' gaze for response planning. Results show that word form encoding is done as early as possible and not postponed until the end of the incoming turn. Hence, planning a response during the incoming turn is executed at least until word form activation.
In conversation, turn-taking is usually fluid, with next speakers taking their turn right after the end of the previous turn. Most, but not all, previous studies show that next speakers start to plan their turn early, if possible already during the incoming turn. The present study makes use of the list-completion paradigm (Barthel et al., 2016), analyzing speech onset latencies and eye-movements of participants in a task-oriented dialogue with a confederate. The measures are used to disentangle the contributions to the timing of turn-taking of early planning of content on the one hand and initiation of articulation as a reaction to the upcoming turn-end on the other hand. Participants named objects visible on their computer screen in response to utterances that did, or did not, contain lexical and prosodic cues to the end of the incoming turn. In the presence of an early lexical cue, participants showed earlier gaze shifts toward the target objects and responded faster than in its absence, whereas the presence of a late intonational cue only led to faster response times and did not affect the timing of participants' eye movements. The results show that with a combination of eye-movement and turn-transition time measures it is possible to tease apart the effects of early planning and response initiation on turn timing. They are consistent with models of turn-taking that assume that next speakers (a) start planning their response as soon as the incoming turn's message can be understood and (b) monitor the incoming turn for cues to turn-completion so as to initiate their response when turn-transition becomes relevant.
Speech planning is a sophisticated process. In dialog, it regularly starts in overlap with an incoming turn by a conversation partner. We show that planning spoken responses in overlap with incoming turns is associated with higher processing load than planning in silence. In a dialogic experiment, participants took turns with a confederate describing lists of objects. The confederate’s utterances (to which participants responded) were pre-recorded and varied in whether they ended in a verb or an object noun and whether this ending was predictable or not. We found that response planning in overlap with sentence-final verbs evokes larger task-evoked pupillary responses, while end predictability had no effect. This finding indicates that planning in overlap leads to higher processing load for next speakers in dialog and that next speakers do not proactively modulate the time course of their response planning based on their predictions of turn endings. The turn-taking system exerts pressure on the language processing system by pushing speakers to plan in overlap despite the ensuing increase in processing load.
In conversation, interlocutors rarely leave long gaps between turns, suggesting that next speakers begin to plan their turns while listening to the previous speaker. The present experiment used analyses of speech onset latencies and eye-movements in a task-oriented dialogue paradigm to investigate when speakers start planning their responses. German speakers heard a confederate describe sets of objects in utterances that either ended in a noun [e.g., Ich habe eine Tür und ein Fahrrad (“I have a door and a bicycle”)] or a verb form [e.g., Ich habe eine Tür und ein Fahrrad besorgt (“I have gotten a door and a bicycle”)], while the presence or absence of the final verb either was or was not predictable from the preceding sentence structure. In response, participants had to name any unnamed objects they could see in their own displays with utterances such as Ich habe ein Ei (“I have an egg”). The results show that speakers begin to plan their turns as soon as sufficient information is available to do so, irrespective of further incoming words.
Comprehending conditional statements is fundamental for hypothetical reasoning about situations. However, the online comprehension of conditional statements containing different conditional connectives is still debated. We report two self-paced reading experiments on German conditionals presenting the conditional connectives wenn (‘if’) and nur wenn (‘only if’) in identical discourse contexts. In Experiment 1, participants read a conditional sentence followed by the confirmed antecedent p and the confirmed or negated consequent q. The final, critical sentence was presented word by word and contained a positive or negative quantifier (ein/kein ‘one/no’). Reading times of the two quantifiers did not differ between the two conditional connectives. In Experiment 2, presenting a negated antecedent, reading times for the critical positive quantifier (ein) did not differ between conditional connectives, while reading times for the negative quantifier (kein) were shorter for nur wenn than for wenn. The results show that comprehenders form distinct predictions about discourse continuations due to differences in the lexical semantics of the tested conditional connectives, shedding light on the role of conditional connectives in the online interpretation of conditionals in general.
Having found their way onto the computer screens, comics soon branched into webcomics. These kept a lot of the characteristics of print comic books, but gradually adapted new unexplored modes of representation. Three relatively new ‘enhancements’ to the medium of comics are presented in this article: webcomics enhanced through the use of the infinite canvas, as proposed by Scott McCloud, those enhanced with videos and/or sound, and lastly those enhanced with interactive and ludic elements. All of the mentioned push the medium of comics into new waters, and by doing so they add new layers of meaning and modify their structure based on the make-up of the implemented features. Infinite canvas manages to lift some limitations of print comics without changing the overall feel too drastically, while animated and voiced webcomics, as well as interactive or game comics, have a much higher inclination to transgress into domains of other media and transform themselves in order to accommodate and integrate these novel foreign features.
In this paper we present the results of an automatic classification of Russian texts into three levels of difficulty. Our aim is to build a study corpus of Russian, in which a L2 student is able to select texts of a desired complexity. We are building on a pilot study, in which we classified Russian texts into two levels of difficulty. In the current paper, we apply the classification to an extended corpus of 577 labelled texts. The best-performing combination of features achieves an accuracy of 0,74 within at most one level difference.
In this paper, we present first results of training a classifier for discriminating Russian texts into different levels of difficulty. For the classification we considered both surface-oriented features adopted from readability assessments and more linguistically informed, positional features to classify texts into two levels of difficulty. This text classification is the main focus of our Levelled Study Corpus of Russian (LeStCoR), in which we aim to build a corpus adapted for language learning purposes – selecting simpler texts for beginner second language learners and more complex texts for advanced learners. The most discriminative feature in our pilot study was a lexical feature that approximates accessibility of the vocabulary by the second language learner in terms of the proportion of familiar words in the texts. The best feature setting achieved an accuracy of 0.91 on a pilot corpus of 209 texts.
We present a method for detecting and reconstructing separated particle verbs in a corpus of spoken German by following an approach suggested for written language. Our study shows that the method can be applied successfully to spoken language, compares different ways of dealing with structures that are specific to spoken language corpora, analyses some remaining problems, and discusses ways of optimising precision or recall for the method. The outlook sketches some possibilities for further work in related areas.
In this paper, we present an overview of freely available web applications providing online access to spoken language corpora. We explore and discuss various solutions with which the corpus providers and corpus platform developers address the needs of researchers who are working with spoken language. The paper aims to contribute to the long-overdue exchange and discussion of methods and best practices in the design of online access to spoken language corpora.
The goal of the MULI (MUltiLingual Information structure) project is to empirically analyse information structure in German and English newspaper texts. In contrast to other projects in which information structure is annotated and investigated (e.g. in the Prague Dependency Treebank, which mirrors the basic information about the topic-focus articulation of the sentence), we do not annotate theory-biased categories like topic-focus or theme-rheme. Trying to be as theory-independent as possible, we annotate those features which are relevant to information structure and on the basis of which typical patterns, co-occurrences or correlations can be determined. We distinguish between three annotation levels: syntax, discourse and prosody. The data is based on the TIGER Corpus for German and the Penn Treebank for English, since the existing information on part-of-speech and syntactic structure can be re-used for our purposes. The actual annotation of an English example sequence illustrates our choice of categories on each level. Their combination offers the possibility to investigate how information structure is realised and can be interpreted.
We present the annotation of information structure in the MULI project. To learn more about the information structuring means in prosody, syntax and discourse, theory- independent features were defined for each level. We describe the features and illustrate them on an example sentence. To investigate the interplay of features, the representation has to allow for inspecting all three layers at the same time. This is realised by a stand-off XML mark-up with the word as the basic unit. The theory-neutral XML stand-off annotation allows integrating this resource with other linguistic resources such as the Tiger Treebank for German or the Penn treebank for English.
We present an approach on how to investigate what kind of semantic information is regularly associated with the structural markup of scientific articles. This approach addresses the need for an explicit formal description of the semantics of text-oriented XML-documents. The domain of our investigation is a corpus of scientific articles from psychology and linguistics from both English and German online available journals. For our analyses, we provide XML-markup representing two kinds of semantic levels: the thematic level (i.e. topics in the text world that the article is about) and the functional or rhetorical level. Our hypothesis is that these semantic levels correlate with the articles’ document structure also represented in XML. Articles have been annotated with the appropriate information. Each of the three informational levels is modelled in a separate XML document, since in our domain, the different description levels might conflict so that it is impossible to model them within a single XML document. For comparing and mining the resulting multi-layered XML annotations of one article, a Prolog-based approach is used. It focusses on the comparison of XML markup that is distributed among different documents. Prolog predicates have been defined for inferring relations between levels of information that are modelled in separate XML documents. We demonstrate how the Prolog tool is applied in our corpus analyses.
The KorAP project (“Korpusanalyseplattform der nächste Generation”, “Corpus-analysis platform of the next generation”), carried out at the Institut fUr Deutsche Sprache (IDS) in Mannheim, Germany, has as its goal the development of a modem, state-of-the-art corpus-analysis platform, capable of handling very large corpora and opening the perspectives for innovative linguistic research. The platform will facilitate new linguistic findings by making it possible to manage and analyse extremely large amounts of primary data and annotations, while at the same time allowing an undistorted view of the primary un-annotated text, and thus fully satisfying expectations associated with a scientific tool. The project started in July 2011 and is funded till June 2014. The demo presentation in December will be the first version following a preliminary feature freeze, and will open the alpha testing phase of the project.
The paper reviews the results of work done in the context of TEI-Lex0, a joint ENeL / DARIAH / PARTHENOS initiative aimed at formulating guidelines for the encoding of retrodigitized dictionaries by streamlining and simplifying the recommendations of the “Print Dictionaries” chapter of the TEI Guidelines. TEI-Lex0 work is performed by teams concentrating on each of the main components of dictionary entries. The work presented here concerns proposals for constraining TEI-based encoding of orthographic, phonetic, and grammatical information on written and spoken forms of the lemma (headword), including auxiliary inflected forms. We also adduce examples of handling various types of orthographic and phonetic variants, as well as examples of handling the representation of inflectional paradigms, which have received less attention in the TEI Guidelines but which are nonetheless essential for properly exposing data content to the various uses that digitized lexica may have.
It is well known that the distribution of lexical and grammatical patterns is size- and register-sensitive (Biber 1986, and later publications). This fact alone presents a challenge to many corpus-oriented linguistic studies focusing on a single language. When it comes to cross-linguistic studies using corpora, the challenge becomes even greater due to the lack of high-quality multilingual corpora (Kupietz et al. 2020; Kupietz/Trawiński 2022), which are comparable with respect to the size and the register. That was the motivation for the creation of the European Reference Corpus EuReCo, an initiative started in 2013 at the Leibniz Institute for the German Language (IDS) together with several European partners (Kupietz et al. 2020). EuReCo is an emerging federated corpus, with large virtual comparable corpora across various languages and with an infrastructure supporting contrastive research. The core of the infrastructure is KorAP (Diewald et al. 2016), a scalable open-source platform supporting the analysis and visualisation of properties of texts annotated by multiple and potentially conflicting information layers, and supporting several corpus query languages. Until recently, EuReCo consisted of three monolingual subparts: the German Reference Corpus DeReKo (Kupietz et al. 2018), the Reference Corpus of Contemporary Romanian Language (Barbu Mititelu/Tufiş/Irimia 2018), and the Hungarian National Corpus (Váradi 2002). The goal of the present submission is twofold. On the one hand, it reports about the new component of EuReCo: a sample of the National Corpus of Polish (Przepiórkowski et al. 2010). On the other hand, it presents the results of a new pilot study using the newly extended EuReCo. This pilot study investigates selected Polish collocations involving light verbs and their prepositional / nominal complements (Fig. 1) and extends the collocation analyses of German, Romanian and Hungarian (Fig. 2) discussed in Kupietz/Trawiński (2022).
The present article describes the first stage of the KorAP project, launched recently at the Institut für Deutsche Sprache (IDS) in Mannheim, Germany. The aim of this project is to develop an innovative corpus analysis platform to tackle the increasing demands of modern linguistic research. The platform will facilitate new linguistic findings by making it possible to manage and analyse primary data and annotations in the petabyte range, while at the same time allowing an undistorted view of the primary linguistic data, and thus fully satisfying the demands of a scientific tool. An additional important aim of the project is to make corpus data as openly accessible as possible in light of unavoidable legal restrictions, for instance through support for distributed virtual corpora, user-defined annotations and adaptable user interfaces, as well as interfaces and sandboxes for user-supplied analysis applications. We discuss our motivation for undertaking this endeavour and the challenges that face it. Next, we outline our software implementation plan and describe development to-date.
The present paper describes Corpus Query Lingua Franca (ISO CQLF), a specification designed at ISO Technical Committee 37 Subcommittee 4 “Language resource management” for the purpose of facilitating the comparison of properties of corpus query languages. We overview the motivation for this endeavour and present its aims and its general architecture. CQLF is intended as a multi-part specification; here, we concentrate on the basic metamodel that provides a frame that the other parts fit in.
In mid-2017, as part of our activities within the TEI Special Interest Group for Linguists (LingSIG), we submitted to the TEI Technical Council a proposal for a new attribute class that would gather attributes facilitating simple token-level linguistic annotation. With this proposal, we addressed community feedback complaining about the lack of a specific tagset for lightweight linguistic annotation within the TEI. Apart from @lemma and @lemmaRef, up till now TEI encoders could only resort to using the generic attribute @ana for inline linguistic annotation, or to the quite complex system of feature structures for robust linguistic annotation, the latter requiring relatively complex processing even for the most basic types of linguistic features. As a result, there now exists a small set of basic descriptive devices which have been made available at the cost of only very small changes to the TEI tagset. The merit of a predefined TEI tagset for lightweight linguistic annotation is the homogeneity of tagging and thus better interoperability of simple linguistic resources encoded in the TEI. The present paper introduces the new attributes, makes a case for one more addition, and presents the advantages of the new system over the legacy TEI solutions.
Standards in CLARIN
(2022)
This chapter looks at a fragment of the ongoing work of the CLARIN Standards Committee (CSC) on producing a shared set of recommendations on standards, formats, and related best practices supported by the CLARIN infrastructure and its participating centres. What might at first glance seem to be a straightforward goal has over the years proven to be rather complex, reflecting the robustness and heterogeneity of the emerging distributed digital research infrastructure and the various disciplines and research traditions of the language-based humanities that it serves and represents, and therefore part of the chapter reviews the various initiatives and proposals that strove to produce helpful standards-related guidance. The focus turns next to a subtask initiated in late 2019, its scope narrowed to one of the core activities and responsibilities of CLARIN backbone centres, namely the provision of data deposition services. Centres are obligated to publish their recom-mendations concerning the repertoire of data formats that are best suited for their research profiles. We look at how this requirement has been met by the particular centres and suggest that having centres maintain their information in the Standards Information System (SIS) is the way to improve on the current state of affairs.
CoMParS is a resource under construction in the context of the long-term project German Grammar in European Comparison (GDE) at the IDS Mannheim. The principal goal of GDE is to create a novel contrastive grammar of German against the background of other European languages. Alongside German, which is the central focus, the core languages for comparison are English, French, Hungarian and Polish, representing different typological classes. Unlike traditional contrastive grammars available for German, which usually cover language pairs and are based on formal grammatical categories, the new GDE grammar is developed in the spirit of functionalist typology. This implies that, instead of formal criteria, cognitively motivated functional domains in terms of Givón (1984) are used as tertia comparationis. The purpose of CoMParS is to document the empirical basis of the theoretical assumptions of GDE-V and to illustrate the otherwise rather abstract content of grammar books by as many as possible naturally occurring and adequately presented multilingual examples, including information on their use in specific contexts and registers. These examples come from existing parallel corpora, and our presentation will focus on the legal aspects and consequences of this choice of language data.
The present contribution addresses an infrastructural issue of universal relevance, addressed in the specific context of the TEI. We describe a combination of open-source tools and an open-access approach to creating knowledge repositories that have been employed in building a bibliographic reference library for the “TEI for Linguists” special interest group (LingSIG). The authors argue that, for an initiative such as the TEI, it is important to choose open, freely available solutions. If these solutions have the advantage of attracting new users and promoting the initiative itself, so much the better, especially if it is done in a non-committal way: no one using the LingSIG bibliographic repository has to be a member of the LingSIG or a “TEI-er” in general.
Recent typological studies have shown that socio-linguistic factors have a substantial effect on at least certain structures of language. However, we are still far from understanding how such factors should be operationalized and how they interact with other factors in shaping grammar. To address both questions, this study examines the influence of socio-linguistic factors on the number of dedicated conditional constructions in a sample of 374 languages. We test the number of speakers, the degree of multilingualism, the availability of a literature tradition, the use of writing, and the use of the language in the education system. At the same time, we control for genealogical, contact, and bibliographical biases. Our results suggest that the number of speakers is the most informative predictor. However, we find that the association between the number of speakers and the number of dedicated conditional constructions is much weaker than assumed, once genealogical and contact biases are controlled for.
The paper presents best practices and results from projects in four countries dedicated to the creation of corpora of computer-mediated communication and social media interactions (CMC). Even though there are still many open issues related to building and annotating corpora of that type, there already exists a range of accessible solutions which have been tested in projects and which may serve as a starting point for a more precise discussion of how future standards for CMC corpora may (and should) be shaped like.
The paper presents best practices and results from projects in four countries dedicated to the creation of corpora of computer-mediated communication and social media interactions (CMC). Even though there are still many open issues related to building and annotating corpora of that type, there already exists a range of accessible solutions which have been tested in projects and which may serve as a starting point for a more precise discussion of how future standards for CMC corpora may (and should) be shaped like.
The paper presents best practices and results from projects dedicated to the creation of corpora of computer-mediated communication and social media interactions (CMC) from four different countries. Even though there are still many open issues related to building and annotating corpora of this type, there already exists a range of tested solutions which may serve as a starting point for a comprehensive discussion on how future standards for CMC corpora could (and should) be shaped like.