Refine
Year of publication
- 2012 (272) (remove)
Document Type
- Part of a Book (120)
- Article (82)
- Conference Proceeding (35)
- Book (19)
- Part of Periodical (11)
- Doctoral Thesis (2)
- Other (2)
- Review (1)
Keywords
- Deutsch (118)
- Korpus <Linguistik> (28)
- Konversationsanalyse (19)
- Computerlinguistik (16)
- Englisch (11)
- Sprachgebrauch (11)
- Interaktion (10)
- Kontrastive Grammatik (10)
- Deutschland (9)
- Diskursanalyse (9)
- Sprachkontakt (9)
- Sprachwandel (9)
- Grammatik (8)
- Kommunikation (8)
- Standardisierung (8)
- Gesprochene Sprache (7)
- Kolloquium (7)
- Metadaten (7)
- Multimodalität (7)
- Polnisch (7)
- Semantik (7)
- Sprache (7)
- Sprachpolitik (7)
- Syntax (7)
- Verb (7)
- Annotation (6)
- Genitiv (6)
- Interaktionsanalyse (6)
- Lexikographie (6)
- Neuerscheinungen (6)
- Newsletter (6)
- Projekte (6)
- Wörterbuch (6)
- Arzt (5)
- Datenmanagement (5)
- Dialektologie (5)
- Eheschließung (5)
- Europa (5)
- Information Extraction (5)
- Infrastruktur (5)
- Kontrastive Linguistik (5)
- Korpuslinguistik (5)
- Linguistik (5)
- Mehrsprachigkeit (5)
- Minderheitensprache (5)
- Natürliche Sprache (5)
- Soziolinguistik (5)
- Akzent (4)
- Aussiedler (4)
- Biografisches Interview (4)
- Digitalisierung (4)
- Forschung (4)
- Französisch (4)
- Gesprächsanalyse (4)
- Institut für Deutsche Sprache <Mannheim> (4)
- Internet (4)
- Kasus (4)
- Kolonialismus (4)
- Kongressbericht (4)
- Kritische Diskursanalyse (4)
- Lebensmittel (4)
- Mannheim <2012> (4)
- Migration (4)
- Patient (4)
- Rezension (4)
- Wortschatz (4)
- conversation analysis (4)
- Adjektiv (3)
- Altenbild (3)
- Bedeutung (3)
- CLARIN (3)
- Einwanderer (3)
- Ethnolinguistik (3)
- Finite Verbform (3)
- Forschungsdaten (3)
- Fremdsprache (3)
- Hörgerät (3)
- Hörschädigung (3)
- Italienisch (3)
- Kommunikationsverhalten (3)
- Linguistic Landscape (3)
- Medien (3)
- Morphologie (3)
- Mundart (3)
- Mundart Russlanddeutsch (3)
- Negation (3)
- Nominalphrase (3)
- Phraseologismus (3)
- Russlanddeutsche (3)
- Sprachgeschichte (3)
- Sprachkritik (3)
- Sprachvariante (3)
- Standardsprache (3)
- Textgrammatik (3)
- Türkin (3)
- Ungarisch (3)
- Verumfokus (3)
- Wortbildung (3)
- elexiko (3)
- hearing aid use (3)
- hearing impairment (3)
- interaction (3)
- kontrastive Linguistik (3)
- language policy (3)
- Alter (2)
- Archivierung (2)
- Attribut (2)
- Aufsatzsammlung (2)
- Automatische Sprachanalyse (2)
- Biografieforschung (2)
- Bruno Strecker (2)
- Component Metadata Infrastructure (CMDI) (2)
- Computerunterstützte Kommunikation (2)
- Computerunterstützte Lexikographie (2)
- Conversational alignment (2)
- Dativ (2)
- Deklination (2)
- Deutschunterricht (2)
- Diskursethik (2)
- Einstellung (2)
- Erzählforschung (2)
- Forschungsprojekt (2)
- Frame-Semantik (2)
- Fremdsprachenlernen (2)
- Fremdsprachenunterricht (2)
- Fremdwort (2)
- Gefühl (2)
- German (2)
- Gesprächsforschung (2)
- Handlungsstruktur <Literatur> (2)
- Informationsstruktur (2)
- Internetwörterbuch (2)
- Kommunikationsforschung (2)
- Konjunktion (2)
- Konrad-Duden-Preis 2012 (2)
- Kulturwandel (2)
- Language attitude (2)
- Lehrer (2)
- Lemma (2)
- Massenmedien (2)
- Migrationslinguistik (2)
- Neologismus (2)
- Niederländisch (2)
- Online-Wörterbuch (2)
- Ontologie <Wissensverarbeitung> (2)
- Pazifischer Ozean <Süd> (2)
- Phraseologie (2)
- Pitch contour (2)
- Pitch matching (2)
- Politik (2)
- Politische Kommunikation (2)
- Politische Willensbildung (2)
- Pragmatik (2)
- Prosodic similarity (2)
- Prosodie (2)
- Präposition (2)
- Präsentation (2)
- Psycholinguistik (2)
- Recherche (2)
- Russisch (2)
- Schriftsprache (2)
- Schwerhörigkeit (2)
- Sentimentanalyse (2)
- Sozialpsychologie (2)
- Sprachentwicklung (2)
- Spracherhaltung (2)
- Sprachinsel (2)
- Sprachkompetenz (2)
- Sprachkonflikt (2)
- Sprachtypologie (2)
- Sprichwort (2)
- Sprichwortforschung (2)
- Stereotyp (2)
- Stichwörter (2)
- Synonym (2)
- TEI (2)
- Tonhöhe (2)
- Tweed (2)
- Unterricht (2)
- Unterrichtskommunikation (2)
- Virtuelle Kommunikation (2)
- Wandel (2)
- Wissenschaftskommunikation (2)
- Wortstellung (2)
- XML (2)
- computational models of narrative (2)
- computerunterstützte Lexikographie (2)
- language contact (2)
- minority language (2)
- multimodality (2)
- perception experiment (2)
- read speech (2)
- sentiment analysis (2)
- Übersetzung (2)
- AToL-Skala (1)
- Achtundsechziger (1)
- Adjunkt <Linguistik> (1)
- Adverb (1)
- Adverbiale (1)
- Affigierung (1)
- Akkusativ (1)
- Akzentuierung (1)
- Alemannisch (1)
- Amalgamierung <Linguistik> (1)
- Amazonia (1)
- Amtssprache (1)
- Anglizismus (1)
- Apostroph (1)
- Arabic (1)
- Arabismus <Linguistik> (1)
- Arbeitskreis Linguistische Pragmatik (1)
- Arbeitsplatz (1)
- Arbeitstagung (1)
- Arbeitstagung IDS (1)
- Argument <Linguistik> (1)
- Artikel (1)
- Arzt-Patient-Interaktion (1)
- Aufkleber (1)
- Augenfolgebewegung (1)
- Aussprache (1)
- Auszeichnungssprache (1)
- Automatische Textanalyse (1)
- Baden-Württemberg (1)
- Baltikum (1)
- Bantusprachen (1)
- Bartmiński, Jerzy (1)
- Beeinflussung (1)
- Belehrung (1)
- Benutzerforschung (1)
- Benutzerführung (1)
- Benutzung (1)
- Berufliche Qualifikation (1)
- Best-Practice (1)
- Bestimmter Artikel (1)
- Betriebliche Ausbildung (1)
- Bewegungsverb (1)
- Bibliographie (1)
- Bibliothek (1)
- Bilingualismus (1)
- Bioethik (1)
- Biographisches Interview (1)
- CLARIN-D (1)
- CMDI experiences (1)
- CMDI infrastructure use (1)
- CMDI profile creation (1)
- Cleft-Erweiterung (1)
- Comitative Construction (1)
- Comitative Preposition (1)
- Compositional Semantics (1)
- Computer-Assisted Language Learning (CALL) (1)
- Computerunterstütztes Lernen (1)
- Computervermittelte Kommunikation (1)
- Concurrency (1)
- Cyber-Mobbing (1)
- Database Management Systems (1)
- Datenbank (1)
- Datenverarbeitung (1)
- Demonstrativpronomen (1)
- Denglisch (1)
- Dereko (1)
- Deusch (1)
- Deutsch als Fremdsprache (1)
- Deutsch-Neuguinea (1)
- Deutsche Gebärdensprache (DGS) (1)
- Deutsches Referenzkorpus (DeReKo) (1)
- Deutsches Referenzkorpus zur internetbasierten Kommunikation (DeRiK) (1)
- Deutschland <Bundesrepublik> (1)
- Dialekt (1)
- Digitale Daten (1)
- Digitale Sprachressourcen (1)
- Digitalization (1)
- Diphthong (1)
- Direktiv <Akkusativ> (1)
- Diskurs (1)
- Diskursforschung (1)
- Diskursivität (1)
- Dokumentation (1)
- Domain-specific Relation Extraction (1)
- Editor (1)
- Elektronisches Buch (1)
- Elektronisches Wörterbuch (1)
- Emblem (1)
- Empirische Linguistik (1)
- Entscheidungsbaum (1)
- Estnisch (1)
- Ethik (1)
- Ethnografie (1)
- Ethnologie (1)
- Experiment (1)
- Face (1)
- Facebook (1)
- Fachsprache (1)
- Fernsehunterhaltung (1)
- Fiktion (1)
- Flexion (1)
- Flexionsendung (1)
- Flirt (1)
- Flurname (1)
- Fokus (1)
- Food Domain (1)
- Food item (1)
- Formalisierung (1)
- Forschungsberichte (1)
- Forschungsmethode (1)
- Frame semantics (1)
- Frau (1)
- Frauenforschung (1)
- Fremddarstellung (1)
- Frequenz (1)
- Fugenelement (1)
- Funktionale Grammatik (1)
- Gebrauchsgrafik (1)
- Gehen als situierte Praktik (1)
- Geisteswissenschaften (1)
- Geminata (1)
- Generationenbeziehung (1)
- Genitivattribut (1)
- Germanistik (1)
- Germanistische Sprachwissenschaft (1)
- Geräuschverb (1)
- Geschichte 1918 (1)
- Geschichte 1945-1955 (1)
- Geschichte 1967-1968 (1)
- Geschichte <1800-1899> (1)
- Geschichtswissenschaft (1)
- Gesellschaft (1)
- Gesprächseröffnung (1)
- Gestik (1)
- Gestural matching (1)
- Gesture (1)
- Gleitlaut (1)
- Glottisverschlusslaut (1)
- Google (1)
- Gottesdienst (1)
- Grammatikregel (1)
- Gravity's Rainbow (1)
- Griechenland (1)
- Hausa (1)
- Head-driven phrase structure grammar (1)
- Hieroglyphe (1)
- Hilfeplan (1)
- Hörverlust (1)
- ISO/TC 37/SC 4 (1)
- ISOcat (1)
- ISOcat registry (1)
- Ideologie (1)
- Immigranten (1)
- Informationsgesellschaft (1)
- Informationsvermittlung (1)
- Institut für Corpuslinguistik und Texttechnologie (ICLTT) (1)
- Institut für Deutsche Sprache (1)
- Interaktionanalyse (1)
- Interaktionslinguistik (1)
- Interkulturelle Kommunikation (1)
- Internationalität (1)
- Interoperabilität (1)
- Interview (1)
- Intonation (1)
- Intonation <Linguistik> (1)
- Jahrestagung IDS (1)
- Jugendhilfe (1)
- Jugendlicher (1)
- Jugendsprache (1)
- Kindersprache (1)
- Kirchenraum (1)
- Kochbuch (1)
- Kognition (1)
- Kognitive Linguistik (1)
- Koloniallinguistik (1)
- Komitativ <Kasus> (1)
- Komitative Präposition (1)
- Komitativkonstruktion (1)
- Kommunalpolitik (1)
- Kommunikationsstrategie (1)
- Kommunikationstraining (1)
- Kompositionelle Semantik (1)
- Kongress (1)
- Konjugation (1)
- Konnektor (1)
- Konsekutivsatz (1)
- Konstruktion <Linguistik> (1)
- Konstruktionsgrammatik (1)
- Kontrastive Morphologie (1)
- Kontrastive Phonetik (1)
- Kontrastive Phonologie (1)
- Kontrastive Syntax (1)
- Koordination (1)
- Korpora (1)
- Korpusanalyseplattform (KorAP) (1)
- Kultur (1)
- Kulturkontakt (1)
- Kulturwissenschaft (1)
- Kulturwissenschaften (1)
- Kunst (1)
- Körpersprache (1)
- Künstliche Intelligenz (1)
- Langzeitarchivierung (1)
- Latgalian (1)
- Latvia (1)
- Lebensstil (1)
- Lehnwort (1)
- Lehrbuch (1)
- Lehrersprache (1)
- Lernprozess (1)
- Lettisch (1)
- Lexem (1)
- Lexik (1)
- Lexikografie (1)
- Lexikograph (1)
- Lingua Franca (1)
- Linguistic Retrieval (1)
- Linguistic annotation (1)
- Linguistic processing (1)
- Linke Peripherie (1)
- Literaturverwaltung (1)
- Logische Partikel (1)
- Lokativ (1)
- Long-Term Archiving (1)
- Längsschnittuntersuchung (1)
- META-SHARE (1)
- MLSA (1)
- Markiertheit (1)
- Maschinelles Lernen (1)
- Mean reciprocal rank (1)
- Mediatisierung (1)
- Medizin (1)
- Mehrworteinheit (1)
- Meinung (1)
- Metapher (1)
- Methodologie (1)
- Migrationshintergrund (1)
- Migrationsvarietät (1)
- Mimik (1)
- Modality (1)
- Modalverb (1)
- Modewort (1)
- Morphem (1)
- Morphologie <Linguistik> (1)
- Morphology of the Folktale (1)
- Multi-layer Annotation (1)
- Multimedia (1)
- Mundartforschung (1)
- Muttersprache (1)
- Möglichkeit (1)
- Mündliche Kommunikation (1)
- NaLiDa (1)
- Nationalitätenpolitik (1)
- Nationalsozialismus (1)
- Needs assessment (1)
- Neue Medien (1)
- Nichtverbale Kommunikation (1)
- Niederdeutsch (1)
- Niederrhein (1)
- Normung (1)
- Northern Sotho (1)
- Numerus (1)
- OWID (1)
- Online dictionary (1)
- Operator-Skopus-Struktur (1)
- Ortsname (1)
- Partizipation (1)
- Pedi-Sprache (1)
- Periphrase (1)
- Persistent identifier (1)
- Persuasion (1)
- Pfälzisch (1)
- Phonetik (1)
- Plural (1)
- Polish (1)
- Politische Sprache (1)
- Polizei (1)
- Portugiesisch (1)
- Possessivität (1)
- Pro-Form (1)
- Produktanalyse (1)
- Produktionsanalyse (1)
- Progressiv (1)
- Pronomen (1)
- Propositionale Einstellung (1)
- Propp system (1)
- Prosodic repetition (1)
- Proust, Marcel (1)
- Psychologie (1)
- Pynchon, Thomas (1)
- Quantitative Analyse (1)
- Quellenkunde (1)
- Rahmen (1)
- Rassismus (1)
- Raum (1)
- Raum als interaktive Ressource (1)
- Raumwahrnehmung (1)
- Reality-TV (1)
- Rechtschreibung (1)
- Redaktionssystem (1)
- Regiolekt (1)
- Regionalsprache (1)
- Rektion (1)
- Relation type (1)
- Relativpronom (1)
- Relativsatz (1)
- Repository <Informatik> (1)
- Reproduzierbarkeit (1)
- Rollenidentität (1)
- Russia (1)
- Russland (1)
- SALSA (1)
- SOA (1)
- Satz (1)
- Satzanfang (1)
- Satzeröffnung (1)
- Satzverbindung (1)
- Satzverknüpfung (1)
- Schreiben (1)
- Schriftlichkeit (1)
- Schuld (1)
- Schuldenkrise (1)
- Schwäbisch (1)
- Schüler (1)
- Selbstdarstellung (1)
- Semantic role labelling (1)
- Semi-automatic annotation (1)
- SentiFrameNet (1)
- Sequentialanalyse (1)
- Server (1)
- Serviceorientierte Architektur (1)
- Sibirien (1)
- Sichtbarkeit (1)
- Skateboarder (1)
- Skateboardkultur (1)
- Skatesticker (1)
- Slawistik (1)
- Soziale Identität (1)
- Sozialer Wandel (1)
- Sozialwissenschaften (1)
- Soziolekt (1)
- Sparkling wine (1)
- Spezialbibliothek (1)
- Spoken Language Data (1)
- Sprachanalse (1)
- Spracheinstellung (1)
- Spracheinstellungen (1)
- Sprachförderung (1)
- Sprachgeografie (1)
- Sprachkorpora (1)
- Sprachkrise (1)
- Sprachliche Minderheit (1)
- Sprachstatistik (1)
- Sprachstudie (1)
- Sprachunterricht (1)
- Sprachvarianten (1)
- Sprachvariation (1)
- Sprachverbreitung (1)
- Sprachvergleich (1)
- Sprachverstehen (1)
- Sprachwechsel (1)
- Sprecher (1)
- Sprecherwechsel (1)
- Sprechhandlung (1)
- Standard (1)
- Studentenbewegung (1)
- Subkultur (1)
- Substantiv (1)
- Suffix (1)
- Tagebuch (1)
- Teilhabe (1)
- Temporal Reference (1)
- Tempus (1)
- Tenseless Languages (1)
- Terminologie (1)
- Terminologielehre (1)
- Text Encoding Initiative (1)
- Text Mining (1)
- Textkorpus (1)
- Textlinguistik (1)
- Textproduktion (1)
- Textsorte (1)
- Textualität (1)
- Texzgrammatik (1)
- Thematische Relation (1)
- Topik-Drop (1)
- Topikalisierung (1)
- Totalitarismus (1)
- Tourismus (1)
- Transkription (1)
- Tschechisch (1)
- Turing Galaxy (1)
- Türkisch (1)
- Türkisch für Anfänger <Fernsehsendung> (1)
- Türkische Einwanderin (1)
- Ukraine (1)
- Unbestimmter Artikel (1)
- Union of Soviet Socialist Republics (USSR) (1)
- Unternehmen (1)
- Unterrichtssituation (1)
- Unterstützung (1)
- Valenz (1)
- Valenz <Verb> (1)
- Variationslinguistik (1)
- Vergleichbarkeit (1)
- Verlaufsform (1)
- Verlinkung (1)
- Vernetzungsmanager (1)
- Verstehen (1)
- Verwandtschaftsbezeichnung (1)
- Very Large Corpora (1)
- Virtuelle Realität (1)
- Visualisierung (1)
- Vortragstechnik (1)
- WSD (1)
- Wahlkampf (1)
- Web Services (1)
- WebLicht (1)
- Website (1)
- Werbegrafik (1)
- Werbekultur (1)
- Werbung (1)
- Widerstand (1)
- Wissenschaft (1)
- Wissenschaftlicher Text (1)
- Wissenschaftssprache (1)
- Wissenspräsentation (1)
- Wissensvermittlung (1)
- Wortverbindung (1)
- Wörter des Jahres (1)
- Wörterbuchkorpus (1)
- XForms (1)
- XQuery Full Text (1)
- Zeichensetzung (1)
- Zeichensprache (1)
- Zeitungssprache (1)
- Zeugenaussagen (1)
- Zivilgesellschaft (1)
- Zulu (1)
- Zulu-Sprache (1)
- acoustic correlates (1)
- annotation (1)
- archiving support (1)
- archiving workflow (1)
- artefacts (1)
- automation (1)
- bestenfalls (1)
- church service (1)
- combination of methods (1)
- common language (1)
- communication in sciences (1)
- concept scheme (1)
- concept system (1)
- conceptual domain (1)
- context (1)
- coordination (1)
- copulatives (1)
- corpus analysis (1)
- corpus construction (1)
- corpus linguistics (1)
- couple interaction (1)
- cultural effects (1)
- cultural revolution (1)
- cultural skills (1)
- data category (1)
- database (1)
- decision tree structure (1)
- deutsch (1)
- dictionary design (1)
- dictionary software (1)
- dictionary writing system (1)
- discourse (1)
- electronic dictionaries (1)
- elektroniese woordeboeke (1)
- elektronische Lexikografie (1)
- ethnography (1)
- europäische Sprachen (1)
- fixation duration (1)
- gebruikersleiding (1)
- gender studies (1)
- grammar-pragmatics-correlations (1)
- grammis (1)
- griechisch (1)
- household work (1)
- information presentation devices (1)
- infrastructure (1)
- inligtingsaanbiedingsinstrumente (1)
- institutionelle Kommunikation (1)
- inter-rater variability (1)
- interpret (1)
- intra-rater variability (1)
- keuse-boomstruktuur (1)
- kinship terminology (1)
- kognitive Linguistik (1)
- kontrastive Morphologie (1)
- kopulatiewe (1)
- kosten (1)
- landscape (1)
- landscapes (1)
- language activism (1)
- language functions (1)
- language ideology (1)
- language variation (1)
- lecker (1)
- lexicographic working environment (1)
- lexikography (1)
- markup language (1)
- material culture (1)
- mediterranean (1)
- metadata (1)
- metadata editor (1)
- methods (1)
- modality (1)
- morpho-syntactic database (1)
- multilingual matter (1)
- multilingualism (1)
- multimodal interaction analysis (1)
- natural language processing (1)
- neologism (1)
- networking (1)
- normalisation (1)
- normalization (1)
- noun phrase (1)
- onomastics (1)
- part-of-speech ontology (1)
- phraseology (1)
- politics (1)
- politische Frauengruppen (1)
- politische Kommunikation (1)
- polizeiliche Vernehmung (1)
- primary research data repository (1)
- prominence (1)
- prosody (1)
- public space (1)
- qualitative Medienforschung (1)
- rapid serial visual presentation (1)
- reading time (1)
- reductive grammar (1)
- relation registry (1)
- research infrastructure (1)
- semiotic mediation (1)
- sequential analysis (1)
- software tools (1)
- sozialer Sprachstil (1)
- space as interactive resource (1)
- special language (1)
- standardization (1)
- subjectivity (1)
- survey (1)
- syllable duration (1)
- syllable prominence (1)
- tagging (1)
- teksproduksie (1)
- teksresepsie (1)
- text production (1)
- text reception (1)
- time reckoning (1)
- transcription (1)
- tun (1)
- tun-Periphrase (1)
- turn (1)
- ung-Nominalisierung (1)
- user guidance (1)
- verwantskapsterminologie (1)
- walking as situated prac-tice (1)
- web-based information system (1)
- woordeboekontwerp (1)
- word order (1)
- Ägypten (1)
- Ägyptisch (1)
- Älterer Mensch (1)
- Öffentlicher Raum (1)
Publicationstate
- Veröffentlichungsversion (102)
- Zweitveröffentlichung (23)
- Postprint (15)
Reviewstate
Publisher
- de Gruyter (37)
- Institut für Deutsche Sprache (31)
- Narr (17)
- European Language Resources Association (8)
- Lang (8)
- De Gruyter (7)
- European Language Resources Association (ELRA) (5)
- Verl. für Gesprächsforschung (5)
- Akademie Verlag (4)
- Springer (4)
The changes caused by the growing automatisation of processes in the lexicographer´s workstation and in lexicographic work, together with the ensuing needs of lexicographers and their demands for adequately targeted software, have not been discussed sufficiently in meta-lexicographic research. The aim of this paper is therefore to fill this gap, with a focus on academic non-commercial lexicography. After an introduction into the general functionalities of specific dictionary writing software, with the help of a real-life example we will discuss the lexicographic working environment, the new specific demands to lexicographic software as well as different tools. The final aim is to propose some recommendations for how to structure the lexicographic working environment to meet specific project requirements.
A frequently replicated finding is that higher frequency words tend to be shorter and contain more strongly reduced vowels. However, little is known about potential differences in the articulatory gestures for high vs. low frequency words. The present study made use of electromagnetic articulography to investigate the production of two German vowels, [i] and [a], embedded in high and low frequency words. We found that word frequency differently affected the production of [i] and [a] at the temporal as well as the gestural level. Higher frequency of use predicted greater acoustic durations for long vowels; reduced durations for short vowels; articulatory trajectories with greater tongue height for [i] and more pronounced downward articulatory trajectories for [a]. These results show that the phonological contrast between short and long vowels is learned better with experience, and challenge both the Smooth Signal Redundancy Hypothesis and current theories of German phonology.
The present article describes the first stage of the KorAP project, launched recently at the Institut für Deutsche Sprache (IDS) in Mannheim, Germany. The aim of this project is to develop an innovative corpus analysis platform to tackle the increasing demands of modern linguistic research. The platform will facilitate new linguistic findings by making it possible to manage and analyse primary data and annotations in the petabyte range, while at the same time allowing an undistorted view of the primary linguistic data, and thus fully satisfying the demands of a scientific tool. An additional important aim of the project is to make corpus data as openly accessible as possible in light of unavoidable legal restrictions, for instance through support for distributed virtual corpora, user-defined annotations and adaptable user interfaces, as well as interfaces and sandboxes for user-supplied analysis applications. We discuss our motivation for undertaking this endeavour and the challenges that face it. Next, we outline our software implementation plan and describe development to-date.
The present contribution addresses an infrastructural issue of universal relevance, addressed in the specific context of the TEI. We describe a combination of open-source tools and an open-access approach to creating knowledge repositories that have been employed in building a bibliographic reference library for the “TEI for Linguists” special interest group (LingSIG). The authors argue that, for an initiative such as the TEI, it is important to choose open, freely available solutions. If these solutions have the advantage of attracting new users and promoting the initiative itself, so much the better, especially if it is done in a non-committal way: no one using the LingSIG bibliographic repository has to be a member of the LingSIG or a “TEI-er” in general.
The paper presents an XML schema for the representation of genres of computer-mediated communication (CMC) that is compliant with the encoding framework defined by the TEI. It was designed for the annotation of CMC documents in the project Deutsches Referenzkorpus zur internetbasierten Kommunikation (DeRiK), which aims at building a corpus on language use in the most popular CMC genres on the German-speaking Internet. The focus of the schema is on those CMC genres which are written and dialogic―such as forums, bulletin boards, chats, instant messaging, wiki and weblog discussions, microblogging on Twitter, and conversation on “social network” sites.
The schema provides a representation format for the main structural features of CMC discourse as well as elements for the annotation of those units regarded as “typical” for language use on the Internet. The schema introduces an element <posting>, which describes stretches of text that are sent to the server by a user at a certain point in time. Postings are the main constituting elements of threads and logfiles, which, in our schema, are the two main types of CMC macrostructures. For the microlevel of CMC documents (that is, the structure of the <posting> content), the schema introduces elements for selected features of Internet jargon such as emoticons, interaction words and addressing terms. It allows for easy anonymization of CMC data for purposes in which the annotated data are made publicly available and includes metadata which are necessary for referencing random excerpts from the data as references in dictionary entries or as results of corpus queries.
Documentation of the schema as well as encoding examples can be retrieved from the web at http://www.empirikom.net/bin/view/Themen/CmcTEI. The schema is meant to be a core model for representing CMC that can be modified and extended by others according to their own specific perspectives on CMC data. It could be a first step towards an integration of features for the representation of CMC genres into a future new version of the TEI Guidelines.