OPUS 4 | Search

205 search hits

11 to 20

Sort by

Transparent, efficient, and robust word embedding access with WOMBAT (2018)

Müller, Mark-Christoph ; Strube, Michael

We present WOMBAT, a Python tool which supports NLP practitioners in accessing word embeddings from code. WOMBAT addresses common research problems, including unified access, scaling, and robust and reproducible preprocessing. Code that uses WOMBAT for accessing word embeddings is not only cleaner, more readable, and easier to reuse, but also much more efficient than code using standard in-memory methods: a Python script using WOMBAT for evaluating seven large word embedding collections (8.7M embedding vectors in total) on a simple SemEval sentence similarity task involving 250 raw sentence pairs completes in under ten seconds end-to-end on a standard notebook computer.

Erzählen multimodal (2018)

L’hétéro-répétition comme validation des complétions collaboratives. Analyse séquentielle et multimodale de séquences de co-construction (2018)

Oloff, Florence

Cette contribution s'intéresse aux co-constructions d'un tour de parole en interaction, plus spécifiquement, à la manière dont la complétion d'un énoncé de la part d'un co-participant est ensuite réceptionnée par le locuteur dont le tour a été complété. Malgré l'intérét certain porté par l'analyse conversationnelle et la linguistique interactionnelle à la co-énonciation, l'évaluation de cette pratique par le premier locuteur n’a pas fait l’objet d’analyses approfondies. Dans ce qui suit, nous nous focalisons plus particulièrement sur les pratiques interactionnelles qui permettent aux participants de valider une co-construction. Ce travail est issu du projet ANR SPIM (« L'imitation dans la parole »), dans le cadre duquel nous nous sommes interrogée sur la fonction de l'hétéro-répétition (le fait de répéter un énoncé d'un autre locuteur ou une partie de celui-ci, opposée à l'auto-répétition) dans des séquences de co-construction d'un tour de parole. Dans la partie analytique, nous contrastons deux possibilités de validation d'une complétion collaborative, à savoir l'acquiescement simple (« oui ») et l'hétéro-répétition simple. Sur la base d’enregistrements vidéo de conversations naturelles, nous montrons que ces deux pratiques ne valident pas la complétion collaborative de la même manière, mais qu'elles permettent aux locuteurs d’évaluer finement le caractère plus ou moins adéquat des éléments co-construits.

Diversité des répétitions et des reformulations dans les interactions orales : Défis analytiques et conception d’un outil de détection automatique (2018)

Ursi, Biagio ; Etienne, Carole ; Oloff, Florence ; Mondada, Lorenza ; Traverso, Véronique

Cette contribution propose une analyse qualitative et quantitative des reformulations sur des données interactionnelles. Pour la constitution du corpus d’étude, nous nous appuyons sur un outil de détection automatique des hétéro-répétitions, considérées comme indices de reformulation. Après avoir illustré les éléments qui ont présidé à la conception de l’outil, nous présentons le paramétrage de cette ressource, que nous avons testée sur quatre enregistrements de la base de données CLAPI. Cette étude souligne la pertinence de l’approche interactionnelle dans l’analyse des hétéro-répétitions, en en montrant les fonctionnalités multiples, notamment dans les pratiques de reformulation dans la conversation.

Zur Theatralität und Multimodalität des Erzählens in der Fernseh-Unterhaltung (2018)

Oloff, Florence ; König, Katharina

Der vorliegende Beitrag befasst sich mit Erzählen in seiner massenmedialen Vermittlung in einer Unterhaltungsendung im Fernsehen. Ziel ist es, anhand einer multimodalen und medienlinguistischen Analyse eines exemplarischen Ausschnitts aus der TV-Unterhaltungssendung "Zimmer frei" die Spezifik solcher massenmedialen Erzählungen herauszuarbeiten. Zum einen wird aufgezeigt, dass sich massenmediales Erzählen in seinem sequenziellen Auf- und Ausbau aufgrund seiner Einbindung in ein mediales Unterhaltungsformat in systematischer Weise von Alltagserzählungen unterscheidet. Zum anderen wird veranschaulicht, inwieweit theatrale Inszenierungs- und Aufführungsmittel der Fernsehproduktion die Aktivität des Erzählens mitkonstituieren. Erzählungen im Fernsehen, so die analyseleitende Prämisse, können nicht schlicht als durch das Fernsehen übertragene narrative Aktivitäten konzeptualisiert werden. Vielmehr sind sie durch eine mediale Theatralität mitgeprägt. (Para)verbale, körperliche und mediale Inszenierungs- und Aufführungsverfahren greifen konzertiert ineinander, um Erzählungen als "dramas to an audience" (Goffman 1974:508) hervorzubringen.

Ansätze zu einer multimodalen Erzählanalyse. Einführung in das Themenheft (2018)

König, Katharina ; Oloff, Florence

Bisherige linguistische Studien zum mündlichen Erzählen beziehen sich vornehmlich auf die Beschreibung verbaler und vokaler Verfahren. Erzählen findet jedoch häufig unter den Bedingungen der zeitlich-räumlichen Ko-Präsenz der SprecherInnen statt, die den Gebrauch von körperlichen und materiellen Ressourcen ermöglicht. Der vorliegende einleitende Beitrag des Themenheftes modelliert Erzählen daher als körpergebundene und verkörperlichte Praktik, die es im Rahmen von interaktionalen und sequenzorientierten Analyseansätzen zu beschreiben gilt. Im Anschluss an die Darstellung von Entwicklungslinien der soziolinguistischen und interaktional-gesprächsanalytischen Untersuchung konversationellen Erzählens wird ein Überblick über bisherige Befunde zur multimodalen Ausgestaltung des Erzählens in der face-to-face-Interaktion gegeben. Abschließend werden grundlegende Fragestellungen skizziert, deren Beantwortung im Rahmen einer multimodalen Erzählanalyse die tatsächliche Alltagspraxis des Erzählens umfassender zu erschließen vermag.

The importance of linguistic markers of identity and authenticity in German Gangsta rap (2018)

Cotgrove, Louis Alexander

This study investigates the language used by six German Gangsta rappers to establish and maintain their identity and authenticity as rappers, in songs released between 2015 and 2016. Gangsta rap is a subgenre of Hip-Hop that emphasises ‘the rappers’ street credibility in texts describing tough [urban] neighbourhoods, violence, misogyny, and the achievement of material wealth’ (Bower 379). The culture of Gangsta rap attracts overwhelmingly negative mainstream media coverage (Muggs; Roper) and is often accused of corrupting ‘standard’ language (Krummheuer). The lyrical content of the songs is indeed controversial and has been previously covered by many academics (Byrd; Littlejohn and Putnam; Bower; Rollefson), as has the emergence of Hip-Hop in Germany (Elflein; Pennay; Nitzsche and Grünzweig).

Datenmanagement – Gegenstand und Dienst der Computerlinguistik. 40th Annual Conference of the German Linguistic Society. Stuttgart, Germany. (2018)

Trippel, Thorsten

Datenmanagement wird durch die Forschungsföderungsorganisationen (etwa in Horizon 2020 der EU, die Allianz der deutschen Wissenschaftsorganisationen oder in DFG geförderten Projekten) mehr und mehr Teil der Forschungslandschaft. Für die Computerlinguistik ist das Forschungsdatenmanagement aber auch Teil des Forschungsgebietes: Datenmodellierung und Transformation für die nachhaltige Datenspeicherung gehören in den Bereich der Texttechnologie und Textlinguistik, ebenso die Modellierung der beschreibenden Daten zu Datensätzen.

An initial description of syntactic extensions in spoken Czech (2018)

Oloff, Florence ; Havlík, Martin

This paper aims to describe different patterns of syntactic extensions of turns-at-talk in mundane conversations in Czech. Within interactional linguistics, same-speaker continuations of possibly complete syntactic structures have been described for typologically diverse languages, but have not yet been investigated for Slavic languages. Based on previously established descriptions of various types of extensions (Vorreiter 2003; Couper-Kuhlen & Ono 2007), our initial description shall therefore contribute to the cross-linguistic exploration of this phenomenon. While all previously described forms for continuing a turn-constructional unit seem to exist in Czech, some grammatical features of this language (especially free word order and strong case morphology) may lead to problems in distinguishing specific types of syntactic extensions. Consequently, this type of language allows for critically evaluating the cross-linguistic validity of the different categories and underlines the necessity of analysing syntactic phenomena within their specific action contexts.

CLARIN data management activities in the PARTHENOS context (2018)

van Berchum, Marnix ; Trippel, Thorsten

Data Management is one of the core activities of all CLARIN centres providing data and services for the academia. In PARTHENOS, European initiatives and projects in the area of the humanities and social sciences assembled to compare policies and procedures. One of the areas of interest is data management. The data management landscape shows a lot of proliferation, for which an abstraction level is introduced to help centres, such as CLARIN centres, in the process of providing the best possible services to users with data management needs.

11 to 20

Open Access

Refine

Author

Year of publication

Document Type

Language

Has Fulltext

Is part of the Bibliography

Keywords

Publicationstate

Reviewstate

Publisher

205 search hits