Refine
Document Type
- Conference Proceeding (9)
- Part of a Book (5)
- Report (3)
- Article (1)
- Other (1)
Has Fulltext
- yes (19)
Keywords
- Korpus <Linguistik> (9)
- Forschungsdaten (8)
- Datenmanagement (4)
- Metadatenmodell (4)
- CLARIAH-DE (3)
- Deutsch (3)
- Digitale Edition (3)
- Edition (3)
- Infrastruktur (3)
- Metadaten (3)
Publicationstate
- Veröffentlichungsversion (19) (remove)
Reviewstate
- Peer-Review (10)
- (Verlags)-Lektorat (5)
Publisher
- Leibniz-Institut für Deutsche Sprache, CLARIAH-DE (3)
- Linköping University Electronic Press (3)
- CLARIN (2)
- Dagstuhl (2)
- Leibniz-Institut für Deutsche Sprache (IDS) (2)
- Waxmann (2)
- Zenodo (2)
- European Language Resources Association (1)
- Institute for Logic, Language and Computation (1)
- Universitätsverlag Rhein-Ruhr (1)
Interested in formally modelling similarity between narratives, we investigate judgements of similarity between narratives in a small corpus of film reviews and book–film comparisons. A main finding is that judgements tend to concern multiple levels of story representation at once. As these texts are pragmatically related to reception contexts, we find many references to reception quality and optimality. We conclude that current formal models of narrative can not capture the task of naturalistic narrative comparisons given in the analysed reviews, but that the development of models containing a more reception-oriented point of view will be necessary.
The understanding of story variation, whether motivated by cultural currents or other factors, is important for applications of formal models of narrative such as story generation or story retrieval. We present the first stage of an experiment to elicit natural narrative variation data suitable for evaluation with respect to story similarity, to qualitative and quantitative analysis of story variation, and also for data processing. We also present few preliminary results from the first stage of the experiment, using Red Riding Hood and Romeo and Juliet as base texts.
Vorwort
(2018)
We present web services which implement a workflow for transcripts of spoken language following the TEI guidelines, in particular ISO 24624:2016 “Language resource management – Transcription of spoken language”. The web services are available at our website and will be available via the CLARIN infrastructure, including the Virtual Language Observatory and WebLicht.
Dieser Beitrag beschreibt, welche Schritte nötig sind, um die Daten des Archivs der Grafen v. Platen (AGP) für Forschungsdateninfrastrukturen (FDI) zugänglich zu machen: die Daten konvertieren, die Metadaten extrahieren, Daten und Metadaten indizieren sowie die Datenmodelle für Daten und Metadaten so ergänzen, dass sie die Bestände des Archivs sinnvoll erfassen. Zugleich wird begründet, weshalb man überhaupt solchen Aufwand treiben sollte: nämlich, damit die Daten einem größeren Publikum zur Verfügung stehen und überdies mit Werkzeugen bearbeitet werden können, die in den Infrastrukturen zur Verfügung stehen, und damit eine weitere Verlinkung und Kombination mit externen Ressourcen erfolgen kann, sodass ein deutlicher Mehrwert entstehen kann.
This paper addresses long-term archival for large corpora. Three aspects specific to language resources are focused, namely (1) the removal of resources for legal reasons, (2) versioning of (unchanged) objects in constantly growing resources, especially where objects can be part of multiple releases but also part of different collections, and (3) the conversion of data to new formats for digital preservation. It is motivated why language resources may have to be changed, and why formats may need to be converted. As a solution, the use of an intermediate proxy object called a signpost is suggested. The approach will be exemplified with respect to the corpora of the Leibniz Institute for the German Language in Mannheim, namely the German Reference Corpus (DeReKo) and the Archive for Spoken German (AGD).