OPUS 4 | Search

Sprachverfall? Einleitung (2014)

Méthodes pour la représentation informatisée de données lexicales / Methoden der Speicherung lexikalischer Daten (2014)

In recent years, new developments in the area of lexicography have altered not only the management, processing and publishing of lexicographical data, but also created new types of products such as electronic dictionaries and thesauri. These expand th range of possible uses of lexical data and support users with more flexibility, for instance in assisting human translation. In this article, we give a short and easy-to-understand introduction to the problematic nature of the storage, display and interpretation of lexical data. We then describe the main methods and specifications used to build and represent lexical data.

Maschinelle Übersetzung – Gegenwart und Perspektiven (2014)

Werthmann, Antonina ; Witt, Andreas

Communication across all language barriers has long been a goal of humankind. In recent years, new technologies have enabled this at least partially. New approaches and different methods in the field of Machine Translation (MT) are continuously being improved, modified, and combined, as well. Significant progress has already been achieved in this area; many automatic translation tools, such as Google Translate and Babelfish, can translate not only short texts, but also complete web pages in real time. In recent years, new advances are being made in the mobile area; Googles Translate app for Android and iOS, for example, can recognize and translate words within photographs taken by the mobile device (to translate a restaurant menu, for instance). Despite this progress, a “perfect” machine translation system seems to be an impossibility because a machine translation system, however advanced, will always have some limitations. Human languages contain many irregularities and exceptions, and consequently go through a constant process of change, which is difficult to measure or to be processed automatically. This paper gives a short introduction of the state of the art of MT. It examines the following aspects: types of MT, the most conventional and widely developed approaches, and also the advantages and disadvantages of these different paradigms.

Identifikation von Kostenfaktoren und Erarbeitung von Kostenmodellen(R 1.1.1). Version 28.03.2014. Arbeitspapekt 1.1. Verantwortlicher Partner IDS. TextGrid. Virtuelle Forschungsumgebung für die Geisteswissenschaften. Projekt: TextGrid - Institutionalisierung einer Virtuellen Forschungsumgebung in den Geisteswissenschaften. BMBF Förderkennzeichen: 01UG1203A. Laufzeit: Juni 2012 bis Mai 2015 (2014)

Fiedler, Norman ; Funk, Stefan ; Gietz, Peter ; Küster, Marc C. ; Rapp, Andrea ; Söring, Sibylle ; Vitt, Thorsten ; Wieder, Philipp ; Witt, Andreas

From <tiger2/> to ISOTiger – community driven developments for syntax annotation in SynAF (2014)

Bosch, Sonja ; Eckart, Kerstin ; Faaß, Gertrud ; Heid, Ulrich ; Lee, Kiyong ; Pareja-Lora, Antonio ; Pretorius, Laurette ; Romary, Laurent ; Witt, Andreas ; Zeldes, Amir ; Zipser, Florian

In 2010, ISO published a standard for syntactic annotation, ISO 24615:2010 (SynAF). Back then, the document specified a comprehensive reference model for the representation of syntactic annotations, but no accompanying XML serialisation. ISO’s subcommittee on language resource management (ISO TC 37/SC 4) is working on making the SynAF serialisation ISOTiger an additional part of the standard. This contribution addresses the current state of development of ISOTiger, along with a number of open issues on which we are seeking community feedback in order to ensure that ISOTiger becomes a useful extension to the SynAF reference model.

Forschungsinfrastrukturen in außeruniversitären Forschungseinrichtungen. Forschungsbericht (2014)

Fiedler, Norman ; Werthmann, Antonina ; Stührenberg, Maik ; Schonefeld, Oliver ; Bingel, Joachim ; Witt, Andreas

Forschungsinfrastrukturen am IDS: Gegenwart und Zukunft (2014)

Schonefeld, Oliver ; Witt, Andreas

Erfahrungsbericht Rechtsform: Praxisbewährung und Nutzeranforderungen (R 1.1.2). Arbeitspapekt 1.1. TextGrid - Institutionalisierung einer Virtuellen Forschungsumgebung in den Geisteswissenschaften. Laufzeit: Juni 2012 bis Mai 2015 (2014)

Fiedler, Norman ; Witt, Andreas

Data formats for phonological corpora (2014)

Romary, Laurent ; Witt, Andreas

Best practices on long-term archiving of spoken language data (2014)

Fischer, Peter M. ; Witt, Andreas

Access control by query rewriting: the case of KorAP (2014)

Banski, Piotr ; Diewald, Nils ; Hanl, Michael ; Kupietz, Marc ; Witt, Andreas

We present an approach to an aspect of managing complex access scenarios to large and heterogeneous corpora that involves handling user queries that, intentionally or due to the complexity of the queried resource, target texts or annotations outside of the given user’s permissions. We first outline the overall architecture of the corpus analysis platform KorAP, devoting some attention to the way in which it handles multiple query languages, by implementing ISO CQLF (Corpus Query Lingua Franca), which in turn constitutes a component crucial for the functionality discussed here. Next, we look at query rewriting as it is used by KorAP and zoom in on one kind of this procedure, namely the rewriting of queries that is forced by data access restrictions.

Open Access

Refine

Author

Year of publication

Document Type

Language

Has Fulltext

Is part of the Bibliography

Keywords

Publicationstate

Reviewstate

Publisher

11 search hits