OPUS 4 | Search

25 search hits

1 to 10

Sort by

Proceedings of the 12th edition of the KONVENS conference (2014)

The 2014 issue of KONVENS is even more a forum for exchange: its main topic is the interaction between Computational Linguistics and Information Science, and the synergies such interaction, cooperation and integrated views can produce. This topic at the crossroads of different research traditions which deal with natural language as a container of knowledge, and with methods to extract and manage knowledge that is linguistically represented is close to the heart of many researchers at the Institut für Informationswissenschaft und Sprachtechnologie of Universität Hildesheim: it has long been one of the institute’s research topics, and it has received even more attention over the last few years. The main conference papers deal with this topic from different points of view, involving flat as well as deep representations, automatic methods targeting annotation and hybrid symbolic and statistical processing, as well as new Machine Learning-based approaches, but also the creation of language resources for both machines and humans, and methods for testing the latter to optimize their human-machine interaction properties. In line with the general topic, KONVENS-2014 focuses on areas of research which involve this cooperation of information science and computational linguistics: for example learning-based approaches, (cross-lingual) Information Retrieval, Sentiment Analysis, paraphrasing or dictionary and corpus creation, management and usability.

Workshop Proceedings of the 12th edition of the KONVENS conference (2014)

Towards a gold standard corpus for detecting valencies of Zulu verbs (2019)

Faaß, Getrud ; Bosch, Sonja

We report on a new project building a Natural Language Processing resource for Zulu by making use of resources already available. Combining tagging results with the results of morphological analysis semi-automatically, we expect to reduce the amount of manual work when generating a finely-grained gold standard corpus usable for training a tagger. From the tagged corpus, we plan to extract verb-argument pairs with the aim of compiling a verb valency lexicon for Zulu.

Building NLP resources for Dzongkha: A tagset and a tagged corpus (2010)

Chungku, Chungku ; Rabgay, Jurmey ; Faaß, Gertrud

This paper describes the application of probabilistic part of speech taggers to the Dzongkha language. A tag set containing 66 tags is designed, which is based on the Penn Treebank. A training corpus of 40,247 tokens is utilized to train the model. Using the lexicon extracted from the training corpus and lexicon from the available word list, we used two statistical taggers for comparison reasons. The best result achieved was 93.1% accuracy in a 10-fold cross validation on the training set. The winning tagger was thereafter applied to annotate a 570,247 token corpus.

Practice Report. A blended learning approach to teaching NLP for a DH public (2017)

Faaß, Gertrud ; Heid, Ulrich

This paper reports about current practice in a staged approach to the introduction of NLP principles and techniques for students of information science (IIM) and of international communication and translation (ICT) as part of their curricula. As most of these students are rather not familiar with computer science or, in the case of IIM students, linguistics, we see them as comparable with students of the humanities. We follow a blended learning strategy with lectures, online materials, tutorials, and screencasts. In the first two terms, we focus on linguistics and its formalisation, NLP tools and applications are then introduced from the third term on. The lectures are combined with tutorials and - since the summer term 2017 - with a set of screencasts.

Datenmanagement – Gegenstand und Dienst der Computerlinguistik. 40th Annual Conference of the German Linguistic Society. Stuttgart, Germany. (2018)

Trippel, Thorsten

Datenmanagement wird durch die Forschungsföderungsorganisationen (etwa in Horizon 2020 der EU, die Allianz der deutschen Wissenschaftsorganisationen oder in DFG geförderten Projekten) mehr und mehr Teil der Forschungslandschaft. Für die Computerlinguistik ist das Forschungsdatenmanagement aber auch Teil des Forschungsgebietes: Datenmodellierung und Transformation für die nachhaltige Datenspeicherung gehören in den Bereich der Texttechnologie und Textlinguistik, ebenso die Modellierung der beschreibenden Daten zu Datensätzen.

Wie Wörter Wellen werden. Die Untersuchung von Sprachverarbeitung mittels EEG (2017)

Freunberger, Dominik

In diesem Beitrag werden nach einer kurzen methodischen Vorstellung der Elektroenzephalographie und der Ereignis-korrelierten Potenziale einige Eckpunkte, die bei der Gestaltung eines linguistischen EEG-Experimentes Beachtung finden sollten, ausgeführt. Der Beitrag schliest mit Überlegungen, die bei der Untersuchung grammatischer Variation besonders berücksichtigt werden sollten.

Sprachtechnologie für die Strukturierung digitaler Information (2000)

Uszkoreit, Hans

Der Beitrag erläutert die Grundideen und Potenziale einer konsequenten hypermedialen Informationsvernetzung im WWW. Dazu werden Verfahren der automatischen Hyperverknüpfung vorgestellt, die Rolle der Sprachtechnologie in diesem Zusammenhang diskutiert und die Bedeutung der XML-Standards für die Verwirklichung einer dichten Hypervernetzung erklärt.

Bemerkungen zur andauernden Aktualität des Werks von Ulrich Engel (2018)

Lobin, Henning

Ulrich Engel hat mit seinen Publikationen zur deutschen Grammatik, zur Verbvalenz und zur kontrastiven Linguistik große Wirkung auf die internationale germanistische Linguistik ausgeübt. Weniger bekannt ist, dass er mit seinem Werk auch andere linguistische Teildisziplinen beeinflusst hat, die davon bis heute profitieren. Dependenzielle Ansätze spielen bei der maschinellen Syntaxanalyse mittlerweile eine zentrale Rolle, und bei der Entwicklung von Systemen zur maschinellen Übersetzung haben Engels Arbeiten ebenfalls ihre Spur hinterlassen. Der Aufbau von Sprachressourcen in Gestalt von „Baumbanken“ kann auf Engels Grammatikkonzeption zurückgreifen, und auch zur neuerlich florierenden Konstruktionsgrammatik bestehen klare Bezüge. Im Beitrag werden diese weniger bekannten Einwirkungen von Engels Werk in andere Bereiche dargestellt und in ihrer andauernden Aktualität gewürdigt.

Computer sind dumm - und daher äußerst nützlich! Eine kurze Einführung in die komplexe Welt der Computerlinguistik (2010)

Fisseni, Bernhard

1 to 10

Open Access

Refine

Author

Year of publication

Document Type

Language

Has Fulltext

Is part of the Bibliography

Keywords

Publicationstate

Reviewstate

Publisher

25 search hits