Refine
Year of publication
- 2009 (23) (remove)
Document Type
- Conference Proceeding (23) (remove)
Has Fulltext
- yes (23)
Is part of the Bibliography
- no (23)
Keywords
- Computerlinguistik (4)
- Deutsch (4)
- Korpus <Linguistik> (4)
- Automatische Sprachanalyse (3)
- Mehrsprachigkeit (3)
- Natürliche Sprache (3)
- Algorithmus (2)
- Bildung (2)
- Datensatz (2)
- Polarität (2)
Publicationstate
- Veröffentlichungsversion (11)
- Zweitveröffentlichung (8)
- Postprint (4)
Reviewstate
- Peer-Review (11)
- (Verlags)-Lektorat (5)
- Review-Status-unbekannt (1)
Publisher
- AAAI Press (1)
- AKS-Verlag (1)
- Acta Press (1)
- Acta Universitatis Upsaliensis (1)
- Association for Computational Linguistics (1)
- Association for Computing Machinery (1)
- Berlin-Brandenburgische Akademie der Wissenschaften; Zentrum Sprache (1)
- CSLI Publications (1)
- Department of Linguistics, University of Cambridge (1)
- EDUCatt (1)
2008. godā tyka veikts pietejums, kura golvonais mierkis beja raksturuot niulenejū latgalīšu volūdys lūmu izgleiteibys sistemā. Itys roksts prezeņtej byutiskuokūs pietejuma rezultatus. Pietejuma īrūsme sajimta nu „Mercator Education Centre“ (Merkatora izgleiteibys centra), kas dorbojās Nīderlaņdē Ļuvortā (frīzu volūdā — Ljouwert), Frīzejis proviņcis golvyspiļsātā. Piļneigs pietejuma izvārsums ar Merkatora izgleiteibys centra atbolstu publicāts izdavumu serejā „Regional Dossier Series“ (Regionalūs dosje sereja) angļu volūdā. Itys roksts golvonom kuortom dūmuots taidam adresatam, kas mozuok ir saisteits ar Eiropys volūdu izpietis institucejom i kam roksti angļu volūdā var saguoduot izpratnis voi atrasšonys gryuteibys. Partū pietejuma suokumā teik dūts seikuoks metožu i mierķu raksturuojums, paskaidrojūt pietejuma strukturu i rezultatu apkūpuojuma veidu, kai ari dūts puorskots par latgalīšu volūdys lūmu myusdīnu izgleiteibys sistemā. Sacynuojumūs ir īzeimātys nuokūtnis perspektivis i prīšklykumi dabuotūs rezultatu izmontuojumam.
“Linguistic Landscapes” (LL) is a research method which has become increasingly popular in recent years. In this paper, we will first explain the method itself and discuss some of its fundamental assumptions. We will then recall the basic traits of multilingualism in the Baltic States, before presenting results from our project carried out together with a group of Master students of Philology in several medium-sized towns in the Baltic States, focussing on our home town of Rēzekne in the highly multilingual region of Latgale in Eastern Latvia. In the discussion of some of the results, we will introduce the concept of “Legal Hypercorrection” as a term for the stricter compliance of language laws than necessary. The last part will report on advantages of LL for educational purposes of multilingualism, and for developing discussions on multilingualism among the general public.
Beyond the stars: exploiting free-text user reviews to improve the accuracy of movie recommendations
(2009)
In this paper we show that the extraction of opinions from free-text reviews can improve the accuracy of movie recommendations. We present three approaches to extract movie aspects as opinion targets and use them as features for the collaborative filtering. Each of these approaches requires different amounts of manual interaction. We collected a data set of reviews with corresponding ordinal (star) ratings of several thousand movies to evaluate the different features for the collaborative filtering. We employ a state-of-the-art collaborative filtering engine for the recommendations during our evaluation and compare the performance with and without using the features representing user preferences mined from the free-text reviews provided by the users. The opinion mining based features perform significantly better than the baseline, which is based on star ratings and genre information only.
This paper introduces LRTwiki, an improved variant of the Likelihood Ratio Test (LRT). The central idea of LRTwiki is to employ a comprehensive domain specific knowledge source as additional “on-topic” data sets, and to modify the calculation of the LRT algorithm to take advantage of this new information. The knowledge source is created on the basis of Wikipedia articles. We evaluate on the two related tasks product feature extraction and keyphrase extraction, and find LRTwiki to yield a significant improvement over the original LRT in both tasks.
Cette contribution discute différents enjeux dégagés lors d’une étude des pratiques professionnelles plurilingues : ces enjeux ont émergé d’une analyse menée collaborativement par deux équipes de chercheurs, à Lyon et à Paris, participant au projet européen DYLAN (6e programme cadre) et élaborant ensemble l’analyse empirique d’un extrait d’une réunion de travail, enregistrée dans le cadre d’une collaboration sur un même terrain. Cette analyse est l’occasion de thématiser de manière exemplaire un certain nombre de questions surgissant de l’étude des contacts des langues dans les contextes professionnels, concernant aussi bien les enjeux épistémologiques que l'engagement du chercheur sur le terrain.
This paper describes a new approach to improve the analysis and categorization of web documents using statistical methods for template based clustering as well as semantical analysis based on terminological ontologies. A domain-specific environment serves for prove of concept. In order to demonstrate the widespread practical benefit of our approach, we outline a combined mathematical and semantical framework for information retrieval on internet resources.
The paper discusses particular logical consistency conditions satisfied by German proposition-embedding predicates which determine the question type (external and internal whether-form as well as exhaustive and non-exhaustive wh-form), the correlate type (es- or da-correlate) as well as the impact of the correlate on the respective consistency condition. It will turn out that some consistency conditions also determine the embedding of verb second and subject-control.
Digitale Medien haben in einer rasenden Geschwindigkeit inzwischen alle Lebensbereiche verändert. Sie greifen immer weiter in gewachsene Strukturen ein und prägen immer mehr unsere Wirtschafts-, Arbeits- und Sozialwelt, aber auch unsere private Kommunikation und unser alltägliches Leben. Ständig neue Entwicklungen stellen dabei alle Beteiligten immer wieder vor neue Herausforderungen. Damit einher geht die Notwendigkeit, sich kontinuierlich neues Wissen anzueignen. Als Schlüsselqualifikation zur Beherrschung dieser neuen Anforderungen in unserer sich ständig ändernden Gesellschaft gilt Medienkompetenz. Neben Lesen, Schreiben und Rechnen ist sie zur vierten Kulturtechnik geworden, die alle Bürgerinnen und Bürger in unserer Gesellschaft unabhängig von Alter, Geschlecht und Herkunft beherrschen sollten. Um an den aktuellen gesellschaftlichen und politischen Entwicklungen überhaupt noch teilnehmen und erwerbsfähig bleiben zu können, muss diese Kompetenz sogar beherrscht werden können. Damit wird ihre Vermittlung zum staatlichen Bildungsauftrag.