OPUS 4 | Search

35 search hits

1 to 10

Sort by

Year
Year
Title
Title
Author
Author

When readers pay attention to the left: A concurrent eyetracking-fMRI investigation on the neuronal correlates of regressive eye movements during reading (2017)

Weiß, Anna Fiona ; Kretzschmar, Franziska ; Nagels, Arne ; Schlesewsky, Matthias ; Bornkessel-Schlesewsky, Ina ; Tune, Sarah

Keeping Properties with the Data CL-MetaHeaders - An Open Specification (2017)

Vidler, John ; Wattam, Stephen

Corpus researchers, along with many other disciplines in science are being put under continual pressure to show accountability and reproducibility in their work. This is unsurprisingly difficult when the researcher is faced with a wide array of methods and tools through which to do their work; simply tracking the operations done can be problematic, especially when toolchains are often configured by the developers, but left largely as a black box to the user. Here we present a scheme for encoding this ‘meta data’ inside the corpus files themselves in a structured data format, along with a proof-of-concept tool to record the operations performed on a file.

Removing spam from web corpora through supervised learning using FastText (2017)

Suchomel, Vít

Unlike traditional text corpora collected from trustworthy sources, the content of web based corpora has to be filtered. This study briefly discusses the impact of web spam on corpus usability and emphasizes the importance of removing computer generated text from web corpora. The paper also presents a keyword comparison of an unfiltered corpus with the same collection of texts cleaned by a supervised classifier trained using FastText. The classifier was able to recognize 71% of web spam documents similar to the training set but lacked both precision and recall when applied to short texts from another data set.

User's Guide for the ZAS Database of Clause-Embedding Predicates (2017)

Stiebels, Barbara ; McFadden, Thomas ; Schwabe, Kerstin ; Solstad, Torgrim ; Kellner, Elisa ; Sommer, Livia ; Stoltmann, Katarzyna

Marital Satisfaction, Sex, Age, Marriage Duration, Religion, Number of Children, Economic Status, Education, and Collectivistic Values: Data from 33 Countries (2017)

Sorokowski, Piotr ; Randall, Ashley K. ; Groyecka, Agata ; Frackowiak, Tomasz ; Cantarero, Katarzyna ; Hilpert, Peter ; Ahmadi, Khodabakhsh ; Alghraibeh, Ahmad M. ; Aryeetey, Richmond ; Bertoni, Anna ; Bettache, Karim ; Błażejewska, Marta ; Bodenmann, Guy ; Bortolini, Tiago S. ; Bosc, Carla ; Butovskaya, Marina ; Castro, Felipe N. ; Cetinkaya, Hakan ; Cunha, Diana ; David, Daniel ; David, Oana A. ; Domínguez Espinosa, Alejandra C. ; Donato, Silvia ; Dronova, Daria ; Dural, Seda ; Fisher, Maryanne ; Akkaya, Aslıhan Hamamcıoğlu ; Hamamura, Takeshi ; Hansen, Karolina ; Hattori, Wallisen T. ; Hromatko, Ivana ; Gulbetekin, Evrim ; Iafrate, Raffaella ; James, Bawo ; Jiang, Feng ; Kimamo, Charles O. ; Koç, Fırat ; Krasnodębska, Anna ; Laar, Amos ; Lopes, Fívia A. ; Martinez, Rocio ; Mesko, Norbert ; Molodovskaya, Natalya ; Qezeli, Khadijeh Moradi ; Motahari, Zahrasadat ; Natividade, Jean C. ; Ntayi, Joseph ; Ojedokun, Oluyinka ; Omar-Fauzee, Mohd S. B. ; Onyishi, Ike E. ; Özener, Barış ; Paluszak, Anna ; Portugal, Alda ; Realo, Anu ; Relvas, Ana P. ; Rizwan, Muhammad ; Sabiniewicz, Agnieszka L. ; Salkičević, Svjetlana ; Sarmány-Schuller, Ivan ; Stamkou, Eftychia ; Stoyanova, Stanislava ; Šukolová, Denisa ; Sutresna, Nina ; Tadinac, Meri ; Teras, Andero ; Ponciano, Edna L. T. ; Tripathi, Ritu ; Tripathi, Nachiketa ; Tripathi, Mamta ; Yamamoto, Maria E. ; Yoo, Gyesook ; Sorokowska, Agnieszka

Forms of committed relationships, including formal marriage arrangements between men and women, exist in almost every culture (Bell, 1997). Yet, similarly to many other psychological constructs (Henrich et al., 2010), marital satisfaction and its correlates have been investigated almost exclusively in Western countries (e.g., Bradbury et al., 2000). Meanwhile, marital relationships are heavily guided by culturally determined norms, customs, and expectations (for review see Berscheid, 1995; Fiske et al., 1998). While we acknowledge the differences existing both between- and within-cultures, we measured marital satisfaction and several factors that might potentially correlate with it based on self-report data from individuals across 33 countries. The purpose of this paper is to introduce the raw data available for anybody interested in further examining any relations between them and other country-level scores obtained elsewhere. Below, we review the central variables that are likely to be related to marital satisfaction.

From Knapsack to Wessi. German loanwords in English: 1600-2000 (2017)

Simpson, John

The present paper examines the rise and fall of Modern High German loanwords in English from 1600 until 2000, principally making use of the record of borrowing documented by the Oxford English Dictionary (OED) in its Third Edition (online version, in revision 2000-). Groups of loanwords are analysed by century, with reference to the changing social and cultural landscape characterising relationships between the relevant nations over this period. This is not a simple picture: each language grows over the period in different ways, and the speakers of English look to German at different times for different types of borrowing, as the political and intellectual balance alters.

A Survey on Hate Speech Detection using Natural Language Processing (2017)

Schmidt, Anna ; Wiegand, Michael

This paper presents a survey on hate speech detection. Given the steadily growing body of social media content, the amount of online hate speech is also increasing. Due to the massive scale of the web, methods that automatically detect hate speech are required. Our survey describes key areas that have been explored to automatically recognize these types of utterances using natural language processing. We also discuss limits of those approaches.

Accelerating corpus search using multiple cores (2017)

Rábara, Radoslav ; Rychlý, Pavel ; Herman, Ondřej ; Jakubíček, Miloš

The Manatee corpus management system on which the Sketch Engine is built is efficient, but unable to harness the power of today’s multiprocessor machines. We describe a new, compatible implementation of Manatee which we develop in the Go language and report on the performance gains that we obtained.

Data point selection for genre-aware parsing (2017)

Rehbein, Ines ; Bildhauer, Felix

In the NLP literature, adapting a parser to new text with properties different from the training data is commonly referred to as domain adaptation. In practice, however, the differences between texts from different sources often reflect a mixture of domain and genre properties, and it is by no means clear what impact each of those has on statistical parsing. In this paper, we investigate how differences between articles in a newspaper corpus relate to the concepts of genre and domain and how they influence parsing performance of a transition-based dependency parser. We do this by applying various similarity measures for data point selection and testing their adequacy for creating genre-aware parsing models.

Theory, data, and the epistemology of syntax (2017)

Pullum, Geoffrey K.

Syntactic theory has tended to vacillate between implausible methodological extremes. Some linguists hold that our theories are accountable solely for the corpus of attested utterances; others assume our subject matter is unobservable intuitive feelings about sentences. Both extremes should be rejected. The subject matter of syntax is neither past utterance production nor the functioning of inaccessible mental machinery; it is normative - a system of tacitly grasped constraints defining correctness of structure. There are interesting parallels between syntactic and moral systems, modulo the key difference that linguistic systems are diverse whereas morality is universal. The appropriate epistemology for justifying formulations of normative systems is familiar in philosophy: it is known as the method of reflective equilibrium.

1 to 10

Open Access

Refine

Author

Year of publication

Document Type

Language

Has Fulltext

Is part of the Bibliography

Keywords

Publicationstate

Reviewstate

Publisher

35 search hits