TY - JOUR U1 - Zeitschriftenartikel, wissenschaftlich - begutachtet (reviewed) A1 - Wiegand, Michael A1 - Klakow, Dietrich T1 - Detecting conditional healthiness of food items from natural language text JF - Language Resources and Evaluation N2 - In this article, we explore the feasibility of extracting suitable and unsuitable food items for particular health conditions from natural language text. We refer to this task as conditional healthiness classification. For that purpose, we annotate a corpus extracted from forum entries of a food-related website. We identify different relation types that hold between food items and health conditions going beyond a binary distinction of suitability and unsuitability and devise various supervised classifiers using different types of features. We examine the impact of different task-specific resources, such as a healthiness lexicon that lists the healthiness status of a food item and a sentiment lexicon. Moreover, we also consider task-specific linguistic features that disambiguate a context in which mentions of a food item and a health condition co-occur and compare them with standard features using bag of words, part-of-speech information and syntactic parses. We also investigate in how far individual food items and health conditions correlate with specific relation types and try to harness this information for classification. KW - Computerlinguistik KW - Information Extraction KW - Polarität KW - Lebensmittel KW - Natürliche Sprache KW - Text classification KW - Food domain KW - Social media KW - Linguistically informed feature engineering KW - Polarity classification Y1 - 2015 UN - https://nbn-resolving.org/urn:nbn:de:bsz:mh39-85428 SN - 1574-0218 SS - 1574-0218 U6 - https://doi.org/10.1007/s10579-015-9314-7 DO - https://doi.org/10.1007/s10579-015-9314-7 N1 - This is a post-peer-review, pre-copyedit version of an article published in Language Resources and Evaluation. The final authenticated version is available online at: http://dx.doi.org/10.1007/s10579-015-9314-7 VL - 49 IS - 4 SP - 777 EP - 830 PB - Springer CY - Dordrecht ER -