Liars and Saviors in a Sentiment Annotated Corpus of Comments to Political Debates

Paula Carvalho1,  Luís Sarmento2,  Jorge Teixeira2,  Mário J. Silva1
1Lasige, Faculty of Sciences, University of Lisbon, 2Labs Sapo UP & University of Porto


We investigate the expression of opinions about human entities in user-generated content (UGC). A set of 2,800 online news comments (8,000 sentences) was manually annotated, following a rich annotation scheme designed for this purpose. We conclude that the challenge in performing opinion mining in such type of content is correctly identifying the positive opinions, because (i) they are much less frequent than negative opinions and (ii) they are particularly exposed to verbal irony. We also show that the recognition of human targets poses additional challenges on mining opinions from UGC, since they are frequently mentioned by pronouns, definite descriptions and nicknames.

Full paper: