-
A Crowd-Annotated Spanish Corpus for Humor Analysis
Abstract: Computational Humor involves several tasks, such as humor recognition, humor generation, and humor scoring, for which it is useful to have human-curated data. In this work we present a corpus of 27,000 tweets written in Spanish and crowd-annotated by their humor value and funniness score, with about four annotations per tweet, tagged by 1,300 people over the Internet. It is equally divided between… ▽ More
Submitted 19 July, 2018; v1 submitted 2 October, 2017; originally announced October 2017.
Comments: Camera-ready version of the paper submitted to SocialNLP 2018, with a fixed typo
-
Is This a Joke? Detecting Humor in Spanish Tweets
Abstract: While humor has been historically studied from a psychological, cognitive and linguistic standpoint, its study from a computational perspective is an area yet to be explored in Computational Linguistics. There exist some previous works, but a characterization of humor that allows its automatic recognition and generation is far from being specified. In this work we build a crowdsourced corpus of la… ▽ More
Submitted 28 March, 2017; originally announced March 2017.
Comments: Preprint version, without referral
Journal ref: Presented in Iberamia 2016. The final publication is available at link.springer.com: https://link.springer.com/chapter/10.1007%2F978-3-319-47955-2_12