Skip to main content

Showing 1–31 of 31 results for author: Neves, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.21838  [pdf, other

    cs.IR

    Learning Universal User Representations Leveraging Cross-domain User Intent at Snapchat

    Authors: Clark Mingxuan Ju, Leonardo Neves, Bhuvesh Kumar, Liam Collins, Tong Zhao, Yuwei Qiu, Qing Dou, Yang Zhou, Sohail Nizam, Rengim Ozturk, Yvette Liu, Sen Yang, Manish Malik, Neil Shah

    Abstract: The development of powerful user representations is a key factor in the success of recommender systems (RecSys). Online platforms employ a range of RecSys techniques to personalize user experience across diverse in-app surfaces. User representations are often learned individually through user's historical interactions within each surface and user representations across different surfaces can be sh… ▽ More

    Submitted 30 April, 2025; originally announced April 2025.

    Comments: Accepted to the industrial track of SIGIR'25

  2. arXiv:2412.17245  [pdf, other

    cs.IR cs.SI

    GraphHash: Graph Clustering Enables Parameter Efficiency in Recommender Systems

    Authors: Xinyi Wu, Donald Loveland, Runjin Chen, Yozen Liu, Xin Chen, Leonardo Neves, Ali Jadbabaie, Clark Mingxuan Ju, Neil Shah, Tong Zhao

    Abstract: Deep recommender systems rely heavily on large embedding tables to handle high-cardinality categorical features such as user/item identifiers, and face significant memory constraints at scale. To tackle this challenge, hashing techniques are often employed to map multiple entities to the same embedding and thus reduce the size of the embedding tables. Concurrently, graph-based collaborative signal… ▽ More

    Submitted 8 February, 2025; v1 submitted 22 December, 2024; originally announced December 2024.

    Comments: ACM Web Conference (WWW) 2025, Oral

  3. arXiv:2412.17171  [pdf, other

    cs.LG cs.IR

    Enhancing Item Tokenization for Generative Recommendation through Self-Improvement

    Authors: Runjin Chen, Mingxuan Ju, Ngoc Bui, Dimosthenis Antypas, Stanley Cai, Xiaopeng Wu, Leonardo Neves, Zhangyang Wang, Neil Shah, Tong Zhao

    Abstract: Generative recommendation systems, driven by large language models (LLMs), present an innovative approach to predicting user preferences by modeling items as token sequences and generating recommendations in a generative manner. A critical challenge in this approach is the effective tokenization of items, ensuring that they are represented in a form compatible with LLMs. Current item tokenization… ▽ More

    Submitted 22 December, 2024; originally announced December 2024.

  4. arXiv:2411.10142  [pdf, other

    cs.CY

    First Steps towards K-12 Computer Science Education in Portugal -- Experience Report

    Authors: Fernando Luis Neves, Jose Nuno Oliveira

    Abstract: Computer scientists Jeannette Wing and Simon Peyton Jones have catalyzed a pivotal discussion on the need to introduce computing in K-12 mandatory education. In Wing's own words, computing 'represents a universally applicable attitude and skill set everyone, not just computer scientists, would be eager to learn and use.'' The crux of this educational endeavor lies in its execution. This paper repo… ▽ More

    Submitted 15 November, 2024; originally announced November 2024.

    ACM Class: K.3.1; K.3.2

  5. arXiv:2406.04106  [pdf, other

    cs.CL

    Explainability and Hate Speech: Structured Explanations Make Social Media Moderators Faster

    Authors: Agostina Calabrese, Leonardo Neves, Neil Shah, Maarten W. Bos, Björn Ross, Mirella Lapata, Francesco Barbieri

    Abstract: Content moderators play a key role in keeping the conversation on social media healthy. While the high volume of content they need to judge represents a bottleneck to the moderation pipeline, no studies have explored how models could support them to make faster decisions. There is, by now, a vast body of research into detecting hate speech, sometimes explicitly motivated by a desire to help improv… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 11 pages, 14 figures, to be published at ACL 2024

  6. arXiv:2403.13344  [pdf, other

    cs.SI cs.AI cs.CL cs.HC cs.IR cs.LG

    USE: Dynamic User Modeling with Stateful Sequence Models

    Authors: Zhihan Zhou, Qixiang Fang, Leonardo Neves, Francesco Barbieri, Yozen Liu, Han Liu, Maarten W. Bos, Ron Dotsch

    Abstract: User embeddings play a crucial role in user engagement forecasting and personalized services. Recent advances in sequence modeling have sparked interest in learning user embeddings from behavioral data. Yet behavior-based user embedding learning faces the unique challenge of dynamic user modeling. As users continuously interact with the apps, user embeddings should be periodically updated to accou… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  7. arXiv:2403.13220  [pdf

    cs.SE

    Elevating Software Quality in Agile Environments: The Role of Testing Professionals in Unit Testing

    Authors: Lucas Neves, Oscar Campos, Robson Santos, Italo Santos, Cleyton Magalhaes, Ronnie de Souza Santos

    Abstract: Testing is an essential quality activity in the software development process. Usually, a software system is tested on several levels, starting with unit testing that checks the smallest parts of the code until acceptance testing, which is focused on the validations with the end-user. Historically, unit testing has been the domain of developers, who are responsible for ensuring the accuracy of thei… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  8. General-Purpose User Modeling with Behavioral Logs: A Snapchat Case Study

    Authors: Qixiang Fang, Zhihan Zhou, Francesco Barbieri, Yozen Liu, Leonardo Neves, Dong Nguyen, Daniel L. Oberski, Maarten W. Bos, Ron Dotsch

    Abstract: Learning general-purpose user representations based on user behavioral logs is an increasingly popular user modeling approach. It benefits from easily available, privacy-friendly yet expressive data, and does not require extensive re-tuning of the upstream user model for different downstream tasks. While this approach has shown promise in search engines and e-commerce applications, its fit for ins… ▽ More

    Submitted 25 July, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

    Comments: SIGIR 2024

  9. arXiv:2310.14757  [pdf, other

    cs.CL

    SuperTweetEval: A Challenging, Unified and Heterogeneous Benchmark for Social Media NLP Research

    Authors: Dimosthenis Antypas, Asahi Ushio, Francesco Barbieri, Leonardo Neves, Kiamehr Rezaee, Luis Espinosa-Anke, Jiaxin Pei, Jose Camacho-Collados

    Abstract: Despite its relevance, the maturity of NLP for social media pales in comparison with general-purpose models, metrics and benchmarks. This fragmented landscape makes it hard for the community to know, for instance, given a task, which is the best performing model and how it compares with others. To alleviate this issue, we introduce a unified benchmark for NLP evaluation in social media, SuperTweet… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 Findings

  10. arXiv:2309.08999  [pdf, other

    cs.CL

    Context-aware Adversarial Attack on Named Entity Recognition

    Authors: Shuguang Chen, Leonardo Neves, Thamar Solorio

    Abstract: In recent years, large pre-trained language models (PLMs) have achieved remarkable performance on many natural language processing benchmarks. Despite their success, prior studies have shown that PLMs are vulnerable to attacks from adversarial examples. In this work, we focus on the named entity recognition task and study context-aware adversarial attack methods to examine the model's robustness.… ▽ More

    Submitted 2 February, 2024; v1 submitted 16 September, 2023; originally announced September 2023.

    Comments: Accepted to W-NUT at EACL 2024

  11. arXiv:2308.02142  [pdf, other

    cs.CL cs.SI

    Tweet Insights: A Visualization Platform to Extract Temporal Insights from Twitter

    Authors: Daniel Loureiro, Kiamehr Rezaee, Talayeh Riahi, Francesco Barbieri, Leonardo Neves, Luis Espinosa Anke, Jose Camacho-Collados

    Abstract: This paper introduces a large collection of time series data derived from Twitter, postprocessed using word embedding techniques, as well as specialized fine-tuned language models. This data comprises the past five years and captures changes in n-gram frequency, similarity, sentiment and topic distribution. The interface built on top of this data enables temporal analysis for detecting and charact… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

    Comments: Demo paper. Visualization platform available at https://tweetnlp.org/insights

  12. arXiv:2210.07916  [pdf, other

    cs.CL

    Style Transfer as Data Augmentation: A Case Study on Named Entity Recognition

    Authors: Shuguang Chen, Leonardo Neves, Thamar Solorio

    Abstract: In this work, we take the named entity recognition task in the English language as a case study and explore style transfer as a data augmentation method to increase the size and diversity of training data in low-resource scenarios. We propose a new method to effectively transform the text from a high-resource domain to a low-resource domain by changing its style-related attributes to generate synt… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

    Comments: To appear at EMNLP 2022 main conference

  13. arXiv:2210.03797  [pdf, other

    cs.CL

    Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts

    Authors: Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados

    Abstract: Recent progress in language model pre-training has led to important improvements in Named Entity Recognition (NER). Nonetheless, this progress has been mainly tested in well-formatted documents such as news, Wikipedia, or scientific articles. In social media the landscape is different, in which it adds another layer of complexity due to its noisy and dynamic nature. In this paper, we focus on NER… ▽ More

    Submitted 15 November, 2022; v1 submitted 7 October, 2022; originally announced October 2022.

    Comments: AACL 2022 main conference

  14. arXiv:2210.01108  [pdf, other

    cs.CL cs.CY cs.LG

    SemEval 2023 Task 9: Multilingual Tweet Intimacy Analysis

    Authors: Jiaxin Pei, Vítor Silva, Maarten Bos, Yozon Liu, Leonardo Neves, David Jurgens, Francesco Barbieri

    Abstract: We propose MINT, a new Multilingual INTimacy analysis dataset covering 13,372 tweets in 10 languages including English, French, Spanish, Italian, Portuguese, Korean, Dutch, Chinese, Hindi, and Arabic. We benchmarked a list of popular multilingual pre-trained language models. The dataset is released along with the SemEval 2023 Task 9: Multilingual Tweet Intimacy Analysis (https://sites.google.com/u… ▽ More

    Submitted 3 February, 2023; v1 submitted 3 October, 2022; originally announced October 2022.

    Comments: SemEval 2023 Task 9: Multilingual Tweet Intimacy Analysis

  15. arXiv:2209.09824  [pdf, other

    cs.CL

    Twitter Topic Classification

    Authors: Dimosthenis Antypas, Asahi Ushio, Jose Camacho-Collados, Leonardo Neves, Vítor Silva, Francesco Barbieri

    Abstract: Social media platforms host discussions about a wide variety of topics that arise everyday. Making sense of all the content and organising it into categories is an arduous task. A common way to deal with this issue is relying on topic modeling, but topics discovered using this technique are difficult to interpret and can differ from corpus to corpus. In this paper, we present a new task based on t… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

    Comments: Accepted at COLING 2022

  16. arXiv:2209.07216  [pdf, other

    cs.CL

    TempoWiC: An Evaluation Benchmark for Detecting Meaning Shift in Social Media

    Authors: Daniel Loureiro, Aminette D'Souza, Areej Nasser Muhajab, Isabella A. White, Gabriel Wong, Luis Espinosa Anke, Leonardo Neves, Francesco Barbieri, Jose Camacho-Collados

    Abstract: Language evolves over time, and word meaning changes accordingly. This is especially true in social media, since its dynamic nature leads to faster semantic shifts, making it challenging for NLP models to deal with new content and trends. However, the number of datasets and models that specifically address the dynamic nature of these social platforms is scarce. To bridge this gap, we present Tempo… ▽ More

    Submitted 16 September, 2022; v1 submitted 15 September, 2022; originally announced September 2022.

    Comments: Accepted to COLING 2022. Used to create the TempoWiC Shared Task for EvoNLP

  17. arXiv:2206.14774  [pdf, other

    cs.CL

    TweetNLP: Cutting-Edge Natural Language Processing for Social Media

    Authors: Jose Camacho-Collados, Kiamehr Rezaee, Talayeh Riahi, Asahi Ushio, Daniel Loureiro, Dimosthenis Antypas, Joanne Boisson, Luis Espinosa-Anke, Fangyu Liu, Eugenio Martínez-Cámara, Gonzalo Medina, Thomas Buhrmann, Leonardo Neves, Francesco Barbieri

    Abstract: In this paper we present TweetNLP, an integrated platform for Natural Language Processing (NLP) in social media. TweetNLP supports a diverse set of NLP tasks, including generic focus areas such as sentiment analysis and named entity recognition, as well as social media-specific tasks such as emoji prediction and offensive language identification. Task-specific systems are powered by reasonably-siz… ▽ More

    Submitted 25 October, 2022; v1 submitted 29 June, 2022; originally announced June 2022.

    Comments: EMNLP 2022 Demo paper. TweetNLP: https://tweetnlp.org/

  18. arXiv:2202.03829  [pdf, other

    cs.CL cs.AI

    TimeLMs: Diachronic Language Models from Twitter

    Authors: Daniel Loureiro, Francesco Barbieri, Leonardo Neves, Luis Espinosa Anke, Jose Camacho-Collados

    Abstract: Despite its importance, the time variable has been largely neglected in the NLP and language model literature. In this paper, we present TimeLMs, a set of language models specialized on diachronic Twitter data. We show that a continual learning strategy contributes to enhancing Twitter-based language models' capacity to deal with future and out-of-distribution tweets, while making them competitive… ▽ More

    Submitted 1 April, 2022; v1 submitted 8 February, 2022; originally announced February 2022.

    Comments: Accepted to ACL 2022 (Demo Track) - https://github.com/cardiffnlp/timelms

  19. arXiv:2109.01758  [pdf, other

    cs.CL

    Data Augmentation for Cross-Domain Named Entity Recognition

    Authors: Shuguang Chen, Gustavo Aguilar, Leonardo Neves, Thamar Solorio

    Abstract: Current work in named entity recognition (NER) shows that data augmentation techniques can produce more robust models. However, most existing techniques focus on augmenting in-domain data in low-resource scenarios where annotated data is quite limited. In contrast, we study cross-domain data augmentation for the NER task. We investigate the possibility of leveraging data from high-resource domains… ▽ More

    Submitted 3 September, 2021; originally announced September 2021.

    Comments: To appear at EMNLP 2021 main conference

  20. arXiv:2104.09742  [pdf, other

    cs.CL

    Mitigating Temporal-Drift: A Simple Approach to Keep NER Models Crisp

    Authors: Shuguang Chen, Leonardo Neves, Thamar Solorio

    Abstract: Performance of neural models for named entity recognition degrades over time, becoming stale. This degradation is due to temporal drift, the change in our target variables' statistical properties over time. This issue is especially problematic for social media data, where topics change rapidly. In order to mitigate the problem, data annotation and retraining of models is common. Despite its useful… ▽ More

    Submitted 19 April, 2021; originally announced April 2021.

    Comments: Accepted to SocialNLP at NAACL 2021

  21. arXiv:2011.01196  [pdf, other

    cs.CL

    The Devil is in the Details: Evaluating Limitations of Transformer-based Methods for Granular Tasks

    Authors: Brihi Joshi, Neil Shah, Francesco Barbieri, Leonardo Neves

    Abstract: Contextual embeddings derived from transformer-based neural language models have shown state-of-the-art performance for various tasks such as question answering, sentiment analysis, and textual similarity in recent years. Extensive work shows how accurately such models can represent abstract, semantic information present in text. In this expository work, we explore a tangent direction and analyze… ▽ More

    Submitted 2 November, 2020; originally announced November 2020.

    Comments: Accepted at COLING 2020. Code available at https://github.com/brihijoshi/granular-similarity-COLING-2020

  22. arXiv:2010.12864  [pdf, other

    cs.CL stat.ML

    On Transferability of Bias Mitigation Effects in Language Model Fine-Tuning

    Authors: Xisen Jin, Francesco Barbieri, Brendan Kennedy, Aida Mostafazadeh Davani, Leonardo Neves, Xiang Ren

    Abstract: Fine-tuned language models have been shown to exhibit biases against protected groups in a host of modeling tasks such as text classification and coreference resolution. Previous works focus on detecting these biases, reducing bias in data representations, and using auxiliary training objectives to mitigate bias during fine-tuning. Although these techniques achieve bias reduction for the task and… ▽ More

    Submitted 11 April, 2021; v1 submitted 24 October, 2020; originally announced October 2020.

    Comments: 14 pages; Accepted at NAACL 2021

  23. arXiv:2010.12712  [pdf, other

    cs.CL

    Can images help recognize entities? A study of the role of images for Multimodal NER

    Authors: Shuguang Chen, Gustavo Aguilar, Leonardo Neves, Thamar Solorio

    Abstract: Multimodal named entity recognition (MNER) requires to bridge the gap between language understanding and visual context. While many multimodal neural techniques have been proposed to incorporate images into the MNER task, the model's ability to leverage multimodal interactions remains poorly understood. In this work, we conduct in-depth analyses of existing multimodal fusion techniques from differ… ▽ More

    Submitted 19 September, 2021; v1 submitted 23 October, 2020; originally announced October 2020.

    Comments: Accepted to W-NUT 2021 at EMNLP

  24. arXiv:2010.12421  [pdf, other

    cs.CL cs.SI

    TweetEval: Unified Benchmark and Comparative Evaluation for Tweet Classification

    Authors: Francesco Barbieri, Jose Camacho-Collados, Leonardo Neves, Luis Espinosa-Anke

    Abstract: The experimental landscape in natural language processing for social media is too fragmented. Each year, new shared tasks and datasets are proposed, ranging from classics like sentiment analysis to irony detection or emoji prediction. Therefore, it is unclear what the current state of the art is, as there is no standardized evaluation protocol, neither a strong set of baselines trained on such dom… ▽ More

    Submitted 26 October, 2020; v1 submitted 23 October, 2020; originally announced October 2020.

    Comments: Findings of EMNLP 2020. TweetEval benchmark available at https://github.com/cardiffnlp/tweeteval

  25. arXiv:2006.06830  [pdf, other

    cs.LG stat.ML

    Data Augmentation for Graph Neural Networks

    Authors: Tong Zhao, Yozen Liu, Leonardo Neves, Oliver Woodford, Meng Jiang, Neil Shah

    Abstract: Data augmentation has been widely used to improve generalizability of machine learning models. However, comparatively little work studies data augmentation for graphs. This is largely due to the complex, non-Euclidean structure of graphs, which limits possible manipulation operations. Augmentation operations commonly used in vision and language have no analogs for graphs. Our work studies graph da… ▽ More

    Submitted 2 December, 2020; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: AAAI 2021. This complete version contains the Appendix

  26. arXiv:2004.07499  [pdf, other

    cs.CL cs.AI cs.LG

    LEAN-LIFE: A Label-Efficient Annotation Framework Towards Learning from Explanation

    Authors: Dong-Ho Lee, Rahul Khanna, Bill Yuchen Lin, Jamin Chen, Seyeon Lee, Qinyuan Ye, Elizabeth Boschee, Leonardo Neves, Xiang Ren

    Abstract: Successfully training a deep neural network demands a huge corpus of labeled data. However, each label only provides limited information to learn from and collecting the requisite number of labels involves massive human effort. In this work, we introduce LEAN-LIFE, a web-based, Label-Efficient AnnotatioN framework for sequence labeling and classification tasks, with an easy-to-use UI that not only… ▽ More

    Submitted 16 April, 2020; originally announced April 2020.

    Comments: Accepted to the ACL 2020 (demo). The first two authors contributed equally. Project page: http://inklab.usc.edu/leanlife/

  27. arXiv:1911.01352  [pdf, other

    cs.CL

    Learning from Explanations with Neural Execution Tree

    Authors: Ziqi Wang, Yujia Qin, Wenxuan Zhou, Jun Yan, Qinyuan Ye, Leonardo Neves, Zhiyuan Liu, Xiang Ren

    Abstract: While deep neural networks have achieved impressive performance on a range of NLP tasks, these data-hungry models heavily rely on labeled data, which restricts their applications in scenarios where data annotation is expensive. Natural language (NL) explanations have been demonstrated very useful additional supervision, which can provide sufficient domain knowledge for generating more labeled data… ▽ More

    Submitted 14 February, 2020; v1 submitted 4 November, 2019; originally announced November 2019.

    Comments: 18 pages, 8 figures, 12 tables. Published as a conference paper at ICLR 2020

    ACM Class: I.2.7

  28. arXiv:1909.02177  [pdf, other

    cs.CL

    NERO: A Neural Rule Grounding Framework for Label-Efficient Relation Extraction

    Authors: Wenxuan Zhou, Hongtao Lin, Bill Yuchen Lin, Ziqi Wang, Junyi Du, Leonardo Neves, Xiang Ren

    Abstract: Deep neural models for relation extraction tend to be less reliable when perfectly labeled data is limited, despite their success in label-sufficient scenarios. Instead of seeking more instance-level labels from human annotators, here we propose to annotate frequent surface patterns to form labeling rules. These rules can be automatically mined from large text corpora and generalized via a soft ru… ▽ More

    Submitted 15 January, 2020; v1 submitted 4 September, 2019; originally announced September 2019.

    Comments: Accepted by WWW2020. Code available at https://github.com/INK-USC/NERO

  29. arXiv:1903.12431  [pdf, other

    cs.CL

    Train One Get One Free: Partially Supervised Neural Network for Bug Report Duplicate Detection and Clustering

    Authors: Lahari Poddar, Leonardo Neves, William Brendel, Luis Marujo, Sergey Tulyakov, Pradeep Karuturi

    Abstract: Tracking user reported bugs requires considerable engineering effort in going through many repetitive reports and assigning them to the correct teams. This paper proposes a neural architecture that can jointly (1) detect if two bug reports are duplicates, and (2) aggregate them into latent topics. Leveraging the assumption that learning the topic of a bug is a sub-task for detecting duplicates, we… ▽ More

    Submitted 3 April, 2019; v1 submitted 29 March, 2019; originally announced March 2019.

    Comments: Accepted for publication in NAACL 2019

  30. arXiv:1802.07862  [pdf, other

    cs.CL

    Multimodal Named Entity Recognition for Short Social Media Posts

    Authors: Seungwhan Moon, Leonardo Neves, Vitor Carvalho

    Abstract: We introduce a new task called Multimodal Named Entity Recognition (MNER) for noisy user-generated data such as tweets or Snapchat captions, which comprise short text with accompanying images. These social media posts often come in inconsistent or incomplete syntax and lexical notations with very limited surrounding textual contexts, bringing significant challenges for NER. To this end, we create… ▽ More

    Submitted 21 February, 2018; originally announced February 2018.

  31. arXiv:1712.00489  [pdf, other

    cs.CL cs.AI cs.CV cs.LG eess.AS

    Visual Features for Context-Aware Speech Recognition

    Authors: Abhinav Gupta, Yajie Miao, Leonardo Neves, Florian Metze

    Abstract: Automatic transcriptions of consumer-generated multi-media content such as "Youtube" videos still exhibit high word error rates. Such data typically occupies a very broad domain, has been recorded in challenging conditions, with cheap hardware and a focus on the visual modality, and may have been post-processed or edited. In this paper, we extend our earlier work on adapting the acoustic model of… ▽ More

    Submitted 1 December, 2017; originally announced December 2017.

    Comments: 5 pages and 3 figures

    Journal ref: IEEE Xplore (ICASSP) (2017) 5020-5024