Skip to main content

Showing 1–50 of 75 results for author: Neves, L

.
  1. arXiv:2506.05826  [pdf, ps, other

    cs.LG

    Learning Along the Arrow of Time: Hyperbolic Geometry for Backward-Compatible Representation Learning

    Authors: Ngoc Bui, Menglin Yang, Runjin Chen, Leonardo Neves, Mingxuan Ju, Rex Ying, Neil Shah, Tong Zhao

    Abstract: Backward compatible representation learning enables updated models to integrate seamlessly with existing ones, avoiding to reprocess stored data. Despite recent advances, existing compatibility approaches in Euclidean space neglect the uncertainty in the old embedding model and force the new model to reconstruct outdated representations regardless of their quality, thereby hindering the learning p… ▽ More

    Submitted 6 June, 2025; originally announced June 2025.

  2. arXiv:2505.21811  [pdf, ps, other

    cs.IR cs.AI

    Revisiting Self-attention for Cross-domain Sequential Recommendation

    Authors: Clark Mingxuan Ju, Leonardo Neves, Bhuvesh Kumar, Liam Collins, Tong Zhao, Yuwei Qiu, Qing Dou, Sohail Nizam, Sen Yang, Neil Shah

    Abstract: Sequential recommendation is a popular paradigm in modern recommender systems. In particular, one challenging problem in this space is cross-domain sequential recommendation (CDSR), which aims to predict future behaviors given user interactions across multiple domains. Existing CDSR frameworks are mostly built on the self-attention transformer and seek to improve by explicitly injecting additional… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

    Comments: Accepted to KDD'25

  3. arXiv:2504.21838  [pdf, ps, other

    cs.IR

    Learning Universal User Representations Leveraging Cross-domain User Intent at Snapchat

    Authors: Clark Mingxuan Ju, Leonardo Neves, Bhuvesh Kumar, Liam Collins, Tong Zhao, Yuwei Qiu, Qing Dou, Yang Zhou, Sohail Nizam, Rengim Ozturk, Yvette Liu, Sen Yang, Manish Malik, Neil Shah

    Abstract: The development of powerful user representations is a key factor in the success of recommender systems (RecSys). Online platforms employ a range of RecSys techniques to personalize user experience across diverse in-app surfaces. User representations are often learned individually through user's historical interactions within each surface and user representations across different surfaces can be sh… ▽ More

    Submitted 9 June, 2025; v1 submitted 30 April, 2025; originally announced April 2025.

    Comments: Accepted to the industrial track of SIGIR'25

  4. arXiv:2504.18378  [pdf, ps, other

    cond-mat.mtrl-sci

    Ab initio modeling of TWIP and TRIP effects in $β$-Ti alloys

    Authors: David Holec, Johann Grillitsch, Jose L. Neves, David Obersteiner, Thomas Klein

    Abstract: Transformations in bcc-$β$, hcp-$α$, and the $ω$ phases of Ti alloys are studied using Density Functional Theory for pure Ti and Ti alloyed with Al, Si, V, Cr, Fe, Cu, Nb, Mo, and Sn. The $β$-stabilization caused by alloying Si, Fe, Cr, and Mo was observed, but the most stable phase appears between the $β$ and the $α$ phases, corresponding to the martensitic $α''$ phase. Next, the… ▽ More

    Submitted 6 July, 2025; v1 submitted 25 April, 2025; originally announced April 2025.

    Comments: 26 pages, 8 figures

  5. arXiv:2504.14073  [pdf, other

    quant-ph

    Ptychographic estimation of qudit states encoded in the angular position and orbital angular momentum of single photons

    Authors: A. M. da Costa, L. Neves

    Abstract: Ptychography is a computational imaging technique mainly used in optical and electron microscopy. Its quantum analogue was recently introduced as a simple method for estimating unknown pure quantum states through projections onto partially overlapping subspaces, each one followed by a projective measurement in the Fourier basis. In the end, an iterative algorithm estimates the state from the colle… ▽ More

    Submitted 18 April, 2025; originally announced April 2025.

    Comments: 11 pages, 10 figures. Published version

    Journal ref: Journal of the Optical Society of America B 42, 1013 (2025)

  6. Coherence based on positive operator-valued measures for standard and concatenated quantum state discrimination with inconclusive results

    Authors: L. F. Melo, O. Jiménez, L. Neves

    Abstract: The optimal measurement that discriminates nonorthogonal quantum states with fixed rates of inconclusive outcomes (FRIO) can be decomposed into an assisted separation of the inputs, yielding conclusive and inconclusive outputs, followed by a minimum-error (ME) measurement for the conclusive ones (standard FRIO) or both ones (concatenated FRIO). The implementation of these measurements is underpinn… ▽ More

    Submitted 31 January, 2025; originally announced February 2025.

    Comments: 12 pages, 6 figures. Published version

    Journal ref: Physical Review A 111, 012403 (2025)

  7. arXiv:2412.17245  [pdf, other

    cs.IR cs.SI

    GraphHash: Graph Clustering Enables Parameter Efficiency in Recommender Systems

    Authors: Xinyi Wu, Donald Loveland, Runjin Chen, Yozen Liu, Xin Chen, Leonardo Neves, Ali Jadbabaie, Clark Mingxuan Ju, Neil Shah, Tong Zhao

    Abstract: Deep recommender systems rely heavily on large embedding tables to handle high-cardinality categorical features such as user/item identifiers, and face significant memory constraints at scale. To tackle this challenge, hashing techniques are often employed to map multiple entities to the same embedding and thus reduce the size of the embedding tables. Concurrently, graph-based collaborative signal… ▽ More

    Submitted 8 February, 2025; v1 submitted 22 December, 2024; originally announced December 2024.

    Comments: ACM Web Conference (WWW) 2025, Oral

  8. arXiv:2412.17171  [pdf, other

    cs.LG cs.IR

    Enhancing Item Tokenization for Generative Recommendation through Self-Improvement

    Authors: Runjin Chen, Mingxuan Ju, Ngoc Bui, Dimosthenis Antypas, Stanley Cai, Xiaopeng Wu, Leonardo Neves, Zhangyang Wang, Neil Shah, Tong Zhao

    Abstract: Generative recommendation systems, driven by large language models (LLMs), present an innovative approach to predicting user preferences by modeling items as token sequences and generating recommendations in a generative manner. A critical challenge in this approach is the effective tokenization of items, ensuring that they are represented in a form compatible with LLMs. Current item tokenization… ▽ More

    Submitted 22 December, 2024; originally announced December 2024.

  9. Ptychographic estimation of pure multiqubit states in a quantum device

    Authors: Warley M. S. Alves, Leonardo Neves

    Abstract: Quantum ptychography is a method for estimating an unknown pure quantum state by subjecting it to overlapping projections, each one followed by a projective measurement on a single prescribed basis. Here, we present a comprehensive study of this method applied for estimating $n$-qubit states in a circuit-based quantum computer, including numerical simulations and experiments carried out on an IBM… ▽ More

    Submitted 2 December, 2024; originally announced December 2024.

    Comments: 15 pages, 10 figures, 5 tables

    Journal ref: APL Quantum 1, 046115 (2024)

  10. Experimental optimal discrimination of $N$ states of a qubit with fixed rates of inconclusive outcomes

    Authors: L. F. Melo, M. A. Solís-Prosser, O. Jiménez, A. Delgado, L. Neves

    Abstract: In a general optimized measurement scheme for discriminating between nonorthogonal quantum states, the error rate is minimized under the constraint of a fixed rate of inconclusive outcomes (FRIO). This so-called optimal FRIO measurement encompasses the standard and well known minimum-error and optimal unambiguous (or maximum-confidence) discrimination strategies as particular cases. Here, we exper… ▽ More

    Submitted 21 November, 2024; originally announced November 2024.

    Comments: 12 pages, 6 figures. Published version

    Journal ref: Physical Review Research 5, 043149 (2023)

  11. arXiv:2411.10142  [pdf, other

    cs.CY

    First Steps towards K-12 Computer Science Education in Portugal -- Experience Report

    Authors: Fernando Luis Neves, Jose Nuno Oliveira

    Abstract: Computer scientists Jeannette Wing and Simon Peyton Jones have catalyzed a pivotal discussion on the need to introduce computing in K-12 mandatory education. In Wing's own words, computing 'represents a universally applicable attitude and skill set everyone, not just computer scientists, would be eager to learn and use.'' The crux of this educational endeavor lies in its execution. This paper repo… ▽ More

    Submitted 15 November, 2024; originally announced November 2024.

    ACM Class: K.3.1; K.3.2

  12. arXiv:2408.04111  [pdf, ps, other

    quant-ph

    Energy additivity as a requirement for universal quantum thermodynamical frameworks

    Authors: Luis Rodrigo Neves, Frederico Brito

    Abstract: The quest to develop a general framework for thermodynamics, suitable for the regime of strong coupling and correlations between subsystems of an autonomous quantum "universe," has entailed diverging definitions for basic quantities, including internal energy. While most approaches focus solely on the system of interest, we propose that a universal notion of internal energy should also account for… ▽ More

    Submitted 26 June, 2025; v1 submitted 7 August, 2024; originally announced August 2024.

    Comments: 13 pages, 1 figure. [V2] Sections II and III swapped, with corresponding adaptations, including abstract. Concept of EHR formalized as Definition 1. No-go result included (Proposition 1). Minor changes through the text. [V3] Introduction rewritten. Subsection IIC expanded, now Section III. References 45-50 added. Minor changes through the text, including abstract

  13. arXiv:2406.04106  [pdf, other

    cs.CL

    Explainability and Hate Speech: Structured Explanations Make Social Media Moderators Faster

    Authors: Agostina Calabrese, Leonardo Neves, Neil Shah, Maarten W. Bos, Björn Ross, Mirella Lapata, Francesco Barbieri

    Abstract: Content moderators play a key role in keeping the conversation on social media healthy. While the high volume of content they need to judge represents a bottleneck to the moderation pipeline, no studies have explored how models could support them to make faster decisions. There is, by now, a vast body of research into detecting hate speech, sometimes explicitly motivated by a desire to help improv… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 11 pages, 14 figures, to be published at ACL 2024

  14. arXiv:2403.13344  [pdf, other

    cs.SI cs.AI cs.CL cs.HC cs.IR cs.LG

    USE: Dynamic User Modeling with Stateful Sequence Models

    Authors: Zhihan Zhou, Qixiang Fang, Leonardo Neves, Francesco Barbieri, Yozen Liu, Han Liu, Maarten W. Bos, Ron Dotsch

    Abstract: User embeddings play a crucial role in user engagement forecasting and personalized services. Recent advances in sequence modeling have sparked interest in learning user embeddings from behavioral data. Yet behavior-based user embedding learning faces the unique challenge of dynamic user modeling. As users continuously interact with the apps, user embeddings should be periodically updated to accou… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  15. arXiv:2403.13220  [pdf

    cs.SE

    Elevating Software Quality in Agile Environments: The Role of Testing Professionals in Unit Testing

    Authors: Lucas Neves, Oscar Campos, Robson Santos, Italo Santos, Cleyton Magalhaes, Ronnie de Souza Santos

    Abstract: Testing is an essential quality activity in the software development process. Usually, a software system is tested on several levels, starting with unit testing that checks the smallest parts of the code until acceptance testing, which is focused on the validations with the end-user. Historically, unit testing has been the domain of developers, who are responsible for ensuring the accuracy of thei… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  16. General-Purpose User Modeling with Behavioral Logs: A Snapchat Case Study

    Authors: Qixiang Fang, Zhihan Zhou, Francesco Barbieri, Yozen Liu, Leonardo Neves, Dong Nguyen, Daniel L. Oberski, Maarten W. Bos, Ron Dotsch

    Abstract: Learning general-purpose user representations based on user behavioral logs is an increasingly popular user modeling approach. It benefits from easily available, privacy-friendly yet expressive data, and does not require extensive re-tuning of the upstream user model for different downstream tasks. While this approach has shown promise in search engines and e-commerce applications, its fit for ins… ▽ More

    Submitted 25 July, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

    Comments: SIGIR 2024

  17. arXiv:2310.14757  [pdf, other

    cs.CL

    SuperTweetEval: A Challenging, Unified and Heterogeneous Benchmark for Social Media NLP Research

    Authors: Dimosthenis Antypas, Asahi Ushio, Francesco Barbieri, Leonardo Neves, Kiamehr Rezaee, Luis Espinosa-Anke, Jiaxin Pei, Jose Camacho-Collados

    Abstract: Despite its relevance, the maturity of NLP for social media pales in comparison with general-purpose models, metrics and benchmarks. This fragmented landscape makes it hard for the community to know, for instance, given a task, which is the best performing model and how it compares with others. To alleviate this issue, we introduce a unified benchmark for NLP evaluation in social media, SuperTweet… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 Findings

  18. arXiv:2309.08999  [pdf, other

    cs.CL

    Context-aware Adversarial Attack on Named Entity Recognition

    Authors: Shuguang Chen, Leonardo Neves, Thamar Solorio

    Abstract: In recent years, large pre-trained language models (PLMs) have achieved remarkable performance on many natural language processing benchmarks. Despite their success, prior studies have shown that PLMs are vulnerable to attacks from adversarial examples. In this work, we focus on the named entity recognition task and study context-aware adversarial attack methods to examine the model's robustness.… ▽ More

    Submitted 2 February, 2024; v1 submitted 16 September, 2023; originally announced September 2023.

    Comments: Accepted to W-NUT at EACL 2024

  19. arXiv:2308.02142  [pdf, other

    cs.CL cs.SI

    Tweet Insights: A Visualization Platform to Extract Temporal Insights from Twitter

    Authors: Daniel Loureiro, Kiamehr Rezaee, Talayeh Riahi, Francesco Barbieri, Leonardo Neves, Luis Espinosa Anke, Jose Camacho-Collados

    Abstract: This paper introduces a large collection of time series data derived from Twitter, postprocessed using word embedding techniques, as well as specialized fine-tuned language models. This data comprises the past five years and captures changes in n-gram frequency, similarity, sentiment and topic distribution. The interface built on top of this data enables temporal analysis for detecting and charact… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

    Comments: Demo paper. Visualization platform available at https://tweetnlp.org/insights

  20. arXiv:2210.07916  [pdf, other

    cs.CL

    Style Transfer as Data Augmentation: A Case Study on Named Entity Recognition

    Authors: Shuguang Chen, Leonardo Neves, Thamar Solorio

    Abstract: In this work, we take the named entity recognition task in the English language as a case study and explore style transfer as a data augmentation method to increase the size and diversity of training data in low-resource scenarios. We propose a new method to effectively transform the text from a high-resource domain to a low-resource domain by changing its style-related attributes to generate synt… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

    Comments: To appear at EMNLP 2022 main conference

  21. arXiv:2210.03797  [pdf, other

    cs.CL

    Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts

    Authors: Asahi Ushio, Leonardo Neves, Vitor Silva, Francesco Barbieri, Jose Camacho-Collados

    Abstract: Recent progress in language model pre-training has led to important improvements in Named Entity Recognition (NER). Nonetheless, this progress has been mainly tested in well-formatted documents such as news, Wikipedia, or scientific articles. In social media the landscape is different, in which it adds another layer of complexity due to its noisy and dynamic nature. In this paper, we focus on NER… ▽ More

    Submitted 15 November, 2022; v1 submitted 7 October, 2022; originally announced October 2022.

    Comments: AACL 2022 main conference

  22. arXiv:2210.01108  [pdf, other

    cs.CL cs.CY cs.LG

    SemEval 2023 Task 9: Multilingual Tweet Intimacy Analysis

    Authors: Jiaxin Pei, Vítor Silva, Maarten Bos, Yozon Liu, Leonardo Neves, David Jurgens, Francesco Barbieri

    Abstract: We propose MINT, a new Multilingual INTimacy analysis dataset covering 13,372 tweets in 10 languages including English, French, Spanish, Italian, Portuguese, Korean, Dutch, Chinese, Hindi, and Arabic. We benchmarked a list of popular multilingual pre-trained language models. The dataset is released along with the SemEval 2023 Task 9: Multilingual Tweet Intimacy Analysis (https://sites.google.com/u… ▽ More

    Submitted 3 February, 2023; v1 submitted 3 October, 2022; originally announced October 2022.

    Comments: SemEval 2023 Task 9: Multilingual Tweet Intimacy Analysis

  23. arXiv:2209.09824  [pdf, other

    cs.CL

    Twitter Topic Classification

    Authors: Dimosthenis Antypas, Asahi Ushio, Jose Camacho-Collados, Leonardo Neves, Vítor Silva, Francesco Barbieri

    Abstract: Social media platforms host discussions about a wide variety of topics that arise everyday. Making sense of all the content and organising it into categories is an arduous task. A common way to deal with this issue is relying on topic modeling, but topics discovered using this technique are difficult to interpret and can differ from corpus to corpus. In this paper, we present a new task based on t… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

    Comments: Accepted at COLING 2022

  24. arXiv:2209.07216  [pdf, other

    cs.CL

    TempoWiC: An Evaluation Benchmark for Detecting Meaning Shift in Social Media

    Authors: Daniel Loureiro, Aminette D'Souza, Areej Nasser Muhajab, Isabella A. White, Gabriel Wong, Luis Espinosa Anke, Leonardo Neves, Francesco Barbieri, Jose Camacho-Collados

    Abstract: Language evolves over time, and word meaning changes accordingly. This is especially true in social media, since its dynamic nature leads to faster semantic shifts, making it challenging for NLP models to deal with new content and trends. However, the number of datasets and models that specifically address the dynamic nature of these social platforms is scarce. To bridge this gap, we present Tempo… ▽ More

    Submitted 16 September, 2022; v1 submitted 15 September, 2022; originally announced September 2022.

    Comments: Accepted to COLING 2022. Used to create the TempoWiC Shared Task for EvoNLP

  25. arXiv:2206.14774  [pdf, other

    cs.CL

    TweetNLP: Cutting-Edge Natural Language Processing for Social Media

    Authors: Jose Camacho-Collados, Kiamehr Rezaee, Talayeh Riahi, Asahi Ushio, Daniel Loureiro, Dimosthenis Antypas, Joanne Boisson, Luis Espinosa-Anke, Fangyu Liu, Eugenio Martínez-Cámara, Gonzalo Medina, Thomas Buhrmann, Leonardo Neves, Francesco Barbieri

    Abstract: In this paper we present TweetNLP, an integrated platform for Natural Language Processing (NLP) in social media. TweetNLP supports a diverse set of NLP tasks, including generic focus areas such as sentiment analysis and named entity recognition, as well as social media-specific tasks such as emoji prediction and offensive language identification. Task-specific systems are powered by reasonably-siz… ▽ More

    Submitted 25 October, 2022; v1 submitted 29 June, 2022; originally announced June 2022.

    Comments: EMNLP 2022 Demo paper. TweetNLP: https://tweetnlp.org/

  26. A constraint on local definitions of quantum internal energy

    Authors: Luis Rodrigo Torres Neves, Frederico Brito

    Abstract: Recent advances in quantum thermodynamics have been focusing on ever more elementary systems of interest, approaching the limit of a single qubit, with correlations, strong coupling and non-equilibrium environments coming into play. Under such scenarios, it is clear that fundamental physical quantities must be revisited. This article questions whether a universal definition of internal energy for… ▽ More

    Submitted 14 October, 2023; v1 submitted 9 May, 2022; originally announced May 2022.

    Comments: 20 pages, 3 figures. Version 2: new references added; further discussions on the hypotheses and connection to other approaches included in Sections I and V. Version 3: bibliographic metadata included

    Journal ref: Phys. Rev. A 108, 042209 (2023)

  27. arXiv:2202.03829  [pdf, other

    cs.CL cs.AI

    TimeLMs: Diachronic Language Models from Twitter

    Authors: Daniel Loureiro, Francesco Barbieri, Leonardo Neves, Luis Espinosa Anke, Jose Camacho-Collados

    Abstract: Despite its importance, the time variable has been largely neglected in the NLP and language model literature. In this paper, we present TimeLMs, a set of language models specialized on diachronic Twitter data. We show that a continual learning strategy contributes to enhancing Twitter-based language models' capacity to deal with future and out-of-distribution tweets, while making them competitive… ▽ More

    Submitted 1 April, 2022; v1 submitted 8 February, 2022; originally announced February 2022.

    Comments: Accepted to ACL 2022 (Demo Track) - https://github.com/cardiffnlp/timelms

  28. Enhanced discrimination of high-dimensional quantum states by concatenated optimal measurement strategies

    Authors: M. A. Solís-Prosser, O. Jiménez, A. Delgado, L. Neves

    Abstract: The impossibility of deterministic and error-free discrimination among nonorthogonal quantum states lies at the core of quantum theory and constitutes a primitive for secure quantum communication. Demanding determinism leads to errors, while demanding certainty leads to some inconclusiveness. One of the most fundamental strategies developed for this task is the optimal unambiguous measurement. It… ▽ More

    Submitted 18 December, 2021; originally announced December 2021.

    Comments: 10 pages, 7 figures. Published version

    Journal ref: Quantum Science and Technology 7, 015017 (2022)

  29. arXiv:2109.01758  [pdf, other

    cs.CL

    Data Augmentation for Cross-Domain Named Entity Recognition

    Authors: Shuguang Chen, Gustavo Aguilar, Leonardo Neves, Thamar Solorio

    Abstract: Current work in named entity recognition (NER) shows that data augmentation techniques can produce more robust models. However, most existing techniques focus on augmenting in-domain data in low-resource scenarios where annotated data is quite limited. In contrast, we study cross-domain data augmentation for the NER task. We investigate the possibility of leveraging data from high-resource domains… ▽ More

    Submitted 3 September, 2021; originally announced September 2021.

    Comments: To appear at EMNLP 2021 main conference

  30. arXiv:2104.09742  [pdf, other

    cs.CL

    Mitigating Temporal-Drift: A Simple Approach to Keep NER Models Crisp

    Authors: Shuguang Chen, Leonardo Neves, Thamar Solorio

    Abstract: Performance of neural models for named entity recognition degrades over time, becoming stale. This degradation is due to temporal drift, the change in our target variables' statistical properties over time. This issue is especially problematic for social media data, where topics change rapidly. In order to mitigate the problem, data annotation and retraining of models is common. Despite its useful… ▽ More

    Submitted 19 April, 2021; originally announced April 2021.

    Comments: Accepted to SocialNLP at NAACL 2021

  31. arXiv:2011.01196  [pdf, other

    cs.CL

    The Devil is in the Details: Evaluating Limitations of Transformer-based Methods for Granular Tasks

    Authors: Brihi Joshi, Neil Shah, Francesco Barbieri, Leonardo Neves

    Abstract: Contextual embeddings derived from transformer-based neural language models have shown state-of-the-art performance for various tasks such as question answering, sentiment analysis, and textual similarity in recent years. Extensive work shows how accurately such models can represent abstract, semantic information present in text. In this expository work, we explore a tangent direction and analyze… ▽ More

    Submitted 2 November, 2020; originally announced November 2020.

    Comments: Accepted at COLING 2020. Code available at https://github.com/brihijoshi/granular-similarity-COLING-2020

  32. arXiv:2010.15349  [pdf, other

    quant-ph physics.optics

    Ptychographic reconstruction of pure quantum states

    Authors: M. F. Fernandes, M. A. Solís-Prosser, L. Neves

    Abstract: The quantum analogue of ptychography, a powerful coherent diffractive imaging technique, is a simple method for reconstructing $d$-dimensional pure states. It relies on measuring partially overlapping parts of the input state in a single orthonormal basis and feeding the outcomes to an iterative phase-retrieval algorithm for postprocessing. We provide a proof of concept demonstration of this metho… ▽ More

    Submitted 29 October, 2020; originally announced October 2020.

    Comments: 4 pages, 3 figures. Published version

    Journal ref: Optics Letters 45, 6002 (2020)

  33. arXiv:2010.12864  [pdf, other

    cs.CL stat.ML

    On Transferability of Bias Mitigation Effects in Language Model Fine-Tuning

    Authors: Xisen Jin, Francesco Barbieri, Brendan Kennedy, Aida Mostafazadeh Davani, Leonardo Neves, Xiang Ren

    Abstract: Fine-tuned language models have been shown to exhibit biases against protected groups in a host of modeling tasks such as text classification and coreference resolution. Previous works focus on detecting these biases, reducing bias in data representations, and using auxiliary training objectives to mitigate bias during fine-tuning. Although these techniques achieve bias reduction for the task and… ▽ More

    Submitted 11 April, 2021; v1 submitted 24 October, 2020; originally announced October 2020.

    Comments: 14 pages; Accepted at NAACL 2021

  34. arXiv:2010.12712  [pdf, other

    cs.CL

    Can images help recognize entities? A study of the role of images for Multimodal NER

    Authors: Shuguang Chen, Gustavo Aguilar, Leonardo Neves, Thamar Solorio

    Abstract: Multimodal named entity recognition (MNER) requires to bridge the gap between language understanding and visual context. While many multimodal neural techniques have been proposed to incorporate images into the MNER task, the model's ability to leverage multimodal interactions remains poorly understood. In this work, we conduct in-depth analyses of existing multimodal fusion techniques from differ… ▽ More

    Submitted 19 September, 2021; v1 submitted 23 October, 2020; originally announced October 2020.

    Comments: Accepted to W-NUT 2021 at EMNLP

  35. arXiv:2010.12421  [pdf, other

    cs.CL cs.SI

    TweetEval: Unified Benchmark and Comparative Evaluation for Tweet Classification

    Authors: Francesco Barbieri, Jose Camacho-Collados, Leonardo Neves, Luis Espinosa-Anke

    Abstract: The experimental landscape in natural language processing for social media is too fragmented. Each year, new shared tasks and datasets are proposed, ranging from classics like sentiment analysis to irony detection or emoji prediction. Therefore, it is unclear what the current state of the art is, as there is no standardized evaluation protocol, neither a strong set of baselines trained on such dom… ▽ More

    Submitted 26 October, 2020; v1 submitted 23 October, 2020; originally announced October 2020.

    Comments: Findings of EMNLP 2020. TweetEval benchmark available at https://github.com/cardiffnlp/tweeteval

  36. arXiv:2006.06830  [pdf, other

    cs.LG stat.ML

    Data Augmentation for Graph Neural Networks

    Authors: Tong Zhao, Yozen Liu, Leonardo Neves, Oliver Woodford, Meng Jiang, Neil Shah

    Abstract: Data augmentation has been widely used to improve generalizability of machine learning models. However, comparatively little work studies data augmentation for graphs. This is largely due to the complex, non-Euclidean structure of graphs, which limits possible manipulation operations. Augmentation operations commonly used in vision and language have no analogs for graphs. Our work studies graph da… ▽ More

    Submitted 2 December, 2020; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: AAAI 2021. This complete version contains the Appendix

  37. A simple individual-based population growth model with limited resources

    Authors: Luis R. T. Neves, Leonardo Paulo Maia

    Abstract: We address a novel approach for stochastic individual-based modelling of a single species population. Individuals are distinguished by their remaining lifetimes, which are regulated by the interplay between the inexorable running of time and the individual's nourishment history. A food-limited environment induces intraspecific competition and henceforth the carrying capacity of the medium may be f… ▽ More

    Submitted 30 November, 2020; v1 submitted 8 May, 2020; originally announced May 2020.

    Comments: 11 pages, 7 figures

  38. arXiv:2004.07499  [pdf, other

    cs.CL cs.AI cs.LG

    LEAN-LIFE: A Label-Efficient Annotation Framework Towards Learning from Explanation

    Authors: Dong-Ho Lee, Rahul Khanna, Bill Yuchen Lin, Jamin Chen, Seyeon Lee, Qinyuan Ye, Elizabeth Boschee, Leonardo Neves, Xiang Ren

    Abstract: Successfully training a deep neural network demands a huge corpus of labeled data. However, each label only provides limited information to learn from and collecting the requisite number of labels involves massive human effort. In this work, we introduce LEAN-LIFE, a web-based, Label-Efficient AnnotatioN framework for sequence labeling and classification tasks, with an easy-to-use UI that not only… ▽ More

    Submitted 16 April, 2020; originally announced April 2020.

    Comments: Accepted to the ACL 2020 (demo). The first two authors contributed equally. Project page: http://inklab.usc.edu/leanlife/

  39. arXiv:2002.04053  [pdf, other

    quant-ph

    3D compact photonic circuits for realizing quantum state tomography of qudits in any finite dimension

    Authors: Wilder Cardoso, Davi Barros, Leonardo Neves, Sebastião Pádua

    Abstract: In this work, we propose three-dimensional photonic circuit designs that guarantee a considerable reduction in the complexity of circuits for the purpose of performing quantum state tomography of N-dimensional path qudits. The POVM (Positive Operator-Valued Measure) chosen in this work ensures that, for odd dimensions, such process is minimal. Our proposal consists of organizing the waveguides tha… ▽ More

    Submitted 10 February, 2020; originally announced February 2020.

    Comments: 9+ pages, 11 figures, comments are welcome!

  40. Photonic Discrete-time Quantum Walks and Applications

    Authors: Leonardo Neves, Graciana Puentes

    Abstract: We present a review of photonic implementations of discrete-time quantum walks (DTQW) in the spatial and temporal domains, based on spatial- and time-multiplexing techniques, respectively. Additionally, we propose a detailed novel scheme for photonic DTQW, using transverse spatial modes of single photons and programmable spatial light modulators (SLM) to manipulate them. Unlike all previous mode-m… ▽ More

    Submitted 18 December, 2019; originally announced December 2019.

    Comments: 17 pages, 5 figures. arXiv admin note: text overlap with arXiv:1609.07572

    Journal ref: Entropy 2018, 20(10), 731

  41. arXiv:1911.01352  [pdf, other

    cs.CL

    Learning from Explanations with Neural Execution Tree

    Authors: Ziqi Wang, Yujia Qin, Wenxuan Zhou, Jun Yan, Qinyuan Ye, Leonardo Neves, Zhiyuan Liu, Xiang Ren

    Abstract: While deep neural networks have achieved impressive performance on a range of NLP tasks, these data-hungry models heavily rely on labeled data, which restricts their applications in scenarios where data annotation is expensive. Natural language (NL) explanations have been demonstrated very useful additional supervision, which can provide sufficient domain knowledge for generating more labeled data… ▽ More

    Submitted 14 February, 2020; v1 submitted 4 November, 2019; originally announced November 2019.

    Comments: 18 pages, 8 figures, 12 tables. Published as a conference paper at ICLR 2020

    ACM Class: I.2.7

  42. arXiv:1909.02177  [pdf, other

    cs.CL

    NERO: A Neural Rule Grounding Framework for Label-Efficient Relation Extraction

    Authors: Wenxuan Zhou, Hongtao Lin, Bill Yuchen Lin, Ziqi Wang, Junyi Du, Leonardo Neves, Xiang Ren

    Abstract: Deep neural models for relation extraction tend to be less reliable when perfectly labeled data is limited, despite their success in label-sufficient scenarios. Instead of seeking more instance-level labels from human annotators, here we propose to annotate frequent surface patterns to form labeling rules. These rules can be automatically mined from large text corpora and generalized via a soft ru… ▽ More

    Submitted 15 January, 2020; v1 submitted 4 September, 2019; originally announced September 2019.

    Comments: Accepted by WWW2020. Code available at https://github.com/INK-USC/NERO

  43. arXiv:1903.12431  [pdf, other

    cs.CL

    Train One Get One Free: Partially Supervised Neural Network for Bug Report Duplicate Detection and Clustering

    Authors: Lahari Poddar, Leonardo Neves, William Brendel, Luis Marujo, Sergey Tulyakov, Pradeep Karuturi

    Abstract: Tracking user reported bugs requires considerable engineering effort in going through many repetitive reports and assigning them to the correct teams. This paper proposes a neural architecture that can jointly (1) detect if two bug reports are duplicates, and (2) aggregate them into latent topics. Leveraging the assumption that learning the topic of a bug is a sub-task for detecting duplicates, we… ▽ More

    Submitted 3 April, 2019; v1 submitted 29 March, 2019; originally announced March 2019.

    Comments: Accepted for publication in NAACL 2019

  44. Ptychography of pure quantum states

    Authors: Mário Foganholi Fernandes, Leonardo Neves

    Abstract: Ptychography is an imaging technique in which a localized illumination scans overlapping regions of an object and generates a set of diffraction intensities used to computationally reconstruct its complex-valued transmission function. We propose a quantum analogue of this technique designed to reconstruct $d$-dimensional pure states. A set of $n$ rank-$r$ projectors "scans" overlapping parts of an… ▽ More

    Submitted 16 December, 2019; v1 submitted 29 December, 2018; originally announced December 2018.

    Comments: 11 pages, 6 figures. Published version

    Journal ref: Scientific Reports 9, 16066 (2019)

  45. arXiv:1802.07862  [pdf, other

    cs.CL

    Multimodal Named Entity Recognition for Short Social Media Posts

    Authors: Seungwhan Moon, Leonardo Neves, Vitor Carvalho

    Abstract: We introduce a new task called Multimodal Named Entity Recognition (MNER) for noisy user-generated data such as tweets or Snapchat captions, which comprise short text with accompanying images. These social media posts often come in inconsistent or incomplete syntax and lexical notations with very limited surrounding textual contexts, bringing significant challenges for NER. To this end, we create… ▽ More

    Submitted 21 February, 2018; originally announced February 2018.

  46. arXiv:1712.00489  [pdf, other

    cs.CL cs.AI cs.CV cs.LG eess.AS

    Visual Features for Context-Aware Speech Recognition

    Authors: Abhinav Gupta, Yajie Miao, Leonardo Neves, Florian Metze

    Abstract: Automatic transcriptions of consumer-generated multi-media content such as "Youtube" videos still exhibit high word error rates. Such data typically occupies a very broad domain, has been recorded in challenging conditions, with cheap hardware and a focus on the visual modality, and may have been post-processed or edited. In this paper, we extend our earlier work on adapting the acoustic model of… ▽ More

    Submitted 1 December, 2017; originally announced December 2017.

    Comments: 5 pages and 3 figures

    Journal ref: IEEE Xplore (ICASSP) (2017) 5020-5024

  47. arXiv:1709.04761  [pdf, other

    physics.med-ph physics.app-ph

    A Novel Metamaterial-Inspired RF-coil for Preclinical Dual-Nuclei MRI

    Authors: A. Hurshkainen, A. Nikulin, E. Georget, B. Larrat, D. Berrahou, L. Neves, P. Sabouroux, S. Enoch, I. Melchakova, P. Belov, S. Glybovski, R. Abdeddaim

    Abstract: In this paper we propose, design and test a new dual-nuclei RF-coil inspired by wire metamaterial structures. The coil operates due to resonant excitation of hybridized eigenmodes in multimode flat periodic structures comprising several coupled thin metal strips. It was shown that the field distribution of the coil (i.e. penetration depth) can be controlled independently at two different Larmor fr… ▽ More

    Submitted 14 September, 2017; originally announced September 2017.

  48. arXiv:1708.09112  [pdf, ps, other

    math.AP

    Nonradial solutions for the Hénon equation close to the threshold

    Authors: Pablo Figueroa, Sérgio L. N. Neves

    Abstract: We consider the Hénon problem \begin{equation*} \left\{ \begin{array} - - Δu = |x|^α u^{\frac{N+2+2α}{N-2}-\varepsilon} & \ \ \text{in} \ B_1, \\ u > 0 & \ \ \text{in} \ B_1, \\ u=0 & \ \ \text{on} \ \partial B_1, \end{array} \right. \end{equation*} where $B_1$ is the unit ball in ${\mathbb R}^N$ and $N\geqslant 3$. For $\varepsilon > 0$ small enough, we use $α$ as a paramenter and prove the exist… ▽ More

    Submitted 29 September, 2017; v1 submitted 30 August, 2017; originally announced August 2017.

    Comments: 23 pages, 0 figures

    MSC Class: 35B32; 35J25; 35J60

  49. Proposal for Automated Operations for Single-Photon Multipath Qudits

    Authors: Roberto D. Baldijão, Gilberto F. Borges, Breno Marques, Miguel Solís-prosser, Leonardo Neves, Sebastião Pádua

    Abstract: We propose a method for implementing automated state transformations on single-photon multipath qudits encoded in a one-dimensional transverse spatial domain. It relies on transferring the encoding from this domain to the orthogonal one by applying a spatial phase modulation with diffraction gratings, merging all the initial propagation paths with a stable interferometric network, and filtering ou… ▽ More

    Submitted 31 March, 2017; originally announced March 2017.

    Comments: 13 pages, 12 figures

    Journal ref: Phys. Rev. A 96, 032329 (2017)

  50. arXiv:1703.02961  [pdf, other

    quant-ph physics.optics

    Experimental minimum-error quantum-state discrimination in high dimensions

    Authors: M. A. Solís-Prosser, M. F. Fernandes, O. Jiménez, A. Delgado, L. Neves

    Abstract: Quantum mechanics forbids perfect discrimination among nonorthogonal states through a single shot measurement. To optimize this task, many strategies were devised that later became fundamental tools for quantum information processing. Here, we address the pioneering minimum-error (ME) measurement and give the first experimental demonstration of its application for discriminating nonorthogonal stat… ▽ More

    Submitted 8 March, 2017; originally announced March 2017.

    Comments: 13 pages, 13 figures

    Journal ref: Physical Review Letters 118, 100501 (2017)