Skip to main content

Showing 1–13 of 13 results for author: De Paula, M

Searching in archive cs. Search in all archives.
.
  1. The Effects of Demographic Instructions on LLM Personas

    Authors: Angel Felipe Magnossão de Paula, J. Shane Culpepper, Alistair Moffat, Sachin Pathiyan Cherumanal, Falk Scholer, Johanne Trippas

    Abstract: Social media platforms must filter sexist content in compliance with governmental regulations. Current machine learning approaches can reliably detect sexism based on standardized definitions, but often neglect the subjective nature of sexist language and fail to consider individual users' perspectives. To address this gap, we adopt a perspectivist approach, retaining diverse annotations rather th… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

    Comments: Accepted at SIGIR'25, Padua, Italy

  2. Walert: Putting Conversational Search Knowledge into Action by Building and Evaluating a Large Language Model-Powered Chatbot

    Authors: Sachin Pathiyan Cherumanal, Lin Tian, Futoon M. Abushaqra, Angel Felipe Magnossao de Paula, Kaixin Ji, Danula Hettiachchi, Johanne R. Trippas, Halil Ali, Falk Scholer, Damiano Spina

    Abstract: Creating and deploying customized applications is crucial for operational success and enriching user experiences in the rapidly evolving modern business world. A prominent facet of modern user experiences is the integration of chatbots or voice assistants. The rapid evolution of Large Language Models (LLMs) has provided a powerful tool to build conversational applications. We present Walert, a cus… ▽ More

    Submitted 14 January, 2024; originally announced January 2024.

    Comments: Accepted at 2024 ACM SIGIR CHIIR

  3. arXiv:2307.03385  [pdf, other

    cs.CL cs.CY cs.LG

    AI-UPV at EXIST 2023 -- Sexism Characterization Using Large Language Models Under The Learning with Disagreements Regime

    Authors: Angel Felipe Magnossão de Paula, Giulia Rizzi, Elisabetta Fersini, Damiano Spina

    Abstract: With the increasing influence of social media platforms, it has become crucial to develop automated systems capable of detecting instances of sexism and other disrespectful and hateful behaviors to promote a more inclusive and respectful online environment. Nevertheless, these tasks are considerably challenging considering different hate categories and the author's intentions, especially under the… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

    Comments: 15 pages, 9 tables, 1 figures, conference

  4. arXiv:2307.03377  [pdf, ps, other

    cs.CL cs.LG

    Mitigating Negative Transfer with Task Awareness for Sexism, Hate Speech, and Toxic Language Detection

    Authors: Angel Felipe Magnossão de Paula, Paolo Rosso, Damiano Spina

    Abstract: This paper proposes a novelty approach to mitigate the negative transfer problem. In the field of machine learning, the common strategy is to apply the Single-Task Learning approach in order to train a supervised model to solve a specific task. Training a robust model requires a lot of data and a significant amount of computational resources, making this solution unfeasible in cases where data are… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

    Comments: 8 pages, 2 figures, 5 tables, IJCNN 2023 conference

  5. arXiv:2303.09823  [pdf, other

    cs.CL cs.AI cs.LG

    Transformers and Ensemble methods: A solution for Hate Speech Detection in Arabic languages

    Authors: Angel Felipe Magnossão de Paula, Imene Bensalem, Paolo Rosso, Wajdi Zaghouani

    Abstract: This paper describes our participation in the shared task of hate speech detection, which is one of the subtasks of the CERIST NLP Challenge 2022. Our experiments evaluate the performance of six transformer models and their combination using 2 ensemble approaches. The best results on the training set, in a five-fold cross validation scenario, were obtained by using the ensemble approach based on t… ▽ More

    Submitted 17 March, 2023; originally announced March 2023.

    Comments: 7 pages, 3 tables

  6. arXiv:2112.06080  [pdf, other

    cs.IR cs.AI

    UPV at TREC Health Misinformation Track 2021 Ranking with SBERT and Quality Estimators

    Authors: Ipek Baris Schlicht, Angel Felipe Magnossão de Paula, Paolo Rosso

    Abstract: Health misinformation on search engines is a significant problem that could negatively affect individuals or public health. To mitigate the problem, TREC organizes a health misinformation track. This paper presents our submissions to this track. We use a BM25 and a domain-specific semantic search engine for retrieving initial documents. Later, we examine a health news schema for quality assessment… ▽ More

    Submitted 11 December, 2021; originally announced December 2021.

    Comments: 6 pages; presented at the TREC 2021

  7. arXiv:2111.04551  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    Sexism Prediction in Spanish and English Tweets Using Monolingual and Multilingual BERT and Ensemble Models

    Authors: Angel Felipe Magnossão de Paula, Roberto Fray da Silva, Ipek Baris Schlicht

    Abstract: The popularity of social media has created problems such as hate speech and sexism. The identification and classification of sexism in social media are very relevant tasks, as they would allow building a healthier social environment. Nevertheless, these tasks are considerably challenging. This work proposes a system to use multilingual and monolingual BERT and data points translation and ensemble… ▽ More

    Submitted 8 November, 2021; originally announced November 2021.

    Comments: 18 pages, presented at IberLEF: http://ceur-ws.org/Vol-2943/exist_paper2.pdf, the best scoring system at EXIST

  8. arXiv:2111.04530  [pdf, other

    cs.CL cs.CY cs.LG

    AI-UPV at IberLEF-2021 DETOXIS task: Toxicity Detection in Immigration-Related Web News Comments Using Transformers and Statistical Models

    Authors: Angel Felipe Magnossão de Paula, Ipek Baris Schlicht

    Abstract: This paper describes our participation in the DEtection of TOXicity in comments In Spanish (DETOXIS) shared task 2021 at the 3rd Workshop on Iberian Languages Evaluation Forum. The shared task is divided into two related classification tasks: (i) Task 1: toxicity detection and; (ii) Task 2: toxicity level detection. They focus on the xenophobic problem exacerbated by the spread of toxic comments p… ▽ More

    Submitted 8 November, 2021; originally announced November 2021.

    Comments: 20 pages. Presented at IberLEF. See http://ceur-ws.org/Vol-2943/detoxis_paper2.pdf

  9. arXiv:2109.09233  [pdf, other

    cs.CL cs.AI cs.LG

    Unified and Multilingual Author Profiling for Detecting Haters

    Authors: Ipek Baris Schlicht, Angel Felipe Magnossão de Paula

    Abstract: This paper presents a unified user profiling framework to identify hate speech spreaders by processing their tweets regardless of the language. The framework encodes the tweets with sentence transformers and applies an attention mechanism to select important tweets for learning user profiles. Furthermore, the attention layer helps to explain why a user is a hate speech spreader by producing attent… ▽ More

    Submitted 19 September, 2021; originally announced September 2021.

    Comments: 9 pages, 2 figures, see the original paper: http://ceur-ws.org/Vol-2936/paper-157.pdf

    Journal ref: Published at the CLEF 2021

  10. arXiv:2109.09232  [pdf, other

    cs.CL cs.LG

    UPV at CheckThat! 2021: Mitigating Cultural Differences for Identifying Multilingual Check-worthy Claims

    Authors: Ipek Baris Schlicht, Angel Felipe Magnossão de Paula, Paolo Rosso

    Abstract: Identifying check-worthy claims is often the first step of automated fact-checking systems. Tackling this task in a multilingual setting has been understudied. Encoding inputs with multilingual text representations could be one approach to solve the multilingual check-worthiness detection. However, this approach could suffer if cultural bias exists within the communities on determining what is che… ▽ More

    Submitted 19 September, 2021; originally announced September 2021.

    Comments: 11 pages, 2 figures. Link to the original paper: http://ceur-ws.org/Vol-2936/paper-36.pdf

    ACM Class: I.7; J.4

    Journal ref: published at CLEF 2021

  11. arXiv:2107.13461  [pdf, other

    cs.RO cs.AI eess.SY q-bio.NC

    Marine Vehicles Localization Using Grid Cells for Path Integration

    Authors: Ignacio Carlucho, Manuel F. Bailey, Mariano De Paula, Corina Barbalata

    Abstract: Autonomous Underwater Vehicles (AUVs) are platforms used for research and exploration of marine environments. However, these types of vehicles face many challenges that hinder their widespread use in the industry. One of the main limitations is obtaining accurate position estimation, due to the lack of GPS signal underwater. This estimation is usually done with Kalman filters. However, new develop… ▽ More

    Submitted 9 August, 2021; v1 submitted 28 July, 2021; originally announced July 2021.

  12. A reinforcement learning control approach for underwater manipulation under position and torque constraints

    Authors: Ignacio Carlucho, Mariano De Paula, Gerardo G. Acosta, Corina Barbalata

    Abstract: In marine operations underwater manipulators play a primordial role. However, due to uncertainties in the dynamic model and disturbances caused by the environment, low-level control methods require great capabilities to adapt to change. Furthermore, under position and torque constraints the requirements for the control system are greatly increased. Reinforcement learning is a data driven control t… ▽ More

    Submitted 24 November, 2020; originally announced November 2020.

    Journal ref: Global Oceans 2020: Singapore - U.S. Gulf Coast

  13. arXiv:1808.06044  [pdf, ps, other

    cs.CE

    Maximising Throughput in a Complex Coal Export System

    Authors: Mateus Rocha de Paula, Natashia Boland, Andreas Ernst, Alexandre Mendes, Martin Savelsbergh

    Abstract: The Port of Newcastle features three coal export terminals, operating primarily in cargo assembly mode, that share a rail network on their inbound side, and a channel on their outbound side. Maximising throughput at a single coal terminal, taking into account its layout, its equipment, and its operating policies, is already challenging, but maximising throughput of the Hunter Valley coal export sy… ▽ More

    Submitted 22 August, 2018; v1 submitted 18 August, 2018; originally announced August 2018.