Skip to main content

Showing 1–24 of 24 results for author: de Arriba-Pérez, F

Searching in archive cs. Search in all archives.
.
  1. Promoting Security and Trust on Social Networks: Explainable Cyberbullying Detection Using Large Language Models in a Stream-Based Machine Learning Framework

    Authors: Silvia García-Méndez, Francisco De Arriba-Pérez

    Abstract: Social media platforms enable instant and ubiquitous connectivity and are essential to social interaction and communication in our technological society. Apart from its advantages, these platforms have given rise to negative behaviors in the online community, the so-called cyberbullying. Despite the many works involving generative Artificial Intelligence (AI) in the literature lately, there remain… ▽ More

    Submitted 7 April, 2025; originally announced May 2025.

    Journal ref: In 11th International Conference on SNAMS (pp. 25-32). IEEE (2024)

  2. Identification and explanation of disinformation in wiki data streams

    Authors: Francisco de Arriba-Pérez, Silvia García-Méndez, Fátima Leal, Benedita Malheiro, Juan C Burguillo

    Abstract: Social media platforms, increasingly used as news sources for varied data analytics, have transformed how information is generated and disseminated. However, the unverified nature of this content raises concerns about trustworthiness and accuracy, potentially negatively impacting readers' critical judgment due to disinformation. This work aims to contribute to the automatic data quality validation… ▽ More

    Submitted 3 February, 2025; originally announced March 2025.

    Comments: (2025) Integrated Computer-Aided Engineering

  3. Optimal word order for non-causal text generation with Large Language Models: the Spanish case

    Authors: Andrea Busto-Castiñeira, Silvia García-Méndez, Francisco de Arriba-Pérez, Francisco J. González-Castaño

    Abstract: Natural Language Generation (NLG) popularity has increased owing to the progress in Large Language Models (LLMs), with zero-shot inference capabilities. However, most neural systems utilize decoder-only causal (unidirectional) transformer models, which are effective for English but may reduce the richness of languages with less strict word order, subject omission, or different relative clause atta… ▽ More

    Submitted 20 February, 2025; originally announced February 2025.

  4. arXiv:2412.17651  [pdf, other

    cs.AI

    Detecting anxiety and depression in dialogues: a multi-label and explainable approach

    Authors: Francisco de Arriba-Pérez, Silvia García-Méndez

    Abstract: Anxiety and depression are the most common mental health issues worldwide, affecting a non-negligible part of the population. Accordingly, stakeholders, including governments' health systems, are developing new strategies to promote early detection and prevention from a holistic perspective (i.e., addressing several disorders simultaneously). In this work, an entirely novel system for the multi-la… ▽ More

    Submitted 23 December, 2024; originally announced December 2024.

    Journal ref: de Arriba-Pérez, F., García-Méndez, S. (2024). Detecting anxiety and depression in dialogues: a multi-label and explainable approach. In Proceedings of the 3rd AIxIA Workshop on Artificial Intelligence For Healthcare (pp. 257-271)

  5. Explainable cognitive decline detection in free dialogues with a Machine Learning approach based on pre-trained Large Language Models

    Authors: Francisco de Arriba-Pérez, Silvia García-Méndez, Javier Otero-Mosquera, Francisco J. González-Castaño

    Abstract: Cognitive and neurological impairments are very common, but only a small proportion of affected individuals are diagnosed and treated, partly because of the high costs associated with frequent screening. Detecting pre-illness stages and analyzing the progression of neurological disorders through effective and efficient intelligent systems can be beneficial for timely diagnosis and early interventi… ▽ More

    Submitted 4 November, 2024; originally announced November 2024.

    Journal ref: Applied Intelligence, 1-16 (2024)

  6. Leveraging Large Language Models through Natural Language Processing to provide interpretable Machine Learning predictions of mental deterioration in real time

    Authors: Francisco de Arriba-Pérez, Silvia García-Méndez

    Abstract: Based on official estimates, 50 million people worldwide are affected by dementia, and this number increases by 10 million new patients every year. Without a cure, clinical prognostication and early intervention represent the most effective ways to delay its progression. To this end, Artificial Intelligence and computational linguistics can be exploited for natural language analysis, personalized… ▽ More

    Submitted 5 September, 2024; originally announced September 2024.

  7. Predictability and Causality in Spanish and English Natural Language Generation

    Authors: Andrea Busto-Castiñeira, Francisco J. González-Castaño, Silvia García-Méndez, Francisco de Arriba-Pérez

    Abstract: In recent years, the field of Natural Language Generation (NLG) has been boosted by the recent advances in deep learning technologies. Nonetheless, these new data-intensive methods introduce language-dependent disparities in NLG as the main training data sets are in English. Also, most neural NLG systems use decoder-only (causal) transformer language models, which work well for English, but were n… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

    Journal ref: Busto-Castiñeira, A., Castaño, F. J. G., García-Méndez, S., & De Arriba-Pérez, F. (2024). Predictability and Causality in Spanish and English Natural Language Generation. IEEE Access

  8. arXiv:2406.15038  [pdf, other

    cs.LG cs.AI cs.CL cs.SI

    Online detection and infographic explanation of spam reviews with data drift adaptation

    Authors: Francisco de Arriba-Pérez, Silvia García-Méndez, Fátima Leal, Benedita Malheiro, J. C. Burguillo

    Abstract: Spam reviews are a pervasive problem on online platforms due to its significant impact on reputation. However, research into spam detection in data streams is scarce. Another concern lies in their need for transparency. Consequently, this paper addresses those problems by proposing an online solution for identifying and explaining spam reviews, incorporating data drift adaptation. It integrates (i… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Journal ref: Francisco de Arriba-Pérez, Silvia García-Méndez, Fátima Leal, Benedita Malheiro, Juan C. Burguillo, Online Detection and Infographic Explanation of Spam Reviews with Data Drift Adaptation, Informatica(2024), 1-25

  9. Toward data-driven research: preliminary study to predict surface roughness in material extrusion using previously published data with Machine Learning

    Authors: Fátima García-Martínez, Diego Carou, Francisco de Arriba-Pérez, Silvia García-Méndez

    Abstract: Material extrusion is one of the most commonly used approaches within the additive manufacturing processes available. Despite its popularity and related technical advancements, process reliability and quality assurance remain only partially solved. In particular, the surface roughness caused by this process is a key concern. To solve this constraint, experimental plans have been exploited to optim… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  10. Informatics & dairy industry coalition: AI trends and present challenges

    Authors: Silvia García-Méndez, Francisco de Arriba-Pérez, María del Carmen Somoza-López

    Abstract: Artificial Intelligence (AI) can potentially transform the industry, enhancing the production process and minimizing manual, repetitive tasks. Accordingly, the synergy between high-performance computing and powerful mathematical models enables the application of sophisticated data analysis procedures like Machine Learning. However, challenges exist regarding effective, efficient, and flexible proc… ▽ More

    Submitted 19 June, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

  11. arXiv:2406.12762  [pdf, other

    cs.LG cs.AI cs.HC

    Unsupervised explainable activity prediction in competitive Nordic Walking from experimental data

    Authors: Silvia García-Méndez, Francisco de Arriba-Pérez, Francisco J. González-Castaño, Javier Vales-Alonso

    Abstract: Artificial Intelligence (AI) has found application in Human Activity Recognition (HAR) in competitive sports. To date, most Machine Learning (ML) approaches for HAR have relied on offline (batch) training, imposing higher computational and tagging burdens compared to online processing unsupervised approaches. Additionally, the decisions behind traditional ML predictors are opaque and require human… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  12. Automatic generation of insights from workers' actions in industrial workflows with explainable Machine Learning

    Authors: Francisco de Arriba-Pérez, Silvia García-Méndez, Javier Otero-Mosquera, Francisco J. González-Castaño, Felipe Gil-Castiñeira

    Abstract: New technologies such as Machine Learning (ML) gave great potential for evaluating industry workflows and automatically generating key performance indicators (KPIs). However, despite established standards for measuring the efficiency of industrial machinery, there is no precise equivalent for workers' productivity, which would be highly desirable given the lack of a skilled workforce for the next… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: IEEE Industrial Electronics Magazine (2023)

  13. arXiv:2406.11924  [pdf, other

    cs.SI cs.AI cs.CL cs.LG

    Explainable assessment of financial experts' credibility by classifying social media forecasts and checking the predictions with actual market data

    Authors: Silvia García-Méndez, Francisco de Arriba-Pérez, Jaime González-Gonzáleza, Francisco J. González-Castaño

    Abstract: Social media include diverse interaction metrics related to user popularity, the most evident example being the number of user followers. The latter has raised concerns about the credibility of the posts by the most popular creators. However, most existing approaches to assess credibility in social media strictly consider this problem a binary classification, often based on a priori information, w… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Journal ref: Expert Systems with Applications. 124515 (2024)

  14. arXiv:2405.18542  [pdf, other

    cs.AI cs.CL cs.HC cs.LG

    Automatic detection of cognitive impairment in elderly people using an entertainment chatbot with Natural Language Processing capabilities

    Authors: Francisco de Arriba-Pérez, Silvia García-Méndez, Francisco J. González-Castaño, Enrique Costa-Montenegro

    Abstract: Previous researchers have proposed intelligent systems for therapeutic monitoring of cognitive impairments. However, most existing practical approaches for this purpose are based on manual tests. This raises issues such as excessive caretaking effort and the white-coat effect. To avoid these issues, we present an intelligent conversational system for entertaining elderly people with news of their… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  15. Explainable machine learning multi-label classification of Spanish legal judgements

    Authors: Francisco de Arriba-Pérez, Silvia García-Méndez, Francisco J. González-Castaño, Jaime González-González

    Abstract: Artificial Intelligence techniques such as Machine Learning (ML) have not been exploited to their maximum potential in the legal domain. This has been partially due to the insufficient explanations they provided about their decisions. Automatic expert systems with explanatory capabilities can be specially useful when legal practitioners search jurisprudence to gather contextual knowledge for their… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  16. arXiv:2405.14505  [pdf, other

    cs.CL cs.AI cs.CE cs.LG

    Explainable automatic industrial carbon footprint estimation from bank transaction classification using natural language processing

    Authors: Jaime González-González, Silvia García-Méndez, Francisco de Arriba-Pérez, Francisco J. González-Castaño, Óscar Barba-Seara

    Abstract: Concerns about the effect of greenhouse gases have motivated the development of certification protocols to quantify the industrial carbon footprint (CF). These protocols are manual, work-intensive, and expensive. All of the above have led to a shift towards automatic data-driven approaches to estimate the CF, including Machine Learning (ML) solutions. Unfortunately, the decision-making processes i… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  17. A review on the use of large language models as virtual tutors

    Authors: Silvia García-Méndez, Francisco de Arriba-Pérez, María del Carmen Somoza-López

    Abstract: Transformer architectures contribute to managing long-term dependencies for Natural Language Processing, representing one of the most recent changes in the field. These architectures are the basis of the innovative, cutting-edge Large Language Models (LLMs) that have produced a huge buzz in several fields and industrial sectors, among the ones education stands out. Accordingly, these generative Ar… ▽ More

    Submitted 5 September, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

    Journal ref: Science & Education (2024), 1-16

  18. Exposing and Explaining Fake News On-the-Fly

    Authors: Francisco de Arriba-Pérez, Silvia García-Méndez, Fátima Leal, Benedita Malheiro, Juan Carlos Burguillo

    Abstract: Social media platforms enable the rapid dissemination and consumption of information. However, users instantly consume such content regardless of the reliability of the shared data. Consequently, the latter crowdsourcing model is exposed to manipulation. This work contributes with an explainable and online classification method to recognize fake news in real-time. The proposed method combines both… ▽ More

    Submitted 5 September, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

    Journal ref: Mach Learn (2024)

  19. arXiv:2404.08665  [pdf, other

    cs.IR cs.CL cs.LG cs.SI q-fin.TR

    Targeted aspect-based emotion analysis to detect opportunities and precaution in financial Twitter messages

    Authors: Silvia García-Méndez, Francisco de Arriba-Pérez, Ana Barros-Vila, Francisco J. González-Castaño

    Abstract: Microblogging platforms, of which Twitter is a representative example, are valuable information sources for market screening and financial models. In them, users voluntarily provide relevant information, including educated knowledge on investments, reacting to the state of the stock markets in real-time and, often, influencing this state. We are interested in the user forecasts in financial, socia… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

  20. arXiv:2404.07224  [pdf, other

    q-fin.ST cs.CE cs.IR cs.LG cs.SI

    Detection of financial opportunities in micro-blogging data with a stacked classification system

    Authors: Francisco de Arriba-Pérez, Silvia García-Méndez, José A. Regueiro-Janeiro, Francisco J. González-Castaño

    Abstract: Micro-blogging sources such as the Twitter social network provide valuable real-time data for market prediction models. Investors' opinions in this network follow the fluctuations of the stock markets and often include educated speculations on market opportunities that may have impact on the actions of other investors. In view of this, we propose a novel system to detect positive predictions in tw… ▽ More

    Submitted 29 March, 2024; originally announced April 2024.

  21. arXiv:2404.01338  [pdf, other

    cs.CL cs.CE cs.IR cs.LG q-fin.ST

    Automatic detection of relevant information, predictions and forecasts in financial news through topic modelling with Latent Dirichlet Allocation

    Authors: Silvia García-Méndez, Francisco de Arriba-Pérez, Ana Barros-Vila, Francisco J. González-Castaño, Enrique Costa-Montenegro

    Abstract: Financial news items are unstructured sources of information that can be mined to extract knowledge for market screening applications. Manual extraction of relevant information from the continuous stream of finance-related news is cumbersome and beyond the skills of many investors, who, at most, can follow a few sources and authors. Accordingly, we focus on the analysis of financial news to identi… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

  22. arXiv:2404.01337  [pdf, other

    cs.CL cs.CE cs.IR cs.LG q-fin.ST

    Detection of Temporality at Discourse Level on Financial News by Combining Natural Language Processing and Machine Learning

    Authors: Silvia García-Méndez, Francisco de Arriba-Pérez, Ana Barros-Vila, Francisco J. González-Castaño

    Abstract: Finance-related news such as Bloomberg News, CNN Business and Forbes are valuable sources of real data for market screening systems. In news, an expert shares opinions beyond plain technical analyses that include context such as political, sociological and cultural factors. In the same text, the expert often discusses the performance of different assets. Some key statements are mere descriptions o… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

  23. arXiv:2404.01327  [pdf, other

    cs.CL cs.AI cs.HC cs.LG

    Entertainment chatbot for the digital inclusion of elderly people without abstraction capabilities

    Authors: Silvia García-Méndez, Francisco de Arriba-Pérez, Francisco J. González-Castaño, José A. Regueiro-Janeiro, Felipe Gil-Castiñeira

    Abstract: Current language processing technologies allow the creation of conversational chatbot platforms. Even though artificial intelligence is still too immature to support satisfactory user experience in many mass market domains, conversational interfaces have found their way into ad hoc applications such as call centres and online shopping assistants. However, they have not been applied so far to socia… ▽ More

    Submitted 29 March, 2024; originally announced April 2024.

  24. Automatic explanation of the classification of Spanish legal judgments in jurisdiction-dependent law categories with tree estimators

    Authors: Jaime González-González, Francisco de Arriba-Pérez, Silvia García-Méndez, Andrea Busto-Castiñeira, Francisco J. González-Castaño

    Abstract: Automatic legal text classification systems have been proposed in the literature to address knowledge extraction from judgments and detect their aspects. However, most of these systems are black boxes even when their models are interpretable. This may raise concerns about their trustworthiness. Accordingly, this work contributes with a system combining Natural Language Processing (NLP) with Machin… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.