-
Knowledge Graphs for Enhancing Large Language Models in Entity Disambiguation
Authors:
Gerard Pons,
Besim Bilalli,
Anna Queralt
Abstract:
Recent advances in Large Language Models (LLMs) have positioned them as a prominent solution for Natural Language Processing tasks. Notably, they can approach these problems in a zero or few-shot manner, thereby eliminating the need for training or fine-tuning task-specific models. However, LLMs face some challenges, including hallucination and the presence of outdated knowledge or missing informa…
▽ More
Recent advances in Large Language Models (LLMs) have positioned them as a prominent solution for Natural Language Processing tasks. Notably, they can approach these problems in a zero or few-shot manner, thereby eliminating the need for training or fine-tuning task-specific models. However, LLMs face some challenges, including hallucination and the presence of outdated knowledge or missing information from specific domains in the training data. These problems cannot be easily solved by retraining the models with new data as it is a time-consuming and expensive process. To mitigate these issues, Knowledge Graphs (KGs) have been proposed as a structured external source of information to enrich LLMs. With this idea, in this work we use KGs to enhance LLMs for zero-shot Entity Disambiguation (ED). For that purpose, we leverage the hierarchical representation of the entities' classes in a KG to gradually prune the candidate space as well as the entities' descriptions to enrich the input prompt with additional factual knowledge. Our evaluation on popular ED datasets shows that the proposed method outperforms non-enhanced and description-only enhanced LLMs, and has a higher degree of adaptability than task-specific models. Furthermore, we conduct an error analysis and discuss the impact of the leveraged KG's semantic expressivity on the ED performance.
△ Less
Submitted 6 May, 2025; v1 submitted 5 May, 2025;
originally announced May 2025.
-
Towards Continuous Experiment-driven MLOps
Authors:
Keerthiga Rajenthiram,
Milad Abdullah,
Ilias Gerostathopoulos,
Petr Hnetynka,
Tomáš Bureš,
Gerard Pons,
Besim Bilalli,
Anna Queralt
Abstract:
Despite advancements in MLOps and AutoML, ML development still remains challenging for data scientists. First, there is poor support for and limited control over optimizing and evolving ML models. Second, there is lack of efficient mechanisms for continuous evolution of ML models which would leverage the knowledge gained in previous optimizations of the same or different models. We propose an expe…
▽ More
Despite advancements in MLOps and AutoML, ML development still remains challenging for data scientists. First, there is poor support for and limited control over optimizing and evolving ML models. Second, there is lack of efficient mechanisms for continuous evolution of ML models which would leverage the knowledge gained in previous optimizations of the same or different models. We propose an experiment-driven MLOps approach which tackles these problems. Our approach relies on the concept of an experiment, which embodies a fully controllable optimization process. It introduces full traceability and repeatability to the optimization process, allows humans to be in full control of it, and enables continuous improvement of the ML system. Importantly, it also establishes knowledge, which is carried over and built across a series of experiments and allows for improving the efficiency of experimentation over time. We demonstrate our approach through its realization and application in the ExtremeXP1 project (Horizon Europe).
△ Less
Submitted 5 March, 2025;
originally announced March 2025.
-
Capturing and Anticipating User Intents in Data Analytics via Knowledge Graphs
Authors:
Gerard Pons,
Besim Bilalli,
Anna Queralt
Abstract:
In today's data-driven world, the ability to extract meaningful information from data is becoming essential for businesses, organizations and researchers alike. For that purpose, a wide range of tools and systems exist addressing data-related tasks, from data integration, preprocessing and modeling, to the interpretation and evaluation of the results. As data continues to grow in volume, variety,…
▽ More
In today's data-driven world, the ability to extract meaningful information from data is becoming essential for businesses, organizations and researchers alike. For that purpose, a wide range of tools and systems exist addressing data-related tasks, from data integration, preprocessing and modeling, to the interpretation and evaluation of the results. As data continues to grow in volume, variety, and complexity, there is an increasing need for advanced but user-friendly tools, such as intelligent discovery assistants (IDAs) or automated machine learning (AutoML) systems, that facilitate the user's interaction with the data. This enables non-expert users, such as citizen data scientists, to leverage powerful data analytics techniques effectively. The assistance offered by IDAs or AutoML tools should not be guided only by the analytical problem's data but should also be tailored to each individual user. To this end, this work explores the usage of Knowledge Graphs (KG) as a basic framework for capturing in a human-centered manner complex analytics workflows, by storing information not only about the workflow's components, datasets and algorithms but also about the users, their intents and their feedback, among others. The data stored in the generated KG can then be exploited to provide assistance (e.g., recommendations) to the users interacting with these systems. To accomplish this objective, two methods are explored in this work. Initially, the usage of query templates to extract relevant information from the KG is studied. However, upon identifying its main limitations, the usage of link prediction with knowledge graph embeddings is explored, which enhances flexibility and allows leveraging the entire structure and components of the graph. The experiments show that the proposed method is able to capture the graph's structure and to produce sensible suggestions.
△ Less
Submitted 1 November, 2024;
originally announced November 2024.
-
The evolution of ultra-massive carbon oxygen white dwarfs
Authors:
María E. Camisassa,
Leandro G. Althaus,
Detlev Koester,
Santiago Torres,
Pilar Gil Pons,
Alejandro H. Córsico
Abstract:
Ultra-massive white dwarfs ($\rm M_{WD} \gtrsim 1.05\, M_{\odot}$) are considered powerful tools to study type Ia supernovae explosions, merger events, the occurrence of physical processes in the Super Asymptotic Giant Branch (SAGB) phase, and the existence of high magnetic fields. Traditionally, ultra-massive white dwarfs are expected to harbour oxygen-neon (ONe) cores. However, new observations…
▽ More
Ultra-massive white dwarfs ($\rm M_{WD} \gtrsim 1.05\, M_{\odot}$) are considered powerful tools to study type Ia supernovae explosions, merger events, the occurrence of physical processes in the Super Asymptotic Giant Branch (SAGB) phase, and the existence of high magnetic fields. Traditionally, ultra-massive white dwarfs are expected to harbour oxygen-neon (ONe) cores. However, new observations and recent theoretical studies suggest that the progenitors of some ultra-massive white dwarfs can avoid carbon burning, leading to the formation of ultra-massive white dwarfs harbouring carbon-oxygen (CO) cores. Here we present a set of ultra-massive white dwarf evolutionary sequences with CO cores for a wide range of metallicity and masses. We take into account the energy released by latent heat and phase separation during the crystallization process and by $^{22}$Ne sedimentation. Realistic chemical profiles resulting from the full computation of progenitor evolution are considered. We compare our CO ultra-massive white dwarf models with ONe models. We conclude that CO ultra-massive white dwarfs evolve significantly slower than their ONe counterparts mainly for three reasons: their larger thermal content, the effect of crystallization, and the effect of $^{22}$Ne sedimentation. We also provide colors in several photometric bands on the basis of new model atmospheres. These CO ultra-massive white dwarf models, together with the ONe ultra-massive white dwarf models, provide an appropriate theoretical framework to study the ultra-massive white dwarf population in our Galaxy.
△ Less
Submitted 23 February, 2022; v1 submitted 7 February, 2022;
originally announced February 2022.
-
The pulsational properties of ultra-massive DB white dwarfs with carbon-oxygen cores coming from single-star evolution
Authors:
Alejandro H. Córsico,
Leandro G. Althaus,
Pilar Gil Pons,
Santiago Torres
Abstract:
Ultra-massive white dwarfs are relevant for their role as type Ia Supernova progenitors, the occurrence of physical processes in the asymptotic giant-branch phase, the existence of high-field magnetic white dwarfs, and the occurrence of double white dwarf mergers. Some hydrogen-rich ultra-massive white dwarfs are pulsating stars, and as such, they offer the possibility of studying their interiors…
▽ More
Ultra-massive white dwarfs are relevant for their role as type Ia Supernova progenitors, the occurrence of physical processes in the asymptotic giant-branch phase, the existence of high-field magnetic white dwarfs, and the occurrence of double white dwarf mergers. Some hydrogen-rich ultra-massive white dwarfs are pulsating stars, and as such, they offer the possibility of studying their interiors through asteroseismology. On the other hand, pulsating helium-rich ultra-massive white dwarfs could be even more attractive objects for asteroseismology if they were found, as they should be hotter and less crystallized than pulsating hydrogen-rich white dwarfs, something that would pave the way for probing their deep interiors. We explore the pulsational properties of ultra-massive helium-rich white dwarfs with carbon-oxygen and oxygen-neon cores resulting from single stellar evolution. Our goal is to provide a theoretical basis that could eventually help to discern the core composition of ultra-massive white dwarfs and the scenario of their formation through asteroseismology, anticipating the possible future detection of pulsations in this type of stars. We find that, given that the white dwarf models coming from the three scenarios considered are characterized by distinct core chemical profiles, their pulsation properties are also different, thus leading to distinctive signatures in the period-spacing and mode-trapping properties. Our results indicate that, in case of an eventual detection of pulsating ultra-massive helium-rich white dwarfs, it would be possible to derive valuable information encrypted in the core of these stars in connection with the origin of such exotic objects. The detection of pulsations in these stars has many chances to be achieved soon through observations collected with ongoing space missions.
△ Less
Submitted 20 December, 2020;
originally announced December 2020.
-
The formation of ultra-massive carbon-oxygen core white dwarfs and their evolutionary and pulsational properties
Authors:
Leandro G. Althaus,
Pilar Gil Pons,
Alejandro H. Córsico,
Marcelo Miller Bertolami,
Francisco De Gerónimo,
María E. Camisassa,
Santiago Torres,
Jordi Gutierrez,
Alberto Rebassa-Mansergas
Abstract:
(Abridged abstract) We explore the formation of ultra-massive (M_{\rm WD} \gtrsim 1.05 M_\sun$), carbon-oxygen core white dwarfs resulting from single stellar evolution. We also study their evolutionary and pulsational properties and compare them with those of the ultra-massive white dwarfs with oxygen-neon cores resulting from carbon burning in single progenitor stars, and with binary merger pred…
▽ More
(Abridged abstract) We explore the formation of ultra-massive (M_{\rm WD} \gtrsim 1.05 M_\sun$), carbon-oxygen core white dwarfs resulting from single stellar evolution. We also study their evolutionary and pulsational properties and compare them with those of the ultra-massive white dwarfs with oxygen-neon cores resulting from carbon burning in single progenitor stars, and with binary merger predictions. We consider two single-star evolution scenarios for the formation of ultra-massive carbon-oxygen core white dwarfs that involve rotation of the degenerate core after core helium burning and reduced mass-loss rates in massive asymptotic giant-branch stars. We compare our findings with the predictions from ultra-massive white dwarfs resulting from the merger of two equal-mass carbon-oxygen core white dwarfs, by assuming complete mixing between them and a carbon-oxygen core for the merged remnant. The resulting ultra-massive carbon-oxygen core white dwarfs evolve markedly slower than their oxygen-neon counterparts. Our study strongly suggests the formation of ultra-massive white dwarfs with carbon-oxygen core from single stellar evolution. We find that both the evolutionary and pulsation properties of these white dwarfs are markedly different from those of their oxygen-neon core counterparts and from those white dwarfs with carbon-oxygen core that might result from double degenerate mergers. This can eventually be used to discern the core composition of ultra-massive white dwarfs and their formation scenario.
△ Less
Submitted 20 November, 2020;
originally announced November 2020.
-
VICSOM: VIsual Clues from SOcial Media for psychological assessment
Authors:
Mohammad Mahdi Dehshibi,
Gerard Pons,
Bita Baiani,
David Masip
Abstract:
Sharing multimodal information (typically images, videos or text) in Social Network Sites (SNS) occupies a relevant part of our time. The particular way how users expose themselves in SNS can provide useful information to infer human behaviors. This paper proposes to use multimodal data gathered from Instagram accounts to predict the perceived prototypical needs described in Glasser's choice theor…
▽ More
Sharing multimodal information (typically images, videos or text) in Social Network Sites (SNS) occupies a relevant part of our time. The particular way how users expose themselves in SNS can provide useful information to infer human behaviors. This paper proposes to use multimodal data gathered from Instagram accounts to predict the perceived prototypical needs described in Glasser's choice theory. The contribution is two-fold: (i) we provide a large multimodal database from Instagram public profiles (more than 30,000 images and text captions) annotated by expert Psychologists on each perceived behavior according to Glasser's theory, and (ii) we propose to automate the recognition of the (unconsciously) perceived needs by the users. Particularly, we propose a baseline using three different feature sets: visual descriptors based on pixel images (SURF and Visual Bag of Words), a high-level descriptor based on the automated scene description using Convolutional Neural Networks, and a text-based descriptor (Word2vec) obtained from processing the captions provided by the users. Finally, we propose a multimodal fusion of these descriptors obtaining promising results in the multi-label classification problem.
△ Less
Submitted 15 May, 2019;
originally announced May 2019.
-
Multi-task, multi-label and multi-domain learning with residual convolutional networks for emotion recognition
Authors:
Gerard Pons,
David Masip
Abstract:
Automated emotion recognition in the wild from facial images remains a challenging problem. Although recent advances in Deep Learning have supposed a significant breakthrough in this topic, strong changes in pose, orientation and point of view severely harm current approaches. In addition, the acquisition of labeled datasets is costly, and current state-of-the-art deep learning algorithms cannot m…
▽ More
Automated emotion recognition in the wild from facial images remains a challenging problem. Although recent advances in Deep Learning have supposed a significant breakthrough in this topic, strong changes in pose, orientation and point of view severely harm current approaches. In addition, the acquisition of labeled datasets is costly, and current state-of-the-art deep learning algorithms cannot model all the aforementioned difficulties. In this paper, we propose to apply a multi-task learning loss function to share a common feature representation with other related tasks. Particularly we show that emotion recognition benefits from jointly learning a model with a detector of facial Action Units (collective muscle movements). The proposed loss function addresses the problem of learning multiple tasks with heterogeneously labeled data, improving previous multi-task approaches. We validate the proposal using two datasets acquired in non controlled environments, and an application to predict compound facial emotion expressions.
△ Less
Submitted 19 February, 2018;
originally announced February 2018.