-
Multimodal Representation Alignment for Cross-modal Information Retrieval
Authors:
Fan Xu,
Luis A. Leiva
Abstract:
Different machine learning models can represent the same underlying concept in different ways. This variability is particularly valuable for in-the-wild multimodal retrieval, where the objective is to identify the corresponding representation in one modality given another modality as input. This challenge can be effectively framed as a feature alignment problem. For example, given a sentence encod…
▽ More
Different machine learning models can represent the same underlying concept in different ways. This variability is particularly valuable for in-the-wild multimodal retrieval, where the objective is to identify the corresponding representation in one modality given another modality as input. This challenge can be effectively framed as a feature alignment problem. For example, given a sentence encoded by a language model, retrieve the most semantically aligned image based on features produced by an image encoder, or vice versa. In this work, we first investigate the geometric relationships between visual and textual embeddings derived from both vision-language models and combined unimodal models. We then align these representations using four standard similarity metrics as well as two learned ones, implemented via neural networks. Our findings indicate that the Wasserstein distance can serve as an informative measure of the modality gap, while cosine similarity consistently outperforms alternative metrics in feature alignment tasks. Furthermore, we observe that conventional architectures such as multilayer perceptrons are insufficient for capturing the complex interactions between image and text representations. Our study offers novel insights and practical considerations for researchers working in multimodal information retrieval, particularly in real-world, cross-modal applications.
△ Less
Submitted 10 June, 2025;
originally announced June 2025.
-
Thalamus: A User Simulation Toolkit for Prototyping Multimodal Sensing Studies
Authors:
Kayhan Latifzadeh,
Luis A. Leiva
Abstract:
Conducting user studies that involve physiological and behavioral measurements is very time-consuming and expensive, as it not only involves a careful experiment design, device calibration, etc. but also a careful software testing. We propose Thalamus, a software toolkit for collecting and simulating multimodal signals that can help the experimenters to prepare in advance for unexpected situations…
▽ More
Conducting user studies that involve physiological and behavioral measurements is very time-consuming and expensive, as it not only involves a careful experiment design, device calibration, etc. but also a careful software testing. We propose Thalamus, a software toolkit for collecting and simulating multimodal signals that can help the experimenters to prepare in advance for unexpected situations before reaching out to the actual study participants and even before having to install or purchase a specific device. Among other features, Thalamus allows the experimenter to modify, synchronize, and broadcast physiological signals (as coming from various data streams) from different devices simultaneously and not necessarily located in the same place. Thalamus is cross-platform, cross-device, and simple to use, making it thus a valuable asset for HCI research.
△ Less
Submitted 12 May, 2025;
originally announced May 2025.
-
AdSight: Scalable and Accurate Quantification of User Attention in Multi-Slot Sponsored Search
Authors:
Mario Villaizán-Vallelado,
Matteo Salvatori,
Kayhan Latifzadeh,
Antonio Penta,
Luis A. Leiva,
Ioannis Arapakis
Abstract:
Modern Search Engine Results Pages (SERPs) present complex layouts where multiple elements compete for visibility. Attention modelling is crucial for optimising web design and computational advertising, whereas attention metrics can inform ad placement and revenue strategies. We introduce AdSight, a method leveraging mouse cursor trajectories to quantify in a scalable and accurate manner user atte…
▽ More
Modern Search Engine Results Pages (SERPs) present complex layouts where multiple elements compete for visibility. Attention modelling is crucial for optimising web design and computational advertising, whereas attention metrics can inform ad placement and revenue strategies. We introduce AdSight, a method leveraging mouse cursor trajectories to quantify in a scalable and accurate manner user attention in multi-slot environments like SERPs. AdSight uses a novel Transformer-based sequence-to-sequence architecture where the encoder processes cursor trajectory embeddings, and the decoder incorporates slot-specific features, enabling robust attention prediction across various SERP layouts. We evaluate our approach on two Machine Learning tasks: (1) regression, to predict fixation times and counts; and (2) classification, to determine some slot types were noticed. Our findings demonstrate the model's ability to predict attention with unprecedented precision, offering actionable insights for researchers and practitioners.
△ Less
Submitted 7 May, 2025; v1 submitted 30 April, 2025;
originally announced May 2025.
-
Sparse-to-Sparse Training of Diffusion Models
Authors:
Inês Cardoso Oliveira,
Decebal Constantin Mocanu,
Luis A. Leiva
Abstract:
Diffusion models (DMs) are a powerful type of generative models that have achieved state-of-the-art results in various image synthesis tasks and have shown potential in other domains, such as natural language processing and temporal data modeling. Despite their stable training dynamics and ability to produce diverse high-quality samples, DMs are notorious for requiring significant computational re…
▽ More
Diffusion models (DMs) are a powerful type of generative models that have achieved state-of-the-art results in various image synthesis tasks and have shown potential in other domains, such as natural language processing and temporal data modeling. Despite their stable training dynamics and ability to produce diverse high-quality samples, DMs are notorious for requiring significant computational resources, both in the training and inference stages. Previous work has focused mostly on increasing the efficiency of model inference. This paper introduces, for the first time, the paradigm of sparse-to-sparse training to DMs, with the aim of improving both training and inference efficiency. We focus on unconditional generation and train sparse DMs from scratch (Latent Diffusion and ChiroDiff) on six datasets using three different methods (Static-DM, RigL-DM, and MagRan-DM) to study the effect of sparsity in model performance. Our experiments show that sparse DMs are able to match and often outperform their Dense counterparts, while substantially reducing the number of trainable parameters and FLOPs. We also identify safe and effective values to perform sparse-to-sparse training of DMs.
△ Less
Submitted 30 April, 2025;
originally announced April 2025.
-
Brain Signatures of Time Perception in Virtual Reality
Authors:
Sahar Niknam,
Saravanakumar Duraisamy,
Jean Botev,
Luis A. Leiva
Abstract:
Achieving a high level of immersion and adaptation in virtual reality (VR) requires precise measurement and representation of user state. While extrinsic physical characteristics such as locomotion and pose can be accurately tracked in real-time, reliably capturing mental states is more challenging. Quantitative psychology allows considering more intrinsic features like emotion, attention, or cogn…
▽ More
Achieving a high level of immersion and adaptation in virtual reality (VR) requires precise measurement and representation of user state. While extrinsic physical characteristics such as locomotion and pose can be accurately tracked in real-time, reliably capturing mental states is more challenging. Quantitative psychology allows considering more intrinsic features like emotion, attention, or cognitive load. Time perception, in particular, is strongly tied to users' mental states, including stress, focus, and boredom. However, research on objectively measuring the pace at which we perceive the passage of time is scarce. In this work, we investigate the potential of electroencephalography (EEG) as an objective measure of time perception in VR, exploring neural correlates with oscillatory responses and time-frequency analysis. To this end, we implemented a variety of time perception modulators in VR, collected EEG recordings, and labeled them with overestimation, correct estimation, and underestimation time perception states. We found clear EEG spectral signatures for these three states, that are persistent across individuals, modulators, and modulation duration. These signatures can be integrated and applied to monitor and actively influence time perception in VR, allowing the virtual environment to be purposefully adapted to the individual to increase immersion further and improve user experience. A free copy of this paper and all supplemental materials are available at https://vrarlab.uni.lu/pub/brain-signatures.
△ Less
Submitted 10 April, 2025;
originally announced April 2025.
-
A Comparative Study of Scanpath Models in Graph-Based Visualization
Authors:
Angela Lopez-Cardona,
Parvin Emami,
Sebastian Idesis,
Saravanakumar Duraisamy,
Luis A. Leiva,
Ioannis Arapakis
Abstract:
Information Visualization (InfoVis) systems utilize visual representations to enhance data interpretation. Understanding how visual attention is allocated is essential for optimizing interface design. However, collecting Eye-tracking (ET) data presents challenges related to cost, privacy, and scalability. Computational models provide alternatives for predicting gaze patterns, thereby advancing Inf…
▽ More
Information Visualization (InfoVis) systems utilize visual representations to enhance data interpretation. Understanding how visual attention is allocated is essential for optimizing interface design. However, collecting Eye-tracking (ET) data presents challenges related to cost, privacy, and scalability. Computational models provide alternatives for predicting gaze patterns, thereby advancing InfoVis research. In our study, we conducted an ET experiment with 40 participants who analyzed graphs while responding to questions of varying complexity within the context of digital forensics. We compared human scanpaths with synthetic ones generated by models such as DeepGaze, UMSS, and Gazeformer. Our research evaluates the accuracy of these models and examines how question complexity and number of nodes influence performance. This work contributes to the development of predictive modeling in visual analytics, offering insights that can enhance the design and effectiveness of InfoVis systems.
△ Less
Submitted 3 June, 2025; v1 submitted 31 March, 2025;
originally announced March 2025.
-
The AI-Therapist Duo: Exploring the Potential of Human-AI Collaboration in Personalized Art Therapy for PICS Intervention
Authors:
Bereket A. Yilma,
Chan Mi Kim,
Geke Ludden,
Thomas van Rompay,
Luis A. Leiva
Abstract:
Post-intensive care syndrome (PICS) is a multifaceted condition that arises from prolonged stays in an intensive care unit (ICU). While preventing PICS among ICU patients is becoming increasingly important, interventions remain limited. Building on evidence supporting the effectiveness of art exposure in addressing the psychological aspects of PICS, we propose a novel art therapy solution through…
▽ More
Post-intensive care syndrome (PICS) is a multifaceted condition that arises from prolonged stays in an intensive care unit (ICU). While preventing PICS among ICU patients is becoming increasingly important, interventions remain limited. Building on evidence supporting the effectiveness of art exposure in addressing the psychological aspects of PICS, we propose a novel art therapy solution through a collaborative Human-AI approach that enhances personalized therapeutic interventions using state-of-the-art Visual Art Recommendation Systems. We developed two Human-in-the-Loop (HITL) personalization methods and assessed their impact through a large-scale user study (N=150). Our findings demonstrate that this Human-AI collaboration not only enhances the personalization and effectiveness of art therapy but also supports therapists by streamlining their workload. While our study centres on PICS intervention, the results suggest that human-AI collaborative Art therapy could potentially benefit other areas where emotional support is critical, such as cases of anxiety and depression.
△ Less
Submitted 13 February, 2025;
originally announced February 2025.
-
Transfer Learning for Covert Speech Classification Using EEG Hilbert Envelope and Temporal Fine Structure
Authors:
Saravanakumar Duraisamy,
Mateusz Dubiel,
Maurice Rekrut,
Luis A. Leiva
Abstract:
Brain-Computer Interfaces (BCIs) can decode imagined speech from neural activity. However, these systems typically require extensive training sessions where participants imaginedly repeat words, leading to mental fatigue and difficulties identifying the onset of words, especially when imagining sequences of words. This paper addresses these challenges by transferring a classifier trained in overt…
▽ More
Brain-Computer Interfaces (BCIs) can decode imagined speech from neural activity. However, these systems typically require extensive training sessions where participants imaginedly repeat words, leading to mental fatigue and difficulties identifying the onset of words, especially when imagining sequences of words. This paper addresses these challenges by transferring a classifier trained in overt speech data to covert speech classification. We used electroencephalogram (EEG) features derived from the Hilbert envelope and temporal fine structure, and used them to train a bidirectional long-short-term memory (BiLSTM) model for classification. Our method reduces the burden of extensive training and achieves state-of-the-art classification accuracy: 86.44% for overt speech and 79.82% for covert speech using the overt speech classifier.
△ Less
Submitted 6 February, 2025;
originally announced February 2025.
-
Text-to-Image Generation for Vocabulary Learning Using the Keyword Method
Authors:
Nuwan T. Attygalle,
Matjaž Kljun,
Aaron Quigley,
Klen čOpič Pucihar,
Jens Grubert,
Verena Biener,
Luis A. Leiva,
Juri Yoneyama,
Alice Toniolo,
Angela Miguel,
Hirokazu Kato,
Maheshya Weerasinghe
Abstract:
The 'keyword method' is an effective technique for learning vocabulary of a foreign language. It involves creating a memorable visual link between what a word means and what its pronunciation in a foreign language sounds like in the learner's native language. However, these memorable visual links remain implicit in the people's mind and are not easy to remember for a large set of words. To enhance…
▽ More
The 'keyword method' is an effective technique for learning vocabulary of a foreign language. It involves creating a memorable visual link between what a word means and what its pronunciation in a foreign language sounds like in the learner's native language. However, these memorable visual links remain implicit in the people's mind and are not easy to remember for a large set of words. To enhance the memorisation and recall of the vocabulary, we developed an application that combines the keyword method with text-to-image generators to externalise the memorable visual links into visuals. These visuals represent additional stimuli during the memorisation process. To explore the effectiveness of this approach we first run a pilot study to investigate how difficult it is to externalise the descriptions of mental visualisations of memorable links, by asking participants to write them down. We used these descriptions as prompts for text-to-image generator (DALL-E2) to convert them into images and asked participants to select their favourites. Next, we compared different text-to-image generators (DALL-E2, Midjourney, Stable and Latent Diffusion) to evaluate the perceived quality of the generated images by each. Despite heterogeneous results, participants mostly preferred images generated by DALL-E2, which was used also for the final study. In this study, we investigated whether providing such images enhances the retention of vocabulary being learned, compared to the keyword method only. Our results indicate that people did not encounter difficulties describing their visualisations of memorable links and that providing corresponding images significantly improves memory retention.
△ Less
Submitted 28 January, 2025;
originally announced January 2025.
-
Do Large Language Models Show Biases in Causal Learning?
Authors:
Maria Victoria Carro,
Francisca Gauna Selasco,
Denise Alejandra Mester,
Margarita Gonzales,
Mario A. Leiva,
Maria Vanina Martinez,
Gerardo I. Simari
Abstract:
Causal learning is the cognitive process of developing the capability of making causal inferences based on available information, often guided by normative principles. This process is prone to errors and biases, such as the illusion of causality, in which people perceive a causal relationship between two variables despite lacking supporting evidence. This cognitive bias has been proposed to underl…
▽ More
Causal learning is the cognitive process of developing the capability of making causal inferences based on available information, often guided by normative principles. This process is prone to errors and biases, such as the illusion of causality, in which people perceive a causal relationship between two variables despite lacking supporting evidence. This cognitive bias has been proposed to underlie many societal problems, including social prejudice, stereotype formation, misinformation, and superstitious thinking. In this research, we investigate whether large language models (LLMs) develop causal illusions, both in real-world and controlled laboratory contexts of causal learning and inference. To this end, we built a dataset of over 2K samples including purely correlational cases, situations with null contingency, and cases where temporal information excludes the possibility of causality by placing the potential effect before the cause. We then prompted the models to make statements or answer causal questions to evaluate their tendencies to infer causation erroneously in these structured settings. Our findings show a strong presence of causal illusion bias in LLMs. Specifically, in open-ended generation tasks involving spurious correlations, the models displayed bias at levels comparable to, or even lower than, those observed in similar studies on human subjects. However, when faced with null-contingency scenarios or temporal cues that negate causal relationships, where it was required to respond on a 0-100 scale, the models exhibited significantly higher bias. These findings suggest that the models have not uniformly, consistently, or reliably internalized the normative principles essential for accurate causal learning.
△ Less
Submitted 13 December, 2024;
originally announced December 2024.
-
Are UFOs Driving Innovation? The Illusion of Causality in Large Language Models
Authors:
María Victoria Carro,
Francisca Gauna Selasco,
Denise Alejandra Mester,
Mario Alejandro Leiva
Abstract:
Illusions of causality occur when people develop the belief that there is a causal connection between two variables with no supporting evidence. This cognitive bias has been proposed to underlie many societal problems including social prejudice, stereotype formation, misinformation and superstitious thinking. In this research we investigate whether large language models develop the illusion of cau…
▽ More
Illusions of causality occur when people develop the belief that there is a causal connection between two variables with no supporting evidence. This cognitive bias has been proposed to underlie many societal problems including social prejudice, stereotype formation, misinformation and superstitious thinking. In this research we investigate whether large language models develop the illusion of causality in real-world settings. We evaluated and compared news headlines generated by GPT-4o-Mini, Claude-3.5-Sonnet, and Gemini-1.5-Pro to determine whether the models incorrectly framed correlations as causal relationships. In order to also measure sycophantic behavior, which occurs when a model aligns with a user's beliefs in order to look favorable even if it is not objectively correct, we additionally incorporated the bias into the prompts, observing if this manipulation increases the likelihood of the models exhibiting the illusion of causality. We found that Claude-3.5-Sonnet is the model that presents the lowest degree of causal illusion aligned with experiments on Correlation-to-Causation Exaggeration in human-written press releases. On the other hand, our findings suggest that while mimicry sycophancy increases the likelihood of causal illusions in these models, especially in GPT-4o-Mini, Claude-3.5-Sonnet remains the most robust against this cognitive bias.
△ Less
Submitted 15 October, 2024;
originally announced October 2024.
-
MOSAIC: Multimodal Multistakeholder-aware Visual Art Recommendation
Authors:
Bereket A. Yilma,
Luis A. Leiva
Abstract:
Visual art (VA) recommendation is complex, as it has to consider the interests of users (e.g. museum visitors) and other stakeholders (e.g. museum curators). We study how to effectively account for key stakeholders in VA recommendations while also considering user-centred measures such as novelty, serendipity, and diversity. We propose MOSAIC, a novel multimodal multistakeholder-aware approach usi…
▽ More
Visual art (VA) recommendation is complex, as it has to consider the interests of users (e.g. museum visitors) and other stakeholders (e.g. museum curators). We study how to effectively account for key stakeholders in VA recommendations while also considering user-centred measures such as novelty, serendipity, and diversity. We propose MOSAIC, a novel multimodal multistakeholder-aware approach using state-of-the-art CLIP and BLIP backbone architectures and two joint optimisation objectives: popularity and representative selection of paintings across different categories. We conducted an offline evaluation using preferences elicited from 213 users followed by a user study with 100 crowdworkers. We found a strong effect of popularity, which was positively perceived by users, and a minimal effect of representativeness. MOSAIC's impact extends beyond visitors, benefiting various art stakeholders. Its user-centric approach has broader applicability, offering advancements for content recommendation across domains that require considering multiple stakeholders.
△ Less
Submitted 31 July, 2024;
originally announced July 2024.
-
Modeling User Preferences via Brain-Computer Interfacing
Authors:
Luis A. Leiva,
V. Javier Traver,
Alexandra Kawala-Sterniuk,
Tuukka Ruotsalo
Abstract:
Present Brain-Computer Interfacing (BCI) technology allows inference and detection of cognitive and affective states, but fairly little has been done to study scenarios in which such information can facilitate new applications that rely on modeling human cognition. One state that can be quantified from various physiological signals is attention. Estimates of human attention can be used to reveal p…
▽ More
Present Brain-Computer Interfacing (BCI) technology allows inference and detection of cognitive and affective states, but fairly little has been done to study scenarios in which such information can facilitate new applications that rely on modeling human cognition. One state that can be quantified from various physiological signals is attention. Estimates of human attention can be used to reveal preferences and novel dimensions of user experience. Previous approaches have tackled these incredibly challenging tasks using a variety of behavioral signals, from dwell-time to click-through data, and computational models of visual correspondence to these behavioral signals. However, behavioral signals are only rough estimations of the real underlying attention and affective preferences of the users. Indeed, users may attend to some content simply because it is salient, but not because it is really interesting, or simply because it is outrageous. With this paper, we put forward a research agenda and example work using BCI to infer users' preferences, their attentional correlates towards visual content, and their associations with affective experience. Subsequently, we link these to relevant applications, such as information retrieval, personalized steering of generative models, and crowdsourcing population estimates of affective experiences.
△ Less
Submitted 31 May, 2024; v1 submitted 15 May, 2024;
originally announced May 2024.
-
Impact of Design Decisions in Scanpath Modeling
Authors:
Parvin Emami,
Yue Jiang,
Zixin Guo,
Luis A. Leiva
Abstract:
Modeling visual saliency in graphical user interfaces (GUIs) allows to understand how people perceive GUI designs and what elements attract their attention. One aspect that is often overlooked is the fact that computational models depend on a series of design parameters that are not straightforward to decide. We systematically analyze how different design parameters affect scanpath evaluation metr…
▽ More
Modeling visual saliency in graphical user interfaces (GUIs) allows to understand how people perceive GUI designs and what elements attract their attention. One aspect that is often overlooked is the fact that computational models depend on a series of design parameters that are not straightforward to decide. We systematically analyze how different design parameters affect scanpath evaluation metrics using a state-of-the-art computational model (DeepGaze++). We particularly focus on three design parameters: input image size, inhibition-of-return decay, and masking radius. We show that even small variations of these design parameters have a noticeable impact on standard evaluation metrics such as DTW or Eyenalysis. These effects also occur in other scanpath models, such as UMSS and ScanGAN, and in other datasets such as MASSVIS. Taken together, our results put forward the impact of design decisions for predicting users' viewing behavior on GUIs.
△ Less
Submitted 14 May, 2024;
originally announced May 2024.
-
Examining Humanness as a Metaphor to Design Voice User Interfaces
Authors:
Smit Desai,
Mateusz Dubiel,
Luis A. Leiva
Abstract:
Voice User Interfaces (VUIs) increasingly leverage 'humanness' as a foundational design metaphor, adopting roles like 'assistants,' 'teachers,' and 'secretaries' to foster natural interactions. Yet, this approach can sometimes misalign user trust and reinforce societal stereotypes, leading to socio-technical challenges that might impede long-term engagement. This paper explores an alternative appr…
▽ More
Voice User Interfaces (VUIs) increasingly leverage 'humanness' as a foundational design metaphor, adopting roles like 'assistants,' 'teachers,' and 'secretaries' to foster natural interactions. Yet, this approach can sometimes misalign user trust and reinforce societal stereotypes, leading to socio-technical challenges that might impede long-term engagement. This paper explores an alternative approach to navigate these challenges-incorporating non-human metaphors in VUI design. We report on a study with 240 participants examining the effects of human versus non-human metaphors on user perceptions within health and finance domains. Results indicate a preference for the human metaphor (doctor) over the non-human (health encyclopedia) in health contexts for its perceived enjoyability and likeability. In finance, however, user perceptions do not significantly differ between human (financial advisor) and non-human (calculator) metaphors. Importantly, our research reveals that the explicit awareness of a metaphor's use influences adoption intentions, with a marked preference for non-human metaphors when their metaphorical nature is not disclosed. These findings highlight context-specific conversation design strategies required in integrating non-human metaphors into VUI design, suggesting tradeoffs and design considerations that could enhance user engagement and adoption.
△ Less
Submitted 13 May, 2024;
originally announced May 2024.
-
EyeFormer: Predicting Personalized Scanpaths with Transformer-Guided Reinforcement Learning
Authors:
Yue Jiang,
Zixin Guo,
Hamed Rezazadegan Tavakoli,
Luis A. Leiva,
Antti Oulasvirta
Abstract:
From a visual perception perspective, modern graphical user interfaces (GUIs) comprise a complex graphics-rich two-dimensional visuospatial arrangement of text, images, and interactive objects such as buttons and menus. While existing models can accurately predict regions and objects that are likely to attract attention ``on average'', so far there is no scanpath model capable of predicting scanpa…
▽ More
From a visual perception perspective, modern graphical user interfaces (GUIs) comprise a complex graphics-rich two-dimensional visuospatial arrangement of text, images, and interactive objects such as buttons and menus. While existing models can accurately predict regions and objects that are likely to attract attention ``on average'', so far there is no scanpath model capable of predicting scanpaths for an individual. To close this gap, we introduce EyeFormer, which leverages a Transformer architecture as a policy network to guide a deep reinforcement learning algorithm that controls gaze locations. Our model has the unique capability of producing personalized predictions when given a few user scanpath samples. It can predict full scanpath information, including fixation positions and duration, across individuals and various stimulus types. Additionally, we demonstrate applications in GUI layout optimization driven by our model. Our software and models will be publicly available.
△ Less
Submitted 20 April, 2024; v1 submitted 15 April, 2024;
originally announced April 2024.
-
Artful Path to Healing: Using Machine Learning for Visual Art Recommendation to Prevent and Reduce Post-Intensive Care
Authors:
Bereket A. Yilma,
Chan Mi Kim,
Gerald C. Cupchik,
Luis A. Leiva
Abstract:
Staying in the intensive care unit (ICU) is often traumatic, leading to post-intensive care syndrome (PICS), which encompasses physical, psychological, and cognitive impairments. Currently, there are limited interventions available for PICS. Studies indicate that exposure to visual art may help address the psychological aspects of PICS and be more effective if it is personalized. We develop Machin…
▽ More
Staying in the intensive care unit (ICU) is often traumatic, leading to post-intensive care syndrome (PICS), which encompasses physical, psychological, and cognitive impairments. Currently, there are limited interventions available for PICS. Studies indicate that exposure to visual art may help address the psychological aspects of PICS and be more effective if it is personalized. We develop Machine Learning-based Visual Art Recommendation Systems (VA RecSys) to enable personalized therapeutic visual art experiences for post-ICU patients. We investigate four state-of-the-art VA RecSys engines, evaluating the relevance of their recommendations for therapeutic purposes compared to expert-curated recommendations. We conduct an expert pilot test and a large-scale user study (n=150) to assess the appropriateness and effectiveness of these recommendations. Our results suggest all recommendations enhance temporal affective states. Visual and multimodal VA RecSys engines compare favourably with expert-curated recommendations, indicating their potential to support the delivery of personalized art therapy for PICS prevention and treatment.
△ Less
Submitted 23 February, 2024;
originally announced February 2024.
-
Awareness in robotics: An early perspective from the viewpoint of the EIC Pathfinder Challenge "Awareness Inside''
Authors:
Cosimo Della Santina,
Carlos Hernandez Corbato,
Burak Sisman,
Luis A. Leiva,
Ioannis Arapakis,
Michalis Vakalellis,
Jean Vanderdonckt,
Luis Fernando D'Haro,
Guido Manzi,
Cristina Becchio,
Aïda Elamrani,
Mohsen Alirezaei,
Ginevra Castellano,
Dimos V. Dimarogonas,
Arabinda Ghosh,
Sofie Haesaert,
Sadegh Soudjani,
Sybert Stroeve,
Paul Verschure,
Davide Bacciu,
Ophelia Deroy,
Bahador Bahrami,
Claudio Gallicchio,
Sabine Hauert,
Ricardo Sanz
, et al. (6 additional authors not shown)
Abstract:
Consciousness has been historically a heavily debated topic in engineering, science, and philosophy. On the contrary, awareness had less success in raising the interest of scholars in the past. However, things are changing as more and more researchers are getting interested in answering questions concerning what awareness is and how it can be artificially generated. The landscape is rapidly evolvi…
▽ More
Consciousness has been historically a heavily debated topic in engineering, science, and philosophy. On the contrary, awareness had less success in raising the interest of scholars in the past. However, things are changing as more and more researchers are getting interested in answering questions concerning what awareness is and how it can be artificially generated. The landscape is rapidly evolving, with multiple voices and interpretations of the concept being conceived and techniques being developed. The goal of this paper is to summarize and discuss the ones among these voices connected with projects funded by the EIC Pathfinder Challenge called ``Awareness Inside'', a nonrecurring call for proposals within Horizon Europe designed specifically for fostering research on natural and synthetic awareness. In this perspective, we dedicate special attention to challenges and promises of applying synthetic awareness in robotics, as the development of mature techniques in this new field is expected to have a special impact on generating more capable and trustworthy embodied systems.
△ Less
Submitted 14 February, 2024;
originally announced February 2024.
-
Impact of Voice Fidelity on Decision Making: A Potential Dark Pattern?
Authors:
Mateusz Dubiel,
Anastasia Sergeeva,
Luis A. Leiva
Abstract:
Manipulative design in user interfaces (conceptualized as dark patterns) has emerged as a significant impediment to the ethical design of technology and a threat to user agency and freedom of choice. While previous research focused on exploring these patterns in the context of graphical user interfaces, the impact of speech has largely been overlooked. We conducted a listening test (N = 50) to eli…
▽ More
Manipulative design in user interfaces (conceptualized as dark patterns) has emerged as a significant impediment to the ethical design of technology and a threat to user agency and freedom of choice. While previous research focused on exploring these patterns in the context of graphical user interfaces, the impact of speech has largely been overlooked. We conducted a listening test (N = 50) to elicit participants' preferences regarding different synthetic voices that varied in terms of synthesis method (concatenative vs. neural) and prosodic qualities (speech pace and pitch variance), and then evaluated their impact in an online decision-making study (N = 101). Our results indicate a significant effect of voice qualities on the participant's choices, independently from the content of the available options. Our results also indicate that the voice's perceived engagement, ease of understanding, and domain fit directly translate to its impact on participants' behaviour in decision-making tasks.
△ Less
Submitted 10 February, 2024;
originally announced February 2024.
-
UEyes: An Eye-Tracking Dataset across User Interface Types
Authors:
Yue Jiang,
Luis A. Leiva,
Paul R. B. Houssel,
Hamed R. Tavakoli,
Julia Kylmälä,
Antti Oulasvirta
Abstract:
Different types of user interfaces differ significantly in the number of elements and how they are displayed. To examine how such differences affect the way users look at UIs, we collected and analyzed a large eye-tracking-based dataset, UEyes (62 participants, 1,980 UI screenshots, near 20K eye movement sequences), covering four major UI types: webpage, desktop UI, mobile UI, and poster. Furtherm…
▽ More
Different types of user interfaces differ significantly in the number of elements and how they are displayed. To examine how such differences affect the way users look at UIs, we collected and analyzed a large eye-tracking-based dataset, UEyes (62 participants, 1,980 UI screenshots, near 20K eye movement sequences), covering four major UI types: webpage, desktop UI, mobile UI, and poster. Furthermore, we analyze and discuss the differences in important factors, such as color, location, and gaze direction across UI types, individual viewing strategies and potential future directions. This position paper is a derivative of our recent paper with a particular focus on the UEyes dataset.
△ Less
Submitted 7 February, 2024;
originally announced February 2024.
-
The Elements of Visual Art Recommendation: Learning Latent Semantic Representations of Paintings
Authors:
Bereket A. Yilma,
Luis A. Leiva
Abstract:
Artwork recommendation is challenging because it requires understanding how users interact with highly subjective content, the complexity of the concepts embedded within the artwork, and the emotional and cognitive reflections they may trigger in users. In this paper, we focus on efficiently capturing the elements (i.e., latent semantic relationships) of visual art for personalized recommendation.…
▽ More
Artwork recommendation is challenging because it requires understanding how users interact with highly subjective content, the complexity of the concepts embedded within the artwork, and the emotional and cognitive reflections they may trigger in users. In this paper, we focus on efficiently capturing the elements (i.e., latent semantic relationships) of visual art for personalized recommendation. We propose and study recommender systems based on textual and visual feature learning techniques, as well as their combinations. We then perform a small-scale and a large-scale user-centric evaluation of the quality of the recommendations. Our results indicate that textual features compare favourably with visual ones, whereas a fusion of both captures the most suitable hidden semantic relationships for artwork recommendation. Ultimately, this paper contributes to our understanding of how to deliver content that suitably matches the user's interests and how they are perceived.
△ Less
Submitted 28 February, 2023;
originally announced March 2023.
-
Resource Allocation in Multicore Elastic Optical Networks: A Deep Reinforcement Learning Approach
Authors:
Juan Pinto-Ríos,
Felipe Calderón,
Ariel Leiva,
Gabriel Hermosilla,
Alejandra Beghelli,
Danilo Bórquez-Paredes,
Astrid Lozada,
Nicolás Jara,
Ricardo Olivares,
Gabriel Saavedra
Abstract:
A deep reinforcement learning approach is applied, for the first time, to solve the routing, modulation, spectrum and core allocation (RMSCA) problem in dynamic multicore fiber elastic optical networks (MCF-EONs). To do so, a new environment - compatible with OpenAI's Gym - was designed and implemented to emulate the operation of MCF-EONs. The new environment processes the agent actions (selection…
▽ More
A deep reinforcement learning approach is applied, for the first time, to solve the routing, modulation, spectrum and core allocation (RMSCA) problem in dynamic multicore fiber elastic optical networks (MCF-EONs). To do so, a new environment - compatible with OpenAI's Gym - was designed and implemented to emulate the operation of MCF-EONs. The new environment processes the agent actions (selection of route, core and spectrum slot) by considering the network state and physical-layer-related aspects. The latter includes the available modulation formats and their reach and the inter-core crosstalk (XT), an MCF-related impairment. If the resulting quality of the signal is acceptable, the environment allocates the resources selected by the agent. After processing the agent's action, the environment is configured to give the agent a numerical reward and information about the new network state. The blocking performance of four different agents was compared through simulation to 3 baseline heuristics used in MCF-EONs. Results obtained for the NSFNet and COST239 network topologies show that the best-performing agent achieves, on average, up to a four-times decrease in blocking probability concerning the best-performing baseline heuristic methods.
△ Less
Submitted 5 July, 2022;
originally announced July 2022.
-
A Contextual Framework for Adaptive User Interfaces: Modelling the Interaction Environment
Authors:
Mateusz Dubiel,
Bereket Abera Yilma,
Kayhan Latifzadeh,
Luis A. Leiva
Abstract:
The interaction context (or environment) is key to any HCI task and especially to adaptive user interfaces (AUIs), since it represents the conditions under which users interact with computers. Unfortunately, there are currently no formal representations to model said interaction context. In order to address this gap, we propose a contextual framework for AUIs and illustrate a practical applica- ti…
▽ More
The interaction context (or environment) is key to any HCI task and especially to adaptive user interfaces (AUIs), since it represents the conditions under which users interact with computers. Unfortunately, there are currently no formal representations to model said interaction context. In order to address this gap, we propose a contextual framework for AUIs and illustrate a practical applica- tion using learning management systems as a case study. We also discuss limitations of our framework and offer discussion points about the realisation of truly context-aware AUIs.
△ Less
Submitted 31 March, 2022;
originally announced March 2022.
-
Creep Tide Model for the 3-Body Problem. The rotational evolution of a circumbinary planet
Authors:
F. A. Zoppetti,
H. Folonier,
A. M. Leiva,
C. Beaugé
Abstract:
We present a tidal model for treating the rotational evolution in the general three-body problem with arbitrary viscosities, in which all the masses are considered to be extended and all the tidal interactions between pairs are taken into account. Based on the creep tide theory, we present the set of differential equations that describes the rotational evolution of each body, in a formalism that i…
▽ More
We present a tidal model for treating the rotational evolution in the general three-body problem with arbitrary viscosities, in which all the masses are considered to be extended and all the tidal interactions between pairs are taken into account. Based on the creep tide theory, we present the set of differential equations that describes the rotational evolution of each body, in a formalism that is easily extensible to the N tidally-interacting body problem. We apply our model to the case of a circumbinary planet and use a Kepler-38 like binary system as a working example. We find that, in this low planetary eccentricity case, the most likely final stationary rotation state is the 1:1 spin-orbit resonance, considering an arbitrary planetary viscosity inside the estimated range for the solar system planets. We derive analytical expressions for the mean rotational stationary state, based on high-order power series of the semimajor axes ratio a1 /a2 and low-order expansions of the eccentricities. These are found to reproduce very accurately the mean behaviour of the low-eccentric numerical integrations for arbitrary planetary relaxation factors, and up to a1/a2 \sim 0.4. Our analytical model is used to predict the stationary rotation of the Kepler circumbinary planets and find that most of them are probably rotating in a sub-synchronous state, although the synchrony shift is much less important than the one estimated in Zoppetti et al. (2019, 2020). We present a comparison of our results with those obtained with the Constant Time Lag and find that, unlike what we assumed in our previous works, the cross torques have a non-negligible net secular contribution, and must be taken into account when computing the tides over each body in an N-extended-body system from an arbitrary reference frame. These torques are naturally taken into account in the creep theory.
△ Less
Submitted 5 May, 2021;
originally announced May 2021.
-
Adapting User Interfaces with Model-based Reinforcement Learning
Authors:
Kashyap Todi,
Gilles Bailly,
Luis A. Leiva,
Antti Oulasvirta
Abstract:
Adapting an interface requires taking into account both the positive and negative effects that changes may have on the user. A carelessly picked adaptation may impose high costs to the user -- for example, due to surprise or relearning effort -- or "trap" the process to a suboptimal design immaturely. However, effects on users are hard to predict as they depend on factors that are latent and evolv…
▽ More
Adapting an interface requires taking into account both the positive and negative effects that changes may have on the user. A carelessly picked adaptation may impose high costs to the user -- for example, due to surprise or relearning effort -- or "trap" the process to a suboptimal design immaturely. However, effects on users are hard to predict as they depend on factors that are latent and evolve over the course of interaction. We propose a novel approach for adaptive user interfaces that yields a conservative adaptation policy: It finds beneficial changes when there are such and avoids changes when there are none. Our model-based reinforcement learning method plans sequences of adaptations and consults predictive HCI models to estimate their effects. We present empirical and simulation results from the case of adaptive menus, showing that the method outperforms both a non-adaptive and a frequency-based policy.
△ Less
Submitted 11 March, 2021;
originally announced March 2021.
-
Understanding Visual Saliency in Mobile User Interfaces
Authors:
Luis A. Leiva,
Yunfei Xue,
Avya Bansal,
Hamed R. Tavakoli,
Tuğçe Köroğlu,
Niraj R. Dayama,
Antti Oulasvirta
Abstract:
For graphical user interface (UI) design, it is important to understand what attracts visual attention. While previous work on saliency has focused on desktop and web-based UIs, mobile app UIs differ from these in several respects. We present findings from a controlled study with 30 participants and 193 mobile UIs. The results speak to a role of expectations in guiding where users look at. Strong…
▽ More
For graphical user interface (UI) design, it is important to understand what attracts visual attention. While previous work on saliency has focused on desktop and web-based UIs, mobile app UIs differ from these in several respects. We present findings from a controlled study with 30 participants and 193 mobile UIs. The results speak to a role of expectations in guiding where users look at. Strong bias toward the top-left corner of the display, text, and images was evident, while bottom-up features such as color or size affected saliency less. Classic, parameter-free saliency models showed a weak fit with the data, and data-driven models improved significantly when trained specifically on this dataset (e.g., NSS rose from 0.66 to 0.84). We also release the first annotated dataset for investigating visual saliency in mobile UIs.
△ Less
Submitted 22 January, 2021;
originally announced January 2021.
-
My Mouse, My Rules: Privacy Issues of Behavioral User Profiling via Mouse Tracking
Authors:
Luis A. Leiva,
Ioannis Arapakis,
Costas Iordanou
Abstract:
This paper aims to stir debate about a disconcerting privacy issue on web browsing that could easily emerge because of unethical practices and uncontrolled use of technology. We demonstrate how straightforward is to capture behavioral data about the users at scale, by unobtrusively tracking their mouse cursor movements, and predict user's demographics information with reasonable accuracy using fiv…
▽ More
This paper aims to stir debate about a disconcerting privacy issue on web browsing that could easily emerge because of unethical practices and uncontrolled use of technology. We demonstrate how straightforward is to capture behavioral data about the users at scale, by unobtrusively tracking their mouse cursor movements, and predict user's demographics information with reasonable accuracy using five lines of code. Based on our results, we propose an adversarial method to mitigate user profiling techniques that make use of mouse cursor tracking, such as the recurrent neural net we analyze in this paper. We also release our data and a web browser extension that implements our adversarial method, so that others can benefit from this work in practice.
△ Less
Submitted 22 January, 2021;
originally announced January 2021.
-
Query Abandonment Prediction with Recurrent Neural Models of Mouse Cursor Movements
Authors:
Lukas Brückner,
Ioannis Arapakis,
Luis A. Leiva
Abstract:
Most successful search queries do not result in a click if the user can satisfy their information needs directly on the SERP. Modeling query abandonment in the absence of click-through data is challenging because search engines must rely on other behavioral signals to understand the underlying search intent. We show that mouse cursor movements make a valuable, low-cost behavioral signal that can d…
▽ More
Most successful search queries do not result in a click if the user can satisfy their information needs directly on the SERP. Modeling query abandonment in the absence of click-through data is challenging because search engines must rely on other behavioral signals to understand the underlying search intent. We show that mouse cursor movements make a valuable, low-cost behavioral signal that can discriminate good and bad abandonment. We model mouse movements on SERPs using recurrent neural nets and explore several data representations that do not rely on expensive hand-crafted features and do not depend on a particular SERP structure. We also experiment with data resampling and augmentation techniques that we adopt for sequential data. Our results can help search providers to gauge user satisfaction for queries without clicks and ultimately contribute to a better understanding of search engine performance.
△ Less
Submitted 22 January, 2021;
originally announced January 2021.
-
The 2017 May 20$^{\rm th}$ stellar occultation by the elongated centaur (95626) 2002 GZ$_{32}$
Authors:
P. Santos-Sanz,
J. L. Ortiz,
B. Sicardy,
G. Benedetti-Rossi,
N. Morales,
E. Fernández-Valenzuela,
R. Duffard,
R. Iglesias-Marzoa,
J. L. Lamadrid,
N. Maícas,
L. Pérez,
K. Gazeas,
J. C. Guirado,
V. Peris,
F. J. Ballesteros,
F. Organero,
L. Ana-Hernández,
F. Fonseca,
A. Alvarez-Candal,
Y. Jiménez-Teja,
M. Vara-Lubiano,
F. Braga-Ribas,
J. I. B. Camargo,
J. Desmars,
M. Assafin
, et al. (34 additional authors not shown)
Abstract:
We predicted a stellar occultation of the bright star Gaia DR1 4332852996360346368 (UCAC4 385-75921) (m$_{\rm V}$= 14.0 mag) by the centaur 2002 GZ$_{32}$ for 2017 May 20$^{\rm th}$. Our latest shadow path prediction was favourable to a large region in Europe. Observations were arranged in a broad region inside the nominal shadow path. Series of images were obtained with 29 telescopes throughout E…
▽ More
We predicted a stellar occultation of the bright star Gaia DR1 4332852996360346368 (UCAC4 385-75921) (m$_{\rm V}$= 14.0 mag) by the centaur 2002 GZ$_{32}$ for 2017 May 20$^{\rm th}$. Our latest shadow path prediction was favourable to a large region in Europe. Observations were arranged in a broad region inside the nominal shadow path. Series of images were obtained with 29 telescopes throughout Europe and from six of them (five in Spain and one in Greece) we detected the occultation. This is the fourth centaur, besides Chariklo, Chiron and Bienor, for which a multi-chord stellar occultation is reported. By means of an elliptical fit to the occultation chords we obtained the limb of 2002 GZ$_{32}$ during the occultation, resulting in an ellipse with axes of 305 $\pm$ 17 km $\times$ 146 $\pm$ 8 km. From this limb, thanks to a rotational light curve obtained shortly after the occultation, we derived the geometric albedo of 2002 GZ$_{32}$ ($p_{\rm V}$ = 0.043 $\pm$ 0.007) and a 3-D ellipsoidal shape with axes 366 km $\times$ 306 km $\times$ 120 km. This shape is not fully consistent with a homogeneous body in hydrostatic equilibrium for the known rotation period of 2002 GZ$_{32}$. The size (albedo) obtained from the occultation is respectively smaller (greater) than that derived from the radiometric technique but compatible within error bars. No rings or debris around 2002 GZ$_{32}$ were detected from the occultation, but narrow and thin rings cannot be discarded.
△ Less
Submitted 11 December, 2020;
originally announced December 2020.
-
Human or Machine? It Is Not What You Write, But How You Write It
Authors:
Luis A. Leiva,
Moises Diaz,
Miguel A. Ferrer,
Réjean Plamondon
Abstract:
Online fraud often involves identity theft. Since most security measures are weak or can be spoofed, we investigate a more nuanced and less explored avenue: behavioral biometrics via handwriting movements. This kind of data can be used to verify whether a user is operating a device or a computer application, so it is important to distinguish between human and machine-generated movements reliably.…
▽ More
Online fraud often involves identity theft. Since most security measures are weak or can be spoofed, we investigate a more nuanced and less explored avenue: behavioral biometrics via handwriting movements. This kind of data can be used to verify whether a user is operating a device or a computer application, so it is important to distinguish between human and machine-generated movements reliably. For this purpose, we study handwritten symbols (isolated characters, digits, gestures, and signatures) produced by humans and machines, and compare and contrast several deep learning models. We find that if symbols are presented as static images, they can fool state-of-the-art classifiers (near 75% accuracy in the best case) but can be distinguished with remarkable accuracy if they are presented as temporal sequences (95% accuracy in the average case). We conclude that an accurate detection of fake movements has more to do with how users write, rather than what they write. Our work has implications for computerized systems that need to authenticate or verify legitimate human users, and provides an additional layer of security to keep attackers at bay.
△ Less
Submitted 25 October, 2020;
originally announced October 2020.
-
Learning Efficient Representations of Mouse Movements to Predict User Attention
Authors:
Ioannis Arapakis,
Luis A. Leiva
Abstract:
Tracking mouse cursor movements can be used to predict user attention on heterogeneous page layouts like SERPs. So far, previous work has relied heavily on handcrafted features, which is a time-consuming approach that often requires domain expertise. We investigate different representations of mouse cursor movements, including time series, heatmaps, and trajectory-based images, to build and contra…
▽ More
Tracking mouse cursor movements can be used to predict user attention on heterogeneous page layouts like SERPs. So far, previous work has relied heavily on handcrafted features, which is a time-consuming approach that often requires domain expertise. We investigate different representations of mouse cursor movements, including time series, heatmaps, and trajectory-based images, to build and contrast both recurrent and convolutional neural networks that can predict user attention to direct displays, such as SERP advertisements. Our models are trained over raw mouse cursor data and achieve competitive performance. We conclude that neural network models should be adopted for downstream tasks involving mouse cursor movements, since they can provide an invaluable implicit feedback signal for re-ranking and evaluation.
△ Less
Submitted 30 May, 2020;
originally announced June 2020.
-
Omnis Prædictio: Estimating the Full Spectrum of Human Performance with Stroke Gestures
Authors:
Luis A. Leiva,
Radu-Daniel Vatavu,
Daniel Martín-Albo,
Réjean Plamondon
Abstract:
Designing effective, usable, and widely adoptable stroke gesture commands for graphical user interfaces is a challenging task that traditionally involves multiple iterative rounds of prototyping, implementation, and follow-up user studies and controlled experiments for evaluation, verification, and validation. An alternative approach is to employ theoretical models of human performance, which can…
▽ More
Designing effective, usable, and widely adoptable stroke gesture commands for graphical user interfaces is a challenging task that traditionally involves multiple iterative rounds of prototyping, implementation, and follow-up user studies and controlled experiments for evaluation, verification, and validation. An alternative approach is to employ theoretical models of human performance, which can deliver practitioners with insightful information right from the earliest stages of user interface design. However, very few aspects of the large spectrum of human performance with stroke gesture input have been investigated and modeled so far, leaving researchers and practitioners of gesture-based user interface design with a very narrow range of predictable measures of human performance, mostly focused on estimating production time, of which extremely few cases delivered accompanying software tools to assist modeling. We address this problem by introducing "Omnis Praedictio" (Omnis for short), a generic technique and companion web tool that provides accurate user-independent estimations of any numerical stroke gesture feature, including custom features specified in code. Our experimental results on three public datasets show that our model estimations correlate on average r > .9 with groundtruth data. Omnis also enables researchers and practitioners to understand human performance with stroke gestures on many levels and, consequently, raises the bar for human performance models and estimation techniques for stroke gesture input.
△ Less
Submitted 27 May, 2020;
originally announced May 2020.
-
A Price-Per-Attention Auction Scheme Using Mouse Cursor Information
Authors:
Ioannis Arapakis,
Antonio Penta,
Hideo Joho,
Luis A. Leiva
Abstract:
Payments in online ad auctions are typically derived from click-through rates, so that advertisers do not pay for ineffective ads. But advertisers often care about more than just clicks. That is, for example, if they aim to raise brand awareness or visibility. There is thus an opportunity to devise a more effective ad pricing paradigm, in which ads are paid only if they are actually noticed. This…
▽ More
Payments in online ad auctions are typically derived from click-through rates, so that advertisers do not pay for ineffective ads. But advertisers often care about more than just clicks. That is, for example, if they aim to raise brand awareness or visibility. There is thus an opportunity to devise a more effective ad pricing paradigm, in which ads are paid only if they are actually noticed. This article contributes a novel auction format based on a pay-per-attention (PPA) scheme. We show that the PPA auction inherits the desirable properties (strategy-proofness and efficiency) as its pay-per-impression and pay-per-click counterparts, and that it also compares favourably in terms of revenues. To make the PPA format feasible, we also contribute a scalable diagnostic technology to predict user attention to ads in sponsored search using raw mouse cursor coordinates only, regardless of the page content and structure. We use the user attention predictions in numerical simulations to evaluate the PPA auction scheme. Our results show that, in relevant economic settings, the PPA revenues would be strictly higher than the existing auction payment schemes.
△ Less
Submitted 21 January, 2020;
originally announced January 2020.
-
Tidal evolution of circumbinary systems with arbitrary eccentricities: applications for Kepler systems
Authors:
F. A. Zoppetti,
A. M. Leiva,
C. Beaugé
Abstract:
We present an extended version of the Constant Time Lag analytical approach for the tidal evolution of circumbinary planets introduced in our previous work. The model is self-consistent, in the sense that all tidal interactions between pairs are computed, regardless of their size. We derive analytical expressions for the variational equations governing the spin and orbital evolution, which are exp…
▽ More
We present an extended version of the Constant Time Lag analytical approach for the tidal evolution of circumbinary planets introduced in our previous work. The model is self-consistent, in the sense that all tidal interactions between pairs are computed, regardless of their size. We derive analytical expressions for the variational equations governing the spin and orbital evolution, which are expressed as high-order elliptical expansions in the semimajor axis ratio but retain closed form in terms of the binary and planetary eccentricities. These are found to reproduce the results of the numerical simulations with arbitrary eccentricities very well, as well as reducing to our previous results in the low-eccentric case. Our model is then applied to the well-characterised Kepler circumbinary systems by analysing the tidal timescales and unveiling the tidal flow around each different system. In all cases we find that the spins reach stationary values much faster than the characteristic timescale of the orbital evolution, indicating that all Kepler circumbinary planets are expected to be in a sub-synchronous state. On the other hand, all systems are located in a tidal flow leading to outward migration; thus the proximity of the planets to the orbital instability limit may have been even greater in the past. Additionally, Kepler systems may have suffered a significant tidally induced eccentricity damping, which may be related to their proximity to the capture eccentricity. To help understand the predictions of our model, we also offer a simple geometrical interpretation of our results.
△ Less
Submitted 18 December, 2019;
originally announced December 2019.
-
A self-consistent weak friction model for the tidal evolution of circumbinary planets
Authors:
F. A. Zoppetti,
C. Beaugé,
A. M. Leiva,
H. Folonier
Abstract:
We present a self-consistent model for the tidal evolution of circumbinary planets. Based on the weak-friction model, we derive expressions of the resulting forces and torques considering complete tidal interactions between all the bodies of the system. Although the tidal deformation suffered by each extended mass must take into account the combined gravitational effects of the other two bodies, t…
▽ More
We present a self-consistent model for the tidal evolution of circumbinary planets. Based on the weak-friction model, we derive expressions of the resulting forces and torques considering complete tidal interactions between all the bodies of the system. Although the tidal deformation suffered by each extended mass must take into account the combined gravitational effects of the other two bodies, the only tidal forces that have a net effect on the dynamic are those that are applied on the same body that exerts the deformation, as long as no mean-motion resonance exists between the masses. We apply the model to the Kepler-38 binary system. The evolution of the spin equations shows that the planet reaches a stationary solution much faster than the stars, and the equilibrium spin frequency is sub-synchronous. The binary components evolve on a longer timescale, reaching a super-synchronous solution very close to that derived for the 2-body problem. After reaching spin stationarity, the eccentricity is damped in all bodies and for all the parameters analyzed here. A similar effect is noted for the binary separation. The semimajor axis of the planet, on the other hand, may migrate inwards or outwards, depending on the masses and orbital parameters. In some cases the secular evolution of the system may also exhibit an alignment of the pericenters, requiring to include additional terms in the tidal model. Finally, we derived analytical expressions for the variational equations of the orbital evolution and spin rates based on low-order elliptical expansions in the semimajor axis ratio and the eccentricities. These are found to reduce to the 2-body case when one of the masses is taken equal to zero. This model allow us to find a close and simple analytical expression for the stationary spin rates of all the bodies, as well as predicting the direction and magnitude of the orbital migration.
△ Less
Submitted 12 June, 2019;
originally announced June 2019.
-
Secular models and Kozai resonance for planets in coorbital non-coplanar motion
Authors:
Cristian A. Giuppone,
Alejandro M. Leiva
Abstract:
In this work, we construct and test an analytical and a semianalytical secular models for two planets locked in a coorbital non-coplanar motion, comparing some results with the case of restricted three body problem.
The analytical average model replicates the numerical N-body integrations, even for moderate eccentricities ($\lesssim$ 0.3) and inclinations ($\lesssim10^\circ$), except for the reg…
▽ More
In this work, we construct and test an analytical and a semianalytical secular models for two planets locked in a coorbital non-coplanar motion, comparing some results with the case of restricted three body problem.
The analytical average model replicates the numerical N-body integrations, even for moderate eccentricities ($\lesssim$ 0.3) and inclinations ($\lesssim10^\circ$), except for the regions corresponding to quasi-satellite and Lidov-Kozai configurations. Furthermore, this model is also useful in the restricted three body problem, assuming very low mass ratio between the planets. We also describe a four-degree-of-freedom semianalytical model valid for any type of coorbital configuration in a wide range of eccentricities and inclinations.
{Using a N-body integrator, we have found that the phase space of the General Three Body Problem is different to the restricted case for inclined systems, and establish the location of the Lidov-Kozai equilibrium configurations depending on mass ratio. We study the stability of periodic orbits in the inclined systems, and find that apart from the robust configurations $L_4$, $AL_4$, and $QS$ is possible to harbour two Earth-like planets in orbits previously identified as unstable $U$ and also in Euler $L_3$ configurations, with bounded chaos.
△ Less
Submitted 28 April, 2016;
originally announced April 2016.
-
MAMA: An Algebraic Map for the Secular Dynamics of Planetesimals in Tight Binary Systems
Authors:
A. M. Leiva,
J. A. Correa-Otto,
C. Beaugé
Abstract:
We present an algebraic map (MAMA) for the dynamical and collisional evolution of a planetesimal swarm orbiting the main star of a tight binary system (TBS). The orbital evolution of each planetesimal is dictated by the secular perturbations of the secondary star and gas drag due to interactions with a protoplanetary disk. The gas disk is assumed eccentric with a constant precession rate. Gravitat…
▽ More
We present an algebraic map (MAMA) for the dynamical and collisional evolution of a planetesimal swarm orbiting the main star of a tight binary system (TBS). The orbital evolution of each planetesimal is dictated by the secular perturbations of the secondary star and gas drag due to interactions with a protoplanetary disk. The gas disk is assumed eccentric with a constant precession rate. Gravitational interactions between the planetesimals are ignored. All bodies are assumed coplanar. A comparison with full N-body simulations shows that the map is of the order of 100 times faster, while preserving all the main characteristics of the full system.
In a second part of the work, we apply MAMA to the γ-Cephei, searching for friendly scenarios that may explain the formation of the giant planet detected in this system. For low-mass protoplanetary disks, we find that a low-eccentricity static disk aligned with the binary yields impact velocities between planetesimals below the disruption threshold. All other scenarios appear hostile to planetary formation.
△ Less
Submitted 1 October, 2013;
originally announced October 2013.
-
Secular dynamics of planetesimals in tight binary systems: Application to Gamma-Cephei
Authors:
C. A. Giuppone,
A. M. Leiva,
J. Correa-Otto,
C. Beauge
Abstract:
The secular dynamics of small planetesimals in tight binary systems play a fundamental role in establishing the possibility of accretional collisions in such extreme cases. The most important secular parameters are the forced eccentricity and secular frequency, which depend on the initial conditions of the particles, as well as on the mass and orbital parameters of the secondary star. We construct…
▽ More
The secular dynamics of small planetesimals in tight binary systems play a fundamental role in establishing the possibility of accretional collisions in such extreme cases. The most important secular parameters are the forced eccentricity and secular frequency, which depend on the initial conditions of the particles, as well as on the mass and orbital parameters of the secondary star. We construct a second-order theory (with respect to the masses) for the planar secular motion of small planetasimals and deduce new expressions for the forced eccentricity and secular frequency. We also reanalyze the radial velocity data available for Gamma-Cephei and present a series of orbital solutions leading to residuals compatible with the best fits. Finally, we discuss how different orbital configurations for Gamma-Cephei may affect the dynamics of small bodies in circunmstellar motion. For Gamma-Cephei, we find that the classical first-order expressions for the secular frequency and forced eccentricity lead to large inaccuracies around 50 % for semimajor axes larger than one tenth the orbital separation between the stellar components. Low eccentricities and/or masses reduce the importance of the second-order terms. The dynamics of small planetesimals only show a weak dependence with the orbital fits of the stellar components, and the same result is found including the effects of a nonlinear gas drag. Thus, the possibility of planetary formation in this binary system largely appears insensitive to the orbital fits adopted for the stellar components, and any future alterations in the system parameters (due to new observations) should not change this picture. Finally, we show that planetesimals migrating because of gas drag may be trapped in mean-motion resonances with the binary, even though the migration is divergent.
△ Less
Submitted 2 May, 2011;
originally announced May 2011.
-
Dynamics of Planetesimals due to Gas Drag from an Eccentric Precessing Disk
Authors:
C. Beauge,
A. M. Leiva,
N. Haghighipour,
J. Correa Otto
Abstract:
We analyze the dynamics of individual kilometer-size planetesimals in circumstellar orbits of a tight binary system. We include both the gravitational perturbations of the secondary star and a non-linear gas drag stemming from an eccentric gas disk with a finite precession rate. We consider several precession rates and eccentricities for the gas, and compare the results with a static disk in circu…
▽ More
We analyze the dynamics of individual kilometer-size planetesimals in circumstellar orbits of a tight binary system. We include both the gravitational perturbations of the secondary star and a non-linear gas drag stemming from an eccentric gas disk with a finite precession rate. We consider several precession rates and eccentricities for the gas, and compare the results with a static disk in circular orbit.
The disk precession introduces three main differences with respect to the classical static case: (i) The equilibrium secular solutions generated by the gas drag are no longer fixed points in the averaged system, but limit cycles with frequency equal to the precession rate of the gas. The amplitude of the cycle is inversely dependent on the body size, reaching negligible values for $\sim 50$ km size planetesimals. (ii) The maximum final eccentricity attainable by small bodies is restricted to the interval between the gas eccentricity and the forced eccentricity, and apsidal alignment is no longer guaranteed for planetesimals strongly coupled with the gas. (iii) The characteristic timescales of orbital decay and secular evolution decrease significantly with increasing precession rates, with values up to two orders of magnitude smaller than for static disks.
Finally, we apply this analysis to the $γ$-Cephei system and estimate impact velocities for different size bodies and values of the gas eccentricity. For high disk eccentricities, we find that the disk precession decreases the velocity dispersion between different size planetesimals, thus contributing to accretional collisions in the outer parts of the disk. The opposite occurs for almost circular gas disks, where precession generates an increase in the relative velocities.
△ Less
Submitted 8 June, 2010;
originally announced June 2010.
-
Mapping the $ν_\odot$ Secular Resonance for Retrograde Irregular Satellites
Authors:
J. Correa Otto,
A. M. Leiva,
C. A. Giuppone,
C. Beaugé
Abstract:
Constructing dynamical maps from the filtered output of numerical integrations, we analyze the structure of the $ν_\odot$ secular resonance for fictitious irregular satellites in retrograde orbits. This commensurability is associated to the secular angle $θ= \varpi - \varpi_\odot$, where $\varpi$ is the longitude of pericenter of the satellite and $\varpi_\odot$ corresponds to the (fixed) planet…
▽ More
Constructing dynamical maps from the filtered output of numerical integrations, we analyze the structure of the $ν_\odot$ secular resonance for fictitious irregular satellites in retrograde orbits. This commensurability is associated to the secular angle $θ= \varpi - \varpi_\odot$, where $\varpi$ is the longitude of pericenter of the satellite and $\varpi_\odot$ corresponds to the (fixed) planetocentric orbit of the Sun. Our study is performed in the restricted three-body problem, where the satellites are considered as massless particles around a massive planet and perturbed by the Sun. Depending on the initial conditions, the resonance presents a diversity of possible resonant modes, including librations of $θ$ around zero (as found for Sinope and Pasiphae) or 180 degrees, as well as asymmetric librations (e.g. Narvi). Symmetric modes are present in all giant planets, although each regime appears restricted to certain values of the satellite inclination. Asymmetric solutions, on the other hand, seem absent around Neptune due to its almost circular heliocentric orbit. Simulating the effects of a smooth orbital migration on the satellite, we find that the resonance lock is preserved as long as the induced change in semimajor axis is much slower compared to the period of the resonant angle (adiabatic limit). However, the librational mode may vary during the process, switching between symmetric and asymmetric oscillations. Finally, we present a simple scaling transformation that allows to estimate the resonant structure around any giant planet from the results calculated around a single primary mass.
△ Less
Submitted 12 November, 2009;
originally announced November 2009.
-
The Earth-Moon CR3BP: A full Atlas of low-energy fast periodic transfer orbits
Authors:
Alejandro M. Leiva,
Carlos B. Briozzo
Abstract:
In the framework of the planar CR3BP for mass parameter mu=0.0121505, corresponding to the Earth-Moon system, we identify and describe 80 families of periodic orbits encircling both the Earth and the Moon ("transfer" orbits). All the orbits in these families have very low energies, most of them corresponding to values of the Jacobi constant C for which the Hill surface is closed at the Lagrangia…
▽ More
In the framework of the planar CR3BP for mass parameter mu=0.0121505, corresponding to the Earth-Moon system, we identify and describe 80 families of periodic orbits encircling both the Earth and the Moon ("transfer" orbits). All the orbits in these families have very low energies, most of them corresponding to values of the Jacobi constant C for which the Hill surface is closed at the Lagrangian point L2. All of these orbits have also short period T, generally under six months. Most of the families are composed of orbits that are asymmetric with respect to the Earth-Moon axis.
The main results presented for each family are: (i) the characteristic curves T(h), y(h), v_y(h), and v_x(h) on the Poincare section Sigma_1={x=0.836915310,y,v_x>0,v_y} normal to the Earth-Moon axis at the Lagrangian point L1, parameterized by their energy h=-C/2 in the synodic coordinate system; (ii) the stability parameter along each family; (iii) the intersections x_i(h) of the orbits with the Earth-Moon axis, on the Poincare section Sigma_2={x,y=0,v_x},v_y>0}; (iv) plots of some selected orbits and details of their circumlunar region; and (v) numerical data for the intersection of an orbit with Sigma_1 at a reference value of h. Some possible extensions and applications of this work are also discussed.
△ Less
Submitted 14 December, 2006;
originally announced December 2006.