-
Membership Inference Attacks on Sequence Models
Authors:
Lorenzo Rossi,
Michael Aerni,
Jie Zhang,
Florian Tramèr
Abstract:
Sequence models, such as Large Language Models (LLMs) and autoregressive image generators, have a tendency to memorize and inadvertently leak sensitive information. While this tendency has critical legal implications, existing tools are insufficient to audit the resulting risks. We hypothesize that those tools' shortcomings are due to mismatched assumptions. Thus, we argue that effectively measuri…
▽ More
Sequence models, such as Large Language Models (LLMs) and autoregressive image generators, have a tendency to memorize and inadvertently leak sensitive information. While this tendency has critical legal implications, existing tools are insufficient to audit the resulting risks. We hypothesize that those tools' shortcomings are due to mismatched assumptions. Thus, we argue that effectively measuring privacy leakage in sequence models requires leveraging the correlations inherent in sequential generation. To illustrate this, we adapt a state-of-the-art membership inference attack to explicitly model within-sequence correlations, thereby demonstrating how a strong existing attack can be naturally extended to suit the structure of sequence models. Through a case study, we show that our adaptations consistently improve the effectiveness of memorization audits without introducing additional computational costs. Our work hence serves as an important stepping stone toward reliable memorization audits for large sequence models.
△ Less
Submitted 5 June, 2025;
originally announced June 2025.
-
Reaction-diffusion equations in periodic media: convergence to pulsating fronts
Authors:
Hongjun Guo,
François Hamel,
Luca Rossi
Abstract:
This paper is concerned with reaction-diffusion-advection equations in spatially periodic media. Under an assumption of weak stability of the constant states 0 and 1, and of existence of pulsating traveling fronts connecting them, we show that fronts' profiles appear, along sequences of times and points, in the large-time dynamics of the solutions of the Cauchy problem, whether their initial suppo…
▽ More
This paper is concerned with reaction-diffusion-advection equations in spatially periodic media. Under an assumption of weak stability of the constant states 0 and 1, and of existence of pulsating traveling fronts connecting them, we show that fronts' profiles appear, along sequences of times and points, in the large-time dynamics of the solutions of the Cauchy problem, whether their initial supports are bounded or unbounded. The types of equations that fit into our assumptions are the combustion and the bistable ones. We also show a generalized Freidlin-G{ä}rtner formula and other geometrical properties of the asymptotic invasion shapes, or spreading sets, of invading solutions, and we relate these sets to the upper level sets of the solutions.
△ Less
Submitted 26 May, 2025;
originally announced May 2025.
-
Increased hydrogen escape from Mars atmosphere during periods of high obliquity
Authors:
Gabriella Gilli,
Francisco González-Galindo,
Jean-Yves Chaufray,
Ehouarn Millour,
François Forget,
Franck Montmessin,
Franck Lefèvre,
Joseph Naar,
Yangcheng Luo,
Margaux Vals,
Loïc Rossi,
Miguel Ángel López-Valverde,
Adrián Brines
Abstract:
It is still unknown how much water has escaped from Mars during its history. Hydrogen escape from Mars's atmosphere probably played a major role in drying the planet, but present-day Hloss rates (about 3x10^26 atoms per second on average) cannot explain the geological evidence for the large volumes of liquid water on ancient Mars. Here we used the three-dimensional Mars-Planetary Climate Model to…
▽ More
It is still unknown how much water has escaped from Mars during its history. Hydrogen escape from Mars's atmosphere probably played a major role in drying the planet, but present-day Hloss rates (about 3x10^26 atoms per second on average) cannot explain the geological evidence for the large volumes of liquid water on ancient Mars. Here we used the three-dimensional Mars-Planetary Climate Model to show that H loss rates could have increased by more than one order of magnitude (6x10^27 atoms per second) during higher spin axis obliquity periods, notably in the last few million years when Mars's obliquity was about 35 deg on average. The resulting accumulated H escape over Mars's history translates into an approx. 80 m global equivalent layer, which is close to the lower limit of geological estimates, assessing the major role of atmospheric escape in drying Mars.
△ Less
Submitted 22 May, 2025;
originally announced May 2025.
-
Digit expansions in rational and algebraic basis
Authors:
Lucía Rossi
Abstract:
Consider $α\in \Q(i)$ satisfying $|α| >1$. Let $\D = \{0,1,\ldots,|a_0|-1\}$, where $a_0$ is the independent coefficient of the minimal primitive polynomial of $α$. We introduce a way of expanding complex numbers in base $α$ with digits in $\D$ that we call $α$-expansions, which generalize rational base number systems introduced by Akiyama, Frougny and Sakarovitch, and are related to rational self…
▽ More
Consider $α\in \Q(i)$ satisfying $|α| >1$. Let $\D = \{0,1,\ldots,|a_0|-1\}$, where $a_0$ is the independent coefficient of the minimal primitive polynomial of $α$. We introduce a way of expanding complex numbers in base $α$ with digits in $\D$ that we call $α$-expansions, which generalize rational base number systems introduced by Akiyama, Frougny and Sakarovitch, and are related to rational self-affine tiles introduced by Steiner and Thuswaldner. We define an algorithm to obtain the expansions for certain Gaussian integers and show results on the language. We then extend the expansions to all $x \in \C$ (or $x \in \R$ when $α= \ab \in \Q$, the rational case will be our starting point) and show that they are unique almost everywhere. We relate them to tilings of the complex plane. We characterize $α$-expansions in terms of $p$-adic completions of $\Q(i)$ with respect to Gaussian primes.
△ Less
Submitted 20 May, 2025;
originally announced May 2025.
-
Future Circular Collider Feasibility Study Report: Volume 2, Accelerators, Technical Infrastructure and Safety
Authors:
M. Benedikt,
F. Zimmermann,
B. Auchmann,
W. Bartmann,
J. P. Burnet,
C. Carli,
A. Chancé,
P. Craievich,
M. Giovannozzi,
C. Grojean,
J. Gutleber,
K. Hanke,
A. Henriques,
P. Janot,
C. Lourenço,
M. Mangano,
T. Otto,
J. Poole,
S. Rajagopalan,
T. Raubenheimer,
E. Todesco,
L. Ulrici,
T. Watson,
G. Wilkinson,
A. Abada
, et al. (1439 additional authors not shown)
Abstract:
In response to the 2020 Update of the European Strategy for Particle Physics, the Future Circular Collider (FCC) Feasibility Study was launched as an international collaboration hosted by CERN. This report describes the FCC integrated programme, which consists of two stages: an electron-positron collider (FCC-ee) in the first phase, serving as a high-luminosity Higgs, top, and electroweak factory;…
▽ More
In response to the 2020 Update of the European Strategy for Particle Physics, the Future Circular Collider (FCC) Feasibility Study was launched as an international collaboration hosted by CERN. This report describes the FCC integrated programme, which consists of two stages: an electron-positron collider (FCC-ee) in the first phase, serving as a high-luminosity Higgs, top, and electroweak factory; followed by a proton-proton collider (FCC-hh) at the energy frontier in the second phase.
FCC-ee is designed to operate at four key centre-of-mass energies: the Z pole, the WW production threshold, the ZH production peak, and the top/anti-top production threshold - delivering the highest possible luminosities to four experiments. Over 15 years of operation, FCC-ee will produce more than 6 trillion Z bosons, 200 million WW pairs, nearly 3 million Higgs bosons, and 2 million top anti-top pairs. Precise energy calibration at the Z pole and WW threshold will be achieved through frequent resonant depolarisation of pilot bunches. The sequence of operation modes remains flexible.
FCC-hh will operate at a centre-of-mass energy of approximately 85 TeV - nearly an order of magnitude higher than the LHC - and is designed to deliver 5 to 10 times the integrated luminosity of the HL-LHC. Its mass reach for direct discovery extends to several tens of TeV. In addition to proton-proton collisions, FCC-hh is capable of supporting ion-ion, ion-proton, and lepton-hadron collision modes.
This second volume of the Feasibility Study Report presents the complete design of the FCC-ee collider, its operation and staging strategy, the full-energy booster and injector complex, required accelerator technologies, safety concepts, and technical infrastructure. It also includes the design of the FCC-hh hadron collider, development of high-field magnets, hadron injector options, and key technical systems for FCC-hh.
△ Less
Submitted 25 April, 2025;
originally announced May 2025.
-
Future Circular Collider Feasibility Study Report: Volume 3, Civil Engineering, Implementation and Sustainability
Authors:
M. Benedikt,
F. Zimmermann,
B. Auchmann,
W. Bartmann,
J. P. Burnet,
C. Carli,
A. Chancé,
P. Craievich,
M. Giovannozzi,
C. Grojean,
J. Gutleber,
K. Hanke,
A. Henriques,
P. Janot,
C. Lourenço,
M. Mangano,
T. Otto,
J. Poole,
S. Rajagopalan,
T. Raubenheimer,
E. Todesco,
L. Ulrici,
T. Watson,
G. Wilkinson,
P. Azzi
, et al. (1439 additional authors not shown)
Abstract:
Volume 3 of the FCC Feasibility Report presents studies related to civil engineering, the development of a project implementation scenario, and environmental and sustainability aspects. The report details the iterative improvements made to the civil engineering concepts since 2018, taking into account subsurface conditions, accelerator and experiment requirements, and territorial considerations. I…
▽ More
Volume 3 of the FCC Feasibility Report presents studies related to civil engineering, the development of a project implementation scenario, and environmental and sustainability aspects. The report details the iterative improvements made to the civil engineering concepts since 2018, taking into account subsurface conditions, accelerator and experiment requirements, and territorial considerations. It outlines a technically feasible and economically viable civil engineering configuration that serves as the baseline for detailed subsurface investigations, construction design, cost estimation, and project implementation planning. Additionally, the report highlights ongoing subsurface investigations in key areas to support the development of an improved 3D subsurface model of the region.
The report describes development of the project scenario based on the 'avoid-reduce-compensate' iterative optimisation approach. The reference scenario balances optimal physics performance with territorial compatibility, implementation risks, and costs. Environmental field investigations covering almost 600 hectares of terrain - including numerous urban, economic, social, and technical aspects - confirmed the project's technical feasibility and contributed to the preparation of essential input documents for the formal project authorisation phase. The summary also highlights the initiation of public dialogue as part of the authorisation process. The results of a comprehensive socio-economic impact assessment, which included significant environmental effects, are presented. Even under the most conservative and stringent conditions, a positive benefit-cost ratio for the FCC-ee is obtained. Finally, the report provides a concise summary of the studies conducted to document the current state of the environment.
△ Less
Submitted 25 April, 2025;
originally announced May 2025.
-
Future Circular Collider Feasibility Study Report: Volume 1, Physics, Experiments, Detectors
Authors:
M. Benedikt,
F. Zimmermann,
B. Auchmann,
W. Bartmann,
J. P. Burnet,
C. Carli,
A. Chancé,
P. Craievich,
M. Giovannozzi,
C. Grojean,
J. Gutleber,
K. Hanke,
A. Henriques,
P. Janot,
C. Lourenço,
M. Mangano,
T. Otto,
J. Poole,
S. Rajagopalan,
T. Raubenheimer,
E. Todesco,
L. Ulrici,
T. Watson,
G. Wilkinson,
P. Azzi
, et al. (1439 additional authors not shown)
Abstract:
Volume 1 of the FCC Feasibility Report presents an overview of the physics case, experimental programme, and detector concepts for the Future Circular Collider (FCC). This volume outlines how FCC would address some of the most profound open questions in particle physics, from precision studies of the Higgs and EW bosons and of the top quark, to the exploration of physics beyond the Standard Model.…
▽ More
Volume 1 of the FCC Feasibility Report presents an overview of the physics case, experimental programme, and detector concepts for the Future Circular Collider (FCC). This volume outlines how FCC would address some of the most profound open questions in particle physics, from precision studies of the Higgs and EW bosons and of the top quark, to the exploration of physics beyond the Standard Model. The report reviews the experimental opportunities offered by the staged implementation of FCC, beginning with an electron-positron collider (FCC-ee), operating at several centre-of-mass energies, followed by a hadron collider (FCC-hh). Benchmark examples are given of the expected physics performance, in terms of precision and sensitivity to new phenomena, of each collider stage. Detector requirements and conceptual designs for FCC-ee experiments are discussed, as are the specific demands that the physics programme imposes on the accelerator in the domains of the calibration of the collision energy, and the interface region between the accelerator and the detector. The report also highlights advances in detector, software and computing technologies, as well as the theoretical tools /reconstruction techniques that will enable the precision measurements and discovery potential of the FCC experimental programme. This volume reflects the outcome of a global collaborative effort involving hundreds of scientists and institutions, aided by a dedicated community-building coordination, and provides a targeted assessment of the scientific opportunities and experimental foundations of the FCC programme.
△ Less
Submitted 25 April, 2025;
originally announced May 2025.
-
The Muon Collider
Authors:
Carlotta Accettura,
Simon Adrian,
Rohit Agarwal,
Claudia Ahdida,
Chiara Aime',
Avni Aksoy,
Gian Luigi Alberghi,
Siobhan Alden,
Luca Alfonso,
Muhammad Ali,
Anna Rita Altamura,
Nicola Amapane,
Kathleen Amm,
David Amorim,
Paolo Andreetto,
Fabio Anulli,
Ludovica Aperio Bella,
Rob Appleby,
Artur Apresyan,
Pouya Asadi,
Mohammed Attia Mahmoud,
Bernhard Auchmann,
John Back,
Anthony Badea,
Kyu Jung Bae
, et al. (433 additional authors not shown)
Abstract:
Muons offer a unique opportunity to build a compact high-energy electroweak collider at the 10 TeV scale. A Muon Collider enables direct access to the underlying simplicity of the Standard Model and unparalleled reach beyond it. It will be a paradigm-shifting tool for particle physics representing the first collider to combine the high-energy reach of a proton collider and the high precision of an…
▽ More
Muons offer a unique opportunity to build a compact high-energy electroweak collider at the 10 TeV scale. A Muon Collider enables direct access to the underlying simplicity of the Standard Model and unparalleled reach beyond it. It will be a paradigm-shifting tool for particle physics representing the first collider to combine the high-energy reach of a proton collider and the high precision of an electron-positron collider, yielding a physics potential significantly greater than the sum of its individual parts. A high-energy muon collider is the natural next step in the exploration of fundamental physics after the HL-LHC and a natural complement to a future low-energy Higgs factory. Such a facility would significantly broaden the scope of particle colliders, engaging the many frontiers of the high energy community.
The last European Strategy for Particle Physics Update and later the Particle Physics Project Prioritisation Panel in the US requested a study of the muon collider, which is being carried on by the International Muon Collider Collaboration. In this comprehensive document we present the physics case, the state of the work on accelerator design and technology, and propose an R\&D project that can make the muon collider a reality.
△ Less
Submitted 30 April, 2025;
originally announced April 2025.
-
High Field Magnet Programme -- European Strategy Input
Authors:
B. Auchmann,
E. Todesco,
A. Ballarino,
N. Bagrets,
A. Milanese,
E. Rochepault,
L. Rossi,
C. Senatore,
F. Toral
Abstract:
In this submission, we describe research goals, implementation, and timelines of the High Field Magnet Programme, hosted by CERN. The programme pursues accelerator-magnet R&D with low-temperature- and high-temperature superconductor technology with a main focus on the FCC-hh. Following a long tradition of magnet R&D for high-energy particle colliders, HFM R&D fosters important societal impact thro…
▽ More
In this submission, we describe research goals, implementation, and timelines of the High Field Magnet Programme, hosted by CERN. The programme pursues accelerator-magnet R&D with low-temperature- and high-temperature superconductor technology with a main focus on the FCC-hh. Following a long tradition of magnet R&D for high-energy particle colliders, HFM R&D fosters important societal impact through synergies with other fields.
△ Less
Submitted 23 April, 2025;
originally announced April 2025.
-
RealHarm: A Collection of Real-World Language Model Application Failures
Authors:
Pierre Le Jeune,
Jiaen Liu,
Luca Rossi,
Matteo Dora
Abstract:
Language model deployments in consumer-facing applications introduce numerous risks. While existing research on harms and hazards of such applications follows top-down approaches derived from regulatory frameworks and theoretical analyses, empirical evidence of real-world failure modes remains underexplored. In this work, we introduce RealHarm, a dataset of annotated problematic interactions with…
▽ More
Language model deployments in consumer-facing applications introduce numerous risks. While existing research on harms and hazards of such applications follows top-down approaches derived from regulatory frameworks and theoretical analyses, empirical evidence of real-world failure modes remains underexplored. In this work, we introduce RealHarm, a dataset of annotated problematic interactions with AI agents built from a systematic review of publicly reported incidents. Analyzing harms, causes, and hazards specifically from the deployer's perspective, we find that reputational damage constitutes the predominant organizational harm, while misinformation emerges as the most common hazard category. We empirically evaluate state-of-the-art guardrails and content moderation systems to probe whether such systems would have prevented the incidents, revealing a significant gap in the protection of AI applications.
△ Less
Submitted 14 April, 2025;
originally announced April 2025.
-
BioX-CPath: Biologically-driven Explainable Diagnostics for Multistain IHC Computational Pathology
Authors:
Amaya Gallagher-Syed,
Henry Senior,
Omnia Alwazzan,
Elena Pontarini,
Michele Bombardieri,
Costantino Pitzalis,
Myles J. Lewis,
Michael R. Barnes,
Luca Rossi,
Gregory Slabaugh
Abstract:
The development of biologically interpretable and explainable models remains a key challenge in computational pathology, particularly for multistain immunohistochemistry (IHC) analysis. We present BioX-CPath, an explainable graph neural network architecture for whole slide image (WSI) classification that leverages both spatial and semantic features across multiple stains. At its core, BioX-CPath i…
▽ More
The development of biologically interpretable and explainable models remains a key challenge in computational pathology, particularly for multistain immunohistochemistry (IHC) analysis. We present BioX-CPath, an explainable graph neural network architecture for whole slide image (WSI) classification that leverages both spatial and semantic features across multiple stains. At its core, BioX-CPath introduces a novel Stain-Aware Attention Pooling (SAAP) module that generates biologically meaningful, stain-aware patient embeddings. Our approach achieves state-of-the-art performance on both Rheumatoid Arthritis and Sjogren's Disease multistain datasets. Beyond performance metrics, BioX-CPath provides interpretable insights through stain attention scores, entropy measures, and stain interaction scores, that permit measuring model alignment with known pathological mechanisms. This biological grounding, combined with strong classification performance, makes BioX-CPath particularly suitable for clinical applications where interpretability is key. Source code and documentation can be found at: https://github.com/AmayaGS/BioX-CPath.
△ Less
Submitted 3 April, 2025; v1 submitted 26 March, 2025;
originally announced March 2025.
-
PHGNN: A Novel Prompted Hypergraph Neural Network to Diagnose Alzheimer's Disease
Authors:
Chenyu Liu,
Luca Rossi
Abstract:
The accurate diagnosis of Alzheimer's disease (AD) and prognosis of mild cognitive impairment (MCI) conversion are crucial for early intervention. However, existing multimodal methods face several challenges, from the heterogeneity of input data, to underexplored modality interactions, missing data due to patient dropouts, and limited data caused by the time-consuming and costly data collection pr…
▽ More
The accurate diagnosis of Alzheimer's disease (AD) and prognosis of mild cognitive impairment (MCI) conversion are crucial for early intervention. However, existing multimodal methods face several challenges, from the heterogeneity of input data, to underexplored modality interactions, missing data due to patient dropouts, and limited data caused by the time-consuming and costly data collection process. In this paper, we propose a novel Prompted Hypergraph Neural Network (PHGNN) framework that addresses these limitations by integrating hypergraph based learning with prompt learning. Hypergraphs capture higher-order relationships between different modalities, while our prompt learning approach for hypergraphs, adapted from NLP, enables efficient training with limited data. Our model is validated through extensive experiments on the ADNI dataset, outperforming SOTA methods in both AD diagnosis and the prediction of MCI conversion.
△ Less
Submitted 18 March, 2025;
originally announced March 2025.
-
SuperCap: Multi-resolution Superpixel-based Image Captioning
Authors:
Henry Senior,
Luca Rossi,
Gregory Slabaugh,
Shanxin Yuan
Abstract:
It has been a longstanding goal within image captioning to move beyond a dependence on object detection. We investigate using superpixels coupled with Vision Language Models (VLMs) to bridge the gap between detector-based captioning architectures and those that solely pretrain on large datasets. Our novel superpixel approach ensures that the model receives object-like features whilst the use of VL…
▽ More
It has been a longstanding goal within image captioning to move beyond a dependence on object detection. We investigate using superpixels coupled with Vision Language Models (VLMs) to bridge the gap between detector-based captioning architectures and those that solely pretrain on large datasets. Our novel superpixel approach ensures that the model receives object-like features whilst the use of VLMs provides our model with open set object understanding. Furthermore, we extend our architecture to make use of multi-resolution inputs, allowing our model to view images in different levels of detail, and use an attention mechanism to determine which parts are most relevant to the caption. We demonstrate our model's performance with multiple VLMs and through a range of ablations detailing the impact of different architectural choices. Our full model achieves a competitive CIDEr score of $136.9$ on the COCO Karpathy split.
△ Less
Submitted 11 March, 2025;
originally announced March 2025.
-
Stability of propagating terraces in spatially periodic multistable equations in $\mathbb{R}^N$
Authors:
Thomas Giletti,
Luca Rossi
Abstract:
In this paper, we study the large time behaviour of solutions of multistable reaction-diffusion equations in $\mathbb{R}^N$, with a spatially periodic heterogeneity. By multistable, we mean that the problem admits a finite -- but arbitrarily large -- number of stable, periodic steady states. In contrast with the more classical monostable and bistable frameworks, which exhibit the emergence of a si…
▽ More
In this paper, we study the large time behaviour of solutions of multistable reaction-diffusion equations in $\mathbb{R}^N$, with a spatially periodic heterogeneity. By multistable, we mean that the problem admits a finite -- but arbitrarily large -- number of stable, periodic steady states. In contrast with the more classical monostable and bistable frameworks, which exhibit the emergence of a single travelling front in the long run, in the present case the large time dynamics is governed by a family of stacked travelling fronts, involving intermediate steady states, called propagating terrace. Their existence in the multidimensional case has been established in our previous work [13]. The first result of the present paper is their uniqueness. Next, we show that the speeds of the propagating terraces in different directions dictate the spreading speeds of solutions of the Cauchy problem, for both planar-like and compactly supported initial data. The latter case turns out to be much more intricate than the former, due to the fact that the propagating terraces in distinct directions may involve different sets of intermediate steady states. Another source of difficulty is that the Wulff shape of the speeds of travelling fronts can be non-smooth, as we show in the bistable case using a result of [4].
△ Less
Submitted 10 March, 2025;
originally announced March 2025.
-
TikTok StitchGraph: Characterizing communication patterns on TikTok through a collection of interaction networks
Authors:
Mads Høgenhaug,
Marcus Friis,
Morten Pedersen,
Luca Rossi
Abstract:
We present TikTok StitchGraph: a collection of 36 graphs based on TikTok stitches. With its rapid growth and widespread popularity, TikTok presents a compelling platform for study, yet given its video-first nature the network structure of the conversations that it hosts remains largely unexplored. Leveraging its recently released APIs, in combination with web scraping, we construct graphs detailin…
▽ More
We present TikTok StitchGraph: a collection of 36 graphs based on TikTok stitches. With its rapid growth and widespread popularity, TikTok presents a compelling platform for study, yet given its video-first nature the network structure of the conversations that it hosts remains largely unexplored. Leveraging its recently released APIs, in combination with web scraping, we construct graphs detailing stitch relations from both a video- and user-centric perspective. Specifically, we focus on user multi-digraphs, with vertices representing users and edges representing directed stitch relations. From the user graphs, we characterize common communication patterns of the stitch using frequent subgraph mining, finding a preference for stars and star-like structures, an aversion towards cyclic structures, and directional disposition favoring in- and out-stars over mixed-direction structures. These structures are augmented with sentiment labels in the form of edge attributes. We then use these subgraphs for graph-level embeddings together with Graph2Vec, we show no clear distinction between topologies for different hashtag topic categories. Lastly, we compare our StitchGraphs to Twitter reply networks and show that a remakable similarity between the conversation networks on the two platforms.
△ Less
Submitted 3 April, 2025; v1 submitted 25 February, 2025;
originally announced February 2025.
-
Generalized principal eigenvalues for parabolic operators in bounded domains
Authors:
Henri Berestycki,
Grégoire Nadin,
Luca Rossi
Abstract:
We introduce here new generalized principal eigenvalues for linear parabolic operators with heterogeneous coefficients in space and time. We consider a bounded spatial domain and an unbounded time interval $I$ : $I=\mathbb{R},\ \mathbb{R}^+$ or $\mathbb{R}^-$, and operators with coefficients having a fairly general dependence on space and time. The notions we introduce rely on the parabolic maximu…
▽ More
We introduce here new generalized principal eigenvalues for linear parabolic operators with heterogeneous coefficients in space and time. We consider a bounded spatial domain and an unbounded time interval $I$ : $I=\mathbb{R},\ \mathbb{R}^+$ or $\mathbb{R}^-$, and operators with coefficients having a fairly general dependence on space and time. The notions we introduce rely on the parabolic maximum principle and extend some earlier definitions introduced for elliptic operators [BNV].
We first show that these eigenvalues hold the key to understanding the large time behavior and entire solutions of heterogeneous Fisher-KPP type equations. We then describe the relation of these quantities with principal Floquet bundles for parabolic operators which provides further characterizations of the principal eigenvalues. These allow us to derive monotonicity properties and comparisons between generalized principal eigenvalues, as well as perturbation results and further properties involving limit operators. We show that the sign of these eigenvalues encodes different versions of the maximum principle for parabolic operators. Lastly, we explicitly compute the generalized principal eigenvalues for several classes of operators such as spatial-independent, periodic, almost periodic, uniquely ergodic or random stationary ergodic coefficients.
△ Less
Submitted 23 February, 2025;
originally announced February 2025.
-
A Neural-Network Extraction of Unpolarised Transverse-Momentum-Dependent Distributions
Authors:
Alessandro Bacchetta,
Valerio Bertone,
Chiara Bissolotti,
Matteo Cerutti,
Marco Radici,
Simone Rodini,
Lorenzo Rossi
Abstract:
We present the first extraction of transverse-momentum-dependent distributions of unpolarised quarks from experimental Drell-Yan data using neural networks to parametrise their nonperturbative part. We show that neural networks outperform traditional parametrisations providing a more accurate description of data. This work establishes the feasibility of using neural networks to explore the multi-d…
▽ More
We present the first extraction of transverse-momentum-dependent distributions of unpolarised quarks from experimental Drell-Yan data using neural networks to parametrise their nonperturbative part. We show that neural networks outperform traditional parametrisations providing a more accurate description of data. This work establishes the feasibility of using neural networks to explore the multi-dimensional partonic structure of hadrons and paves the way for more accurate determinations based on machine-learning techniques.
△ Less
Submitted 6 February, 2025;
originally announced February 2025.
-
Looking into the faintEst WIth MUSE (LEWIS): Exploring the nature of ultra-diffuse galaxies in the Hydra-I cluster II. Stellar kinematics and dynamical masses
Authors:
Chiara Buttitta,
Enrichetta Iodice,
Goran Doll,
Johanna Hartke,
Michael Hilker,
Duncan A. Forbes,
Enrico M. Corsini,
Luca Rossi,
Magda Arnaboldi,
Michele Cantiello,
Giuseppe D'Ago,
Jesus Falcon-Barroso,
Marco Gullieuszik,
Antonio La Marca,
Steffen Mieske,
Marco Mirabile,
Maurizio Paolillo,
Marina Rejkuba,
Marilena Spavone,
Chiara Spiniello,
Marc Sarzi
Abstract:
Context: This paper focuses on a class of galaxies characterised by an extremely low surface brightness: the ultra-diffuse galaxies (UDGs). We used new integral-field spectroscopic data from the ESO Large Programme Looking into the faintEst WIth MUSE (LEWIS) project. Aims: Our main goals are addressing the formation channels and investigating possible correlations of their observational properties…
▽ More
Context: This paper focuses on a class of galaxies characterised by an extremely low surface brightness: the ultra-diffuse galaxies (UDGs). We used new integral-field spectroscopic data from the ESO Large Programme Looking into the faintEst WIth MUSE (LEWIS) project. Aims: Our main goals are addressing the formation channels and investigating possible correlations of their observational properties. In particular, we derive their stellar kinematics and dynamical properties. Methods: We extract the 1D stacked spectrum inside the effective radius to obtain an unbiased measure of $σ_{\rm eff}$. To derive the spatially-resolved stellar kinematics, we first apply the Voronoi tessellation algorithm to bin the spaxels in the datacube and then follow the same prescription adopted for the 1D case. In addition, we extract the velocity profiles along the galaxy's major and minor axes. Results: We find that 7 out of 18 UDGs in LEWIS show a mild rotation, 5 do not have evidence of any rotation, and the remaining 6 UDGs are unconstrained cases. This is the first large census of velocity profiles for UDGs. On average, UDGs in LEWIS are characterised by low values of $σ_{\rm eff}$, comparable with available values from the literature. In the Faber-Jackson relation plane, we found a group of UDGs consistent with the relation within the errorbars, whereas outliers are objects with non-negligible rotation components. UDGs and LSBs in LEWIS have larger dark matter content than dwarf galaxies with similar total luminosity. We do not find clear correlations between the derived properties and the local environment. Conclusions: Based on the stellar kinematics, two classes of UDGs are found in the Hydra I cluster: the rotating and non-rotating systems. This result, combined with other structural properties, can help to discriminate between the several formation scenarios proposed for UDGs.
△ Less
Submitted 27 January, 2025;
originally announced January 2025.
-
Transients versus network interactions give rise to multistability through trapping mechanism
Authors:
Kalel L. Rossi,
Everton S. Medeiros,
Peter Ashwin,
Ulrike Feudel
Abstract:
In networked systems, the interplay between the dynamics of individual subsystems and their network interactions has been found to generate multistability in various contexts. Despite its ubiquity, the specific mechanisms and ingredients that give rise to multistability from such interplay remain poorly understood. In a network of coupled excitable units, we show that this interplay generating mul…
▽ More
In networked systems, the interplay between the dynamics of individual subsystems and their network interactions has been found to generate multistability in various contexts. Despite its ubiquity, the specific mechanisms and ingredients that give rise to multistability from such interplay remain poorly understood. In a network of coupled excitable units, we show that this interplay generating multistability occurs through a competition between the units' transient dynamics and their coupling. Specifically, the diffusive coupling between the units manages to reinject them in the excitability region of their individual state space and effectively trap them there. We show that this trapping mechanism leads to the coexistence of multiple types of oscillations: periodic, quasiperiodic, and even chaotic, although the units separately do not oscillate. Interestingly, we show that the attractors emerge through different types of bifurcations - in particular, the periodic attractors emerge through either saddle-node of limit cycles bifurcations or homoclinic bifurcations - but in all cases the reinjection mechanism is present.
△ Less
Submitted 21 November, 2024;
originally announced November 2024.
-
MuCol Milestone Report No. 5: Preliminary Parameters
Authors:
Carlotta Accettura,
Simon Adrian,
Rohit Agarwal,
Claudia Ahdida,
Chiara Aimé,
Avni Aksoy,
Gian Luigi Alberghi,
Siobhan Alden,
Luca Alfonso,
Nicola Amapane,
David Amorim,
Paolo Andreetto,
Fabio Anulli,
Rob Appleby,
Artur Apresyan,
Pouya Asadi,
Mohammed Attia Mahmoud,
Bernhard Auchmann,
John Back,
Anthony Badea,
Kyu Jung Bae,
E. J. Bahng,
Lorenzo Balconi,
Fabrice Balli,
Laura Bandiera
, et al. (369 additional authors not shown)
Abstract:
This document is comprised of a collection of updated preliminary parameters for the key parts of the muon collider. The updated preliminary parameters follow on from the October 2023 Tentative Parameters Report. Particular attention has been given to regions of the facility that are believed to hold greater technical uncertainty in their design and that have a strong impact on the cost and power…
▽ More
This document is comprised of a collection of updated preliminary parameters for the key parts of the muon collider. The updated preliminary parameters follow on from the October 2023 Tentative Parameters Report. Particular attention has been given to regions of the facility that are believed to hold greater technical uncertainty in their design and that have a strong impact on the cost and power consumption of the facility. The data is collected from a collaborative spreadsheet and transferred to overleaf.
△ Less
Submitted 5 November, 2024;
originally announced November 2024.
-
CFTS-GAN: Continual Few-Shot Teacher Student for Generative Adversarial Networks
Authors:
Munsif Ali,
Leonardo Rossi,
Massimo Bertozzi
Abstract:
Few-shot and continual learning face two well-known challenges in GANs: overfitting and catastrophic forgetting. Learning new tasks results in catastrophic forgetting in deep learning models. In the case of a few-shot setting, the model learns from a very limited number of samples (e.g. 10 samples), which can lead to overfitting and mode collapse. So, this paper proposes a Continual Few-shot Teach…
▽ More
Few-shot and continual learning face two well-known challenges in GANs: overfitting and catastrophic forgetting. Learning new tasks results in catastrophic forgetting in deep learning models. In the case of a few-shot setting, the model learns from a very limited number of samples (e.g. 10 samples), which can lead to overfitting and mode collapse. So, this paper proposes a Continual Few-shot Teacher-Student technique for the generative adversarial network (CFTS-GAN) that considers both challenges together. Our CFTS-GAN uses an adapter module as a student to learn a new task without affecting the previous knowledge. To make the student model efficient in learning new tasks, the knowledge from a teacher model is distilled to the student. In addition, the Cross-Domain Correspondence (CDC) loss is used by both teacher and student to promote diversity and to avoid mode collapse. Moreover, an effective strategy of freezing the discriminator is also utilized for enhancing performance. Qualitative and quantitative results demonstrate more diverse image synthesis and produce qualitative samples comparatively good to very stronger state-of-the-art models.
△ Less
Submitted 17 October, 2024;
originally announced October 2024.
-
MCGM: Mask Conditional Text-to-Image Generative Model
Authors:
Rami Skaik,
Leonardo Rossi,
Tomaso Fontanini,
Andrea Prati
Abstract:
Recent advancements in generative models have revolutionized the field of artificial intelligence, enabling the creation of highly-realistic and detailed images. In this study, we propose a novel Mask Conditional Text-to-Image Generative Model (MCGM) that leverages the power of conditional diffusion models to generate pictures with specific poses. Our model builds upon the success of the Break-a-s…
▽ More
Recent advancements in generative models have revolutionized the field of artificial intelligence, enabling the creation of highly-realistic and detailed images. In this study, we propose a novel Mask Conditional Text-to-Image Generative Model (MCGM) that leverages the power of conditional diffusion models to generate pictures with specific poses. Our model builds upon the success of the Break-a-scene [1] model in generating new scenes using a single image with multiple subjects and incorporates a mask embedding injection that allows the conditioning of the generation process. By introducing this additional level of control, MCGM offers a flexible and intuitive approach for generating specific poses for one or more subjects learned from a single image, empowering users to influence the output based on their requirements. Through extensive experimentation and evaluation, we demonstrate the effectiveness of our proposed model in generating high-quality images that meet predefined mask conditions and improving the current Break-a-scene generative model.
△ Less
Submitted 1 October, 2024;
originally announced October 2024.
-
Reaction-diffusion model for a population structured in phenotype and space I -- Criterion for persistence
Authors:
Nathanaël Boutillon,
Luca Rossi
Abstract:
We consider a reaction-diffusion model for a population structured in phenotype. We assume that the population lives in a heterogeneous periodic environment, so that a given phenotypic trait may be more or less fit according to the spatial location. The model features spatial mobility of individuals as well as mutation.
We first prove the well-posedness of the model. Next, we derive a criterion…
▽ More
We consider a reaction-diffusion model for a population structured in phenotype. We assume that the population lives in a heterogeneous periodic environment, so that a given phenotypic trait may be more or less fit according to the spatial location. The model features spatial mobility of individuals as well as mutation.
We first prove the well-posedness of the model. Next, we derive a criterion for the persistence of the population which involves the generalised principal eigenvalue associated with the linearised elliptic operator. This notion allows us to handle the possible lack of coercivity of the operator. We then obtain a monotonicity result for the generalised principal eigenvalue, in terms of the frequency of spatial fluctuations of the environment and in terms of the spatial diffusivity. We deduce that the more heterogeneous is the environment, or the higher is the mobility of individuals, the harder is the persistence for the species.
This work lays the mathematical foundation to investigate some other optimisation problems for the environment to make persistence as hard or as easy as possible, which will be addressed in the forthcoming companion paper.
△ Less
Submitted 6 March, 2025; v1 submitted 30 September, 2024;
originally announced September 2024.
-
Exploring the three-dimensional momentum distribution of longitudinally polarized quarks in the proton
Authors:
Alessandro Bacchetta,
Alessia Bongallino,
Matteo Cerutti,
Marco Radici,
Lorenzo Rossi
Abstract:
By analyzing experimental data on semi-inclusive deep inelastic scattering off longitudinally polarized targets, we extract the transverse momentum dependence of the quark helicity distribution, i.e., the difference between the three-dimensional motion of quarks with polarization parallel or antiparallel to the longitudinal polarization of the parent hadron. We perform the analysis at next-to-lead…
▽ More
By analyzing experimental data on semi-inclusive deep inelastic scattering off longitudinally polarized targets, we extract the transverse momentum dependence of the quark helicity distribution, i.e., the difference between the three-dimensional motion of quarks with polarization parallel or antiparallel to the longitudinal polarization of the parent hadron. We perform the analysis at next-to-leading (NLL) and next-to-next-to-leading (NNLL) perturbative accuracy. The quality of the fit is very good for both cases, reaching a $χ^2$ per number of data points equal to $1.11$ and $1.09$, respectively. Although the limited number of data points leads to significant uncertainties, the data are consistent with an interpretation in which the helicity distribution is narrower in transverse momentum than the unpolarized distribution.
△ Less
Submitted 26 September, 2024;
originally announced September 2024.
-
Mamba-ST: State Space Model for Efficient Style Transfer
Authors:
Filippo Botti,
Alex Ergasti,
Leonardo Rossi,
Tomaso Fontanini,
Claudio Ferrari,
Massimo Bertozzi,
Andrea Prati
Abstract:
The goal of style transfer is, given a content image and a style source, generating a new image preserving the content but with the artistic representation of the style source. Most of the state-of-the-art architectures use transformers or diffusion-based models to perform this task, despite the heavy computational burden that they require. In particular, transformers use self- and cross-attention…
▽ More
The goal of style transfer is, given a content image and a style source, generating a new image preserving the content but with the artistic representation of the style source. Most of the state-of-the-art architectures use transformers or diffusion-based models to perform this task, despite the heavy computational burden that they require. In particular, transformers use self- and cross-attention layers which have large memory footprint, while diffusion models require high inference time. To overcome the above, this paper explores a novel design of Mamba, an emergent State-Space Model (SSM), called Mamba-ST, to perform style transfer. To do so, we adapt Mamba linear equation to simulate the behavior of cross-attention layers, which are able to combine two separate embeddings into a single output, but drastically reducing memory usage and time complexity. We modified the Mamba's inner equations so to accept inputs from, and combine, two separate data streams. To the best of our knowledge, this is the first attempt to adapt the equations of SSMs to a vision task like style transfer without requiring any other module like cross-attention or custom normalization layers. An extensive set of experiments demonstrates the superiority and efficiency of our method in performing style transfer compared to transformers and diffusion models. Results show improved quality in terms of both ArtFID and FID metrics. Code is available at https://github.com/FilippoBotti/MambaST.
△ Less
Submitted 16 September, 2024;
originally announced September 2024.
-
Interim report for the International Muon Collider Collaboration (IMCC)
Authors:
C. Accettura,
S. Adrian,
R. Agarwal,
C. Ahdida,
C. Aimé,
A. Aksoy,
G. L. Alberghi,
S. Alden,
N. Amapane,
D. Amorim,
P. Andreetto,
F. Anulli,
R. Appleby,
A. Apresyan,
P. Asadi,
M. Attia Mahmoud,
B. Auchmann,
J. Back,
A. Badea,
K. J. Bae,
E. J. Bahng,
L. Balconi,
F. Balli,
L. Bandiera,
C. Barbagallo
, et al. (362 additional authors not shown)
Abstract:
The International Muon Collider Collaboration (IMCC) [1] was established in 2020 following the recommendations of the European Strategy for Particle Physics (ESPP) and the implementation of the European Strategy for Particle Physics-Accelerator R&D Roadmap by the Laboratory Directors Group [2], hereinafter referred to as the the European LDG roadmap. The Muon Collider Study (MuC) covers the accele…
▽ More
The International Muon Collider Collaboration (IMCC) [1] was established in 2020 following the recommendations of the European Strategy for Particle Physics (ESPP) and the implementation of the European Strategy for Particle Physics-Accelerator R&D Roadmap by the Laboratory Directors Group [2], hereinafter referred to as the the European LDG roadmap. The Muon Collider Study (MuC) covers the accelerator complex, detectors and physics for a future muon collider. In 2023, European Commission support was obtained for a design study of a muon collider (MuCol) [3]. This project started on 1st March 2023, with work-packages aligned with the overall muon collider studies. In preparation of and during the 2021-22 U.S. Snowmass process, the muon collider project parameters, technical studies and physics performance studies were performed and presented in great detail. Recently, the P5 panel [4] in the U.S. recommended a muon collider R&D, proposed to join the IMCC and envisages that the U.S. should prepare to host a muon collider, calling this their "muon shot". In the past, the U.S. Muon Accelerator Programme (MAP) [5] has been instrumental in studies of concepts and technologies for a muon collider.
△ Less
Submitted 28 January, 2025; v1 submitted 17 July, 2024;
originally announced July 2024.
-
Urban mobility and learning: analyzing the influence of commuting time on students' GPA at Politecnico di Milano
Authors:
Arianna Burzacchi,
Lidia Rossi,
Tommaso Agasisti,
Anna Maria Paganoni,
Simone Vantini
Abstract:
Despite its crucial role in students' daily lives, commuting time remains an underexplored dimension in higher education research. To address this gap, this study focuses on challenges that students face in urban environments and investigates the impact of commuting time on the academic performance of first-year bachelor students of Politecnico di Milano, Italy. This research employs an innovative…
▽ More
Despite its crucial role in students' daily lives, commuting time remains an underexplored dimension in higher education research. To address this gap, this study focuses on challenges that students face in urban environments and investigates the impact of commuting time on the academic performance of first-year bachelor students of Politecnico di Milano, Italy. This research employs an innovative two-step methodology. In the initial phase, machine learning algorithms trained on privacy-preserving GPS data from anonymous users are used to construct accessibility maps to the university and to obtain an estimate of students' commuting times. In the subsequent phase, authors utilize polynomial linear mixed-effects models and investigate the factors influencing students' academic performance, with a particular emphasis on commuting time. Notably, this investigation incorporates a causal framework, which enables the establishment of causal relationships between commuting time and academic outcomes. The findings underscore the significant impact of travel time on students' performance and may support policies and implications aiming at improving students' educational experience in metropolitan areas. The study's innovation lies both in its exploration of a relatively uncharted factor and the novel methodologies applied in both phases.
△ Less
Submitted 16 July, 2024;
originally announced July 2024.
-
Backstepping control for the sterile mosquitoes technique: stabilization of extinction equilibrium
Authors:
Andrea Cristofaro,
Luca Rossi
Abstract:
The control of a mosquito population using the sterile insect technique is considered. Building on a model-based approach, where the control input is the release rate of sterilized males, we propose a non-negative backstepping control law capable of globally stabilizing the extinction equilibrium of the system. A simulation study supports and validates the theoretical findings, showing the efficac…
▽ More
The control of a mosquito population using the sterile insect technique is considered. Building on a model-based approach, where the control input is the release rate of sterilized males, we propose a non-negative backstepping control law capable of globally stabilizing the extinction equilibrium of the system. A simulation study supports and validates the theoretical findings, showing the efficacy of the approach both on a reduced model, used for control design, and on a complete model of the mosquito population dynamics.
△ Less
Submitted 24 June, 2024;
originally announced June 2024.
-
Ordinal Mixed-Effects Random Forest
Authors:
Giulia Bergonzoli,
Lidia Rossi,
Chiara Masci
Abstract:
We propose an innovative statistical method, called Ordinal Mixed-Effect Random Forest (OMERF), that extends the use of random forest to the analysis of hierarchical data and ordinal responses. The model preserves the flexibility and ability of modeling complex patterns of both categorical and continuous variables, typical of tree-based ensemble methods, and, at the same time, takes into account t…
▽ More
We propose an innovative statistical method, called Ordinal Mixed-Effect Random Forest (OMERF), that extends the use of random forest to the analysis of hierarchical data and ordinal responses. The model preserves the flexibility and ability of modeling complex patterns of both categorical and continuous variables, typical of tree-based ensemble methods, and, at the same time, takes into account the structure of hierarchical data, modeling the dependence structure induced by the grouping and allowing statistical inference at all data levels. A simulation study is conducted to validate the performance of the proposed method and to compare it to the one of other state-of-the art models. The application of OMERF is exemplified in a case study focusing on predicting students performances using data from the Programme for International Student Assessment (PISA) 2022. The model identifies discriminating student characteristics and estimates the school-effect.
△ Less
Submitted 5 June, 2024;
originally announced June 2024.
-
Flavor dependence of unpolarized quark Transverse Momentum Distributions from a global fit
Authors:
Alessandro Bacchetta,
Valerio Bertone,
Chiara Bissolotti,
Giuseppe Bozzi,
Matteo Cerutti,
Filippo Delcarro,
Marco Radici,
Lorenzo Rossi,
Andrea Signori
Abstract:
We present an extraction of the unpolarized transverse-momentum-dependent parton distribution and fragmentation functions that takes into account possible differences between quark flavors and final-state hadrons. The extraction is based on experimental measurements from Drell-Yan processes and semi-inclusive deep-inelastic scattering, whose combination is essential to distinguish flavor differenc…
▽ More
We present an extraction of the unpolarized transverse-momentum-dependent parton distribution and fragmentation functions that takes into account possible differences between quark flavors and final-state hadrons. The extraction is based on experimental measurements from Drell-Yan processes and semi-inclusive deep-inelastic scattering, whose combination is essential to distinguish flavor differences. The analysis is carried out at N$^3$LL accuracy. The extracted flavor-dependent distributions give a very good description of the data ($χ^2/N_{\rm dat} = 1.08$). The resulting uncertainties take fully into account also the uncertainties in the determination of the corresponding collinear distributions.
△ Less
Submitted 2 October, 2024; v1 submitted 22 May, 2024;
originally announced May 2024.
-
All-fiber, near-infrared, laser system at 780nm for atom cooling
Authors:
Matteo Marchesini,
Michelangelo Dondi,
Leonardo Rossi,
Gabriele Bolognini,
Marco Prevedelli,
Francesco Minardi
Abstract:
One of the prominent platforms for quantum technologies, cold atoms require reliable laser systems. We present the design, implementation, and characterization of a simple, compact, and economical laser system at 780 nm, entirely based on fiber components. Two semiconductor lasers at 1560 nm are amplified in a single Erbium-doped fiber amplifier and frequency-doubled in a periodically-poled lithiu…
▽ More
One of the prominent platforms for quantum technologies, cold atoms require reliable laser systems. We present the design, implementation, and characterization of a simple, compact, and economical laser system at 780 nm, entirely based on fiber components. Two semiconductor lasers at 1560 nm are amplified in a single Erbium-doped fiber amplifier and frequency-doubled in a periodically-poled lithium niobate crystal. We characterize the amplitude noise and the linewidths of the lasers, as well as the SHG efficiency. With a rms relative amplitude noise of 3$\times$10$^{-4}$ at 1 s and linewidths below 1 MHz, our system is suitable for cooling and trapping of Rb atoms.
△ Less
Submitted 17 July, 2024; v1 submitted 21 May, 2024;
originally announced May 2024.
-
Numerical implementation of evolution equations for twist-3 collinear PDFs
Authors:
Simone Rodini,
Lorenzo Rossi,
Alexey Vladimirov
Abstract:
Twist-3 collinear parton distribution functions (PDFs) are matrix elements of quark-gluon-quark or three-gluons light-cone operators. They depend on three momentum fraction variables, which are restricted to a hexagon region, and the evolution kernels are defined via two-dimensional convolution in these variables. We present the numerical realisation of the twist-3 evolution equations at leading o…
▽ More
Twist-3 collinear parton distribution functions (PDFs) are matrix elements of quark-gluon-quark or three-gluons light-cone operators. They depend on three momentum fraction variables, which are restricted to a hexagon region, and the evolution kernels are defined via two-dimensional convolution in these variables. We present the numerical realisation of the twist-3 evolution equations at leading order in the strong coupling for all kinds of twist-3 PDF (quark, gluon, chiral-even/odd, etc). We provide two independent codes (in C and Fortran) that have been extensively cross-checked, and are ready-to-use. We supplement the paper with a review of known properties of twist-3 PDFs.
△ Less
Submitted 9 July, 2024; v1 submitted 2 May, 2024;
originally announced May 2024.
-
Swin2-MoSE: A New Single Image Super-Resolution Model for Remote Sensing
Authors:
Leonardo Rossi,
Vittorio Bernuzzi,
Tomaso Fontanini,
Massimo Bertozzi,
Andrea Prati
Abstract:
Due to the limitations of current optical and sensor technologies and the high cost of updating them, the spectral and spatial resolution of satellites may not always meet desired requirements. For these reasons, Remote-Sensing Single-Image Super-Resolution (RS-SISR) techniques have gained significant interest. In this paper, we propose Swin2-MoSE model, an enhanced version of Swin2SR. Our model i…
▽ More
Due to the limitations of current optical and sensor technologies and the high cost of updating them, the spectral and spatial resolution of satellites may not always meet desired requirements. For these reasons, Remote-Sensing Single-Image Super-Resolution (RS-SISR) techniques have gained significant interest. In this paper, we propose Swin2-MoSE model, an enhanced version of Swin2SR. Our model introduces MoE-SM, an enhanced Mixture-of-Experts (MoE) to replace the Feed-Forward inside all Transformer block. MoE-SM is designed with Smart-Merger, and new layer for merging the output of individual experts, and with a new way to split the work between experts, defining a new per-example strategy instead of the commonly used per-token one. Furthermore, we analyze how positional encodings interact with each other, demonstrating that per-channel bias and per-head bias can positively cooperate. Finally, we propose to use a combination of Normalized-Cross-Correlation (NCC) and Structural Similarity Index Measure (SSIM) losses, to avoid typical MSE loss limitations. Experimental results demonstrate that Swin2-MoSE outperforms any Swin derived models by up to 0.377 - 0.958 dB (PSNR) on task of 2x, 3x and 4x resolution-upscaling (Sen2Venus and OLI2MSI datasets). It also outperforms SOTA models by a good margin, proving to be competitive and with excellent potential, especially for complex tasks. Additionally, an analysis of computational costs is also performed. Finally, we show the efficacy of Swin2-MoSE, applying it to a semantic segmentation task (SeasoNet dataset). Code and pretrained are available on https://github.com/IMPLabUniPr/swin2-mose/tree/official_code
△ Less
Submitted 12 December, 2024; v1 submitted 29 April, 2024;
originally announced April 2024.
-
Self-Balanced R-CNN for Instance Segmentation
Authors:
Leonardo Rossi,
Akbar Karimi,
Andrea Prati
Abstract:
Current state-of-the-art two-stage models on instance segmentation task suffer from several types of imbalances. In this paper, we address the Intersection over the Union (IoU) distribution imbalance of positive input Regions of Interest (RoIs) during the training of the second stage. Our Self-Balanced R-CNN (SBR-CNN), an evolved version of the Hybrid Task Cascade (HTC) model, brings brand new loo…
▽ More
Current state-of-the-art two-stage models on instance segmentation task suffer from several types of imbalances. In this paper, we address the Intersection over the Union (IoU) distribution imbalance of positive input Regions of Interest (RoIs) during the training of the second stage. Our Self-Balanced R-CNN (SBR-CNN), an evolved version of the Hybrid Task Cascade (HTC) model, brings brand new loop mechanisms of bounding box and mask refinements. With an improved Generic RoI Extraction (GRoIE), we also address the feature-level imbalance at the Feature Pyramid Network (FPN) level, originated by a non-uniform integration between low- and high-level features from the backbone layers. In addition, the redesign of the architecture heads toward a fully convolutional approach with FCC further reduces the number of parameters and obtains more clues to the connection between the task to solve and the layers used. Moreover, our SBR-CNN model shows the same or even better improvements if adopted in conjunction with other state-of-the-art models. In fact, with a lightweight ResNet-50 as backbone, evaluated on COCO minival 2017 dataset, our model reaches 45.3% and 41.5% AP for object detection and instance segmentation, with 12 epochs and without extra tricks. The code is available at https://github.com/IMPLabUniPr/mmdetection/tree/sbr_cnn
△ Less
Submitted 25 April, 2024;
originally announced April 2024.
-
Generating Graphs via Spectral Diffusion
Authors:
Giorgia Minello,
Alessandro Bicciato,
Luca Rossi,
Andrea Torsello,
Luca Cosmo
Abstract:
In this paper, we present GGSD, a novel graph generative model based on 1) the spectral decomposition of the graph Laplacian matrix and 2) a diffusion process. Specifically, we propose to use a denoising model to sample eigenvectors and eigenvalues from which we can reconstruct the graph Laplacian and adjacency matrix. Using the Laplacian spectrum allows us to naturally capture the structural char…
▽ More
In this paper, we present GGSD, a novel graph generative model based on 1) the spectral decomposition of the graph Laplacian matrix and 2) a diffusion process. Specifically, we propose to use a denoising model to sample eigenvectors and eigenvalues from which we can reconstruct the graph Laplacian and adjacency matrix. Using the Laplacian spectrum allows us to naturally capture the structural characteristics of the graph and work directly in the node space while avoiding the quadratic complexity bottleneck that limits the applicability of other diffusion-based methods. This, in turn, is accomplished by truncating the spectrum, which, as we show in our experiments, results in a faster yet accurate generative process, and by designing a novel transformer-based architecture linear in the number of nodes. Our permutation invariant model can also handle node features by concatenating them to the eigenvectors of each node. An extensive set of experiments on both synthetic and real-world graphs demonstrates the strengths of our model against state-of-the-art alternatives.
△ Less
Submitted 4 March, 2025; v1 submitted 29 February, 2024;
originally announced February 2024.
-
GNN-LoFI: a Novel Graph Neural Network through Localized Feature-based Histogram Intersection
Authors:
Alessandro Bicciato,
Luca Cosmo,
Giorgia Minello,
Luca Rossi,
Andrea Torsello
Abstract:
Graph neural networks are increasingly becoming the framework of choice for graph-based machine learning. In this paper, we propose a new graph neural network architecture that substitutes classical message passing with an analysis of the local distribution of node features. To this end, we extract the distribution of features in the egonet for each local neighbourhood and compare them against a s…
▽ More
Graph neural networks are increasingly becoming the framework of choice for graph-based machine learning. In this paper, we propose a new graph neural network architecture that substitutes classical message passing with an analysis of the local distribution of node features. To this end, we extract the distribution of features in the egonet for each local neighbourhood and compare them against a set of learned label distributions by taking the histogram intersection kernel. The similarity information is then propagated to other nodes in the network, effectively creating a message passing-like mechanism where the message is determined by the ensemble of the features. We perform an ablation study to evaluate the network's performance under different choices of its hyper-parameters. Finally, we test our model on standard graph classification and regression benchmarks, and we find that it outperforms widely used alternative approaches, including both graph kernels and graph neural networks.
△ Less
Submitted 17 January, 2024;
originally announced January 2024.
-
Non-linear instability of slowly rotating Kerr-AdS black holes
Authors:
Pau Figueras,
Lorenzo Rossi
Abstract:
Generic scalar perturbations on a fixed slowly rotating Kerr-AdS black hole background exhibit stable trapping, that is, the scalar field remains in a region between the exterior of the black hole and the AdS boundary for a very long time, decaying only inverse logarithmically in time. We study this effect employing fully general simulations that take into account the non-linear backreaction of th…
▽ More
Generic scalar perturbations on a fixed slowly rotating Kerr-AdS black hole background exhibit stable trapping, that is, the scalar field remains in a region between the exterior of the black hole and the AdS boundary for a very long time, decaying only inverse logarithmically in time. We study this effect employing fully general simulations that take into account the non-linear backreaction of the scalar field on the geometry. We find that the stable trapping of generic perturbations of Kerr-AdS persists at the non-linear level. Furthermore, the spacetime settles into a time-dependant and non-axisymmetric black hole which differs from Kerr-AdS. Since our perturbations are generic, our results indicate that slowly rotating Kerr-AdS black holes are non-linearly unstable.
△ Less
Submitted 21 February, 2024; v1 submitted 23 November, 2023;
originally announced November 2023.
-
Feature Selection and Hyperparameter Fine-tuning in Artificial Neural Networks for Wood Quality Classification
Authors:
Mateus Roder,
Leandro Aparecido Passos,
João Paulo Papa,
André Luis Debiaso Rossi
Abstract:
Quality classification of wood boards is an essential task in the sawmill industry, which is still usually performed by human operators in small to median companies in developing countries. Machine learning algorithms have been successfully employed to investigate the problem, offering a more affordable alternative compared to other solutions. However, such approaches usually present some drawback…
▽ More
Quality classification of wood boards is an essential task in the sawmill industry, which is still usually performed by human operators in small to median companies in developing countries. Machine learning algorithms have been successfully employed to investigate the problem, offering a more affordable alternative compared to other solutions. However, such approaches usually present some drawbacks regarding the proper selection of their hyperparameters. Moreover, the models are susceptible to the features extracted from wood board images, which influence the induction of the model and, consequently, its generalization power. Therefore, in this paper, we investigate the problem of simultaneously tuning the hyperparameters of an artificial neural network (ANN) as well as selecting a subset of characteristics that better describes the wood board quality. Experiments were conducted over a private dataset composed of images obtained from a sawmill industry and described using different feature descriptors. The predictive performance of the model was compared against five baseline methods as well as a random search, performing either ANN hyperparameter tuning and feature selection. Experimental results suggest that hyperparameters should be adjusted according to the feature set, or the features should be selected considering the hyperparameter values. In summary, the best predictive performance, i.e., a balanced accuracy of $0.80$, was achieved in two distinct scenarios: (i) performing only feature selection, and (ii) performing both tasks concomitantly. Thus, we suggest that at least one of the two approaches should be considered in the context of industrial applications.
△ Less
Submitted 20 October, 2023;
originally announced October 2023.
-
MUSTANG: Multi-Stain Self-Attention Graph Multiple Instance Learning Pipeline for Histopathology Whole Slide Images
Authors:
Amaya Gallagher-Syed,
Luca Rossi,
Felice Rivellese,
Costantino Pitzalis,
Myles Lewis,
Michael Barnes,
Gregory Slabaugh
Abstract:
Whole Slide Images (WSIs) present a challenging computer vision task due to their gigapixel size and presence of numerous artefacts. Yet they are a valuable resource for patient diagnosis and stratification, often representing the gold standard for diagnostic tasks. Real-world clinical datasets tend to come as sets of heterogeneous WSIs with labels present at the patient-level, with poor to no ann…
▽ More
Whole Slide Images (WSIs) present a challenging computer vision task due to their gigapixel size and presence of numerous artefacts. Yet they are a valuable resource for patient diagnosis and stratification, often representing the gold standard for diagnostic tasks. Real-world clinical datasets tend to come as sets of heterogeneous WSIs with labels present at the patient-level, with poor to no annotations. Weakly supervised attention-based multiple instance learning approaches have been developed in recent years to address these challenges, but can fail to resolve both long and short-range dependencies. Here we propose an end-to-end multi-stain self-attention graph (MUSTANG) multiple instance learning pipeline, which is designed to solve a weakly-supervised gigapixel multi-image classification task, where the label is assigned at the patient-level, but no slide-level labels or region annotations are available. The pipeline uses a self-attention based approach by restricting the operations to a highly sparse k-Nearest Neighbour Graph of embedded WSI patches based on the Euclidean distance. We show this approach achieves a state-of-the-art F1-score/AUC of 0.89/0.92, outperforming the widely used CLAM model. Our approach is highly modular and can easily be modified to suit different clinical datasets, as it only requires a patient-level label without annotations and accepts WSI sets of different sizes, as the graphs can be of varying sizes and structures. The source code can be found at https://github.com/AmayaGS/MUSTANG.
△ Less
Submitted 4 October, 2023; v1 submitted 19 September, 2023;
originally announced September 2023.
-
Biological invasions and epidemics with nonlocal diffusion along a line
Authors:
Henri Berestycki,
Jean-Michel Roquejoffre,
Luca Rossi
Abstract:
The goal of this work is to understand and quantify how a line with nonlocal diffusion given by an integral enhances a reaction-diffusion process occurring in the surrounding plane. This is part of a long term programme where we aim at modelling, in a mathematically rigorous way, the effect of transportation networks on the speed of biological invasions or propagation of epidemics. We prove the ex…
▽ More
The goal of this work is to understand and quantify how a line with nonlocal diffusion given by an integral enhances a reaction-diffusion process occurring in the surrounding plane. This is part of a long term programme where we aim at modelling, in a mathematically rigorous way, the effect of transportation networks on the speed of biological invasions or propagation of epidemics. We prove the existence of a global propagation speed and characterise in terms of the parameters of the system the situations where such a speed is boosted by the presence of the line. In the course of the study we also uncover unexpected regularity properties of the model. On the quantitative side, the two main parameters are the intensity of the diffusion kernel and the characteristic size of its support. One outcome of this work is that the propagation speed will significantly be enhanced even if only one of the two is large, thus broadening the picture that we have already drawn in our previous works on the subject, with local diffusion modelled by a standard Laplacian. We further investigate the role of the other parameters, enlightening some subtle effects due to the interplay between the diffusion in the half plane and that on the line. Lastly, in the context of propagation of epidemics, we also discuss the model where, instead of a diffusion, displacement on the line comes from a pure transport term.
△ Less
Submitted 9 January, 2024; v1 submitted 15 September, 2023;
originally announced September 2023.
-
GRFolres: A code for modified gravity simulations in strong gravity
Authors:
Llibert Aresté Saló,
Sam E. Brady,
Katy Clough,
Daniela Doneva,
Tamara Evstafyeva,
Pau Figueras,
Tiago França,
Lorenzo Rossi,
Shunhui Yao
Abstract:
GRFolres is an open-source code for performing simulations in modified theories of gravity, based on the publicly available 3+1D numerical relativity code GRChombo.
Note: Submitted for review in the Journal of Open Source Software; Comments welcome; The code can be found at https://github.com/GRChombo/GRFolres
GRFolres is an open-source code for performing simulations in modified theories of gravity, based on the publicly available 3+1D numerical relativity code GRChombo.
Note: Submitted for review in the Journal of Open Source Software; Comments welcome; The code can be found at https://github.com/GRChombo/GRFolres
△ Less
Submitted 13 September, 2023; v1 submitted 12 September, 2023;
originally announced September 2023.
-
TEX (TEst stand for X-band) at LNF
Authors:
C. Di Giulio,
F. Cardelli,
S. Pioli,
D. Alesini,
M. Bellaveglia,
S. Bini,
B. Buonomo,
S. Cantarella,
G. Catuscelli,
M. Ceccarelli,
R. Ceccarelli,
M. Cianfrini,
R. Clementi,
E. Di Pasquale,
G. Di Raddo,
R. Di Raddo,
A. Falone,
A. Gallo,
G. Latini,
A. Liedl,
V. Lollo,
G. Piermarini,
L. Piersanti,
S. Quaglia,
L. A. Rossi
, et al. (5 additional authors not shown)
Abstract:
TEX facility if commissioned for high power testing to characterize accelerating structures and validate them for the operation on future particle accelerators for medical, industrial and research applications. At this aim, TEX is directly involved in the LNF leading project EuPRAXIA@SPARC_Lab. The brief description of the facility and its status and prospective will be provided.
TEX facility if commissioned for high power testing to characterize accelerating structures and validate them for the operation on future particle accelerators for medical, industrial and research applications. At this aim, TEX is directly involved in the LNF leading project EuPRAXIA@SPARC_Lab. The brief description of the facility and its status and prospective will be provided.
△ Less
Submitted 31 August, 2023; v1 submitted 6 August, 2023;
originally announced August 2023.
-
Long time rigidity to flux-induced symmetry breaking in quantum quench dynamics
Authors:
Lorenzo Rossi,
Luca Barbiero,
Jan Carl Budich,
Fabrizio Dolcini
Abstract:
We investigate how the breaking of charge conjugation symmetry $\mathcal{C}$ impacts on the dynamics of a half-filled fermionic lattice system after global quenches. We show that, when the initial state is insulating and the $\mathcal{C}$-symmetry is broken non-locally by a constant magnetic flux, local observables and correlations behave as if the symmetry were unbroken for a time interval propor…
▽ More
We investigate how the breaking of charge conjugation symmetry $\mathcal{C}$ impacts on the dynamics of a half-filled fermionic lattice system after global quenches. We show that, when the initial state is insulating and the $\mathcal{C}$-symmetry is broken non-locally by a constant magnetic flux, local observables and correlations behave as if the symmetry were unbroken for a time interval proportional to the system size $L$. In particular, the local particle density of a quenched dimerized insulator remains pinned to $1/2$ in each lattice site for an extensively long time, while it starts to significantly fluctuate only afterwards. Due to its qualitative resemblance to the sudden arrival of rapidly rising ocean waves, we dub this phenomenon the ``tsunami effect". Notably, it occurs even though the chiral symmetry is dynamically broken right after the quench. Furthermore, we identify a way to quantify the amount of symmetry breaking in the quantum state, showing that in insulators perturbed by a flux it is exponentially suppressed as a function of the system size, while it is only algebraically suppressed in metals and in insulators with locally broken $\mathcal{C}$-symmetry. The robustness of the tsunami effect to weak disorder and interactions is demonstrated, and possible experimental realizations are proposed.
△ Less
Submitted 8 January, 2024; v1 submitted 7 July, 2023;
originally announced July 2023.
-
Spreading, flattening and logarithmic lag for reaction-diffusion equations in R^N: old and new results
Authors:
François Hamel,
Luca Rossi
Abstract:
This paper is concerned with the large-time dynamics of bounded solutions of reaction-diffusion equations with bounded or unbounded initial support in R N. We start with a survey of some old and recent results on the spreading speeds of the solutions and their asymptotic local one-dimensional symmetry. We then derive some flattening properties of the level sets of the solutions if initially suppor…
▽ More
This paper is concerned with the large-time dynamics of bounded solutions of reaction-diffusion equations with bounded or unbounded initial support in R N. We start with a survey of some old and recent results on the spreading speeds of the solutions and their asymptotic local one-dimensional symmetry. We then derive some flattening properties of the level sets of the solutions if initially supported on subgraphs. We also investigate the special case of asymptotically conical-shaped initial conditions. Lastly, we reclaim some known results about the logarithmic lag between the position of the solutions and that of planar or spherical fronts expanding with minimal speed, for almost-planar or compactly supported initial conditions. We then prove some new logarithmic-in-time estimates of the lag of the position of the solutions with respect to that of a planar front, for initial conditions which are supported on subgraphs with logarithmic growth at infinity. These estimates entail in particular that the same lag as for compactly supported initial data holds true for a class of unbounded initial supports. The paper also contains some related conjectures and open problems.
△ Less
Submitted 1 July, 2024; v1 submitted 7 July, 2023;
originally announced July 2023.
-
Dynamical properties and mechanisms of metastability: a perspective in neuroscience
Authors:
Kalel L. Rossi,
Roberto C. Budzinski,
Everton S. Medeiros,
Bruno R. R. Boaretto,
Lyle Muller,
Ulrike Feudel
Abstract:
Metastability, characterized by a variability of regimes in time, is a ubiquitous type of neural dynamics. It has been formulated in many different ways in the neuroscience literature, however, which may cause some confusion. In this Perspective, we discuss metastability from the point of view of dynamical systems theory. We extract from the literature a very simple but general definition through…
▽ More
Metastability, characterized by a variability of regimes in time, is a ubiquitous type of neural dynamics. It has been formulated in many different ways in the neuroscience literature, however, which may cause some confusion. In this Perspective, we discuss metastability from the point of view of dynamical systems theory. We extract from the literature a very simple but general definition through the concept of metastable regimes as long-lived but transient epochs of activity with unique dynamical properties. This definition serves as an umbrella term that encompasses formulations from other works, and readily connects to concepts from dynamical systems theory. This allows us to examine general dynamical properties of metastable regimes, propose in a didactic manner several dynamics-based mechanisms that generate them, and discuss a theoretical tool to characterize them quantitatively. This perspective leads to insights that help to address issues debated in the literature and also suggest pathways for future research.
△ Less
Submitted 21 May, 2024; v1 submitted 9 May, 2023;
originally announced May 2023.
-
Incremental procedural and sensorimotor learning in cognitive humanoid robots
Authors:
Leonardo de Lellis Rossi,
Leticia Mara Berto,
Eric Rohmer,
Paula Paro Costa,
Ricardo Ribeiro Gudwin,
Esther Luna Colombini,
Alexandre da Silva Simoes
Abstract:
The ability to automatically learn movements and behaviors of increasing complexity is a long-term goal in autonomous systems. Indeed, this is a very complex problem that involves understanding how knowledge is acquired and reused by humans as well as proposing mechanisms that allow artificial agents to reuse previous knowledge. Inspired by Jean Piaget's theory's first three sensorimotor substages…
▽ More
The ability to automatically learn movements and behaviors of increasing complexity is a long-term goal in autonomous systems. Indeed, this is a very complex problem that involves understanding how knowledge is acquired and reused by humans as well as proposing mechanisms that allow artificial agents to reuse previous knowledge. Inspired by Jean Piaget's theory's first three sensorimotor substages, this work presents a cognitive agent based on CONAIM (Conscious Attention-Based Integrated Model) that can learn procedures incrementally. Throughout the paper, we show the cognitive functions required in each substage and how adding new functions helps address tasks previously unsolved by the agent. Experiments were conducted with a humanoid robot in a simulated environment modeled with the Cognitive Systems Toolkit (CST) performing an object tracking task. The system is modeled using a single procedural learning mechanism based on Reinforcement Learning. The increasing agent's cognitive complexity is managed by adding new terms to the reward function for each learning phase. Results show that this approach is capable of solving complex tasks incrementally.
△ Less
Submitted 30 April, 2023;
originally announced May 2023.
-
Framework for global stability analysis of dynamical systems
Authors:
George Datseris,
Kalel Luiz Rossi,
Alexandre Wagemakers
Abstract:
Dynamical systems, that are used to model power grids, the brain, and other physical systems, can exhibit coexisting stable states known as attractors. A powerful tool to understand such systems, as well as to better predict when they may ``tip'' from one stable state to the other, is global stability analysis. It involves identifying the initial conditions that converge to each attractor, known a…
▽ More
Dynamical systems, that are used to model power grids, the brain, and other physical systems, can exhibit coexisting stable states known as attractors. A powerful tool to understand such systems, as well as to better predict when they may ``tip'' from one stable state to the other, is global stability analysis. It involves identifying the initial conditions that converge to each attractor, known as the basins of attraction, measuring the relative volume of these basins in state space, and quantifying how these fractions change as a system parameter evolves. By improving existing approaches, we present a comprehensive framework that allows for global stability analysis on any dynamical system. Notably, our framework enables the analysis to be made efficiently and conveniently over a parameter range. As such, it becomes an essential complement to traditional continuation techniques, that only allow for linear stability analysis. We demonstrate the effectiveness of our approach on a variety of models, including climate, power grids, ecosystems, and more. Our framework is available as simple-to-use open-source code as part of the DynamicalSystems.jl library.
△ Less
Submitted 24 April, 2023;
originally announced April 2023.
-
Towards a Muon Collider
Authors:
Carlotta Accettura,
Dean Adams,
Rohit Agarwal,
Claudia Ahdida,
Chiara Aimè,
Nicola Amapane,
David Amorim,
Paolo Andreetto,
Fabio Anulli,
Robert Appleby,
Artur Apresyan,
Aram Apyan,
Sergey Arsenyev,
Pouya Asadi,
Mohammed Attia Mahmoud,
Aleksandr Azatov,
John Back,
Lorenzo Balconi,
Laura Bandiera,
Roger Barlow,
Nazar Bartosik,
Emanuela Barzi,
Fabian Batsch,
Matteo Bauce,
J. Scott Berg
, et al. (272 additional authors not shown)
Abstract:
A muon collider would enable the big jump ahead in energy reach that is needed for a fruitful exploration of fundamental interactions. The challenges of producing muon collisions at high luminosity and 10 TeV centre of mass energy are being investigated by the recently-formed International Muon Collider Collaboration. This Review summarises the status and the recent advances on muon colliders desi…
▽ More
A muon collider would enable the big jump ahead in energy reach that is needed for a fruitful exploration of fundamental interactions. The challenges of producing muon collisions at high luminosity and 10 TeV centre of mass energy are being investigated by the recently-formed International Muon Collider Collaboration. This Review summarises the status and the recent advances on muon colliders design, physics and detector studies. The aim is to provide a global perspective of the field and to outline directions for future work.
△ Less
Submitted 27 November, 2023; v1 submitted 15 March, 2023;
originally announced March 2023.
-
Analogies between hadron-in-jet and dihadron fragmentation
Authors:
Alessandro Bacchetta,
Marco Radici,
Lorenzo Rossi
Abstract:
We describe the formal analogies in the description of the inclusive production in hard processes of hadron pairs (based on dihadron fragmentation functions) and of a single hadron inside a jet (based on hadron-in-jet fragmentation functions). Since several observables involving dihadron fragmentation functions have been proposed in the past, we are able to suggest new interesting observables invo…
▽ More
We describe the formal analogies in the description of the inclusive production in hard processes of hadron pairs (based on dihadron fragmentation functions) and of a single hadron inside a jet (based on hadron-in-jet fragmentation functions). Since several observables involving dihadron fragmentation functions have been proposed in the past, we are able to suggest new interesting observables involving hadron-in-jet fragmentation functions, in lepton-hadron deep-inelastic scattering and hadronic collisions.
△ Less
Submitted 7 March, 2023;
originally announced March 2023.
-
Graph Neural Networks in Vision-Language Image Understanding: A Survey
Authors:
Henry Senior,
Gregory Slabaugh,
Shanxin Yuan,
Luca Rossi
Abstract:
2D image understanding is a complex problem within computer vision, but it holds the key to providing human-level scene comprehension. It goes further than identifying the objects in an image, and instead, it attempts to understand the scene. Solutions to this problem form the underpinning of a range of tasks, including image captioning, visual question answering (VQA), and image retrieval. Graphs…
▽ More
2D image understanding is a complex problem within computer vision, but it holds the key to providing human-level scene comprehension. It goes further than identifying the objects in an image, and instead, it attempts to understand the scene. Solutions to this problem form the underpinning of a range of tasks, including image captioning, visual question answering (VQA), and image retrieval. Graphs provide a natural way to represent the relational arrangement between objects in an image, and thus, in recent years graph neural networks (GNNs) have become a standard component of many 2D image understanding pipelines, becoming a core architectural component, especially in the VQA group of tasks. In this survey, we review this rapidly evolving field and we provide a taxonomy of graph types used in 2D image understanding approaches, a comprehensive list of the GNN models used in this domain, and a roadmap of future potential developments. To the best of our knowledge, this is the first comprehensive survey that covers image captioning, visual question answering, and image retrieval techniques that focus on using GNNs as the main part of their architecture.
△ Less
Submitted 12 April, 2024; v1 submitted 7 March, 2023;
originally announced March 2023.