-
One Period to Rule Them All: Identifying Critical Learning Periods in Deep Networks
Authors:
Vinicius Yuiti Fukase,
Heitor Gama,
Barbara Bueno,
Lucas Libanio,
Anna Helena Reali Costa,
Artur Jordao
Abstract:
Critical Learning Periods comprehend an important phenomenon involving deep learning, where early epochs play a decisive role in the success of many training recipes, such as data augmentation. Existing works confirm the existence of this phenomenon and provide useful insights. However, the literature lacks efforts to precisely identify when critical periods occur. In this work, we fill this gap b…
▽ More
Critical Learning Periods comprehend an important phenomenon involving deep learning, where early epochs play a decisive role in the success of many training recipes, such as data augmentation. Existing works confirm the existence of this phenomenon and provide useful insights. However, the literature lacks efforts to precisely identify when critical periods occur. In this work, we fill this gap by introducing a systematic approach for identifying critical periods during the training of deep neural networks, focusing on eliminating computationally intensive regularization techniques and effectively applying mechanisms for reducing computational costs, such as data pruning. Our method leverages generalization prediction mechanisms to pinpoint critical phases where training recipes yield maximum benefits to the predictive ability of models. By halting resource-intensive recipes beyond these periods, we significantly accelerate the learning phase and achieve reductions in training time, energy consumption, and CO$_2$ emissions. Experiments on standard architectures and benchmarks confirm the effectiveness of our method. Specifically, we achieve significant milestones by reducing the training time of popular architectures by up to 59.67%, leading to a 59.47% decrease in CO$_2$ emissions and a 60% reduction in financial costs, without compromising performance. Our work enhances understanding of training dynamics and paves the way for more sustainable and efficient deep learning practices, particularly in resource-constrained environments. In the era of the race for foundation models, we believe our method emerges as a valuable framework. The repository is available at https://github.com/baunilhamarga/critical-periods
△ Less
Submitted 18 June, 2025;
originally announced June 2025.
-
Quantum block Krylov subspace projector algorithm for computing low-lying eigenenergies
Authors:
Maria Gabriela Jordão Oliveira,
Nina Glaser
Abstract:
Determining eigenvalues is a computationally expensive task that is crucial for countless applications in natural sciences. Toward this end, we introduce the quantum block Krylov subspace projector (QBKSP) algorithm, a multireference quantum variant of the Lanczos algorithm designed to accurately compute low-lying eigenvalues, including degenerate states. We present three different compact quantum…
▽ More
Determining eigenvalues is a computationally expensive task that is crucial for countless applications in natural sciences. Toward this end, we introduce the quantum block Krylov subspace projector (QBKSP) algorithm, a multireference quantum variant of the Lanczos algorithm designed to accurately compute low-lying eigenvalues, including degenerate states. We present three different compact quantum circuits to evaluate the required expectation values, each suited to different problem settings. To investigate the impact of the number and fidelity of the initial reference states, as well as time evolution duration, we perform error-free and limited-precision numerical simulations and quantum circuit simulations. The results demonstrate that using multiple initial reference states improves the convergence of the algorithm, especially in realistic precision-limited simulations and in cases where a single reference fails to simultaneously retrieve all desired eigenvalues. Furthermore, the QBKSP algorithm enables the computation of degenerate eigenstates and respective multiplicity by imposing appropriate convergence criteria.
△ Less
Submitted 11 June, 2025;
originally announced June 2025.
-
IDEIA: A Generative AI-Based System for Real-Time Editorial Ideation in Digital Journalism
Authors:
Victor B. Santos,
Cauã O. Jordão,
Leonardo J. O. Ibiapina,
Gabriel M. Silva,
Mirella E. B. Santana,
Matheus A. Garrido,
Lucas R. C. Farias
Abstract:
This paper presents IDEIA (Intelligent Engine for Editorial Ideation and Assistance), a generative AI-powered system designed to optimize the journalistic ideation process by combining real-time trend analysis with automated content suggestion. Developed in collaboration with the Sistema Jornal do Commercio de Comunicação (SJCC), the largest media conglomerate in Brazil's North and Northeast regio…
▽ More
This paper presents IDEIA (Intelligent Engine for Editorial Ideation and Assistance), a generative AI-powered system designed to optimize the journalistic ideation process by combining real-time trend analysis with automated content suggestion. Developed in collaboration with the Sistema Jornal do Commercio de Comunicação (SJCC), the largest media conglomerate in Brazil's North and Northeast regions, IDEIA integrates the Google Trends API for data-driven topic monitoring and the Google Gemini API for the generation of context-aware headlines and summaries. The system adopts a modular architecture based on Node.js, React, and PostgreSQL, supported by Docker containerization and a CI/CD pipeline using GitHub Actions and Vercel. Empirical results demonstrate a significant reduction in the time and cognitive effort required for editorial planning, with reported gains of up to 70\% in the content ideation stage. This work contributes to the field of computational journalism by showcasing how intelligent automation can enhance productivity while maintaining editorial quality. It also discusses the technical and ethical implications of incorporating generative models into newsroom workflows, highlighting scalability and future applicability across sectors beyond journalism.
△ Less
Submitted 8 June, 2025;
originally announced June 2025.
-
Pruning Everything, Everywhere, All at Once
Authors:
Gustavo Henrique do Nascimento,
Ian Pons,
Anna Helena Reali Costa,
Artur Jordao
Abstract:
Deep learning stands as the modern paradigm for solving cognitive tasks. However, as the problem complexity increases, models grow deeper and computationally prohibitive, hindering advancements in real-world and resource-constrained applications. Extensive studies reveal that pruning structures in these models efficiently reduces model complexity and improves computational efficiency. Successful s…
▽ More
Deep learning stands as the modern paradigm for solving cognitive tasks. However, as the problem complexity increases, models grow deeper and computationally prohibitive, hindering advancements in real-world and resource-constrained applications. Extensive studies reveal that pruning structures in these models efficiently reduces model complexity and improves computational efficiency. Successful strategies in this sphere include removing neurons (i.e., filters, heads) or layers, but not both together. Therefore, simultaneously pruning different structures remains an open problem. To fill this gap and leverage the benefits of eliminating neurons and layers at once, we propose a new method capable of pruning different structures within a model as follows. Given two candidate subnetworks (pruned models), one from layer pruning and the other from neuron pruning, our method decides which to choose by selecting the one with the highest representation similarity to its parent (the network that generates the subnetworks) using the Centered Kernel Alignment metric. Iteratively repeating this process provides highly sparse models that preserve the original predictive ability. Throughout extensive experiments on standard architectures and benchmarks, we confirm the effectiveness of our approach and show that it outperforms state-of-the-art layer and filter pruning techniques. At high levels of Floating Point Operations reduction, most state-of-the-art methods degrade accuracy, whereas our approach either improves it or experiences only a minimal drop. Notably, on the popular ResNet56 and ResNet110, we achieve a milestone of 86.37% and 95.82% FLOPs reduction. Besides, our pruned models obtain robustness to adversarial and out-of-distribution samples and take an important step towards GreenAI, reducing carbon emissions by up to 83.31%. Overall, we believe our work opens a new chapter in pruning.
△ Less
Submitted 4 June, 2025;
originally announced June 2025.
-
Improving Fairness in LLMs Through Testing-Time Adversaries
Authors:
Isabela Pereira Gregio,
Ian Pons,
Anna Helena Reali Costa,
Artur Jordão
Abstract:
Large Language Models (LLMs) push the bound-aries in natural language processing and generative AI, driving progress across various aspects of modern society. Unfortunately, the pervasive issue of bias in LLMs responses (i.e., predictions) poses a significant and open challenge, hindering their application in tasks involving ethical sensitivity and responsible decision-making. In this work, we pro…
▽ More
Large Language Models (LLMs) push the bound-aries in natural language processing and generative AI, driving progress across various aspects of modern society. Unfortunately, the pervasive issue of bias in LLMs responses (i.e., predictions) poses a significant and open challenge, hindering their application in tasks involving ethical sensitivity and responsible decision-making. In this work, we propose a straightforward, user-friendly and practical method to mitigate such biases, enhancing the reliability and trustworthiness of LLMs. Our method creates multiple variations of a given sentence by modifying specific attributes and evaluates the corresponding prediction behavior compared to the original, unaltered, prediction/sentence. The idea behind this process is that critical ethical predictions often exhibit notable inconsistencies, indicating the presence of bias. Unlike previous approaches, our method relies solely on forward passes (i.e., testing-time adversaries), eliminating the need for training, fine-tuning, or prior knowledge of the training data distribution. Through extensive experiments on the popular Llama family, we demonstrate the effectiveness of our method in improving various fairness metrics, focusing on the reduction of disparities in how the model treats individuals from different racial groups. Specifically, using standard metrics, we improve the fairness in Llama3 in up to 27 percentage points. Overall, our approach significantly enhances fairness, equity, and reliability in LLM-generated results without parameter tuning or training data modifications, confirming its effectiveness in practical scenarios. We believe our work establishes an important step toward enabling the use of LLMs in tasks that require ethical considerations and responsible decision-making.
△ Less
Submitted 17 May, 2025;
originally announced May 2025.
-
Superradiance Enhanced Light-Matter Interaction in Spatially Ordered Shape and Volume Controlled Single Quantum Dots: Enabling On-Chip Photonic Networks
Authors:
Lucas Jordao,
Qi Huang,
Swarnabha Chattaraj,
Siyuan Lu,
Jiefei Zhang,
Anupam Madhukar
Abstract:
On-chip photonic networks require adequately spatially ordered matter-photon interconversion qubit sources with emission figures-of-merit exceeding the requirements that would enable the desired functional response of the network. The mesa-top single quantum dots (MTSQDs) have recently been demonstrated to meet these requirements. The substrate-encoded size-reducing epitaxy (SESRE) approach underp…
▽ More
On-chip photonic networks require adequately spatially ordered matter-photon interconversion qubit sources with emission figures-of-merit exceeding the requirements that would enable the desired functional response of the network. The mesa-top single quantum dots (MTSQDs) have recently been demonstrated to meet these requirements. The substrate-encoded size-reducing epitaxy (SESRE) approach underpinning the realization of these quantum emitters allows control on the shape, size, and strain (lattice-matched or mismatched) of these epitaxial single quantum dots. We have exploited this unique feature of the MTSQDs to reproducibly create arrays of quantum dots that exhibit single photon superradiance, characteristic of a delicate balance between the confinement potential volume, depth, and the resulting binding energy of the center of mass motion of the exciton and the exciton binding energy. In the exciton's weak confinement regime, direct enhancement of the quantum dot oscillator strength to ~30 is demonstrated for emitters in large arrays. Our findings provide compelling incentive for investigations of the potential of SESRE based tailored MTSQDs of lattice matched and mismatched material combinations for fabricating and studying interconnected networks enabled by these unique matter qubit-light qubit interconversion units.
△ Less
Submitted 10 May, 2025;
originally announced May 2025.
-
Above-room-temperature ferromagnetism in large-area epitaxial Fe3GaTe2/graphene van der Waals heterostructures
Authors:
Tauqir Shinwari,
Kacho Imtiyaz Ali Khan,
Hua Lv,
Atekelte Abebe Kassa,
Frans Munnik,
Simon Josephy,
Achim Trampert,
Victor Ukleev,
Chen Luo,
Florin Radu,
Jens Herfort,
Michael Hanke,
Joao Marcelo Jordao Lopes
Abstract:
Fe3GaTe2 (FGaT), a two-dimensional (2D) layered ferromagnetic metal, exhibits a high Curie temperature (TC) ~ 360 K along with strong perpendicular magnetic anisotropy (PMA), making it a promising material candidate for next-generation energy-efficient magnetic devices. However, the vast majority of studies on FGaT to date have been limited to millimeter-sized bulk crystals and exfoliated flakes,…
▽ More
Fe3GaTe2 (FGaT), a two-dimensional (2D) layered ferromagnetic metal, exhibits a high Curie temperature (TC) ~ 360 K along with strong perpendicular magnetic anisotropy (PMA), making it a promising material candidate for next-generation energy-efficient magnetic devices. However, the vast majority of studies on FGaT to date have been limited to millimeter-sized bulk crystals and exfoliated flakes, which are unsuitable for practical applications and integration into device processing. Also, its combination with other 2D materials to form van der Waals heterostructures has only been achieved by flake stacking. Consequently, the controlled large-scale growth of FGaT and related heterostructures remains largely unexplored. In this work, we demonstrate a breakthrough in the high-quality, large-scale growth of epitaxial FGaT thin films on single-crystalline graphene/SiC templates using molecular beam epitaxy. Structural characterization confirms the high crystalline quality of the continuous FGaT/graphene van der Waals heterostructures. Temperature-dependent magnetization and anomalous Hall measurements reveal robust PMA with an enhanced TC well above room temperature, reaching up to 400 K. Furthermore, X-ray absorption and X-ray magnetic circular dichroism spectra provide insight into the spin and orbital magnetic moment contributions, further validating the high TC and robust PMA. These findings are highly significant for the future development of high-performance spintronic devices based on 2D heterostructures, with potential applications in next-generation data storage, logic processing and quantum technologies.
△ Less
Submitted 9 May, 2025;
originally announced May 2025.
-
Efficient LLMs with AMP: Attention Heads and MLP Pruning
Authors:
Leandro Giusti Mugnaini,
Bruno Lopes Yamamoto,
Lucas Lauton de Alcantara,
Victor Zacarias,
Edson Bollis,
Lucas Pellicer,
Anna Helena Reali Costa,
Artur Jordao
Abstract:
Deep learning drives a new wave in computing systems and triggers the automation of increasingly complex problems. In particular, Large Language Models (LLMs) have significantly advanced cognitive tasks, often matching or even surpassing human-level performance. However, their extensive parameters result in high computational costs and slow inference, posing challenges for deployment in resource-l…
▽ More
Deep learning drives a new wave in computing systems and triggers the automation of increasingly complex problems. In particular, Large Language Models (LLMs) have significantly advanced cognitive tasks, often matching or even surpassing human-level performance. However, their extensive parameters result in high computational costs and slow inference, posing challenges for deployment in resource-limited settings. Among the strategies to overcome the aforementioned challenges, pruning emerges as a successful mechanism since it reduces model size while maintaining predictive ability. In this paper, we introduce AMP: Attention Heads and MLP Pruning, a novel structured pruning method that efficiently compresses LLMs by removing less critical structures within Multi-Head Attention (MHA) and Multilayer Perceptron (MLP). By projecting the input data onto weights, AMP assesses structural importance and overcomes the limitations of existing techniques, which often fall short in flexibility or efficiency. In particular, AMP surpasses the current state-of-the-art on commonsense reasoning tasks by up to 1.49 percentage points, achieving a 30% pruning ratio with minimal impact on zero-shot task performance. Moreover, AMP also improves inference speeds, making it well-suited for deployment in resource-constrained environments. We confirm the flexibility of AMP on different families of LLMs, including LLaMA and Phi.
△ Less
Submitted 29 April, 2025;
originally announced April 2025.
-
Jet evolution in a quantum computer: quark and gluon dynamics
Authors:
Nuno Filipe Castro,
José Guilherme Milhano,
Maria Gabriela Jordão Oliveira
Abstract:
The intrinsic quantum nature of jets and the Quark-Gluon Plasma makes the study of jet quenching a promising candidate to benefit from quantum computing power. Standing as a precursor of the full study of this phenomenon, we study the propagation of SU(3) partons in Quark-Gluon Plasma using quantum simulation algorithms. The algorithms are developed in detail, and the propagation of both quarks an…
▽ More
The intrinsic quantum nature of jets and the Quark-Gluon Plasma makes the study of jet quenching a promising candidate to benefit from quantum computing power. Standing as a precursor of the full study of this phenomenon, we study the propagation of SU(3) partons in Quark-Gluon Plasma using quantum simulation algorithms. The algorithms are developed in detail, and the propagation of both quarks and gluons is analysed and compared with analytical expectations. The results, obtained with quantum simulators, demonstrate that the algorithm successfully simulates parton propagation, yielding results consistent with analytical baseline calculations.
△ Less
Submitted 5 February, 2025;
originally announced February 2025.
-
The Dynamic Creativity of Proto-artifacts in Generative Computational Co-creation
Authors:
Juan Salamanca,
Daniel Gómez-Marín,
Sergi Jordà
Abstract:
This paper explores the attributes necessary to determine the creative merit of intermediate artifacts produced during a computational co-creative process (CCC) in which a human and an artificial intelligence system collaborate in the generative phase of a creative project. In an active listening experiment, subjects with diverse musical training (N=43) judged unfinished pieces composed by the New…
▽ More
This paper explores the attributes necessary to determine the creative merit of intermediate artifacts produced during a computational co-creative process (CCC) in which a human and an artificial intelligence system collaborate in the generative phase of a creative project. In an active listening experiment, subjects with diverse musical training (N=43) judged unfinished pieces composed by the New Electronic Assistant (NEA). The results revealed that a two-attribute definition based on the value and novelty of an artifact (e.g., Corazza's effectiveness and novelty) suffices to assess unfinished work leading to innovative products, instead of Boden's classic three-attribute definition of creativity (value, novelty, and surprise). These findings reduce the creativity metrics needed in CCC processes and simplify the evaluation of the numerous unfinished artifacts generated by computational creative assistants.
△ Less
Submitted 25 November, 2024;
originally announced November 2024.
-
Layer Pruning with Consensus: A Triple-Win Solution
Authors:
Leandro Giusti Mugnaini,
Carolina Tavares Duarte,
Anna H. Reali Costa,
Artur Jordao
Abstract:
Layer pruning offers a promising alternative to standard structured pruning, effectively reducing computational costs, latency, and memory footprint. While notable layer-pruning approaches aim to detect unimportant layers for removal, they often rely on single criteria that may not fully capture the complex, underlying properties of layers. We propose a novel approach that combines multiple simila…
▽ More
Layer pruning offers a promising alternative to standard structured pruning, effectively reducing computational costs, latency, and memory footprint. While notable layer-pruning approaches aim to detect unimportant layers for removal, they often rely on single criteria that may not fully capture the complex, underlying properties of layers. We propose a novel approach that combines multiple similarity metrics into a single expressive measure of low-importance layers, called the Consensus criterion. Our technique delivers a triple-win solution: low accuracy drop, high-performance improvement, and increased robustness to adversarial attacks. With up to 78.80% FLOPs reduction and performance on par with state-of-the-art methods across different benchmarks, our approach reduces energy consumption and carbon emissions by up to 66.99% and 68.75%, respectively. Additionally, it avoids shortcut learning and improves robustness by up to 4 percentage points under various adversarial attacks. Overall, the Consensus criterion demonstrates its effectiveness in creating robust, efficient, and environmentally friendly pruned models.
△ Less
Submitted 21 November, 2024;
originally announced November 2024.
-
Positive definite functions as uniformly ergodic multipliers of the Fourier algebra
Authors:
Jorge Galindo,
Enrique Jordá,
Alberto Rodríguez-Arenas
Abstract:
Let G be a locally compact group and let $φ$ be a positive definite function on G with $φ(e)=1$. This function defines a multiplication operator $M_φ$ on the Fourier algebra $A(G)$ of $G$. The aim of this paper is to classify the ergodic properties of the operators $M_φ$, focusing on several key factors, including the subgroup $H_φ=\{x\in G\colon φ(x)=1\}$, the spectrum of $M_φ$, or how ``spread-o…
▽ More
Let G be a locally compact group and let $φ$ be a positive definite function on G with $φ(e)=1$. This function defines a multiplication operator $M_φ$ on the Fourier algebra $A(G)$ of $G$. The aim of this paper is to classify the ergodic properties of the operators $M_φ$, focusing on several key factors, including the subgroup $H_φ=\{x\in G\colon φ(x)=1\}$, the spectrum of $M_φ$, or how ``spread-out'' a power of $M_φ$ can be. We show that the multiplication operator $M_φ$ is uniformly mean ergodic if and only if $H_φ$ is open and 1 is not an accumulation point of the spectrum of $M_φ$. Equivalently, this happens when some power of $φ$ is not far, in the multiplier norm, from a function supported on finitely many cosets of $H_φ$. Additionally, we show that the powers of $M_φ$ converge in norm if, and only if, the operator is uniformly mean ergodic and $H_φ=\{x\in G\colon |φ(x)|=1\}$.
△ Less
Submitted 18 November, 2024;
originally announced November 2024.
-
Bell state generation and CNOT operation using on-demand identical photons from shape-controlled spatially ordered quantum dots
Authors:
Qi Huang,
Swarnabha Chattaraj,
Lucas Jordao,
Jiefei Zhang,
Siyuan Lu,
Anupam Madhukar
Abstract:
Fault tolerant on-chip photonic quantum computation is enormously helped by (a) deterministic generation of the needed thousands to millions of photon qubits from (b) quantum emitters in designed spatially ordered arrays to enable networks for implementing many-qubit logic circuits. Scaling up photonic quantum information processing systems has, however, been prevented by the lack of such quantum…
▽ More
Fault tolerant on-chip photonic quantum computation is enormously helped by (a) deterministic generation of the needed thousands to millions of photon qubits from (b) quantum emitters in designed spatially ordered arrays to enable networks for implementing many-qubit logic circuits. Scaling up photonic quantum information processing systems has, however, been prevented by the lack of such quantum emitters until the demonstration of the platform of mesa-top single quantum dots (MTSQDs) -- controlled shape, size, and volume single QD -- located in designed regular arrays. Here we demonstrate 2 qubit CNOT gate operation -- a universal gate necessary to enable quantum circuits of arbitrary complexity -- in polarization basis using photons emitted from individual MTSQDs. A Bell state fidelity of 0.825$\pm$0.010 is achieved with two photon interference (TPI) visibility of 0.947$\pm$0.0015 at 4K without Purcell enhancement. The results make a strong case for developing MTSQD arrays for utility scale optical quantum information processing platforms.
△ Less
Submitted 8 November, 2024; v1 submitted 6 November, 2024;
originally announced November 2024.
-
Evolution of global inequality in well-being: A copula-based approach
Authors:
Koen Decancq,
Vanesa Jorda
Abstract:
We employ a flexible parametric model to estimate global income, health, and education distributions from 1980 to 2015. Using these marginal distributions within a copula-based framework, we construct a global joint distribution of well-being. This approach allows us to specifically analyze the impact of dependency structures on global well-being inequality. While inequality decreased in each indi…
▽ More
We employ a flexible parametric model to estimate global income, health, and education distributions from 1980 to 2015. Using these marginal distributions within a copula-based framework, we construct a global joint distribution of well-being. This approach allows us to specifically analyze the impact of dependency structures on global well-being inequality. While inequality decreased in each individual dimension, our findings suggest that multidimensional inequality does not necessarily follow this trend. Its evolution is influenced by the interdependence among dimensions and the chosen inequality aversion parameter.
△ Less
Submitted 30 October, 2024;
originally announced October 2024.
-
Predicting soccer matches with complex networks and machine learning
Authors:
Eduardo Alves Baratela,
Felipe Jordão Xavier,
Thomas Peron,
Paulino Ribeiro Villas-Boas,
Francisco Aparecido Rodrigues
Abstract:
Soccer attracts the attention of many researchers and professionals in the sports industry. Therefore, the incorporation of science into the sport is constantly growing, with increasing investments in performance analysis and sports prediction industries. This study aims to (i) highlight the use of complex networks as an alternative tool for predicting soccer match outcomes, and (ii) show how the…
▽ More
Soccer attracts the attention of many researchers and professionals in the sports industry. Therefore, the incorporation of science into the sport is constantly growing, with increasing investments in performance analysis and sports prediction industries. This study aims to (i) highlight the use of complex networks as an alternative tool for predicting soccer match outcomes, and (ii) show how the combination of structural analysis of passing networks with match statistical data can provide deeper insights into the game patterns and strategies used by teams. In order to do so, complex network metrics and match statistics were used to build machine learning models that predict the wins and losses of soccer teams in different leagues. The results showed that models based on passing networks were as effective as ``traditional'' models, which use general match statistics. Another finding was that by combining both approaches, more accurate models were obtained than when they were used separately, demonstrating that the fusion of such approaches can offer a deeper understanding of game patterns, allowing the comprehension of tactics employed by teams relationships between players, their positions, and interactions during matches. It is worth mentioning that both network metrics and match statistics were important and impactful for the mixed model. Furthermore, the use of networks with a lower granularity of temporal evolution (such as creating a network for each half of the match) performed better than a single network for the entire game.
△ Less
Submitted 19 September, 2024;
originally announced September 2024.
-
New gravity field of comet 67P/C-G based on Rosetta's Doppler and optical data
Authors:
Julien Laurent-Varin,
Théo James,
Jean-Charles Marty,
Laurent Jorda,
Sebastien Le Maistre,
Robert Gaskell
Abstract:
We provide a new gravity field of Comet 67P-C/G up to degree 4. We detect mass heterogeneity in the comet nucleus. The loss of mass is restimated at 0.28\% of the comet's total mass (3 times larger than previous estimate). Comparison of the gravity field between pre- and post-perihelion allowed us to measure a shift in the comet's center of gravity of 35 m northward, attributed to ice sublimation…
▽ More
We provide a new gravity field of Comet 67P-C/G up to degree 4. We detect mass heterogeneity in the comet nucleus. The loss of mass is restimated at 0.28\% of the comet's total mass (3 times larger than previous estimate). Comparison of the gravity field between pre- and post-perihelion allowed us to measure a shift in the comet's center of gravity of 35 m northward, attributed to ice sublimation process.
△ Less
Submitted 4 September, 2024;
originally announced September 2024.
-
Generalized Hilbert operators acting on weighted spaces of holomorphic functions with sup-norms
Authors:
María J. Beltrán-Meneu,
José Bonet,
Enrique Jordá
Abstract:
The behaviour of the generalized Hilbert operator associated with a positive finite Borel measure $μ$ on $[0,1)$ is investigated when it acts on weighted Banach spaces of holomorphic functions on the unit disc defined by sup-norms and on Korenblum type growth Banach spaces. It is studied when the operator is well defined, bounded and compact. To this aim, we study when it can be represented as an…
▽ More
The behaviour of the generalized Hilbert operator associated with a positive finite Borel measure $μ$ on $[0,1)$ is investigated when it acts on weighted Banach spaces of holomorphic functions on the unit disc defined by sup-norms and on Korenblum type growth Banach spaces. It is studied when the operator is well defined, bounded and compact. To this aim, we study when it can be represented as an integral operator. We observe important differences with the behaviour of the Cesàro-type operator acting on these spaces, getting that boundedness and compactness are equivalent concepts for some standard weights. For the space of bounded holomorphic functions on the disc and for the Wiener algebra, we get also this equivalence, which is characterized in turn by the summability of the moments of the measure $μ.$ In the latter case, it is also equivalent to nuclearity. Nuclearity of the generalized Hilbert operator acting on related spaces, such as the classical Hardy space, is also analyzed.
△ Less
Submitted 24 July, 2024;
originally announced July 2024.
-
Source regions of carbonaceous meteorites and NEOs
Authors:
M. Brož,
P. Vernazza,
M. Marsset,
R. P. Binzel,
F. DeMeo,
M. Birlan,
F. Colas,
S. Anghel,
S. Bouley,
C. Blanpain,
J. Gattacceca,
S. Jeanne,
L. Jorda,
J. Lecubin,
A. Malgoyre,
A. Steinhausser,
J. Vaubaillon,
B. Zanda
Abstract:
The present work aims to determine the source regions of carbonaceous chondrites (CM, CI, CO, CV, CK, CR, CH, CB, or C-ungrouped). We studied 38 individual asteroid families, including young and old ones, and determined their contributions to the NEO populations at metre and kilometre sizes using collisional and orbital models. Our models are in agreement with spectroscopic observations of NEOs, c…
▽ More
The present work aims to determine the source regions of carbonaceous chondrites (CM, CI, CO, CV, CK, CR, CH, CB, or C-ungrouped). We studied 38 individual asteroid families, including young and old ones, and determined their contributions to the NEO populations at metre and kilometre sizes using collisional and orbital models. Our models are in agreement with spectroscopic observations of NEOs, cosmic-ray exposure ages of meteorites, statistics of bolides, infrared emission from dust bands, composition of interplanetary dust particles (IDPs), or abundance of extraterrestrial helium-3. We identified the Veritas, Polana and Eos families as the primary sources of CM/CR, CI and CO/CV/CK chondrites, respectively. Substantial contributions are also expected from CM-like König and CI-like Clarissa, Misa and Hoffmeister families. The source regions of kilometre-sized bodies are generally different. The Adeona family is by far the main source of CM-like NEOs, whereas the Polana (low-i) and Euphrosyne (high-i) families are at the origin of most CI-like NEOs. The Polana family is the likely source of both Ryugu and Bennu. We were able to link spectroscopically and dynamically several NEOs to the Baptistina family. Finally, it appears that the pre-atmospheric flux of carbonaceous chondrites at metre sizes is about the same as that of ordinary chondrites. Given the difference in fall statistics between the two groups (80\% versus 4.4\%), this implies either substantial atmospheric fragmentation of carbonaceous bodies at the level of ${\sim}0.5\,{\rm MPa}$, or destruction by thermal cracking and water desorption. The source regions of most meteorites and kilometre-sized NEOs have now been determined; including some minor classes like enstatite chondrites and achondrites (Nysa, Hungaria), acapulcoites/lodranites (Iannini).
△ Less
Submitted 17 July, 2024; v1 submitted 28 June, 2024;
originally announced June 2024.
-
Scalable Training of Trustworthy and Energy-Efficient Predictive Graph Foundation Models for Atomistic Materials Modeling: A Case Study with HydraGNN
Authors:
Massimiliano Lupo Pasini,
Jong Youl Choi,
Kshitij Mehta,
Pei Zhang,
David Rogers,
Jonghyun Bae,
Khaled Z. Ibrahim,
Ashwin M. Aji,
Karl W. Schulz,
Jorda Polo,
Prasanna Balaprakash
Abstract:
We present our work on developing and training scalable, trustworthy, and energy-efficient predictive graph foundation models (GFMs) using HydraGNN, a multi-headed graph convolutional neural network architecture. HydraGNN expands the boundaries of graph neural network (GNN) computations in both training scale and data diversity. It abstracts over message passing algorithms, allowing both reproduct…
▽ More
We present our work on developing and training scalable, trustworthy, and energy-efficient predictive graph foundation models (GFMs) using HydraGNN, a multi-headed graph convolutional neural network architecture. HydraGNN expands the boundaries of graph neural network (GNN) computations in both training scale and data diversity. It abstracts over message passing algorithms, allowing both reproduction of and comparison across algorithmic innovations that define nearest-neighbor convolution in GNNs. This work discusses a series of optimizations that have allowed scaling up the GFMs training to tens of thousands of GPUs on datasets consisting of hundreds of millions of graphs. Our GFMs use multi-task learning (MTL) to simultaneously learn graph-level and node-level properties of atomistic structures, such as energy and atomic forces. Using over 154 million atomistic structures for training, we illustrate the performance of our approach along with the lessons learned on two state-of-the-art United States Department of Energy (US-DOE) supercomputers, namely the Perlmutter petascale system at the National Energy Research Scientific Computing Center and the Frontier exascale system at Oak Ridge Leadership Computing Facility. The HydraGNN architecture enables the GFM to achieve near-linear strong scaling performance using more than 2,000 GPUs on Perlmutter and 16,000 GPUs on Frontier.
△ Less
Submitted 1 November, 2024; v1 submitted 12 June, 2024;
originally announced June 2024.
-
Effective Layer Pruning Through Similarity Metric Perspective
Authors:
Ian Pons,
Bruno Yamamoto,
Anna H. Reali Costa,
Artur Jordao
Abstract:
Deep neural networks have been the predominant paradigm in machine learning for solving cognitive tasks. Such models, however, are restricted by a high computational overhead, limiting their applicability and hindering advancements in the field. Extensive research demonstrated that pruning structures from these models is a straightforward approach to reducing network complexity. In this direction,…
▽ More
Deep neural networks have been the predominant paradigm in machine learning for solving cognitive tasks. Such models, however, are restricted by a high computational overhead, limiting their applicability and hindering advancements in the field. Extensive research demonstrated that pruning structures from these models is a straightforward approach to reducing network complexity. In this direction, most efforts focus on removing weights or filters. Studies have also been devoted to layer pruning as it promotes superior computational gains. However, layer pruning often hurts the network predictive ability (i.e., accuracy) at high compression rates. This work introduces an effective layer-pruning strategy that meets all underlying properties pursued by pruning methods. Our method estimates the relative importance of a layer using the Centered Kernel Alignment (CKA) metric, employed to measure the similarity between the representations of the unpruned model and a candidate layer for pruning. We confirm the effectiveness of our method on standard architectures and benchmarks, in which it outperforms existing layer-pruning strategies and other state-of-the-art pruning techniques. Particularly, we remove more than 75% of computation while improving predictive ability. At higher compression regimes, our method exhibits negligible accuracy drop, while other methods notably deteriorate model accuracy. Apart from these benefits, our pruned models exhibit robustness to adversarial and out-of-distribution samples.
△ Less
Submitted 4 November, 2024; v1 submitted 27 May, 2024;
originally announced May 2024.
-
Entropy numbers of Reproducing Hilbert Space of zonal positive definite kernels on compact two-point homogeneous spaces
Authors:
Karina Gonzalez,
Thaís Jordão
Abstract:
We present estimates for the covering numbers of the unit ball of Reproducing Kernel Hilbert Spaces (RKHSs) of functions on $M^d$ a d-dimensional compact two-point homogeneous space. The RKHS is generated by a continuous zonal/isotropic positive definite kernel. We employ the representation in terms of the Schoenberg/Fourier series expansion for continuous isotropic positive definite kernels, give…
▽ More
We present estimates for the covering numbers of the unit ball of Reproducing Kernel Hilbert Spaces (RKHSs) of functions on $M^d$ a d-dimensional compact two-point homogeneous space. The RKHS is generated by a continuous zonal/isotropic positive definite kernel. We employ the representation in terms of the Schoenberg/Fourier series expansion for continuous isotropic positive definite kernels, given in terms of a family of orthogonal polynomials on $M^d$. The bounds we present carry accurate information about the asymptotic constants depending on the dimension of the manifold and the decay or growth rate of the coefficients of the kernel. The results we present extend the estimates previously known for continuous isotropic positive definite kernels on the d-dimensional unit sphere. We present the weak asymptotic equivalence for the order of the growth of covering numbers associated to kernels on $M^d$ with a convergent geometric sequence of coefficients. We apply our estimates in order to present a bound for the covering numbers of the spherical Gaussian kernel, and to present bounds for formal examples on $M^d$.
△ Less
Submitted 13 May, 2024;
originally announced May 2024.
-
Power boundedness and related properties for weighted composition operators on $\mathscr{S}(\mathbb{R}^d)$
Authors:
Vicente Asensio,
Enrique Jordá,
Thomas Kalmes
Abstract:
We characterize those pairs $(ψ,\varphi)$ of smooth mappings $ψ:\mathbb{R}^d\rightarrow\mathbb{C},\varphi:\mathbb{R}^d\rightarrow\mathbb{R}^d$ for which the corresponding weighted composition operator $C_{ψ,\varphi}f=ψ\cdot(f\circ\varphi)$ acts continuously on $\mathscr{S}(\mathbb{R}^d)$. Additionally, we give several easy-to-check necessary and sufficient conditions of this property for interesti…
▽ More
We characterize those pairs $(ψ,\varphi)$ of smooth mappings $ψ:\mathbb{R}^d\rightarrow\mathbb{C},\varphi:\mathbb{R}^d\rightarrow\mathbb{R}^d$ for which the corresponding weighted composition operator $C_{ψ,\varphi}f=ψ\cdot(f\circ\varphi)$ acts continuously on $\mathscr{S}(\mathbb{R}^d)$. Additionally, we give several easy-to-check necessary and sufficient conditions of this property for interesting special cases. Moreover, we characterize power boundedness and topologizablity of $C_{ψ,\varphi}$ on $\mathscr{S}(\mathbb{R}^d)$ in terms of $ψ,\varphi$. Among other things, as an application of our results we show that for a univariate polynomial $\varphi$ with $\text{deg}(\varphi)\geq 2$, power boundedness of $C_{ψ,\varphi}$ on $\mathscr{S}(\mathbb{R})$ for every $ψ\in\mathscr{O}_M(\mathbb{R})$ only depends on $\varphi$ and that in this case power boundedness of $C_{ψ,\varphi}$ is equivalent to $(C_{ψ,\varphi}^n)_{n\in\mathbb{N}}$ converging to $0$ in $\mathcal{L}_b(\mathscr{S}(\mathbb{R}))$ as well as to the uniform mean ergodicity of $C_{ψ,\varphi}$. Additionally, we give an example of a power bounded and uniformly mean ergodic weighted composition operator $C_{ψ,\varphi}$ on $\mathscr{S}(\mathbb{R})$ for which neither the multiplication operator $f\mapsto ψf$ nor the composition operator $f\mapsto f\circ\varphi$ acts on $\mathscr{S}(\mathbb{R})$. Our results complement and considerably extend various results of Fernández, Galbis, and the second named author.
△ Less
Submitted 10 November, 2024; v1 submitted 2 May, 2024;
originally announced May 2024.
-
Cesàro operators associated with Borel measures acting on weighted spaces of holomorphic functions with sup-norm
Authors:
Maria José Beltrán Meneu,
José Bonet,
Enrique Jordá
Abstract:
Let $μ$ be a positive finite Borel measure on $[0,1).$ Cesàro-type operators $C_μ$ when acting on weighted spaces of holomorphic functions are investigated. In the case of bounded holomorphic functions on the unit disc we prove that $C_μ$ is continuous if and only if it is compact. In the case of weighted Banach spaces of holomorphic function defined by general weights, we give sufficient and nece…
▽ More
Let $μ$ be a positive finite Borel measure on $[0,1).$ Cesàro-type operators $C_μ$ when acting on weighted spaces of holomorphic functions are investigated. In the case of bounded holomorphic functions on the unit disc we prove that $C_μ$ is continuous if and only if it is compact. In the case of weighted Banach spaces of holomorphic function defined by general weights, we give sufficient and necessary conditions for the continuity and compactness. For standard weights, we characterize the continuity and compactness on classical growth Banach spaces of holomorphic functions. We also study the point spectrum and the spectrum of $C_μ$ on the space of holomorphic functions on the disc, on the space of bounded holomorphic functions on the disc, and on the classical growth Banach spaces of holomorphic functions. All characterizations are given in terms of the sequence of moments $(μ_n)_{n\in\N_0}$. The continuity, compactness and spectrum of $C_μ$ acting on Fréchet and (LB) Korenblum type spaces are also considered.
△ Less
Submitted 21 October, 2024; v1 submitted 17 January, 2024;
originally announced January 2024.
-
Large-Area Spatially Ordered Mesa Top Single Quantum Dots: Suitable Single Photon Emitters for On-Chip Integrated Quantum Information Processing Platforms
Authors:
Qi Huang,
Lucas Jordao,
Siyuan Lu,
Swarnabha Chattaraj,
Jiefei Zhang,
Anupam Madhukar
Abstract:
Realization of the long sought on-chip scalable photonic quantum information processing networks has been thwarted by the absence of spatially-ordered and scalable on-demand single photon emitters with emission figures-of-merit exceeding the required thresholds across large numbers. The positioning must meet the required degree of accuracy that enables fabricating their interconnection to create t…
▽ More
Realization of the long sought on-chip scalable photonic quantum information processing networks has been thwarted by the absence of spatially-ordered and scalable on-demand single photon emitters with emission figures-of-merit exceeding the required thresholds across large numbers. The positioning must meet the required degree of accuracy that enables fabricating their interconnection to create the desired functional network. Here we report on the realization of large-area spatially-ordered arrays of mesa-top single quantum dots (MTSQDs) that are demonstrated [1] to be on-demand single photon emitters with characteristics that meet the requirements for implementing quantum photonic circuits/platforms aimed at quantum key distribution, linear optical quantum computing, simulations of quantum many-body problems, and metrology/sensing. The reported GaAs/InGaAs/GaAs MTSQD arrays, grown via SESRE (substrate-encoded size-reducing epitaxy) are in multiple arrays of up to 100x100 with 5um pitch, across a centimeter radius area. We show illustrative large-area images of the emission intensity (brightness) and color-coded wavelength distribution exhibiting ~3.35nm standard deviation. Scanning transmission electron microscopy shows a remarkable control on the QD location to within ~3nm accuracy laterally and ~1nm vertically. The primary remaining challenge is the control on the uniformity of the currently wet-chemically etched as-patterned nanomesa lateral size across the substrate, a surmountable technical issue. Thus, SESRE offers the most promising approach to realizing on-chip scalable spatially-ordered arrays of on-demand bright single quantum emitters meeting the figures-of-merit required for on-chip fully integrated quantum photonic circuit platforms-monolithic (such as based upon AlGaAs on insulator) or hybrid that leverage the silicon-on-insulator (SOI) photonic integrated circuit (PIC).
△ Less
Submitted 31 December, 2023; v1 submitted 22 December, 2023;
originally announced December 2023.
-
Large-Scale Multi-Hypotheses Cell Tracking Using Ultrametric Contours Maps
Authors:
Jordão Bragantini,
Merlin Lange,
Loïc Royer
Abstract:
In this work, we describe a method for large-scale 3D cell-tracking through a segmentation selection approach. The proposed method is effective at tracking cells across large microscopy datasets on two fronts: (i) It can solve problems containing millions of segmentation instances in terabyte-scale 3D+t datasets; (ii) It achieves competitive results with or without deep learning, which requires 3D…
▽ More
In this work, we describe a method for large-scale 3D cell-tracking through a segmentation selection approach. The proposed method is effective at tracking cells across large microscopy datasets on two fronts: (i) It can solve problems containing millions of segmentation instances in terabyte-scale 3D+t datasets; (ii) It achieves competitive results with or without deep learning, which requires 3D annotated data, that is scarce in the fluorescence microscopy field. The proposed method computes cell tracks and segments using a hierarchy of segmentation hypotheses and selects disjoint segments by maximizing the overlap between adjacent frames. We show that this method achieves state-of-the-art results in 3D images from the cell tracking challenge and has a faster integer linear programming formulation. Moreover, our framework is flexible and supports segmentations from off-the-shelf cell segmentation models and can combine them into an ensemble that improves tracking. The code is available https://github.com/royerlab/ultrack.
△ Less
Submitted 11 April, 2024; v1 submitted 8 August, 2023;
originally announced August 2023.
-
Inference for Local Projections
Authors:
Atsushi Inoue,
Òscar Jordà,
Guido M. Kuersteiner
Abstract:
Inference for impulse responses estimated with local projections presents interesting challenges and opportunities. Analysts typically want to assess the precision of individual estimates, explore the dynamic evolution of the response over particular regions, and generally determine whether the impulse generates a response that is any different from the null of no effect. Each of these goals requi…
▽ More
Inference for impulse responses estimated with local projections presents interesting challenges and opportunities. Analysts typically want to assess the precision of individual estimates, explore the dynamic evolution of the response over particular regions, and generally determine whether the impulse generates a response that is any different from the null of no effect. Each of these goals requires a different approach to inference. In this article, we provide an overview of results that have appeared in the literature in the past 20 years along with some new procedures that we introduce here.
△ Less
Submitted 13 August, 2024; v1 submitted 5 June, 2023;
originally announced June 2023.
-
Sharp estimates for the covering numbers of the Weierstrass fractal kernel
Authors:
K. Gonzalez,
D. Azevedo,
T. Jordão
Abstract:
In this paper, we present sharp estimates for the covering numbers of the embedding of the reproducing kernel Hilbert space (RKHS) associated with the Weierstrass fractal kernel into the space of continuous functions. The method we apply is based on the characterization of the infinite-dimensional RKHS generated by the Weierstrass fractal kernel and it requires estimates for the norm operator of o…
▽ More
In this paper, we present sharp estimates for the covering numbers of the embedding of the reproducing kernel Hilbert space (RKHS) associated with the Weierstrass fractal kernel into the space of continuous functions. The method we apply is based on the characterization of the infinite-dimensional RKHS generated by the Weierstrass fractal kernel and it requires estimates for the norm operator of orthogonal projections on the RKHS.
△ Less
Submitted 27 April, 2023;
originally announced April 2023.
-
A close look at the entropy numbers of the unit ball of the Reproducing Hilbert Space of isotropic positive definite kernels
Authors:
T. Jordão,
K. Gonzalez
Abstract:
We present accurate upper and lower bounds for the covering numbers, with explicit constants, of the unit ball for two general classes of Reproducing Kernel Hilbert Space (RKHS) on the unit sphere of $\mathbb{R}^{d+1}$. In both classes, the RKHS is generated by an isotropic continuous positive definite kernel. The upper and lower bounds we present carry precise information about the asymptotic con…
▽ More
We present accurate upper and lower bounds for the covering numbers, with explicit constants, of the unit ball for two general classes of Reproducing Kernel Hilbert Space (RKHS) on the unit sphere of $\mathbb{R}^{d+1}$. In both classes, the RKHS is generated by an isotropic continuous positive definite kernel. The upper and lower bounds we present carry precise information about the asymptotic constants, depending on the dimension of the sphere and the monotonic behavior of the Schoenberg/Fourier coefficients of the isotropic kernel.
△ Less
Submitted 12 November, 2023; v1 submitted 27 April, 2023;
originally announced April 2023.
-
When Layers Play the Lottery, all Tickets Win at Initialization
Authors:
Artur Jordao,
George Correa de Araujo,
Helena de Almeida Maia,
Helio Pedrini
Abstract:
Pruning is a standard technique for reducing the computational cost of deep networks. Many advances in pruning leverage concepts from the Lottery Ticket Hypothesis (LTH). LTH reveals that inside a trained dense network exists sparse subnetworks (tickets) able to achieve similar accuracy (i.e., win the lottery - winning tickets). Pruning at initialization focuses on finding winning tickets without…
▽ More
Pruning is a standard technique for reducing the computational cost of deep networks. Many advances in pruning leverage concepts from the Lottery Ticket Hypothesis (LTH). LTH reveals that inside a trained dense network exists sparse subnetworks (tickets) able to achieve similar accuracy (i.e., win the lottery - winning tickets). Pruning at initialization focuses on finding winning tickets without training a dense network. Studies on these concepts share the trend that subnetworks come from weight or filter pruning. In this work, we investigate LTH and pruning at initialization from the lens of layer pruning. First, we confirm the existence of winning tickets when the pruning process removes layers. Leveraged by this observation, we propose to discover these winning tickets at initialization, eliminating the requirement of heavy computational resources for training the initial (over-parameterized) dense network. Extensive experiments show that our winning tickets notably speed up the training phase and reduce up to 51% of carbon emission, an important step towards democratization and green Artificial Intelligence. Beyond computational benefits, our winning tickets exhibit robustness against adversarial and out-of-distribution examples. Finally, we show that our subnetworks easily win the lottery at initialization while tickets from filter removal (the standard structured LTH) hardly become winning tickets.
△ Less
Submitted 19 March, 2024; v1 submitted 25 January, 2023;
originally announced January 2023.
-
Uniformly ergodic probability measures
Authors:
Jorge Galindo,
Enrique Jordá,
Alberto Rodríguez-Arenas
Abstract:
Let $G$ be a locally compact group and $μ$ be a probability measure on $G$. We consider the convolution operator $λ_1(μ)\colon L_1(G)\to L_1(G)$ given by $λ_1(μ)f=μ\ast f$ and its restriction $λ_1^0(μ)$ to the augmentation ideal $L_1^0(G)$. Say that $μ$ is uniformly ergodic if the Cesàro means of the operator $λ_1^0(μ)$ converge uniformly to 0, that is, if $λ_1^0(μ)$ is a uniformly mean ergodic op…
▽ More
Let $G$ be a locally compact group and $μ$ be a probability measure on $G$. We consider the convolution operator $λ_1(μ)\colon L_1(G)\to L_1(G)$ given by $λ_1(μ)f=μ\ast f$ and its restriction $λ_1^0(μ)$ to the augmentation ideal $L_1^0(G)$. Say that $μ$ is uniformly ergodic if the Cesàro means of the operator $λ_1^0(μ)$ converge uniformly to 0, that is, if $λ_1^0(μ)$ is a uniformly mean ergodic operator with limit 0 and that $μ$ is uniformly completely mixing if the powers of the operator $λ_1^0(μ)$ converge uniformly to 0.
We completely characterize the uniform mean ergodicity of the operator $λ_1(μ)$ and the uniform convergence of its powers and see that there is no difference between $λ_1(μ)$ and $λ_1^0(μ) $ in this regard. We prove in particular that $μ$ is uniformly ergodic if and only if $G$ is compact, $μ$ is adapted (its support is not contained in a proper closed subgroup of $G$) and 1 is an isolated point of the spectrum of $μ$. The last of these three conditions is actually equivalent to $μ$ being spread-out (some convolution power of $μ$ is not singular). The measure $μ$ is uniformly completely mixing if and only if $G$ is compact, $μ$ is spread-out and the only unimodular value of the spectrum of $μ$ is 1.
△ Less
Submitted 13 December, 2023; v1 submitted 24 January, 2023;
originally announced January 2023.
-
On the spectrum of isomorphisms defined on the space of smooth functions which are flat at 0
Authors:
Enrique Jordá
Abstract:
In this note we study the spectrum and the Waelbroeck spectrum of the derivative operator composed with isomorphic multiplication oper
In this note we study the spectrum and the Waelbroeck spectrum of the derivative operator composed with isomorphic multiplication oper
△ Less
Submitted 23 January, 2023;
originally announced January 2023.
-
Activity distribution of comet 67P/Churyumov-Gerasimenko from combined measurements of non-gravitational forces and torques
Authors:
Nicholas Attree,
Laurent Jorda,
Olivier Groussin,
Jessica Agarwal,
Riccardo Lasagni Manghi,
Paolo Tortora,
Marco Zannoni,
Raphael Marschall
Abstract:
Aims. Understanding the activity is vital for deciphering the structure, formation, and evolution of comets. We investigate models of cometary activity by comparing them to the dynamics of 67P/Churyumov-Gerasimenko. Methods. We matched simple thermal models of water activity to the combined Rosetta datasets by fitting to the total outgassing rate and four components of the outgassing induced non-g…
▽ More
Aims. Understanding the activity is vital for deciphering the structure, formation, and evolution of comets. We investigate models of cometary activity by comparing them to the dynamics of 67P/Churyumov-Gerasimenko. Methods. We matched simple thermal models of water activity to the combined Rosetta datasets by fitting to the total outgassing rate and four components of the outgassing induced non-gravitational force and torque, with a final manual adjustment of the model parameters to additionally match the other two torque components. We parametrised the thermal model in terms of a distribution of relative activity over the surface of the comet, and attempted to link this to different terrain types. We also tested a more advanced thermal model based on a pebble structure. Results. We confirm a hemispherical dichotomy and non-linear water outgassing response to insolation. The southern hemisphere of the comet and consolidated terrain show enhanced activity relative to the northern hemisphere and dust-covered, unconsolidated terrain types, especially at perihelion. We further find that the non-gravitational torque is especially sensitive to the activity distribution, and to fit the pole-axis orientation in particular, activity must be concentrated (in excess of the already high activity in the southern hemisphere and consolidated terrain) around the south pole and on the body and neck of the comet over its head. This is the case for both the simple thermal model and the pebble-based model. Overall, our results show that water activity cannot be matched by a simple model of sublimating surface ice driven by the insolation alone, regardless of the surface distribution, and that both local spatial and temporal variations are needed to fit the data.
△ Less
Submitted 12 January, 2023;
originally announced January 2023.
-
The equilibrium shape of (65) Cybele: primordial or relic of a large impact?
Authors:
M. Marsset,
M. Brož,
J. Vermersch,
N. Rambaux,
M. Ferrais,
M. Viikinkoski,
J. Hanuš,
E. Jehin,
E. Podlewska-Gaca,
P. Bartczak,
G. Dudzinski,
B. Carry,
P. Vernazza,
R. Szakáts,
R. Duffard,
A. Jones,
D. Molina,
T. Santana-Ros,
Z. Benkhaldoun,
M. Birlan,
C. Dumas,
R. Fétick,
T. Fusco,
L. Jorda,
F. Marchis
, et al. (2 additional authors not shown)
Abstract:
Cybele asteroids constitute an appealing reservoir of primitive material genetically linked to the outer Solar System, and the physical properties of the largest members can be readily accessed by large telescopes. We took advantage of the bright apparition of (65) Cybele in July and August 2021 to acquire high-angular-resolution images and optical light curves of the asteroid with which we aim to…
▽ More
Cybele asteroids constitute an appealing reservoir of primitive material genetically linked to the outer Solar System, and the physical properties of the largest members can be readily accessed by large telescopes. We took advantage of the bright apparition of (65) Cybele in July and August 2021 to acquire high-angular-resolution images and optical light curves of the asteroid with which we aim to analyse its shape and bulk properties. 7 series of images acquired with VLT/SPHERE were combined with optical light curves to reconstruct the shape of the asteroid using the ADAM, MPCD, and SAGE algorithms. The origin of the shape was investigated by means of N-body simulations. Cybele has a volume-equivalent diameter of 263+/-3km and a bulk density of 1.55+/-0.19g.cm-3. Notably, its shape and rotation state are closely compatible with those of a Maclaurin equilibrium figure. The lack of a collisional family associated with Cybele and the higher bulk density of that body with respect to other large P-type asteroids suggest that it never experienced any large disruptive impact followed by rapid re-accumulation. This would imply that its present-day shape represents the original one. However, numerical integration of the long-term dynamical evolution of a hypothetical family shows that it is dispersed by gravitational perturbations and chaotic diffusion over Gyrs of evolution. The very close match between Cybele and an equilibrium figure opens up the possibility that D>260km small bodies from the outer Solar System all formed at equilibrium. However, we cannot rule out an old impact as the origin of the equilibrium shape. Cybele itself is found to be dynamically unstable, implying that it was recently (<1Ga) placed on its current orbit either through slow diffusion from a relatively stable orbit in the Cybele region or, less likely, from an unstable, JFC orbit in the planet-crossing region.
△ Less
Submitted 1 December, 2022;
originally announced December 2022.
-
Fitting a Collider in a Quantum Computer: Tackling the Challenges of Quantum Machine Learning for Big Datasets
Authors:
Miguel Caçador Peixoto,
Nuno Filipe Castro,
Miguel Crispim Romão,
Maria Gabriela Jordão Oliveira,
Inês Ochoa
Abstract:
Current quantum systems have significant limitations affecting the processing of large datasets with high dimensionality, typical of high energy physics. In the present paper, feature and data prototype selection techniques were studied to tackle this challenge. A grid search was performed and quantum machine learning models were trained and benchmarked against classical shallow machine learning m…
▽ More
Current quantum systems have significant limitations affecting the processing of large datasets with high dimensionality, typical of high energy physics. In the present paper, feature and data prototype selection techniques were studied to tackle this challenge. A grid search was performed and quantum machine learning models were trained and benchmarked against classical shallow machine learning methods, trained both in the reduced and the complete datasets. The performance of the quantum algorithms was found to be comparable to the classical ones, even when using large datasets. Sequential Backward Selection and Principal Component Analysis techniques were used for feature's selection and while the former can produce the better quantum machine learning models in specific cases, it is more unstable. Additionally, we show that such variability in the results is caused by the use of discrete variables, highlighting the suitability of Principal Component analysis transformed data for quantum machine learning applications in the high energy physics context.
△ Less
Submitted 6 December, 2023; v1 submitted 6 November, 2022;
originally announced November 2022.
-
The Next-Generation OS Process Abstraction
Authors:
Rodrigo Siqueira,
Nelson Lago,
Fabio Kon,
Dejan Milojičić
Abstract:
Operating Systems are built upon a set of abstractions to provide resource management and programming APIs for common functionality, such as synchronization, communication, protection, and I/O. The process abstraction is the bridge across these two aspects; unsurprisingly, research efforts pay particular attention to the process abstraction, aiming at enhancing security, improving performance, and…
▽ More
Operating Systems are built upon a set of abstractions to provide resource management and programming APIs for common functionality, such as synchronization, communication, protection, and I/O. The process abstraction is the bridge across these two aspects; unsurprisingly, research efforts pay particular attention to the process abstraction, aiming at enhancing security, improving performance, and supporting hardware innovations. However, given the intrinsic difficulties to implement modifications at the OS level, recent endeavors have not yet been widely adopted in production-oriented OSes. Still, we believe the current hardware evolution and new application requirements provide favorable conditions to change this trend. This paper evaluates recent research on OS process features identifying potential evolution paths. We derive a set of relevant process characteristics, and propose how to extend them as to benefit OSes and applications.
△ Less
Submitted 24 May, 2022;
originally announced May 2022.
-
Dynamics of composition operators on function spaces defined by local and global properties
Authors:
Angela A. Albanese,
Enrique Jordá,
Claudio Mele
Abstract:
In this paper we consider composition operators on locally convex spaces of functions defined on $\mathbb{R}$. We prove results concerning supercyclicity, power boundedness, mean ergodicity and convergence of the iterates in the strong operator topology.
In this paper we consider composition operators on locally convex spaces of functions defined on $\mathbb{R}$. We prove results concerning supercyclicity, power boundedness, mean ergodicity and convergence of the iterates in the strong operator topology.
△ Less
Submitted 21 March, 2022; v1 submitted 16 December, 2021;
originally announced December 2021.
-
Large-Scale Video Analytics through Object-Level Consolidation
Authors:
Daniel Rivas,
Francesc Guim,
Jordà Polo,
David Carrera
Abstract:
As the number of installed cameras grows, so do the compute resources required to process and analyze all the images captured by these cameras. Video analytics enables new use cases, such as smart cities or autonomous driving. At the same time, it urges service providers to install additional compute resources to cope with the demand while the strict latency requirements push compute towards the e…
▽ More
As the number of installed cameras grows, so do the compute resources required to process and analyze all the images captured by these cameras. Video analytics enables new use cases, such as smart cities or autonomous driving. At the same time, it urges service providers to install additional compute resources to cope with the demand while the strict latency requirements push compute towards the end of the network, forming a geographically distributed and heterogeneous set of compute locations, shared and resource-constrained. Such landscape (shared and distributed locations) forces us to design new techniques that can optimize and distribute work among all available locations and, ideally, make compute requirements grow sublinearly with respect to the number of cameras installed. In this paper, we present FoMO (Focus on Moving Objects). This method effectively optimizes multi-camera deployments by preprocessing images for scenes, filtering the empty regions out, and composing regions of interest from multiple cameras into a single image that serves as input for a pre-trained object detection model. Results show that overall system performance can be increased by 8x while accuracy improves 40% as a by-product of the methodology, all using an off-the-shelf pre-trained model with no additional training or fine-tuning.
△ Less
Submitted 30 November, 2021;
originally announced November 2021.
-
Observed tidal evolution of Kleopatra's outer satellite
Authors:
M. Brož,
J. Ďurech,
B. Carry,
F. Vachier,
F. Marchis,
J. Hanuš,
L. Jorda,
P. Vernazza,
D. Vokrouhlický,
M. Walterová,
R. Behrend
Abstract:
The orbit of the outer satellite Alexhelios of (216) Kleopatra is already constrained by adaptive-optics astrometry, obtained with the VLT/SPHERE instrument. However, there is also a preceding occultation event in 1980 attributed to this satellite. Hereinafter, we try to link all observations, spanning 1980--2018. We find the nominal orbit exhibits an unexplained shift by $+60^\circ$ in the true l…
▽ More
The orbit of the outer satellite Alexhelios of (216) Kleopatra is already constrained by adaptive-optics astrometry, obtained with the VLT/SPHERE instrument. However, there is also a preceding occultation event in 1980 attributed to this satellite. Hereinafter, we try to link all observations, spanning 1980--2018. We find the nominal orbit exhibits an unexplained shift by $+60^\circ$ in the true longitude. Using both periodogram analysis and an $\ell = 10$ multipole model suitable for the motion of mutually interacting moons about the irregular body, we confirmed that it is not possible to adjust the respective osculating period $P_2$. Instead, we were forced to use a model with tidal dissipation (and increasing orbital periods) to explain the shift. We also analyzed light curves, spanning 1977--2021, and searched for the expected spin deceleration of Kleopatra. According to our best-fit model, the observed period rate is $\dot P_2 = (1.8\pm 0.1)\cdot 10^{-8}\,{\rm d}\,{\rm d}^{-1}$ and the corresponding time lag $Δt_2 = 42\,{\rm s}$ of tides, for the assumed value of the Love number $k_2 = 0.3$. It is the first detection of tidal evolution for moons orbiting 100-km asteroids. The corresponding dissipation factor $Q$ is comparable with other terrestrial bodies, albeit at a higher loading frequency $2|ω-n|$. We also predict a secular evolution of the inner moon, $\dot P_1 = 5.0\cdot 10^{-8}$, as well as a spin deceleration of Kleopatra, $\dot P_0 = 1.9\cdot 10^{-12}$. In alternative models, with moons captured in the 3:2 mean-motion resonance or more massive moons, the respective values of $Δt_2$ are a factor of 2--3 lower. Future astrometric observations by direct imaging or occultations should allow to distinguish between these models, which is important for the internal structure and mechanical properties of (216) Kleopatra.
△ Less
Submitted 25 October, 2021;
originally announced October 2021.
-
Digital Twin Earth -- Coasts: Developing a fast and physics-informed surrogate model for coastal floods via neural operators
Authors:
Peishi Jiang,
Nis Meinert,
Helga Jordão,
Constantin Weisser,
Simon Holgate,
Alexander Lavin,
Björn Lütjens,
Dava Newman,
Haruko Wainwright,
Catherine Walker,
Patrick Barnard
Abstract:
Developing fast and accurate surrogates for physics-based coastal and ocean models is an urgent need due to the coastal flood risk under accelerating sea level rise, and the computational expense of deterministic numerical models. For this purpose, we develop the first digital twin of Earth coastlines with new physics-informed machine learning techniques extending the state-of-art Neural Operator.…
▽ More
Developing fast and accurate surrogates for physics-based coastal and ocean models is an urgent need due to the coastal flood risk under accelerating sea level rise, and the computational expense of deterministic numerical models. For this purpose, we develop the first digital twin of Earth coastlines with new physics-informed machine learning techniques extending the state-of-art Neural Operator. As a proof-of-concept study, we built Fourier Neural Operator (FNO) surrogates on the simulations of an industry-standard flood and ocean model (NEMO). The resulting FNO surrogate accurately predicts the sea surface height in most regions while achieving upwards of 45x acceleration of NEMO. We delivered an open-source \textit{CoastalTwin} platform in an end-to-end and modular way, to enable easy extensions to other simulations and ML-based surrogate methods. Our results and deliverable provide a promising approach to massively accelerate coastal dynamics simulators, which can enable scientists to efficiently execute many simulations for decision-making, uncertainty quantification, and other research activities.
△ Less
Submitted 13 October, 2021;
originally announced October 2021.
-
Sharp estimates for the covering numbers of the Weierstrass fractal kernel
Authors:
Douglas Azevedo,
Karina Gonzalez,
Thais Jordão
Abstract:
In this paper, we use the infamous continuous and nowhere differentiable Weierstrass function as a prototype to define a Weierstrass fractal kernel. We investigate the properties of the reproducing kernel Hilbert space (RKHS) associated with this kernel by presenting an explicit characterization of this space. In particular, we show that this space has a dense subset composed of continuous but now…
▽ More
In this paper, we use the infamous continuous and nowhere differentiable Weierstrass function as a prototype to define a Weierstrass fractal kernel. We investigate the properties of the reproducing kernel Hilbert space (RKHS) associated with this kernel by presenting an explicit characterization of this space. In particular, we show that this space has a dense subset composed of continuous but nowhere differentiable functions. Moreover, we present sharp estimates for the covering numbers of the unit ball of this space as a subset of the continuous functions.
△ Less
Submitted 9 October, 2021;
originally announced October 2021.
-
(216) Kleopatra, a low density critically rotating M-type asteroid
Authors:
F. Marchis,
L. Jorda,
P. Vernazza,
M. Brož,
J. Hanuš,
M. Ferrais,
F. Vachier,
N. Rambaux,
M. Marsset,
M. Viikinkoski,
E. Jehin,
S. Benseguane,
E. Podlewska-Gaca,
B. Carry,
A. Drouard,
S. Fauvaud,
M. Birlan,
J. Berthier,
P. Bartczak,
C. Dumas,
G. Dudzinski,
J. Durech,
J. Castillo-Rogez,
F. Cipriani,
F. Colas
, et al. (14 additional authors not shown)
Abstract:
Context. The recent estimates of the 3D shape of the M/Xe-type triple asteroid system (216) Kleopatra indicated a density of 5 g.cm$^{-3}$. Such a high density implies a high metal content and a low porosity which is not easy to reconcile with its peculiar dumbbell shape. Aims. Given the unprecedented angular resolution of the VLT/SPHERE/ZIMPOL camera, we aim to constrain the mass and the shape of…
▽ More
Context. The recent estimates of the 3D shape of the M/Xe-type triple asteroid system (216) Kleopatra indicated a density of 5 g.cm$^{-3}$. Such a high density implies a high metal content and a low porosity which is not easy to reconcile with its peculiar dumbbell shape. Aims. Given the unprecedented angular resolution of the VLT/SPHERE/ZIMPOL camera, we aim to constrain the mass and the shape of Kleopatra with high accuracy, hence its density. Methods. We combined our new VLT/SPHERE observations of Kleopatra recorded in 2017 and 2018 with archival data, as well as lightcurve, occultation, and delay-Doppler images, to derive its 3D shape model using two different algorithms (ADAM, MPCD). Furthermore, an N-body dynamical model allowed us to retrieve the orbital elements of the two moons as explained in the accompanying paper. Results. The shape of Kleopatra is very close to an equilibrium dumbbell figure with two lobes and a thick neck. Its volume equivalent diameter (118.75$\pm$1.40) km and mass (2.97$\pm$0.32) 10$^{18}$ kg imply a bulk density of (3.38$\pm$0.50) g cm$^{-3}$. Such a low density for a supposedly metal-rich body indicates a substantial porosity within the primary. This porous structure along with its near-equilibrium shape is compatible with a formation scenario including a giant impact followed by reaccumulation. Kleopatra's current rotation period and dumbbell shape imply that it is in a critically rotating state. The low effective gravity along the equator of the body, together with the equatorial orbits of the moons and possibly rubble-pile structure, opens the possibility that the moons formed via mass shedding. Conclusions. Kleopatra is a puzzling multiple system due to the unique characteristics of the primary. It deserves particular attention in the future, with the Extremely Large Telescopes and possibly a dedicated space mission.
△ Less
Submitted 16 August, 2021;
originally announced August 2021.
-
On the Effect of Pruning on Adversarial Robustness
Authors:
Artur Jordao,
Helio Pedrini
Abstract:
Pruning is a well-known mechanism for reducing the computational cost of deep convolutional networks. However, studies have shown the potential of pruning as a form of regularization, which reduces overfitting and improves generalization. We demonstrate that this family of strategies provides additional benefits beyond computational performance and generalization. Our analyses reveal that pruning…
▽ More
Pruning is a well-known mechanism for reducing the computational cost of deep convolutional networks. However, studies have shown the potential of pruning as a form of regularization, which reduces overfitting and improves generalization. We demonstrate that this family of strategies provides additional benefits beyond computational performance and generalization. Our analyses reveal that pruning structures (filters and/or layers) from convolutional networks increase not only generalization but also robustness to adversarial images (natural images with content modified). Such achievements are possible since pruning reduces network capacity and provides regularization, which have been proven effective tools against adversarial images. In contrast to promising defense mechanisms that require training with adversarial images and careful regularization, we show that pruning obtains competitive results considering only natural images (e.g., the standard and low-cost training). We confirm these findings on several adversarial attacks and architectures; thus suggesting the potential of pruning as a novel defense mechanism against adversarial images.
△ Less
Submitted 24 November, 2021; v1 submitted 10 August, 2021;
originally announced August 2021.
-
On chip scalable highly pure and indistinguishable single photon sources in ordered arrays: Path to Quantum Optical Circuits
Authors:
Jiefei Zhang,
Swarnabha Chattaraj,
Qi Huang,
Lucas Jordao,
Siyuan Lu,
Anupam Madhukar
Abstract:
Realization of quantum optical circuits is at the heart of quantum photonic information processing. A long-standing obstacle however has been the absence of a platform of single photon sources (SPSs) that simultaneously satisfies the following required characteristics: spatially ordered SPS arrays that produce, on-demand, highly pure, and indistinguishable single photons with sufficiently uniform…
▽ More
Realization of quantum optical circuits is at the heart of quantum photonic information processing. A long-standing obstacle however has been the absence of a platform of single photon sources (SPSs) that simultaneously satisfies the following required characteristics: spatially ordered SPS arrays that produce, on-demand, highly pure, and indistinguishable single photons with sufficiently uniform emission characteristics across the array, needed for controlled interference between photons from distinct sources to enable functional quantum optical networks. Here we report on such a platform of SPSs based upon a novel class of epitaxial quantum dots. Under resonant excitation, the SPSs (without Purcell enhancement) show single photon purity >99% (g(2)(0) ~ 0.015), high two-photon Hong-Ou-Mandel interference visibilities of 0.82 $\pm$ 0.03 (at 11.5K), and spectral nonuniformity <3nm - within established locally tunable technology. Our platform of SPSs paves the path to creating on-chip scalable quantum photonic systems.
△ Less
Submitted 7 November, 2021; v1 submitted 3 August, 2021;
originally announced August 2021.
-
An advanced multipole model for (216) Kleopatra triple system
Authors:
M. Brož,
F. Marchis,
L. Jorda,
J. Hanuš,
P. Vernazza,
M. Ferrais,
F. Vachier,
N. Rambaux,
M. Marsset,
M. Viikinkoski,
E. Jehin,
S. Benseguane,
E. Podlewska-Gaca,
B. Carry,
A. Drouard,
S. Fauvaud,
M. Birlan,
J. Berthier,
P. Bartczak,
C. Dumas,
G. Dudziński,
J. Ďurech,
J. Castillo-Rogez,
F. Cipriani,
F. Colas
, et al. (15 additional authors not shown)
Abstract:
To interpret adaptive-optics observations of (216) Kleopatra, we need to describe an evolution of multiple moons, orbiting an extremely irregular body and including their mutual interactions. Such orbits are generally non-Keplerian and orbital elements are not constants. Consequently, we use a modified $N$-body integrator, which was significantly extended to include the multipole expansion of the…
▽ More
To interpret adaptive-optics observations of (216) Kleopatra, we need to describe an evolution of multiple moons, orbiting an extremely irregular body and including their mutual interactions. Such orbits are generally non-Keplerian and orbital elements are not constants. Consequently, we use a modified $N$-body integrator, which was significantly extended to include the multipole expansion of the gravitational field up to the order $\ell = 10$. Its convergence was verified against the `brute-force' algorithm. We computed the coefficients $C_{\ell m},S_{\!\ell m}$ for Kleopatra's shape, assuming a~constant bulk density. For solar-system applications, it was also necessary to implement a variable distance and geometry of observations. Our $χ^2$ metric then accounts for the absolute astrometry, the relative astrometry (2nd moon with respect to 1st), angular velocities, and also silhouettes, constraining the pole orientation. This allowed us to derive the orbital elements of Kleopatra's two moons. Using both archival astrometric data and new VLT/SPHERE observations (ESO LP 199.C-0074), we were able to identify the true periods of the moons, $P_1 = (1.822359\pm0.004156)\,{\rm d}$, $P_2 = (2.745820\pm0.004820)\,{\rm d}$. They orbit very close to the 3:2 mean-motion resonance, but their osculating eccentricities are too small compared to other perturbations (multipole, mutual), so that regular librations of the critical argument are not present. The resulting mass of Kleopatra, $m_1 = (1.49\pm0.16)\cdot10^{-12}\,M_\odot$ or $2.97\cdot10^{18}\,{\rm kg}$, is significantly lower than previously thought. An implication explained in the accompanying paper (Marchis et al.) is that (216) Kleopatra is a critically rotating body.
△ Less
Submitted 19 May, 2021;
originally announced May 2021.
-
Ancient and present surface evolution processes in the Ash regionof comet 67P/Churyumov-Gerasimenko
Authors:
A. Bouquety,
L. Jorda,
O. Groussin,
A. Sejourné,
S. Bouley,
F. Costard
Abstract:
The Rosetta mission provided us with detailed data of the surface of the nucleus of comet 67P/Churyumov-Gerasimenko.In order to better understand the physical processes associated with the comet activity and the surface evolution of its nucleus, we performed a detailed comparative morphometrical analysis of two depressions located in the Ash region. To detect morphological temporal changes, we com…
▽ More
The Rosetta mission provided us with detailed data of the surface of the nucleus of comet 67P/Churyumov-Gerasimenko.In order to better understand the physical processes associated with the comet activity and the surface evolution of its nucleus, we performed a detailed comparative morphometrical analysis of two depressions located in the Ash region. To detect morphological temporal changes, we compared pre- and post-perihelion high-resolution (pixel scale of 0.07-1.75 m) OSIRIS images of the two depressions. We quantified the changes using the dynamic heights and the gravitational slopes calculated from the Digital Terrain Model (DTM) of the studied area using the ArcGIS software before and after perihelion. Our comparative morphometrical analysis allowed us to detect and quantify the temporal changes that occurred in two depressions of the Ash region during the last perihelion passage. We find that the two depressions grew by several meters. The area of the smallest depression (structure I) increased by 90+/-20%, with two preferential growths: one close to the cliff associated with the apparition of new boulders at its foot, and a second one on the opposite side of the cliff. The largest depression (structure II) grew in all directions, increasing in area by 20+/-5%, and no new deposits have been detected. We interpreted these two depression changes as being driven by the sublimation of ices, which explains their global growth and which can also trigger landslides. The deposits associated with depression II reveal a stair-like topography, indicating that they have accumulated during several successive landslides from different perihelion passages. Overall, these observations bring additional evidence of complex active processes and reshaping events occurring on short timescales, such as depression growth and landslides, and on longer timescales, such as cliff retreat.
△ Less
Submitted 28 April, 2021;
originally announced April 2021.
-
Towards Automatic Model Specialization for Edge Video Analytics
Authors:
Daniel Rivas,
Francesc Guim,
Jordà Polo,
Pubudu M. Silva,
Josep Ll. Berral,
David Carrera
Abstract:
Judging by popular and generic computer vision challenges, such as the ImageNet or PASCAL VOC, neural networks have proven to be exceptionally accurate in recognition tasks. However, state-of-the-art accuracy often comes at a high computational price, requiring hardware acceleration to achieve real-time performance, while use cases, such as smart cities, require images from fixed cameras to be ana…
▽ More
Judging by popular and generic computer vision challenges, such as the ImageNet or PASCAL VOC, neural networks have proven to be exceptionally accurate in recognition tasks. However, state-of-the-art accuracy often comes at a high computational price, requiring hardware acceleration to achieve real-time performance, while use cases, such as smart cities, require images from fixed cameras to be analyzed in real-time. Due to the amount of network bandwidth these streams would generate, we cannot rely on offloading compute to a centralized cloud. Thus, a distributed edge cloud is expected to process images locally. However, the edge is, by nature, resource-constrained, which puts a limit on the computational complexity that can execute. Yet, there is a need for a meeting point between the edge and accurate real-time video analytics. Specializing lightweight models on a per-camera basis may help but it quickly becomes unfeasible as the number of cameras grows unless the process is automated. In this paper, we present and evaluate COVA (Contextually Optimized Video Analytics), a framework to assist in the automatic specialization of models for video analytics in edge cameras. COVA automatically improves the accuracy of lightweight models through their specialization. Moreover, we discuss and review each step involved in the process to understand the different trade-offs that each one entails. Additionally, we show how the sole assumption of static cameras allows us to make a series of considerations that greatly simplify the scope of the problem. Finally, experiments show that state-of-the-art models, i.e., able to generalize to unseen environments, can be effectively used as teachers to tailor smaller networks to a specific context, boosting accuracy at a constant computational cost. Results show that our COVA can automatically improve accuracy of pre-trained models by an average of 21%.
△ Less
Submitted 13 December, 2021; v1 submitted 14 April, 2021;
originally announced April 2021.
-
cuConv: A CUDA Implementation of Convolution for CNN Inference
Authors:
Marc Jordà,
Pedro Valero-Lara,
Antonio J. Peña
Abstract:
Convolutions are the core operation of deep learning applications based on Convolutional Neural Networks (CNNs). Current GPU architectures are highly efficient for training and deploying deep CNNs, and hence, these are largely used in production for this purpose. State-of-the-art implementations, however, present a lack of efficiency for some commonly used network configurations.
In this paper w…
▽ More
Convolutions are the core operation of deep learning applications based on Convolutional Neural Networks (CNNs). Current GPU architectures are highly efficient for training and deploying deep CNNs, and hence, these are largely used in production for this purpose. State-of-the-art implementations, however, present a lack of efficiency for some commonly used network configurations.
In this paper we propose a GPU-based implementation of the convolution operation for CNN inference that favors coalesced accesses, without requiring prior data transformations. Our experiments demonstrate that our proposal yields notable performance improvements in a range of common CNN forward propagation convolution configurations, with speedups of up to 2.29x with respect to the best implementation of convolution in cuDNN, hence covering a relevant region in currently existing approaches.
△ Less
Submitted 30 March, 2021;
originally announced March 2021.
-
Enabling Homomorphically Encrypted Inference for Large DNN Models
Authors:
Guillermo Lloret-Talavera,
Marc Jorda,
Harald Servat,
Fabian Boemer,
Chetan Chauhan,
Shigeki Tomishima,
Nilesh N. Shah,
Antonio J. Peña
Abstract:
The proliferation of machine learning services in the last few years has raised data privacy concerns. Homomorphic encryption (HE) enables inference using encrypted data but it incurs 100x-10,000x memory and runtime overheads. Secure deep neural network (DNN) inference using HE is currently limited by computing and memory resources, with frameworks requiring hundreds of gigabytes of DRAM to evalua…
▽ More
The proliferation of machine learning services in the last few years has raised data privacy concerns. Homomorphic encryption (HE) enables inference using encrypted data but it incurs 100x-10,000x memory and runtime overheads. Secure deep neural network (DNN) inference using HE is currently limited by computing and memory resources, with frameworks requiring hundreds of gigabytes of DRAM to evaluate small models. To overcome these limitations, in this paper we explore the feasibility of leveraging hybrid memory systems comprised of DRAM and persistent memory. In particular, we explore the recently-released Intel Optane PMem technology and the Intel HE-Transformer nGraph to run large neural networks such as MobileNetV2 (in its largest variant) and ResNet-50 for the first time in the literature. We present an in-depth analysis of the efficiency of the executions with different hardware and software configurations. Our results conclude that DNN inference using HE incurs on friendly access patterns for this memory configuration, yielding efficient executions.
△ Less
Submitted 29 April, 2021; v1 submitted 30 March, 2021;
originally announced March 2021.
-
The phase diagram of a frustrated Heisenberg model: from disorder to order and back again
Authors:
Michel M. J. Miranda,
Igor C. Almeida,
Eric C. Andrade,
José A. Hoyos
Abstract:
We study the effects of bond and site disorder in the classical $J_{1}$-$J_{2}$ Heisenberg model on a square lattice in the order-by-disorder frustrated regime $2J_{2}>\left|J_{1}\right|$. Combining symmetry arguments, numerical energy minimization and large scale Monte Carlo simulations, we establish that the finite temperature Ising-like transition of the clean system is destroyed in the presenc…
▽ More
We study the effects of bond and site disorder in the classical $J_{1}$-$J_{2}$ Heisenberg model on a square lattice in the order-by-disorder frustrated regime $2J_{2}>\left|J_{1}\right|$. Combining symmetry arguments, numerical energy minimization and large scale Monte Carlo simulations, we establish that the finite temperature Ising-like transition of the clean system is destroyed in the presence of any finite concentration of impurities. We explain this finding via a random-field mechanism which generically emerges in systems where disorder locally breaks the same real-space symmetry spontaneously globally broken by the associated order parameter. We also determine that the phase replacing the clean one is a paramagnet polarized in the nematic glass order with non-trivial magnetic response. This is because disorder also induces non-collinear spin-vortex-crystal order and produces a conjugated transverse dipolar random field. As a result of these many competing effects, the associated magnetic susceptibilities are non-monotonic functions of the temperature. As a further application of our methods, we show the generation of random axes in other frustrated magnets with broken SU(2) symmetry. We also discuss the generality of our findings and their relevance to experiments.
△ Less
Submitted 19 August, 2021; v1 submitted 25 March, 2021;
originally announced March 2021.
-
Evidence for differentiation of the most primitive small bodies
Authors:
B. Carry,
P. Vernazza,
F. Vachier,
M. Neveu,
J. Berthier J. Hanus,
M. Ferrais,
L. Jorda,
M. Marsset,
M. Viikinkoski,
P. Bartczak,
R. Behrend,
Z. Benkhaldoun,
M. Birlan,
J. Castillo-Rogez,
F. Cipriani,
F. Colas,
A. Drouard,
G. P. Dudzinski,
J. Desmars,
C. Dumas,
J. Durech,
R. Fetick,
T. Fusco,
J. Grice,
E. Jehin
, et al. (18 additional authors not shown)
Abstract:
Dynamical models of Solar System evolution have suggested that P-/D-type volatile-rich asteroids formed in the outer Solar System and may be genetically related to the Jupiter Trojans, the comets and small KBOs. Indeed, their spectral properties resemble that of anhydrous cometary dust. High-angular-resolution images of P-type asteroid (87) Sylvia with VLT/SPHERE were used to reconstruct its 3D sh…
▽ More
Dynamical models of Solar System evolution have suggested that P-/D-type volatile-rich asteroids formed in the outer Solar System and may be genetically related to the Jupiter Trojans, the comets and small KBOs. Indeed, their spectral properties resemble that of anhydrous cometary dust. High-angular-resolution images of P-type asteroid (87) Sylvia with VLT/SPHERE were used to reconstruct its 3D shape, and to study the dynamics of its two satellites. We also model Sylvia's thermal evolution. The shape of Sylvia appears flattened and elongated. We derive a volume-equivalent diameter of 271 +/- 5 km, and a low density of 1378 +/- 45 kg.m-3. The two satellites orbit Sylvia on circular, equatorial orbits. The oblateness of Sylvia should imply a detectable nodal precession which contrasts with the fully-Keplerian dynamics of the satellites. This reveals an inhomogeneous internal structure, suggesting that Sylvia is differentiated. Sylvia's low density and differentiated interior can be explained by partial melting and mass redistribution through water percolation. The outer shell would be composed of material similar to interplanetary dust particles (IDPs) and the core similar to aqueously altered IDPs or carbonaceous chondrite meteorites such as the Tagish Lake meteorite. Numerical simulations of the thermal evolution of Sylvia show that for a body of such size, partial melting was unavoidable due to the decay of long-lived radionuclides. In addition, we show that bodies as small as 130-150 km in diameter should have followed a similar thermal evolution, while smaller objects, such as comets and the KBO Arrokoth, must have remained pristine, in agreement with in situ observations of these bodies. NASA Lucy mission target (617) Patroclus (diameter~140 km) may, however, be differentiated.
△ Less
Submitted 10 March, 2021;
originally announced March 2021.