-
Comment on "Storage properties of a quantum perceptron"
Authors:
Mauro Pastore
Abstract:
The recent paper "Storage properties of a quantum perceptron" [Phys. Rev. E 110, 024127] considers a quadratic constraint satisfaction problem, motivated by a quantum version of the perceptron. In particular, it derives its critical capacity, the density of constraints at which there is a satisfiability transition. The same problem was considered before in another context (classification of geomet…
▽ More
The recent paper "Storage properties of a quantum perceptron" [Phys. Rev. E 110, 024127] considers a quadratic constraint satisfaction problem, motivated by a quantum version of the perceptron. In particular, it derives its critical capacity, the density of constraints at which there is a satisfiability transition. The same problem was considered before in another context (classification of geometrically structured inputs, see [Phys. Rev. Lett. 125, 120601; Phys. Rev. E 102, 032119; J. Stat. Mech. (2021) 113301]), but the results on the critical capacity drastically differ. In this note, I substantiate the claim that the derivation performed in the quantum scenario has issues when inspected closely, I report a more principled way to perform it and I evaluate the critical capacity of an alternative constraint satisfaction problem that I consider more relevant for the quantum perceptron rule proposed by the article in question.
△ Less
Submitted 23 June, 2025;
originally announced June 2025.
-
Statistical mechanics of extensive-width Bayesian neural networks near interpolation
Authors:
Jean Barbier,
Francesco Camilli,
Minh-Toan Nguyen,
Mauro Pastore,
Rudy Skerk
Abstract:
For three decades statistical mechanics has been providing a framework to analyse neural networks. However, the theoretically tractable models, e.g., perceptrons, random features models and kernel machines, or multi-index models and committee machines with few neurons, remained simple compared to those used in applications. In this paper we help reducing the gap between practical networks and thei…
▽ More
For three decades statistical mechanics has been providing a framework to analyse neural networks. However, the theoretically tractable models, e.g., perceptrons, random features models and kernel machines, or multi-index models and committee machines with few neurons, remained simple compared to those used in applications. In this paper we help reducing the gap between practical networks and their theoretical understanding through a statistical physics analysis of the supervised learning of a two-layer fully connected network with generic weight distribution and activation function, whose hidden layer is large but remains proportional to the inputs dimension. This makes it more realistic than infinitely wide networks where no feature learning occurs, but also more expressive than narrow ones or with fixed inner weights. We focus on the Bayes-optimal learning in the teacher-student scenario, i.e., with a dataset generated by another network with the same architecture. We operate around interpolation, where the number of trainable parameters and of data are comparable and feature learning emerges. Our analysis uncovers a rich phenomenology with various learning transitions as the number of data increases. In particular, the more strongly the features (i.e., hidden neurons of the target) contribute to the observed responses, the less data is needed to learn them. Moreover, when the data is scarce, the model only learns non-linear combinations of the teacher weights, rather than "specialising" by aligning its weights with the teacher's. Specialisation occurs only when enough data becomes available, but it can be hard to find for practical training algorithms, possibly due to statistical-to-computational~gaps.
△ Less
Submitted 30 May, 2025;
originally announced May 2025.
-
Interfacial Behavior from the Atomic Blueprint: Machine Learning-Guided Design of Spatially Functionalized a-SiO2 Surfaces
Authors:
Evgenii Strugovshchikov,
Viktor Mandrolko,
Dominika Lesnicki,
Mariachiara Pastore,
Laurent Chaput,
Mykola Isaiev
Abstract:
Precise control over surface chemistry is essential for tuning interfacial behavior in technologies ranging from catalysis and protective coatings to energy conversion systems. Although chemical functionalization of alpha-quartz (alpha-SiO2) with hydroxyl (OH) and methyl (CH3) groups has been extensively studied, the impact of their spatial distribution at the atomic scale remains largely uncharte…
▽ More
Precise control over surface chemistry is essential for tuning interfacial behavior in technologies ranging from catalysis and protective coatings to energy conversion systems. Although chemical functionalization of alpha-quartz (alpha-SiO2) with hydroxyl (OH) and methyl (CH3) groups has been extensively studied, the impact of their spatial distribution at the atomic scale remains largely uncharted. In this work, we integrate density functional theory (DFT), ab initio molecular dynamics (AIMD), and on-the-fly machine-learned force fields (MLFFs) to systematically investigate how different arrangements of OH/CH3 groups modulate surface properties. Our results reveal that spatial patterning governs the formation of hydrogen-bonding networks, alters vibrational signatures, and has a significant influence on the thermodynamic stability of the functionalized surfaces. The MLFF framework enables high-fidelity simulations at unprecedented scales, bridging the gap between quantum accuracy and statistical sampling. By uncovering structure-property relationships inaccessible to conventional approaches, this study establishes spatial arrangement of functionalized groups as a critical and tunable design axis, paving the way for the predictive engineering of silica-based materials with optimized interfacial performance.
△ Less
Submitted 29 April, 2025;
originally announced April 2025.
-
Optimal generalisation and learning transition in extensive-width shallow neural networks near interpolation
Authors:
Jean Barbier,
Francesco Camilli,
Minh-Toan Nguyen,
Mauro Pastore,
Rudy Skerk
Abstract:
We consider a teacher-student model of supervised learning with a fully-trained two-layer neural network whose width $k$ and input dimension $d$ are large and proportional. We provide an effective theory for approximating the Bayes-optimal generalisation error of the network for any activation function in the regime of sample size $n$ scaling quadratically with the input dimension, i.e., around th…
▽ More
We consider a teacher-student model of supervised learning with a fully-trained two-layer neural network whose width $k$ and input dimension $d$ are large and proportional. We provide an effective theory for approximating the Bayes-optimal generalisation error of the network for any activation function in the regime of sample size $n$ scaling quadratically with the input dimension, i.e., around the interpolation threshold where the number of trainable parameters $kd+k$ and of data $n$ are comparable. Our analysis tackles generic weight distributions. We uncover a discontinuous phase transition separating a "universal" phase from a "specialisation" phase. In the first, the generalisation error is independent of the weight distribution and decays slowly with the sampling rate $n/d^2$, with the student learning only some non-linear combinations of the teacher weights. In the latter, the error is weight distribution-dependent and decays faster due to the alignment of the student towards the teacher network. We thus unveil the existence of a highly predictive solution near interpolation, which is however potentially hard to find by practical algorithms.
△ Less
Submitted 1 April, 2025; v1 submitted 30 January, 2025;
originally announced January 2025.
-
Feature learning in finite-width Bayesian deep linear networks with multiple outputs and convolutional layers
Authors:
Federico Bassetti,
Marco Gherardi,
Alessandro Ingrosso,
Mauro Pastore,
Pietro Rotondo
Abstract:
Deep linear networks have been extensively studied, as they provide simplified models of deep learning. However, little is known in the case of finite-width architectures with multiple outputs and convolutional layers. In this manuscript, we provide rigorous results for the statistics of functions implemented by the aforementioned class of networks, thus moving closer to a complete characterizatio…
▽ More
Deep linear networks have been extensively studied, as they provide simplified models of deep learning. However, little is known in the case of finite-width architectures with multiple outputs and convolutional layers. In this manuscript, we provide rigorous results for the statistics of functions implemented by the aforementioned class of networks, thus moving closer to a complete characterization of feature learning in the Bayesian setting. Our results include: (i) an exact and elementary non-asymptotic integral representation for the joint prior distribution over the outputs, given in terms of a mixture of Gaussians; (ii) an analytical formula for the posterior distribution in the case of squared error loss function (Gaussian likelihood); (iii) a quantitative description of the feature learning infinite-width regime, using large deviation theory. From a physical perspective, deep architectures with multiple outputs or convolutional layers represent different manifestations of kernel shape renormalization, and our work provides a dictionary that translates this physics intuition and terminology into rigorous Bayesian statistics.
△ Less
Submitted 16 June, 2025; v1 submitted 5 June, 2024;
originally announced June 2024.
-
Restoring balance: principled under/oversampling of data for optimal classification
Authors:
Emanuele Loffredo,
Mauro Pastore,
Simona Cocco,
Rémi Monasson
Abstract:
Class imbalance in real-world data poses a common bottleneck for machine learning tasks, since achieving good generalization on under-represented examples is often challenging. Mitigation strategies, such as under or oversampling the data depending on their abundances, are routinely proposed and tested empirically, but how they should adapt to the data statistics remains poorly understood. In this…
▽ More
Class imbalance in real-world data poses a common bottleneck for machine learning tasks, since achieving good generalization on under-represented examples is often challenging. Mitigation strategies, such as under or oversampling the data depending on their abundances, are routinely proposed and tested empirically, but how they should adapt to the data statistics remains poorly understood. In this work, we determine exact analytical expressions of the generalization curves in the high-dimensional regime for linear classifiers (Support Vector Machines). We also provide a sharp prediction of the effects of under/oversampling strategies depending on class imbalance, first and second moments of the data, and the metrics of performance considered. We show that mixed strategies involving under and oversampling of data lead to performance improvement. Through numerical experiments, we show the relevance of our theoretical predictions on real datasets, on deeper architectures and with sampling strategies based on unsupervised probabilistic models.
△ Less
Submitted 31 January, 2025; v1 submitted 15 May, 2024;
originally announced May 2024.
-
Random features and polynomial rules
Authors:
Fabián Aguirre-López,
Silvio Franz,
Mauro Pastore
Abstract:
Random features models play a distinguished role in the theory of deep learning, describing the behavior of neural networks close to their infinite-width limit. In this work, we present a thorough analysis of the generalization performance of random features models for generic supervised learning problems with Gaussian data. Our approach, built with tools from the statistical mechanics of disorder…
▽ More
Random features models play a distinguished role in the theory of deep learning, describing the behavior of neural networks close to their infinite-width limit. In this work, we present a thorough analysis of the generalization performance of random features models for generic supervised learning problems with Gaussian data. Our approach, built with tools from the statistical mechanics of disordered systems, maps the random features model to an equivalent polynomial model, and allows us to plot average generalization curves as functions of the two main control parameters of the problem: the number of random features $N$ and the size $P$ of the training set, both assumed to scale as powers in the input dimension $D$. Our results extend the case of proportional scaling between $N$, $P$ and $D$. They are in accordance with rigorous bounds known for certain particular learning tasks and are in quantitative agreement with numerical experiments performed over many order of magnitudes of $N$ and $P$. We find good agreement also far from the asymptotic limits where $D\to \infty$ and at least one between $P/D^K$, $N/D^L$ remains finite.
△ Less
Submitted 31 January, 2025; v1 submitted 15 February, 2024;
originally announced February 2024.
-
Structure of Working Memory in Children From 3 to 8 Years Old
Authors:
Barbara Carretti,
David Giofre,
Enrico Toffalini,
Cesare Cornoldi,
Massimiliano Pastore,
Silvia Lanfranchi
Abstract:
Several models of working memory (WM) have been proposed in the literature. Most of the research on the architecture of WM is based on adults or older children, but less is known about younger children. In this study, we tested various models of WM on a sample of 739 Italian children from 3 to 8 years old. Participants were assessed with 12 WM tasks, systematically varying the modality and level o…
▽ More
Several models of working memory (WM) have been proposed in the literature. Most of the research on the architecture of WM is based on adults or older children, but less is known about younger children. In this study, we tested various models of WM on a sample of 739 Italian children from 3 to 8 years old. Participants were assessed with 12 WM tasks, systematically varying the modality and level of executive control required (based on the number of activities to be performed at once: retention alone, ignoring distractors, and dealing with dual tasks). We examined younger children, n = 501, Mage = 56.8 months (SD = 6.4, 48% males) and older children, n = 238, Mage = 80.0 months (SD = 9.0, 58% males) separately using multigroup confirmatory factor analyses. A Bayesian analytical approach was adopted. Our results suggested that a four-factor model distinguishing between verbal, visual, spatial-simultaneous, and spatial-sequential components of WM achieved the best fit. Overall, the WM structure was very similar in the two groups. We further explored this result with an additional model with a central executive factor loaded on high-control tasks only, and found evidence for the presence of an executive control component. The contribution of this factor in terms of explained variance was only modest, however. Our findings demonstrate that it is important to distinguish between WM components in young children.
△ Less
Submitted 12 October, 2022;
originally announced October 2022.
-
Post-selection Inference in Multiverse Analysis (PIMA): an inferential framework based on the sign flipping score test
Authors:
Paolo Girardi,
Anna Vesely,
Daniël Lakens,
Gianmarco Altoè,
Massimiliano Pastore,
Antonio Calcagnì,
Livio Finos
Abstract:
When analyzing data researchers make some decisions that are either arbitrary, based on subjective beliefs about the data generating process, or for which equally justifiable alternative choices could have been made. This wide range of data-analytic choices can be abused, and has been one of the underlying causes of the replication crisis in several fields. Recently, the introduction of multiverse…
▽ More
When analyzing data researchers make some decisions that are either arbitrary, based on subjective beliefs about the data generating process, or for which equally justifiable alternative choices could have been made. This wide range of data-analytic choices can be abused, and has been one of the underlying causes of the replication crisis in several fields. Recently, the introduction of multiverse analysis provides researchers with a method to evaluate the stability of the results across reasonable choices that could be made when analyzing data. Multiverse analysis is confined to a descriptive role, lacking a proper and comprehensive inferential procedure. Recently, specification curve analysis adds an inferential procedure to multiverse analysis, but this approach is limited to simple cases related to the linear model, and only allows researchers to infer whether at least one specification rejects the null hypothesis, but not which specifications should be selected. In this paper we present a Post-selection Inference approach to Multiverse Analysis (PIMA) which is a flexible and general inferential approach that accounts for all possible models, i.e., the multiverse of reasonable analyses. The approach allows for a wide range of data specifications (i.e. pre-processing) and any generalized linear model; it allows testing the null hypothesis of a given predictor not being associated with the outcome, by merging information from all reasonable models of multiverse analysis, and provides strong control of the family-wise error rate such that it allows researchers to claim that the null-hypothesis can be rejected for each specification that shows a significant effect. The inferential proposal is based on a conditional resampling procedure. To be continued...
△ Less
Submitted 3 October, 2023; v1 submitted 6 October, 2022;
originally announced October 2022.
-
A statistical mechanics framework for Bayesian deep neural networks beyond the infinite-width limit
Authors:
R. Pacelli,
S. Ariosto,
M. Pastore,
F. Ginelli,
M. Gherardi,
P. Rotondo
Abstract:
Despite the practical success of deep neural networks, a comprehensive theoretical framework that can predict practically relevant scores, such as the test accuracy, from knowledge of the training data is currently lacking. Huge simplifications arise in the infinite-width limit, where the number of units $N_\ell$ in each hidden layer ($\ell=1,\dots, L$, being $L$ the depth of the network) far exce…
▽ More
Despite the practical success of deep neural networks, a comprehensive theoretical framework that can predict practically relevant scores, such as the test accuracy, from knowledge of the training data is currently lacking. Huge simplifications arise in the infinite-width limit, where the number of units $N_\ell$ in each hidden layer ($\ell=1,\dots, L$, being $L$ the depth of the network) far exceeds the number $P$ of training examples. This idealisation, however, blatantly departs from the reality of deep learning practice. Here, we use the toolset of statistical mechanics to overcome these limitations and derive an approximate partition function for fully-connected deep neural architectures, which encodes information about the trained models. The computation holds in the ''thermodynamic limit'' where both $N_\ell$ and $P$ are large and their ratio $α_\ell = P/N_\ell$ is finite. This advance allows us to obtain (i) a closed formula for the generalisation error associated to a regression task in a one-hidden layer network with finite $α_1$; (ii) an approximate expression of the partition function for deep architectures (via an ''effective action'' that depends on a finite number of ''order parameters''); (iii) a link between deep neural networks in the proportional asymptotic limit and Student's $t$ processes.
△ Less
Submitted 9 December, 2023; v1 submitted 11 September, 2022;
originally announced September 2022.
-
Satisfiability transition in asymmetric neural networks
Authors:
Fabián Aguirre-López,
Mauro Pastore,
Silvio Franz
Abstract:
Asymmetry in the synaptic interactions between neurons plays a crucial role in determining the memory storage and retrieval properties of recurrent neural networks. In this work, we analyze the problem of storing random memories in a network of neurons connected by a synaptic matrix with a definite degree of asymmetry. We study the corresponding satisfiability and clustering transitions in the spa…
▽ More
Asymmetry in the synaptic interactions between neurons plays a crucial role in determining the memory storage and retrieval properties of recurrent neural networks. In this work, we analyze the problem of storing random memories in a network of neurons connected by a synaptic matrix with a definite degree of asymmetry. We study the corresponding satisfiability and clustering transitions in the space of solutions of the constraint satisfaction problem associated with finding synaptic matrices given the memories. We find, besides the usual SAT/UNSAT transition at a critical number of memories to store in the network, an additional transition for very asymmetric matrices, where the competing constraints (definite asymmetry vs. memories storage) induce enough frustration in the problem to make it impossible to solve. This finding is particularly striking in the case of a single memory to store, where no quenched disorder is present in the system.
△ Less
Submitted 10 October, 2022; v1 submitted 7 April, 2022;
originally announced April 2022.
-
Critical properties of the SAT/UNSAT transitions in the classification problem of structured data
Authors:
Mauro Pastore
Abstract:
The classification problem of structured data can be solved with different strategies: a supervised learning approach, starting from a labeled training set, and an unsupervised learning one, where only the structure of the patterns in the dataset is used to find a classification compatible with it. The two strategies can be interpreted as extreme cases of a semi-supervised approach to learn multi-…
▽ More
The classification problem of structured data can be solved with different strategies: a supervised learning approach, starting from a labeled training set, and an unsupervised learning one, where only the structure of the patterns in the dataset is used to find a classification compatible with it. The two strategies can be interpreted as extreme cases of a semi-supervised approach to learn multi-view data, relevant for applications. In this paper I study the critical properties of the two storage problems associated with these tasks, in the case of the linear binary classification of doublets of points sharing the same label, within replica theory. While the first approach presents a SAT/UNSAT transition in a (marginally) stable replica-symmetric phase, in the second one the satisfiability line lies in a full replica-symmetry-broken phase. A similar behavior in the problem of learning with a margin is also pointed out.
△ Less
Submitted 8 November, 2021; v1 submitted 17 September, 2021;
originally announced September 2021.
-
Self-induced glassy phase in multimodal cavity quantum electrodynamics
Authors:
Vittorio Erba,
Mauro Pastore,
Pietro Rotondo
Abstract:
We provide strong evidence that the effective spin-spin interaction in a multimodal confocal optical cavity gives rise to a self-induced glassy phase, which emerges exclusively from the peculiar euclidean correlations and is not related to the presence of disorder as in standard spin glasses. As recently shown, this spin-spin effective interaction is both non-local and non-translational invariant,…
▽ More
We provide strong evidence that the effective spin-spin interaction in a multimodal confocal optical cavity gives rise to a self-induced glassy phase, which emerges exclusively from the peculiar euclidean correlations and is not related to the presence of disorder as in standard spin glasses. As recently shown, this spin-spin effective interaction is both non-local and non-translational invariant, and randomness in the atoms positions produces a spin glass phase. Here we consider the simplest feasible disorder-free setting where atoms form a one-dimensional regular chain and we study the thermodynamics of the resulting effective Ising model. We present extensive results showing that the system has a low-temperature glassy phase. Notably, for rational values of the only free adimensional parameter $α=p/q$ of the interaction, the number of metastable states at low temperature grows exponentially with $q$ and the problem of finding the ground state rapidly becomes computationally intractable, suggesting that the system develops high energy barriers and ergodicity breaking occurs.
△ Less
Submitted 11 January, 2021;
originally announced January 2021.
-
Statistical learning theory of structured data
Authors:
Mauro Pastore,
Pietro Rotondo,
Vittorio Erba,
Marco Gherardi
Abstract:
The traditional approach of statistical physics to supervised learning routinely assumes unrealistic generative models for the data: usually inputs are independent random variables, uncorrelated with their labels. Only recently, statistical physicists started to explore more complex forms of data, such as equally-labelled points lying on (possibly low dimensional) object manifolds. Here we provide…
▽ More
The traditional approach of statistical physics to supervised learning routinely assumes unrealistic generative models for the data: usually inputs are independent random variables, uncorrelated with their labels. Only recently, statistical physicists started to explore more complex forms of data, such as equally-labelled points lying on (possibly low dimensional) object manifolds. Here we provide a bridge between this recently-established research area and the framework of statistical learning theory, a branch of mathematics devoted to inference in machine learning. The overarching motivation is the inadequacy of the classic rigorous results in explaining the remarkable generalization properties of deep learning. We propose a way to integrate physical models of data into statistical learning theory, and address, with both combinatorial and statistical mechanics methods, the computation of the Vapnik-Chervonenkis entropy, which counts the number of different binary classifications compatible with the loss class. As a proof of concept, we focus on kernel machines and on two simple realizations of data structure introduced in recent physics literature: $k$-dimensional simplexes with prescribed geometric relations and spherical manifolds (equivalent to margin classification). Entropy, contrary to what happens for unstructured data, is nonmonotonic in the sample size, in contrast with the rigorous bounds. Moreover, data structure induces a novel transition beyond the storage capacity, which we advocate as a proxy of the nonmonotonicity, and ultimately a cue of low generalization error. The identification of a synaptic volume vanishing at the transition allows a quantification of the impact of data structure within replica theory, applicable in cases where combinatorial methods are not available, as we demonstrate for margin learning.
△ Less
Submitted 20 May, 2020;
originally announced May 2020.
-
Beyond the storage capacity: data driven satisfiability transition
Authors:
Pietro Rotondo,
Mauro Pastore,
Marco Gherardi
Abstract:
Data structure has a dramatic impact on the properties of neural networks, yet its significance in the established theoretical frameworks is poorly understood. Here we compute the Vapnik-Chervonenkis entropy of a kernel machine operating on data grouped into equally labelled subsets. At variance with the unstructured scenario, entropy is non-monotonic in the size of the training set, and displays…
▽ More
Data structure has a dramatic impact on the properties of neural networks, yet its significance in the established theoretical frameworks is poorly understood. Here we compute the Vapnik-Chervonenkis entropy of a kernel machine operating on data grouped into equally labelled subsets. At variance with the unstructured scenario, entropy is non-monotonic in the size of the training set, and displays an additional critical point besides the storage capacity. Remarkably, the same behavior occurs in margin classifiers even with randomly labelled data, as is elucidated by identifying the synaptic volume encoding the transition. These findings reveal aspects of expressivity lying beyond the condensed description provided by the storage capacity, and they indicate the path towards more realistic bounds for the generalization error of neural networks.
△ Less
Submitted 20 May, 2020;
originally announced May 2020.
-
Enhancing statistical inference in psychological research via prospective and retrospective design analysis
Authors:
Gianmarco Altoè,
Giulia Bertoldo,
Claudio Zandonella Callegher,
Enrico Toffalini,
Antonio Calcagnì,
Livio Finos,
Massimiliano Pastore
Abstract:
In the past two decades, psychological science has experienced an unprecedented replicability crisis which uncovered several issues. Among others, statistical inference is too often viewed as an isolated procedure limited to the analysis of data that have already been collected. We build on and further develop an idea proposed by Gelman and Carlin (2014) termed "prospective and retrospective desig…
▽ More
In the past two decades, psychological science has experienced an unprecedented replicability crisis which uncovered several issues. Among others, statistical inference is too often viewed as an isolated procedure limited to the analysis of data that have already been collected. We build on and further develop an idea proposed by Gelman and Carlin (2014) termed "prospective and retrospective design analysis". Rather than focusing only on the statistical significance of a result and on the classical control of type I and type II errors, a comprehensive design analysis involves reasoning about what can be considered a plausible effect size. Furthermore, it introduces two relevant inferential risks: the exaggeration ratio or Type M error (i.e., the predictable average overestimation of an effect that emerges as statistically significant), and the sign error or Type S error (i.e., the risk that a statistically significant effect is estimated in the wrong direction). Another important aspect of design analysis is that it can be usefully carried out both in the planning phase of a study and for the evaluation of studies that have already been conducted, thus increasing researchers' awareness during all phases of a research project. We use a familiar example in psychology where the researcher is interested in analyzing the differences between two independent groups. We examine the case in which the plausible effect size is formalized as a single value, and propose a method in which uncertainty concerning the magnitude of the effect is formalized via probability distributions. Through several examples, we show that even though a design analysis requires big effort, it has the potential to contribute to planning more robust and replicable studies. Finally, future developments in the Bayesian framework are discussed.
△ Less
Submitted 30 September, 2019;
originally announced September 2019.
-
Large deviations of the free energy in the p-spin glass spherical model
Authors:
Mauro Pastore,
Andrea Di Gioacchino,
Pietro Rotondo
Abstract:
We investigate the behavior of the rare fluctuations of the free energy in the p-spin spherical model, evaluating the corresponding rate function via the Gärtner-Ellis theorem. This approach requires the knowledge of the analytic continuation of the disorder-averaged replicated partition function to arbitrary real number of replicas. In zero external magnetic field, we show via a one-step replica…
▽ More
We investigate the behavior of the rare fluctuations of the free energy in the p-spin spherical model, evaluating the corresponding rate function via the Gärtner-Ellis theorem. This approach requires the knowledge of the analytic continuation of the disorder-averaged replicated partition function to arbitrary real number of replicas. In zero external magnetic field, we show via a one-step replica symmetry breaking (1RSB) calculation that the rate function is infinite for fluctuations of the free energy above its typical value, corresponding to an anomalous, super-extensive suppression of rare fluctuations. We extend this calculation to non-zero magnetic field, showing that in this case this very large deviation disappears and we try to motivate this finding in light of a geometrical interpretation of the scaled cumulant generating function.
△ Less
Submitted 25 October, 2019; v1 submitted 13 September, 2019;
originally announced September 2019.
-
Impact of the fac/mer isomerism on the excited states dynamics of pyridyl-carbene Fe(II) complexes
Authors:
Kevin Magra,
Edoardo Domenichini,
Antonio Francés-Monerris,
Cristina Cebrian,
Marc Beley,
Mohamed Darari,
Mariachiara Pastore,
Antonio Monari,
Xavier Assfeld,
Stefan Haacke,
Philippe C. Gros
Abstract:
The control of photophysical properties of iron complexes and especially of their excited states decay is a great challenge in the search for sustainable alternatives to noble metals in photochemical applications. Herein we report the synthesis and investigations of the photophysics of mer and fac iron complexes bearing bidentate pyridyl-NHC ligands, coordinating the Fe with three ligand-field enh…
▽ More
The control of photophysical properties of iron complexes and especially of their excited states decay is a great challenge in the search for sustainable alternatives to noble metals in photochemical applications. Herein we report the synthesis and investigations of the photophysics of mer and fac iron complexes bearing bidentate pyridyl-NHC ligands, coordinating the Fe with three ligand-field enhancing carbene bonds. Ultrafast transient absorption spectroscopy reveals two distinct excited state populations for both mer and fac forms, ascribed to the populations of the T1 and the T2 states, respectively, which decay to the ground state via parallel pathways. We find 3-4 ps and 15-20 ps excited state lifetimes, with respective amplitudes depending on the isomer. The longer lifetime exceeds the one reported for iron complexes with tridentate ligands analogues involving four iron-carbene bonds. By combining experimental and computational results, a mechanism based on the differential trapping of the triplet states in spin-crossover regions is proposed for the first time to explain the impact of the fac/mer isomerism on the overall excited-state lifetimes. Our results clearly highlight the impact of bidentate Pyridyl-NHC ligands on the photophysics of iron complexes, especially the paramount role of fac/mer isomerism in modulating the overall decay process, which can be potentially exploited in the design of new Fe(II)-based photoactive compound.
△ Less
Submitted 21 May, 2019;
originally announced May 2019.
-
ssMousetrack: Analysing computerized tracking data via Bayesian state-space models in {R}
Authors:
Antonio Calcagnì,
Massimiliano Pastore,
Gianmarco Altoè
Abstract:
Recent technological advances have provided new settings to enhance individual-based data collection and computerized-tracking data have became common in many behavioral and social research. By adopting instantaneous tracking devices such as computer-mouse, wii, and joysticks, such data provide new insights for analysing the dynamic unfolding of response process. ssMousetrack is a R package for mo…
▽ More
Recent technological advances have provided new settings to enhance individual-based data collection and computerized-tracking data have became common in many behavioral and social research. By adopting instantaneous tracking devices such as computer-mouse, wii, and joysticks, such data provide new insights for analysing the dynamic unfolding of response process. ssMousetrack is a R package for modeling and analysing computerized-tracking data by means of a Bayesian state-space approach. The package provides a set of functions to prepare data, fit the model, and assess results via simple diagnostic checks. This paper describes the package and illustrates how it can be used to model and analyse computerized-tracking data. A case study is also included to show the use of the package in empirical case studies.
△ Less
Submitted 23 April, 2019;
originally announced April 2019.
-
A Maximum Entropy Procedure to Solve Likelihood Equations
Authors:
Antonio Calcagnì,
Livio Finos,
Gianmarco Altoè,
Massimiliano Pastore
Abstract:
In this article we provide initial findings regarding the problem of solving likelihood equations by means of a maximum entropy approach. Unlike standard procedures that require equating at zero the score function of the maximum-likelihood problem, we propose an alternative strategy where the score is instead used as external informative constraint to the maximization of the convex Shannon's entro…
▽ More
In this article we provide initial findings regarding the problem of solving likelihood equations by means of a maximum entropy approach. Unlike standard procedures that require equating at zero the score function of the maximum-likelihood problem, we propose an alternative strategy where the score is instead used as external informative constraint to the maximization of the convex Shannon's entropy function. The problem involves the re-parameterization of the score parameters as expected values of discrete probability distributions where probabilities need to be estimated. This leads to a simpler situation where parameters are searched in smaller (hyper) simplex space. We assessed our proposal by means of empirical case studies and a simulation study, this latter involving the most critical case of logistic regression under data separation. The results suggested that the maximum entropy re-formulation of the score problem solves the likelihood equation problem. Similarly, when maximum-likelihood estimation is difficult, as for the case of logistic regression under separation, the maximum entropy proposal achieved results (numerically) comparable to those obtained by the Firth's Bias-corrected approach. Overall, these first findings reveal that a maximum entropy solution can be considered as an alternative technique to solve the likelihood equation.
△ Less
Submitted 13 June, 2019; v1 submitted 22 April, 2019;
originally announced April 2019.
-
Charge separation: From the topology of molecular electronic transitions to the dye/semiconductor interfacial energetics and kinetics
Authors:
Thibaud Etienne,
Mariachiara Pastore
Abstract:
Charge separation properties, that is the ability of a chromophore, or a chromophore/semiconductor interface, to separate charges upon light absorption, are crucial characteristics for an efficient photovoltaic device. Starting from this concept, we devote the first part of this book chapter to the topological analysis of molecular electronic transitions induced by photon capture. Such analysis ca…
▽ More
Charge separation properties, that is the ability of a chromophore, or a chromophore/semiconductor interface, to separate charges upon light absorption, are crucial characteristics for an efficient photovoltaic device. Starting from this concept, we devote the first part of this book chapter to the topological analysis of molecular electronic transitions induced by photon capture. Such analysis can be either qualitative or quantitative, and is presented here in the framework of the reduced density matrix theory applied to single-reference, multiconfigurational excited states. The qualitative strategies are separated into density-based and wave function-based approaches, while the quantitative methods reported here for analysing the photoinduced charge transfer nature are either fragment-based, global or statistical. In the second part of this chapter we extend the analysis to dye-sensitized metal oxide surface models, discussing interfacial charge separation, energetics and electron injection kinetics from the dye excited state to the semiconductor conduction band states.
△ Less
Submitted 4 December, 2019; v1 submitted 26 November, 2018;
originally announced November 2018.
-
Lattice QCD$_2$ effective action with Bogoliubov transformations
Authors:
Sergio Caracciolo,
Mauro Pastore
Abstract:
In the Wilson's lattice formulation of QCD, a fermionic Fock space of states can be explicitly built at each time slice using canonical creation and annihilation operators. The partition function $Z$ is then represented as the trace of the transfer matrix, and its usual functional representation as a path integral of $\exp(- S)$ can be recovered in a standard way. However, applying a Bogoliubov tr…
▽ More
In the Wilson's lattice formulation of QCD, a fermionic Fock space of states can be explicitly built at each time slice using canonical creation and annihilation operators. The partition function $Z$ is then represented as the trace of the transfer matrix, and its usual functional representation as a path integral of $\exp(- S)$ can be recovered in a standard way. However, applying a Bogoliubov transformation on the canonical operators before passing to the functional formalism, we can isolate a vacuum contribution in the resulting action which depends only on the parameters of the transformation and fixes them via a variational principle. Then, inserting in the trace defining $Z$ an operator projecting on the mesons subspace at each time slice and making the physical assumption that the true partition function is well approximate by the projected one, we can also write an effective quadratic action for mesons. We tested the method in the renowned 't Hooft model, namely QCD in two spacetime dimensions for large number of colours, in Coulomb gauge.
△ Less
Submitted 19 November, 2018;
originally announced November 2018.
-
Effective mesonic theory for the 't Hooft model on the lattice
Authors:
Sergio Caracciolo,
Mauro Pastore
Abstract:
We apply to a lattice version of the 't~Hooft model, QCD in two space-time dimensions for large number of colours, a method recently proposed to obtain an effective mesonic action starting from the fundamental, fermionic one. The idea is to pass from a canonical, operatorial representation, where the low-energy states have a direct physical interpretation in terms of a Bogoliubov vacuum and its co…
▽ More
We apply to a lattice version of the 't~Hooft model, QCD in two space-time dimensions for large number of colours, a method recently proposed to obtain an effective mesonic action starting from the fundamental, fermionic one. The idea is to pass from a canonical, operatorial representation, where the low-energy states have a direct physical interpretation in terms of a Bogoliubov vacuum and its corresponding quasiparticle excitations, to a functional, path integral representation, via the formalism of the transfer matrix. In this way we obtain a lattice effective theory for mesons in a self-consistent setting. We also verify that well-known results from other different approaches are reproduced in the continuum limit.
△ Less
Submitted 16 November, 2018;
originally announced November 2018.
-
Remarks on replica diagonal collective field condensations in SYK
Authors:
Sergio Caracciolo,
Matteo A. Cardella,
Mauro Pastore
Abstract:
In the Sachdev-Ye-Kitaev model with generic order $q \ge 4$ random couplings, we compute the critical temperature relating the Majorana fermions high temperature perturbative vacuum to the vacuum where the replica diagonal collective field $G(τ, τ')$ condenses. We study, by a finite temperature diagrammatic analysis, the effective action of an auxiliary Hubbard-Stratonovich bilocal field related t…
▽ More
In the Sachdev-Ye-Kitaev model with generic order $q \ge 4$ random couplings, we compute the critical temperature relating the Majorana fermions high temperature perturbative vacuum to the vacuum where the replica diagonal collective field $G(τ, τ')$ condenses. We study, by a finite temperature diagrammatic analysis, the effective action of an auxiliary Hubbard-Stratonovich bilocal field related to $G(τ, τ')$ in the large $N$ limit. Subtelties that arise in switching from the operatorial to the functional integral representation of the SYK thermal partition function are also discussed.
△ Less
Submitted 22 November, 2018; v1 submitted 26 July, 2018;
originally announced July 2018.
-
Spectral geometry of Riemannian Legendre foliations
Authors:
Gabriel Baditoiu,
Stere Ianus,
Anna Maria Pastore
Abstract:
We obtain geometric characterizations of isospectral minimal Riemannian Legendre foliations on compact Sasakian manifolds of constant $φ$-sectional curvature.
We obtain geometric characterizations of isospectral minimal Riemannian Legendre foliations on compact Sasakian manifolds of constant $φ$-sectional curvature.
△ Less
Submitted 31 January, 2012; v1 submitted 16 September, 2010;
originally announced September 2010.
-
Lightlike hypersurfaces in indefinite $\mathcal{S}$-manifolds
Authors:
Letizia Brunetti,
Anna Maria Pastore
Abstract:
In a metric $g.f.f$-manifold we study lightlike hypersurfaces $M$ tangent to the characteristic vector fields, and owing to the presence of the $f$-structure, we determine some decompositions of $TM$ and of a chosen screen distribution obtaining two distributions invariant with respect to the structure. We discuss the existence of a $g.f.f$-structure on a lightlike hypersurface and, under suitab…
▽ More
In a metric $g.f.f$-manifold we study lightlike hypersurfaces $M$ tangent to the characteristic vector fields, and owing to the presence of the $f$-structure, we determine some decompositions of $TM$ and of a chosen screen distribution obtaining two distributions invariant with respect to the structure. We discuss the existence of a $g.f.f$-structure on a lightlike hypersurface and, under suitable hypotheses, we obtain an indefinite $\mathcal{S}$-structure on the leaves of an integrable distribution. The existence of totally umbilical lightlike hypersurfaces of an indefinite $\mathcal{S}$-space form is also discussed. Finally, we explicitely describe a lightlike hypersurface of an indefinite $\mathcal{S}$-manifold.
△ Less
Submitted 27 March, 2008;
originally announced March 2008.
-
Mixed metric 3-contact manifolds and paraquaternionic Kähler manifolds
Authors:
Angelo V. Caldarella,
Anna Maria Pastore
Abstract:
We study manifolds endowed with mixed metric 3--contact structures, proving that the distribution spanned by the Reeb vector fields is integrable, with totally geodesic integral manifolds, of constant sectional curvature $k=\pm1$. We also prove a result of projectability of such structures onto paraquaternionic Kählerian structures.
We study manifolds endowed with mixed metric 3--contact structures, proving that the distribution spanned by the Reeb vector fields is integrable, with totally geodesic integral manifolds, of constant sectional curvature $k=\pm1$. We also prove a result of projectability of such structures onto paraquaternionic Kählerian structures.
△ Less
Submitted 7 June, 2008; v1 submitted 20 March, 2008;
originally announced March 2008.
-
Mixed 3-Sasakian structures and curvature
Authors:
Angelo V. Caldarella,
Anna Maria Pastore
Abstract:
In this paper we deal with two classes of mixed metric 3-structures, namely the mixed 3-Sasakian structures and the mixed metric 3-contact structures. Firstly we study some properties of the curvature of mixed 3-Sasakian structures, proving that any manifold endowed with such a structure is Einstein. Then we prove the identity between the class of mixed 3-Sasakian structures and the class of mix…
▽ More
In this paper we deal with two classes of mixed metric 3-structures, namely the mixed 3-Sasakian structures and the mixed metric 3-contact structures. Firstly we study some properties of the curvature of mixed 3-Sasakian structures, proving that any manifold endowed with such a structure is Einstein. Then we prove the identity between the class of mixed 3-Sasakian structures and the class of mixed metric 3-contact structures.
△ Less
Submitted 13 March, 2008;
originally announced March 2008.
-
Curvature of a class of indefinite globally framed $f$-manifolds
Authors:
Letizia Brunetti,
Anna Maria Pastore
Abstract:
We present a compared analysis of some properties of indefinite almost $\mathcal{S}$-manifolds and indefinite $\mathcal{S}$-manifolds. We give some characterizations in terms of the Levi-Civita connection and of the characteristic vector fields. We study the sectional and $φ$-sectional curvature of indefinite almost $\mathcal{S}$-manifolds and state an expression of the curvature tensor field fo…
▽ More
We present a compared analysis of some properties of indefinite almost $\mathcal{S}$-manifolds and indefinite $\mathcal{S}$-manifolds. We give some characterizations in terms of the Levi-Civita connection and of the characteristic vector fields. We study the sectional and $φ$-sectional curvature of indefinite almost $\mathcal{S}$-manifolds and state an expression of the curvature tensor field for the indefinite $\mathcal{S}$-space forms. We analyse the sectional curvature of indefinite $\mathcal{S}$-manifold in which the number of the spacelike characteristic vector fields is equal to that of the timelike characteristic vector fields. Some examples are also described.
△ Less
Submitted 27 March, 2008; v1 submitted 4 March, 2008;
originally announced March 2008.