-
How Malicious AI Swarms Can Threaten Democracy
Authors:
Daniel Thilo Schroeder,
Meeyoung Cha,
Andrea Baronchelli,
Nick Bostrom,
Nicholas A. Christakis,
David Garcia,
Amit Goldenberg,
Yara Kyrychenko,
Kevin Leyton-Brown,
Nina Lutz,
Gary Marcus,
Filippo Menczer,
Gordon Pennycook,
David G. Rand,
Frank Schweitzer,
Christopher Summerfield,
Audrey Tang,
Jay Van Bavel,
Sander van der Linden,
Dawn Song,
Jonas R. Kunst
Abstract:
Advances in AI portend a new era of sophisticated disinformation operations. While individual AI systems already create convincing -- and at times misleading -- information, an imminent development is the emergence of malicious AI swarms. These systems can coordinate covertly, infiltrate communities, evade traditional detectors, and run continuous A/B tests, with round-the-clock persistence. The r…
▽ More
Advances in AI portend a new era of sophisticated disinformation operations. While individual AI systems already create convincing -- and at times misleading -- information, an imminent development is the emergence of malicious AI swarms. These systems can coordinate covertly, infiltrate communities, evade traditional detectors, and run continuous A/B tests, with round-the-clock persistence. The result can include fabricated grassroots consensus, fragmented shared reality, mass harassment, voter micro-suppression or mobilization, contamination of AI training data, and erosion of institutional trust. With democratic processes worldwide increasingly vulnerable, we urge a three-pronged response: (1) platform-side defenses -- always-on swarm-detection dashboards, pre-election high-fidelity swarm-simulation stress-tests, transparency audits, and optional client-side "AI shields" for users; (2) model-side safeguards -- standardized persuasion-risk tests, provenance-authenticating passkeys, and watermarking; and (3) system-level oversight -- a UN-backed AI Influence Observatory.
△ Less
Submitted 10 June, 2025; v1 submitted 18 May, 2025;
originally announced June 2025.
-
Temperature anisotropy instabilities of solar wind electrons with regularized Kappa-halos resolved with ALPS
Authors:
Dustin L. Schröder,
Horst Fichtner,
Marian Lazar,
Daniel Verscharen,
Kristopher G. Klein
Abstract:
Space plasmas in various astrophysical setups can often be both very hot and dilute, making them highly susceptible to waves and fluctuations, which are generally self-generated and maintained by kinetic instabilities. In this sense, we have in-situ observational evidence from the solar wind and planetary environments, which reveal not only wave fluctuations at kinetic scales of electrons and prot…
▽ More
Space plasmas in various astrophysical setups can often be both very hot and dilute, making them highly susceptible to waves and fluctuations, which are generally self-generated and maintained by kinetic instabilities. In this sense, we have in-situ observational evidence from the solar wind and planetary environments, which reveal not only wave fluctuations at kinetic scales of electrons and protons, but also non-equilibrium distributions of particle velocities. This paper reports on the progress made in achieving a consistent modeling of the instabilities generated by temperature anisotropy, taking concrete example of those induced by anisotropic electrons, such as, electromagnetic electron-cyclotron (whistler) and firehose instabilities. The effects of the two main electron populations, the quasi-thermal core and the suprathermal halo indicated by the observations, are thus captured. The low-energy core is bi-Maxwellian, and the halo is described for the first time by a regularized (bi-)$κ$-distribution (RKD), which was recently introduced to fix inconsistencies of standard $κ$-distributions (SKD). In the absence of a analytical RKD dispersion kinetic formalism (involving tedious and laborious derivations), both the dispersion and (in)stability properties are directly solved numerically using the numerical Arbitrary Linear Plasma Solver (ALPS). The results have an increased degree of confidence, considering the successful testing of the ALPS on previous results with established distributions.
△ Less
Submitted 22 April, 2025;
originally announced April 2025.
-
Pencils to Pixels: A Systematic Study of Creative Drawings across Children, Adults and AI
Authors:
Surabhi S Nath,
Guiomar del Cuvillo y Schröder,
Claire E. Stevenson
Abstract:
Can we derive computational metrics to quantify visual creativity in drawings across intelligent agents, while accounting for inherent differences in technical skill and style? To answer this, we curate a novel dataset consisting of 1338 drawings by children, adults and AI on a creative drawing task. We characterize two aspects of the drawings -- (1) style and (2) content. For style, we define mea…
▽ More
Can we derive computational metrics to quantify visual creativity in drawings across intelligent agents, while accounting for inherent differences in technical skill and style? To answer this, we curate a novel dataset consisting of 1338 drawings by children, adults and AI on a creative drawing task. We characterize two aspects of the drawings -- (1) style and (2) content. For style, we define measures of ink density, ink distribution and number of elements. For content, we use expert-annotated categories to study conceptual diversity, and image and text embeddings to compute distance measures. We compare the style, content and creativity of children, adults and AI drawings and build simple models to predict expert and automated creativity scores. We find significant differences in style and content in the groups -- children's drawings had more components, AI drawings had greater ink density, and adult drawings revealed maximum conceptual diversity. Notably, we highlight a misalignment between creativity judgments obtained through expert and automated ratings and discuss its implications. Through these efforts, our work provides, to the best of our knowledge, the first framework for studying human and artificial creativity beyond the textual modality, and attempts to arrive at the domain-agnostic principles underlying creativity. Our data and scripts are available on GitHub.
△ Less
Submitted 9 February, 2025;
originally announced February 2025.
-
Matrix Concentration Inequalities and Free Probability II. Two-sided Bounds and Applications
Authors:
Afonso S. Bandeira,
Giorgio Cipolloni,
Dominik Schröder,
Ramon van Handel
Abstract:
The first paper in this series introduced a new family of nonasymptotic matrix concentration inequalities that sharply capture the spectral properties of very general Gaussian (as well as non-Gaussian) random matrices in terms of an associated noncommutative model. These methods achieved matching upper and lower bounds for smooth spectral statistics, but only provided upper bounds for the spectral…
▽ More
The first paper in this series introduced a new family of nonasymptotic matrix concentration inequalities that sharply capture the spectral properties of very general Gaussian (as well as non-Gaussian) random matrices in terms of an associated noncommutative model. These methods achieved matching upper and lower bounds for smooth spectral statistics, but only provided upper bounds for the spectral edges. Here we obtain matching lower bounds for the spectral edges, completing the theory initiated in the first paper. The resulting two-sided bounds enable the study of applications that require an exact determination of the spectral edges to leading order, which is fundamentally beyond the reach of classical matrix concentration inequalities. To illustrate their utility, we undertake a detailed study of phase transition phenomena for spectral outliers of nonhomogeneous random matrices.
△ Less
Submitted 17 June, 2024;
originally announced June 2024.
-
Asymptotics of Learning with Deep Structured (Random) Features
Authors:
Dominik Schröder,
Daniil Dmitriev,
Hugo Cui,
Bruno Loureiro
Abstract:
For a large class of feature maps we provide a tight asymptotic characterisation of the test error associated with learning the readout layer, in the high-dimensional limit where the input dimension, hidden layer widths, and number of training samples are proportionally large. This characterization is formulated in terms of the population covariance of the features. Our work is partially motivated…
▽ More
For a large class of feature maps we provide a tight asymptotic characterisation of the test error associated with learning the readout layer, in the high-dimensional limit where the input dimension, hidden layer widths, and number of training samples are proportionally large. This characterization is formulated in terms of the population covariance of the features. Our work is partially motivated by the problem of learning with Gaussian rainbow neural networks, namely deep non-linear fully-connected networks with random but structured weights, whose row-wise covariances are further allowed to depend on the weights of previous layers. For such networks we also derive a closed-form formula for the feature covariance in terms of the weight matrices. We further find that in some cases our results can capture feature maps learned by deep, finite-width neural networks trained under gradient descent.
△ Less
Submitted 10 June, 2024; v1 submitted 21 February, 2024;
originally announced February 2024.
-
Harmful Conspiracies in Temporal Interaction Networks: Understanding the Dynamics of Digital Wildfires through Phase Transitions
Authors:
Kaspara Skovli Gåsvær,
Pedro G. Lind,
Johannes Langguth,
Morten Hjorth-Jensen,
Michael Kreil,
Daniel Thilo Schroeder
Abstract:
Shortly after the first COVID-19 cases became apparent in December 2020, rumors spread on social media suggesting a connection between the virus and the 5G radiation emanating from the recently deployed telecommunications network. In the course of the following weeks, this idea gained increasing popularity, and various alleged explanations for how such a connection manifests emerged. Ultimately, a…
▽ More
Shortly after the first COVID-19 cases became apparent in December 2020, rumors spread on social media suggesting a connection between the virus and the 5G radiation emanating from the recently deployed telecommunications network. In the course of the following weeks, this idea gained increasing popularity, and various alleged explanations for how such a connection manifests emerged. Ultimately, after being amplified by prominent conspiracy theorists, a series of arson attacks on telecommunication equipment follows, concluding with the kidnapping of telecommunication technicians in Peru. In this paper, we study the spread of content related to a conspiracy theory with harmful consequences, a so-called digital wildfire. In particular, we investigate the 5G and COVID-19 misinformation event on Twitter before, during, and after its peak in April and May 2020. For this purpose, we examine the community dynamics in complex temporal interaction networks underlying Twitter user activity. We assess the evolution of such digital wildfires by appropriately defining the temporal dynamics of communication in communities within social networks. We show that, for this specific misinformation event, the number of interactions of the users participating in a digital wildfire, as well as the size of the engaged communities, both follow a power-law distribution. Moreover, our research elucidates the possibility of quantifying the phases of a digital wildfire, as per established literature. We identify one such phase as a critical transition, marked by a shift from sporadic tweets to a global spread event, highlighting the dramatic scaling of misinformation propagation.
△ Less
Submitted 9 October, 2023;
originally announced October 2023.
-
Feasibility of Passive Sounding of Uranian Moons using Uranian Kilometric Radiation
Authors:
Andrew Romero-Wolf,
Gregor Steinbruegge,
Julie Castillo-Rogez,
Corey J. Cochrane,
Tom A. Nordheim,
Karl L. Mitchell,
Natalie S. Wolfenbarger,
Dustin M. Schroeder,
Sean T. Peters
Abstract:
We present a feasibility study for passive sounding of Uranian icy moons using Uranian Kilometric Radio (UKR) emissions in the 100 - 900 kHz band. We provide a summary description of the observation geometry, the UKR characteristics, and estimate the sensitivity for an instrument analogous to the Cassini Radio Plasma Wave Science (RPWS) but with a modified receiver digitizer and signal processing…
▽ More
We present a feasibility study for passive sounding of Uranian icy moons using Uranian Kilometric Radio (UKR) emissions in the 100 - 900 kHz band. We provide a summary description of the observation geometry, the UKR characteristics, and estimate the sensitivity for an instrument analogous to the Cassini Radio Plasma Wave Science (RPWS) but with a modified receiver digitizer and signal processing chain. We show that the concept has the potential to directly and unambiguously detect cold oceans within Uranian satellites and provide strong constraints on the interior structure in the presence of warm or no oceans. As part of a geophysical payload, the concept could therefore have a key role in the detection of oceans within the Uranian satellites. The main limitation of the concept is coherence losses attributed to the extended source size of the UKR and dependence on the illumination geometry. These factors represent constraints on the tour design of a future Uranus mission in terms of flyby altitudes and encounter timing.
△ Less
Submitted 5 May, 2023;
originally announced May 2023.
-
Social media in the Global South: A Network Dataset of the Malian Twittersphere
Authors:
Daniel Thilo Schroeder,
Mirjam de Bruijn,
Luca Bruls,
Mulatu Alemayehu Moges,
Samba Dialimpa Badji,
Noëmie Fritz,
Modibo Galy Cisse,
Johannes Langguth,
Bruce Mutsvairo,
Kristin Skare Orgeret
Abstract:
With the expansion of mobile communications infrastructure, social media usage in the Global South is surging. Compared to the Global North, populations of the Global South have had less prior experience with social media from stationary computers and wired Internet. Many countries are experiencing violent conflicts that have a profound effect on their societies. As a result, social networks devel…
▽ More
With the expansion of mobile communications infrastructure, social media usage in the Global South is surging. Compared to the Global North, populations of the Global South have had less prior experience with social media from stationary computers and wired Internet. Many countries are experiencing violent conflicts that have a profound effect on their societies. As a result, social networks develop under different conditions than elsewhere, and our goal is to provide data for studying this phenomenon. In this dataset paper, we present a data collection of a national Twittersphere in a West African country of conflict. While not the largest social network in terms of users, Twitter is an important platform where people engage in public discussion. The focus is on Mali, a country beset by conflict since 2012 that has recently had a relatively precarious media ecology. The dataset consists of tweets and Twitter users in Mali and was collected in June 2022, when the Malian conflict became more violent internally both towards external and international actors. In a preliminary analysis, we assume that the conflictual context influences how people access social media and, therefore, the shape of the Twittersphere and its characteristics. The aim of this paper is to primarily invite researchers from various disciplines including complex networks and social sciences scholars to explore the data at hand further. We collected the dataset using a scraping strategy of the follower network and the identification of characteristics of a Malian Twitter user. The given snapshot of the Malian Twitter follower network contains around seven million accounts, of which 56,000 are clearly identifiable as Malian. In addition, we present the tweets. The dataset is available at: https://osf.io/mj2qt/
△ Less
Submitted 24 October, 2023; v1 submitted 25 April, 2023;
originally announced April 2023.
-
Deterministic equivalent and error universality of deep random features learning
Authors:
Dominik Schröder,
Hugo Cui,
Daniil Dmitriev,
Bruno Loureiro
Abstract:
This manuscript considers the problem of learning a random Gaussian network function using a fully connected network with frozen intermediate layers and trainable readout layer. This problem can be seen as a natural generalization of the widely studied random features model to deeper architectures. First, we prove Gaussian universality of the test error in a ridge regression setting where the lear…
▽ More
This manuscript considers the problem of learning a random Gaussian network function using a fully connected network with frozen intermediate layers and trainable readout layer. This problem can be seen as a natural generalization of the widely studied random features model to deeper architectures. First, we prove Gaussian universality of the test error in a ridge regression setting where the learner and target networks share the same intermediate layers, and provide a sharp asymptotic formula for it. Establishing this result requires proving a deterministic equivalent for traces of the deep random features sample covariance matrices which can be of independent interest. Second, we conjecture the asymptotic Gaussian universality of the test error in the more general setting of arbitrary convex losses and generic learner/target architectures. We provide extensive numerical evidence for this conjecture, which requires the derivation of closed-form expressions for the layer-wise post-activation population covariances. In light of our results, we investigate the interplay between architecture design and implicit regularization.
△ Less
Submitted 1 February, 2023;
originally announced February 2023.
-
Optimal Lower Bound on Eigenvector Overlaps for non-Hermitian Random Matrices
Authors:
Giorgio Cipolloni,
László Erdős,
Joscha Henheik,
Dominik Schröder
Abstract:
We consider large non-Hermitian $N\times N$ matrices with an additive independent, identically distributed (i.i.d.) noise for each matrix elements. We show that already a small noise of variance $1/N$ completely thermalises the bulk singular vectors, in particular they satisfy the strong form of Quantum Unique Ergodicity (QUE) with an optimal speed of convergence. In physics terms, we thus extend…
▽ More
We consider large non-Hermitian $N\times N$ matrices with an additive independent, identically distributed (i.i.d.) noise for each matrix elements. We show that already a small noise of variance $1/N$ completely thermalises the bulk singular vectors, in particular they satisfy the strong form of Quantum Unique Ergodicity (QUE) with an optimal speed of convergence. In physics terms, we thus extend the Eigenstate Thermalisation Hypothesis, formulated originally by [Deutsch 1991] and proven for Wigner matrices in [Cipolloni, Erdős, Schröder 2020], to arbitrary non-Hermitian matrices with an i.i.d. noise. As a consequence we obtain an optimal lower bound on the diagonal overlaps of the corresponding non-Hermitian eigenvectors. This quantity, also known as the (square of the) eigenvalue condition number measuring the sensitivity of the eigenvalue to small perturbations, has notoriously escaped rigorous treatment beyond the explicitly computable Ginibre ensemble apart from the very recent upper bounds given in [arXiv:2005.08930] and [arXiv:2005.08908]. As a key tool, we develop a new systematic decomposition of general observables in random matrix theory that governs the size of products of resolvents with deterministic matrices in between.
△ Less
Submitted 11 January, 2024; v1 submitted 9 January, 2023;
originally announced January 2023.
-
Mesoscopic Central Limit Theorem for non-Hermitian Random Matrices
Authors:
Giorgio Cipolloni,
László Endős,
Dominik Schröder
Abstract:
We prove that the mesoscopic linear statistics $\sum_i f(n^a(σ_i-z_0))$ of the eigenvalues $\{σ_i\}_i$ of large $n\times n$ non-Hermitian random matrices with complex centred i.i.d. entries are asymptotically Gaussian for any $H^{2}_0$-functions $f$ around any point $z_0$ in the bulk of the spectrum on any mesoscopic scale $0<a<1/2$. This extends our previous result [arXiv:1912.04100], that was va…
▽ More
We prove that the mesoscopic linear statistics $\sum_i f(n^a(σ_i-z_0))$ of the eigenvalues $\{σ_i\}_i$ of large $n\times n$ non-Hermitian random matrices with complex centred i.i.d. entries are asymptotically Gaussian for any $H^{2}_0$-functions $f$ around any point $z_0$ in the bulk of the spectrum on any mesoscopic scale $0<a<1/2$. This extends our previous result [arXiv:1912.04100], that was valid on the macroscopic scale, $a=0$, to cover the entire mesoscopic regime. The main novelty is a local law for the product of resolvents for the Hermitization of $X$ at spectral parameters $z_1, z_2$ with an improved error term in the entire mesoscopic regime $|z_1-z_2|\gg n^{-1/2}$. The proof is dynamical; it relies on a recursive tandem of the characteristic flow method and the Green function comparison idea combined with a separation of the unstable mode of the underlying stability operator.
△ Less
Submitted 1 March, 2024; v1 submitted 21 October, 2022;
originally announced October 2022.
-
LungViT: Ensembling Cascade of Texture Sensitive Hierarchical Vision Transformers for Cross-Volume Chest CT Image-to-Image Translation
Authors:
Muhammad F. A. Chaudhary,
Sarah E. Gerard,
Gary E. Christensen,
Christopher B. Cooper,
Joyce D. Schroeder,
Eric A. Hoffman,
Joseph M. Reinhardt
Abstract:
Chest computed tomography (CT) at inspiration is often complemented by an expiratory CT to identify peripheral airways disease. Additionally, co-registered inspiratory-expiratory volumes can be used to derive various markers of lung function. Expiratory CT scans, however, may not be acquired due to dose or scan time considerations or may be inadequate due to motion or insufficient exhale; leading…
▽ More
Chest computed tomography (CT) at inspiration is often complemented by an expiratory CT to identify peripheral airways disease. Additionally, co-registered inspiratory-expiratory volumes can be used to derive various markers of lung function. Expiratory CT scans, however, may not be acquired due to dose or scan time considerations or may be inadequate due to motion or insufficient exhale; leading to a missed opportunity to evaluate underlying small airways disease. Here, we propose LungViT - a generative adversarial learning approach using hierarchical vision transformers for translating inspiratory CT intensities to corresponding expiratory CT intensities. LungViT addresses several limitations of the traditional generative models including slicewise discontinuities, limited size of generated volumes, and their inability to model texture transfer at volumetric level. We propose a shifted-window hierarchical vision transformer architecture with squeeze-and-excitation decoder blocks for modeling dependencies between features. We also propose a multiview texture similarity distance metric for texture and style transfer in 3D. To incorporate global information into the training process and refine the output of our model, we use ensemble cascading. LungViT is able to generate large 3D volumes of size 320 x 320 x 320. We train and validate our model using a diverse cohort of 1500 subjects with varying disease severity. To assess model generalizability beyond the development set biases, we evaluate our model on an out-of-distribution external validation set of 200 subjects. Clinical validation on internal and external testing sets shows that synthetic volumes could be reliably adopted for deriving clinical endpoints of chronic obstructive pulmonary disease.
△ Less
Submitted 27 August, 2023; v1 submitted 5 October, 2022;
originally announced October 2022.
-
Localization supervision of chest x-ray classifiers using label-specific eye-tracking annotation
Authors:
Ricardo Bigolin Lanfredi,
Joyce D. Schroeder,
Tolga Tasdizen
Abstract:
Convolutional neural networks (CNNs) have been successfully applied to chest x-ray (CXR) images. Moreover, annotated bounding boxes have been shown to improve the interpretability of a CNN in terms of localizing abnormalities. However, only a few relatively small CXR datasets containing bounding boxes are available, and collecting them is very costly. Opportunely, eye-tracking (ET) data can be col…
▽ More
Convolutional neural networks (CNNs) have been successfully applied to chest x-ray (CXR) images. Moreover, annotated bounding boxes have been shown to improve the interpretability of a CNN in terms of localizing abnormalities. However, only a few relatively small CXR datasets containing bounding boxes are available, and collecting them is very costly. Opportunely, eye-tracking (ET) data can be collected in a non-intrusive way during the clinical workflow of a radiologist. We use ET data recorded from radiologists while dictating CXR reports to train CNNs. We extract snippets from the ET data by associating them with the dictation of keywords and use them to supervise the localization of specific abnormalities. We show that this method improves a model's interpretability without impacting its image-level classification.
△ Less
Submitted 14 December, 2022; v1 submitted 20 July, 2022;
originally announced July 2022.
-
On the rightmost eigenvalue of non-Hermitian random matrices
Authors:
Giorgio Cipolloni,
László Erdős,
Dominik Schröder,
Yuanyuan Xu
Abstract:
We establish a precise three-term asymptotic expansion, with an optimal estimate of the error term, for the rightmost eigenvalue of an $n\times n$ random matrix with independent identically distributed complex entries as $n$ tends to infinity. All terms in the expansion are universal.
We establish a precise three-term asymptotic expansion, with an optimal estimate of the error term, for the rightmost eigenvalue of an $n\times n$ random matrix with independent identically distributed complex entries as $n$ tends to infinity. All terms in the expansion are universal.
△ Less
Submitted 22 June, 2023; v1 submitted 9 June, 2022;
originally announced June 2022.
-
Directional Extremal Statistics for Ginibre Eigenvalues
Authors:
Giorgio Cipolloni,
László Erdős,
Dominik Schröder,
Yuanyuan Xu
Abstract:
We consider the eigenvalues of a large dimensional real or complex Ginibre matrix in the region of the complex plane where their real parts reach their maximum value. This maximum follows the Gumbel distribution and that these extreme eigenvalues form a Poisson point process, asymptotically as the dimension tends to infinity. In the complex case these facts have already been established by Bender…
▽ More
We consider the eigenvalues of a large dimensional real or complex Ginibre matrix in the region of the complex plane where their real parts reach their maximum value. This maximum follows the Gumbel distribution and that these extreme eigenvalues form a Poisson point process, asymptotically as the dimension tends to infinity. In the complex case these facts have already been established by Bender \cite{MR2594353} and in the real case by Akemann and Phillips \cite{MR3192169} even for the more general elliptic ensemble with a sophisticated saddle point analysis. The purpose of this note is to give a very short direct proof in the Ginibre case with an effective error term. Moreover, our estimates on the correlation kernel in this regime serve as a key input for accurately locating $\max\Re\mathrm{Spec}(X)$ for any large matrix $X$ with i.i.d. entries in the companion paper \cite{2206.04448}.
△ Less
Submitted 15 June, 2022; v1 submitted 9 June, 2022;
originally announced June 2022.
-
Switching between Numerical Black-box Optimization Algorithms with Warm-starting Policies
Authors:
Dominik Schröder,
Diederick Vermetten,
Hao Wang,
Carola Doerr,
Thomas Bäck
Abstract:
When solving optimization problems with black-box approaches, the algorithms gather valuable information about the problem instance during the optimization process. This information is used to adjust the distributions from which new solution candidates are sampled. In fact, a key objective in evolutionary computation is to identify the most effective ways to collect and exploit instance knowledge.…
▽ More
When solving optimization problems with black-box approaches, the algorithms gather valuable information about the problem instance during the optimization process. This information is used to adjust the distributions from which new solution candidates are sampled. In fact, a key objective in evolutionary computation is to identify the most effective ways to collect and exploit instance knowledge. However, while considerable work is devoted to adjusting hyper-parameters of black-box optimization algorithms on the fly or exchanging some of its modular components, we barely know how to effectively switch between different black-box optimization algorithms.
In this work, we build on the recent study of Vermetten et al. [GECCO 2020], who presented a data-driven approach to investigate promising switches between pairs of algorithms for numerical black-box optimization. We replicate their approach with a portfolio of five algorithms and investigate whether the predicted performance gains are realized when executing the most promising switches. Our results suggest that with a single switch between two algorithms, we outperform the best static choice among the five algorithms on 48 out of the 120 considered problem instances, the 24 BBOB functions in five different dimensions. We also show that for switching between BFGS and CMA-ES, a proper warm-starting of the parameters is crucial to realize high-performance gains. Lastly, with a sensitivity analysis, we find the actual performance gain per run is largely affected by the switching point, and in some cases, the switching point yielding the best actual performance differs from the one computed from the theoretical gain.
△ Less
Submitted 12 January, 2023; v1 submitted 13 April, 2022;
originally announced April 2022.
-
Rank-uniform local law for Wigner matrices
Authors:
Giorgio Cipolloni,
László Erdős,
Dominik Schröder
Abstract:
We prove a general local law for Wigner matrices which optimally handles observables of arbitrary rank and thus it unifies the well-known averaged and isotropic local laws. As an application, we prove that the quadratic forms of a general deterministic matrix $A$ on the bulk eigenvectors of a Wigner matrix has approximately Gaussian fluctuation. For the bulk spectrum, we thus generalize our previo…
▽ More
We prove a general local law for Wigner matrices which optimally handles observables of arbitrary rank and thus it unifies the well-known averaged and isotropic local laws. As an application, we prove that the quadratic forms of a general deterministic matrix $A$ on the bulk eigenvectors of a Wigner matrix has approximately Gaussian fluctuation. For the bulk spectrum, we thus generalize our previous result [arXiv:2103.06730] valid for test matrices $A$ of large rank as well as the result of Benigni and Lopatto [arXiv:2103.12013] valid for specific small rank observables.
△ Less
Submitted 6 September, 2023; v1 submitted 3 March, 2022;
originally announced March 2022.
-
Optimal multi-resolvent local laws for Wigner matrices
Authors:
Giorgio Cipolloni,
László Erdős,
Dominik Schröder
Abstract:
We prove local laws, i.e. optimal concentration estimates for arbitrary products of resolvents of a Wigner random matrix with deterministic matrices in between. We find that the size of such products heavily depends on whether some of the deterministic matrices are traceless. Our estimates correctly account for this dependence and they hold optimally down to the smallest possible spectral scale.
We prove local laws, i.e. optimal concentration estimates for arbitrary products of resolvents of a Wigner random matrix with deterministic matrices in between. We find that the size of such products heavily depends on whether some of the deterministic matrices are traceless. Our estimates correctly account for this dependence and they hold optimally down to the smallest possible spectral scale.
△ Less
Submitted 1 November, 2022; v1 submitted 27 December, 2021;
originally announced December 2021.
-
Comparing radiologists' gaze and saliency maps generated by interpretability methods for chest x-rays
Authors:
Ricardo Bigolin Lanfredi,
Ambuj Arora,
Trafton Drew,
Joyce D. Schroeder,
Tolga Tasdizen
Abstract:
The interpretability of medical image analysis models is considered a key research field. We use a dataset of eye-tracking data from five radiologists to compare the outputs of interpretability methods and the heatmaps representing where radiologists looked. We conduct a class-independent analysis of the saliency maps generated by two methods selected from the literature: Grad-CAM and attention ma…
▽ More
The interpretability of medical image analysis models is considered a key research field. We use a dataset of eye-tracking data from five radiologists to compare the outputs of interpretability methods and the heatmaps representing where radiologists looked. We conduct a class-independent analysis of the saliency maps generated by two methods selected from the literature: Grad-CAM and attention maps from an attention-gated model. For the comparison, we use shuffled metrics, which avoid biases from fixation locations. We achieve scores comparable to an interobserver baseline in one shuffled metric, highlighting the potential of saliency maps from Grad-CAM to mimic a radiologist's attention over an image. We also divide the dataset into subsets to evaluate in which cases similarities are higher.
△ Less
Submitted 19 April, 2023; v1 submitted 22 December, 2021;
originally announced December 2021.
-
Single volume lung biomechanics from chest computed tomography using a mode preserving generative adversarial network
Authors:
Muhammad F. A. Chaudhary,
Sarah E. Gerard,
Di Wang,
Gary E. Christensen,
Christopher B. Cooper,
Joyce D. Schroeder,
Eric A. Hoffman,
Joseph M. Reinhardt
Abstract:
Local tissue expansion of the lungs is typically derived by registering computed tomography (CT) scans acquired at multiple lung volumes. However, acquiring multiple scans incurs increased radiation dose, time, and cost, and may not be possible in many cases, thus restricting the applicability of registration-based biomechanics. We propose a generative adversarial learning approach for estimating…
▽ More
Local tissue expansion of the lungs is typically derived by registering computed tomography (CT) scans acquired at multiple lung volumes. However, acquiring multiple scans incurs increased radiation dose, time, and cost, and may not be possible in many cases, thus restricting the applicability of registration-based biomechanics. We propose a generative adversarial learning approach for estimating local tissue expansion directly from a single CT scan. The proposed framework was trained and evaluated on 2500 subjects from the SPIROMICS cohort. Once trained, the framework can be used as a registration-free method for predicting local tissue expansion. We evaluated model performance across varying degrees of disease severity and compared its performance with two image-to-image translation frameworks - UNet and Pix2Pix. Our model achieved an overall PSNR of 18.95 decibels, SSIM of 0.840, and Spearman's correlation of 0.61 at a high spatial resolution of 1 mm3.
△ Less
Submitted 15 October, 2021;
originally announced October 2021.
-
REFLACX, a dataset of reports and eye-tracking data for localization of abnormalities in chest x-rays
Authors:
Ricardo Bigolin Lanfredi,
Mingyuan Zhang,
William F. Auffermann,
Jessica Chan,
Phuong-Anh T. Duong,
Vivek Srikumar,
Trafton Drew,
Joyce D. Schroeder,
Tolga Tasdizen
Abstract:
Deep learning has shown recent success in classifying anomalies in chest x-rays, but datasets are still small compared to natural image datasets. Supervision of abnormality localization has been shown to improve trained models, partially compensating for dataset sizes. However, explicitly labeling these anomalies requires an expert and is very time-consuming. We propose a potentially scalable meth…
▽ More
Deep learning has shown recent success in classifying anomalies in chest x-rays, but datasets are still small compared to natural image datasets. Supervision of abnormality localization has been shown to improve trained models, partially compensating for dataset sizes. However, explicitly labeling these anomalies requires an expert and is very time-consuming. We propose a potentially scalable method for collecting implicit localization data using an eye tracker to capture gaze locations and a microphone to capture a dictation of a report, imitating the setup of a reading room. The resulting REFLACX (Reports and Eye-Tracking Data for Localization of Abnormalities in Chest X-rays) dataset was labeled across five radiologists and contains 3,032 synchronized sets of eye-tracking data and timestamped report transcriptions for 2,616 chest x-rays from the MIMIC-CXR dataset. We also provide auxiliary annotations, including bounding boxes around lungs and heart and validation labels consisting of ellipses localizing abnormalities and image-level labels. Furthermore, a small subset of the data contains readings from all radiologists, allowing for the calculation of inter-rater scores.
△ Less
Submitted 28 June, 2022; v1 submitted 29 September, 2021;
originally announced September 2021.
-
On the Spectral Form Factor for Random Matrices
Authors:
Giorgio Cipolloni,
László Erdős,
Dominik Schröder
Abstract:
In the physics literature the spectral form factor (SFF), the squared Fourier transform of the empirical eigenvalue density, is the most common tool to test universality for disordered quantum systems, yet previous mathematical results have been restricted only to two exactly solvable models [Forrester 2020]. We rigorously prove the physics prediction on SFF up to an intermediate time scale for a…
▽ More
In the physics literature the spectral form factor (SFF), the squared Fourier transform of the empirical eigenvalue density, is the most common tool to test universality for disordered quantum systems, yet previous mathematical results have been restricted only to two exactly solvable models [Forrester 2020]. We rigorously prove the physics prediction on SFF up to an intermediate time scale for a large class of random matrices using a robust method, the multi-resolvent local laws. Beyond Wigner matrices we also consider the monoparametric ensemble and prove that universality of SFF can already be triggered by a single random parameter, extending the recently proven Wigner-Dyson universality [Cipolloni, Erdős, Schröder 2021] to some larger spectral scales. Remarkably, extensive numerics indicates that our formulas correctly predict the SFF in the entire slope-dip-ramp regime, as customarily called in physics.
△ Less
Submitted 7 March, 2023; v1 submitted 14 September, 2021;
originally announced September 2021.
-
Quenched universality for deformed Wigner matrices
Authors:
Giorgio Cipolloni,
László Erdős,
Dominik Schröder
Abstract:
Following E. Wigner's original vision, we prove that sampling the eigenvalue gaps within the bulk spectrum of a fixed (deformed) Wigner matrix $H$ yields the celebrated Wigner-Dyson-Mehta universal statistics with high probability. Similarly, we prove universality for a monoparametric family of deformed Wigner matrices $H+xA$ with a deterministic Hermitian matrix $A$ and a fixed Wigner matrix $H$,…
▽ More
Following E. Wigner's original vision, we prove that sampling the eigenvalue gaps within the bulk spectrum of a fixed (deformed) Wigner matrix $H$ yields the celebrated Wigner-Dyson-Mehta universal statistics with high probability. Similarly, we prove universality for a monoparametric family of deformed Wigner matrices $H+xA$ with a deterministic Hermitian matrix $A$ and a fixed Wigner matrix $H$, just using the randomness of a single scalar real random variable $x$. Both results constitute quenched versions of bulk universality that has so far only been proven in annealed sense with respect to the probability space of the matrix ensemble.
△ Less
Submitted 17 April, 2024; v1 submitted 18 June, 2021;
originally announced June 2021.
-
Density of small singular values of the shifted real Ginibre ensemble
Authors:
Giorgio Cipolloni,
László Erdős,
Dominik Schröder
Abstract:
We derive a precise asymptotic formula for the density of the small singular values of the real Ginibre matrix ensemble shifted by a complex parameter $z$ as the dimension tends to infinity. For $z$ away from the real axis the formula coincides with that for the complex Ginibre ensemble we derived earlier in [arXiv:1908.01653]. On the level of the one-point function of the low lying singular value…
▽ More
We derive a precise asymptotic formula for the density of the small singular values of the real Ginibre matrix ensemble shifted by a complex parameter $z$ as the dimension tends to infinity. For $z$ away from the real axis the formula coincides with that for the complex Ginibre ensemble we derived earlier in [arXiv:1908.01653]. On the level of the one-point function of the low lying singular values we thus confirm the transition from real to complex Ginibre ensembles as the shift parameter $z$ becomes genuinely complex; the analogous phenomenon has been well known for eigenvalues. We use the superbosonization formula [arXiv:0707.2929] in a regime where the main contribution comes from a three dimensional saddle manifold.
△ Less
Submitted 2 June, 2021; v1 submitted 28 May, 2021;
originally announced May 2021.
-
On the condition number of the shifted real Ginibre ensemble
Authors:
Giorgio Cipolloni,
László Erdős,
Dominik Schröder
Abstract:
We derive an accurate lower tail estimate on the lowest singular value $σ_1(X-z)$ of a real Gaussian (Ginibre) random matrix $X$ shifted by a complex parameter $z$. Such shift effectively changes the upper tail behaviour of the condition number $κ(X-z)$ from the slower $\mathbf{P}(κ(X-z)\ge t)\lesssim 1/t$ decay typical for real Ginibre matrices to the faster $1/t^2$ decay seen for complex Ginibre…
▽ More
We derive an accurate lower tail estimate on the lowest singular value $σ_1(X-z)$ of a real Gaussian (Ginibre) random matrix $X$ shifted by a complex parameter $z$. Such shift effectively changes the upper tail behaviour of the condition number $κ(X-z)$ from the slower $\mathbf{P}(κ(X-z)\ge t)\lesssim 1/t$ decay typical for real Ginibre matrices to the faster $1/t^2$ decay seen for complex Ginibre matrices as long as $z$ is away from the real axis. This sharpens and resolves a recent conjecture in [arXiv:2005.08930] on the regularizing effect of the real Ginibre ensemble with a genuinely complex shift. As a consequence we obtain an improved upper bound on the eigenvalue condition numbers (known also as the eigenvector overlaps) for real Ginibre matrices. The main technical tool is a rigorous supersymmetric analysis from our earlier work [arXiv:1908.01653].
△ Less
Submitted 1 November, 2022; v1 submitted 28 May, 2021;
originally announced May 2021.
-
Analysis of One-Hidden-Layer Neural Networks via the Resolvent Method
Authors:
Vanessa Piccolo,
Dominik Schröder
Abstract:
In this work, we investigate the asymptotic spectral density of the random feature matrix $M = Y Y^\ast$ with $Y = f(WX)$ generated by a single-hidden-layer neural network, where $W$ and $X$ are random rectangular matrices with i.i.d. centred entries and $f$ is a non-linear smooth function which is applied entry-wise. We prove that the Stieltjes transform of the limiting spectral distribution appr…
▽ More
In this work, we investigate the asymptotic spectral density of the random feature matrix $M = Y Y^\ast$ with $Y = f(WX)$ generated by a single-hidden-layer neural network, where $W$ and $X$ are random rectangular matrices with i.i.d. centred entries and $f$ is a non-linear smooth function which is applied entry-wise. We prove that the Stieltjes transform of the limiting spectral distribution approximately satisfies a quartic self-consistent equation, which is exactly the equation obtained by [Pennington, Worah] and [Benigni, Péché] with the moment method. We extend the previous results to the case of additive bias $Y=f(WX+B)$ with $B$ being an independent rank-one Gaussian random matrix, closer modelling the neural network infrastructures encountered in practice. Our key finding is that in the case of additive bias it is impossible to choose an activation function preserving the layer-to-layer singular value distribution, in sharp contrast to the bias-free case where a simple integral constraint is sufficient to achieve isospectrality. To obtain the asymptotics for the empirical spectral density we follow the resolvent method from random matrix theory via the cumulant expansion. We find that this approach is more robust and less combinatorial than the moment method and expect that it will apply also for models where the combinatorics of the former become intractable. The resolvent method has been widely employed, but compared to previous works, it is applied here to non-linear random matrices.
△ Less
Submitted 11 November, 2021; v1 submitted 11 May, 2021;
originally announced May 2021.
-
Normal fluctuation in quantum ergodicity for Wigner matrices
Authors:
Giorgio Cipolloni,
László Erdős,
Dominik Schröder
Abstract:
We consider the quadratic form of a general deterministic matrix on the eigenvectors of an $N\times N$ Wigner matrix and prove that it has Gaussian fluctuation for each bulk eigenvector in the large $N$ limit. The proof is a combination of the energy method for the Dyson Brownian motion inspired by [Marcinek, Yau 2020] and our recent multi-resolvent local laws [Cipolloni, Erdős, Schröder 2020].
We consider the quadratic form of a general deterministic matrix on the eigenvectors of an $N\times N$ Wigner matrix and prove that it has Gaussian fluctuation for each bulk eigenvector in the large $N$ limit. The proof is a combination of the energy method for the Dyson Brownian motion inspired by [Marcinek, Yau 2020] and our recent multi-resolvent local laws [Cipolloni, Erdős, Schröder 2020].
△ Less
Submitted 3 March, 2022; v1 submitted 11 March, 2021;
originally announced March 2021.
-
Thermalisation for Wigner matrices
Authors:
Giorgio Cipolloni,
László Erdős,
Dominik Schröder
Abstract:
We compute the deterministic approximation of products of Sobolev functions of large Wigner matrices $W$ and provide an optimal error bound on their fluctuation with very high probability. This generalizes Voiculescu's seminal theorem [Voiculescu 1991] from polynomials to general Sobolev functions, as well as from tracial quantities to individual matrix elements. Applying the result to…
▽ More
We compute the deterministic approximation of products of Sobolev functions of large Wigner matrices $W$ and provide an optimal error bound on their fluctuation with very high probability. This generalizes Voiculescu's seminal theorem [Voiculescu 1991] from polynomials to general Sobolev functions, as well as from tracial quantities to individual matrix elements. Applying the result to $\exp(\mathrm{i} tW)$ for large $t$, we obtain a precise decay rate for the overlaps of several deterministic matrices with temporally well separated Heisenberg time evolutions; thus we demonstrate the thermalisation effect of the unitary group generated by Wigner matrices.
△ Less
Submitted 27 January, 2023; v1 submitted 19 February, 2021;
originally announced February 2021.
-
Functional Central Limit Theorems for Wigner Matrices
Authors:
Giorgio Cipolloni,
László Erdős,
Dominik Schröder
Abstract:
We consider the fluctuations of regular functions $f$ of a Wigner matrix $W$ viewed as an entire matrix $f(W)$. Going beyond the well studied tracial mode, $\mathrm{Tr}[f(W)]$, which is equivalent to the customary linear statistics of eigenvalues, we show that $\mathrm{Tr}[f(W)]$ is asymptotically normal for any non-trivial bounded deterministic matrix $A$. We identify three different and asymptot…
▽ More
We consider the fluctuations of regular functions $f$ of a Wigner matrix $W$ viewed as an entire matrix $f(W)$. Going beyond the well studied tracial mode, $\mathrm{Tr}[f(W)]$, which is equivalent to the customary linear statistics of eigenvalues, we show that $\mathrm{Tr}[f(W)]$ is asymptotically normal for any non-trivial bounded deterministic matrix $A$. We identify three different and asymptotically independent modes of this fluctuation, corresponding to the tracial part, the traceless diagonal part and the off-diagonal part of $f(W)$ in the entire mesoscopic regime, where we find that the off-diagonal modes fluctuate on a much smaller scale than the tracial mode. In addition, we determine the fluctuations in the Eigenstate Thermalisation Hypothesis [Deutsch 1991], i.e. prove that the eigenfunction overlaps with any deterministic matrix are asymptotically Gaussian after a small spectral averaging. In particular, in the macroscopic regime our result generalises [Lytova 2013] to complex $W$ and to all crossover ensembles in between. The main technical inputs are the recent multi-resolvent local laws with traceless deterministic matrices from the companion paper [Cipolloni, Erdős, Schröder 2020].
△ Less
Submitted 27 April, 2023; v1 submitted 24 December, 2020;
originally announced December 2020.
-
Eigenstate Thermalization Hypothesis for Wigner Matrices
Authors:
Giorgio Cipolloni,
László Erdős,
Dominik Schröder
Abstract:
We prove that any deterministic matrix is approximately the identity in the eigenbasis of a large random Wigner matrix with very high probability and with an optimal error inversely proportional to the square root of the dimension. Our theorem thus rigorously verifies the Eigenstate Thermalization Hypothesis by Deutsch [Deutsch 1991] for the simplest chaotic quantum system, the Wigner ensemble. In…
▽ More
We prove that any deterministic matrix is approximately the identity in the eigenbasis of a large random Wigner matrix with very high probability and with an optimal error inversely proportional to the square root of the dimension. Our theorem thus rigorously verifies the Eigenstate Thermalization Hypothesis by Deutsch [Deutsch 1991] for the simplest chaotic quantum system, the Wigner ensemble. In mathematical terms, we prove the strong form of Quantum Unique Ergodicity (QUE) with an optimal convergence rate for all eigenvectors simultaneously, generalizing previous probabilistic QUE results in [Bourgade, Yau 2017] and [Bourgade, Yau, Yin 2020].
△ Less
Submitted 3 March, 2021; v1 submitted 24 December, 2020;
originally announced December 2020.
-
Quantifying the Preferential Direction of the Model Gradient in Adversarial Training With Projected Gradient Descent
Authors:
Ricardo Bigolin Lanfredi,
Joyce D. Schroeder,
Tolga Tasdizen
Abstract:
Adversarial training, especially projected gradient descent (PGD), has proven to be a successful approach for improving robustness against adversarial attacks. After adversarial training, gradients of models with respect to their inputs have a preferential direction. However, the direction of alignment is not mathematically well established, making it difficult to evaluate quantitatively. We propo…
▽ More
Adversarial training, especially projected gradient descent (PGD), has proven to be a successful approach for improving robustness against adversarial attacks. After adversarial training, gradients of models with respect to their inputs have a preferential direction. However, the direction of alignment is not mathematically well established, making it difficult to evaluate quantitatively. We propose a novel definition of this direction as the direction of the vector pointing toward the closest point of the support of the closest inaccurate class in decision space. To evaluate the alignment with this direction after adversarial training, we apply a metric that uses generative adversarial networks to produce the smallest residual needed to change the class present in the image. We show that PGD-trained models have a higher alignment than the baseline according to our definition, that our metric presents higher alignment values than a competing metric formulation, and that enforcing this alignment increases the robustness of models.
△ Less
Submitted 19 April, 2023; v1 submitted 10 September, 2020;
originally announced September 2020.
-
Interpretation of Disease Evidence for Medical Images Using Adversarial Deformation Fields
Authors:
Ricardo Bigolin Lanfredi,
Joyce D. Schroeder,
Clement Vachet,
Tolga Tasdizen
Abstract:
The high complexity of deep learning models is associated with the difficulty of explaining what evidence they recognize as correlating with specific disease labels. This information is critical for building trust in models and finding their biases. Until now, automated deep learning visualization solutions have identified regions of images used by classifiers, but these solutions are too coarse,…
▽ More
The high complexity of deep learning models is associated with the difficulty of explaining what evidence they recognize as correlating with specific disease labels. This information is critical for building trust in models and finding their biases. Until now, automated deep learning visualization solutions have identified regions of images used by classifiers, but these solutions are too coarse, too noisy, or have a limited representation of the way images can change. We propose a novel method for formulating and presenting spatial explanations of disease evidence, called deformation field interpretation with generative adversarial networks (DeFI-GAN). An adversarially trained generator produces deformation fields that modify images of diseased patients to resemble images of healthy patients. We validate the method studying chronic obstructive pulmonary disease (COPD) evidence in chest x-rays (CXRs) and Alzheimer's disease (AD) evidence in brain MRIs. When extracting disease evidence in longitudinal data, we show compelling results against a baseline producing difference maps. DeFI-GAN also highlights disease biomarkers not found by previous methods and potential biases that may help in investigations of the dataset and of the adopted learning methods.
△ Less
Submitted 19 April, 2023; v1 submitted 3 July, 2020;
originally announced July 2020.
-
Fluctuation Around the Circular Law for Random Matrices with Real Entries
Authors:
Giorgio Cipolloni,
László Erdős,
Dominik Schröder
Abstract:
We extend our recent result [Cipolloni, Erdős, Schröder 2019] on the central limit theorem for the linear eigenvalue statistics of non-Hermitian matrices $X$ with independent, identically distributed complex entries to the real symmetry class. We find that the expectation and variance substantially differ from their complex counterparts, reflecting (i) the special spectral symmetry of real matrice…
▽ More
We extend our recent result [Cipolloni, Erdős, Schröder 2019] on the central limit theorem for the linear eigenvalue statistics of non-Hermitian matrices $X$ with independent, identically distributed complex entries to the real symmetry class. We find that the expectation and variance substantially differ from their complex counterparts, reflecting (i) the special spectral symmetry of real matrices onto the real axis; and (ii) the fact that real i.i.d. matrices have many real eigenvalues. Our result generalizes the previously known special cases where either the test function is analytic [O'Rourke, Renfrew 2016] or the first four moments of the matrix elements match the real Gaussian [Tao, Vu 2015; Kopel 2015]. The key element of the proof is the analysis of several weakly dependent Dyson Brownian motions (DBMs). The conceptual novelty of the real case compared with [Cipolloni, Erdős, Schröder 2019] is that the correlation structure of the stochastic differentials in each individual DBM is non-trivial, potentially even jeopardising its well-posedness.
△ Less
Submitted 31 January, 2024; v1 submitted 6 February, 2020;
originally announced February 2020.
-
Central Limit Theorem for Linear Eigenvalue Statistics of non-Hermitian Random Matrices
Authors:
Giorgio Cipolloni,
László Erdős,
Dominik Schröder
Abstract:
We consider large non-Hermitian random matrices $X$ with complex, independent, identically distributed centred entries and show that the linear statistics of their eigenvalues are asymptotically Gaussian for test functions having $2+ε$ derivatives. Previously this result was known only for a few special cases; either the test functions were required to be analytic [Rider, Silverstein 2006], or the…
▽ More
We consider large non-Hermitian random matrices $X$ with complex, independent, identically distributed centred entries and show that the linear statistics of their eigenvalues are asymptotically Gaussian for test functions having $2+ε$ derivatives. Previously this result was known only for a few special cases; either the test functions were required to be analytic [Rider, Silverstein 2006], or the distribution of the matrix elements needed to be Gaussian [Rider, Virág 2007], or at least match the Gaussian up to the first four moments [Tao, Vu 2016; Kopel 2015]. We find the exact dependence of the limiting variance on the fourth cumulant that was not known before. The proof relies on two novel ingredients: (i) a local law for a product of two resolvents of the Hermitisation of $X$ with different spectral parameters and (ii) a coupling of several weakly dependent Dyson Brownian Motions. These methods are also the key inputs for our analogous results on the linear eigenvalue statistics of real matrices $X$ that are presented in the companion paper [Cipolloni, Erdős, Schröder 2019].
△ Less
Submitted 13 October, 2023; v1 submitted 9 December, 2019;
originally announced December 2019.
-
Kinetic equations for sterile neutrinos from thermal fluctuations
Authors:
Dietrich Bodeker,
Dennis Schroder
Abstract:
We obtain non-linear kinetic equations for sterile neutrino occupancies and lepton minus baryon numbers by matching real time correlation functions of thermal fluctuations computed in an effective description to those computed in thermal quantum field theory. After expanding in the sterile-neutrino Yukawa couplings, the coefficients in the equations are written as real time correlation functions o…
▽ More
We obtain non-linear kinetic equations for sterile neutrino occupancies and lepton minus baryon numbers by matching real time correlation functions of thermal fluctuations computed in an effective description to those computed in thermal quantum field theory. After expanding in the sterile-neutrino Yukawa couplings, the coefficients in the equations are written as real time correlation functions of Standard Model operators. Our kinetic equations are valid for an arbitrary number of sterile neutrinos of any mass spectrum. They can be used to describe, e.g., low-scale leptogenesis via neutrino oscillations, or sterile neutrino dark matter production in the Higgs phase.
△ Less
Submitted 27 February, 2020; v1 submitted 12 November, 2019;
originally announced November 2019.
-
Towards the bulk universality of non-Hermitian random matrices
Authors:
Giorgio Cipolloni,
László Erdős,
Dominik Schröder
Abstract:
We consider the non-Hermitian analogue of the celebrated Wigner-Dyson-Mehta bulk universality phenomenon, i.e. that in the bulk the local eigenvalue statistics of a large random matrix with independent, identically distributed centred entries are universal, in particular they asymptotically coincide with those of the Ginibre ensemble in the corresponding symmetry class. In this paper we reduce thi…
▽ More
We consider the non-Hermitian analogue of the celebrated Wigner-Dyson-Mehta bulk universality phenomenon, i.e. that in the bulk the local eigenvalue statistics of a large random matrix with independent, identically distributed centred entries are universal, in particular they asymptotically coincide with those of the Ginibre ensemble in the corresponding symmetry class. In this paper we reduce this problem to understanding a certain microscopic regime for the Hermitized resolvent in Girko's formula by showing that all other regimes are negligible.
△ Less
Submitted 15 September, 2020; v1 submitted 13 September, 2019;
originally announced September 2019.
-
Adversarial regression training for visualizing the progression of chronic obstructive pulmonary disease with chest x-rays
Authors:
Ricardo Bigolin Lanfredi,
Joyce D. Schroeder,
Clement Vachet,
Tolga Tasdizen
Abstract:
Knowledge of what spatial elements of medical images deep learning methods use as evidence is important for model interpretability, trustiness, and validation. There is a lack of such techniques for models in regression tasks. We propose a method, called visualization for regression with a generative adversarial network (VR-GAN), for formulating adversarial training specifically for datasets conta…
▽ More
Knowledge of what spatial elements of medical images deep learning methods use as evidence is important for model interpretability, trustiness, and validation. There is a lack of such techniques for models in regression tasks. We propose a method, called visualization for regression with a generative adversarial network (VR-GAN), for formulating adversarial training specifically for datasets containing regression target values characterizing disease severity. We use a conditional generative adversarial network where the generator attempts to learn to shift the output of a regressor through creating disease effect maps that are added to the original images. Meanwhile, the regressor is trained to predict the original regression value for the modified images. A model trained with this technique learns to provide visualization for how the image would appear at different stages of the disease. We analyze our method in a dataset of chest x-rays associated with pulmonary function tests, used for diagnosing chronic obstructive pulmonary disease (COPD). For validation, we compute the difference of two registered x-rays of the same patient at different time points and correlate it to the generated disease effect map. The proposed method outperforms a technique based on classification and provides realistic-looking images, making modifications to images following what radiologists usually observe for this disease. Implementation code is available at https://github.com/ricbl/vrgan.
△ Less
Submitted 27 August, 2019;
originally announced August 2019.
-
Optimal Lower Bound on the Least Singular Value of the Shifted Ginibre Ensemble
Authors:
Giorgio Cipolloni,
László Erdős,
Dominik Schröder
Abstract:
We consider the least singular value of a large random matrix with real or complex i.i.d. Gaussian entries shifted by a constant $z\in\mathbb{C}$. We prove an optimal lower tail estimate on this singular value in the critical regime where $z$ is around the spectral edge thus improving the classical bound of [Sankar, Spielman, Teng, 2006] in the edge regime. Lacking Brézin-Hikami formulas in the re…
▽ More
We consider the least singular value of a large random matrix with real or complex i.i.d. Gaussian entries shifted by a constant $z\in\mathbb{C}$. We prove an optimal lower tail estimate on this singular value in the critical regime where $z$ is around the spectral edge thus improving the classical bound of [Sankar, Spielman, Teng, 2006] in the edge regime. Lacking Brézin-Hikami formulas in the real case, we rely on the superbosonization formula [Littelmann, Sommers, Zirnbauer, 2008].
△ Less
Submitted 1 November, 2022; v1 submitted 5 August, 2019;
originally announced August 2019.
-
Edge Universality for non-Hermitian Random Matrices
Authors:
Giorgio Cipolloni,
László Erdős,
Dominik Schröder
Abstract:
We consider large non-Hermitian real or complex random matrices $X$ with independent, identically distributed centred entries. We prove that their local eigenvalue statistics near the spectral edge, the unit circle, coincide with those of the Ginibre ensemble, i.e. when the matrix elements of $X$ are Gaussian. This result is the non-Hermitian counterpart of the universality of the Tracy-Widom dist…
▽ More
We consider large non-Hermitian real or complex random matrices $X$ with independent, identically distributed centred entries. We prove that their local eigenvalue statistics near the spectral edge, the unit circle, coincide with those of the Ginibre ensemble, i.e. when the matrix elements of $X$ are Gaussian. This result is the non-Hermitian counterpart of the universality of the Tracy-Widom distribution at the spectral edges of the Wigner ensemble.
△ Less
Submitted 9 September, 2020; v1 submitted 2 August, 2019;
originally announced August 2019.
-
Quantum matrix diagonalization visualized
Authors:
Kevin Randles,
Daniel V. Schroeder,
Bruce R. Thomas
Abstract:
We show how to visualize the process of diagonalizing the Hamiltonian matrix to find the energy eigenvalues and eigenvectors of a generic one-dimensional quantum system. Starting in the familiar sine-wave basis of an embedding infinite square well, we display the Hamiltonian matrix graphically with the basis functions alongside. Each step in the diagonalization process consists of selecting a nonz…
▽ More
We show how to visualize the process of diagonalizing the Hamiltonian matrix to find the energy eigenvalues and eigenvectors of a generic one-dimensional quantum system. Starting in the familiar sine-wave basis of an embedding infinite square well, we display the Hamiltonian matrix graphically with the basis functions alongside. Each step in the diagonalization process consists of selecting a nonzero off-diagonal matrix element, then rotating the two corresponding basis vectors in their own subspace until this element is zero. We provide Mathematica code to display the effects of these rotations on both the matrix and the basis functions. As an electronic supplement we also provide a JavaScript web app to interactively carry out this process.
△ Less
Submitted 30 May, 2019;
originally announced May 2019.
-
Reflections On the Anomalous ANITA Events: The Antarctic Subsurface as a Possible Explanation
Authors:
Ian M. Shoemaker,
Alexander Kusenko,
Peter Kuipers Munneke,
Andrew Romero-Wolf,
Dustin M. Schroeder,
Martin J. Siegert
Abstract:
The ANITA balloon experiment was designed to detect radio signals initiated by neutrinos and cosmic ray air showers. These signals are typically discriminated by the polarization and phase inversions of the radio signal. The reflected signal from cosmic rays suffer phase inversion compared to a direct tau neutrino event. In this paper we study sub-surface reflection, which can occur without phase…
▽ More
The ANITA balloon experiment was designed to detect radio signals initiated by neutrinos and cosmic ray air showers. These signals are typically discriminated by the polarization and phase inversions of the radio signal. The reflected signal from cosmic rays suffer phase inversion compared to a direct tau neutrino event. In this paper we study sub-surface reflection, which can occur without phase inversion, in the context of the two anomalous up-going events reported by ANITA. We find that subsurface layers and firn density inversions may plausibly account for the events, while ice fabric layers and wind ablation crusts could also play a role. This hypothesis can be tested with radar surveying of the Antarctic region in the vicinity of the anomalous ANITA events. Future experiments should not use phase inversion as a sole criterion to discriminate between downgoing and upgoing events, unless the subsurface reflection properties are well understood.
△ Less
Submitted 7 May, 2019;
originally announced May 2019.
-
Equilibration of right-handed electrons
Authors:
Dietrich Bodeker,
Dennis Schroder
Abstract:
We study the equilibration of right-handed electrons in the symmetric phase of the Standard Model. Due to the smallness of the electron Yukawa coupling, it happens relatively late in the history of the Universe. We compute the equilibration rate at leading order in the Standard Model couplings, by including gauge interactions, the top Yukawa- and the Higgs self-interaction. The dominant contributi…
▽ More
We study the equilibration of right-handed electrons in the symmetric phase of the Standard Model. Due to the smallness of the electron Yukawa coupling, it happens relatively late in the history of the Universe. We compute the equilibration rate at leading order in the Standard Model couplings, by including gauge interactions, the top Yukawa- and the Higgs self-interaction. The dominant contribution is due to $ 2 \to 2 $ particle scattering, even though the rate of (inverse) Higgs decays is strongly enhanced by multiple soft scattering which is included by Landau-Pomeranchuk-Migdal (LPM) resummation. Our numerical result is substantially larger than approximations presented in previous literature.
△ Less
Submitted 29 May, 2019; v1 submitted 19 February, 2019;
originally announced February 2019.
-
Cusp Universality for Random Matrices II: The Real Symmetric Case
Authors:
Giorgio Cipolloni,
László Erdős,
Torben Krüger,
Dominik Schröder
Abstract:
We prove that the local eigenvalue statistics of real symmetric Wigner-type matrices near the cusp points of the eigenvalue density are universal. Together with the companion paper [arXiv:1809.03971], which proves the same result for the complex Hermitian symmetry class, this completes the last remaining case of the Wigner-Dyson-Mehta universality conjecture after bulk and edge universalities have…
▽ More
We prove that the local eigenvalue statistics of real symmetric Wigner-type matrices near the cusp points of the eigenvalue density are universal. Together with the companion paper [arXiv:1809.03971], which proves the same result for the complex Hermitian symmetry class, this completes the last remaining case of the Wigner-Dyson-Mehta universality conjecture after bulk and edge universalities have been established in the last years. We extend the recent Dyson Brownian motion analysis at the edge [arXiv:1712.03881] to the cusp regime using the optimal local law from [arXiv:1809.03971] and the accurate local shape analysis of the density from [arXiv:1506.05095, arXiv:1804.07752]. We also present a PDE-based method to improve the estimate on eigenvalue rigidity via the maximum principle of the heat flow related to the Dyson Brownian motion.
△ Less
Submitted 22 October, 2019; v1 submitted 9 November, 2018;
originally announced November 2018.
-
Cusp Universality for Random Matrices I: Local Law and the Complex Hermitian Case
Authors:
László Erdős,
Torben Krüger,
Dominik Schröder
Abstract:
For complex Wigner-type matrices, i.e. Hermitian random matrices with independent, not necessarily identically distributed entries above the diagonal, we show that at any cusp singularity of the limiting eigenvalue distribution the local eigenvalue statistics are universal and form a Pearcey process. Since the density of states typically exhibits only square root or cubic root cusp singularities,…
▽ More
For complex Wigner-type matrices, i.e. Hermitian random matrices with independent, not necessarily identically distributed entries above the diagonal, we show that at any cusp singularity of the limiting eigenvalue distribution the local eigenvalue statistics are universal and form a Pearcey process. Since the density of states typically exhibits only square root or cubic root cusp singularities, our work complements previous results on the bulk and edge universality and it thus completes the resolution of the Wigner-Dyson-Mehta universality conjecture for the last remaining universality type in the complex Hermitian class. Our analysis holds not only for exact cusps, but approximate cusps as well, where an extended Pearcey process emerges. As a main technical ingredient we prove an optimal local law at the cusp for both symmetry classes. This result is also used in the companion paper [arXiv:1811.04055] where the cusp universality for real symmetric Wigner-type matrices is proven.
△ Less
Submitted 26 October, 2024; v1 submitted 11 September, 2018;
originally announced September 2018.
-
Correlated Random Matrices: Band Rigidity and Edge Universality
Authors:
Johannes Alt,
László Erdős,
Torben Krüger,
Dominik Schröder
Abstract:
We prove edge universality for a general class of correlated real symmetric or complex Hermitian Wigner matrices with arbitrary expectation. Our theorem also applies to internal edges of the self-consistent density of states. In particular, we establish a strong form of band rigidity which excludes mismatches between location and label of eigenvalues close to internal edges in these general models…
▽ More
We prove edge universality for a general class of correlated real symmetric or complex Hermitian Wigner matrices with arbitrary expectation. Our theorem also applies to internal edges of the self-consistent density of states. In particular, we establish a strong form of band rigidity which excludes mismatches between location and label of eigenvalues close to internal edges in these general models.
△ Less
Submitted 11 December, 2018; v1 submitted 20 April, 2018;
originally announced April 2018.
-
Random Matrices with Slow Correlation Decay
Authors:
László Erdős,
Torben Krüger,
Dominik Schröder
Abstract:
We consider large random matrices with a general slowly decaying correlation among its entries. We prove universality of the local eigenvalue statistics and optimal local laws for the resolvent away from the spectral edges, generalizing the recent result of [arXiv:1604.08188] to allow slow correlation decay and arbitrary expectation. The main novel tool is a systematic diagrammatic control of a mu…
▽ More
We consider large random matrices with a general slowly decaying correlation among its entries. We prove universality of the local eigenvalue statistics and optimal local laws for the resolvent away from the spectral edges, generalizing the recent result of [arXiv:1604.08188] to allow slow correlation decay and arbitrary expectation. The main novel tool is a systematic diagrammatic control of a multivariate cumulant expansion.
△ Less
Submitted 29 May, 2020; v1 submitted 30 May, 2017;
originally announced May 2017.
-
Entanglement isn't just for spin
Authors:
Daniel V. Schroeder
Abstract:
Quantum entanglement occurs not just in discrete systems such as spins, but also in the spatial wave functions of systems with more than one degree of freedom. It is easy to introduce students to entangled wave functions at an early stage, in any course that discusses wave functions. Doing so not only prepares students to learn about Bell's theorem and quantum information science, but can also pro…
▽ More
Quantum entanglement occurs not just in discrete systems such as spins, but also in the spatial wave functions of systems with more than one degree of freedom. It is easy to introduce students to entangled wave functions at an early stage, in any course that discusses wave functions. Doing so not only prepares students to learn about Bell's theorem and quantum information science, but can also provide a deeper understanding of the principles of quantum mechanics and help fight against some common misconceptions. Here I introduce several pictorial examples of entangled wave functions that depend on just two spatial variables. I also show how such wave functions can arise dynamically, and describe how to quantify their entanglement.
△ Less
Submitted 30 March, 2017;
originally announced March 2017.
-
The variational-relaxation algorithm for finding quantum bound states
Authors:
Daniel V. Schroeder
Abstract:
I describe a simple algorithm for numerically finding the ground state and low-lying excited states of a quantum system. The algorithm is an adaptation of the relaxation method for solving Poisson's equation, and is fundamentally based on the variational principle. It is especially useful for two-dimensional systems with nonseparable potentials, for which simpler techniques are inapplicable yet th…
▽ More
I describe a simple algorithm for numerically finding the ground state and low-lying excited states of a quantum system. The algorithm is an adaptation of the relaxation method for solving Poisson's equation, and is fundamentally based on the variational principle. It is especially useful for two-dimensional systems with nonseparable potentials, for which simpler techniques are inapplicable yet the computation time is minimal.
△ Less
Submitted 19 July, 2017; v1 submitted 31 January, 2017;
originally announced January 2017.
-
An Enhanced Lumped Element Electrical Model of a Double Barrier Memristive Device
Authors:
Enver Solan,
Sven Dirkmann,
Mirko Hansen,
Dietmar Schroeder,
Hermann Kohlstedt,
Martin Ziegler,
Thomas Mussenbrock,
Karlheinz Ochs
Abstract:
The massive parallel approach of neuromorphic circuits leads to effective methods for solving complex problems. It has turned out that resistive switching devices with a continuous resistance range are potential candidates for such applications. These devices are memristive systems - nonlinear resistors with memory. They are fabricated in nanotechnology and hence parameter spread during fabricatio…
▽ More
The massive parallel approach of neuromorphic circuits leads to effective methods for solving complex problems. It has turned out that resistive switching devices with a continuous resistance range are potential candidates for such applications. These devices are memristive systems - nonlinear resistors with memory. They are fabricated in nanotechnology and hence parameter spread during fabrication may aggravate reproducible analyses. This issue makes simulation models of memristive devices worthwhile.
Kinetic Monte-Carlo simulations based on a distributed model of the device can be used to understand the underlying physical and chemical phenomena. However, such simulations are very time-consuming and neither convenient for investigations of whole circuits nor for real-time applications, e.g. emulation purposes. Instead, a concentrated model of the device can be used for both fast simulations and real-time applications, respectively. We introduce an enhanced electrical model of a valence change mechanism (VCM) based double barrier memristive device (DBMD) with a continuous resistance range. This device consists of an ultra-thin memristive layer sandwiched between a tunnel barrier and a Schottky-contact. The introduced model leads to very fast simulations by using usual circuit simulation tools while maintaining physically meaningful parameters.
Kinetic Monte-Carlo simulations based on a distributed model and experimental data have been utilized as references to verify the concentrated model.
△ Less
Submitted 19 January, 2017;
originally announced January 2017.
-
Fluctuations of Functions of Wigner Matrices
Authors:
László Erdős,
Dominik Schröder
Abstract:
We show that matrix elements of functions of $N\times N$ Wigner matrices fluctuate on a scale of order $N^{-1/2}$ and we identify the limiting fluctuation. Our result holds for any function $f$ of the matrix that has bounded variation and thus considerably relaxes the regularity requirement imposed in [7,11].
We show that matrix elements of functions of $N\times N$ Wigner matrices fluctuate on a scale of order $N^{-1/2}$ and we identify the limiting fluctuation. Our result holds for any function $f$ of the matrix that has bounded variation and thus considerably relaxes the regularity requirement imposed in [7,11].
△ Less
Submitted 11 August, 2021; v1 submitted 22 October, 2016;
originally announced October 2016.