Skip to main content

Showing 1–24 of 24 results for author: Maggioni, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2411.09686  [pdf, ps, other

    stat.ML cs.LG

    Conditional regression for the Nonlinear Single-Variable Model

    Authors: Yantao Wu, Mauro Maggioni

    Abstract: Regressing a function $F$ on $\mathbb{R}^d$ without the statistical and computational curse of dimensionality requires special statistical models, for example that impose geometric assumptions on the distribution of the data (e.g., that its support is low-dimensional), or strong smoothness assumptions on $F$, or a special structure $F$. Among the latter, compositional models $F=f\circ g$ with $g$… ▽ More

    Submitted 11 July, 2025; v1 submitted 14 November, 2024; originally announced November 2024.

    Comments: 57 pages, 10 figures

    MSC Class: 62G08

  2. arXiv:2410.04655  [pdf, other

    cs.LG cs.AI math.SP stat.ME stat.ML

    Graph Fourier Neural Kernels (G-FuNK): Learning Solutions of Nonlinear Diffusive Parametric PDEs on Multiple Domains

    Authors: Shane E. Loeffler, Zan Ahmad, Syed Yusuf Ali, Carolyna Yamamoto, Dan M. Popescu, Alana Yee, Yash Lal, Natalia Trayanova, Mauro Maggioni

    Abstract: Predicting time-dependent dynamics of complex systems governed by non-linear partial differential equations (PDEs) with varying parameters and domains is a challenging task motivated by applications across various fields. We introduce a novel family of neural operators based on our Graph Fourier Neural Kernels, designed to learn solution generators for nonlinear PDEs in which the highest-order ter… ▽ More

    Submitted 9 October, 2024; v1 submitted 6 October, 2024; originally announced October 2024.

  3. arXiv:2408.16138  [pdf, ps, other

    cs.LG math.DG stat.ML

    Thinner Latent Spaces: Detecting Dimension and Imposing Invariance with Conformal Autoencoders

    Authors: George A. Kevrekidis, Zan Ahmad, Mauro Maggioni, Soledad Villar, Yannis G. Kevrekidis

    Abstract: Conformal Autoencoders are a neural network architecture that imposes orthogonality conditions between the gradients of latent variables to obtain disentangled representations of data. In this work we show that orthogonality relations within the latent layer of the network can be leveraged to infer the intrinsic dimensionality of nonlinear manifold data sets (locally characterized by the dimension… ▽ More

    Submitted 10 July, 2025; v1 submitted 28 August, 2024; originally announced August 2024.

  4. arXiv:2402.08412  [pdf, other

    stat.ML cs.LG math.DS math.ST

    Interacting Particle Systems on Networks: joint inference of the network and the interaction kernel

    Authors: Quanjun Lang, Xiong Wang, Fei Lu, Mauro Maggioni

    Abstract: Modeling multi-agent systems on networks is a fundamental challenge in a wide variety of disciplines. We jointly infer the weight matrix of the network and the interaction kernel, which determine respectively which agents interact with which others and the rules of such interactions from data consisting of multiple trajectories. The estimator we propose leads naturally to a non-convex optimization… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: 53 pages, 17 figures

    MSC Class: 62F12; 82C22

  5. arXiv:2212.00746  [pdf, other

    cs.IT cs.LG math.OC stat.ML

    Learning Transition Operators From Sparse Space-Time Samples

    Authors: Christian Kümmerle, Mauro Maggioni, Sui Tang

    Abstract: We consider the nonlinear inverse problem of learning a transition operator $\mathbf{A}$ from partial observations at different times, in particular from sparse observations of entries of its powers $\mathbf{A},\mathbf{A}^2,\cdots,\mathbf{A}^{T}$. This Spatio-Temporal Transition Operator Recovery problem is motivated by the recent interest in learning time-varying graph signals that are driven by… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

    Comments: 34 pages, 12 figures

  6. arXiv:2207.05242  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Unsupervised learning of observation functions in state-space models by nonparametric moment methods

    Authors: Qingci An, Yannis Kevrekidis, Fei Lu, Mauro Maggioni

    Abstract: We investigate the unsupervised learning of non-invertible observation functions in nonlinear state-space models. Assuming abundant data of the observation process along with the distribution of the state process, we introduce a nonparametric generalized moment method to estimate the observation function via constrained regression. The major challenge comes from the non-invertibility of the observ… ▽ More

    Submitted 11 July, 2022; originally announced July 2022.

    MSC Class: 62G05; 68Q32; 62M15

  7. arXiv:2104.02120  [pdf, other

    stat.ML cs.LG math.DS

    Nonlinear model reduction for slow-fast stochastic systems near unknown invariant manifolds

    Authors: Felix X. -F. Ye, Sichen Yang, Mauro Maggioni

    Abstract: We introduce a nonlinear stochastic model reduction technique for high-dimensional stochastic dynamical systems that have a low-dimensional invariant effective manifold with slow dynamics, and high-dimensional, large fast modes. Given only access to a black box simulator from which short bursts of simulation can be obtained, we design an algorithm that outputs an estimate of the invariant manifold… ▽ More

    Submitted 24 October, 2023; v1 submitted 5 April, 2021; originally announced April 2021.

  8. arXiv:2101.05119  [pdf, ps, other

    stat.ML cs.LG math.ST

    Multiscale regression on unknown manifolds

    Authors: Wenjing Liao, Mauro Maggioni, Stefano Vigogna

    Abstract: We consider the regression problem of estimating functions on $\mathbb{R}^D$ but supported on a $d$-dimensional manifold $ \mathcal{M} \subset \mathbb{R}^D $ with $ d \ll D $. Drawing ideas from multi-resolution analysis and nonlinear approximation, we construct low-dimensional coordinates on $\mathcal{M}$ at multiple scales, and perform multiscale regression by local polynomial fitting. We propos… ▽ More

    Submitted 13 January, 2021; originally announced January 2021.

  9. arXiv:2010.03729  [pdf, other

    stat.ML cs.LG math.DS math.ST

    Learning Theory for Inferring Interaction Kernels in Second-Order Interacting Agent Systems

    Authors: Jason Miller, Sui Tang, Ming Zhong, Mauro Maggioni

    Abstract: Modeling the complex interactions of systems of particles or agents is a fundamental scientific and mathematical problem that is studied in diverse fields, ranging from physics and biology, to economics and machine learning. In this work, we describe a very general second-order, heterogeneous, multivariable, interacting agent model, with an environment, that encompasses a wide variety of known sys… ▽ More

    Submitted 7 October, 2020; originally announced October 2020.

    Comments: 68 pages

    MSC Class: 62Gxx; 37Nxx; 68Txx

  10. arXiv:2007.15174  [pdf, other

    math.ST stat.ML

    Learning interaction kernels in stochastic systems of interacting particles from multiple trajectories

    Authors: Fei Lu, Mauro Maggioni, Sui Tang

    Abstract: We consider stochastic systems of interacting particles or agents, with dynamics determined by an interaction kernel which only depends on pairwise distances. We study the problem of inferring this interaction kernel from observations of the positions of the particles, in either continuous or discrete time, along multiple independent trajectories. We introduce a nonparametric inference approach to… ▽ More

    Submitted 29 July, 2020; originally announced July 2020.

    Comments: 38 pages; 9 figures

    MSC Class: 70F17; 62G05; 62M05

  11. arXiv:1912.11123  [pdf, other

    cs.LG math.DS nlin.AO stat.ML

    Data-driven Discovery of Emergent Behaviors in Collective Dynamics

    Authors: Mauro Maggioni, Jason Miller, Ming Zhong

    Abstract: Particle- and agent-based systems are a ubiquitous modeling tool in many disciplines. We consider the fundamental problem of inferring interaction kernels from observations of agent-based dynamical systems given observations of trajectories, in particular for collective dynamical systems exhibiting emergent behaviors with complicated interaction kernels, in a nonparametric fashion, and for kernels… ▽ More

    Submitted 30 March, 2020; v1 submitted 23 December, 2019; originally announced December 2019.

  12. arXiv:1910.04832  [pdf, other

    stat.ML cs.LG math.ST

    Learning interaction kernels in heterogeneous systems of agents from multiple trajectories

    Authors: Fei Lu, Mauro Maggioni, Sui Tang

    Abstract: Systems of interacting particles or agents have wide applications in many disciplines such as Physics, Chemistry, Biology and Economics. These systems are governed by interaction laws, which are often unknown: estimating them from observation data is a fundamental task that can provide meaningful insights and accurate predictions of the behaviour of the agents. In this paper, we consider the inver… ▽ More

    Submitted 14 July, 2020; v1 submitted 10 October, 2019; originally announced October 2019.

    Comments: 63 pages, revised various places

    MSC Class: 62GXX

  13. arXiv:1905.12989  [pdf, other

    cs.LG math.ST stat.ML

    Learning by Active Nonlinear Diffusion

    Authors: Mauro Maggioni, James M. Murphy

    Abstract: This article proposes an active learning method for high dimensional data, based on intrinsic data geometries learned through diffusion processes on graphs. Diffusion distances are used to parametrize low-dimensional structures on the dataset, which allow for high-accuracy labelings of the dataset with only a small number of carefully chosen labels. The geometric structure of the data suggests reg… ▽ More

    Submitted 30 May, 2019; originally announced May 2019.

    Comments: 20 pages, 10 figures

  14. arXiv:1902.05402  [pdf, other

    cs.CV cs.LG stat.ML

    Spectral-Spatial Diffusion Geometry for Hyperspectral Image Clustering

    Authors: James M. Murphy, Mauro Maggioni

    Abstract: An unsupervised learning algorithm to cluster hyperspectral image (HSI) data is proposed that exploits spatially-regularized random walks. Markov diffusions are defined on the space of HSI spectra with transitions constrained to near spatial neighbors. The explicit incorporation of spatial regularity into the diffusion construction leads to smoother random processes that are more adapted for unsup… ▽ More

    Submitted 8 February, 2019; originally announced February 2019.

  15. Nonparametric inference of interaction laws in systems of agents from trajectory data

    Authors: Fei Lu, Mauro Maggioni, Sui Tang, Ming Zhong

    Abstract: Inferring the laws of interaction between particles and agents in complex dynamical systems from observational data is a fundamental challenge in a wide variety of disciplines. We propose a non-parametric statistical learning approach to estimate the governing laws of distance-based interactions, with no reference or assumption about their analytical form, from data consisting trajectories of inte… ▽ More

    Submitted 23 March, 2019; v1 submitted 14 December, 2018; originally announced December 2018.

  16. arXiv:1810.06702  [pdf, other

    stat.ML cs.LG

    Learning by Unsupervised Nonlinear Diffusion

    Authors: Mauro Maggioni, James M. Murphy

    Abstract: This paper proposes and analyzes a novel clustering algorithm that combines graph-based diffusion geometry with techniques based on density and mode estimation. The proposed method is suitable for data generated from mixtures of distributions with densities that are both multimodal and have nonlinear shapes. A crucial aspect of this algorithm is the use of time of a data-adapted diffusion process… ▽ More

    Submitted 29 December, 2018; v1 submitted 15 October, 2018; originally announced October 2018.

    Comments: 40 Pages, 17 Figures

  17. arXiv:1712.06206  [pdf, other

    stat.ML

    Path-Based Spectral Clustering: Guarantees, Robustness to Outliers, and Fast Algorithms

    Authors: Anna Little, Mauro Maggioni, James M. Murphy

    Abstract: We consider the problem of clustering with the longest-leg path distance (LLPD) metric, which is informative for elongated and irregularly shaped clusters. We prove finite-sample guarantees on the performance of clustering with respect to this metric when random samples are drawn from multiple intrinsically low-dimensional clusters in high-dimensional space, in the presence of a large number of hi… ▽ More

    Submitted 6 March, 2019; v1 submitted 17 December, 2017; originally announced December 2017.

    Comments: 59 pages, 12 figures

  18. arXiv:1709.01233  [pdf, other

    stat.ML

    Supervised Dimensionality Reduction for Big Data

    Authors: Joshua T. Vogelstein, Eric Bridgeford, Minh Tang, Da Zheng, Christopher Douville, Randal Burns, Mauro Maggioni

    Abstract: To solve key biomedical problems, experimentalists now routinely measure millions or billions of features (dimensions) per sample, with the hope that data science techniques will be able to build accurate data-driven inferences. Because sample sizes are typically orders of magnitude smaller than the dimensionality of these data, valid inferences require finding a low-dimensional representation tha… ▽ More

    Submitted 23 January, 2021; v1 submitted 5 September, 2017; originally announced September 2017.

    Comments: 6 figures

  19. arXiv:1611.01179  [pdf, other

    stat.ML cs.IT math.ST

    Adaptive Geometric Multiscale Approximations for Intrinsically Low-dimensional Data

    Authors: Wenjing Liao, Mauro Maggioni

    Abstract: We consider the problem of efficiently approximating and encoding high-dimensional data sampled from a probability distribution $ρ$ in $\mathbb{R}^D$, that is nearly supported on a $d$-dimensional set $\mathcal{M}$ - for example supported on a $d$-dimensional Riemannian manifold. Geometric Multi-Resolution Analysis (GMRA) provides a robust and computationally efficient procedure to construct low-d… ▽ More

    Submitted 18 July, 2017; v1 submitted 3 November, 2016; originally announced November 2016.

  20. Discovering and Deciphering Relationships Across Disparate Data Modalities

    Authors: Joshua T. Vogelstein, Eric Bridgeford, Qing Wang, Carey E. Priebe, Mauro Maggioni, Cencheng Shen

    Abstract: Understanding the relationships between different properties of data, such as whether a connectome or genome has information about disease status, is becoming increasingly important in modern biological datasets. While existing approaches can test whether two properties are related, they often require unfeasibly large sample sizes in real data scenarios, and do not provide any insight into how or… ▽ More

    Submitted 6 December, 2018; v1 submitted 16 September, 2016; originally announced September 2016.

    Journal ref: eLife 8, e41690, 2019

  21. arXiv:1509.07497  [pdf, other

    stat.ML

    High Dimensional Data Modeling Techniques for Detection of Chemical Plumes and Anomalies in Hyperspectral Images and Movies

    Authors: Yi, Wang, Guangliang Chen, Mauro Maggioni

    Abstract: We briefly review recent progress in techniques for modeling and analyzing hyperspectral images and movies, in particular for detecting plumes of both known and unknown chemicals. For detecting chemicals of known spectrum, we extend the technique of using a single subspace for modeling the background to a "mixture of subspaces" model to tackle more complicated background. Furthermore, we use parti… ▽ More

    Submitted 29 January, 2016; v1 submitted 24 September, 2015; originally announced September 2015.

  22. arXiv:1506.03410  [pdf, other

    stat.ML cs.LG

    Sparse Projection Oblique Randomer Forests

    Authors: Tyler M. Tomita, James Browne, Cencheng Shen, Jaewon Chung, Jesse L. Patsolic, Benjamin Falk, Jason Yim, Carey E. Priebe, Randal Burns, Mauro Maggioni, Joshua T. Vogelstein

    Abstract: Decision forests, including Random Forests and Gradient Boosting Trees, have recently demonstrated state-of-the-art performance in a variety of machine learning settings. Decision forests are typically ensembles of axis-aligned decision trees; that is, trees that split only along feature dimensions. In contrast, many recent extensions to decision forests are based on axis-oblique splits. Unfortuna… ▽ More

    Submitted 3 October, 2019; v1 submitted 10 June, 2015; originally announced June 2015.

    Comments: 31 pages; submitted to Journal of Machine Learning Research for review

    MSC Class: 68T10 ACM Class: I.5.2

    Journal ref: Journal of Machine Learning Research 21(104), 1-39, 2020

  23. arXiv:1212.1143  [pdf, other

    cs.AI eess.SY math.OC stat.ML

    Multiscale Markov Decision Problems: Compression, Solution, and Transfer Learning

    Authors: Jake Bouvrie, Mauro Maggioni

    Abstract: Many problems in sequential decision making and stochastic control often have natural multiscale structure: sub-tasks are assembled together to accomplish complex goals. Systematically inferring and leveraging hierarchical structure, particularly beyond a single level of abstraction, has remained a longstanding challenge. We describe a fast multiscale procedure for repeatedly compressing, or homog… ▽ More

    Submitted 5 December, 2012; originally announced December 2012.

    Comments: 86 pages, 15 figures

  24. arXiv:1105.4924  [pdf, ps, other

    math.MG cs.DS stat.ML

    Multiscale Geometric Methods for Data Sets II: Geometric Multi-Resolution Analysis

    Authors: William K. Allard, Guangliang Chen, Mauro Maggioni

    Abstract: Data sets are often modeled as point clouds in $R^D$, for $D$ large. It is often assumed that the data has some interesting low-dimensional structure, for example that of a $d$-dimensional manifold $M$, with $d$ much smaller than $D$. When $M$ is simply a linear subspace, one may exploit this assumption for encoding efficiently the data by projecting onto a dictionary of $d$ vectors in $R^D$ (for… ▽ More

    Submitted 7 September, 2011; v1 submitted 24 May, 2011; originally announced May 2011.

    Comments: Re-formatted using AMS style