Skip to main content

Showing 1–21 of 21 results for author: Bertozzi, A L

Searching in archive stat. Search in all archives.
.
  1. arXiv:2412.08016  [pdf, other

    cs.LG stat.ML

    GLL: A Differentiable Graph Learning Layer for Neural Networks

    Authors: Jason Brown, Bohan Chen, Harris Hardiman-Mostow, Jeff Calder, Andrea L. Bertozzi

    Abstract: Standard deep learning architectures used for classification generate label predictions with a projection head and softmax activation function. Although successful, these methods fail to leverage the relational information between samples in the batch for generating label predictions. In recent works, graph-based learning techniques, namely Laplace learning, have been heuristically combined with n… ▽ More

    Submitted 10 December, 2024; originally announced December 2024.

    Comments: 44 pages, 11 figures. Preprint. Submitted to the Journal of Machine Learning Research

    MSC Class: 68T05; 68T07; 35R02 ACM Class: I.2.6; I.2.10; I.4.0

  2. arXiv:2406.13781  [pdf, other

    cs.LG cs.AI cs.CL cs.CV stat.ML

    A Primal-Dual Framework for Transformers and Neural Networks

    Authors: Tan M. Nguyen, Tam Nguyen, Nhat Ho, Andrea L. Bertozzi, Richard G. Baraniuk, Stanley J. Osher

    Abstract: Self-attention is key to the remarkable success of transformers in sequence modeling tasks including many applications in natural language processing and computer vision. Like neural network layers, these attention mechanisms are often developed by heuristics and experience. To provide a principled framework for constructing attention layers in transformers, we show that the self-attention corresp… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: Accepted to ICLR 2023, 26 pages, 4 figures, 14 tables

  3. arXiv:2311.11163  [pdf, other

    cs.SI stat.AP stat.CO

    Hate speech and hate crimes: a data-driven study of evolving discourse around marginalized groups

    Authors: Malvina Bozhidarova, Jonathn Chang, Aaishah Ale-rasool, Yuxiang Liu, Chongyao Ma, Andrea L. Bertozzi, P. Jeffrey Brantingham, Junyuan Lin, Sanjukta Krishnagopal

    Abstract: This study explores the dynamic relationship between online discourse, as observed in tweets, and physical hate crimes, focusing on marginalized groups. Leveraging natural language processing techniques, including keyword extraction and topic modeling, we analyze the evolution of online discourse after events affecting these groups. Examining sentiment and polarizing tweets, we establish correlati… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

  4. Model-Change Active Learning in Graph-Based Semi-Supervised Learning

    Authors: Kevin Miller, Andrea L. Bertozzi

    Abstract: Active learning in semi-supervised classification involves introducing additional labels for unlabelled data to improve the accuracy of the underlying classifier. A challenge is to identify which points to label to best improve performance while limiting the number of new labels. "Model Change" active learning quantifies the resulting change incurred in the classifier by introducing the additional… ▽ More

    Submitted 14 November, 2024; v1 submitted 14 October, 2021; originally announced October 2021.

    Journal ref: Commun. Appl. Math. Comput. 6, 1270-1298 (2024)

  5. Posterior Consistency of Semi-Supervised Regression on Graphs

    Authors: Andrea L. Bertozzi, Bamdad Hosseini, Hao Li, Kevin Miller, Andrew M. Stuart

    Abstract: Graph-based semi-supervised regression (SSR) is the problem of estimating the value of a function on a weighted graph from its values (labels) on a small subset of the vertices. This paper is concerned with the consistency of SSR in the context of classification, in the setting where the labels have small noise and the underlying graph weighting is consistent with well-clustered nodes. We present… ▽ More

    Submitted 24 March, 2021; v1 submitted 24 July, 2020; originally announced July 2020.

  6. arXiv:2007.11126  [pdf, other

    stat.ML cs.LG

    Efficient Graph-Based Active Learning with Probit Likelihood via Gaussian Approximations

    Authors: Kevin Miller, Hao Li, Andrea L. Bertozzi

    Abstract: We present a novel adaptation of active learning to graph-based semi-supervised learning (SSL) under non-Gaussian Bayesian models. We present an approximation of non-Gaussian distributions to adapt previously Gaussian-based acquisition functions to these more general cases. We develop an efficient rank-one update for applying "look-ahead" based methods as well as model retraining. We also introduc… ▽ More

    Submitted 21 July, 2020; originally announced July 2020.

    Comments: Accepted in ICML Workshop on Real World Experiment Design and Active Learning 2020

  7. arXiv:2006.06919  [pdf, other

    cs.LG math.DS stat.ML

    MomentumRNN: Integrating Momentum into Recurrent Neural Networks

    Authors: Tan M. Nguyen, Richard G. Baraniuk, Andrea L. Bertozzi, Stanley J. Osher, Bao Wang

    Abstract: Designing deep neural networks is an art that often involves an expensive search over candidate architectures. To overcome this for recurrent neural nets (RNNs), we establish a connection between the hidden state dynamics in an RNN and gradient descent (GD). We then integrate momentum into this framework and propose a new family of RNNs, called {\em MomentumRNNs}. We theoretically prove and numeri… ▽ More

    Submitted 11 October, 2020; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: 21 pages, 11 figures, Accepted for publication at Advances in Neural Information Processing Systems (NeurIPS) 2020

    MSC Class: 68T07 ACM Class: I.2

    Journal ref: Advances in Neural Information Processing Systems (NeurIPS) 2020

  8. arXiv:2003.00631  [pdf, other

    cs.LG cs.AI stat.ML

    Sparsity Meets Robustness: Channel Pruning for the Feynman-Kac Formalism Principled Robust Deep Neural Nets

    Authors: Thu Dinh, Bao Wang, Andrea L. Bertozzi, Stanley J. Osher

    Abstract: Deep neural nets (DNNs) compression is crucial for adaptation to mobile devices. Though many successful algorithms exist to compress naturally trained DNNs, developing efficient and stable compression algorithms for robustly trained DNNs remains widely open. In this paper, we focus on a co-design of efficient DNN compression algorithms and sparse neural architectures for robust and accurate deep l… ▽ More

    Submitted 1 March, 2020; originally announced March 2020.

    Comments: 16 pages, 7 figures

    MSC Class: 68T01

  9. arXiv:2002.10583  [pdf, other

    cs.LG cs.NE stat.ML

    Scheduled Restart Momentum for Accelerated Stochastic Gradient Descent

    Authors: Bao Wang, Tan M. Nguyen, Andrea L. Bertozzi, Richard G. Baraniuk, Stanley J. Osher

    Abstract: Stochastic gradient descent (SGD) with constant momentum and its variants such as Adam are the optimization algorithms of choice for training deep neural networks (DNNs). Since DNN training is incredibly computationally expensive, there is great interest in speeding up the convergence. Nesterov accelerated gradient (NAG) improves the convergence rate of gradient descent (GD) for convex optimizatio… ▽ More

    Submitted 26 April, 2020; v1 submitted 24 February, 2020; originally announced February 2020.

    Comments: 35 pages, 16 figures, 18 tables

  10. arXiv:1904.09062  [pdf, other

    eess.IV stat.ML

    Semi-Supervised First-Person Activity Recognition in Body-Worn Video

    Authors: Honglin Chen, Hao Li, Alexander Song, Matt Haberland, Osman Akar, Adam Dhillon, Tiankuang Zhou, Andrea L. Bertozzi, P. Jeffrey Brantingham

    Abstract: Body-worn cameras are now commonly used for logging daily life, sports, and law enforcement activities, creating a large volume of archived footage. This paper studies the problem of classifying frames of footage according to the activity of the camera-wearer with an emphasis on application to real-world police body-worn video. Real-world datasets pose a different set of challenges from existing e… ▽ More

    Submitted 18 April, 2019; originally announced April 2019.

  11. arXiv:1902.05113  [pdf, other

    cs.LG math.OC stat.ML

    A Study on Graph-Structured Recurrent Neural Networks and Sparsification with Application to Epidemic Forecasting

    Authors: Zhijian Li, Xiyang Luo, Bao Wang, Andrea L. Bertozzi, Jack Xin

    Abstract: We study epidemic forecasting on real-world health data by a graph-structured recurrent neural network (GSRNN). We achieve state-of-the-art forecasting accuracy on the benchmark CDC dataset. To improve model efficiency, we sparsify the network weights via transformed-$\ell_1$ penalty and maintain prediction accuracy at the same level with 70% of the network weights being zero.

    Submitted 13 February, 2019; originally announced February 2019.

  12. arXiv:1811.06321  [pdf, other

    cs.SI eess.SP nlin.AO physics.soc-ph stat.ML

    Multivariate Spatiotemporal Hawkes Processes and Network Reconstruction

    Authors: Baichuan Yuan, Hao Li, Andrea L. Bertozzi, P. Jeffrey Brantingham, Mason A. Porter

    Abstract: There is often latent network structure in spatial and temporal data and the tools of network analysis can yield fascinating insights into such data. In this paper, we develop a nonparametric method for network reconstruction from spatiotemporal data sets using multivariate Hawkes processes. In contrast to prior work on network reconstruction with point-process models, which has often focused on e… ▽ More

    Submitted 15 November, 2018; originally announced November 2018.

  13. arXiv:1809.08516  [pdf, other

    cs.LG math.NA stat.ML

    Adversarial Defense via Data Dependent Activation Function and Total Variation Minimization

    Authors: Bao Wang, Alex T. Lin, Wei Zhu, Penghang Yin, Andrea L. Bertozzi, Stanley J. Osher

    Abstract: We improve the robustness of Deep Neural Net (DNN) to adversarial attacks by using an interpolating function as the output activation. This data-dependent activation remarkably improves both the generalization and robustness of DNN. In the CIFAR10 benchmark, we raise the robust accuracy of the adversarially trained ResNet20 from $\sim 46\%$ to $\sim 69\%$ under the state-of-the-art Iterative Fast… ▽ More

    Submitted 29 April, 2020; v1 submitted 22 September, 2018; originally announced September 2018.

    Comments: 17 pages, 6 figures

    MSC Class: 68Pxx

    Journal ref: Inverse Problems and Imaging, 2020

  14. arXiv:1806.02485  [pdf, other

    cs.SI cond-mat.stat-mech math.ST nlin.AO stat.ML

    Stochastic Block Models are a Discrete Surface Tension

    Authors: Zachary M. Boyd, Mason A. Porter, Andrea L. Bertozzi

    Abstract: Networks, which represent agents and interactions between them, arise in myriad applications throughout the sciences, engineering, and even the humanities. To understand large-scale structure in a network, a common task is to cluster a network's nodes into sets called "communities", such that there are dense connections within communities but sparse connections between them. A popular and statisti… ▽ More

    Submitted 24 March, 2019; v1 submitted 6 June, 2018; originally announced June 2018.

    Comments: to appear in Journal of Nonlinear Science

    MSC Class: 65K10; 49M20; 35Q56; 62H30; 91C20; 91D30; 94C15

  15. arXiv:1804.00684  [pdf, other

    cs.LG math.NA stat.ML

    Graph-Based Deep Modeling and Real Time Forecasting of Sparse Spatio-Temporal Data

    Authors: Bao Wang, Xiyang Luo, Fangbo Zhang, Baichuan Yuan, Andrea L. Bertozzi, P. Jeffrey Brantingham

    Abstract: We present a generic framework for spatio-temporal (ST) data modeling, analysis, and forecasting, with a special focus on data that is sparse in both space and time. Our multi-scaled framework is a seamless coupling of two major components: a self-exciting point process that models the macroscale statistical behaviors of the ST data and a graph structured recurrent neural network (GSRNN) to discov… ▽ More

    Submitted 2 April, 2018; originally announced April 2018.

    Comments: 9 pages, 19 figures

    MSC Class: 65-06

  16. arXiv:1711.08833  [pdf, other

    cs.LG math.NA stat.ML

    Deep Learning for Real-Time Crime Forecasting and its Ternarization

    Authors: Bao Wang, Penghang Yin, Andrea L. Bertozzi, P. Jeffrey Brantingham, Stanley J. Osher, Jack Xin

    Abstract: Real-time crime forecasting is important. However, accurate prediction of when and where the next crime will happen is difficult. No known physical model provides a reasonable approximation to such a complex system. Historical crime data are sparse in both space and time and the signal of interests is weak. In this work, we first present a proper representation of crime data. We then adapt the spa… ▽ More

    Submitted 23 November, 2017; originally announced November 2017.

    Comments: 14 pages, 7 figures

    MSC Class: 62-07

  17. arXiv:1707.09285  [pdf, other

    math.OC stat.ML

    Simplified Energy Landscape for Modularity Using Total Variation

    Authors: Zachary Boyd, Egil Bae, Xue-Cheng Tai, Andrea L. Bertozzi

    Abstract: Networks capture pairwise interactions between entities and are frequently used in applications such as social networks, food networks, and protein interaction networks, to name a few. Communities, cohesive groups of nodes, often form in these applications, and identifying them gives insight into the overall organization of the network. One common quality function used to identify community struct… ▽ More

    Submitted 2 April, 2018; v1 submitted 28 July, 2017; originally announced July 2017.

    Comments: 25 pages, 3 figures, 3 tables, submitted to SIAM J. App. Math

  18. arXiv:1707.03340  [pdf, other

    math.NA cs.LG stat.ML

    Deep Learning for Real Time Crime Forecasting

    Authors: Bao Wang, Duo Zhang, Duanhao Zhang, P. Jeffery Brantingham, Andrea L. Bertozzi

    Abstract: Accurate real time crime prediction is a fundamental issue for public safety, but remains a challenging problem for the scientific community. Crime occurrences depend on many complex factors. Compared to many predictable events, crime is sparse. At different spatio-temporal scales, crime distributions display dramatically different patterns. These distributions are of very low regularity in both s… ▽ More

    Submitted 9 July, 2017; originally announced July 2017.

    Comments: 4 pages, 6 figures, NOLTA, 2017

    MSC Class: 68T05

  19. arXiv:1703.08816  [pdf, other

    cs.LG stat.ML

    Uncertainty quantification in graph-based classification of high dimensional data

    Authors: Andrea L. Bertozzi, Xiyang Luo, Andrew M. Stuart, Konstantinos C. Zygalakis

    Abstract: Classification of high dimensional data finds wide-ranging applications. In many of these applications equipping the resulting classification with a measure of uncertainty may be as important as the classification itself. In this paper we introduce, develop algorithms for, and investigate the properties of, a variety of Bayesian models for the task of binary classification; via the posterior distr… ▽ More

    Submitted 8 February, 2018; v1 submitted 26 March, 2017; originally announced March 2017.

    Comments: 33 pages, 14 figures

  20. arXiv:1302.3913  [pdf, other

    stat.ML

    Multiclass Data Segmentation using Diffuse Interface Methods on Graphs

    Authors: Cristina Garcia-Cardona, Ekaterina Merkurjev, Andrea L. Bertozzi, Arjuna Flenner, Allon Percus

    Abstract: We present two graph-based algorithms for multiclass segmentation of high-dimensional data. The algorithms use a diffuse interface model based on the Ginzburg-Landau functional, related to total variation compressed sensing and image processing. A multiclass extension is introduced using the Gibbs simplex, with the functional's double-well potential modified to handle the multiclass case. The firs… ▽ More

    Submitted 17 January, 2014; v1 submitted 15 February, 2013; originally announced February 2013.

    Comments: 14 pages

    MSC Class: 62-XX

  21. arXiv:1206.4969  [pdf, other

    stat.AP cs.SI physics.soc-ph

    Community detection using spectral clustering on sparse geosocial data

    Authors: Yves van Gennip, Blake Hunter, Raymond Ahn, Peter Elliott, Kyle Luh, Megan Halvorson, Shannon Reid, Matt Valasik, James Wo, George E. Tita, Andrea L. Bertozzi, P. Jeffrey Brantingham

    Abstract: In this article we identify social communities among gang members in the Hollenbeck policing district in Los Angeles, based on sparse observations of a combination of social interactions and geographic locations of the individuals. This information, coming from LAPD Field Interview cards, is used to construct a similarity graph for the individuals. We use spectral clustering to identify clusters i… ▽ More

    Submitted 8 November, 2012; v1 submitted 21 June, 2012; originally announced June 2012.

    Comments: 22 pages, 6 figures (with subfigures)

    MSC Class: 62H30; 91C20; 91D30; 94C15