Skip to main content

Showing 1–34 of 34 results for author: Osher, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2502.16773  [pdf, ps, other

    stat.CO

    Splitting Regularized Wasserstein Proximal Algorithms for Nonsmooth Sampling Problems

    Authors: Fuqun Han, Stanley Osher, Wuchen Li

    Abstract: Sampling from nonsmooth target probability distributions is essential in various applications, including the Bayesian Lasso. We propose a splitting-based sampling algorithm for the time-implicit discretization of the probability flow for the Fokker-Planck equation, where the score function, defined as the gradient logarithm of the current probability density function, is approximated by the regula… ▽ More

    Submitted 10 July, 2025; v1 submitted 23 February, 2025; originally announced February 2025.

  2. arXiv:2406.13781  [pdf, other

    cs.LG cs.AI cs.CL cs.CV stat.ML

    A Primal-Dual Framework for Transformers and Neural Networks

    Authors: Tan M. Nguyen, Tam Nguyen, Nhat Ho, Andrea L. Bertozzi, Richard G. Baraniuk, Stanley J. Osher

    Abstract: Self-attention is key to the remarkable success of transformers in sequence modeling tasks including many applications in natural language processing and computer vision. Like neural network layers, these attention mechanisms are often developed by heuristics and experience. To provide a principled framework for constructing attention layers in transformers, we show that the self-attention corresp… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: Accepted to ICLR 2023, 26 pages, 4 figures, 14 tables

  3. arXiv:2402.06162  [pdf, other

    stat.ML cs.LG

    Wasserstein proximal operators describe score-based generative models and resolve memorization

    Authors: Benjamin J. Zhang, Siting Liu, Wuchen Li, Markos A. Katsoulakis, Stanley J. Osher

    Abstract: We focus on the fundamental mathematical structure of score-based generative models (SGMs). We first formulate SGMs in terms of the Wasserstein proximal operator (WPO) and demonstrate that, via mean-field games (MFGs), the WPO formulation reveals mathematical structure that describes the inductive bias of diffusion and score-based models. In particular, MFGs yield optimality conditions in the form… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  4. arXiv:2308.14945  [pdf, other

    stat.ML cs.LG stat.CO

    Noise-Free Sampling Algorithms via Regularized Wasserstein Proximals

    Authors: Hong Ye Tan, Stanley Osher, Wuchen Li

    Abstract: We consider the problem of sampling from a distribution governed by a potential function. This work proposes an explicit score based MCMC method that is deterministic, resulting in a deterministic evolution for particles rather than a stochastic differential equation evolution. The score term is given in closed form by a regularized Wasserstein proximal, using a kernel convolution that is approxim… ▽ More

    Submitted 2 October, 2023; v1 submitted 28 August, 2023; originally announced August 2023.

    MSC Class: 65C05; 62G07

  5. arXiv:2308.05061  [pdf, other

    cs.LG math.NA stat.ML

    Fine-Tune Language Models as Multi-Modal Differential Equation Solvers

    Authors: Liu Yang, Siting Liu, Stanley J. Osher

    Abstract: In the growing domain of scientific machine learning, in-context operator learning has shown notable potential in building foundation models, as in this framework the model is trained to learn operators and solve differential equations using prompted data, during the inference stage without weight updates. However, the current model's overdependence on function data overlooks the invaluable human… ▽ More

    Submitted 1 February, 2024; v1 submitted 9 August, 2023; originally announced August 2023.

  6. arXiv:2304.07993  [pdf, other

    cs.LG math.NA stat.ML

    In-Context Operator Learning with Data Prompts for Differential Equation Problems

    Authors: Liu Yang, Siting Liu, Tingwei Meng, Stanley J. Osher

    Abstract: This paper introduces a new neural-network-based approach, namely In-Context Operator Networks (ICON), to simultaneously learn operators from the prompted data and apply it to new questions during the inference stage, without any weight update. Existing methods are limited to using a neural network to approximate a specific equation solution or a specific operator, requiring retraining when switch… ▽ More

    Submitted 19 September, 2023; v1 submitted 17 April, 2023; originally announced April 2023.

    Comments: The second and third authors contributed equally. This is an outdated preprint. Please refer to the updated version published in PNAS: www.pnas.org/doi/10.1073/pnas.2310142120 See code in https://github.com/LiuYangMage/in-context-operator-networks

  7. arXiv:2301.00437  [pdf, other

    cs.LG stat.ML

    Neural Collapse in Deep Linear Networks: From Balanced to Imbalanced Data

    Authors: Hien Dang, Tho Tran, Stanley Osher, Hung Tran-The, Nhat Ho, Tan Nguyen

    Abstract: Modern deep neural networks have achieved impressive performance on tasks from image classification to natural language processing. Surprisingly, these complex systems with massive amounts of parameters exhibit the same structural properties in their last-layer features and classifiers across canonical datasets when training until convergence. In particular, it has been observed that the last-laye… ▽ More

    Submitted 18 June, 2023; v1 submitted 1 January, 2023; originally announced January 2023.

    Comments: 75 pages, 20 figures, 4 tables. Hien Dang and Tho Tran contributed equally to this work

  8. arXiv:2211.15779  [pdf, other

    cs.LG stat.ML

    Revisiting Over-smoothing and Over-squashing Using Ollivier-Ricci Curvature

    Authors: Khang Nguyen, Hieu Nong, Vinh Nguyen, Nhat Ho, Stanley Osher, Tan Nguyen

    Abstract: Graph Neural Networks (GNNs) had been demonstrated to be inherently susceptible to the problems of over-smoothing and over-squashing. These issues prohibit the ability of GNNs to model complex graph interactions by limiting their effectiveness in taking into account distant information. Our study reveals the key connection between the local graph geometry and the occurrence of both of these issues… ▽ More

    Submitted 31 May, 2023; v1 submitted 28 November, 2022; originally announced November 2022.

    Comments: Accepted at ICML 2023; 24 pages, 4 figures

  9. arXiv:2209.15092  [pdf, other

    cs.LG stat.ML

    Improving Generative Flow Networks with Path Regularization

    Authors: Anh Do, Duy Dinh, Tan Nguyen, Khuong Nguyen, Stanley Osher, Nhat Ho

    Abstract: Generative Flow Networks (GFlowNets) are recently proposed models for learning stochastic policies that generate compositional objects by sequences of actions with the probability proportional to a given reward function. The central problem of GFlowNets is to improve their exploration and generalization. In this work, we propose a novel path regularization method based on optimal transport theory… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

    Comments: 28 pages, 2 figures, 5 tables. Anh Do, Duy Dinh, and Tan Nguyen contributed equally to this work

  10. arXiv:2206.00206  [pdf, ps, other

    cs.LG stat.ML

    Transformer with Fourier Integral Attentions

    Authors: Tan Nguyen, Minh Pham, Tam Nguyen, Khai Nguyen, Stanley J. Osher, Nhat Ho

    Abstract: Multi-head attention empowers the recent success of transformers, the state-of-the-art models that have achieved remarkable success in sequence modeling and beyond. These attention mechanisms compute the pairwise dot products between the queries and keys, which results from the use of unnormalized Gaussian kernels with the assumption that the queries follow a mixture of Gaussian distribution. Ther… ▽ More

    Submitted 31 May, 2022; originally announced June 2022.

    Comments: 35 pages, 5 tables. Tan Nguyen and Minh Pham contributed equally to this work

  11. arXiv:2110.08678  [pdf, other

    cs.LG cs.CL stat.ML

    Improving Transformers with Probabilistic Attention Keys

    Authors: Tam Nguyen, Tan M. Nguyen, Dung D. Le, Duy Khuong Nguyen, Viet-Anh Tran, Richard G. Baraniuk, Nhat Ho, Stanley J. Osher

    Abstract: Multi-head attention is a driving force behind state-of-the-art transformers, which achieve remarkable performance across a variety of natural language processing (NLP) and computer vision tasks. It has been observed that for many applications, those attention heads learn redundant embedding, and most of them can be removed without degrading the performance of the model. Inspired by this observati… ▽ More

    Submitted 12 June, 2022; v1 submitted 16 October, 2021; originally announced October 2021.

    Comments: 27 pages, 16 figures, 10 tables

    Journal ref: Proceedings of the 39th International Conference on Machine Learning, Baltimore, Maryland, USA, PMLR 162, 2022

  12. arXiv:2008.02200  [pdf, other

    cs.LG stat.ML

    Wasserstein-based Projections with Applications to Inverse Problems

    Authors: Howard Heaton, Samy Wu Fung, Alex Tong Lin, Stanley Osher, Wotao Yin

    Abstract: Inverse problems consist of recovering a signal from a collection of noisy measurements. These are typically cast as optimization problems, with classic approaches using a data fidelity term and an analytic regularizer that stabilizes recovery. Recent Plug-and-Play (PnP) works propose replacing the operator for analytic regularization in optimization methods by a data-driven denoiser. These scheme… ▽ More

    Submitted 14 April, 2021; v1 submitted 5 August, 2020; originally announced August 2020.

    Comments: Revised version uploaded on April 14, 2021

  13. arXiv:2006.06919  [pdf, other

    cs.LG math.DS stat.ML

    MomentumRNN: Integrating Momentum into Recurrent Neural Networks

    Authors: Tan M. Nguyen, Richard G. Baraniuk, Andrea L. Bertozzi, Stanley J. Osher, Bao Wang

    Abstract: Designing deep neural networks is an art that often involves an expensive search over candidate architectures. To overcome this for recurrent neural nets (RNNs), we establish a connection between the hidden state dynamics in an RNN and gradient descent (GD). We then integrate momentum into this framework and propose a new family of RNNs, called {\em MomentumRNNs}. We theoretically prove and numeri… ▽ More

    Submitted 11 October, 2020; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: 21 pages, 11 figures, Accepted for publication at Advances in Neural Information Processing Systems (NeurIPS) 2020

    MSC Class: 68T07 ACM Class: I.2

    Journal ref: Advances in Neural Information Processing Systems (NeurIPS) 2020

  14. arXiv:2005.00218  [pdf, other

    cs.LG stat.ML

    Differentially Private Federated Learning with Laplacian Smoothing

    Authors: Zhicong Liang, Bao Wang, Quanquan Gu, Stanley Osher, Yuan Yao

    Abstract: Federated learning aims to protect data privacy by collaboratively learning a model without sharing private data among users. However, an adversary may still be able to infer the private training data by attacking the released model. Differential privacy provides a statistical protection against such attacks at the price of significantly degrading the accuracy or utility of the trained models. In… ▽ More

    Submitted 10 September, 2021; v1 submitted 1 May, 2020; originally announced May 2020.

  15. arXiv:2003.00631  [pdf, other

    cs.LG cs.AI stat.ML

    Sparsity Meets Robustness: Channel Pruning for the Feynman-Kac Formalism Principled Robust Deep Neural Nets

    Authors: Thu Dinh, Bao Wang, Andrea L. Bertozzi, Stanley J. Osher

    Abstract: Deep neural nets (DNNs) compression is crucial for adaptation to mobile devices. Though many successful algorithms exist to compress naturally trained DNNs, developing efficient and stable compression algorithms for robustly trained DNNs remains widely open. In this paper, we focus on a co-design of efficient DNN compression algorithms and sparse neural architectures for robust and accurate deep l… ▽ More

    Submitted 1 March, 2020; originally announced March 2020.

    Comments: 16 pages, 7 figures

    MSC Class: 68T01

  16. arXiv:2002.10583  [pdf, other

    cs.LG cs.NE stat.ML

    Scheduled Restart Momentum for Accelerated Stochastic Gradient Descent

    Authors: Bao Wang, Tan M. Nguyen, Andrea L. Bertozzi, Richard G. Baraniuk, Stanley J. Osher

    Abstract: Stochastic gradient descent (SGD) with constant momentum and its variants such as Adam are the optimization algorithms of choice for training deep neural networks (DNNs). Since DNN training is incredibly computationally expensive, there is great interest in speeding up the convergence. Nesterov accelerated gradient (NAG) improves the convergence rate of gradient descent (GD) for convex optimizatio… ▽ More

    Submitted 26 April, 2020; v1 submitted 24 February, 2020; originally announced February 2020.

    Comments: 35 pages, 16 figures, 18 tables

  17. arXiv:2002.10113  [pdf, other

    cs.LG cs.MA math.OC stat.ML

    Alternating the Population and Control Neural Networks to Solve High-Dimensional Stochastic Mean-Field Games

    Authors: Alex Tong Lin, Samy Wu Fung, Wuchen Li, Levon Nurbekyan, Stanley J. Osher

    Abstract: We present APAC-Net, an alternating population and agent control neural network for solving stochastic mean field games (MFGs). Our algorithm is geared toward high-dimensional instances of MFGs that are beyond reach with existing solution methods. We achieve this in two steps. First, we take advantage of the underlying variational primal-dual structure that MFGs exhibit and phrase it as a convex-c… ▽ More

    Submitted 14 July, 2023; v1 submitted 24 February, 2020; originally announced February 2020.

  18. arXiv:1912.01825  [pdf, other

    cs.LG math.NA math.OC stat.ML

    A Machine Learning Framework for Solving High-Dimensional Mean Field Game and Mean Field Control Problems

    Authors: Lars Ruthotto, Stanley Osher, Wuchen Li, Levon Nurbekyan, Samy Wu Fung

    Abstract: Mean field games (MFG) and mean field control (MFC) are critical classes of multi-agent models for efficient analysis of massive populations of interacting agents. Their areas of application span topics in economics, finance, game theory, industrial engineering, crowd motion, and more. In this paper, we provide a flexible machine learning framework for the numerical solution of potential MFG and M… ▽ More

    Submitted 14 February, 2020; v1 submitted 4 December, 2019; originally announced December 2019.

    Comments: 21 pages, 13 figures, 2 table

    MSC Class: 49N90; 49J20; 49N70

  19. arXiv:1911.00782  [pdf, other

    cs.LG math.NA stat.CO stat.ML

    Laplacian Smoothing Stochastic Gradient Markov Chain Monte Carlo

    Authors: Bao Wang, Difan Zou, Quanquan Gu, Stanley Osher

    Abstract: As an important Markov Chain Monte Carlo (MCMC) method, stochastic gradient Langevin dynamics (SGLD) algorithm has achieved great success in Bayesian learning and posterior sampling. However, SGLD typically suffers from slow convergence rate due to its large variance caused by the stochastic gradient. In order to alleviate these drawbacks, we leverage the recently developed Laplacian Smoothing (LS… ▽ More

    Submitted 2 November, 2019; originally announced November 2019.

    Comments: 27 pages, 5 figures

    MSC Class: 62Dxx; 65Yxx; 65Cxx

  20. arXiv:1907.06800  [pdf, other

    cs.LG math.NA stat.ML

    Graph Interpolating Activation Improves Both Natural and Robust Accuracies in Data-Efficient Deep Learning

    Authors: Bao Wang, Stanley J. Osher

    Abstract: Improving the accuracy and robustness of deep neural nets (DNNs) and adapting them to small training data are primary tasks in deep learning research. In this paper, we replace the output activation function of DNNs, typically the data-agnostic softmax function, with a graph Laplacian-based high dimensional interpolating function which, in the continuum limit, converges to the solution of a Laplac… ▽ More

    Submitted 15 July, 2019; originally announced July 2019.

    Comments: 34 pages, 10 figures

    MSC Class: 68T01; 68T45

  21. arXiv:1906.12056  [pdf, other

    cs.LG cs.CR stat.ML

    DP-LSSGD: A Stochastic Optimization Method to Lift the Utility in Privacy-Preserving ERM

    Authors: Bao Wang, Quanquan Gu, March Boedihardjo, Farzin Barekat, Stanley J. Osher

    Abstract: Machine learning (ML) models trained by differentially private stochastic gradient descent (DP-SGD) have much lower utility than the non-private ones. To mitigate this degradation, we propose a DP Laplacian smoothing SGD (DP-LSSGD) to train ML models with differential privacy (DP) guarantees. At the core of DP-LSSGD is the Laplacian smoothing, which smooths out the Gaussian noise used in the Gauss… ▽ More

    Submitted 7 December, 2019; v1 submitted 28 June, 2019; originally announced June 2019.

    Comments: 21 pages, 7 figures

    MSC Class: 68T05

  22. arXiv:1903.05662  [pdf, other

    cs.LG math.OC stat.ML

    Understanding Straight-Through Estimator in Training Activation Quantized Neural Nets

    Authors: Penghang Yin, Jiancheng Lyu, Shuai Zhang, Stanley Osher, Yingyong Qi, Jack Xin

    Abstract: Training activation quantized neural networks involves minimizing a piecewise constant function whose gradient vanishes almost everywhere, which is undesirable for the standard back-propagation or chain rule. An empirical way around this issue is to use a straight-through estimator (STE) (Bengio et al., 2013) in the backward pass only, so that the "gradient" through the modified chain rule becomes… ▽ More

    Submitted 25 September, 2019; v1 submitted 13 March, 2019; originally announced March 2019.

    Comments: in International Conference on Learning Representations (ICLR) 2019

  23. arXiv:1901.06827  [pdf, other

    cs.LG math.DS math.NA stat.ML

    A Deterministic Gradient-Based Approach to Avoid Saddle Points

    Authors: Lisa Maria Kreusser, Stanley J. Osher, Bao Wang

    Abstract: Loss functions with a large number of saddle points are one of the major obstacles for training modern machine learning models efficiently. First-order methods such as gradient descent are usually the methods of choice for training machine learning models. However, these methods converge to saddle points for certain choices of initial guesses. In this paper, we propose a modification of the recent… ▽ More

    Submitted 28 September, 2020; v1 submitted 21 January, 2019; originally announced January 2019.

  24. arXiv:1811.10745  [pdf, other

    cs.LG cs.CR math.NA stat.ML

    ResNets Ensemble via the Feynman-Kac Formalism to Improve Natural and Robust Accuracies

    Authors: Bao Wang, Binjie Yuan, Zuoqiang Shi, Stanley J. Osher

    Abstract: Empirical adversarial risk minimization (EARM) is a widely used mathematical framework to robustly train deep neural nets (DNNs) that are resistant to adversarial attacks. However, both natural and robust accuracies, in classifying clean and adversarial images, respectively, of the trained robust models are far from satisfactory. In this work, we unify the theory of optimal control of transport eq… ▽ More

    Submitted 10 June, 2019; v1 submitted 26 November, 2018; originally announced November 2018.

    Comments: 18 pages, 6 figures

    MSC Class: 68Txx

  25. arXiv:1811.06492  [pdf, other

    cs.LG cs.CR stat.ML

    Mathematical Analysis of Adversarial Attacks

    Authors: Zehao Dou, Stanley J. Osher, Bao Wang

    Abstract: In this paper, we analyze efficacy of the fast gradient sign method (FGSM) and the Carlini-Wagner's L2 (CW-L2) attack. We prove that, within a certain regime, the untargeted FGSM can fool any convolutional neural nets (CNNs) with ReLU activation; the targeted FGSM can mislead any CNNs with ReLU activation to classify any given image into any prescribed class. For a special two-layer neural network… ▽ More

    Submitted 25 November, 2018; v1 submitted 15 November, 2018; originally announced November 2018.

    Comments: 21 pages

  26. arXiv:1809.08516  [pdf, other

    cs.LG math.NA stat.ML

    Adversarial Defense via Data Dependent Activation Function and Total Variation Minimization

    Authors: Bao Wang, Alex T. Lin, Wei Zhu, Penghang Yin, Andrea L. Bertozzi, Stanley J. Osher

    Abstract: We improve the robustness of Deep Neural Net (DNN) to adversarial attacks by using an interpolating function as the output activation. This data-dependent activation remarkably improves both the generalization and robustness of DNN. In the CIFAR10 benchmark, we raise the robust accuracy of the adversarially trained ResNet20 from $\sim 46\%$ to $\sim 69\%$ under the state-of-the-art Iterative Fast… ▽ More

    Submitted 29 April, 2020; v1 submitted 22 September, 2018; originally announced September 2018.

    Comments: 17 pages, 6 figures

    MSC Class: 68Pxx

    Journal ref: Inverse Problems and Imaging, 2020

  27. arXiv:1808.05240  [pdf, other

    cs.LG cs.CV stat.ML

    Blended Coarse Gradient Descent for Full Quantization of Deep Neural Networks

    Authors: Penghang Yin, Shuai Zhang, Jiancheng Lyu, Stanley Osher, Yingyong Qi, Jack Xin

    Abstract: Quantized deep neural networks (QDNNs) are attractive due to their much lower memory storage and faster inference speed than their regular full precision counterparts. To maintain the same performance level especially at low bit-widths, QDNNs must be retrained. Their training involves piecewise constant activation functions and discrete weights, hence mathematical challenges arise. We introduce th… ▽ More

    Submitted 6 January, 2019; v1 submitted 15 August, 2018; originally announced August 2018.

  28. arXiv:1806.06317  [pdf, other

    cs.LG math.NA stat.ML

    Laplacian Smoothing Gradient Descent

    Authors: Stanley Osher, Bao Wang, Penghang Yin, Xiyang Luo, Farzin Barekat, Minh Pham, Alex Lin

    Abstract: We propose a class of very simple modifications of gradient descent and stochastic gradient descent. We show that when applied to a large variety of machine learning problems, ranging from logistic regression to deep neural nets, the proposed surrogates can dramatically reduce the variance, allow to take a larger step size, and improve the generalization accuracy. The methods only involve multiply… ▽ More

    Submitted 27 April, 2019; v1 submitted 16 June, 2018; originally announced June 2018.

    Comments: 28 pages, 15 figures

    MSC Class: 65-06

  29. arXiv:1802.00168  [pdf, other

    cs.LG cs.CV stat.ML

    Deep Neural Nets with Interpolating Function as Output Activation

    Authors: Bao Wang, Xiyang Luo, Zhen Li, Wei Zhu, Zuoqiang Shi, Stanley J. Osher

    Abstract: We replace the output layer of deep neural nets, typically the softmax function, by a novel interpolating function. And we propose end-to-end training and testing algorithms for this new architecture. Compared to classical neural nets with softmax function as output activation, the surrogate with interpolating function as output activation combines advantages of both deep and manifold learning. Th… ▽ More

    Submitted 16 June, 2018; v1 submitted 1 February, 2018; originally announced February 2018.

    Comments: 11 pages, 4 figures

    MSC Class: 68Txx

  30. arXiv:1711.08833  [pdf, other

    cs.LG math.NA stat.ML

    Deep Learning for Real-Time Crime Forecasting and its Ternarization

    Authors: Bao Wang, Penghang Yin, Andrea L. Bertozzi, P. Jeffrey Brantingham, Stanley J. Osher, Jack Xin

    Abstract: Real-time crime forecasting is important. However, accurate prediction of when and where the next crime will happen is difficult. No known physical model provides a reasonable approximation to such a complex system. Historical crime data are sparse in both space and time and the signal of interests is weak. In this work, we first present a proper representation of crime data. We then adapt the spa… ▽ More

    Submitted 23 November, 2017; originally announced November 2017.

    Comments: 14 pages, 7 figures

    MSC Class: 62-07

  31. arXiv:1710.07746  [pdf, other

    math.OC cs.LG stat.ML

    Stochastic Backward Euler: An Implicit Gradient Descent Algorithm for $k$-means Clustering

    Authors: Penghang Yin, Minh Pham, Adam Oberman, Stanley Osher

    Abstract: In this paper, we propose an implicit gradient descent algorithm for the classic $k$-means problem. The implicit gradient step or backward Euler is solved via stochastic fixed-point iteration, in which we randomly sample a mini-batch gradient in every iteration. It is the average of the fixed-point trajectory that is carried over to the next gradient step. We draw connections between the proposed… ▽ More

    Submitted 21 May, 2018; v1 submitted 20 October, 2017; originally announced October 2017.

  32. Sparse Recovery via Differential Inclusions

    Authors: Stanley Osher, Feng Ruan, Jiechao Xiong, Yuan Yao, Wotao Yin

    Abstract: In this paper, we recover sparse signals from their noisy linear measurements by solving nonlinear differential inclusions, which is based on the notion of inverse scale space (ISS) developed in applied mathematics. Our goal here is to bring this idea to address a challenging problem in statistics, \emph{i.e.} finding the oracle estimator which is unbiased and sign-consistent using dynamics. We ca… ▽ More

    Submitted 21 January, 2016; v1 submitted 30 June, 2014; originally announced June 2014.

    Comments: In Applied and Computational Harmonic Analysis, 2016

    Report number: CAM Report 14-61

  33. arXiv:1207.6430  [pdf, other

    stat.ML cs.LG stat.AP

    Optimal Data Collection For Informative Rankings Expose Well-Connected Graphs

    Authors: Braxton Osting, Christoph Brune, Stanley J. Osher

    Abstract: Given a graph where vertices represent alternatives and arcs represent pairwise comparison data, the statistical ranking problem is to find a potential function, defined on the vertices, such that the gradient of the potential function agrees with the pairwise comparisons. Our goal in this paper is to develop a method for collecting data for which the least squares estimator for the ranking proble… ▽ More

    Submitted 4 June, 2014; v1 submitted 26 July, 2012; originally announced July 2012.

    Comments: 31 pages, 10 figures, 3 tables

    Report number: UCLA CAM report 12-32 MSC Class: 62F07; 05C40; 49N45;

  34. A convex model for non-negative matrix factorization and dimensionality reduction on physical space

    Authors: Ernie Esser, Michael Möller, Stanley Osher, Guillermo Sapiro, Jack Xin

    Abstract: A collaborative convex framework for factoring a data matrix $X$ into a non-negative product $AS$, with a sparse coefficient matrix $S$, is proposed. We restrict the columns of the dictionary matrix $A$ to coincide with certain columns of the data matrix $X$, thereby guaranteeing a physically meaningful dictionary and dimensionality reduction. We use $l_{1,\infty}$ regularization to select the dic… ▽ More

    Submitted 4 February, 2011; originally announced February 2011.

    Comments: 14 pages, 9 figures. EE and JX were supported by NSF grants {DMS-0911277}, {PRISM-0948247}, MM by the German Academic Exchange Service (DAAD), SO and MM by NSF grants {DMS-0835863}, {DMS-0914561}, {DMS-0914856} and ONR grant {N00014-08-1119}, and GS was supported by NSF, NGA, ONR, ARO, DARPA, and {NSSEFF.}