Skip to main content

Showing 1–50 of 58 results for author: Liang, F

Searching in archive stat. Search in all archives.
.
  1. arXiv:2505.19136  [pdf, ps, other

    stat.ML cs.LG

    Uncertainty Quantification for Physics-Informed Neural Networks with Extended Fiducial Inference

    Authors: Frank Shih, Zhenghao Jiang, Faming Liang

    Abstract: Uncertainty quantification (UQ) in scientific machine learning is increasingly critical as neural networks are widely adopted to tackle complex problems across diverse scientific disciplines. For physics-informed neural networks (PINNs), a prominent model in scientific machine learning, uncertainty is typically quantified using Bayesian or dropout methods. However, both approaches suffer from a fu… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

  2. arXiv:2505.01995  [pdf, ps, other

    stat.ML cs.LG math.ST stat.CO

    Extended Fiducial Inference for Individual Treatment Effects via Deep Neural Networks

    Authors: Sehwan Kim, Faming Liang

    Abstract: Individual treatment effect estimation has gained significant attention in recent data science literature. This work introduces the Double Neural Network (Double-NN) method to address this problem within the framework of extended fiducial inference (EFI). In the proposed method, deep neural networks are used to model the treatment and control effect functions, while an additional neural network is… ▽ More

    Submitted 4 May, 2025; originally announced May 2025.

  3. arXiv:2504.04658  [pdf, other

    cs.CV stat.AP

    3DM-WeConvene: Learned Image Compression with 3D Multi-Level Wavelet-Domain Convolution and Entropy Model

    Authors: Haisheng Fu, Jie Liang, Feng Liang, Zhenman Fang, Guohe Zhang, Jingning Han

    Abstract: Learned image compression (LIC) has recently made significant progress, surpassing traditional methods. However, most LIC approaches operate mainly in the spatial domain and lack mechanisms for reducing frequency-domain correlations. To address this, we propose a novel framework that integrates low-complexity 3D multi-level Discrete Wavelet Transform (DWT) into convolutional layers and entropy cod… ▽ More

    Submitted 6 April, 2025; originally announced April 2025.

    Comments: 13 pages

  4. arXiv:2411.00969  [pdf, other

    stat.ML cs.LG

    Magnitude Pruning of Large Pretrained Transformer Models with a Mixture Gaussian Prior

    Authors: Mingxuan Zhang, Yan Sun, Faming Liang

    Abstract: Large pretrained transformer models have revolutionized modern AI applications with their state-of-the-art performance in natural language processing (NLP). However, their substantial parameter count poses challenges for real-world deployment. To address this, researchers often reduce model size by pruning parameters based on their magnitude or sensitivity. Previous research has demonstrated the l… ▽ More

    Submitted 1 November, 2024; originally announced November 2024.

  5. arXiv:2411.00273  [pdf, other

    cs.LG stat.AP stat.ML

    Efficient Model Compression for Bayesian Neural Networks

    Authors: Diptarka Saha, Zihe Liu, Feng Liang

    Abstract: Model Compression has drawn much attention within the deep learning community recently. Compressing a dense neural network offers many advantages including lower computation cost, deployability to devices of limited storage and memories, and resistance to adversarial attacks. This may be achieved via weight pruning or fully discarding certain input features. Here we demonstrate a novel strategy to… ▽ More

    Submitted 31 October, 2024; originally announced November 2024.

  6. arXiv:2411.00256  [pdf, ps, other

    stat.ME

    Bayesian Smoothing and Feature Selection Using variational Automatic Relevance Determination

    Authors: Zihe Liu, Diptarka Saha, Feng Liang

    Abstract: This study introduces Variational Automatic Relevance Determination (VARD), a novel approach tailored for fitting sparse additive regression models in high-dimensional settings. VARD distinguishes itself by its ability to independently assess the smoothness of each feature while enabling precise determination of whether a feature's contribution to the response is zero, linear, or nonlinear. Furthe… ▽ More

    Submitted 31 October, 2024; originally announced November 2024.

  7. arXiv:2409.16276  [pdf, other

    stat.ME

    Bayesian Variable Selection and Sparse Estimation for High-Dimensional Graphical Models

    Authors: Anwesha Chakravarti, Naveen N. Narishetty, Feng Liang

    Abstract: We introduce a novel Bayesian approach for both covariate selection and sparse precision matrix estimation in the context of high-dimensional Gaussian graphical models involving multiple responses. Our approach provides a sparse estimation of the three distinct sparsity structures: the regression coefficient matrix, the conditional dependency structure among responses, and between responses and co… ▽ More

    Submitted 24 September, 2024; originally announced September 2024.

    Comments: 27 pages in main paper, 33 pages in Supplementary, 4 figures

  8. arXiv:2407.21622  [pdf, other

    stat.ML cs.LG math.ST

    Extended Fiducial Inference: Toward an Automated Process of Statistical Inference

    Authors: Faming Liang, Sehwan Kim, Yan Sun

    Abstract: While fiducial inference was widely considered a big blunder by R.A. Fisher, the goal he initially set --`inferring the uncertainty of model parameters on the basis of observations' -- has been continually pursued by many statisticians. To this end, we develop a new statistical inference method called extended Fiducial inference (EFI). The new method achieves the goal of fiducial inference by leve… ▽ More

    Submitted 31 July, 2024; originally announced July 2024.

  9. arXiv:2407.09983  [pdf, other

    stat.AP

    WeConvene: Learned Image Compression with Wavelet-Domain Convolution and Entropy Model

    Authors: Haisheng Fu, Jie Liang, Zhenman Fang, Jingning Han, Feng Liang, Guohe Zhang

    Abstract: Recently learned image compression (LIC) has achieved great progress and even outperformed the traditional approach using DCT or discrete wavelet transform (DWT). However, LIC mainly reduces spatial redundancy in the autoencoder networks and entropy coding, but has not fully removed the frequency-domain correlation explicitly as in DCT or DWT. To leverage the best of both worlds, we propose a surp… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

    Comments: 16 pages, ECCV2024

  10. arXiv:2403.18994  [pdf, other

    stat.ML cs.LG

    Causal-StoNet: Causal Inference for High-Dimensional Complex Data

    Authors: Yaxin Fang, Faming Liang

    Abstract: With the advancement of data science, the collection of increasingly complex datasets has become commonplace. In such datasets, the data dimension can be extremely high, and the underlying data generation process can be unknown and highly nonlinear. As a result, the task of making causal inference with high-dimensional complex data has become a fundamental problem in many disciplines, such as medi… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  11. arXiv:2403.13178  [pdf, other

    stat.ML cs.AI cs.LG

    Fast Value Tracking for Deep Reinforcement Learning

    Authors: Frank Shih, Faming Liang

    Abstract: Reinforcement learning (RL) tackles sequential decision-making problems by creating agents that interacts with their environment. However, existing algorithms often view these problem as static, focusing on point estimates for model parameters to maximize expected rewards, neglecting the stochastic dynamics of agent-environment interactions and the critical role of uncertainty quantification. Our… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  12. arXiv:2402.15602  [pdf, other

    math.ST cs.IT cs.LG stat.ML

    Minimax Optimality of Score-based Diffusion Models: Beyond the Density Lower Bound Assumptions

    Authors: Kaihong Zhang, Caitlyn H. Yin, Feng Liang, Jingbo Liu

    Abstract: We study the asymptotic error of score-based diffusion model sampling in large-sample scenarios from a non-parametric statistics perspective. We show that a kernel-based score estimator achieves an optimal mean square error of $\widetilde{O}\left(n^{-1} t^{-\frac{d+2}{2}}(t^{\frac{d}{2}} \vee 1)\right)$ for the score function of $p_0*\mathcal{N}(0,t\boldsymbol{I}_d)$, where $n$ and $d$ represent t… ▽ More

    Submitted 23 July, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Journal ref: Proceedings of the 41st International Conference on Machine Learning, PMLR 235:60134-60178, 2024

  13. arXiv:2401.11093  [pdf, other

    stat.AP

    Learned Image Compression with Dual-Branch Encoder and Conditional Information Coding

    Authors: Haisheng Fu, Feng Liang, Jie Liang, Zhenman Fang, Guohe Zhang, Jingning Han

    Abstract: Recent advancements in deep learning-based image compression are notable. However, prevalent schemes that employ a serial context-adaptive entropy model to enhance rate-distortion (R-D) performance are markedly slow. Furthermore, the complexities of the encoding and decoding networks are substantially high, rendering them unsuitable for some practical applications. In this paper, we propose two te… ▽ More

    Submitted 21 March, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

    Comments: Accepted by DCC2024

  14. arXiv:2310.03243  [pdf, other

    stat.ML cs.AI cs.LG

    Sparse Deep Learning for Time Series Data: Theory and Applications

    Authors: Mingxuan Zhang, Yan Sun, Faming Liang

    Abstract: Sparse deep learning has become a popular technique for improving the performance of deep neural networks in areas such as uncertainty quantification, variable selection, and large-scale network compression. However, most existing research has focused on problems where the observations are independent and identically distributed (i.i.d.), and there has been little work on the problems where the ob… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

  15. arXiv:2306.13641  [pdf, other

    stat.ML cs.LG

    A New Paradigm for Generative Adversarial Networks based on Randomized Decision Rules

    Authors: Sehwan Kim, Qifan Song, Faming Liang

    Abstract: The Generative Adversarial Network (GAN) was recently introduced in the literature as a novel machine learning method for training generative models. It has many applications in statistics such as nonparametric clustering and nonparametric conditional independence tests. However, training the GAN is notoriously difficult due to the issue of mode collapse, which refers to the lack of diversity amon… ▽ More

    Submitted 23 June, 2023; originally announced June 2023.

  16. arXiv:2306.09262  [pdf, other

    stat.ML cs.LG cs.PL

    A Heavy-Tailed Algebra for Probabilistic Programming

    Authors: Feynman Liang, Liam Hodgkinson, Michael W. Mahoney

    Abstract: Despite the successes of probabilistic models based on passing noise through neural networks, recent work has identified that such methods often fail to capture tail behavior accurately, unless the tails of the base distribution are appropriately calibrated. To overcome this deficiency, we propose a systematic approach for analyzing the tails of random variables, and we illustrate how this approac… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: 21 pages, 6 figures

  17. arXiv:2303.02840  [pdf, other

    stat.ME

    The conditionally studentized test for high-dimensional parametric regressions

    Authors: Feng Liang, Chuhan Wang, jiaqi Huang, Lixing Zhu

    Abstract: This paper studies model checking for general parametric regression models having no dimension reduction structures on the predictor vector. Using any U-statistic type test as an initial test, this paper combines the sample-splitting and conditional studentization approaches to construct a COnditionally Studentized Test (COST). Whether the initial test is global or local smoothing-based; the dimen… ▽ More

    Submitted 17 August, 2023; v1 submitted 5 March, 2023; originally announced March 2023.

    Comments: 35 pages, 2 figures

  18. arXiv:2212.04585  [pdf, other

    stat.ME stat.CO

    A Double Regression Method for Graphical Modeling of High-dimensional Nonlinear and Non-Gaussian Data

    Authors: Siqi Liang, Faming Liang

    Abstract: Graphical models have long been studied in statistics as a tool for inferring conditional independence relationships among a large set of random variables. The most existing works in graphical modeling focus on the cases that the data are Gaussian or mixed and the variables are linearly dependent. In this paper, we propose a double regression method for learning graphical models under the high-dim… ▽ More

    Submitted 8 December, 2022; originally announced December 2022.

    Comments: 1 figure

    MSC Class: 62H22

    Journal ref: Statistics and Its Interface 2023

  19. arXiv:2211.10837  [pdf, other

    cs.LG stat.CO

    Non-reversible Parallel Tempering for Deep Posterior Approximation

    Authors: Wei Deng, Qian Zhang, Qi Feng, Faming Liang, Guang Lin

    Abstract: Parallel tempering (PT), also known as replica exchange, is the go-to workhorse for simulations of multi-modal distributions. The key to the success of PT is to adopt efficient swap schemes. The popular deterministic even-odd (DEO) scheme exploits the non-reversibility property and has successfully reduced the communication cost from $O(P^2)$ to $O(P)$ given sufficiently many $P$ chains. However,… ▽ More

    Submitted 19 November, 2022; originally announced November 2022.

    Comments: Accepted by AAAI 2023

  20. arXiv:2210.04349  [pdf, other

    cs.LG stat.ML

    Nonlinear Sufficient Dimension Reduction with a Stochastic Neural Network

    Authors: Siqi Liang, Yan Sun, Faming Liang

    Abstract: Sufficient dimension reduction is a powerful tool to extract core information hidden in the high-dimensional data and has potentially many important applications in machine learning tasks. However, the existing nonlinear sufficient dimension reduction methods often lack the scalability necessary for dealing with large-scale data. We propose a new type of stochastic neural network under a rigorous… ▽ More

    Submitted 9 October, 2022; originally announced October 2022.

  21. arXiv:2205.07918  [pdf, other

    stat.ML cs.LG

    Fat-Tailed Variational Inference with Anisotropic Tail Adaptive Flows

    Authors: Feynman Liang, Liam Hodgkinson, Michael W. Mahoney

    Abstract: While fat-tailed densities commonly arise as posterior and marginal distributions in robust models and scale mixtures, they present challenges when Gaussian-based variational inference fails to capture tail decay accurately. We first improve previous theory on tails of Lipschitz flows by quantifying how the tails affect the rate of tail decay and by expanding the theory to non-Lipschitz polynomial… ▽ More

    Submitted 16 May, 2022; originally announced May 2022.

  22. arXiv:2202.09867  [pdf, other

    stat.ML cs.LG

    Interacting Contour Stochastic Gradient Langevin Dynamics

    Authors: Wei Deng, Siqi Liang, Botao Hao, Guang Lin, Faming Liang

    Abstract: We propose an interacting contour stochastic gradient Langevin dynamics (ICSGLD) sampler, an embarrassingly parallel multiple-chain contour stochastic gradient Langevin dynamics (CSGLD) sampler with efficient interactions. We show that ICSGLD can be theoretically more efficient than a single-chain CSGLD with an equivalent computational budget. We also present a novel random-field function, which f… ▽ More

    Submitted 20 February, 2022; originally announced February 2022.

    Comments: ICLR 2022

  23. arXiv:2201.05319  [pdf, other

    stat.ML cs.LG

    A Kernel-Expanded Stochastic Neural Network

    Authors: Yan Sun, Faming Liang

    Abstract: The deep neural network suffers from many fundamental issues in machine learning. For example, it often gets trapped into a local minimum in training, and its prediction uncertainty is hard to be assessed. To address these issues, we propose the so-called kernel-expanded stochastic neural network (K-StoNet) model, which incorporates support vector regression (SVR) as the first hidden layer and ref… ▽ More

    Submitted 14 January, 2022; originally announced January 2022.

    Comments: Accepted by JRSSB

  24. arXiv:2110.04232  [pdf, other

    stat.ML cs.IR cs.LG stat.ME

    Learning Topic Models: Identifiability and Finite-Sample Analysis

    Authors: Yinyin Chen, Shishuang He, Yun Yang, Feng Liang

    Abstract: Topic models provide a useful text-mining tool for learning, extracting, and discovering latent structures in large text corpora. Although a plethora of methods have been proposed for topic modeling, lacking in the literature is a formal theoretical investigation of the statistical identifiability and accuracy of latent topic estimation. In this paper, we propose a maximum likelihood estimator (ML… ▽ More

    Submitted 10 August, 2022; v1 submitted 8 October, 2021; originally announced October 2021.

  25. arXiv:2110.00653  [pdf, ps, other

    stat.ML cs.LG

    Sparse Deep Learning: A New Framework Immune to Local Traps and Miscalibration

    Authors: Yan Sun, Wenjun Xiong, Faming Liang

    Abstract: Deep learning has powered recent successes of artificial intelligence (AI). However, the deep neural network, as the basic model of deep learning, has suffered from issues such as local traps and miscalibration. In this paper, we provide a new framework for sparse deep learning, which has the above issues addressed in a coherent way. In particular, we lay down a theoretical foundation for sparse d… ▽ More

    Submitted 2 December, 2021; v1 submitted 1 October, 2021; originally announced October 2021.

    Comments: Neurips 2021

  26. arXiv:2105.05363  [pdf, ps, other

    stat.ME

    A Langevinized Ensemble Kalman Filter for Large-Scale Static and Dynamic Learning

    Authors: Peiyi Zhang, Qifan Song, Faming Liang

    Abstract: The Ensemble Kalman Filter (EnKF) has achieved great successes in data assimilation in atmospheric and oceanic sciences, but its failure in convergence to the right filtering distribution precludes its use for uncertainty quantification. We reformulate the EnKF under the framework of Langevin dynamics, which leads to a new particle filtering algorithm, the so-called Langevinized EnKF. The Langevin… ▽ More

    Submitted 11 May, 2021; originally announced May 2021.

  27. arXiv:2102.13229  [pdf, other

    stat.ML cs.LG

    Consistent Sparse Deep Learning: Theory and Computation

    Authors: Yan Sun, Qifan Song, Faming Liang

    Abstract: Deep learning has been the engine powering many successes of data science. However, the deep neural network (DNN), as the basic model of deep learning, is often excessively over-parameterized, causing many difficulties in training, prediction and interpretation. We propose a frequentist-like method for learning sparse DNNs and justify its consistency under the Bayesian framework: the proposed meth… ▽ More

    Submitted 7 March, 2021; v1 submitted 25 February, 2021; originally announced February 2021.

    Comments: Accepted by JASA

  28. arXiv:2010.12128  [pdf, other

    cs.LG stat.ML

    Accelerating Metropolis-Hastings with Lightweight Inference Compilation

    Authors: Feynman Liang, Nimar Arora, Nazanin Tehrani, Yucen Li, Michael Tingley, Erik Meijer

    Abstract: In order to construct accurate proposers for Metropolis-Hastings Markov Chain Monte Carlo, we integrate ideas from probabilistic graphical models and neural networks in an open-source framework we call Lightweight Inference Compilation (LIC). LIC implements amortized inference within an open-universe declarative probabilistic programming language (PPL). Graph neural networks are used to parameteri… ▽ More

    Submitted 22 October, 2020; originally announced October 2020.

    Journal ref: PMLR 130 (2021) 181-189

  29. arXiv:2010.09800  [pdf, other

    stat.ML cs.LG stat.CO

    A Contour Stochastic Gradient Langevin Dynamics Algorithm for Simulations of Multi-modal Distributions

    Authors: Wei Deng, Guang Lin, Faming Liang

    Abstract: We propose an adaptively weighted stochastic gradient Langevin dynamics algorithm (SGLD), so-called contour stochastic gradient Langevin dynamics (CSGLD), for Bayesian learning in big data statistics. The proposed algorithm is essentially a \emph{scalable dynamic importance sampler}, which automatically \emph{flattens} the target distribution such that the simulation for a multi-modal distribution… ▽ More

    Submitted 23 May, 2022; v1 submitted 19 October, 2020; originally announced October 2020.

    Comments: Accepted by NeurIPS 2020

  30. arXiv:2010.08864  [pdf, ps, other

    stat.ME

    Markov Neighborhood Regression for High-Dimensional Inference

    Authors: Faming Liang, Jingnan Xue, Bochao Jia

    Abstract: This paper proposes an innovative method for constructing confidence intervals and assessing p-values in statistical inference for high-dimensional linear models. The proposed method has successfully broken the high-dimensional inference problem into a series of low-dimensional inference problems: For each regression coefficient $β_i$, the confidence interval and $p$-value are computed by regressi… ▽ More

    Submitted 17 October, 2020; originally announced October 2020.

    Comments: 37 pages, 5 figures

    MSC Class: 62F25; 62J20

    Journal ref: Journal of the American Statistical Association, 2020

  31. arXiv:2010.01084  [pdf, other

    stat.ML cs.LG math.PR stat.CO stat.ME

    Accelerating Convergence of Replica Exchange Stochastic Gradient MCMC via Variance Reduction

    Authors: Wei Deng, Qi Feng, Georgios Karagiannis, Guang Lin, Faming Liang

    Abstract: Replica exchange stochastic gradient Langevin dynamics (reSGLD) has shown promise in accelerating the convergence in non-convex learning; however, an excessively large correction for avoiding biases from noisy energy estimators has limited the potential of the acceleration. To address this issue, we study the variance reduction for noisy energy estimators, which promotes much more effective swaps.… ▽ More

    Submitted 18 March, 2021; v1 submitted 2 October, 2020; originally announced October 2020.

    Comments: Accepted by ICLR 2021

  32. arXiv:2009.09535  [pdf, other

    stat.ML cs.LG

    Stochastic Gradient Langevin Dynamics Algorithms with Adaptive Drifts

    Authors: Sehwan Kim, Qifan Song, Faming Liang

    Abstract: Bayesian deep learning offers a principled way to address many issues concerning safety of artificial intelligence (AI), such as model uncertainty,model interpretability, and prediction bias. However, due to the lack of efficient Monte Carlo algorithms for sampling from the posterior of deep neural networks (DNNs), Bayesian deep learning has not yet powered our AI system. We propose a class of ada… ▽ More

    Submitted 20 September, 2020; originally announced September 2020.

    Comments: 27 pages

  33. arXiv:2008.05367  [pdf, other

    stat.ML cs.LG math.PR stat.ME

    Non-convex Learning via Replica Exchange Stochastic Gradient MCMC

    Authors: Wei Deng, Qi Feng, Liyao Gao, Faming Liang, Guang Lin

    Abstract: Replica exchange Monte Carlo (reMC), also known as parallel tempering, is an important technique for accelerating the convergence of the conventional Markov Chain Monte Carlo (MCMC) algorithms. However, such a method requires the evaluation of the energy function based on the full dataset and is not scalable to big data. The naïve implementation of reMC in mini-batch settings introduces large bias… ▽ More

    Submitted 22 March, 2021; v1 submitted 12 August, 2020; originally announced August 2020.

    Comments: Accepted by ICML 2020

  34. arXiv:2006.10653  [pdf, other

    cs.LG stat.ML

    Precise expressions for random projections: Low-rank approximation and randomized Newton

    Authors: Michał Dereziński, Feynman Liang, Zhenyu Liao, Michael W. Mahoney

    Abstract: It is often desirable to reduce the dimensionality of a large dataset by projecting it onto a low-dimensional subspace. Matrix sketching has emerged as a powerful technique for performing such dimensionality reduction very efficiently. Even though there is an extensive literature on the worst-case performance of sketching, existing guarantees are typically very different from what is observed in p… ▽ More

    Submitted 13 June, 2022; v1 submitted 18 June, 2020; originally announced June 2020.

    Comments: This version of the paper includes a correction to the assumptions in a technical result, Theorem 2. None of the other claims are affected by this change. The conference version of this paper does not include the correction, so we recommend to cite this arXiv version when referencing Theorem 2

  35. arXiv:2002.02919  [pdf, ps, other

    stat.CO cs.LG stat.ML

    Extended Stochastic Gradient MCMC for Large-Scale Bayesian Variable Selection

    Authors: Qifan Song, Yan Sun, Mao Ye, Faming Liang

    Abstract: Stochastic gradient Markov chain Monte Carlo (MCMC) algorithms have received much attention in Bayesian computing for big data problems, but they are only applicable to a small class of problems for which the parameter space has a fixed dimension and the log-posterior density is differentiable with respect to the parameters. This paper proposes an extended stochastic gradient MCMC lgoriathm which,… ▽ More

    Submitted 7 February, 2020; originally announced February 2020.

  36. arXiv:1912.04533  [pdf, other

    cs.LG math.ST stat.ML

    Exact expressions for double descent and implicit regularization via surrogate random design

    Authors: Michał Dereziński, Feynman Liang, Michael W. Mahoney

    Abstract: Double descent refers to the phase transition that is exhibited by the generalization error of unregularized learning models when varying the ratio between the number of parameters and the number of training samples. The recent success of highly over-parameterized machine learning models such as deep neural networks has motivated a theoretical analysis of the double descent phenomenon in classical… ▽ More

    Submitted 18 June, 2020; v1 submitted 10 December, 2019; originally announced December 2019.

    Comments: Minor typo corrections and clarifications; moved the proofs into the appendix

  37. arXiv:1910.10791  [pdf, other

    stat.ML cs.LG stat.ME

    An Adaptive Empirical Bayesian Method for Sparse Deep Learning

    Authors: Wei Deng, Xiao Zhang, Faming Liang, Guang Lin

    Abstract: We propose a novel adaptive empirical Bayesian method for sparse deep learning, where the sparsity is ensured via a class of self-adaptive spike-and-slab priors. The proposed method works by alternatively sampling from an adaptive hierarchical posterior distribution using stochastic gradient Markov Chain Monte Carlo (MCMC) and smoothly optimizing the hyperparameters using stochastic approximation… ▽ More

    Submitted 13 April, 2020; v1 submitted 23 October, 2019; originally announced October 2019.

    Comments: Accepted by NeurIPS 2019; Update the assumption on the regularity of Poisson equation

  38. arXiv:1907.06566  [pdf, other

    eess.IV cs.LG stat.ML

    Improved Hybrid Layered Image Compression using Deep Learning and Traditional Codecs

    Authors: Haisheng Fu, Feng Liang, Bo Lei, Nai Bian, Qian zhang, Mohammad Akbari, Jie Liang, Chengjie Tu

    Abstract: Recently deep learning-based methods have been applied in image compression and achieved many promising results. In this paper, we propose an improved hybrid layered image compression framework by combining deep learning and the traditional image codecs. At the encoder, we first use a convolutional neural network (CNN) to obtain a compact representation of the input image, which is losslessly enco… ▽ More

    Submitted 15 July, 2019; originally announced July 2019.

    Comments: Submitted to Signal Processing: Image Communication

    Report number: 1907.06566

    Journal ref: Volume 82, March 2020, 115774

  39. arXiv:1906.04133  [pdf, other

    cs.LG stat.ML

    Bayesian experimental design using regularized determinantal point processes

    Authors: Michał Dereziński, Feynman Liang, Michael W. Mahoney

    Abstract: In experimental design, we are given $n$ vectors in $d$ dimensions, and our goal is to select $k\ll n$ of them to perform expensive measurements, e.g., to obtain labels/responses, for a linear regression task. Many statistical criteria have been proposed for choosing the optimal design, with popular choices including A- and D-optimality. If prior knowledge is given, typically in the form of a… ▽ More

    Submitted 10 June, 2019; originally announced June 2019.

  40. arXiv:1810.04811  [pdf, other

    stat.CO

    Stochastic Approximation Hamiltonian Monte Carlo

    Authors: Jonghyun Yun, Minsuk Shin, Ick Hoon Jin, Faming Liang

    Abstract: Recently, the Hamilton Monte Carlo (HMC) has become widespread as one of the more reliable approaches to efficient sample generation processes. However, HMC is difficult to sample in a multimodal posterior distribution because the HMC chain cannot cross energy barrier between modes due to the energy conservation property. In this paper, we propose a Stochastic Approximate Hamilton Monte Carlo (SAH… ▽ More

    Submitted 19 June, 2020; v1 submitted 10 October, 2018; originally announced October 2018.

  41. arXiv:1807.10025  [pdf, ps, other

    eess.SP cs.IT stat.ML

    Towards Optimal Power Control via Ensembling Deep Neural Networks

    Authors: Fei Liang, Cong Shen, Wei Yu, Feng Wu

    Abstract: A deep neural network (DNN) based power control method is proposed, which aims at solving the non-convex optimization problem of maximizing the sum rate of a multi-user interference channel. Towards this end, we first present PCNet, which is a multi-layer fully connected neural network that is specifically designed for the power control problem. PCNet takes the channel coefficients as input and ou… ▽ More

    Submitted 9 March, 2019; v1 submitted 26 July, 2018; originally announced July 2018.

    Comments: 30 pages, 27 figures

  42. arXiv:1805.02620  [pdf, ps, other

    stat.ME

    Fast Bayesian Integrative Learning of Multiple Gene Regulatory Networks for Type 1 Diabetes

    Authors: Bochao Jia, Faming Liang, the TEDDY Study Group

    Abstract: Motivated by the need to study the molecular mechanism underlying Type 1 Diabetes (T1D) with the gene expression data collected from both the patients and healthy controls at multiple time points, we propose an innovative method for jointly estimating multiple dependent Gaussian graphical models. Compared to the existing methods, the proposed method has a few significant advantages. First, it incl… ▽ More

    Submitted 7 December, 2018; v1 submitted 7 May, 2018; originally announced May 2018.

  43. arXiv:1805.02547  [pdf, other

    stat.ME

    Learning Gene Regulatory Networks with High-Dimensional Heterogeneous Data

    Authors: Bochao Jia, Faming Liang

    Abstract: The Gaussian graphical model is a widely used tool for learning gene regulatory networks with high-dimensional gene expression data. Most existing methods for Gaussian graphical models assume that the data are homogeneous, i.e., all samples are drawn from a single Gaussian distribution. However, for many real problems, the data are heterogeneous, which may contain some subgroups or come from diffe… ▽ More

    Submitted 7 May, 2018; originally announced May 2018.

  44. arXiv:1805.02257  [pdf, other

    stat.ML cs.LG math.ST stat.CO stat.ME

    Bayesian Regularization for Graphical Models with Unequal Shrinkage

    Authors: Lingrui Gan, Naveen N. Narisetty, Feng Liang

    Abstract: We consider a Bayesian framework for estimating a high-dimensional sparse precision matrix, in which adaptive shrinkage and sparsity are induced by a mixture of Laplace priors. Besides discussing our formulation from the Bayesian standpoint, we investigate the MAP (maximum a posteriori) estimator from a penalized likelihood perspective that gives rise to a new non-convex penalty approximating the… ▽ More

    Submitted 20 May, 2018; v1 submitted 6 May, 2018; originally announced May 2018.

    Comments: To appear in Journal of the American Statistical Association (Theory & Methods)

  45. arXiv:1802.08308  [pdf, ps, other

    stat.ME stat.AP

    A Bayesian Mark Interaction Model for Analysis of Tumor Pathology Images

    Authors: Qiwei Li, Xinlei Wang, Faming Liang, Guanghua Xiao

    Abstract: With the advance of imaging technology, digital pathology imaging of tumor tissue slides is becoming a routine clinical procedure for cancer diagnosis. This process produces massive imaging data that capture histological details in high resolution. Recent developments in deep-learning methods have enabled us to identify and classify individual cells from digital pathology images at large scale. Th… ▽ More

    Submitted 22 February, 2018; originally announced February 2018.

    Comments: 53 pages, 12 figures

    MSC Class: 62M30; 62F15

  46. arXiv:1802.02251  [pdf, ps, other

    stat.ME

    An Imputation-Consistency Algorithm for High-Dimensional Missing Data Problems and Beyond

    Authors: Faming Liang, Bochao Jia, Jingnan Xue, Qizhai Li, Ye Luo

    Abstract: Missing data are frequently encountered in high-dimensional problems, but they are usually difficult to deal with using standard algorithms, such as the expectation-maximization (EM) algorithm and its variants. To tackle this difficulty, some problem-specific algorithms have been developed in the literature, but there still lacks a general algorithm. This work is to fill the gap: we propose a gene… ▽ More

    Submitted 6 February, 2018; originally announced February 2018.

    Comments: 30 pages, 1 figure

  47. arXiv:1709.01231  [pdf, ps, other

    stat.ML cs.LG

    Discriminative Similarity for Clustering and Semi-Supervised Learning

    Authors: Yingzhen Yang, Feng Liang, Nebojsa Jojic, Shuicheng Yan, Jiashi Feng, Thomas S. Huang

    Abstract: Similarity-based clustering and semi-supervised learning methods separate the data into clusters or classes according to the pairwise similarity between the data, and the pairwise similarity is crucial for their performance. In this paper, we propose a novel discriminative similarity learning framework which learns discriminative similarity for either data clustering or semi-supervised learning. T… ▽ More

    Submitted 5 September, 2017; originally announced September 2017.

  48. An Iterative BP-CNN Architecture for Channel Decoding

    Authors: Fei Liang, Cong Shen, Feng Wu

    Abstract: Inspired by recent advances in deep learning, we propose a novel iterative BP-CNN architecture for channel decoding under correlated noise. This architecture concatenates a trained convolutional neural network (CNN) with a standard belief-propagation (BP) decoder. The standard BP decoder is used to estimate the coded bits, followed by a CNN to remove the estimation errors of the BP decoder and obt… ▽ More

    Submitted 18 July, 2017; originally announced July 2017.

    Comments: 30 pages, 12 figures

  49. arXiv:1702.05056  [pdf, other

    stat.ML stat.ME

    An Empirical Bayes Approach for High Dimensional Classification

    Authors: Yunbo Ouyang, Feng Liang

    Abstract: We propose an empirical Bayes estimator based on Dirichlet process mixture model for estimating the sparse normalized mean difference, which could be directly applied to the high dimensional linear classification. In theory, we build a bridge to connect the estimation error of the mean difference and the misclassification error, also provide sufficient conditions of sub-optimal classifiers and opt… ▽ More

    Submitted 16 February, 2017; originally announced February 2017.

  50. arXiv:1702.04330  [pdf, ps, other

    stat.ME

    A Nonparametric Bayesian Approach for Sparse Sequence Estimation

    Authors: Yunbo Ouyang, Feng Liang

    Abstract: A nonparametric Bayes approach is proposed for the problem of estimating a sparse sequence based on Gaussian random variables. We adopt the popular two-group prior with one component being a point mass at zero, and the other component being a mixture of Gaussian distributions. Although the Gaussian family has been shown to be suboptimal for this problem, we find that Gaussian mixtures, with a prop… ▽ More

    Submitted 29 May, 2017; v1 submitted 14 February, 2017; originally announced February 2017.

    Comments: Revise technical conditions