Skip to main content

Showing 1–20 of 20 results for author: Yi, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2501.00212  [pdf, other

    stat.ME

    Denoising Data with Measurement Error Using a Reproducing Kernel-based Diffusion Model

    Authors: Mingyang Yi, Marcos Matabuena, Ruoyu Wang

    Abstract: The ongoing technological revolution in measurement systems enables the acquisition of high-resolution samples in fields such as engineering, biology, and medicine. However, these observations are often subject to errors from measurement devices. Motivated by this challenge, we propose a denoising framework that employs diffusion models to generate denoised data whose distribution closely approxim… ▽ More

    Submitted 30 December, 2024; originally announced January 2025.

  2. arXiv:2412.07075  [pdf, other

    math.OC stat.ML

    Conformal Uncertainty Quantification of Electricity Price Predictions for Risk-Averse Storage Arbitrage

    Authors: Saud Alghumayjan, Ming Yi, Bolun Xu

    Abstract: This paper proposes a risk-averse approach to energy storage price arbitrage, leveraging conformal uncertainty quantification for electricity price predictions. The method addresses the significant challenges posed by the inherent volatility and uncertainty of real-time electricity prices, which create substantial risks of financial losses for energy storage participants relying on future price fo… ▽ More

    Submitted 9 December, 2024; originally announced December 2024.

  3. arXiv:2311.15982  [pdf, other

    stat.ME math.ST

    Stab-GKnock: Controlled variable selection for partially linear models using generalized knockoffs

    Authors: Han Su, Panxu Yuan, Qingyang Sun, Mengxi Yi, Gaorong Li

    Abstract: The recently proposed fixed-X knockoff is a powerful variable selection procedure that controls the false discovery rate (FDR) in any finite-sample setting, yet its theoretical insights are difficult to show beyond Gaussian linear models. In this paper, we make the first attempt to extend the fixed-X knockoff to partially linear models by using generalized knockoff features, and propose a new stab… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: 40 pages, 11 figures, 4 tables

  4. arXiv:2310.20090  [pdf, other

    stat.ML cs.LG stat.CO

    Bridging the Gap Between Variational Inference and Wasserstein Gradient Flows

    Authors: Mingxuan Yi, Song Liu

    Abstract: Variational inference is a technique that approximates a target distribution by optimizing within the parameter space of variational families. On the other hand, Wasserstein gradient flows describe optimization within the space of probability measures where they do not necessarily admit a parametric density function. In this paper, we bridge the gap between these two methods. We demonstrate that,… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

  5. arXiv:2309.05019  [pdf, ps, other

    cs.LG stat.ML

    SA-Solver: Stochastic Adams Solver for Fast Sampling of Diffusion Models

    Authors: Shuchen Xue, Mingyang Yi, Weijian Luo, Shifeng Zhang, Jiacheng Sun, Zhenguo Li, Zhi-Ming Ma

    Abstract: Diffusion Probabilistic Models (DPMs) have achieved considerable success in generation tasks. As sampling from DPMs is equivalent to solving diffusion SDE or ODE which is time-consuming, numerous fast sampling methods built upon improved differential equation solvers are proposed. The majority of such techniques consider solving the diffusion ODE due to its superior efficiency. However, stochastic… ▽ More

    Submitted 24 June, 2025; v1 submitted 10 September, 2023; originally announced September 2023.

    Comments: Accepted in NeurIPS 2023

  6. arXiv:2307.15774  [pdf, other

    stat.ME

    Robust and Resistant Regularized Covariance Matrices

    Authors: David E. Tyler, Mengxi Yi, Klaus Nordhausen

    Abstract: We introduce a class of regularized M-estimators of multivariate scatter and show, analogous to the popular spatial sign covariance matrix (SSCM), that they possess high breakdown points. We also show that the SSCM can be viewed as an extreme member of this class. Unlike the SSCM, this class of estimators takes into account the shape of the contours of the data cloud when down-weighing observation… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

    Comments: 22 pages, 2 figures, 1 table. arXiv admin note: text overlap with arXiv:2003.00078

    MSC Class: 62H12; 62F35

  7. arXiv:2305.15577  [pdf, other

    stat.ML cs.LG

    Minimizing $f$-Divergences by Interpolating Velocity Fields

    Authors: Song Liu, Jiahao Yu, Jack Simons, Mingxuan Yi, Mark Beaumont

    Abstract: Many machine learning problems can be seen as approximating a \textit{target} distribution using a \textit{particle} distribution by minimizing their statistical discrepancy. Wasserstein Gradient Flow can move particles along a path that minimizes the $f$-divergence between the target and particle distributions. To move particles, we need to calculate the corresponding velocity fields derived from… ▽ More

    Submitted 6 June, 2024; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: This manuscript is an extended version of the ICML2024 version. The code for reproducing our results can be found at https://github.com/anewgithubname/gradest2

  8. arXiv:2302.01075  [pdf, other

    stat.ML cs.LG

    MonoFlow: Rethinking Divergence GANs via the Perspective of Wasserstein Gradient Flows

    Authors: Mingxuan Yi, Zhanxing Zhu, Song Liu

    Abstract: The conventional understanding of adversarial training in generative adversarial networks (GANs) is that the discriminator is trained to estimate a divergence, and the generator learns to minimize this divergence. We argue that despite the fact that many variants of GANs were developed following this paradigm, the current theoretical understanding of GANs and their practical algorithms are inconsi… ▽ More

    Submitted 8 August, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

    Comments: ICML 2023

  9. arXiv:2207.13177  [pdf, other

    stat.ML cs.LG

    Sliced Wasserstein Variational Inference

    Authors: Mingxuan Yi, Song Liu

    Abstract: Variational Inference approximates an unnormalized distribution via the minimization of Kullback-Leibler (KL) divergence. Although this divergence is efficient for computation and has been widely used in applications, it suffers from some unreasonable properties. For example, it is not a proper metric, i.e., it is non-symmetric and does not preserve the triangle inequality. On the other hand, opti… ▽ More

    Submitted 26 July, 2022; originally announced July 2022.

  10. arXiv:2203.11528  [pdf, other

    stat.ML cs.LG

    Out-of-distribution Generalization with Causal Invariant Transformations

    Authors: Ruoyu Wang, Mingyang Yi, Zhitang Chen, Shengyu Zhu

    Abstract: In real-world applications, it is important and desirable to learn a model that performs well on out-of-distribution (OOD) data. Recently, causality has become a powerful tool to tackle the OOD generalization problem, with the idea resting on the causal mechanism that is invariant across domains of interest. To leverage the generally unknown causal mechanism, existing works assume a linear form of… ▽ More

    Submitted 23 March, 2022; v1 submitted 22 March, 2022; originally announced March 2022.

    Comments: accepted by cvpr2022

  11. arXiv:2111.00743  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Towards the Generalization of Contrastive Self-Supervised Learning

    Authors: Weiran Huang, Mingyang Yi, Xuyang Zhao, Zihao Jiang

    Abstract: Recently, self-supervised learning has attracted great attention, since it only requires unlabeled data for model training. Contrastive learning is one popular method for self-supervised learning and has achieved promising empirical performance. However, the theoretical understanding of its generalization ability is still limited. To this end, we define a kind of $(σ,δ)$-measure to mathematically… ▽ More

    Submitted 2 March, 2023; v1 submitted 1 November, 2021; originally announced November 2021.

    Comments: Accepted by ICLR 2023

  12. arXiv:2007.03747  [pdf, ps, other

    eess.SP cs.LG stat.ML

    On Cokriging, Neural Networks, and Spatial Blind Source Separation for Multivariate Spatial Prediction

    Authors: Christoph Muehlmann, Klaus Nordhausen, Mengxi Yi

    Abstract: Multivariate measurements taken at irregularly sampled locations are a common form of data, for example in geochemical analysis of soil. In practical considerations predictions of these measurements at unobserved locations are of great interest. For standard multivariate spatial prediction methods it is mandatory to not only model spatial dependencies but also cross-dependencies which makes it a d… ▽ More

    Submitted 1 July, 2020; originally announced July 2020.

    Journal ref: IEEE Geoscience and Remote Sensing Letters, 18, 1931-1935, 2021

  13. arXiv:2007.03112  [pdf, other

    astro-ph.SR astro-ph.GA stat.ML

    Interpreting Stellar Spectra with Unsupervised Domain Adaptation

    Authors: Teaghan O'Briain, Yuan-Sen Ting, Sébastien Fabbro, Kwang M. Yi, Kim Venn, Spencer Bialek

    Abstract: We discuss how to achieve mapping from large sets of imperfect simulations and observational data with unsupervised domain adaptation. Under the hypothesis that simulated and observed data distributions share a common underlying representation, we show how it is possible to transfer between simulated and observed domains. Driven by an application to interpret stellar spectroscopic sky surveys, we… ▽ More

    Submitted 6 July, 2020; originally announced July 2020.

    Comments: 4 pages, 4 figure, accepted to the ICML 2020 Machine Learning Interpretability for Scientific Discovery workshop. A full 20-page version is submitted to ApJ. The code used in this study is made publicly available on github: https://github.com/teaghan/Cycle_SN

  14. arXiv:2007.03109  [pdf, other

    astro-ph.SR astro-ph.GA astro-ph.IM physics.data-an stat.ML

    Cycle-StarNet: Bridging the gap between theory and data by leveraging large datasets

    Authors: Teaghan O'Briain, Yuan-Sen Ting, Sébastien Fabbro, Kwang M. Yi, Kim Venn, Spencer Bialek

    Abstract: The advancements in stellar spectroscopy data acquisition have made it necessary to accomplish similar improvements in efficient data analysis techniques. Current automated methods for analyzing spectra are either (a) data-driven, which requires prior knowledge of stellar parameters and elemental abundances, or (b) based on theoretical synthetic models that are susceptible to the gap between theor… ▽ More

    Submitted 13 November, 2020; v1 submitted 6 July, 2020; originally announced July 2020.

    Comments: 23 pages, 15 figures, 2 tables, accepted for publication on Nov 12, 2020, Nov 12. A companion 4-page preview is accepted to the ICML 2020 Machine Learning Interpretability for Scientific Discovery workshop. The code used in this study is made publicly available on github: https://github.com/teaghan/Cycle_SN

    Journal ref: 2021, ApJ, 906, 130

  15. arXiv:2003.11249  [pdf, other

    cs.LG cs.CV stat.ML

    VaB-AL: Incorporating Class Imbalance and Difficulty with Variational Bayes for Active Learning

    Authors: Jongwon Choi, Kwang Moo Yi, Jihoon Kim, Jinho Choo, Byoungjip Kim, Jin-Yeop Chang, Youngjune Gwon, Hyung Jin Chang

    Abstract: Active Learning for discriminative models has largely been studied with the focus on individual samples, with less emphasis on how classes are distributed or which classes are hard to deal with. In this work, we show that this is harmful. We propose a method based on the Bayes' rule, that can naturally incorporate class imbalance into the Active Learning framework. We derive that three terms shoul… ▽ More

    Submitted 3 December, 2020; v1 submitted 25 March, 2020; originally announced March 2020.

  16. arXiv:2003.00078  [pdf, ps, other

    stat.ME

    Breakdown points of penalized and hybrid M-estimators of covariance

    Authors: David E. Tyler, Mengxi Yi

    Abstract: We introduce a class of hybrid M-estimators of multivariate scatter which, analogous to the popular spatial sign covariance matrix (SSCM), possess high breakdown points. We also show that the SSCM can be viewed as an extreme member of this class. Unlike the SSCM, but like the regular M-estimators of scatter, this new class of estimators takes into account the shape of the contours of the data clou… ▽ More

    Submitted 28 February, 2020; originally announced March 2020.

    Comments: 8 pages, no figures or tables

    MSC Class: 62H12; 62F35

  17. arXiv:2002.06410  [pdf, other

    stat.ML cs.LG

    Posterior Ratio Estimation of Latent Variables

    Authors: Song Liu, Yulong Zhang, Mingxuan Yi, Mladen Kolar

    Abstract: Density Ratio Estimation has attracted attention from the machine learning community due to its ability to compare the underlying distributions of two datasets. However, in some applications, we want to compare distributions of random variables that are \emph{inferred} from observations. In this paper, we study the problem of estimating the ratio between two posterior probability density functions… ▽ More

    Submitted 25 June, 2020; v1 submitted 15 February, 2020; originally announced February 2020.

  18. arXiv:1903.07120  [pdf, other

    cs.LG stat.ML

    Stabilize Deep ResNet with A Sharp Scaling Factor $τ$

    Authors: Huishuai Zhang, Da Yu, Mingyang Yi, Wei Chen, Tie-Yan Liu

    Abstract: We study the stability and convergence of training deep ResNets with gradient descent. Specifically, we show that the parametric branch in the residual block should be scaled down by a factor $τ=O(1/\sqrt{L})$ to guarantee stable forward/backward process, where $L$ is the number of residual blocks. Moreover, we establish a converse result that the forward process is unbounded when… ▽ More

    Submitted 30 January, 2023; v1 submitted 17 March, 2019; originally announced March 2019.

    Comments: Journal version (Published in Machine Learning Journal), 26 pages

    Journal ref: Machine Learning, 111(9), 3359-3392 (2022)

  19. arXiv:1903.02237  [pdf, other

    cs.LG stat.ML

    Positively Scale-Invariant Flatness of ReLU Neural Networks

    Authors: Mingyang Yi, Qi Meng, Wei Chen, Zhi-ming Ma, Tie-Yan Liu

    Abstract: It was empirically confirmed by Keskar et al.\cite{SharpMinima} that flatter minima generalize better. However, for the popular ReLU network, sharp minimum can also generalize well \cite{SharpMinimacan}. The conclusion demonstrates that the existing definitions of flatness fail to account for the complex geometry of ReLU neural networks because they can't cover the Positively Scale-Invariant (PSI)… ▽ More

    Submitted 6 March, 2019; originally announced March 2019.

  20. arXiv:1805.08300  [pdf, other

    stat.ME

    Lassoing Eigenvalues

    Authors: David E. Tyler, Mengxi Yi

    Abstract: The properties of penalized sample covariance matrices depend on the choice of the penalty function. In this paper, we introduce a class of non-smooth penalty functions for the sample covariance matrix, and demonstrate how this method results in a grouping of the estimated eigenvalues. We refer to this method as "lassoing eigenvalues" or as the "elasso".

    Submitted 21 May, 2018; originally announced May 2018.

    Comments: 18 pages, 6 figures

    MSC Class: 62H12 (primary); 62H25 (secondary)