Skip to main content

Showing 1–16 of 16 results for author: Ryu, J J

.
  1. arXiv:2506.01523  [pdf, ps, other

    cs.LG stat.ML

    Alignment as Distribution Learning: Your Preference Model is Explicitly a Language Model

    Authors: Jihun Yun, Juno Kim, Jongho Park, Junhyuck Kim, Jongha Jon Ryu, Jaewoong Cho, Kwang-Sung Jun

    Abstract: Alignment via reinforcement learning from human feedback (RLHF) has become the dominant paradigm for controlling the quality of outputs from large language models (LLMs). However, when viewed as `loss + regularization,' the standard RLHF objective lacks theoretical justification and incentivizes degenerate, deterministic solutions, an issue that variants such as Direct Policy Optimization (DPO) al… ▽ More

    Submitted 2 June, 2025; originally announced June 2025.

    Comments: 26 pages, 7 tables

  2. arXiv:2502.10826  [pdf, other

    cs.LG cs.IT stat.ML

    Improved Offline Contextual Bandits with Second-Order Bounds: Betting and Freezing

    Authors: J. Jon Ryu, Jeongyeol Kwon, Benjamin Koppe, Kwang-Sung Jun

    Abstract: We consider the off-policy selection and learning in contextual bandits where the learner aims to select or train a reward-maximizing policy using data collected by a fixed behavior policy. Our contribution is two-fold. First, we propose a novel off-policy selection method that leverages a new betting-based confidence bound applied to an inverse propensity weight sequence. Our theoretical analysis… ▽ More

    Submitted 15 February, 2025; originally announced February 2025.

    Comments: 36 pages, 8 figures

  3. arXiv:2502.09609  [pdf, other

    cs.LG cs.AI stat.ML

    Score-of-Mixture Training: Training One-Step Generative Models Made Simple via Score Estimation of Mixture Distributions

    Authors: Tejas Jayashankar, J. Jon Ryu, Gregory Wornell

    Abstract: We propose Score-of-Mixture Training (SMT), a novel framework for training one-step generative models by minimizing a class of divergences called the $α$-skew Jensen-Shannon divergence. At its core, SMT estimates the score of mixture distributions between real and fake samples across multiple noise levels. Similar to consistency models, our approach supports both training from scratch (SMT) and di… ▽ More

    Submitted 13 February, 2025; v1 submitted 13 February, 2025; originally announced February 2025.

    Comments: 27 pages, 9 figures. Title updated to match the title of the manuscript, otherwise identical to v1

  4. arXiv:2409.18209  [pdf, ps, other

    stat.ML cs.LG math.ST

    A Unified View on Learning Unnormalized Distributions via Noise-Contrastive Estimation

    Authors: J. Jon Ryu, Abhin Shah, Gregory W. Wornell

    Abstract: This paper studies a family of estimators based on noise-contrastive estimation (NCE) for learning unnormalized distributions. The main contribution of this work is to provide a unified perspective on various methods for learning unnormalized distributions, which have been independently proposed and studied in separate research communities, through the lens of NCE. This unified view offers new ins… ▽ More

    Submitted 26 September, 2024; originally announced September 2024.

    Comments: 35 pages

  5. arXiv:2402.06160  [pdf, other

    cs.LG stat.ML

    Are Uncertainty Quantification Capabilities of Evidential Deep Learning a Mirage?

    Authors: Maohao Shen, J. Jon Ryu, Soumya Ghosh, Yuheng Bu, Prasanna Sattigeri, Subhro Das, Gregory W. Wornell

    Abstract: This paper questions the effectiveness of a modern predictive uncertainty quantification approach, called \emph{evidential deep learning} (EDL), in which a single neural network model is trained to learn a meta distribution over the predictive distribution by minimizing a specific objective function. Despite their perceived strong empirical performance on downstream tasks, a line of recent studies… ▽ More

    Submitted 31 October, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: 35 pages, 14 figures. NeurIPS 2024

  6. arXiv:2402.03683  [pdf, other

    stat.ME cs.IT math.ST

    Gambling-Based Confidence Sequences for Bounded Random Vectors

    Authors: J. Jon Ryu, Gregory W. Wornell

    Abstract: A confidence sequence (CS) is a sequence of confidence sets that contains a target parameter of an underlying stochastic process at any time step with high probability. This paper proposes a new approach to constructing CSs for means of bounded multivariate stochastic processes using a general gambling framework, extending the recently established coin toss framework for bounded random processes.… ▽ More

    Submitted 21 August, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: 14 pages, 3 figures. ICML 2024

  7. arXiv:2402.03655  [pdf, other

    cs.LG math.NA stat.ML

    Operator SVD with Neural Networks via Nested Low-Rank Approximation

    Authors: J. Jon Ryu, Xiangxiang Xu, H. S. Melihcan Erol, Yuheng Bu, Lizhong Zheng, Gregory W. Wornell

    Abstract: Computing eigenvalue decomposition (EVD) of a given linear operator, or finding its leading eigenvalues and eigenfunctions, is a fundamental task in many machine learning and scientific computing problems. For high-dimensional eigenvalue problems, training neural networks to parameterize the eigenfunctions is considered as a promising alternative to the classical numerical linear algebra technique… ▽ More

    Submitted 21 August, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: 36 pages, 7 figures. ICML 2024. Almost identical to the conference version, except a few updates for fixing typos and mistakes

  8. arXiv:2302.08077  [pdf, other

    cs.LG

    Group Fairness with Uncertainty in Sensitive Attributes

    Authors: Abhin Shah, Maohao Shen, Jongha Jon Ryu, Subhro Das, Prasanna Sattigeri, Yuheng Bu, Gregory W. Wornell

    Abstract: Learning a fair predictive model is crucial to mitigate biased decisions against minority groups in high-stakes applications. A common approach to learn such a model involves solving an optimization problem that maximizes the predictive power of the model under an appropriate group fairness constraint. However, in practice, sensitive attributes are often missing or noisy resulting in uncertainty.… ▽ More

    Submitted 7 June, 2023; v1 submitted 15 February, 2023; originally announced February 2023.

  9. arXiv:2207.12382  [pdf, other

    math.PR cs.IT stat.ME

    On Confidence Sequences for Bounded Random Processes via Universal Gambling Strategies

    Authors: J. Jon Ryu, Alankrita Bhatt

    Abstract: This paper considers the problem of constructing a confidence sequence, which is a sequence of confidence intervals that hold uniformly over time, for estimating the mean of bounded real-valued random processes. This paper revisits the gambling-based approach established in the recent literature from a natural \emph{two-horse race} perspective, and demonstrates new properties of the resulting algo… ▽ More

    Submitted 21 August, 2024; v1 submitted 25 July, 2022; originally announced July 2022.

    Comments: 20 pages, 3 figures. IEEE Transactions on Information Theory (to appear)

  10. arXiv:2202.06005  [pdf, ps, other

    cs.IT math.PR

    An Information-Theoretic Proof of the Kac--Bernstein Theorem

    Authors: J. Jon Ryu, Young-Han Kim

    Abstract: A short, information-theoretic proof of the Kac--Bernstein theorem, which is stated as follows, is presented: For any independent random variables $X$ and $Y$, if $X+Y$ and $X-Y$ are independent, then $X$ and $Y$ are normally distributed.

    Submitted 21 February, 2022; v1 submitted 12 February, 2022; originally announced February 2022.

    Comments: 4 pages

  11. arXiv:2202.02464  [pdf, other

    math.ST cs.DC cs.IT cs.LG stat.ML

    Minimax Optimal Algorithms with Fixed-$k$-Nearest Neighbors

    Authors: J. Jon Ryu, Young-Han Kim

    Abstract: This paper presents how to perform minimax optimal classification, regression, and density estimation based on fixed-$k$ nearest neighbor (NN) searches. We consider a distributed learning scenario, in which a massive dataset is split into smaller groups, where the $k$-NNs are found for a query point with respect to each subset of data. We propose \emph{optimal} rules to aggregate the fixed-$k$-NN… ▽ More

    Submitted 6 September, 2024; v1 submitted 4 February, 2022; originally announced February 2022.

    Comments: 65 pages, 5 figures. The manuscript has been revised from scratch compared to the previous version. Notable differences include (1) updated statements and corrected proofs for classification and regression, (2) explicit statements and proofs for distance-selective rules, and (3) new analogous estimators for density estimation

  12. arXiv:2202.02431  [pdf, ps, other

    cs.IT

    On Universal Portfolios with Continuous Side Information

    Authors: Alankrita Bhatt, J. Jon Ryu, Young-Han Kim

    Abstract: A new portfolio selection strategy that adapts to a continuous side-information sequence is presented, with a universal wealth guarantee against a class of state-constant rebalanced portfolios with respect to a state function that maps each side-information symbol to a finite set of states. In particular, given that a state function belongs to a collection of functions of finite Natarajan dimensio… ▽ More

    Submitted 4 February, 2022; originally announced February 2022.

  13. arXiv:2202.02406  [pdf, other

    cs.IT cs.LG math.OC

    Parameter-free Online Linear Optimization with Side Information via Universal Coin Betting

    Authors: J. Jon Ryu, Alankrita Bhatt, Young-Han Kim

    Abstract: A class of parameter-free online linear optimization algorithms is proposed that harnesses the structure of an adversarial sequence by adapting to some side information. These algorithms combine the reduction technique of Orabona and P{á}l (2016) for adapting coin betting algorithms for online linear optimization with universal compression techniques in information theory for incorporating sequent… ▽ More

    Submitted 4 February, 2022; originally announced February 2022.

    Comments: 23 pages, 5 figures, to appear at AISTATS 2022

  14. arXiv:1911.04018  [pdf, other

    cs.LG cs.SD eess.AS stat.ML

    Feedback Recurrent AutoEncoder

    Authors: Yang Yang, Guillaume Sautière, J. Jon Ryu, Taco S Cohen

    Abstract: In this work, we propose a new recurrent autoencoder architecture, termed Feedback Recurrent AutoEncoder (FRAE), for online compression of sequential data with temporal dependency. The recurrent structure of FRAE is designed to efficiently extract the redundancy along the time dimension and allows a compact discrete representation of the data to be learned. We demonstrate its effectiveness in spee… ▽ More

    Submitted 17 February, 2020; v1 submitted 10 November, 2019; originally announced November 2019.

    Journal ref: ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

  15. arXiv:1905.10945  [pdf, other

    cs.LG stat.ML

    Learning with Succinct Common Representation Based on Wyner's Common Information

    Authors: J. Jon Ryu, Yoojin Choi, Young-Han Kim, Mostafa El-Khamy, Jungwon Lee

    Abstract: A new bimodal generative model is proposed for generating conditional and joint samples, accompanied with a training method with learning a succinct bottleneck representation. The proposed model, dubbed as the variational Wyner model, is designed based on two classical problems in network information theory -- distributed simulation and channel synthesis -- in which Wyner's common information aris… ▽ More

    Submitted 27 July, 2022; v1 submitted 26 May, 2019; originally announced May 2019.

    Comments: 20 pages, 7 figures

  16. arXiv:1805.08342  [pdf, other

    math.ST cs.IT stat.ME stat.ML

    Nearest neighbor density functional estimation from inverse Laplace transform

    Authors: J. Jon Ryu, Shouvik Ganguly, Young-Han Kim, Yung-Kyun Noh, Daniel D. Lee

    Abstract: A new approach to $L_2$-consistent estimation of a general density functional using $k$-nearest neighbor distances is proposed, where the functional under consideration is in the form of the expectation of some function $f$ of the densities at each point. The estimator is designed to be asymptotically unbiased, using the convergence of the normalized volume of a $k$-nearest neighbor ball to a Gamm… ▽ More

    Submitted 4 February, 2022; v1 submitted 21 May, 2018; originally announced May 2018.

    Comments: 43 pages, 4 figures. IEEE Transactions on Information Theory (to appear)