Skip to main content

Showing 1–50 of 107 results for author: Ye, C

Searching in archive stat. Search in all archives.
.
  1. arXiv:2506.24013  [pdf, ps, other

    stat.ME q-bio.GN stat.AP

    CoMMiT: Co-informed inference of microbiome-metabolome interactions via transfer learning

    Authors: Leiyue Li, Chenglong Ye, Tim Randolph, Meredith Hullar, Johanna Lampe, Marian Neuhouser, Daniel Raftery, Yue Wang

    Abstract: Recent multi-omic microbiome studies enable integrative analysis of microbes and metabolites, uncovering their associations with various host conditions. Such analyses require multivariate models capable of accounting for the complex correlation structures between microbes and metabolites. However, existing multivariate models often suffer from low statistical power for detecting microbiome-metabo… ▽ More

    Submitted 30 June, 2025; originally announced June 2025.

    Comments: 38 pages, 5 figures

  2. arXiv:2505.17004  [pdf, ps, other

    cs.LG cs.AI math.NA stat.ML

    Guided Diffusion Sampling on Function Spaces with Applications to PDEs

    Authors: Jiachen Yao, Abbas Mammadov, Julius Berner, Gavin Kerrigan, Jong Chul Ye, Kamyar Azizzadenesheli, Anima Anandkumar

    Abstract: We propose a general framework for conditional sampling in PDE-based inverse problems, targeting the recovery of whole solutions from extremely sparse or noisy measurements. This is accomplished by a function-space diffusion model and plug-and-play guidance for conditioning. Our method first trains an unconditional discretization-agnostic denoising model using neural operator architectures. At inf… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

  3. arXiv:2504.00024  [pdf, other

    stat.ME cs.AI cs.LG

    A multi-locus predictiveness curve and its summary assessment for genetic risk prediction

    Authors: Changshuai Wei, Ming Li, Yalu Wen, Chengyin Ye, Qing Lu

    Abstract: With the advance of high-throughput genotyping and sequencing technologies, it becomes feasible to comprehensive evaluate the role of massive genetic predictors in disease prediction. There exists, therefore, a critical need for developing appropriate statistical measurements to access the combined effects of these genetic variants in disease prediction. Predictiveness curve is commonly used as a… ▽ More

    Submitted 28 March, 2025; originally announced April 2025.

  4. arXiv:2502.07460  [pdf, ps, other

    cs.LG stat.ML

    Logarithmic Regret for Online KL-Regularized Reinforcement Learning

    Authors: Heyang Zhao, Chenlu Ye, Wei Xiong, Quanquan Gu, Tong Zhang

    Abstract: Recent advances in Reinforcement Learning from Human Feedback (RLHF) have shown that KL-regularization plays a pivotal role in improving the efficiency of RL fine-tuning for large language models (LLMs). Despite its empirical advantage, the theoretical difference between KL-regularized RL and standard RL remains largely under-explored. While there is a recent line of work on the theoretical analys… ▽ More

    Submitted 30 May, 2025; v1 submitted 11 February, 2025; originally announced February 2025.

  5. arXiv:2502.06516  [pdf, ps, other

    cs.LG cs.AI cs.CV stat.ML

    Boost-and-Skip: A Simple Guidance-Free Diffusion for Minority Generation

    Authors: Soobin Um, Beomsu Kim, Jong Chul Ye

    Abstract: Minority samples are underrepresented instances located in low-density regions of a data manifold, and are valuable in many generative AI applications, such as data augmentation, creative content generation, etc. Unfortunately, existing diffusion-based minority generators often rely on computationally expensive guidance dedicated for minority generation. To address this, here we present a simple y… ▽ More

    Submitted 30 May, 2025; v1 submitted 10 February, 2025; originally announced February 2025.

    Comments: ICML 2025, 29 pages, 11 figures

  6. arXiv:2502.02486  [pdf, ps, other

    stat.ML cs.LG

    Catoni Contextual Bandits are Robust to Heavy-tailed Rewards

    Authors: Chenlu Ye, Yujia Jin, Alekh Agarwal, Tong Zhang

    Abstract: Typical contextual bandit algorithms assume that the rewards at each round lie in some fixed range $[0, R]$, and their regret scales polynomially with this reward range $R$. However, many practical scenarios naturally involve heavy-tailed rewards or rewards where the worst-case range can be substantially larger than the variance. In this paper, we develop an algorithmic approach building on Catoni… ▽ More

    Submitted 4 February, 2025; originally announced February 2025.

  7. arXiv:2412.00156  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    VISION-XL: High Definition Video Inverse Problem Solver using Latent Image Diffusion Models

    Authors: Taesung Kwon, Jong Chul Ye

    Abstract: In this paper, we propose a novel framework for solving high-definition video inverse problems using latent image diffusion models. Building on recent advancements in spatio-temporal optimization for video inverse problems using image diffusion models, our approach leverages latent-space diffusion models to achieve enhanced video quality and resolution. To address the high computational demands of… ▽ More

    Submitted 6 March, 2025; v1 submitted 29 November, 2024; originally announced December 2024.

    Comments: Project page: https://vision-xl.github.io/

  8. arXiv:2411.04625  [pdf, other

    cs.LG stat.ML

    Sharp Analysis for KL-Regularized Contextual Bandits and RLHF

    Authors: Heyang Zhao, Chenlu Ye, Quanquan Gu, Tong Zhang

    Abstract: Reverse-Kullback-Leibler (KL) regularization has emerged to be a predominant technique used to enhance policy optimization in reinforcement learning (RL) and reinforcement learning from human feedback (RLHF), which forces the learned policy to stay close to a reference policy. While the effectiveness and necessity of KL-regularization have been empirically demonstrated in various practical scenari… ▽ More

    Submitted 11 February, 2025; v1 submitted 7 November, 2024; originally announced November 2024.

  9. arXiv:2410.10892  [pdf, ps, other

    stat.ML cs.DS cs.LG

    Replicable Uniformity Testing

    Authors: Sihan Liu, Christopher Ye

    Abstract: Uniformity testing is arguably one of the most fundamental distribution testing problems. Given sample access to an unknown distribution $\mathbf{p}$ on $[n]$, one must decide if $\mathbf{p}$ is uniform or $\varepsilon$-far from uniform (in total variation distance). A long line of work established that uniformity testing has sample complexity $Θ(\sqrt{n}\varepsilon^{-2})$. However, when the input… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

    Comments: To appear in NeurIPS 2024

  10. arXiv:2409.14684  [pdf, other

    stat.ME

    Consistent Order Determination of Markov Decision Process

    Authors: Chuyun Ye, Lixing Zhu, Ruoqing Zhu

    Abstract: The Markov assumption in Markov Decision Processes (MDPs) is fundamental in reinforcement learning, influencing both theoretical research and practical applications. Existing methods that rely on the Bellman equation benefit tremendously from this assumption for policy evaluation and inference. Testing the Markov assumption or selecting the appropriate order is important for further analysis. Exis… ▽ More

    Submitted 22 September, 2024; originally announced September 2024.

  11. arXiv:2409.02574  [pdf, other

    cs.CV cs.AI stat.ML

    Solving Video Inverse Problems Using Image Diffusion Models

    Authors: Taesung Kwon, Jong Chul Ye

    Abstract: Recently, diffusion model-based inverse problem solvers (DIS) have emerged as state-of-the-art approaches for addressing inverse problems, including image super-resolution, deblurring, inpainting, etc. However, their application to video inverse problems arising from spatio-temporal degradation remains largely unexplored due to the challenges in training video diffusion models. To address this iss… ▽ More

    Submitted 27 February, 2025; v1 submitted 4 September, 2024; originally announced September 2024.

    Comments: ICLR 2025; 25 pages, 17 figures

  12. arXiv:2407.11435  [pdf, other

    q-bio.GN cs.LG stat.ML

    Genomic Language Models: Opportunities and Challenges

    Authors: Gonzalo Benegas, Chengzhong Ye, Carlos Albors, Jianan Canal Li, Yun S. Song

    Abstract: Large language models (LLMs) are having transformative impacts across a wide range of scientific fields, particularly in the biomedical sciences. Just as the goal of Natural Language Processing is to understand sequences of words, a major objective in biology is to understand biological sequences. Genomic Language Models (gLMs), which are LLMs trained on DNA sequences, have the potential to signif… ▽ More

    Submitted 22 September, 2024; v1 submitted 16 July, 2024; originally announced July 2024.

    Comments: Review article; 26 pages, 3 figures, 1 table

    MSC Class: 92-08; 92B20; 68T50; 68T07

  13. arXiv:2406.02628  [pdf, ps, other

    stat.ML cs.CC cs.DS cs.LG

    Replicability in High Dimensional Statistics

    Authors: Max Hopkins, Russell Impagliazzo, Daniel Kane, Sihan Liu, Christopher Ye

    Abstract: The replicability crisis is a major issue across nearly all areas of empirical science, calling for the formal study of replicability in statistics. Motivated in this context, [Impagliazzo, Lei, Pitassi, and Sorrell STOC 2022] introduced the notion of replicable learning algorithms, and gave basic procedures for $1$-dimensional tasks including statistical queries. In this work, we study the comput… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 119 pages

    ACM Class: F.2.0

  14. arXiv:2403.14830  [pdf, other

    stat.ML cs.LG

    Deep Clustering Evaluation: How to Validate Internal Clustering Validation Measures

    Authors: Zeya Wang, Chenglong Ye

    Abstract: Deep clustering, a method for partitioning complex, high-dimensional data using deep neural networks, presents unique evaluation challenges. Traditional clustering validation measures, designed for low-dimensional spaces, are problematic for deep clustering, which involves projecting data into lower-dimensional embeddings before partitioning. Two key issues are identified: 1) the curse of dimensio… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  15. arXiv:2403.14183  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    OTSeg: Multi-prompt Sinkhorn Attention for Zero-Shot Semantic Segmentation

    Authors: Kwanyoung Kim, Yujin Oh, Jong Chul Ye

    Abstract: The recent success of CLIP has demonstrated promising results in zero-shot semantic segmentation by transferring muiltimodal knowledge to pixel-level classification. However, leveraging pre-trained CLIP knowledge to closely align text embeddings with pixel embeddings still has limitations in existing approaches. To address this issue, we propose OTSeg, a novel multimodal attention mechanism aimed… ▽ More

    Submitted 11 July, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

    Comments: ECCV 2024; 23 pages, 8 tables, 8 figures; Project Page: https://cubeyoung.github.io/OTSeg_project/

  16. arXiv:2402.08991  [pdf, ps, other

    stat.ML cs.LG

    Towards Robust Model-Based Reinforcement Learning Against Adversarial Corruption

    Authors: Chenlu Ye, Jiafan He, Quanquan Gu, Tong Zhang

    Abstract: This study tackles the challenges of adversarial corruption in model-based reinforcement learning (RL), where the transition dynamics can be corrupted by an adversary. Existing studies on corruption-robust RL mostly focus on the setting of model-free RL, where robust least-square regression is often employed for value function estimation. However, these techniques cannot be directly applied to mod… ▽ More

    Submitted 20 July, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  17. arXiv:2402.08222  [pdf, other

    stat.ME

    Integration of multiview microbiome data for deciphering microbiome-metabolome-disease pathways

    Authors: Lei Fang, Yue Wang, Chenglong Ye

    Abstract: The intricate interplay between host organisms and their gut microbiota has catalyzed research into the microbiome's role in disease, shedding light on novel aspects of disease pathogenesis. However, the mechanisms through which the microbiome exerts its influence on disease remain largely unclear. In this study, we first introduce a structural equation model to delineate the pathways connecting t… ▽ More

    Submitted 16 February, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

  18. arXiv:2402.07314  [pdf, ps, other

    cs.LG stat.ML

    Online Iterative Reinforcement Learning from Human Feedback with General Preference Model

    Authors: Chenlu Ye, Wei Xiong, Yuheng Zhang, Hanze Dong, Nan Jiang, Tong Zhang

    Abstract: We investigate Reinforcement Learning from Human Feedback (RLHF) in the context of a general preference oracle. In particular, we do not assume the existence of a reward function and an oracle preference signal drawn from the Bradley-Terry model as most of the prior works do. We consider a standard mathematical formulation, the reverse-KL regularized minimax game between two LLMs for RLHF under ge… ▽ More

    Submitted 12 November, 2024; v1 submitted 11 February, 2024; originally announced February 2024.

    Comments: RLHF, Preference Learning, Alignment for LLMs

  19. arXiv:2312.11456  [pdf, other

    cs.LG cs.AI stat.ML

    Iterative Preference Learning from Human Feedback: Bridging Theory and Practice for RLHF under KL-Constraint

    Authors: Wei Xiong, Hanze Dong, Chenlu Ye, Ziqi Wang, Han Zhong, Heng Ji, Nan Jiang, Tong Zhang

    Abstract: This paper studies the alignment process of generative models with Reinforcement Learning from Human Feedback (RLHF). We first identify the primary challenges of existing popular methods like offline PPO and offline DPO as lacking in strategical exploration of the environment. Then, to understand the mathematical principle of RLHF, we consider a standard mathematical formulation, the reverse-KL re… ▽ More

    Submitted 1 May, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: 53 pages; theoretical study and algorithmic design of iterative RLHF and DPO

  20. arXiv:2311.13180  [pdf, other

    stat.ML cs.LG

    Provably Efficient High-Dimensional Bandit Learning with Batched Feedbacks

    Authors: Jianqing Fan, Zhaoran Wang, Zhuoran Yang, Chenlu Ye

    Abstract: We study high-dimensional multi-armed contextual bandits with batched feedback where the $T$ steps of online interactions are divided into $L$ batches. In specific, each batch collects data according to a policy that depends on previous batches and the rewards are revealed only at the end of the batch. Such a feedback structure is popular in applications such as personalized medicine and online ad… ▽ More

    Submitted 24 November, 2023; v1 submitted 22 November, 2023; originally announced November 2023.

  21. arXiv:2310.02712  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    ED-NeRF: Efficient Text-Guided Editing of 3D Scene with Latent Space NeRF

    Authors: Jangho Park, Gihyun Kwon, Jong Chul Ye

    Abstract: Recently, there has been a significant advancement in text-to-image diffusion models, leading to groundbreaking performance in 2D image generation. These advancements have been extended to 3D models, enabling the generation of novel 3D objects from textual descriptions. This has evolved into NeRF editing methods, which allow the manipulation of existing 3D objects through textual conditioning. How… ▽ More

    Submitted 21 March, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

    Comments: ICLR 2024; Project Page: https://jhq1234.github.io/ed-nerf.github.io/

  22. arXiv:2310.01110  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Prompt-tuning latent diffusion models for inverse problems

    Authors: Hyungjin Chung, Jong Chul Ye, Peyman Milanfar, Mauricio Delbracio

    Abstract: We propose a new method for solving imaging inverse problems using text-to-image latent diffusion models as general priors. Existing methods using latent diffusion models for inverse problems typically rely on simple null text prompts, which can lead to suboptimal performance. To address this limitation, we introduce a method for prompt tuning, which jointly optimizes the text embedding on-the-fly… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

    Comments: 22 pages, 10 figures

  23. arXiv:2310.01107  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Ground-A-Video: Zero-shot Grounded Video Editing using Text-to-image Diffusion Models

    Authors: Hyeonho Jeong, Jong Chul Ye

    Abstract: Recent endeavors in video editing have showcased promising results in single-attribute editing or style transfer tasks, either by training text-to-video (T2V) models on text-video data or adopting training-free methods. However, when confronted with the complexities of multi-attribute editing scenarios, they exhibit shortcomings such as omitting or overlooking intended attribute changes, modifying… ▽ More

    Submitted 24 February, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: Accepted to ICLR 2024, Project Page: http://ground-a-video.github.io

  24. arXiv:2309.02476  [pdf, other

    stat.ML cs.LG

    Optimal Sample Selection Through Uncertainty Estimation and Its Application in Deep Learning

    Authors: Yong Lin, Chen Liu, Chenlu Ye, Qing Lian, Yuan Yao, Tong Zhang

    Abstract: Modern deep learning heavily relies on large labeled datasets, which often comse with high costs in terms of both manual labeling and computational resources. To mitigate these challenges, researchers have explored the use of informative subset selection techniques, including coreset selection and active learning. Specifically, coreset selection involves sampling data with both input ($\bx$) and o… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

  25. arXiv:2308.07418  [pdf, other

    cs.LG stat.ML

    Locally Adaptive and Differentiable Regression

    Authors: Mingxuan Han, Varun Shankar, Jeff M Phillips, Chenglong Ye

    Abstract: Over-parameterized models like deep nets and random forests have become very popular in machine learning. However, the natural goals of continuity and differentiability, common in regression models, are now often ignored in modern overparametrized, locally-adaptive models. We propose a general framework to construct a global continuous and differentiable model based on a weighted average of locall… ▽ More

    Submitted 12 October, 2023; v1 submitted 14 August, 2023; originally announced August 2023.

    Journal ref: Journal of Machine Learning for Modeling and Computing 2023

  26. arXiv:2306.04396  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Improving Diffusion-based Image Translation using Asymmetric Gradient Guidance

    Authors: Gihyun Kwon, Jong Chul Ye

    Abstract: Diffusion models have shown significant progress in image translation tasks recently. However, due to their stochastic nature, there's often a trade-off between style transformation and content preservation. Current strategies aim to disentangle style and content, preserving the source image's structure while successfully transitioning from a source to a target domain under text or one-shot image… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

  27. arXiv:2305.19809  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Direct Diffusion Bridge using Data Consistency for Inverse Problems

    Authors: Hyungjin Chung, Jeongsol Kim, Jong Chul Ye

    Abstract: Diffusion model-based inverse problem solvers have shown impressive performance, but are limited in speed, mostly as they require reverse diffusion sampling starting from noise. Several recent works have tried to alleviate this problem by building a diffusion process, directly bridging the clean and the corrupted for specific inverse problems. In this paper, we first unify these existing works und… ▽ More

    Submitted 24 October, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

    Comments: NeurIPS 2023 camera-ready. 16 pages, 6 figures

  28. arXiv:2305.16375  [pdf, other

    cs.LG cs.AI stat.ML

    Data Topology-Dependent Upper Bounds of Neural Network Widths

    Authors: Sangmin Lee, Jong Chul Ye

    Abstract: This paper investigates the relationship between the universal approximation property of deep neural networks and topological characteristics of datasets. Our primary contribution is to introduce data topology-dependent upper bounds on the network width. Specifically, we first show that a three-layer neural network, applying a ReLU activation function and max pooling, can be designed to approximat… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

  29. arXiv:2305.15086  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Unpaired Image-to-Image Translation via Neural Schrödinger Bridge

    Authors: Beomsu Kim, Gihyun Kwon, Kwanyoung Kim, Jong Chul Ye

    Abstract: Diffusion models are a powerful class of generative models which simulate stochastic differential equations (SDEs) to generate data from noise. While diffusion models have achieved remarkable progress, they have limitations in unpaired image-to-image (I2I) translation tasks due to the Gaussian prior assumption. Schrödinger Bridge (SB), which learns an SDE to translate between two arbitrary distrib… ▽ More

    Submitted 2 March, 2024; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: ICLR 2024

  30. arXiv:2305.00520  [pdf, other

    stat.ML cs.LG

    The ART of Transfer Learning: An Adaptive and Robust Pipeline

    Authors: Boxiang Wang, Yunan Wu, Chenglong Ye

    Abstract: Transfer learning is an essential tool for improving the performance of primary tasks by leveraging information from auxiliary data resources. In this work, we propose Adaptive Robust Transfer Learning (ART), a flexible pipeline of performing transfer learning with generic machine learning algorithms. We establish the non-asymptotic learning theory of ART, providing a provable theoretical guarante… ▽ More

    Submitted 30 April, 2023; originally announced May 2023.

  31. arXiv:2303.08622  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Zero-Shot Contrastive Loss for Text-Guided Diffusion Image Style Transfer

    Authors: Serin Yang, Hyunmin Hwang, Jong Chul Ye

    Abstract: Diffusion models have shown great promise in text-guided image style transfer, but there is a trade-off between style transformation and content preservation due to their stochastic nature. Existing methods require computationally expensive fine-tuning of diffusion models or additional neural network. To address this, here we propose a zero-shot contrastive loss for diffusion models that doesn't r… ▽ More

    Submitted 12 April, 2023; v1 submitted 15 March, 2023; originally announced March 2023.

  32. arXiv:2303.05754  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Decomposed Diffusion Sampler for Accelerating Large-Scale Inverse Problems

    Authors: Hyungjin Chung, Suhyeon Lee, Jong Chul Ye

    Abstract: Krylov subspace, which is generated by multiplying a given vector by the matrix of a linear transformation and its successive powers, has been extensively studied in classical optimization literature to design algorithms that converge quickly for large linear inverse problems. For example, the conjugate gradient method (CG), one of the most popular Krylov subspace methods, is based on the idea of… ▽ More

    Submitted 19 February, 2024; v1 submitted 10 March, 2023; originally announced March 2023.

    Comments: ICLR 2024; 28 pages, 9 figures

  33. arXiv:2302.03900  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Zero-shot Generation of Coherent Storybook from Plain Text Story using Diffusion Models

    Authors: Hyeonho Jeong, Gihyun Kwon, Jong Chul Ye

    Abstract: Recent advancements in large scale text-to-image models have opened new possibilities for guiding the creation of images through human-devised natural language. However, while prior literature has primarily focused on the generation of individual images, it is essential to consider the capability of these models to ensure coherency within a sequence of images to fulfill the demands of real-world a… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

  34. arXiv:2301.12334  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Don't Play Favorites: Minority Guidance for Diffusion Models

    Authors: Soobin Um, Suhyeon Lee, Jong Chul Ye

    Abstract: We explore the problem of generating minority samples using diffusion models. The minority samples are instances that lie on low-density regions of a data manifold. Generating a sufficient number of such minority instances is important, since they often contain some unique attributes of the data. However, the conventional generation process of the diffusion models mostly yields majority samples (t… ▽ More

    Submitted 26 February, 2024; v1 submitted 28 January, 2023; originally announced January 2023.

    Comments: ICLR 2024

  35. arXiv:2301.12171  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    ZegOT: Zero-shot Segmentation Through Optimal Transport of Text Prompts

    Authors: Kwanyoung Kim, Yujin Oh, Jong Chul Ye

    Abstract: Recent success of large-scale Contrastive Language-Image Pre-training (CLIP) has led to great promise in zero-shot semantic segmentation by transferring image-text aligned knowledge to pixel-level classification. However, existing methods usually require an additional image encoder or retraining/tuning the CLIP module. Here, we propose a novel Zero-shot segmentation with Optimal Transport (ZegOT)… ▽ More

    Submitted 30 May, 2023; v1 submitted 28 January, 2023; originally announced January 2023.

    Comments: 18pages, 8 figures

  36. arXiv:2301.12003  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Minimizing Trajectory Curvature of ODE-based Generative Models

    Authors: Sangyun Lee, Beomsu Kim, Jong Chul Ye

    Abstract: Recent ODE/SDE-based generative models, such as diffusion models, rectified flows, and flow matching, define a generative process as a time reversal of a fixed forward process. Even though these models show impressive performance on large-scale datasets, numerical simulation requires multiple evaluations of a neural network, leading to a slow sampling speed. We attribute the reason to the high cur… ▽ More

    Submitted 25 May, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

    Comments: ICML 2023

  37. arXiv:2212.05949  [pdf, ps, other

    stat.ML cs.LG

    Corruption-Robust Algorithms with Uncertainty Weighting for Nonlinear Contextual Bandits and Markov Decision Processes

    Authors: Chenlu Ye, Wei Xiong, Quanquan Gu, Tong Zhang

    Abstract: Despite the significant interest and progress in reinforcement learning (RL) problems with adversarial corruption, current works are either confined to the linear setting or lead to an undesired $\tilde{O}(\sqrt{T}ζ)$ regret bound, where $T$ is the number of rounds and $ζ$ is the total amount of corruption. In this paper, we consider the contextual bandit with general function approximation and pr… ▽ More

    Submitted 10 February, 2024; v1 submitted 12 December, 2022; originally announced December 2022.

    Comments: We study the corruption-robust MDPs and contextual bandits with general function approximation

    Journal ref: ICML 2023

  38. arXiv:2211.10656  [pdf, other

    cs.CV cs.LG stat.ML

    Parallel Diffusion Models of Operator and Image for Blind Inverse Problems

    Authors: Hyungjin Chung, Jeongsol Kim, Sehui Kim, Jong Chul Ye

    Abstract: Diffusion model-based inverse problem solvers have demonstrated state-of-the-art performance in cases where the forward operator is known (i.e. non-blind). However, the applicability of the method to blind inverse problems has yet to be explored. In this work, we show that we can indeed solve a family of blind inverse problems by constructing another diffusion prior for the forward operator. Speci… ▽ More

    Submitted 19 November, 2022; originally announced November 2022.

    Comments: 25 pages, 13 figures

  39. arXiv:2210.05248  [pdf, other

    cs.LG cs.AI stat.ML

    Self-supervised debiasing using low rank regularization

    Authors: Geon Yeong Park, Chanyong Jung, Sangmin Lee, Jong Chul Ye, Sang Wan Lee

    Abstract: Spurious correlations can cause strong biases in deep neural networks, impairing generalization ability. While most existing debiasing methods require full supervision on either spurious attributes or target labels, training a debiased model from a limited amount of both annotations is still an open question. To address this issue, we investigate an interesting phenomenon using the spectral analys… ▽ More

    Submitted 8 October, 2023; v1 submitted 11 October, 2022; originally announced October 2022.

  40. arXiv:2210.05247  [pdf, other

    cs.LG cs.AI stat.ML

    Training Debiased Subnetworks with Contrastive Weight Pruning

    Authors: Geon Yeong Park, Sangmin Lee, Sang Wan Lee, Jong Chul Ye

    Abstract: Neural networks are often biased to spuriously correlated features that provide misleading statistical evidence that does not generalize. This raises an interesting question: ``Does an optimal unbiased functional subnetwork exist in a severely biased network? If so, how to extract such subnetwork?" While empirical evidence has been accumulated about the existence of such unbiased subnetworks, thes… ▽ More

    Submitted 26 June, 2023; v1 submitted 11 October, 2022; originally announced October 2022.

    Comments: CVPR 2023, code: https://github.com/ParkGeonYeong/DCWP

  41. arXiv:2209.15264  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Diffusion-based Image Translation using Disentangled Style and Content Representation

    Authors: Gihyun Kwon, Jong Chul Ye

    Abstract: Diffusion-based image translation guided by semantic texts or a single target image has enabled flexible style transfer which is not limited to the specific domains. Unfortunately, due to the stochastic nature of diffusion models, it is often difficult to maintain the original content of the image during the reverse diffusion. To address this, here we present a novel diffusion-based unsupervised i… ▽ More

    Submitted 1 February, 2023; v1 submitted 30 September, 2022; originally announced September 2022.

    Comments: ICLR 2023 camera ready

  42. arXiv:2209.14687  [pdf, other

    stat.ML cs.AI cs.CV cs.LG

    Diffusion Posterior Sampling for General Noisy Inverse Problems

    Authors: Hyungjin Chung, Jeongsol Kim, Michael T. Mccann, Marc L. Klasky, Jong Chul Ye

    Abstract: Diffusion models have been recently studied as powerful generative inverse problem solvers, owing to their high quality reconstructions and the ease of combining existing iterative solvers. However, most works focus on solving simple linear inverse problems in noiseless settings, which significantly under-represents the complexity of real-world problems. In this work, we extend diffusion solvers t… ▽ More

    Submitted 20 May, 2024; v1 submitted 29 September, 2022; originally announced September 2022.

    Comments: ICLR 2023 spotlight

  43. arXiv:2208.01864  [pdf, other

    cs.CV cs.LG stat.ML

    Pyramidal Denoising Diffusion Probabilistic Models

    Authors: Dohoon Ryu, Jong Chul Ye

    Abstract: Recently, diffusion model have demonstrated impressive image generation performances, and have been extensively studied in various computer vision tasks. Unfortunately, training and evaluating diffusion models consume a lot of time and computational resources. To address this problem, here we present a novel pyramidal diffusion model that can generate high resolution images starting from much coar… ▽ More

    Submitted 30 September, 2022; v1 submitted 3 August, 2022; originally announced August 2022.

  44. arXiv:2206.11944  [pdf, ps, other

    stat.ME

    High-dimensional Variable Screening via Conditional Martingale Difference Divergence

    Authors: Lei Fang, Qingcong Yuan, Xiangrong Yin, Chenglong Ye

    Abstract: Variable screening has been a useful research area that deals with ultrahigh-dimensional data. When there exist both marginally and jointly dependent predictors to the response, existing methods such as conditional screening or iterative screening often suffer from instability against the selection of the conditional set or the computational burden, respectively. In this article, we propose a new… ▽ More

    Submitted 6 July, 2023; v1 submitted 23 June, 2022; originally announced June 2022.

  45. arXiv:2206.00941  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Improving Diffusion Models for Inverse Problems using Manifold Constraints

    Authors: Hyungjin Chung, Byeongsu Sim, Dohoon Ryu, Jong Chul Ye

    Abstract: Recently, diffusion models have been used to solve various inverse problems in an unsupervised manner with appropriate modifications to the sampling process. However, the current solvers, which recursively apply a reverse diffusion step followed by a projection-based measurement consistency step, often produce suboptimal results. By studying the generative sampling path, here we show that current… ▽ More

    Submitted 20 May, 2024; v1 submitted 2 June, 2022; originally announced June 2022.

    Comments: NeurIPS 2022 camera-ready; 29 pages, 16 figures

  46. arXiv:2203.09301  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    One-Shot Adaptation of GAN in Just One CLIP

    Authors: Gihyun Kwon, Jong Chul Ye

    Abstract: There are many recent research efforts to fine-tune a pre-trained generator with a few target images to generate images of a novel domain. Unfortunately, these methods often suffer from overfitting or under-fitting when fine-tuned with a single target image. To address this, here we present a novel single-shot GAN adaptation method through unified CLIP space manipulations. Specifically, our model… ▽ More

    Submitted 30 January, 2023; v1 submitted 17 March, 2022; originally announced March 2022.

  47. arXiv:2202.05510  [pdf, other

    cs.LG cs.AI stat.ML

    Support Vectors and Gradient Dynamics of Single-Neuron ReLU Networks

    Authors: Sangmin Lee, Byeongsu Sim, Jong Chul Ye

    Abstract: Understanding implicit bias of gradient descent for generalization capability of ReLU networks has been an important research topic in machine learning research. Unfortunately, even for a single ReLU neuron trained with the square loss, it was recently shown impossible to characterize the implicit regularization in terms of a norm of model parameters (Vardi & Shamir, 2021). In order to close the g… ▽ More

    Submitted 13 June, 2022; v1 submitted 11 February, 2022; originally announced February 2022.

  48. arXiv:2112.05146  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    Come-Closer-Diffuse-Faster: Accelerating Conditional Diffusion Models for Inverse Problems through Stochastic Contraction

    Authors: Hyungjin Chung, Byeongsu Sim, Jong Chul Ye

    Abstract: Diffusion models have recently attained significant interest within the community owing to their strong performance as generative models. Furthermore, its application to inverse problems have demonstrated state-of-the-art performance. Unfortunately, diffusion models have a critical downside - they are inherently slow to sample from, needing few thousand steps of iteration to generate images from p… ▽ More

    Submitted 19 March, 2022; v1 submitted 8 December, 2021; originally announced December 2021.

    Comments: Accepted to CVPR 2022

  49. arXiv:2112.03696  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    Noise Distribution Adaptive Self-Supervised Image Denoising using Tweedie Distribution and Score Matching

    Authors: Kwanyoung Kim, Taesung Kwon, Jong Chul Ye

    Abstract: Tweedie distributions are a special case of exponential dispersion models, which are often used in classical statistics as distributions for generalized linear models. Here, we reveal that Tweedie distributions also play key roles in modern deep learning era, leading to a distribution independent self-supervised image denoising formula without clean reference images. Specifically, by combining wit… ▽ More

    Submitted 4 December, 2021; originally announced December 2021.

  50. arXiv:2106.09246  [pdf, other

    cs.CV cs.LG stat.ML

    Federated CycleGAN for Privacy-Preserving Image-to-Image Translation

    Authors: Joonyoung Song, Jong Chul Ye

    Abstract: Unsupervised image-to-image translation methods such as CycleGAN learn to convert images from one domain to another using unpaired training data sets from different domains. Unfortunately, these approaches still require centrally collected unpaired records, potentially violating privacy and security issues. Although the recent federated learning (FL) allows a neural network to be trained without d… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.