Skip to main content

Showing 1–6 of 6 results for author: Jiang, E H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.20995  [pdf, other

    cs.CL

    Multi-head Reward Aggregation Guided by Entropy

    Authors: Xiaomin Li, Xupeng Chen, Jingxuan Fan, Eric Hanchen Jiang, Mingye Gao

    Abstract: Aligning large language models (LLMs) with safety guidelines typically involves reinforcement learning from human feedback (RLHF), relying on human-generated preference annotations. However, assigning consistent overall quality ratings is challenging, prompting recent research to shift towards detailed evaluations based on multiple specific safety criteria. This paper uncovers a consistent observa… ▽ More

    Submitted 26 March, 2025; originally announced March 2025.

  2. arXiv:2411.18440  [pdf, other

    astro-ph.GA cs.CV

    Learning the Evolution of Physical Structure of Galaxies via Diffusion Models

    Authors: Andrew Lizarraga, Eric Hanchen Jiang, Jacob Nowack, Yun Qi Li, Ying Nian Wu, Bernie Boscoe, Tuan Do

    Abstract: In astrophysics, understanding the evolution of galaxies in primarily through imaging data is fundamental to comprehending the formation of the Universe. This paper introduces a novel approach to conditioning Denoising Diffusion Probabilistic Models (DDPM) on redshifts for generating galaxy images. We explore whether this advanced generative model can accurately capture the physical characteristic… ▽ More

    Submitted 27 November, 2024; originally announced November 2024.

  3. arXiv:2411.17472  [pdf, other

    cs.CV cs.LG stat.ML

    Unlocking the Potential of Text-to-Image Diffusion with PAC-Bayesian Theory

    Authors: Eric Hanchen Jiang, Yasi Zhang, Zhi Zhang, Yixin Wan, Andrew Lizarraga, Shufan Li, Ying Nian Wu

    Abstract: Text-to-image (T2I) diffusion models have revolutionized generative modeling by producing high-fidelity, diverse, and visually realistic images from textual prompts. Despite these advances, existing models struggle with complex prompts involving multiple objects and attributes, often misaligning modifiers with their corresponding nouns or neglecting certain elements. Recent attention-based methods… ▽ More

    Submitted 25 November, 2024; originally announced November 2024.

  4. arXiv:2411.00401  [pdf, other

    cs.LG cs.AI

    Statistical Guarantees for Lifelong Reinforcement Learning using PAC-Bayesian Theory

    Authors: Zhi Zhang, Chris Chow, Yasi Zhang, Yanchao Sun, Haochen Zhang, Eric Hanchen Jiang, Han Liu, Furong Huang, Yuchen Cui, Oscar Hernan Madrid Padilla

    Abstract: Lifelong reinforcement learning (RL) has been developed as a paradigm for extending single-task RL to more realistic, dynamic settings. In lifelong RL, the "life" of an RL agent is modeled as a stream of tasks drawn from a task distribution. We propose EPIC (\underline{E}mpirical \underline{P}AC-Bayes that \underline{I}mproves \underline{C}ontinuously), a novel algorithm designed for lifelong RL u… ▽ More

    Submitted 1 November, 2024; originally announced November 2024.

  5. arXiv:2410.11359  [pdf, other

    cs.LG cs.RO stat.ML

    DODT: Enhanced Online Decision Transformer Learning through Dreamer's Actor-Critic Trajectory Forecasting

    Authors: Eric Hanchen Jiang, Zhi Zhang, Dinghuai Zhang, Andrew Lizarraga, Chenheng Xu, Yasi Zhang, Siyan Zhao, Zhengjie Xu, Peiyu Yu, Yuer Tang, Deqian Kong, Ying Nian Wu

    Abstract: Advancements in reinforcement learning have led to the development of sophisticated models capable of learning complex decision-making tasks. However, efficiently integrating world models with decision transformers remains a challenge. In this paper, we introduce a novel approach that combines the Dreamer algorithm's ability to generate anticipatory trajectories with the adaptive learning strength… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

  6. arXiv:2312.03216  [pdf, other

    cs.LG cs.AI

    SDSRA: A Skill-Driven Skill-Recombination Algorithm for Efficient Policy Learning

    Authors: Eric H. Jiang, Andrew Lizarraga

    Abstract: In this paper, we introduce a novel algorithm - the Skill-Driven Skill Recombination Algorithm (SDSRA) - an innovative framework that significantly enhances the efficiency of achieving maximum entropy in reinforcement learning tasks. We find that SDSRA achieves faster convergence compared to the traditional Soft Actor-Critic (SAC) algorithm and produces improved policies. By integrating skill-base… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.