Skip to main content

Showing 1–18 of 18 results for author: Chen, A S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.07505  [pdf, ps, other

    cs.LG cs.AI

    Reinforcement Learning via Implicit Imitation Guidance

    Authors: Perry Dong, Alec M. Lessing, Annie S. Chen, Chelsea Finn

    Abstract: We study the problem of sample efficient reinforcement learning, where prior data such as demonstrations are provided for initialization in lieu of a dense reward signal. A natural approach is to incorporate an imitation learning objective, either as regularization during training or to acquire a reference policy. However, imitation learning objectives can ultimately degrade long-term performance,… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

  2. arXiv:2503.18419  [pdf, other

    cs.HC cs.AI cs.ET

    Generative AI in Knowledge Work: Design Implications for Data Navigation and Decision-Making

    Authors: Bhada Yun, Dana Feng, Ace S. Chen, Afshin Nikzad, Niloufar Salehi

    Abstract: Our study of 20 knowledge workers revealed a common challenge: the difficulty of synthesizing unstructured information scattered across multiple platforms to make informed decisions. Drawing on their vision of an ideal knowledge synthesis tool, we developed Yodeai, an AI-enabled system, to explore both the opportunities and limitations of AI in knowledge work. Through a user study with 16 product… ▽ More

    Submitted 24 March, 2025; originally announced March 2025.

    Comments: Accepted to CHI '25 (Conference on Human Factors in Computing Systems), to appear April 26-May 1, 2025, Yokohama, Japan

    ACM Class: H.5.m

  3. arXiv:2503.03707  [pdf, other

    cs.RO cs.AI cs.LG

    Curating Demonstrations using Online Experience

    Authors: Annie S. Chen, Alec M. Lessing, Yuejiang Liu, Chelsea Finn

    Abstract: Many robot demonstration datasets contain heterogeneous demonstrations of varying quality. This heterogeneity may benefit policy pre-training, but can hinder robot performance when used with a final imitation learning objective. In particular, some strategies in the data may be less reliable than others or may be underrepresented in the data, leading to poor performance when such strategies are sa… ▽ More

    Submitted 5 March, 2025; originally announced March 2025.

  4. arXiv:2409.19817  [pdf, other

    cs.LG cs.AI cs.CL

    Calibrating Language Models with Adaptive Temperature Scaling

    Authors: Johnathan Xie, Annie S. Chen, Yoonho Lee, Eric Mitchell, Chelsea Finn

    Abstract: The effectiveness of large language models (LLMs) is not only measured by their ability to generate accurate outputs but also by their calibration-how well their confidence scores reflect the probability of their outputs being correct. While unsupervised pre-training has been shown to yield LLMs with well-calibrated conditional probabilities, recent studies have shown that after fine-tuning with r… ▽ More

    Submitted 29 September, 2024; originally announced September 2024.

    Comments: EMNLP 2024

  5. arXiv:2408.04752  [pdf, other

    cs.HC

    A Multi-Level Task Framework for Event Sequence Analysis

    Authors: Kazi Tasnim Zinat, Saimadhav Naga Sakhamuri, Aaron Sun Chen, Zhicheng Liu

    Abstract: Despite the development of numerous visual analytics tools for event sequence data across various domains, including but not limited to healthcare, digital marketing, and user behavior analysis, comparing these domain-specific investigations and transferring the results to new datasets and problem areas remain challenging. Task abstractions can help us go beyond domain-specific details, but existi… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

    Comments: Task Abstraction, Event Sequence Data

  6. arXiv:2407.18397  [pdf, other

    cs.LG stat.ML

    Gaussian Process Kolmogorov-Arnold Networks

    Authors: Andrew Siyuan Chen

    Abstract: In this paper, we introduce a probabilistic extension to Kolmogorov Arnold Networks (KANs) by incorporating Gaussian Process (GP) as non-linear neurons, which we refer to as GP-KAN. A fully analytical approach to handling the output distribution of one GP as an input to another GP is achieved by considering the function inner product of a GP function sample with the input distribution. These GP ne… ▽ More

    Submitted 17 August, 2024; v1 submitted 25 July, 2024; originally announced July 2024.

    Comments: related code: https://github.com/siyuan0/gp-kan Under review at IEEE

  7. arXiv:2407.02666  [pdf, other

    cs.RO cs.AI

    Commonsense Reasoning for Legged Robot Adaptation with Vision-Language Models

    Authors: Annie S. Chen, Alec M. Lessing, Andy Tang, Govind Chada, Laura Smith, Sergey Levine, Chelsea Finn

    Abstract: Legged robots are physically capable of navigating a diverse variety of environments and overcoming a wide range of obstructions. For example, in a search and rescue mission, a legged robot could climb over debris, crawl through gaps, and navigate out of dead ends. However, the robot's controller needs to respond intelligently to such varied obstacles, and this requires handling unexpected and unu… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: 27 pages

  8. arXiv:2402.14789  [pdf, other

    cs.LG cs.AI

    Self-Guided Masked Autoencoders for Domain-Agnostic Self-Supervised Learning

    Authors: Johnathan Xie, Yoonho Lee, Annie S. Chen, Chelsea Finn

    Abstract: Self-supervised learning excels in learning representations from large amounts of unlabeled data, demonstrating success across multiple data modalities. Yet, extending self-supervised learning to new modalities is non-trivial because the specifics of existing methods are tailored to each domain, such as domain-specific augmentations which reflect the invariances in the target task. While masked mo… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: ICLR 2024

  9. arXiv:2402.10893  [pdf, other

    cs.LG cs.AI cs.CL

    RLVF: Learning from Verbal Feedback without Overgeneralization

    Authors: Moritz Stephan, Alexander Khazatsky, Eric Mitchell, Annie S Chen, Sheryl Hsu, Archit Sharma, Chelsea Finn

    Abstract: The diversity of contexts in which large language models (LLMs) are deployed requires the ability to modify or customize default model behaviors to incorporate nuanced requirements and preferences. A convenient interface to specify such model adjustments is high-level verbal feedback, such as "Don't use emojis when drafting emails to my boss." However, while writing high-level feedback is far simp… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: 9 pages, 9 figures

  10. arXiv:2311.01059  [pdf, other

    cs.RO cs.LG

    Adapt On-the-Go: Behavior Modulation for Single-Life Robot Deployment

    Authors: Annie S. Chen, Govind Chada, Laura Smith, Archit Sharma, Zipeng Fu, Sergey Levine, Chelsea Finn

    Abstract: To succeed in the real world, robots must cope with situations that differ from those seen during training. We study the problem of adapting on-the-fly to such novel scenarios during deployment, by drawing upon a diverse repertoire of previouslylearned behaviors. Our approach, RObust Autonomous Modulation (ROAM), introduces a mechanism based on the perceived value of pre-trained behaviors to selec… ▽ More

    Submitted 8 December, 2024; v1 submitted 2 November, 2023; originally announced November 2023.

  11. arXiv:2306.11120  [pdf, other

    cs.LG cs.AI

    Confidence-Based Model Selection: When to Take Shortcuts for Subpopulation Shifts

    Authors: Annie S. Chen, Yoonho Lee, Amrith Setlur, Sergey Levine, Chelsea Finn

    Abstract: Effective machine learning models learn both robust features that directly determine the outcome of interest (e.g., an object with wheels is more likely to be a car), and shortcut features (e.g., an object on a road is more likely to be a car). The latter can be a source of error under distributional shift, when the correlations change at test-time. The prevailing sentiment in the robustness liter… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

    Comments: 15 pages, 5 figures

  12. arXiv:2302.12766  [pdf, other

    cs.RO cs.AI cs.CL cs.CV cs.LG

    Language-Driven Representation Learning for Robotics

    Authors: Siddharth Karamcheti, Suraj Nair, Annie S. Chen, Thomas Kollar, Chelsea Finn, Dorsa Sadigh, Percy Liang

    Abstract: Recent work in visual representation learning for robotics demonstrates the viability of learning from large video datasets of humans performing everyday tasks. Leveraging methods such as masked autoencoding and contrastive learning, these representations exhibit strong transfer to policy learning for visuomotor control. But, robot learning encompasses a diverse set of problems beyond control incl… ▽ More

    Submitted 24 February, 2023; originally announced February 2023.

    Comments: 30 Pages, 15 Figures

  13. arXiv:2302.05441  [pdf, other

    cs.LG cs.AI

    Project and Probe: Sample-Efficient Domain Adaptation by Interpolating Orthogonal Features

    Authors: Annie S. Chen, Yoonho Lee, Amrith Setlur, Sergey Levine, Chelsea Finn

    Abstract: Transfer learning with a small amount of target data is an effective and common approach to adapting a pre-trained model to distribution shifts. In some situations, target data labels may be expensive to obtain, so we may only have access to a limited number of target data points. To make the most of a very small target dataset, we propose a lightweight, sample-efficient approach that learns a div… ▽ More

    Submitted 25 May, 2023; v1 submitted 10 February, 2023; originally announced February 2023.

    Comments: 22 pages, 9 figures

  14. arXiv:2210.11466  [pdf, other

    cs.LG cs.AI

    Surgical Fine-Tuning Improves Adaptation to Distribution Shifts

    Authors: Yoonho Lee, Annie S. Chen, Fahim Tajwar, Ananya Kumar, Huaxiu Yao, Percy Liang, Chelsea Finn

    Abstract: A common approach to transfer learning under distribution shift is to fine-tune the last few layers of a pre-trained model, preserving learned features while also adapting to the new task. This paper shows that in such settings, selectively fine-tuning a subset of layers (which we term surgical fine-tuning) matches or outperforms commonly used fine-tuning approaches. Moreover, the type of distribu… ▽ More

    Submitted 6 June, 2023; v1 submitted 20 October, 2022; originally announced October 2022.

    Comments: ICLR 2023

  15. arXiv:2210.08863  [pdf, other

    cs.LG cs.AI

    You Only Live Once: Single-Life Reinforcement Learning

    Authors: Annie S. Chen, Archit Sharma, Sergey Levine, Chelsea Finn

    Abstract: Reinforcement learning algorithms are typically designed to learn a performant policy that can repeatedly and autonomously complete a task, usually starting from scratch. However, in many real-world situations, the goal might not be to learn a policy that can do the task repeatedly, but simply to perform a new task successfully once in a single trial. For example, imagine a disaster relief robot t… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

    Comments: 17 pages

    Journal ref: Neural Information Processing Systems (NeurIPS) 2022

  16. arXiv:2107.09044  [pdf, other

    cs.LG cs.AI cs.CY stat.ML

    Just Train Twice: Improving Group Robustness without Training Group Information

    Authors: Evan Zheran Liu, Behzad Haghgoo, Annie S. Chen, Aditi Raghunathan, Pang Wei Koh, Shiori Sagawa, Percy Liang, Chelsea Finn

    Abstract: Standard training via empirical risk minimization (ERM) can produce models that achieve high accuracy on average but low accuracy on certain groups, especially in the presence of spurious correlations between the input and label. Prior approaches that achieve high worst-group accuracy, like group distributionally robust optimization (group DRO) require expensive group annotations for each training… ▽ More

    Submitted 27 September, 2021; v1 submitted 19 July, 2021; originally announced July 2021.

    Comments: International Conference on Machine Learning (ICML), 2021

  17. arXiv:2103.16817  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Learning Generalizable Robotic Reward Functions from "In-The-Wild" Human Videos

    Authors: Annie S. Chen, Suraj Nair, Chelsea Finn

    Abstract: We are motivated by the goal of generalist robots that can complete a wide range of tasks across many environments. Critical to this is the robot's ability to acquire some metric of task success or reward, which is necessary for reinforcement learning, planning, or knowing when to ask for help. For a general-purpose robot operating in the real world, this reward function must also be able to gener… ▽ More

    Submitted 31 March, 2021; originally announced March 2021.

    Comments: https://sites.google.com/view/dvd-human-videos

  18. arXiv:2010.11917  [pdf, other

    cs.RO cs.AI cs.LG

    Batch Exploration with Examples for Scalable Robotic Reinforcement Learning

    Authors: Annie S. Chen, HyunJi Nam, Suraj Nair, Chelsea Finn

    Abstract: Learning from diverse offline datasets is a promising path towards learning general purpose robotic agents. However, a core challenge in this paradigm lies in collecting large amounts of meaningful data, while not depending on a human in the loop for data collection. One way to address this challenge is through task-agnostic exploration, where an agent attempts to explore without a task-specific r… ▽ More

    Submitted 23 April, 2021; v1 submitted 22 October, 2020; originally announced October 2020.

    Comments: 11 Pages, 11 Figures

    Journal ref: IEEE Robotics and Automation Letters ( Volume: 6, Issue: 3, July 2021)