Skip to main content

Showing 1–8 of 8 results for author: Yunis, D

.
  1. arXiv:2503.19574  [pdf, other

    cs.CL cs.IR

    Context-Efficient Retrieval with Factual Decomposition

    Authors: Yanhong Li, David Yunis, David McAllester, Jiawei Zhou

    Abstract: There has recently been considerable interest in incorporating information retrieval into large language models (LLMs). Retrieval from a dynamically expanding external corpus of text allows a model to incorporate current events and can be viewed as a form of episodic memory. Here we demonstrate that pre-processing the external corpus into semi-structured ''atomic facts'' makes retrieval more effic… ▽ More

    Submitted 25 March, 2025; originally announced March 2025.

    Comments: NAACL 2025 Main Conference

  2. arXiv:2410.21597  [pdf, other

    cs.CL cs.AI cs.LG

    Reducing the Scope of Language Models

    Authors: David Yunis, Siyu Huo, Chulaka Gunasekara, Danish Contractor

    Abstract: We now deploy language models in a wide variety of user-facing applications. Typically, these deployments have some specific purpose, like answering questions about documentation or acting as coding assistants, but they require general language understanding. Under these circumstances these models should not be able to answer irrelevant requests such as, poetry generation or questions about physic… ▽ More

    Submitted 17 April, 2025; v1 submitted 28 October, 2024; originally announced October 2024.

  3. arXiv:2408.11804  [pdf, other

    cs.LG cs.AI

    Approaching Deep Learning through the Spectral Dynamics of Weights

    Authors: David Yunis, Kumar Kshitij Patel, Samuel Wheeler, Pedro Savarese, Gal Vardi, Karen Livescu, Michael Maire, Matthew R. Walter

    Abstract: We propose an empirical approach centered on the spectral dynamics of weights -- the behavior of singular values and vectors during optimization -- to unify and clarify several phenomena in deep learning. We identify a consistent bias in optimization across various experiments, from small-scale ``grokking'' to large-scale tasks like image classification with ConvNets, image generation with UNets,… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

  4. arXiv:2312.06716  [pdf, other

    cs.CV

    Deciphering 'What' and 'Where' Visual Pathways from Spectral Clustering of Layer-Distributed Neural Representations

    Authors: Xiao Zhang, David Yunis, Michael Maire

    Abstract: We present an approach for analyzing grouping information contained within a neural network's activations, permitting extraction of spatial layout and semantic segmentation from the behavior of large pre-trained vision models. Unlike prior work, our method conducts a holistic analysis of a network's activation state, leveraging features from all layers and obviating the need to guess which part of… ▽ More

    Submitted 20 June, 2024; v1 submitted 10 December, 2023; originally announced December 2023.

    Comments: Accepted to CVPR2024 (Highlight)

  5. arXiv:2309.04459  [pdf, other

    cs.LG cs.AI cs.RO

    Subwords as Skills: Tokenization for Sparse-Reward Reinforcement Learning

    Authors: David Yunis, Justin Jung, Falcon Dai, Matthew Walter

    Abstract: Exploration in sparse-reward reinforcement learning is difficult due to the requirement of long, coordinated sequences of actions in order to achieve any reward. Moreover, in continuous action spaces there are an infinite number of possible actions, which only increases the difficulty of exploration. One class of methods designed to address these issues forms temporally extended actions, often cal… ▽ More

    Submitted 30 October, 2024; v1 submitted 8 September, 2023; originally announced September 2023.

    Comments: Accepted to NeurIPS 2024

  6. arXiv:2306.17840  [pdf, other

    cs.RO cs.CL

    Statler: State-Maintaining Language Models for Embodied Reasoning

    Authors: Takuma Yoneda, Jiading Fang, Peng Li, Huanyu Zhang, Tianchong Jiang, Shengjie Lin, Ben Picker, David Yunis, Hongyuan Mei, Matthew R. Walter

    Abstract: There has been a significant research interest in employing large language models to empower intelligent robots with complex reasoning. Existing work focuses on harnessing their abilities to reason about the histories of their actions and observations. In this paper, we explore a new dimension in which large language models may benefit robotics planning. In particular, we propose Statler, a framew… ▽ More

    Submitted 20 May, 2024; v1 submitted 30 June, 2023; originally announced June 2023.

    Comments: Accepted at ICRA 2024; Project website: https://statler-lm.github.io/

  7. arXiv:1801.01432  [pdf, other

    cs.RO cs.LG

    Jointly Learning to Construct and Control Agents using Deep Reinforcement Learning

    Authors: Charles Schaff, David Yunis, Ayan Chakrabarti, Matthew R. Walter

    Abstract: The physical design of a robot and the policy that controls its motion are inherently coupled, and should be determined according to the task and environment. In an increasing number of applications, data-driven and learning-based approaches, such as deep reinforcement learning, have proven effective at designing control policies. For most tasks, the only way to evaluate a physical design with res… ▽ More

    Submitted 14 September, 2018; v1 submitted 4 January, 2018; originally announced January 2018.

  8. arXiv:1703.08612  [pdf, other

    cs.RO cs.LG

    Jointly Optimizing Placement and Inference for Beacon-based Localization

    Authors: Charles Schaff, David Yunis, Ayan Chakrabarti, Matthew R. Walter

    Abstract: The ability of robots to estimate their location is crucial for a wide variety of autonomous operations. In settings where GPS is unavailable, measurements of transmissions from fixed beacons provide an effective means of estimating a robot's location as it navigates. The accuracy of such a beacon-based localization system depends both on how beacons are distributed in the environment, and how the… ▽ More

    Submitted 20 September, 2017; v1 submitted 24 March, 2017; originally announced March 2017.

    Comments: Appeared at 2017 International Conference on Intelligent Robots and Systems (IROS)