Skip to main content

Showing 1–49 of 49 results for author: Walter, M R

.
  1. arXiv:2506.03594  [pdf, ps, other

    cs.GR cs.CV cs.LG cs.MM cs.RO

    SplArt: Articulation Estimation and Part-Level Reconstruction with 3D Gaussian Splatting

    Authors: Shengjie Lin, Jiading Fang, Muhammad Zubair Irshad, Vitor Campagnolo Guizilini, Rares Andrei Ambrus, Greg Shakhnarovich, Matthew R. Walter

    Abstract: Reconstructing articulated objects prevalent in daily environments is crucial for applications in augmented/virtual reality and robotics. However, existing methods face scalability limitations (requiring 3D supervision or costly annotations), robustness issues (being susceptible to local optima), and rendering shortcomings (lacking speed or photorealism). We introduce SplArt, a self-supervised, ca… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: https://github.com/ripl/splart

  2. arXiv:2505.16892  [pdf, other

    cs.RO

    FlashBack: Consistency Model-Accelerated Shared Autonomy

    Authors: Luzhe Sun, Jingtian Ji, Xiangshan Tan, Matthew R. Walter

    Abstract: Shared autonomy is an enabling technology that provides users with control authority over robots that would otherwise be difficult if not impossible to directly control. Yet, standard methods make assumptions that limit their adoption in practice-for example, prior knowledge of the user's goals or the objective (i.e., reward) function that they wish to optimize, knowledge of the user's policy, or… ▽ More

    Submitted 27 May, 2025; v1 submitted 22 May, 2025; originally announced May 2025.

  3. arXiv:2505.04612  [pdf, other

    cs.CV

    FastMap: Revisiting Dense and Scalable Structure from Motion

    Authors: Jiahao Li, Haochen Wang, Muhammad Zubair Irshad, Igor Vasiljevic, Matthew R. Walter, Vitor Campagnolo Guizilini, Greg Shakhnarovich

    Abstract: We propose FastMap, a new global structure from motion method focused on speed and simplicity. Previous methods like COLMAP and GLOMAP are able to estimate high-precision camera poses, but suffer from poor scalability when the number of matched keypoint pairs becomes large. We identify two key factors leading to this problem: poor parallelization and computationally expensive optimization steps. T… ▽ More

    Submitted 19 May, 2025; v1 submitted 7 May, 2025; originally announced May 2025.

    Comments: Project webpage: https://jiahao.ai/fastmap

  4. arXiv:2503.01007  [pdf, other

    cs.RO

    From Vague Instructions to Task Plans: A Feedback-Driven HRC Task Planning Framework based on LLMs

    Authors: Afagh Mehri Shervedani, Matthew R. Walter, Milos Zefran

    Abstract: Recent advances in large language models (LLMs) have demonstrated their potential as planners in human-robot collaboration (HRC) scenarios, offering a promising alternative to traditional planning methods. LLMs, which can generate structured plans by reasoning over natural language inputs, have the ability to generalize across diverse tasks and adapt to human instructions. This paper investigates… ▽ More

    Submitted 2 March, 2025; originally announced March 2025.

  5. arXiv:2502.07937  [pdf, ps, other

    cs.LG stat.ML

    Active Advantage-Aligned Online Reinforcement Learning with Offline Data

    Authors: Xuefeng Liu, Hung T. C. Le, Siyu Chen, Rick Stevens, Zhuoran Yang, Matthew R. Walter, Yuxin Chen

    Abstract: Online reinforcement learning (RL) enhances policies through direct interactions with the environment, but faces challenges related to sample efficiency. In contrast, offline RL leverages extensive pre-collected data to learn policies, but often produces suboptimal results due to limited data coverage. Recent efforts integrate offline and online RL in order to harness the advantages of both approa… ▽ More

    Submitted 30 May, 2025; v1 submitted 11 February, 2025; originally announced February 2025.

  6. arXiv:2411.17764  [pdf, other

    cs.RO cs.AI

    PROGRESSOR: A Perceptually Guided Reward Estimator with Self-Supervised Online Refinement

    Authors: Tewodros Ayalew, Xiao Zhang, Kevin Yuanbo Wu, Tianchong Jiang, Michael Maire, Matthew R. Walter

    Abstract: We present PROGRESSOR, a novel framework that learns a task-agnostic reward function from videos, enabling policy training through goal-conditioned reinforcement learning (RL) without manual supervision. Underlying this reward is an estimate of the distribution over task progress as a function of the current, initial, and goal observations that is learned in a self-supervised fashion. Crucially, P… ▽ More

    Submitted 25 November, 2024; originally announced November 2024.

    Comments: 15 pages,13 figures

  7. arXiv:2409.18098  [pdf, other

    cs.RO

    StackGen: Generating Stable Structures from Silhouettes via Diffusion

    Authors: Luzhe Sun, Takuma Yoneda, Samuel W. Wheeler, Tianchong Jiang, Matthew R. Walter

    Abstract: Humans naturally obtain intuition about the interactions between and the stability of rigid objects by observing and interacting with the world. It is this intuition that governs the way in which we regularly configure objects in our environment, allowing us to build complex structures from simple, everyday objects. Robotic agents, on the other hand, traditionally require an explicit model of the… ▽ More

    Submitted 18 March, 2025; v1 submitted 26 September, 2024; originally announced September 2024.

  8. arXiv:2408.11804  [pdf, other

    cs.LG cs.AI

    Approaching Deep Learning through the Spectral Dynamics of Weights

    Authors: David Yunis, Kumar Kshitij Patel, Samuel Wheeler, Pedro Savarese, Gal Vardi, Karen Livescu, Michael Maire, Matthew R. Walter

    Abstract: We propose an empirical approach centered on the spectral dynamics of weights -- the behavior of singular values and vectors during optimization -- to unify and clarify several phenomena in deep learning. We identify a consistent bias in optimization across various experiments, from small-scale ``grokking'' to large-scale tasks like image classification with ConvNets, image generation with UNets,… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

  9. arXiv:2404.19221  [pdf, other

    cs.CV cs.CL

    Transcrib3D: 3D Referring Expression Resolution through Large Language Models

    Authors: Jiading Fang, Xiangshan Tan, Shengjie Lin, Igor Vasiljevic, Vitor Guizilini, Hongyuan Mei, Rares Ambrus, Gregory Shakhnarovich, Matthew R Walter

    Abstract: If robots are to work effectively alongside people, they must be able to interpret natural language references to objects in their 3D environment. Understanding 3D referring expressions is challenging -- it requires the ability to both parse the 3D structure of the scene and correctly ground free-form language in the presence of distraction and clutter. We introduce Transcrib3D, an approach that b… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: CORLW 2023

  10. arXiv:2404.17034  [pdf, ps, other

    cs.LG

    Learning Actionable Counterfactual Explanations in Large State Spaces

    Authors: Keziah Naggita, Matthew R. Walter, Avrim Blum

    Abstract: Recourse generators provide actionable insights, often through feature-based counterfactual explanations (CFEs), to help negatively classified individuals understand how to adjust their input features to achieve a positive classification. These feature-based CFEs, which we refer to as \emph{low-level} CFEs, are overly specific (e.g., coding experience: \(4 \to 5+\) years) and often recommended in… ▽ More

    Submitted 2 June, 2025; v1 submitted 25 April, 2024; originally announced April 2024.

  11. arXiv:2403.19913  [pdf, other

    cs.CL cs.AI cs.LG cs.RO

    MANGO: A Benchmark for Evaluating Mapping and Navigation Abilities of Large Language Models

    Authors: Peng Ding, Jiading Fang, Peng Li, Kangrui Wang, Xiaochen Zhou, Mo Yu, Jing Li, Matthew R. Walter, Hongyuan Mei

    Abstract: Large language models such as ChatGPT and GPT-4 have recently achieved astonishing performance on a variety of natural language processing tasks. In this paper, we propose MANGO, a benchmark to evaluate their capabilities to perform text-based mapping and navigation. Our benchmark includes 53 mazes taken from a suite of textgames: each maze is paired with a walkthrough that visits every location b… ▽ More

    Submitted 8 August, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

    Comments: COLM 2024 camera-ready

  12. arXiv:2310.17649  [pdf, other

    cs.RO cs.CV

    6-DoF Stability Field via Diffusion Models

    Authors: Takuma Yoneda, Tianchong Jiang, Gregory Shakhnarovich, Matthew R. Walter

    Abstract: A core capability for robot manipulation is reasoning over where and how to stably place objects in cluttered environments. Traditionally, robots have relied on object-specific, hand-crafted heuristics in order to perform such reasoning, with limited generalizability beyond a small number of object instances and object interaction patterns. Recent approaches instead learn notions of physical inter… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: In submission

  13. arXiv:2310.01737  [pdf, other

    cs.LG cs.AI stat.ML

    Blending Imitation and Reinforcement Learning for Robust Policy Improvement

    Authors: Xuefeng Liu, Takuma Yoneda, Rick L. Stevens, Matthew R. Walter, Yuxin Chen

    Abstract: While reinforcement learning (RL) has shown promising performance, its sample complexity continues to be a substantial hurdle, restricting its broader application across a variety of domains. Imitation learning (IL) utilizes oracles to improve sample efficiency, yet it is often constrained by the quality of the oracles deployed. which actively interleaves between IL and RL based on an online estim… ▽ More

    Submitted 4 October, 2023; v1 submitted 2 October, 2023; originally announced October 2023.

  14. Enhancing scientific exploration of the deep sea through shared autonomy in remote manipulation

    Authors: Amy Phung, Gideon Billings, Andrea F. Daniele, Matthew R. Walter, Richard Camilli

    Abstract: Shared autonomy enables novice remote users to conduct deep-ocean science operations with robotic manipulators.

    Submitted 15 September, 2023; originally announced September 2023.

  15. arXiv:2306.17840  [pdf, other

    cs.RO cs.CL

    Statler: State-Maintaining Language Models for Embodied Reasoning

    Authors: Takuma Yoneda, Jiading Fang, Peng Li, Huanyu Zhang, Tianchong Jiang, Shengjie Lin, Ben Picker, David Yunis, Hongyuan Mei, Matthew R. Walter

    Abstract: There has been a significant research interest in employing large language models to empower intelligent robots with complex reasoning. Existing work focuses on harnessing their abilities to reason about the histories of their actions and observations. In this paper, we explore a new dimension in which large language models may benefit robotics planning. In particular, we propose Statler, a framew… ▽ More

    Submitted 20 May, 2024; v1 submitted 30 June, 2023; originally announced June 2023.

    Comments: Accepted at ICRA 2024; Project website: https://statler-lm.github.io/

  16. arXiv:2306.10259  [pdf, other

    cs.LG cs.AI stat.ML

    Active Policy Improvement from Multiple Black-box Oracles

    Authors: Xuefeng Liu, Takuma Yoneda, Chaoqi Wang, Matthew R. Walter, Yuxin Chen

    Abstract: Reinforcement learning (RL) has made significant strides in various complex domains. However, identifying an effective policy via RL often necessitates extensive exploration. Imitation learning aims to mitigate this issue by using expert demonstrations to guide exploration. In real-world scenarios, one often has access to multiple suboptimal black-box experts, rather than a single optimal oracle.… ▽ More

    Submitted 5 July, 2023; v1 submitted 17 June, 2023; originally announced June 2023.

  17. arXiv:2305.13307  [pdf, other

    cs.CV

    NeRFuser: Large-Scale Scene Representation by NeRF Fusion

    Authors: Jiading Fang, Shengjie Lin, Igor Vasiljevic, Vitor Guizilini, Rares Ambrus, Adrien Gaidon, Gregory Shakhnarovich, Matthew R. Walter

    Abstract: A practical benefit of implicit visual representations like Neural Radiance Fields (NeRFs) is their memory efficiency: large scenes can be efficiently stored and shared as small neural nets instead of collections of images. However, operating on these implicit visual data structures requires extending classical image-based vision techniques (e.g., registration, blending) from image sets to neural… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: Code available at https://github.com/ripl/nerfuser

  18. arXiv:2302.03805  [pdf, ps, other

    cs.LG

    Eliciting User Preferences for Personalized Multi-Objective Decision Making through Comparative Feedback

    Authors: Han Shao, Lee Cohen, Avrim Blum, Yishay Mansour, Aadirupa Saha, Matthew R. Walter

    Abstract: In classic reinforcement learning (RL) and decision making problems, policies are evaluated with respect to a scalar reward function, and all optimal policies are the same with regards to their expected return. However, many real-world problems involve balancing multiple, sometimes conflicting, objectives whose relative priority will vary according to the preferences of each user. Consequently, a… ▽ More

    Submitted 31 October, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

  19. arXiv:2207.11773  [pdf, other

    cs.RO

    N-LIMB: Neural Limb Optimization for Efficient Morphological Design

    Authors: Charles Schaff, Matthew R. Walter

    Abstract: A robot's ability to complete a task is heavily dependent on its physical design. However, identifying an optimal physical design and its corresponding control policy is inherently challenging. The freedom to choose the number of links, their type, and how they are connected results in a combinatorial design space, and the evaluation of any design in that space requires deriving its optimal contro… ▽ More

    Submitted 19 September, 2022; v1 submitted 24 July, 2022; originally announced July 2022.

    Comments: For code and videos, see https://sites.google.com/ttic.edu/nlimb

  20. arXiv:2202.04575  [pdf, other

    cs.RO

    Soft Robots Learn to Crawl: Jointly Optimizing Design and Control with Sim-to-Real Transfer

    Authors: Charles Schaff, Audrey Sedal, Matthew R. Walter

    Abstract: This work provides a complete framework for the simulation, co-optimization, and sim-to-real transfer of the design and control of soft legged robots. The compliance of soft robots provides a form of "mechanical intelligence" -- the ability to passively exhibit behaviors that would otherwise be difficult to program. Exploiting this capacity requires careful consideration of the coupling between me… ▽ More

    Submitted 9 February, 2022; originally announced February 2022.

  21. arXiv:2112.08526  [pdf, other

    cs.LG cs.RO

    Invariance Through Latent Alignment

    Authors: Takuma Yoneda, Ge Yang, Matthew R. Walter, Bradly Stadie

    Abstract: A robot's deployment environment often involves perceptual changes that differ from what it has experienced during training. Standard practices such as data augmentation attempt to bridge this gap by augmenting source images in an effort to extend the support of the training distribution to better cover what the agent might experience at test time. In many cases, however, it is impossible to know… ▽ More

    Submitted 17 May, 2022; v1 submitted 15 December, 2021; originally announced December 2021.

    Comments: To appear in RSS 2022. Here's our project page: https://invariance-through-latent-alignment.github.io

  22. arXiv:2112.03325  [pdf, other

    cs.CV cs.RO

    Self-Supervised Camera Self-Calibration from Video

    Authors: Jiading Fang, Igor Vasiljevic, Vitor Guizilini, Rares Ambrus, Greg Shakhnarovich, Adrien Gaidon, Matthew R. Walter

    Abstract: Camera calibration is integral to robotics and computer vision algorithms that seek to infer geometric properties of the scene from visual input streams. In practice, calibration is a laborious procedure requiring specialized data collection and careful tuning. This process must be repeated whenever the parameters of the camera change, which can be a frequent occurrence for mobile robots and auton… ▽ More

    Submitted 1 March, 2022; v1 submitted 6 December, 2021; originally announced December 2021.

    Comments: The project page: https://sites.google.com/ttic.edu/self-sup-self-calib

  23. arXiv:2109.10957  [pdf, other

    cs.RO stat.AP

    Real Robot Challenge: A Robotics Competition in the Cloud

    Authors: Stefan Bauer, Felix Widmaier, Manuel Wüthrich, Annika Buchholz, Sebastian Stark, Anirudh Goyal, Thomas Steinbrenner, Joel Akpo, Shruti Joshi, Vincent Berenz, Vaibhav Agrawal, Niklas Funk, Julen Urain De Jesus, Jan Peters, Joe Watson, Claire Chen, Krishnan Srinivasan, Junwu Zhang, Jeffrey Zhang, Matthew R. Walter, Rishabh Madan, Charles Schaff, Takahiro Maeda, Takuma Yoneda, Denis Yarats , et al. (17 additional authors not shown)

    Abstract: Dexterous manipulation remains an open problem in robotics. To coordinate efforts of the research community towards tackling this problem, we propose a shared benchmark. We designed and built robotic platforms that are hosted at MPI for Intelligent Systems and can be accessed remotely. Each platform consists of three robotic fingers that are capable of dexterous object manipulation. Users are able… ▽ More

    Submitted 10 June, 2022; v1 submitted 22 September, 2021; originally announced September 2021.

  24. arXiv:2105.10396  [pdf, other

    cs.RO cs.CL

    Language Understanding for Field and Service Robots in a Priori Unknown Environments

    Authors: Matthew R. Walter, Siddharth Patki, Andrea F. Daniele, Ethan Fahnestock, Felix Duvallet, Sachithra Hemachandra, Jean Oh, Anthony Stentz, Nicholas Roy, Thomas M. Howard

    Abstract: Contemporary approaches to perception, planning, estimation, and control have allowed robots to operate robustly as our remote surrogates in uncertain, unstructured environments. This progress now creates an opportunity for robots to operate not only in isolation, but also with and alongside humans in our complex environments. Realizing this opportunity requires an efficient and flexible medium th… ▽ More

    Submitted 21 December, 2021; v1 submitted 21 May, 2021; originally announced May 2021.

    Comments: Field Robotics (accepted, to appear)

  25. Benchmarking Structured Policies and Policy Optimization for Real-World Dexterous Object Manipulation

    Authors: Niklas Funk, Charles Schaff, Rishabh Madan, Takuma Yoneda, Julen Urain De Jesus, Joe Watson, Ethan K. Gordon, Felix Widmaier, Stefan Bauer, Siddhartha S. Srinivasa, Tapomayukh Bhattacharjee, Matthew R. Walter, Jan Peters

    Abstract: Dexterous manipulation is a challenging and important problem in robotics. While data-driven methods are a promising approach, current benchmarks require simulation or extensive engineering support due to the sample inefficiency of popular methods. We present benchmarks for the TriFinger system, an open-source robotic platform for dexterous manipulation and the focus of the 2020 Real Robot Challen… ▽ More

    Submitted 8 December, 2021; v1 submitted 5 May, 2021; originally announced May 2021.

    Journal ref: IEEE Robotics and Automation Letters 7 (2022) 478-485

  26. arXiv:2011.11765  [pdf, other

    cs.CV cs.LG

    Boosting Contrastive Self-Supervised Learning with False Negative Cancellation

    Authors: Tri Huynh, Simon Kornblith, Matthew R. Walter, Michael Maire, Maryam Khademi

    Abstract: Self-supervised representation learning has made significant leaps fueled by progress in contrastive learning, which seeks to learn transformations that embed positive input pairs nearby, while pushing negative pairs far apart. While positive pairs can be generated reliably (e.g., as different views of the same image), it is difficult to accurately establish negative pairs, defined as samples from… ▽ More

    Submitted 2 January, 2022; v1 submitted 23 November, 2020; originally announced November 2020.

    Comments: Code is available at https://github.com/google-research/fnc

  27. arXiv:2009.05940  [pdf, other

    cs.CL cs.LG cs.MA

    Pow-Wow: A Dataset and Study on Collaborative Communication in Pommerman

    Authors: Takuma Yoneda, Matthew R. Walter, Jason Naradowsky

    Abstract: In multi-agent learning, agents must coordinate with each other in order to succeed. For humans, this coordination is typically accomplished through the use of language. In this work we perform a controlled study of human language use in a competitive team-based game, and search for useful lessons for structuring communication protocol between autonomous agents. We construct Pow-Wow, a new dataset… ▽ More

    Submitted 13 September, 2020; originally announced September 2020.

    Comments: Accepted at LaReL workshop at ICML 2020

  28. arXiv:2009.04362  [pdf, other

    cs.RO cs.LG

    Integrated Benchmarking and Design for Reproducible and Accessible Evaluation of Robotic Agents

    Authors: Jacopo Tani, Andrea F. Daniele, Gianmarco Bernasconi, Amaury Camus, Aleksandar Petrov, Anthony Courchesne, Bhairav Mehta, Rohit Suri, Tomasz Zaluska, Matthew R. Walter, Emilio Frazzoli, Liam Paull, Andrea Censi

    Abstract: As robotics matures and increases in complexity, it is more necessary than ever that robot autonomy research be reproducible. Compared to other sciences, there are specific challenges to benchmarking autonomy, such as the complexity of the software stacks, the variability of the hardware and the reliance on data-driven techniques, amongst others. In this paper, we describe a new concept for reprod… ▽ More

    Submitted 9 September, 2020; originally announced September 2020.

    Comments: IROS 2020; Code available at https://github.com/duckietown

  29. arXiv:2008.01205  [pdf, other

    cs.LG cs.RO stat.ML

    Concurrent Training Improves the Performance of Behavioral Cloning from Observation

    Authors: Zachary W. Robertson, Matthew R. Walter

    Abstract: Learning from demonstration is widely used as an efficient way for robots to acquire new skills. However, it typically requires that demonstrations provide full access to the state and action sequences. In contrast, learning from observation offers a way to utilize unlabeled demonstrations (e.g., video) to perform imitation learning. One approach to this is behavioral cloning from observation (BCO… ▽ More

    Submitted 3 August, 2020; originally announced August 2020.

    Comments: 13 pages, 2 figures, Submitted to the 4th Conference on Robot Learning (CoRL 2020)

  30. arXiv:2004.05097  [pdf, other

    cs.RO

    Residual Policy Learning for Shared Autonomy

    Authors: Charles Schaff, Matthew R. Walter

    Abstract: Shared autonomy provides an effective framework for human-robot collaboration that takes advantage of the complementary strengths of humans and robots to achieve common goals. Many existing approaches to shared autonomy make restrictive assumptions that the goal space, environment dynamics, or human policy are known a priori, or are limited to discrete action spaces, preventing those methods from… ▽ More

    Submitted 10 July, 2020; v1 submitted 10 April, 2020; originally announced April 2020.

    Comments: Published at Robotics: Science and Systems 2020 (RSS)

  31. arXiv:2002.06299  [pdf, other

    cs.LG stat.ML

    Loop Estimator for Discounted Values in Markov Reward Processes

    Authors: Falcon Z. Dai, Matthew R. Walter

    Abstract: At the working heart of policy iteration algorithms commonly used and studied in the discounted setting of reinforcement learning, the policy evaluation step estimates the value of states with samples from a Markov reward process induced by following a Markov policy in a Markov decision process. We propose a simple and efficient estimator called loop estimator that exploits the regenerative struct… ▽ More

    Submitted 3 March, 2021; v1 submitted 14 February, 2020; originally announced February 2020.

    Comments: accepted to AAAI 2021

  32. arXiv:1910.10034  [pdf, other

    cs.RO cs.AI cs.CL

    Language-guided Semantic Mapping and Mobile Manipulation in Partially Observable Environments

    Authors: Siddharth Patki, Ethan Fahnestock, Thomas M. Howard, Matthew R. Walter

    Abstract: Recent advances in data-driven models for grounded language understanding have enabled robots to interpret increasingly complex instructions. Two fundamental limitations of these methods are that most require a full model of the environment to be known a priori, and they attempt to reason over a world representation that is flat and unnecessarily detailed, which limits scalability. Recent semantic… ▽ More

    Submitted 22 October, 2019; originally announced October 2019.

    Comments: To appear at 2019 Conference on Robot Learning (CoRL)

  33. arXiv:1908.00463  [pdf, other

    cs.CV

    DIODE: A Dense Indoor and Outdoor DEpth Dataset

    Authors: Igor Vasiljevic, Nick Kolkin, Shanyi Zhang, Ruotian Luo, Haochen Wang, Falcon Z. Dai, Andrea F. Daniele, Mohammadreza Mostajabi, Steven Basart, Matthew R. Walter, Gregory Shakhnarovich

    Abstract: We introduce DIODE, a dataset that contains thousands of diverse high resolution color images with accurate, dense, long-range depth measurements. DIODE (Dense Indoor/Outdoor DEpth) is the first public dataset to include RGBD images of indoor and outdoor scenes obtained with one sensor suite. This is in contrast to existing datasets that focus on just one domain/scene type and employ different sen… ▽ More

    Submitted 29 August, 2019; v1 submitted 1 August, 2019; originally announced August 2019.

  34. arXiv:1907.02114  [pdf, ps, other

    cs.LG stat.ML

    Maximum Expected Hitting Cost of a Markov Decision Process and Informativeness of Rewards

    Authors: Falcon Z. Dai, Matthew R. Walter

    Abstract: We propose a new complexity measure for Markov decision processes (MDPs), the maximum expected hitting cost (MEHC). This measure tightens the closely related notion of diameter [JOA10] by accounting for the reward structure. We show that this parameter replaces diameter in the upper bound on the optimal value span of an extended MDP, thus refining the associated upper bounds on the regret of sever… ▽ More

    Submitted 4 November, 2019; v1 submitted 3 July, 2019; originally announced July 2019.

    Comments: Minor post-review revision. Main paper with appendix. To appear at NeurIPS 2019

  35. arXiv:1906.05948  [pdf, other

    cs.LG cs.CV cs.NE

    Multigrid Neural Memory

    Authors: Tri Huynh, Michael Maire, Matthew R. Walter

    Abstract: We introduce a novel approach to endowing neural networks with emergent, long-term, large-scale memory. Distinct from strategies that connect neural networks to external memory banks via intricately crafted controllers and hand-designed attentional mechanisms, our memory is internal, distributed, co-located alongside computation, and implicitly addressed, while being drastically simpler than prior… ▽ More

    Submitted 15 August, 2020; v1 submitted 13 June, 2019; originally announced June 2019.

    Comments: ICML 2020; Project Website: http://people.cs.uchicago.edu/~trihuynh/multigrid_mem

  36. arXiv:1903.09243  [pdf, other

    cs.RO cs.AI cs.CL cs.LG

    Inferring Compact Representations for Efficient Natural Language Understanding of Robot Instructions

    Authors: Siddharth Patki, Andrea F. Daniele, Matthew R. Walter, Thomas M. Howard

    Abstract: The speed and accuracy with which robots are able to interpret natural language is fundamental to realizing effective human-robot interaction. A great deal of attention has been paid to developing models and approximate inference algorithms that improve the efficiency of language understanding. However, existing methods still attempt to reason over a representation of the environment that is flat… ▽ More

    Submitted 21 March, 2019; originally announced March 2019.

    Comments: Accepted to ICRA 2019

  37. arXiv:1903.02503  [pdf, other

    cs.RO

    The AI Driving Olympics at NeurIPS 2018

    Authors: Julian Zilly, Jacopo Tani, Breandan Considine, Bhairav Mehta, Andrea F. Daniele, Manfred Diaz, Gianmarco Bernasconi, Claudio Ruch, Jan Hakenberg, Florian Golemo, A. Kirsten Bowser, Matthew R. Walter, Ruslan Hristov, Sunil Mallya, Emilio Frazzoli, Andrea Censi, Liam Paull

    Abstract: Despite recent breakthroughs, the ability of deep learning and reinforcement learning to outperform traditional approaches to control physically embodied robotic agents remains largely unproven. To help bridge this gap, we created the 'AI Driving Olympics' (AI-DO), a competition with the objective of evaluating the state of the art in machine learning and artificial intelligence for mobile robotic… ▽ More

    Submitted 6 March, 2019; originally announced March 2019.

    Comments: Competition, robotics, safety-critical AI, self-driving cars, autonomous mobility on demand, Duckietown

  38. arXiv:1801.01432  [pdf, other

    cs.RO cs.LG

    Jointly Learning to Construct and Control Agents using Deep Reinforcement Learning

    Authors: Charles Schaff, David Yunis, Ayan Chakrabarti, Matthew R. Walter

    Abstract: The physical design of a robot and the policy that controls its motion are inherently coupled, and should be determined according to the task and environment. In an increasing number of applications, data-driven and learning-based approaches, such as deep reinforcement learning, have proven effective at designing control policies. For most tasks, the only way to evaluate a physical design with res… ▽ More

    Submitted 14 September, 2018; v1 submitted 4 January, 2018; originally announced January 2018.

  39. arXiv:1704.01133  [pdf, other

    cs.RO cs.CV cs.LG

    Satellite Image-based Localization via Learned Embeddings

    Authors: Dong-Ki Kim, Matthew R. Walter

    Abstract: We propose a vision-based method that localizes a ground vehicle using publicly available satellite imagery as the only prior knowledge of the environment. Our approach takes as input a sequence of ground-level images acquired by the vehicle as it navigates, and outputs an estimate of the vehicle's pose relative to a georeferenced satellite image. We overcome the significant viewpoint and appearan… ▽ More

    Submitted 7 March, 2022; v1 submitted 4 April, 2017; originally announced April 2017.

    Comments: Published in IEEE International Conference on Robotics and Automation (ICRA), 2017; arXiv version has updated author information and added video highlight available at https://youtu.be/58K1-0WpGNs

  40. arXiv:1703.08612  [pdf, other

    cs.RO cs.LG

    Jointly Optimizing Placement and Inference for Beacon-based Localization

    Authors: Charles Schaff, David Yunis, Ayan Chakrabarti, Matthew R. Walter

    Abstract: The ability of robots to estimate their location is crucial for a wide variety of autonomous operations. In settings where GPS is unavailable, measurements of transmissions from fixed beacons provide an effective means of estimating a robot's location as it navigates. The accuracy of such a beacon-based localization system depends both on how beacons are distributed in the environment, and how the… ▽ More

    Submitted 20 September, 2017; v1 submitted 24 March, 2017; originally announced March 2017.

    Comments: Appeared at 2017 International Conference on Intelligent Robots and Systems (IROS)

  41. arXiv:1611.06997  [pdf, other

    cs.CL cs.AI

    Coherent Dialogue with Attention-based Language Models

    Authors: Hongyuan Mei, Mohit Bansal, Matthew R. Walter

    Abstract: We model coherent conversation continuation via RNN-based dialogue models equipped with a dynamic attention mechanism. Our attention-RNN language model dynamically increases the scope of attention on the history as the conversation continues, as opposed to standard attention (or alignment) models with a fixed input scope in a sequence-to-sequence model. This allows each generated word to be associ… ▽ More

    Submitted 21 November, 2016; originally announced November 2016.

    Comments: To appear at AAAI 2017

  42. arXiv:1610.03164  [pdf, other

    cs.RO cs.AI cs.CL cs.LG

    Navigational Instruction Generation as Inverse Reinforcement Learning with Neural Machine Translation

    Authors: Andrea F. Daniele, Mohit Bansal, Matthew R. Walter

    Abstract: Modern robotics applications that involve human-robot interaction require robots to be able to communicate with humans seamlessly and effectively. Natural language provides a flexible and efficient medium through which robots can exchange information with their human partners. Significant advancements have been made in developing robots capable of interpreting free-form instructions, but less atte… ▽ More

    Submitted 10 October, 2016; originally announced October 2016.

  43. arXiv:1511.05526  [pdf, other

    cs.RO cs.CL cs.CV

    Learning Articulated Motion Models from Visual and Lingual Signals

    Authors: Zhengyang Wu, Mohit Bansal, Matthew R. Walter

    Abstract: In order for robots to operate effectively in homes and workplaces, they must be able to manipulate the articulated objects common within environments built for and by humans. Previous work learns kinematic models that prescribe this manipulation from visual demonstrations. Lingual signals, such as natural language descriptions and instructions, offer a complementary means of conveying knowledge o… ▽ More

    Submitted 1 July, 2016; v1 submitted 17 November, 2015; originally announced November 2015.

  44. arXiv:1510.09171  [pdf, other

    cs.RO cs.CV

    Accurate Vision-based Vehicle Localization using Satellite Imagery

    Authors: Hang Chu, Hongyuan Mei, Mohit Bansal, Matthew R. Walter

    Abstract: We propose a method for accurately localizing ground vehicles with the aid of satellite imagery. Our approach takes a ground image as input, and outputs the location from which it was taken on a georeferenced satellite image. We perform visual localization by estimating the co-occurrence probabilities between the ground and satellite images based on a ground-satellite feature dictionary. The metho… ▽ More

    Submitted 30 October, 2015; originally announced October 2015.

    Comments: 9 pages, 8 figures. Full version is submitted to ICRA 2016. Short version is to appear at NIPS 2015 Workshop on Transfer and Multi-Task Learning

  45. arXiv:1509.00838  [pdf, other

    cs.CL cs.AI cs.LG cs.NE

    What to talk about and how? Selective Generation using LSTMs with Coarse-to-Fine Alignment

    Authors: Hongyuan Mei, Mohit Bansal, Matthew R. Walter

    Abstract: We propose an end-to-end, domain-independent neural encoder-aligner-decoder model for selective generation, i.e., the joint task of content selection and surface realization. Our model first encodes a full set of over-determined database event records via an LSTM-based recurrent neural network, then utilizes a novel coarse-to-fine aligner to identify the small subset of salient records to talk abo… ▽ More

    Submitted 8 January, 2016; v1 submitted 2 September, 2015; originally announced September 2015.

  46. arXiv:1506.04089  [pdf, other

    cs.CL cs.AI cs.LG cs.NE cs.RO

    Listen, Attend, and Walk: Neural Mapping of Navigational Instructions to Action Sequences

    Authors: Hongyuan Mei, Mohit Bansal, Matthew R. Walter

    Abstract: We propose a neural sequence-to-sequence model for direction following, a task that is essential to realizing effective autonomous agents. Our alignment-based encoder-decoder model with long short-term memory recurrent neural networks (LSTM-RNN) translates natural language instructions to action sequences based upon a representation of the observable world state. We introduce a multi-level aligner… ▽ More

    Submitted 17 December, 2015; v1 submitted 12 June, 2015; originally announced June 2015.

    Comments: To appear at AAAI 2016 (and an extended version of a NIPS 2015 Multimodal Machine Learning workshop paper)

  47. arXiv:1503.05079  [pdf, other

    cs.RO

    Learning Models for Following Natural Language Directions in Unknown Environments

    Authors: Sachithra Hemachandra, Felix Duvallet, Thomas M. Howard, Nicholas Roy, Anthony Stentz, Matthew R. Walter

    Abstract: Natural language offers an intuitive and flexible means for humans to communicate with the robots that we will increasingly work alongside in our homes and workplaces. Recent advancements have given rise to robots that are able to interpret natural language manipulation and navigation commands, but these methods require a prior map of the robot's environment. In this paper, we propose a novel lear… ▽ More

    Submitted 17 March, 2015; originally announced March 2015.

    Comments: ICRA 2015

  48. arXiv:1502.01659  [pdf, other

    cs.RO cs.CV

    Learning Articulated Motions From Visual Demonstration

    Authors: Sudeep Pillai, Matthew R. Walter, Seth Teller

    Abstract: Many functional elements of human homes and workplaces consist of rigid components which are connected through one or more sliding or rotating linkages. Examples include doors and drawers of cabinets and appliances; laptops; and swivel office chairs. A robotic mobile manipulator would benefit from the ability to acquire kinematic models of such objects from observation. This paper describes a meth… ▽ More

    Submitted 5 February, 2015; originally announced February 2015.

    Comments: Published in Robotics: Science and Systems X, Berkeley, CA. ISBN: 978-0-9923747-0-9

  49. arXiv:1401.6911  [pdf

    astro-ph.EP physics.geo-ph

    Hydrothermal alteration at the Panorama Formation, North Pole Dome, Pilbara Craton, Western Australia

    Authors: Adrian J. Brown, Thomas J. Cudahy, Malcolm R. Walter

    Abstract: An airborne hyperspectral remote sensing dataset was obtained of the North Pole Dome region of the Pilbara Craton in October 2002. It has been analyzed for indications of hydrothermal minerals. Here we report on the identification and mapping of hydrothermal minerals in the 3.459 Ga Panorama Formation and surrounding strata. The spatial distribution of a pattern of subvertical pyrophyllite rich ve… ▽ More

    Submitted 24 January, 2014; originally announced January 2014.

    Comments: 29 pages, 9 figures, 2 tables

    Journal ref: Precambrian Research (2006) 151, 211-223