Skip to main content

Showing 1–46 of 46 results for author: Wang, M Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.05725  [pdf, other

    cs.RO

    Quantitative Hardness Assessment with Vision-based Tactile Sensing for Fruit Classification and Grasping

    Authors: Zhongyuan Liao, Yipai Du, Jianghua Duan, Haobo Liang, Michael Yu Wang

    Abstract: Accurate estimation of fruit hardness is essential for automated classification and handling systems, particularly in determining fruit variety, assessing ripeness, and ensuring proper harvesting force. This study presents an innovative framework for quantitative hardness assessment utilizing vision-based tactile sensing, tailored explicitly for robotic applications in agriculture. The proposed me… ▽ More

    Submitted 8 May, 2025; originally announced May 2025.

    Comments: Accepted to the Novel Approaches for Precision Agriculture and Forestry with Autonomous Robots IEEE ICRA Workshop - 2025

  2. arXiv:2503.10743  [pdf, other

    cs.RO cs.LG stat.ML

    Spatial-Temporal Graph Diffusion Policy with Kinematic Modeling for Bimanual Robotic Manipulation

    Authors: Qi Lv, Hao Li, Xiang Deng, Rui Shao, Yinchuan Li, Jianye Hao, Longxiang Gao, Michael Yu Wang, Liqiang Nie

    Abstract: Despite the significant success of imitation learning in robotic manipulation, its application to bimanual tasks remains highly challenging. Existing approaches mainly learn a policy to predict a distant next-best end-effector pose (NBP) and then compute the corresponding joint rotation angles for motion using inverse kinematics. However, they suffer from two important issues: (1) rarely consideri… ▽ More

    Submitted 13 March, 2025; originally announced March 2025.

    Comments: Accepted by CVPR 2025

  3. arXiv:2503.03464  [pdf, other

    cs.RO

    Generative Artificial Intelligence in Robotic Manipulation: A Survey

    Authors: Kun Zhang, Peng Yun, Jun Cen, Junhao Cai, Didi Zhu, Hangjie Yuan, Chao Zhao, Tao Feng, Michael Yu Wang, Qifeng Chen, Jia Pan, Wei Zhang, Bo Yang, Hua Chen

    Abstract: This survey provides a comprehensive review on recent advancements of generative learning models in robotic manipulation, addressing key challenges in the field. Robotic manipulation faces critical bottlenecks, including significant challenges in insufficient data and inefficient data acquisition, long-horizon and complex task planning, and the multi-modality reasoning ability for robust policy le… ▽ More

    Submitted 10 March, 2025; v1 submitted 5 March, 2025; originally announced March 2025.

  4. arXiv:2502.06359  [pdf, other

    cs.RO

    Occlusion-Aware Contingency Safety-Critical Planning for Autonomous Vehicles

    Authors: Lei Zheng, Rui Yang, Minzhe Zheng, Zengqi Peng, Michael Yu Wang, Jun Ma

    Abstract: Ensuring safe driving while maintaining travel efficiency for autonomous vehicles in dynamic and occluded environments is a critical challenge. This paper proposes an occlusion-aware contingency safety-critical planning approach for real-time autonomous driving in such environments. Leveraging reachability analysis for risk assessment, forward reachable sets of occluded phantom vehicles are comput… ▽ More

    Submitted 10 February, 2025; originally announced February 2025.

  5. arXiv:2410.20230  [pdf, other

    cs.RO

    FRTree Planner: Robot Navigation in Cluttered and Unknown Environments with Tree of Free Regions

    Authors: Yulin Li, Zhicheng Song, Chunxin Zheng, Zhihai Bi, Kai Chen, Michael Yu Wang, Jun Ma

    Abstract: In this work, we present FRTree planner, a novel robot navigation framework that leverages a tree structure of free regions, specifically designed for navigation in cluttered and unknown environments with narrow passages. The framework continuously incorporates real-time perceptive information to identify distinct navigation options and dynamically expands the tree toward explorable and traversabl… ▽ More

    Submitted 13 February, 2025; v1 submitted 26 October, 2024; originally announced October 2024.

  6. arXiv:2409.10310  [pdf, other

    cs.RO eess.SY

    Safe and Real-Time Consistent Planning for Autonomous Vehicles in Partially Observed Environments via Parallel Consensus Optimization

    Authors: Lei Zheng, Rui Yang, Minzhe Zheng, Michael Yu Wang, Jun Ma

    Abstract: Ensuring safety and driving consistency is a significant challenge for autonomous vehicles operating in partially observed environments. This work introduces a consistent parallel trajectory optimization (CPTO) approach to enable safe and consistent driving in dense obstacle environments with perception uncertainties. Utilizing discrete-time barrier function theory, we develop a consensus safety b… ▽ More

    Submitted 16 September, 2024; originally announced September 2024.

  7. arXiv:2406.05613  [pdf, other

    cs.RO

    Distributed Motion Control of Multiple Mobile Manipulators for Reducing Interaction Wrench in Object Manipulation

    Authors: Wenhang Liu, Meng Ren, Kun Song, Gaoming Chen, Michael Yu Wang, Zhenhua Xiong

    Abstract: In real-world cooperative manipulation of objects, multiple mobile manipulator systems may suffer from disturbances and asynchrony, leading to excessive interaction wrenches and potentially causing object damage or emergency stops. Existing methods often rely on torque control and dynamic models, which are uncommon in many industrial robots and settings. Additionally, dynamic models often neglect… ▽ More

    Submitted 7 April, 2025; v1 submitted 8 June, 2024; originally announced June 2024.

  8. arXiv:2406.05427  [pdf, other

    cs.LG

    Decision Mamba: A Multi-Grained State Space Model with Self-Evolution Regularization for Offline RL

    Authors: Qi Lv, Xiang Deng, Gongwei Chen, Michael Yu Wang, Liqiang Nie

    Abstract: While the conditional sequence modeling with the transformer architecture has demonstrated its effectiveness in dealing with offline reinforcement learning (RL) tasks, it is struggle to handle out-of-distribution states and actions. Existing work attempts to address this issue by data augmentation with the learned policy or adding extra constraints with the value-based RL algorithm. However, these… ▽ More

    Submitted 22 January, 2025; v1 submitted 8 June, 2024; originally announced June 2024.

    Comments: Accepted by NeurIPS 2024; Code is available at https://github.com/aopolin-lv/DecisionMamba

    Journal ref: The Thirty-eighth Annual Conference on Neural Information Processing Systems, 2024

  9. arXiv:2406.02461  [pdf, other

    cs.CV

    RoomTex: Texturing Compositional Indoor Scenes via Iterative Inpainting

    Authors: Qi Wang, Ruijie Lu, Xudong Xu, Jingbo Wang, Michael Yu Wang, Bo Dai, Gang Zeng, Dan Xu

    Abstract: The advancement of diffusion models has pushed the boundary of text-to-3D object generation. While it is straightforward to composite objects into a scene with reasonable geometry, it is nontrivial to texture such a scene perfectly due to style inconsistency and occlusions between objects. To tackle these problems, we propose a coarse-to-fine 3D scene texturing framework, referred to as RoomTex, t… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  10. arXiv:2404.05242  [pdf, other

    cs.RO

    Collision-Free Trajectory Optimization in Cluttered Environments Using Sums-of-Squares Programming

    Authors: Yulin Li, Chunxin Zheng, Kai Chen, Yusen Xie, Xindong Tang, Michael Yu Wang, Jun Ma

    Abstract: In this work, we propose a trajectory optimization approach for robot navigation in cluttered 3D environments. We represent the robot's geometry as a semialgebraic set defined by polynomial inequalities such that robots with general shapes can be suitably characterized. To address the robot navigation task in obstacle-dense environments, we exploit the free space directly to construct a sequence o… ▽ More

    Submitted 26 August, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

  11. arXiv:2404.04929  [pdf, other

    cs.RO

    RoboMP$^2$: A Robotic Multimodal Perception-Planning Framework with Multimodal Large Language Models

    Authors: Qi Lv, Hao Li, Xiang Deng, Rui Shao, Michael Yu Wang, Liqiang Nie

    Abstract: Multimodal Large Language Models (MLLMs) have shown impressive reasoning abilities and general intelligence in various domains. It inspires researchers to train end-to-end MLLMs or utilize large models to generate policies with human-selected prompts for embodied agents. However, these methods exhibit limited generalization capabilities on unseen tasks or scenarios, and overlook the multimodal env… ▽ More

    Submitted 8 June, 2024; v1 submitted 7 April, 2024; originally announced April 2024.

    Comments: Accepted by ICML 2024; Project page: https://aopolin-lv.github.io/RoboMP2.github.io/

    Journal ref: Proceedings of the 41st International Conference on Machine Learning, PMLR 235:33558-33574, 2024

  12. arXiv:2403.04143  [pdf, other

    cs.RO

    Incremental Bayesian Learning for Fail-Operational Control in Autonomous Driving

    Authors: Lei Zheng, Rui Yang, Zengqi Peng, Wei Yan, Michael Yu Wang, Jun Ma

    Abstract: Abrupt maneuvers by surrounding vehicles (SVs) can typically lead to safety concerns and affect the task efficiency of the ego vehicle (EV), especially with model uncertainties stemming from environmental disturbances. This paper presents a real-time fail-operational controller that ensures the asymptotic convergence of an uncertain EV to a safe state, while preserving task efficiency in dynamic e… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: 8 pages, 8 figures, accepted for publication in the 22nd European Control Conference (ECC 2024)

  13. Barrier-Enhanced Parallel Homotopic Trajectory Optimization for Safety-Critical Autonomous Driving

    Authors: Lei Zheng, Rui Yang, Michael Yu Wang, Jun Ma

    Abstract: Enforcing safety while preventing overly conservative behaviors is essential for autonomous vehicles to achieve high task performance. In this paper, we propose a barrier-enhanced parallel homotopic trajectory optimization (BPHTO) approach with the over-relaxed alternating direction method of multipliers (ADMM) for real-time integrated decision-making and planning. To facilitate safety interaction… ▽ More

    Submitted 9 May, 2025; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: 17 pages, 10 figures

    Journal ref: IEEE Transactions on Intelligent Transportation Systems, vol. 26, no. 2, pp. 2169 - 2186, February 2025

  14. arXiv:2312.07146  [pdf, other

    cs.RO

    CompdVision: Combining Near-Field 3D Visual and Tactile Sensing Using a Compact Compound-Eye Imaging System

    Authors: Lifan Luo, Boyang Zhang, Zhijie Peng, Yik Kin Cheung, Guanlan Zhang, Zhigang Li, Michael Yu Wang, Hongyu Yu

    Abstract: As automation technologies advance, the need for compact and multi-modal sensors in robotic applications is growing. To address this demand, we introduce CompdVision, a novel sensor that employs a compound-eye imaging system to combine near-field 3D visual and tactile sensing within a compact form factor. CompdVision utilizes two types of vision units to address diverse sensing needs, eliminating… ▽ More

    Submitted 18 July, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

    Comments: Accepted to IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2024

  15. arXiv:2312.06168  [pdf, other

    cs.RO

    A Novel Planning Framework for Complex Flipping Manipulation of Multiple Mobile Manipulators

    Authors: Wenhang Liu, Meng Ren, Kun Song, Michael Yu Wang, Zhenhua Xiong

    Abstract: During complex object manipulation, manipulator systems often face the configuration disconnectivity problem due to closed-chain constraints. Although regrasping can be adopted to get a piecewise connected manipulation, it is a challenging problem to determine whether there is a planning result without regrasping. To address this problem, a novel planning framework is proposed for multiple mobile… ▽ More

    Submitted 25 October, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

  16. arXiv:2310.10959  [pdf, other

    cs.RO

    Origami-inspired Bi-directional Actuator with Orthogonal Actuation

    Authors: Shuai Liu, Sheeraz Athar, Michael Yu Wang

    Abstract: Origami offers a promising alternative for designing innovative soft robotic actuators. While features of origami, such as bi-directional motion and structural anisotropy, haven't been extensively explored in the past, this letter presents a novel design inspired by origami tubes for a bi-directional actuator. This actuator is capable of moving in two orthogonal directions and has separate channel… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

  17. arXiv:2309.05298  [pdf, other

    cs.RO eess.SY

    Real-Time Parallel Trajectory Optimization with Spatiotemporal Safety Constraints for Autonomous Driving in Congested Traffic

    Authors: Lei Zheng, Rui Yang, Zengqi Peng, Haichao Liu, Michael Yu Wang, Jun Ma

    Abstract: Multi-modal behaviors exhibited by surrounding vehicles (SVs) can typically lead to traffic congestion and reduce the travel efficiency of autonomous vehicles (AVs) in dense traffic. This paper proposes a real-time parallel trajectory optimization method for the AV to achieve high travel efficiency in dynamic and congested environments. A spatiotemporal safety module is developed to facilitate the… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

    Comments: 8 pages, 7 figures, accepted for publication in the 26th IEEE International Conference on Intelligent Transportation Systems (ITSC 2023)

  18. Spatiotemporal Receding Horizon Control with Proactive Interaction Towards Autonomous Driving in Dense Traffic

    Authors: Lei Zheng, Rui Yang, Zengqi Peng, Michael Yu Wang, Jun Ma

    Abstract: In dense traffic scenarios, ensuring safety while keeping high task performance for autonomous driving is a critical challenge. To address this problem, this paper proposes a computationally-efficient spatiotemporal receding horizon control (ST-RHC) scheme to generate a safe, dynamically feasible, energy-efficient trajectory in control space, where different driving tasks in dense traffic can be a… ▽ More

    Submitted 26 May, 2024; v1 submitted 11 August, 2023; originally announced August 2023.

    Comments: 16 pages, 13 figures, accepted for publication in IEEE Transactions on Intelligent Vehicles

  19. arXiv:2304.02253  [pdf, other

    cs.RO

    Flipbot: Learning Continuous Paper Flipping via Coarse-to-Fine Exteroceptive-Proprioceptive Exploration

    Authors: Chao Zhao, Chunli Jiang, Junhao Cai, Michael Yu Wang, Hongyu Yu, Qifeng Chen

    Abstract: This paper tackles the task of singulating and grasping paper-like deformable objects. We refer to such tasks as paper-flipping. In contrast to manipulating deformable objects that lack compression strength (such as shirts and ropes), minor variations in the physical properties of the paper-like deformable objects significantly impact the results, making manipulation highly challenging. Here, we p… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

    Comments: Accepted to International Conference on Robotics and Automation (ICRA) 2023

  20. Learn to Grasp via Intention Discovery and its Application to Challenging Clutter

    Authors: Chao Zhao, Chunli Jiang, Junhao Cai, Hongyu Yu, Michael Yu Wang, Qifeng Chen

    Abstract: Humans excel in grasping objects through diverse and robust policies, many of which are so probabilistically rare that exploration-based learning methods hardly observe and learn. Inspired by the human learning process, we propose a method to extract and exploit latent intents from demonstrations, and then learn diverse and robust grasping policies through self-exploration. The resulting policy ca… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

    Comments: Accepted to IEEE Robotics and Automation Letters (RA-L)

    Journal ref: IEEE Robotics and Automation Letters, vol. 8, no. 2, pp. 488-495, Feb. 2023

  21. arXiv:2304.02251  [pdf, other

    cs.RO

    ERRA: An Embodied Representation and Reasoning Architecture for Long-horizon Language-conditioned Manipulation Tasks

    Authors: Chao Zhao, Shuai Yuan, Chunli Jiang, Junhao Cai, Hongyu Yu, Michael Yu Wang, Qifeng Chen

    Abstract: This letter introduces ERRA, an embodied learning architecture that enables robots to jointly obtain three fundamental capabilities (reasoning, planning, and interaction) for solving long-horizon language-conditioned manipulation tasks. ERRA is based on tightly-coupled probabilistic inferences at two granularity levels. Coarse-resolution inference is formulated as sequence generation through a lar… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

    Comments: Accepted to IEEE Robotics and Automation Letters (RA-L)

  22. arXiv:2210.03340  [pdf, other

    cs.RO cs.MA

    A Novel Graph-based Motion Planner of Multi-Mobile Robot Systems with Formation and Obstacle Constraints

    Authors: Wenhang Liu, Jiawei Hu, Heng Zhang, Michael Yu Wang, Zhenhua Xiong

    Abstract: Multi-mobile robot systems show great advantages over one single robot in many applications. However, the robots are required to form desired task-specified formations, making feasible motions decrease significantly. Thus, it is challenging to determine whether the robots can pass through an obstructed environment under formation constraints, especially in an obstacle-rich environment. Furthermore… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

  23. arXiv:2209.06675  [pdf, other

    cs.RO

    Volumetric-based Contact Point Detection for 7-DoF Grasping

    Authors: Junhao Cai, Jingcheng Su, Zida Zhou, Hui Cheng, Qifeng Chen, Michael Y Wang

    Abstract: In this paper, we propose a novel grasp pipeline based on contact point detection on the truncated signed distance function (TSDF) volume to achieve closed-loop 7-degree-of-freedom (7-DoF) grasping on cluttered environments. The key aspects of our method are that 1) the proposed pipeline exploits the TSDF volume in terms of multi-view fusion, contact-point sampling and evaluation, and collision ch… ▽ More

    Submitted 14 September, 2022; originally announced September 2022.

    Comments: Accepted to Conference on Robot Learning (CoRL) 2022. Supplementary materials: https://openreview.net/forum?id=SrSCqW4dq9

  24. arXiv:2207.01452  [pdf, other

    cs.CV cs.RO

    Open-world Semantic Segmentation for LIDAR Point Clouds

    Authors: Jun Cen, Peng Yun, Shiwei Zhang, Junhao Cai, Di Luan, Michael Yu Wang, Ming Liu, Mingqian Tang

    Abstract: Current methods for LIDAR semantic segmentation are not robust enough for real-world applications, e.g., autonomous driving, since it is closed-set and static. The closed-set assumption makes the network only able to output labels of trained classes, even for objects never seen before, while a static network cannot update its knowledge base according to what it has seen. Therefore, in this work, w… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

    Comments: Accepted by ECCV 2022. arXiv admin note: text overlap with arXiv:2011.10033, arXiv:2109.05441 by other authors

  25. arXiv:2204.10082  [pdf, other

    cs.RO

    Viko 2.0: A Hierarchical Gecko-inspired Adhesive Gripper with Visuotactile Sensor

    Authors: Chohei Pang, Qicheng Wang, Kinwing Mak, Hongyu Yu, Michael Yu Wang

    Abstract: Robotic grippers with visuotactile sensors have access to rich tactile information for grasping tasks but encounter difficulty in partially encompassing large objects with sufficient grip force. While hierarchical gecko-inspired adhesives are a potential technique for bridging performance gaps, they require a large contact area for efficient usage. In this work, we present a new version of an adap… ▽ More

    Submitted 21 April, 2022; originally announced April 2022.

    Comments: This work has been submitted to the IEEE for possible publication

  26. arXiv:2204.08691  [pdf

    cs.RO cs.CV eess.IV

    A Thin Format Vision-Based Tactile Sensor with A Micro Lens Array (MLA)

    Authors: Xia Chen, Guanlan Zhang, Michael Yu Wang, Hongyu Yu

    Abstract: Vision-based tactile sensors have been widely studied in the robotics field for high spatial resolution and compatibility with machine learning algorithms. However, the currently employed sensor's imaging system is bulky limiting its further application. Here we present a micro lens array (MLA) based vison system to achieve a low thickness format of the sensor package with high tactile sensing per… ▽ More

    Submitted 19 April, 2022; originally announced April 2022.

    Comments: 7 pages, 5 figures

  27. arXiv:2202.02179  [pdf, other

    cs.RO

    DelTact: A Vision-based Tactile Sensor Using Dense Color Pattern

    Authors: Guanlan Zhang, Yipai Du, Hongyu Yu, Michael Yu Wang

    Abstract: Tactile sensing is an essential perception for robots to complete dexterous tasks. As a promising tactile sensing technique, vision-based tactile sensors have been developed to improve robot performance in manipulation and grasping. Here we propose a new design of a vision-based tactile sensor, DelTact. The sensor uses a modular hardware architecture for compactness whilst maintaining a contact me… ▽ More

    Submitted 31 May, 2022; v1 submitted 4 February, 2022; originally announced February 2022.

    Comments: 8 pages contents, 1 page references, 8 figures, 2 tables

  28. arXiv:2112.01135  [pdf, other

    cs.CV

    Open-set 3D Object Detection

    Authors: Jun Cen, Peng Yun, Junhao Cai, Michael Yu Wang, Ming Liu

    Abstract: 3D object detection has been wildly studied in recent years, especially for robot perception systems. However, existing 3D object detection is under a closed-set condition, meaning that the network can only output boxes of trained classes. Unfortunately, this closed-set condition is not robust enough for practical use, as it will identify unknown objects as known by mistake. Therefore, in this pap… ▽ More

    Submitted 2 December, 2021; originally announced December 2021.

    Comments: Received by 3DV 2021

  29. arXiv:2108.04562  [pdf, other

    cs.CV

    Deep Metric Learning for Open World Semantic Segmentation

    Authors: Jun Cen, Peng Yun, Junhao Cai, Michael Yu Wang, Ming Liu

    Abstract: Classical close-set semantic segmentation networks have limited ability to detect out-of-distribution (OOD) objects, which is important for safety-critical applications such as autonomous driving. Incrementally learning these OOD objects with few annotations is an ideal way to enlarge the knowledge base of the deep learning models. In this paper, we propose an open world semantic segmentation syst… ▽ More

    Submitted 10 August, 2021; originally announced August 2021.

    Comments: Accepted by ICCV2021

  30. arXiv:2108.02448  [pdf, other

    cs.CV cs.AI cs.RO

    MFuseNet: Robust Depth Estimation with Learned Multiscopic Fusion

    Authors: Weihao Yuan, Rui Fan, Michael Yu Wang, Qifeng Chen

    Abstract: We design a multiscopic vision system that utilizes a low-cost monocular RGB camera to acquire accurate depth estimation. Unlike multi-view stereo with images captured at unconstrained camera poses, the proposed system controls the motion of a camera to capture a sequence of images in horizontally or vertically aligned positions with the same parallax. In this system, we propose a new heuristic me… ▽ More

    Submitted 6 August, 2021; v1 submitted 5 August, 2021; originally announced August 2021.

    Comments: IEEE International Conference on Robotics and Automation (ICRA) + IEEE Robotics and Automation Letters (RA-L). arXiv admin note: substantial text overlap with arXiv:2001.08212

  31. arXiv:2105.00680  [pdf, other

    cs.RO

    Viko: An Adaptive Gecko Gripper with Vision-based Tactile Sensor

    Authors: Chohei Pang, Kinwing Mak, Yazhan Zhang, Yang Yang, Yu Alexander Tse, Michael Yu Wang

    Abstract: Monitoring the state of contact is essential for robotic devices, especially grippers that implement gecko-inspired adhesives where intimate contact is crucial for a firm attachment. However, due to the lack of deformable sensors, few have demonstrated tactile sensing for gecko grippers. We present Viko, an adaptive gecko gripper that utilizes vision-based tactile sensors to monitor contact state.… ▽ More

    Submitted 3 May, 2021; originally announced May 2021.

    Comments: This paper is accepted to ICRA2021, please contact corresponding author Y. Tse ([email protected]) for details

  32. arXiv:2104.04170  [pdf, other

    cs.CV

    Stereo Matching by Self-supervision of Multiscopic Vision

    Authors: Weihao Yuan, Yazhan Zhang, Bingkun Wu, Siyu Zhu, Ping Tan, Michael Yu Wang, Qifeng Chen

    Abstract: Self-supervised learning for depth estimation possesses several advantages over supervised learning. The benefits of no need for ground-truth depth, online fine-tuning, and better generalization with unlimited data attract researchers to seek self-supervised solutions. In this work, we propose a new self-supervised framework for stereo matching utilizing multiple images captured at aligned camera… ▽ More

    Submitted 16 August, 2021; v1 submitted 8 April, 2021; originally announced April 2021.

    Comments: 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

  33. arXiv:2103.14359  [pdf, other

    cs.RO

    A Tactile Sensing Foot for Single Robot Leg Stabilization

    Authors: Guanlan Zhang, Yipai Du, Yazhan Zhan, Michael Yu Wang

    Abstract: Tactile sensing on human feet is crucial for motion control, however, has not been explored in robotic counterparts. This work is dedicated to endowing tactile sensing to legged robot's feet and showing that a single-legged robot can be stabilized with only tactile sensing signals from its foot. We propose a robot leg with a novel vision-based tactile sensing foot system and implement a processing… ▽ More

    Submitted 26 March, 2021; originally announced March 2021.

    Comments: 7 pages, 9 figures, ICRA2021

    Journal ref: ICRA2021

  34. arXiv:2103.04027  [pdf, other

    cs.CV cs.RO

    Learning to Predict Vehicle Trajectories with Model-based Planning

    Authors: Haoran Song, Di Luan, Wenchao Ding, Michael Yu Wang, Qifeng Chen

    Abstract: Predicting the future trajectories of on-road vehicles is critical for autonomous driving. In this paper, we introduce a novel prediction framework called PRIME, which stands for Prediction with Model-based Planning. Unlike recent prediction works that utilize neural networks to model scene context and produce unconstrained trajectories, PRIME is designed to generate accurate and feasibility-guara… ▽ More

    Submitted 20 October, 2021; v1 submitted 5 March, 2021; originally announced March 2021.

    Comments: Accepted by The Conference on Robot Learning (CoRL) 2021. Project page at http://haoran-song.github.io/prime

  35. arXiv:2010.04931  [pdf, other

    cs.RO

    Origami-based Shape Morphing Fingertip to Enhance Grasping Stability and Dexterity

    Authors: Zicheng Kan, Yazhan Zhang, Chohei Pang, Michael Yu Wang

    Abstract: Adaptation to various scene configurations and object properties, stability and dexterity in robotic grasping manipulation is far from explored. This work presents an origami-based shape morphing fingertip design to actively tackle the grasping stability and dexterity problems. The proposed fingertip utilizes origami as its skeleton providing degrees of freedom at desired positions and motor-drive… ▽ More

    Submitted 10 October, 2020; originally announced October 2020.

    Comments: Accepted to CASE2020

  36. Vacuum Driven Auxetic Switching Structure and Its Application on a Gripper and Quadruped

    Authors: Shuai Liu, Sheeraz Athar, Michael Yu Wang

    Abstract: The properties and applications of auxetics have been widely explored in the past years. Through proper utilization of auxetic structures, designs with unprecedented mechanical and structural behaviors can be produced. Taking advantage of this, we present the development of novel and lowcost 3D structures inspired by a simple auxetic unit. The core part, which we call the body in this paper, is a… ▽ More

    Submitted 28 August, 2020; originally announced August 2020.

    Comments: 6 pages, 14 Figures, International Conference on Intelligent Robots and Systems (IROS), 2020

    Journal ref: 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Las Vegas, NV, USA

  37. arXiv:2008.00637  [pdf, other

    cs.CV

    Self-supervised Object Tracking with Cycle-consistent Siamese Networks

    Authors: Weihao Yuan, Michael Yu Wang, Qifeng Chen

    Abstract: Self-supervised learning for visual object tracking possesses valuable advantages compared to supervised learning, such as the non-necessity of laborious human annotations and online training. In this work, we exploit an end-to-end Siamese network in a cycle-consistent self-supervised framework for object tracking. Self-supervision can be performed by taking advantage of the cycle consistency in t… ▽ More

    Submitted 3 August, 2020; originally announced August 2020.

    Comments: 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

  38. arXiv:2004.04976  [pdf

    cs.RO

    A Flexible Connector for Soft Modular Robots Based on Micropatterned Intersurface Jamming

    Authors: Yu Alexander Tse, Shuai Liu, Yang Yang, Michael Yu Wang

    Abstract: Soft modular robots enable more flexibility and safer interaction with the changing environment than traditional robots. However, it has remained challenging to create deformable connectors that can be integrated into soft machines. In this work, we propose a flexible connector for soft modular robots based on micropatterned intersurface jamming. The connector is composed of micropatterned dry adh… ▽ More

    Submitted 10 April, 2020; originally announced April 2020.

    Comments: Accepted to ROBOSOFT 2020

  39. PiP: Planning-informed Trajectory Prediction for Autonomous Driving

    Authors: Haoran Song, Wenchao Ding, Yuxuan Chen, Shaojie Shen, Michael Yu Wang, Qifeng Chen

    Abstract: It is critical to predict the motion of surrounding vehicles for self-driving planning, especially in a socially compliant and flexible way. However, future prediction is challenging due to the interaction and uncertainty in driving behaviors. We propose planning-informed trajectory prediction (PiP) to tackle the prediction problem in the multi-agent setting. Our approach is differentiated from th… ▽ More

    Submitted 18 January, 2021; v1 submitted 25 March, 2020; originally announced March 2020.

    Comments: European Conference on Computer Vision (ECCV) 2020; Project page at http://haoran-song.github.io/planning-informed-prediction

  40. arXiv:2001.08212  [pdf, other

    cs.CV

    Active Perception with A Monocular Camera for Multiscopic Vision

    Authors: Weihao Yuan, Rui Fan, Michael Yu Wang, Qifeng Chen

    Abstract: We design a multiscopic vision system that utilizes a low-cost monocular RGB camera to acquire accurate depth estimation for robotic applications. Unlike multi-view stereo with images captured at unconstrained camera poses, the proposed system actively controls a robot arm with a mounted camera to capture a sequence of images in horizontally or vertically aligned positions with the same parallax.… ▽ More

    Submitted 22 January, 2020; originally announced January 2020.

    Comments: 8 pages

  41. arXiv:1912.07024  [pdf, other

    cs.RO

    Multi-Object Rearrangement with Monte Carlo Tree Search:A Case Study on Planar Nonprehensile Sorting

    Authors: Haoran Song, Joshua A. Haustein, Weihao Yuan, Kaiyu Hang, Michael Yu Wang, Danica Kragic, Johannes A. Stork

    Abstract: In this work, we address a planar non-prehensile sorting task. Here, a robot needs to push many densely packed objects belonging to different classes into a configuration where these classes are clearly separated from each other. To achieve this, we propose to employ Monte Carlo tree search equipped with a task-specific heuristic function. We evaluate the algorithm on various simulated and real-wo… ▽ More

    Submitted 18 January, 2021; v1 submitted 15 December, 2019; originally announced December 2019.

    Comments: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2020; Project page at http://haoran-song.github.io/mcts-sorting/

  42. arXiv:1910.03973  [pdf, other

    cs.RO cs.CV cs.LG

    Towards Learning to Detect and Predict Contact Events on Vision-based Tactile Sensors

    Authors: Yazhan Zhang, Weihao Yuan, Zicheng Kan, Michael Yu Wang

    Abstract: In essence, successful grasp boils down to correct responses to multiple contact events between fingertips and objects. In most scenarios, tactile sensing is adequate to distinguish contact events. Due to the nature of high dimensionality of tactile information, classifying spatiotemporal tactile signals using conventional model-based methods is difficult. In this work, we propose to predict and c… ▽ More

    Submitted 9 October, 2019; originally announced October 2019.

    Comments: 10 pages, 7 figures, Accepted to conference on Robot Learning (CoRL 2019)

  43. arXiv:1906.09460  [pdf, other

    cs.RO

    Effective Estimation of Contact Force and Torque for Vision-based Tactile Sensor with Helmholtz-Hodge Decomposition

    Authors: Yazhan Zhang, Zicheng Kan, Yang Yang, Alexander Yu Tse, Michael Yu Wang

    Abstract: Retrieving rich contact information from robotic tactile sensing has been a challenging, yet significant task for the effective perception of object properties that the robot interacts with. This work is dedicated to developing an algorithm to estimate contact force and torque for vision-based tactile sensors. We first introduce the observation of the contact deformation patterns of hyperelastic m… ▽ More

    Submitted 22 June, 2019; originally announced June 2019.

    Comments: 8 pages, 11 figures, submitted and accepted by IROS2019

  44. arXiv:1810.02653  [pdf, other

    cs.RO cs.LG

    FingerVision Tactile Sensor Design and Slip Detection Using Convolutional LSTM Network

    Authors: Yazhan Zhang, Zicheng Kan, Yu Alexander Tse, Yang Yang, Michael Yu Wang

    Abstract: Tactile sensing is essential to the human perception system, so as to robot. In this paper, we develop a novel optical-based tactile sensor "FingerVision" with effective signal processing algorithms. This sensor is composed of soft skin with embedded marker array bonded to rigid frame, and a web camera with a fisheye lens. While being excited with contact force, the camera tracks the movements of… ▽ More

    Submitted 5 October, 2018; originally announced October 2018.

    Comments: 7 pages, 7 figures, submitted to ICRA2019

  45. arXiv:1809.04322  [pdf, other

    cs.RO cs.AI cs.LG

    Reinforcement Learning in Topology-based Representation for Human Body Movement with Whole Arm Manipulation

    Authors: Weihao Yuan, Kaiyu Hang, Haoran Song, Danica Kragic, Michael Y. Wang, Johannes A. Stork

    Abstract: Moving a human body or a large and bulky object can require the strength of whole arm manipulation (WAM). This type of manipulation places the load on the robot's arms and relies on global properties of the interaction to succeed---rather than local contacts such as grasping or non-prehensile pushing. In this paper, we learn to generate motions that enable WAM for holding and transporting of human… ▽ More

    Submitted 12 September, 2018; originally announced September 2018.

    Comments: Submitted to RA-L with ICRA 2019

  46. Rearrangement with Nonprehensile Manipulation Using Deep Reinforcement Learning

    Authors: Weihao Yuan, Johannes A. Stork, Danica Kragic, Michael Y. Wang, Kaiyu Hang

    Abstract: Rearranging objects on a tabletop surface by means of nonprehensile manipulation is a task which requires skillful interaction with the physical world. Usually, this is achieved by precisely modeling physical properties of the objects, robot, and the environment for explicit planning. In contrast, as explicitly modeling the physical environment is not always feasible and involves various uncertain… ▽ More

    Submitted 15 March, 2018; originally announced March 2018.

    Comments: 2018 International Conference on Robotics and Automation