Bunny-VisionPro

Real-Time Bimanual Dexterous Teleoperation for Imitation Learning

¹ The University of Hong Kong ² University of California, San Diego * Equal contribution

Bunny-VisionPro is a real-time bimanual teleoperation system that prioritizes safety and minimal delay. It also features human haptic feedback to enhance immersion. High-quality demonstration is collected to improve imitation learning.

Abstract

Teleoperation is a crucial tool for collecting human demonstrations, but controlling robots with bimanual dexterous hands remains a challenge. We introduce Bunny-VisionPro, a real-time bimanual dexterous teleoperation system that leverages a VR headset. Unlike previous vision-based teleoperation systems, we design novel low-cost devices to provide haptic feedback to the operator, enhancing immersion. Our system prioritizes safety by incorporating collision and singularity avoidance while maintaining real-time performance through innovative designs. Bunny-VisionPro outperforms prior systems on a standard task suite, achieving higher success rates and reduced task completion times. Moreover, the high-quality teleoperation demonstrations improve downstream imitation learning performance, leading to better generalizability. Notably, Bunny-VisionPro enables imitation learning with challenging multi-stage, long-horizon dexterous manipulation tasks, which have rarely been addressed in previous work. Our system's ability to handle bimanual manipulations while prioritizing safety and real-time performance makes it a powerful tool for advancing dexterous manipulation and imitation learning.

Skincare Teleoperation

Hand poses captured by Apple Vision Pro are converted into robot motion control commands for real-time teleoperation. We prioritize safety through singularity checks and collision avoidance, enabling the robot to perform human skincare tasks safely in teleoperation mode.

Teleoperation System

Human Haptic Feedback

Effective human manipulation integrates visual and tactile feedback, but many vision-based teleoperation systems overlook haptic feedback. We developed a cost-effective haptic feedback system using ERM actuators to address this. It processes tactile signals from robot hands and drives vibration motors to simulate sensations, allowing operators to perceive and respond to the environment more intuitively, enhancing manipulation performance.

Haptic Feedback Demo

Haptics User Study

w/o haptic feedback

w/ haptic feedback

Imitation Learning

We evaluate the quality of demonstrations collected by our system by training popular imitation learning algorithms — ACT, Diffusion Policy, and DP3 — and testing their generalization performance in term of spatial generalization and unseen scenarios. Additionally, we train on challenging long-horizon tasks to showcase our system's effectiveness in collecting high-quality demonstrations.

Long-horizon Tasks

Generalization Results

Cleaning pan

Spatial Generalization

Unseen Object

Uncovering & Pouring

Spatial Generalization

Unseen Object

Grasping toy

Spatial Generalization

Unseen Object

Safety and Real-time Performance

Arm Motion Control: Bunny-VisionPro vs AnyTeleop+

From the video, AnyTeleop+ shows strange behaviors such as excessive joint movement in joint 2 and the arm lowering unexpectedly. It also suffers from unstable latency, failing to track the target end-effector position (the transluscent ball) in real time. In contrast, Bunny-VisionPro produces more natural, stable and responsive trajectories, not only enhancing teleoperation accuracy and efficiency, but also enabling high-quality data collection for effective imitation learning.

AnyTeleop+

Bunny-VisionPro

Collision Avoidance

In our simulation, we use blue cubes to represent obstacles. The robot's goal is to move around without hitting these obstacles. Without collision avoidance, the robot collides with them, shown as penetration in the simulation, which would be dangerous if deployed in real robots. With collision avoidance, the robot successfully bypasses the obstacles in real time, demonstrating the effectiveness of our collision avoidance system.

w/o collision avoidance

w/ collision avoidance

Comparing Wearable Devices for Hand Pose Tracking and Robot Retargeting

More Teleoperation Tasks

Haptic Feedback Tasks

Opening Cabinet

w/o haptic feedback

w/ haptic feedback

Cleaning Whiteboard

w/o haptic feedback

w/ haptic feedback

Kitchen Tasks

Motion-constrained Tasks

Dynamic Tasks

BibTex


@article{bunny-visionpro,
    title   = {Bunny-VisionPro: Real-Time Bimanual Dexterous Teleoperation for Imitation Learning},
    author  = {Runyu Ding, Yuzhe Qin, Jiyue Zhu, Chengzhe Jia, Shiqi Yang, Ruihan Yang, Xiaojuan Qi, and Xiaolong Wang},
    year    = {2024}
}

If you have any questions, please feel free to contact Runyu Ding.