ManiSkill-ViTac Challenge 2026

Real-world language-guided bimanual vision-tactile manipulation challenge

Read Documentation Maniskill-vitac GitHub Join Discord Registration

News

18 Apr 2026

Announcement

Challenge 2026 Officially Launched & Timeline Released

The ManiSkill-ViTac Challenge 2026 is officially launched! The full challenge timeline is now available, covering registration, sample data release, technical brief submission, Stage 2 real-robot evaluation, and the award ceremony at the Embodied AI Workshop @ CVPR 2026. Visit the Timeline section for key dates, the Documentation page for participation details, and the Maniskill-vitac GitHub repository for the codebase.

16 Apr 2026

Announcement

Sample Datasets Officially Released

We officially released the sample datasets on April 16, 2026. Participants can now use them to inspect the data structure, understand the task format, and start preparing their solutions.

26 Jan 2026

Announcement

Embodied AI Workshop @ CVPR 2026 Accepted

The workshop will take place at CVPR 2026 on June 3-4 in Denver. The ManiSkill-ViTac Challenge 2026 award ceremony will be held there, where winning teams will present their work and receive certificates and cash awards.

25 Jan 2026

Announcement

Challenge Website Released

The official ManiSkill-ViTac Challenge 2026 website is now live. Follow the news and documentation sections for task definitions, timelines, participation details and future resource releases.

Overview

The 2024 and 2025 editions built a simulation-based sim-to-real pipeline for tactile manipulation, covering tasks such as peg insertion, lock opening and visuo-tactile fusion with visuotactile sensors. However, many real-world scenarios — transparent objects, liquids, granular materials and fragile solids — remain extremely difficult to simulate and render with sufficient fidelity. Real-world demonstration data offers a powerful complement to simulation, enabling policy learning for these challenging domains.

ManiSkill-ViTac Challenge 2026 addresses this gap. Built on the ViTaMIn-B bimanual visuo-tactile data collection platform, the challenge provides real-world demonstration trajectories for contact-rich bimanual tasks. Participants train language-conditioned visuo-tactile policies directly from these demonstrations and are evaluated on the same physical hardware. By introducing language guidance, the challenge aims to advance research toward TVLA (Tactile-Vision-Language-Action) models — unifying tactile sensing, visual perception and language understanding within a single action policy.

Bridging the Reality Gap

Tasks involve transparent objects, liquids and material fragmentation — scenarios where the reality gap in simulation remains particularly wide, making real-world demonstration data essential.

ViTaMIn-B Bimanual Platform

All demonstration data is collected on ViTaMIn-B, a bimanual visuo-tactile teleoperation system that captures synchronized vision, tactile and proprioceptive streams at scale.

Language-Conditioned TVLA

Each task is paired with natural-language instructions. Policies must ground language into tactile-visual control, pushing the frontier of Tactile-Vision-Language-Action models.

Challenge Format

1

Stage 1

Registration and Screening

Feb. — Apr. 2026

A batch of sample training data for each task will be released in mid-April 2026, including wrist camera images, tactile images, wrist poses and language instructions. Participants can use this data to understand the task setup, develop initial approaches, and prepare the technical brief required for Stage 2 selection.

How to participate

Register through the registration form — each team consists of one leader, an optional advisor, and up to four additional members
Data format documentation and visualization tools will be provided alongside the sample data release
A Discord server is available for Q&A and discussion with organizers and other participants

Technical Brief

Each team submits a 2–4 page technical brief by May 1, 2026
Content should cover team background, relevant prior work, and technical approach for the challenge
A subset of teams will be selected by the committee to advance to Stage 2 based on the technical brief
The number of advancing teams depends on total registrations

2

Stage 2

Real-World Robot Evaluation

May — Jun. 2026

The full demonstration dataset will be publicly released at the start of Stage 2. Advancing teams train their policies and submit them for evaluation on the physical robot platforms.

Evaluation

Full demonstration dataset publicly released
Each team has multiple submission attempts; each submission is evaluated on both tasks on the physical platform
The final score is the sum of Task 1 and Task 2 performance from the same submission — the highest-scoring submission is taken as the team's result

Results and Award Ceremony

Final rankings and results announced on June 1, 2026
Award ceremony and winning team presentations at the CVPR 2026 Embodied AI Workshop (June 3–4, Denver)

Tasks

The challenge includes two tasks — Test Tube Cleaning and Grinding. The final score is the sum of performance across both tasks.

Task 1

Test Tube Cleaning

Long-horizon bimanual task — color identification → grasp → rack insertion → brush cleaning
Tactile-guided compliant control for transparent tube grasping
Probes tactile manipulation and language grounding over extended horizons
Transparent tubes make pure-vision grasping unreliable — tactile sensing resolves ambiguity and ensures stable contact

Language instruction: “Pick and clean the color-matched test tubes”

Data Visualization

Task 2

Grinding

Long-horizon bimanual task — material selection → placement → sustained grinding
Tests multi-step language-conditioned reasoning and material generalization
Tactile sensing ensures stable contact force during grinding, preventing slippage and maintaining consistent quality

Language instruction: “Grind the specified materials using the pestle”

Data Visualization

LeaderBoard

Rank	Team Name	Task 1	Task 2	Total
1	TBA	—	—	—
Baseline	Official Baseline	—	—	—

Updated: May 2026

Awards

🥇

1st Place × 1

$3,000

🥈

2nd Place × 2

$1,000

🥉

3rd Place × 3

$500

The 1st place team will be invited to present at the CVPR 2026 Embodied AI Workshop.

Challenge Timeline

The challenge progresses from website release and early data preparation to screening, leaderboard publication and award ceremony.

Jan 25, 2026

Website Release

The official challenge website becomes publicly available.

Jan 26, 2026

Workshop Acceptance

CVPR 2026 Embodied AI Workshop acceptance is announced.

Mid-April 2026

Sample Data, Documentation and Visualization Release

Sample training data, data format documentation and visualization tools released for both tasks.

May 1, 2026

Full Dataset and Baseline Release

Full demonstration dataset publicly released. Baseline leaderboard published.

May 10, 2026

Technical Brief Deadline

Deadline for teams to submit their technical brief for Stage 2 selection.

Jun 1, 2026

Final Rankings Announced

Final results and leaderboard published.

Jun 3–4, 2026

Award Ceremony

Award ceremony and winning team presentations at the CVPR 2026 Embodied AI Workshop in Denver.

Sponsors and Partners

Registration

Fill in the form below to register your team. After registration, you will receive follow-up information for technical brief submission, support materials and community discussion.

🗓️

Registration Deadline

May 1, 2026

Official Registration

ManiSkill-ViTac Challenge 2026 — Team Registration

Complete all fields. The team leader will receive a confirmation email after submission.

Organizers

Faculty

Rui Chen

THU

Shaowei Cui

CASIA

Jing Xu

THU

Student

Chaoyi Liu

THU

Rongxuan Zhang

NEU

Kaiyue Chen

XJTLU

Junlai Li

SJTU

Chenrui Zhang

NUS

Chuanyu Li

THU

Zhongyu Chen

ZJU

Lishun Zhou

Tongji University

Bailun Jiang

Wenxuan Ma

CASIA

Contact

For technical support, please join our Discord.

For other questions, please email maniskill.vitac@gmail.com.