Back to Search
Industry Landscape

Embodied AI Datasets

A collection of key datasets used for training embodied AI models, including robot manipulation and vision-language-action models.

Ecosystem Snapshot

28
Datasets
1
Benchmarks

Leading Datasets

Leading Benchmarks

Industry Insights

This page aggregates the core open datasets that underpin embodied AI research and VLA model training. These datasets provide diverse, large-scale robot demonstration data spanning multiple robot embodiments, tasks, scenes, and environments.

Key datasets include Open X-Embodiment (1M+ episodes across 22 robot types), DROID (87K+ in-the-wild trajectories), and BridgeData (60K+ WidowX demonstrations for generalization research).

Embodied AI Datasets | EmbodiedHub