EGO datasetGated
Xperience-10M
Multimodal human experience for embodied AI
A large egocentric multimodal dataset with synchronized video streams, audio, depth, poses, mocap, IMU, and hierarchical language annotations.
Signals
videoaudiodepthcamera posehand mocapIMUlanguage
Formats
Hugging Face datasetmultimodal episode files
OpenBot fit
- - World model pretraining
- - Real-to-sim and sim-to-real data alignment
- - Multimodal episode quality checks
Integration notes
- - Very large and controlled-access; the practical OpenBot path is metadata indexing plus targeted subset pulls.
- - Useful as a reference for the signals OpenBot Data should preserve.
