EGO datasetLicense required
EgoSchema
Long-form egocentric video QA
A diagnostic video-language benchmark derived from Ego4D, designed to test temporal and causal reasoning over long first-person videos.
Signals
videoquestionsanswerstemporal reasoning labels
Formats
JSONEgo4D video references
OpenBot fit
- - Video-language model evaluation
- - Long-horizon plan checking
- - Narrative consistency tests for agents
Integration notes
- - Not a manipulation training set, but helpful for evaluating whether agents understand long first-person context.
- - Underlying video access follows Ego4D licensing.
