OpenBot
Back to datasets
EGO datasetLicense required

EgoSchema

Long-form egocentric video QA

A diagnostic video-language benchmark derived from Ego4D, designed to test temporal and causal reasoning over long first-person videos.

Signals

videoquestionsanswerstemporal reasoning labels

Formats

JSONEgo4D video references

OpenBot fit

  • - Video-language model evaluation
  • - Long-horizon plan checking
  • - Narrative consistency tests for agents

Integration notes

  • - Not a manipulation training set, but helpful for evaluating whether agents understand long first-person context.
  • - Underlying video access follows Ego4D licensing.