Deep diveMay 12, 2026

Per-step survival: a more honest metric for VLA evaluation

Abstract

A per-step survival metric for long-horizon manipulation, validated across 14 tasks and four embodiments.

Full writeup coming soon. ← Back to research