OpenBot Data

Keep datasets local.Review selected videos.

Audit LeRobot and robot-video datasets locally. Send one selected video to Hosted Data for evidence, approval, and export.

Install openbot-data Request hosted beta

openbot-data v0.0.3 · Hosted Data is free during 0.0.x

Choose by data scope

Full dataset

Stays on your machine

openbot-data

Audit + readiness

No account · no upload

Selected video

Sent for processing

Hosted Data

Review + export

Free Beta · approval required

Choose by data scope.

Full datasets stay local. Hosted Data receives one selected video or registered reference.

Open source · local

openbot-data

PyPI · v0.0.3

Local inspection, readiness, and safe repair planning.

Scope: Full dataset · video directories · LeRobot v2.1 / v3
Proof: Audit · snapshot · diff · ACT / SmolVLA readiness
Repair: Safe copy plan · official merge verification
Access: No account · no upload

View on PyPI View source Read toolkit docs

Hosted · Free Beta

Hosted Data beta

Free during 0.0.x

Evidence-backed annotation, human review, and approved export.

Scope: Selected public / signed MP4 · registered video reference
Proof: Evidence timeline · review history · approval lineage
Export: JSONL · LeRobot sidecar · RLDS-compatible metadata
Price: Free Beta during 0.0.x · no purchase

Request hosted beta Read Data API docs

Hosted Free Beta capabilities

What Hosted Data adds when a video needs review.

Every generated segment stays reviewable, attributable, and blocked from export until approval.

01
Timestamped subtask timelines
Turn model suggestions into editable timelines for actions, objects, targets, state changes, and outcomes.
02
Evidence before scores
Keep timestamped frames, contact sheets, model version, and prompt provenance with every suggestion.
03
Verified video intake
Download selected videos under strict limits, decode them with OpenCV, and report duration, FPS, resolution, and failures.
04
Human review gate
Check segment IDs and boundaries, edit the full timeline, and approve it before training export.
05
Complete review history
Preserve the original model suggestion and every accepted human revision instead of overwriting history.
06
Approved training metadata exports
Generate JSONL, LeRobot subtask sidecars, or RLDS-compatible metadata from approved timelines.

Pipeline

One selected video. Six traceable stages.

One request record is preserved from intake to export. Human approval stays in the loop.

01
Select
video or episode
02
Queue
one request record
03
Decode
frames + metadata
04
Annotate
evidence-backed timeline
05
Review
human-approved revision
06
Export
JSONL · LeRobot · RLDS

Compatibility

The hosted V1 contract we can actually verify.

Current hosted input

Public HTTPS MP4Short-lived signed MP4 URLRegistered dataset + video key

Approved exports

JSONL segmentsLeRobot subtask sidecarRLDS-compatible metadata

Processor

OpenCV decode + metadataTimestamped contact sheetsStructured model adapterProvider-neutral result schema

Choose the path that fits your data.

Use openbot-data for local dataset inspection. Request the Hosted Data Free Beta when a selected video or episode needs annotation, human review, or approved export. No purchase is required during 0.0.x.

Install openbot-data Request hosted beta

Keep datasets local.Review selected videos.

Choose by data scope.

openbot-data

Hosted Data beta

What Hosted Data adds when a video needs review.

Timestamped subtask timelines

Evidence before scores

Verified video intake

Human review gate

Complete review history

Approved training metadata exports

One selected video. Six traceable stages.

Select

Queue

Decode

Annotate

Review

Export

The hosted V1 contract we can actually verify.

Choose the path that fits your data.