Skip to main content

Velociraptor Model

A bipedal predator with distinctive sickle claws, trained for locomotion and predatory strike behavior.

Specifications

PropertyValue
Hip height~0.5m (simulated)
Total mass~15 kg (simulated, scaled)
Hinge joints24
Actuators22
Observation dim67
Action dim22

Anatomy

  • Torso - Elongated body with simplified head/neck
  • Legs - Digitigrade hind limbs (hip pitch/roll, knee, ankle, 2 toe digits per leg)
  • Sickle claws - Retractable claw on digit 2 of each foot (the weapon)
  • Arms - Stub forelimbs with shoulder pitch/roll actuators
  • Tail - 5-segment counterbalance (4 actuated segments)

Action Space (22 dims)

IndexActuatorType
0-5Right leg (hip pitch/roll, knee, ankle, toe d3/d4)Position
6Right sickle clawMotor
7-12Left leg (hip pitch/roll, knee, ankle, toe d3/d4)Position
13Left sickle clawMotor
14-17Tail (pitch 1, yaw 1, pitch 2, pitch 3)Position
18-19Right arm (shoulder pitch/roll)Position
20-21Left arm (shoulder pitch/roll)Position

Training Stages

  1. Balance - Learn to stand without falling
  2. Locomotion - Walk and run forward
  3. Strike - Sprint and attack prey with sickle claws

Diagnostic Metrics (Stage 3)

After each training stage the eval command reports:

MetricDescription
mean_forward_velocityAverage forward speed (m/s)
gait_symmetryLeft/right stride symmetry ∈ [0, 1]
stride_frequencyStep frequency (Hz)
cost_of_transportEnergy efficiency (lower is better)
mean_pelvis_heightUpright stability (m)
mean_heading_alignmentcos θ toward prey ∈ [-1, 1]
success_rateFraction of steps with sickle-claw strike
min_prey_distanceClosest approach to prey (m)

Training Results (PPO, 2026-03-15)

Full 3-stage curriculum training with seed 42, 4 parallel environments. Total training time: 11h 25m 15s.

StageDescriptionTimestepsBest Eval RewardFinal Eval RewardAvg Fwd VelAvg Ep LengthDuration
1 — BalanceStand and balance without falling6,000,0001964.43 +/- 27.391839.99 +/- 252.170.11 +/- 0.07 m/s969.5 steps (9.70s)2h 57m
2 — LocomotionForward walking/running8,000,0002678.68 +/- 4.072677.51 +/- 5.863.47 +/- 0.02 m/s1000.0 steps (10.00s)4h 36m
3 — StrikeSprint and strike prey with sickle claw8,000,0001366.19 +/- 76.291258.93 +/- 305.562.02 +/- 0.30 m/s191.2 steps (1.91s)3h 52m

Key observations:

  • Stage 2 achieves perfect episode length (1000/1000 steps) with very low velocity variance (3.47 +/- 0.02 m/s), indicating extremely stable locomotion
  • Stage 2 final and best rewards are nearly identical (2677.51 vs 2678.68), showing no overfitting or collapse
  • Stage 3 achieves 93.3% strike success rate with short episodes (191.2 steps), demonstrating efficient prey pursuit

Training Videos

Velociraptor PPO training progression across each curriculum stage.

STAGE 1

Balance

Standing without falling

STAGE 2

Locomotion

Walking and running forward

STAGE 3

Strike

Sprinting and attacking with sickle claws

Usage

cd environments/velociraptor

# View the model
python scripts/view_model.py

# Train stage 1
python scripts/train_sb3.py train --stage 1 --timesteps 1000000