gdog-sim

Wheeled quadruped simulator built with Genesis.

randomized robot morphology and terrain
photo-realistic rendering with dynamic shadows
PID-stabilized suspension control
predictive inverted-pendulum longitudinal control (command-aware stance IK + brake feedforward)
Smartphone remote control via gdog-remote companion app with WebSocket and optional WebRTC 4 UDP

Quick Start

Download source code, install dependencies, and run the sim

Clone repo:

git clone https://github.com/Felipegalind0/gdog-sim
cd gdog-sim

Create .venv if it does not exist, activate it, and install dependencies:

python3 -m venv .venv
source .venv/bin/activate
python -m pip install --upgrade pip
python -m pip install -r requirements.txt

Run the sim with interactive viewer:

.venv/bin/python main.py --render --seed 829297643 --quick-tunnel
.venv/bin/python main.py --render --seed 829297643 --quick-tunnel --quick-tunnel-protocol http2 --quick-tunnel-edge-ip-version 4

What This Project Contains

main.py: simulation entrypoint, scene setup, control loop, suspension PID + IK, wheel drive, HUD
procedural_gen.py: randomized robot URDF generation and randomized terrain/albedo generation
gdog.urdf.jinja: robot template used to build a temporary URDF at runtime
camera_controller.py: follow camera, mouse/keyboard hooks, HUD overlays, in-view command input
commands.py: command state container, remote payload parsing, suspension command parser/executor
network.py: FastAPI app exposing /ws and optional /offer
pid.py: PID controller with telemetry for debug overlays
math_utils.py: quaternion helpers, leg IK helpers, camera math

Runtime Behavior

On startup, main.py does the following:

Parses CLI arguments (--render, --video, --seed, --spawn-bone, --steps)
Initializes Genesis with GPU backend and a deterministic runtime seed
Builds a lunar-looking scene:
- random terrain patch layout (center patch forced flat for stable spawn)
- moon-style grayscale albedo texture with crater-like variation
- directional sun-like light
Generates a randomized robot from gdog.urdf.jinja
Splits robot joints into leg joints (position-controlled) and wheel joints (velocity-controlled)
Starts FastAPI control server in a background thread (0.0.0.0:8000)
Runs the main simulation loop:

consumes remote and keyboard commands
computes speed-adaptive drive command envelopes
predicts desired longitudinal acceleration from normalized command and speed error
maps desired acceleration to desired tilt using $\theta_{des} = \arctan\left(\frac{a_{des}}{g}\right)$
converts desired tilt to fore-aft leg placement target and applies smoothed two-link IK
scales brake authority with predictive deceleration demand before velocity ramp limiting
applies roll/pitch stabilization through PID + two-link vertical IK
computes skid-steer wheel targets (left = vx - omega, right = vx + omega)
updates viewer/capture camera and overlays

Longitudinal Control Architecture

The simulator now uses a command-predictive longitudinal controller instead of purely reactive stance shifting.

Control flow per simulation step:

Read longitudinal command and normalize it to [-1, 1] after drive envelope clamping.
Convert normalized command to desired forward speed target.
Compute speed error against measured forward speed.
Convert speed error to desired acceleration target and clamp to physical limits.
Convert desired acceleration to desired pendulum tilt with $\theta_{des} = \arctan\left(\frac{a_{des}}{g}\right)$.
Convert desired tilt to desired leg X placement (inverted-pendulum wheel placement proxy).
Filter and clamp leg X target, then apply IK for each leg.
Compute predictive brake feedforward scale from deceleration alignment and apply it to brake ramp limit.

Sign convention used by stance IK:

positive leg X: wheels move backward relative to body
negative leg X: wheels move forward relative to body

This makes acceleration and deceleration one symmetric signal with opposite sign, while still allowing independent safety limits for traction and tip risk.

Interpretation: the controller estimates the non-gravity longitudinal force needed to balance an inverted pendulum via desired acceleration, then realizes that demand through wheel braking/drive authority plus stance placement.

Key Tuning Parameters

Primary predictive stance and braking knobs in main.py:

STANCE_SHIFT_MAX_LEG_X_M: hard fore-aft IK travel cap
STANCE_SHIFT_CMD_SPEED_MAX_MPS: maps normalized command to desired speed target
STANCE_SHIFT_SPEED_ERROR_TO_ACCEL_GAIN: converts speed error to desired acceleration
STANCE_SHIFT_ACCEL_MAX_MPS2: clamp for desired acceleration magnitude
STANCE_SHIFT_TILT_TO_LEG_X_GAIN: maps desired tilt to leg X shift
STANCE_SHIFT_FILTER_ALPHA: stance target smoothing factor
DRIVE_BRAKE_PREDICTIVE_GAIN: extra brake authority from predictive decel demand

Existing drive safety/ramp controls still apply on top:

DRIVE_ACCEL_RESPONSE_GAIN, DRIVE_BRAKE_RESPONSE_GAIN
DRIVE_VX_ACCEL_LIMIT[_STATIONARY], DRIVE_VX_DECEL_LIMIT[_STATIONARY]
DRIVE_BRAKE_THROTTLE_MIN, DRIVE_BRAKE_REVERSE_SCALE

Requirements

Python 3.11
A Python virtual environment (existing .venv in this repo is preferred)
Upstream Genesis wheel support for your platform

Supported in this repo today:

macOS arm64
Linux x86_64
Linux ARM64 (for example Ubuntu 24 on NVIDIA DGX Spark) via custom compile script

Installation

Install simulator dependencies:

source .venv/bin/activate
python -m pip install -r requirements.txt

Linux ARM64 (NVIDIA DGX Spark / Ubuntu 24)

Since upstream Genesis binaries (quadrants, etc.) are missing for Linux ARM64, you must compile them from source.

Option 1: Build with Docker (Recommended for DGX/HPC clusters) If you do not have sudo privileges on the machine, you can run everything inside a container:

docker build -t gdog-sim .
docker run --rm --ipc=host -p 8000:8000 --gpus all \
  -e NVIDIA_DRIVER_CAPABILITIES=all \
  -e DISPLAY=$DISPLAY \
  -v /tmp/.X11-unix:/tmp/.X11-unix:rw \
  gdog-sim python main.py --render --host 0.0.0.0

Note on Ubuntu 24 & Wayland: Ubuntu 24 defaults to Wayland, even on modern NVIDIA drivers. If echo $XDG_SESSION_TYPE returns wayland, the X11 bridge above relies on Xwayland and might have hardware acceleration penalties. To pass Wayland natively, use:
docker run --rm --ipc=host -p 8000:8000 --gpus all \
  -e NVIDIA_DRIVER_CAPABILITIES=all \
  -e WAYLAND_DISPLAY=$WAYLAND_DISPLAY \
  -e XDG_RUNTIME_DIR=$XDG_RUNTIME_DIR \
  -v $XDG_RUNTIME_DIR/$WAYLAND_DISPLAY:$XDG_RUNTIME_DIR/$WAYLAND_DISPLAY:rw \
  gdog-sim python main.py --render --host 0.0.0.0

(If your cluster uses Apptainer/Singularity or Podman instead of Docker, the same Dockerfile will compile correctly).

Optional WebRTC support:

python -m pip install -r requirements-webrtc-optional.txt

Running

Interactive viewer (continuous until window close or Ctrl+C):

python main.py --render

Deterministic randomized world:

python main.py --render --seed 12345

Deterministic randomized world with a random terrain bone prop:

python main.py --render --seed 12345 --spawn-bone

Custom backend bind host/port:

python main.py --render --host 0.0.0.0 --port 8000

Fixed-length run:

python main.py --render --steps 2000

Unlimited loop explicitly:

python main.py --render --steps 0

Video capture mode:

python main.py --video

Video mode writes wheeled_go2.mp4 (50 FPS) when the run completes.

Cloudflare Quick Tunnel (No API Token)

If you want to use the HTTPS GitHub Pages remote with your local simulator backend, run an ephemeral Cloudflare Quick Tunnel.

Install once (macOS):

brew install cloudflared

Then launch sim with tunnel:

python main.py --render --quick-tunnel

If your network is slow to provision the quick tunnel URL, increase wait time:

python main.py --render --quick-tunnel --quick-tunnel-timeout 60

For restrictive guest/corporate Wi-Fi, force TCP-based edge transport and IPv4:

python main.py --render --quick-tunnel --quick-tunnel-protocol http2 --quick-tunnel-edge-ip-version 4

On startup, the simulator prints:

local backend targets (<ip>:8000)
tunnel backend target (for example https://random-name.trycloudflare.com)
a prefilled remote link (for example https://felipegalind0.github.io/gdog-remote?backend=https://...)
a terminal QR code for that prefilled link

If terminal QR rendering dependency is missing, startup prints a fallback QR image URL instead.

Paste the printed tunnel URL into gdog-remote backend input when using GitHub Pages.

Notes:

Cloudflare Quick Tunnel does not require an API key/token for this flow
URL is temporary and usually changes each run

Useful options:

--remote-url <url>: override remote page URL (default is hardcoded to https://felipegalind0.github.io/gdog-remote)
--no-qr: print link only, disable terminal QR rendering
--quick-tunnel-timeout <seconds>: wait longer for tunnel URL discovery
--quick-tunnel-attempts <n>: retry quick tunnel startup before failing
--quick-tunnel-protocol auto|http2|quic: cloudflared edge transport; use http2 on restrictive networks
--quick-tunnel-edge-ip-version auto|4|6: force IP family; use 4 when guest Wi-Fi has broken IPv6

If quick tunnel URL discovery still times out in the simulator, run cloudflared manually in a second terminal:

cloudflared tunnel --url http://127.0.0.1:8000 --no-autoupdate

Then copy the https://...trycloudflare.com URL from that terminal into gdog-remote backend input.

Step Defaults

--steps provided: exact value is used
--video without --steps: 500 steps
--render without --steps: unlimited
headless without --steps: 10000 steps

Controls

Remote Control Inputs

The backend accepts JSON command payloads with:

vx: linear velocity command
omega: yaw-rate command
pitch_cmd or pitch: normalized pitch setpoint in [-1, 1]
roll_cmd or roll: normalized roll setpoint in [-1, 1]
cam_dx/dx, cam_dy/dy, cam_zoom/zoom: camera delta inputs
cmd or command: text command for suspension console parser

Keyboard Drive Override (Viewer)

When arrow keys are held in render mode, keyboard commands override remote drive commands:

Up/Down: forward/back
Left/Right: yaw

Camera Controls (Viewer)

Camera follows robot XY and yaw automatically
Mouse rotate drag controls orbit latitude
Mouse wheel:
- horizontal scroll changes orbit longitude
- vertical scroll changes orbit latitude
Shift + wheel zooms in/out
Ctrl + wheel adjusts camera target height offset

Other Viewer Hotkeys

/ or t: open command input
Enter: submit command
Esc: cancel command input
h: toggle shadows

Suspension Command Console

Supported control/tuning commands include:

help, status
respawn, reset, pid_reset
pitch PID: kp=..., ki=..., kd=... (also readable via kp?, etc.)
roll PID: rp=..., ri=..., rd=... (also readable via rp?, etc.)
mix sign toggles: p_sign=1|-1, r_sign=1|-1, p_sign=flip
suspension enable: susp=on|off
debug toggles: debug_pitch=on|off, debug_roll=on|off, debug_yaw=on|off, debug_speed=on|off

Unknown keys are reported with suggestions, and invalid values return per-key hints.

HUD/Debug Overlays

Always-on status (top-center):
- suspension ON/OFF
- current pitch and roll (degrees)
Optional debug blocks (top-right) controlled by debug flags:
- pitch PID internals
- roll PID internals
- yaw diagnostics
- speed/wheel command diagnostics

Network API

WebSocket

Endpoint: ws://<host>:8000/ws
Behavior: accepts text JSON frames and updates command state

HTTP Command Fallback

Endpoint: POST http://<host>:8000/command
Behavior: accepts JSON command payloads using the same fields as WebSocket frames
Purpose: lets control commands pass through restrictive networks that block WebSocket upgrades

Capabilities

Endpoint: GET http://<host>:8000/capabilities
Returns JSON such as {"webrtc": false}

WebRTC (Optional)

Endpoint: POST http://<host>:8000/offer
Requires aiortc installed
If aiortc is missing, endpoint returns:

{"error":"WebRTC not installed"}

Important Note

Voice command progress/result events stream over WebSocket or WebRTC data channel. If the remote is in HTTP fallback mode, basic joystick control still works, but voice command telemetry will be limited.

Running With gdog-remote

Start simulator and remote UI in separate terminals.

In this repo:

source .venv/bin/activate
python main.py --render

In sibling repo ../gdog-remote:

npm install
npm run dev

Open the Vite URL (typically http://localhost:5173)

The remote app streams controls at 50 Hz and prefers WebRTC data channel when connected; otherwise it sends over WebSocket.

Dependencies

requirements.txt: aggregate install (runtime + Genesis platform marker)
requirements-runtime.txt: FastAPI/transport + direct Python deps used by this repo
requirements-genesis.txt: Genesis package (skips Linux ARM64)
requirements-genesis-deps.txt: legacy explicit Genesis transitive set (not used by default install)
requirements-webrtc-optional.txt: optional aiortc

Troubleshooting

aiortc not installed. WebRTC disabled. Using WebSockets as primary.
- Expected unless optional WebRTC dependency is installed
Remote shows capability probe warning
- backend may be unreachable, blocked by mixed-content rules, or blocked by network policy
- /capabilities should exist; test with curl http://<host>:8000/capabilities
- if WebSocket is blocked on guest Wi-Fi, remote should automatically switch to HTTP fallback mode
Cloudflare quick tunnel exits with status_code="500 Internal Server Error" and error code: 1101
- quick tunnel API is failing upstream (not a simulator bug)
- retry after a minute or use an alternate egress network/VPN
- consider a named Cloudflare tunnel (account-backed) for higher reliability
Robot does not react to remote controls
- confirm sim is running
- confirm backend reachable at http://localhost:8000/ws
- verify remote status shows WebSocket connected
Startup/import issues
- activate .venv
- reinstall with python -m pip install -r requirements.txt
Linux ARM64 startup exits with Genesis unavailable message
- Run ./scripts/install_ubuntu_arm64.sh to compile the missing binaries.

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
bone_props.py		bone_props.py
camera_controller.py		camera_controller.py
commands.py		commands.py
config.py		config.py
gdog.urdf.jinja		gdog.urdf.jinja
gdog_v1.gif		gdog_v1.gif
gdog_v1.png		gdog_v1.png
longitudinal_control.py		longitudinal_control.py
main.py		main.py
math_utils.py		math_utils.py
network.py		network.py
pid.py		pid.py
procedural_gen.py		procedural_gen.py
requirements-genesis-deps.txt		requirements-genesis-deps.txt
requirements-genesis.txt		requirements-genesis.txt
requirements-runtime.txt		requirements-runtime.txt
requirements-webrtc-optional.txt		requirements-webrtc-optional.txt
requirements.txt		requirements.txt
robot_model.py		robot_model.py
runtime_services.py		runtime_services.py
suspension_control.py		suspension_control.py
voice_tasks.py		voice_tasks.py

Folders and files

Latest commit

History

Repository files navigation

gdog-sim

Quick Start

Download source code, install dependencies, and run the sim

What This Project Contains

Runtime Behavior

Longitudinal Control Architecture

Key Tuning Parameters

Requirements

Installation

Linux ARM64 (NVIDIA DGX Spark / Ubuntu 24)

Running

Cloudflare Quick Tunnel (No API Token)

Step Defaults

Controls

Remote Control Inputs

Keyboard Drive Override (Viewer)

Camera Controls (Viewer)

Other Viewer Hotkeys

Suspension Command Console

HUD/Debug Overlays

Network API

WebSocket

HTTP Command Fallback

Capabilities

WebRTC (Optional)

Important Note

Running With gdog-remote

Dependencies

Troubleshooting

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages