Decentralized Physical AI

Nepher Robotics runs tournaments that produce real, deployable robotics AI — trained policies, complete Isaac Lab External Projects, and standardized OpenUSD environments. Those tournaments are validated through Decentralized Physical AI: a transparent, stake-weighted evaluation network where independent validators score every submission in standardized Isaac Lab environments.

The infrastructure runs on Bittensor Subnet 49, but the product is not the blockchain — it is the AI assets the tournaments create.

What Tournaments Produce

Each completed tournament ships three concrete assets the robotics community can clone, reproduce, and deploy:

Trained policy

Every submission centers on a trained control policy — the learned weights that map observations to robot actions. Winning packages include:

best_policy/best_policy.pt — the required checkpoint validators load during evaluation
Training metadata — params/agent.yaml and params/env.yaml snapshots so you know exactly which hyperparameters and environment config produced the weights
Export-ready formats — top packages may also ship JIT or ONNX exports for faster sim playback or downstream hardware deployment

These are not demo weights. They are policies that survived decentralized evaluation against hidden benchmark scenes — ready to load in Isaac Lab with play.py or integrate into your own deployment stack.

Isaac Lab External Project

Each tournament publishes a full Isaac Lab External Project — the standard NVIDIA pattern for custom robot learning tasks. One repo contains everything needed to retrain, evaluate, and run inference:

Component	What it includes
Task module	`source/<module>/` — robot definitions, MDP rewards, observation/action spaces, and registered Gym environments
Training code	`scripts/rsl_rl/train.py` — RSL-RL training entry point with tournament-specific env IDs (e.g. `Nepher-Spot-WaypointNav-Envhub-v0`)
Inference code	`scripts/rsl_rl/play.py` — load `best_policy.pt` and run the policy in simulation with the same env config validators use
Project scaffolding	`pyproject.toml`, `setup.py`, and `scripts/list_envs.py` for install verification

Reference example: task-spot-waypointnav — a hierarchical Spot navigation task with trainable high-level nav policy and frozen low-level locomotion. When a new tournament opens, its task repo follows this same External Project layout so miners and validators share one reproducible codebase.

Environments (public and benchmark)

Tournaments rely on EnvHub OpenUSD environment bundles so every participant trains and evaluates against the same geometry. Two tiers keep competition fair while preventing overfitting:

Tier	Purpose	Availability
Public environments	Training and self-evaluation — diverse scenes miners use to build generalizable policies (e.g. `waypoint-sample-v1`)	Download anytime via `npcli envhub download`
Benchmark environments	Official evaluation — hidden scenes validators activate only during the evaluation period (e.g. `waypoint-benchmark-v1`)	Validators only; miners never see them before submission

Public scenes let you iterate locally with the EnvHub-integrated task variant (Nepher-*-Envhub-v0). Benchmark scenes are the ground truth: if your policy ranks #1, it generalized to environments it was never trained on. All bundles are OpenUSD-compliant and import directly into Isaac Sim 5.1+ / Isaac Lab 2.3+.

Winning and top-performing packages — policy, External Project, and linked environment configs — are published to SimStore as open-source tiers, accelerating community learning and enterprise PoCs.

Focus on outcomes

If you are a robotics engineer, start with the Tournament Miner Guide and Tournaments overview. You do not need deep Web3 knowledge to compete — rewards are paid in TAO (not subnet alpha tokens), and the CLI handles submission signing.

Why Decentralized Validation?

Robotics benchmarks fail when evaluation is opaque. A centralized leaderboard can hide dataset leakage, inconsistent environments, or unreproducible scoring.

Decentralized Physical AI solves this by design:

Multiple independent validators run the same public eval-nav code on each submission in headless Isaac Sim
Stake-weighted score aggregation — aggregated_score = Σ(score_i × stake_i) / Σ(stake_i) — makes collusion expensive
On-chain eligibility snapshots lock who can be scored before evaluation begins (Subnet 49 registration + agent submission required)
Hidden benchmarks — EnvHub evaluation scenes like waypoint-benchmark-v1 are only activated for validators during the evaluation period
Open-source evaluators let miners self-test with identical configs before submitting

The result: when a policy ranks #1, the community can clone the repo, download the EnvHub scenes, and reproduce the score.

How It Works (High Level)

 Engineers                    Validators                  Output
 ─────────                    ──────────                  ──────

 Train policy  ──submit──►   Download agent ZIP
 in Isaac Lab                 Install task module
       │                      npcli envhub download …
       │                      Run eval-nav (headless)
       ▼                      POST score → tournament-api.nepher.ai
 Tournament task repo                  │
 (USD + Isaac Lab)            Stake-weighted aggregation
                                       │
                                       ▼
                              Winner → TAO payout
                              Assets → SimStore (open tier)

Tournament phases (contest → submit → evaluation → review → reward) are documented in the Incentive Mechanism guide.

Bittensor Subnet 49 — Infrastructure, Not the Story

Bittensor provides the decentralized coordination layer:

Subnet 49 hosts Nepher Robotics validators who evaluate agents and set on-chain weights during reward periods
Wallet registration ties submissions to cryptographic identity (btcli subnet register --netuid 49)
Stake-weighted averaging aggregates validator scores; ties break on earliest submission time
Emission schedule: outside reward periods, all weight goes to UID 0 (burn)

Validators and miners interact with the subnet through nepher-subnet (nepher-miner, nepher-validator) or nepher-cli (npcli tournament submit). Most day-to-day work — training, EnvHub downloads, self-evaluation — happens entirely in Isaac Lab.

For economic details (TAO payouts, emission burning, alpha token model), see Subnet Alphanomics.

Comparison: Hackathon vs Tournament

Both live on the Nepher platform but serve different goals:

	Hackathon	Tournament
Purpose	Innovation — explore new task genres	Optimization — best policy for a fixed benchmark
Scope	Broad, open-ended challenges	Specific robot + environment + metrics
Validation	Community voting + stake weights	Decentralized `eval-nav` evaluation
Output	Ideas, PoCs, new genres	Production policies, code, and assets
Portal	hackathon.nepher.ai	tournament.nepher.ai

Get Involved

Role	Guide
Compete (train & submit policies)	Miner Guide
Validate (run evaluations, set weights)	Validator Guide
Browse assets	SimStore · EnvHub
CLI	Nepher CLI

External Resources

Bittensor documentation: docs.bittensor.com
Bittensor GitHub: github.com/opentensor/bittensor
Nepher subnet repo: github.com/nepher-ai/nepher-subnet
Public evaluation: github.com/nepher-ai/eval-nav
Tournament API: tournament-api.nepher.ai

Decentralized Physical AI ​

What Tournaments Produce ​

Trained policy ​

Isaac Lab External Project ​

Environments (public and benchmark) ​

Why Decentralized Validation? ​

How It Works (High Level) ​

Bittensor Subnet 49 — Infrastructure, Not the Story ​

Comparison: Hackathon vs Tournament ​

Get Involved ​

External Resources ​