Skip to content

Decentralized Physical AI

Nepher Robotics runs tournaments that produce real, deployable robotics AI — trained policies, complete Isaac Lab External Projects, and standardized OpenUSD environments. Those tournaments are validated through Decentralized Physical AI: a transparent, stake-weighted evaluation network where independent validators score every submission in standardized Isaac Lab environments.

The infrastructure runs on Bittensor Subnet 49, but the product is not the blockchain — it is the AI assets the tournaments create.

What Tournaments Produce

Each completed tournament ships three concrete assets the robotics community can clone, reproduce, and deploy:

Trained policy

Every submission centers on a trained control policy — the learned weights that map observations to robot actions. Winning packages include:

  • best_policy/best_policy.pt — the required checkpoint validators load during evaluation
  • Training metadataparams/agent.yaml and params/env.yaml snapshots so you know exactly which hyperparameters and environment config produced the weights
  • Export-ready formats — top packages may also ship JIT or ONNX exports for faster sim playback or downstream hardware deployment

These are not demo weights. They are policies that survived decentralized evaluation against hidden benchmark scenes — ready to load in Isaac Lab with play.py or integrate into your own deployment stack.

Isaac Lab External Project

Each tournament publishes a full Isaac Lab External Project — the standard NVIDIA pattern for custom robot learning tasks. One repo contains everything needed to retrain, evaluate, and run inference:

ComponentWhat it includes
Task modulesource/<module>/ — robot definitions, MDP rewards, observation/action spaces, and registered Gym environments
Training codescripts/rsl_rl/train.py — RSL-RL training entry point with tournament-specific env IDs (e.g. Nepher-Spot-WaypointNav-Envhub-v0)
Inference codescripts/rsl_rl/play.py — load best_policy.pt and run the policy in simulation with the same env config validators use
Project scaffoldingpyproject.toml, setup.py, and scripts/list_envs.py for install verification

Reference example: task-spot-waypointnav — a hierarchical Spot navigation task with trainable high-level nav policy and frozen low-level locomotion. When a new tournament opens, its task repo follows this same External Project layout so miners and validators share one reproducible codebase.

Environments (public and benchmark)

Tournaments rely on EnvHub OpenUSD environment bundles so every participant trains and evaluates against the same geometry. Two tiers keep competition fair while preventing overfitting:

TierPurposeAvailability
Public environmentsTraining and self-evaluation — diverse scenes miners use to build generalizable policies (e.g. waypoint-sample-v1)Download anytime via npcli envhub download
Benchmark environmentsOfficial evaluation — hidden scenes validators activate only during the evaluation period (e.g. waypoint-benchmark-v1)Validators only; miners never see them before submission

Public scenes let you iterate locally with the EnvHub-integrated task variant (Nepher-*-Envhub-v0). Benchmark scenes are the ground truth: if your policy ranks #1, it generalized to environments it was never trained on. All bundles are OpenUSD-compliant and import directly into Isaac Sim 5.1+ / Isaac Lab 2.3+.

Winning and top-performing packages — policy, External Project, and linked environment configs — are published to SimStore as open-source tiers, accelerating community learning and enterprise PoCs.

Focus on outcomes

If you are a robotics engineer, start with the Tournament Miner Guide and Tournaments overview. You do not need deep Web3 knowledge to compete — rewards are paid in TAO (not subnet alpha tokens), and the CLI handles submission signing.

Why Decentralized Validation?

Robotics benchmarks fail when evaluation is opaque. A centralized leaderboard can hide dataset leakage, inconsistent environments, or unreproducible scoring.

Decentralized Physical AI solves this by design:

  • Multiple independent validators run the same public eval-nav code on each submission in headless Isaac Sim
  • Stake-weighted score aggregationaggregated_score = Σ(score_i × stake_i) / Σ(stake_i) — makes collusion expensive
  • On-chain eligibility snapshots lock who can be scored before evaluation begins (Subnet 49 registration + agent submission required)
  • Hidden benchmarks — EnvHub evaluation scenes like waypoint-benchmark-v1 are only activated for validators during the evaluation period
  • Open-source evaluators let miners self-test with identical configs before submitting

The result: when a policy ranks #1, the community can clone the repo, download the EnvHub scenes, and reproduce the score.

How It Works (High Level)

 Engineers                    Validators                  Output
 ─────────                    ──────────                  ──────

 Train policy  ──submit──►   Download agent ZIP
 in Isaac Lab                 Install task module
       │                      npcli envhub download …
       │                      Run eval-nav (headless)
       ▼                      POST score → tournament-api.nepher.ai
 Tournament task repo                  │
 (USD + Isaac Lab)            Stake-weighted aggregation


                              Winner → TAO payout
                              Assets → SimStore (open tier)

Tournament phases (contest → submit → evaluation → review → reward) are documented in the Incentive Mechanism guide.

Bittensor Subnet 49 — Infrastructure, Not the Story

Bittensor provides the decentralized coordination layer:

  • Subnet 49 hosts Nepher Robotics validators who evaluate agents and set on-chain weights during reward periods
  • Wallet registration ties submissions to cryptographic identity (btcli subnet register --netuid 49)
  • Stake-weighted averaging aggregates validator scores; ties break on earliest submission time
  • Emission schedule: outside reward periods, all weight goes to UID 0 (burn)

Validators and miners interact with the subnet through nepher-subnet (nepher-miner, nepher-validator) or nepher-cli (npcli tournament submit). Most day-to-day work — training, EnvHub downloads, self-evaluation — happens entirely in Isaac Lab.

For economic details (TAO payouts, emission burning, alpha token model), see Subnet Alphanomics.

Comparison: Hackathon vs Tournament

Both live on the Nepher platform but serve different goals:

HackathonTournament
PurposeInnovation — explore new task genresOptimization — best policy for a fixed benchmark
ScopeBroad, open-ended challengesSpecific robot + environment + metrics
ValidationCommunity voting + stake weightsDecentralized eval-nav evaluation
OutputIdeas, PoCs, new genresProduction policies, code, and assets
Portalhackathon.nepher.aitournament.nepher.ai

Get Involved

RoleGuide
Compete (train & submit policies)Miner Guide
Validate (run evaluations, set weights)Validator Guide
Browse assetsSimStore · EnvHub
CLINepher CLI

External Resources

Released under the MIT License.