Training is the hardest problem in decentralized AI — it requires coordinating gradient updates across unreliable nodes with heterogeneous hardware over unpredictable networks. Four projects attack it from different angles, and two have already proven it works at frontier scale.
TrainingMainnet (Delphi)
Verifiable distributed ML training
Aggregates idle GPUs worldwide into a single training network with cryptographic proof the work was done correctly. The core innovation is Verde: Probabilistic Proof-of-Learning using RepOps — bitwise-deterministic ML primitives that guarantee identical results across heterogeneous hardware. Cheating solvers get slashed. Delphi mainnet (an AI-settled prediction market) launched April 2026; the $AI token TGE followed on Binance Alpha, KuCoin, and Coinbase.
$43MSeries A, a16z crypto led
10B$AI total supply (Gensyn L2)
Canonical POVInference verification is easy (re-run and compare). Training verification is an unsolved problem because training is non-deterministic across hardware. If Gensyn cracks this at scale, it becomes the trust layer for every distributed training network.
Team: Founded 2020 at Entrepreneur First. Ben Fielding (CEO), Harry Grieve (CTO). Backed by a16z crypto, Galaxy Digital, CoinFund.
Visit Gensyn →
TrainingResearch / Production
Decentralized frontier model training
Builds the tooling for globally distributed, asynchronous training across heterogeneous, permissionless GPUs on regular internet. OpenDiLoCo cuts inter-node communication; PRIME-RL handles async reinforcement learning; TOPLOC verifies inference from untrusted workers. INTELLECT-2 (May 2025) trained a 32B reasoning model via fully async RL across 100+ GPUs on three continents — improving on QwQ-32B — with all code, data, and weights open-sourced.
32BINTELLECT-2 reasoning model
$70.4Mtotal raised
Canonical POVArguably the most technically important project in decentralized AI. Unlike most projects here, it is not a DePIN with a token — it is a research lab producing open-source infrastructure. The INTELLECT series is doing for decentralized training what Bitcoin's whitepaper did for decentralized money: proving the concept works at meaningful scale.
Team: Vincent Weisser (CEO), Johannes Hagemann (CTO, ex-Aleph Alpha). Backed by Founders Fund, CoinFund, Distributed Global.
Visit Prime Intellect →
TrainingResearch
Protocol Learning — unextractable models
Developing Protocol Learning: decentralized, communication-efficient, model-parallel training where no single party ever possesses the full weights. The model "lives" in the protocol, split across geographically distributed nodes. A compression method achieves 95%+ compression with no convergence loss on standard 300 Mbps internet. The model is trainable and usable but unextractable — contributors get paid because no one can copy it and walk away. Node0 (Feb 2026) trained an 8B LLaMA on par with centralized training across four locations.
$7.6Mseed, USV + CoinFund co-led
9Amazon ML PhDs
Canonical POVThe highest-risk, highest-upside project on this entire map. Unextractability creates a digital commons that is usable but not appropriable. If Protocol Learning scales, it breaks the foundation-model oligopoly at the root. No token yet, pure research — the kind of team VCs should track closely even if production is 18-24 months out.
Team: Alexander Long (PhD CS, UNSW, ex-Amazon). Angels include Balaji Srinivasan and Clem Delangue.
Visit Pluralis →
Training + AgentsProduction
The open-source AI lab with a blockchain
Builds frontier open models (Hermes), a decentralized training network (Psyche on Solana), a distributed optimizer (DisTrO), and 2026's fastest-growing agent framework (Hermes Agent). DisTrO reduces inter-GPU communication bandwidth by up to 10,000x, making distributed pre-training practical on consumer internet. Hermes 4.3 was the first model trained end-to-end on Psyche, nearly matching 70B-class performance at half the parameter cost.
~105KHermes Agent GitHub stars in 10 weeks
$50MSeries A (Paradigm), $1B token valuation
Canonical POVThe strongest proof that decentralized training can produce frontier-quality models. What makes Nous singular is the full-stack position: the most popular open-weight models + a working training network + the fastest-growing agent framework + a $1B Paradigm mark. DisTrO's 10,000x bandwidth reduction is a genuine breakthrough on the bottleneck every training project faces.
Team: Jeffrey Quesnelle (CEO, YaRN author), Teknium, Karan Malhotra. Includes Diederik Kingma, co-inventor of Adam. $65M total.
Visit Nous Research →
Canonical POV
Training is the layer where the thesis is most decisively proven and least commoditized. Prime Intellect and Nous have shipped competitive models from permissionless swarms; Gensyn and Pluralis are solving the trust and extractability problems that make those swarms durable. This is research-grade computer science, not product wrappers — underwrite the teams accordingly.