Cluster Protocol x Pundi AI: The First End-to-End Decentralized AI Training Stack
Apr 22, 2026
4 min Read

Cluster Protocol is thrilled to announce a strategic partnership with Pundi AI, the decentralized AI data economy turning human-annotated datasets into tokenized, community-owned assets.
Together, we're closing the last gap in decentralized AI: fueling permissionless compute with permissionless, high-quality data, end to end, without centralized middlemen in the loop.
Introduction to the Partnership
Every serious AI system rests on two pillars: compute and data. The decentralized AI movement has spent the last two years wiring up the compute side, with networks like Cluster Protocol unlocking distributed GPU capacity for builders worldwide. But data, the actual fuel, has quietly remained centralized. Scraped, opaque, and controlled by a handful of aggregators who extract nearly all of the value while contributors receive nothing.
Pundi AI has built what the ecosystem has been missing: a full onchain data economy where human-annotated, provenance-tracked datasets become liquid, tradable, and composable assets. With more than 122,000 datasets, 28,000+ contributors, 4.5 trillion+ Dataset Tokens (DTOKs) minted, and over a petabyte of IPFS-backed training data, Pundi AI is the most battle-tested data layer in Web3 AI today. They are also a member of the NVIDIA Inception program, validating their enterprise-grade technical foundations.
This partnership sets the stage for bringing Pundi AI's data economy and Cluster Protocol's compute network into one coherent loop. The ambition is simple: a world where developers can source human-annotated data, train on decentralized GPU compute, and deploy the resulting AI or agent without ever having to touch AWS, GCP, or an opaque data broker. The other half of decentralized AI, finally within reach.

Pillar 1: An End-to-End Decentralized AI Training Stack
The combination is straightforward in concept: human-annotated data from Pundi AI feeding into distributed compute on Cluster Protocol. Training workflows that would otherwise depend on centralized cloud providers and opaque data sources gain an open, onchain alternative, with value flowing back to everyone who contributed along the way, whether they supplied the dataset or the GPUs.
No centralized cloud in the loop. No opaque provenance. The exact mechanics of how dataset access and training execution connect is something both teams will shape together as the integration matures.
Pillar 2: Bringing Tokenized Data into Cluster's Ecosystem
Pundi AI has turned datasets into a tradable, community-backed asset class through its Data Pump and Dataset Tokens (DTOKs). Making that inventory discoverable and usable within Cluster Protocol's open marketplace for GPUs, data, and agents is a natural fit for both sides.
For Pundi AI, it expands DTOK utility into a new builder audience actively searching for training inventory, deepening the token's role beyond speculation. For Cluster Protocol, it adds real, high-quality, provenance-verified datasets to the marketplace that developers on CodeXero can tap into when they go from prompt to protocol. The exact entitlement and distribution model is something we'll design together, with Pundi AI's existing creator-royalty structure ensuring dataset authors keep earning as their work gets used.
Pillar 3: Better Data for Agents Built on Cluster
As agentic AI becomes a bigger part of what builders ship on Cluster Protocol and CodeXero, access to Pundi AI's data layer (along with the quality signals that come with tools like its per-dataset Integrated AI Agent) offers a meaningful edge. For agentic AI, data quality is not a nice-to-have. It is the ceiling on what any agent can become. This collaboration raises it.
About Pundi AI
Pundi AI is a decentralized AI data ecosystem building the infrastructure for an open, community-owned data economy. Through its Data Pump, AI Data Marketplace, and PURSE+ browser plugin, Pundi AI enables anyone to contribute, annotate, tokenize, and trade AI training data, with provenance tracked on the Pundi AIFX Omnilayer and storage backed by IPFS.
The native $PUNDIAI token powers governance, rewards, and transactions across the ecosystem. Pundi AI is a member of the NVIDIA Inception program and is on track to migrate its data economy to Ethereum in 2026, unlocking deeper liquidity and composability for DTOKs across the broader Web3 ecosystem.
Pundi AI's mission is to create one million jobs in AI by ensuring that the people who produce and label training data share in the value it creates.
About Cluster Protocol
Cluster is the decentralized AI infrastructure powering the Liberation Engine for Internet Capital Markets.
It provides verifiable compute, privacy-preserving execution, and modular AI orchestration, enabling developers to turn natural-language prompts into production-grade, tokenized dApps via CodeXero, Cluster's AI-native IDE.
