Decentralized AI: Permissionless LLM Inference on POKT Network

Read original: arXiv:2405.20450 - Published 6/3/2024 by Daniel Olshansky, Ramiro Rodriguez Colmeiro, Bowen Li

Decentralized AI: Permissionless LLM Inference on POKT Network

Overview

This paper explores a decentralized approach to running large language model (LLM) inference on the Pocket Network (POKT), a blockchain-based network for decentralized web3 services.
The key ideas include leveraging POKT's permissionless full nodes to enable anyone to run LLM inference without centralized control, and using token-based incentives to encourage participation.
The proposed system aims to enable more open and accessible AI capabilities, aligned with principles of decentralization and user sovereignty.

Plain English Explanation

The paper describes a way to run powerful AI language models in a decentralized manner, without a single authority controlling access. This is done by using the Pocket Network, a blockchain-based network that allows anyone to contribute computing power and get rewarded.

The key idea is to enable "permissionless" inference, meaning anyone can use these AI models without having to get approval from a central provider. This contrasts with the current model where major tech companies control access to the most advanced AI systems.

By running the AI on the decentralized POKT network, the researchers aim to make these powerful language capabilities more open and accessible to a wider range of users and use cases. The POKT network uses token-based incentives to encourage people to contribute computing power, creating a self-sustaining ecosystem.

This aligns with the broader vision of decentralized AI and web3, where the control and benefits of advanced technologies are more widely distributed rather than concentrated in the hands of a few large players.

Technical Explanation

The paper proposes a system for running large language model (LLM) inference on the Pocket Network (POKT), a decentralized network of full nodes that provide blockchain data access services.

The key components include:

Permissionless LLM Inference: The system allows anyone to run LLM inference without requiring approval or access control from a central provider. This is enabled by leveraging POKT's permissionless full node infrastructure.
POKT Token Incentives: The POKT token is used to incentivize participants to contribute computing resources for running LLM inference. Nodes are rewarded for processing inference requests.
Inference Delegation: Users can delegate their inference requests to POKT full nodes, which will execute the inference task and return the results. This allows users to access LLM capabilities without needing to run the models themselves.
Scalability and Reliability: By distributing inference across the POKT network, the system aims to achieve scalability and reliability, with no single point of failure.

The researchers evaluate the proposed system through simulations and discuss its potential benefits, such as increased accessibility and user sovereignty for advanced AI capabilities, aligned with the principles of decentralized AI.

Critical Analysis

The paper presents a compelling vision for democratizing access to powerful language models by leveraging decentralized infrastructure. However, some potential limitations and areas for further research are worth considering:

Performance and Latency: Running inference on a distributed network may introduce additional latency compared to centralized approaches. The authors acknowledge this challenge and suggest exploring techniques like optical networks to improve performance.
Security and Reliability: While the decentralized nature of the POKT network aims to improve resilience, there may be concerns around the overall security and reliability of the system, especially for mission-critical applications.
Incentive Alignment: Ensuring the long-term sustainability of the token-based incentive model and aligning the incentives of all participants will be crucial for the viability of the system.
Regulatory and Legal Considerations: The implications of a permissionless LLM inference system on issues like data privacy, intellectual property, and regulatory compliance will need to be carefully considered.

Conclusion

This paper presents a novel approach to running large language models in a decentralized, permissionless manner on the Pocket Network. By leveraging blockchain-based incentives and distributed infrastructure, the proposed system aims to increase the accessibility and user sovereignty of advanced AI capabilities, aligning with the principles of decentralized AI.

While the paper raises important technical and practical considerations, the underlying idea of democratizing access to powerful AI models through decentralized architectures is an intriguing direction for further research and development. As the field of AI continues to evolve, exploring innovative approaches that prioritize openness, user control, and equitable access will be crucial for ensuring the responsible and inclusive advancement of these transformative technologies.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Decentralized AI: Permissionless LLM Inference on POKT Network

Daniel Olshansky, Ramiro Rodriguez Colmeiro, Bowen Li

POKT Network's decentralized Remote Procedure Call (RPC) infrastructure, surpassing 740 billion requests since launching on MainNet in 2020, is well-positioned to extend into providing AI inference services with minimal design or implementation modifications. This litepaper illustrates how the network's open-source and permissionless design aligns incentives among model researchers, hardware operators, API providers and users whom we term model Sources, Suppliers, Gateways and Applications respectively. Through its Relay Mining algorithm, POKT creates a transparent marketplace where costs and earnings directly reflect cryptographically verified usage. This decentralized framework offers large model AI researchers a new avenue to disseminate their work and generate revenue without the complexities of maintaining infrastructure or building end-user products. Supply scales naturally with demand, as evidenced in recent years and the protocol's free market dynamics. POKT Gateways facilitate network growth, evolution, adoption, and quality by acting as application-facing load balancers, providing value-added features without managing LLM nodes directly. This vertically decoupled network, battle tested over several years, is set up to accelerate the adoption, operation, innovation and financialization of open-source models. It is the first mature permissionless network whose quality of service competes with centralized entities set up to provide application grade inference.

6/3/2024

Proof of Quality: A Costless Paradigm for Trustless Generative AI Model Inference on Blockchains

Zhenjie Zhang, Yuyang Rao, Hao Xiao, Xiaokui Xiao, Yin Yang

Generative AI models, such as GPT-4 and Stable Diffusion, have demonstrated powerful and disruptive capabilities in natural language and image tasks. However, deploying these models in decentralized environments remains challenging. Unlike traditional centralized deployment, systematically guaranteeing the integrity of AI model services in fully decentralized environments, particularly on trustless blockchains, is both crucial and difficult. In this paper, we present a new inference paradigm called emph{proof of quality} (PoQ) to enable the deployment of arbitrarily large generative models on blockchain architecture. Unlike traditional approaches based on validating inference procedures, such as ZKML or OPML, our PoQ paradigm focuses on the outcome quality of model inference. Using lightweight BERT-based cross-encoders as our underlying quality evaluation model, we design and implement PQML, the first practical protocol for real-world NLP generative model inference on blockchains, tailored for popular open-source models such as Llama 3 and Mixtral. Our analysis demonstrates that our protocol is robust against adversarial but rational participants in ecosystems, where lazy or dishonest behavior results in fewer benefits compared to well-behaving participants. The computational overhead of validating the quality evaluation is minimal, allowing quality validators to complete the quality check within a second, even using only a CPU. Preliminary simulation results show that PoQ consensus is generated in milliseconds, 1,000 times faster than any existing scheme.

5/31/2024

🖼️

LooPIN: A PinFi protocol for decentralized computing

Yunwei Mao, Qi He, Ju Li

Networked computing power is a critical utility in the era of artificial intelligence. This paper presents a novel Physical Infrastructure Finance (PinFi) protocol designed to facilitate the distribution of computing power within networks in a decentralized manner. Addressing the core challenges of coordination, pricing, and liquidity in decentralized physical infrastructure networks (DePIN), the PinFi protocol introduces a distinctive dynamic pricing mechanism. It enables providers to allocate excess computing resources to a dissipative PinFi liquidity pool, distinct from traditional DeFi liquidity pools, ensuring seamless access for clients at equitable, market-based prices. This approach significantly reduces the costs of accessing computing power, potentially to as low as 1% compared to existing services, while simultaneously enhancing security and dependability. The PinFi protocol is poised to transform the dynamics of supply and demand in computing power networks, setting a new standard for efficiency and accessibility.

6/17/2024

Proof-of-Learning with Incentive Security

Zishuo Zhao, Zhixuan Fang, Xuechao Wang, Xi Chen, Yuan Zhou

Most concurrent blockchain systems rely heavily on the Proof-of-Work (PoW) or Proof-of-Stake (PoS) mechanisms for decentralized consensus and security assurance. However, the substantial energy expenditure stemming from computationally intensive yet meaningless tasks has raised considerable concerns surrounding traditional PoW approaches, The PoS mechanism, while free of energy consumption, is subject to security and economic issues. Addressing these issues, the paradigm of Proof-of-Useful-Work (PoUW) seeks to employ challenges of practical significance as PoW, thereby imbuing energy consumption with tangible value. While previous efforts in Proof of Learning (PoL) explored the utilization of deep learning model training SGD tasks as PoUW challenges, recent research has revealed its vulnerabilities to adversarial attacks and the theoretical hardness in crafting a byzantine-secure PoL mechanism. In this paper, we introduce the concept of incentive-security that incentivizes rational provers to behave honestly for their best interest, bypassing the existing hardness to design a PoL mechanism with computational efficiency, a provable incentive-security guarantee and controllable difficulty. Particularly, our work is secure against two attacks to the recent work of Jia et al. [2021], and also improves the computational overhead from $Theta(1)$ to $O(frac{log E}{E})$. Furthermore, while most recent research assumes trusted problem providers and verifiers, our design also guarantees frontend incentive-security even when problem providers are untrusted, and verifier incentive-security that bypasses the Verifier's Dilemma. By incorporating ML training into blockchain consensus mechanisms with provable guarantees, our research not only proposes an eco-friendly solution to blockchain systems, but also provides a proposal for a completely decentralized computing power market in the new AI age.

6/6/2024