ML2SC: Deploying Machine Learning Models as Smart Contracts on the Blockchain

Read original: arXiv:2404.16967 - Published 4/29/2024 by Zhikai Li, Steve Vott, Bhaskar Krishnamachar

🐍

Overview

Importance of trusting computations in machine learning (ML) models
Blockchain technology's potential to provide this trust through transparent and tamper-proof record-keeping
Challenge: Translating ML models written in Python and PyTorch to Solidity smart contracts for deployment on Ethereum Virtual Machine (EVM)-compatible blockchains

Plain English Explanation

As artificial intelligence (AI) and machine learning (ML) become more prevalent, there is a growing need to ensure that the calculations and outputs of these systems can be trusted. Blockchain technology, with its ability to record data and computations in a transparent and tamper-proof manner, can help address this concern.

One significant hurdle in using blockchain for ML is that while ML models are often developed in Python using libraries like PyTorch, the smart contracts that run on Ethereum-based blockchains are written in a different language, Solidity. The researchers in this paper introduce a tool called "ML2SC" that can automatically translate a specific type of ML model, called a multi-layer perceptron (MLP), from PyTorch to Solidity.

The key idea is that ML2SC uses a fixed-point math library to approximate the floating-point computations performed in the original PyTorch model. This allows the Solidity-based smart contract to closely match the behavior of the original ML model. After deploying the smart contract, the researchers can train the model off-chain using PyTorch and then update the model parameters (weights and biases) on the blockchain through a function call.

When users want to use the ML model, they can call a function on the smart contract and provide the input data. The smart contract will then perform the inference computation and return the output, all in a transparent and verifiable manner on the blockchain. The researchers also analyze the gas costs associated with deploying, updating, and running the model on the blockchain, showing that the costs scale linearly with the model complexity.

Overall, this research provides a way to bring the trust and transparency of blockchain technology to ML models, which could be valuable in applications where the reliability and accountability of AI systems is critical, such as explainable Ponzi scheme detection or incentive-compatible security systems.

Technical Explanation

The researchers introduce "Machine Learning to Smart Contract" (ML2SC), a tool that can automatically translate multi-layer perceptron (MLP) models written in PyTorch to Solidity smart contracts. This addresses the challenge of deploying ML models on Ethereum Virtual Machine (EVM)-compatible blockchains, where the smart contracts are written in Solidity rather than the Python-based frameworks used to develop the models.

ML2SC uses a fixed-point math library to approximate the floating-point computations in the original PyTorch model. This allows the Solidity-based smart contract to closely match the behavior of the PyTorch model. After deploying the smart contract, the researchers can train the model off-chain using PyTorch and then update the model parameters (weights and biases) on the blockchain through a function call.

To use the ML model, users can call a function on the smart contract and provide the input data. The smart contract will then perform the inference computation and return the output, all in a transparent and verifiable manner on the blockchain.

The researchers also provide a mathematical model of the gas costs associated with deploying, updating, and running the ML model on the blockchain. They show that the gas costs scale linearly with various parameters of the MLP model, such as the number of layers and the number of neurons per layer. They present empirical results that match their theoretical analysis.

Finally, the researchers evaluate the classification accuracy of the on-chain implementation and show that the outputs are identical to the original off-chain PyTorch implementation, demonstrating the fidelity of their approach.

Critical Analysis

The researchers have addressed an important challenge in bringing the trust and transparency of blockchain technology to machine learning models. By developing a tool to translate PyTorch models to Solidity smart contracts, they have paved the way for deploying ML models on Ethereum-based blockchains.

However, the current work is limited to multi-layer perceptron (MLP) models, which are a relatively simple class of ML models. It would be valuable to extend the approach to other, more complex model architectures, such as deep neural networks or recurrent neural networks. Additionally, the researchers focus on the technical aspects of the translation and deployment process, but do not explore the potential real-world applications and use cases in depth.

Further research could also investigate the performance and scalability of the on-chain ML models, as well as explore techniques to optimize the gas costs associated with deploying and running these models on the blockchain.

Conclusion

This research introduces a novel approach to bringing the trust and transparency of blockchain technology to machine learning models. By developing a tool to translate PyTorch models to Solidity smart contracts, the researchers have opened up new possibilities for deploying ML models on Ethereum-based blockchains.

The ability to run ML models in a transparent and verifiable manner on the blockchain could be valuable in a wide range of applications, from incentive-compatible security systems to explainable Ponzi scheme detection. This research represents an important step towards enhancing the trust and accountability of AI systems, which will be crucial as they become more prevalent in our lives.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🐍

ML2SC: Deploying Machine Learning Models as Smart Contracts on the Blockchain

Zhikai Li, Steve Vott, Bhaskar Krishnamachar

With the growing concern of AI safety, there is a need to trust the computations done by machine learning (ML) models. Blockchain technology, known for recording data and running computations transparently and in a tamper-proof manner, can offer this trust. One significant challenge in deploying ML Classifiers on-chain is that while ML models are typically written in Python using an ML library such as Pytorch, smart contracts deployed on EVM-compatible blockchains are written in Solidity. We introduce Machine Learning to Smart Contract (ML2SC), a PyTorch to Solidity translator that can automatically translate multi-layer perceptron (MLP) models written in Pytorch to Solidity smart contract versions. ML2SC uses a fixed-point math library to approximate floating-point computation. After deploying the generated smart contract, we can train our models off-chain using PyTorch and then further transfer the acquired weights and biases to the smart contract using a function call. Finally, the model inference can also be done with a function call providing the input. We mathematically model the gas costs associated with deploying, updating model parameters, and running inference on these models on-chain, showing that the gas costs increase linearly in various parameters associated with an MLP. We present empirical results matching our modeling. We also evaluate the classification accuracy showing that the outputs obtained by our transparent on-chain implementation are identical to the original off-chain implementation with Pytorch.

4/29/2024

💬

Efficacy of Various Large Language Models in Generating Smart Contracts

Siddhartha Chatterjee, Bina Ramamurthy

This study analyzes the application of code-generating Large Language Models in the creation of immutable Solidity smart contracts on the Ethereum Blockchain. Other works such as Evaluating Large Language Models Trained on Code, Mark Chen et. al (2012) have previously analyzed Artificial Intelligence code generation abilities. This paper aims to expand this to a larger scope to include programs where security and efficiency are of utmost priority such as smart contracts. The hypothesis leading into the study was that LLMs in general would have difficulty in rigorously implementing security details in the code, which was shown through our results, but surprisingly generally succeeded in many common types of contracts. We also discovered a novel way of generating smart contracts through new prompting strategies.

7/17/2024

🔎

Vulnerability Detection in Smart Contracts: A Comprehensive Survey

Christopher De Baets, Basem Suleiman, Armin Chitizadeh, Imran Razzak

In the growing field of blockchain technology, smart contracts exist as transformative digital agreements that execute transactions autonomously in decentralised networks. However, these contracts face challenges in the form of security vulnerabilities, posing significant financial and operational risks. While traditional methods to detect and mitigate vulnerabilities in smart contracts are limited due to a lack of comprehensiveness and effectiveness, integrating advanced machine learning technologies presents an attractive approach to increasing effective vulnerability countermeasures. We endeavour to fill an important gap in the existing literature by conducting a rigorous systematic review, exploring the intersection between machine learning and smart contracts. Specifically, the study examines the potential of machine learning techniques to improve the detection and mitigation of vulnerabilities in smart contracts. We analysed 88 articles published between 2018 and 2023 from the following databases: IEEE, ACM, ScienceDirect, Scopus, and Google Scholar. The findings reveal that classical machine learning techniques, including KNN, RF, DT, XG-Boost, and SVM, outperform static tools in vulnerability detection. Moreover, multi-model approaches integrating deep learning and classical machine learning show significant improvements in precision and recall, while hybrid models employing various techniques achieve near-perfect performance in vulnerability detection accuracy. By integrating state-of-the-art solutions, this work synthesises current methods, thoroughly investigates research gaps, and suggests directions for future studies. The insights gathered from this study are intended to serve as a seminal reference for academics, industry experts, and bodies interested in leveraging machine learning to enhance smart contract security.

7/12/2024

Machine Learning for Blockchain Data Analysis: Progress and Opportunities

Poupak Azad, Cuneyt Gurcan Akcora, Arijit Khan

Blockchain technology has rapidly emerged to mainstream attention, while its publicly accessible, heterogeneous, massive-volume, and temporal data are reminiscent of the complex dynamics encountered during the last decade of big data. Unlike any prior data source, blockchain datasets encompass multiple layers of interactions across real-world entities, e.g., human users, autonomous programs, and smart contracts. Furthermore, blockchain's integration with cryptocurrencies has introduced financial aspects of unprecedented scale and complexity such as decentralized finance, stablecoins, non-fungible tokens, and central bank digital currencies. These unique characteristics present both opportunities and challenges for machine learning on blockchain data. On one hand, we examine the state-of-the-art solutions, applications, and future directions associated with leveraging machine learning for blockchain data analysis critical for the improvement of blockchain technology such as e-crime detection and trends prediction. On the other hand, we shed light on the pivotal role of blockchain by providing vast datasets and tools that can catalyze the growth of the evolving machine learning ecosystem. This paper serves as a comprehensive resource for researchers, practitioners, and policymakers, offering a roadmap for navigating this dynamic and transformative field.

4/30/2024