Training Compute Thresholds: Features and Functions in AI Governance

Read original: arXiv:2405.10799 - Published 8/7/2024 by Lennart Heim, Leonie Koessler

🏋️

Overview

The paper examines the use of training compute thresholds as a tool for governing artificial intelligence (AI) systems.
The authors argue that compute thresholds can serve as a valuable trigger for further evaluation of AI models, rather than being the sole determinant of regulation.
The paper outlines several key advantages of using compute thresholds in AI governance frameworks.

Plain English Explanation

The research paper looks at using the amount of computing power needed to train an AI system as a way to help manage and oversee these advanced technologies. The authors suggest that rather than relying solely on compute thresholds to regulate AI, they can be used as an initial filter to identify models that may be higher-risk and require additional scrutiny.

Some of the key benefits of using compute thresholds that the paper highlights include:

Correlation with model capabilities and risks - The amount of compute needed to train an AI system can indicate how powerful and potentially risky the model may be.
Quantifiability and ease of measurement - Compute can be easily measured and compared across different AI systems.
Robustness to circumvention - It's difficult for AI developers to understate the compute used in training to avoid thresholds.
Knowability before deployment - Compute thresholds can be determined before an AI model is even built.
Potential for external verification - Compute usage can be audited by third parties.
Targeted scope - Thresholds can focus regulation on the most capable and potentially harmful AI systems.

By using compute thresholds alongside other sector-specific regulations and broader governance measures, the authors argue this can provide a practical starting point for identifying and further evaluating high-risk AI models.

Technical Explanation

The paper makes the case that training compute thresholds can serve as a valuable tool within broader AI governance frameworks. The authors note that compute thresholds are correlated with an AI model's capabilities and associated risks, making them a quantifiable and easily measured indicator.

Importantly, compute thresholds are resistant to circumvention, as it is difficult for AI developers to understate the true compute used in training. Additionally, compute requirements can be known before a model is even developed and deployed, enabling proactive regulation. The authors also highlight that compute usage can be externally verified by third parties.

Crucially, the targeted scope of compute thresholds allows regulators to focus their attention on the most capable and potentially harmful AI systems, rather than applying a one-size-fits-all approach. The authors propose that compute thresholds should be used as an initial filter, alongside other sector-specific regulations and broader governance measures, to identify models that may require further evaluation.

Critical Analysis

The paper makes a compelling case for the use of compute thresholds in AI governance, highlighting several practical advantages over other potential regulatory approaches. However, the authors acknowledge that compute thresholds should not be the sole determinant of regulation, and that they should be used in conjunction with other measures.

One potential limitation is that compute thresholds may not capture all relevant risks associated with AI systems. For example, certain AI models may exhibit undesirable behaviors or biases that are not directly correlated with their training compute. Additionally, the rapidly evolving nature of AI technology may require ongoing reassessment of appropriate compute thresholds over time.

Further research could explore ways to reduce barriers to entry for foundation model training, which could potentially undermine the efficacy of compute thresholds. Ongoing monitoring and adaptation of governance frameworks will be crucial to ensure they remain effective in the face of rapid technological change.

Conclusion

The paper presents a compelling case for the use of training compute thresholds as part of a broader AI governance framework. While compute thresholds alone are not a silver bullet, they offer several practical advantages as an initial filter for identifying potentially high-risk AI systems.

By leveraging compute thresholds alongside other sector-specific regulations and broader governance measures, policymakers and regulators can take a targeted approach to managing the risks associated with advanced AI technologies. As the field of AI continues to evolve rapidly, ongoing evaluation and adaptation of governance frameworks will be essential to ensure they remain effective in protecting the public interest.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🏋️

Training Compute Thresholds: Features and Functions in AI Governance

Lennart Heim, Leonie Koessler

Regulators in the US and EU are using thresholds based on training compute--the number of computational operations used in training--to identify general-purpose artificial intelligence (GPAI) models that may pose risks of large-scale societal harm. We argue that training compute currently is the most suitable metric to identify GPAI models that deserve regulatory oversight and further scrutiny. Training compute correlates with model capabilities and risks, is quantifiable, can be measured early in the AI lifecycle, and can be verified by external actors, among other advantageous features. These features make compute thresholds considerably more suitable than other proposed metrics to serve as an initial filter to trigger additional regulatory requirements and scrutiny. However, training compute is an imperfect proxy for risk. As such, compute thresholds should not be used in isolation to determine appropriate mitigation measures. Instead, they should be used to detect potentially risky GPAI models that warrant regulatory oversight, such as through notification requirements, and further scrutiny, such as via model evaluations and risk assessments, the results of which may inform which mitigation measures are appropriate. In fact, this appears largely consistent with how compute thresholds are used today. As GPAI technology and market structures evolve, regulators should update compute thresholds and complement them with other metrics into regulatory review processes.

8/7/2024

On the Limitations of Compute Thresholds as a Governance Strategy

Sara Hooker

At face value, this essay is about understanding a fairly esoteric governance tool called compute thresholds. However, in order to grapple with whether these thresholds will achieve anything, we must first understand how they came to be. To do so, we need to engage with a decades-old debate at the heart of computer science progress, namely, is bigger always better? Does a certain inflection point of compute result in changes to the risk profile of a model? Hence, this essay may be of interest not only to policymakers and the wider public but also to computer scientists interested in understanding the role of compute in unlocking breakthroughs. This discussion is timely given the wide adoption of compute thresholds in both the White House Executive Orders on AI Safety (EO) and the EU AI Act to identify more risky systems. A key conclusion of this essay is that compute thresholds, as currently implemented, are shortsighted and likely to fail to mitigate risk. The relationship between compute and risk is highly uncertain and rapidly changing. Relying upon compute thresholds overestimates our ability to predict what abilities emerge at different scales. This essay ends with recommendations for a better way forward.

7/31/2024

🤖

Risk thresholds for frontier AI

Leonie Koessler, Jonas Schuett, Markus Anderljung

Frontier artificial intelligence (AI) systems could pose increasing risks to public safety and security. But what level of risk is acceptable? One increasingly popular approach is to define capability thresholds, which describe AI capabilities beyond which an AI system is deemed to pose too much risk. A more direct approach is to define risk thresholds that simply state how much risk would be too much. For instance, they might state that the likelihood of cybercriminals using an AI system to cause X amount of economic damage must not increase by more than Y percentage points. The main upside of risk thresholds is that they are more principled than capability thresholds, but the main downside is that they are more difficult to evaluate reliably. For this reason, we currently recommend that companies (1) define risk thresholds to provide a principled foundation for their decision-making, (2) use these risk thresholds to help set capability thresholds, and then (3) primarily rely on capability thresholds to make their decisions. Regulators should also explore the area because, ultimately, they are the most legitimate actors to define risk thresholds. If AI risk estimates become more reliable, risk thresholds should arguably play an increasingly direct role in decision-making.

6/24/2024

Position Paper: Technical Research and Talent is Needed for Effective AI Governance

Anka Reuel, Lisa Soder, Ben Bucknall, Trond Arne Undheim

In light of recent advancements in AI capabilities and the increasingly widespread integration of AI systems into society, governments worldwide are actively seeking to mitigate the potential harms and risks associated with these technologies through regulation and other governance tools. However, there exist significant gaps between governance aspirations and the current state of the technical tooling necessary for their realisation. In this position paper, we survey policy documents published by public-sector institutions in the EU, US, and China to highlight specific areas of disconnect between the technical requirements necessary for enacting proposed policy actions, and the current technical state of the art. Our analysis motivates a call for tighter integration of the AI/ML research community within AI governance in order to i) catalyse technical research aimed at bridging the gap between current and supposed technical underpinnings of regulatory action, as well as ii) increase the level of technical expertise within governing institutions so as to inform and guide effective governance of AI.

6/12/2024