OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code

Read original: arXiv:2405.15568 - Published 5/27/2024 by Maxence Faldor, Jenny Zhang, Antoine Cully, Jeff Clune

OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code

Overview

This paper proposes a novel approach to open-endedness in artificial intelligence systems, using models of human notions of interestingness and environments programmed in code.
The key idea is to move beyond traditional reward-based reinforcement learning towards more open-ended and intrinsically motivated exploration.
The authors introduce a framework called "\ours" that aims to capture aspects of human curiosity and interestingness in a computational model.

Plain English Explanation

The paper explores a new way to create AI systems that can explore and discover things on their own, rather than simply trying to maximize a pre-defined reward signal. The researchers argue that true open-ended learning and discovery requires going beyond traditional reinforcement learning approaches.

Their proposed framework, called "\ours", is designed to model aspects of human curiosity and what we find interesting. Instead of just optimizing for a specific reward, the AI agent in this system tries to find things that are "interesting" based on an internal model of interestingness. This could lead to more open-ended and creative exploration, as the agent seeks out novel and surprising situations, rather than just trying to maximize a single objective.

The goal is to move towards AI systems that can engage in more open-ended, intrinsically motivated learning and discovery, similar to how humans explore the world around us. This could unlock new frontiers in AI capabilities and help us better understand the foundations of human-like intelligence.

Technical Explanation

The key innovation in this paper is the "\ours" framework, which aims to capture models of human notions of interestingness within an AI agent's decision-making process. Rather than simply optimizing for externally defined rewards, the agent tries to find situations that are "interesting" based on an internal model of interestingness.

To implement this, the authors propose programming environments in code that can be freely explored by the agent. The agent then uses a learned model of interestingness to guide its exploration, seeking out novel and surprising situations. This stands in contrast to typical reward-based reinforcement learning, where the agent's behavior is driven by maximizing a pre-defined reward signal.

The paper presents several experiments demonstrating the capabilities of the "\ours" framework. For example, in one environment the agent discovers a wide variety of interesting behaviors, from simple movement patterns to more complex strategies, without any external rewards. The authors argue this shows the potential of their approach to enable more open-ended, intrinsically motivated exploration and learning.

Critical Analysis

The "\ours" framework represents a promising step towards more open-ended AI systems, but there are some important caveats and areas for further research:

The notion of "interestingness" is inherently subjective and can be challenging to define and model computationally. The authors acknowledge this as a key challenge, and more work is needed to refine and validate their interestingness models.
The experiments in the paper are still relatively simple, and it's unclear how well the approach would scale to more complex, real-world environments. Significant additional research would be needed to demonstrate the viability of this approach for practical applications.
The paper does not address how the "\ours" framework could be integrated with other important AI capabilities, such as robust learning, planning, and safety considerations. Combining open-ended exploration with these other critical aspects of intelligence remains an open challenge.

Despite these limitations, the core idea of moving beyond reward-based reinforcement learning towards more intrinsically motivated, open-ended exploration is an important and compelling direction for the field of AI. Further research building on this work could yield significant advances in our understanding of intelligence and how to develop more capable and flexible AI systems.

Conclusion

This paper presents a novel framework called "\ours" that aims to enable more open-ended and intrinsically motivated exploration in AI systems. By incorporating models of human notions of interestingness, the authors seek to move beyond traditional reward-based reinforcement learning towards more creative and discovery-driven behavior.

While the approach has some limitations and requires further research, the core idea of capturing aspects of human curiosity and interestingness in a computational model is a promising direction for the field of AI. Developing AI systems that can engage in open-ended learning and discovery, similar to how humans explore the world, could unlock new frontiers in artificial intelligence and help us better understand the foundations of human-like intelligence.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code

Maxence Faldor, Jenny Zhang, Antoine Cully, Jeff Clune

Open-ended and AI-generating algorithms aim to continuously generate and solve increasingly complex tasks indefinitely, offering a promising path toward more general intelligence. To accomplish this grand vision, learning must occur within a vast array of potential tasks. Existing approaches to automatically generating environments are constrained within manually predefined, often narrow distributions of environment, limiting their ability to create any learning environment. To address this limitation, we introduce a novel framework, OMNI-EPIC, that augments previous work in Open-endedness via Models of human Notions of Interestingness (OMNI) with Environments Programmed in Code (EPIC). OMNI-EPIC leverages foundation models to autonomously generate code specifying the next learnable (i.e., not too easy or difficult for the agent's current skill set) and interesting (e.g., worthwhile and novel) tasks. OMNI-EPIC generates both environments (e.g., an obstacle course) and reward functions (e.g., progress through the obstacle course quickly without touching red objects), enabling it, in principle, to create any simulatable learning task. We showcase the explosive creativity of OMNI-EPIC, which continuously innovates to suggest new, interesting learning challenges. We also highlight how OMNI-EPIC can adapt to reinforcement learning agents' learning progress, generating tasks that are of suitable difficulty. Overall, OMNI-EPIC can endlessly create learnable and interesting environments, further propelling the development of self-improving AI systems and AI-Generating Algorithms. Project website with videos: https://dub.sh/omniepic

5/27/2024

OmniACT: A Dataset and Benchmark for Enabling Multimodal Generalist Autonomous Agents for Desktop and Web

Raghav Kapoor, Yash Parag Butala, Melisa Russak, Jing Yu Koh, Kiran Kamble, Waseem Alshikh, Ruslan Salakhutdinov

For decades, human-computer interaction has fundamentally been manual. Even today, almost all productive work done on the computer necessitates human input at every step. Autonomous virtual agents represent an exciting step in automating many of these menial tasks. Virtual agents would empower users with limited technical proficiency to harness the full possibilities of computer systems. They could also enable the efficient streamlining of numerous computer tasks, ranging from calendar management to complex travel bookings, with minimal human intervention. In this paper, we introduce OmniACT, the first-of-a-kind dataset and benchmark for assessing an agent's capability to generate executable programs to accomplish computer tasks. Our scope extends beyond traditional web automation, covering a diverse range of desktop applications. The dataset consists of fundamental tasks such as Play the next song, as well as longer horizon tasks such as Send an email to John Doe mentioning the time and place to meet. Specifically, given a pair of screen image and a visually-grounded natural language task, the goal is to generate a script capable of fully executing the task. We run several strong baseline language model agents on our benchmark. The strongest baseline, GPT-4, performs the best on our benchmark However, its performance level still reaches only 15% of the human proficiency in generating executable scripts capable of completing the task, demonstrating the challenge of our task for conventional web agents. Our benchmark provides a platform to measure and evaluate the progress of language model agents in automating computer tasks and motivates future work towards building multimodal models that bridge large language models and the visual grounding of computer screens.

7/23/2024

Open-Endedness is Essential for Artificial Superhuman Intelligence

Edward Hughes, Michael Dennis, Jack Parker-Holder, Feryal Behbahani, Aditi Mavalankar, Yuge Shi, Tom Schaul, Tim Rocktaschel

In recent years there has been a tremendous surge in the general capabilities of AI systems, mainly fuelled by training foundation models on internetscale data. Nevertheless, the creation of openended, ever self-improving AI remains elusive. In this position paper, we argue that the ingredients are now in place to achieve openendedness in AI systems with respect to a human observer. Furthermore, we claim that such open-endedness is an essential property of any artificial superhuman intelligence (ASI). We begin by providing a concrete formal definition of open-endedness through the lens of novelty and learnability. We then illustrate a path towards ASI via open-ended systems built on top of foundation models, capable of making novel, humanrelevant discoveries. We conclude by examining the safety implications of generally-capable openended AI. We expect that open-ended foundation models will prove to be an increasingly fertile and safety-critical area of research in the near future.

6/7/2024

👀

Odyssey: Empowering Agents with Open-World Skills

Shunyu Liu, Yaoru Li, Kongcheng Zhang, Zhenyu Cui, Wenkai Fang, Yuxuan Zheng, Tongya Zheng, Mingli Song

Recent studies have delved into constructing generalist agents for open-world embodied environments like Minecraft. Despite the encouraging results, existing efforts mainly focus on solving basic programmatic tasks, e.g., material collection and tool-crafting following the Minecraft tech-tree, treating the ObtainDiamond task as the ultimate goal. This limitation stems from the narrowly defined set of actions available to agents, requiring them to learn effective long-horizon strategies from scratch. Consequently, discovering diverse gameplay opportunities in the open world becomes challenging. In this work, we introduce ODYSSEY, a new framework that empowers Large Language Model (LLM)-based agents with open-world skills to explore the vast Minecraft world. ODYSSEY comprises three key parts: (1) An interactive agent with an open-world skill library that consists of 40 primitive skills and 183 compositional skills. (2) A fine-tuned LLaMA-3 model trained on a large question-answering dataset with 390k+ instruction entries derived from the Minecraft Wiki. (3) A new open-world benchmark includes thousands of long-term planning tasks, tens of dynamic-immediate planning tasks, and one autonomous exploration task. Extensive experiments demonstrate that the proposed ODYSSEY framework can effectively evaluate the planning and exploration capabilities of agents. All datasets, model weights, and code are publicly available to motivate future research on more advanced autonomous agent solutions.

7/23/2024