Creative Beam Search

2405.00099

Published 5/10/2024 by Giorgio Franceschelli, Mirco Musolesi

Abstract

Large language models are revolutionizing several areas, including artificial creativity. However, the process of generation in machines profoundly diverges from that observed in humans. In particular, machine generation is characterized by a lack of intentionality and an underlying creative process. We propose a method called Creative Beam Search that uses Diverse Beam Search and LLM-as-a-Judge to perform response generation and response validation. The results of a qualitative experiment show how our approach can provide better output than standard sampling techniques. We also show that the response validation step is a necessary complement to the response generation step.

Create account to get full access

Overview

This paper introduces a novel technique called "Creative Beam Search" that aims to improve the creative capabilities of large language models (LLMs).
The key idea is to modify the standard beam search decoding algorithm used in LLMs to encourage more diverse and innovative output.
The authors test their approach on several creative tasks, including generating games via LLMs, generating situated reflection triggers about alternative solutions, and iterative research idea generation.

Plain English Explanation

Large language models (LLMs) like GPT-3 are impressive at generating human-like text, but they can often produce predictable or repetitive outputs, especially when tasked with creative writing or idea generation. The "Creative Beam Search" technique described in this paper aims to address this by modifying the standard beam search decoding algorithm used in LLMs.

Beam search is a common method for generating text one word at a time, keeping track of the most promising partial outputs (the "beams") and expanding them. The key insight of this work is that by encouraging the beam search to explore more diverse possibilities, the model can produce more creative and innovative outputs.

The authors test their approach on several tasks that require creativity, such as generating video game ideas, generating thought-provoking reflections on alternative solutions, and iterative research idea generation. The results suggest that Creative Beam Search can indeed produce more diverse and creative outputs compared to standard beam search.

Technical Explanation

The key innovation of this work is the "Creative Beam Search" algorithm, which modifies the standard beam search decoding process used in many LLMs. In traditional beam search, the algorithm keeps track of the top-k most promising partial outputs (the "beams") and expands them one word at a time, selecting the next word based on the model's probability distribution.

The Creative Beam Search approach introduces two main changes:

Beam Reranking: Instead of solely selecting the top-k most probable beams, the algorithm also considers the diversity of the beams. This encourages the search to explore a wider range of possibilities, rather than focusing only on the most likely outputs.
Iterative Refinement: The algorithm iterates through multiple rounds of beam search, using the outputs from the previous round to inform the search in the current round. This allows the model to gradually refine and improve the creativity of its outputs over time.

The authors evaluate their approach on several creative tasks, including generating video game ideas, generating situated reflection triggers about alternative solutions, and iterative research idea generation. The results show that Creative Beam Search outperforms standard beam search on measures of diversity and creativity, demonstrating the potential of this approach to enhance the creative capabilities of LLMs.

Critical Analysis

The paper provides a thoughtful and well-designed study of the Creative Beam Search algorithm, but there are a few potential limitations and areas for further research:

Task Generalization: The authors focus on a limited set of creative tasks, such as video game idea generation and research idea generation. It would be helpful to see how the approach performs on a wider range of creative tasks to better understand its general applicability.
Human Evaluation: While the authors use automated metrics to assess the creativity of the outputs, it would be valuable to also conduct human evaluations. This could provide deeper insights into how end-users perceive the creativity and usefulness of the generated content.
Computational Efficiency: The iterative refinement process in Creative Beam Search may be computationally more expensive than standard beam search. The authors could explore ways to improve the efficiency of the algorithm, perhaps by incorporating techniques like progressive search or mechanical design agents.

Overall, the Creative Beam Search approach is a promising step towards enhancing the creative capabilities of LLMs, and the paper presents a thoughtful and well-executed study. However, further research is needed to fully understand the strengths, limitations, and broader implications of this technique.

Conclusion

The "Creative Beam Search" algorithm introduced in this paper represents a novel approach to improving the creative capabilities of large language models (LLMs). By modifying the standard beam search decoding process to encourage more diverse and innovative outputs, the authors demonstrate that LLMs can be made more effective at tasks that require creativity, such as generating video game ideas, generating situated reflections on alternative solutions, and iterative research idea generation.

While the paper presents a thoughtful and well-designed study, there are still some areas for further research, such as exploring the generalization of the approach to a wider range of creative tasks, incorporating human evaluation, and improving the computational efficiency of the algorithm. Nonetheless, the Creative Beam Search technique represents a promising step forward in the ongoing effort to enhance the creative capabilities of LLMs and unlock their full potential in a wide range of applications.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

💬

Characterising the Creative Process in Humans and Large Language Models

Surabhi S. Nath, Peter Dayan, Claire Stevenson

Large language models appear quite creative, often performing on par with the average human on creative tasks. However, research on LLM creativity has focused solely on textit{products}, with little attention on the creative textit{process}. Process analyses of human creativity often require hand-coded categories or exploit response times, which do not apply to LLMs. We provide an automated method to characterise how humans and LLMs explore semantic spaces on the Alternate Uses Task, and contrast with behaviour in a Verbal Fluency Task. We use sentence embeddings to identify response categories and compute semantic similarities, which we use to generate jump profiles. Our results corroborate earlier work in humans reporting both persistent (deep search in few semantic spaces) and flexible (broad search across multiple semantic spaces) pathways to creativity, where both pathways lead to similar creativity scores. LLMs were found to be biased towards either persistent or flexible paths, that varied across tasks. Though LLMs as a population match human profiles, their relationship with creativity is different, where the more flexible models score higher on creativity. Our dataset and scripts are available on href{https://github.com/surabhisnath/Creative_Process}{GitHub}.

6/7/2024

cs.HC cs.AI cs.CL

🤔

Creativity and Machine Learning: A Survey

Giorgio Franceschelli, Mirco Musolesi

There is a growing interest in the area of machine learning and creativity. This survey presents an overview of the history and the state of the art of computational creativity theories, key machine learning techniques (including generative deep learning), and corresponding automatic evaluation methods. After presenting a critical discussion of the key contributions in this area, we outline the current research challenges and emerging opportunities in this field.

5/8/2024

cs.LG cs.AI cs.CY

💬

Creativity Has Left the Chat: The Price of Debiasing Language Models

Behnam Mohammadi

Large Language Models (LLMs) have revolutionized natural language processing but can exhibit biases and may generate toxic content. While alignment techniques like Reinforcement Learning from Human Feedback (RLHF) reduce these issues, their impact on creativity, defined as syntactic and semantic diversity, remains unexplored. We investigate the unintended consequences of RLHF on the creativity of LLMs through three experiments focusing on the Llama-2 series. Our findings reveal that aligned models exhibit lower entropy in token predictions, form distinct clusters in the embedding space, and gravitate towards attractor states, indicating limited output diversity. Our findings have significant implications for marketers who rely on LLMs for creative tasks such as copywriting, ad creation, and customer persona generation. The trade-off between consistency and creativity in aligned models should be carefully considered when selecting the appropriate model for a given application. We also discuss the importance of prompt engineering in harnessing the creative potential of base models.

6/11/2024

cs.CL cs.AI

📉

Automating Creativity

Ming-Hui Huang, Roland T. Rust

Generative AI (GenAI) has spurred the expectation of being creative, due to its ability to generate content, yet so far, its creativity has somewhat disappointed, because it is trained using existing data following human intentions to generate outputs. The purpose of this paper is to explore what is required to evolve AI from generative to creative. Based on a reinforcement learning approach and building upon various research streams of computational creativity, we develop a triple prompt-response-reward engineering framework to develop the creative capability of GenAI. This framework consists of three components: 1) a prompt model for expected creativity by developing discriminative prompts that are objectively, individually, or socially novel, 2) a response model for observed creativity by generating surprising outputs that are incrementally, disruptively, or radically innovative, and 3) a reward model for improving creativity over time by incorporating feedback from the AI, the creator/manager, and/or the customers. This framework enables the application of GenAI for various levels of creativity strategically.

5/14/2024

cs.AI