LLM Discussion: Enhancing the Creativity of Large Language Models via Discussion Framework and Role-Play






Published 5/21/2024 by Li-Chun Lu, Shou-Jen Chen, Tsung-Min Pai, Chan-Hung Yu, Hung-yi Lee, Shao-Hua Sun
LLM Discussion: Enhancing the Creativity of Large Language Models via Discussion Framework and Role-Play


Large language models (LLMs) have shown exceptional proficiency in natural language processing but often fall short of generating creative and original responses to open-ended questions. To enhance LLM creativity, our key insight is to emulate the human process of inducing collective creativity through engaging discussions with participants from diverse backgrounds and perspectives. To this end, we propose LLM Discussion, a three-phase discussion framework that facilitates vigorous and diverging idea exchanges and ensures convergence to creative answers. Moreover, we adopt a role-playing technique by assigning distinct roles to LLMs to combat the homogeneity of LLMs. We evaluate the efficacy of the proposed framework with the Alternative Uses Test, Similarities Test, Instances Test, and Scientific Creativity Test through both LLM evaluation and human study. Our proposed framework outperforms single-LLM approaches and existing multi-LLM frameworks across various creativity metrics.

Create account to get full access


If you already have an account, we'll log you in


  • This paper explores a framework for enhancing the creativity of large language models (LLMs) through discussion and role-play.
  • The authors propose an approach where LLMs engage in structured discussions and take on different personas to explore new ideas and perspectives.
  • The goal is to leverage the conversational and generative capabilities of LLMs to foster more innovative and creative outputs.

Plain English Explanation

The paper presents a way to make large language models (LLMs) more creative. LLMs are AI systems trained on vast amounts of text data that can generate human-like language. However, their outputs can sometimes lack originality or be limited to familiar patterns.

To address this, the researchers developed a "discussion framework" where the LLM engages in structured conversations, taking on different roles or personas. For example, the LLM might play the part of a scientist, an artist, or a philosopher, and have a back-and-forth discussion with itself in these various guises.

The idea is that by adopting different perspectives through role-play, the LLM can explore new ideas and generate more innovative and creative responses. The researchers believe this approach can unlock the full generative potential of these powerful language models.

Technical Explanation

The paper proposes a "Discussion Framework" to enhance the creativity of large language models (LLMs). In this framework, the LLM engages in structured discussions, taking on different personas or "roles" during the conversation.

The authors conduct experiments where an LLM plays the part of multiple characters, such as a scientist, an artist, and a philosopher. The LLM then has a back-and-forth discussion between these different roles, exploring ideas and generating responses from various perspectives.

The key insight is that by adopting different identities, the LLM can move beyond its typical patterns and outputs, and generate more novel and creative content. The researchers hypothesize that this role-play approach leverages the conversational and generative capabilities of LLMs to foster greater innovation.

The paper includes experiments demonstrating the effectiveness of this Discussion Framework, showing how it can produce more diverse and creative outputs compared to a baseline LLM without the role-play component.

Critical Analysis

The paper presents a promising approach for enhancing the creativity of large language models, but there are some limitations and areas for further research:

  • The experiments are conducted on a single LLM model, and it's unclear how generalizable the findings would be to other LLMs with different architectures or training data. [link to "Exploring Capabilities of Large Language Models for Generating Diverse and Novel Content"]
  • The roles and discussion prompts used in the experiments are relatively simple, and more complex or nuanced persona-shifting may be required to fully unlock the model's creative potential. [link to "Large Language Model-based Situational Dialogues for Second Language Learning"]
  • The paper does not address potential issues around bias, toxicity, or safety that could arise from LLMs taking on different identities and engaging in open-ended discussions. [link to "Multi-Role Consensus Through LLMs' Discussions on Vulnerability"]
  • Further research is needed to understand the cognitive processes and mechanisms underlying the enhanced creativity observed in the Discussion Framework, and how these insights could be applied to improve LLM design and training. [link to "Apprentices to Research Assistants: Advancing Research with Large Language Models"]

Overall, the paper presents an interesting and innovative approach to fostering creativity in large language models, but more work is needed to fully realize its potential and address potential challenges.


This paper introduces a Discussion Framework that aims to enhance the creativity of large language models (LLMs) through structured conversations and role-play. By having LLMs adopt different personas and engage in back-and-forth discussions, the researchers demonstrate that the models can generate more diverse and innovative outputs compared to a baseline.

The key insight is that this approach leverages the conversational and generative capabilities of LLMs to move beyond their typical patterns and explore new ideas from multiple perspectives. While the paper presents promising results, there are also limitations and areas for further research, such as exploring more complex role-shifting, addressing potential safety and bias concerns, and understanding the underlying cognitive processes.

Overall, the Discussion Framework represents an interesting step towards unlocking the full creative potential of large language models, with implications for a wide range of applications, from content creation to problem-solving and beyond.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

Designing and Evaluating Dialogue LLMs for Co-Creative Improvised Theatre

Designing and Evaluating Dialogue LLMs for Co-Creative Improvised Theatre

Boyd Branch, Piotr Mirowski, Kory Mathewson, Sophia Ppali, Alexandra Covaci





Social robotics researchers are increasingly interested in multi-party trained conversational agents. With a growing demand for real-world evaluations, our study presents Large Language Models (LLMs) deployed in a month-long live show at the Edinburgh Festival Fringe. This case study investigates human improvisers co-creating with conversational agents in a professional theatre setting. We explore the technical capabilities and constraints of on-the-spot multi-party dialogue, providing comprehensive insights from both audience and performer experiences with AI on stage. Our human-in-the-loop methodology underlines the challenges of these LLMs in generating context-relevant responses, stressing the user interface's crucial role. Audience feedback indicates an evolving interest for AI-driven live entertainment, direct human-AI interaction, and a diverse range of expectations about AI's conversational competence and utility as a creativity support tool. Human performers express immense enthusiasm, varied satisfaction, and the evolving public opinion highlights mixed emotions about AI's role in arts.

Read more


HoLLMwood: Unleashing the Creativity of Large Language Models in Screenwriting via Role Playing

HoLLMwood: Unleashing the Creativity of Large Language Models in Screenwriting via Role Playing

Jing Chen, Xinyu Zhu, Cheng Yang, Chufan Shi, Yadong Xi, Yuxiang Zhang, Junjie Wang, Jiashu Pu, Rongsheng Zhang, Yujiu Yang, Tian Feng





Generative AI has demonstrated unprecedented creativity in the field of computer vision, yet such phenomena have not been observed in natural language processing. In particular, large language models (LLMs) can hardly produce written works at the level of human experts due to the extremely high complexity of literature writing. In this paper, we present HoLLMwood, an automated framework for unleashing the creativity of LLMs and exploring their potential in screenwriting, which is a highly demanding task. Mimicking the human creative process, we assign LLMs to different roles involved in the real-world scenario. In addition to the common practice of treating LLMs as ${Writer}$, we also apply LLMs as ${Editor}$, who is responsible for providing feedback and revision advice to ${Writer}$. Besides, to enrich the characters and deepen the plots, we introduce a role-playing mechanism and adopt LLMs as ${Actors}$ that can communicate and interact with each other. Evaluations on automatically generated screenplays show that HoLLMwood substantially outperforms strong baselines in terms of coherence, relevance, interestingness and overall quality.

Read more


Facilitating Multi-Role and Multi-Behavior Collaboration of Large Language Models for Online Job Seeking and Recruiting

Facilitating Multi-Role and Multi-Behavior Collaboration of Large Language Models for Online Job Seeking and Recruiting

Hongda Sun, Hongzhan Lin, Haiyu Yan, Chen Zhu, Yang Song, Xin Gao, Shuo Shang, Rui Yan





The emergence of online recruitment services has revolutionized the traditional landscape of job seeking and recruitment, necessitating the development of high-quality industrial applications to improve person-job fitting. Existing methods generally rely on modeling the latent semantics of resumes and job descriptions and learning a matching function between them. Inspired by the powerful role-playing capabilities of Large Language Models (LLMs), we propose to introduce a mock interview process between LLM-played interviewers and candidates. The mock interview conversations can provide additional evidence for candidate evaluation, thereby augmenting traditional person-job fitting based solely on resumes and job descriptions. However, characterizing these two roles in online recruitment still presents several challenges, such as developing the skills to raise interview questions, formulating appropriate answers, and evaluating two-sided fitness. To this end, we propose MockLLM, a novel applicable framework that divides the person-job matching process into two modules: mock interview generation and two-sided evaluation in handshake protocol, jointly enhancing their performance through collaborative behaviors between interviewers and candidates. We design a role-playing framework as a multi-role and multi-behavior paradigm to enable a single LLM agent to effectively behave with multiple functions for both parties. Moreover, we propose reflection memory generation and dynamic prompt modification techniques to refine the behaviors of both sides, enabling continuous optimization of the augmented additional evidence. Extensive experimental results show that MockLLM can achieve the best performance on person-job matching accompanied by high mock interview quality, envisioning its emerging application in real online recruitment in the future.

Read more


Customizing Large Language Models for Business Context: Framework and Experiments

Customizing Large Language Models for Business Context: Framework and Experiments

Wen Wang, Zhenyue Zhao, Tianshu Sun





The advent of Large Language Models (LLMs) has ushered in a new era for design science in Information Systems, demanding a paradigm shift in tailoring LLMs design for business contexts. We propose and test a novel framework to customize LLMs for general business contexts that aims to achieve three fundamental objectives simultaneously: (1) aligning conversational patterns, (2) integrating in-depth domain knowledge, and (3) embodying theory-driven soft skills and core principles. We design methodologies that combine domain-specific theory with Supervised Fine Tuning (SFT) to achieve these objectives simultaneously. We instantiate our proposed framework in the context of medical consultation. Specifically, we carefully construct a large volume of real doctors' consultation records and medical knowledge from multiple professional databases. Additionally, drawing on medical theory, we identify three soft skills and core principles of human doctors: professionalism, explainability, and emotional support, and design approaches to integrate these traits into LLMs. We demonstrate the feasibility of our framework using online experiments with thousands of real patients as well as evaluation by domain experts and consumers. Experimental results show that the customized LLM model substantially outperforms untuned base model in medical expertise as well as consumer satisfaction and trustworthiness, and it substantially reduces the gap between untuned LLMs and human doctors, elevating LLMs to the level of human experts. Additionally, we delve into the characteristics of textual consultation records and adopt interpretable machine learning techniques to identify what drives the performance gain. Finally, we showcase the practical value of our model through a decision support system designed to assist human doctors in a lab experiment.

Read more
