Is persona enough for personality? Using ChatGPT to reconstruct an agent's latent personality from simple descriptions

2406.12216

Published 6/19/2024 by Yongyi Ji, Zhisheng Tang, Mayank Kejriwal

Is persona enough for personality? Using ChatGPT to reconstruct an agent's latent personality from simple descriptions

Abstract

Personality, a fundamental aspect of human cognition, contains a range of traits that influence behaviors, thoughts, and emotions. This paper explores the capabilities of large language models (LLMs) in reconstructing these complex cognitive attributes based only on simple descriptions containing socio-demographic and personality type information. Utilizing the HEXACO personality framework, our study examines the consistency of LLMs in recovering and predicting underlying (latent) personality dimensions from simple descriptions. Our experiments reveal a significant degree of consistency in personality reconstruction, although some inconsistencies and biases, such as a tendency to default to positive traits in the absence of explicit information, are also observed. Additionally, socio-demographic factors like age and number of children were found to influence the reconstructed personality dimensions. These findings have implications for building sophisticated agent-based simulacra using LLMs and highlight the need for further research on robust personality generation in LLMs.

Create account to get full access

Overview

Examines whether using ChatGPT, a large language model, can reconstruct an agent's latent personality from simple descriptions
Explores the relationship between persona and personality in AI agents
Investigates the ability of language models to infer psychological traits and dispositions from limited textual input

Plain English Explanation

This research paper explores whether using a powerful language model like ChatGPT can allow us to reconstruct an AI agent's underlying personality from just a simple description of their persona. The idea is to see if there is a way to go beyond just defining a character's surface-level traits (their "persona") and actually uncover their deeper psychological makeup and dispositions.

The researchers wanted to investigate the relationship between the outward persona we present and our underlying personality. They used ChatGPT to see if it could take a basic description of an agent's persona and then infer their more complex psychological traits and tendencies. This builds on previous research that has looked at the ability of large language models to infer personality and psychological dispositions from limited textual information.

The key question is whether a persona is enough to fully capture an agent's personality, or if there are deeper layers that require more sophisticated modeling to uncover. This research aims to shed light on the limitations of using personas alone to simulate human-like psychology in conversational AI systems.

Technical Explanation

The researchers conducted experiments where they provided ChatGPT with simple persona descriptions for fictional characters and asked it to infer their underlying personality traits. The persona descriptions included basic information like the character's age, occupation, hobbies, and general demeanor.

ChatGPT was then prompted to analyze the persona and generate a detailed profile of the character's personality, including their psychological tendencies, emotional dispositions, and cognitive styles. The researchers compared ChatGPT's inferred personality profiles to ground truth personality assessments for the characters to evaluate the model's ability to reconstruct their latent personality from the limited persona information.

The results suggest that while ChatGPT was able to make some reasonable inferences about the characters' personalities, there were also significant limitations in its ability to fully capture the depth and nuance of their psychological makeup based solely on the persona descriptions. The researchers found that language models have a limited capacity to simulate the full complexity of human personality from constrained textual inputs.

Critical Analysis

One key caveat noted in the paper is that the persona descriptions provided to ChatGPT were relatively simple and straightforward. More complex or ambiguous persona details may have made it more challenging for the language model to infer accurate personality profiles. The researchers acknowledge that their findings may not generalize to more sophisticated persona representations.

Additionally, the ground truth personality assessments used to evaluate ChatGPT's inferences were themselves subjective and potentially biased. There is a concern that the "true" personality of the fictional characters may not have been fully captured, making it difficult to definitively assess the model's performance.

Further research is needed to better understand the relationship between persona and personality in conversational AI systems. Expanding the range of persona descriptions, incorporating more contextual information, and exploring alternative methods for evaluating personality inference could all help shed additional light on the limitations and potential of language models in this domain.

Conclusion

This research suggests that while large language models like ChatGPT can make reasonable inferences about an agent's personality based on limited persona descriptions, there are significant challenges in fully reconstructing their underlying psychological makeup from such constrained inputs. The findings highlight the need for more sophisticated approaches to modeling human-like personality in conversational AI systems, going beyond simple persona definitions.

As the field of conversational AI continues to advance, a deeper understanding of the connection between outward persona and inner personality will be essential for creating agents that can engage in more nuanced, empathetic, and human-like interactions. This paper provides a valuable contribution to this ongoing exploration.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

💬

PersonaLLM: Investigating the Ability of Large Language Models to Express Personality Traits

Hang Jiang, Xiajie Zhang, Xubo Cao, Cynthia Breazeal, Deb Roy, Jad Kabbara

Despite the many use cases for large language models (LLMs) in creating personalized chatbots, there has been limited research on evaluating the extent to which the behaviors of personalized LLMs accurately and consistently reflect specific personality traits. We consider studying the behavior of LLM-based agents which we refer to as LLM personas and present a case study with GPT-3.5 and GPT-4 to investigate whether LLMs can generate content that aligns with their assigned personality profiles. To this end, we simulate distinct LLM personas based on the Big Five personality model, have them complete the 44-item Big Five Inventory (BFI) personality test and a story writing task, and then assess their essays with automatic and human evaluations. Results show that LLM personas' self-reported BFI scores are consistent with their designated personality types, with large effect sizes observed across five traits. Additionally, LLM personas' writings have emerging representative linguistic patterns for personality traits when compared with a human writing corpus. Furthermore, human evaluation shows that humans can perceive some personality traits with an accuracy of up to 80%. Interestingly, the accuracy drops significantly when the annotators were informed of AI authorship.

4/3/2024

cs.CL cs.AI cs.HC

💬

Large Language Models Can Infer Personality from Free-Form User Interactions

Heinrich Peters, Moran Cerf, Sandra C. Matz

This study investigates the capacity of Large Language Models (LLMs) to infer the Big Five personality traits from free-form user interactions. The results demonstrate that a chatbot powered by GPT-4 can infer personality with moderate accuracy, outperforming previous approaches drawing inferences from static text content. The accuracy of inferences varied across different conversational settings. Performance was highest when the chatbot was prompted to elicit personality-relevant information from users (mean r=.443, range=[.245, .640]), followed by a condition placing greater emphasis on naturalistic interaction (mean r=.218, range=[.066, .373]). Notably, the direct focus on personality assessment did not result in a less positive user experience, with participants reporting the interactions to be equally natural, pleasant, engaging, and humanlike across both conditions. A chatbot mimicking ChatGPT's default behavior of acting as a helpful assistant led to markedly inferior personality inferences and lower user experience ratings but still captured psychologically meaningful information for some of the personality traits (mean r=.117, range=[-.004, .209]). Preliminary analyses suggest that the accuracy of personality inferences varies only marginally across different socio-demographic subgroups. Our results highlight the potential of LLMs for psychological profiling based on conversational interactions. We discuss practical implications and ethical challenges associated with these findings.

5/24/2024

cs.HC cs.AI cs.CL cs.CY cs.LG

🏷️

Limited Ability of LLMs to Simulate Human Psychological Behaviours: a Psychometric Analysis

Nikolay B Petrov, Gregory Serapio-Garc'ia, Jason Rentfrow

The humanlike responses of large language models (LLMs) have prompted social scientists to investigate whether LLMs can be used to simulate human participants in experiments, opinion polls and surveys. Of central interest in this line of research has been mapping out the psychological profiles of LLMs by prompting them to respond to standardized questionnaires. The conflicting findings of this research are unsurprising given that mapping out underlying, or latent, traits from LLMs' text responses to questionnaires is no easy task. To address this, we use psychometrics, the science of psychological measurement. In this study, we prompt OpenAI's flagship models, GPT-3.5 and GPT-4, to assume different personas and respond to a range of standardized measures of personality constructs. We used two kinds of persona descriptions: either generic (four or five random person descriptions) or specific (mostly demographics of actual humans from a large-scale human dataset). We found that the responses from GPT-4, but not GPT-3.5, using generic persona descriptions show promising, albeit not perfect, psychometric properties, similar to human norms, but the data from both LLMs when using specific demographic profiles, show poor psychometrics properties. We conclude that, currently, when LLMs are asked to simulate silicon personas, their responses are poor signals of potentially underlying latent traits. Thus, our work casts doubt on LLMs' ability to simulate individual-level human behaviour across multiple-choice question answering tasks.

5/14/2024

cs.CL cs.AI cs.CY cs.HC

💬

Large Language Models Can Infer Psychological Dispositions of Social Media Users

Heinrich Peters, Sandra Matz

Large Language Models (LLMs) demonstrate increasingly human-like abilities across a wide variety of tasks. In this paper, we investigate whether LLMs like ChatGPT can accurately infer the psychological dispositions of social media users and whether their ability to do so varies across socio-demographic groups. Specifically, we test whether GPT-3.5 and GPT-4 can derive the Big Five personality traits from users' Facebook status updates in a zero-shot learning scenario. Our results show an average correlation of r = .29 (range = [.22, .33]) between LLM-inferred and self-reported trait scores - a level of accuracy that is similar to that of supervised machine learning models specifically trained to infer personality. Our findings also highlight heterogeneity in the accuracy of personality inferences across different age groups and gender categories: predictions were found to be more accurate for women and younger individuals on several traits, suggesting a potential bias stemming from the underlying training data or differences in online self-expression. The ability of LLMs to infer psychological dispositions from user-generated text has the potential to democratize access to cheap and scalable psychometric assessments for both researchers and practitioners. On the one hand, this democratization might facilitate large-scale research of high ecological validity and spark innovation in personalized services. On the other hand, it also raises ethical concerns regarding user privacy and self-determination, highlighting the need for stringent ethical frameworks and regulation.

6/6/2024

cs.CL cs.AI cs.CY cs.HC cs.LG cs.SI