lit-6B

Maintainer: hakurei

Last updated 5/27/2024

👁️

Property	Value
Run this model	Run on HuggingFace
API spec	View on HuggingFace
Github link	No Github link provided
Paper link	No paper link provided

Create account to get full access

Model Overview

lit-6B is a GPT-J 6B model fine-tuned on a diverse range of light novels, erotica, and annotated literature for the purpose of generating novel-like fictional text. As described by the maintainer hakurei, the model was trained on 2GB of data and can be used for entertainment purposes and as a creative writing assistant for fiction writers.

Similar models include GPT-J 6B, a 6 billion parameter auto-regressive language model trained on The Pile dataset, and OPT-6.7B-Erebus, a 6.7 billion parameter model fine-tuned on various "adult" themed datasets. Another related model is MPT-7B-StoryWriter-65k+, a 7 billion parameter model designed for generating long-form fictional stories.

Model Inputs and Outputs

lit-6B takes in text prompts that can be annotated with tags like [ Title: The Dunwich Horror; Author: H. P. Lovecraft; Genre: Horror; Tags: 3rdperson, scary; Style: Dark ] to guide the generation towards a specific style of fiction. The model then generates new text that continues the story in the specified tone and genre.

Inputs

Text prompts, optionally with metadata tags to indicate desired genre, style, and other attributes

Outputs

Continuation of the input text, generating novel-like fiction in the specified style

Capabilities

lit-6B is adept at generating fictional narratives across a range of genres, from horror to romance, by leveraging the metadata annotations provided in the input prompt. The model can produce coherent and engaging passages that flow naturally from the initial text, making it a useful tool for creative writing and story development.

What Can I Use it For?

lit-6B is well-suited for various entertainment and creative writing applications. Writers can use the model as a collaborative partner to brainstorm ideas, develop characters and plot lines, or generate passages for their stories. The model's ability to adapt to different genres and styles also makes it potentially useful for interactive fiction, game development, or other narrative-driven applications.

Things to Try

One interesting aspect of lit-6B is the use of annotative prompting to guide the generation. Try experimenting with different combinations of genre, style, and other tags to see how the model's output changes. You could also try providing longer input prompts to see how the model continues and expands upon the narrative. Additionally, you may want to explore the model's capabilities in generating content for different target audiences or exploring more mature themes, while always being mindful of potential biases or limitations.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Models

👁️

genji-jp

NovelAI

genji-jp is a 6 billion parameter model fine-tuned by NovelAI on a dataset of Japanese web novels. It is based on the GPT-J 6B model, which was trained by EleutherAI on a large corpus of English text. The Genji-JP model inherits GPT-J's architecture, including 28 layers, a 4096 dimensional model, and 16 attention heads. Rotary position encodings are used to model long-range dependencies. Similar Japanese-focused language models include Lit-6B, a GPT-J 6B model fine-tuned on light novels and erotica, and weblab-10b, a 10 billion parameter multilingual GPT-NeoX model trained on Japanese and English corpora. Model inputs and outputs Inputs Text prompt**: The model takes a text prompt as input, which it uses to generate new text in the Japanese language. Outputs Generated text**: The model outputs Japanese text that continues and expands on the given prompt. The generated text aims to be coherent and consistent with the input prompt. Capabilities The Genji-JP model is capable of generating long-form Japanese text in a variety of storytelling styles and genres. It can be used to continue short story prompts, generate synopses or outlines for longer narratives, or even produce entirely new creative stories. The model's familiarity with Japanese web novel conventions allows it to generate content that feels natural and in-keeping with the style of that genre. What can I use it for? The Genji-JP model could be used as a creative writing assistant for authors working on Japanese-language fiction. It could help generate ideas, expand upon outlines, or produce first drafts that the author can then refine. The model's ability to capture the conventions of Japanese web novels makes it particularly well-suited for that domain. Beyond fiction writing, the model could also be used to generate Japanese text for other applications, such as dialogue in video games, subtitles for anime, or content for Japanese-focused websites and social media. Things to try One interesting aspect of the Genji-JP model is its ability to capture the nuances of Japanese storytelling and web novel conventions. Prompts that leverage these cultural elements, such as introducing a common character archetype or setting a scene in a familiar Japanese locale, may yield particularly compelling and authentic-feeling generated text. Experimenting with different prompt styles and lengths could also be fruitful. Very short, open-ended prompts may allow the model to exercise more creative freedom, while more detailed prompts may result in more coherent and on-topic generations. Finding the right balance between guidance and autonomy is part of the creative process when using language models like Genji-JP.

Updated Invalid Date

Text-to-Text

🖼️

gpt-j-6b

EleutherAI

1.4K

The gpt-j-6b is a large language model trained by EleutherAI, a research group dedicated to developing open-source AI systems. The model has 6 billion trainable parameters and uses the same tokenizer as GPT-2 and GPT-3, with a vocabulary size of 50,257. It utilizes Rotary Position Embedding (RoPE) for positional encoding. Similar models include GPT-2B-001 and ChatGLM2-6B, which are also large transformer models trained for language generation tasks. However, the gpt-j-6b model differs in its specific architecture, training data, and intended use cases. Model inputs and outputs Inputs The model takes in text prompts as input, which can be of varying length up to the model's context window of 2048 tokens. Outputs The model generates human-like text continuation based on the provided prompt. The output can be of arbitrary length, though it is typically used to generate short- to medium-length responses. Capabilities The gpt-j-6b model is adept at generating coherent and contextually relevant text continuations. It can be used for a variety of language generation tasks, such as creative writing, dialogue generation, and content summarization. However, the model has not been fine-tuned for specific downstream applications like chatbots or commercial use cases. What can I use it for? The gpt-j-6b model is well-suited for research and experimentation purposes, as it provides a powerful language generation capability that can be further fine-tuned or incorporated into larger AI systems. Potential use cases include: Prototyping conversational AI agents Generating creative writing prompts and story continuations Summarizing long-form text Augmenting existing language models with additional capabilities However, the model should not be deployed for human-facing applications without appropriate supervision, as it may generate harmful or offensive content. Things to try One interesting aspect of the gpt-j-6b model is its ability to generate long-form text continuations. Researchers could experiment with prompting the model to write multi-paragraph essays or short stories, and analyze the coherence and creativity of the generated output. Additionally, the model could be fine-tuned on specific datasets or tasks to explore its potential for specialized language generation applications.

Updated Invalid Date

Text-to-Text

🤯

pygmalion-6b

PygmalionAI

721

pygmalion-6b is a proof-of-concept dialogue model based on GPT-J-6B, created by PygmalionAI. It has been fine-tuned on 56MB of dialogue data gathered from multiple sources, including both real and partially machine-generated conversations. This model is not suitable for use by minors, as it may output X-rated content under certain circumstances. Model Inputs and Outputs The pygmalion-6b model takes in prompts formatted with specific persona and dialogue history information. The expected input format is: Inputs [CHARACTER]'s Persona:** A few sentences describing the character the model should portray :** A delimiter token to separate the persona from the dialogue history [DIALOGUE HISTORY]:** Previous messages in the conversation to provide context The model will then generate a response in the voice of the specified character. Outputs Text response from the specified character Capabilities pygmalion-6b is capable of engaging in open-ended dialogue, roleplaying different characters, and generating fictional conversations. However, due to the training data used, the model may produce socially unacceptable or offensive text at times. What Can I Use It For? The pygmalion-6b model can be used for entertainment purposes, such as interactive fiction or chatbots for fictional scenarios. However, it is not suitable for commercial or high-stakes applications, as the model's outputs cannot be guaranteed to be safe or accurate. The Gradio UI notebook provided by PygmalionAI offers an easy way to experiment with the model. Things to Try You can try prompting the model with detailed personas and dialogue histories to see how it responds in-character. Experiment with different types of fictional scenarios, such as fantasy, sci-fi, or historical settings. Pay attention to how the model's responses change based on the provided context.

Updated Invalid Date

Text-to-Text

🔄

OPT-6.7B-Erebus

KoboldAI

The OPT-6.7B-Erebus is a large language model developed by KoboldAI. It is the second generation of the original Shinen model, trained on a dataset consisting of 6 different sources surrounding "Adult" themes. The name "Erebus" comes from Greek mythology, representing "darkness" - similar to the original Shin'en, or "deep abyss". This model is similar to other large OPT models like the OPT-13B-Erebus and GPT-NeoX-20B-Erebus, which were also developed by KoboldAI and trained on adult-themed datasets. However, the OPT-6.7B-Erebus has a smaller parameter count than these other Erebus models. Model Inputs and Outputs Inputs Text prompts for text generation Outputs Continuation of the input text, generated in an autoregressive manner Capabilities The OPT-6.7B-Erebus model is capable of generating coherent, adult-themed text based on provided prompts. It can produce narratives, descriptions, and dialogue in a variety of adult genres and styles. However, as the model was trained on an explicit dataset, the generated output will reflect this bias and may not be suitable for all audiences. What Can I Use It For? The OPT-6.7B-Erebus model could be used for creative writing projects, erotica, or other adult-oriented content generation. However, it's important to be aware of the model's biases and limitations, and to use it responsibly. The model should not be deployed in public-facing applications without proper moderation and filtering. Things to Try You could try providing the model with different types of adult-themed prompts, such as romance, erotica, or sensual descriptions, and see how the model responds. You could also experiment with altering the generation parameters, like temperature or top-k sampling, to adjust the style and content of the generated text. Just be mindful of the model's limitations and inappropriate outputs.

Updated Invalid Date

Text-to-Text