WizardLM-13B-V1-1-SuperHOT-8K-GPTQ

Maintainer: TheBloke

Last updated 9/6/2024

🧠

Property	Value
Run this model	Run on HuggingFace
API spec	View on HuggingFace
Github link	No Github link provided
Paper link	No paper link provided

Create account to get full access

Model overview

WizardLM-13B-V1-1-SuperHOT-8K-GPTQ is a 13 billion parameter large language model that was created by merging WizardLM's WizardLM 13B V1.1 with Kaio Ken's SuperHOT 8K and then quantizing the model to 4-bit precision using GPTQ-for-LLaMa. This experimental model offers an increased context size of up to 8K tokens, which has been tested to work with the ExLlama library and text-generation-webui.

Model inputs and outputs

Inputs

Prompts: The model takes prompts as input, which can be in the form of natural language text, code, or a combination of the two.

Outputs

Text generation: The primary output of the model is generated text, which can be used for a variety of tasks such as language modeling, summarization, translation, and creative writing.

Capabilities

The WizardLM-13B-V1-1-SuperHOT-8K-GPTQ model is capable of generating coherent and contextually relevant text across a wide range of topics. Its increased context size allows it to maintain coherence and consistency over longer stretches of text, making it particularly well-suited for tasks that require sustained reasoning or storytelling.

What can I use it for?

This model can be used for a variety of natural language processing tasks, such as:

Creative writing: The model's ability to generate coherent and contextually relevant text makes it useful for tasks like story writing, dialogue generation, and creative prompt completion.
Task-oriented dialogue: With its increased context size, the model can be used to build interactive conversational agents that can engage in multi-turn dialogues and maintain context over longer exchanges.
Content generation: The model can be used to generate text for a wide range of applications, such as blog posts, articles, product descriptions, and more.

Things to try

One interesting aspect of this model is its ability to leverage the extended 8K context size. By setting the appropriate parameters in tools like text-generation-webui, you can experiment with the model's performance on tasks that require maintaining coherence and consistency over longer stretches of text. Additionally, the model's quantization to 4-bit precision makes it more efficient and accessible for deployment on a variety of hardware platforms.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Models

🐍

WizardLM-13B-V1-0-Uncensored-SuperHOT-8K-GPTQ

TheBloke

The WizardLM-13B-V1-0-Uncensored-SuperHOT-8K-GPTQ model is a 13B parameter language model created by combining Eric Hartford's WizardLM 13B V1.0 Uncensored with Kaio Ken's SuperHOT 8K. The model has been quantized to 4-bit using the GPTQ-for-LLaMa tool, which allows for increased context size up to 8K tokens. This model is an experimental new GPTQ that offers expanded context compared to the original WizardLM 13B V1.0 Uncensored. Model inputs and outputs The WizardLM-13B-V1-0-Uncensored-SuperHOT-8K-GPTQ model takes text prompts as input and generates coherent, detailed responses. The model has been trained on a large corpus of online text data, allowing it to understand and converse on a wide range of topics. Inputs Text prompt**: A text prompt provided to the model to initiate the generation of a response. Outputs Generated text**: The model's response to the provided text prompt, which can be up to 8192 tokens in length. Capabilities The WizardLM-13B-V1-0-Uncensored-SuperHOT-8K-GPTQ model is a powerful language model capable of engaging in open-ended conversations, answering questions, and generating human-like text on a variety of subjects. Its expanded context size allows it to maintain coherence and provide more detailed responses compared to models with shorter context. What can I use it for? The WizardLM-13B-V1-0-Uncensored-SuperHOT-8K-GPTQ model can be used for a wide range of natural language processing tasks, such as chatbots, content generation, question answering, and creative writing. The increased context size makes it well-suited for applications that require longer-form, coherent responses. Things to try One interesting aspect of the WizardLM-13B-V1-0-Uncensored-SuperHOT-8K-GPTQ model is its ability to maintain context and narrative structure over longer text generation. Try providing the model with a multi-sentence prompt and see how it continues the story or expands on the initial ideas. The model's large knowledge base and generation capabilities make it well-suited for collaborative storytelling or worldbuilding exercises.

Updated Invalid Date

Text-to-Text

🧠

WizardLM-33B-V1-0-Uncensored-SuperHOT-8K-GPTQ

TheBloke

The WizardLM-33B-V1-0-Uncensored-SuperHOT-8K-GPTQ is a large language model created by TheBloke, a prominent AI researcher and model developer. This model is a variant of the WizardLM-33B model, which has been merged with Kaio Ken's SuperHOT 8K system to extend the context length to 8192 tokens. The model has been quantized to 4-bit precision using GPTQ, resulting in a more compact and efficient model for inference on GPU hardware. Similar models available from TheBloke include the Wizard-Vicuna-13B-Uncensored-SuperHOT-8K-GPTQ, which is a 13B version of the model with a similar architecture and capabilities, and the WizardLM-7B-uncensored-GPTQ and WizardLM-30B-Uncensored-GPTQ models, which are smaller and larger variants respectively. Model inputs and outputs Inputs Text prompts**: The model accepts free-form text prompts as input, which can be used to generate continuations, completions, or responses. Outputs Generated text**: The model outputs generated text, which can be used for a variety of applications such as content creation, dialogue generation, and language modeling. Capabilities The WizardLM-33B-V1-0-Uncensored-SuperHOT-8K-GPTQ model demonstrates impressive language generation capabilities, with the ability to produce coherent and contextually relevant text. The extended 8192 token context length allows the model to maintain continuity and coherence over longer stretches of text, making it particularly well-suited for applications that require sustained dialogue or narrative generation. What can I use it for? This model can be used for a wide range of language-based applications, such as: Content creation**: The model can be used to generate articles, stories, scripts, or other forms of written content. Dialogue systems**: The extended context length makes this model well-suited for building more natural and contextual chatbots or virtual assistants. Summarization**: The model can be used to generate concise summaries of longer text passages. Question answering**: The model can be used to answer questions based on the provided context. Potential commercial applications for this model include creative content generation, customer service automation, and research and development in natural language processing. Things to try One interesting aspect of this model is its ability to maintain coherence and continuity over longer stretches of text, thanks to the extended 8192 token context length. You could try providing the model with a complex or multi-part prompt, and observe how it is able to build upon and expand the initial context to generate a cohesive and engaging response. Another interesting direction to explore would be fine-tuning or further training the model on specialized datasets, in order to adapt its capabilities to more specific use cases or domains. This could involve incorporating domain-specific knowledge or adjusting the model's tone, style, or behavior to better suit the intended application.

Updated Invalid Date

Text-to-Text

👁️

WizardLM-Uncensored-SuperCOT-StoryTelling-30B-SuperHOT-8K-GPTQ

TheBloke

The WizardLM-Uncensored-SuperCOT-StoryTelling-30B-GPTQ is an AI model created by TheBloke that combines the capabilities of several large language models. It is a 30 billion parameter model that has been trained on a diverse dataset to excel at language understanding, reasoning, and creative writing. Similar models include the WizardLM Uncensored SuperCOT Storytelling 30B - GPTQ and the WizardLM-33B-V1-0-Uncensored-SuperHOT-8K-GPTQ, which also leverage the SuperHOT technique to expand the context size. Model inputs and outputs The WizardLM-Uncensored-SuperCOT-StoryTelling-30B-GPTQ is a text-to-text model, meaning it takes in text prompts and generates coherent, contextual responses. Inputs Text prompts of varying lengths, from a few words to several paragraphs Outputs Fluent, human-like text responses that demonstrate strong language understanding, reasoning, and creative writing capabilities Capabilities The WizardLM-Uncensored-SuperCOT-StoryTelling-30B-GPTQ is a highly capable model that can engage in open-ended dialogue, answer questions, and generate creative content like stories and worldbuilding. It has been trained to have in-depth knowledge on a wide range of topics and to provide thoughtful, nuanced responses. What can I use it for? The model's versatility makes it useful for a variety of applications, such as: Chatbots and virtual assistants that can engage in natural conversations Creative writing assistants to help generate stories, dialogue, and worldbuilding Question-answering systems that can provide detailed and informative responses Research and analysis tools that can draw insights from large amounts of text data Things to try An interesting aspect of the WizardLM-Uncensored-SuperCOT-StoryTelling-30B-GPTQ is its ability to generate highly detailed and imaginative responses when prompted with open-ended creative writing tasks. For example, you could try giving it a simple prompt like "Describe a fantasy world" and see the rich, evocative description it produces.

Updated Invalid Date

Text-to-Text

🤷

Pygmalion-13B-SuperHOT-8K-GPTQ

TheBloke

The Pygmalion-13B-SuperHOT-8K-GPTQ model is a merge of TehVenom's Pygmalion 13B and Kaio Ken's SuperHOT 8K, quantized to 4-bit using GPTQ-for-LLaMa. It offers up to 8K context size, which has been tested to work with ExLlama and text-generation-webui. Similar models include the Wizard-Vicuna-13B-Uncensored-SuperHOT-8K-GPTQ, which combines Eric Hartford's Wizard Vicuna 13B Uncensored with Kaio Ken's SuperHOT 8K, and the Llama-2-13B-GPTQ and Llama-2-7B-GPTQ models, which are GPTQ versions of Meta's Llama 2 models. Model inputs and outputs Inputs The model accepts natural language text as input. Outputs The model generates natural language text as output. Capabilities The Pygmalion-13B-SuperHOT-8K-GPTQ model is capable of engaging in open-ended conversations and generating coherent and contextual text. Its extended 8K context size allows it to maintain continuity and coherence over longer passages of text. What can I use it for? This model could be used for a variety of natural language processing tasks, such as: Open-ended chatbots and assistants**: The model's capabilities make it well-suited for building conversational AI assistants that can engage in open-ended dialogue. Content generation**: The model could be used to generate text for creative writing, storytelling, and other content creation purposes. Question answering and knowledge retrieval**: With its large knowledge base, the model could be used to answer questions and retrieve information on a wide range of topics. Things to try One key aspect of this model is its ability to maintain coherence and context over longer passages of text due to the increased 8K context size. This could be particularly useful for applications that require a strong sense of narrative or conversational flow, such as interactive fiction, roleplaying, or virtual assistants. Developers could explore ways to leverage this extended context to create more immersive and coherent experiences for users, such as by allowing the model to maintain character personalities, world-building details, and the progression of a storyline over longer interactions.

Updated Invalid Date

Text-to-Image