Phi-3-mini-128k-instruct-onnx

159

Last updated 5/28/2024

📶

Property	Value
Run this model	Run on HuggingFace
API spec	View on HuggingFace
Github link	No Github link provided
Paper link	No paper link provided

Create account to get full access

Model overview

The Phi-3-mini-128k-instruct-onnx is a lightweight, state-of-the-art open model developed by Microsoft. It belongs to the Phi-3 model family, which was trained on synthetic data and filtered websites with a focus on high-quality, reasoning-dense data. The model underwent a rigorous enhancement process, incorporating both supervised fine-tuning and direct preference optimization to ensure precise instruction adherence and robust safety measures.

Compared to other similar models, the Phi-3-mini-128k-instruct-onnx is optimized for acceleration with ONNX Runtime, allowing it to run efficiently on a variety of hardware, including CPU, GPU, and mobile devices. This makes it well-suited for memory and compute-constrained environments, as well as latency-bound scenarios. Additionally, the model has demonstrated strong reasoning capabilities, especially in areas like code, math, and logic.

Model inputs and outputs

Inputs

Text: The Phi-3-mini-128k-instruct-onnx model accepts text as input, and it is best suited for prompts using the chat format.

Outputs

Generated text: The model generates text in response to the input, with the goal of following instructions and providing safe, ethical, and accurate information.

Capabilities

The Phi-3-mini-128k-instruct-onnx model has been trained to excel at a variety of tasks, including question answering, code generation, and logical reasoning. For example, when prompted to explain the Fermi paradox, the model provides a concise and informative response, highlighting the key ideas behind this intriguing cosmic puzzle.

What can I use it for?

The Phi-3-mini-128k-instruct-onnx model is well-suited for a range of applications that require strong reasoning capabilities, such as research on language and multimodal models, or the development of generative AI features. The model's optimization for ONNX Runtime also makes it a good choice for use cases that require efficient inference on a variety of hardware platforms, including server, desktop, and mobile environments.

Things to try

One interesting thing to try with the Phi-3-mini-128k-instruct-onnx model is to explore its ability to generate code snippets. While the model has been trained on a range of data sources, including common programming languages and libraries, it's important to carefully validate any generated code before using it in production, as the model may produce inaccurate or unsafe output. Additionally, you could experiment with prompting the model to perform more complex logical reasoning tasks, such as solving mathematical problems or analyzing ethical dilemmas, to see how it responds.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Models

✨

Phi-3-mini-4k-instruct-onnx

microsoft

Phi-3-mini-4k-instruct-onnx is a lightweight, state-of-the-art AI model developed by Microsoft that is optimized for inference with ONNX Runtime. It is part of the Phi-3 model family, which includes both 4K and 128K variants. The model was trained on a combination of synthetic data, filtered websites, and high-quality chat format data, undergoing a rigorous enhancement process to ensure precise instruction adherence and robust safety measures. The optimized Phi-3-mini-4k-instruct-onnx model is published in ONNX format to enable accelerated inference on a variety of hardware, including CPU, GPU, and mobile devices. It supports DirectML for hardware acceleration on Windows devices, and can run on different NVIDIA GPU architectures using CUDA. There are also INT4-quantized versions available for improved performance on CPUs and mobile devices. Similar models in the Phi-3 family include the Phi-3-mini-128k-instruct-onnx and the Phi-3-mini-4k-instruct models, which offer different context length support. Model Inputs and Outputs Inputs Text**: The Phi-3-mini-4k-instruct-onnx model is best suited for prompts using a chat format, where the input is formatted as a question or instruction. Outputs Generated Text**: The model generates text in response to the input prompt, following the instruction or answering the question. Capabilities The Phi-3-mini-4k-instruct-onnx model has been trained to demonstrate strong reasoning abilities, including common sense reasoning, logical reasoning, and following instructions precisely. It has been evaluated on a variety of benchmarks, such as MMLU, HellaSwag, and TruthfulQA, where it has shown state-of-the-art performance compared to other models of similar size. What Can I Use It For? The Phi-3-mini-4k-instruct-onnx model is well-suited for use cases that require a lightweight, high-performance model with robust reasoning capabilities. Some potential applications include: Memory/compute-constrained environments**: The model's small size and optimized ONNX format make it suitable for deployment on devices with limited resources, such as mobile phones or edge devices. Latency-bound scenarios**: The model's optimized inference performance can be beneficial in applications that require fast responses, such as chatbots or virtual assistants. Applications requiring strong reasoning**: The model's strong performance on benchmarks testing common sense, math, coding, and logical reasoning makes it a good choice for applications that require these capabilities, such as educational tools or coding assistants. Microsoft has also provided ONNX Runtime integration and support, making it easier to deploy the Phi-3-mini-4k-instruct-onnx model across a range of platforms and hardware. Things to Try One interesting aspect of the Phi-3-mini-4k-instruct-onnx model is its support for different precision levels, including INT4 quantization for improved performance on CPUs and mobile devices. You could try experimenting with these different model configurations to see how they perform on your specific use case and hardware. Additionally, the model's strong reasoning capabilities could be useful for building educational or productivity-focused applications, where users can interact with the model to get assistance with tasks like math, coding, or general knowledge questions. You could explore ways to leverage the model's strengths in these areas. Finally, the availability of the model in ONNX format and the provided ONNX Runtime integration opens up opportunities for cross-platform deployment and hardware acceleration. You could investigate how to take advantage of these features to optimize the model's performance and deployment for your target platforms and devices.

Updated Invalid Date

Text-to-Text

🛠️

Phi-3-small-128k-instruct

microsoft

116

The Phi-3-small-128k-instruct is a 7B parameter, lightweight, state-of-the-art open model trained by Microsoft. It belongs to the Phi-3 family of models, which includes variants with different context lengths such as the Phi-3-small-8k-instruct and Phi-3-mini-128k-instruct. The model was trained on a combination of synthetic data and filtered publicly available websites, with a focus on high-quality and reasoning-dense properties. After initial training, the model underwent a post-training process that incorporated both supervised fine-tuning and direct preference optimization to enhance its ability to follow instructions and adhere to safety measures. When evaluated against benchmarks testing common sense, language understanding, math, code, long context and logical reasoning, the Phi-3-small-128k-instruct demonstrated robust and state-of-the-art performance among models of the same size and next size up. Model inputs and outputs Inputs Text**: The Phi-3-small-128k-instruct model is best suited for prompts using the chat format, where the input is provided as text. Outputs Generated text**: The model generates text in response to the input prompt. Capabilities The Phi-3-small-128k-instruct model showcases strong reasoning abilities, particularly in areas like code, math, and logic. It performs well on benchmarks evaluating common sense, language understanding, and logical reasoning. The model is also designed to be lightweight and efficient, making it suitable for memory/compute-constrained environments and latency-bound scenarios. What can I use it for? The Phi-3-small-128k-instruct model is intended for broad commercial and research use in English. It can be used as a building block for general-purpose AI systems and applications that require strong reasoning capabilities, such as: Memory/compute-constrained environments Latency-bound scenarios AI systems that need to excel at tasks like coding, math, and logical reasoning Microsoft has also released other models in the Phi-3 family, such as the Phi-3-mini-128k-instruct and Phi-3-medium-128k-instruct, which may be better suited for different use cases based on their size and capabilities. Things to try One interesting aspect of the Phi-3-small-128k-instruct model is its strong performance on benchmarks evaluating logical reasoning and math skills. Developers could explore using this model as a foundation for building AI systems that need to tackle complex logical or mathematical problems, such as automated theorem proving, symbolic reasoning, or advanced question-answering. Another area to explore is the model's ability to follow instructions and adhere to safety guidelines. Developers could investigate how the model's instruction-following and safety-conscious capabilities could be leveraged in applications that require reliable and trustworthy AI assistants, such as in customer service, education, or sensitive domains.

Updated Invalid Date

Text-to-Text

🚀

Phi-3-mini-4k-instruct

microsoft

603

The phi-3-mini-4k-instruct is a 3.8B parameter, lightweight, state-of-the-art open model trained with the Phi-3 datasets, as described by the maintainer. It is part of the Phi-3 family of models, which includes other variants like the phi-3-mini-128k-instruct and phi-3-mini-128k-instruct that differ in their context length. The Phi-3 models are designed to be high-performing yet memory/compute-constrained, making them suitable for latency-bound scenarios and environments with limited resources. Model inputs and outputs The phi-3-mini-4k-instruct model takes text as input and generates text as output. It is particularly well-suited for prompts using a chat format, where the input is structured as a conversation between a user and an assistant. Inputs Prompt**: The text that the model will use to generate a response. System Prompt**: An optional system prompt that helps guide the model's behavior, such as instructing it to act as a helpful assistant. Additional parameters**: The model also accepts various parameters to control the generation process, such as temperature, top-k and top-p filtering, and stopping sequences. Outputs Generated Text**: The model's response to the provided prompt, which can be a continuation of the conversation, an answer to a question, or a generated piece of text. Capabilities The phi-3-mini-4k-instruct model has been fine-tuned to excel at tasks that require strong reasoning abilities, such as common sense reasoning, language understanding, math, coding, and logical reasoning. When evaluated on a range of benchmarks, the model has demonstrated state-of-the-art performance among models with less than 13 billion parameters. What can I use it for? The phi-3-mini-4k-instruct model is intended for a variety of commercial and research use cases in English, particularly those that require memory or compute-constrained environments, such as mobile applications, or latency-bound scenarios. It can be used as a building block for developing generative AI-powered features, such as chatbots, question-answering systems, and code generation tools. Things to try One interesting aspect of the phi-3-mini-4k-instruct model is its ability to engage in multi-turn conversations using the provided chat format. You can try prompting the model with a series of related questions or tasks and observe how it maintains context and generates coherent responses. Additionally, the model's strong performance on tasks like math and coding make it a compelling choice for developing educational or productivity-focused applications.

Updated Invalid Date

Text-to-Text

🔍

Phi-3-small-8k-instruct

microsoft

108

The Phi-3-small-8k-instruct is a 7B parameter, lightweight, state-of-the-art open model from Microsoft. It is part of the Phi-3 family of models, which includes variants with different context lengths - 8K and 128K. The Phi-3 models are trained on a combination of synthetic data and filtered public websites, with a focus on high-quality and reasoning-dense properties. The Phi-3-small-8k-instruct model has undergone a post-training process that incorporates both supervised fine-tuning and direct preference optimization to enhance its ability to follow instructions and adhere to safety measures. When evaluated on benchmarks testing common sense, language understanding, math, code, long context, and logical reasoning, the model demonstrated robust and state-of-the-art performance among models of similar size. Model inputs and outputs Inputs Text prompts, best suited for the chat format Outputs Generated text responses to the input prompts Capabilities The Phi-3-small-8k-instruct model excels at tasks that require strong reasoning, such as math, coding, and logical analysis. It can provide detailed and coherent responses across a wide range of topics. What can I use it for? The Phi-3-small-8k-instruct model is intended for broad commercial and research use in English. It can be used in general-purpose AI systems and applications that require memory/compute constrained environments, low-latency scenarios, or robust reasoning capabilities. The model can accelerate research on language and multimodal models, and serve as a building block for generative AI-powered features. Things to try One interesting aspect of the Phi-3-small-8k-instruct model is its ability to provide step-by-step explanations and solutions for math and coding problems. You can try prompting the model with math equations or coding challenges and observe how it breaks down the problem and walks through the solution. Another interesting area to explore is the model's language understanding and common sense reasoning capabilities. You can provide it with prompts that require an understanding of the physical world, social norms, or abstract concepts, and see how it responds.

Updated Invalid Date

Text-to-Text