Triplex

Maintainer: SciPhi

219

Last updated 8/23/2024

⚙️

Property	Value
Run this model	Run on HuggingFace
API spec	View on HuggingFace
Github link	No Github link provided
Paper link	No paper link provided

Create account to get full access

Model overview

Triplex is a state-of-the-art large language model (LLM) developed by SciPhi.AI for the task of knowledge graph construction. It is a finetuned version of the Phi3-3.8B model that excels at extracting triplets - simple statements consisting of a subject, predicate, and object - from text or other data sources. Compared to GPT-4, Triplex is able to construct knowledge graphs at a 98% cost reduction while maintaining strong performance.

Unlike more expensive knowledge graph approaches like Microsoft's Graph RAG, Triplex enables local graph building at a fraction of the cost using SciPhi's R2R system. It outperforms GPT-4 on benchmark tasks related to knowledge graph construction, making it a compelling option for projects that require building knowledge graphs from unstructured data.

Model inputs and outputs

Inputs

Unstructured text data: Triplex takes in raw text as input and extracts knowledge graph triplets from it.
Entity types and predicates: The model also takes in a list of entity types and predicates that it should focus on when extracting triplets.

Outputs

Knowledge graph triplets: The main output of Triplex is a set of extracted triplets representing relationships between entities in the input text.

Capabilities

Triplex excels at the task of knowledge graph construction, outperforming GPT-4 while costing 1/60th as much. It is able to rapidly extract high-quality triplets from text, enabling users to build knowledge graphs at a fraction of the typical cost. This makes it a powerful tool for applications that require structured knowledge extracted from unstructured data sources.

What can I use it for?

Triplex is well-suited for any project that requires building knowledge graphs from text data. This could include applications in areas like:

Business intelligence: Extracting insights and relationships from corporate documents, reports, and other internal data sources.
Scientific research: Mapping out connections between concepts, entities, and findings in academic papers and other technical literature.
Public sector: Aggregating and structuring information from government reports, legislation, and other public documents.

The cost-effectiveness of Triplex makes it an appealing option for organizations that need to build knowledge graphs but have limited budgets or computational resources.

Things to try

One interesting aspect of Triplex is its ability to focus on specific entity types and predicates when extracting knowledge graph triplets. This allows users to tailor the model's output to their particular needs and use cases. For example, you could experiment with different sets of entity types and predicates to see how the extracted triplets vary, and then select the configuration that is most relevant for your project.

Another thing to try is using Triplex in conjunction with SciPhi's R2R system for local knowledge graph building. By leveraging R2R, you can quickly and efficiently construct knowledge graphs from text data without the need for expensive cloud-based infrastructure.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Models

🤿

Phi-3-medium-128k-instruct

microsoft

295

The Phi-3-medium-128k-instruct is a 14B parameter, lightweight, state-of-the-art open model developed by Microsoft. It was trained on synthetic data and filtered publicly available websites, with a focus on high-quality and reasoning-dense properties. The model belongs to the Phi-3 family, which also includes Phi-3-mini-128k-instruct and Phi-3-mini-4k-instruct, differing in parameter size and context length. The model underwent a post-training process that incorporated supervised fine-tuning and direct preference optimization to enhance its instruction following and safety. When evaluated on benchmarks testing common sense, language understanding, math, code, long context, and logical reasoning, the Phi-3-medium-128k-instruct demonstrated robust and state-of-the-art performance among models of similar and larger sizes. Model inputs and outputs Inputs Text**: The Phi-3-medium-128k-instruct model is best suited for text-based prompts, particularly those using a chat format. Outputs Generated text**: The model generates relevant and coherent text in response to the input prompt. Capabilities The Phi-3-medium-128k-instruct model showcases strong reasoning abilities across a variety of domains, including common sense, language understanding, mathematics, coding, and logical reasoning. For example, it can provide step-by-step solutions to math problems, generate code to implement algorithms, and engage in multi-turn conversations to demonstrate its understanding of complex topics. What can I use it for? The Phi-3-medium-128k-instruct model is intended for broad commercial and research use cases that require memory/compute-constrained environments, latency-bound scenarios, and strong reasoning capabilities. It can be used as a building block for developing generative AI-powered features, such as question-answering systems, code generation tools, and educational applications. Things to try One interesting aspect of the Phi-3-medium-128k-instruct model is its ability to handle long-form context. Try providing the model with a multi-paragraph prompt and see how it maintains coherence and relevance in its generated response. You can also experiment with using the model for specific tasks, such as translating technical jargon into plain language or generating step-by-step explanations for complex concepts.

Updated Invalid Date

Text-to-Text

🔮

Phi-3-medium-4k-instruct

microsoft

135

The Phi-3-medium-4k-instruct is a 14B parameter, lightweight, state-of-the-art open model trained by Microsoft. It is part of the Phi-3 family of models which come in different sizes and context lengths, including the Phi-3-medium-128k-instruct variant with 128k context length. The Phi-3 models have undergone a post-training process that incorporates both supervised fine-tuning and direct preference optimization to enhance their instruction following capabilities and safety measures. When evaluated on benchmarks testing common sense, language understanding, math, code, long context, and logical reasoning, the Phi-3-medium-4k-instruct demonstrated robust and state-of-the-art performance compared to models of similar and larger size. Model inputs and outputs Inputs Text**: The model is best suited for text-based prompts, particularly in a conversational "chat" format. Outputs Generated text**: The model outputs generated text in response to the input prompt. Capabilities The Phi-3-medium-4k-instruct model showcases strong reasoning and language understanding capabilities, particularly in areas like code, math, and logical reasoning. It can be a useful tool for building general-purpose AI systems and applications that require memory/compute-constrained environments, latency-bound scenarios, or advanced reasoning. What can I use it for? The Phi-3-medium-4k-instruct model can be leveraged for a variety of commercial and research use cases in English, such as powering AI assistants, generating content, and accelerating language model research. Its compact size and strong performance make it well-suited for applications with limited resources or low-latency requirements. Things to try One interesting aspect of the Phi-3 models is their focus on safety and alignment with human preferences. You could experiment with the model's ability to follow instructions and generate content that adheres to ethical guidelines. Additionally, its strong performance on code and math-related tasks suggests it could be a useful tool for building AI-powered programming and educational applications.

Updated Invalid Date

Text-to-Text

🧠

Phi-3-mini-128k-instruct

microsoft

1.3K

The phi-3-mini-128k-instruct is a 3.8 billion-parameter, lightweight, state-of-the-art open model trained using the Phi-3 datasets. It is part of the Phi-3 family, which also includes the Phi-3-mini-4k-instruct and Phi-3-mini-128k-instruct models. The Phi-3 models are designed to be efficient and effective, with a focus on reasoning capabilities like code, math, and logic. Model inputs and outputs The phi-3-mini-128k-instruct model takes text as input and generates text in response. It is best suited for prompts using a chat format, where the user provides a prompt and the model generates a relevant response. Inputs Prompt:** The text prompt to send to the model. Max Length:** The maximum number of tokens to generate. Temperature:** Adjusts the randomness of the outputs, with higher values being more random. Top K:** Samples from the top K most likely tokens when decoding text. Top P:** Samples from the top P percentage of most likely tokens when decoding text. Repetition Penalty:** Penalty for repeated words in the generated text. System Prompt:** The system prompt provided to the model. Seed:** The seed for the random number generator. Outputs Generated Text:** The text generated by the model in response to the input prompt. Capabilities The phi-3-mini-128k-instruct model has demonstrated robust and state-of-the-art performance on a variety of benchmarks, including common sense reasoning, language understanding, mathematics, coding, and logical reasoning. It is designed to be effective in memory/compute-constrained environments and latency-bound scenarios, while providing strong reasoning capabilities. What can I use it for? The phi-3-mini-128k-instruct model is intended for commercial and research use in English. It can be used as a building block for generative AI-powered features, such as chatbots, language-generation tools, and code assistants. The model's small size and strong reasoning abilities make it particularly well-suited for use in applications that require efficient and effective language processing. Things to try One interesting aspect of the phi-3-mini-128k-instruct model is its ability to follow instructions and adhere to safety measures. You could try prompting the model with tasks that require following specific instructions or navigating complex scenarios, and see how it responds. Additionally, you could experiment with using the model in combination with other AI tools or datasets to explore new and innovative applications.

Updated Invalid Date

Text-to-Text

🔮

Phi-3-vision-128k-instruct

microsoft

741

Phi-3-vision-128k-instruct is a lightweight, state-of-the-art open multimodal model built upon datasets which include synthetic data and filtered publicly available websites, with a focus on very high-quality, reasoning dense data both on text and vision. The model belongs to the Phi-3 model family, and the multimodal version comes with 128K context length (in tokens) it can support. The model underwent a rigorous enhancement process, incorporating both supervised fine-tuning and direct preference optimization to ensure precise instruction adherence and robust safety measures. Similar models in the Phi-3 family include the Phi-3-mini-128k-instruct and Phi-3-mini-4k-instruct. These models have fewer parameters (3.8B) compared to the full Phi-3-vision-128k-instruct but share the same training approach and underlying architecture. Model inputs and outputs Inputs Text**: The model accepts text input, and is best suited for prompts using a chat format. Images**: The model can process visual inputs in addition to text. Outputs Generated text**: The model generates text in response to the input, aiming to provide safe, ethical and accurate information. Capabilities The Phi-3-vision-128k-instruct model is designed for broad commercial and research use, with capabilities that include general image understanding, OCR, and chart and table understanding. It can be used to accelerate research on efficient language and multimodal models, and as a building block for generative AI powered features. What can I use it for? The Phi-3-vision-128k-instruct model is well-suited for applications that require memory/compute constrained environments, latency bound scenarios, or general image and text understanding. Example use cases include: Visual question answering**: Given an image and a text question about the image, the model can generate a relevant response. Image captioning**: The model can generate captions describing the contents of an image. Multimodal task automation**: Combining text and image inputs, the model can be used to automate tasks like form filling, document processing, or data extraction. Things to try To get a sense of the model's capabilities, you can try prompting it with a variety of multimodal tasks, such as: Asking it to describe the contents of an image in detail Posing questions about the objects, people, or activities depicted in an image Requesting the model to summarize the key information from a document containing both text and figures/tables Asking it to generate steps for a visual instruction manual or recipe The model's robust reasoning abilities, combined with its understanding of both text and vision, make it a powerful tool for tackling a wide range of multimodal challenges.

Updated Invalid Date

Image-to-Text