Entry Trinity Mini with an API

January 28, 2026

12

Find out how to Entry Arcee Trinity Mini through API

TL;DR

Arcee Trinity Mini is a complicated AI mannequin designed to ship sturdy reasoning, coding, and math capabilities whereas being environment friendly with computing assets. It makes use of a mixture-of-experts structure, activating solely about 3 billion of its 26 billion parameters for every process. This strategy makes it sooner and less expensive to run than many bigger fashions.

You may run Trinity Mini instantly on Clarifai utilizing the Playground for fast exams and experimentation or entry the mannequin via Clarifai’s OpenAI-compatible API for seamless integration into your purposes and workflows.

Introduction

After we consider reasoning fashions, top-tier fashions like OpenAI GPT-5.2 and Google Gemini 3 Professional normally come to thoughts. Nonetheless, open-weight fashions provide comparable efficiency whereas giving builders larger management and customization choices.

One such mannequin is Arcee Trinity Mini, a U.S.-built, open-weight mannequin from Arcee AI designed particularly for real-world manufacturing workflows. It excels at multi-step reasoning, coding, and producing structured outputs, making it a superb selection for purposes requiring precision and effectivity.

On this information, you’ll learn the way Trinity Mini works, the right way to entry it through API via Clarifai and the right way to begin utilizing it in your personal software.

What’s Arcee Trinity Mini?

Arcee Trinity Mini is a robust open‑weight language mannequin developed by Arcee AI. It’s a part of the Trinity household of fashions which are constructed for actual‑world purposes resembling multi‑flip conversations, instrument use, structured outputs, and reasoning duties. Trinity Mini is designed to carry out reliably in manufacturing environments, whether or not you run it within the cloud, on‑premises, or via a hosted API. Its constant capabilities make it a robust selection for builders and groups aiming to construct superior AI techniques with predictable efficiency.

Whereas main closed fashions usually dominate the highlight, Trinity Mini offers an open‑weight different that gives builders extra management and suppleness. It allows you to tailor the mannequin in your workflows with out being locked into proprietary ecosystems.

Key Options and Advantages

Trinity Mini fills a rising want for environment friendly and customizable fashions that may be deployed at scale. Listed below are the important thing options that make it worthwhile for each builders and companies:

Multi-step Reasoning and Software Orchestration
Trinity Mini is constructed to handle advanced duties that require a number of reasoning steps and interplay with exterior instruments. This makes it best for constructing agent pipelines the place the mannequin must carry out sequences of actions, resembling querying databases, calling APIs, or producing code dynamically.

Lengthy Context Assist (128K Tokens)
The mannequin helps a context window of as much as 128,000 tokens. This permits it to take care of continuity over lengthy paperwork, multi-turn conversations, or detailed workflows with out dropping observe of related info. Such prolonged context capabilities are worthwhile to be used instances like authorized doc overview, analysis summaries, or any situation that calls for deep understanding over prolonged inputs.

Structured Output with JSON Schema Enforcement
Trinity Mini enforces output codecs via native JSON schema adherence. This implies the responses conform to predefined constructions, minimizing the necessity for advanced parsing or error dealing with on the consumer facet. This function is important for integrating the mannequin’s output instantly into automated techniques and pipelines, bettering reliability and decreasing growth overhead.

Environment friendly Efficiency and Throughput
Because of its sparse Combination-of-Specialists (MoE) structure, Trinity Mini prompts solely a fraction of its whole parameters per token, permitting it to ship reasoning energy akin to a lot bigger dense fashions at a fraction of the compute price. This design permits it to deal with tons of of API requests per second on a single Nvidia A100 GPU, supporting scalable and cost-effective deployment in manufacturing environments.

Accessing Arcee Trinity Mini through Clarifai

Stipulations

Getting began with Arcee Trinity Mini via the Clarifai API is easy. Observe these steps to arrange your surroundings and authenticate.

Clarifai Account: Join at clarifai.com to realize entry to the platform’s AI fashions.
Private Entry Token (PAT): You want a PAT to authenticate your API requests. Get one by navigating to Settings > Secrets and techniques in your Clarifai dashboard and creating or copying your token.
SDKs: Clarifai offers SDKs for Python and Node.js, and likewise helps OpenAI-compatible shoppers. For detailed directions and to put in different SDKs, go to the Clarifai Quickstart Information.
Authentication and Setup: To authenticate your API requests, set your Private Entry Token as an surroundings variable:

API Utilization

Right here’s the right way to make your first API name to the Arcee Trinity Mini mannequin utilizing totally different strategies.

Utilizing Python SDK:

Utilizing Node.js SDK:

Utilizing OpenAI-Suitable Python Shopper

Utilizing the Playground

For fast experimentation and validation, you need to use the Clarifai Playground to work together with Arcee Trinity Mini instantly within the browser. That is helpful for testing prompts, exploring mannequin habits, and verifying outputs with out writing any code.

Screenshot 2026-01-26 at 2.48.46 PM

Benchmark Efficiency of Trinity Mini

Arcee Trinity Mini delivers spectacular reasoning and tool-calling capabilities whereas sustaining excessive effectivity. Right here’s the way it performs throughout a number of difficult benchmarks:

Reasoning Accuracy

MMLU (Zero-Shot): Trinity Mini scores 84.95% throughout 57 topics, together with math, legislation, and science, demonstrating sturdy basic information and reasoning abilities with out task-specific coaching.
Math-500: It achieves 92.10% on this superior math reasoning benchmark, exhibiting strong proficiency in advanced calculations and problem-solving.
GPQA-Diamond: On graduate-level science questions, Trinity Mini reaches 58.55%, reflecting its capability to deal with specialised and technical content material.

Software Calling and Structured Output

BFCL v3 (Operate Calling): With 59.67%, Trinity Mini reliably generates responses that strictly adhere to JSON schema necessities, making it best for agent workflows that depend upon structured information.
MUSR (Multi-Step Reasoning): The mannequin attains 63.49% accuracy on duties requiring sequential, logical steps, highlighting its multi-turn reasoning power.

Throughput and Scalability

Processes over 200 tokens per second on a single A100 GPU utilizing bfloat16 precision.
Prompts solely about 3 billion parameters per token, in comparison with 8–14 billion for related dense fashions, leading to vital compute financial savings.
Helps an prolonged 128,000-token context window with out the reminiscence overhead usually related to lengthy contexts, enabling strong understanding of huge paperwork or conversations.

Benchmark Comparability Desk

Benchmark	Trinity Mini	LLaMA-3.1-8B	Qwen-2.5-7B	Mistral-class	Gemini-class
SimpleQA	8.90	9.10	6.50	10.70	—
MUSR	63.49	64.40	64.47	56.30	—
MMLU (Zero-Shot)	84.95	87.26	85.58	82.30	83.02
Math-500	92.10	95.00	90.20	87.40	95.80
GPQA-Diamond	58.55	70.05	65.40	55.00	60.91
BFCL v3	59.67	53.01	—	48.25	—

Purposes and Use Circumstances

Arcee Trinity Mini is nicely suited to a variety of real-world purposes the place reasoning high quality, lengthy context dealing with, and structured outputs are important.

Conversational AI Purposes

Trinity Mini can energy conversational techniques that transcend easy query answering. Its capability to take care of lengthy context makes it best for multi-turn buyer assist chatbots that want to recollect prior messages, person preferences, or earlier troubleshooting steps. It additionally works nicely for digital assistants that combine with instruments or APIs, resembling fetching information, triggering actions, or returning structured responses. As well as, the mannequin can assist interactive documentation or information base experiences, the place customers discover technical content material via pure language conversations.

Agentic Workflows

For agent-based techniques, Trinity Mini offers sturdy multi-step reasoning and dependable instrument calling. This allows agent workflows that plan actions, invoke exterior instruments, and refine outcomes over a number of steps. It’s significantly helpful for workflow automation, the place the mannequin generates structured outputs that downstream techniques can eat with out additional parsing. Trinity Mini additionally suits naturally into retrieval-augmented technology (RAG) pipelines, the place its prolonged context window permits it to cause over giant retrieved paperwork whereas sustaining coherence.

Enterprise Integration

In enterprise environments, Trinity Mini provides an environment friendly path to manufacturing deployment. Its efficiency traits make it appropriate for cost-conscious, high-throughput purposes accessed via APIs. Groups can use it to construct inner instruments with pure language interfaces, permitting workers to question techniques or generate insights with out specialised coaching. The mannequin can also be nicely suited to doc evaluation and processing pipelines, the place its 128K context assist permits it to deal with lengthy stories, contracts, or technical paperwork in a single move.

Conclusion

Arcee Trinity Mini provides a robust mixture of environment friendly structure, superior reasoning capabilities, and assist for long-context understanding. It is a superb selection for builders and companies seeking to construct subtle AI purposes. Its sparse mixture-of-experts design delivers excessive efficiency on difficult benchmarks whereas preserving compute prices manageable. With native assist for structured outputs and performance calling, Trinity Mini suits naturally into agent workflows, conversational AI, and complicated doc processing pipelines.

By accessing Trinity Mini via Clarifai’s strong API, you’ll be able to rapidly combine these capabilities into your tasks, whether or not you’re constructing chatbots, automation techniques, or information evaluation instruments. Begin experimenting immediately within the Clarifai Playground or dive straight into API integration to unlock the total potential of this versatile mannequin.

To be taught extra and get began: