Changelog

Type

March 2026

Feature

Models Added

Add GLM 5 Turbo

GLM 5 Turbo

GLM-5 Turbo is a high-performance model from Z.ai optimized for fast inference and agent-driven workflows. Designed for real-world environments such as OpenClaw scenarios, it delivers strong performance across long execution chains and complex task pipelines. The model features improved instruction decomposition, tool integration, scheduled and persistent execution, and enhanced stability for extended multi-step tasks, making it well suited for autonomous agents and production automation workflows.

Enjoy it!

Feature

Models Added

Add OpenRouter and NVIDIA models

Nemotron 3 Super (Free)Healer AlphaHunter Alpha

Nemotron 3 Super (Free)

NVIDIA Nemotron 3 Super is a 120B-parameter open hybrid Mixture-of-Experts model designed for complex multi-agent and long-horizon reasoning workflows. It activates only 12B parameters per token, enabling high compute efficiency while maintaining strong accuracy on advanced tasks. Built on a hybrid Mamba–Transformer MoE architecture with multi-token prediction (MTP), the model delivers significantly higher token generation throughput than leading open models.

Healer Alpha

Healer Alpha is a frontier omni-modal model that integrates vision, audio understanding, reasoning, and action capabilities within a single system. It can natively perceive visual and auditory inputs, reason across multiple modalities, and execute complex multi-step tasks with precision, enabling advanced real-world agentic applications. Note: Prompts and completions processed by this model are logged by the provider and may be used for model improvement.

Hunter Alpha

Hunter Alpha is a frontier intelligence model with over 1 trillion parameters and a 1M-token context window, designed specifically for agentic applications. It excels at long-horizon planning, complex reasoning, and sustained multi-step task execution, delivering strong reliability and precise instruction following for advanced agent frameworks such as OpenClaw. Note: Prompts and completions processed by this model are logged by the provider and may be used for model improvement.

These models are in free tier.

Enjoy it!

Feature

Models Added

Add Qwen3.5-9B & Seed-2.0-Lite

Qwen3.5-9BSeed-2.0-Lite

Qwen3.5-9B

Qwen3.5-9B is a multimodal foundation model from the Qwen3.5 family, built to deliver strong reasoning, coding, and visual understanding within an efficient 9B-parameter architecture. It adopts a unified vision-language design with early fusion of multimodal tokens, enabling the model to process and reason across text and images within the same context.

With balanced multimodal capability and efficient deployment requirements, Qwen3.5-9B is well suited for applications that combine visual analysis, coding assistance, and general reasoning.

Seed-2.0-Lite

Seed-2.0-Lite is a balanced model designed for high-frequency enterprise workloads, optimizing for both capability and cost efficiency. It surpasses the previous-generation Seed-1.8 in overall performance while maintaining stable, production-ready quality. The model supports long-context processing, multi-source information fusion, multi-step instruction execution, and high-fidelity structured outputs.

It is well suited for enterprise scenarios such as unstructured data processing, content generation, search and recommendation, and data analysis, delivering reliable results while significantly reducing operational cost.

Enjoy it!

Feature

Models Added

Add GPT-5.4 and GPT-5.4 Pro

GPT-5.4

GPT-5.4 is OpenAI's latest frontier model, unifying the GPT and Codex lines into a single system designed for both general intelligence and advanced software engineering workflows. It supports text and image inputs and features a 1M+ token context window (≈922K input, 128K output), enabling high-context reasoning, coding, and multimodal analysis within a single workflow.

The model delivers improved performance in coding, document understanding, tool use, and instruction following, and is designed as a strong default for complex tasks. It can generate production-quality code, synthesize information across large datasets, and execute multi-step workflows with fewer iterations and greater token efficiency.

GPT-5.4 Pro

GPT-5.4 Pro is OpenAI's most advanced model, built on the unified GPT-5.4 architecture with enhanced reasoning capabilities for complex and high-stakes tasks. It supports text and image inputs and features a 1M+ token context window (≈922K input, 128K output) for handling large-scale workflows and long-context analysis.

Optimized for step-by-step reasoning, instruction following, and accuracy, GPT-5.4 Pro excels in agentic coding, long-context problem solving, and complex multi-step workflows, making it well suited for advanced engineering, research, and high-reliability applications.

The discount prices will be updated in few days, stay tuned!

Enjoy it.

Feature

Models Added

✨ New Models Added to All Subscription Plans — Free & Unlimited

Hi there,

Great news! We've just added three new models to every Apertis subscription plan at no extra cost — with unlimited usage.

| Model | Highlights | |-------|-----------| | GLM 4.7 Flash | Zhipu's latest high-speed reasoning model — low latency, high throughput | | GPT-5.1 Codex (Mini) | OpenAI's lightweight code generation model — ideal for everyday dev tasks | | MiniMax M2.1 | MiniMax's next-gen general-purpose model — strong multilingual capabilities |

How to Get Started

These models are available immediately for all subscribers — no configuration changes needed. Simply use the model ID in your API calls:

model: "glm-4.7-flash" model: "gpt-5.1-codex-mini" model: "minimax-m2.1"

Why Free & Unlimited?

We continuously partner with leading AI providers to bring high-value models into your subscription. More models, same price — that's our commitment to maximizing the value of your plan.

Not a subscriber yet? **Explore our plans →**

Questions? Feel free to reach out anytime.

Happy Building.