Changelog

Type

June 2026

Feature

Models Added

Add MiniMax M3

MiniMax M3

MiniMax-M3 is a multimodal foundation model from MiniMax, supporting text, image, and video inputs with text output and a 1M-token context window. It is designed for long-horizon agentic workflows, coding, and tool-driven task execution, enabling sustained reasoning across complex tasks.

Built on MiniMax Sparse Attention (MSA), the model dramatically improves long-context efficiency by replacing full attention with KV-block selection, reducing compute costs at 1M-token contexts while maintaining strong performance. Trained as a native multimodal model and optimized for multi-turn, production-style collaboration, MiniMax-M3 excels at extended, multi-step workflows rather than single-turn interactions.

Enjoy it.

May 2026

Feature

Models Added

Add Claude Opus 4.8 Series

Claude Opus 4.8Claude Opus 4.8 (Fast)

Claude Opus 4.8

Claude Opus 4.8 is Anthropic's most capable generally available model in the Opus family, designed for highly autonomous agents, long-horizon workflows, and advanced knowledge work. It supports text, image, and file inputs with text output, includes reasoning capabilities, and features a 1M-token context window for maintaining coherence across extended tasks and sessions.

The model excels at multi-step reasoning, complex coding, and end-to-end project orchestration, including large codebases, multi-stage debugging, and long-running asynchronous agent pipelines. Beyond software engineering, it is highly effective for document drafting, presentation creation, data analysis, and memory-driven workflows, delivering consistent quality across very long outputs and complex projects.

Enjoy it.

Feature

Models Added

Add Qwen3.7 Max

Qwen3.7 Max

Qwen3.7-Max is the flagship model in Alibaba's Qwen3.7 series, designed for agent-centric workloads with strong performance in coding, productivity, and long-horizon autonomous execution. It supports text input and output and delivers notable improvements in coding and agentic capabilities over previous Qwen generations.

Optimized for real-world workflows, the model also supports explicit prompt caching for efficient reuse of repeated context, making it well suited for scalable development, office automation, and advanced agent systems.

Enjoy it.

Feature

Models Added

Add Grok Build 0.1

Grok Build 0.1

Grok Build 0.1 is xAI's fast coding model designed specifically for agentic software engineering workflows. It supports text and image inputs with text output, and is optimized for interactive coding agents, tool use, and multi-step development tasks.

Powering the Grok Build CLI, the model features a 256K token context window with effectively no text output limit, making it well suited for long-horizon coding, automation, and continuous development workflows. Currently available in early access.

Enjoy it.

Feature

Feature Added

Apertis Coworker — delegate grunt work to cheaper models

The Apertis MCP server now ships a delegate coworker tool. Connect it to Claude Code (or any MCP client) and Claude can hand off routine, high-volume subtasks — bulk edits, boilerplate, repetitive lookups — to a cheaper model through your Apertis API key, while staying in control as the "manager."

What's new

`delegate` tool — Claude calls a single MCP tool to run a subtask on a

lower-cost model, then reviews the result. You keep premium models for reasoning and spend cheap tokens on the busywork.

Works with any MCP client — Claude Code, Claude Desktop, and other

MCP-compatible agents.

One install — available on npm as @apertis/mcp-server.

Get started

  npx @apertis/mcp-server

Add it to your MCP client config with your Apertis API key, and Claude can start delegating immediately.

See the official documentation here and source code.