Changelog

Type

April 2026

Feature

Models Added

Add Kimi K2.6

Kimi K2.6

Kimi K2.6

Kimi K2.6 is Moonshot AI's next-generation multimodal model, designed for long-horizon coding, UI/UX generation, and multi-agent orchestration. It handles complex end-to-end development tasks across languages such as Python, Rust, and Go, and can transform prompts and visual inputs into production-ready interfaces.

Powered by a scalable agent swarm architecture, K2.6 can coordinate hundreds of parallel sub-agents for autonomous task decomposition, enabling the generation of documents, websites, and spreadsheets in a single run without human intervention.

Enjoy it.

Read more

Feature

Feature Added

Skills & MCP Server — `@apertis/mcp-server` v0.3.0

A Model Context Protocol server that lets any MCP-compatible AI assistant call Apertis directly. No setup, no wrapper code — just install once.

Install (Claude Code):

claude mcp add apertis -- npx -y @apertis/mcp-server

Nine tools shipped:

| Tool | What it does | |------|--------------| | list_models | List models with optional free/paid + capability filters | | get_model_info | Detailed info for a specific model (pricing, context, provider) | | compare_models | Side-by-side comparison of 2–5 models | | check_quota | Account balance, subscription status, remaining quota | | get_usage_stats | Usage by model and period (today / week / month) | | list_api_keys | List your keys (masked) with status and quota | | create_api_key | Create a new key with an optional quota limit | | suggest_model | Freeform keyword search over the full catalog | | recommend_model | Curated Apertis pick for a task type with live pricing *(new in v0.3.0)* |

→ Guide: docs.apertis.ai/api/sdks/mcp-server → npm: `@apertis/mcp-server`

Agent Skills — one-command install for 45+ AI tools

Three curated skills that teach your AI assistant how to use Apertis correctly. Install once, works everywhere.

npx skills add theQuert/apertis-skills

Compatible with Claude Code, Cursor, GitHub Copilot, Codex, Gemini CLI, and 45+ other AI coding tools.

| Skill | What your agent learns | |-------|------------------------| | apertis-api | Auth, endpoints, :web suffix, MCP reference — the complete API surface | | apertis-model-picker | Opinionated model picks by task type with reasoning | | apertis-migrate | One-line swap from OpenAI SDK to Apertis |

→ Source: github.com/theQuert/apertis-skills

GET /v1/recommend — dynamic model selection endpoint

Ask Apertis what to use for a task and get back the curated pick with live pricing. Recommendations update as models are added, retired, or re-priced — your code stays the same.

curl "https://api.apertis.ai/v1/recommend?task=coding&budget=medium" \
  -H "Authorization: Bearer $APERTIS_API_KEY"

Task types: coding, long-context, fast-chat, reasoning, vision Budget tiers: low, medium (default), high

Response shape:

{
  "model": "claude-sonnet-4-6",
  "pricing": { "input_per_1m": 2.40, "output_per_1m": 12.00 },
  "why": "Best coding ability per dollar. 200K context.",
  "alternatives": [
    { "model": "deepseek-v3", "note": "3x cheaper, good for simpler coding" },
    { "model": "claude-opus-4-6", "note": "most capable, higher cost" }
  ]
}

Use the returned model ID directly in your next /v1/chat/completions call.

→ Reference: docs.apertis.ai/api/utilities/recommend

---

Docs

  • New: @apertis/mcp-server SDK guide with recommend_model walkthrough
  • New: GET /v1/recommend endpoint reference with Python example
  • Updated: Cursor integration guide with new screenshots and apertis/ prefix convention
  • Updated: Ideas page now publicly browsable at docs.apertis.ai/help/ideas
  • Updated: Timeout documentation — X-Timeout header, 408 status semantics

---

Why this release

We kept seeing two questions in support:

1. *“Which model should I use?”* 2. *“How do I wire Apertis into my agent/IDE?”*

recommend_model + the skills + the MCP server answer both — without asking you to paste the same instructions into every new session. Your agent now picks the right model and knows how to call us, natively.

Read more

System

Subscription Quota Multiplier Update

Quota multiplier adjustments for Lite, Pro, and Plus plans — effective 2026-04-24 00:00 UTC

glm-5.1 multiplier raised on Lite/Pro/Plusclaude-opus-4-6 multiplier rebalanced (raised on Pro/Plus, lowered on Lite)gpt-5.4 multiplier raised on Pro/Pluscode:claude-opus-4-6 multiplier raised on Lite/Proclaude-opus-4-7 multiplier raised on Plusgemini-3-flash-preview & gemini-3.1-pro-preview multipliers adjusted on Lite

Subscription Quota Multiplier Update

Effective 2026-04-24 at 00:00 UTC, quota multipliers across Lite, Pro, and Plus plans will be adjusted in response to recent upstream AI provider pricing changes.

Why

  • Anthropic Claude (upstream routing channel) base rates have trended upward across multiple providers
  • Z.AI GLM-5.1 base rates have trended upward across multiple providers
  • OpenAI GPT-5.4 serving costs have increased on several routes
  • Additionally, claude-opus-4-6 on the Lite plan is being lowered because our review showed the previous multiplier was set higher than current real cost justifies

Changes

Lite Plan ($12/month, 600 quota per cycle)

  • glm-5.1: 0.51 → 1.5
  • code:claude-opus-4-6: 5.00 → 10.0
  • claude-opus-4-6: 5.00 → 2.0 (decrease)
  • gemini-3-flash-preview: 0.10 → 0.2
  • gemini-3.1-pro-preview: 0.43 → 0.77

Pro Plan ($25/month, 900 quota per cycle)

  • glm-5.1: 0.37 → 1.5
  • claude-opus-4-6: 1.00 → 1.5
  • gpt-5.4: 0.62 → 1.0
  • code:claude-opus-4-6: 0.75 → 1.5

Plus Plan ($60/month, 1,500 quota per cycle)

  • claude-opus-4-6: 0.75 → 1.5
  • glm-5.1: 0.25 → 1.0
  • gpt-5.4: 0.45 → 0.8
  • claude-opus-4-7: 3.00 → 4.0

What stays the same

  • Monthly subscription fee
  • Billing cycle and quota allowance per plan
  • Model access and plan tiers
  • Pay-As-You-Go (PAYG) fallback behavior

Your options

If you do not agree with the changes, you may cancel your subscription at any time before 2026-04-24 00:00 UTC at https://apertis.ai/setting. Affected users will also be notified by email.

For questions, contact us at [email protected].

Read more

Feature

Models Added

Add Claude Opus 4.7

Claude Opus 4.7

Claude Opus 4.7

Opus 4.7 is the next generation of Anthropic's Opus family, designed for long-running, asynchronous agent workflows. Building on Opus 4.6, it delivers stronger performance on complex, multi-step tasks and more reliable execution across extended pipelines such as large codebases, multi-stage debugging, and end-to-end project orchestration. Beyond coding, Opus 4.7 enhances knowledge work capabilities, including document drafting, presentation creation, and data analysis. With strong coherence over long outputs and sessions, it is well suited for tasks requiring persistence, judgment, and sustained execution.

Enjoy it.

Read more

Feature

Feature Added

Fallback Timeout Setting for Coding Plan Users

Coding Plan subscribers can now configure the Fallback Timeout directly from the dashboard.

What's new

When Apertis routes your request to an upstream provider, it waits a set amount of time before switching to the next available channel. Previously this was fixed at 30 seconds — fine for most models, but too short for preview and reasoning models processing large context windows.

You can now adjust this in Settings → Subscription Keys → Fallback Timeout (range: 5s–300s).

Who should change this

  • Using gemini-3-flash-preview, claude-opus-4-thinking, or other preview/reasoning models with large prompts? Increase to 120s+
  • Using standard models like gpt-4o, claude-sonnet-4? Default 30s is fine

How it works

1. Go to Settings → Subscription Keys 2. Find Fallback Timeout in the metadata section 3. Enter your preferred value in milliseconds (e.g., 120000 for 120s) 4. Click Save

Changes take effect immediately.

Read more