Changelog

Type

June 2026

Feature

Models Added

Add GLM 5.2

GLM 5.2

GLM-5.2 is Z.AI's flagship model for long-horizon task execution, designed to handle complex, project-scale workflows with high reliability. Featuring a 1M-token context window, it can maintain and reason over extensive engineering context, enabling consistent execution across large, multi-stage tasks.

Optimized for end-to-end software development, GLM-5.2 follows engineering standards reliably and can manage the full workflow from requirements analysis and implementation to testing and multi-platform deployment, making it well suited for advanced coding agents and large-scale autonomous engineering projects.

Enjoy it.

Feature

Models Added

Add Kimi K2.7 Code

Kimi K2.7 Code

Kimi K2.7 Code is a coding-focused model in Moonshot AI's Kimi K2 family, designed for long-horizon software engineering and agentic development workflows. Built on a native multimodal Mixture-of-Experts (MoE) architecture, it supports text, image, and video inputs and operates exclusively in thinking mode, preserving reasoning across multi-turn interactions.

With approximately 1T total parameters and 32B activated per token, plus a 256K-token context window, K2.7 Code excels at end-to-end programming tasks, agentic task decomposition, repository-scale reasoning, and extended coding conversations, making it well suited for advanced coding agents and long-context development workflows.

Enjoy it.

Feature

Models Added

Add Claude Fable 5

Claude Fable 5

Claude Fable 5 is Anthropic's Mythos-class model, designed for autonomous knowledge work, coding, and long-running agentic workflows. It supports text, image, and file inputs with text output, includes reasoning capabilities, and features a 1M-token context window for handling complex, high-context tasks.

Optimized for asynchronous and long-horizon execution, Claude Fable 5 excels at end-to-end tasks that would typically require hours, days, or weeks of human effort. It combines strong reasoning, autonomous verification and self-correction loops, and robust safeguards, making it well suited for complex research, software engineering, and large-scale knowledge work.

Enjoy it.

Feature

Models Added

Add Nemotron 3 Ultra & Nemotron 3.5 Content Safety

Nemotron 3 UltraNemotron 3 Ultra (Free)Nemotron 3.5 Content Safety (Free)

Nemotron 3 Ultra

NVIDIA Nemotron 3 Ultra is an open frontier reasoning and orchestration model featuring a 550B-parameter Mixture-of-Experts (MoE) architecture with 55B active parameters per token. Built on a hybrid Transformer–Mamba design, it supports text input and output with a 1M-token context window, enabling large-scale reasoning and long-horizon task execution.

Optimized for agent orchestration, coding agents, deep research, and complex enterprise workflows, the model excels at multi-step reasoning, planning, and sustained execution. With high-throughput inference designed for large-scale agent pipelines, Nemotron 3 Ultra serves as a powerful foundation for advanced agentic AI systems.

Nemotron 3.5 Content Safety

NVIDIA Nemotron 3.5 Content Safety is a compact 4B-parameter multimodal guardrail model from NVIDIA, designed for content moderation, safety classification, and AI policy enforcement. Supporting text and image inputs with text output, it evaluates both user prompts and model responses, providing safe/unsafe classifications, safety category labels, and optional reasoning traces.

Fine-tuned from Gemma-3-4B and supporting 12 languages with a 128K-token context window, the model is well suited for prompt moderation, response filtering, content classification, and enterprise safety pipelines. As part of the NVIDIA Nemotron family, it offers a configurable reasoning mode and integrates easily into agentic AI systems requiring robust guardrails and compliance controls.

Enjoy them.

Feature

Models Added

Add Qwen3.7 Plus

Qwen3.7 Plus

Qwen3.7-Plus is a cost-effective multimodal model in Alibaba's Qwen3.7 series, supporting text and image inputs with text output. It combines the series' strong language capabilities with significantly enhanced vision-language understanding, while retaining full-stack agent-level intelligence for coding, tool use, and productivity workflows.

Its standout capability is multimodal interactive agency—the ability to perceive real-world scenes, understand screens and graphical interfaces, generate code from visual references, and perform end-to-end navigation within applications. This makes Qwen3.7-Plus well suited for GUI automation, visual coding, productivity agents, and multimodal task execution.

Enjoy it.