Changelog

Type

January 2026

Fix

System Update

Optimization on API requests: 40-80% latency reduction

Before: 30-55ms After: 5-10ms (cache hits) / 15-25ms (cache misses)Estimated improvement: 40-80% latency reduction
  • Before: 30-55ms
  • After: 5-10ms (cache hits) / 15-25ms (cache misses)
  • Estimated improvement: 40-80% latency reduction
Read more

Fix

Feature Added

Python SDK: Add comprehensive SDK features (v0.2.0)

- Vision/Image support with create_with_image() convenience method- Audio input/output support- Video content support- Web Search with create_with_web_search() convenience method- Reasoning Mode support (glm-4.7)- Extended Thinking (Gemini) support

Add comprehensive SDK features (v0.2.0) New Features:

  • Vision/Image support with create_with_image() convenience method
  • Audio input/output support
  • Video content support
  • Web Search with create_with_web_search() convenience method
  • Reasoning Mode support (glm-4.7)
  • Extended Thinking (Gemini) support
  • Models API (list, retrieve)
  • Responses API (OpenAI format)
  • Messages API (Anthropic format)
  • Rerank API (BAAI/bge-reranker-v2-m3, Qwen models)
  • Helper utilities for base64 encoding

Includes 81 unit tests covering all new functionality.

PyPI Package: https://pypi.org/project/apertis/

Read more

Fix

Performance Improvement

Python SDK - Fix API response compatibility issues (v0.1.1)

Fix API response compatibility issues (v0.1.1)

  • Make id, object, created fields optional in ChatCompletion
  • Make id, object, created, model fields optional in ChatCompletionChunk
  • Make Usage fields optional to handle empty usage objects in streaming
  • Fix streaming by using httpx send() with stream=True instead of stream context manager

Tested with real API calls:

  • Chat completions: working
  • Streaming: working
  • Embeddings: working
  • Tool calling: working
Read more