CLIProxyAPIPlus

mirror of https://github.com/router-for-me/CLIProxyAPIPlus.git synced 2026-03-21 16:40:22 +00:00

Author	SHA1	Message	Date
Luis Pater	8822f20d17	feat(registry): add GPT 5.4 model definition to static data	2026-03-06 02:23:53 +08:00
hkfires	c44793789b	feat(thinking): add adaptive thinking support for Claude models Add support for Claude's "adaptive" and "auto" thinking modes using `output_config.effort`. Introduce support for new effort level "max" in adaptive thinking. Update thinking logic, validate model capabilities, and extend converters and handling to ensure compatibility with adaptive modes. Adjust static model data with supported levels and refine handling across translators and executors.	2026-03-03 09:05:31 +08:00
hkfires	134f41496d	fix(antigravity): update model configurations and add new models for Antigravity	2026-03-01 10:05:29 +08:00
Luis Pater	1ae994b4aa	fix(antigravity): adjust thinkingBudget default to 64000 and update model definitions for Claude	2026-03-01 09:39:39 +08:00
margbug01	8de0885b7d	fix: support thinking.type="auto" from Amp client for Antigravity Claude models ## Problem When using Antigravity Claude models through CLIProxyAPI, the thinking chain (reasoning content) does not display in the Amp client. ## Root Cause The Amp client sends `thinking: {"type": "auto"}` in its requests, but `ConvertClaudeRequestToAntigravity` only handled `"enabled"` and `"adaptive"` types in its switch statement. The `"auto"` type was silently ignored, resulting in no `thinkingConfig` being set in the translated Gemini request. Without `thinkingConfig`, the Antigravity API returns responses without any thinking content. Additionally, the Antigravity API for Claude models does not support `thinkingBudget: -1` (auto mode sentinel). It requires a concrete positive budget value. The fix uses 128000 as the budget for "auto" mode, which `ApplyThinking` will then normalize to stay within the model's actual limits (e.g., capped to `maxOutputTokens - 1`). ## Changes ### internal/translator/antigravity/claude/antigravity_claude_request.go 1. Add "auto" case to the thinking type switch statement. Sets `thinkingBudget: 128000` and `includeThoughts: true`. The budget is subsequently normalized by `ApplyThinking` based on model-specific limits. 2. Add "auto" to hasThinking check so that interleaved thinking hints are injected for tool-use scenarios when Amp sends `thinking.type="auto"`. ### internal/registry/model_definitions_static_data.go 3. Add Thinking configuration for `claude-sonnet-4-6`, `claude-sonnet-4-5`, and `claude-opus-4-6` in `GetAntigravityModelConfig()` -- these were previously missing, causing `ApplyThinking` to skip thinking config entirely. ## Testing - Deployed to Railway test instance (cpa-thinking-test) - Verified via debug logging that: - Amp sends `thinking: {"type": "auto"}` - CPA now translates this to `thinkingConfig: {thinkingBudget: 128000, includeThoughts: true}` - `ApplyThinking` normalizes the budget to model-specific limits - Antigravity API receives the correct thinkingConfig Amp-Thread-ID: https://ampcode.com/threads/T-019ca511-710d-776d-a07c-4b750f871a93 Co-authored-by: Amp <amp@ampcode.com>	2026-03-01 02:18:43 +08:00
huang_usaki	3b4f9f43db	feat(registry): add gemini-3.1-flash-image support	2026-02-27 10:20:46 +08:00
Luis Pater	8c6c90da74	fix(registry): clean up outdated model definitions in static data	2026-02-26 23:12:40 +08:00
Luis Pater	713388dd7b	Fixed: #1675 fix(gemini): add model definitions for Gemini 3.1 Pro High and Image	2026-02-23 00:12:57 +08:00
Luis Pater	d210be06c2	fix(gemini): update min Thinking value and add Gemini 3.1 Pro Preview model definition	2026-02-22 21:51:32 +08:00
Luis Pater	081cfe806e	fix(gemini): correct `Created` timestamps for Gemini 3.1 Pro Preview model definitions	2026-02-21 20:47:47 +08:00
hkfires	c1c62a6c04	feat(gemini): add Gemini 3.1 Pro Preview model definitions	2026-02-21 20:42:29 +08:00
apparition	1a0ceda0fc	feat: add Gemini 3.1 Pro Preview model definition	2026-02-19 17:43:08 +08:00
Luis Pater	bb86a0c0c4	feat(logging, executor): add request logging tests and WebSocket-based Codex executor - Introduced unit tests for request logging middleware to enhance coverage. - Added WebSocket-based Codex executor to support Responses API upgrade. - Updated middleware logic to selectively capture request bodies for memory efficiency. - Enhanced Codex configuration handling with new WebSocket attributes.	2026-02-19 01:57:02 +08:00
Luis Pater	ae1e8a5191	chore(runtime, registry): update Codex client version and GPT-5.3 model creation date	2026-02-13 12:47:48 +08:00
Franz Bettag	1ce56d7413	Update internal/registry/model_definitions_static_data.go Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>	2026-02-12 23:37:27 +01:00
Franz Bettag	41a78be3a2	feat(registry): add gpt-5.3-codex-spark model definition	2026-02-12 23:24:08 +01:00
Luis Pater	575881cb59	feat(registry): add new model definition for MiniMax-M2.5	2026-02-12 22:43:01 +08:00
hkfires	f361b2716d	feat(registry): add glm-5 model to iflow	2026-02-12 11:13:28 +08:00
hkfires	349ddcaa89	fix(registry): correct max completion tokens for opus 4.6 thinking	2026-02-10 18:05:40 +08:00
test	f5f26f0cbe	Add Kimi (Moonshot AI) provider support - OAuth2 device authorization grant flow (RFC 8628) for authentication - Streaming and non-streaming chat completions via OpenAI-compatible API - Models: kimi-k2, kimi-k2-thinking, kimi-k2.5 - CLI `--kimi-login` command for device flow auth - Token management with automatic refresh - Thinking/reasoning effort support for thinking-enabled models Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 19:24:46 -05:00
Luis Pater	4b00312fef	Merge pull request #1435 from tianyicui/fix/haiku-4-5-thinking-support fix: Enable extended thinking support for Claude Haiku 4.5	2026-02-06 05:44:14 +08:00
Frank Qing	f870a9d2a7	fix(registry): correct Claude Opus 4.6 model metadata	2026-02-06 05:39:41 +08:00
kvokka	bc78d668ac	feat(registry): register Claude 4.6 static data Add model definition for Claude 4.6 Opus with 200k context length and thinking support capabilities.	2026-02-05 23:13:36 +04:00
Luis Pater	5bd0896ad7	feat(registry): add GPT 5.3 Codex model to static data	2026-02-06 01:52:41 +08:00
Luis Pater	f7d82fda3f	feat(registry): add Kimi-K2.5 model to static data	2026-02-05 19:48:04 +08:00
Tianyi Cui	706590c62a	fix: Enable extended thinking support for Claude Haiku 4.5 Claude Haiku 4.5 (claude-haiku-4-5-20251001) supports extended thinking according to Anthropic's official documentation: https://docs.anthropic.com/en/docs/build-with-claude/extended-thinking The model was incorrectly marked as not supporting thinking in the static model definitions. This fix adds ThinkingSupport with the same parameters as other Claude 4.5 models (Sonnet, Opus): - Min: 1024 tokens - Max: 128000 tokens - ZeroAllowed: true - DynamicAllowed: false	2026-02-05 19:03:23 +08:00
hkfires	c8c27325dc	feat(thinking): enable thinking toggle for qwen3 and deepseek models Fix #1245	2026-01-28 09:54:05 +08:00
hkfires	88a0f095e8	chore(registry): disable gemini 2.5 flash image preview model	2026-01-27 18:33:13 +08:00
hkfires	c65f64dce0	chore(registry): comment out rev19-uic3-1p model config	2026-01-27 18:33:13 +08:00
hkfires	d18cd217e1	feat(api): add management model definitions endpoint	2026-01-27 18:33:12 +08:00

30 Commits