CLIProxyAPIPlus

mirror of https://github.com/router-for-me/CLIProxyAPIPlus.git synced 2026-03-29 16:54:41 +00:00

Author	SHA1	Message	Date
hkfires	c44793789b	feat(thinking): add adaptive thinking support for Claude models Add support for Claude's "adaptive" and "auto" thinking modes using `output_config.effort`. Introduce support for new effort level "max" in adaptive thinking. Update thinking logic, validate model capabilities, and extend converters and handling to ensure compatibility with adaptive modes. Adjust static model data with supported levels and refine handling across translators and executors.	2026-03-03 09:05:31 +08:00
Luis Pater	8aa2cce8c5	Merge PR #1735 into dev with conflict resolution and fixes	2026-03-02 03:22:51 +08:00
hkfires	134f41496d	fix(antigravity): update model configurations and add new models for Antigravity	2026-03-01 10:05:29 +08:00
Luis Pater	1ae994b4aa	fix(antigravity): adjust thinkingBudget default to 64000 and update model definitions for Claude	2026-03-01 09:39:39 +08:00
margbug01	8de0885b7d	fix: support thinking.type="auto" from Amp client for Antigravity Claude models ## Problem When using Antigravity Claude models through CLIProxyAPI, the thinking chain (reasoning content) does not display in the Amp client. ## Root Cause The Amp client sends `thinking: {"type": "auto"}` in its requests, but `ConvertClaudeRequestToAntigravity` only handled `"enabled"` and `"adaptive"` types in its switch statement. The `"auto"` type was silently ignored, resulting in no `thinkingConfig` being set in the translated Gemini request. Without `thinkingConfig`, the Antigravity API returns responses without any thinking content. Additionally, the Antigravity API for Claude models does not support `thinkingBudget: -1` (auto mode sentinel). It requires a concrete positive budget value. The fix uses 128000 as the budget for "auto" mode, which `ApplyThinking` will then normalize to stay within the model's actual limits (e.g., capped to `maxOutputTokens - 1`). ## Changes ### internal/translator/antigravity/claude/antigravity_claude_request.go 1. Add "auto" case to the thinking type switch statement. Sets `thinkingBudget: 128000` and `includeThoughts: true`. The budget is subsequently normalized by `ApplyThinking` based on model-specific limits. 2. Add "auto" to hasThinking check so that interleaved thinking hints are injected for tool-use scenarios when Amp sends `thinking.type="auto"`. ### internal/registry/model_definitions_static_data.go 3. Add Thinking configuration for `claude-sonnet-4-6`, `claude-sonnet-4-5`, and `claude-opus-4-6` in `GetAntigravityModelConfig()` -- these were previously missing, causing `ApplyThinking` to skip thinking config entirely. ## Testing - Deployed to Railway test instance (cpa-thinking-test) - Verified via debug logging that: - Amp sends `thinking: {"type": "auto"}` - CPA now translates this to `thinkingConfig: {thinkingBudget: 128000, includeThoughts: true}` - `ApplyThinking` normalizes the budget to model-specific limits - Antigravity API receives the correct thinkingConfig Amp-Thread-ID: https://ampcode.com/threads/T-019ca511-710d-776d-a07c-4b750f871a93 Co-authored-by: Amp <amp@ampcode.com>	2026-03-01 02:18:43 +08:00
maplelove	f3c164d345	feat(antigravity): update to v1.19.5 with new models and Claude 4-6 migration	2026-02-27 10:34:27 +08:00
maplelove	4040b1e766	Merge remote-tracking branch 'upstream/dev' into dev # Conflicts: # internal/runtime/executor/antigravity_executor.go	2026-02-27 10:29:50 +08:00
huang_usaki	3b4f9f43db	feat(registry): add gemini-3.1-flash-image support	2026-02-27 10:20:46 +08:00
Luis Pater	8c6c90da74	fix(registry): clean up outdated model definitions in static data	2026-02-26 23:12:40 +08:00
maplelove	8f97a5f77c	feat(registry): expose input modalities, token limits, and generation methods for Antigravity models	2026-02-23 13:33:51 +08:00
Luis Pater	713388dd7b	Fixed: #1675 fix(gemini): add model definitions for Gemini 3.1 Pro High and Image	2026-02-23 00:12:57 +08:00
Luis Pater	d210be06c2	fix(gemini): update min Thinking value and add Gemini 3.1 Pro Preview model definition	2026-02-22 21:51:32 +08:00
Luis Pater	081cfe806e	fix(gemini): correct `Created` timestamps for Gemini 3.1 Pro Preview model definitions	2026-02-21 20:47:47 +08:00
hkfires	c1c62a6c04	feat(gemini): add Gemini 3.1 Pro Preview model definitions	2026-02-21 20:42:29 +08:00
apparition	1a0ceda0fc	feat: add Gemini 3.1 Pro Preview model definition	2026-02-19 17:43:08 +08:00
Luis Pater	bb86a0c0c4	feat(logging, executor): add request logging tests and WebSocket-based Codex executor - Introduced unit tests for request logging middleware to enhance coverage. - Added WebSocket-based Codex executor to support Responses API upgrade. - Updated middleware logic to selectively capture request bodies for memory efficiency. - Enhanced Codex configuration handling with new WebSocket attributes.	2026-02-19 01:57:02 +08:00
Luis Pater	46a6782065	refactor(all): replace manual pointer assignments with `new` to enhance code readability and maintainability	2026-02-15 14:10:10 +08:00
Luis Pater	ae1e8a5191	chore(runtime, registry): update Codex client version and GPT-5.3 model creation date	2026-02-13 12:47:48 +08:00
Franz Bettag	1ce56d7413	Update internal/registry/model_definitions_static_data.go Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>	2026-02-12 23:37:27 +01:00
Franz Bettag	41a78be3a2	feat(registry): add gpt-5.3-codex-spark model definition	2026-02-12 23:24:08 +01:00
Luis Pater	575881cb59	feat(registry): add new model definition for MiniMax-M2.5	2026-02-12 22:43:01 +08:00
hkfires	f361b2716d	feat(registry): add glm-5 model to iflow	2026-02-12 11:13:28 +08:00
hkfires	349ddcaa89	fix(registry): correct max completion tokens for opus 4.6 thinking	2026-02-10 18:05:40 +08:00
test	f5f26f0cbe	Add Kimi (Moonshot AI) provider support - OAuth2 device authorization grant flow (RFC 8628) for authentication - Streaming and non-streaming chat completions via OpenAI-compatible API - Models: kimi-k2, kimi-k2-thinking, kimi-k2.5 - CLI `--kimi-login` command for device flow auth - Token management with automatic refresh - Thinking/reasoning effort support for thinking-enabled models Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 19:24:46 -05:00
Luis Pater	4b00312fef	Merge pull request #1435 from tianyicui/fix/haiku-4-5-thinking-support fix: Enable extended thinking support for Claude Haiku 4.5	2026-02-06 05:44:14 +08:00
Frank Qing	f870a9d2a7	fix(registry): correct Claude Opus 4.6 model metadata	2026-02-06 05:39:41 +08:00
kvokka	bc78d668ac	feat(registry): register Claude 4.6 static data Add model definition for Claude 4.6 Opus with 200k context length and thinking support capabilities.	2026-02-05 23:13:36 +04:00
Luis Pater	5bd0896ad7	feat(registry): add GPT 5.3 Codex model to static data	2026-02-06 01:52:41 +08:00
Luis Pater	f7d82fda3f	feat(registry): add Kimi-K2.5 model to static data	2026-02-05 19:48:04 +08:00
Tianyi Cui	706590c62a	fix: Enable extended thinking support for Claude Haiku 4.5 Claude Haiku 4.5 (claude-haiku-4-5-20251001) supports extended thinking according to Anthropic's official documentation: https://docs.anthropic.com/en/docs/build-with-claude/extended-thinking The model was incorrectly marked as not supporting thinking in the static model definitions. This fix adds ThinkingSupport with the same parameters as other Claude 4.5 models (Sonnet, Opus): - Min: 1024 tokens - Max: 128000 tokens - ZeroAllowed: true - DynamicAllowed: false	2026-02-05 19:03:23 +08:00
hkfires	c8c27325dc	feat(thinking): enable thinking toggle for qwen3 and deepseek models Fix #1245	2026-01-28 09:54:05 +08:00
hkfires	88a0f095e8	chore(registry): disable gemini 2.5 flash image preview model	2026-01-27 18:33:13 +08:00
hkfires	c65f64dce0	chore(registry): comment out rev19-uic3-1p model config	2026-01-27 18:33:13 +08:00
hkfires	d18cd217e1	feat(api): add management model definitions endpoint	2026-01-27 18:33:12 +08:00
Darley	46c6fb1e7a	fix(api): enhance ClaudeModels response to align with api.anthropic.com	2026-01-24 04:41:08 +03:30
hkfires	e641fde25c	feat(registry): support provider-specific model info lookup	2026-01-20 10:01:17 +08:00
dinhkarate	8734d4cb90	feat(vertex): add Imagen image generation model support Add support for Imagen 3.0 and 4.0 image generation models in Vertex AI: - Add 5 Imagen model definitions (4.0, 4.0-ultra, 4.0-fast, 3.0, 3.0-fast) - Implement :predict action routing for Imagen models - Convert Imagen request/response format to match Gemini structure like gemini-3-pro-image - Transform prompts to Imagen's instances/parameters format - Convert base64 image responses to Gemini-compatible inline data	2026-01-20 01:26:37 +07:00
hkfires	c175821cc4	feat(registry): expand antigravity model config Remove static Name mapping and add entries for claude-sonnet-4-5, tab_flash_lite_preview, and gpt-oss-120b-medium configs	2026-01-19 19:32:00 +08:00
hkfires	2b387e169b	feat(iflow): add iflow-rome model definition	2026-01-15 20:23:55 +08:00
hkfires	e0ffec885c	fix(aistudio): remove levels from model definitions	2026-01-15 16:06:46 +08:00
hkfires	5c40a2db21	refactor(thinking): simplify ModeNone and budget validation logic	2026-01-15 14:03:08 +08:00
hkfires	7f1b2b3f6e	fix(thinking): improve model lookup and validation	2026-01-15 13:06:40 +08:00
hkfires	40ee065eff	fix(thinking): use static lookup to avoid alias issues	2026-01-15 13:06:40 +08:00
hkfires	a75fb6af90	refactor(antigravity): remove hardcoded model aliases	2026-01-15 13:06:39 +08:00
hkfires	0b06d637e7	refactor: improve thinking logic	2026-01-15 13:06:39 +08:00
Luis Pater	e02ceecd35	feat(registry): introduce `ModelRegistryHook` for monitoring model registrations and unregistrations Added support for external hooks to observe model registry events using the `ModelRegistryHook` interface. Implemented thread-safe, non-blocking execution of hooks with panic recovery. Comprehensive tests added to verify hook behavior during registration, unregistration, blocking, and panic scenarios.	2026-01-02 23:18:40 +08:00
hkfires	4fc3d5e935	refactor(iflow): simplify thinking config handling for GLM and MiniMax models	2026-01-01 19:31:08 +08:00
Luis Pater	8d15723195	feat(registry): add `GetAvailableModelsByProvider` method for retrieving models by provider	2025-12-31 23:37:46 +08:00
hkfires	e332419081	feat(registry): add thinking support for gemini-2.5-computer-use-preview model	2025-12-31 17:09:22 +08:00
hkfires	ce7474d953	feat(cliproxy): propagate thinking support metadata to aliased models	2025-12-30 15:16:54 +08:00

1 2 3

134 Commits