CLIProxyAPIPlus

mirror of https://github.com/router-for-me/CLIProxyAPIPlus.git synced 2026-04-04 19:51:18 +00:00

Author	SHA1	Message	Date
hkfires	fee736933b	feat(openai-compat): add per-model thinking support	2026-03-24 14:21:12 +08:00
hkfires	c3d5dbe96f	feat(model_registry): enhance model registration and refresh mechanisms	2026-03-13 10:56:39 +08:00
hkfires	dbd42a42b2	fix(model_updater): clarify log message for model refresh failure	2026-03-12 10:32:04 +08:00
hkfires	dea3e74d35	feat(antigravity): refactor model handling and remove unused code	2026-03-12 09:24:45 +08:00
hkfires	e333fbea3d	feat(updater): update StartModelsUpdater to block until models refresh completes	2026-03-10 14:41:58 +08:00
hkfires	efbe36d1d4	feat(updater): change models refresh to one-time fetch on startup	2026-03-10 14:18:54 +08:00
hkfires	30d5c95b26	feat(registry): refresh model catalog from network	2026-03-10 14:02:54 +08:00
hkfires	d1e3195e6f	feat(codex): register models by plan tier	2026-03-10 11:20:37 +08:00
Luis Pater	631e5c8331	Merge pull request #1922 from shenshuoyaoyouguang/pr/model-registry-safety fix(registry): clone model snapshots and invalidate available-model cache	2026-03-07 23:01:42 +08:00
Luis Pater	ca90487a8c	Merge branch 'main' into feature/add-gemini-3.1-flash-image-preview	2026-03-07 22:16:09 +08:00
chujian	3a18f6fcca	fix(registry): clone slice fields in model map output	2026-03-07 18:53:56 +08:00
chujian	099e734a02	fix(registry): always clone available model snapshots	2026-03-07 18:40:02 +08:00
chujian	97ef633c57	fix(registry): address review feedback	2026-03-07 17:36:57 +08:00
chujian	dae8463ba1	fix(registry): clone model snapshots and invalidate available-model cache	2026-03-07 16:59:23 +08:00
Frad LEE	a8cbc68c3e	feat(registry): add gemini 3.1 flash lite preview - Add model to GetGeminiModels() - Add model to GetGeminiVertexModels() - Add model to GetGeminiCLIModels() - Add model to GetAIStudioModels() - Add to AntigravityModelConfig with thinking levels - Update gemini-3-flash-preview description Registers the new lightweight Gemini model across all provider endpoints for cost-effective high-volume usage scenarios. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-06 20:52:28 +08:00
zhongnan.rex	242aecd924	feat(registry): add gemini-3.1-flash-image-preview model definition	2026-03-06 10:50:04 +08:00
Luis Pater	9397f7049f	fix(registry): simplify GPT 5.4 model description in static data	2026-03-06 02:32:56 +08:00
Luis Pater	8822f20d17	feat(registry): add GPT 5.4 model definition to static data	2026-03-06 02:23:53 +08:00
hkfires	c44793789b	feat(thinking): add adaptive thinking support for Claude models Add support for Claude's "adaptive" and "auto" thinking modes using `output_config.effort`. Introduce support for new effort level "max" in adaptive thinking. Update thinking logic, validate model capabilities, and extend converters and handling to ensure compatibility with adaptive modes. Adjust static model data with supported levels and refine handling across translators and executors.	2026-03-03 09:05:31 +08:00
Luis Pater	8aa2cce8c5	Merge PR #1735 into dev with conflict resolution and fixes	2026-03-02 03:22:51 +08:00
hkfires	134f41496d	fix(antigravity): update model configurations and add new models for Antigravity	2026-03-01 10:05:29 +08:00
Luis Pater	1ae994b4aa	fix(antigravity): adjust thinkingBudget default to 64000 and update model definitions for Claude	2026-03-01 09:39:39 +08:00
margbug01	8de0885b7d	fix: support thinking.type="auto" from Amp client for Antigravity Claude models ## Problem When using Antigravity Claude models through CLIProxyAPI, the thinking chain (reasoning content) does not display in the Amp client. ## Root Cause The Amp client sends `thinking: {"type": "auto"}` in its requests, but `ConvertClaudeRequestToAntigravity` only handled `"enabled"` and `"adaptive"` types in its switch statement. The `"auto"` type was silently ignored, resulting in no `thinkingConfig` being set in the translated Gemini request. Without `thinkingConfig`, the Antigravity API returns responses without any thinking content. Additionally, the Antigravity API for Claude models does not support `thinkingBudget: -1` (auto mode sentinel). It requires a concrete positive budget value. The fix uses 128000 as the budget for "auto" mode, which `ApplyThinking` will then normalize to stay within the model's actual limits (e.g., capped to `maxOutputTokens - 1`). ## Changes ### internal/translator/antigravity/claude/antigravity_claude_request.go 1. Add "auto" case to the thinking type switch statement. Sets `thinkingBudget: 128000` and `includeThoughts: true`. The budget is subsequently normalized by `ApplyThinking` based on model-specific limits. 2. Add "auto" to hasThinking check so that interleaved thinking hints are injected for tool-use scenarios when Amp sends `thinking.type="auto"`. ### internal/registry/model_definitions_static_data.go 3. Add Thinking configuration for `claude-sonnet-4-6`, `claude-sonnet-4-5`, and `claude-opus-4-6` in `GetAntigravityModelConfig()` -- these were previously missing, causing `ApplyThinking` to skip thinking config entirely. ## Testing - Deployed to Railway test instance (cpa-thinking-test) - Verified via debug logging that: - Amp sends `thinking: {"type": "auto"}` - CPA now translates this to `thinkingConfig: {thinkingBudget: 128000, includeThoughts: true}` - `ApplyThinking` normalizes the budget to model-specific limits - Antigravity API receives the correct thinkingConfig Amp-Thread-ID: https://ampcode.com/threads/T-019ca511-710d-776d-a07c-4b750f871a93 Co-authored-by: Amp <amp@ampcode.com>	2026-03-01 02:18:43 +08:00
maplelove	f3c164d345	feat(antigravity): update to v1.19.5 with new models and Claude 4-6 migration	2026-02-27 10:34:27 +08:00
maplelove	4040b1e766	Merge remote-tracking branch 'upstream/dev' into dev # Conflicts: # internal/runtime/executor/antigravity_executor.go	2026-02-27 10:29:50 +08:00
huang_usaki	3b4f9f43db	feat(registry): add gemini-3.1-flash-image support	2026-02-27 10:20:46 +08:00
Luis Pater	8c6c90da74	fix(registry): clean up outdated model definitions in static data	2026-02-26 23:12:40 +08:00
maplelove	8f97a5f77c	feat(registry): expose input modalities, token limits, and generation methods for Antigravity models	2026-02-23 13:33:51 +08:00
Luis Pater	713388dd7b	Fixed: #1675 fix(gemini): add model definitions for Gemini 3.1 Pro High and Image	2026-02-23 00:12:57 +08:00
Luis Pater	d210be06c2	fix(gemini): update min Thinking value and add Gemini 3.1 Pro Preview model definition	2026-02-22 21:51:32 +08:00
Luis Pater	081cfe806e	fix(gemini): correct `Created` timestamps for Gemini 3.1 Pro Preview model definitions	2026-02-21 20:47:47 +08:00
hkfires	c1c62a6c04	feat(gemini): add Gemini 3.1 Pro Preview model definitions	2026-02-21 20:42:29 +08:00
apparition	1a0ceda0fc	feat: add Gemini 3.1 Pro Preview model definition	2026-02-19 17:43:08 +08:00
Luis Pater	bb86a0c0c4	feat(logging, executor): add request logging tests and WebSocket-based Codex executor - Introduced unit tests for request logging middleware to enhance coverage. - Added WebSocket-based Codex executor to support Responses API upgrade. - Updated middleware logic to selectively capture request bodies for memory efficiency. - Enhanced Codex configuration handling with new WebSocket attributes.	2026-02-19 01:57:02 +08:00
Luis Pater	46a6782065	refactor(all): replace manual pointer assignments with `new` to enhance code readability and maintainability	2026-02-15 14:10:10 +08:00
Luis Pater	ae1e8a5191	chore(runtime, registry): update Codex client version and GPT-5.3 model creation date	2026-02-13 12:47:48 +08:00
Franz Bettag	1ce56d7413	Update internal/registry/model_definitions_static_data.go Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>	2026-02-12 23:37:27 +01:00
Franz Bettag	41a78be3a2	feat(registry): add gpt-5.3-codex-spark model definition	2026-02-12 23:24:08 +01:00
Luis Pater	575881cb59	feat(registry): add new model definition for MiniMax-M2.5	2026-02-12 22:43:01 +08:00
hkfires	f361b2716d	feat(registry): add glm-5 model to iflow	2026-02-12 11:13:28 +08:00
hkfires	349ddcaa89	fix(registry): correct max completion tokens for opus 4.6 thinking	2026-02-10 18:05:40 +08:00
test	f5f26f0cbe	Add Kimi (Moonshot AI) provider support - OAuth2 device authorization grant flow (RFC 8628) for authentication - Streaming and non-streaming chat completions via OpenAI-compatible API - Models: kimi-k2, kimi-k2-thinking, kimi-k2.5 - CLI `--kimi-login` command for device flow auth - Token management with automatic refresh - Thinking/reasoning effort support for thinking-enabled models Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 19:24:46 -05:00
Luis Pater	4b00312fef	Merge pull request #1435 from tianyicui/fix/haiku-4-5-thinking-support fix: Enable extended thinking support for Claude Haiku 4.5	2026-02-06 05:44:14 +08:00
Frank Qing	f870a9d2a7	fix(registry): correct Claude Opus 4.6 model metadata	2026-02-06 05:39:41 +08:00
kvokka	bc78d668ac	feat(registry): register Claude 4.6 static data Add model definition for Claude 4.6 Opus with 200k context length and thinking support capabilities.	2026-02-05 23:13:36 +04:00
Luis Pater	5bd0896ad7	feat(registry): add GPT 5.3 Codex model to static data	2026-02-06 01:52:41 +08:00
Luis Pater	f7d82fda3f	feat(registry): add Kimi-K2.5 model to static data	2026-02-05 19:48:04 +08:00
Tianyi Cui	706590c62a	fix: Enable extended thinking support for Claude Haiku 4.5 Claude Haiku 4.5 (claude-haiku-4-5-20251001) supports extended thinking according to Anthropic's official documentation: https://docs.anthropic.com/en/docs/build-with-claude/extended-thinking The model was incorrectly marked as not supporting thinking in the static model definitions. This fix adds ThinkingSupport with the same parameters as other Claude 4.5 models (Sonnet, Opus): - Min: 1024 tokens - Max: 128000 tokens - ZeroAllowed: true - DynamicAllowed: false	2026-02-05 19:03:23 +08:00
hkfires	c8c27325dc	feat(thinking): enable thinking toggle for qwen3 and deepseek models Fix #1245	2026-01-28 09:54:05 +08:00
hkfires	88a0f095e8	chore(registry): disable gemini 2.5 flash image preview model	2026-01-27 18:33:13 +08:00

1 2 3 4

152 Commits