CLIProxyAPIPlus

mirror of https://github.com/router-for-me/CLIProxyAPIPlus.git synced 2026-04-12 17:24:13 +00:00

Author	SHA1	Message	Date
hkfires	d390b95b76	fix(tests): update test cases	2026-04-08 08:53:50 +08:00
Luis Pater	c1818f197b	Merge pull request #1940 from Blue-B/fix/claude-interleaved-thinking-amp-gzip-budget fix(claude): enable interleaved-thinking beta, decode AMP error gzip, fix budget 400	2026-04-06 09:08:23 +08:00
Blue-B	5f58248016	fix(claude): clamp max_tokens to model limit in normalizeClaudeBudget When adjustedBudget < minBudget, the previous fix blindly set max_tokens = budgetTokens+1 which could exceed MaxCompletionTokens. Now: cap max_tokens at MaxCompletionTokens, recalculate budget, and disable thinking entirely if constraints are unsatisfiable. Add unit tests covering raise, clamp, disable, and no-op scenarios.	2026-03-09 22:10:30 +09:00
Blue-B	07d6689d87	fix(claude): add interleaved-thinking beta header, AMP gzip error decoding, normalizeClaudeBudget max_tokens 1. Always include interleaved-thinking-2025-05-14 beta header so that thinking blocks are returned correctly for all Claude models. 2. Remove status-code guard in AMP reverse proxy ModifyResponse so that error responses (4xx/5xx) with hidden gzip encoding are decoded properly — prevents garbled error messages reaching the client. 3. In normalizeClaudeBudget, when the adjusted budget falls below the model minimum, set max_tokens = budgetTokens+1 instead of leaving the request unchanged (which causes a 400 from the API).	2026-03-07 21:31:10 +09:00
chujian	7c1299922e	fix(openai-compat): improve pool fallback and preserve adaptive thinking	2026-03-07 16:54:28 +08:00
hkfires	835ae178d4	feat(thinking): rename isBudgetBasedProvider to isBudgetCapableProvider and update logic for provider checks	2026-03-03 19:49:51 +08:00
hkfires	c80ab8bf0d	feat(thinking): improve provider family checks and clamp unsupported levels	2026-03-03 19:05:15 +08:00
hkfires	0452b869e8	feat(thinking): add HasLevel and MapToClaudeEffort functions for adaptive thinking support	2026-03-03 14:16:36 +08:00
hkfires	c44793789b	feat(thinking): add adaptive thinking support for Claude models Add support for Claude's "adaptive" and "auto" thinking modes using `output_config.effort`. Introduce support for new effort level "max" in adaptive thinking. Update thinking logic, validate model capabilities, and extend converters and handling to ensure compatibility with adaptive modes. Adjust static model data with supported levels and refine handling across translators and executors.	2026-03-03 09:05:31 +08:00
Luis Pater	8599b1560e	Fixed: #1716 feat(kimi): add support for explicit disabled thinking and reasoning effort handling	2026-02-28 05:29:07 +08:00
hkfires	0659ffab75	Revert "Merge pull request #1627 from thebtf/fix/reasoning-effort-clamping"	2026-02-24 19:47:53 +08:00
Kirill Turanskiy	2ea95266e3	fix: clamp reasoning_effort to valid OpenAI-format values CPA-internal thinking levels like 'xhigh' and 'minimal' are not accepted by OpenAI-format providers (MiniMax, etc.). The OpenAI applier now maps non-standard levels to the nearest valid reasoning_effort value before writing to the request body: xhigh → high minimal → low auto → medium	2026-02-18 03:36:42 +03:00
test	f5f26f0cbe	Add Kimi (Moonshot AI) provider support - OAuth2 device authorization grant flow (RFC 8628) for authentication - Streaming and non-streaming chat completions via OpenAI-compatible API - Models: kimi-k2, kimi-k2-thinking, kimi-k2.5 - CLI `--kimi-login` command for device flow auth - Token management with automatic refresh - Thinking/reasoning effort support for thinking-enabled models Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-05 19:24:46 -05:00
hkfires	209d74062a	fix(thinking): ensure includeThoughts is false for ModeNone in budget processing	2026-02-05 10:24:42 +08:00
hkfires	d86b13c9cb	fix(thinking): support user-defined includeThoughts setting with camelCase and snake_case variants Fixes #1378	2026-02-05 10:07:41 +08:00
neavo	6c65fdf54b	fix(gemini): support snake_case thinking config fields from Python SDK Google official Gemini Python SDK sends thinking_level, thinking_budget, and include_thoughts (snake_case) instead of thinkingLevel, thinkingBudget, and includeThoughts (camelCase). This caused thinking configuration to be ignored when using Python SDK. Changes: - Extract layer: extractGeminiConfig now reads snake_case as fallback - Apply layer: Gemini/CLI/Antigravity appliers clean up snake_case fields - Translator layer: Gemini->OpenAI/Claude/Codex translators support fallback - Tests: Added 4 test cases for snake_case field coverage Fixes #1426	2026-02-04 21:12:47 +08:00
hkfires	c8c27325dc	feat(thinking): enable thinking toggle for qwen3 and deepseek models Fix #1245	2026-01-28 09:54:05 +08:00
hkfires	e641fde25c	feat(registry): support provider-specific model info lookup	2026-01-20 10:01:17 +08:00
hkfires	239a28793c	feat(claude): clamp thinking budget to max_tokens constraints	2026-01-19 16:32:20 +08:00
hkfires	c421d653e7	refactor(claude): move max_tokens constraint enforcement to Apply method	2026-01-19 15:50:35 +08:00
hkfires	cb6caf3f87	fix(thinking): update ValidateConfig to include fromSuffix parameter and adjust budget validation logic	2026-01-18 16:37:14 +08:00
hkfires	03005b5d29	refactor(thinking): add Gemini family provider grouping for strict validation	2026-01-18 11:30:53 +08:00
hkfires	c7e8830a56	refactor(thinking): pass source and target formats to ApplyThinking for cross-format validation Update ApplyThinking signature to accept fromFormat and toFormat parameters instead of a single provider string. This enables: - Proper level-to-budget conversion when source is level-based (openai/codex) and target is budget-based (gemini/claude) - Strict budget range validation when source and target formats match - Level clamping to nearest supported level for cross-format requests - Format alias resolution in SDK translator registry for codex/openai-response Also adds ErrBudgetOutOfRange error code and improves iflow config extraction to fall back to openai format when iflow-specific config is not present.	2026-01-18 10:30:15 +08:00
hkfires	2b387e169b	feat(iflow): add iflow-rome model definition	2026-01-15 20:23:55 +08:00
hkfires	4ad6189487	refactor(thinking): extract antigravity logic into a dedicated provider	2026-01-15 19:08:22 +08:00
hkfires	ff4ff6bc2f	feat(thinking): support zero as a valid thinking budget for capable models	2026-01-15 15:41:10 +08:00
hkfires	5c40a2db21	refactor(thinking): simplify ModeNone and budget validation logic	2026-01-15 14:03:08 +08:00
hkfires	ee2976cca0	refactor(thinking): improve logging for user-defined models	2026-01-15 13:06:41 +08:00
hkfires	bcd4d9595f	fix(thinking): refine ModeNone handling based on provider capabilities	2026-01-15 13:06:41 +08:00
hkfires	5a77b7728e	refactor(thinking): improve budget clamping and logging with provider/model context	2026-01-15 13:06:41 +08:00
hkfires	1fbbba6f59	feat(logging): order log fields for improved readability	2026-01-15 13:06:41 +08:00
hkfires	f6a2d072e6	refactor(thinking): refine configuration logging	2026-01-15 13:06:41 +08:00
hkfires	6e4a602c60	fix(thinking): map reasoning_effort to thinkingConfig	2026-01-15 13:06:40 +08:00
hkfires	33d66959e9	test(thinking): remove legacy unit and integration tests	2026-01-15 13:06:40 +08:00
hkfires	7f1b2b3f6e	fix(thinking): improve model lookup and validation	2026-01-15 13:06:40 +08:00
hkfires	40ee065eff	fix(thinking): use static lookup to avoid alias issues	2026-01-15 13:06:40 +08:00
hkfires	72f2125668	fix(executor): properly handle thinking application errors	2026-01-15 13:06:39 +08:00
hkfires	e8f5888d8e	fix(thinking): fix auth matching for thinking suffix and json field conflicts	2026-01-15 13:06:39 +08:00
hkfires	0b06d637e7	refactor: improve thinking logic	2026-01-15 13:06:39 +08:00

39 Commits