CLIProxyAPIPlus

mirror of https://github.com/router-for-me/CLIProxyAPIPlus.git synced 2026-04-15 10:52:03 +00:00

Author	SHA1	Message	Date
rensumo	55c146a0e7	feat: add glm-5.1 to CodeBuddy model list	2026-04-09 14:20:26 +08:00
Luis Pater	e9dc576409	Merge branch 'router-for-me:main' into main	2026-04-09 03:49:09 +08:00
Luis Pater	941334da79	fix(auth): handle OAuth model alias in retry logic and refine Qwen quota handling	2026-04-09 03:44:19 +08:00
Luis Pater	d54f816363	fix(executor): update Qwen user agent and enhance header configuration	2026-04-09 01:45:52 +08:00
Luis Pater	f43d25def1	Merge pull request #496 from kunish/fix/copilot-premium-request-inflation fix(copilot): prevent intermittent context overflow for Claude models	2026-04-08 23:43:15 +08:00
Luis Pater	6a43d7285c	Merge branch 'main' into plus	2026-04-08 23:42:05 +08:00
kunish	578c312660	fix(copilot): lower static Claude context limits and expose them to Claude Code The Copilot API enforces per-account prompt token limits (128K individual, 168K business) that are lower than the total context window (200K). When the dynamic /models API fetch fails or returns no capabilities.limits, the static fallback of 200K exceeds the real enforced limit, causing intermittent "prompt token count exceeds the limit" errors. Two complementary fixes: 1. Lower static Copilot Claude model ContextLength from 200000 to 128000 (the conservative default matching defaultCopilotContextLength). Dynamic API limits override this when available. 2. Add context_length and max_completion_tokens to Claude-format model responses so Claude Code CLI can learn the actual Copilot limit instead of relying on its built-in 1M context configuration.	2026-04-08 17:02:53 +08:00
Luis Pater	12b967118b	Merge pull request #2592 from router-for-me/tests fix(tests): update test cases	2026-04-08 11:57:15 +08:00
rensumo	9a5f142c33	feat(codebuddy): add glm-5v-turbo model and update context lengths	2026-04-08 09:48:25 +08:00
hkfires	d390b95b76	fix(tests): update test cases	2026-04-08 08:53:50 +08:00
Luis Pater	d1f6224b70	Merge pull request #2569 from LucasInsight/fix/record-zero-usage fix: record zero usage	2026-04-08 08:13:11 +08:00
Luis Pater	fcc59d606d	fix(translator): add unit tests to validate output_item.done fallback logic for Gemini and Claude	2026-04-08 03:54:15 +08:00
Luis Pater	91e7591955	fix(executor): add transient 429 resource exhausted handling with retry logic	2026-04-08 02:48:53 +08:00
Luis Pater	4607356333	Merge pull request #491 from Ve-ria/main 修复 CodeBuddy 不支持非流式请求的问题	2026-04-07 18:25:21 +08:00
Luis Pater	5ae38584b8	Merge branch 'main' into plus	2026-04-07 18:23:31 +08:00
Luis Pater	c8b7e2b8d6	fix(executor): ensure empty stream completions use output_item.done as fallback Fixed: #2583	2026-04-07 18:21:12 +08:00
Luis Pater	cad45ffa33	Merge pull request #2578 from LemonZuo/feat_socks5h feat: support socks5h scheme for proxy settings	2026-04-07 09:57:18 +08:00
Lemon	163d68318f	feat: support socks5h scheme for proxy settings	2026-04-07 07:46:11 +08:00
Michael	8b9dbe10f0	fix: record zero usage	2026-04-06 20:19:42 +08:00
rensumo	341b4beea1	Update internal/runtime/executor/codebuddy_executor.go Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>	2026-04-06 14:16:56 +08:00
rensumo	bea13f9724	fix(executor): support non-stream requests for CodeBuddy	2026-04-06 13:59:06 +08:00
Luis Pater	9f5bdfaa31	Merge pull request #2531 from jamestut/openai-vertex-token-usage-fix Fix missing `response.completed.usage` for late-usage OpenAI-compatible streams	2026-04-06 09:30:49 +08:00
Luis Pater	9eabdd09db	Merge pull request #2522 from aikins01/fix/strip-tool-use-signature fix(amp): strip signature from tool_use blocks before forwarding to Claude	2026-04-06 09:30:14 +08:00
Luis Pater	c3f8dc362e	Merge pull request #2491 from mpfo0106/feature/claude-code-safe-alignment-sentinels test(claude): add compatibility sentinels and centralize builtin fallback handling	2026-04-06 09:27:08 +08:00
Luis Pater	b85120873b	Merge pull request #2332 from RaviTharuma/fix/claude-thinking-signature fix: preserve Claude thinking signatures in Codex translator	2026-04-06 09:25:06 +08:00
Luis Pater	000fcb15fa	Merge pull request #2298 from snoyiatk/feat/add-gitstore-branch feat(gitstore): add support for specifying git branch (via GITSTORE_G…	2026-04-06 09:21:03 +08:00
Luis Pater	c1818f197b	Merge pull request #1940 from Blue-B/fix/claude-interleaved-thinking-amp-gzip-budget fix(claude): enable interleaved-thinking beta, decode AMP error gzip, fix budget 400	2026-04-06 09:08:23 +08:00
Aikins Laryea	b0653cec7b	fix(amp): strip signature from tool_use blocks before forwarding to Claude ensureAmpSignature injects signature:"" into tool_use blocks so the Amp TUI does not crash on P.signature.length. when Amp sends the conversation back, Claude rejects the extra field with 400: tool_use.signature: Extra inputs are not permitted strip the proxy-injected signature from tool_use blocks in SanitizeAmpRequestBody before forwarding to the upstream API.	2026-04-05 12:26:24 +00:00
Luis Pater	22a1a24cf5	feat(executor): add tests for preserving key order in cache control functions Added comprehensive tests to ensure key order is maintained when modifying payloads in `normalizeCacheControlTTL` and `enforceCacheControlLimit` functions. Removed unused helper functions and refactored implementations for better readability and efficiency.	2026-04-05 17:58:13 +08:00
Luis Pater	7223fee2de	Merge branch 'pr-488' # Conflicts: # README.md # README_CN.md # README_JA.md	2026-04-05 02:08:45 +08:00
Luis Pater	ada8e2905e	feat(api): enhance proxy resolution for API key-based auth Added comprehensive support for resolving proxy URLs from configuration based on API key and provider attributes. Introduced new helper functions and extended the test suite to validate fallback mechanisms and compatibility cases.	2026-04-05 01:56:34 +08:00
Luis Pater	3774b56e9f	feat(misc): add background updater for Antigravity version caching Introduce `StartAntigravityVersionUpdater` to periodically refresh the cached Antigravity version using a non-blocking background process. Updated main server flow to initialize the updater.	2026-04-04 22:09:11 +08:00
Luis Pater	c2d4137fb9	feat(executor): enhance Qwen system message handling with strict injection and merging rules Closes: #2537	2026-04-04 21:51:02 +08:00
rensumo	8d5e470e1f	feat: dynamically fetch antigravity UA version from releases API Fetch the latest version from the antigravity auto-updater releases endpoint and cache it for 6 hours. Falls back to 1.21.9 if the API is unreachable or returns unexpected data.	2026-04-04 14:52:59 +08:00
James	65e9e892a4	Fix missing `response.completed.usage` for late-usage OpenAI-compatible streams	2026-04-04 05:58:04 +00:00
Luis Pater	088c1d07f4	Merge branch 'main' into plus	2026-04-04 11:40:03 +08:00
Luis Pater	8430b28cfa	Merge pull request #2526 from rensumo/main feat: 升级反重力 (antigravity) UA 版本为 1.21.9	2026-04-04 11:32:16 +08:00
rensumo	f3ab8f4bc5	chore: update antigravity UA version to 1.21.9	2026-04-04 07:35:08 +08:00
Luis Pater	0e4f189c2e	Merge pull request #1302 from dinhkarate/feat(vertex)/add-prefix-field Feat(vertex): add prefix field	2026-04-04 04:17:12 +08:00
Luis Pater	98509f615c	Merge pull request #485 from kunish/fix/copilot-premium-request-inflation fix(copilot): reduce premium request inflation, enable thinking, and use dynamic API limits	2026-04-04 02:19:56 +08:00
Luis Pater	e7a66ae504	Merge branch 'router-for-me:main' into main	2026-04-04 02:18:06 +08:00
Luis Pater	754b126944	fix(executor): remove commented-out code in QwenExecutor	2026-04-04 02:14:48 +08:00
Luis Pater	ae37ccffbf	Merge pull request #2520 from Arronlong/main fix:qwen invalid_parameter_error	2026-04-04 02:13:09 +08:00
Luis Pater	42c062bb5b	Merge pull request #2509 from adamhelfgott/fix-claude-thinking-temperature Normalize Claude temperature when thinking is enabled	2026-04-03 23:55:50 +08:00
kunish	87bf0b73d5	fix(copilot): use dynamic API limits to prevent prompt token overflow The Copilot API enforces per-account prompt token limits (128K individual, 168K business) that differ from the static 200K context length advertised by the proxy. This mismatch caused Claude Code to accumulate context beyond the actual limit, triggering "prompt token count exceeds the limit of 128000" errors. Changes: - Extract max_prompt_tokens and max_output_tokens from the Copilot /models API response (capabilities.limits) and use them as the authoritative ContextLength and MaxCompletionTokens values - Add CopilotModelLimits struct and Limits() helper to parse limits from the existing Capabilities map - Fix GitLab Duo context-1m beta header not being set when routing through the Anthropic gateway (gitlab_duo_force_context_1m attr was set but only gin headers were checked) - Fix flaky parallel tests that shared global model registry state	2026-04-03 23:54:17 +08:00
Arronlong	29dba0399b	Comment out system message check in Qwen executor fix qwen invalid_parameter_error	2026-04-03 23:07:33 +08:00
Luis Pater	a824e7cd0b	feat(models): add GPT-5.3, GPT-5.4, and GPT-5.4-mini with enhanced "thinking" levels	2026-04-03 23:05:10 +08:00
Luis Pater	140faef7dc	Merge branch 'router-for-me:main' into main	2026-04-03 21:48:23 +08:00
Luis Pater	adb580b344	feat(security): add configuration to toggle Gemini CLI endpoint access Closes: #2445	2026-04-03 21:46:49 +08:00
kunish	b849bf79d6	fix(copilot): address code review — SSE reasoning, multi-choice, agent detection - Strip SSE `data:` prefix before normalizing reasoning_text→reasoning_content in streaming mode; re-wrap afterward for the translator - Iterate all choices in normalizeGitHubCopilotReasoningField (not just choices[0]) to support n>1 requests - Remove over-broad tool-role fallback in isAgentInitiated that scanned all messages for role:"tool", aligning with opencode's approach of only detecting active tool loops — genuine user follow-ups after tool use are no longer mis-classified as agent-initiated - Add 5 reasoning normalization tests; update 2 X-Initiator tests to match refined semantics	2026-04-03 20:51:19 +08:00

1 2 3 4 5 ...

2134 Commits