CLIProxyAPIPlus

mirror of https://github.com/router-for-me/CLIProxyAPIPlus.git synced 2026-04-12 17:24:13 +00:00

Author	SHA1	Message	Date
rensumo	bea13f9724	fix(executor): support non-stream requests for CodeBuddy	2026-04-06 13:59:06 +08:00
Luis Pater	7223fee2de	Merge branch 'pr-488' # Conflicts: # README.md # README_CN.md # README_JA.md v6.9.15-0	2026-04-05 02:08:45 +08:00
Luis Pater	ada8e2905e	feat(api): enhance proxy resolution for API key-based auth Added comprehensive support for resolving proxy URLs from configuration based on API key and provider attributes. Introduced new helper functions and extended the test suite to validate fallback mechanisms and compatibility cases.	2026-04-05 01:56:34 +08:00
Luis Pater	4ba10531da	feat(docs): add Poixe AI sponsorship details to README files Added Poixe AI sponsorship information, including referral bonuses and platform capabilities, to README files in English, Japanese, and Chinese. Updated assets to include Poixe AI logo.	2026-04-05 01:20:50 +08:00
Luis Pater	3774b56e9f	feat(misc): add background updater for Antigravity version caching Introduce `StartAntigravityVersionUpdater` to periodically refresh the cached Antigravity version using a non-blocking background process. Updated main server flow to initialize the updater.	2026-04-04 22:09:11 +08:00
Luis Pater	c2d4137fb9	feat(executor): enhance Qwen system message handling with strict injection and merging rules Closes: #2537	2026-04-04 21:51:02 +08:00
Luis Pater	2ee938acaf	Merge pull request #2535 from rensumo/main feat: 动态获取 Antigravity User-Agent 版本号	2026-04-04 21:00:47 +08:00
rensumo	8d5e470e1f	feat: dynamically fetch antigravity UA version from releases API Fetch the latest version from the antigravity auto-updater releases endpoint and cache it for 6 hours. Falls back to 1.21.9 if the API is unreachable or returns unexpected data.	2026-04-04 14:52:59 +08:00
Luis Pater	3882494878	Merge pull request #486 from router-for-me/plus v6.9.14 v6.9.14-0	2026-04-04 11:40:13 +08:00
Luis Pater	088c1d07f4	Merge branch 'main' into plus	2026-04-04 11:40:03 +08:00
Luis Pater	8430b28cfa	Merge pull request #2526 from rensumo/main feat: 升级反重力 (antigravity) UA 版本为 1.21.9	2026-04-04 11:32:16 +08:00
rensumo	f3ab8f4bc5	chore: update antigravity UA version to 1.21.9	2026-04-04 07:35:08 +08:00
Luis Pater	0e4f189c2e	Merge pull request #1302 from dinhkarate/feat(vertex)/add-prefix-field Feat(vertex): add prefix field	2026-04-04 04:17:12 +08:00
Luis Pater	98509f615c	Merge pull request #485 from kunish/fix/copilot-premium-request-inflation fix(copilot): reduce premium request inflation, enable thinking, and use dynamic API limits v6.9.13-1	2026-04-04 02:19:56 +08:00
Luis Pater	e7a66ae504	Merge branch 'router-for-me:main' into main v6.9.13-0	2026-04-04 02:18:06 +08:00
Luis Pater	754b126944	fix(executor): remove commented-out code in QwenExecutor	2026-04-04 02:14:48 +08:00
Luis Pater	ae37ccffbf	Merge pull request #2520 from Arronlong/main fix:qwen invalid_parameter_error	2026-04-04 02:13:09 +08:00
Luis Pater	42c062bb5b	Merge pull request #2509 from adamhelfgott/fix-claude-thinking-temperature Normalize Claude temperature when thinking is enabled	2026-04-03 23:55:50 +08:00
kunish	87bf0b73d5	fix(copilot): use dynamic API limits to prevent prompt token overflow The Copilot API enforces per-account prompt token limits (128K individual, 168K business) that differ from the static 200K context length advertised by the proxy. This mismatch caused Claude Code to accumulate context beyond the actual limit, triggering "prompt token count exceeds the limit of 128000" errors. Changes: - Extract max_prompt_tokens and max_output_tokens from the Copilot /models API response (capabilities.limits) and use them as the authoritative ContextLength and MaxCompletionTokens values - Add CopilotModelLimits struct and Limits() helper to parse limits from the existing Capabilities map - Fix GitLab Duo context-1m beta header not being set when routing through the Anthropic gateway (gitlab_duo_force_context_1m attr was set but only gin headers were checked) - Fix flaky parallel tests that shared global model registry state	2026-04-03 23:54:17 +08:00
Luis Pater	f389667ec3	Merge pull request #2513 from lonr-6/codex/fix-ws-custom-tool-repair-v2 fix: repair responses websocket custom tool call pairing	2026-04-03 23:45:38 +08:00
Arronlong	29dba0399b	Comment out system message check in Qwen executor fix qwen invalid_parameter_error	2026-04-03 23:07:33 +08:00
Luis Pater	a824e7cd0b	feat(models): add GPT-5.3, GPT-5.4, and GPT-5.4-mini with enhanced "thinking" levels	2026-04-03 23:05:10 +08:00
Luis Pater	140faef7dc	Merge branch 'router-for-me:main' into main v6.9.12-0	2026-04-03 21:48:23 +08:00
Luis Pater	adb580b344	feat(security): add configuration to toggle Gemini CLI endpoint access Closes: #2445	2026-04-03 21:46:49 +08:00
Luis Pater	06405f2129	fix(security): enforce stricter localhost validation for GeminiCLIAPIHandler Closes: #2445	2026-04-03 21:22:03 +08:00
kunish	b849bf79d6	fix(copilot): address code review — SSE reasoning, multi-choice, agent detection - Strip SSE `data:` prefix before normalizing reasoning_text→reasoning_content in streaming mode; re-wrap afterward for the translator - Iterate all choices in normalizeGitHubCopilotReasoningField (not just choices[0]) to support n>1 requests - Remove over-broad tool-role fallback in isAgentInitiated that scanned all messages for role:"tool", aligning with opencode's approach of only detecting active tool loops — genuine user follow-ups after tool use are no longer mis-classified as agent-initiated - Add 5 reasoning normalization tests; update 2 X-Initiator tests to match refined semantics	2026-04-03 20:51:19 +08:00
kunish	59af2c57b1	fix(copilot): reduce premium request inflation and enable thinking This commit addresses three issues with Claude Code through GitHub Copilot: 1. Premium request inflation: Responses API requests were missing Openai-Intent headers and proper defaults, causing Copilot to bill each tool-loop continuation as a new premium request. Fixed by adding isAgentInitiated() heuristic (checks for tool_result content or preceding assistant tool_use), applying Responses API defaults (store, include, reasoning.summary), and local tiktoken-based token counting to avoid extra API calls. 2. Context overflow: Claude Code's modelSupports1M() hardcodes opus-4-6 as 1M-capable, but Copilot only supports ~128K-200K. Fixed by stripping the context-1m-2025-08-07 beta from translated request bodies. Also forwards response headers in non-streaming Execute() and registers the GET /copilot-quota management API route. 3. Thinking not working: Add ThinkingSupport with level-based reasoning to Claude models in the static definitions. Normalize Copilot's non-standard 'reasoning_text' response field to 'reasoning_content' before passing to the SDK translator. Use caller-provided context in CountTokens instead of Background().	2026-04-03 20:24:30 +08:00
Kai Wang	d1fd2c4ad4	fix: repair websocket custom tool calls	2026-04-03 17:11:44 +08:00
Kai Wang	b6c6379bfa	fix: repair websocket custom tool calls	2026-04-03 17:11:42 +08:00
Kai Wang	8f0e66b72e	fix: repair websocket custom tool calls	2026-04-03 17:11:41 +08:00
Adam Helfgott	f63cf6ff7a	Normalize Claude temperature for thinking	2026-04-03 03:45:51 -04:00
Luis Pater	d2419ed49d	feat(executor): ensure default system message in QwenExecutor payload	2026-04-03 11:18:48 +08:00
Luis Pater	516d22c695	Merge pull request #484 from Ve-ria/main 更新CodeBuddy CN的模型列表 v6.9.10-1	2026-04-03 11:10:32 +08:00
rensumo	73cda6e836	Update CodeBuddy DeepSeek model description	2026-04-03 11:03:33 +08:00
rensumo	0805989ee5	更新CodeBuddy CN的模型列表	2026-04-03 10:59:27 +08:00
Luis Pater	75da02af55	Merge branch 'router-for-me:main' into main v6.9.10-0	2026-04-02 22:34:47 +08:00
Luis Pater	ab9ebea592	Merge PR #2474 # Conflicts: # internal/api/modules/amp/response_rewriter.go # internal/api/modules/amp/response_rewriter_test.go	2026-04-02 22:31:12 +08:00
Luis Pater	7ee37ee4b9	feat: add /healthz endpoint and test coverage for health check Closes: #2493	2026-04-02 21:56:27 +08:00
Luis Pater	837afffb31	docs: remove README_JA.md and clean up related links from README files	2026-04-02 21:37:47 +08:00
Luis Pater	03a1bac898	Merge upstream v6.9.9 (PR #483 ) v6.9.9-0	2026-04-02 21:31:21 +08:00
Luis Pater	3171d524f0	docs: fix duplicated ProxyPal entry in README files	2026-04-02 21:22:40 +08:00
Luis Pater	3e78a8d500	Merge branch 'main' into dev	2026-04-02 21:21:26 +08:00
Luis Pater	fcba912cc4	Merge pull request #2492 from davidwushi1145/main fix(responses): reassemble split SSE event/data frames before streaming	2026-04-02 21:20:31 +08:00
Luis Pater	7170eeea5f	Merge pull request #2454 from buddingnewinsights/add-proxypal-to-readme docs: add ProxyPal to "Who is with us?" section	2026-04-02 21:18:22 +08:00
Luis Pater	e3eb048c7a	Merge pull request #2489 from Soein/upstream-pr fix: 增强 Claude 反代检测对抗能力	2026-04-02 21:16:58 +08:00
Luis Pater	a59e92435b	Merge pull request #2490 from router-for-me/logs Refactor websocket logging and error handling	2026-04-02 20:47:31 +08:00
davidwushi1145	108895fc04	Harden Responses SSE framing against partial chunk boundaries Follow-up review found two real framing hazards in the handler-layer framer: it could flush a partial `data:` payload before the JSON was complete, and it could inject an extra newline before chunks that already began with `\n`/`\r\n`. This commit tightens the framer so it only emits undelimited events when the buffered `data:` payload is already valid JSON (or `[DONE]`), skips newline injection for chunks that already start with a line break, and avoids the heavier `bytes.Split` path while scanning SSE fields. The regression suite now covers split `data:` payload chunks, newline-prefixed chunks, and dropping incomplete trailing data on flush, so the original Responses fix remains intact while the review concerns are explicitly locked down. Constraint: Keep the follow-up limited to handler-layer framing and tests Rejected: Ignore the review and rely on current executor chunk shapes \| leaves partial data payload corruption possible Rejected: Build a fully generic SSE parser \| wider change than needed for the identified risks Confidence: high Scope-risk: narrow Reversibility: clean Directive: Do not emit undelimited Responses SSE events unless buffered `data:` content is already complete and valid Tested: /tmp/go1.26.1/go/bin/go test ./sdk/api/handlers/openai -count=1 Tested: /tmp/go1.26.1/go/bin/go test ./sdk/api/handlers -count=1 Tested: /tmp/go1.26.1/go/bin/go vet ./sdk/api/handlers/... Not-tested: Full repository test suite outside sdk/api/handlers packages	2026-04-02 20:39:49 +08:00
davidwushi1145	abc293c642	Prevent malformed Responses SSE frames from breaking stream clients Line-oriented upstream executors can emit `event:` and `data:` as separate chunks, but the Responses handler had started terminating each incoming chunk as a full SSE event. That split `response.created` into an empty event plus a later data block, which broke downstream clients like OpenClaw. This keeps the fix in the handler layer: a small stateful framer now buffers standalone `event:` lines until the matching `data:` arrives, preserves already-framed events, and ignores delimiter-only leftovers. The regression suite now covers split event/data framing, full-event passthrough, terminal errors, and the bootstrap path that forwards line-oriented openai-response streams from non-Codex executors too. Constraint: Keep the fix localized to Responses handler framing instead of patching every executor Rejected: Revert to v6.9.7 chunk writing \| would reintroduce data-only framing regressions Rejected: Patch each line-oriented executor separately \| duplicates fragile SSE assembly logic Confidence: high Scope-risk: narrow Reversibility: clean Directive: Do not assume incoming Responses stream chunks are already complete SSE events; preserve handler-layer reassembly for split `event:`/`data:` inputs Tested: /tmp/go1.26.1/go/bin/go test ./sdk/api/handlers/openai -count=1 Tested: /tmp/go1.26.1/go/bin/go test ./sdk/api/handlers -count=1 Tested: /tmp/go1.26.1/go test ./sdk/api/handlers/... -count=1 Tested: /tmp/go1.26.1/go/bin/go vet ./sdk/api/handlers/... Tested: Temporary patched server on 127.0.0.1:18317 -> /v1/models 200, /v1/responses non-stream 200, /v1/responses stream emitted combined `event:` + `data:` frames Not-tested: Full repository test suite outside sdk/api/handlers packages	2026-04-02 20:26:42 +08:00
pzy	bb44671845	fix: 修复反代检测对抗的 3 个问题 - computeFingerprint 使用 rune 索引替代字节索引，修复多字节字符指纹不匹配 - utls Chrome TLS 指纹仅对 Anthropic 官方域名生效，自定义 base_url 走标准 transport - IPv6 地址使用 net.JoinHostPort 正确拼接端口	2026-04-02 19:12:55 +08:00
Luis Pater	09e480036a	feat(auth): add support for managing custom headers in auth files Closes #2457	2026-04-02 19:11:09 +08:00

1 2 3 4 5 ...

2848 Commits