CLIProxyAPIPlus

mirror of https://github.com/router-for-me/CLIProxyAPIPlus.git synced 2026-04-18 12:23:44 +00:00

Author	SHA1	Message	Date
kunish	578c312660	fix(copilot): lower static Claude context limits and expose them to Claude Code The Copilot API enforces per-account prompt token limits (128K individual, 168K business) that are lower than the total context window (200K). When the dynamic /models API fetch fails or returns no capabilities.limits, the static fallback of 200K exceeds the real enforced limit, causing intermittent "prompt token count exceeds the limit" errors. Two complementary fixes: 1. Lower static Copilot Claude model ContextLength from 200000 to 128000 (the conservative default matching defaultCopilotContextLength). Dynamic API limits override this when available. 2. Add context_length and max_completion_tokens to Claude-format model responses so Claude Code CLI can learn the actual Copilot limit instead of relying on its built-in 1M context configuration.	2026-04-08 17:02:53 +08:00
rensumo	9a5f142c33	feat(codebuddy): add glm-5v-turbo model and update context lengths	2026-04-08 09:48:25 +08:00
Luis Pater	98509f615c	Merge pull request #485 from kunish/fix/copilot-premium-request-inflation fix(copilot): reduce premium request inflation, enable thinking, and use dynamic API limits	2026-04-04 02:19:56 +08:00
Luis Pater	e7a66ae504	Merge branch 'router-for-me:main' into main	2026-04-04 02:18:06 +08:00
Luis Pater	a824e7cd0b	feat(models): add GPT-5.3, GPT-5.4, and GPT-5.4-mini with enhanced "thinking" levels	2026-04-03 23:05:10 +08:00
kunish	59af2c57b1	fix(copilot): reduce premium request inflation and enable thinking This commit addresses three issues with Claude Code through GitHub Copilot: 1. Premium request inflation: Responses API requests were missing Openai-Intent headers and proper defaults, causing Copilot to bill each tool-loop continuation as a new premium request. Fixed by adding isAgentInitiated() heuristic (checks for tool_result content or preceding assistant tool_use), applying Responses API defaults (store, include, reasoning.summary), and local tiktoken-based token counting to avoid extra API calls. 2. Context overflow: Claude Code's modelSupports1M() hardcodes opus-4-6 as 1M-capable, but Copilot only supports ~128K-200K. Fixed by stripping the context-1m-2025-08-07 beta from translated request bodies. Also forwards response headers in non-streaming Execute() and registers the GET /copilot-quota management API route. 3. Thinking not working: Add ThinkingSupport with level-based reasoning to Claude models in the static definitions. Normalize Copilot's non-standard 'reasoning_text' response field to 'reasoning_content' before passing to the SDK translator. Use caller-provided context in CountTokens instead of Background().	2026-04-03 20:24:30 +08:00
rensumo	73cda6e836	Update CodeBuddy DeepSeek model description	2026-04-03 11:03:33 +08:00
rensumo	0805989ee5	更新CodeBuddy CN的模型列表	2026-04-03 10:59:27 +08:00
Luis Pater	9720b03a6b	Merge pull request #477 from ben-vargas/plus-main fix(copilot): route Gemini preview models to chat endpoint and correct context lengths	2026-04-02 11:36:51 +08:00
Ben Vargas	c1a8adf1ab	feat(registry): add GitHub Copilot gemini-3.1-pro-preview model	2026-04-01 01:25:03 -06:00
Ben Vargas	08e078fc25	fix(openai): route copilot Gemini preview models to chat endpoint	2026-04-01 01:24:58 -06:00
Thai Nguyen Hung	bd09c0bf09	feat(registry): add gpt-5.4-mini model to GitHub Copilot registry	2026-04-01 10:04:38 +07:00
黄姜恒	19c52bcb60	feat: stash code	2026-03-25 10:14:14 +08:00
Luis Pater	7fa527193c	Merge pull request #453 from HeCHieh/fix/github-copilot-gpt54-responses Fix GitHub Copilot gpt-5.4 endpoint routing	2026-03-25 09:45:23 +08:00
Luis Pater	ed0eb51b4d	Merge pull request #450 from lwiles692/feature/add-codebuddy-support feat(auth): add CodeBuddy-CN browser OAuth authentication support	2026-03-25 09:43:52 +08:00
Luis Pater	0e4f669c8b	Merge branch 'router-for-me:main' into main	2026-03-25 09:38:34 +08:00
hkfires	fee736933b	feat(openai-compat): add per-model thinking support	2026-03-24 14:21:12 +08:00
hechieh	e6690cb447	Refine GitHub Copilot endpoint selection Amp-Thread-ID: https://ampcode.com/threads/T-019d14cd-bc90-70ce-b1ae-87bc97332650 Co-authored-by: Amp <amp@ampcode.com>	2026-03-22 19:43:35 +08:00
hechieh	35907416b8	Fix GitHub Copilot gpt-5.4 endpoint routing Amp-Thread-ID: https://ampcode.com/threads/T-019d14cd-bc90-70ce-b1ae-87bc97332650 Co-authored-by: Amp <amp@ampcode.com>	2026-03-22 19:05:44 +08:00
Wei Lee	4022e69651	feat(auth): add CodeBuddy-CN browser OAuth authentication support	2026-03-18 17:50:12 +08:00
Luis Pater	86d5db472a	Merge branch 'main' into plus	2026-03-13 11:28:52 +08:00
hkfires	c3d5dbe96f	feat(model_registry): enhance model registration and refresh mechanisms	2026-03-13 10:56:39 +08:00
hkfires	dbd42a42b2	fix(model_updater): clarify log message for model refresh failure	2026-03-12 10:32:04 +08:00
hkfires	dea3e74d35	feat(antigravity): refactor model handling and remove unused code	2026-03-12 09:24:45 +08:00
Luis Pater	046865461e	Merge PR #424 from router-for-me/main	2026-03-10 19:19:29 +08:00
hkfires	e333fbea3d	feat(updater): update StartModelsUpdater to block until models refresh completes	2026-03-10 14:41:58 +08:00
hkfires	efbe36d1d4	feat(updater): change models refresh to one-time fetch on startup	2026-03-10 14:18:54 +08:00
hkfires	30d5c95b26	feat(registry): refresh model catalog from network	2026-03-10 14:02:54 +08:00
hkfires	d1e3195e6f	feat(codex): register models by plan tier	2026-03-10 11:20:37 +08:00
Luis Pater	182b31963a	Merge branch 'router-for-me:main' into main	2026-03-08 20:48:05 +08:00
Luis Pater	631e5c8331	Merge pull request #1922 from shenshuoyaoyouguang/pr/model-registry-safety fix(registry): clone model snapshots and invalidate available-model cache	2026-03-07 23:01:42 +08:00
Luis Pater	ca90487a8c	Merge branch 'main' into feature/add-gemini-3.1-flash-image-preview	2026-03-07 22:16:09 +08:00
chujian	3a18f6fcca	fix(registry): clone slice fields in model map output	2026-03-07 18:53:56 +08:00
chujian	099e734a02	fix(registry): always clone available model snapshots	2026-03-07 18:40:02 +08:00
chujian	97ef633c57	fix(registry): address review feedback	2026-03-07 17:36:57 +08:00
chujian	dae8463ba1	fix(registry): clone model snapshots and invalidate available-model cache	2026-03-07 16:59:23 +08:00
Luis Pater	0c05131aeb	Merge branch 'router-for-me:main' into main	2026-03-07 09:08:28 +08:00
Frad LEE	a8cbc68c3e	feat(registry): add gemini 3.1 flash lite preview - Add model to GetGeminiModels() - Add model to GetGeminiVertexModels() - Add model to GetGeminiCLIModels() - Add model to GetAIStudioModels() - Add to AntigravityModelConfig with thinking levels - Update gemini-3-flash-preview description Registers the new lightweight Gemini model across all provider endpoints for cost-effective high-volume usage scenarios. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-06 20:52:28 +08:00
zhongnan.rex	242aecd924	feat(registry): add gemini-3.1-flash-image-preview model definition	2026-03-06 10:50:04 +08:00
Luis Pater	ad5253bd2b	Merge branch 'router-for-me:main' into main	2026-03-06 04:15:55 +08:00
Luis Pater	9397f7049f	fix(registry): simplify GPT 5.4 model description in static data	2026-03-06 02:32:56 +08:00
Luis Pater	a14d19b92c	Merge branch 'router-for-me:main' into main	2026-03-06 02:25:19 +08:00
Luis Pater	8822f20d17	feat(registry): add GPT 5.4 model definition to static data	2026-03-06 02:23:53 +08:00
LMark	9e855f8100	feat(github-copilot): add /responses support for gpt-4o and gpt-4.1	2026-03-05 21:20:21 +08:00
CheesesNguyen	c51851689b	fix: remove SOFT_LIMIT_REACHED logic, tool compression, and fix bugs - Remove SOFT_LIMIT_REACHED marker injection in response path - Remove SOFT_LIMIT_REACHED detection logic in request path - Remove SOFT_LIMIT_REACHED streaming logic in executor - Remove tool_compression.go and related constants - Fix truncation_detector: string(rune(len)) producing Unicode char instead of decimal string - Fix WebSearchToolUseId being overwritten by non-web-search tools - Fix duplicate kiro entry in model_definitions.go comment - Add build output to .gitignore	2026-03-05 10:05:39 +07:00
Luis Pater	179e5434b1	Merge pull request #406 from router-for-me/main v6.8.40	2026-03-03 21:51:48 +08:00
hkfires	c44793789b	feat(thinking): add adaptive thinking support for Claude models Add support for Claude's "adaptive" and "auto" thinking modes using `output_config.effort`. Introduce support for new effort level "max" in adaptive thinking. Update thinking logic, validate model capabilities, and extend converters and handling to ensure compatibility with adaptive modes. Adjust static model data with supported levels and refine handling across translators and executors.	2026-03-03 09:05:31 +08:00
Luis Pater	7547d1d0b3	chore(config): add default OAuth model alias configurations and extend registry with supported API endpoints	2026-03-02 21:36:42 +08:00
Luis Pater	68934942d0	Merge branch 'pr-402-local' # Conflicts: # internal/config/oauth_model_alias_migration.go	2026-03-02 20:45:37 +08:00
Luis Pater	8aa2cce8c5	Merge PR #1735 into dev with conflict resolution and fixes	2026-03-02 03:22:51 +08:00

1 2 3 4 5 ...

278 Commits