CLIProxyAPIPlus

mirror of https://github.com/router-for-me/CLIProxyAPIPlus.git synced 2026-03-27 22:27:28 +00:00

Author	SHA1	Message	Date
Luis Pater	13c93e8cfd	Merge pull request #414 from CheesesNguyen/fix/remove-soft-limit-and-tool-compression fix: remove SOFT_LIMIT_REACHED logic, tool compression, and fix bugs	2026-03-05 20:12:50 +08:00
Luis Pater	352cb98ff0	Merge branch 'router-for-me:main' into main	2026-03-05 20:08:19 +08:00
CheesesNguyen	7fe1d102cb	fix: don't treat empty input as truncation for tools without required fields Tools like TaskList, TaskGet have no required parameters, so empty input is valid. Previously, the truncation detector flagged all empty inputs as truncated, causing these tools to be skipped and breaking the tool loop. Now only flag empty input as truncation when the tool has required fields defined in RequiredFieldsByTool.	2026-03-05 14:43:45 +07:00
Luis Pater	5850492a93	Fixed: #1548 test(translator): add unit tests for fallback logic in `ConvertCodexResponseToOpenAI` model assignment	2026-03-05 12:11:54 +08:00
Luis Pater	ebef1fae2a	Merge pull request #1511 from stondy0103/fix/responses-nullable-type-array fix(translator): fix nullable type arrays breaking Gemini/Antigravity API	2026-03-05 11:30:09 +08:00
CheesesNguyen	c51851689b	fix: remove SOFT_LIMIT_REACHED logic, tool compression, and fix bugs - Remove SOFT_LIMIT_REACHED marker injection in response path - Remove SOFT_LIMIT_REACHED detection logic in request path - Remove SOFT_LIMIT_REACHED streaming logic in executor - Remove tool_compression.go and related constants - Fix truncation_detector: string(rune(len)) producing Unicode char instead of decimal string - Fix WebSearchToolUseId being overwritten by non-web-search tools - Fix duplicate kiro entry in model_definitions.go comment - Add build output to .gitignore	2026-03-05 10:05:39 +07:00
Luis Pater	4bbeb92e9a	Fixed: #1135 test(translator): add tests for `tool_choice` handling in Claude request conversions	2026-03-04 22:28:26 +08:00
Luis Pater	b436dad8bc	Merge pull request #1822 from sususu98/fix/strip-defer-loading fix(translator): strip defer_loading from Claude tool declarations in Codex and Gemini translators	2026-03-04 20:49:48 +08:00
Luis Pater	7ebd8f0c44	Merge branch 'router-for-me:main' into main	2026-03-04 18:30:45 +08:00
sususu98	d26ad8224d	fix(translator): strip defer_loading from Claude tool declarations in Codex and Gemini translators Claude's Tool Search feature (advanced-tool-use-2025-11-20 beta) adds defer_loading field to tool definitions. When proxying Claude requests to Codex or Gemini, this unknown field causes 400 errors upstream. Strip defer_loading (and cache_control where missing) in all three Claude-to-upstream translation paths: - codex/claude: defer_loading + cache_control - gemini-cli/claude: defer_loading - gemini/claude: defer_loading Fixes #1725, Fixes #1375	2026-03-04 14:21:30 +08:00
hkfires	5c84d69d42	feat(translator): map output_config.effort to adaptive thinking level in antigravity	2026-03-04 13:11:07 +08:00
sususu98	527e4b7f26	fix(antigravity): pass through adaptive thinking effort level instead of always mapping to high	2026-03-04 10:12:45 +08:00
Luis Pater	179e5434b1	Merge pull request #406 from router-for-me/main v6.8.40	2026-03-03 21:51:48 +08:00
Luis Pater	9f95b31158	fix(translator): enhance handling of mixed output content in Claude requests	2026-03-03 21:49:41 +08:00
hkfires	ce87714ef1	feat(thinking): normalize effort levels in adaptive thinking requests to prevent validation errors	2026-03-03 15:10:47 +08:00
hkfires	0452b869e8	feat(thinking): add HasLevel and MapToClaudeEffort functions for adaptive thinking support	2026-03-03 14:16:36 +08:00
hkfires	d2e5857b82	feat(thinking): enhance adaptive thinking support across models and update test cases	2026-03-03 13:00:24 +08:00
hkfires	c44793789b	feat(thinking): add adaptive thinking support for Claude models Add support for Claude's "adaptive" and "auto" thinking modes using `output_config.effort`. Introduce support for new effort level "max" in adaptive thinking. Update thinking logic, validate model capabilities, and extend converters and handling to ensure compatibility with adaptive modes. Adjust static model data with supported levels and refine handling across translators and executors.	2026-03-03 09:05:31 +08:00
Luis Pater	68934942d0	Merge branch 'pr-402-local' # Conflicts: # internal/config/oauth_model_alias_migration.go	2026-03-02 20:45:37 +08:00
hkfires	914db94e79	refactor(headers): streamline User-Agent handling and introduce GeminiCLI versioning	2026-03-02 13:04:30 +08:00
Luis Pater	8aa2cce8c5	Merge PR #1735 into dev with conflict resolution and fixes	2026-03-02 03:22:51 +08:00
Luis Pater	446150a747	Merge branch 'router-for-me:main' into main	2026-03-01 20:35:05 +08:00
hkfires	b148820c35	fix(translator): handle Claude thinking type "auto" like adaptive	2026-03-01 10:30:19 +08:00
Luis Pater	b6ca5ef7ce	Merge branch 'main' into plus	2026-03-01 09:41:52 +08:00
Luis Pater	1ae994b4aa	fix(antigravity): adjust thinkingBudget default to 64000 and update model definitions for Claude	2026-03-01 09:39:39 +08:00
Luis Pater	32e64dacfd	Merge branch 'main' into plus	2026-03-01 02:44:26 +08:00
Luis Pater	5446cd2b02	Merge pull request #1761 from margbug01/fix/thinking-chain-display fix: support thinking.type=auto from Amp client and decouple thinking translation from unsigned history	2026-03-01 02:30:42 +08:00
margbug01	8de0885b7d	fix: support thinking.type="auto" from Amp client for Antigravity Claude models ## Problem When using Antigravity Claude models through CLIProxyAPI, the thinking chain (reasoning content) does not display in the Amp client. ## Root Cause The Amp client sends `thinking: {"type": "auto"}` in its requests, but `ConvertClaudeRequestToAntigravity` only handled `"enabled"` and `"adaptive"` types in its switch statement. The `"auto"` type was silently ignored, resulting in no `thinkingConfig` being set in the translated Gemini request. Without `thinkingConfig`, the Antigravity API returns responses without any thinking content. Additionally, the Antigravity API for Claude models does not support `thinkingBudget: -1` (auto mode sentinel). It requires a concrete positive budget value. The fix uses 128000 as the budget for "auto" mode, which `ApplyThinking` will then normalize to stay within the model's actual limits (e.g., capped to `maxOutputTokens - 1`). ## Changes ### internal/translator/antigravity/claude/antigravity_claude_request.go 1. Add "auto" case to the thinking type switch statement. Sets `thinkingBudget: 128000` and `includeThoughts: true`. The budget is subsequently normalized by `ApplyThinking` based on model-specific limits. 2. Add "auto" to hasThinking check so that interleaved thinking hints are injected for tool-use scenarios when Amp sends `thinking.type="auto"`. ### internal/registry/model_definitions_static_data.go 3. Add Thinking configuration for `claude-sonnet-4-6`, `claude-sonnet-4-5`, and `claude-opus-4-6` in `GetAntigravityModelConfig()` -- these were previously missing, causing `ApplyThinking` to skip thinking config entirely. ## Testing - Deployed to Railway test instance (cpa-thinking-test) - Verified via debug logging that: - Amp sends `thinking: {"type": "auto"}` - CPA now translates this to `thinkingConfig: {thinkingBudget: 128000, includeThoughts: true}` - `ApplyThinking` normalizes the budget to model-specific limits - Antigravity API receives the correct thinkingConfig Amp-Thread-ID: https://ampcode.com/threads/T-019ca511-710d-776d-a07c-4b750f871a93 Co-authored-by: Amp <amp@ampcode.com>	2026-03-01 02:18:43 +08:00
Luis Pater	16243f18fd	Merge branch 'router-for-me:main' into main	2026-03-01 01:46:23 +08:00
Luis Pater	a6ce5f36e6	Fixed: #1758 fix(codex): filter billing headers from system result text and update template logic	2026-03-01 01:45:35 +08:00
maplelove	68dd2bfe82	fix(translator): allow passthrough of custom generationConfig for all Gemini-like providers	2026-02-27 17:13:42 +08:00
Cyrus	030bf5e6c7	feat(kiro): add IDC auth and endpoint improvements, redesign fingerprint system - Add IAM Identity Center (IDC) authentication with CLI flags (--kiro-idc-login, --kiro-idc-start-url, --kiro-idc-region) and login flow - Add ProfileArn auto-fetching in Execute/ExecuteStream for imported IDC accounts - Simplify endpoint preference with map-based alias lookup and getAuthValue helper - Redesign fingerprint as global singleton with external config and per-account deterministic generation - Add StartURL and FingerprintConfig fields to Kiro config - Add AgentContinuationID/AgentTaskType support in Kiro translators - Add comprehensive tests for executor, fingerprint, SSO OIDC, and AWS helpers - Add CLI login documentation to README	2026-02-27 00:58:03 +08:00
Luis Pater	f481d25133	Merge branch 'main' into plus	2026-02-26 23:16:17 +08:00
Luis Pater	816fb4c5da	Merge pull request #1682 from sususu98/fix/tool-result-image-parts fix(antigravity): place tool_result images in functionResponse.parts and unify mimeType	2026-02-25 23:14:35 +08:00
Luis Pater	6bcac3a55a	Merge branch 'router-for-me:main' into main	2026-02-25 22:21:31 +08:00
Luis Pater	d24ea4ce2a	Merge pull request #1664 from ciberponk/pr/responses-compaction-compat feat: add codex responses compatibility for compaction payloads	2026-02-25 01:21:59 +08:00
Luis Pater	77cc4ce3a0	Merge branch 'main' into plus	2026-02-25 01:04:15 +08:00
Luis Pater	37dfea1d3f	Merge pull request #287 from possible055/main fix(kiro): support OR-group field matching in truncation detector	2026-02-25 01:02:49 +08:00
apparition	c785c1a3ca	fix(kiro): support OR-group field matching in truncation detector - Change RequiredFieldsByTool value type from []string to [][]string - Outer slice = AND (all groups required); inner slice = OR (any one satisfies) - Fix Bash entry to accept "cmd" or "command", resolving soft-truncation loop - Update findMissingRequiredFields logic and inline docs accordingly	2026-02-24 22:48:05 +08:00
Luis Pater	c3e12c5e58	Merge pull request #1654 from alexey-yanchenko/feature/pass-file-inputs Pass file input from /chat/completions and /responses to codex and claude	2026-02-24 05:53:11 +08:00
Luis Pater	1825fc7503	Merge pull request #1643 from alexey-yanchenko/fix/gemini-prompt-tokens Fix usage convertation from gemini response to openai format	2026-02-24 05:46:13 +08:00
Darley	6e634fe3f9	fix: filter out orphaned tool results from history and current context	2026-02-23 14:33:59 +08:00
sususu98	4e26182d14	fix(antigravity): place tool_result images in functionResponse.parts and unify mimeType Move base64 image data from Claude tool_result into functionResponse.parts as inlineData instead of outer sibling parts, preventing context bloat. Unify all inlineData field naming to camelCase mimeType across Claude, OpenAI, and Gemini translators. Add comprehensive edge case tests and Gemini-side regression test for functionResponse.parts preservation.	2026-02-23 13:38:21 +08:00
fan	afc8a0f9be	refactor: simplify context_management compatibility handling	2026-02-21 22:20:48 +08:00
ciberponk	d693d7993b	feat: support responses compaction payload compatibility for codex translator	2026-02-21 12:56:10 +08:00
Luis Pater	57d18bb226	Merge branch 'router-for-me:main' into main	2026-02-20 22:42:01 +08:00
DragonBaiMo	7c9c89dace	fix(kiro): keep thinking enabled across request formats	2026-02-20 20:34:40 +08:00
Alexey Yanchenko	0cbfe7f457	Pass file input from /chat/completions and /responses to codex and claude	2026-02-20 10:25:44 +07:00
Kirill Turanskiy	1cc21cc45b	fix: prevent duplicate function call arguments when delta events precede done Non-spark codex models (gpt-5.3-codex, gpt-5.2-codex) stream function call arguments via multiple delta events followed by a done event. The done handler unconditionally emitted the full arguments, duplicating what deltas already streamed. This produced invalid double JSON that Claude Code couldn't parse, causing tool calls to fail with missing parameters and infinite retry loops. Add HasReceivedArgumentsDelta flag to track whether delta events were received. The done handler now only emits arguments when no deltas preceded it (spark models), while delta-based streaming continues to work for non-spark models.	2026-02-19 23:18:14 +03:00
Kirill Turanskiy	07cf616e2b	fix: handle response.function_call_arguments.done in codex→claude streaming translator Some Codex models (e.g. gpt-5.3-codex-spark) send function call arguments in a single "done" event without preceding "delta" events. The streaming translator only handled "delta" events, causing tool call arguments to be lost — resulting in empty tool inputs and infinite retry loops in clients like Claude Code. Emit the full arguments from the "done" event as a single input_json_delta so downstream clients receive the complete tool input.	2026-02-19 23:18:14 +03:00

1 2 3 4 5 ...

478 Commits