CLIProxyAPIPlus

mirror of https://github.com/router-for-me/CLIProxyAPIPlus.git synced 2026-04-24 09:21:02 +00:00

Author	SHA1	Message	Date
Luis Pater	6184c43319	Fixed: #1109 feat(translator): enhance session ID derivation with user_id parsing in Claude	2026-01-20 12:35:40 +08:00
Luis Pater	2cbe4a790c	chore(translator): remove unnecessary whitespace in gemini_openai_response code	2026-01-20 11:47:33 +08:00
Luis Pater	68b3565d7b	Merge branch 'main' into dev (PR #961 )	2026-01-20 11:42:22 +08:00
Luis Pater	d4bb4e6624	refactor(antigravity): remove unused client signature handling in thinking objects	2026-01-20 10:17:55 +08:00
Luis Pater	a7ffc77e3d	Merge branch 'dev' into fix/cross-model-thinking-signature	2026-01-20 10:10:43 +08:00
Luis Pater	08779cc8a8	Merge branch 'router-for-me:main' into main	2026-01-19 21:00:58 +08:00
hkfires	52e46ced1b	fix(translator): avoid forcing RFC 8259 system prompt	2026-01-19 11:33:27 +08:00
hkfires	cf9daf470c	feat(translator): report cached token usage in Claude output	2026-01-19 11:23:44 +08:00
Luis Pater	2d9f6c104c	Merge branch 'main' into plus	2026-01-19 04:03:17 +08:00
Luis Pater	140d6211cc	feat(translator): add reasoning state tracking and improve reasoning summary handling - Introduced `oaiToResponsesStateReasoning` to track reasoning data. - Enhanced logic for emitting reasoning summary events and managing state transitions. - Updated output generation to handle multiple reasoning entries consistently.	2026-01-19 03:58:28 +08:00
hkfires	d5ef4a6d15	refactor(translator): remove registry model lookups from thinking config conversions	2026-01-18 10:30:14 +08:00
Luis Pater	46433a25f8	fix(translator): add check for empty `text` to prevent invalid serialization in `gemini` and `antigravity`	2026-01-18 00:50:10 +08:00
Luis Pater	f8f3ad84fc	Fixed: #1064 feat(translator): improve system message handling and content indexing across translators - Updated logic for processing system messages in `claude`, `gemini`, `gemini-cli`, and `antigravity` translators. - Introduced indexing for `systemInstruction.parts` to ensure proper ordering and handling of multi-part content. - Added safeguards for accurate content transformation and serialization.	2026-01-17 05:40:56 +08:00
Luis Pater	93d7883513	Merge pull request #110 from PancakeZik/fix/system-prompt-reinjection fix: prevent system prompt re-injection on subsequent turns	2026-01-17 05:19:11 +08:00
Luis Pater	015a3e8a83	Merge branch 'router-for-me:main' into main	2026-01-17 05:17:38 +08:00
Joao	6b074653f2	fix: prevent system prompt re-injection on subsequent turns When tool results are sent back to the model, the system prompt was being re-injected into the user message content, causing the model to think the user had pasted the system prompt again. This was especially noticeable after multiple tool uses. The fix checks if there is conversation history (len(history) > 0). If so, it's a subsequent turn and we skip system prompt injection. The system prompt is only injected on the first turn (len(history) == 0). This ensures: - First turn: system prompt is injected - Tool result turns: system prompt is NOT re-injected - New conversations: system prompt is injected fresh	2026-01-16 20:16:44 +00:00
Luis Pater	65b4e1ec6c	feat(codex): enable instruction toggling and update role terminology - Added conditional logic for Codex instruction injection based on configuration. - Updated role terminology from "user" to "developer" for better alignment with context.	2026-01-17 04:12:29 +08:00
Luis Pater	06afa29f2d	Merge branch 'router-for-me:main' into main	2026-01-16 20:01:35 +08:00
Luis Pater	6600d58ba2	feat(codex): enhance input transformation and remove unused `safety_identifier` field - Added logic to transform `inputResults` into structured JSON for improved processing. - Removed redundant `safety_identifier` field in executor payload to streamline requests.	2026-01-16 19:59:01 +08:00
Luis Pater	bca244df67	Merge branch 'router-for-me:main' into main	2026-01-16 11:37:33 +08:00
Luis Pater	cec4e251bd	feat(translator): preserve `text` field in serialized output during chat completions processing	2026-01-16 11:35:34 +08:00
Luis Pater	c29839d2ed	Merge remote-tracking branch 'origin/main' into pr-104 # Conflicts: # config.example.yaml # internal/config/config.go # sdk/cliproxy/auth/model_name_mappings.go	2026-01-16 09:40:07 +08:00
hkfires	199cf480b0	refactor(thinking): remove support for non-standard thinking configurations This change removes the translation logic for several non-standard, proprietary extensions used to configure thinking/reasoning. Specifically, support for `extra_body.google.thinking_config` and the Anthropic-style `thinking` object has been dropped from the OpenAI request translators. This simplification streamlines the translators, focusing them on the standard `reasoning_effort` parameter. It also removes the need to look up model information from the registry within these components. BREAKING CHANGE: Support for non-standard thinking configurations via `extra_body.google.thinking_config` and the Anthropic-style `thinking` object has been removed. Clients should now use the standard `reasoning_effort` parameter to control reasoning.	2026-01-15 19:32:12 +08:00
hkfires	5a77b7728e	refactor(thinking): improve budget clamping and logging with provider/model context	2026-01-15 13:06:41 +08:00
hkfires	ed8b0f25ee	fix(thinking): use LookupModelInfo for model data	2026-01-15 13:06:41 +08:00
hkfires	6e4a602c60	fix(thinking): map reasoning_effort to thinkingConfig	2026-01-15 13:06:40 +08:00
hkfires	0b06d637e7	refactor: improve thinking logic	2026-01-15 13:06:39 +08:00
Luis Pater	a235fb1507	Merge branch 'main' into plus	2026-01-15 03:30:56 +08:00
Luis Pater	b163f8ed9e	Fixed: #1004 feat(translator): add function name to response output item serialization - Included `item.name` in the serialized response output to enhance output item handling.	2026-01-15 03:27:00 +08:00
ZqinKing	83e5f60b8b	fix(kiro): scale description compression by needed size Compute a size-reduction based keep ratio and use it to trim tool descriptions, avoiding forced minimum truncation when the target size already fits. This aligns compression with actual payload reduction needs and prevents over-compression.	2026-01-14 16:22:46 +08:00
ZqinKing	5b433f962f	feat(kiro): 实现动态工具压缩功能 ## 背景当 Claude Code 发送过多工具信息时，可能超出 Kiro API 请求限制导致 500 错误。现有的工具描述截断（KiroMaxToolDescLen = 10237）只能限制单个工具的描述长度，无法解决整体工具列表过大的问题。 ## 解决方案实现动态工具压缩功能，采用两步压缩策略： 1. 先检查原始大小，超过 20KB 才进行压缩 2. 第一步：简化 input_schema，只保留 type/enum/required 字段 3. 第二步：按比例缩短 description（最短 50 字符） 4. 保留全部工具和 skills 可调用，不丢弃任何工具 ## 新增文件 - internal/translator/kiro/claude/tool_compression.go - calculateToolsSize(): 计算工具列表的 JSON 序列化大小 - simplifyInputSchema(): 简化 input_schema，递归处理嵌套 properties - compressToolDescription(): 按比例压缩描述，支持 UTF-8 安全截断 - compressToolsIfNeeded(): 主压缩函数，实现两步压缩策略 - internal/translator/kiro/claude/tool_compression_test.go - 完整的单元测试覆盖所有新增函数 - 测试 UTF-8 安全性 - 测试压缩效果 ## 修改文件 - internal/translator/kiro/common/constants.go - 新增 ToolCompressionTargetSize = 20KB (压缩目标大小阈值) - 新增 MinToolDescriptionLength = 50 (描述最短长度) - internal/translator/kiro/claude/kiro_claude_request.go - 在 convertClaudeToolsToKiro() 函数末尾调用 compressToolsIfNeeded() ## 测试结果 - 70KB 工具压缩至 17KB (74.7% 压缩率) - 所有单元测试通过 ## 预期效果 - 80KB+ tools 压缩至 ~15KB - 不影响工具调用功能	2026-01-14 11:07:07 +08:00
adrenjc	5977af96a0	fix(antigravity): prevent corrupted thought signature when switching models When switching from Claude models (e.g., Opus 4.5) to Gemini models (e.g., Flash) mid-conversation via Antigravity OAuth, the client-provided thinking signatures from Claude would cause "Corrupted thought signature" errors since they are incompatible with Gemini API. Changes: - Remove fallback to client-provided signatures in thinking block handling - Only use cached signatures (from same-session Gemini responses) - Skip thinking blocks without valid cached signatures - tool_use blocks continue to use skip_thought_signature_validator when no valid signature is available This ensures cross-model switching works correctly while preserving signature validation for same-model conversations. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-13 18:24:05 +08:00
Luis Pater	216dafe44b	Merge branch 'router-for-me:main' into main	2026-01-12 00:27:15 +08:00
hkfires	220ca45f74	fix(codex): only override instructions when upstream provides them	2026-01-11 15:52:21 +08:00
hkfires	70a82d80ac	fix(codex): only override instructions in responses for OpenCode UA	2026-01-11 15:19:37 +08:00
hkfires	ac626111ac	feat(codex): add OpenCode instructions based on user agent	2026-01-11 13:36:35 +08:00
extremk	5bb9c2a2bd	Add candidate count parameter to OpenAI request	2026-01-10 18:50:13 +08:00
extremk	0b5bbe9234	Add candidate count handling in OpenAI request	2026-01-10 18:49:29 +08:00
extremk	14c74e5e84	Handle 'n' parameter for candidate count in requests Added handling for the 'n' parameter to set candidate count in generationConfig.	2026-01-10 18:48:33 +08:00
extremk	6448d0ee7c	Add candidate count handling in OpenAI request	2026-01-10 18:47:41 +08:00
extremk	b0c17af2cf	Enhance Gemini to OpenAI response conversion Refactor response handling to support multiple candidates and improve parameter management.	2026-01-10 18:46:25 +08:00
Luis Pater	514b9bf9fc	Merge origin/main into pr-92	2026-01-10 01:12:22 +08:00
Luis Pater	4d7f389b69	Fixed: #941 fix(translator): ensure fallback to valid originalRequestRawJSON in response handling	2026-01-10 01:01:09 +08:00
Luis Pater	1906ebcfce	Merge branch 'main' into plus	2026-01-09 21:52:24 +08:00
hkfires	ee62ef4745	refactor(logging): clean up oauth logs and debugs	2026-01-09 11:20:55 +08:00
Luis Pater	d47b7dc79a	refactor(response): enhance parameter handling for Codex to Claude conversion	2026-01-09 05:20:19 +08:00
Luis Pater	3d01b3cfe8	Merge pull request #553 from XInTheDark/fix/builtin-tools-web-search fix(translator): preserve built-in tools (web_search) to Responses API	2026-01-09 04:40:13 +08:00
Luis Pater	4794645dec	Merge branch 'main' into plus	2026-01-06 23:20:39 +08:00
MohammadErfan Jabbari	fe6043aec7	fix(antigravity): preserve finish_reason tool_calls across streaming chunks When streaming responses with tool calls, the finish_reason was being overwritten. The upstream sends functionCall in chunk 1, then finishReason: STOP in chunk 2. The old code would set finish_reason from every chunk, causing "tool_calls" to be overwritten by "stop". This broke clients like Claude Code that rely on finish_reason to detect when tool calls are complete. Changes: - Add SawToolCall bool to track tool calls across entire stream - Add UpstreamFinishReason to cache the finish reason - Only emit finish_reason on final chunk (has both finishReason + usage) - Priority: tool_calls > max_tokens > stop Includes 5 unit tests covering: - Tool calls not overwritten by subsequent STOP - Normal text gets "stop" - MAX_TOKENS without tool calls gets "max_tokens" - Tool calls take priority over MAX_TOKENS - Intermediate chunks have no finish_reason Fixes streaming tool call detection for Claude Code + Gemini models. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2026-01-05 18:45:25 +01:00
Luis Pater	8f8dfd081b	Merge pull request #850 from can1357/main feat(translator): add developer role support for Gemini translators	2026-01-05 11:27:24 +08:00

1 2 3 4 5 ...

365 Commits