CLIProxyAPIPlus

mirror of https://github.com/router-for-me/CLIProxyAPIPlus.git synced 2026-03-09 15:25:17 +00:00

Author	SHA1	Message	Date
Luis Pater	ec6068060b	Merge branch 'main' into plus	2026-02-19 04:42:35 +08:00
Kirill Turanskiy	5fa23c7f41	fix: handle tool call argument streaming in Codex→OpenAI translator The OpenAI Chat Completions translator was silently dropping response.function_call_arguments.delta and response.function_call_arguments.done Codex SSE events, meaning tool call arguments were never streamed incrementally to clients. Add proper handling mirroring the proven Claude translator pattern: - response.output_item.added: announce tool call (id, name, empty args) - response.function_call_arguments.delta: stream argument chunks - response.function_call_arguments.done: emit full args if no deltas - response.output_item.done: defensive fallback for backward compat State tracking via HasReceivedArgumentsDelta and HasToolCallAnnounced ensures no duplicate argument emission and correct behavior for models like codex-spark that skip delta events entirely.	2026-02-18 19:09:05 +03:00
Luis Pater	d560c20c26	Merge branch 'router-for-me:main' into main	2026-02-15 14:49:13 +08:00
Alexey Yanchenko	63d4de5eea	Pass cache usage from codex to openai chat completions	2026-02-15 12:04:15 +07:00
Skyuno	632a2fd2f2	refactor: align GenerateSearchIndicatorEvents return type with other event builders Change GenerateSearchIndicatorEvents to return [][]byte instead of []sseEvent for consistency with BuildFallbackTextEvents and other event building functions. Benefits: - Consistent API across all event generation functions - Eliminates intermediate sseEvent type conversion in caller - Simplifies usage by returning ready-to-send SSE byte slices This addresses the code quality feedback from PR #226 review.	2026-02-13 22:04:09 +08:00
Skyuno	5626637fbd	security: remove query content from web search logs to prevent PII leakage - Remove search query from iteration logs (Info level) - Remove query and toolUseId from analysis logs (Info level) - Remove query from non-stream result logs (Info level) - Remove query from tool injection logs (Info level) - Remove query from tool_use detection logs (Debug level) This addresses the security concern raised in PR #226 review about potential PII exposure in search query logs.	2026-02-13 22:04:09 +08:00
Skyuno	587371eb14	refactor: align web search with executor layer patterns Consolidate web search handler, SSE event generation, stream analysis, and MCP HTTP I/O into the executor layer. Merge the separate kiro_websearch_handler.go back into kiro_executor.go to align with the single-file-per-executor convention. Translator retains only pure data types, detection, and payload transformation. Key changes: - Move SSE construction (search indicators, fallback text, message_start) from translator to executor, consistent with streamToChannel pattern - Move MCP handler (callMcpAPI, setMcpHeaders, fetchToolDescription) from translator to executor alongside other HTTP I/O - Reuse applyDynamicFingerprint for MCP UA headers (eliminate duplication) - Centralize MCP endpoint URL via BuildMcpEndpoint in translator - Add atomic Set/GetWebSearchDescription for cross-layer tool desc cache - Thread context.Context through MCP HTTP calls for cancellation support - Thread usage reporter through all web search API call paths - Add token expiry pre-check before MCP/GAR calls - Clean up dead code (GenerateMessageID, webSearchAuthContext fp logic, ContainsWebSearchTool, StripWebSearchTool)	2026-02-13 22:04:09 +08:00
Luis Pater	d02df0141b	Merge pull request #224 from Buywatermelon/fix/kiro-assistant-first-message fix(kiro): prepend placeholder user message when conversation starts with assistant role	2026-02-12 15:11:10 +08:00
y	086d8d0d0b	fix(kiro): prepend placeholder user message when conversation starts with assistant role Kiro/AmazonQ API requires the conversation history to start with a user message. Some clients (e.g., OpenClaw) send conversations starting with an assistant message, which is valid for the native Claude API but causes 'Improperly formed request' (400) on the Kiro endpoint. This fix detects when the first message has role=assistant and prepends a minimal placeholder user message ('.') to satisfy the Kiro API's message ordering requirement. Upstream error: {"message":"Improperly formed request.","reason":null} Verified: original request returns 400, fixed request returns 200.	2026-02-12 11:09:47 +08:00
Darley	55c3197fb8	fix(kiro): merge adjacent assistant messages while preserving tool_calls	2026-02-12 07:30:36 +08:00
Darley	5a2cf0d53c	fix: prevent merging assistant messages with tool_calls Adjacent assistant messages where any message contains tool_calls were being merged by MergeAdjacentMessages, causing tool_calls to be silently dropped. This led to orphaned tool results that could not match any toolUse in history, resulting in Kiro API returning 'Improperly formed request.' Now assistant messages with tool_calls are kept separate during merge, preserving the tool call chain integrity.	2026-02-12 01:53:40 +08:00
Luis Pater	2334a2b174	Merge branch 'router-for-me:main' into main	2026-02-11 21:09:34 +08:00
Luis Pater	7726a44ca2	Merge pull request #212 from Skyuno/fix/orphaned-tool-results fix(kiro): filter orphaned tool_results from compacted conversations	2026-02-11 21:06:20 +08:00
Luis Pater	dc55fb0ce3	Merge pull request #211 from Skyuno/fix/kiro-websearch fix(kiro): fully implement Kiro web search tool via MCP integration	2026-02-11 21:05:21 +08:00
Luis Pater	a146c6c0aa	Merge pull request #1523 from xxddff/feature/removeUserField fix(codex): remove unsupported 'user' field from /v1/responses payload	2026-02-11 20:38:16 +08:00
Luis Pater	cc8c4ffb5f	Merge branch 'router-for-me:main' into main	2026-02-11 15:07:06 +08:00
Luis Pater	1510bfcb6f	fix(translator): improve content handling for system and user messages - Added support for single and array-based `content` cases. - Enhanced `system_instruction` structure population logic. - Improved handling of user role assignment for string-based `content`.	2026-02-11 15:04:01 +08:00
Skyuno	09b19f5c4e	fix(kiro): filter orphaned tool_results from compacted conversations	2026-02-11 00:23:05 +08:00
Skyuno	7b01ca0e2e	fix(kiro): implement web search MCP integration for streaming and non-streaming paths Add complete web search functionality that routes pure web_search requests to the Kiro MCP endpoint instead of the normal GAR API. Executor changes (kiro_executor.go): - Add web_search detection in Execute() and ExecuteStream() entry points using HasWebSearchTool() to intercept pure web_search requests before normal processing - Add 'kiro' format passthrough in buildKiroPayloadForFormat() for pre-built payloads used by callKiroRawAndBuffer() - Implement handleWebSearchStream(): streaming search loop with MCP search -> InjectToolResultsClaude -> callKiroAndBuffer, supporting up to 5 search iterations with model-driven re-search - Implement handleWebSearch(): non-streaming path that performs single MCP search, injects tool results, calls normal Execute path, and appends server_tool_use indicators to response - Add helper methods: callKiroAndBuffer(), callKiroRawAndBuffer(), callKiroDirectStream(), sendFallbackText(), executeNonStreamFallback() Web search core logic (kiro_websearch.go) [NEW]: - Define MCP JSON-RPC 2.0 types (McpRequest, McpResponse, McpResult, McpContent, McpError) - Define WebSearchResults/WebSearchResult structs for parsing MCP search results - HasWebSearchTool(): detect pure web_search requests (single-tool array only) - ContainsWebSearchTool(): detect web_search in mixed-tool arrays - ExtractSearchQuery(): parse search query from Claude Code's tool_use message format - CreateMcpRequest(): build MCP tools/call request with Kiro-compatible ID format - InjectToolResultsClaude(): append assistant tool_use + user tool_result messages to Claude-format payload for GAR translation pipeline - InjectToolResults(): modify Kiro-format payload directly with toolResults in currentMessage context - InjectSearchIndicatorsInResponse(): prepend server_tool_use + web_search_tool_result content blocks to non-streaming response for Claude Code search count display - ReplaceWebSearchToolDescription(): swap restrictive Kiro tool description with minimal re-search-friendly version - StripWebSearchTool(): remove web_search from tools array - FormatSearchContextPrompt() / FormatToolResultText(): format search results for injection - SSE event generation: SseEvent type, GenerateWebSearchEvents() (11-event sequence), GenerateSearchIndicatorEvents() (server_tool_use + web_search_tool_result pairs) - Stream analysis: AnalyzeBufferedStream() to detect stop_reason and web_search tool_use in buffered chunks, FilterChunksForClient() to strip tool_use blocks and adjust indices, AdjustSSEChunk() / AdjustStreamIndices() for content block index offset management MCP API handler (kiro_websearch_handler.go) [NEW]: - WebSearchHandler struct with MCP endpoint, HTTP client, auth token, fingerprint, and custom auth attributes - FetchToolDescription(): sync.Once-guarded MCP tools/list call to cache web_search tool description - GetWebSearchDescription(): thread-safe cached description retrieval - CallMcpAPI(): MCP API caller with retry logic (exponential backoff, retryable on 502/503/504), AWS-aligned headers via setMcpHeaders() - ParseSearchResults(): extract WebSearchResults from MCP JSON-RPC response - setMcpHeaders(): set Content-Type, Kiro agent headers, dynamic fingerprint User-Agent, AWS SDK identifiers, Bearer auth, and custom auth attributes Claude request translation (kiro_claude_request.go): - Rename web_search -> remote_web_search in convertClaudeToolsToKiro() with dynamic description from GetWebSearchDescription() or hardcoded fallback - Rename web_search -> remote_web_search in BuildAssistantMessageStruct() for tool_use content blocks - Add remoteWebSearchDescription constant as fallback when MCP tools/list hasn't been fetched	2026-02-11 00:02:30 +08:00
Skyuno	fe6fc628ed	Revert "fix: filter out web_search/websearch tools unsupported by Kiro API" This reverts commit `5dc936a9a4`.	2026-02-10 22:24:46 +08:00
Skyuno	8192eeabc8	Revert "feat: inject web_search alternative hint instead of silently filtering" This reverts commit `3c7a5afdcc`.	2026-02-10 22:24:46 +08:00
xxddff	bb9fe52f1e	Update internal/translator/codex/openai/responses/codex_openai-responses_request_test.go Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>	2026-02-10 18:24:58 +09:00
xxddff	afe4c1bfb7	更新internal/translator/codex/openai/responses/codex_openai-responses_request.go Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>	2026-02-10 18:24:26 +09:00
xxddff	865af9f19e	Implement test for user field deletion Add test to verify deletion of user field in response	2026-02-10 17:38:49 +09:00
xxddff	2b97cb98b5	Delete 'user' field from raw JSON Remove the 'user' field from the raw JSON as requested.	2026-02-10 17:35:54 +09:00
hkfires	938a799263	feat(translator): support Claude thinking type adaptive	2026-02-10 16:20:32 +08:00
Luis Pater	a3dc56d2a0	Merge branch 'router-for-me:main' into main	2026-02-09 02:07:02 +08:00
Luis Pater	63643c44a1	Fixed: #1484 fix(translator): restructure message content handling to support multiple content types - Consolidated `input_text` and `output_text` handling into a single case. - Added support for processing `input_image` content with associated URLs.	2026-02-09 02:05:38 +08:00
Luis Pater	6b83585b53	Merge branch 'router-for-me:main' into main	2026-02-07 08:52:51 +08:00
hkfires	b7e4f00c5f	fix(translator): correct gemini-cli log prefix	2026-02-07 08:40:09 +08:00
Luis Pater	c20507c15e	Merge branch 'router-for-me:main' into main	2026-02-07 06:43:17 +08:00
Luis Pater	80b5e79e75	fix(translator): normalize and restrict `stop_reason`/`finish_reason` usage - Standardized the handling of `stop_reason` and `finish_reason` across Codex and Gemini responses. - Restricted pass-through of specific reasons (`max_tokens`, `stop`) for consistency. - Enhanced fallback logic for undefined reasons.	2026-02-07 02:07:51 +08:00
Luis Pater	d182e893b6	Merge pull request #194 from PancakeZik/fix/assistant-content-parroting fix: replace assistant placeholder text to prevent model parroting	2026-02-07 01:38:58 +08:00
Luis Pater	6abd7d27d9	Merge pull request #190 from taetaetae/fix/kiro-claude-compaction-current-user-empty-content fix(kiro): handle empty content in current user message for compaction	2026-02-07 01:33:01 +08:00
Luis Pater	77586ed7d3	Merge branch 'main' into plus	2026-02-07 01:31:21 +08:00
Joao	98edcad39d	fix: replace assistant placeholder text to prevent model parroting Kiro API requires non-empty content on assistant messages, so CLIProxyAPI injects placeholder text when assistant messages only contain tool_use blocks (no text). The previous placeholders were conversational phrases: - DefaultAssistantContentWithTools: "I'll help you with that." - DefaultAssistantContent: "I understand." In agentic sessions with many tool calls, these phrases appeared dozens of times in conversation history. Opus 4.6 (and likely other models) picked up on this pattern and started parroting "I'll help you with that." before every tool call in its actual responses. Fix: Replace both placeholders with a single dot ".", which satisfies Kiro's non-empty requirement without giving the model a phrase to mimic.	2026-02-06 16:42:21 +00:00
Luis Pater	1187aa8222	feat(translator): capture cached token count in usage metadata and handle prompt caching - Added support to extract and include `cachedContentTokenCount` in `usage.prompt_tokens_details`. - Logged warnings for failures to set cached token count for better debugging.	2026-02-06 21:28:40 +08:00
taetaetae	4e3bad3907	fix(kiro): handle empty content in current user message for compaction Problem: - PR #186 fixed empty content for assistant messages and history user messages - But current user message (isLastMessage == true) was not fixed - When user message contains only tool_result (no text), content becomes empty - This causes 'Improperly formed request' errors from Kiro API - Compaction requests from OpenCode commonly have this pattern Solution: - Move empty content check BEFORE the isLastMessage branch - Apply fallback content to ALL user messages, not just history - Add DefaultUserContentWithToolResults and DefaultUserContent constants Fixes compaction failures for OpenCode + Quotio + CLIProxyAPIPlus + Kiro stack	2026-02-06 11:58:43 +09:00
Luis Pater	e35ffaa925	Merge pull request #186 from taetaetae/fix/kiro-claude-compaction-empty-content fix(kiro): handle empty content in Claude format assistant messages	2026-02-06 05:39:41 +08:00
Luis Pater	74d9a1ffed	Merge branch 'router-for-me:main' into main	2026-02-06 03:27:27 +08:00
Luis Pater	a5a25dec57	refactor(translator, executor): remove redundant `bytes.Clone` calls for improved performance - Replaced all instances of `bytes.Clone` with direct references to enhance efficiency. - Simplified payload handling across executors and translators by eliminating unnecessary data duplication.	2026-02-06 03:26:29 +08:00
taetaetae	14f044ce4f	refactor: extract default assistant content to shared constants Apply code review feedback from gemini-code-assist: - Move fallback strings to kirocommon package as exported constants - Update kiro_claude_request.go to use shared constants - Update kiro_openai_request.go to use shared constants - Improves maintainability and avoids duplication	2026-02-05 23:36:57 +09:00
taetaetae	88872baffc	fix(kiro): handle empty content in Claude format assistant messages Problem: - PR #181 fixed empty content for OpenAI format (kiro_openai_request.go) - But Claude format (kiro_claude_request.go) was not fixed - OpenCode uses Claude format (/v1/messages endpoint) - When assistant messages have only tool_use (no text), content becomes empty - This causes 'Improperly formed request' errors from Kiro API Example of problematic message format: { "role": "assistant", "content": [ {"type": "tool_use", "id": "...", "name": "todowrite", "input": {...}} ] } Solution: - Add empty content fallback in BuildAssistantMessageStruct (Claude format) - Same fix as PR #181 applied to kiro_openai_request.go Fixes compaction failures for OpenCode + Quotio + CLIProxyAPIPlus + Kiro stack	2026-02-05 23:27:35 +09:00
Luis Pater	dbecf5330e	Merge pull request #181 from taetaetae/fix/kiro-compaction-tool-use-content fix(kiro): handle tool_use in content array for compaction requests	2026-02-05 20:17:32 +08:00
Luis Pater	6b6b343922	Merge branch 'main' into plus	2026-02-05 19:51:56 +08:00
Luis Pater	25c6b479c7	refactor(util, executor): optimize payload handling and schema processing - Replaced repetitive string operations with a centralized `escapeGJSONPathKey` function. - Streamlined handling of JSON schema cleaning for Gemini and Antigravity requests. - Improved payload management by transitioning from byte slices to strings for processing. - Removed unnecessary cloning of byte slices in several places.	2026-02-05 19:00:30 +08:00
Chén Mù	7cf9ff0345	Merge pull request #1429 from neavo/fix/gemini-python-sdk-thinking-fields fix(gemini): support snake_case thinking config fields from Python SDK	2026-02-05 14:32:58 +08:00
taetaetae	49ef22ab78	refactor: simplify inputMap initialization logic Apply code review feedback from gemini-code-assist: - Initialize inputMap upfront instead of using nested if blocks - Combine Exists() and IsObject() checks into single condition - Remove redundant nil check	2026-02-05 07:12:42 +09:00
taetaetae	ae4638712e	fix(kiro): handle tool_use in content array for compaction requests Problem: - PR #162 fixed empty string content but missed array content with tool_use - OpenCode's compaction requests send assistant messages with content as array - When content array contains only tool_use (no text), content becomes empty - This causes 'Improperly formed request' errors from Kiro API Example of problematic message format: { "role": "assistant", "content": [ {"type": "tool_use", "id": "...", "name": "todowrite", "input": {...}} ] } Solution: - Extract tool_use from content array (Anthropic/OpenCode format) - This is in addition to existing tool_calls handling (OpenAI format) - The empty content fallback from PR #162 will then work correctly Fixes compaction failures that persisted after PR #162 merge.	2026-02-05 07:08:14 +09:00
neavo	6c65fdf54b	fix(gemini): support snake_case thinking config fields from Python SDK Google official Gemini Python SDK sends thinking_level, thinking_budget, and include_thoughts (snake_case) instead of thinkingLevel, thinkingBudget, and includeThoughts (camelCase). This caused thinking configuration to be ignored when using Python SDK. Changes: - Extract layer: extractGeminiConfig now reads snake_case as fallback - Apply layer: Gemini/CLI/Antigravity appliers clean up snake_case fields - Translator layer: Gemini->OpenAI/Claude/Codex translators support fallback - Tests: Added 4 test cases for snake_case field coverage Fixes #1426	2026-02-04 21:12:47 +08:00

1 2 3 4 5 ...

424 Commits