CLIProxyAPIPlus

mirror of https://github.com/router-for-me/CLIProxyAPIPlus.git synced 2026-03-08 06:43:41 +00:00

Author	SHA1	Message	Date
CheesesNguyen	7fe1d102cb	fix: don't treat empty input as truncation for tools without required fields Tools like TaskList, TaskGet have no required parameters, so empty input is valid. Previously, the truncation detector flagged all empty inputs as truncated, causing these tools to be skipped and breaking the tool loop. Now only flag empty input as truncation when the tool has required fields defined in RequiredFieldsByTool.	2026-03-05 14:43:45 +07:00
CheesesNguyen	c51851689b	fix: remove SOFT_LIMIT_REACHED logic, tool compression, and fix bugs - Remove SOFT_LIMIT_REACHED marker injection in response path - Remove SOFT_LIMIT_REACHED detection logic in request path - Remove SOFT_LIMIT_REACHED streaming logic in executor - Remove tool_compression.go and related constants - Fix truncation_detector: string(rune(len)) producing Unicode char instead of decimal string - Fix WebSearchToolUseId being overwritten by non-web-search tools - Fix duplicate kiro entry in model_definitions.go comment - Add build output to .gitignore	2026-03-05 10:05:39 +07:00
Cyrus	030bf5e6c7	feat(kiro): add IDC auth and endpoint improvements, redesign fingerprint system - Add IAM Identity Center (IDC) authentication with CLI flags (--kiro-idc-login, --kiro-idc-start-url, --kiro-idc-region) and login flow - Add ProfileArn auto-fetching in Execute/ExecuteStream for imported IDC accounts - Simplify endpoint preference with map-based alias lookup and getAuthValue helper - Redesign fingerprint as global singleton with external config and per-account deterministic generation - Add StartURL and FingerprintConfig fields to Kiro config - Add AgentContinuationID/AgentTaskType support in Kiro translators - Add comprehensive tests for executor, fingerprint, SSO OIDC, and AWS helpers - Add CLI login documentation to README	2026-02-27 00:58:03 +08:00
Luis Pater	37dfea1d3f	Merge pull request #287 from possible055/main fix(kiro): support OR-group field matching in truncation detector	2026-02-25 01:02:49 +08:00
apparition	c785c1a3ca	fix(kiro): support OR-group field matching in truncation detector - Change RequiredFieldsByTool value type from []string to [][]string - Outer slice = AND (all groups required); inner slice = OR (any one satisfies) - Fix Bash entry to accept "cmd" or "command", resolving soft-truncation loop - Update findMissingRequiredFields logic and inline docs accordingly	2026-02-24 22:48:05 +08:00
Darley	6e634fe3f9	fix: filter out orphaned tool results from history and current context	2026-02-23 14:33:59 +08:00
DragonBaiMo	7c9c89dace	fix(kiro): keep thinking enabled across request formats	2026-02-20 20:34:40 +08:00
Skyuno	632a2fd2f2	refactor: align GenerateSearchIndicatorEvents return type with other event builders Change GenerateSearchIndicatorEvents to return [][]byte instead of []sseEvent for consistency with BuildFallbackTextEvents and other event building functions. Benefits: - Consistent API across all event generation functions - Eliminates intermediate sseEvent type conversion in caller - Simplifies usage by returning ready-to-send SSE byte slices This addresses the code quality feedback from PR #226 review.	2026-02-13 22:04:09 +08:00
Skyuno	5626637fbd	security: remove query content from web search logs to prevent PII leakage - Remove search query from iteration logs (Info level) - Remove query and toolUseId from analysis logs (Info level) - Remove query from non-stream result logs (Info level) - Remove query from tool injection logs (Info level) - Remove query from tool_use detection logs (Debug level) This addresses the security concern raised in PR #226 review about potential PII exposure in search query logs.	2026-02-13 22:04:09 +08:00
Skyuno	587371eb14	refactor: align web search with executor layer patterns Consolidate web search handler, SSE event generation, stream analysis, and MCP HTTP I/O into the executor layer. Merge the separate kiro_websearch_handler.go back into kiro_executor.go to align with the single-file-per-executor convention. Translator retains only pure data types, detection, and payload transformation. Key changes: - Move SSE construction (search indicators, fallback text, message_start) from translator to executor, consistent with streamToChannel pattern - Move MCP handler (callMcpAPI, setMcpHeaders, fetchToolDescription) from translator to executor alongside other HTTP I/O - Reuse applyDynamicFingerprint for MCP UA headers (eliminate duplication) - Centralize MCP endpoint URL via BuildMcpEndpoint in translator - Add atomic Set/GetWebSearchDescription for cross-layer tool desc cache - Thread context.Context through MCP HTTP calls for cancellation support - Thread usage reporter through all web search API call paths - Add token expiry pre-check before MCP/GAR calls - Clean up dead code (GenerateMessageID, webSearchAuthContext fp logic, ContainsWebSearchTool, StripWebSearchTool)	2026-02-13 22:04:09 +08:00
Luis Pater	d02df0141b	Merge pull request #224 from Buywatermelon/fix/kiro-assistant-first-message fix(kiro): prepend placeholder user message when conversation starts with assistant role	2026-02-12 15:11:10 +08:00
y	086d8d0d0b	fix(kiro): prepend placeholder user message when conversation starts with assistant role Kiro/AmazonQ API requires the conversation history to start with a user message. Some clients (e.g., OpenClaw) send conversations starting with an assistant message, which is valid for the native Claude API but causes 'Improperly formed request' (400) on the Kiro endpoint. This fix detects when the first message has role=assistant and prepends a minimal placeholder user message ('.') to satisfy the Kiro API's message ordering requirement. Upstream error: {"message":"Improperly formed request.","reason":null} Verified: original request returns 400, fixed request returns 200.	2026-02-12 11:09:47 +08:00
Darley	55c3197fb8	fix(kiro): merge adjacent assistant messages while preserving tool_calls	2026-02-12 07:30:36 +08:00
Darley	5a2cf0d53c	fix: prevent merging assistant messages with tool_calls Adjacent assistant messages where any message contains tool_calls were being merged by MergeAdjacentMessages, causing tool_calls to be silently dropped. This led to orphaned tool results that could not match any toolUse in history, resulting in Kiro API returning 'Improperly formed request.' Now assistant messages with tool_calls are kept separate during merge, preserving the tool call chain integrity.	2026-02-12 01:53:40 +08:00
Luis Pater	7726a44ca2	Merge pull request #212 from Skyuno/fix/orphaned-tool-results fix(kiro): filter orphaned tool_results from compacted conversations	2026-02-11 21:06:20 +08:00
Skyuno	09b19f5c4e	fix(kiro): filter orphaned tool_results from compacted conversations	2026-02-11 00:23:05 +08:00
Skyuno	7b01ca0e2e	fix(kiro): implement web search MCP integration for streaming and non-streaming paths Add complete web search functionality that routes pure web_search requests to the Kiro MCP endpoint instead of the normal GAR API. Executor changes (kiro_executor.go): - Add web_search detection in Execute() and ExecuteStream() entry points using HasWebSearchTool() to intercept pure web_search requests before normal processing - Add 'kiro' format passthrough in buildKiroPayloadForFormat() for pre-built payloads used by callKiroRawAndBuffer() - Implement handleWebSearchStream(): streaming search loop with MCP search -> InjectToolResultsClaude -> callKiroAndBuffer, supporting up to 5 search iterations with model-driven re-search - Implement handleWebSearch(): non-streaming path that performs single MCP search, injects tool results, calls normal Execute path, and appends server_tool_use indicators to response - Add helper methods: callKiroAndBuffer(), callKiroRawAndBuffer(), callKiroDirectStream(), sendFallbackText(), executeNonStreamFallback() Web search core logic (kiro_websearch.go) [NEW]: - Define MCP JSON-RPC 2.0 types (McpRequest, McpResponse, McpResult, McpContent, McpError) - Define WebSearchResults/WebSearchResult structs for parsing MCP search results - HasWebSearchTool(): detect pure web_search requests (single-tool array only) - ContainsWebSearchTool(): detect web_search in mixed-tool arrays - ExtractSearchQuery(): parse search query from Claude Code's tool_use message format - CreateMcpRequest(): build MCP tools/call request with Kiro-compatible ID format - InjectToolResultsClaude(): append assistant tool_use + user tool_result messages to Claude-format payload for GAR translation pipeline - InjectToolResults(): modify Kiro-format payload directly with toolResults in currentMessage context - InjectSearchIndicatorsInResponse(): prepend server_tool_use + web_search_tool_result content blocks to non-streaming response for Claude Code search count display - ReplaceWebSearchToolDescription(): swap restrictive Kiro tool description with minimal re-search-friendly version - StripWebSearchTool(): remove web_search from tools array - FormatSearchContextPrompt() / FormatToolResultText(): format search results for injection - SSE event generation: SseEvent type, GenerateWebSearchEvents() (11-event sequence), GenerateSearchIndicatorEvents() (server_tool_use + web_search_tool_result pairs) - Stream analysis: AnalyzeBufferedStream() to detect stop_reason and web_search tool_use in buffered chunks, FilterChunksForClient() to strip tool_use blocks and adjust indices, AdjustSSEChunk() / AdjustStreamIndices() for content block index offset management MCP API handler (kiro_websearch_handler.go) [NEW]: - WebSearchHandler struct with MCP endpoint, HTTP client, auth token, fingerprint, and custom auth attributes - FetchToolDescription(): sync.Once-guarded MCP tools/list call to cache web_search tool description - GetWebSearchDescription(): thread-safe cached description retrieval - CallMcpAPI(): MCP API caller with retry logic (exponential backoff, retryable on 502/503/504), AWS-aligned headers via setMcpHeaders() - ParseSearchResults(): extract WebSearchResults from MCP JSON-RPC response - setMcpHeaders(): set Content-Type, Kiro agent headers, dynamic fingerprint User-Agent, AWS SDK identifiers, Bearer auth, and custom auth attributes Claude request translation (kiro_claude_request.go): - Rename web_search -> remote_web_search in convertClaudeToolsToKiro() with dynamic description from GetWebSearchDescription() or hardcoded fallback - Rename web_search -> remote_web_search in BuildAssistantMessageStruct() for tool_use content blocks - Add remoteWebSearchDescription constant as fallback when MCP tools/list hasn't been fetched	2026-02-11 00:02:30 +08:00
Skyuno	fe6fc628ed	Revert "fix: filter out web_search/websearch tools unsupported by Kiro API" This reverts commit `5dc936a9a4`.	2026-02-10 22:24:46 +08:00
Skyuno	8192eeabc8	Revert "feat: inject web_search alternative hint instead of silently filtering" This reverts commit `3c7a5afdcc`.	2026-02-10 22:24:46 +08:00
Luis Pater	d182e893b6	Merge pull request #194 from PancakeZik/fix/assistant-content-parroting fix: replace assistant placeholder text to prevent model parroting	2026-02-07 01:38:58 +08:00
Joao	98edcad39d	fix: replace assistant placeholder text to prevent model parroting Kiro API requires non-empty content on assistant messages, so CLIProxyAPI injects placeholder text when assistant messages only contain tool_use blocks (no text). The previous placeholders were conversational phrases: - DefaultAssistantContentWithTools: "I'll help you with that." - DefaultAssistantContent: "I understand." In agentic sessions with many tool calls, these phrases appeared dozens of times in conversation history. Opus 4.6 (and likely other models) picked up on this pattern and started parroting "I'll help you with that." before every tool call in its actual responses. Fix: Replace both placeholders with a single dot ".", which satisfies Kiro's non-empty requirement without giving the model a phrase to mimic.	2026-02-06 16:42:21 +00:00
taetaetae	4e3bad3907	fix(kiro): handle empty content in current user message for compaction Problem: - PR #186 fixed empty content for assistant messages and history user messages - But current user message (isLastMessage == true) was not fixed - When user message contains only tool_result (no text), content becomes empty - This causes 'Improperly formed request' errors from Kiro API - Compaction requests from OpenCode commonly have this pattern Solution: - Move empty content check BEFORE the isLastMessage branch - Apply fallback content to ALL user messages, not just history - Add DefaultUserContentWithToolResults and DefaultUserContent constants Fixes compaction failures for OpenCode + Quotio + CLIProxyAPIPlus + Kiro stack	2026-02-06 11:58:43 +09:00
taetaetae	14f044ce4f	refactor: extract default assistant content to shared constants Apply code review feedback from gemini-code-assist: - Move fallback strings to kirocommon package as exported constants - Update kiro_claude_request.go to use shared constants - Update kiro_openai_request.go to use shared constants - Improves maintainability and avoids duplication	2026-02-05 23:36:57 +09:00
taetaetae	88872baffc	fix(kiro): handle empty content in Claude format assistant messages Problem: - PR #181 fixed empty content for OpenAI format (kiro_openai_request.go) - But Claude format (kiro_claude_request.go) was not fixed - OpenCode uses Claude format (/v1/messages endpoint) - When assistant messages have only tool_use (no text), content becomes empty - This causes 'Improperly formed request' errors from Kiro API Example of problematic message format: { "role": "assistant", "content": [ {"type": "tool_use", "id": "...", "name": "todowrite", "input": {...}} ] } Solution: - Add empty content fallback in BuildAssistantMessageStruct (Claude format) - Same fix as PR #181 applied to kiro_openai_request.go Fixes compaction failures for OpenCode + Quotio + CLIProxyAPIPlus + Kiro stack	2026-02-05 23:27:35 +09:00
taetaetae	49ef22ab78	refactor: simplify inputMap initialization logic Apply code review feedback from gemini-code-assist: - Initialize inputMap upfront instead of using nested if blocks - Combine Exists() and IsObject() checks into single condition - Remove redundant nil check	2026-02-05 07:12:42 +09:00
taetaetae	ae4638712e	fix(kiro): handle tool_use in content array for compaction requests Problem: - PR #162 fixed empty string content but missed array content with tool_use - OpenCode's compaction requests send assistant messages with content as array - When content array contains only tool_use (no text), content becomes empty - This causes 'Improperly formed request' errors from Kiro API Example of problematic message format: { "role": "assistant", "content": [ {"type": "tool_use", "id": "...", "name": "todowrite", "input": {...}} ] } Solution: - Extract tool_use from content array (Anthropic/OpenCode format) - This is in addition to existing tool_calls handling (OpenAI format) - The empty content fallback from PR #162 will then work correctly Fixes compaction failures that persisted after PR #162 merge.	2026-02-05 07:08:14 +09:00
Luis Pater	e2169e3987	Merge pull request #175 from Skyuno/fix/json-truncation-rework fix(kiro): Rework JSON Truncation Handling with SOFT_LIMIT_REACHED	2026-02-04 01:24:35 +08:00
Luis Pater	3354fae391	Merge pull request #162 from taetaetae/fix/kiro-compaction-empty-content fix(kiro): handle empty content in messages to prevent Bad Request errors	2026-02-04 01:01:48 +08:00
Skyuno	fcdd91895e	Merge remote-tracking branch 'upstream/main' into fix/json-truncation-rework	2026-02-03 20:28:32 +08:00
taetaetae	1f7c58f7ce	refactor: use constants for default assistant messages Apply code review feedback from gemini-code-assist: - Define default messages as local constants to improve maintainability - Avoid magic strings in the empty content handling logic	2026-02-03 07:10:38 +09:00
Skyuno	3c7a5afdcc	feat: inject web_search alternative hint instead of silently filtering	2026-02-02 05:19:06 +08:00
Skyuno	5dc936a9a4	fix: filter out web_search/websearch tools unsupported by Kiro API	2026-02-02 05:19:06 +08:00
Skyuno	ba168ec003	fix(kiro): skip _partial field (may contain hallucinated paths), add pwd hint for retry	2026-02-02 05:17:39 +08:00
Skyuno	a12e22c66f	Revert "Merge pull request #150 from PancakeZik/fix/write-tool-truncation-handling" This reverts commit `fd5b669c87`, reversing changes made to `30d832c9b1`.	2026-02-02 05:17:39 +08:00
taetaetae	b45ede0b71	fix(kiro): handle empty content in messages to prevent Bad Request errors Problem: - OpenCode's /compaction command and auto-compaction (at 80%+ context) sends requests that can result in empty assistant message content - Kiro API strictly requires non-empty content for all messages - This causes 'Bad Request: Improperly formed request' errors - After compaction failure, the malformed message stays in history, breaking all subsequent requests in the session Solution: - Add fallback content for empty assistant messages in buildAssistantMessageFromOpenAI() - Add history truncation (max 50 messages) to prevent oversized requests - This ensures all messages have valid content before sending to Kiro API Fixes issues with: - /compaction command returning Bad Request - Auto-compaction breaking sessions - Conversations becoming unresponsive after compaction failure	2026-02-01 15:47:18 +09:00
Joao	acdfa1c87f	fix: handle Write tool truncation when content exceeds API limits When the Kiro/AWS CodeWhisperer API receives a Write tool request with content that exceeds transmission limits, it truncates the tool input. This can result in: - Empty input buffer (no input transmitted at all) - Missing 'content' field in the parsed JSON - Incomplete JSON that fails to parse This fix detects these truncation scenarios and converts them to Bash tool calls that echo an error message. This allows Claude Code to execute the Bash command, see the error output, and the agent can then retry with smaller chunks. Changes: - kiro_claude_tools.go: Detect three truncation scenarios in ProcessToolUseEvent: 1. Empty input buffer (no input transmitted) 2. JSON parse failure with file_path but no content field 3. Successfully parsed JSON missing content field When detected, emit a special '__truncated_write__' marker tool use - kiro_executor.go: Handle '__truncated_write__' markers in streamToChannel: 1. Extract file_path from the marker for context 2. Create a Bash tool_use that echoes an error message 3. Include retry guidance (700-line chunks recommended) 4. Set hasToolUses=true to ensure stop_reason='tool_use' for agent continuation This ensures the agent continues and can retry with smaller file chunks instead of failing silently or showing errors to the user.	2026-01-29 12:22:55 +00:00
Darley	e3e741d0be	Default Claude tool input schema	2026-01-26 09:15:38 +08:00
Darley	7c7c5fd967	Fix Kiro tool schema defaults	2026-01-26 08:27:53 +08:00
Joao	6b074653f2	fix: prevent system prompt re-injection on subsequent turns When tool results are sent back to the model, the system prompt was being re-injected into the user message content, causing the model to think the user had pasted the system prompt again. This was especially noticeable after multiple tool uses. The fix checks if there is conversation history (len(history) > 0). If so, it's a subsequent turn and we skip system prompt injection. The system prompt is only injected on the first turn (len(history) == 0). This ensures: - First turn: system prompt is injected - Tool result turns: system prompt is NOT re-injected - New conversations: system prompt is injected fresh	2026-01-16 20:16:44 +00:00
ZqinKing	83e5f60b8b	fix(kiro): scale description compression by needed size Compute a size-reduction based keep ratio and use it to trim tool descriptions, avoiding forced minimum truncation when the target size already fits. This aligns compression with actual payload reduction needs and prevents over-compression.	2026-01-14 16:22:46 +08:00
ZqinKing	5b433f962f	feat(kiro): 实现动态工具压缩功能 ## 背景当 Claude Code 发送过多工具信息时，可能超出 Kiro API 请求限制导致 500 错误。现有的工具描述截断（KiroMaxToolDescLen = 10237）只能限制单个工具的描述长度，无法解决整体工具列表过大的问题。 ## 解决方案实现动态工具压缩功能，采用两步压缩策略： 1. 先检查原始大小，超过 20KB 才进行压缩 2. 第一步：简化 input_schema，只保留 type/enum/required 字段 3. 第二步：按比例缩短 description（最短 50 字符） 4. 保留全部工具和 skills 可调用，不丢弃任何工具 ## 新增文件 - internal/translator/kiro/claude/tool_compression.go - calculateToolsSize(): 计算工具列表的 JSON 序列化大小 - simplifyInputSchema(): 简化 input_schema，递归处理嵌套 properties - compressToolDescription(): 按比例压缩描述，支持 UTF-8 安全截断 - compressToolsIfNeeded(): 主压缩函数，实现两步压缩策略 - internal/translator/kiro/claude/tool_compression_test.go - 完整的单元测试覆盖所有新增函数 - 测试 UTF-8 安全性 - 测试压缩效果 ## 修改文件 - internal/translator/kiro/common/constants.go - 新增 ToolCompressionTargetSize = 20KB (压缩目标大小阈值) - 新增 MinToolDescriptionLength = 50 (描述最短长度) - internal/translator/kiro/claude/kiro_claude_request.go - 在 convertClaudeToolsToKiro() 函数末尾调用 compressToolsIfNeeded() ## 测试结果 - 70KB 工具压缩至 17KB (74.7% 压缩率) - 所有单元测试通过 ## 预期效果 - 80KB+ tools 压缩至 ~15KB - 不影响工具调用功能	2026-01-14 11:07:07 +08:00
TinyCoder	c169b32570	refactor(kiro): Remove unused variables in OpenAI translator Remove dead code that was never used: - toolCallIDToName map: built but never read from - seenToolCallIDs: declared but never populated, only suppressed with _	2025-12-24 15:10:35 +07:00
TinyCoder	36a512fdf2	fix(kiro): Handle tool results correctly in OpenAI format translation Fix three issues in Kiro OpenAI translator that caused "Improperly formed request" errors when processing LiteLLM-translated requests with tool_use/tool_result: 1. Skip merging tool role messages in MergeAdjacentMessages() to preserve individual tool_call_id fields 2. Track pendingToolResults and attach to the next user message instead of only the last message. Create synthetic user message when conversation ends with tool results. 3. Insert synthetic user message with tool results before assistant messages to maintain proper alternating user/assistant structure. This fixes the case where LiteLLM translates Anthropic user messages containing only tool_result blocks into tool role messages followed by assistant. Adds unit tests covering all tool result handling scenarios.	2025-12-24 15:10:35 +07:00
Ravens2121	d687ee2777	feat(kiro): implement official reasoningContentEvent and improve metadat	2025-12-18 04:38:22 +08:00
Ravens2121	f3d1cc8dc1	chore: change debug logs from INFO to DEBUG level	2025-12-16 05:32:03 +08:00
Ravens2121	e889efeda7	fix: add signature field to thinking blocks for non-streaming mode - Add generateThinkingSignature() function in kiro_claude_response.go	2025-12-16 05:21:49 +08:00
Ravens2121	0a3a95521c	feat: enhance thinking mode support for Kiro translator Changes:	2025-12-16 05:01:40 +08:00
Ravens2121	c3ed3b40ea	feat(kiro): Add token usage cross-validation and simplify thinking mode handling	2025-12-14 16:40:33 +08:00
Ravens2121	9c04c18c04	feat(kiro): enhance request translation and fix streaming issues English: - Fix <thinking> tag parsing: only parse at response start, avoid misinterpreting discussion text - Add token counting support using tiktoken for local estimation - Support top_p parameter in inference config - Handle max_tokens=-1 as maximum (32000 tokens) - Add tool_choice and response_format parameter handling via system prompt hints - Support multiple thinking mode detection formats (Claude API, OpenAI reasoning_effort, AMP/Cursor) - Shorten MCP tool names exceeding 64 characters - Fix duplicate [DONE] marker in OpenAI SSE streaming - Enhance token usage statistics with multiple event format support - Add code fence markers to constants 中文: - 修复 <thinking> 标签解析：仅在响应开头解析，避免误解析讨论文本中的标签 - 使用 tiktoken 实现本地 token 计数功能 - 支持 top_p 推理配置参数 - 处理 max_tokens=-1 转换为最大值（32000 tokens） - 通过系统提示词注入实现 tool_choice 和 response_format 参数支持 - 支持多种思考模式检测格式（Claude API、OpenAI reasoning_effort、AMP/Cursor） - 截断超过64字符的 MCP 工具名称 - 修复 OpenAI SSE 流中重复的 [DONE] 标记 - 增强 token 使用量统计，支持多种事件格式 - 添加代码围栏标记常量	2025-12-14 11:57:16 +08:00
Ravens2121	01cf221167	feat(kiro): 代码优化重构 + OpenAI翻译器实现	2025-12-14 06:58:50 +08:00

1 2

57 Commits