CLIProxyAPIPlus

mirror of https://github.com/router-for-me/CLIProxyAPIPlus.git synced 2026-04-04 19:51:18 +00:00

Author	SHA1	Message	Date
VooDisss	511b8a992e	fix(codex): restore prompt cache continuity for Codex requests Prompt caching on Codex was not reliably reusable through the proxy because repeated chat-completions requests could reach the upstream without the same continuity envelope. In practice this showed up most clearly with OpenCode, where cache reads worked in the reference client but not through CLIProxyAPI, although the root cause is broader than OpenCode itself. The proxy was breaking continuity in several ways: executor-layer Codex request preparation stripped prompt_cache_retention, chat-completions translation did not preserve that field, continuity headers used a different shape than the working client behavior, and OpenAI-style Codex requests could be sent without a stable prompt_cache_key. When that happened, session_id fell back to a fresh random value per request, so upstream Codex treated repeated requests as unrelated turns instead of as part of the same cacheable context. This change fixes that by preserving caller-provided prompt_cache_retention on Codex execution paths, preserving prompt_cache_retention when translating OpenAI chat-completions requests to Codex, aligning Codex continuity headers to session_id, and introducing an explicit Codex continuity policy that derives a stable continuity key from the best available signal. The resolution order prefers an explicit prompt_cache_key, then execution session metadata, then an explicit idempotency key, then stable request-affinity metadata, then a stable client-principal hash, and finally a stable auth-ID hash when no better continuity signal exists. The same continuity key is applied to both prompt_cache_key in the request body and session_id in the request headers so repeated requests reuse the same upstream cache/session identity. The auth manager also keeps auth selection sticky for repeated request sequences, preventing otherwise-equivalent Codex requests from drifting across different upstream auth contexts and accidentally breaking cache reuse. To keep the implementation maintainable, the continuity resolution and diagnostics are centralized in a dedicated Codex continuity helper instead of being scattered across executor flow code. Regression coverage now verifies retention preservation, continuity-key precedence, stable auth-ID fallback, websocket parity, translator preservation, and auth-affinity behavior. Manual validation confirmed prompt cache reads now occur through CLIProxyAPI when using Codex via OpenCode, and the fix should also benefit other clients that rely on stable repeated Codex request continuity.	2026-03-27 17:49:29 +02:00
Luis Pater	e5166841db	Merge pull request #2310 from shellus/fix/claude-openai-system-top-level fix: preserve OpenAI system messages as Claude top-level system	2026-03-25 10:21:18 +08:00
hkfires	528b1a2307	feat(codex): pass through codex client identity headers	2026-03-25 08:48:18 +08:00
GeJiaXiang	09c92aa0b5	fix: keep a fallback turn for system-only Claude inputs	2026-03-24 13:54:25 +08:00
GeJiaXiang	8c67b3ae64	test: verify remaining user message after system merge	2026-03-24 13:47:52 +08:00
GeJiaXiang	000e4ceb4e	fix: map OpenAI system messages to Claude top-level system	2026-03-24 13:42:33 +08:00
Luis Pater	d475aaba96	Fixed: #2274 fix(translator): omit null content fields in Codex OpenAI tool call responses	2026-03-24 01:00:57 +08:00
Luis Pater	97c0487add	Merge pull request #2223 from cnrpman/fix/codex-responses-web-search-preview-compat fix: normalize web_search_preview for codex responses	2026-03-24 00:25:37 +08:00
Luis Pater	5d6cdccda0	Merge pull request #2268 from sususu98/fix/sanitize-tool-names fix(translator): sanitize tool names for Gemini function_declarations compatibility	2026-03-23 21:42:22 +08:00
Luis Pater	1b7f4ac3e1	Merge pull request #2252 from sususu98/fix/antigravity-empty-thought-text fix(antigravity): always include text field in thought parts to prevent Google 500	2026-03-23 21:41:25 +08:00
Luis Pater	afc1a5b814	Fixed: #2281 refactor(claude): centralize usage token calculation logic and add tests for cached token handling	2026-03-23 21:30:03 +08:00
sususu98	e8bb350467	fix: extend tool name sanitization to all remaining Gemini-bound translators Apply SanitizeFunctionName on request and RestoreSanitizedToolName on response for: gemini/claude, gemini/openai/chat-completions, gemini/openai/responses, antigravity/openai/chat-completions, gemini-cli/openai/chat-completions. Also update SanitizedToolNameMap to handle OpenAI format (tools[].function.name) in addition to Claude format (tools[].name).	2026-03-22 14:06:46 +08:00
sususu98	755ca75879	fix: address review feedback - init ToolNameMap eagerly, log collisions, add collision test	2026-03-22 13:24:03 +08:00
sususu98	2398ebad55	fix(translator): sanitize tool names for Gemini function_declarations compatibility Claude Code and MCP clients may send tool names containing characters invalid for Gemini's function_declarations (e.g. '/', '@', spaces). Sanitize on request via SanitizeFunctionName and restore original names on response for both antigravity/claude and gemini-cli/claude translators.	2026-03-22 13:10:53 +08:00
sususu	e005208d76	fix(antigravity): always include text field in thought parts to prevent Google 500 When Claude sends redacted thinking with empty text, the translator was omitting the "text" field from thought parts. Google Antigravity API requires this field, causing 500 "Unknown Error" responses. Verified: 129/129 error logs with empty thought → 500, 0/97 success logs had empty thought. After fix: 0 new "Unknown Error" 500s.	2026-03-20 18:59:25 +08:00
Junyi Du	d1df70d02f	chore: add codex builtin tool normalization logging	2026-03-20 14:08:37 +08:00
Luis Pater	2bd646ad70	refactor: replace `sjson.Set` usage with `sjson.SetBytes` to optimize mutable JSON transformations	2026-03-19 17:58:54 +08:00
Junyi Du	793840cdb4	fix: cover dated and nested codex web search aliases	2026-03-19 03:41:12 +08:00
Junyi Du	8f421de532	fix: handle sjson errors in codex tool normalization	2026-03-19 03:36:06 +08:00
Junyi Du	be2dd60ee7	fix: normalize web_search_preview for codex responses	2026-03-19 03:23:14 +08:00
Darley	9c6c3612a8	fix(claude): read disable_parallel_tool_use from tool_choice	2026-03-17 19:35:41 +08:00
Darley	19e1a4447a	fix(claude): honor disable_parallel_tool_use	2026-03-17 19:17:41 +08:00
Luis Pater	db63f9b5d6	Merge pull request #2162 from enieuwy/fix/responses-api-json-valid-check fix: validate JSON before raw-embedding function call outputs in Responses API	2026-03-16 18:42:31 +08:00
Luis Pater	25f6c4a250	Merge pull request #2158 from sususu98/fix/antigravity-functionresponse-name fix(antigravity): resolve empty functionResponse.name for toolu_* tool_use_id format	2026-03-16 18:39:40 +08:00
enieuwy	b24ae74216	fix: validate JSON before raw-embedding function call outputs in Responses API gjson.Parse() marks any string starting with { or [ as gjson.JSON type, even when the content is not valid JSON (e.g. macOS plist format, truncated tool results). This caused sjson.SetRaw to embed non-JSON content directly into the Gemini API request payload, producing 400 errors. Add json.Valid() check before using SetRaw to ensure only actually valid JSON is embedded raw. Non-JSON content now falls through to sjson.Set which properly escapes it as a JSON string. Fixes #2161	2026-03-16 15:29:18 +08:00
sususu98	ff03dc6a2c	fix(antigravity): resolve empty functionResponse.name for toolu_* tool_use_id format The Claude-to-Gemini translator derived function names by splitting tool_use_id on "-", which produced empty strings for IDs with exactly 2 segments (e.g. toolu_tool-<uuid>). Replace the string-splitting heuristic with a lookup map built from tool_use blocks during the main processing loop, with fallback to the raw ID on miss.	2026-03-16 11:18:29 +08:00
Luis Pater	b1dcff778c	Merge pull request #2141 from Muran-prog/fix/tool-calling-translation-2132 fix: skip empty assistant message in tool call translation (#2132)	2026-03-16 01:42:27 +08:00
Muran-prog	0b94d36c4a	test: use exact match for tool name assertion Address review feedback - drop function.name fallback and strings.Contains in favor of direct == comparison.	2026-03-14 21:45:28 +02:00
Muran-prog	c8cee6a209	fix: skip empty assistant message in tool call translation (#2132 ) When assistant has tool_calls but no text content, the translator emitted an empty message into the Responses API input array before function_call items. The API then couldn't match function_call_output to its function_call by call_id, returning: No tool output found for function call ... Only emit assistant messages that have content parts. Tool-call-only messages now produce function_call items directly. Added 9 tests for tool calling translation covering single/parallel calls, multi-turn conversations, name shortening, empty content edge cases, and call_id integrity.	2026-03-14 21:01:01 +02:00
Luis Pater	4b1a404fcb	Fixed: #1936 feat(translator): add image type handling in ConvertClaudeRequestToGemini	2026-03-15 02:18:28 +08:00
sususu98	b76b79068f	fix(gemini-cli): sanitize tool schemas and filter empty parts 1. Claude translator: add CleanJSONSchemaForGemini() to sanitize tool input schemas (removes $schema, anyOf, const, format, etc.) and delete eager_input_streaming from tool declarations. Remove fragile bytes.Replace for format:"uri" now covered by schema cleaner. 2. Gemini native translator: filter out content entries with empty or missing parts arrays to prevent Gemini API 400 error "required oneof field 'data' must have one initialized field". Both fixes align gemini-cli with protections already present in the antigravity translator.	2026-03-13 12:37:37 +08:00
Luis Pater	683f3709d6	Merge pull request #2076 from aikins01/fix/backfill-empty-function-response-names fix: backfill empty functionResponse.name from preceding functionCall	2026-03-12 10:35:44 +08:00
Aikins Laryea	a6c3042e34	refactor: remove redundant bounds checks per code review	2026-03-12 00:12:43 +00:00
Aikins Laryea	861537c9bd	fix: backfill empty functionResponse.name from preceding functionCall when Amp or Claude Code sends functionResponse with an empty name in Gemini conversation history, the Gemini API rejects the request with 400 "Name cannot be empty". this fix backfills empty names from the corresponding preceding functionCall parts using positional matching. covers all three Gemini translator paths: - gemini/gemini (direct API key) - antigravity/gemini (OAuth) - gemini-cli/gemini (Gemini CLI) also switches fixCLIToolResponse pending group matching from LIFO to FIFO to correctly handle multiple sequential tool call groups. fixes #1903	2026-03-12 00:00:38 +00:00
Luis Pater	7b7b258c38	Fixed: #2022 test(translator): add tests for handling Claude system messages as string and array	2026-03-11 10:47:33 +08:00
Kirill Turanskiy	338321e553	fix: use camelCase systemInstruction in OpenAI-to-Gemini translators The Gemini v1internal (cloudcode-pa) and Antigravity Manager endpoints require camelCase "systemInstruction" in request JSON. The current snake_case "system_instruction" causes system prompts to be silently ignored when routing through these endpoints. Replace all "system_instruction" JSON keys with "systemInstruction" in chat-completions and responses request translators.	2026-03-08 15:59:13 +03:00
Luis Pater	4f48e5254a	Merge pull request #1957 from router-for-me/thinking fix(translator): pass through adaptive thinking effort	2026-03-08 20:46:58 +08:00
Luis Pater	38277c1ea6	Merge pull request #1875 from woqiqishi/fix/tool-use-id-sanitize fix: sanitize tool_use.id to comply with Claude API regex ^[a-zA-Z0-9_-]+$	2026-03-07 22:06:36 +08:00
Luis Pater	9cee8ef87b	Merge pull request #1684 from alexey-yanchenko/fix/input-audio-from-openai-to-antigravity fix: preserve input_audio content parts when proxying to Antigravity	2026-03-07 10:12:28 +08:00
Luis Pater	93fb841bcb	Fixed: #1670 test(translator): add unit tests for OpenAI to Claude requests and tool result handling - Introduced tests for converting OpenAI requests to Claude with text, base64 images, and URL images in tool results. - Refactored `convertClaudeToolResultContent` and related functionality to properly handle raw content with images and text. - Updated conversion logic to streamline image handling for both base64 and URL formats.	2026-03-07 09:25:22 +08:00
Luis Pater	2695a99623	fix(translator): conditionally remove `service_tier` from OpenAI response processing	2026-03-06 11:07:22 +08:00
hkfires	ce8cc1ba33	fix(translator): pass through adaptive thinking effort	2026-03-06 09:13:32 +08:00
Xu Hong	553d6f50ea	fix: sanitize tool_use.id to comply with Claude API regex ^[a-zA-Z0-9_-]+$ Add util.SanitizeClaudeToolID() to replace non-conforming characters in tool_use.id fields across all five response translators (gemini, codex, openai, antigravity, gemini-cli). Upstream tool names may contain dots or other special characters (e.g. "fs.readFile") that violate Claude's ID validation regex. The sanitizer replaces such characters with underscores and provides a generated fallback for empty IDs. Fixes #1872, Fixes #1849 Made-with: Cursor	2026-03-06 00:10:09 +08:00
Luis Pater	cc8dc7f62c	Merge branch 'main' into dev	2026-03-05 23:13:21 +08:00
Luis Pater	a3846ea513	Merge pull request #1870 from sususu98/fix/remove-instructions-restore cleanup(translator): remove leftover instructions restore in codex responses	2026-03-05 23:12:31 +08:00
Luis Pater	0e6bb076e9	fix(translator): comment out `service_tier` removal from OpenAI response processing	2026-03-05 22:49:38 +08:00
Luis Pater	4e1d09809d	Fixed: #1741 fix(translator): handle tool name mappings and improve tool call handling in OpenAI and Claude integrations	2026-03-05 22:24:50 +08:00
sususu98	68a6cabf8b	style: blank unused params in codex responses translator	2026-03-05 16:42:48 +08:00
sususu98	ac0e387da1	cleanup(translator): remove leftover instructions restore in codex responses The instructions restore logic was originally needed when the proxy injected custom instructions (per-model system prompts) into requests. Since `ac802a46` removed the injection system, the proxy no longer modifies instructions before forwarding. The upstream response's instructions field now matches the client's original value, making the restore a no-op. Also removes unused sjson import. Closes router-for-me/CLIProxyAPI#1868	2026-03-05 16:34:55 +08:00
Luis Pater	5850492a93	Fixed: #1548 test(translator): add unit tests for fallback logic in `ConvertCodexResponseToOpenAI` model assignment	2026-03-05 12:11:54 +08:00

1 2 3 4 5 ...

378 Commits