CLIProxyAPIPlus

mirror of https://github.com/router-for-me/CLIProxyAPIPlus.git synced 2026-03-21 16:40:22 +00:00

Author	SHA1	Message	Date
hkfires	5c84d69d42	feat(translator): map output_config.effort to adaptive thinking level in antigravity	2026-03-04 13:11:07 +08:00
Luis Pater	9f95b31158	fix(translator): enhance handling of mixed output content in Claude requests	2026-03-03 21:49:41 +08:00
hkfires	ce87714ef1	feat(thinking): normalize effort levels in adaptive thinking requests to prevent validation errors	2026-03-03 15:10:47 +08:00
hkfires	0452b869e8	feat(thinking): add HasLevel and MapToClaudeEffort functions for adaptive thinking support	2026-03-03 14:16:36 +08:00
hkfires	d2e5857b82	feat(thinking): enhance adaptive thinking support across models and update test cases	2026-03-03 13:00:24 +08:00
hkfires	c44793789b	feat(thinking): add adaptive thinking support for Claude models Add support for Claude's "adaptive" and "auto" thinking modes using `output_config.effort`. Introduce support for new effort level "max" in adaptive thinking. Update thinking logic, validate model capabilities, and extend converters and handling to ensure compatibility with adaptive modes. Adjust static model data with supported levels and refine handling across translators and executors.	2026-03-03 09:05:31 +08:00
hkfires	914db94e79	refactor(headers): streamline User-Agent handling and introduce GeminiCLI versioning	2026-03-02 13:04:30 +08:00
Luis Pater	8aa2cce8c5	Merge PR #1735 into dev with conflict resolution and fixes	2026-03-02 03:22:51 +08:00
hkfires	b148820c35	fix(translator): handle Claude thinking type "auto" like adaptive	2026-03-01 10:30:19 +08:00
Luis Pater	1ae994b4aa	fix(antigravity): adjust thinkingBudget default to 64000 and update model definitions for Claude	2026-03-01 09:39:39 +08:00
Luis Pater	5446cd2b02	Merge pull request #1761 from margbug01/fix/thinking-chain-display fix: support thinking.type=auto from Amp client and decouple thinking translation from unsigned history	2026-03-01 02:30:42 +08:00
margbug01	8de0885b7d	fix: support thinking.type="auto" from Amp client for Antigravity Claude models ## Problem When using Antigravity Claude models through CLIProxyAPI, the thinking chain (reasoning content) does not display in the Amp client. ## Root Cause The Amp client sends `thinking: {"type": "auto"}` in its requests, but `ConvertClaudeRequestToAntigravity` only handled `"enabled"` and `"adaptive"` types in its switch statement. The `"auto"` type was silently ignored, resulting in no `thinkingConfig` being set in the translated Gemini request. Without `thinkingConfig`, the Antigravity API returns responses without any thinking content. Additionally, the Antigravity API for Claude models does not support `thinkingBudget: -1` (auto mode sentinel). It requires a concrete positive budget value. The fix uses 128000 as the budget for "auto" mode, which `ApplyThinking` will then normalize to stay within the model's actual limits (e.g., capped to `maxOutputTokens - 1`). ## Changes ### internal/translator/antigravity/claude/antigravity_claude_request.go 1. Add "auto" case to the thinking type switch statement. Sets `thinkingBudget: 128000` and `includeThoughts: true`. The budget is subsequently normalized by `ApplyThinking` based on model-specific limits. 2. Add "auto" to hasThinking check so that interleaved thinking hints are injected for tool-use scenarios when Amp sends `thinking.type="auto"`. ### internal/registry/model_definitions_static_data.go 3. Add Thinking configuration for `claude-sonnet-4-6`, `claude-sonnet-4-5`, and `claude-opus-4-6` in `GetAntigravityModelConfig()` -- these were previously missing, causing `ApplyThinking` to skip thinking config entirely. ## Testing - Deployed to Railway test instance (cpa-thinking-test) - Verified via debug logging that: - Amp sends `thinking: {"type": "auto"}` - CPA now translates this to `thinkingConfig: {thinkingBudget: 128000, includeThoughts: true}` - `ApplyThinking` normalizes the budget to model-specific limits - Antigravity API receives the correct thinkingConfig Amp-Thread-ID: https://ampcode.com/threads/T-019ca511-710d-776d-a07c-4b750f871a93 Co-authored-by: Amp <amp@ampcode.com>	2026-03-01 02:18:43 +08:00
Luis Pater	a6ce5f36e6	Fixed: #1758 fix(codex): filter billing headers from system result text and update template logic	2026-03-01 01:45:35 +08:00
maplelove	68dd2bfe82	fix(translator): allow passthrough of custom generationConfig for all Gemini-like providers	2026-02-27 17:13:42 +08:00
Luis Pater	816fb4c5da	Merge pull request #1682 from sususu98/fix/tool-result-image-parts fix(antigravity): place tool_result images in functionResponse.parts and unify mimeType	2026-02-25 23:14:35 +08:00
Luis Pater	d24ea4ce2a	Merge pull request #1664 from ciberponk/pr/responses-compaction-compat feat: add codex responses compatibility for compaction payloads	2026-02-25 01:21:59 +08:00
Luis Pater	c3e12c5e58	Merge pull request #1654 from alexey-yanchenko/feature/pass-file-inputs Pass file input from /chat/completions and /responses to codex and claude	2026-02-24 05:53:11 +08:00
Luis Pater	1825fc7503	Merge pull request #1643 from alexey-yanchenko/fix/gemini-prompt-tokens Fix usage convertation from gemini response to openai format	2026-02-24 05:46:13 +08:00
sususu98	4e26182d14	fix(antigravity): place tool_result images in functionResponse.parts and unify mimeType Move base64 image data from Claude tool_result into functionResponse.parts as inlineData instead of outer sibling parts, preventing context bloat. Unify all inlineData field naming to camelCase mimeType across Claude, OpenAI, and Gemini translators. Add comprehensive edge case tests and Gemini-side regression test for functionResponse.parts preservation.	2026-02-23 13:38:21 +08:00
fan	afc8a0f9be	refactor: simplify context_management compatibility handling	2026-02-21 22:20:48 +08:00
ciberponk	d693d7993b	feat: support responses compaction payload compatibility for codex translator	2026-02-21 12:56:10 +08:00
Alexey Yanchenko	0cbfe7f457	Pass file input from /chat/completions and /responses to codex and claude	2026-02-20 10:25:44 +07:00
Kirill Turanskiy	1cc21cc45b	fix: prevent duplicate function call arguments when delta events precede done Non-spark codex models (gpt-5.3-codex, gpt-5.2-codex) stream function call arguments via multiple delta events followed by a done event. The done handler unconditionally emitted the full arguments, duplicating what deltas already streamed. This produced invalid double JSON that Claude Code couldn't parse, causing tool calls to fail with missing parameters and infinite retry loops. Add HasReceivedArgumentsDelta flag to track whether delta events were received. The done handler now only emits arguments when no deltas preceded it (spark models), while delta-based streaming continues to work for non-spark models.	2026-02-19 23:18:14 +03:00
Kirill Turanskiy	07cf616e2b	fix: handle response.function_call_arguments.done in codex→claude streaming translator Some Codex models (e.g. gpt-5.3-codex-spark) send function call arguments in a single "done" event without preceding "delta" events. The streaming translator only handled "delta" events, causing tool call arguments to be lost — resulting in empty tool inputs and infinite retry loops in clients like Claude Code. Emit the full arguments from the "done" event as a single input_json_delta so downstream clients receive the complete tool input.	2026-02-19 23:18:14 +03:00
TinyCoder	00822770ec	fix(antigravity): prevent invalid JSON when tool_result has no content sjson.SetRaw with an empty string produces malformed JSON (e.g. "result":}). This happens when a Claude tool_result block has no content field, causing functionResponseResult.Raw to be "". Guard against this by falling back to sjson.Set with an empty string only when .Raw is empty.	2026-02-19 17:08:39 +07:00
Alexey Yanchenko	b9ae4ab803	Fix usage convertation from gemini response to openai format	2026-02-19 15:34:59 +07:00
Kirill Turanskiy	5fa23c7f41	fix: handle tool call argument streaming in Codex→OpenAI translator The OpenAI Chat Completions translator was silently dropping response.function_call_arguments.delta and response.function_call_arguments.done Codex SSE events, meaning tool call arguments were never streamed incrementally to clients. Add proper handling mirroring the proven Claude translator pattern: - response.output_item.added: announce tool call (id, name, empty args) - response.function_call_arguments.delta: stream argument chunks - response.function_call_arguments.done: emit full args if no deltas - response.output_item.done: defensive fallback for backward compat State tracking via HasReceivedArgumentsDelta and HasToolCallAnnounced ensures no duplicate argument emission and correct behavior for models like codex-spark that skip delta events entirely.	2026-02-18 19:09:05 +03:00
Alexey Yanchenko	63d4de5eea	Pass cache usage from codex to openai chat completions	2026-02-15 12:04:15 +07:00
Luis Pater	a146c6c0aa	Merge pull request #1523 from xxddff/feature/removeUserField fix(codex): remove unsupported 'user' field from /v1/responses payload	2026-02-11 20:38:16 +08:00
Luis Pater	1510bfcb6f	fix(translator): improve content handling for system and user messages - Added support for single and array-based `content` cases. - Enhanced `system_instruction` structure population logic. - Improved handling of user role assignment for string-based `content`.	2026-02-11 15:04:01 +08:00
xxddff	bb9fe52f1e	Update internal/translator/codex/openai/responses/codex_openai-responses_request_test.go Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>	2026-02-10 18:24:58 +09:00
xxddff	afe4c1bfb7	更新internal/translator/codex/openai/responses/codex_openai-responses_request.go Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>	2026-02-10 18:24:26 +09:00
xxddff	865af9f19e	Implement test for user field deletion Add test to verify deletion of user field in response	2026-02-10 17:38:49 +09:00
xxddff	2b97cb98b5	Delete 'user' field from raw JSON Remove the 'user' field from the raw JSON as requested.	2026-02-10 17:35:54 +09:00
hkfires	938a799263	feat(translator): support Claude thinking type adaptive	2026-02-10 16:20:32 +08:00
Luis Pater	63643c44a1	Fixed: #1484 fix(translator): restructure message content handling to support multiple content types - Consolidated `input_text` and `output_text` handling into a single case. - Added support for processing `input_image` content with associated URLs.	2026-02-09 02:05:38 +08:00
hkfires	b7e4f00c5f	fix(translator): correct gemini-cli log prefix	2026-02-07 08:40:09 +08:00
Luis Pater	80b5e79e75	fix(translator): normalize and restrict `stop_reason`/`finish_reason` usage - Standardized the handling of `stop_reason` and `finish_reason` across Codex and Gemini responses. - Restricted pass-through of specific reasons (`max_tokens`, `stop`) for consistency. - Enhanced fallback logic for undefined reasons.	2026-02-07 02:07:51 +08:00
Luis Pater	1187aa8222	feat(translator): capture cached token count in usage metadata and handle prompt caching - Added support to extract and include `cachedContentTokenCount` in `usage.prompt_tokens_details`. - Logged warnings for failures to set cached token count for better debugging.	2026-02-06 21:28:40 +08:00
Luis Pater	a5a25dec57	refactor(translator, executor): remove redundant `bytes.Clone` calls for improved performance - Replaced all instances of `bytes.Clone` with direct references to enhance efficiency. - Simplified payload handling across executors and translators by eliminating unnecessary data duplication.	2026-02-06 03:26:29 +08:00
Luis Pater	25c6b479c7	refactor(util, executor): optimize payload handling and schema processing - Replaced repetitive string operations with a centralized `escapeGJSONPathKey` function. - Streamlined handling of JSON schema cleaning for Gemini and Antigravity requests. - Improved payload management by transitioning from byte slices to strings for processing. - Removed unnecessary cloning of byte slices in several places.	2026-02-05 19:00:30 +08:00
Chén Mù	7cf9ff0345	Merge pull request #1429 from neavo/fix/gemini-python-sdk-thinking-fields fix(gemini): support snake_case thinking config fields from Python SDK	2026-02-05 14:32:58 +08:00
neavo	6c65fdf54b	fix(gemini): support snake_case thinking config fields from Python SDK Google official Gemini Python SDK sends thinking_level, thinking_budget, and include_thoughts (snake_case) instead of thinkingLevel, thinkingBudget, and includeThoughts (camelCase). This caused thinking configuration to be ignored when using Python SDK. Changes: - Extract layer: extractGeminiConfig now reads snake_case as fallback - Apply layer: Gemini/CLI/Antigravity appliers clean up snake_case fields - Translator layer: Gemini->OpenAI/Claude/Codex translators support fallback - Tests: Added 4 test cases for snake_case field coverage Fixes #1426	2026-02-04 21:12:47 +08:00
dannycreations	3f9c9591bd	feat(gemini-cli): support image content in Claude request conversion - Add logic to handle `image` content type during request translation. - Map Claude base64 image data to Gemini's `inlineData` structure. - Support automatic extraction of `media_type` and `data` for image parts.	2026-02-04 11:00:37 +07:00
Luis Pater	259f586ff7	Fixed: #1398 fix(translator): use model group caching for client signature validation	2026-02-03 22:04:52 +08:00
Luis Pater	d885b81f23	Fixed: #1403 fix(translator): handle "input" field transformation for OpenAI responses	2026-02-03 21:49:30 +08:00
Luis Pater	fe6bffd080	fixed: #1407 fix(translator): adjust "developer" role to "user" and ignore unsupported tool types	2026-02-03 21:41:17 +08:00
hkfires	354f6582b2	fix(codex): convert system role to developer for codex input	2026-02-01 15:37:37 +08:00
hkfires	fe3ebe3532	docs(translator): update Codex Claude request transform docs	2026-02-01 14:55:41 +08:00
hkfires	ac802a4646	refactor(codex): remove codex instructions injection support	2026-02-01 14:33:31 +08:00

1 2 3 4 5 ...

321 Commits