CLIProxyAPIPlus

mirror of https://github.com/router-for-me/CLIProxyAPIPlus.git synced 2026-04-06 21:08:39 +00:00

Author	SHA1	Message	Date
Luis Pater	b468ca79c3	Merge branch 'dev' of github.com:router-for-me/CLIProxyAPI into dev	2026-04-01 03:09:03 +08:00
Luis Pater	d2c7e4e96a	refactor(runtime): move executor utilities to `helps` package and update references	2026-04-01 03:08:20 +08:00
Luis Pater	1c7003ff68	Merge pull request #2452 from Lucaszmv/fix-qwen-cli-v0.13.2 fix(qwen): update CLI simulation to v0.13.2 and adjust header casing	2026-04-01 02:44:27 +08:00
Lucaszmv	1b44364e78	fix(qwen): update CLI simulation to v0.13.2	2026-03-31 15:19:07 -03:00
Luis Pater	51fd58d74f	fix(codex): use normalizeCodexInstructions to set default instructions	2026-03-31 12:16:57 +08:00
Luis Pater	faae9c2f7c	Merge pull request #2422 from MonsterQiu/fix/codex-compact-instructions fix(codex): add default instructions for /responses/compact	2026-03-31 12:14:20 +08:00
Luis Pater	bc3a6e4646	Merge pull request #2434 from MonsterQiu/fix/codex-responses-null-instructions fix(codex): normalize null instructions for /responses requests	2026-03-31 12:01:21 +08:00
Luis Pater	b09b03e35e	Merge pull request #2424 from possible055/fix/websocket-transcript-replacement fix(openai): handle transcript replacement after websocket v2 compaction	2026-03-31 11:00:33 +08:00
Luis Pater	16231947e7	Merge pull request #2426 from xixiwenxuanhe/feature/antigravity-credits feat(antigravity): add AI credits quota fallback	2026-03-31 10:51:40 +08:00
MonsterQiu	39b9a38fbc	fix(codex): normalize null instructions across responses paths	2026-03-31 10:32:39 +08:00
MonsterQiu	bd855abec9	fix(codex): normalize null instructions for responses requests	2026-03-31 10:29:02 +08:00
Luis Pater	7c3c2e9f64	Merge pull request #2417 from CharTyr/fix/amp-streaming-thinking-regression fix(amp): 修复流式响应中 thinking block 被错误抑制导致的 TUI 空白回复	2026-03-31 10:12:13 +08:00
Luis Pater	c10f8ae2e2	Fixed: #2420 docs(readme): remove ProxyPal section from all README translations	2026-03-31 07:23:02 +08:00
xixiwenxuanhe	a0bf33eca6	fix(antigravity): preserve fallback and honor config gate	2026-03-31 00:14:05 +08:00
xixiwenxuanhe	88dd9c715d	feat(antigravity): add AI credits quota fallback	2026-03-30 23:58:12 +08:00
apparition	a3e21df814	fix(openai): avoid developer transcript resets - Narrow websocket transcript replacement detection to assistant outputs and function calls - Preserve existing merge behavior for follow-up developer messages without previous_response_id - Add a regression test covering mid-session developer message updates	2026-03-30 23:33:16 +08:00
MonsterQiu	d3b94c9241	fix(codex): normalize null instructions for compact requests	2026-03-30 22:58:05 +08:00
apparition	c1d7599829	fix(openai): handle transcript replacement after websocket compaction - Add shouldReplaceWebsocketTranscript() to detect historical model output in input - Add normalizeResponseTranscriptReplacement() for full transcript reset handling - Prevent duplicate stale turn-state when clients replace local history post-compaction - Avoid orphaned function_call items from incremental append on compact transcripts - Add unit tests for transcript replacement detection and state reset behavior	2026-03-30 22:44:58 +08:00
MonsterQiu	d11936f292	fix(codex): add default instructions for /responses/compact	2026-03-30 22:44:46 +08:00
Luis Pater	17363edf25	fix(auth): skip downtime for request-scoped 404 errors in model state management	2026-03-30 22:22:42 +08:00
CharTyr	279cbbbb8a	fix(amp): don't suppress thinking blocks in streaming mode Reverts the streaming thinking suppression introduced in `b15453c`. rewriteStreamEvent should only inject signatures and rewrite model names — suppressing thinking blocks in streaming mode breaks SSE index alignment and causes the Amp TUI to render empty responses on the second message onward (especially with model-mapped non-Claude providers like GPT-5.4). Non-streaming responses still suppress thinking when tool_use is present via rewriteModelInResponse.	2026-03-30 20:09:32 +08:00
Luis Pater	486cd4c343	Merge pull request #2409 from sususu98/fix/tool-use-pairing-break fix(antigravity): reorder model parts to prevent tool_use↔tool_result pairing breakage	2026-03-30 16:59:46 +08:00
sususu98	25feceb783	fix(antigravity): reorder model parts to prevent tool_use↔tool_result pairing breakage When a Claude assistant message contains [text, tool_use, text], the Antigravity API internally splits the model message at functionCall boundaries, creating an extra assistant turn between tool_use and the following tool_result. Claude then rejects with: tool_use ids were found without tool_result blocks immediately after Fix: extend the existing 2-way part reordering (thinking-first) to a 3-way partition: thinking → regular → functionCall. This ensures functionCall parts are always last, so Antigravity's split cannot insert an extra assistant turn before the user's tool_result. Fixes #989	2026-03-30 15:09:33 +08:00
Luis Pater	d26752250d	Merge pull request #2403 from CharTyr/clean-pr fix(amp): 修复Amp CLI 集成缺失/无效 signature 导致的 TUI 崩溃与上游 400 问题	2026-03-30 12:54:15 +08:00
CharTyr	b15453c369	fix(amp): address PR review - stream thinking suppression, SSE detection, test init - Call suppressAmpThinking in rewriteStreamEvent for streaming path - Handle nil return from suppressAmpThinking to skip suppressed events - Narrow looksLikeSSEChunk to line-prefix detection (HasPrefix vs Contains) - Initialize suppressedContentBlock map in test	2026-03-30 00:42:04 -04:00
CharTyr	04ba8c8bc3	feat(amp): sanitize signatures and handle stream suppression for Amp compatibility	2026-03-29 22:23:18 -04:00
Luis Pater	6570692291	Merge pull request #2400 from router-for-me/revert-2374-codex-cache-clean Revert "fix(codex): restore prompt cache continuity for Codex requests"	2026-03-29 22:19:39 +08:00
Luis Pater	13aa5b3375	Revert "fix(codex): restore prompt cache continuity for Codex requests"	2026-03-29 22:18:14 +08:00
Luis Pater	6d8de0ade4	feat(auth): implement weighted provider rotation for improved scheduling fairness	2026-03-29 13:49:01 +08:00
Luis Pater	1587ff5e74	Merge pull request #2389 from router-for-me/claude fix(claude): add default max_tokens for models	2026-03-29 13:03:20 +08:00
hkfires	f033d3a6df	fix(claude): enhance ensureModelMaxTokens to use registered max_completion_tokens and fallback to default	2026-03-29 13:00:43 +08:00
hkfires	145e0e0b5d	fix(claude): add default max_tokens for models	2026-03-29 12:46:00 +08:00
Luis Pater	9b7d7021af	docs(readme): update LingtrueAPI link in all README translations	2026-03-29 12:30:24 +08:00
Luis Pater	e41c22ef44	docs(readme): add LingtrueAPI sponsorship details to all README translations	2026-03-29 12:23:37 +08:00
Luis Pater	55271403fb	Merge pull request #2374 from VooDisss/codex-cache-clean fix(codex): restore prompt cache continuity for Codex requests	2026-03-28 21:16:51 +08:00
Luis Pater	36fba66619	Merge pull request #2371 from RaviTharuma/docs/provider-specific-routes docs: clarify provider-specific routing for aliased models	2026-03-28 21:11:29 +08:00
Luis Pater	b9b127a7ea	Merge pull request #2347 from edlsh/fix/codex-strip-stream-options fix(codex): strip stream_options from Responses API requests	2026-03-28 21:03:01 +08:00
Luis Pater	2741e7b7b3	Merge pull request #2346 from pjpjq/codex/fix-codex-capacity-retry fix(codex): Treat Codex capacity errors as retryable	2026-03-28 21:00:50 +08:00
Luis Pater	1767a56d4f	Merge pull request #2343 from kongkk233/fix/proxy-transport-defaults Preserve default transport settings for proxy clients	2026-03-28 20:58:24 +08:00
Luis Pater	779e6c2d2f	Merge pull request #2231 from 7RPH/fix/responses-stream-multi-tool-calls fix: preserve separate streamed tool calls in Responses API	2026-03-28 20:53:19 +08:00
Luis Pater	73c831747b	Merge pull request #2133 from DragonFSKY/fix/2061-stale-modelstates fix(auth): prevent stale runtime state inheritance from disabled auth entries	2026-03-28 20:50:57 +08:00
Luis Pater	10b824fcac	fix(security): validate auth file names to prevent unsafe input	2026-03-28 04:48:23 +08:00
VooDisss	e5d3541b5a	refactor(codex): remove stale affinity cleanup leftovers Drop the last affinity-related executor artifacts so the PR stays focused on the minimal Codex continuity fix set: stable prompt cache identity, stable session_id, and the executor-only behavior that was validated to restore cache reads.	2026-03-27 20:40:26 +02:00
VooDisss	79755e76ea	refactor(pr): remove forbidden translator changes Drop the chat-completions translator edits from this PR so the branch complies with the repository policy that forbids pull-request changes under internal/translator. The remaining PR stays focused on the executor-level Codex continuity fix that was validated to restore cache reuse.	2026-03-27 19:34:13 +02:00
VooDisss	35f158d526	refactor(pr): narrow Codex cache fix scope Remove the experimental auth-affinity routing changes from this PR so it stays focused on the validated Codex continuity fix. This keeps the prompt-cache repair while avoiding unrelated routing-policy concerns such as provider/model affinity scope, lifecycle cleanup, and hard-pin fallback semantics.	2026-03-27 19:06:34 +02:00
VooDisss	6962e09dd9	fix(auth): scope affinity by provider Keep sticky auth affinity limited to matching providers and stop persisting execution-session IDs as long-lived affinity keys so provider switching and normal streaming traffic do not create incorrect pins or stale affinity state.	2026-03-27 18:52:58 +02:00
VooDisss	4c4cbd44da	fix(auth): avoid leaking or over-persisting affinity keys Stop using one-shot idempotency keys as long-lived auth-affinity identifiers and remove raw affinity-key values from debug logs so sticky routing keeps its continuity benefits without creating avoidable memory growth or credential exposure risks.	2026-03-27 18:34:51 +02:00
VooDisss	26eca8b6ba	fix(codex): preserve continuity and safe affinity fallback Restore Claude continuity after the continuity refactor, keep auth-affinity keys out of upstream Codex session identifiers, and only persist affinity after successful execution so retries can still rotate to healthy credentials when the first auth fails.	2026-03-27 18:27:33 +02:00
VooDisss	62b17f40a1	refactor(codex): align continuity helpers with review feedback Align websocket continuity resolution with the HTTP Codex path, make auth-affinity principal keys use a stable string representation, and extract small helpers that remove duplicated continuity and affinity logic without changing the validated cache-hit behavior.	2026-03-27 18:11:57 +02:00
VooDisss	511b8a992e	fix(codex): restore prompt cache continuity for Codex requests Prompt caching on Codex was not reliably reusable through the proxy because repeated chat-completions requests could reach the upstream without the same continuity envelope. In practice this showed up most clearly with OpenCode, where cache reads worked in the reference client but not through CLIProxyAPI, although the root cause is broader than OpenCode itself. The proxy was breaking continuity in several ways: executor-layer Codex request preparation stripped prompt_cache_retention, chat-completions translation did not preserve that field, continuity headers used a different shape than the working client behavior, and OpenAI-style Codex requests could be sent without a stable prompt_cache_key. When that happened, session_id fell back to a fresh random value per request, so upstream Codex treated repeated requests as unrelated turns instead of as part of the same cacheable context. This change fixes that by preserving caller-provided prompt_cache_retention on Codex execution paths, preserving prompt_cache_retention when translating OpenAI chat-completions requests to Codex, aligning Codex continuity headers to session_id, and introducing an explicit Codex continuity policy that derives a stable continuity key from the best available signal. The resolution order prefers an explicit prompt_cache_key, then execution session metadata, then an explicit idempotency key, then stable request-affinity metadata, then a stable client-principal hash, and finally a stable auth-ID hash when no better continuity signal exists. The same continuity key is applied to both prompt_cache_key in the request body and session_id in the request headers so repeated requests reuse the same upstream cache/session identity. The auth manager also keeps auth selection sticky for repeated request sequences, preventing otherwise-equivalent Codex requests from drifting across different upstream auth contexts and accidentally breaking cache reuse. To keep the implementation maintainable, the continuity resolution and diagnostics are centralized in a dedicated Codex continuity helper instead of being scattered across executor flow code. Regression coverage now verifies retention preservation, continuity-key precedence, stable auth-ID fallback, websocket parity, translator preservation, and auth-affinity behavior. Manual validation confirmed prompt cache reads now occur through CLIProxyAPI when using Codex via OpenCode, and the fix should also benefit other clients that rely on stable repeated Codex request continuity.	2026-03-27 17:49:29 +02:00

1 2 3 4 5 ...

2109 Commits