CLIProxyAPIPlus

mirror of https://github.com/router-for-me/CLIProxyAPIPlus.git synced 2026-04-18 04:14:28 +00:00

Author	SHA1	Message	Date
VooDisss	511b8a992e	fix(codex): restore prompt cache continuity for Codex requests Prompt caching on Codex was not reliably reusable through the proxy because repeated chat-completions requests could reach the upstream without the same continuity envelope. In practice this showed up most clearly with OpenCode, where cache reads worked in the reference client but not through CLIProxyAPI, although the root cause is broader than OpenCode itself. The proxy was breaking continuity in several ways: executor-layer Codex request preparation stripped prompt_cache_retention, chat-completions translation did not preserve that field, continuity headers used a different shape than the working client behavior, and OpenAI-style Codex requests could be sent without a stable prompt_cache_key. When that happened, session_id fell back to a fresh random value per request, so upstream Codex treated repeated requests as unrelated turns instead of as part of the same cacheable context. This change fixes that by preserving caller-provided prompt_cache_retention on Codex execution paths, preserving prompt_cache_retention when translating OpenAI chat-completions requests to Codex, aligning Codex continuity headers to session_id, and introducing an explicit Codex continuity policy that derives a stable continuity key from the best available signal. The resolution order prefers an explicit prompt_cache_key, then execution session metadata, then an explicit idempotency key, then stable request-affinity metadata, then a stable client-principal hash, and finally a stable auth-ID hash when no better continuity signal exists. The same continuity key is applied to both prompt_cache_key in the request body and session_id in the request headers so repeated requests reuse the same upstream cache/session identity. The auth manager also keeps auth selection sticky for repeated request sequences, preventing otherwise-equivalent Codex requests from drifting across different upstream auth contexts and accidentally breaking cache reuse. To keep the implementation maintainable, the continuity resolution and diagnostics are centralized in a dedicated Codex continuity helper instead of being scattered across executor flow code. Regression coverage now verifies retention preservation, continuity-key precedence, stable auth-ID fallback, websocket parity, translator preservation, and auth-affinity behavior. Manual validation confirmed prompt cache reads now occur through CLIProxyAPI when using Codex via OpenCode, and the fix should also benefit other clients that rely on stable repeated Codex request continuity.	2026-03-27 17:49:29 +02:00
Tam Nhu Tran	ea3e0b713e	fix: harden pooled model-support fallback state	2026-03-18 13:19:20 -04:00
Tam Nhu Tran	5135c22cd6	fix: fall back on model support errors during auth rotation	2026-03-18 12:43:45 -04:00
Luis Pater	b5701f416b	Fixed: #2102 fix(auth): ensure unique auth index for shared API keys across providers and credential identities	2026-03-15 02:48:54 +08:00
Luis Pater	ce53d3a287	Fixed: #1997 test(auth-scheduler): add benchmarks and priority-based scheduling improvements - Added `BenchmarkManagerPickNextMixedPriority500` for mixed-priority performance assessment. - Updated `pickNextMixed` to prioritize highest ready priority tiers. - Introduced `highestReadyPriorityLocked` and `pickReadyAtPriorityLocked` for better scheduling logic. - Added unit test to validate selection of highest priority tiers in mixed provider scenarios.	2026-03-09 22:27:15 +08:00
Luis Pater	f5941a411c	test(auth): cover scheduler refresh regression paths	2026-03-09 09:27:56 +08:00
DragonFSKY	90afb9cb73	fix(auth): new OAuth accounts invisible to scheduler after dynamic registration When new OAuth auth files are added while the service is running, `applyCoreAuthAddOrUpdate` calls `coreManager.Register()` (which upserts into the scheduler) BEFORE `registerModelsForAuth()`. At upsert time, `buildScheduledAuthMeta` snapshots `supportedModelSetForAuth` from the global model registry — but models haven't been registered yet, so the set is empty. With an empty `supportedModelSet`, `supportsModel()` always returns false and the new auth is never added to any model shard. Additionally, when all existing accounts are in cooldown, the scheduler returns `modelCooldownError`, but `shouldRetrySchedulerPick` only handles `Error` types — so the `syncScheduler` safety-net rebuild never triggers and the new accounts remain invisible. Fix: 1. Add `RefreshSchedulerEntry()` to re-upsert a single auth after its models are registered, rebuilding `supportedModelSet` from the now-populated registry. 2. Call it from `applyCoreAuthAddOrUpdate` after `registerModelsForAuth`. 3. Make `shouldRetrySchedulerPick` also match `modelCooldownError` so the full scheduler rebuild triggers when all credentials are cooling down — catching any similar stale-snapshot edge cases.	2026-03-09 03:11:47 +08:00
Luis Pater	2b134fc378	test(auth-scheduler): add unit tests and scheduler implementation - Added comprehensive unit tests for `authScheduler` and related components. - Implemented `authScheduler` with support for Round Robin, Fill First, and custom selector strategies. - Improved tracking of auth states, cooldowns, and recovery logic in scheduler.	2026-03-08 05:52:55 +08:00
chujian	a52da26b5d	fix(auth): stop draining stream pool goroutines after context cancellation	2026-03-07 18:30:33 +08:00
chujian	522a68a4ea	fix(openai-compat): retry empty bootstrap streams	2026-03-07 18:08:13 +08:00
chujian	a02eda54d0	fix(openai-compat): address review feedback	2026-03-07 17:39:42 +08:00
chujian	7c1299922e	fix(openai-compat): improve pool fallback and preserve adaptive thinking	2026-03-07 16:54:28 +08:00
Luis Pater	79009bb3d4	Fixed: #797 test(auth): add test for preserving ModelStates during auth updates	2026-03-04 02:06:24 +08:00
hkfires	532107b4fa	test(auth): add global model registry usage to conductor override tests	2026-03-03 09:18:56 +08:00
Luis Pater	cc1d8f6629	Fixed: #1747 feat(auth): add configurable max-retry-credentials for finer control over cross-credential retries	2026-03-01 02:42:36 +08:00
Luis Pater	27c68f5bb2	fix(auth): replace MarkResult with hook OnResult for result handling	2026-02-27 20:47:46 +08:00
Luis Pater	74bf7eda8f	Merge pull request #1686 from lyd123qw2008/fix/auth-refresh-concurrency-limit fix(auth): limit auto-refresh concurrency to prevent refresh storms	2026-02-27 05:59:20 +08:00
Luis Pater	aa1da8a858	Merge pull request #1685 from lyd123qw2008/fix/auth-auto-refresh-interval fix(auth): respect configured auto-refresh interval	2026-02-25 01:13:47 +08:00
Luis Pater	7cb398d167	Merge pull request #1663 from rensumo/main feat: implement credential-based round-robin for gemini-cli	2026-02-24 06:02:50 +08:00
Luis Pater	48732ba05e	Merge pull request #1527 from HEUDavid/feat/auth-hook feat(auth): add post-auth hook mechanism	2026-02-24 05:33:13 +08:00
lyd123qw2008	0aaf177640	fix(auth): limit auto-refresh concurrency to prevent refresh storms	2026-02-23 22:28:41 +08:00
lyd123qw2008	450d1227bd	fix(auth): respect configured auto-refresh interval	2026-02-23 22:07:50 +08:00
rensumo	5936f9895c	feat: implement credential-based round-robin for gemini-cli virtual auths Changes the RoundRobinSelector to use two-level round-robin when gemini-cli virtual auths are detected (via gemini_virtual_parent attr): - Level 1: cycle across credential groups (parent accounts) - Level 2: cycle within each group's project auths Credentials start from a random offset (rand.IntN) for fair distribution. Non-virtual auths and single-credential scenarios fall back to flat RR. Adds 3 test cases covering multi-credential grouping, single-parent fallback, and mixed virtual/non-virtual fallback.	2026-02-21 12:49:48 +08:00
Luis Pater	2789396435	fix: ensure connection-scoped headers are filtered in upstream requests - Added `connectionScopedHeaders` utility to respect "Connection" header directives. - Updated `FilterUpstreamHeaders` to remove connection-scoped headers dynamically. - Refactored and tested upstream header filtering with additional validations. - Adjusted upstream header handling during retries to replace headers safely.	2026-02-19 13:19:10 +08:00
Luis Pater	61da7bd981	Merge PR #1626 into codex/pr-1626	2026-02-19 04:49:14 +08:00
Luis Pater	252f7e0751	Merge pull request #1625 from thebtf/feat/tool-prefix-config feat: add per-auth tool_prefix_disabled option	2026-02-19 04:07:22 +08:00
Luis Pater	bb86a0c0c4	feat(logging, executor): add request logging tests and WebSocket-based Codex executor - Introduced unit tests for request logging middleware to enhance coverage. - Added WebSocket-based Codex executor to support Responses API upgrade. - Updated middleware logic to selectively capture request bodies for memory efficiency. - Enhanced Codex configuration handling with new WebSocket attributes.	2026-02-19 01:57:02 +08:00
Kirill Turanskiy	1f8f198c45	feat: passthrough upstream response headers to clients CPA previously stripped ALL response headers from upstream AI provider APIs, preventing clients from seeing rate-limit info, request IDs, server-timing and other useful headers. Changes: - Add Headers field to Response and StreamResult structs - Add FilterUpstreamHeaders helper (hop-by-hop + security denylist) - Add WriteUpstreamHeaders helper (respects CPA-set headers) - ExecuteWithAuthManager/ExecuteCountWithAuthManager now return headers - ExecuteStreamWithAuthManager returns headers from initial connection - All 11 provider executors populate Response.Headers - All handler call sites write filtered upstream headers before response Filtered headers (not forwarded): - RFC 7230 hop-by-hop: Connection, Transfer-Encoding, Keep-Alive, etc. - Security: Set-Cookie - CPA-managed: Content-Length, Content-Encoding	2026-02-18 00:16:22 +03:00
Kirill Turanskiy	9261b0c20b	feat: add per-auth tool_prefix_disabled option Allow disabling the proxy_ tool name prefix on a per-account basis. Users who route their own Anthropic account through CPA can set "tool_prefix_disabled": true in their OAuth auth JSON to send tool names unchanged to Anthropic. Default behavior is fully preserved — prefix is applied unless explicitly disabled. Changes: - Add ToolPrefixDisabled() accessor to Auth (reads metadata key "tool_prefix_disabled" or "tool-prefix-disabled") - Gate all 6 prefix apply/strip points with the new flag - Add unit tests for the accessor	2026-02-17 21:48:19 +03:00
Luis Pater	46a6782065	refactor(all): replace manual pointer assignments with `new` to enhance code readability and maintainability	2026-02-15 14:10:10 +08:00
HEUDavid	65debb874f	feat/auth-hook: refactor RequstInfo to preserve original HTTP semantics	2026-02-12 07:11:17 +08:00
HEUDavid	3caadac003	feat/auth-hook: add post auth hook [CR]	2026-02-12 07:11:17 +08:00
HEUDavid	94563d622c	feat/auth-hook: add post auth hook	2026-02-12 07:11:17 +08:00
hkfires	b7e4f00c5f	fix(translator): correct gemini-cli log prefix	2026-02-07 08:40:09 +08:00
Luis Pater	f410dd0440	Merge pull request #1390 from sususu98/fix/400-invalid-request-no-retry fix(auth): 400 invalid_request_error 立即返回不再重试	2026-02-07 03:14:25 +08:00
Luis Pater	eb5582c17c	Merge pull request #1386 from shenshuoyaoyouguang/sync-auth-changes fix(auth): normalize model key for thinking suffix in selectors	2026-02-07 03:12:01 +08:00
LTbinglingfeng	fc7b6ef086	fix(kimi): add OAuth model-alias channel support and cover OAuth excluded-models with tests	2026-02-07 01:16:39 +08:00
sususu98	233be6272a	fix(auth): 400 invalid_request_error 立即返回不再重试当上游返回 400 Bad Request 且错误消息包含 invalid_request_error 时，表示请求本身格式错误，切换账户不会改变结果。修改： - 添加 isRequestInvalidError 判定函数 - 内层循环遇到此错误立即返回，不遍历其他账户 - 外层循环不再对此类错误进行重试	2026-02-02 17:35:51 +08:00
chujian	47cb52385e	sdk/cliproxy/auth: update selector tests	2026-02-02 05:26:04 +08:00
Luis Pater	e93e05ae25	refactor: consolidate channel send logic with context-safe handlers Optimize channel operations by introducing reusable context-aware send functions (`send` and `sendErr`) across `wsrelay`, `handlers`, and `cliproxy`. Ensure graceful handling of canceled contexts during stream operations.	2026-01-28 10:58:35 +08:00
Luis Pater	70897247b2	feat(auth): add support for request_retry and disable_cooling overrides Implement `request_retry` and `disable_cooling` metadata overrides for authentication management. Update retry and cooling logic accordingly across `Manager`, Antigravity executor, and file synthesizer. Add tests to validate new behaviors.	2026-01-26 21:59:08 +08:00
Luis Pater	9c341f5aa5	feat(auth): add skip persistence context key for file watcher events Introduce `WithSkipPersist` to disable persistence during Manager Update/Register calls, preventing write-back loops caused by redundant file writes. Add corresponding tests and integrate with existing file store and conductor logic.	2026-01-26 18:20:19 +08:00
Luis Pater	c32e2a8196	fix(auth): handle context cancellation in executor methods	2026-01-24 04:56:55 +08:00
Chén Mù	f353a54555	Merge pull request #1171 from router-for-me/auth refactor(auth): remove unused provider execution helpers	2026-01-23 19:43:42 +08:00
Chén Mù	1d6e2e751d	Merge pull request #1140 from sxjeru/main fix(auth): handle quota cooldown in retry logic for transient errors	2026-01-23 19:43:17 +08:00
hkfires	cc50b63422	refactor(auth): remove unused provider execution helpers	2026-01-23 19:12:55 +08:00
hkfires	81b369aed9	fix(auth): include requested model in executor metadata	2026-01-23 18:30:08 +08:00
sxjeru	30a59168d7	fix(auth): handle quota cooldown in retry logic for transient errors	2026-01-21 21:48:23 +08:00
hkfires	fe5b3c80cb	refactor(config): rename oauth-model-mappings to oauth-model-alias	2026-01-15 18:03:26 +08:00
hkfires	8bc6df329f	fix(auth): apply API key model mapping to request model	2026-01-15 13:06:41 +08:00

1 2 3

119 Commits