CLIProxyAPIPlus

mirror of https://github.com/router-for-me/CLIProxyAPIPlus.git synced 2026-04-07 21:38:21 +00:00

Author	SHA1	Message	Date
VooDisss	62b17f40a1	refactor(codex): align continuity helpers with review feedback Align websocket continuity resolution with the HTTP Codex path, make auth-affinity principal keys use a stable string representation, and extract small helpers that remove duplicated continuity and affinity logic without changing the validated cache-hit behavior.	2026-03-27 18:11:57 +02:00
VooDisss	511b8a992e	fix(codex): restore prompt cache continuity for Codex requests Prompt caching on Codex was not reliably reusable through the proxy because repeated chat-completions requests could reach the upstream without the same continuity envelope. In practice this showed up most clearly with OpenCode, where cache reads worked in the reference client but not through CLIProxyAPI, although the root cause is broader than OpenCode itself. The proxy was breaking continuity in several ways: executor-layer Codex request preparation stripped prompt_cache_retention, chat-completions translation did not preserve that field, continuity headers used a different shape than the working client behavior, and OpenAI-style Codex requests could be sent without a stable prompt_cache_key. When that happened, session_id fell back to a fresh random value per request, so upstream Codex treated repeated requests as unrelated turns instead of as part of the same cacheable context. This change fixes that by preserving caller-provided prompt_cache_retention on Codex execution paths, preserving prompt_cache_retention when translating OpenAI chat-completions requests to Codex, aligning Codex continuity headers to session_id, and introducing an explicit Codex continuity policy that derives a stable continuity key from the best available signal. The resolution order prefers an explicit prompt_cache_key, then execution session metadata, then an explicit idempotency key, then stable request-affinity metadata, then a stable client-principal hash, and finally a stable auth-ID hash when no better continuity signal exists. The same continuity key is applied to both prompt_cache_key in the request body and session_id in the request headers so repeated requests reuse the same upstream cache/session identity. The auth manager also keeps auth selection sticky for repeated request sequences, preventing otherwise-equivalent Codex requests from drifting across different upstream auth contexts and accidentally breaking cache reuse. To keep the implementation maintainable, the continuity resolution and diagnostics are centralized in a dedicated Codex continuity helper instead of being scattered across executor flow code. Regression coverage now verifies retention preservation, continuity-key precedence, stable auth-ID fallback, websocket parity, translator preservation, and auth-affinity behavior. Manual validation confirmed prompt cache reads now occur through CLIProxyAPI when using Codex via OpenCode, and the fix should also benefit other clients that rely on stable repeated Codex request continuity.	2026-03-27 17:49:29 +02:00
hkfires	528b1a2307	feat(codex): pass through codex client identity headers	2026-03-25 08:48:18 +08:00
Luis Pater	0906aeca87	Merge pull request #2254 from clcc2019/main refactor: streamline usage reporting by consolidating record publishi…	2026-03-24 00:39:31 +08:00
Luis Pater	a000eb523d	Merge pull request #2213 from TTTPOB/ua-fix feat(claude): stabilize device fingerprint across mixed Claude Code and cloaked clients	2026-03-23 22:53:51 +08:00
dslife2025	0ed2d16596	Merge branch 'router-for-me:main' into main	2026-03-23 09:50:43 +08:00
clcc2019	c1bf298216	refactor: streamline usage reporting by consolidating record publishing logic - Introduced a new method `buildRecord` in `usageReporter` to encapsulate record creation, improving code readability and maintainability. - Added latency tracking to usage records, ensuring accurate reporting of request latencies. - Updated tests to validate the inclusion of latency in usage records and ensure proper functionality of the new reporting structure.	2026-03-20 19:44:26 +08:00
Luis Pater	2bd646ad70	refactor: replace `sjson.Set` usage with `sjson.SetBytes` to optimize mutable JSON transformations	2026-03-19 17:58:54 +08:00
tpob	52c1fa025e	fix(claude): learn official fingerprints after custom baselines	2026-03-19 13:59:41 +08:00
tpob	680105f84d	fix(claude): refresh cached fingerprint after baseline upgrades	2026-03-19 13:28:58 +08:00
tpob	f7069e9548	fix(claude): pin stabilized OS arch to baseline	2026-03-19 13:07:16 +08:00
tpob	8179d5a8a4	fix(claude): avoid racy fingerprint downgrades	2026-03-19 01:03:41 +08:00
tpob	6fa7abe434	fix(claude): keep configured baseline above older fingerprints	2026-03-19 01:02:04 +08:00
tpob	dd64adbeeb	fix(claude): preserve legacy user agent overrides	2026-03-19 00:03:09 +08:00
tpob	616d41c06a	fix(claude): restore legacy runtime OS arch fallback	2026-03-19 00:01:50 +08:00
tpob	e0e337aeb9	feat(claude): add switch for device profile stabilization	2026-03-18 19:31:59 +08:00
tpob	d52839fced	fix: stabilize claude device fingerprint	2026-03-18 18:46:54 +08:00
Zhenyu Qi	aec65e3be3	fix(openai_compat): add stream_options.include_usage for streaming usage tracking	2026-03-13 00:48:17 -07:00
Luis Pater	817cebb321	Merge pull request #2082 from router-for-me/antigravity Refactor Antigravity model handling and improve logging	2026-03-12 10:39:13 +08:00
hkfires	dea3e74d35	feat(antigravity): refactor model handling and remove unused code	2026-03-12 09:24:45 +08:00
Luis Pater	89d7be9525	Merge branch 'dev' into codex/custom-useragent-request	2026-03-11 22:55:50 +08:00
lang-911	70988d387b	Add Codex websocket header defaults	2026-03-11 00:34:57 -07:00
Luis Pater	ddaa9d2436	Fixed: #2034 feat(proxy): centralize proxy handling with `proxyutil` package and enhance test coverage - Added `proxyutil` package to simplify proxy handling across the codebase. - Refactored various components (`executor`, `cliproxy`, `auth`, etc.) to use `proxyutil` for consistent and reusable proxy logic. - Introduced support for "direct" proxy mode to explicitly bypass all proxies. - Updated tests to validate proxy behavior (e.g., `direct`, HTTP/HTTPS, and SOCKS5). - Enhanced YAML configuration documentation for proxy options.	2026-03-11 11:08:02 +08:00
hkfires	d1e3195e6f	feat(codex): register models by plan tier	2026-03-10 11:20:37 +08:00
Dominic Robinson	a1e0fa0f39	test(executor): cover string system prompt handling in checkSystemInstructionsWithMode	2026-03-09 12:40:27 +00:00
Dominic Robinson	5c9997cdac	fix: Preserve system prompt when sent as a string instead of content block array	2026-03-09 07:38:11 +00:00
hkfires	424711b718	fix(executor): use aiplatform base url for vertex api key calls	2026-03-08 20:13:12 +08:00
Luis Pater	1042489f85	Merge pull request #1893 from thebtf/fix/normalize-ttl-byte-preservation-mainline fix: preserve original JSON bytes in normalizeCacheControlTTL	2026-03-07 22:14:13 +08:00
Luis Pater	5ebc58fab4	refactor(executor): remove legacy `connCreateSent` logic and standardize `response.create` usage for all websocket events - Simplified connection logic by removing `connCreateSent` and related state handling. - Updated `buildCodexWebsocketRequestBody` to always use `response.create`. - Added unit tests to validate `response.create` behavior and beta header preservation. - Dropped unsupported `response.append` and outdated `response.done` event types.	2026-03-07 09:07:23 +08:00
Kirill Turanskiy	97fdd2e088	fix: preserve original JSON bytes in normalizeCacheControlTTL when no TTL change needed normalizeCacheControlTTL unconditionally re-serializes the entire request body through json.Unmarshal/json.Marshal even when no TTL normalization is needed. Go's json.Marshal randomizes map key order and HTML-escapes <, >, & characters (to \u003c, \u003e, \u0026), producing different raw bytes on every call. Anthropic's prompt caching uses byte-prefix matching, so any byte-level difference causes a cache miss. This means the ~119K system prompt and tools are re-processed on every request when routed through CPA. The fix adds a bool return to normalizeTTLForBlock to indicate whether it actually modified anything, and skips the marshal step in normalizeCacheControlTTL when no blocks were changed.	2026-03-05 22:28:01 +03:00
Luis Pater	8d44be858e	Merge pull request #1834 from DragonFSKY/fix/sse-streaming-accept-encoding fix(claude): extend gzip fix to SSE success path and header-absent compression (#1763)	2026-03-05 22:57:27 +08:00
Luis Pater	ac135fc7cb	Fixed: #1815 test(executor): add unit tests for prompt cache key generation in OpenAI `cacheHelper`	2026-03-05 22:49:23 +08:00
Luis Pater	fdbd4041ca	Fixed: #1531 fix(gemini): add `deprecated` to unsupported schema keywords Add `deprecated` to the list of unsupported schema metadata fields in Gemini and update tests to verify its removal.	2026-03-05 11:48:15 +08:00
DragonFSKY	419bf784ab	fix(claude): prevent compressed SSE streams and add magic-byte decompression fallback - Set Accept-Encoding: identity for SSE streams; upstream must not compress line-delimited SSE bodies that bufio.Scanner reads directly - Re-enforce identity after ApplyCustomHeadersFromAttrs to prevent auth attribute injection from re-enabling compression on the stream path - Add peekableBody type wrapping bufio.Reader for non-consuming magic-byte inspection of the first 4 bytes without affecting downstream readers - Detect gzip (0x1f 0x8b) and zstd (0x28 0xb5 0x2f 0xfd) by magic bytes when Content-Encoding header is absent, covering misbehaving upstreams - Remove if-Content-Encoding guard on all three error paths (Execute, ExecuteStream, CountTokens); unconditionally delegate to decodeResponseBody so magic-byte detection applies consistently to all response paths - Add 10 tests covering stream identity enforcement, compressed success bodies, magic-byte detection without headers, error path decoding, and auth attribute override prevention	2026-03-05 06:38:38 +08:00
hkfires	c44793789b	feat(thinking): add adaptive thinking support for Claude models Add support for Claude's "adaptive" and "auto" thinking modes using `output_config.effort`. Introduce support for new effort level "max" in adaptive thinking. Update thinking logic, validate model capabilities, and extend converters and handling to ensure compatibility with adaptive modes. Adjust static model data with supported levels and refine handling across translators and executors.	2026-03-03 09:05:31 +08:00
hkfires	9229708b6c	revert(executor): re-apply PR #1735 antigravity changes with cleanup	2026-03-02 19:30:32 +08:00
hkfires	914db94e79	refactor(headers): streamline User-Agent handling and introduce GeminiCLI versioning	2026-03-02 13:04:30 +08:00
hkfires	b907d21851	revert(executor): revert antigravity_executor.go changes from PR #1735	2026-03-02 12:54:15 +08:00
Luis Pater	d6cc976d1f	chore(executor): remove unused header scrubbing function	2026-03-02 03:40:54 +08:00
Luis Pater	8aa2cce8c5	Merge PR #1735 into dev with conflict resolution and fixes	2026-03-02 03:22:51 +08:00
Luis Pater	77b42c6165	fix(claude): handle `X-CPA-CLAUDE-1M` header and ensure proper beta merging logic	2026-03-01 21:39:33 +08:00
Luis Pater	1cbc4834e1	Merge pull request #1771 from edlsh/fix/claude-cache-control-1769 Fix Claude OAuth cache_control regressions and gzip error decoding	2026-03-01 20:17:22 +08:00
edlsh	76aa917882	Optimize cache-control JSON mutations in Claude executor	2026-02-28 22:47:04 -05:00
edlsh	6ac9b31e4e	Handle compressed error decode failures safely	2026-02-28 22:43:46 -05:00
edlsh	0ad3e8457f	Clarify cloaking system block cache-control comments	2026-02-28 22:34:14 -05:00
edlsh	444a47ae63	Fix Claude cache-control guardrails and gzip error decoding	2026-02-28 22:32:33 -05:00
hkfires	134f41496d	fix(antigravity): update model configurations and add new models for Antigravity	2026-03-01 10:05:29 +08:00
exe.dev user	b45343e812	fix(cloak): align outgoing requests with real Claude Code 2.1.63 fingerprint Captured and compared outgoing requests from CLIProxyAPI against real Claude Code 2.1.63 and fixed all detectable differences: Headers: - Update anthropic-beta to match 2.1.63: replace fine-grained-tool-streaming and prompt-caching-2024-07-31 with context-management-2025-06-27 and prompt-caching-scope-2026-01-05 - Remove X-Stainless-Helper-Method header (real Claude Code does not send it) - Update default User-Agent from "claude-cli/2.1.44 (external, sdk-cli)" to "claude-cli/2.1.63 (external, cli)" - Force Claude Code User-Agent for non-Claude clients to avoid leaking real client identity (e.g. curl, OpenAI SDKs) during cloaking Body: - Inject x-anthropic-billing-header as system[0] (matches real format) - Change system prompt identifier from "You are Claude Code..." to "You are a Claude agent, built on Anthropic's Claude Agent SDK." - Add cache_control with ttl:"1h" to match real request format - Fix user_id format: user_[64hex]_account_[uuid]_session_[uuid] (was missing account UUID) - Disable tool name prefix (set claudeToolPrefix to empty string) TLS: - Switch utls fingerprint from HelloFirefox_Auto to HelloChrome_Auto (closer to Node.js/OpenSSL used by real Claude Code) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-28 09:19:06 +00:00
maplelove	2baf35b3ef	fix(executor): bump antigravity UA to 1.19.6 and align image_gen payload	2026-02-27 14:09:37 +08:00
maplelove	846e75b893	feat(gemini): route gemini-3.1-flash-image identically to gemini-3-pro-image	2026-02-27 13:32:06 +08:00

1 2 3 4 5 ...

399 Commits