CLIProxyAPIPlus

mirror of https://github.com/router-for-me/CLIProxyAPIPlus.git synced 2026-04-05 12:13:25 +00:00

Files

kunish 59af2c57b1 fix(copilot): reduce premium request inflation and enable thinking

This commit addresses three issues with Claude Code through GitHub
Copilot:

1. **Premium request inflation**: Responses API requests were missing
   Openai-Intent headers and proper defaults, causing Copilot to bill
   each tool-loop continuation as a new premium request. Fixed by adding
   isAgentInitiated() heuristic (checks for tool_result content or
   preceding assistant tool_use), applying Responses API defaults
   (store, include, reasoning.summary), and local tiktoken-based token
   counting to avoid extra API calls.

2. **Context overflow**: Claude Code's modelSupports1M() hardcodes
   opus-4-6 as 1M-capable, but Copilot only supports ~128K-200K.
   Fixed by stripping the context-1m-2025-08-07 beta from translated
   request bodies. Also forwards response headers in non-streaming
   Execute() and registers the GET /copilot-quota management API route.

3. **Thinking not working**: Add ThinkingSupport with level-based
   reasoning to Claude models in the static definitions. Normalize
   Copilot's non-standard 'reasoning_text' response field to
   'reasoning_content' before passing to the SDK translator. Use
   caller-provided context in CountTokens instead of Background().

2026-04-03 20:24:30 +08:00

handlers/management

Merge upstream v6.9.9 (PR #483 )

2026-04-02 21:31:21 +08:00

middleware

Refactor websocket logging and error handling

2026-04-02 17:30:51 +08:00

modules

Merge branch 'router-for-me:main' into main

2026-04-02 22:34:47 +08:00

server_test.go

feat: add /healthz endpoint and test coverage for health check

2026-04-02 21:56:27 +08:00

server.go

fix(copilot): reduce premium request inflation and enable thinking

2026-04-03 20:24:30 +08:00