CLIProxyAPIPlus

mirror of https://github.com/router-for-me/CLIProxyAPIPlus.git synced 2026-03-22 00:50:26 +00:00

Author	SHA1	Message	Date
Luis Pater	e3d8d726e6	Merge branch 'router-for-me:main' into main	2025-12-28 15:09:33 +08:00
Luis Pater	457924828a	Merge pull request #757 from ben-vargas/fix-thinking-toolchoice-conflict Fix: disable thinking when tool_choice forces tool use	2025-12-28 14:04:30 +08:00
Ben Vargas	aca2ef6359	Fix: disable thinking when tool_choice forces tool use Anthropic API does not allow extended thinking when tool_choice is set to "any" or a specific tool. This was causing 400 errors when using features like Amp's /handoff command which forces tool_choice. Added disableThinkingIfToolChoiceForced() that removes thinking config when incompatible tool_choice is detected, applied to both streaming and non-streaming paths. Fixes router-for-me/CLIProxyAPI#630	2025-12-27 16:31:37 -07:00
Luis Pater	0f51e73baa	Merge branch 'router-for-me:main' into main	2025-12-28 03:07:58 +08:00
Luis Pater	3a436e116a	feat(cliproxy): implement model aliasing and hashing for Codex configurations, enhance request routing logic, and normalize Codex model entries	2025-12-28 03:06:51 +08:00
Luis Pater	d06e2dc83c	Merge branch 'router-for-me:main' into main	2025-12-28 02:10:16 +08:00
leaph	6403ff4ec4	feat(iflow): add model-specific thinking configs for GLM-4.7 and MiniMax-M2.1 - GLM-4.7: Uses extra_body={"thinking": {"type": "enabled"}, "clear_thinking": false} - MiniMax-M2.1: Uses reasoning_split=true for OpenAI-style reasoning separation - Added preserveReasoningContentInMessages() to support re-injection of reasoning content in assistant message history for multi-turn conversations - Added ThinkingSupport to MiniMax-M2.1 model definition	2025-12-27 18:39:15 +01:00
Luis Pater	d35152bbef	Merge branch 'router-for-me:main' into main	2025-12-27 22:03:50 +08:00
Luis Pater	c281f4cbaf	Fixed: #747 fix(translators): rename and integrate `usageMetadata` as `cpaUsageMetadata` in Claude processing logic	2025-12-27 22:02:11 +08:00
Luis Pater	05f249d77f	Merge branch 'main' into plus	2025-12-26 12:14:35 +08:00
Luis Pater	3ce0d76aa4	feat(usage): add import/export functionality for usage statistics and enhance deduplication logic	2025-12-26 11:49:51 +08:00
Luis Pater	7551faff79	Merge branch 'main' into plus	2025-12-24 22:27:12 +08:00
Luis Pater	d3f4783a24	Merge pull request #57 from PancakeZik/my-idc-changes feat: add AWS Identity Center (IDC) authentication support	2025-12-24 17:20:01 +08:00
NguyenSiTrung	969c1a5b72	refactor: extract parseGeminiFamilyUsageDetail helper to reduce duplication	2025-12-24 10:22:31 +07:00
NguyenSiTrung	872339bceb	feat: add cached token parsing for Gemini API responses	2025-12-24 10:20:11 +07:00
Luis Pater	e592a57458	Merge branch 'router-for-me:main' into main	2025-12-24 04:25:06 +08:00
Luis Pater	7569320770	Merge branch 'dev' into fix/antigravity-prompt-caching	2025-12-24 03:49:46 +08:00
Luis Pater	f0365f0465	Merge branch 'main' into plus	2025-12-23 22:34:08 +08:00
Luis Pater	6d1e20e940	fix(claude_executor): update header logic for API key handling Refined header assignment to use `x-api-key` for Anthropic API requests, ensuring correct authorization behavior based on request attributes and URL validation.	2025-12-23 22:30:25 +08:00
Joao	98db5aabd0	feat: persist refreshed IDC tokens to auth file Add persistRefreshedAuth function to write refreshed tokens back to the auth file after inline token refresh. This prevents repeated token refreshes on every request when the token expires. Changes: - Add persistRefreshedAuth() to kiro_executor.go - Call persist after all token refresh paths (401, 403, pre-request) - Remove unused log import from sdk/auth/kiro.go	2025-12-23 10:00:14 +00:00
Joao	7fd98f3556	feat: add IDC auth support with Kiro IDE headers	2025-12-23 08:18:10 +00:00
Luis Pater	b1aecc2bf1	Merge branch 'router-for-me:main' into main	2025-12-23 02:49:37 +08:00
Luis Pater	83b90e106f	refactor(antigravity): add sandbox URL constant and update base URLs routine	2025-12-23 02:47:56 +08:00
Evan Nguyen	24e8e20b59	Merge branch 'main' into fix/antigravity-prompt-caching	2025-12-21 19:43:24 +07:00
Luis Pater	e755e567ea	Merge branch 'router-for-me:main' into main	2025-12-21 19:54:13 +08:00
Evan Nguyen	a87f09bad2	feat(antigravity): add session ID generation and mutex for random source	2025-12-21 17:50:41 +07:00
Luis Pater	63908869f6	Merge pull request #611 from soilSpoon/feature/antigravity feat(antigravity): Improve Claude model compatibility	2025-12-21 16:27:29 +08:00
이대희	4070c9de81	Remove interleaved-thinking header from requests Removes the addition of the "anthropic-beta: interleaved-thinking-2025-05-14" header for Claude thinking models when building HTTP requests. This prevents sending an experimental/feature flag header that is no longer required and avoids potential compatibility or routing issues with downstream services. Keeps request headers simpler and more standard.	2025-12-21 15:29:36 +09:00
이대희	1e9e4a86a2	Improve thinking/tool signature handling for Claude and Gemini requests Prefer cached signatures and avoid injecting dummy thinking blocks; instead remove unsigned thinking blocks and add a skip sentinel for tool calls without a valid signature. Generate stable session IDs from the first user message, apply schema cleaning only for Claude models, and reorder thinking parts so thinking appears first. For Gemini, remove thinking blocks and attach a skip sentinel to function calls. Simplify response handling by passing raw function args through (remove special Bash conversion). Update and add tests to reflect the new behavior. These changes prevent rejected dummy signatures, improve compatibility with Antigravity’s signature validation, provide more stable session IDs for conversation grouping, and make request/response translation more robust.	2025-12-21 15:15:50 +09:00
Luis Pater	5418bbc338	Merge branch 'router-for-me:main' into main	2025-12-20 23:40:09 +08:00
Ben Vargas	1231dc9cda	feat(antigravity): add payload config support to Antigravity executor Add applyPayloadConfig calls to all Antigravity executor paths (Execute, executeClaudeNonStream, ExecuteStream) to enable config.yaml payload overrides for Antigravity/Gemini-Claude models. This allows users to configure thinking budget and other parameters via payload.override in config.yaml for models like gemini-claude-opus-4-5*.	2025-12-19 22:30:44 -07:00
Luis Pater	843316ea7a	Merge branch 'router-for-me:main' into main	2025-12-19 22:24:26 +08:00
hkfires	2039062845	fix(gemini): add optional skip for gemini3 thinking conversion	2025-12-19 22:07:43 +08:00
evann	404546ce93	refactor(antigravity): regarding production endpoint caching	2025-12-19 16:36:54 +07:00
evann	9058d406a3	feat(antigravity): enhance prompt caching support and update agent version	2025-12-19 16:33:41 +07:00
이대희	b6ba15fcbd	fix(runtime/executor): Antigravity executor schema handling and Claude-specific headers	2025-12-19 10:28:23 +09:00
Luis Pater	0f646800f6	Merge branch 'router-for-me:main' into main	2025-12-18 08:36:59 +08:00
Luis Pater	13eb5268de	Merge pull request #582 from ben-vargas/fix-gemini-3-thinking-level feat: use thinkingLevel for Gemini 3 models per Google documentation	2025-12-18 07:19:37 +08:00
Ben Vargas	598f0af19b	fix: apply thinkingLevel from model suffix metadata for Gemini 3 The previous commit added thinkingLevel support but didn't apply it when the reasoning effort came from model name suffix (e.g., model(minimal)). This was because ResolveThinkingConfigFromMetadata returns nil for level-based models, bypassing the metadata application. Changes: - Add ApplyGemini3ThinkingLevelFromMetadata for standard Gemini API - Add ApplyGemini3ThinkingLevelFromMetadataCLI for CLI API format - Update gemini_cli_executor to apply Gemini 3 thinkingLevel from metadata - Update antigravity_executor to apply Gemini 3 thinkingLevel from metadata - Update aistudio_executor to apply Gemini 3 thinkingLevel from metadata - Add comprehensive test coverage for Gemini 3 thinkingLevel functions	2025-12-17 16:08:38 -07:00
Ben Vargas	a33f5d31fc	feat: use thinkingLevel for Gemini 3 models per Google documentation Per Google's official documentation, Gemini 3 models should use thinkingLevel (string) instead of thinkingBudget (number) for optimal performance. From Google's Gemini Thinking docs: > Use the thinkingLevel parameter with Gemini 3 models. While > thinkingBudget is accepted for backwards compatibility, using > it with Gemini 3 Pro may result in suboptimal performance. Changes: - Add model family detection functions (IsGemini3Model, IsGemini25Model, IsGemini3ProModel, IsGemini3FlashModel) - Add ApplyGeminiThinkingLevel and ApplyGeminiCLIThinkingLevel functions for applying thinkingLevel config - Add ValidateGemini3ThinkingLevel for model-specific level validation - Add ThinkingBudgetToGemini3Level for backward compatibility conversion - Update NormalizeGeminiThinkingBudget to convert budget to level for Gemini 3 models - Update ApplyDefaultThinkingIfNeeded to not set a default level for Gemini 3 (lets API use its dynamic default "high") - Update ConvertThinkingLevelToBudget to preserve thinkingLevel for Gemini 3 models - Add Levels field to all Gemini 3 model definitions: - Gemini 3 Pro: ["low", "high"] - Gemini 3 Flash: ["minimal", "low", "medium", "high"] Backward compatibility: - Gemini 2.5 models continue to use thinkingBudget as before - If thinkingBudget is provided for Gemini 3, it's converted to the appropriate thinkingLevel - Existing configurations continue to work	2025-12-17 15:28:20 -07:00
Ravens2121	54acd69e9d	Merge branch 'router-for-me:main' into master	2025-12-18 04:39:17 +08:00
Ravens2121	d687ee2777	feat(kiro): implement official reasoningContentEvent and improve metadat	2025-12-18 04:38:22 +08:00
Luis Pater	f7b17ee6ec	Merge pull request #36 from rezhajulio/feat/gpt-5.2 Add GPT-5.2 model support for GitHub Copilot	2025-12-18 03:16:25 +08:00
Luis Pater	408614c74c	Merge branch 'router-for-me:main' into main	2025-12-18 03:13:48 +08:00
Luis Pater	68a27772b3	feat(antigravity): enable token counting via API with resilient routing Introduces the capability to count tokens for Antigravity-backed requests. This implementation leverages the `countTokens` endpoint of the Antigravity API, replacing the prior unsupported stub. Key aspects of this update include: - API Integration: Direct integration with the Antigravity `countTokens` API, including necessary request payload translation and authentication. - Resilient Infrastructure: A fallback mechanism has been established, allowing the system to attempt connections across multiple Antigravity base URLs to ensure request success even in the event of temporary service interruptions. - Model Aliasing: Added mappings for `gemini-3-flash` and `gemini-3-flash-preview` to ensure compatibility with the latest model variants. - Robust Error Handling: Comprehensive error handling and logging are in place to manage failures during API interactions.	2025-12-18 03:12:46 +08:00
Luis Pater	10e0ea1309	Merge main into pr-39	2025-12-18 00:36:51 +08:00
Luis Pater	5fda6f8ef3	feat(antigravity): implement non-streaming execution for Claude model requests	2025-12-17 23:17:11 +08:00
Luis Pater	09923f654c	feat(antigravity): add streaming support for Claude model requests	2025-12-17 22:16:57 +08:00
이대희	1b8e538a77	feature: Improves Gemini JSON schema compatibility Enhances compatibility with the Gemini API by implementing a schema cleaning process. This includes: - Centralizing schema cleaning logic for Gemini in a dedicated utility function. - Converting unsupported schema keywords to hints within the description field. - Flattening complex schema structures like `anyOf`, `oneOf`, and type arrays to simplify the schema. - Handling streaming responses with empty tool names, which can occur in subsequent chunks after the initial tool use.	2025-12-17 17:10:53 +09:00
Rezha Julio	92c62bb2fb	Add GPT-5.2 model support for GitHub Copilot	2025-12-17 02:15:10 +07:00

1 2 3 4 5 ...

298 Commits