CLIProxyAPIPlus

mirror of https://github.com/router-for-me/CLIProxyAPIPlus.git synced 2026-03-21 16:40:22 +00:00

Author	SHA1	Message	Date
Luis Pater	6a4cff6699	Merge branch 'router-for-me:main' into main	2025-12-14 17:28:28 +08:00
Luis Pater	d5310a3300	Merge pull request #531 from AoaoMH/feature/auth-model-check feat: add API endpoint to query models for auth credentials	2025-12-14 16:46:43 +08:00
Luis Pater	b80c2aabb0	Merge branch 'router-for-me:main' into main	2025-12-14 16:19:29 +08:00
Luis Pater	f0a3eb574e	fix(registry): update DeepSeek model definitions with new IDs and descriptions	2025-12-14 16:17:11 +08:00
Test	bb15855443	feat: add API endpoint to query models for auth credentials	2025-12-14 15:16:26 +08:00
Ravens2121	db80b20bc2	feat(kiro): enhance thinking support and fix truncation issues - Thinking Support: - Enabled thinking support for all Kiro Claude models, including Haiku 4.5 and agentic variants. - Updated `model_definitions.go` with thinking configuration (Min: 1024, Max: 32000, ZeroAllowed: true). - Fixed `extended_thinking` field names in `model_registry.go` (from `min_budget`/`max_budget` to `min`/`max`) to comply with Claude API specs, enabling thinking control in clients like Claude Code. - Kiro Executor Fixes: - Fixed `budget_tokens` handling: explicitly disable thinking when budget is 0 or negative. - Removed aggressive duplicate content filtering logic that caused truncation/data loss. - Enhanced thinking tag parsing with `extractThinkingFromContent` to correctly handle interleaved thinking/text blocks. - Added EOF handling to flush pending thinking tag characters, preventing data loss at stream end. - Performance: - Optimized Claude stream handler (v6.2) with reduced buffer size (4KB) and faster flush interval (50ms) to minimize latency and prevent timeouts.	2025-12-13 03:57:13 +08:00
Ravens2121	fdeb84db2b	Merge branch 'router-for-me:main' into master	2025-12-12 13:44:07 +08:00
Ravens2121	204bba9dea	refactor(kiro): update Kiro executor to use CodeWhisperer endpoint and improve tool calling support	2025-12-12 09:27:30 +08:00
Luis Pater	35fdd7bc05	Merge branch 'router-for-me:main' into main	2025-12-12 08:54:36 +08:00
Ben Vargas	b09e2115d1	fix(models): add "none" reasoning effort level to gpt-5.2 Per OpenAI API documentation, gpt-5.2 supports reasoning_effort values of "none", "low", "medium", "high", and "xhigh". The "none" level was missing from the model definition. Reference: https://platform.openai.com/docs/api-reference/chat/create#chat_create-reasoning_effort	2025-12-11 15:26:23 -07:00
Luis Pater	218dc17713	Merge branch 'router-for-me:main' into main	2025-12-12 03:03:36 +08:00
Luis Pater	cd2da152d4	feat(models): add GPT 5.2 model definition and prompts	2025-12-12 03:02:27 +08:00
Luis Pater	4ce7c61a17	Merge branch 'main' into plus	2025-12-11 21:33:49 +08:00
hkfires	007572b58e	fix(util): do not strip thinking suffix on registered models NormalizeThinkingModel now checks ModelSupportsThinking before removing "-thinking" or "-thinking-<ver>", avoiding accidental parsing of model names where the suffix is part of the official id (e.g., kimi-k2-thinking, qwen3-235b-a22b-thinking-2507). The registry adds ThinkingSupport metadata for several models and propagates it via ModelInfo (e.g., kimi-k2-thinking, deepseek-r1, qwen3-235b-a22b-thinking-2507, minimax-m2), enabling accurate detection of thinking-capable models and correcting base model inference.	2025-12-11 15:52:14 +08:00
hkfires	a03d514095	feat(registry): add thinking metadata for models	2025-12-11 11:28:44 +08:00
Luis Pater	4360ed8a7b	Merge branch 'router-for-me:main' into main	2025-12-11 03:17:55 +08:00
Luis Pater	423ce97665	feat(util): implement dynamic thinking suffix normalization and refactor budget resolution logic - Added support for parsing and normalizing dynamic thinking model suffixes. - Centralized budget resolution across executors and payload helpers. - Retired legacy Gemini-specific thinking handlers in favor of unified logic. - Updated executors to use metadata-based thinking configuration. - Added `ResolveOriginalModel` utility for resolving normalized upstream models using request metadata. - Updated executors (Gemini, Codex, iFlow, OpenAI, Qwen) to incorporate upstream model resolution and substitute model values in payloads and request URLs. - Ensured fallbacks handle cases with missing or malformed metadata to derive models robustly. - Refactored upstream model resolution to dynamically incorporate metadata for selecting and normalizing models. - Improved handling of thinking configurations and model overrides in executors. - Removed hardcoded thinking model entries and migrated logic to metadata-based resolution. - Updated payload mutations to always include the resolved model.	2025-12-11 03:10:50 +08:00
fuko2935	a594338bc5	fix(registry): remove unstable kiro-auto model - Removes kiro-auto from static model registry - Removes kiro-auto mapping from executor - Fixes compatibility issues reported in #7 Fixes #7	2025-12-09 19:14:40 +03:00
Luis Pater	1fd1ccca17	Merge branch 'router-for-me:main' into main	2025-12-09 21:13:08 +08:00
hkfires	3cfe7008a2	fix(registry): update gpt 5.1 model names	2025-12-09 17:55:21 +08:00
hkfires	e5312fb5a2	feat(antigravity): support canonical names for antigravity models	2025-12-09 16:54:13 +08:00
Luis Pater	ab9e9442ec	v6.5.56 (#12 ) * feat(api): add comprehensive ampcode management endpoints Add new REST API endpoints under /v0/management/ampcode for managing ampcode configuration including upstream URL, API key, localhost restriction, model mappings, and force model mappings settings. - Move force-model-mappings from config_basic to config_lists - Add GET/PUT/PATCH/DELETE endpoints for all ampcode settings - Support model mapping CRUD with upsert (PATCH) capability - Add comprehensive test coverage for all ampcode endpoints * refactor(api): simplify request body parsing in ampcode handlers * feat(logging): add upstream API request/response capture to streaming logs * style(logging): remove redundant separator line from response section * feat(antigravity): enforce thinking budget limits for Claude models * refactor(logging): remove unused variable in `ensureAttempt` and redundant function call --------- Co-authored-by: hkfires <10558748+hkfires@users.noreply.github.com>	2025-12-08 22:32:29 +08:00
hkfires	a283545b6b	feat(antigravity): enforce thinking budget limits for Claude models	2025-12-08 20:36:17 +08:00
Luis Pater	0d4f32a881	Merge branch 'router-for-me:main' into main	2025-12-08 10:20:08 +08:00
hkfires	9c09128e00	feat(registry): add explicit thinking support config for antigravity models	2025-12-07 19:12:55 +08:00
Luis Pater	f77c22e6ff	Merge branch 'main' into feature/kiro-integration	2025-12-06 11:52:59 +08:00
Mansi	02d8a1cfec	feat(kiro): add AWS Builder ID authentication support - Add --kiro-aws-login flag for AWS Builder ID device code flow - Add DoKiroAWSLogin function for AWS SSO OIDC authentication - Complete Kiro integration with AWS, Google OAuth, and social auth - Add kiro executor, translator, and SDK components - Update browser support for Kiro authentication flows	2025-12-05 22:46:24 +03:00
Luis Pater	6b60bdd139	Merge branch 'router-for-me:main' into main	2025-12-03 21:55:29 +08:00
Luis Pater	897c40bed8	feat(registry): add DeepSeek-V3.2-Chat model definition Add new DeepSeek-V3.2-Chat model to the registry with standard chat configuration, positioned before the experimental variant for better organization.	2025-12-03 21:34:50 +08:00
Luis Pater	43cac7b5f6	Merge branch 'main' into v6.5.32	2025-12-02 11:46:05 +08:00
Luis Pater	1434bc38e5	refactor(registry): remove Qwen3-Coder from model definitions	2025-12-02 11:34:38 +08:00
Luis Pater	5b01eba943	Merge branch 'router-for-me:main' into main	2025-11-30 21:30:49 +08:00
hkfires	75e278c7a5	feat(registry): add thinking support to gemini models	2025-11-30 20:56:29 +08:00
Ernesto Martínez	3a9ac7ef33	feat(auth): add GitHub Copilot authentication and API integration Add complete GitHub Copilot support including: - Device flow OAuth authentication via GitHub's official client ID - Token management with automatic caching (25 min TTL) - OpenAI-compatible API executor for api.githubcopilot.com - 16 model definitions (GPT-5 variants, Claude variants, Gemini, Grok, Raptor) - CLI login command via -github-copilot-login flag - SDK authenticator and refresh registry integration Enables users to authenticate with their GitHub Copilot subscription and use it as a backend provider alongside existing providers.	2025-11-27 20:14:30 +01:00
Luis Pater	d2e4639b2a	feat(registry): add context length and update max tokens for Claude model configurations - Added `ContextLength` field with a value of 200,000 to all applicable Claude model definitions. - Standardized `MaxCompletionTokens` values across models for consistency and alignment.	2025-11-27 16:13:25 +08:00
nestharus	e73cdf5cff	fix(claude): ensure max_tokens exceeds thinking budget for thinking models Fixes an issue where Claude thinking models would return 400 errors when the thinking.budget_tokens was greater than or equal to max_tokens. Changes: - Add MaxCompletionTokens: 128000 to all Claude thinking model definitions - Add ensureMaxTokensForThinking() function in claude_executor.go that: - Checks if thinking is enabled with a budget_tokens value - Looks up the model's MaxCompletionTokens from the registry - Ensures max_tokens is set to at least the model's MaxCompletionTokens - Falls back to budget_tokens + 4000 buffer if registry lookup fails This ensures Anthropic API constraint (max_tokens > thinking.budget_tokens) is always satisfied when using extended thinking features. Fixes: #339 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-26 22:31:05 -08:00
Luis Pater	36755421fe	Merge pull request #343 from router-for-me/misc style(amp): tidy whitespace in proxy module and tests	2025-11-26 19:03:07 +08:00
hkfires	6c17dbc4da	style(amp): tidy whitespace in proxy module and tests	2025-11-26 18:57:26 +08:00
Luis Pater	ee6429cc75	feat(registry): add Gemini 3 Pro Image Preview model and remove Claude Sonnet 4.5 Thinking - Added new `Gemini 3 Pro Image Preview` model with detailed metadata and configuration. - Removed outdated `Claude Sonnet 4.5 Thinking` model definition for cleanup and relevance.	2025-11-26 18:22:40 +08:00
nestharus	d0e694d4ed	feat(claude): add thinking model variants and beta headers support - Add Claude thinking model definitions (sonnet-4-5-thinking, opus-4-5-thinking variants) - Add Thinking support for antigravity models with -thinking suffix - Add injectThinkingConfig() for automatic thinking budget based on model suffix - Add resolveUpstreamModel() mappings for thinking variants to actual Claude models - Add extractAndRemoveBetas() to convert betas array to anthropic-beta header - Update applyClaudeHeaders() to merge custom betas from request body Closes #324	2025-11-25 03:33:05 -08:00
Ben Vargas	0895533400	fix(registry): correct Claude Opus 4.5 created timestamp Update epoch from 1730419200 (2024-11-01) to 1761955200 (2025-11-01).	2025-11-24 12:27:23 -07:00
Ben Vargas	43f007c234	feat(registry): add Claude Opus 4.5 model definition Add support for claude-opus-4-5-20251101 with 200K context window and 64K max output tokens.	2025-11-24 12:26:39 -07:00
Luis Pater	db81331ae8	refactor(middleware): extract request logging logic and optimize condition checks - Added `shouldLogRequest` helper to simplify path-based request logging logic. - Updated middleware to skip management endpoints for improved security. - Introduced an explicit `nil` logger check for minimal overhead. - Updated dependencies in `go.mod`. feat(auth): add handling for 404 response with retry logic - Introduced support for 404 `not_found` status with a 12-hour backoff period. - Updated `manager.go` to align state and status messages for 404 scenarios. refactor(translator): comment out debug logging in Gemini responses request	2025-11-20 23:20:40 +08:00
Luis Pater	371324c090	feat(registry): expand Gemini model definitions and support Vertex AI	2025-11-20 18:16:26 +08:00
Luis Pater	0586da9c2b	refactor(registry): move Gemini 3 Pro Preview model definition to base set	2025-11-20 10:51:16 +08:00
Ben Vargas	782bba0bc4	feat(registry): enable gemini-3-pro-preview for gemini-cli provider Add gemini-3-pro-preview model to GetGeminiCLIModels() to make it available for OAuth-based Gemini CLI users, matching the model already available in AI Studio provider. Model spec: - ID: gemini-3-pro-preview - Version: 3.0 - Input: 1M tokens - Output: 64K tokens - Thinking: 128-32K tokens (dynamic)	2025-11-19 12:47:39 -07:00
Luis Pater	bf116b68f8	feat(registry): add GPT-5.1 Codex Max model definitions and support - Introduced `gpt-5.1-codex-max` variants to model definitions (`low`, `medium`, `high`, `xhigh`). - Updated executor logic to map effort levels for Codex Max models. - Added `lastCodexMaxPrompt` processing for `gpt-5.1-codex-max` prompts. - Defined instructions for `gpt-5.1-codex-max` in a new file: `codex_instructions/gpt-5.1-codex-max_prompt.md`.	2025-11-20 03:12:22 +08:00
Luis Pater	17016ae6a5	feat(registry): add Gemini 3 Pro Preview model definition	2025-11-18 23:48:21 +08:00
Luis Pater	01b7b60901	feat(registry): add Gemini 3 Pro Preview model definition	2025-11-18 23:46:58 +08:00
Luis Pater	23a7633e6d	fix(registry): update Thinking parameters and replace Gemini-3 Preview with Gemini-2.5 Flash Lite	2025-11-18 11:51:52 +08:00

1 2

92 Commits