From 54d4fd7f84c1c187aea4d3de013b5cdd807fc36d Mon Sep 17 00:00:00 2001 From: Luis Pater Date: Fri, 12 Dec 2025 16:10:15 +0800 Subject: [PATCH] remove PR_DOCUMENTATION.md --- PR_DOCUMENTATION.md | 49 --------------------------------------------- 1 file changed, 49 deletions(-) delete mode 100644 PR_DOCUMENTATION.md diff --git a/PR_DOCUMENTATION.md b/PR_DOCUMENTATION.md deleted file mode 100644 index 6b830af6..00000000 --- a/PR_DOCUMENTATION.md +++ /dev/null @@ -1,49 +0,0 @@ -# PR Title / 拉取请求标题 - -`feat(kiro): Add Thinking Mode support & enhance reliability with multi-quota failover` -`feat(kiro): 支持思考模型 (Thinking Mode) 并通过多配额故障转移增强稳定性` - ---- - -# PR Description / 拉取请求描述 - -## 📝 Summary / 摘要 - -This PR introduces significant upgrades to the Kiro (AWS CodeWhisperer/Amazon Q) module. It adds native support for **Thinking/Reasoning models** (similar to OpenAI o1/Claude 3.7), implements a robust **Multi-Endpoint Failover** system to handle rate limits (429), and optimizes configuration flexibility. - -本次 PR 对 Kiro (AWS CodeWhisperer/Amazon Q) 模块进行了重大升级。它增加了对 **思考/推理模型 (Thinking/Reasoning models)** 的原生支持(类似 OpenAI o1/Claude 3.7),实现了一套健壮的 **多端点故障转移 (Multi-Endpoint Failover)** 系统以应对速率限制 (429),并优化了配置灵活性。 - -## ✨ Key Changes / 主要变更 - -### 1. 🧠 Thinking Mode Support / 思考模式支持 -- **OpenAI Compatibility**: Automatically maps OpenAI's `reasoning_effort` parameter (low/medium/high) to Claude's `budget_tokens` (4k/16k/32k). - - **OpenAI 兼容性**:自动将 OpenAI 的 `reasoning_effort` 参数(low/medium/high)映射为 Claude 的 `budget_tokens`(4k/16k/32k)。 -- **Stream Parsing**: Implemented advanced stream parsing logic to detect and extract content within `...` tags, even across chunk boundaries. - - **流式解析**:实现了高级流式解析逻辑,能够检测并提取 `...` 标签内的内容,即使标签跨越了数据块边界。 -- **Protocol Translation**: Converts Kiro's internal thinking content into OpenAI-compatible `reasoning_content` fields (for non-stream) or `thinking_delta` events (for stream). - - **协议转换**:将 Kiro 内部的思考内容转换为兼容 OpenAI 的 `reasoning_content` 字段(非流式)或 `thinking_delta` 事件(流式)。 - -### 2. 🛡️ Robustness & Failover / 稳健性与故障转移 -- **Dual Quota System**: Explicitly defined `kiroEndpointConfig` to distinguish between **IDE (CodeWhisperer)** and **CLI (Amazon Q)** quotas. - - **双配额系统**:显式定义了 `kiroEndpointConfig` 结构,明确区分 **IDE (CodeWhisperer)** 和 **CLI (Amazon Q)** 的配额来源。 -- **Auto Failover**: Implemented automatic failover logic. If one endpoint returns `429 Too Many Requests`, the request seamlessly retries on the next available endpoint/quota. - - **自动故障转移**:实现了自动故障转移逻辑。如果一个端点返回 `429 Too Many Requests`,请求将无缝在下一个可用端点/配额上重试。 -- **Strict Protocol Compliance**: Enforced strict matching of `Origin` and `X-Amz-Target` headers for each endpoint to prevent `403 Forbidden` errors due to protocol mismatches. - - **严格协议合规**:强制每个端点严格匹配 `Origin` 和 `X-Amz-Target` 头信息,防止因协议不匹配导致的 `403 Forbidden` 错误。 - -### 3. ⚙️ Configuration & Models / 配置与模型 -- **New Config Options**: Added `KiroPreferredEndpoint` (global) and `PreferredEndpoint` (per-key) settings to allow users to prioritize specific quotas (e.g., "ide" or "cli"). - - **新配置项**:添加了 `KiroPreferredEndpoint`(全局)和 `PreferredEndpoint`(单 Key)设置,允许用户优先选择特定的配额(如 "ide" 或 "cli")。 -- **Model Registry**: Normalized model IDs (replaced dots with hyphens) and added `-agentic` variants optimized for large code generation tasks. - - **模型注册表**:规范化了模型 ID(将点号替换为连字符),并添加了针对大型代码生成任务优化的 `-agentic` 变体。 - -### 4. 🔧 Fixes / 修复 -- **AMP Proxy**: Downgraded client-side context cancellation logs from `Error` to `Debug` to reduce log noise. - - **AMP 代理**:将客户端上下文取消的日志级别从 `Error` 降级为 `Debug`,减少日志噪音。 - -## ⚠️ Impact / 影响 - -- **Authentication**: **No changes** to the login/OAuth process. Existing tokens work as is. -- **认证**:登录/OAuth 流程 **无变更**。现有 Token 可直接使用。 -- **Compatibility**: Fully backward compatible. The new failover logic is transparent to the user. -- **兼容性**:完全向后兼容。新的故障转移逻辑对用户是透明的。 \ No newline at end of file