mirror of
https://github.com/0xSojalSec/airllm.git
synced 2026-03-07 22:33:47 +00:00
Update README.md
This commit is contained in:
@@ -70,14 +70,14 @@ DPO是最新的最高效的RLHF训练方法。RLHF一直是生成式AI训练的
|
||||
|
||||
扫码:
|
||||
|
||||

|
||||

|
||||
|
||||
|
||||
## 微信群
|
||||
|
||||
扫码进群:
|
||||
|
||||

|
||||

|
||||
|
||||
|
||||
|
||||
|
||||
Reference in New Issue
Block a user