This website requires JavaScript.
Explore
Help
Register
Sign In
LLM
/
airllm
Watch
1
Star
0
Fork
0
You've already forked airllm
mirror of
https://github.com/0xSojalSec/airllm.git
synced
2026-03-07 22:33:47 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
eb03f4756fa6c948878a678775d375156f8fdf07
airllm
/
rlhf
History
Yu Li
eb03f4756f
RLHF graph
2023-06-29 17:49:06 -05:00
..
qlora_dpo.py
init dpo based rlhf
2023-06-29 16:08:59 -05:00
RLHF.png
RLHF graph
2023-06-29 17:49:06 -05:00
run_dpo_training.sh
init dpo based rlhf
2023-06-29 16:08:59 -05:00