From 16e0942e2e3a2930dfabd1e52454e93065af0b99 Mon Sep 17 00:00:00 2001 From: Yu Li Date: Fri, 1 Dec 2023 21:22:56 -0600 Subject: [PATCH] update readme --- air_llm/README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/air_llm/README.md b/air_llm/README.md index bdbf903..f2fa689 100644 --- a/air_llm/README.md +++ b/air_llm/README.md @@ -93,7 +93,7 @@ We just added model compression based on block-wise quantization based model com ```python model = AirLLMLlama2("garage-bAInd/Platypus2-70B-instruct", - compression='4bit' # specify '8bit' for 8-bit block-wise quantization + compression='4bit' # specify '8bit' for 8-bit block-wise quantization ) ```