update readme

This commit is contained in:
Yu Li
2023-12-01 21:22:56 -06:00
parent 2bacae2b71
commit 16e0942e2e

View File

@@ -93,7 +93,7 @@ We just added model compression based on block-wise quantization based model com
```python
model = AirLLMLlama2("garage-bAInd/Platypus2-70B-instruct",
compression='4bit' # specify '8bit' for 8-bit block-wise quantization
compression='4bit' # specify '8bit' for 8-bit block-wise quantization
)
```