Replies: 1 comment
-
|
A 2x increase from 13 GB to 26 GB usually means the converted weights were saved in a larger dtype, most likely float32 instead of float16. So this may still be correct. The main things to check are: the --dtype used during conversion If the output was saved as float32, then the size doubling is expected. If you want a smaller file, try converting with --dtype float16 or using quantization. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I use convert.py to convert llama's pth file into npz file, but the size goes from 13GB to 26GB.
I don't know if it is correct.
Beta Was this translation helpful? Give feedback.
All reactions