The confusion surrounding bnb_4bit_compute_dtype, torch_dtype, and prepare_model_for_kbit_training. #1516

xiaobingbuhuitou · 2025-02-14T08:04:53Z

xiaobingbuhuitou
Feb 14, 2025

I want to implement QLoRA fine-tuning for a model with dtype=Float32 based on PEFT. When I load the base model using from_pretrained("PATH", BitsAndBytesConfig( load_in_4bit=True, bnb_4bit_use_double_quant=True, bnb_4bit_quant_type="nf4", bnb_4bit_compute_dtype=torch.bfloat16 )), without setting torch_dtype, the model's dtype changes to Float16, and the obtained last_hidden_state also becomes Float16. However, when I set torch_dtype=Float32, both the model's dtype and last_hidden_state remain Float32. But when I wrap the quantized model with prepare_model_for_kbit_training(), everything changes back to Float32. I would like to know if using the prepare_model_for_kbit_training() function causes bnb_4bit_compute_dtype and torch_dtype to become ineffective. Additionally, I would like to ask when it is necessary to set prepare_model_for_kbit_training(). Furthermore, what determines the impact on the base model's and last_hidden_state's data types? Thank you.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The confusion surrounding bnb_4bit_compute_dtype, torch_dtype, and prepare_model_for_kbit_training. #1516

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 0 comments

Select a reply

The confusion surrounding bnb_4bit_compute_dtype, torch_dtype, and prepare_model_for_kbit_training. #1516

xiaobingbuhuitou Feb 14, 2025

Replies: 0 comments

xiaobingbuhuitou
Feb 14, 2025