feat: set vLLM engine_args defaults for gpu_memory_utilization #481

ansh-info · 2025-12-11T08:05:45Z

• Summary

Issue: #472

Prevent vLLM from computing a zero KV-cache budget by aligning engine defaults with Unsloth init defaults.
Set gpu_memory_utilization and max_model_len in engine_args to the corresponding Unsloth init_args values so large-GPU setups don’t stall.

Motivation

Training with Qwen2.5-14B + Unsloth/vLLM on H100 was stalling at 0/steps; logs showed “vLLM’s KV Cache can use up to 0.0 GB” despite ample VRAM. Missing engine defaults led vLLM to size the cache to zero.

Details

In get_model_config, initialize vLLM engine_args with:
- gpu_memory_utilization = init_args["gpu_memory_utilization"]
- max_model_len = init_args["max_seq_length"]
Still allow user overrides via _internal_config["engine_args"].

…x_model_len to mirror the Unsloth init_args Co-authored-by: Apoorva Gupta <[email protected]>

feat: set vLLM engine_args defaults for gpu_memory_utilization and ma…

b17cb83

…x_model_len to mirror the Unsloth init_args Co-authored-by: Apoorva Gupta <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: set vLLM engine_args defaults for gpu_memory_utilization #481

feat: set vLLM engine_args defaults for gpu_memory_utilization #481

Uh oh!

ansh-info commented Dec 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

feat: set vLLM engine_args defaults for gpu_memory_utilization #481

Are you sure you want to change the base?

feat: set vLLM engine_args defaults for gpu_memory_utilization #481

Uh oh!

Conversation

ansh-info commented Dec 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant