feat: add multi-GPU–friendly default for vLLM/Unsloth engine setup #483

ansh-info · 2025-12-11T08:24:48Z

• Summary

Default vLLM/Unsloth tensor_parallel_size to the number of visible GPUs (respects CUDA_VISIBLE_DEVICES) so multi-GPU setups don’t start with an unset/invalid TP world size.
Still allows explicit overrides via _internal_config["engine_args"].

Motivation
On multi-GPU H100 machines, vLLM was crashing during init when TP wasn’t configured; setting a sensible default prevents a TP=0/None startup.

Details
In get_model_config, set engine_args["tensor_parallel_size"] = torch.cuda.device_count() (or 1 if CUDA unavailable) before merging user overrides.

Co-authored-by: Apoorva Gupta <[email protected]>

feat: add multi-GPU–friendly default for vLLM/Unsloth engine setup

5256f8f

Co-authored-by: Apoorva Gupta <[email protected]>

Provide feedback