[post training] support save hf safetensor format checkpoint #845

SLR722 · 2025-01-22T23:30:13Z

context

Now, in llama stack, we only support inference / eval a finetuned checkpoint with meta-reference as inference provider. This is sub-optimal since meta-reference is pretty slow.

Our vision is that developer can inference / eval a finetuned checkpoint produced by post training apis with all the inference providers on the stack. To achieve this, we'd like to define an unified output checkpoint format for post training providers. So that, all the inference provider can respect that format for customized model inference.

By spotting check how ollama and fireworks do inference on a customized model, we defined the output checkpoint format as /adapter/adapter_config.json and /adapter/adapter_model.safetensors (as we only support LoRA post training now, we begin from adapter only checkpoint)

test

we kick off a post training job and configured checkpoint format as 'hf'. Output files

we did a proof of concept with ollama to see if ollama can inference our finetuned checkpoint

create Modelfile like

create a customized model with ollama create llama_3_2_finetuned and run inference successfully

This is just a proof of concept with ollama cmd line. As next step, we'd like to wrap loading / inference customized model logic in the inference provider implementation.

SLR722 added 2 commits January 22, 2025 14:52

temp commit

b75e671

Merge remote-tracking branch 'origin/main' into hf_format_checkpointer

bbb1542

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Jan 22, 2025

SLR722 added 2 commits January 22, 2025 16:21

commit

09e9445

refine

a57f46e

SLR722 changed the title ~~[WIP][post training] support save hf safetensor format checkpoint~~ [post training] support save hf safetensor format checkpoint Jan 23, 2025

SLR722 marked this pull request as ready for review January 23, 2025 01:08

SLR722 requested review from ashwinb, yanxi0830, hardikjshah, dltn, raghotham, dineshyv, vladimirivic and sixianyi0721 as code owners January 23, 2025 01:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[post training] support save hf safetensor format checkpoint #845

[post training] support save hf safetensor format checkpoint #845

SLR722 commented Jan 22, 2025 •

edited

Loading

[post training] support save hf safetensor format checkpoint #845

Are you sure you want to change the base?

[post training] support save hf safetensor format checkpoint #845

Conversation

SLR722 commented Jan 22, 2025 • edited Loading

context

test

SLR722 commented Jan 22, 2025 •

edited

Loading