feat: Add support for phi4 #764

jlonge4 · 2025-01-18T15:12:55Z

This PR adds support for Meta's Phi-4 model by adapting the existing LLaMA implementation.

The Phi-4 architecture follows the LLaMA architecture closely, with the main difference being in how the weights are stored (fused qkv_proj and gate_up vs separate projections).

HuggingFaceDocBuilderDev · 2025-01-20T08:05:33Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

dacorvo · 2025-02-04T16:03:42Z

@jlonge4 thank you very much for this pull-request: adding support for phi4 would be awesome.

We are however heavily refactoring the export mechanism to remove the dependency to transformers-neuronx and simplify the contribution of new models.

Can you take a look at that pull-request and see if it would make it easier for you to add support for phi4 based on the new HLO backend ?

jlonge4 · 2025-02-05T00:38:12Z

Hi there @dacorvo , just took a look at the difference and it certainly seems a lot slimmer! I think my effort would be the same in regard to the most important part for this which is the load_weights function. However it would obviously get rid of a lot of boiler plate. I am down to rework this PR and merge into add_hlo branch if you prefer.

jlonge4 added 2 commits January 18, 2025 09:50

updates

9c12a46

split weights

329d250

jlonge4 added 9 commits January 20, 2025 19:57

tests

8388ba7

decoder config

089e7e9

tests

a6d028b

lint

9bcdfb1

comment update

f4ff271

benchmark

0c87e29

use already fused gate_up if fuse_mlp

0b62ed7

use already fused gate_up if fuse_mlp

1c8a02c

lint

734c605

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add support for phi4 #764

feat: Add support for phi4 #764

jlonge4 commented Jan 18, 2025

HuggingFaceDocBuilderDev commented Jan 20, 2025

dacorvo commented Feb 4, 2025

jlonge4 commented Feb 5, 2025

feat: Add support for phi4 #764

Are you sure you want to change the base?

feat: Add support for phi4 #764

Conversation

jlonge4 commented Jan 18, 2025

HuggingFaceDocBuilderDev commented Jan 20, 2025

dacorvo commented Feb 4, 2025

jlonge4 commented Feb 5, 2025