Add IP-adapter support for stable diffusion #766

JingyaHuang · 2025-01-23T16:12:12Z

What does this PR do?

Fixes #718

Export of sd models loaded with loaded IP adapter weights + image encoder
Ensure the caching works
Inference: add image encoder to the pipelines

Export via CLI

optimum-cli export neuron --model stable-diffusion-v1-5/stable-diffusion-v1-5 --ip_adapter_ids h94/IP-Adapter --ip_adapter_subfolders models --ip_adapter_weight_names ip-adapter-full-face_sd15.bin --ip_adapter_scales 0.5 --batch_size 1 --height 512 --width 512 --num_images_per_prompt 1 --auto_cast matmul --auto_cast_type bf16 ip_adapter_neuron/

Export via NeuronModel API

from optimum.neuron import NeuronStableDiffusionPipeline

model_id = "runwayml/stable-diffusion-v1-5"
compiler_args = {"auto_cast": "matmul", "auto_cast_type": "bf16"}
input_shapes = {"batch_size": 1, "height": 512, "width": 512}

stable_diffusion = NeuronStableDiffusionPipeline.from_pretrained(
    model_id, 
    export=True, 
    ip_adapter_ids="h94/IP-Adapter",
    ip_adapter_subfolders="models",
    ip_adapter_weight_names="ip-adapter-full-face_sd15.bin",
    ip_adapter_scales=0.5,
    **compiler_args, 
    **input_shapes,
)

# Save locally or upload to the HuggingFace Hub
save_directory = "ip_adapter_neuron/"
stable_diffusion.save_pretrained(save_directory)

Inference

With ip_adapter_image as input

from optimum.neuron import NeuronStableDiffusionPipeline

model_id = "runwayml/stable-diffusion-v1-5"
compiler_args = {"auto_cast": "matmul", "auto_cast_type": "bf16"}
input_shapes = {"batch_size": 1, "height": 512, "width": 512}

stable_diffusion = NeuronStableDiffusionPipeline.from_pretrained(
    model_id, 
    export=True, 
    ip_adapter_ids="h94/IP-Adapter",
    ip_adapter_subfolders="models",
    ip_adapter_weight_names="ip-adapter-full-face_sd15.bin",
    ip_adapter_scales=0.5,
    **compiler_args, 
    **input_shapes,
)

# Save locally or upload to the HuggingFace Hub
save_directory = "ip_adapter_neuron/"
stable_diffusion.save_pretrained(save_directory)

With ip_adapter_image_embeds as input (encode the image first)

image_embeds = stable_diffusion.prepare_ip_adapter_image_embeds(
    ip_adapter_image=image,
    ip_adapter_image_embeds=None,
    device=None,
    num_images_per_prompt=1,
    do_classifier_free_guidance=True,
)
torch.save(image_embeds, "image_embeds.ipadpt")

image_embeds = torch.load("image_embeds.ipadpt")
images = stable_diffusion(
    prompt="a polar bear sitting in a chair drinking a milkshake",
    ip_adapter_image_embeds=image_embeds,
    negative_prompt="deformed, ugly, wrong proportion, low res, bad anatomy, worst quality, low quality",
    num_inference_steps=100,
    generator=generator,
).images[0]

image.save("polar_bear.png")

Next steps

Support multiple IP adapters
Ensure it works for sdxl
Extend the support for diffusion transformers
Documentation along with refactoring

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

HuggingFaceDocBuilderDev · 2025-01-27T16:57:32Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

JingyaHuang added 3 commits January 23, 2025 16:07

pre-export prep done

1b82b06

fix doc build

147c30e

step

fa7e2e8

JingyaHuang added 6 commits January 30, 2025 16:45

export done

eec1a7f

Merge branch 'main' into add-ip-adapter

b17e229

make style

c1ab701

fix validation & start api export

0ec30f7

api export done

d2c6821

inference done

0d828b5

JingyaHuang changed the title ~~Add IP-adapter support for stable diffusion (XL)~~ Add IP-adapter support for stable diffusion Feb 5, 2025

JingyaHuang and others added 4 commits February 5, 2025 16:11

fix style

fcd8393

fix

4b14646

Merge branch 'main' into add-ip-adapter

fe04adc

solve conflict

34b40ba

JingyaHuang requested review from dacorvo, michaelbenayoun and tengomucho February 7, 2025 09:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add IP-adapter support for stable diffusion #766

Add IP-adapter support for stable diffusion #766

JingyaHuang commented Jan 23, 2025 •

edited

Loading

HuggingFaceDocBuilderDev commented Jan 27, 2025

Add IP-adapter support for stable diffusion #766

Are you sure you want to change the base?

Add IP-adapter support for stable diffusion #766

Conversation

JingyaHuang commented Jan 23, 2025 • edited Loading

What does this PR do?

Next steps

Before submitting

HuggingFaceDocBuilderDev commented Jan 27, 2025

JingyaHuang commented Jan 23, 2025 •

edited

Loading