feat: use model name as adapter id in chat endpoints by drbh · Pull Request #2128 · huggingface/text-generation-inference

drbh · 2024-06-26T23:25:09Z

This PR allows users to specify lora adapter ids as the model value in the /v1/chat/completions and /v1/completions endpoints.

This feature aligns with other lora implementations and was mentioned here during the initial lora PR #2010 (comment)

curl localhost:3000/v1/chat/completions -s \
    -H 'Content-Type: application/json'  \
    -X POST \
    -d '{
  "model": "predibase/customer_support",
  "messages": [
    {
      "role": "system",
      "content": "You are a helpful assistant."
    },
    {
      "role": "user",
      "content": "What is deep learning?"
    }
  ],
  "stream": false,
  "max_tokens": 20, "seed": 42
}' | jq .

Narsil · 2024-07-01T12:21:19Z

router/src/server.rs

                top_n_tokens: None,
                grammar: None,
-                ..Default::default()
+                adapter_id: model.as_ref().filter(|m| *m != "tgi").map(String::from),


Can we remove this special value ? It should exist in the first place.

if openai requires a model id, then we should actually match it against the actual deployed model id, no ?

Narsil

Merging.

"tgi" is used throughout our docs. Just for backward compat we should support it somehow.

Definitely not great though.

feat: use model name as adapter id in chat endpoints

29a1137

Narsil reviewed Jul 1, 2024

View reviewed changes

Narsil approved these changes Jul 8, 2024

View reviewed changes

Narsil merged commit 87ebb64 into main Jul 8, 2024

Narsil deleted the enable-adapter-id-in-chat branch July 8, 2024 14:06

ErikKaum pushed a commit that referenced this pull request Jul 26, 2024

feat: use model name as adapter id in chat endpoints (#2128)

6ab7ade

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

feat: use model name as adapter id in chat endpoints#2128

feat: use model name as adapter id in chat endpoints#2128
Narsil merged 1 commit intomainfrom
enable-adapter-id-in-chat

drbh commented Jun 26, 2024

Uh oh!

Narsil Jul 1, 2024

Uh oh!

Narsil left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

Conversation

drbh commented Jun 26, 2024

Uh oh!

Narsil Jul 1, 2024

Choose a reason for hiding this comment

Uh oh!

Narsil left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants