Updating the doc (we keep the list actually).

huggingface · Oct 14, 2024 · 406725e · 406725e
1 parent 3d5f107
commit 406725e
Show file tree

Hide file tree

Showing 3 changed files with 18 additions and 2 deletions.
diff --git a/docs/openapi.json b/docs/openapi.json
@@ -2094,4 +2094,4 @@
       "description": "Hugging Face Text Generation Inference API"
     }
   ]
-}
+}
diff --git a/docs/source/supported_models.md b/docs/source/supported_models.md
@@ -34,4 +34,18 @@ Text Generation Inference enables serving optimized models. The following sectio
 - [Idefics](https://huggingface.co/HuggingFaceM4/idefics-9b) (Multimodal)
 
 
-If the above list lacks the model you would like to serve, depending on the model's pipeline type, you can try to initialize and serve the model anyways to see how well it performs, but performance isn't guaranteed for non-optimized models. Read more about [Non-core Model Serving](../basic_tutorials/non_core_models).
+
+If the above list lacks the model you would like to serve, depending on the model's pipeline type, you can try to initialize and serve the model anyways to see how well it performs, but performance isn't guaranteed for non-optimized models:
+
+```python
+# for causal LMs/text-generation models
+AutoModelForCausalLM.from_pretrained(<model>, device_map="auto")`
+# or, for text-to-text generation models
+AutoModelForSeq2SeqLM.from_pretrained(<model>, device_map="auto")
+```
+
+If you wish to serve a supported model that already exists on a local folder, just point to the local folder.
+
+```bash
+text-generation-launcher --model-id <PATH-TO-LOCAL-BLOOM>
+```
diff --git a/update_doc.py b/update_doc.py
@@ -9,6 +9,8 @@
 
 Text Generation Inference enables serving optimized models. The following sections list which models (VLMs & LLMs) are supported.
 
+SUPPORTED_MODELS
+
 
 If the above list lacks the model you would like to serve, depending on the model's pipeline type, you can try to initialize and serve the model anyways to see how well it performs, but performance isn't guaranteed for non-optimized models:
-Original file line number
+Diff line change
@@ Expand Up / @@ -2094,4 +2094,4 @@ @@
           "description": "Hugging Face Text Generation Inference API"
         }
       ]
-    }
+    }
Original file line number	Diff line number	Diff line change
Expand Up		@@ -9,6 +9,8 @@

		Text Generation Inference enables serving optimized models. The following sections list which models (VLMs & LLMs) are supported.

		SUPPORTED_MODELS


		If the above list lacks the model you would like to serve, depending on the model's pipeline type, you can try to initialize and serve the model anyways to see how well it performs, but performance isn't guaranteed for non-optimized models:

Expand Down