You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
| Location folder of user documents. In case it's missing, the taxonomy is navigated to look for updated knowledge documents.||`--input`|`ILAB_PROCESS_INPUT`|
132
+
| Location folder of processed documents. ||`--ouput`|`ILAB_PROCESS_OUTPUT`|
127
133
| Base directories where models are stored. |`$HOME/.cache/instructlab/models`|`--model-dir`|`ILAB_MODEL_DIR`|
128
134
| Name of the embedding model. |**TBD**|`--embedding-model`|`ILAB_EMBEDDING_MODEL_NAME`|
129
135
@@ -137,7 +143,7 @@ ilab data ingest
137
143
138
144
For the Taxonomy or Plug-and-Play RAG paths:
139
145
```
140
-
ilab data ingest /path/to/processed/folder
146
+
ilab data ingest --input path/to/processed/folder
141
147
```
142
148
143
149
#### Working Assumption
@@ -170,20 +176,24 @@ context for RAG-based chat pipelines.
170
176
### 2.5 Embedding Ingestion Pipeline Options
171
177
```bash
172
178
% ilab data ingest --help
173
-
Usage: ilab data ingest [OPTIONS] INPUT_DIR
179
+
Usage: ilab data ingest [OPTIONS]
174
180
175
181
The embedding ingestion pipeline
176
182
177
183
Options:
178
184
--document-store-type TEXT The document store type, one of:
179
-
`milvuslite`, `milvus`.
185
+
`milvuslite`.
180
186
--document-store-uri TEXT The document store URI
181
187
--document-store-collection-name TEXT
182
188
The document store collection name
183
189
--model-dir TEXT Base directories where models are stored.
184
190
[default: (The default system model location
185
191
store, located in the data directory.)]
186
192
--embedding-model TEXT The embedding model name
193
+
--output-dir TEXT Directory where generated datasets are
194
+
stored.
195
+
--input DIRECTORY Directory where pre-processed documents are
0 commit comments