Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
521 changes: 521 additions & 0 deletions docs/assets/recipes/code_generation/infinibyte.py

Large diffs are not rendered by default.

528 changes: 528 additions & 0 deletions docs/assets/recipes/model_usability/prompt_sensitivity.py

Large diffs are not rendered by default.

434 changes: 434 additions & 0 deletions docs/assets/recipes/model_usability/structured_data.py

Large diffs are not rendered by default.

54 changes: 54 additions & 0 deletions docs/recipes/cards.md
Original file line number Diff line number Diff line change
Expand Up @@ -172,6 +172,60 @@ Each recipe is a self-contained example that can be run independently.
[:material-book-open-page-variant: View Recipe](mcp_and_tooluse/search_agent.md){ .md-button }
[Download Code :octicons-download-24:](../assets/recipes/mcp_and_tooluse/search_agent.py){ .md-button download="search_agent.py" }

- :material-code-json:{ .lg .middle } **Nemotron Nano Structured Data**

Generate multi-format structured data (JSON, YAML, XML, Markdown) with controlled schema complexity, conversational grounding, and best-of-3 candidate generation.

---

**Demonstrates:**

- Multi-format structured output generation
- Conditional topic sampling with SubcategorySamplerParams
- Multi-stage LLM pipeline (schema, prompt, conversation, output)
- Best-of-3 candidate generation

---

[:material-book-open-page-variant: View Recipe](model_usability/structured_data.md){ .md-button }
[Download Code :octicons-download-24:](../assets/recipes/model_usability/structured_data.py){ .md-button download="structured_data.py" }

- :material-format-list-text:{ .lg .middle } **Nemotron Nano Prompt Sensitivity**

Generate diverse prompt preambles for RL training with regex-verified output format specifications, LLM paraphrasing, placement order variants, and four LLM judges.

---

**Demonstrates:**

- Seed dataset with format templates and preambles
- LLM paraphrasing with diversity controls
- Regex-aligned format instruction generation
- 4 LLM judges (format compliance, regex alignment, order coherence, preamble quality)

---

[:material-book-open-page-variant: View Recipe](model_usability/prompt_sensitivity.md){ .md-button }
[Download Code :octicons-download-24:](../assets/recipes/model_usability/prompt_sensitivity.py){ .md-button download="prompt_sensitivity.py" }

- :material-infinity:{ .lg .middle } **Nemotron Nano InfiniByte**

Generate more diverse and complex problems by cross-breeding two HuggingFace datasets, augmenting problems through obfuscation or complication with structured LLM evaluation.

---

**Demonstrates:**

- Cross-source dataset seeding from HuggingFace
- LLMStructuredColumnConfig with Pydantic models
- Multi-stage structured generation (candidates, selection, evaluation)
- Expression columns for score extraction

---

[:material-book-open-page-variant: View Recipe](code_generation/infinibyte.md){ .md-button }
[Download Code :octicons-download-24:](../assets/recipes/code_generation/infinibyte.py){ .md-button download="infinibyte.py" }

- :material-file-document-multiple:{ .lg .middle } **Markdown Section Seed Reader**

Define a custom `FileSystemSeedReader` inline and turn Markdown files into one seed row per heading section.
Expand Down
9 changes: 9 additions & 0 deletions docs/recipes/code_generation/infinibyte.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,9 @@
# Nemotron Nano InfiniByte

Generate more diverse and complex training problems by cross-breeding two source datasets. Cross-joins coding and math/science problems, then uses an LLM to create augmented problems through obfuscation or complication, with structured evaluation and solution generation. Used for Nemotron Nano post-training data.

[Download Code :octicons-download-24:](../../assets/recipes/code_generation/infinibyte.py){ .md-button download="infinibyte.py" }

```python
--8<-- "assets/recipes/code_generation/infinibyte.py"
```
9 changes: 9 additions & 0 deletions docs/recipes/model_usability/prompt_sensitivity.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,9 @@
# Nemotron Nano Prompt Sensitivity

Generate diverse prompt preambles for RL training that pair natural-language instructions with regex-verified output format specifications. Uses seed format templates, LLM paraphrasing, placement order variants, and four LLM judges to produce high-quality, format-compliant preambles. Used for Nemotron Nano prompt sensitivity training.

[Download Code :octicons-download-24:](../../assets/recipes/model_usability/prompt_sensitivity.py){ .md-button download="prompt_sensitivity.py" }

```python
--8<-- "assets/recipes/model_usability/prompt_sensitivity.py"
```
9 changes: 9 additions & 0 deletions docs/recipes/model_usability/structured_data.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,9 @@
# Nemotron Nano Structured Data

Generate multi-format structured data (JSON, YAML, XML, Markdown) with controlled schema complexity, conversational grounding, and best-of-3 candidate generation. Used for Nemotron Nano structured-output SFT training.

[Download Code :octicons-download-24:](../../assets/recipes/model_usability/structured_data.py){ .md-button download="structured_data.py" }

```python
--8<-- "assets/recipes/model_usability/structured_data.py"
```
4 changes: 4 additions & 0 deletions mkdocs.yml
Original file line number Diff line number Diff line change
Expand Up @@ -46,6 +46,10 @@ nav:
- Text to Python: recipes/code_generation/text_to_python.md
- Text to SQL: recipes/code_generation/text_to_sql.md
- "Nemotron Super Text to SQL": recipes/code_generation/enterprise_text_to_sql.md
- "Nemotron Nano InfiniByte": recipes/code_generation/infinibyte.md
- Model Usability:
- "Nemotron Nano Structured Data": recipes/model_usability/structured_data.md
- "Nemotron Nano Prompt Sensitivity": recipes/model_usability/prompt_sensitivity.md
- Plugin Development:
- Markdown Section Seed Reader Plugin: recipes/plugin_development/markdown_seed_reader.md
- QA and Chat:
Expand Down
Loading