Name	Name	Last commit message	Last commit date
parent directory ..
README.md	README.md
csv-sum.md	csv-sum.md
debounce.md	debounce.md
deep-clone.md	deep-clone.md
email-validation.md	email-validation.md
group-by.md	group-by.md
infinite-scroll.md	infinite-scroll.md
modal-dialog.md	modal-dialog.md
number-formatting.md	number-formatting.md
rate-limit.md	rate-limit.md
react-countdown.md	react-countdown.md
url-params.md	url-params.md

Name

Last commit message

Last commit date

csv-sum.md

Examples

Real model output, verbatim from benchmark runs, the same task answered by the same model with no skill (## Without Ponytail) and with ponytail (## With Ponytail), so you can compare side by side. Model: Claude Haiku 4.5, temperature 1, source benchmarks/output.json.

These are not hand-written. Reproduce them yourself: npx promptfoo@latest eval -c benchmarks/promptfooconfig.yaml. Method, all three models, and median-of-10 numbers: ../benchmarks/.

Example	Without (LOC)	With (LOC)
Email Validation	75	3
Debounce	116	10
CSV Sum	20	3
Countdown Timer	267	9
Rate Limiting	128	10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

README.md

Examples

Uh oh!

FilesExpand file tree

examples

Directory actions

More options

Directory actions

More options

Latest commit

History

examples

Folders and files

parent directory

README.md

Examples