[BUG] Running with `TransformersModel` does not work #414

danielkorat · 2025-01-29T11:48:53Z

Describe the bug
When replacing HfApiModel with TransformersModel in examples/benchmark.ipynb, the eval results for meta-llama/Llama-3.1-8B-Instruct (and various other published models) are far worse than published (scores of less than 5).

Code to reproduce the error
https://github.com/danielkorat/smolagents/blob/transformers/examples/benchmark-transformers.ipynb

Error logs (if any)
Seems like a big part of problem is the parsing of the LLM output (specifically the assistant role):

Also, the regex parsing error arises in nearly all examples.

Expected behavior
Trying to reproduce the results for meta-llama/Llama-3.1-8B-Instruct, as published in the original notebook:

Packages version:

>>> smolagents.__version__
'1.5.0.dev'

Additional context
Add any other context about the problem here.

accelerate==1.3.0
datasets==3.1.0
matplotlib==3.10.0
matplotlib-inline==0.1.7
numpy==1.26.4
seaborn==0.13.2
sentence-transformers==3.3.0
sympy==1.13.1
transformers==4.48.1

The text was updated successfully, but these errors were encountered:

danielkorat · 2025-01-29T11:52:22Z

Hi @aymeric-roucher 👋
Note that this means that smolagents can not be used on local deployments right now.

ryantzr1 · 2025-01-30T10:16:57Z

hi @aymeric-roucher @danielkorat 👋

I'm also facing this issue when trying the text_to_sql.py example with TransformersModel() instead of HfApiModel(). The agent fails with the same regex error when generating the SQL query.

Code to reproduce the error
https://github.com/ryantzr1/smolagents/blob/test-sql-example/examples/text_to_sql.py
Error Log

I'm currently trying to build a LlamaCppModels class to allow users to work with llama.cpp models via llama-cpp-python. If I find a fix for TransformersModel(), I'll update.

nickvdw · 2025-01-30T19:21:59Z

hi @aymeric-roucher @danielkorat 👋

I'm also facing this issue when trying the text_to_sql.py example with TransformersModel() instead of HfApiModel(). The agent fails with the same regex error when generating the SQL query.

Code to reproduce the error https://github.com/ryantzr1/smolagents/blob/test-sql-example/examples/text_to_sql.py Error Log

I'm currently trying to build a LlamaCppModels class to allow users to work with llama.cpp models via llama-cpp-python. If I find a fix for TransformersModel(), I'll update.

I've seen such code parsing errors before while using TransformersModel(). For me, including the max_new_tokens (e.g., 4096) argument in the TransformersModel(), seemed to help, as suggested in #201 (comment). However, I'm not sure if it'll also help in your case.

As a side note, it'd be great to have a LlamaCppModels class amongst others (e.g., vLLM, ONNXRuntime, etc).

aymeric-roucher · 2025-01-31T11:48:24Z

Thank you folks for reporting!
I'd go with @nickvdw to say that the generation was interrupted, which would be prevented with a higher max_new_tokens parameter!

danielkorat · 2025-02-04T06:44:01Z

Thank you folks for reporting! I'd go with @nickvdw to say that the generation was interrupted, which would be prevented with a higher max_new_tokens parameter!

FYI I tried that but it still did not solve the issue

matfrei · 2025-02-06T13:50:02Z

I encountered the same issue today. After a little digging, the problem seems to be that TransformerModel does not set a default parameter for max_new_tokens. Therefore the transformers default is used, which at 20 tokens is really low for any agentic task.

Explicitly passing max_new_tokens to the TransformersModel() constructor as @aymeric-roucher and @nickvdw have suggested sure helps, but to avoid people getting caught up in this issue, it might be nice to set a default value of, say 4096 tokens here (or maybe in self.kwargs["max_tokens"] in the constructor, but that's a bit less transparent).

Happy to submit a small pull request if needed.

danielkorat added the bug Something isn't working label Jan 29, 2025

danielkorat changed the title ~~[BUG] benchmarking with TransformersModel does not work~~ [BUG] Running with TransformersModel does not work Jan 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Running with `TransformersModel` does not work #414

[BUG] Running with `TransformersModel` does not work #414

danielkorat commented Jan 29, 2025

danielkorat commented Jan 29, 2025 •

edited

Loading

ryantzr1 commented Jan 30, 2025

nickvdw commented Jan 30, 2025

aymeric-roucher commented Jan 31, 2025

danielkorat commented Feb 4, 2025

matfrei commented Feb 6, 2025

[BUG] Running with TransformersModel does not work #414

[BUG] Running with TransformersModel does not work #414

Comments

danielkorat commented Jan 29, 2025

danielkorat commented Jan 29, 2025 • edited Loading

ryantzr1 commented Jan 30, 2025

nickvdw commented Jan 30, 2025

aymeric-roucher commented Jan 31, 2025

danielkorat commented Feb 4, 2025

matfrei commented Feb 6, 2025

[BUG] Running with `TransformersModel` does not work #414

[BUG] Running with `TransformersModel` does not work #414

danielkorat commented Jan 29, 2025 •

edited

Loading