[BUG] SpeechToTextTool not working #473

ismael-dm · 2025-02-02T22:53:47Z

Describe the bug
When I try to use the SpeechToTextTool, it fails

Code to reproduce the error

from smolagents import (
    CodeAgent,
    OpenAIServerModel,
    SpeechToTextTool,
)

model = OpenAIServerModel(
    api_base="http://localhost:8000/v1",
    model_id="Qwen/Qwen2.5-3B-Instruct",
    api_key="-",
)

agent = CodeAgent(
    tools=[SpeechToTextTool()],
    model=model,
    add_base_tools=True,
)

agent.run(
    "What does this audio say?",
    additional_args={
        "mp3_sound_file_url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/transformers/recording.mp3"
    },
)

Error logs (if any)

Traceback (most recent call last):
  File "/home/ubuntu/smolagents/attachments.py", line 14, in <module>
    tools=[SpeechToTextTool()],
  File "/home/ubuntu/.local/lib/python3.10/site-packages/smolagents/default_tools.py", line 270, in __new__
    return super().__new__()
TypeError: object.__new__(): not enough arguments

Expected behavior
I expect it to work as other tools work

Packages version:
smolagents==1.7.0

Additional context
No required additional context

The text was updated successfully, but these errors were encountered:

ismael-dm added the bug Something isn't working label Feb 2, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] SpeechToTextTool not working #473

[BUG] SpeechToTextTool not working #473

ismael-dm commented Feb 2, 2025

[BUG] SpeechToTextTool not working #473

[BUG] SpeechToTextTool not working #473

Comments

ismael-dm commented Feb 2, 2025