MLX model support #300

g-eoj · 2025-01-21T20:35:34Z

The goal of this PR is to enable users to run smolagents with models loaded onto Apple silicon with mlx-lm. The mlx-community has made available many models for experimentation. Personally I find running locally to be a convenient way to learn and experiment with the smolagents library, so I made this PR for a possible new feature.

Example usage:

from smolagents.models import MLXModel

mlx_model = MLXModel("mlx-community/Qwen2.5-Coder-32B-Instruct-4bit", max_tokens=10000)
messages = [{"role": "user", "content": "Explain quantum mechanics in simple terms."}]
print(mlx_model(messages))

~~Some questions:~~

~~tests won't work for CICD due to hardware requirements, what is the preferred way to handle that?~~
~~anything needed for docs and if so where should it go?~~

g-eoj and others added 3 commits January 21, 2025 12:09

Add MLX model support

d4bb38f

Add MLX model test

aa6f075

Merge main and refactor

662a481

g-eoj force-pushed the g-eoj/mlx-model-support branch from b193f27 to 662a481 Compare January 22, 2025 16:50

g-eoj added 5 commits January 22, 2025 09:19

Skip mlx tests if not on macOS

d52ea23

Fix accidental reformat

8f909a0

Add docs

ea960fc

Fix typos

fb2fb1f

Pass kwargs to base class

8a36c23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MLX model support #300

MLX model support #300

g-eoj commented Jan 21, 2025 •

edited

Loading

MLX model support #300

Are you sure you want to change the base?

MLX model support #300

Conversation

g-eoj commented Jan 21, 2025 • edited Loading

g-eoj commented Jan 21, 2025 •

edited

Loading