Add Llama.cpp model capability #450

ryantzr1 · 2025-01-31T12:54:55Z

Closes #449

Description

This pull request introduces the LlamaCppModel class to the smolagents library, enabling seamless integration with llama.cpp models. This enhancement expands the library's versatility, allowing users to leverage optimized performance and efficiency offered by llama.cpp for running large language models.

Features Added:

LlamaCppModel Class:
A new class to interact with llama.cpp models, supporting both local model loading and Hugging Face repository integration.
Parameter Handling:
Comprehensive parameter management, including GPU layers, context size, and maximum token generation.
Conditional Tool Integration:
Tools are integrated only when explicitly provided, ensuring optimized resource utilization.

Motivation and Context

Integrating llama.cpp models into smolagents addresses the growing demand for efficient and resource-optimized language model interactions. This addition allows users to benefit from llama.cpp's capabilities while maintaining the flexibility and functionality that smolagents offers.

How Has This Been Tested?

Integration Tests:
Tested the LlamaCppModel within a CodeAgent to ensure seamless interaction and tool usage.

Example Usage from text_to_sql.py:

from smolagents import LlamaCppModel, CodeAgent
from smolagents.tools import SQLTool  # Assume SQLTool is predefined

# Initialize the SQL tool
sql_engine = SQLTool(...)

# Initialize the LlamaCppModel
model = LlamaCppModel(
    repo_id="bartowski/Qwen2.5-7B-Instruct-1M-GGUF",
    filename="Qwen2.5-7B-Instruct-1M-IQ2_M.gguf",
    n_ctx=8192,
    max_tokens=8192,
)

# Create the CodeAgent with the SQL tool and LlamaCppModel
agent = CodeAgent(
    tools=[sql_engine],
    model=model,
)

# Run the agent with a prompt
response = agent.run("Can you give me the name of the client who got the most expensive receipt?")
print(response.content)
# Output: "The client with the most expensive receipt is Woodrow Wilson."

ryantzr1 added 3 commits January 31, 2025 21:06

Add Llama.cpp model capability

2a0a05d

Fix make quality

63e1a89

Fix accidental changes in agents.py

fb90db1

ryantzr1 force-pushed the llama-cpp-models branch from 8234370 to fb90db1 Compare January 31, 2025 13:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Llama.cpp model capability #450

Add Llama.cpp model capability #450

ryantzr1 commented Jan 31, 2025 •

edited

Loading

Add Llama.cpp model capability #450

Are you sure you want to change the base?

Add Llama.cpp model capability #450

Conversation

ryantzr1 commented Jan 31, 2025 • edited Loading

Description

Features Added:

Motivation and Context

How Has This Been Tested?

Example Usage from text_to_sql.py:

ryantzr1 commented Jan 31, 2025 •

edited

Loading