deepseek-r1:32b or 14b almost cannot work in roo cline, because it never calls the tools leading to continuous errors #28

DoiiarX · 2025-01-21T09:20:46Z

deepseek-r1:32b or 14b almost cannot work in roo cline, because it never calls the tools leading to continuous errors

LYouC · 2025-01-22T08:12:34Z

I also encountered this issue, which might be a problem with the model's capability. I once called qwen2-coder-32b, and it worked fine in roocline. However, deepseek-r1:32b would report an error after multiple repeated invalid conversations.

leobarcellos · 2025-01-22T16:09:18Z

I could not make it work even with the deepseek-r1:70b

jp-gorman · 2025-01-22T19:35:08Z

I have the same issue using cline and the R1-32B model locally on ollama using ollama published R1 model.

Zodaztream · 2025-01-22T21:06:21Z

For me the issue was context length, the default context length is around 4K for Ollama. You can fix this by creating a Modelfile with the following:

FROM deepseek-r1:14b
PARAMETER num_ctx 32768

and then create a model from this with a unique name using ollama. Choose this model in cline and it should start working.

You can also use LM studio which allows to increase context length by holding alt over the model before loading it.

These two approaches will make it work.

jwadow · 2025-01-22T23:45:09Z

For me the issue was context length, the default context length is around 4K for Ollama. You can fix this by creating a Modelfile with the following:

FROM deepseek-r1:14b
PARAMETER num_ctx 32768

and then create a model from this with a unique name using ollama. Choose this model in cline and it should start working.

Yes, thank you, it worked and became much better. For 14B I set num_ctx 25000 and it is able to complete the simple task, albeit after a long time.
7B and especially 1.5B with 32768 are not able to complete the simple task.

R1 models seem to not understand that they need to work with an open active file and replace 1 variable there (in "Code" mode).

But there are still certain technical shortcomings in roo cline:

The model constantly uses visible tags like <tool_use> and <ask_ollowup_question>.
For some reason, the beginning of the answer is very long. I think the problem is that this is a <think> block that is invisible to the user? It would be nice to add a pop-up inscription "Thinking", if so.

Oh, I noticed that this is not a repo of roo cline, but of the creator of the model.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

deepseek-r1:32b or 14b almost cannot work in roo cline, because it never calls the tools leading to continuous errors #28

deepseek-r1:32b or 14b almost cannot work in roo cline, because it never calls the tools leading to continuous errors #28

DoiiarX commented Jan 21, 2025

LYouC commented Jan 22, 2025

leobarcellos commented Jan 22, 2025

jp-gorman commented Jan 22, 2025

Zodaztream commented Jan 22, 2025 •

edited

Loading

jwadow commented Jan 22, 2025 •

edited

Loading

deepseek-r1:32b or 14b almost cannot work in roo cline, because it never calls the tools leading to continuous errors #28

deepseek-r1:32b or 14b almost cannot work in roo cline, because it never calls the tools leading to continuous errors #28

Comments

DoiiarX commented Jan 21, 2025

LYouC commented Jan 22, 2025

leobarcellos commented Jan 22, 2025

jp-gorman commented Jan 22, 2025

Zodaztream commented Jan 22, 2025 • edited Loading

jwadow commented Jan 22, 2025 • edited Loading

Zodaztream commented Jan 22, 2025 •

edited

Loading

jwadow commented Jan 22, 2025 •

edited

Loading