Skip to content

[BUG] (Some) Tool calls fail with Qwen3-Coder-30B-A3B-Instruct on llama.cpp #10780

@robertjmcintyre

Description

@robertjmcintyre

Problem (one or two sentences)

I started a new task, and tried to run this message:

What version of dotnet is installed?

The request fails the tool call (see the message below in the repro)

However (!), without restarting VSCode, the Roo extension, or my llama-server, if I run some other command that interacts with the model (needs to read a file, or check git history, for example) then in the same task session ask the dotnet version question it works as expected.

I'm testing tool calling with this model (Qwen3-Coder-30B-A3B-Instruct-UD-Q8_K_XL.gguf) which is how I came across it; I just wanted a very simple command to try.

However (!), I can do the same test with GPT-OSS-120B and GLM-4.5-Air (both are unsloth gguf's) and it works completely as expected. So it seems to be something with Qwen3-Coder.

Context (who is affected and when)

Happens to users running qwen3-coder locally when they start a new task and make requests that need to execute_command.

Reproduction steps

  1. System information:

VSCode Info:
Version: 1.108.0
Commit: 94e8ae2b28cb5cc932b86e1070569c4463565c37
Date: 2026-01-08T13:53:10.781Z
Electron: 39.2.7
ElectronBuildId: 12953945
Chromium: 142.0.7444.235
Node.js: 22.21.1
V8: 14.2.231.21-electron.0
OS: Linux x64 6.18.3-200.fc43.x86_64

Roo Code Info:
Version: 3.41.0 (0401c25c)
  1. Start llama-server loading unsloth Qwen3-Coder-30B-A3B-Instruct-UD-Q8_K_XL.gguf
  2. Open a workspace/folder and start a new Roo task
  3. Run the request what version of dotnet is installed
  4. Get these responses
I'll check what version of .NET is installed on the system using execute_command.

<function=execute_command>
<parameter=command>
dotnet --version
</parameter>
<parameter=cwd>
/home/robert/repos/AgentFramework
</parameter>

And details:

Date/time: 2026-01-16T16:13:39.589Z
Extension version: 3.41.0
Provider: openai (proxy)
Model: /models/unsloth_Qwen3-Coder-30B-A3B-Instruct/Qwen3-Coder-30B-A3B-Instruct-UD-Q8_K_XL.gguf

The model provided text/reasoning but did not call any of the required tools. This usually indicates the model misunderstood the task or is having difficulty determining which tool to use. The model has been automatically prompted to retry with proper tool usage.
  1. Start a new Roo task and run the request what files have been edited but not committed?
  2. Watch the request succeed
  3. Try the same thing with other models and watch it not repro

Expected result

The command runs successfully

Actual result

the tool call fails

Variations tried (optional)

Tried different models, and they seem to work as expected.

App Version

Version: 3.41.0 (0401c25)

API Provider (optional)

OpenAI Compatible

Model Used (optional)

Qwen3-Coder-30B-A3B-Instruct-UD-Q8_K_XL.gguf

Roo Code Task Links (optional)

No response

Relevant logs or errors (optional)

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions