feat: Add support for qwen3-thinking models via Function Calling #59

antonshalin76 · 2025-11-07T10:26:26Z

Add Support for Qwen3-Thinking Models via Function Calling

Overview

This PR introduces comprehensive support for qwen3-thinking models, which do not support Structured Output (SO) but work excellently through Function Calling (FC). The implementation maintains full compatibility with the SGR framework while enabling the use of thinking models that provide intermediate reasoning in their outputs.

Key Features

1. Universal Pydantic to Function Calling Converter

File: sgr_deep_research/core/utils/pydantic_convert.py
Automatic JSON Schema generation from Pydantic models
Support for complex types: Literal, Optional, Union, List, Dict
Handles nested models and constraints (min/max, length, pattern)

2. Qwen3-Thinking Response Adapter

File: sgr_deep_research/core/adapters/qwen3_thinking_adapter.py
Three-strategy extraction from "dirty" thinking model outputs:
- Strategy 1: tool_calls with JSON in arguments
- Strategy 2: content with <tool_call>...</tool_call> tags (priority)
- Strategy 3: content with raw JSON (fallback)
Robust parsing with detailed error diagnostics

3. SGRQwen3ThinkingAgent

File: sgr_deep_research/core/agents/sgr_qwen3_thinking_agent.py
Full-featured SGR agent adapted for thinking models
Uses Function Calling instead of Structured Output
Modified system prompt with thinking model instructions
Maintains complete SGR architecture: reasoning → action → evaluation

Documentation & Examples

Comprehensive Documentation: docs/QWEN3_THINKING_SUPPORT.md
- Detailed component descriptions
- Configuration examples
- Usage patterns
- Troubleshooting guide
- Performance comparison with instruct models
Practical Examples: examples/qwen3_thinking_example.py
- Basic usage
- Clarification handling
- Configuration loading

️ Architecture

SGRQwen3ThinkingAgent
│
├── Pydantic → FC Conversion (pydantic_convert.py)
│   └── Auto-generates OpenAI Function Calling schema
│
├── Modified System Prompt
│   └── Includes thinking model instructions + dynamic schema
│
├── Reasoning Phase (Function Calling)
│   └── LLM generates reasoning + tool call
│
├── Response Extraction (qwen3_thinking_adapter.py)
│   └── Extracts structured data from mixed output
│
└── Action & Evaluation
    └── Standard SGR flow

Design Decisions

Function Calling over Structured Output: Thinking models don't support SO, but FC provides equivalent functionality while preserving reasoning visibility
Multi-Strategy Extraction: Thinking models can output in various formats depending on vLLM configuration - the adapter handles all cases gracefully
Modified System Prompt: Incorporates base prompt from config + schema + thinking-specific instructions
Full SGR Compatibility: Maintains all SGR agent features (clarifications, planning, tool selection, etc.)

Backward Compatibility

This PR:

Does not modify existing agents or tools
Adds new optional components
Works alongside standard SGR agents
Uses existing configuration system from agents-config-definitions branch

Configuration

config.yaml

llm:
  model: "Qwen/Qwen3-14B-thinking"
  temperature: 0.3
  max_tokens: 12000

agents.yaml

agents:
  - name: "qwen3_thinking_agent"
    base_class: "SGRQwen3ThinkingAgent"
    llm:
      model: "Qwen/Qwen3-14B-thinking"
    tools:
      - "WebSearchTool"
      - "CreateReportTool"
      - "FinalAnswerTool"

virrius and others added 16 commits October 28, 2025 23:13

khe

8b940b9

valera moment

948747f

Update requirements.txt

d28f7e6

Refactor services and registry, improve agent config

e906c6c

Remove display_name and description from AgentDefinition

56fe1ab

Refactor agent config to support generic LLM clients

ee23ed8

Update agent_factory.py

1635350

Update models.py

85571f5

Update agent_factory.py

68bb7dd

Delete agents.yaml

28a7db9

Refactor config system and agent initialization

137b853

Refactor prompt handling and update config structure

467aa51

Refactor agent config, prompt loading, and MCP tool handling

21dff2e

Update agent config merging

dd3c8a8

Update endpoints.py

bcb0186

feat: Add support for qwen3-thinking models via Function Calling

6c286d0

EvilFreelancer requested review from vakovalskii and virrius November 11, 2025 10:34

virrius force-pushed the agents-config-definitions branch from b2e2591 to a831db1 Compare November 11, 2025 14:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Add support for qwen3-thinking models via Function Calling #59

feat: Add support for qwen3-thinking models via Function Calling #59

Uh oh!

antonshalin76 commented Nov 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat: Add support for qwen3-thinking models via Function Calling #59

Are you sure you want to change the base?

feat: Add support for qwen3-thinking models via Function Calling #59

Uh oh!

Conversation

antonshalin76 commented Nov 7, 2025

Add Support for Qwen3-Thinking Models via Function Calling

Overview

Key Features

1. Universal Pydantic to Function Calling Converter

2. Qwen3-Thinking Response Adapter

3. SGRQwen3ThinkingAgent

Documentation & Examples

️ Architecture

Design Decisions

Backward Compatibility

Configuration

config.yaml

agents.yaml

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants