Qwythos-9B v3 Released: Optimizing Chat Templates for Agentic Reasoning

EmperoAI has released version 3 of Qwythos-9B, specifically addressing critical failures in agentic harnesses by refining preserved and adaptive thinking mechanisms within the model's chat template.

Addressing Agentic Harness Performance

The release of Qwythos-9B v3 comes as a direct response to observed instabilities in agentic workflows. According to the developers, previous iterations exhibited significant issues when deployed in agentic harnesses, where the model's ability to maintain consistent reasoning chains was compromised.

Improvements in Adaptive Thinking

The core of the v3 update focuses on the chat template, specifically targeting "preserved and adaptive thinking." In the context of Large Language Models (LLMs), these mechanisms are crucial for the model's ability to maintain a coherent internal monologue or "Chain of Thought" (CoT) while adapting its strategy based on environmental feedback—a requirement for successful autonomous agent behavior.

The developers describe the improvement as a "night and day difference," suggesting that the updated template significantly enhances the model's reliability and reasoning capabilities during complex task execution.

Deployment and Availability

To implement these fixes, users are advised to redownload the model weights. The update is available in both GGUF (for quantized deployment via llama.cpp) and Safetensors (for full-precision or HF-based loading) formats.

Note: Due to the brevity of the source announcement, specific architectural changes beyond the chat template adjustments were not disclosed.

Original Source

LLM Agentic AI Qwythos-9B Chat Templates Reasoning

Techyon

Qwythos-9B v3 released! We have noticed some issues in agentic harnesses due to issues with preserved and adaptive thinking in the chat template. Its a night and day difference, please redownload the GGUF / Safetensor.

Qwythos-9B v3 Released: Optimizing Chat Templates for Agentic Reasoning

Addressing Agentic Harness Performance

Improvements in Adaptive Thinking

Deployment and Availability

Qwythos-9B v3 released! We have noticed some issues in agentic harnesses due to issues with preserved and adaptive thinking in the chat template. Its a night and day difference, please redownload the GGUF / Safetensor.

Qwythos-9B v3 Released: Optimizing Chat Templates for Agentic Reasoning

Addressing Agentic Harness Performance

Improvements in Adaptive Thinking

Deployment and Availability

Related Articles

Liquid AI Ships LFM2.5-230M with llama.cpp, MLX, vLLM, SGLang, and ONNX Support for On-Device Inference

Qwen3.5-9B on RTX 5060 8GB VRAM: The llama.cpp settings + quants that finally made reliable local agents work

AI Technology's Moat Crisis: Why Anthropic's $1T Bet Is Leaking Through Its Own API

Asian AI startups launch Mythos-like models

CoffeeBench: Benchmarking Long-Horizon LLM Agents in Heterogeneous Multi-Agent Economies