| Hardware: 3060 / 12 GB | Qwen 3.5 9B I've tried, making the system prompt smaller. Obviously, the paradox of thinking when it's not worth thinking is in effect but anyway. I've hijacked the prompt to create a reasoning within the reasoning to force immediate response but it's still not working as it takes 39.8 for a Hey and 2.5 seconds for the Stein or Quantum Mechanics. I've read to put in the system prompt that it is confident, but does anyone have any other way. [link] [comments] |
Qwen 3.5 Thinking Anxiety
Reddit r/LocalLLaMA / 3/15/2026
💬 OpinionTools & Practical UsageModels & Research
Key Points
- A Reddit post documents running Qwen 3.5 (9B) on a system with a 3060 12 GB GPU and describes the model's behavior under attempts to induce thinking.
- The author reports experimenting with system prompt tweaks and even attempts to hijack the prompt to force internal reasoning, but response latency remains high.
- The user asks for other methods to reduce thinking anxiety and mentions trying to make the system prompt declare it is confident.
- The post links to related content and credits the author.
Related Articles

Manus、AIエージェントをデスクトップ化 ローカルPC上でファイルやアプリを直接操作可能にのサムネイル画像
Ledge.ai

The programming passion is melting
Dev.to

Best AI Tools for Property Managers in 2026
Dev.to

Building “The Sentinel” – AI Parametric Insurance at Guidewire DEVTrails
Dev.to

Maximize Developer Revenue with Monetzly's Innovative API for AI Conversations
Dev.to