• 11 min read 11 min • 2345 words 2,345 words
gpumod Mode Switches, Driver Hangs, and Landing on Qwen3.6 MTP for Hermes
How a series of silent freezes during gpumod mode switches led me through CUDA pinned-memory debugging, four layers of defense, and ultimately to Qwen3.6-35B-A3B MTP with preserve_thinking as the new Hermes-agent default.