Cluster1 sources· last seen 2h ago· first seen 2h ago

Qwen3.6-27B at 72 tok/s on RTX 3090 on Windows using native vLLM (no WSL, no Docker), portable launcher and installer

The angle here is native Windows, no WSL. Simple installation, open source, no telemetry. Not selling or promoting anything: https://github.com/devnen/qwen3.6-windows-server **Numbers (RTX 3090, Windows 10):** - 72 tok/s short prompt - 64.5 tok/s long prompt (~25k tokens) - 53.4 tok/s at 127k ctx (

Lead: r/LocalLLaMABigness: 23qwen36-27btokrtx3090

Open primary source

📡 Coverage

1 news source

🟠 Hacker News

🔴 Reddit

47 upvotes across 1 sub

📈 Google Trends

Full methodology: How scoring works

Receipts (all sources)

Qwen3.6-27B at 72 tok/s on RTX 3090 on Windows using native vLLM (no WSL, no Docker), portable launcher and installer

REDDIT · r/LocalLLaMA · 2h ago · ⬆ 47 · 💬 21

score 122