Cluster1 sources· last seen 3h ago· first seen 3h ago

Running SmolLM2‑360M on a Samsung Galaxy Watch 4 (380MB RAM) – 74% RAM reduction in llama.cpp

I’ve got SmolLM2‑360M running on a Samsung Galaxy Watch 4 Classic (about 380MB free RAM) by tweaking llama.cpp and the underlying ggml memory model. By default, the model was being loaded twice in RAM: once via the APK’s mmap page cache and again via ggml’s tensor allocations, peaking at 524MB for a

Lead: r/LocalLLaMABigness: 19runningsmollm2360msamsunggalaxy

Open primary source

📡 Coverage

1 news source

🟠 Hacker News

🔴 Reddit

19 upvotes across 1 sub

📈 Google Trends

Full methodology: How scoring works

Receipts (all sources)

Running SmolLM2‑360M on a Samsung Galaxy Watch 4 (380MB RAM) – 74% RAM reduction in llama.cpp

REDDIT · r/LocalLLaMA · 3h ago · ⬆ 19 · 💬 4

score 115