Cluster1 sources· last seen 2h ago· first seen 2h ago
I ran 8 open-weight models as agents in a persistent MMO for 10 days. Here's the 93k event dataset and some things that I learned
Howdy everyone! Quick disclosure: I work on this - it's a project my studio created called the Null Epoch. I wasn't really happy with testing my agents with the usual static benchmarks and I wanted to learn more about how models and agents handle long-horizon planning, resource contention, and adve
Lead: r/LocalLLaMABigness: 23ranopen-weightagentspersistentmmo
📡 Coverage
10
1 news source
🟠 Hacker News
0
🔴 Reddit
52
45 upvotes across 1 sub
📈 Google Trends
0
Full methodology: How scoring works
Receipts (all sources)
I ran 8 open-weight models as agents in a persistent MMO for 10 days. Here's the 93k event dataset and some things that I learned
REDDIT · r/LocalLLaMA · 2h ago · ⬆ 45 · 💬 17
score 121
Howdy everyone! Quick disclosure: I work on this - it's a project my studio created called the Null Epoch. I wasn't really happy with testing my agents with the usual static benchmarks and I wanted to learn more about how models and agents handle long-horizon planning, resource contention, and adve