reddit/r/localllama test Shard - getting to 10× KV cache compression u/Thrumpwart 2026-05-26 · 04:04 UTC