reddit/r/localllm test Output Length Constrained Summarization using GRPO on tiny LLMs | smolcluster u/East-Muffin-6472 2026-05-26 · 10:59 UTC