I was testing my agent built on Koog, JetBrains' open-source framework for building AI agents in Kotlin. I fed it a task from SWE-bench-Verified, a real-world GitHub issue that tests whether AI can actually write code.
For the first 100 messages, everything looked promising. The agent methodicall…