The JetBrains Blog

The latest updates on all JetBrains products and topics

Igor Slinko

In training agents, we toss the whole run if the final outcome is imperfect, missing out on valuable info. To fix this, we developed Step Rejection Fine-Tuning.

The JetBrains Blog

Igor Slinko

Step Rejection Fine-Tuning: Squeezing More Signal from Noisy Agent Trajectories