
news
SWE-bench Is Dead. Here's What Your AI Coding Tool Actually Competes On.
10,000 developers confirm benchmark scores don't predict satisfaction. The real differentiator — context strategy — has no leaderboard at all.

news
OpenAI Didn't Win the AI Race — It Bought the Scoreboard
In seven weeks, OpenAI discredited SWE-bench, acquired Promptfoo, and wrapped every rival model in its SDK. Three defensible moves that add up to vertical integration of the entire AI evaluation stack.