On a 2.0 terminal benchmark, OpenAI’s model scores about 10% higher, guiding users toward stronger results on long, complex ...
Add Decrypt as your preferred source to see more of our stories on Google. Anthropic released Claude Sonnet 4.5, calling it the best coding model yet. The model scored 77.2% on SWE-bench Verified, ...