New Delhi, April 26 -- OpenAI launched its GPT-5.5 model earlier this week with the aim of taking on Anthropic's recently launched Claude Opus 4.7 and Gemini 3.1 Pro models. The new model is claimed to come with massive leaps in coding capabilities along with improved agentic abilities and scientific research.
OpenAI's GPT-5.5 leads the benchmarks for agentic use and efficiency but the new model still lags behind Claude on benchmarks that require precision coding while Gemini 3.1 Pro maintains a lead in areas around academic reasoning.
Across the various benchmarks, GPT-5.5 (including its Pro variant) took the top spot in 15 categories, while Claude Opus 4.7 led in 7 evaluations, and Gemini 3.1 Pro secured 2 wins.
On Terminal-Bench 2.0...
Click here to read full article from source
To read the full article or to get the complete feed from this publication, please
Contact Us.