4 days ago2 min read
GPT-5.5 outperforms and hallucinates
OpenAI’s latest flagship model seems to tell two stories at once: on the one hand, it pushes the frontier again, setting new state-of-the-art results across important benchmarks for knowledge work, agentic coding, computer use, and abstract visual reasoning; on the other hand, it appears to struggle with one of the most important skills for real-world AI systems, which is knowing when not to answer. GPT-5.5 is clearly powerful. It tops the Artificial Analysis Intelligence Ind















