AI Anthropic's Claude Opus 4.8 outpaces GPT-5.5 on benchmarks while making AI agents smarter and more honest 5 min 397 views