3 points | by dcre 6 hours ago ago
1 comments
A new Mythos checkpoint improves significantly on the previous one (and beats GPT-5.5-Cyber) on this benchmark.
A new Mythos checkpoint improves significantly on the previous one (and beats GPT-5.5-Cyber) on this benchmark.