News

"Through extensive experimentation across diverse puzzles, we show that frontier LRMs face a complete accuracy collapse ...