OpenAI’s DeepResearch can complete 26% of ‘Humanity’s Last Exam’ — a benchmark for the frontier of human knowledge
OpenAI’s o1 and DeepSeek’s R1 models, which previously sat atop the leaderboard, could only get through roughly 9% of the exam.
![OpenAI’s DeepResearch can complete 26% of ‘Humanity’s Last Exam’ — a benchmark for the frontier of human knowledge](https://fortune.com/img-assets/wp-content/uploads/2025/02/GettyImages-2198379368-e1739310956573.jpg?w=2048#)
Feb 12, 2025 0
Feb 12, 2025 0
Feb 10, 2025 0
Feb 12, 2025 0
Feb 9, 2025 0
Feb 9, 2025 0
Feb 9, 2025 0
Feb 9, 2025 0
Feb 9, 2025 0
Feb 9, 2025 0
Feb 12, 2025 0
Feb 11, 2025 0
Feb 11, 2025 0
Feb 11, 2025 0
Feb 11, 2025 0
Feb 12, 2025 0
Feb 12, 2025 0
Feb 11, 2025 0
Feb 10, 2025 0
Feb 12, 2025 0
Feb 12, 2025 0
Feb 12, 2025 0
Feb 12, 2025 0
Feb 9, 2025 0
Feb 9, 2025 0
Feb 12, 2025 0
Feb 10, 2025 0
Feb 9, 2025 0
Feb 9, 2025 0
Feb 12, 2025 0
Feb 12, 2025 0
Feb 12, 2025 0
Feb 12, 2025 0
Feb 12, 2025 0
Feb 12, 2025 0
Feb 12, 2025 0
Feb 11, 2025 0
Feb 11, 2025 0
Feb 10, 2025 0
Feb 12, 2025 0
Feb 11, 2025 0
Feb 11, 2025 0
Feb 12, 2025 0
Or register with email
Feb 9, 2025 0
Feb 10, 2025 0
Feb 10, 2025 0
Feb 10, 2025 0
Feb 11, 2025 0
This site uses cookies. By continuing to browse the site you are agreeing to our use of cookies.