Artificial Intelligence in Higher Education: Learning and Pedagogy (AIHELP) Network

YouTube: AI Explained – o3 wow

December 21, 2024

https://www.youtube.com/@aiexplained-official

o3 – wow

o3 isn’t one of the biggest developments in AI for 2+ years because it beats a particular benchmark. It is so because it demonstrates a reusable technique through which almost any benchmark could fall, and at short notice. I’ll cover all the highlights, benchmarks broken, and what comes next. Plus, the costs OpenAI didn’t want us to know, Genesis, ARC-AGI 2, Gemini-Thinking, and much more.

00:00 – Introduction 01:19 – What is o3? 03:18 – FrontierMath 05:15 – o4, o5 06:03 – GPQA 06:24 – Coding, Codeforces + SWE-verified, AlphaCode 2 08:13 – 1st Caveat 09:03 – Compositionality? 10:16 – SimpleBench? 13:11 – ARC-AGI, Chollet 20:25 – Safety Implicaitons

aihelpnetwork_admin

YouTube: AI Explained – o3 wow

https://www.youtube.com/@aiexplained-official

o3 – wow

Leave a Reply Cancel reply