NEW Grok 4.3 TESTED: Needs Multiple Iterations

Discover AI · Advanced ·📰 AI News & Updates ·2w ago
In this video I perform my causal reasoning test (an elevator test) on the newly released Grok 4.3 to evaluate its reasoning capabilities for unpublished complex reasoning and scientific tasks. Complete YouTube playlist of my test available here https://www.youtube.com/playlist?list=PLgy71-0-2-F0Rla8lu5ZldpYQUfXM_5bT 00:00 New Grok 4.3 01:33 Live test (arena.ai) 05:46 Grok 4.3 FAILS 07:47 2nd run Grok 4.3 12:44 First result by Grok 4.3 14:00 Validation run 15:43 Optimization run Grok 4.3 #grok #grokai #nextgenai #aitesting
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related AI Lessons

AI, Actually #2: Shall Mediocre Inherit the Earth?
Researchers studied AI's impact on work with BCG consultants, revealing potential future job market changes
Medium · AI
Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.
Big Tech firms are investing heavily in AI, driving growth and transformation, while prioritizing safety and responsible adoption
Dev.to AI
Navigating the AI Revolution with Tom Resing
Learn how AI impacts UX design and content creation with expert Tom Resing
Medium · UX Design
Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.
Big Tech firms are investing billions in AI, driving growth and transformation, while prioritizing safety and responsible adoption
Dev.to AI

Chapters (7)

New Grok 4.3
1:33 Live test (arena.ai)
5:46 Grok 4.3 FAILS
7:47 2nd run Grok 4.3
12:44 First result by Grok 4.3
14:00 Validation run
15:43 Optimization run Grok 4.3
Up next
SoftBank’s $60 Billion OpenAI Bet Sparks Concerns
Bloomberg Technology
Watch →