Is GPT-5.1 Really an Upgrade? But Models Can Auto-Hack Govts, so … there’s that

AI Explained · Beginner ·📰 AI News & Updates ·6mo ago

Skills: Staying Current in AI90%AI Alignment Basics80%

A lot just got released in the last 36 hours, and it will all affect hundreds of millions of people. 10 details you would miss if you just read the headlines, from GPT 5.1 regressions, to how Claude hacked Govt Agencies, to SIMA 2, and Musical Turing Tests. https://assemblyai.com/aiexplained Chapters: 00:00 - Introduction 00:56 - GPT 5.1 Smarter? 01:47 - Some Regressions 03:22 - Sycophancy? 05:22 - Claude Auto-Hacking 06:16 - Jailbreaking through Granularity 08:22 - This Will be Re-used 09:30 - Hallucinating Hacker 09:57 - Surprisingly Neutral Tone 12:18 - SIMA 2 14:10 - Alpha Parallels 17:24 - AI Music AI Insiders ($9!): https://www.patreon.com/AIExplained GPT 5.1 Announcement: https://openai.com/index/gpt-5-1/ System Card: https://cdn.openai.com/pdf/4173ec8d-1229-47db-96de-06d87147e07e/5_1_system_card.pdf Benchmarks: https://openai.com/index/gpt-5-1-for-developers/ Simple Bench: https://lmcouncil.ai/benchmarks Auto-Hacking: https://x.com/AnthropicAI/status/1989033793190277618 https://www.anthropic.com/news/disrupting-AI-espionage Report: https://assets.anthropic.com/m/ec212e6566a0d47/original/Disrupting-the-first-reported-AI-orchestrated-cyber-espionage-campaign.pdf Sima 2 Announcement: https://deepmind.google/blog/sima-2-an-agent-that-plays-reasons-and-learns-with-you-in-virtual-3d-worlds/ https://x.com/amoufarek/status/1988986075331858693 Scepticism: https://www.technologyreview.com/2025/11/13/1127921/google-deepmind-is-using-gemini-to-train-agents-inside-goat-simulator-3/ Voyager: https://voyager.minedojo.org/ Reuters Music: https://www.reuters.com/legal/litigation/are-you-listening-bots-survey-shows-ai-music-is-virtually-undetectable-2025-11-12/ https://lmcouncil.ai Non-hype Newsletter: https://signaltonoise.beehiiv.com/ Podcast: https://aiexplainedopodcast.buzzsprout.com/

Watch on YouTube ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

More on: Staying Current in AI

View skill →

The biggest mistake developers make in their resumes

The biggest mistake developers make in their resumes

THIS is why a CS degree won't get you a coding job

THIS is why a CS degree won't get you a coding job

Recon-ng - Introduction And Installation

Recon-ng - Introduction And Installation

The Ultimate Home Assistant Backup Guide (Google Drive, OneDrive, Dropbox & Cloudflare R2)

The Ultimate Home Assistant Backup Guide (Google Drive, OneDrive, Dropbox & Cloudflare R2)

Recon-ng - Generating Reports

Recon-ng - Generating Reports

How can I be notified when my name is mentioned on the web?

How can I be notified when my name is mentioned on the web?

Google Search Central

Related AI Lessons

Structure, Not Prophecy

Gemini for Science reveals the importance of structure in AI research, making complex concepts visible in days

4 Things You Must Start Doing With AI To Stay Relevant

Stay relevant in the AI era by adopting 4 key strategies

Medium · ChatGPT

Meta Is in Crisis, Google Search’s Makeover, and AI Gets Booed by Graduates

Learn about Meta's crisis, Google Search's makeover, and AI's backlash from graduates, and how these events impact the tech industry

What I Do Between Biotech Jobs, Part 1: The 20-Line Script That Outsmarted an AI

Learn how a 20-line script outsmarted an AI in biotech, and discover the potential of creative problem-solving in the industry

Chapters (12)

Introduction

0:56 GPT 5.1 Smarter?

1:47 Some Regressions

3:22 Sycophancy?

5:22 Claude Auto-Hacking

6:16 Jailbreaking through Granularity

8:22 This Will be Re-used

9:30 Hallucinating Hacker

9:57 Surprisingly Neutral Tone

12:18 SIMA 2

14:10 Alpha Parallels

17:24 AI Music

OpenAI: $2M in tokens to every YC company in the spring and summer batches.