Newsletter archive

Google I/O 2026: The Agentic Era Is Here

Google didn't just show off new models at I/O 2026, it announced a full-stack shift toward AI that acts, not just answers. Gemini 3.5, Gemini Spark, smart glasses, a new OS approach, and the biggest Search upgrade in 30 years. Here's everything that dropped and why it matters.

May 20, 2026 7 min →

OpenAI's Voice Leap: GPT-Realtime-2 and the End of Call-and-Response AI

OpenAI didn't just release a new voice model today, it released three. Together, they move realtime audio from simple Q&A toward voice interfaces that can reason, translate, and act. Here's everything announced and why it changes the game for developers.

May 7, 2026 6 min →

Code with Claude 2026: The Agentic Era Is Now Infrastructure

Anthropic's developer event wasn't about a new model. It was about shipping the scaffolding that makes long-running, self-improving AI agents actually work in production. Here's everything revealed and why it matters.

May 7, 2026 7 min →

GTC 2026: The Agentic AI Era Is No Longer a Vision, It's Infrastructure

NVIDIA's biggest GTC yet wasn't really about GPUs. It was about the full stack being assembled to make autonomous AI agents production-ready. Here's what that means and why it matters.

Mar 22, 2026 8 min →

GLM-OCR Is Beating Models 10x Its Size - Here's How It Actually Works

A 0.9B model from Zhipu AI just posted state-of-the-art document OCR scores. I read the paper to understand the architecture, the training tricks, and why this is a bigger deal than the benchmarks suggest.

Mar 16, 2026 9 min →

GPT-5.4 Is Here: OpenAI's Most Capable Model Yet Just Crossed a New Threshold

OpenAI dropped GPT-5.4 on March 5, 2026, and it's not just a bump in the version number. This is the first general-purpose AI that can operate your computer, reason through a million tokens of context, and outperform office workers at their own jobs 83% of the time.

Mar 6, 2026 8 min →

The Bullshit Benchmark V2: Why Teaching AI to Say 'This Makes No Sense' Is a Breakthrough

Most AI benchmarks test what models know. The Bullshit Benchmark V2 tests something more important, whether they know when a question deserves no answer at all.

Mar 4, 2026 6 min →

Seedance 2.0 Is Going Viral - Here's How AI Video Models Actually Work

Seedance 2.0 is everywhere this week. I read the 1.0 technical report to understand how these models are built - and how they compare to LLMs.

Feb 25, 2026 7 min →