Issue #34 2026-05-08 2 min read

AI Engineering Signal #34

Anthropic published Natural Language Autoencoders

Signals

Anthropic published Natural Language Autoencoders

a method that extracts Claude's internal representations as readable text, making model internals directly interpretable without manual feature labeling or guesswork.

Web

OpenAI ships GPT-Realtime-2, Translate, and Whisper APIs

realtime voice intelligence now callable outside ChatGPT, with SOTA latency.

TechCrunch

Multi-Token Prediction lands in LLaMA.cpp

Gemma 4 inference on local hardware sees 40% speedup, usable today.

Web

Skymizer HTX301 inference card announced

384GB memory at ~240W on PCIe, squarely targeting on-prem LLM deployment.

Web

Firefox used Claude Mythos for bug hunting

April security fixes spiked dramatically, production evidence that LLMs find real vulns at scale.

Simon Willison

ZAYA1-8B matches DeepSeek-R1 on math with under 1B active parameters

MoE routing that actually delivers on the sparse-efficiency promise.

Web

CRISPR-Cas12a2 programmed to destroy cancer cells selectively

tumor volume halved after single treatment in mice, with healthy cells spared.

Web

Get signals like this in your inbox

Daily AI engineering intelligence. No noise.

[ Subscribe ]

The Take

Interpretability stopped being a research toy and started producing readable internal traces from a production model. Simultaneously, local inference got faster, cheaper hardware arrived, and LLMs proved they can harden real infrastructure. The frontier is shifting from "build bigger" to "understand, optimize, and deploy what we have."

Related Signals

2026-05-09 · general web, simon willison, tech press

AI Engineering Weekly Digest #7

2026-04-03 · simon willison, general web, tech press, github, research, community

AI Engineering Weekly #6