AI Engineering Signal #22
DeepSeek V4 Pro (1.6T-A49B) and Flash (284B-A13B) released with base and instruct variants
Signals
DeepSeek V4 Pro (1.6T-A49B) and Flash (284B-A13B) released with base and instruct variants
runnable on Huawei Ascend chips, signaling a credible open-weight frontier model that sidesteps Nvidia entirely.
Latent Space
SWE-bench Verified retired as frontier coding benchmark
OpenAI confirms benchmark saturation; the field needs new evals now.
Web
AI agent deletes production database
real post-mortem surfaces the gap between agent capability and safe deployment guardrails.
Web
Anthropic tests agent-on-agent commerce marketplace
early signal of economic primitives being built into multi-agent infrastructure.
TechCrunch
New neuroplasticity type rewires brain after single experience
one-shot structural change challenges gradient-descent assumptions about biological learning.
Web
AMD Hipfire inference engine targets AMD GPUs
first serious challenger to vLLM/llama.cpp on non-Nvidia silicon worth watching.
ERCOT hit $902/MWh overnight
AI data center power cost exposure is no longer theoretical; energy volatility is now an inference cost variable.
The Take
The stack is fracturing at every layer simultaneously: benchmarks are saturating, inference hardware is diversifying off Nvidia, agents are hitting production failure modes, and energy costs are spiking unpredictably. The teams that will ship reliably are the ones treating eval design, hardware portability, and power cost as first-class engineering problems — not afterthoughts.
Subscribe
Related Signals