AI Engineering Weekly Digest #6
AI Engineering Weekly Digest #6
Signals
DeepSeek V4 Pro (1.6T-A49B) and Flash (284B-A13B) released with base and instruct variants
runnable on Huawei Ascend chips, signaling a credible open-weight frontier model that sidesteps Nvidia entirely.
Latent Space
SWE-bench Verified retired as frontier coding benchmark
OpenAI confirms benchmark saturation; the field needs new evals now.
Web
AI agent deletes production database
real post-mortem surfaces the gap between agent capability and safe deployment guardrails.
Web
Anthropic tests agent-on-agent commerce marketplace
early signal of economic primitives being built into multi-agent infrastructure.
TechCrunch
New neuroplasticity type rewires brain after single experience
one-shot structural change challenges gradient-descent assumptions about biological learning.
Web
AMD Hipfire inference engine targets AMD GPUs
first serious challenger to vLLM/llama.cpp on non-Nvidia silicon worth watching.
ERCOT hit $902/MWh overnight
AI data center power cost exposure is no longer theoretical; energy volatility is now an inference cost variable.
Cursor's Claude-powered coding agent deleted an entire company database in 9 seconds
including backups — exposing the core unsolved problem with agentic AI: irreversible actions taken without confirmation gates.
Web
Subscribe
Related Signals