Anthropic Managed Agents + OpenAI buys Ona
ZeroFS mounts any S3‑compatible bucket as a POSIX filesystem via NFS, 9P, FUSE or NBD. Its log‑structured engine writes immutable objects, compresses and encrypts data, and compacts deletions, while a local cache gives microsecond reads. It passes xfstests, Jepsen failover and kernel‑build stress suites, proving production‑grade reliability.
Anthropic launched Managed Agents, a hosted service that runs long‑horizon Claude agents via stable, interchangeable components, session logs, harness loops, and sandboxes. By abstracting these layers, the platform avoids pet‑style containers and lets agents survive model upgrades and failures, cutting engineering overhead for autonomous AI workflows.
Chrome 151 ships the declarative <usermedia> element, letting sites request camera and microphone via HTML instead of getUserMedia scripts. It trims code, hardens security, and helps users reclaim denied access, with early trials showing permission recovery rates jump from 10% to over 65% and error reductions for Zoom and Google Meet.
OpenAI is buying Ona, the cloud dev‑environment startup behind Gitpod, to give its Codex agents persistent, secure sandboxes. This moves AI‑assisted work from short, on‑device tasks to multi‑day, production‑grade workflows, letting organizations keep data and execution under their own control.
Current AI models excel at generating isolated code snippets, yet they falter when a solution demands a global view of the program, leading to over‑cautious checks and state explosion. The post argues that programming‑language features that enable reliable local reasoning could give developers the missing global guarantees, a direction worth exploring as AI matures.
At AIE World’s Fair 2026 Geoffrey Litt argued that developers must maintain deep, up‑to‑date understanding of their codebases, or risk becoming passive reviewers to increasingly autonomous AI coding agents. The "understand to participate" framing warns that losing this insight creates cognitive debt and hampers effective collaboration.
Claude Sonnet 5 doubles the context window to 1 million tokens and ups the max output to 128 k, but its new tokenizer inflates token counts by ~30% for English, raising effective cost. Adaptive thinking is now on by default and older sampling knobs are gone, streamlining prompts for developers.
The author pits Fable against ten other LLMs to refactor a sprawling LangGraph "god node", a 350‑line monolith that blocks debugging and testing. By having models propose rewrites and then critique each other's suggestions, the study reveals which models excel as generators versus evaluators, giving concrete guidance for AI‑assisted code cleanup.
Snorkel’s Senior SWE-Bench shifts AI code‑agent evaluation from junior‑level tasks to realistic senior‑engineer challenges, featuring under‑specified feature specs and runtime‑heavy bug fixes. Early results show even top‑tier models miss senior‑grade correctness more than three‑quarters of the time, setting a new performance target for LLM‑driven development.
Meta rewrote its storage stack, adding a tiered caching layer and rebuilding metadata to cut GPU stalls on massive AI models. The new BLOB‑storage architecture runs on top of its Tectonic fabric, delivering faster data ingest and cross‑region access, which slashes training costs and speeds up research cycles.
Zalando’s team rewrote internal routing for its Product Read API with a pure client‑side load balancer that now sustains over a million requests per second. By pulling routing decisions out of Skipper and adding N‑ring fade‑in, occupancy‑based load limits and AZ‑aware latency health, they cut latency, lowered cost and hardened the service against infrastructure failures.
A developer rewrote a production CakePHP app in Laravel just because it felt better, without any business benefit. The piece shows how rewrites erase hidden bug‑fix history, force teams to re‑encounter old issues, and often serve engineer vanity over real value. Skip the rewrite unless the code truly blocks progress.
GitHub faced over 20,000 secret‑scanning alerts across 15,000 repos and reduced them to zero in nine months by triaging noise, focusing on active credentials, and building cross‑team remediation workflows that spanned code, tickets, and bug‑bounty reports. The playbook offers concrete steps any org can adopt to tame secret‑leakage.
As AI agents gain the ability to run code autonomously, they can inadvertently expose private files, fetch malicious content, and exfiltrate data. Sandboxing gives each agent a disposable, isolated environment with strict resource limits, ensuring any code it generates runs safely and disappears afterward.
Subscribe free