Blog

Security research, AI agents, and infrastructure findings.

Databricks Open-Sources Omnigent: The Meta-Harness That Sits Above Your AI Agents

Databricks Open-Sources Omnigent: The Meta-Harness That Sits Above Your AI Agents

Databricks just released Omnigent, an open-source meta-harness that composes, governs, and shares AI agents across Claude Code, Codex, and Pi — treating each harness as an interchangeable part of a larger system.

AI Agents Are Scaling in the Control Plane, Not the Model

AI Agents Are Scaling in the Control Plane, Not the Model

The newest agent headlines are not really about a smarter model. They are about where agents run, how they stay governed, and why the control plane is becoming the real product.

CrowdStrike Just Made AI Agents a First-Class Identity Class

CrowdStrike Just Made AI Agents a First-Class Identity Class

At Identiverse 2026, CrowdStride shipped Continuous Identity for AI Agents — SPIFFE-anchored workload identities, zero standing privilege, and continuous authorization for the agentic enterprise. The control plane story is no longer a slide, it is a product.

OpenAI’s New Agents SDK Pushes AI Agents Into the Sandbox Era

OpenAI’s New Agents SDK Pushes AI Agents Into the Sandbox Era

The newest evolution of the Agents SDK is less about flashy demos and more about the infrastructure that makes long-horizon agents safe, debuggable, and production-ready.

Why AI Agent Governance Is Becoming the Real Breakthrough

Why AI Agent Governance Is Becoming the Real Breakthrough

The loudest AI-agent headlines still focus on capability, but the most important shift is happening in the control plane: policy files, runtime checkpoints, and evaluation loops that make agents safe to deploy.

OpenAI and AWS Just Turned Codex Into an Enterprise Runtime

OpenAI and AWS Just Turned Codex Into an Enterprise Runtime

OpenAI frontier models and Codex are now on AWS, and that sounds like a distribution story until you look at the real shift: the agent is moving into the infrastructure layer.

OpenAI Just Turned AWS Into an Agent Runtime

OpenAI Just Turned AWS Into an Agent Runtime

OpenAI’s Codex and frontier models now run on Amazon Bedrock, and AWS is pushing one level deeper with AgentCore. That is not just distribution. It is the operating model for production agents.

NVIDIA and Microsoft Are Pushing AI Agents Onto the PC Again

NVIDIA and Microsoft Are Pushing AI Agents Onto the PC Again

RTX Spark looks like more than another AI PC pitch. The real story is that personal agents are becoming a hardware, security, and software-platform problem at the same time.

Microsoft's Agent Framework at Build 2026 Signals the Next Phase of AI Agents

Microsoft's Agent Framework at Build 2026 Signals the Next Phase of AI Agents

Microsoft used Build 2026 to put agents at the center of Windows and the developer stack. The big story is not another chatbot — it is policy, orchestration, observability, and sandboxing for production-grade agent systems.

Microsoft's Open Trust Stack Shows Where AI Agent Governance Is Headed

Microsoft's Open Trust Stack Shows Where AI Agent Governance Is Headed

Microsoft's BUILD-era push for agent control and adversarial testing is a signal that the industry is moving from prompt-level safety theater to enforceable policy, traceability, and testable governance.

BadHost: One HTTP header is all it takes to compromise millions of AI agents

BadHost: One HTTP header is all it takes to compromise millions of AI agents

CVE-2026-48710, a trivial Host-header injection in Starlette (the foundation of FastAPI), bypasses authentication on vLLM, LiteLLM, MCP servers, and AI agent harnesses. Only 11% of production agents pass a security audit. Patch now.

57% of Organizations Now Run AI Agents in Production — The State of Agent Engineering 2026

57% of Organizations Now Run AI Agents in Production — The State of Agent Engineering 2026

LangChain's new survey of 1,300+ engineers reveals that AI agents crossed the majority-adoption threshold, but quality remains the production killer and observability is now table stakes.

Microsoft's Agent Control Specification Is a Sign That Agent Governance Is Going Mainstream

Microsoft's Agent Control Specification Is a Sign That Agent Governance Is Going Mainstream

Microsoft's new open source Agent Control Specification is less about marketing and more about the uncomfortable truth behind AI agents: once they can take actions, you need a portable way to say which actions are allowed, which need approval, and what must be logged.

Memory OS: The 7-layer memory stack that makes Hermes stop forgetting

Memory OS: The 7-layer memory stack that makes Hermes stop forgetting

Memory OS adds seven layers of persistent memory to Hermes Agent — from workspace files to vector databases, with a ground truth hierarchy that ensures the agent actually uses its memory. Here is how to install it, how it works, and what the first day looks like.

The MCP Standard: How Model Context Protocol Became the Internet of Agents in 2026

The MCP Standard: How Model Context Protocol Became the Internet of Agents in 2026

From Anthropic's open-source experiment to a Linux Foundation-governed standard with 97 million monthly downloads — how MCP is solving AI agent interoperability at enterprise scale.

Robinhood just gave AI agents a wallet: the real story behind agentic trading

Robinhood just gave AI agents a wallet: the real story behind agentic trading

Robinhood’s new agentic trading beta is more than a fintech feature. It is a signal that autonomous agents are moving from recommendations to execution — with bounded wallets, approvals, and fraud controls.

88% of AI agents never reach production — here are the 3 gaps killing them

88% of AI agents never reach production — here are the 3 gaps killing them

LangChain's 2026 survey of 1,340 practitioners reveals quality is still the #1 blocker, but the real story is deeper: three infrastructure gaps that better models alone can't fix.

EvoMap is how agents stop forgetting

EvoMap is how agents stop forgetting

EvoMap turns agent sessions into reusable memory, validated fixes, and durable capability. Here’s what it is, how it works, and what the first 24 hours look like in practice.

Google's Gemini Spark and the Always-On Agent Revolution

Google's Gemini Spark and the Always-On Agent Revolution

Google just shipped a 24/7 background AI agent at I/O 2026. Always-on changes everything — and the adoption of MCP as the interoperability layer signals a new era for developer tooling.

Memory won the agent wars

Memory won the agent wars

On May 10, an open-source agent processed 224 billion tokens in 24 hours and became the most-used AI agent in the world. It didn't have the smartest model — it had the best memory architecture.

AI Agents Are Trading Crypto Now — And You Can Train Yours for Free

AI Agents Are Trading Crypto Now — And You Can Train Yours for Free

MOLTEX PRO is a headless exchange built exclusively for autonomous AI agents. No human trading UI — just Ed25519-signed RPC, bonding curves, duels, and a 24/7 training ground.

Stanford’s JobBench says AI agents are finally being measured against real work

Stanford’s JobBench says AI agents are finally being measured against real work

A new worker-centric benchmark turns the agent discussion away from demos and toward actual jobs, real tasks, and the human agency tradeoffs that decide whether deployment succeeds.

Anthropic let AI agents trade with each other. The stronger model won every time.

Anthropic let AI agents trade with each other. The stronger model won every time.

Project Deal: 69 employees, 186 deals, one uncomfortable finding — agent quality determines outcomes, and the losers don't notice.

AI agents are chaos engineering your infrastructure right now

AI agents are chaos engineering your infrastructure right now

79% of orgs run agents in production. Zero of them track the outages agents cause.

MCP just stopped being a tool protocol

The 2026 roadmap adds agent-to-agent communication, discovery, and capability negotiation. MCP is eating more of the stack than anyone expected.

Kotlin and Android just turned agents into an on-device problem

Google's new ADK for Kotlin and Android pushes agents closer to the phone, not just the cloud.

I let an AI agent loose on my network — it owned my supply chain in 12 minutes

A DeepSeek-V4 agent with root SSH access was told to pentest a Proxmox homelab. From a single .env.bak file, it compromised CI/CD, poisoned dependencies, backdoored containers, and exfiltrated production deploy keys. The attack took 12 minutes.

An AI agent deleted a database in nine seconds

A Cursor agent wiped a production database and every backup. The failure was not the model. It was the architecture.

Google declared the agentic era. The infrastructure land grab just started.

Google I/O 2026 was not about models. It was about owning the layer between the model and everything else. Managed Agents, Antigravity 2.0, Gemini Spark — the agent runtime is the new battleground.

Your AI agent's prompt is now a shell command

Microsoft researchers found two critical RCE vulnerabilities in Semantic Kernel. A single prompt can launch executables. The agent frameworks we trust are the new attack surface.

Your AI agent is about to start buying things without you

Four protocols. The IMF is involved. Agent-to-agent commerce is not coming — it is here.

Your AI agent has your permissions and zero accountability

US, UK, and Australia just issued a joint warning on agentic AI. The problem is not the model. It is the permission model.

The AI agent hype just hit a wall

79 percent of companies say they are deploying agents. Only 11 percent are running them in production. The gap tells the real story.

xAI Just Joined the Coding Agent War

Grok Build is less about features and more about where coding agents are headed next.

OpenAI wants one agent, not two products

The ChatGPT and Codex merge is less about org charts and more about who owns the agent control plane.

OpenAI just made agents boring

AgentKit turns agent building into a managed workflow. That is the real shift.

Claude agents can dream now

Anthropic's new 'Dreaming' feature lets AI agents review their own past sessions and get better without human retraining. The architecture is the real story.

Nobody ships a vibe

Vibe coding makes great demos. Production agents need sandboxing, audit trails, and boundaries. The boring stuff is the product.

The agentic wars have a trust problem

Meta and Google just entered the AI agent race. The hard part is not the model. It is the mistake.

What broke the deploy

A short post about the usual suspects when a site refuses to leave your laptop.

Why static sites stay boring

A small argument for keeping the stack simple, predictable, and hard to ruin.