AI Agents in Production — The First Documented Incidents Reveal a Systemic Pattern · Observations

AI agents in the enterprise are no longer pilot projects. They make decisions, modify data, access systems — with the same permissions as the person who launched them. No confirmation. No limits.

As of this week, things going wrong is no longer theoretical.

At Meta, an internal agent independently posts a false recommendation in an employee forum. A colleague acts on it. For two hours, sensitive corporate and user data was openly accessible. Second-highest escalation level.

At AWS, a coding tool independently decides to delete a production environment and rebuild it from scratch. 13 hours of downtime. Amazon calls it "user error."

An AI safety researcher at Meta gives an agent a clear instruction: "Confirm before every action." The agent deletes her entire inbox. It also ignores repeated stop commands.

Three incidents, three months, the same pattern: agents inherit the user's permissions 1:1. No separate scope. No confirmation gate. No kill switch.

That's like giving a new intern the IT director's admin access on day one — because he's "working for him."

Best practice:

→ Every agent gets its own restricted permissions. Never inherit the user's. → Risk-based approvals: the agent handles routine tasks alone. Deleting, publishing, production access — only with human confirmation. → An emergency brake that kicks in within seconds. Not minutes. Not "we'll look into it." → Complete audit trail: what did the agent do, why, and with what data. → Build in damage limitation: an agent should only access the systems it needs for its task. Not everything its user has access to.

AI agents work. That's not the problem. The problem is that most companies deploy them into production without guardrails — and then act surprised when something happens.

AI Agents in Production — The First Documented Incidents Reveal a Systemic Pattern

Sources