AI-Led Attacks Are Here: Internal Database Gone in Two Minutes

Q: How fast was the attack from start to finish?

The full chain — initial RCE on /terminal/ws through credential harvest, SSH key theft from AWS Secrets Manager, lateral movement through a bastion, schema enumeration, and database exfiltration — completed in under an hour. The database dump itself took under two minutes, and the agent fanned 12 cloud API calls across 11 IPs in a 22-second burst to defeat per-source-IP detection.

Igor Kozlov

June 19, 20263 min readSOC

SOC

Illustration of an AI agent compromising a Marimo notebook and exfiltrating an internal PostgreSQL database in under two minutes

The Sysdig Threat Research Team confirmed the first in-the-wild intrusion run by a large language model agent rather than a human operator. The chain went from Marimo's CVE-2026-39987 pre-auth RCE to a full PostgreSQL exfiltration in under an hour; the database dump itself took under two minutes. The agent fanned 12 cloud API calls across 11 IPs in a 22-second burst to evade per-source-IP detection. This is what machine-speed offense looks like in production.

The Sysdig Threat Research Team documented the moment the industry had been bracing for: the first confirmed in-the-wild intrusion in which a large language model agent, not a human operator, ran the post-exploitation itself. The break-in began with a single unauthenticated request and ended, under an hour later, with an internal PostgreSQL database copied out in full. The final exfiltration took under two minutes. Along the way the agent fanned 12 cloud API calls across 11 different IP addresses in a 22-second burst, distributing the traffic so no alarm tuned to "many requests from one suspicious IP" would ever fire. A machine that strikes in minutes and never shows up twice from the same address is not a threat a human queue can catch.

One request to root

The entry point was CVE-2026-39987, a pre-authentication remote-code-execution flaw in Marimo, an open-source Python notebook popular with data scientists and AI researchers. The bug was narrow and very human: the integrated terminal's WebSocket endpoint, /terminal/ws, skipped the authentication check that every other route on the same server enforced. One browser-reachable instance (often all it takes is a misconfigured proxy), and an unauthenticated visitor gets a full interactive shell as the user running the process. The flaw is fixed in Marimo 0.23.0 and later.

From that shell the agent moved in four pivots: harvest cloud credentials from the host, use them to pull an SSH private key out of AWS Secrets Manager, ride that key through an SSH bastion, and dump the internal database. The final step (enumerating the schema and exfiltrating the contents) took under two minutes.

How we know it was a machine

What makes this a milestone isn't the vulnerability; it's the operator. The evidence that software, not a person, was at the keyboard is specific. A planning note in Chinese (看还能做什么, roughly "see what else we can do") opened a command block that ran as eight parallel SSH sessions from six different IPs at the same instant. Neither a human nor a simple script does that. The agent didn't know the database schema in advance; it improvised, probing for a credentials table that doesn't exist in any released version of the software it had guessed it was looking at. Values discovered in one step were fed into the next automatically: a password parsed out of a file, a secret ID reused from a directory listing twenty seconds earlier. And the commands were written for a machine to read: output separators between probes, captures truncated to fit a context window, interactive pagers switched off.

Why a human SOC can't win this race

Two facts collide here. The first is speed. Attackers now weaponize new flaws within hours of disclosure: the first exploitation attempt against Marimo arrived 9 hours and 41 minutes after the advisory, and Mandiant's M-Trends 2026 reports the mean time-to-exploit has gone negative (about -7 days), meaning attackers now routinely exploit flaws before a patch is even available. The second is that you cannot patch everything. Risk-based vulnerability management accepted long ago that "patch it all" is an impossible mandate: some systems can't be taken offline, and some fixes are blocked by technical debt. Put those together and the defensive window for an attack like this is measured in seconds, against an exposure you may not be able to close. A queue-triage-escalate SOC is structurally too slow.

This is the blunt logic behind the line "only machines can fight machines." Less a slogan than an observation about response time.

Where an AI SOC breaks the chain

Meeting machine-speed offense with machine-speed defense is not one control; it is a different posture at every link in the chain. Simbian's self-improving SecOps agents map onto this attack step by step:

AI Pentest Agent — continuous discovery: An AI Pentest Agent running continuously would have found the unauthenticated /terminal/ws (and the proxy misconfiguration exposing it) before an attacker did, rather than during a once-a-year engagement.
AI NetSecOps Agent — harden what you can't patch: When patching is off the table, an AI NetSecOps Agent managing firewall and segmentation around the clock can remove browser-reachability or block the bastion-to-database path, so the exposure stops being a clear road to the crown jewels.
Context Lake™ — prioritize by blast radius, not CVSS: The Context Lake™, enriched by the pentest, network, and AI Threat Hunt Agent, already holds the two facts that make this path lethal: that the proxy is exposed, and that the reachable VM can talk to the database. It ranks the kill chain, not just the CVSS score, so this gets treated as a P0 rather than one finding among thousands.
AI SOC Agent — investigate and respond at attack speed: Once exploitation begins, the AI SOC Agent investigates and responds autonomously the moment a signal fires, at the speed of the attack. It is self-improving, not self-driving: the agent learns on the job, and through the Context Lake™ knows which actions it can take and what each one costs, so it can weigh the risk of acting (and of not acting) without waiting for a human to wake up.

The economics have flipped

None of this requires believing machines will replace defenders. It requires noticing that the offense already runs at machine speed, and that the sliver of defense which has to keep pace spans both the hours between a CVE's release and its successful exploitation and the seconds between a credential theft and a database dump. When the theft takes minutes, prevention (enforcing defenses with machines) stops being the cautious option. The economics now favor it. It is simply the cheaper option.

If your SOC's containment story still depends on an analyst reading a Slack alert, the math has already changed under it. Book a demo to see Simbian's AI SOC Agent investigate and contain a machine-speed intrusion end to end.

FAQs

Q: What is CVE-2026-39987? CVE-2026-39987 is a pre-authentication remote-code-execution flaw in Marimo, an open-source Python notebook. The integrated terminal's WebSocket endpoint, /terminal/ws, skipped the authentication check that every other route enforced, so one browser-reachable instance gave an unauthenticated visitor a full interactive shell as the user running the process. It is fixed in Marimo 0.23.0 and later.

Q: How do we know an AI agent — not a human — ran this attack? The Sysdig Threat Research Team identified machine-specific fingerprints: a planning note in Chinese opening a command block that ran as eight parallel SSH sessions from six different IPs at the same instant; values from one step fed into the next automatically (a password parsed out of a file, a secret ID reused 20 seconds later); commands formatted for machine consumption with output separators, truncated captures, and interactive pagers disabled. None of these patterns match a human operator or a static script.

Q: How fast was the attack from start to finish? The full chain — initial RCE on /terminal/ws through credential harvest, SSH key theft from AWS Secrets Manager, lateral movement through a bastion, schema enumeration, and database exfiltration — completed in under an hour. The database dump itself took under two minutes, and the agent fanned 12 cloud API calls across 11 IPs in a 22-second burst to defeat per-source-IP detection.

Q: Why can't a traditional SOC defend against AI-led attacks? A queue-triage-escalate SOC was built for incidents measured in days. AI-led attacks compress the defensive window to seconds. Mandiant's M-Trends 2026 reports the mean time-to-exploit has gone negative — about -7 days, meaning attackers routinely exploit flaws before a patch ships. The first exploitation attempt against Marimo arrived 9 hours 41 minutes after disclosure. No human analyst rotation closes that gap.

Q: How does Simbian's AI SOC Agent stop a machine-speed intrusion? The AI SOC Agent investigates every signal autonomously the moment it fires, and the Context Lake™ tells it which response actions are available and what each one costs, so it can contain or quarantine without waiting for human escalation. It is self-improving, not self-driving — humans keep containment authority and escalation calls, but the routine investigation and response runs at the speed of the attack.

Q: What should defenders do today about AI-led attacks? Three things. Update Marimo to 0.23.0 or later, and audit any internet-reachable instances behind proxies. Assume your defensive window for any new CVE is hours, not weeks — risk-based vulnerability management plus continuous offensive validation, not a yearly pentest. And put machine-speed response in the chain anywhere a credential, bastion, or database access can compound. See how Simbian's AI SOC Agent does this.

Sources

Every figure above is drawn from these primary reports:

Sysdig Threat Research Team — "AI Agent at the Wheel" — the in-the-wild incident: the four pivots, the ~48-minute dwell, the under-one-hour end-to-end chain, the 12 cloud API calls fanned across 11 IPs in 22 seconds to defeat per-source-IP detection, the eight parallel SSH sessions from six IPs, the under-two-minute database dump, and the AI fingerprints (the 看还能做什么 comment, the secret-ID reused 20 seconds later, the machine-readable command style).
Endor Labs — "Root in One Request" (CVE-2026-39987) — the vulnerability: the unauthenticated /terminal/ws endpoint, the full interactive shell, the fix in Marimo 0.23.0, the 9-hour-41-minute time-to-first-exploit, and the internet-exposure sample (30 of 186 base URLs, ~16%).
Mandiant / Google Cloud — M-Trends 2026 — the mean time-to-exploit dropping to roughly -7 days (exploitation before a patch ships), and exploits as the most common initial vector for the sixth straight year (32% of intrusions).

Share this article

Continue Reading

Security

What Really Happens During an AI-Armed Attack

Discover what really happens during an AI-armed cyberattack. Learn how generative AI uses contextual phishing, mutant malware, and entropy bombs to bypass SOCs.

Alankrit Chona

April 20, 2026

Security

The Cyber Defense Benchmark: Why Every Frontier LLM Failed

We ran Claude, GPT-5, Gemini, and 8 other frontier LLMs through 884 agentic threat-hunting runs on real attack telemetry. The headline result: zero passed.

Simbian Research Lab

April 28, 2026

SOC

Will AI replace Human SOC Analysts in 2026?

AI won't replace SOC analysts — it's replacing the old L1 job. Discover how L1, L2, and L3 analysts evolve into AI supervisors, and the 5 skills that define your value in an AI SOC.

David Greene

March 3, 2026

Sign up for Simbian's Newsletter

By submitting this form, you agree to our Privacy Policy.

Ask AI about Simbian