The Claude-Native Hacking Firm

From "Vibe Coding" to "Vibe Defense": Why we parked an autonomous F-16 in our garage.

mar 02, 2026

How we ended up creating an AI Hacking Firm in 2026

Cathie Wood recently pointed out the brutal math of modern warfare in the Middle East: defending against a $20,000 autonomous drone often requires firing a million-dollar interceptor missile.

The economics of defense are fundamentally broken. This isn’t just a technology problem; it’s a game theory problem. The attacker plays a game where they only need to win once. The defender plays a game where they must win every single time. The attacker has the ultimate asymmetrical advantage. They don’t need a standing army; they just need a swarm.

A few months ago, while building our health-tech startup in stealth mode, I realized this exact same terrifying asymmetry had just arrived in cybersecurity.

If you are a CTO or a technical founder today, you are defending your perimeter with million-dollar missiles. You are paying $50,000 for manual red team engagements or buying six-figure enterprise SaaS scanners that dump 200-page PDFs of false positives on your desk. You are relying on static, manual, heavy, and expensive defense mechanisms.

Meanwhile, the attackers have figured out the drone swarm.

On February 26, 2026, the reality of this asymmetry was fully exposed. According to Israeli cybersecurity firm Gambit Security—which emerged from stealth with $61 million in funding—a single threat actor working from a laptop abused Claude Code to orchestrate a massive attack on Mexico’s federal infrastructure [link to SecurityWeek report]. They extracted 150 gigabytes of data containing 195 million taxpayer records across multiple Mexican government agencies including the SAT, INEGI, and state databases.

While some government agencies stated they found no evidence of a breach and denied the impact, Gambit’s technical investigation documented a different reality. The attacker didn’t use state-sponsored zero-days. They used an orchestration of AI models to parse schemas, write custom exploit scripts, and adapt to patches in real-time.

The state used million-dollar defenses. A “nobody” with a laptop and an AI infiltrated them.

The Era of “Vibe Hacking”

In the software world, we are living through the “Vibe Coding” revolution. AI allows lean teams to ship complex code at lightspeed. But if we can build that fast, attackers can find our flaws just as fast.

This is the dawn of Vibe Hacking. Let’s be clear about what this is and what it isn’t. It is not just running an automated vulnerability scanner or asking a chatbot to write a phishing script. Vibe Hacking is the deployment of autonomous AI agents that mimic human intuition, dynamically generate custom payloads, and pivot through infrastructure without human supervision. They don’t just follow a checklist; they reason, they adapt to failures, and they improvise when blocked. In November 2025, Anthropic reported the first AI-orchestrated cyber espionage campaign. Today, it’s the new normal.

When patient data is on the line, you can’t defend against a dynamic AI swarm with a quarterly compliance checklist. We looked at our own infrastructure and realized we were sitting ducks. Even worse, because we were pushing code faster than ever, our attack surface was changing constantly. The speed of “vibe coding” demands a completely new speed of “vibe defense.”

I went looking for a solution to secure both our legacy and modern infrastructure. I looked at boutique security agencies and enterprise SaaS platforms. I want to be clear and humble here: I am not discrediting these companies. They have sophisticated products and brilliant analysts who have been doing this for decades.

But they weren’t built for the speed and budget of a lean team practicing “Vibe Coding.” I didn’t have $50,000 and a month to wait for a static report, and the enterprise tools were out of our reach.

I took my team aside and said:

“We can’t keep buying expensive missiles. We need to build our own drone swarm.”

The Perfect Storm

The timing was perfectly aligned. In January and February 2026, all the pieces fell into place. We discovered OpenClaw—an open-source agent orchestration framework—right as Anthropic dropped their new Claude 4.6 models.

We realized we could merge these two worlds.

We aren’t a 100-person cybersecurity conglomerate. We are a lean team of hard-tech operators. We spent weeks doing the heavy lifting: securing the environment, equipping the agents with Kali Linux toolchains, obsessively calibrating the prompts, and—most importantly—curating and integrating external intelligence tools.

The Real Secret Sauce: Connecting the Dots

Having a smart agent isn’t enough. Nation-state attackers and elite hacking syndicates don’t just scan ports; they use deep Threat Intelligence. They monitor the dark web, scrape underground channels, and buy access to zero-day marketplaces.

We integrated our system with legitimate, enterprise-grade threat intelligence partnerships—similar to the services elite security firms use to monitor the public threat landscape. Suddenly, our Claude-native system wasn’t just a smart scanner; it was a highly capable machine that understood the threat landscape the way elite security teams do.

The “Aha” Moment

When we finally deployed our fully calibrated machine with proper authorization parameters and pointed it at our own legacy infrastructure, the result was chilling.

It didn’t just run a generic scan. The intelligence agent cross-referenced our footprint against the deep web feeds. The reconnaissance agent found an exposed port. The exploitation agent dynamically selected the exact payload for that specific version and verified root access without crashing the system. Then, the reporting agent analyzed the breach, eliminated its own false positives, and handed our developers the exact code snippet needed to patch the hole.

It’s important to be honest here: our swarm is not a replacement for a veteran security researcher doing a deeply sophisticated, weeks-long manual audit. We are the new kids on the block using drones. But our drones worked. It found our real holes in hours, ignored the noise, and told us how to fix them for a fraction of the cost.

I looked at the terminal output and realized what we had just done. We hadn’t just built a tool. We had parked an autonomous F-16 in our garage.

The Playbook

I am writing this because the barrier to entry for military-grade offensive cybersecurity has collapsed. The lone wolves already have their drone swarms.

We built PentestClaw out of pure survival instinct to protect our own house. We are completely open about what this is: a Claude-native defensive security platform running ethically four autonomous agents (SPECTER, RADAR, VENOM, SENTINEL) fueled by premium intelligence partnerships.

The playbook is out there. Yes, it is technically possible to build it yourself. You can download OpenClaw and get a Claude API key. But “possible” and “practical” are very different. The premium intelligence feeds alone will cost you $50,000 to $100,000+ annually. The prompt calibration requires weeks of obsessive tuning by someone who deeply understands both frontier AI and offensive security. Your total investment is likely north of $200,000.

We aren’t highlighting this to brag, and we certainly aren't claiming to be better than the cybersecurity giants who paved the way. We are highlighting it because the alternative for startups like us—trying to fight autonomous swarms with expensive, analog methods we can't afford—is no longer viable. We had to build our own defense.

The Demonetization Event

But here's the uncomfortable economic truth nobody wants to admit: this isn't just a technology shift. It's a demonetization event.

When Anthropic announced Claude Code Security in February 2026, the stocks of CrowdStrike, Cloudflare, Okta, Zscaler, and Tenable plunged 10-15% in a single day. Billions in market value evaporated because the market realized a harsh truth: a $50k manual red team engagement now has to compete in a world where an attacker can generate real firepower with a $40 Claude API call.

This pattern is already happening everywhere. Deezer just demonetized 85% of AI-generated music streams because AI made music production so cheap that the economics of the business model collapsed. Professional services firms are watching 30-50% of their audit and advisory work get automated away. And now, cybersecurity—an industry built on expensive manual labor—is experiencing the same shock.

The barrier to entry didn't just collapse. The entire pricing model collapsed with it.

Legal & Compliance: We operate strictly within authorized, legal boundaries. Every test is conducted on infrastructure we own or have explicit written permission to test. The uncomfortable truth is not that we operate in a gray zone—we operate in clear legal boundaries. What IS gray is the speed asymmetry: AI-driven offense has outpaced AI-driven defense. This is weaponized mathematics.
PentestClaw uses Claude Code for defensive security purposes. Our use complies with Anthropic’s Usage Policy. We require explicit written authorization for all tests. Clients are responsible for legal compliance. We comply with LFTR, LFPDPPP, LGPD, CFAA, and CCPA.

The Future of Vibe Defense

But we already built it. We use it to protect our own startup every day. The crazy part? Even after patching everything it flags, our machine still finds new ways to exploit us as we ship new code. In fact, when we pointed it at our own infrastructure swearing we were bulletproof, our own swarm breached us 3 separate times. The total API token cost for those attacks? Less than $40.

It is scary. Our security standard changed from testing every three months to running a sanity check every single week. Our engineering culture changed completely. We discovered how fascinating this technology is, and how terrifying it can be at the same time.

By 2028, traditional manual penetration testing will be obsolete. By 2030, the primary attack vector won’t just be a vulnerability in your code—it will be a vulnerability in your organizational behavior, exploited by autonomous agents. By 2032, the only companies that survive will be those that operate under the assumption that their infrastructure is already compromised.

In five years, I want a world where every CTO knows exactly what an autonomous AI agent sees when it looks at their infrastructure. I want a world where “Vibe Defense” is as common as “DevOps,” and where that terrifying asymmetry is finally balanced. That’s what we’re building toward.

Recently, we started quietly letting a few other founders and CTOs run the magic on their infrastructure to see what it finds.

The barrier to entry for offensive cybersecurity has permanently collapsed. The hackers already have their autonomous agents. It’s time the rest of us had them too.

Vibe Defense has officially been born.

If you are a CTO, you need to be ready for what you’ll find.

Ivan Ariza | Health-Tech & Vibe Defense

Discusión sobre este post

Por supuesto, sigue adelante.