An estimated 89% of developers now use AI coding tools in their workflow. The same research shows that 45% of the code those tools produce contains OWASP Top 10 vulnerabilities, and that number hasn't improved in over two years. Welcome to the vibe coding security crisis.

Remember when "move fast and break things" was controversial? Vibe coding, the practice of describing software in natural language and letting an AI generate the code, has accelerated development speeds beyond anything the industry has seen. It has also introduced a class of security vulnerabilities that most organisations aren't equipped to handle.

This article breaks down the real data on vibe coding security risks, explains why AI-generated code consistently fails security benchmarks, and provides a practical framework for making vibe-coded applications production-ready.

Let's talk Build with confidence

Ready to transform your business?

Tell us what's blocked, what you're building, or where AI is slowing you down. We reply within one working day.

We reply within one working day.

The Scale of the Problem

The data on AI-generated code security is clear. A thorough 2026 study titled "Is Vibe Coding Safe?" (arXiv:2512.03262) evaluated multiple coding agents against a benchmark of 200 real-world engineering tasks. The finding? Only 10.5% of agent-generated code was secure, despite a 61% functional correctness rate. The paper's authors noted that "all agents perform poorly in terms of software security" and that "our findings raise serious concerns about the widespread adoption of vibe-coding, particularly in security-sensitive applications."

This isn't an isolated finding. Veracode's 2026 GenAI in Software Security report tested over 100 different large language models against OWASP Top 10 benchmarks and found that 45% of AI-generated code contained exploitable vulnerabilities, a failure rate that remained flat throughout 2025 and into 2026, despite rapid improvements in model capabilities.

The gap between what developers *think* and what *actually happens* is particularly troubling. Snyk's 2025 research found that 80% of developers believe AI-generated code is more secure than human-written code, a dangerous perception gap that directly contradicts every security benchmark in the literature.

Why Vibe-Coded Code Is Insecure

Understanding why AI-generated code is vulnerable requires looking at the underlying mechanics. Large language models are trained to produce *statistically likely* code, not *secure* code. This difference drives several recurring failure modes.

Training Data Amplifies Insecure Patterns

AI models are trained on public code repositories, including all the insecure patterns, bad practices, and known vulnerabilities present in that data. A 2023 study published at USENIX Security titled "Lost at C" (Sandoval et al.) demonstrated that developers using AI assistants actually *submitted less secure code* than those writing code manually, while simultaneously being *more confident* in its security. The AI amplifies existing insecure patterns rather than correcting them.

Functional Correctness ≠ Security

The SUSVIBES benchmark revealed the difference: code that passes functional tests can still be insecure. An AI-generated web application might render perfectly, accept user input, and store data, while being vulnerable to SQL injection, cross-site scripting, and insecure direct object references. The model optimises for what it can measure (does the function return the right output?) not what it can't (is this endpoint properly rate-limited and sanitised?).

Hallucinated and Weaponised Dependencies

One of the most dangerous failure modes of AI code generation is package hallucination. Research presented at USENIX Security 2025 ("We Have a Package for You!", Spracklen et al.) found that AI models hallucinate package names at high rates, suggesting dependencies that don't exist. A separate study (arXiv:2410.06462) demonstrated that these hallucinations can be weaponised: attackers can register the hallucinated package names on public repositories, and AI-driven developer tools will automatically install them.

The Lasso Security research team found that 34% of AI-suggested package names are hallucinations, more than one in three. This creates a software supply chain attack vector that didn't exist before AI coding tools became widespread.

Real Incidents and Documented Exploits

The theoretical risks have already materialised. By March 2026, Georgia Tech's Vibe Security Radar had catalogued 74 confirmed CVEs attributed to AI coding tools, with estimates suggesting 400-700 actual exploitable vulnerabilities in observable repositories. The rate of disclosure is accelerating, March 2026 alone saw 35 new AI-related CVEs, a six-fold increase over two months.

Notable incidents include:

Why Traditional Security Approaches Fall Short

Conventional application security relies on predictable patterns: known vulnerability classes, established secure coding standards, and human review processes. Vibe coding breaks all three assumptions.

Code review breaks down when the reviewer doesn't fully understand the generated code. A 2025 study by researchers at Stanford found that developers using AI assistants accepted 39% of insecure code suggestions, because the generated code looked plausible, and the cognitive effort to fully audit it was higher than writing it from scratch.

Static analysis tools trained on human-written code miss AI-specific vulnerability patterns. Traditional SAST scanners detect SQL injection in hand-written code but may not detect the same vulnerability when it's generated by a different distribution of patterns.

The speed mismatch is perhaps the hardest problem. AI can generate an entire application in hours. Security review cycles that take weeks simply can't keep pace. The result is a growing backlog of vibe-coded applications deployed without meaningful security review.

Making Vibe-Coded Applications Production-Ready

None of this means we should abandon AI-assisted development. The productivity gains are real and game changing. But organisations need a structured approach to making vibe-coded code production-ready.

1. Treat All AI-Generated Code as Draft Code

The single most important mindset shift is treating AI output as a first draft, not a finished product. Every function, every endpoint, every database query generated by an AI should be subject to the same security review standards as code written by a junior developer. This is not about distrust; it's about compensating for the model's known weaknesses.

2. Implement AI-Specific Security Testing

Standard security testing needs to be supplemented with AI-specific checks:

3. Establish a Human-in-the-Loop Security Gate

Automated testing catches known patterns. Human review catches everything else. The most effective model is a staged review process where AI-generated code passes through automated security gates, then receives targeted human review focused on:

4. Pin Dependencies and Verify Supply Chains

The hallucinated dependency problem requires a structural solution. All dependencies suggested by AI tools should be pinned to specific versions and verified against package provenance data. Organisations should maintain an internal allowlist of approved packages that AI tools are permitted to reference.

5. Invest in AI Governance

Organisations that succeed with AI-assisted development don't leave it to individual developers to make security decisions. They implement AI governance frameworks that define:

The Path Forward

The vibe coding revolution isn't going away, and it shouldn't. The ability to describe software in natural language and have functional code appear is game changing. But the security industry is in a race against time: AI code generation is growing exponentially, the vulnerability rate isn't improving, and the attack surface expands with every vibe-coded application deployed to production.

The organisations that will thrive in this new paradigm aren't the ones that ban AI coding tools. They're the ones that implement the governance, testing, and review processes necessary to make AI-generated code production-ready.

At Confuse The Machine, we've spent years working at the intersection of AI development and security. We understand how these models think, where they fail, and how to build safety into AI-assisted workflows, not as an afterthought, but as a fundamental part of the process.