ResearchApril 20268 min read

50 sites, 444 findings: the 2026 state of shipping-fast security

TL;DR

Two failure modes, one web. Vibe-coded AI apps fail at the client/server boundary — keys in bundles, RLS misconfigured, client-only auth. Legacy WordPress fails at the platform-default level — exposed login pages, version disclosure, enumerable users.
Same outcome for the user. Different root cause, same customer-data leak. 88% SHIP / 12% BLOCK, average 10.8 findings per site, 444 total across 41 scored sites.
Security belongs in CI, not in a post-launch audit. Teams that treat security as a ship gate catch the 30-point drop before merge. Teams that treat it as an occasional audit read about their incident on HN.

There are roughly two ways a web app ships in April 2026.

The first is "vibe-coded": a developer — sometimes not a developer at all — types a prompt into an AI-coding IDE, and a Next.js app lands on a modern serverless host 40 minutes later with a working managed-backend (Supabase, Firebase, or similar) attached. The second is the long tail: a five-year-old WordPress site on cheap shared hosting that has not been touched since someone's nephew built it. Both are real. Both are in our scan queue every day.

We took 50 production URLs covering both cohorts plus a handful of cloud-platform and auth-SaaS homepages, pointed our engine at them, and wrote down exactly what came back. The numbers below are real. We are not naming sites — the patterns matter more than the brands, and responsible disclosure matters more than clickbait.

The numbers

sites submitted

scored (9 WAF-held)

got BLOCK

444

total findings

On the 41 sites that scored: 88% SHIP, 12% BLOCK. Average score 65.6, median 76. Lowest 0, highest 98. Every single site had findings — the cleanest had 2, the worst had 24, mean was 10.8 per scan.

Grade (auto-computed from score)	Sites	Share of scored
A (90–100)	5	12%
B (75–89)	16	39%
C (60–74)	11	27%
D (40–59)	4	10%
F (0–39) — BLOCK	5	12%

A word on the 9 that didn't score: every one of them bounced the scanner with a WAF challenge (Cloudflare Turnstile, Kasada, or a tenant-specific anti-bot layer). Those are all well-known SaaS domains that have invested heavily in bot protection. The scanner stopped, logged "target unreachable — WAF/anti-bot respected," and emitted no score. We treat that as a positive signal about the target. Sites that can detect and block our scanner are doing real defensive work. We never fabricate a number for a site we cannot see.

Vibe-coded AI-built apps: the surface looks fine, the machinery often does not

About two-thirds of our batch were AI-built developer tools and infrastructure products. These sites are almost uniformly hosted on modern managed hosting providers, use Next.js or Remix, and inherit baseline security headers from the platform. That platform floor is real, and it shows up in the data: average score for this cohort sat above 70, and a clear majority shipped with SHIP verdict.

The failure mode is not missing security headers. It is what happens inside the application once you look past them.

Client-side keys in production bundles. We consistently flagged API-key-shaped strings in /_next/static/chunks/*.js. Some are public-by-design (Google Analytics, Algolia search keys), but a non-trivial share of them are AI-provider keys that the developer put into a NEXT_PUBLIC_* variable to "make it work from the frontend." A single leaked OpenAI or Anthropic key, without a spend cap set in the provider dashboard, is a five-figure problem by the second morning.
Source maps in production. Next.js ships source maps by default. Any visitor can recover near-complete original TypeScript from /_next/static/chunks/*.js.map. Fixable with one line in next.config.ts. Rare to see it set.
Client-side auth guards with no server enforcement. Multiple apps in our batch had routes that router.push("/login") when a local session check failed, while the underlying API endpoint had no auth middleware at all. Two browser dev-tools clicks bypass this pattern.
Open infrastructure on adjacent ports. Our network provider probes 23 common service ports (Postgres, Redis, Elasticsearch, Docker API, kubelet, etcd, …) and flags any that respond from the public internet. We found a small but real number of these — usually Redis or PostgreSQL left on the default port without a firewall because the deploy template skipped that step.

None of this is new in 2026 as a class of problem — people have been leaking keys in JS bundles for a decade. What is new is the volume. When anyone can ship a Next.js + Supabase app in under an hour without reading a single line of the generated code, the average production app gets further from "somebody checked the security posture" every quarter. The floor is rising (platforms give you HSTS and a reasonable CSP for free). The ceiling of what teams actually verify before hitting deploy is not.

Example report — sensitive fields redacted

https://ontusti●●●●●●●●●.kz

Basic scan · 20 findings · 2m 28s

BLOCK

CRITICAL WordPress brute-force attack chain: admin username + exposed login
HIGH WordPress user enumeration (slug: ●●●●●●●●)
HIGH Generic API key in client bundle (value: AIzaSy●●●●●●●●●●●●●)
MEDIUM WordPress login page publicly accessible (/wp-login.php → 200 OK)
MEDIUM WordPress version 6.9.4 disclosed in HTML
MEDIUM No Content-Security-Policy header

Legacy WordPress on cheap shared hosting. Only the hostname and the leaked admin slug are redacted — the CRITICAL chain and the verdict are the whole point of the picture.

Legacy CMS: "SHIP" can hide a ready-made brute-force target

A smaller share of our batch were small-business WordPress sites deployed on cheap shared hosting. The pattern here is completely different from the vibe-coded cohort, and more sobering.

On one site in particular (anonymized), three things were true simultaneously:

WordPress version disclosed in the HTML via the default <meta name="generator">. Anyone can look up CVEs for that specific version.
The admin username was publicly enumerable via the /?author=1 permalink trick — a decade-old WordPress issue. The site redirected /?author=1 straight to /author/<real-admin-slug>/, giving away the exact login name.
/wp-login.php was publicly accessible, with no IP allowlist, no 2FA, no rate limit.

Individually, each of those is a known-but-ignored WordPress default. Together, they are a complete brute-force attack chain with the username already handed to the attacker. Our engine now escalates that combination to a CRITICAL finding with a BLOCK verdict, because calling it "medium severity" would be lying about the practical risk.

This is not a WordPress-hate post. WordPress is fine when you harden it. The point is: these three defaults ship together, and they combine into something an opportunistic script can hit in under a minute. The industry data tracks this exactly — WordPress has topped every "most compromised platform" list for years running, because it is also the long tail of the web, and because the failure modes above are the ones that come for free if you do not go out of your way to close them.

Example report — sensitive fields redacted

https://●●●●●●●●.ai

Basic scan · 16 findings · 41s

BLOCK

CRITICAL Unprotected admin endpoint returning user data (path: /api/admin/●●●●●●)
MEDIUM CSP report-only mode provides zero XSS protection
MEDIUM Missing CSRF tokens on state-changing endpoints
MEDIUM Session cookie missing Secure flag on subdomain (●●●●.example.ai)
MEDIUM Generic API key in client bundle

AI inference platform. Score 24, BLOCK driven by the single CRITICAL on an unprotected admin route — the four mediums alone would have been SHIP with a lower grade. Only the hostname, exact admin path, and subdomain are redacted.

The three patterns that actually cause incidents in 2026

Three classes of finding account for a disproportionate share of the realistic-risk alerts in our dataset. If you do nothing else after reading this post, do these three things.

Check every NEXT_PUBLIC_* variable. AI-provider keys (OpenAI, Anthropic, Google AI) in client bundles are the fastest-moving class of incident we see. One leaked key, no spend cap in the provider dashboard, one opportunistic scraper — that is a four-to-five-figure bill by the next morning. Move the call server-side. Always.
Audit your Supabase RLS and Firebase rules with an actual probe. The dashboard saying "RLS enabled" means the toggle is on. It does not mean your data is protected. A USING (true) policy returns every row to the anon key that ships in your frontend. A Firestore allow read, write: if true does the same for Firebase. This is the most common class of data leak we see in vibe-coded apps — and it is invisible to any URL-only scanner.
Probe your own database ports from outside your VPC. Redis, PostgreSQL, Elasticsearch, Docker API on 2375, Kubelet on 10250 — every one of these reachable from the public internet is a full compromise waiting for an opportunistic scanner. A small but non-zero fraction of our batch leaked at least one. Every one of those was a template deploy where someone skipped the firewall step.

What 2026 actually changed

Three things are new this year that old security-research posts miss:

MCP servers are a fresh attack surface. Model Context Protocol servers give LLMs tool-use capabilities. They also expose filesystem reads, shell commands, and API bridges that are often shipped with zero auth because "it's just local." A hostile prompt (via document, web page, or email) can hijack an agent's tool calls through tool-poisoning and exfiltration flows. Our deep scan added MCP-specific checks in Q1 2026; they fire more often than we expected.
The vibe-coding floor is higher than it was in 2024. Modern web frameworks ship stricter CSP defaults. Major managed hosting providers now set HSTS by default on new projects. Managed-backend dashboards in 2025 started nagging about permissive RLS. The baseline improvements are real — it is why our median score of 76 is higher than the 60s we saw a year ago. The ceiling has not moved; apps that wanted to be insecure still manage to be.
Modern anti-bot is everywhere. 9 of our 50 sites bounced the scanner with Cloudflare Turnstile, Kasada, or a similar challenge. A year ago that number was closer to 2. When your homepage can tell a scraper from a human with zero user friction, a whole class of 2021-era scanner assumptions stop applying. Our engine explicitly respects these blocks — we never guess at a score behind a WAF.

Our conclusions

Security can't be the last step anymore. It has to be a ship requirement. The average app we scan has 11 real findings. Most are low severity. Almost none are fixed in the day or two after we flag them, because nobody owns "fix the blog-post-worthy stuff." Teams that treat security as a gate — a thing that has to pass before the deploy button works — end up with a fraction of the findings of teams that treat it as a periodic audit. We see this in our own rescans: the gap between first scan and third scan, for teams that actually fix things, is roughly 30 points of trust score.

The "vibe-coded" vs "legacy CMS" divide is a red herring. Both produce real incidents. Vibe-coded apps fail at the boundary between client and server — keys in bundles, client-side auth, misconfigured integrations. Legacy CMS apps fail at the platform-default level — exposed login pages, enumerable users, out-of-date versions. Different failure modes, identical outcomes for the user whose data leaks.

Header scanning alone is not enough. Our free tier catches platform misconfiguration — which is real, and which is where ~70% of our findings live. The other 30%, and a much bigger share of the critical ones, only show up with a deep scan: RLS policy audit, Firebase rules audit, active DAST, network port probing, JS deep analysis, MCP server checks. That split is not a marketing construction, it is what the data forces.

Make it a ship gate, not an audit

Teams that catch these issues before merge all do the same thing: wire a scanner into CI. The 30-point rescan gap we see between first and third scan on fixing teams? That's compressed to zero when every PR gets a red check for a newly-introduced CRITICAL. Same data, earlier in the pipeline.

GitHub Action — SARIF upload to Code Scanning + PR comments + configurable fail-on severity.
GitLab CI template — same flow as the Action, native GitLab syntax.
Scheduled rescans — already in Pro. Re-scan every morning; the first regression surfaces before the user does.

Methodology

50 URLs submitted, 41 completed, 9 held by WAF. One engine, same rules, same scoring. Scans ran from a single cloud instance, no hand-tuning per target, no manual finding edits. Free scans run 9 of 15 providers; deep scans with credentials run all 15. Every finding is tagged with a severity, a confidence level, and a CWE class. BLOCK is reserved for confirmed criticals, a score below 40, or a compound attack chain. Unauthenticated perimeter scanning of public endpoints, consistent with standard threat-intelligence methodology.

The full list of what we check is at /docs. The free scan is at /scan. If you want the raw per-site data from this research (with the same redactions we published here), email research@sekrd.com and we'll send the CSV.

We run this research quarterly. The next batch will double the sample size and add scheduled re-scans of the BLOCK cohort, so we can measure which patterns actually get fixed.

Don't ship until you're sekrd

Run a free scan to find the vulnerabilities your AI missed.

Scan Your App Free

Back to Blog

ResearchApril 20268 min read

50 sites, 444 findings: the 2026 state of shipping-fast security

TL;DR

Two failure modes, one web. Vibe-coded AI apps fail at the client/server boundary — keys in bundles, RLS misconfigured, client-only auth. Legacy WordPress fails at the platform-default level — exposed login pages, version disclosure, enumerable users.
Same outcome for the user. Different root cause, same customer-data leak. 88% SHIP / 12% BLOCK, average 10.8 findings per site, 444 total across 41 scored sites.
Security belongs in CI, not in a post-launch audit. Teams that treat security as a ship gate catch the 30-point drop before merge. Teams that treat it as an occasional audit read about their incident on HN.

There are roughly two ways a web app ships in April 2026.

The numbers

sites submitted

scored (9 WAF-held)

got BLOCK

444

total findings

Grade (auto-computed from score)	Sites	Share of scored
A (90–100)	5	12%
B (75–89)	16	39%
C (60–74)	11	27%
D (40–59)	4	10%
F (0–39) — BLOCK	5	12%

Vibe-coded AI-built apps: the surface looks fine, the machinery often does not

The failure mode is not missing security headers. It is what happens inside the application once you look past them.

Client-side keys in production bundles. We consistently flagged API-key-shaped strings in /_next/static/chunks/*.js. Some are public-by-design (Google Analytics, Algolia search keys), but a non-trivial share of them are AI-provider keys that the developer put into a NEXT_PUBLIC_* variable to "make it work from the frontend." A single leaked OpenAI or Anthropic key, without a spend cap set in the provider dashboard, is a five-figure problem by the second morning.
Source maps in production. Next.js ships source maps by default. Any visitor can recover near-complete original TypeScript from /_next/static/chunks/*.js.map. Fixable with one line in next.config.ts. Rare to see it set.
Client-side auth guards with no server enforcement. Multiple apps in our batch had routes that router.push("/login") when a local session check failed, while the underlying API endpoint had no auth middleware at all. Two browser dev-tools clicks bypass this pattern.
Open infrastructure on adjacent ports. Our network provider probes 23 common service ports (Postgres, Redis, Elasticsearch, Docker API, kubelet, etcd, …) and flags any that respond from the public internet. We found a small but real number of these — usually Redis or PostgreSQL left on the default port without a firewall because the deploy template skipped that step.

Example report — sensitive fields redacted

https://ontusti●●●●●●●●●.kz

Basic scan · 20 findings · 2m 28s

BLOCK

CRITICAL WordPress brute-force attack chain: admin username + exposed login
HIGH WordPress user enumeration (slug: ●●●●●●●●)
HIGH Generic API key in client bundle (value: AIzaSy●●●●●●●●●●●●●)
MEDIUM WordPress login page publicly accessible (/wp-login.php → 200 OK)
MEDIUM WordPress version 6.9.4 disclosed in HTML
MEDIUM No Content-Security-Policy header

Legacy WordPress on cheap shared hosting. Only the hostname and the leaked admin slug are redacted — the CRITICAL chain and the verdict are the whole point of the picture.

Legacy CMS: "SHIP" can hide a ready-made brute-force target

A smaller share of our batch were small-business WordPress sites deployed on cheap shared hosting. The pattern here is completely different from the vibe-coded cohort, and more sobering.

On one site in particular (anonymized), three things were true simultaneously:

WordPress version disclosed in the HTML via the default <meta name="generator">. Anyone can look up CVEs for that specific version.
The admin username was publicly enumerable via the /?author=1 permalink trick — a decade-old WordPress issue. The site redirected /?author=1 straight to /author/<real-admin-slug>/, giving away the exact login name.
/wp-login.php was publicly accessible, with no IP allowlist, no 2FA, no rate limit.

Example report — sensitive fields redacted

https://●●●●●●●●.ai

Basic scan · 16 findings · 41s

BLOCK

CRITICAL Unprotected admin endpoint returning user data (path: /api/admin/●●●●●●)
MEDIUM CSP report-only mode provides zero XSS protection
MEDIUM Missing CSRF tokens on state-changing endpoints
MEDIUM Session cookie missing Secure flag on subdomain (●●●●.example.ai)
MEDIUM Generic API key in client bundle

The three patterns that actually cause incidents in 2026

Three classes of finding account for a disproportionate share of the realistic-risk alerts in our dataset. If you do nothing else after reading this post, do these three things.

Check every NEXT_PUBLIC_* variable. AI-provider keys (OpenAI, Anthropic, Google AI) in client bundles are the fastest-moving class of incident we see. One leaked key, no spend cap in the provider dashboard, one opportunistic scraper — that is a four-to-five-figure bill by the next morning. Move the call server-side. Always.
Audit your Supabase RLS and Firebase rules with an actual probe. The dashboard saying "RLS enabled" means the toggle is on. It does not mean your data is protected. A USING (true) policy returns every row to the anon key that ships in your frontend. A Firestore allow read, write: if true does the same for Firebase. This is the most common class of data leak we see in vibe-coded apps — and it is invisible to any URL-only scanner.
Probe your own database ports from outside your VPC. Redis, PostgreSQL, Elasticsearch, Docker API on 2375, Kubelet on 10250 — every one of these reachable from the public internet is a full compromise waiting for an opportunistic scanner. A small but non-zero fraction of our batch leaked at least one. Every one of those was a template deploy where someone skipped the firewall step.

What 2026 actually changed

Three things are new this year that old security-research posts miss:

MCP servers are a fresh attack surface. Model Context Protocol servers give LLMs tool-use capabilities. They also expose filesystem reads, shell commands, and API bridges that are often shipped with zero auth because "it's just local." A hostile prompt (via document, web page, or email) can hijack an agent's tool calls through tool-poisoning and exfiltration flows. Our deep scan added MCP-specific checks in Q1 2026; they fire more often than we expected.
The vibe-coding floor is higher than it was in 2024. Modern web frameworks ship stricter CSP defaults. Major managed hosting providers now set HSTS by default on new projects. Managed-backend dashboards in 2025 started nagging about permissive RLS. The baseline improvements are real — it is why our median score of 76 is higher than the 60s we saw a year ago. The ceiling has not moved; apps that wanted to be insecure still manage to be.
Modern anti-bot is everywhere. 9 of our 50 sites bounced the scanner with Cloudflare Turnstile, Kasada, or a similar challenge. A year ago that number was closer to 2. When your homepage can tell a scraper from a human with zero user friction, a whole class of 2021-era scanner assumptions stop applying. Our engine explicitly respects these blocks — we never guess at a score behind a WAF.

Our conclusions

Make it a ship gate, not an audit

GitHub Action — SARIF upload to Code Scanning + PR comments + configurable fail-on severity.
GitLab CI template — same flow as the Action, native GitLab syntax.
Scheduled rescans — already in Pro. Re-scan every morning; the first regression surfaces before the user does.

Methodology

We run this research quarterly. The next batch will double the sample size and add scheduled re-scans of the BLOCK cohort, so we can measure which patterns actually get fixed.

Don't ship until you're sekrd

Run a free scan to find the vulnerabilities your AI missed.

Scan Your App Free