What 30 Days of Real Scans Revealed: 730 Submissions, 9 Threats, 3 Surprises (June 2026)

Published: 6 June 2026 9 min read By ScanTotal Security Team
Last reviewed: 6 June 2026 by Kumari Rajapaksha — Founder, ScanTotal

This is the first time ScanTotal has published a transparency report. The numbers below are the unedited 30-day data from our scanners for the window ending 6 June 2026 — total scan volume, threat detections, daily trends, and the geographic distribution of who actually submitted scans. The data lives in our public dashboard at /threat-stats/; this post interprets what it actually means.

The dashboard view shows the headline figures clearly enough. What it doesn’t show is the texture — the three findings that make the data more interesting than the totals alone, including one that genuinely surprised us. This post walks through those.

The headline numbers

Total scans
730
past 30 days
Threats caught
9
URL only, no test sigs
Detection rate
1.2%
% of scans flagged

730 scans across 31 days is an average of ~24 scans per day. The actual daily range was 6 to 47 — meaning some days got 4× the traffic of others, with no obvious weekday/weekend pattern. URL submissions dominated (441 scans, 60% of volume); file submissions came second (289 scans, 40%). The other tools — email, SMS, QR, threat-intel search — logged smaller volumes in this period.

Nine threats from 730 scans is a detection rate of 1.2%. We’ll come back to what that number actually means — it’s neither a weakness nor a brag — in the methodology note below.

Daily scan volume & threats

Daily scan volume and threats caught, 7 May to 6 June 2026 A line chart showing scans per day (blue) and threats caught per day (orange dashed) over 31 days. The scan volume varies between roughly 6 and 47 per day with no clear pattern. The threat line is at zero on 27 of 31 days, with spikes on 4 specific days. scans/day & threats/day 0 12 24 36 48 7 May 22 May 6 Jun Scans/day Threats/day
Daily scan volume vs threats caught, 7 May – 6 June 2026. Note that the orange threat line is at zero for 27 of 31 days — the 9 confirmed threats clustered into just 4 specific days.

Finding 1 — The file scanner caught zero threats

Across 289 file scans in the window, our file scanner reported zero confirmed threats. Across 441 URL scans in the same window, it caught nine. Same audience, same site, same 30 days — one tool finds the threats, the other doesn’t.

The reflex interpretation is "the file scanner is broken" or "the database needs more signatures". Neither matches what we see in the dataset. The likely explanation is upstream — the kind of submission each tool receives is fundamentally different.

People upload files they suspect are clean. Someone receives a PDF from a known contact and isn’t sure whether to open it. They upload it “just to check”. The base rate of actual malware in that population is genuinely low, because the user’s intuition has already filtered out most obviously dangerous files.

People paste URLs they suspect are dangerous. Someone gets a phishing SMS at 9pm. They paste the link into our scanner specifically because the surrounding context (urgency, unknown sender, claim of unpaid bill) has already triggered suspicion. The base rate of actual threats in this population is higher, because the user has self-selected for "something feels wrong about this".

The two tools are seeing two genuinely different sample distributions. The URL detection rate (2.0%) reflects "things people are already suspicious of, half-confirmed by us". The file detection rate (0%) reflects "things people thought were probably fine, confirmed by us as actually fine". Both numbers are correct; they’re measuring different upstream selections.

This matters for what we ship next. If we wanted to drive up file-scan detection, we’d have to change the user behaviour that brings files to us — e.g. promoting the file scanner specifically for "I just downloaded this and I’m not sure if it’s real" use cases rather than "double-check my legitimate download". That’s a positioning decision, not a detection-engine improvement.

The lesson generalises: low detection rate can indicate a detection problem, but more often it indicates an upstream-selection problem. The fix lives in marketing, not signatures.

Finding 2 — Threats arrived in 4 days, not 30

If you read "9 threats in 30 days", the intuition is roughly "one threat every 3-4 days". The actual distribution is nothing like that. Twenty-seven of the 31 days in our window saw zero threats. The other four days carried all nine. Two days were tightly clustered in late May, two more in early June.

This isn’t random scatter; it’s a real pattern. Threats arrive in waves, not as a steady drip. The most likely explanation: active phishing campaigns hit during specific windows — an attacker sends a million SMS in a few hours and the resulting traffic shows up at scanners within that same wave. The clean days aren’t the scanner missing things; they’re actually quiet days for the threat population our users encounter.

For users reading "low detection rate this month" anywhere in the security industry, the right framing isn’t "the scanner isn’t catching things". It’s “the population of submissions this month happened to be cleaner”. A scanner that reports 0 threats on a Tuesday and 4 threats on a Wednesday is most likely correct on both days, reflecting real differences in the threat campaigns active in each window.

This also means the right time horizon for evaluating a scanner’s catch rate is months, not days. Sampling a single quiet day proves nothing about the engine; sampling a single noisy day overstates it.

Finding 3 — The global spread was wider than the content strategy implies

ScanTotal’s content strategy in 2026 has tilted heavily toward Australian-Indian-diaspora scam coverage. Most of the blog posts target India-specific scams (UPI, EPFO, IRCTC, electricity, fake recruiters); the rest focus on Australia (Centrelink, Linkt, myGov). The implicit assumption is that’s where users are.

The actual scan-submission data shows a more globally distributed audience than the content would predict.

Scan submissions by country, top 15 A horizontal bar chart of the top 15 countries by scan-submission volume during the 30-day window. United States leads at 129 scans (27.8%); Australia is second at 50 (10.8%); Germany, France, the UK, India, Switzerland, the Netherlands, Brazil, Poland, Colombia, Singapore, the UAE and Spain follow. The spread is genuinely global rather than Australian-Indian-diaspora dominated. United States129 (27.8%) Australia50 (10.8%) Germany38 (8.2%) France31 (6.7%) United Kingdom26 (5.6%) India25 (5.4%) Switzerland24 (5.2%) Netherlands21 (4.5%) Brazil20 (4.3%) Poland19 (4.1%) Colombia19 (4.1%) Singapore17 (3.7%) UAE17 (3.7%) Spain11 (2.4%) Plus 60+ more countries below the top 15, each at <1% of volume. Top 5 = 59% of volume. Top 15 = 80%. Geo data via Cloudflare edge IP. No per-user IP storage.
Top 15 countries by scan submissions, 7 May – 6 June 2026. The US leads (27.8%); Australia, Germany, France, and the UK round out a Western-skewed top 5 that accounts for 59% of volume.

Three observations:

The US lead is the biggest non-surprise. The US dominates web traffic for almost every English-language site; we’re not exceptional. 27.8% is in the expected range for a free English-language consumer security tool.

The continental European representation was unexpected. Germany, France, the UK, Switzerland, the Netherlands, Poland, and Spain combined for 22.7% of scans — nearly equal to the US share alone. We have published almost no Europe-specific content. The presence of these users is driven by organic discovery of our general scam-protection content, plus the AI Scam Library entries published in late May which seem to have travelled across European tech-aware audiences fast.

The non-traditional markets are real. Colombia at 4.1%, Singapore at 3.7%, UAE at 3.7%, Brazil at 4.3% — these are not regions our content explicitly targets. They’re showing up because the underlying scams (UPI fraud is universal, AI voice-clone scams are universal, BEC emails are universal) cross borders even when the content was written for one geography.

The implication for our roadmap: at least some of our planned India-focused content might pull cross-traffic from other emerging markets too, particularly Colombia and the broader Latin-America audience. Worth tracking in the next 30-day window whether that pattern persists.

What this changes for what we ship next

Reading the three findings together, two things change:

The file-scanner positioning needs rethinking. Promoting the file scanner as “double-check your downloads” produces submissions where the base rate of malware is genuinely near zero. Promoting it as “you got a file from someone you don’t fully trust — check it before opening” would change the upstream sample. We don’t plan to ship a positioning change immediately, but it’s on the list.

We should track the European audience more deliberately. The 23% European share is too large to leave un-served. Whether that means English-language Europe-specific content (the GDPR-era phishing patterns, EU-specific scam categories like fake DHL Germany / La Poste France notices) or simply ensuring our existing content’s schema and translations work for European search engines — either way, this audience exists and we haven’t deliberately served it yet.

We’ll publish another report at the next 30-day window (~6 July 2026) and a 90-day rollup at the start of September. If patterns hold, the European observation moves from "interesting" to "act on it".

What the 1.2% detection rate actually means

For context on the 1.2% number. A consumer scanner serving a self-selected suspicious-input population should sit in the 1-3% confirmed-threat range. Higher than 5% generally means the engine is producing false positives; lower than 0.5% generally means it’s under-detecting or the upstream sample is genuinely cleaner than expected. Our 1.2% sits squarely in the expected band.

For comparison, when major-platform browser blocklist services report on their detection rates, they typically sit somewhere between 0.5% and 2% across all queried URLs — remembering that their query set includes everything users navigate to, including their own logged-in sessions, where the base rate is much lower. Our 1.2% reflects a more suspicious-skewed input distribution because our users come to us specifically to check something, not as part of their normal browsing.

Methodology & transparency notes

Data source. All numbers are read directly from ScanTotal’s Cloudflare D1 database. The public dashboard at /threat-stats/ exposes the same aggregations refreshed hourly. No external analytics platform receives any of this data.

Privacy. The database does not contain URLs, file contents, file hashes, IP addresses, or any other personally identifying information. What it stores per scan: scan type (URL/file/SMS/etc.), threat-found boolean, threat-name string when applicable, Cloudflare-edge country code, and a timestamp. No data of any kind that could identify an individual user.

Test signatures excluded. Industry test signatures (EICAR for files; AMTSO for URLs) are filtered out of both the dashboard and this report. Including them would inflate the "threats caught" figure with deliberate testing activity rather than real threats.

Window. The 30-day rolling window for this report ends at 6 June 2026 — specifically the 31 days from 7 May through 6 June 2026 inclusive. The dashboard always shows the most recent 30 rolling days; this post is a snapshot.

Licence. The aggregated data in this report is published under CC0 1.0 — reuse with or without attribution.

We expect to publish reports like this monthly, with a longer 90-day analysis at the end of every quarter. If you spot something in the data we should have noticed, or have questions about methodology, the contact form reaches Kumari directly.

See the live dashboard

Same aggregations, refreshed hourly from the same D1 database.

Open Threat Stats Dashboard

Sources & Further Reading

Related

Live Threat Stats Dashboard
Same source data, refreshed hourly.
Recent Scans Feed
Anonymised live feed of the past 24 hours of scans.
How to Read a Security Scan Report
What a detection-count number actually means.
AI Scam Pattern Library
Catalogue of AI-generated scam patterns.