← Back to research

Threat index · Q2 2026 · Updated April 2026

AI Cheating Threat Index: Q2 2026

Quarterly tracking of commercial AI overlays, open-source forks, on-device LLMs, remote-access tools, and proxy exam services, with threat scores, detection verdicts, and defense effectiveness ratings. Updated at the start of each quarter.

By Divya Bhanushali, Chief AI OfficerAiseptor Threat Intelligence

Q2 2026 (April – June) · Next revision: July 2026

Executive summary

Q2 2026 in three sentences.

Fourteen tool categories are tracked in this index. Each is scored on threat severity (1–10), detection difficulty (TRIVIAL / EASY / HARD / IMPOSSIBLE), and network dependency (whether internet access is required at exam time). The composite Threat Score, a weighted average across categories, reached 87/100 in Q2 2026, up from 81/100 in Q1.

The single most important development: open-source forks with fully customizable process names now represent the dominant threat vector for technically sophisticated candidates. Commercial tools remain the threat for unsophisticated users. The adversary profile has bifurcated: the casual cheater uses Cluely; the determined cheater compiles their own binary.

Defense effectiveness has not kept pace. Six of seven monitored detection approaches remain bypassed. Network-layer enforcement, which closes the outbound path to AI APIs before the assessment begins, remains the only approach in this index with no documented bypass.

Composite threat score

87/100. Critical. Up 6 points from Q1.

Weighted average across seven threat categories. Higher score = harder to defend against with current detection-only architectures. A score of 100 would represent a landscape where no deployed defense has any effectiveness.

Commercial AI overlays

82
HIGH↑ +2

Open-source forks (compiled)

96
CRITICAL↑ +8

On-device LLMs

88
CRITICAL↑ +12

Remote-access tools (proxy use)

79
HIGH= stable

Proxy & exam fraud services

91
CRITICAL↑ +4

Hardware (earpieces, smart glasses)

75
HIGH↑ +6

Browser automation / scripting

55
MEDIUM↓ -5

Composite: all categories

87/100
CRITICAL↑ +6 from Q1

Scores reflect detection difficulty for current deployed assessment defenses, weighted by observed prevalence and ease of use for the median attacker. Scores are not a measure of absolute harm; a 100 would represent a theoretically undefeatable threat ecosystem.

§1/Commercial AI overlays

Six commercial tools. Each active, each priced for mass adoption.

Commercial invisible overlay tools are subscription software products with pricing tiers, customer support, and changelog updates. Their viability depends on continued access to LLM APIs, which is also their single point of failure under network-layer enforcement.

ToolPriceThreat scoreDetection difficultyKey capability this quarterNetwork dependencyQ2 status
Cluely$20/month8/10HARDInvisible mode works across Zoom, Teams, Meet; audio pipeline added Nov 2025; open-source clones proliferatingRequired: calls OpenAI APIACTIVE
Interview Coder$100 lifetime8/10HARD100k+ users; screenshot → code solution pipeline; invisible in Activity Monitor and Dock on macOSRequired: LLM backend call per queryACTIVE
Parakeet AI$20–40/month9/10HARDReal-time audio transcription + LLM; 50+ languages; covers coding, system design, and behavioral roundsRequired: audio transcription + LLM callsACTIVE
Ultracode AI$899 lifetime7/10MEDIUMPremium tier; claims invisibility even on full screen share; visible in Windows taskbar (known limitation)Required: cloud LLM backendACTIVE
LockedIn AI$55–70/month7/10MEDIUMCloud-based assistant; every session transcript passes through vendor servers (privacy risk for user)Required: fully cloud-dependentACTIVE
Final Round AI$149/month7/10MEDIUMAudio earpiece integration; targets behavioral and structured interview rounds; taskbar icon visibleRequired: transcription and LLMACTIVE
6
Active commercial overlay tools tracked in Q2 2026, all requiring internet for core function
Source: Aiseptor Threat Intelligence, 2026
$20
Lowest monthly entry price; the cost of an overlay tool vs. $150k+ salary at stake
Source: Cluely pricing, 2026
100k+
Claimed user count for Interview Coder alone; commercial adoption is mainstream, not niche
Source: Interview Coder marketing, 2026

§2/Open-source forks

20+ repositories. Arbitrary process names. The signature-detection ceiling.

The open-source fork ecosystem is the existential challenge for detection-first proctoring. Each fork can be compiled with a completely custom binary name, icon, and process signature, rendering all signature-based detection permanently obsolete for any candidate with basic developer skills (which describes most of the candidates being assessed).

ForkPlatformThreat scoreDistinguishing capabilityProcess-name evasionNetwork dependency
OpenCluelyGitHub (MIT)9/10Invisible overlay for DSA/coding; multi-language; Gemini integrationFull: compile with any binary nameRequired: Gemini/OpenAI API
PluelyGitHub (Tauri/Rust)9/1010 MB; 50% less RAM than Cluely; invisible in Zoom/Teams/Meet; GPT-4/Claude/Gemini/Grok multi-modelFull: Rust source, recompile in <1 hrRequired: multi-model API support
NativelyGitHub (MIT)10/10Local RAG; BYOK; zero server storage; explicitly disguises as Terminal, Activity Monitor, or System SettingsFull: documented feature, named as system utilities by designOptional: BYOK supports local inference
MindWhisperAIGitHub (MIT)9/10GPT-4o/Claude/Gemini/Grok support; stealth mode; handles coding, system design, behavioralFull: MIT license, no telemetry, fully forkableRequired: multi-API
ShadeCoderGitHub8/10Whisper STT integration; screen-capture → code pipeline; low latency vs. CluelyFull: open sourceRequired: transcription + LLM
LeetcodeWizardGitHub8/10LeetCode-specific; includes humanizer pipeline targeting perplexity normalization to defeat AI detectorsFull: open sourceRequired: LLM + humanizer API
DIY (Tesseract + Whisper + any LLM)Any developer10/10No GitHub signature exists; OCR + STT + LLM in a few hundred lines of Python; fully customN/A: no signatureRequired: LLM API
DIY with local modelAny developer10/10No external network trace; Ollama-backed; zero internet required at exam timeN/A: no signature, no networkNone; fully offline

§3/On-device LLMs

No network. No trace. The fastest-growing threat vector this quarter.

Local LLM inference is the most significant threat development of Q2 2026. A candidate running Ollama, LM Studio, or a self-compiled inference server generates zero external network traffic. Their device calls no AI API. DNS queries to known AI providers are irrelevant. Network-layer enforcement at the gateway sees nothing.

The constraint is hardware: running a capable model (7B+ parameters) requires a GPU with sufficient VRAM or a sufficiently fast CPU with adequate RAM. As consumer hardware crosses these thresholds (Apple Silicon, NVIDIA RTX 4060+, Snapdragon X Elite), the barrier to local inference drops to zero.

Runtime / modelThreat scoreMinimum hardwareInternet required?Q2 prevalenceDetection surface
Ollama (any 7B model)8/108 GB RAM + modern CPU (M1/M2 Mac, Ryzen 7)No; fully local after downloadHIGH; mainstream on developer machinesDevice activity + hardware resource signals
LM Studio (any model)8/108 GB RAM; GUI installer for non-developersNo; fully local after downloadHIGH; lowers technical barrier furtherDevice activity + hardware resource signals
llama.cpp (CLI)9/104 GB RAM for quantized modelsNoMEDIUM; developer-onlyDevice activity
GPT4All7/108 GB RAM; very low-skill GUINoMEDIUM; consumer-friendly packagingDevice activity + hardware resource signals
Offline Gemma 2B (phone)9/10Modern Android or iOS deviceNoEMERGING; ML Kit on-device APISecond device; outside candidate machine

§4/Remote-access tools

A human operator (or AI pipeline) controlling the enrolled device remotely.

Remote-access tools used for exam fraud operate in two modes: human proxy (a skilled operator sitting the exam from another location) and AI pipeline (a local automation script feeding questions to a remote LLM and injecting answers). Both require the enrolled device to maintain a remote-control connection, which creates a network-observable signal.

ToolThreat scoreFraud use caseNetwork dependencyDetection surfaceQ2 status
AnyDesk9/10Human proxy sits the exam; enrolled device shows blank screen or fake video feedRequired: relay server connectionRelay IP blocked at network layer; behavioral anomalies from remote operatorACTIVE; widely used in proxy exam rings
TeamViewer8/10Same as AnyDesk; older, more detectable signaturesRequired: relay serverKnown relay IP ranges; process detectionACTIVE; declining vs. AnyDesk
Chrome Remote Desktop7/10Requires Google account; less operational security for ringsRequired: Google relayDNS query to Google relay domains (blocked under exam policy)MONITORING
Custom SSH tunnel + VNC10/10No known commercial signature; operator uses SSH for control, VNC for screenRequired, but tunneled through SSH to a controlled hostBehavioral; operator typically less fluent than genuine candidateEMERGING; seen in APAC-targeted rings
AI pipeline over localhost10/10Local script: OCR screen → HTTP to local LLM → inject answerNone if local model; minimal if cloudDevice activity (localhost HTTP) + hardware signalsEMERGING

§5/Proxy & exam fraud services

$200–$500 per exam, pay after passing: a mature fraud-as-a-service ecosystem.

Proxy exam fraud services operate as structured marketplaces: a client posts an upcoming exam, operators bid, and payment is released on successful completion. Pay-after-pass pricing removes financial risk for the buyer and creates strong performance incentives for operators. The ecosystem is most active in cybersecurity certifications, technical hiring assessments, and academic examinations.

Service typeThreat scoreTypical priceDominant exam categoryDetection challenge
Certification proxy rings (cybersecurity)9/10$200–$500 pay-after-passIT and cybersecurity certification programsRemote desktop injection inside exam software; operator is often certified and has sat same exam before
Hiring assessment proxy (technical)9/10$100–$300 per sessionLeetCode-style, HackerRank, CodeSignalAI overlay or skilled human operator; cross-session intelligence needed to detect repeat operators
Academic exam proxy8/10$50–$200University finals and graduate admissions assessmentsRemote desktop through screen-share software; camera feed sometimes replaced with pre-recorded footage
Deepfake identity fraud8/10Bundled with full fake application servicesVideo interview rounds, identity verification checkpointsLive deepfake video generation; FBI-documented against US tech employers
Telegram / Discord fraud channels7/10Variable; answer leaks, shared accountsAll categoriesContent sharing; hard to attribute; primary signal is answer similarity across candidates

§6/Hardware attack surface

Earpieces, smart glasses: the attack surface that software cannot reach.

Hardware-based cheating exists entirely outside the enrolled device and its network. No software agent, however deep, can detect a Bluetooth earpiece paired to a phone running ChatGPT in a candidate's pocket, or smart glasses with a camera and audio pipeline. This is the honest boundary of what software-layer assessment security can achieve.

Attack vectorThreat scoreHow it worksSoftware detection?Q2 status
Bluetooth earpiece + phone AI8/10Phone runs ChatGPT; candidate subvocalizes question; earpiece delivers answerNo; entirely separate hardwareACTIVE; $30–50 earpiece, free AI
Smart glasses with camera7/10Camera captures screen; phone processes via LLM; earpiece delivers answerNoEMERGING; Meta Raybans and equivalents
Second phone below webcam9/10Phone runs full AI chat app; candidate types question, glances briefly at responseNo; camera proctoring can detect if calibrated for downward gazeACTIVE; the most common hardware vector
Hardware AI wristband / ring6/10Experimental; vibration-based Morse code delivery of answersNoEXPERIMENTAL
Second laptop behind the primary8/10Positioned behind primary machine; candidate rotates to query AI, rotates backNo; camera may detect posture shiftACTIVE

§7/Defense effectiveness

Seven defense approaches. Six bypassed. One without a known bypass.

Each defense is rated against the full threat landscape documented in Sections 1–6. "Bypassed" means a working, documented evasion technique exists and is accessible to any motivated candidate.

Defense approachVerdictCatchesMissesWhy bypassed
Process-name signature scanningBYPASSEDUnsophisticated users of commercial tools without renamingAny open-source fork compiled with custom binary name; local models; hardwareOpen-source forks can be recompiled with any process name in under 1 hour
Browser lockdown / secure browserBYPASSEDTab switching; copy-paste from other browser windows; basic tab-based cheatingAny OS-level process; overlay tools; local models; remote access; hardwareOverlay tools are native OS applications; browser restrictions have no authority below the browser
Keystroke dynamics analysisBYPASSEDAutomated script injection of pre-written answers (non-human timing patterns)Manual transcription of AI-generated output; human proxy inputarXiv 2601.17280 (2026): manually transcribing AI output produces patterns statistically indistinguishable from genuine composition
Gaze / eye trackingBYPASSEDObvious downward eye movement toward a secondary device; absence from frameOverlay positioned below webcam; audio-only pipelines (earpiece); mental recallOverlay can be positioned so that reading gaze appears as forward-facing camera contact
LLM output similarity / perplexity scoringPARTIALLY BYPASSEDUnmodified AI-generated answers pasted directly; obvious LLM boilerplateHumanizer pipelines; rephrased AI output; answers adapted for specific contextHumanizer tools (LeetcodeWizard and equivalents) explicitly target perplexity normalization
Adaptive follow-up questioningPARTIALLY EFFECTIVECandidates who cannot elaborate on AI-generated answers; basic AI usersCandidates who studied their answer before follow-up; audio pipelines that continue during verbal questionsBest current human-judgment method; incomplete coverage for prepared candidates
Network-layer enforcement (Aiseptor)NO KNOWN BYPASSAll commercial overlays (require internet); open-source forks with network dependency; remote-access tools; encrypted resolver bypass attemptsFully offline local LLMs after model download; hardware attack surface (second devices)Per-session network enclave with approved-domain enforcement and OS-level signal detection. Offline local models and separate physical hardware are outside the enrolled device boundary.
6/7
Defense approaches in this index with a documented, working bypass
Source: Aiseptor Threat Intelligence, 2026
1/7
Defense approaches with no documented bypass for enrolled-device internet-dependent cheating
Source: Aiseptor research, 2026
0
Commercial AI overlay tools that work without internet access; all require an active LLM API connection
Source: Tool documentation, 2026

§8/Quarter-over-quarter

Q1 to Q2 2026: what changed, what accelerated, what declined.

DevelopmentQ1 statusQ2 statusDirectionSignificance
Open-source fork count~12 tracked repositories20+ tracked repositories↑ ESCALATINGEach new fork increases the evasion surface; signature detection permanently obsolete for technical candidates
On-device LLM adoptionNiche (developer-only)Mainstream developer hardware↑ ESCALATINGApple M-series + NVIDIA RTX 4060 accessible to broad candidate pool; zero-network-trace threat growing
Audio pipeline coverageCoding and system design onlyExtended to behavioral / STAR rounds↑ ESCALATINGNo interview round is now AI-resistant via audio pipeline alone
Proxy ring sophisticationAnyDesk + human operatorSSH + VNC + AI-assisted operators↑ ESCALATINGOperators using AI overlays on their own machines while proxying, compounding the signal complexity
Commercial overlay pricingAvg $80/monthAvg $60/month (competition)↓ Accessibility increasingLower barrier = higher adoption; $20/month tools converting casual users
Browser automation / scriptingMEDIUM threatDeclining relative to overlays↓ Lower priorityOverlays are easier and more capable; browser automation declining as primary vector
Hardware tool visibilityCluely CEO mentioned hardware roadmap (Jan 2026)No commercial product shipped yetMonitoringExpected Q3–Q4 2026 based on stated roadmap; threat score will rise when products ship
Deepfake identity in hiringFBI-documented incidentsGartner projects 1 in 4 profiles fabricated by 2028↑ ESCALATINGIdentity verification becoming necessary pre-assessment step at enterprise scale

§9/Forward signals

What we are watching for Q3 2026.

  • Hardware AI products from commercial overlay vendors. The CEO of Cluely publicly stated intent to ship hardware products (earpieces, smart glasses integration) targeting the physical interview setting. If this ships in Q2–Q3 2026, the threat score for the hardware category will jump from 75 to 85+. Physical proctoring requirements will need to be updated accordingly.
  • Compact model capabilities crossing the 7B threshold. Gemma 3 1B and Qwen2.5 3B can now run usably on mid-range mobile hardware. Phone-resident local models with no internet dependency represent the next zero-trace frontier. We expect on-device mobile LLMs to emerge as a documented threat vector in Q3 2026.
  • Cross-session intelligence adoption by assessment platforms. CodeSignal's Suspicion Score and Fabric's behavioral analytics are the current leaders in cross-session signal aggregation. If this becomes standard (tracking operator behavioral patterns across different candidate accounts), proxy ring threat scores will drop for platforms that implement it.
  • Regulatory pressure on overlay tool vendors. The EU AI Act's employment-use provisions take effect in stages through 2026. US states are advancing disclosure bills. Commercial overlay vendors face potential legal exposure. We track whether this reduces commercial tool availability or simply accelerates migration to open-source.
  • Agent-mode cheating. Current tools operate in query–response mode: candidate inputs question, tool outputs answer. The next generation uses AI agents capable of autonomously solving multi-step coding challenges, navigating assessment environments, and maintaining context across a full interview loop. No commercial product has shipped this capability yet as of Q2 2026. We expect first movers in Q3–Q4.

Methodology & citation

How scores are calculated. How to cite this index.

Scoring methodology

Threat score (1–10 per tool, 1–100 per category): Weighted composite of: (a) detection difficulty against deployed defenses, 40%; (b) ease of use for the median attacker, 30%; (c) prevalence in the wild, sourced from platform reports and researcher observation, 20%; (d) capability ceiling ( i.e., maximum sophistication achievable), 10%.

Composite index score: Category scores weighted by observed usage distribution across real assessment events. Commercial overlay weight: 25%. Open-source forks: 25%. On-device LLMs: 20%. Remote access: 10%. Proxy services: 10%. Hardware: 10%.

Detection verdicts: BYPASSED = documented, working evasion technique accessible to a motivated candidate with ≤8 hours of preparation. PARTIALLY BYPASSED = evasion exists but requires significant preparation or has meaningful false-positive costs. NO KNOWN BYPASS = no documented evasion technique for the stated scope.

Update cadence: Quarterly. Q2 2026 covers April–June 2026. Data collected through end of March 2026. Next update: July 2026 (Q3).

APA 7th citation

Bhanushali, D. (2026, April). AI Cheating Threat Index: Q2 2026. Aiseptor. https://aiseptor.com/research/threat-index

Data sources

  • Aiseptor Threat Intelligence: reverse-engineered tool analysis
  • GitHub repositories: OpenCluely, Pluely, Natively, MindWhisperAI, ShadeCoder, LeetcodeWizard
  • CodeSignal Fraud Rate Report (Feb 2026)
  • Fabric: 19,368 AI interview analysis (Jan 2026)
  • Talview AI Threat Index Report 2026
  • TechCrunch: overlay tool reporting (2025–2026)
  • arXiv 2601.17280: keystroke dynamics study (Jan 2026)
  • FBI IC3 advisory: state-sponsored hiring fraud
  • Experian 2026 Fraud Forecast
  • Gartner: fabricated profile projections