CAPTCHAs can still detect AI agents

CAPTCHAs are great. Exploiters get around them with proprietary anti-detect browsers and unethical residential proxies, while privacy browsers and affordable privacy VPNs get blocked and shadowbanned to death.

Fingerprint.com, while not a CAPTCHA, gives you +3 suspicious score just for using privacy settings like adblock on your browser. This makes it harder to sign up for any sites that use fingerprint.com.

https://github.com/CloakHQ/CloakBrowser is a good anti-detect browser as well as CAPTCHA bypass.

I think it's just a game of cat and mouse. It might be easier to catch naive AI agents that are not fine-tuned for specific CAPTCHA tasks with human behavior, can't recognize new challenges, don't know when to stop and ask a human, and just want to brute force their way with limited or no specialized harness and tools available.

Apparently CloudFlare’s turnstile can’t, as evidenced by several public-facing CRUD and mail routines we maintain that no longer are warding off the spam.

I’ve been using Claude Opus 4.7 with Chrome MCP, and it has worked successfully about 95% of the time. However, I’ve failed various hCaptcha challenges.

Until they learn to do that. So cat and mouse. So nothing new.

https://github.com/CloakHQ/CloakBrowser is a good anti-detect browser as well as CAPTCHA bypass.

Apparently CloudFlare’s turnstile can’t, as evidenced by several public-facing CRUD and mail routines we maintain that no longer are warding off the spam.

Yeah, we benchmarked against a few bot detection provides end of last year (https://research.roundtable.ai/bot-benchmarking/), and Turnstile didn't do great when it came to AI agent detection. We hypothesized that Turnstile primarily focuses on device/network characteristics, which AI agents can bypass

Meanwhile the moment I (a human, of which I'm reasonably confident) see a Cloudflare captcha I nope immediately out of the site and block it forevermore in Kagi. It's not worth the waiting game. "Verifying..." lasts ages.

The anime girl captcha works fine and provides no such annoyance.

Until they learn to do that. So cat and mouse. So nothing new.

think the point is that they can't just "learn to do that", because to do so would mean solving human mind (that famously hasn't been going well)

I’ve been using Claude Opus 4.7 with Chrome MCP, and it has worked successfully about 95% of the time. However, I’ve failed various hCaptcha challenges.

The thing many people miss is that the challenge itself isn't the primary signal. The challenge creates an opportunity to observe user activity. You're browser is also fingerprinted.

This is relatively close to our conclusion from the paper: unless agents are specifically trained for the task and know all the information ahead of time, they're not able to generalize from one cognitive CAPTCHA to another

The anime girl captcha works fine and provides no such annoyance.

The thing many people miss is that the challenge itself isn't the primary signal. The challenge creates an opportunity to observe user activity. You're browser is also fingerprinted.

think the point is that they can't just "learn to do that", because to do so would mean solving human mind (that famously hasn't been going well)

until Google trains an AI model off that data, too

Well no, the idea is a tradeoff between interfaces and telemetry.

OK, the agents don't click in the same way as humans. You learn that, what about mouse hovering telemetry, time spent, etc. And one of the most extreme is to force biometrics - a lot of telemetry, breaks the interface a lot - but hey, you have assurance.

And none of these tradeoffs require understanding the deep processes of the human mind. Just, map is not the territory, how you do game the map harder and harder and how do the mapmakers respond to that?

Well no, the idea is a tradeoff between interfaces and telemetry.

did you look at the paper? they specifically look at mini tasks with cognitive processes (Eg what dictates the strategy of how people solve tasks)

until Google trains an AI model off that data, too

I mean, their CAPTCHAs presumably have tons of data collected over the years, and they can't detect a pretty clear AI agent here: https://www.youtube.com/watch?v=UeTpCdUc4Ls

did you look at the paper? they specifically look at mini tasks with cognitive processes (Eg what dictates the strategy of how people solve tasks)

LLMs can solve original math problems at the IMO level and beyond, and you might be talking to one now. I don't think they are going to have problems with any CAPTCHA short of separate device attestation.

Whatever mechanism the paper proposes, rest assured it can be trained on.

This is a ~1000 word overview of our recent machine learning conference paper submission. To read the full preprint, click here.

"CAPTCHAs are broken these days." AI can easily identify all the traffic lights in a static grid. So CAPTCHAs don't provide a valuable human signal, right?

Yes and no.

Yes, because vision language models (VLMs) can recognize images like chimneys, fire hydrants, and traffic lights. Deep learning "solved" CAPTCHA-style image classification in the early 2010s.

No, because AI does not complete CAPTCHAs like humans. If you look across all the data of humans and AI completing CAPTCHAs, you start noticing differences in features like error patterns. Our recent paper found statistically significant differences across sequential click patterns, direction changes, and overselection behavior - features that define how a participant, agent or human, would solve the CAPTCHA problem. In other words, AI can solve CAPTCHAs, but they don't solve them like humans.

CAPTCHA results

Figure 1: Humans and Claude/GPT/Gemini perform at similar task performance levels on the classic CAPTCHA, but there are statistically significant process differences across features like sequential score, direction change, and overselection.

The Turing Test - originally proposed in 1950 by Alan Turing - offers a simple criterion for machine intelligence. If a judge cannot reliably distinguish a machine's responses from a human's, the machine can be considered intelligent.

Turing understood this behavioral criterion was a concession and not the end-all-be-all of human vs. machine intelligence. He had to concede: the question is too difficult, abstract, and loaded. Behavioral indistinguishability provided a more tractable condition, and one that seemed like a good North Star in the 1950s.

Following Turing's footsteps of defining an adversarially robust discriminator that can separate humans from bots, we designed CogCAPTCHA30. This goes one level deeper than the Turing Test, from exploring output (what humans and agents can do) to process (how it can do it). CogCAPTCHA30 combines the original CAPTCHA with 29 classic cognitive psychology tasks for a 30-task battery.

CogCAPTCHA30 summary

Figure 2: CogCAPTCHA30 measures humans and agentic process behavior across decision-making, memory, perception, and reasoning.

We recruited human participants and also deployed AI agents to perform these tasks. The CAPTCHA experiment demonstrated that humans and agents can perform at similar performance (output) levels, but with different processes. We then measured output equivalence - how (how similar their answers were) and_process equivalence_ (how they arrived at their answers) across the whole 30-task paradigm and found that they were uncorrelated:

Process vs Performance

Figure 3: We measured how similar humans and agents are across output (Cohen's d) and process (AUC). Across the task set, these measures are uncorrelated, suggesting output equivalence does not equal process equivalence.

While the classic Turing test measures whether a machine produces output indistinguishable from a human, we propose a Process Turing Test measuring whether machines produce a process indistinguishable from humans.

Our results raise two questions: what types of language models - if any - are like humans, and how adversarially robust is this discrimination process?

To answer the first question, we compared the distance between humans and state-of-the-art frontier models (OpenAI's GPT, Anthropic's Claude, Google DeepMind's Gemini) as well as Qwen (an open-source 1.5B foundation model) and Centaur (an open-source 70B-parameter foundation model of human cognition).

Model Results

Figure 4: State-of-the-art frontier models (Claude, GPT, Gemini) have less similar human process features compared to smaller models (Qwen, Centaur).

We found that state-of-the-art frontier models (Claude, GPT, Gemini) have less similar human process features compared to smaller models (Qwen, Centaur). As we argued in AI Capability isn't Humanness, while frontier models are becoming more powerful over time, they are not necessarily becoming more human. Contemporary progress in artificial intelligence is independent of progress in human simulation.

Qwen, a smaller open-source model, is more humanlike than the larger Claude, GPT, and Gemini. And, as a nice validation, Centaur outperforms the other models in similarity to human process feature space. We hypothesize this is due to large-scale output fine-tuning, specifically 10M+ human choices across 160 cognitive experiments.

This introduces the second question: how adversarially robust is the process to discriminate humans from agents? Any behavioral feature used to distinguish the two may itself become a target for optimization. Accordingly, a detector that succeeds against off-the-shelf agents establishes a behavioral gap only under the current attacker model - how AI exists and operates now. It's to be seen whether it can become a durable human-verification signal for the future technologies. This motivates a stronger test: can an agent close the process gap - between how humans and agents complete tasks - when given increasingly direct access to human data?

Model Performance

Figure 5: Direct process-level fine-tuning (P-SFT) makes AI more humanlike, but this advantage is reduced when some features are excluded and completely disappears when asked to cross-task generalize.

We fine-tuned a Qwen2.5 Instruct model to bring it closer to humans. When given full information - the observed features and the discriminator's objective function - the gap between humans and agents disappears. However, the gap reappears when parts of the feature space are left out and fully returns when agents have to generalize cross-task. In other words, the Process Turing Test is robust when the AI does not have full access to the discriminator and the feature set (i.e., the model does not know how it will be evaluated).

The challenge the Process Turing Test poses is whether AI can continuously replicate all of human cognitive psychology. Despite the anxiety that models are becoming more capable over time, they are empirically not becoming more humanlike. Compared to one-time checks like passwords, CAPTCHAs, document identification, and device fingerprinting, the Process Turing Test provides a step-up function in human verification. Simulating human cognitive psychology is an exponentially more challenging task.

Hacker Times

Hacker Times

CAPTCHAs can still detect AI agents

Discussion

Discussion