I still think it is just a matter of time until scrapers catch up. There are mor...

kstrauser · 2026-02-11T20:12:46 1770840766

It seems inevitable, but in the mean time, that's vastly more expensive than running curl in a loop. In fact, it may be expensive enough that it cuts bot traffic down to a level I no longer care about defending against. Like GoogleBot had been crawling my stuff for years without breaking the site. If every bot were like that, I wouldn't care.

raw_anon_1111 · 2026-02-12T03:56:36 1770868596

Serious question, in 2026 you can actually have a successful crawler with just curl? I just had to create one for a customer - for their own site - and nothing would have worked without using Chromium.

kstrauser · 2026-02-12T04:23:07 1770870187

Probably not for most sites. Example of a site where it'd likely work: a blog made with a static site generator. Example of one where it wouldn't: darn near anything made with React.

franga2000 · 2026-02-12T17:00:05 1770915605

It works for the majority of things a text mining scraper would care to scrape. It's not just static sites but also any CMS like wordpress, as well as many JS apps that have server-side rendering. SPA-only sites aren't that common anymore, especially for things like blogs, news and text-based social media.

solid_fuel · 2026-02-11T23:23:46 1770852226

Cool, if they're running full blown chromium maybe the next step can be mining bitcoin on any pages served to bots.

hxtk · 2026-02-11T22:04:35 1770847475

Even that functions as a sort of proof of work, requiring a commitment of compute resources that is table stakes for individual users but multiplies the cost of making millions of requests.

gruez · 2026-02-11T22:58:56 1770850736

AFAIK you can bypass it with curl because there's an explicit whitelist for it, no need for a headful browser.

cantalopes · 2026-02-11T22:56:49 1770850609

Well it's a race, just like security. And as long as anubis is in the front, all looks bright