this post was submitted on 14 Jan 2026
335 points (98.0% liked)

Open Source

45438 readers
862 users here now

All about open source! Feel free to ask questions, and share news, and interesting stuff!

Useful Links

Rules

Related Communities

Community icon from opensource.org, but we are not affiliated with them.

founded 6 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] RedBauble@sh.itjust.works 2 points 2 months ago (2 children)

Because the 1000 requests/10 minutes on my server are done by AmazonBot, mostly. Followed by ASNs from Huawei, Azure and the like.

[–] Dave@lemmy.nz 2 points 2 months ago

If big tech are the issue, then try this robots.txt (yes on github...): https://github.com/ai-robots-txt/ai.robots.txt

My issue is with the scrapers pretending to be something they aren't. Tens of thousands of requests, spread over IPs, mostly from China and Singapore but increasingly from South America.

[–] FishFace@piefed.social 1 points 2 months ago

AmazonBot follows robots.txt. I don't so what Huawei and Azure ASNs have to do with it - that sounds like those requests simply come from inside a Huawei and an Azure network, respectively, but could otherwise be anything.