Selfhosted

53934 readers

857 users here now

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.

Rules:

Be civil: we're here to support and learn from one another. Insults won't be tolerated. Flame wars are frowned upon.
No spam posting.
Posts have to be centered around self-hosting. There are other communities for discussing hardware or home computing. If it's not obvious why your post topic revolves around selfhosting, please include details to make it clear.
Don't duplicate the full text of your blog or github here. Just post the link for folks to click.
Submission headline should match the article title (don’t cherry-pick information from the title to fit your agenda).
No trolling.
No low-effort posts. This is subjective and will largely be determined by the community member reports.

Resources:

selfh.st Newsletter and index of selfhosted software and apps
awesome-selfhosted software
awesome-sysadmin resources
Self-Hosted Podcast from Jupiter Broadcasting

Any issues on the community? Report it using the report flag.

Questions? DM the mods!

founded 2 years ago

MODERATORS

HybridSarcasm@lemmy.world

HybridSarcasm@lemmy.hybridsarcasm.xyz

In what way am I the product when using CloudFlare's free tier? (lemmy.world)

submitted 4 days ago by early_riser@lemmy.world to c/selfhosted@lemmy.world

51 comments fedilink hide all child comments

I'm using CloudFlare to hide my home IP and to reduce traffic from clankers. However, I'm using the free tier, so how am I the product? What am I sacrificing? Is there another way to do the above without selling my digital soul?

you are viewing a single comment's thread
view the rest of the comments

[–] early_riser@lemmy.world 1 points 4 days ago (3 children)

In my experience even a site with low legitimate traffic will eventually buckle under the torrent of bots and scrapers if it's up long enough to get indexed by search engines, so the longer my stuff is out there the more I anticipate I will need DDoS protection.

[–] ptz@dubvee.org 5 points 4 days ago* (last edited 4 days ago) (1 children)

I've got bot detection setup in Nginx on my VPS which used to return 444 (Nginx for "close the connection and waste no more resources processing it), but I recently started piping that traffic to Nepenthes to return gibberish data for them to train on.

I documented a rough guide in the comment here. Of relevance to you are the two .conf files at the bottom. In the deny-disallowed.conf, change the line for return 301 ... to return 444

I also utilize firewall and fail2ban in the VPS to block bad actors, overly-aggressive scrapers, password brute forces, etc and the link between the VPS and my homelab equipment never sees that traffic.

In the case of a DDoS, I've done the following:

Enable aggressive rate limits in Nginx (it may be slow for everyone but it's still up)
Just stop either Wireguard or Nginx on the VPS until the storm blows over. (Crude but useful to avoid any bandwidth overages if you're charged for inbound traffic).

Granted, I'm not running anything mission-critical, just some services for friends and family, so I can deal with a little downtime.

[–] mesamunefire@piefed.social 4 points 4 days ago (1 children)

I have something similar with fail2ban + hidden buttons. If the requester goes and clicks on the hidden buttons on the main site, it gets into a rabbit hole. After 3 requests, it gets banned for a bit. Usually stops the worst offenders. OpenAI and some of the scrapers are the worst.

Google/bing, I do actually see them hit robots.txt then jump off, which is what they should be going.

[–] ptz@dubvee.org 1 points 4 days ago* (last edited 4 days ago) (1 children)

Oooooh. That's smart. I mostly host apps, but in theory, I should be able to dynamically modify the response body and tack on some HTML for a hidden button and do that.

I used to disallow everything in robots.txt but the worst crawlers just ignored it. Now my robots.txt says all are welcome and every bot gets shunted to the tarpit 😈

[–] mesamunefire@piefed.social 1 points 4 days ago (1 children)

Nice! Thats another way to do it. 😀

I know others use Arabis(?) I think thats what it called. The anime girl one that does a calc on top. Ive never had good luck with it. I think bot are using something to get around and it messes with my requests. Might also be my own fiddling.

[–] FrostyPolicy@suppo.fi 2 points 4 days ago (1 children)

I know others use Arabis(?) I think thats what it called.

You probably mean Anubis.

[–] mesamunefire@piefed.social 1 points 4 days ago

Woops yes!

[–] atzanteol@sh.itjust.works 4 points 4 days ago* (last edited 4 days ago)

I've run a publicly accessible low-legitimate-traffic website that has been indexed by Google and others from my home network for >20 years without anything buckling so far. I don't even have a great connection (30mbps upstream).

Maybe I'm just lucky?

[–] K3can@lemmy.radio 1 points 3 days ago

Consider what a DDOS attack looks like to Cloudflare, then consider what your home server can actually handle.

There's likely a very large gap between those two points.

For me, my server will start to suffer long before traffic reaches the level of a modern DDOS attack.