27
submitted 11 months ago by yournamehere@lemm.ee to c/selfhosted@lemmy.world

hi, i'm kinda out of ideas -maybe you know some selfhostable solution.

I want to monitor some of my pages for gdpr compliance and have reports on that.

So far i tried selenium container with k6, i tried selenium locally with some Python automation and even gave phantomJS with webXray a try. In any case sth. is missing...like selenium wont grab the thirdparty cookies, webxray code has outdated Python and so on.

You guys have any suggestion on how i could selfhost my own automated toolset to check entire webpages for all cookies and thirdpartyscripts? Ppl make changes which might have impact on the gdpr compliance and i dont want to check manually. How would you do it selfhosted?

top 7 comments
sorted by: hot top controversial new old
[-] morras@links.hackliberty.org 6 points 11 months ago
[-] yournamehere@lemm.ee 2 points 11 months ago* (last edited 11 months ago)

oh wow...how did Google not suggest this to me? thanks mate.

edit: got i running...works like a charm. just great. thanks so much.

[-] morras@links.hackliberty.org 3 points 11 months ago

You are welcome :)

[-] steersman2484@sh.itjust.works 4 points 11 months ago

You could try Puppeteer, but I don't know if you can check third party cookies.

Also I can recommend using Apache Airflow for automation. It is like a fancy crontab with a webui and email notifications, and lots of other stuff.

[-] yournamehere@lemm.ee 2 points 11 months ago

thanks. was thinking about airflow or n8n for automation but i still cant find a way to check the thirdpartycookies. someone wrote firefoxwebdriver could do it but you'd need to extract from the profile an stuff...nothing really worked.

[-] 0xD@infosec.pub 2 points 11 months ago
[-] yournamehere@lemm.ee 2 points 11 months ago

those look exciting! thank you very much.

this post was submitted on 13 Oct 2023
27 points (93.5% liked)

Selfhosted

39143 readers
273 users here now

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.

Rules:

  1. Be civil: we're here to support and learn from one another. Insults won't be tolerated. Flame wars are frowned upon.

  2. No spam posting.

  3. Posts have to be centered around self-hosting. There are other communities for discussing hardware or home computing. If it's not obvious why your post topic revolves around selfhosting, please include details to make it clear.

  4. Don't duplicate the full text of your blog or github here. Just post the link for folks to click.

  5. Submission headline should match the article title (don’t cherry-pick information from the title to fit your agenda).

  6. No trolling.

Resources:

Any issues on the community? Report it using the report flag.

Questions? DM the mods!

founded 1 year ago
MODERATORS