Yeah I was using it before I realized I might need a scraper.

What is the USTR blacklist? how do we preserve this data before its lost?

Very little. I know basic html + css but I am trying to work with python

I test with IDLE for python + use selenium for driver directory (geckodrive)

I could send it to you privately if you let me know ur discord or something

I don't like to touch js so ive being going python only. (besides basic html & Css) but I found puppeteer and didn't really get it.

The discord thing is a no-go since I don't really know how to make my issue palatable. That's why I used lemmy. Thanks again!

I don't want a point and click scraper, just a guide that isn't assuming I have background + simple mans terms for easier reading. Thanks for believing in me to be able to build the basic skills necessary! Much appreciated :3

I recommend talking to a LLM

Any recommendations? Not chat-GPT

Also thanks for the help so far!

52
submitted 3 days ago* (last edited 2 days ago) by MaggotInfested@lemmy.dbzer0.com to c/piracy@lemmy.dbzer0.com

I have been trying for hours to figure this out. From a building tutorial to just trying to find prebuilt ones, I can't seem to make it click.

For context I am trying to scrape books myself that I can't seem to find elsewhere so I can use and post them for others.

The scraper tutorial

Hackernoon tutorial by Ethan Jarell

I initially tried to follow this but I kept having a "couldn't find module" error. Since I have never touched python prior to this, I am unaware how to fix this and the help links are not exactly helpful. If there's someone who could guide me through this tutorial that would be great.

Selenium

Selenium Homepage

I don't really get what this is but I think its some sort of python pack and it tells me to download using the pip command but that doesn't seem to work (syntax error). I don't know how to manually add it in because, again, I have little idea of what I'm doing.

Scrapy

Scrapy Homepage

This one seemed like it'd be an out-of-box deal but not only does it need the pip command to download but it has like 5 other dependencies it needs to function which complicates it more for me.

I am not criticizing these wares, I am just asking for help and if someone could help with the simplification of it all or maybe even point me to an easier method that would be amazing!


Updates

  • Figured out that I am supposed to run the command for pip in the command prompt thing on my computer, not the python runner. py -m followed by the pip request

  • Got the Ethan Jarrell tutorial to work and managed to add in selenium, which made me realize that selenium isn't really helpful with the project. rip xP

  • Spent a bunch of time trying to workshop the basic scraper to work with dynamic sites, unsuccessful

  • Online self-help doesn't go in as much as I would like, probably due to the legal grey area


use the megathread and go by what has a 🐐 beside it

116

I was watching on it this morning but I just tried to go on it and this came up. The megathread needs to be updated.

https://goodbye.braflix.is/

41
What is oalinst.exe? (lemmy.dbzer0.com)
submitted 3 weeks ago* (last edited 3 weeks ago) by MaggotInfested@lemmy.dbzer0.com to c/piracy@lemmy.dbzer0.com

I see it sometimes when I download games and I usually avoid it but I want to play the game online so I want to know if its a genuine concern. Couldn't find anyone else talking about it on here :P

EDIT: Thanks for the info everyone! Makes much more sense now.

11
submitted 1 month ago* (last edited 1 month ago) by MaggotInfested@lemmy.dbzer0.com to c/css@programming.dev

Making a site JavaScript-less with bootstrap but the CSS is kicking my ass- I do the code directly as it is meant to be, then I try to add one thing and it breaks. I'm gripping on w3schools for dear life and I just can't seem to wrap my head around anything other than the basics. CSS is pain ESPECIALLY when I'm doing it on an external sheet. (I don't want to do internal because all the text gets overwhelming.) Anyone have some ideas to help with this?

Edit: So I realized the browser tool thing is really easy for visuals + that BOOTSTRAP IS INSANELY VAST. For just about every CSS element theirs another 1.5k sub rules which is great for getting specific but not great when you are basically creating a rule for a already ruled element that you have no way of finding easily. Bootstrap is just a functionality CSS sheet I think and not the equivalent to a HTML DLC

(Image is my CSS sheet compared to the crazy amount of CSS sub sheets that exist in bootstrap. My measly little 16 rules look pathetic)

80

I keep seeing in forums and sites like these that say it's frowned upon to not seed torrents that you use/used. I saw a post on here or Reddit (I don't remember) with a guy ecstatic that someone started seeding his download he had been trying to get done for months. I know seeding lets someone download something using your computer but how is it helpful if someone doesn't have a site and/or isn't "in-range" ?

If you can't tell, I don't know much about how torrenting works other than how to download something using one. I hope that you all can just explain or point me in the right direction because I would like to support the community.

view more: next ›

MaggotInfested

joined 2 months ago