2140
you are viewing a single comment's thread
view the rest of the comments
[-] diyrebel@lemmy.dbzer0.com 6 points 1 year ago* (last edited 1 year ago)

It was coded 8 years ago in Tcl¹ for a one-off project in Belgium. Would you really be interested?

The APIs would have changed dramatically by now & some of the real estate sites no longer exist. Some of the sites brought in CAPTCHAs. It was coded to use Tor & the public transport site has become Tor-hostile and also changed their API. It’s also very user unfriendly.. a collection of scripts & variety of hacks because I was my only user.

I didn’t publish the code at the time because I worried that it would trigger the target sites to become bot-hostile.

① Also note that I use #Tcl for personal use but I resist publishing any Tcl code because I would rather not promote the Tcl language. Why? Because the Tcl folks have jailed a large portion of their docs in Cloudflare’s walled garden. I believe programming language docs should be openly public.

[-] SIGSEGV@waveform.social 1 points 1 year ago

I wouldn't be trying to use or adapt it; it was more of an intellectual curiosity. I haven't written scrapers or used Tcl, so, yeah, I would be interested to take a peek :-)

I'd like to write something similar, but I'd probably use Lisp or Python.

this post was submitted on 26 Jul 2023
2140 points (98.5% liked)

Technology

59773 readers
4400 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 2 years ago
MODERATORS