this post was submitted on 25 Jun 2026
21 points (100.0% liked)

TechTakes

2602 readers
18 users here now

Big brain tech dude got yet another clueless take over at HackerNews etc? Here's the place to vent. Orange site, VC foolishness, all welcome.

This is not debate club. Unless it’s amusing debate.

For actually-good tech, you want our NotAwfulTech community

founded 3 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] Soyweiser@awful.systems 3 points 2 days ago (2 children)

So, anybody know the regular linux commands to turn a pdf into markdown? I assume there is a simple command that does that for you, if there isn't already a pdf2markdown.

[–] flaviat@awful.systems 3 points 21 hours ago (1 children)

There cannot be such a thing since pdf does not structure its data. There is an extension to the standard that would let a program do it for you but nobody uses it (PDF/UA-1). (also pandoc is vibe coded now)

[–] froztbyte@awful.systems 2 points 11 hours ago (1 children)

yeah, my answer to this also used to be pandoc until they took the prompt unto their soul

it's deeply fucking frustrating

[–] Soyweiser@awful.systems 2 points 6 hours ago

That sucks so much. But thanks anyway everybody, my post was half shitpost, half serious. (And I know some things can't be easily converted, (but my regexp to match xhtml script is almost complete).

I'm a bit surprised (but not totally) there actually was a proper tool for it a bit, even if it is vibe corrupted now.