223
submitted 11 months ago* (last edited 11 months ago) by empireOfLove@lemmy.one to c/datahoarder@lemmy.ml

a TorrentFreak article got me spooked so I fired up the ol' yt-dlp. Got the entire channel, including comments, description metadata, and thumbnail images.

A significant number of videos were actually unavailable because of an odd YouTube bug where 15+ year old videos were listed as "currently being processed". I may re-run this later (since I ran it in archive file mode) to get the missing videos, as it seems there may be about 300 out of 4911 videos missing.

you are viewing a single comment's thread
view the rest of the comments
[-] notasandwich1948@sh.itjust.works 3 points 11 months ago

makes me wonder how the whole thing is sustainable for them, on average it seems about 6gb per 100 videos

this post was submitted on 08 Sep 2023
223 points (96.3% liked)

datahoarder

6497 readers
4 users here now

Who are we?

We are digital librarians. Among us are represented the various reasons to keep data -- legal requirements, competitive requirements, uncertainty of permanence of cloud services, distaste for transmitting your data externally (e.g. government or corporate espionage), cultural and familial archivists, internet collapse preppers, and people who do it themselves so they're sure it's done right. Everyone has their reasons for curating the data they have decided to keep (either forever or For A Damn Long Time). Along the way we have sought out like-minded individuals to exchange strategies, war stories, and cautionary tales of failures.

We are one. We are legion. And we're trying really hard not to forget.

-- 5-4-3-2-1-bang from this thread

founded 4 years ago
MODERATORS