208
submitted 8 months ago by abobla@lemm.ee to c/hardware@lemmy.ml
all 50 comments
sorted by: hot top controversial new old
[-] ALostInquirer@lemm.ee 40 points 8 months ago* (last edited 8 months ago)

From the article:

As intriguing as the idea is, we have to admit it smacks of a publicity stunt more than an earnest act of preservation. Even if the data is secure, are the robots the new points of failure? What’s to protect them from fires, floods, EMPs, and all the other threats? What about the readers, which are delicate lasers driven by algorithms? In all likelihood, any explorers in the year 12,000 that might stumble onto the remains of the Global Music Vault would just display it in a museum as a collection of crystal coasters.

I was asking myself similar questions to these, alongside even more basic details like, "What if the future computer systems simply aren't compatible with the old filesystems, thus indicating nothing as being present on the storage media (if it's even recognized as storage media to test)?" It's the deeply fascinating problem all long-term information storage/transmission faces regarding future comprehensibility.

[-] pixxelkick@lemmy.world 83 points 8 months ago

It's possible to reverse engineer data you have forgotten how to read.

It's impossible to read data that you know how to read, but it has become annihilated by time.

The former is far more valuable.

[-] bitsplease@lemmy.ml 31 points 8 months ago

“What if the future computer systems simply aren’t compatible with the old filesystems, thus indicating nothing as being present on the storage media (if it’s even recognized as storage media to test)?”

We've reconstructed archaic languages that no living person speaks from fragments of written records, I find it unlikely that we'll be completely unable to reverse engineer an ancient file system architecture - especially since the most likely course for someone actually reading one of these 1000's of years in the future is for the reader to be from a more technologically advanced civilization.

Think of what modern archeologists would give to have the equivalent of a wikipedia archive from 10,000 years ago - imagine the colossal amounts of grant funding that would be thrown at the problem if we even suspected such a thing was within reach.

Of course all the other issues about keeping the actual system safe for 10k years are totally valid, but you have to start somewhere, and getting a data storage system that can last that long even in perfect conditions is the necessary first step.

[-] ALostInquirer@lemm.ee 7 points 8 months ago

We’ve reconstructed archaic languages that no living person speaks from fragments of written records, I find it unlikely that we’ll be completely unable to reverse engineer an ancient file system architecture - especially since the most likely course for someone actually reading one of these 1000’s of years in the future is for the reader to be from a more technologically advanced civilization.

I saw another reply mention similar, and I see where you're both coming from, but seeing another reply in this vein has encouraged me to ask the question the other reply inspired which is: what if you lack the fragments needed to reverse engineer/reconstruct a means to access the information?

Chances are slim, and to be clear here, I'm by no means knocking this development, as I find it really exciting, but I also enjoy thinking through some of the different potential points of failure. Not from a cynical/pessimistic perspective, but because it's a compelling challenge and puzzle. How much else alongside this specific media may need to survive so that it may remain accessible, directly or indirectly, y'know?

That's as cool and fun to consider as the new storage media itself to me! Come to think of it, maybe I really should look into some kind of archival/museum jobs considering that...

[-] pixxelkick@lemmy.world 8 points 8 months ago

what if you lack the fragments needed to reverse engineer/reconstruct a means to access the information?

In this case the "Fragment" isnt even a fragment, it would be a completely intact start to finish monstrous amount of data.

The larger the "fragment" is, and more complete it is, the more trivial it becomes to decode it.

And since this data is being purposefully stored in a manner intended for future use, it's very likely it will be encoded in a manner to facilitate and make it as easy as possible to decode in an intuitive manner.

Id strongly suspect every individual "glass" would have some form of "clue" or "how to" at the start of it, that serves as a guide to help the consumer know they are decoding it right.

Off the top of my head one example would be encoding a bunch of digits of the Fibonacci Sequence at the start as character literals (so text form), which even in binary form when inspected physically with a microscope, any scientist would go "oh hey thats Fibonacci!"

Then after that a large blank, followed by perhaps in order the entire ANSI character set from 0 to whatever it goes to now. Or perhaps Unicode.

The whole thing is only like a megabyte or two, so it would be less than 0.1% of the storage data, but having those 2 items at the start of every disk would be an easy way for the consumer to sanity check they are "reading" the data right, and clue them into "yo there's data stored on here" very fast

[-] pomodoro_longbreak@sh.itjust.works 1 points 8 months ago* (last edited 8 months ago)

any scientist would go “oh hey thats Fibonacci!”

Agreed, except in my crunchy post-pedal glitter punk opera they would say, "oh hey that's the numbers my screensaver uses!"


Although seriously, what would dictate the "start" of the disk - the top, left, foremost block? I think we can assume they would try to read the data contiguously, but that's about it. I guess you could have some kind of visual indicator, like it's in a different colour...

Interesting problem!

[-] bitsplease@lemmy.ml 5 points 8 months ago

what if you lack the fragments needed to reverse engineer/reconstruct a means to access the information?

Well that's a different question, because now it sounds like you're assuming that significant data loss will occur before it's read. If the storage unit itself is damaged in the meantime to where it's data is corrupted beyond recovery, then yes - that's a potential total loss scenario. Assuming however that the storage unit remains intact, I don't see how a dedicated team of smart individuals couldn't handle it, unless their technology is somehow inferior to ours.

It's also worth considering that this storage unit probably won't be their very first interactions with modern data storage systems. This may or may not be their first interaction with a data storage system that was actually written from modern times, but unless we have a total technological collapse in the intervening 10,000 years, chances are they'll have records from our time that have been copied over however many thousands of times to make it there. Afterall, to use a much less extreme example, I don't need to get my hands on a CD-Rom or Floppy Disk burned in 1991 to get a copy of Linux 0.01, it's been copied over and over through the years and is now available for download online. Data will surely degrade over time, and large chunks will get lost as people stop copying things they think are no longer important, but I feel pretty confident in the idea that enough pieces will make it that far that these scientists (techno-archeologists?) won't be starting from scratch

[-] ALostInquirer@lemm.ee 1 points 8 months ago

Data will surely degrade over time, and large chunks will get lost as people stop copying things they think are no longer important, but I feel pretty confident in the idea that enough pieces will make it that far that these scientists (techno-archeologists?) won’t be starting from scratch

Right, that's what I was trying to refer to in my reply, not a damage to this new storage media itself, but surrounding data/storage media that would provide help in reverse engineering it. Sorry I wasn't clearer about that! I was thinking like if you didn't have, say, a Rosetta Stone kind of artifact (or artifacts) to help in translating/reconstructing/reverse engineering.

That's why I wrote that I think it's really unlikely, like yourself, but it's interesting to consider.

[-] spiderkle@lemmy.ca 11 points 8 months ago* (last edited 8 months ago)

A similar problem facing humanity, is what to write on the outside of storage sites for spent nuclear fuel rods. Failing to properly pass on potentially lifesaving information to the future of humanity can be deadly. Let's hope there won't be any vital information on Microsoft's new glass storage without corning gorilla certification.

[-] Cenzorrll@beehaw.org 7 points 8 months ago

I would think that you could leave a Rosetta Stone with directions on how the data is stored and read. It wouldn't take much, I think. "These glass things contain information, here's how it is encoded. Here's the requirements on reading these". You could start off simple and have a rudimentary one that can be deciphered by hand that describes how to make a device that can quickly pull information from a few others that give directions on how to build another device to read the high capacity ones. You don't need a specific filesystem or computer to read it, you just need to know how to decipher it and that it IS data stored in a certain way, not just cool looking glass art.

[-] ech@lemm.ee 7 points 8 months ago

Might as well ask what's indicative of stone tablets from millennia ago being data to us now? These things aren't discovered and studied in a vacuum. They operate within context - where the items were found, their similarity to other better understood things, known history of data storage, etc etc.

Given enough time and disruption, sure, all context could be lost, but if that's the case, I'd assume figuring out what the weird glass cube thing is would be the least of their problems.

[-] gus@beehaw.org 3 points 8 months ago

More importantly than the filesystem formats, for media I hope they're using codecs that are as simple and as close to raw as possible, eg: PCM and BMP. Chances are pretty high that with something like PCM data, even if nobody had any idea what it was, at some point somebody would stumble upon turning it into audio. I can't imagine ever successfully decoding HEVC data without a specification.

[-] Drusenija@lemmy.world 39 points 8 months ago

"Check it out, I've got terabytes on this small sheet of glass!"

proceeds to drop the glass

"Well... shit."

[-] DerpyPlayz18@lemm.ee 15 points 8 months ago

proceeds to drop the hard drive

"Well... shit."

[-] Drusenija@lemmy.world 8 points 8 months ago

I mean a solid state drive is probably going to have a better chance of surviving a drop than one made of glass. A platter drive, yeah, fair.

[-] andrew@lemmy.stuart.fun 6 points 8 months ago* (last edited 8 months ago)

proceeds to drop the off site data center

"Well... shit. There goes my 3-2-1 backup strategy."

[-] Dogyote@slrpnk.net 35 points 8 months ago* (last edited 8 months ago)

"So they told me that, according to the most advanced theories and techniques in every field, based on extensive theoretical research and experimentation, through analysis and comparison of multiple proposals, they did find a way to preserve information for about one hundred million years. And they emphasized that this was the only method known to be practicable: carving words into stone"

[-] LordKitsuna@lemmy.world 11 points 8 months ago

What is glass except just a bunch of really tiny Stones melted together?

load more comments (10 replies)
[-] Alexstarfire@lemmy.world 24 points 8 months ago

Good, people 10k years from now can discover everyone has been making the same mistakes forever.

[-] Wahots@pawb.social 4 points 8 months ago

And the best porn humanity has to offer, handpicked for preservation.

[-] ramble81@lemm.ee 21 points 8 months ago

Guess I’ll have to buy the ‘White’ album again.

[-] ThePantser@lemmy.world 20 points 8 months ago

Sadly most will be filled with 10,000 year old porn

[-] shasta@lemm.ee 16 points 8 months ago
[-] Grimy@lemmy.world 9 points 8 months ago

I've seen that anime and I don't care how old she is, she still looks way too young to be dressed like that.

[-] spiderkle@lemmy.ca 8 points 8 months ago
[-] pixxelkick@lemmy.world 17 points 8 months ago

This is awesome, I was talking about this with some friends, debating what is truly the best way to store data for long term (on the scale of thousands of years)

Backing up all of human knowledge and history onto such plates actually seems like a worthy endeavor.

Imagine if we had such detailed records about civilizations thousands of years ago!

We have demonstrated time and time again that if you have a bunch of data unencrypted, it is actually quite trivial to reverse engineer it and decipher it.

Dead sea scrolls, Rosetta stone, etc.

This would be terabytes of data, and likely organized in a way to make it very intuitive to reverse engineer even by someone who has no idea how it works.

We could even case study this. Give a loaded one of these slates to some scientists who have no idea how the data on them is stored and have them try and decipher it.

If they can reasonably succeed quickly with no knowledge on how it works, then it should be easy for someone thousands of years from now too.

[-] Squizzy@lemmy.world 3 points 8 months ago

I've always wondered what if there are multiple paths forward. Like the next people will be completely and unrecognisably different. We could be being bombarded with information from a former civilisation and we don't know because we are just completely different.

Like if we never figured out wireless but had everything else, the history of the world could be being broadcast on the radio and we wouldn't know it.

[-] electrorocket@lemmy.ml 1 points 8 months ago

The Voyager records came with the player and instructions for the hopeful alien discoverer to use.

[-] chemicalwonka@discuss.tchncs.de 15 points 8 months ago

So this new storage technology will probably outlast humanity since there will be no future with global warming and the late stage of capitalism.

[-] pastermil@sh.itjust.works 5 points 8 months ago

Great! Now release it to market with reasonable price!

[-] chemicalwonka@discuss.tchncs.de 2 points 8 months ago

Will we finally have a data storage technology that will surpass M-Discs?

this post was submitted on 19 Oct 2023
208 points (96.0% liked)

Hardware

4776 readers
214 users here now

This is a community dedicated to the hardware aspect of technology, from PC parts, to gadgets, to servers, to industrial control equipment, to semiconductors.

Rules:

founded 4 years ago
MODERATORS