this post was submitted on 14 Jun 2023
355 points (100.0% liked)

Technology

41094 readers
248 users here now

A nice place to discuss rumors, happenings, innovations, and challenges in the technology sphere. We also welcome discussions on the intersections of technology and society. If it’s technological news or discussion of technology, it probably belongs here.

Remember the overriding ethos on Beehaw: Be(e) Nice. Each user you encounter here is a person, and should be treated with kindness (even if they’re wrong, or use a Linux distro you don’t like). Personal attacks will not be tolerated.

Subcommunities on Beehaw:


This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

founded 3 years ago
MODERATORS
top 50 comments
sorted by: hot top controversial new old
[–] emstuff@lemmy.blahaj.zone 1 points 2 years ago (9 children)

honestly we should have collectively realized way earlier that putting all the useful, readable, un-touched-by-SEO help content for basically every niche hobby fandom and ideology in the hands of one for-profit entity was not very wisdom-pilled of us

[–] Lanthanae@lemmy.blahaj.zone 0 points 2 years ago* (last edited 2 years ago) (2 children)

One thing the FOSS world really needs to get on right now is some form of search engine accessible distributed content archival. We need a way to store useful content from the past in a way that no one individual or group of individuals is capable of deleting it.

load more comments (2 replies)
[–] withersailor@aussie.zone 0 points 2 years ago

Yes. When everyone enters info on corporate sites, sooner or later they'll decide to monetize it.

Reddit going evil on charges and showing their colours in the AMA has been a wake up.

[–] LunarticBot@beehaw.org 0 points 2 years ago

I just can't agree more with you. Like wow this reddit blackout has truthfully opened my eyes to the massive, giant and incredibly amount of useful information that is currently resting on reddit servers.

[–] twack@lemmy.world 0 points 2 years ago (9 children)

I agree, but I also have serious concerns about this being the replacement strategy. It could be because of my ignorance of how this all works though. Like many of you, I am new and here because of the reddexodus.

These servers are going to cost money, and for many of them the money will run out. Is there a function to preserve the collective content of an entire server once it goes dark? I know that you can migrate your own account to another server, but what happens to everything Google has indexed at Lemmy.world if the worst happens? Is it all just dead links? What if many of the users do not migrate? Is it just gone?

I am concerned that in the current state we are setting up to burn everything that loses a couple admins or becomes too old to economically host.

[–] notroot 0 points 2 years ago

These are certainly possibilities! It's happened elsewhere in the Fediverse... but already we can export most of our data and migrate to a different instance. Getting these base features right is important before enhancing their functionality. Planning for the future is important too. So far I've been impressed by Lemmy, though it's not nearly as portable as Mastodon or Calckey or Pleroma etc. Part of that is that in Lemmy/kbin we don't follow other users... we subscribe to groups (subs/communities/magazines).

Still, with the nature of ActivityPub, it's inevitable that migration tools for Reddit-like federated apps will get built quick-like

[–] railsdev@programming.dev 0 points 2 years ago (1 children)

It would be nice to have some sort of IPFS + Lemmy (or other federated network) witchcraft going.

[–] TerryTPlatypus@beehaw.org 0 points 2 years ago

Makes a lot of sense, especially due to the drama earlier on with Imgur and its image policy

[–] spaduf@lemmy.blahaj.zone 0 points 2 years ago

Before reddit removed them most of this compiled knowledge was in the subreddit wikis. I honestly believe a return to communities with wikis is the long term replacement.

[–] jeena@jemmy.jeena.net 0 points 2 years ago

In practice the content is distributed to all the other servers, so people who have been reading it before will still be able to on their own instance, but you're right the indexed domain is gone and so are the results in Google.

But there is one difference, one instance of lemmy only stores a very small fraction of the content. And it's much easier to fuck up one reddit compared to fuck up thousands of lemmy instances simultaneously. So if one instance goes down, the rest of the fediverse is still up and running.

[–] kotton@beehaw.org 0 points 2 years ago (2 children)

I was on a mastodon server and the owner decided it was not worth his money to keep running. He did not inform anyone on the server or allow any account backups and all was lost.

load more comments (2 replies)
[–] jamesravey@lemmy.nopro.be 0 points 2 years ago

I think it's a fair concern. We've seen other parts of the fediverse successfully implement crowd sourced funding via patron and similar to keep mastodon servers running and I suspect if Lemmy remains "the place to be" admins will have reasonable success with a similar model. Lemmy is super efficient and can support 100s of users on a single box so I think if 1% of users paid like $5 a month you could probably still support 99% of users "for free".

load more comments (3 replies)
[–] jeena@jemmy.jeena.net 0 points 2 years ago (1 children)

That is the main reason why I've been blogging on my own website since 2004 https://paradies.jeena.net/weblog/2004/apr/ersteintrag (and switched to English in 2010 https://jeena.net/posts )

load more comments (1 replies)
[–] noodlejetski@beehaw.org 0 points 2 years ago (3 children)

we should have collectively realized way earlier

some people have, but whenever you'd mention it, you'd be met with "lol take the tinfoil hat off", "but we're already using [for-profit platform] why would we move when everyone's here" and "but it's haaaaaaard".

load more comments (3 replies)
[–] boonhet@lemm.ee 0 points 2 years ago

I've said it numerous times over the years, the Internet has been centralizing rapidly and it benefits none of us.

In 2005 you'd wander around, going from peoples' personal pages to forums to whatever else people linked. In 2015 half of those websites were dead because everyone got their content on reddit anyway.

[–] fiah@discuss.tchncs.de 0 points 2 years ago* (last edited 2 years ago) (4 children)

we can still easily fall into this trap if there isn't a good way to migrate communities between instances. And even if we could just take /c/technology@beehaw.org and move the whole thing to /c/technology@feddit.de or something, that would still break all the indexers' links

load more comments (4 replies)
load more comments (1 replies)
[–] railsdev@programming.dev 0 points 2 years ago (1 children)

Just commented a second ago about IPFS. I think federated networks should be utilizing it somehow.

[–] jamesravey@lemmy.nopro.be 0 points 2 years ago (5 children)

I think IPFS often unfairly gets lumped in with crypto bro shite but it seems to me like a pretty useful technology in many other contexts too.

load more comments (5 replies)
[–] monkeytennis@beehaw.org 0 points 2 years ago (3 children)

Tacking "Reddit" onto search queries almost became a prerequisite. Never imagined I'd have to replace that with "-Reddit".

It's made researching a media centre setup very difficult this week...

load more comments (3 replies)
[–] yads@lemmy.world 0 points 2 years ago (6 children)

Had this happen today. Was searching for some programming related stuff and top pages are all inaccessible Reddit posts.

[–] Olgratin_Magmatoe@lemmy.world 0 points 2 years ago (1 children)

Same. Had some things I needed to look up for my 3D printer and much of the results were inaccessible.

Was a pain.

load more comments (1 replies)
[–] lwaxana_katana@beehaw.org 0 points 2 years ago

Hopefully it will help people realise that a profit motive being attached to everything is actually counterproductive societally.

[–] ghost_in_the_code@lemmynsfw.com 0 points 2 years ago

Same. I found it funny though. Showed that if we tried we can cause some chaos

[–] worfamerryman@beehaw.org 0 points 2 years ago (1 children)

Same, but it’s just growing pains.

We should start rewriting posts in lemmy with the correct information.

load more comments (1 replies)
[–] Jagermo@feddit.de 0 points 2 years ago

Same with Pathfinder 2 questions.

load more comments (1 replies)
[–] thejml@lemm.ee 0 points 2 years ago (3 children)

Definitely saw this coming… can’t imagine what will happen if Stack Overflow pulls something similar. All WebDev/DevOps work will halt overnight.

I’ve been trying to put my issues/solutions in a personal blog or wiki, but there’s so much old info out there in sites like Reddit/SO/medium/etc, it’d be a huge loss when it goes away.

[–] PenguinTD@lemmy.ca 0 points 2 years ago (4 children)

Maybe it really is time to get open sourced AI and bots to archive useful information so they don't get monopolized.

load more comments (4 replies)
[–] schnapsidee@feddit.de 0 points 2 years ago (1 children)

We're going to have to actually read official documentation instead of relying on some greybeard's wisdom on SO 🥲

[–] TheTrueLinuxDev@beehaw.org 0 points 2 years ago

Well, at least stack overflow database dump is available.

load more comments (1 replies)
[–] lastrogue@lemmy.einval.net 0 points 2 years ago

I mean if you do hit this, like I have. You can just use google's webcached view. or sometimes the internet archive.

I found this covers most of my needs: https://cachedview.com/

[–] amber@lemmy.blahaj.zone 0 points 2 years ago (2 children)

I just add “forum” to the back of my search

[–] boonhet@lemm.ee 0 points 2 years ago (1 children)

If you've ever owned an older car, you know that this is the absolute best approach.

Good luck getting exceptionally niche advice for things like that on reddit. Forums get so much more specific, you get an entire forum dedicated to one car model that was only built for 5 years and a bunch of people there know literally everything about it, like the fact that you're better off getting an aftermarket PCV valve because those are built a bit better and don't fail early, or the fact that the shifter cable has the tendency to get water in it so you better be careful shifting out of park on a really cold morning, you might just snap the cable if it's old.

load more comments (1 replies)
load more comments (1 replies)
[–] ghostalmedia@beehaw.org 0 points 2 years ago

100% has this happen today. Wanted and answer, the only answer was on Reddit, and the Google link was busted.

[–] gabuwu@beehaw.org 0 points 2 years ago

People rely far too heavily on reddit for public resources. Here's hoping that changes now.

[–] Dave_r@reddthat.com 0 points 2 years ago (6 children)

So... How is Lemmy set for SEO?

load more comments (6 replies)
[–] ryuko@lemmy.ml 0 points 2 years ago (2 children)

This also highlights the problem with a lot of communities moving to Discord, which inevitably ends up as repositories for critical information, but can't be indexed by Google. Reddit is still valuable as a problem solving resource, and I hope they fix this API fiasco.

[–] j4k3@lemmy.world 0 points 2 years ago (1 children)

I'm willing to bet the lack of api access going forward will make all reddit posts disappear from crawler results anyways. I'm no expert, but I imagine the crawler is picking up on all of the interconnected references to reddit that are all due to free api access. As soon as those connections disappear, so dies the value to the entire community. It will be just like the garbage results we get from every single source now. This is the path of neo digital feudalism.

[–] jamesravey@lemmy.nopro.be 0 points 2 years ago (1 children)

API calls are almost always private between the caller and the endpoint (think telegram bots or mobile apps). There isn't really a technically feasible way for a crawler to somehow "infer" any kind of knowledge of how api calls are being used unless the result has some kind of publically visible side effect (E. G. The program using the api is generating a web page and uploading it somewhere crawlable). Google et Al go by how many links from other pages to the page of interest exist (inbound links) and multiply by a smattering of other things like quality of keywords, length of content etc.

That said, if you're implying that the api changes mean that:

  • people are less likely to use reddit because they can't access it via RIF/Apollo
  • less useful content is added to the site to be indexed,
  • fewer inbound links will be generated that point to existing posts
  • pages stagnate and drop in ranking

That is a plausible concern.

[–] j4k3@lemmy.world 0 points 2 years ago* (last edited 2 years ago) (1 children)

fewer inbound links will be generated that point to existing posts

pages stagnate and drop in ranking

This is what I mean, the external references people had in the periphery will dry up. Like if I'm not using Infinity to generate better refined search results, now I don't post the link to Stack Exchange, and this reference fails to cascade across various copy paste blog resources. Now the original reddit post is a dead end source with no external weighted reference value. It's all of these advanced features implemented in the periphery using the free API that create the usefulness in the first place.

Searching reddit will be just like YouTube searches now. No matter what technical wording you use, you'll never find technical references again. I can type the title of a video on YT verbatim and still won't get the correct results, but I can log into an old account and find the content in my hundreds of playlists I kept as references. It is still there, it is still public.

load more comments (1 replies)
load more comments (1 replies)
[–] mkwarman@beehaw.org 0 points 2 years ago

You can prepend a link with "cache:" to view Google's cached version of the site. This works automatically with the url bar in at least Firefox and Chrome (likely other browsers as well). If your browser doesn't support that you can enter it in the google search bar and the result will be the cached version of the site (if available)

[–] Linuturk@lemmy.onitato.com 0 points 2 years ago

I was looking up traefik labels for my new Lemmy docker-compose setup and had to reference reddit.

[–] spaduf@lemmy.blahaj.zone 0 points 2 years ago (3 children)

Before reddit removed them most of this compiled knowledge was in the subreddit wikis. I honestly believe a return to communities with wikis is the long term replacement.

[–] TerryTPlatypus@beehaw.org 0 points 2 years ago (1 children)

Honestly, not a bad opinion, when the wikis were done well, they did have some extremely useful information. I wonder if we could do something like that in Lemmy...

[–] Deebster@beehaw.org 0 points 2 years ago

That was my first thought - if reddit doesn't want that feature, we'll take it!

load more comments (2 replies)
[–] thisjustin@beehaw.org 0 points 2 years ago (5 children)

It is - but you can still access via archive.org and similar resources.

load more comments (5 replies)
[–] yankeebobo@lemmy.world 0 points 2 years ago

Wait until Google bots catch up and drop many of the links back to Reddit.

load more comments
view more: next ›