GamingChairModel

joined 2 years ago
[–] GamingChairModel@lemmy.world 4 points 10 hours ago* (last edited 10 hours ago) (1 children)

Because it obviously was.

The dashes, the short sentences, the bullet points, the overly familiar tone that seems LinkedIn-ish. All of it sounds like AI.

[–] GamingChairModel@lemmy.world 1 points 17 hours ago

At least they moved onto year-based versioning. That was probably the best part about the 26/Tahoe release.

[–] GamingChairModel@lemmy.world 1 points 17 hours ago

It's called decoding and encoding.

But the big data centers doing all the video processing for the big video services (including both permanent videos from a library and things like live streaming) are encoding the videos with settings that require less computational power to decode. The idea is to be able to let even old budget smartphones still be able to display the video with very low power requirements on the client device. There's no universe where consumers decoding digital video will be a high-power computational task.

Restaurants have sharp knives in the kitchen, but generally serve food that requires only minimal cutting effort from the table knives set out with the rest of the table settings. Dining will always be easier than cooking, by a margin that makes the difficulty of dining not worth mentioning, so it would be bizarre to criticize a knife as being only good for cooking and eating food, when plenty of dining tableware knives out there would be insufficient for kitchen work.

You've made the mistake of lumping decoding and encoding together based on the algorithmic/mathematical similarity of those tasks, when everyone else is more inclined to discuss the very different end user use cases of those computing needs.

[–] GamingChairModel@lemmy.world 8 points 2 days ago (1 children)

Don't they have their CFO not even reporting directly to be CEO? I would bet that there's a ton of internal dissent about timing and strategy of how to cash out.

Yes. But major differences:

The dot com buildout of physical communications infrastructure involved basically 3 things:

  1. Switches/routers at the nodes for sending signals down the right route.
  2. Fiber optic cables connecting the nodes.
  3. Legal rights of way and easements for the legal right to keep the physical assets in that physical place, and to maintain/replace the stuff as needed.

Category number 1? That stuff went obsolete quickly, and wasn't really reused after the crash.

Category number 2 was better. Turns out, fiber optics can carry signals on a lot more channels than those fibers were originally designed for. And they're designed for useful lives measured in decades. So even if they sat dark from being unused for 5-10 years, eventually they could be used again.

Category 3 is super important. That legal right is basically permanent, and so long as communications equipment needs to physically go from one place to another, having that legal right can be built on and profited on (including the ability to sell or lease those rights).

What's that gonna look like for the AI infrastructure? The servers full of GPUs are the bulk of the cost, and the GPUs are replaced with a new generation every 1-2 years, seem to require all new power and cooling infrastructure every 1-2 generations or so.

Plus the AI buildout looks to be several trillion dollars. Even adjusting for inflation, that's so much more than the tens of billions that each telecom company built out that infrastructure.

And it's hard to see how the servers themselves will be useful for regular businesses, much less consumers. A Blackwell 72-GPU server is $3 million and takes 130 kW to run. A residential electrical line maxes out at about 48kW. The newest Vera Rubin servers are projected to be up to 600kW, with all the power and cooling management that comes with that, plus all the ultra high end networking stuff built into that rack. Even deep pocketed businesses will have trouble finding a use for that server rack worth millions, requiring a ton of supporting infrastructure that not even normal pre-2025 data centers have.

I don't think government funding can actually offset the crash in consumer and business demand being insufficient to cover the cost of the most expensive models on the most expensive GPUs. But if you look through my comment history I've made the comparison to supersonic flight, because I genuinely believe there's a possibility that governments fund the expensive branch of this technology for their own military or surveillance or law enforcement purposes without the benefits necessarily actually spilling out into normal commercial applications.

We've hit the point where training a model (both pre training and post training) isn't the expensive part, and the expensive part is actual inference, which makes it hard to scale the most expensive models to where it's useful for a lot of people. So it might be that the companies and governments that can afford to operate an expensive model might be the only ones to do it. And they'll be able to, without necessarily the public being able to have access to the same tech.

Do you mean the actual packaging of silicon dies and putting them into DIMMs? Yeah, they had to revert back, but that's because a lot of the memory silicon that's only good for DDR4 never shut down, and any silicon memory that is good for DDR5 is also getting claimed up for non-DIMM memory (e.g., memory packaged with logic chips rather than sitting on its own package in a DIMM or even soldered to the board).

Basically, previous generations' silicon fabrication tech is still going, and there are still buyers of that last generation product.

Plenty of examples of companies spending more than they earn for decades. Before OpenAI and Anthropic, though, nobody has ever needed to raise more than $100 billion from investors before turning a profit, though. The scale is immense, enough to where it affects the liquidity of the investors that have funded their rise.

The business model should be that with economies of scale they could provide compute much cheaper than average consumer can buy to run locally.

That business model assumes that the huge cloud models will always maintain a gap worth paying for, compared to the local models. I'm just not convinced that the average consumer will need cloud models for summarizing their emails or the news of the day.

And for actual costs of their data centers, there literally aren't enough humans in the world where $20/month AI spending per person will help them break even. They'll need to sell big accounts (many businesses spending billions per year) in order to break even.

[–] GamingChairModel@lemmy.world 10 points 3 days ago (4 children)

There's just no way to pay for the cost of these services, though.

When someone constructs a 100 MW data center (now considered a smaller one for new construction), that's about $2 billion in total costs to outfit the whole operation. And then once it's on, we're talking something like $10-20 million/month in electricity alone, and a few million in other costs. How many $20 subscriptions do you need to sell just to break even with your operating expenses? How many $100/month subscriptions do you need to sell to make a dent on your interest payments on the construction? Will there be a market for $1000/month subscriptions from millions of customers? If not, how's this all going to be paid for?

[–] GamingChairModel@lemmy.world 16 points 3 days ago (3 children)

Once you get into things with useful generation and large context windows, or things like video generation, suddenly you need one or more $10,000+ pieces of hardware to run it.

A Blackwell server with 72 GPUs costs about $3 million, plus requires 130 kW of power (about 3 residential homes' max rated power through a residential 200A circuit box, for about $600-$1000/day in electricity cost).

You're gonna need to sell a lot of $20/month subscriptions to get that paid for, assuming that the server is good for 5 years. If it's only good for 3 years, the economics are basically impossible.

[–] GamingChairModel@lemmy.world 2 points 5 days ago (2 children)

Because the factories are already set up to make DDR4. Retooling to make DDR5 will cost a lot of money and take a lot of downtime for which the factory isn't making anything. So the companies are extending the life cycle of the DDR4 production lines, without needing to upgrade things or retrain workers. As long as people are buying it, then there's money to be made by staying open.

It's like being the burger restaurant next to the steak restaurant when the line for the steak restaurant is 3 hours long. You'll get a lot of spillover from people who don't want to wait, and you can benefit from that without necessarily turning into a steak restaurant yourself.

 

Curious what everyone else is doing with all the files that are generated by photography as a hobby/interest/profession. What's your working setup, how do you share with others, and how are you backing things up?

view more: next ›