The West is its own enemy. Not only are our oligarchs against us, they are dimwitted don’t plan ahead for anything but earnings reports.
Technology
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related news or articles.
- Be excellent to each other!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
- Check for duplicates before posting, duplicates may be removed
- Accounts 7 days and younger will have their posts automatically removed.
Approved Bots
Obviously this is good for RAM prices but does the NAND part affect SSD prices as well?
I'd skip spinning platters if I could get 4 TB of SSD for vheapt
Assuming we'd let China sell chips in the US. Gotta block competition and keep prices high.
I sincerely hope this fucks up the market. I hope all the other manufacturers get fucked right up their greedy fucking asses.
With an unlubricated horse cock
Say, are you an engineer at Boeing in Washington?
Remember he was from Germany, even if he ultimately fell in Washington.
Damn you Mr Xi
Except maybe in America if MAGA dreams up some tariffs because Freedom (Chinese EVs have entered the chat).
MAGA would rationalize huge price increases in the name of "america first!"
Nah, there won't be any tariffs. Nvidia wouldn't like them
But think about Samsung, SK Hynix and Micron shareholders
Whew, glad I dumped my Micron shares a little While ago.
Oh dear...
I do think of them.
The Chinese AI labs are really trying to pop the bubble, too.
How?
Well lemme ask you this. What if models 80-90% as good as Claude, with weights just thrown out there for any provider (or homelab) to host, flood the market? What if they're so dirt cheap to run, they're almost free, and don't even need Nvidia GPUs? What they need fewer resources to run with each update, instead of more?
...What if this already happened, and Big Tech is maddly lobbying to ban/censor them before people realize it, and that the "infinite scaling" thing is a big fat lie?
That's the state of things.
It turns out that off-shoring your economy to a political rival is a really dumb thing for a capitalist to do.
But, but, this quarter profits.
I wish, I wish we would bring out the guillatine for these greedy treasonous capitalist fucks.
We've lost so much because of them
Yep, the Chinese models are already up 10 times cheaper and now that Anthropic, OpenAI, and Google, all are increasing prices up to 10 more for models like Opus, it will make Chinese models anywhere from 50 to 100 times cheaper.
American corps. are betting that since people have their workflow already established they won't switch to other providers, but that's not the case. There's already a mass move to Chinese models.
People keep talking about Chinese models, where are they? How do I used them instead of Claude? Are they safe?
Chinese models are really good. How you use them depends on what your goals are. If you want something on par with Claude or ChatGPT, you need to go to Deepseek or Qwen's website. You can also find these models on openrouter. If you want a local/small model, then get ollama and find a model in the repository. You could also get anythingllm or LM Studio and get models from within the app. There's also huggingface.
Since you asked about safety, deepseek on the official website does collect info and there was a time some of that info was leaked but you can get around that using something like openrouter. Note similar things happened to ChatGPT and Meta AI. There is also the potential for bias (there was a time people were spamming their Deepseek Tiananmen Square responses -- personally, it just would not process my query) but Grok has that same issue.
Look into zen.ai which is opencode's sister company that provides llm access. "At cost"
You can see just how cheap they are. I use Augment Code at work and they have kimi 2.6. It's really solid. Opus/GPT are still better, but for many tasks, kimi works great and doesn't make me cringe at the price.
Qwen 3.6 is supposed to be really good too. I haven't used it that much.
ollama or llama.cpp to self host if you have a good mac or good video card. this is perfectly safe.
there are a bazillion hosted inference providers to choose from https://huggingface.co/docs/inference-providers/en/index be aware that you are sending your code to fuck knows who and they are sending back fuck knows what. ymmv, yolo.
hook one of them up to opencode.ai or pi.dev or one of the bazillion other 'harneses' or whatever we are calling it this week and try not to rm -r anything important.
for a good time try and get a chinese models to say something about tibet, or taiwan... its like having your own virtual tankie tamagochi!
be aware that you are sending your code to fuck knows who and they are sending back fuck knows what
So literally the same as Western-made AI?
Yeah only the Chinese government is currently far better at working behind the scenes with companies than any other government in the world?
Incompetence is a feature of governments at times.
inference providers could be anyone from anywhere, there are even proxy resellers. some are harvesting and reselling your data.
if you send your code to claude/openai/google there is certainly a much higher degree of confidence in who you are sending your data to. yes they to harvest your data and can send you malicious commands (esp if you have a promp injection attack).
its like buying a cheap vps, if the stakes are low its fine, if it important then you need to consider about the consequences of your actions.
nb: i am no expert, just fucking around.
The most famous is Deepseek. It's not even made by "AI" company, it was a side hustle from stock trading company. They released it for free just to flex.
In a way it has actually.
Deepseek was big because not only did they publish the full model for everyone to use, but the MoE structure significantly brought down the hardware requirements in terms of processing power. As long as you have enough VRAM, you can run it on older hardware with no need for the latest Nvidia stuff.
Now they got v4 which many have found to be within a 10% margin of Claude and ChatGPT.
On top of that, China has cheapo VRAM GPUs available or soon to be released, like the MTT S80. Yeah it sucks as a Graphics card because the chip is behind, but you get 16Gb of GDDR6 for much cheaper than anything else.
But its not a conspiracy to fight China. The infinite scaling was just Nvidia solidifying themselves as the monopoly because they want all AI infrastructure to be dependent on them, which is why they still illegally export to China, despite an export ban attempting to reduce their potential competition.
Moore Threads (MTT) already has their own CUDA like system called MUSA, and I'm sure they'll be happy to put in proper hardware support for new stuff like Bf16 and FP8/4. It'll take a few years, but eventually China will catch up to the point where Nvidia gets shanked by cheaper hardware.