this post was submitted on 22 Feb 2026
10 points (100.0% liked)

Technology

1378 readers
45 users here now

A tech news sub for communists

founded 3 years ago
MODERATORS
top 2 comments
sorted by: hot top controversial new old
[โ€“] SouffleHuman@lemmy.ml 2 points 1 day ago (1 children)

Llama 3.1 8B Seems like a pretty weird choice to me, given that it's already pretty outdated at this point. I know the Qwen team will be launching a new 9B model soon, so maybe they'll switch to that soonish.

[โ€“] yogthos@lemmygrad.ml 4 points 1 day ago

I'm guessing they implemented it as proof of concept because it's a well known model, and has simple architecture. I'm really looking forward to full blown 600+ bln param chips. That's where shit gets real. And just imagine this stuff applied to robotics. Those Unitree robots with a DeepSeek chip would basically be Star Wars droids.