overview for morrowind

Dolphin: A Large-Scale Automatic Speech Recognition Model for Eastern Languages in c/localllama@sh.itjust.works

[–] morrowind@lemm.ee 1 points 5 hours ago

Technically it supports fewer languages than whisper, 40 vs 99

The main problem isn't "bother", it's training data. You need hundreds of thousands of hours of high quality transcripts to train models like these and that just doesn't exist for like zulu or whatever

7

Dolphin: A Large-Scale Automatic Speech Recognition Model for Eastern Languages (arxiv.org)

submitted 21 hours ago by morrowind@lemm.ee to c/localllama@sh.itjust.works

2 comments fedilink

Sentence transformers v4 in c/localllama@sh.itjust.works

[–] morrowind@lemm.ee 5 points 5 days ago (1 children)

I want to clarify something. Reranker is a general term that can refer to any model used for reranking. It is independent of implementation.

What you refer to

because reranker models look at the two pieces of content simultaneously and can be fine tuned to the domain in question. They shouldn't be used for the initial retrieval because the evaluation time is O(n²) as each combination of input

Is a specific implementation known as CrossEncoder that is common for reranking models but not retrieval ones for the reasons you described. But you can also use any other architecture

Zoomers & Boomers are the same in c/memes@lemmy.world

[–] morrowind@lemm.ee 2 points 6 days ago

On god

28

Sentence transformers v4 (lemm.ee)

submitted 6 days ago by morrowind@lemm.ee to c/localllama@sh.itjust.works

3 comments fedilink

Link to bluesky https://bsky.app/profile/tomaarsen.com/post/3llc2jvwah22f

Some more details https://huggingface.co/blog/train-reranker

Some updates on community changes and future goals (03-28-2025) in c/localllama@sh.itjust.works

[–] morrowind@lemm.ee 2 points 6 days ago (1 children)

Thumbnail looks a little odd when small. You may want to go for a more digital llama aesthetic

12

NotaGen: Advancing Musicality in Symbolic Music Generation with Large Language Model Training Paradigms (electricalexis.github.io)

submitted 1 week ago by morrowind@lemm.ee to c/localllama@sh.itjust.works

0 comments fedilink

StarVector - a foundation model for generating svgs in c/localllama@sh.itjust.works

[–] morrowind@lemm.ee 1 points 1 week ago (1 children)

autotracers can't generate svgs from text

StarVector - a foundation model for generating svgs in c/localllama@sh.itjust.works

[–] morrowind@lemm.ee 3 points 1 week ago

Claude frequently draws svgs to illustrate things for me (I'm guessing it's in the prompt) but even though it's better at it than all the other models, it still kinda sucks. It's just fudamentally dumb task to do for a purely language model, similar to the arc-agi benchmark , just makes more sense for a vision model and trying to get an llm to do is a waste