morrowind

joined 1 month ago
[–] morrowind@lemm.ee 5 points 5 days ago (1 children)

I want to clarify something. Reranker is a general term that can refer to any model used for reranking. It is independent of implementation.

What you refer to

because reranker models look at the two pieces of content simultaneously and can be fine tuned to the domain in question. They shouldn't be used for the initial retrieval because the evaluation time is O(n²) as each combination of input

Is a specific implementation known as CrossEncoder that is common for reranking models but not retrieval ones for the reasons you described. But you can also use any other architecture

[–] morrowind@lemm.ee 2 points 5 days ago (1 children)

Thumbnail looks a little odd when small. You may want to go for a more digital llama aesthetic

[–] morrowind@lemm.ee 1 points 1 week ago (1 children)

autotracers can't generate svgs from text

[–] morrowind@lemm.ee 3 points 1 week ago

Claude frequently draws svgs to illustrate things for me (I'm guessing it's in the prompt) but even though it's better at it than all the other models, it still kinda sucks. It's just fudamentally dumb task to do for a purely language model, similar to the arc-agi benchmark , just makes more sense for a vision model and trying to get an llm to do is a waste

[–] morrowind@lemm.ee 1 points 2 weeks ago (1 children)

what is the license? The link on hf just 404s

[–] morrowind@lemm.ee 2 points 2 weeks ago

Very similar to chain of draft but seems more thorough

 
[–] morrowind@lemm.ee 3 points 4 weeks ago

It matches R1 in the given benchmarks. R1 has 671B params (36 activated) while this only has 32

[–] morrowind@lemm.ee 2 points 4 weeks ago (2 children)

insane, absolutely insane

[–] morrowind@lemm.ee 6 points 1 month ago (2 children)

good luck trying to run a video model locally

Unless you have top tier hardware

view more: next ›