Free and Open Source Software

18167 readers

1 users here now

If it's free and open source and it's also software, it can be discussed here. Subcommunity of Technology.

This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

founded 2 years ago

MODERATORS

Gaywallet@beehaw.org

alyaza@beehaw.org

FOSS voice to text (feddit.de)

submitted 1 year ago by Iapar@feddit.de to c/foss@beehaw.org

6 comments fedilink hide all child comments

Does anybody know about a good voice-to-text tool? I improvise song lyrics a lot and often thought about how useful it would be to have it written down so I can salvage the good parts.

you are viewing a single comment's thread
view the rest of the comments

[–] TheHobbyist@lemmy.zip 2 points 1 year ago (2 children)

AMD has ROCm which is available on And Radeon Instinct GPUs (server GPUs) and some consumer GPUs. You'd need to double check whether your GPU supports ROCm.

It seems there is some discussion happening here on the use of ROCm with Whisper: https://github.com/openai/whisper/discussions/105 And here (showing it might be possible?): https://github.com/openai/whisper/discussions/55

[–] TheHobbyist@lemmy.zip 2 points 1 year ago

I also found this which could be of interest:

MLC-LLM, which "Enable everyone to develop, optimize and deploy AI models natively on everyone's devices."

Here used to deploy Llama-2-13B on the RX 7900 XTX:

https://blog.mlc.ai/2023/08/09/Making-AMD-GPUs-competitive-for-LLM-inference?ref=upstract.com

[–] doodimus@beehaw.org 2 points 1 year ago

Thanks for that, I've been able to get Stable Diffusion running locally with ROCm so it looks like it should be possible then.