this post was submitted on 23 Jun 2026
60 points (92.9% liked)

Selfhosted

60177 readers
488 users here now

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.

Rules:

  1. Be civil: we're here to support and learn from one another. Insults won't be tolerated. Flame wars are frowned upon.

  2. No spam.

  3. Posts here are to be centered around self-hosting. Please ensure it is clear in your post how it relates to self-hosting.

  4. Don't duplicate the full text of your blog or git here. Just post the link for folks to click.

  5. Submission headline should match the article title.

  6. No trolling.

  7. Promotion posts require your active participation in selfhosting or related communities, or the post will be removed. No more than 10% of your posts or comments may be self-promotional, or your post will be removed. F/LOSS Exception: If your post is about a project that is completely open source & can be self-hosted in full without payment, and your account is at least 30 days old, your post is exempt from this rule as long as you continue to engage in comments.

Resources:

Any issues on the community? Report it using the report flag.

Questions? DM the mods!

founded 3 years ago
MODERATORS
top 4 comments
sorted by: hot top controversial new old
[–] zackhow@programming.dev 13 points 4 days ago (1 children)

Im sure this works but from my recent experience you need your STT on a machine more powerful than a PI atm. Tiny models are just not accurate enough and the bigger ones need more than the PI has to give any sort of decent response time. Compared to where this was two years ago I look forward to where it is in two more.

One of the largest improvements imo has been microwakeword and the ability to run the hotword detection “on device” but I believe it only runs on ESP32 devices so not an option if want everything on a pi.

I spent a little bit of time getting a fully local voice pipeline setup in home assistant last month and I’d say it is near perfect(after adding a few additional community integrations) with the exception of the microphones on current hardware. I look forward to the next HA voice device from Nabu Casa.

[–] fabio@lemmy.manganiello.tech 6 points 4 days ago (1 children)

OpenWakeWord (the solution I also use in my Platypush setup) is actually very good at hotword detection, especially when paired with libspeex for noise suppression.

The STT models from Vosk are actually quite good too. The 180 MB English model is light enough to run on a RPi and it's actually quite decent with clean accents. They also have a 1.8 GB model that is very accurate and that actually can still run fine on a RPi5 8 GB.

[–] zackhow@programming.dev 4 points 4 days ago

Don’t misunderstand. Openwakeword works great. I just think it’s awesome that the hotword detection can run so well directly on low powered devices now.

About STT I’ll agree that they can run well, I just found the experience a lot better on a heavier model running on a beefier machine. They do well in a silent space but struggle when you add background noise.

[–] realitaetsverlust@piefed.zip 2 points 4 days ago

I'd be very surprised if this was actually as good as written in that article.