Ask HN: What API or software are people using for transcription?

https://news.ycombinator.com/rss Hits: 1

Summary

I use whisperfile[1] directly. The whisper-large-v3 model seems good with non-English transcription, which is my main use-case.I am also eyeing whisperX[2], because I want to play some more with speaker diarization.Your use-case seems to be batch transcription, so I'd suggest you go ahead and just use whisperfile, it should work well on an M4 mini, and it also has an HTTP API if you just start it without arguments.If you want more interactivity, I have been using Vibe[3] as an open-source replacement of SuperWhisper[4], but VoiceInk from a sibling comment seems better.Aside: It seems that so many of the mentioned projects use whisper at the core, that it would be interesting to explicitly mark the projects that don't use whisper, so we can have a real fundamental comparison.[1] https://huggingface.co/Mozilla/whisperfile[2] https://github.com/m-bain/whisperX[3] https://github.com/thewh1teagle/vibe/[4] https://superwhisper.com/

First seen: 2025-06-09 17:19

Last seen: 2025-06-09 17:19

Read Full Article More from this Source

Ask HN: What API or software are people using for transcription?

Summary

Related News

Radio pulses detected coming from ice in Antarctica

Frequent reauth doesn't make you more secure

Kyber (YC W23) Is Hiring a Technical Account Manager

The Claude Bliss Attractor

Simulink (Matlab) Copilot