I’ve been using Speech Note (github link) for months, but it often gets things wildly wrong.
I thought it was my mic, so I got one that’s crystal clear. I also tried a ton of different models, and other than being slow (or fast), their accuracy is usually pretty similar.
But I’m still needing to take a lot of time to edit the results, and I wonder if there’s something I should be doing to get better results.
On other speech-to-text platforms (like Futo keyboard on Android), the results are fast and very accurate. I have a hard time believing that Speech Note can’t be as good.
Can any other users share their experience?
I’ve used it for a short while to test it out. Accuracy was pretty good, as was correct punctuation. Response time also good.
It’s using my Nvidia GPU to do the LLM thing, so that may be the difference.