Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Just tried a few languages and there seems to be a problem with German speech to text. When I say "Hallo" it just thinks I said "Untertitel der Amara.org-Community" :-)

English and Spanish seemed to be functioning OK though, but I really wanted to test my German.



The reason for this is quite interesting and is due to the data that Whisper (the transcription service) was trained on: https://github.com/openai/whisper/discussions/928


Thanks for the comment. Sometimes if you toggle the microphone quickly without saying anything, it will halucinate something. If you could, is there any chance you could try again in German and let me know if it's still not working?


Curious about this, why do you think it hallucinates? Isn't that supposed to be the direct output from speech recognition?

PS: Just happened with Spanish now as well, which was working fine. (You: Subtítulos realizados por la comunidad de Amara.org)


No, still not working. Also tried a bunch of other languages and they all seemed to work except Turkish, which also spits out something related to subtitles...


Ah darn - okay, thanks for the feedback. I'll find a way to stop it from doing this.


It would be amazing if the microphone detected say 15 second silence and responded automatically for a fully hands free experience


Yeah - this is a good shout




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: