Since you work on podcasts, do any open source transcription tools currently ide... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		biomcgary on Sept 21, 2022 \| parent \| context \| favorite \| on: Whisper – open source speech recognition by OpenAI Since you work on podcasts, do any open source transcription tools currently identity the speaker in the output? This would be particularly helpful for interviews.

nico on Sept 22, 2022 [–]

Not sure about open source, but in general, automated transcription systems need a separate track for each different speaker. So for example, for a phone call with one person on each end, you need two separate channels (recording systems usually split them left/right on one stereo file).

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact