A tool for transcribing and translating a batch of audio files using the FasterWhisper library.
Voice messages are great for capturing thoughts, but terrible for searching them later. I built this pipeline to automatically process my recordings, transcribe them using Whisper, and translate/summarize them using a local LLM. It turns audio chaos into structured XML data.
Live execution on local machine: