A high-quality speech-to-text model based on OpenAI’s Whisper, providing accurate multilingual transcription and translation. Optimized for robust real-world audio.
12.4K Pulls 1 Tag Updated 6 months ago