chore(whisper-cpp): Convert to Purego and add VAD #6087

richiejp · 2025-08-18T18:18:28Z

Description

Converts the Whisper backend to use Purego similar to stablediffusion. Also adds some features which are not in the upstream CGO bindings.

Notes for Reviewers

We could upstream the Purego bindings, but I'm not sure what that would look like, so will just try it here first.
Initially I've added just a new VAD backend for testing, then will convert the rest.

Signed commits

Yes, I signed my commits.

TODO:

fix VAD end time (speech segments are detected, but RT API is not submitting for transcription after period of silence, possibly time units on segments are wrong)
convert rest of whisper backend to purego
fix transcription failed bug
use transcriptions in-built VAD mode

netlify · 2025-08-18T18:18:33Z

✅ Deploy Preview for localai ready!

Name	Link
🔨 Latest commit	`f740b48`
🔍 Latest deploy log	https://app.netlify.com/projects/localai/deploys/68ad79dbf559d1000855c37e
😎 Deploy Preview	https://deploy-preview-6087--localai.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

richiejp · 2025-08-22T09:57:08Z

ah now I realise that the VAD model can be combined with the transcribe model. So we can just call transcribe and it does VAD first and short circuits if no speech is detected. This changes a lot.

richiejp force-pushed the chore/whisper-purego branch from 8ec66db to 4b179ea Compare August 19, 2025 16:24

github-actions bot added area/ai-model dependencies labels Aug 20, 2025

richiejp added 7 commits August 22, 2025 10:55

whisper-vad purego test

503005f

fix(ci): Avoid matching wrong backend with the same prefix

a734d9a

whisper-vad purego test

bf1e9a4

add vad

a6d05db

fix vad

546a436

add transcription to vad

850bf26

combine VAD with transcribe bckends

0345dfb

richiejp force-pushed the chore/whisper-purego branch from 206a71d to 0345dfb Compare August 22, 2025 09:55

richiejp added 2 commits August 26, 2025 10:06

fix transcription

d50d8c9

rm leading space

f740b48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

chore(whisper-cpp): Convert to Purego and add VAD #6087

chore(whisper-cpp): Convert to Purego and add VAD #6087

Uh oh!

richiejp commented Aug 18, 2025 •

edited

Loading

Uh oh!

netlify bot commented Aug 18, 2025 •

edited

Loading

Uh oh!

richiejp commented Aug 22, 2025

Uh oh!

Uh oh!

Uh oh!

chore(whisper-cpp): Convert to Purego and add VAD #6087

Are you sure you want to change the base?

chore(whisper-cpp): Convert to Purego and add VAD #6087

Uh oh!

Conversation

richiejp commented Aug 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

netlify bot commented Aug 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for localai ready!

Uh oh!

richiejp commented Aug 22, 2025

Uh oh!

Uh oh!

richiejp commented Aug 18, 2025 •

edited

Loading

netlify bot commented Aug 18, 2025 •

edited

Loading