[Cin] Haiku OS (offtop? )

Stefan de Konink stefan at konink.de
Fri Apr 30 16:34:42 CEST 2021


On Friday, April 30, 2021 3:34:00 PM CEST, W P wrote:
> Following Stefan's offtopic: do you know what kdenlive uses for 
> speech recognition? Have you tried it? From what I know speech 
> recognition is very tricky to perform accurately.

After I found the post I researched it. It is Kaldi. 
<https://kaldi-asr.org/>


> I know vosk (https://github.com/alphacep/vosk-api) is quite 
> good but I have no idea if it can be used in Cinelerra (and the 
> training models can be quite heavy so probably better if user 
> downloads them separately but then I am not sure how user 
> friendly it is..)

I think having an actually (even partial correct) annotation what is 
happening on the timeline could give a great non-visual hint on what to 
edit. This kind of stuff should be in audacity too. From my phonetic 
background I know that presenting it is tricky too, because you want to 
have 'grouped tracks'. praat.org has some visuals how they have implemented 
this for academic research.

-- 
Stefan


More information about the Cin mailing list