Speaking to a phone is not appropriate in all social situations.
What STT model, runnable on a midrange phone, is good at recognizing whispered speech?
Could an existing STT model be finetuned to be better at recognizing whispered speech?
Thank you.
[link] [comments]



