Is it possible to set Windows.Media.SpeechSynthesis stream format as in SAPI 5.3?

162 views Asked by At

I'm using Windows.Media.SpeechSynthesis (C++/WinRT) to convert text to audio file. Previously I was using SAPI where was possible to set Audio Format when binding to a file via SPBindToFile(...) before speaking.

Is there any similar method in Windows.Media.SpeechSynthesis? Seems that there is only possible to get 16kHz, 16Bit, Mono wave stream, does it?

Does SpeechSynthesisStream already contain a real audio stream after speech synthesis, or does it hold some precalculated raw data, and does actual encoding happen when accessing its data (playback on a device or copying to another not-speech-specific stream)?

Thank you!

I think there should be possible to control the speech synthesis stream format somehow.

1

There are 1 answers

0
Eric Brown On

The WinRT synthesis engines output 16Khz 16-bit mono data. There isn't any resampling layer to change the format.