Microsoft Speech Platform - sampling rate and bit depth

Question

Microsoft Speech Platform - sampling rate and bit depth

724 views Asked by Icarus At 09 August 2013 at 08:07

Recognition results are best if sampling rate and bit depth of the audio match the training data of the system.

So, does anyone know the exact sampling rate and/or bit depth (and/or stereo/mono) that is used in Microsoft Speech Platform (newest, if that's important)? And if so, do you remember where you got this information?

Please note that I am using the MS Speech Platform, not the SAPI. Unless both are using the same training data, that's not the same AFAIK. To be precise - I use this: http://msdn.microsoft.com/en-us/library/microsoft.speech.recognition.speechrecognitionengine.setinputtowavefile%28v=office.14%29.aspx

My first try is based upon the C++ code example given on the page.

Original Q&A

There are 2 answers

M57 On 03 January 2018 at 11:13

I couldn't find any information regarding sample rate, but it seems the bit depth is actually 8-bit (maybe this has changed since Eric Brown's answer).

Quoted from this page listing supported audio formats:

The Speech Platform downsamples audio that is of greater than 8-bit resolution.

You should be fine providing any bit-depth which is a multiple of 8-bits (which is always the case anyway), since there will be no precision loss due to rounding (and there is no aliasing for resolution, unlike sample rate).

**Eric Brown** · Accepted Answer · 2013-08-10T16:35:29+00:00

Eric Brown On 10 August 2013 at 16:35 BEST ANSWER

The Microsoft.Speech SR engine doesn't need training (unlike the System.Speech SR engine), and is relatively insensitive to sampling rate (will work with anything > 8 KHz sampling rate). 16 bit audio is preferred, but I believe that it will work with 8 bit audio.

TechQA.

Microsoft Speech Platform - sampling rate and bit depth

There are 2 answers

Related Questions in SPEECH-RECOGNITION

Related Questions in SAMPLING

Related Questions in WAVE

Related Questions in MICROSOFT-SPEECH-PLATFORM

Popular Questions

Popular Tags

Trending Questions