Microsoft Translator Speech missing punctuation

Question

Microsoft Translator Speech missing punctuation

384 views Asked by shelll At 20 September 2018 at 08:03

I am using MS Translator Speech WebSocket API for real-time speech recognition and translation. The problem is that sometimes the recognised text does not have punctuation (commas, full stops, etc.). The transcribed text looks good otherwise. I also receive an MP3 with synthesised translation.

It looks completely random, I can send the same audio multiple times and some responses have punctuation and some do not. I am sending the audio in correct format and in near real-time rate e.g. I send 100ms samples every ~100ms. The recognised language is Spanish.

Is this a common issue or is there some other catch?

Original Q&A

There are 2 answers

Chris Wendt On 21 September 2018 at 21:34

There are different response types for partial recognitions and the final recognition. You receive partial recognitions as the speech continues to come in, and one final one at the end of the utterance. The partial results may be missing punctuation and casing, the final one will have casing and punctuation. If you want to ignore the responses without casing and punctuation, you want to filter to only see the final responses.

**shelll** · Accepted Answer · 2018-09-24T12:09:47+00:00

shelll On 24 September 2018 at 12:09 BEST ANSWER

Switching to the Speech Preview API solved the missing punctuation. For now there are SDK's only and the raw WebSocket API is not yet documented. I have managed to connect to and use the WS API, more info in another SO question.

TechQA.

Microsoft Translator Speech missing punctuation

There are 2 answers

Related Questions in AZURE

Related Questions in AZURE-COGNITIVE-SERVICES

Related Questions in MICROSOFT-TRANSLATOR

Related Questions in MICROSOFT-SPEECH-API

Popular Questions

Popular Tags

Trending Questions