Azure speech private preview for diarization was earlier setting “unknown” speaker tag until it recognise a long 7 seconds statement from a speaker, with the api in public preview it started tagging guest-n which brings accuracy concern, even if a guest-1 detected and received short sentences it is getting tagged guest-2 until guest-2 speaks a long sentence and likewise
Is there a solution to get the private preview behaviour back?
Is there a solution to get the private preview behaviour back?
As per documentation, they still say it will mark shorter sentences as unknown
Used sdk version implementation group: 'com.microsoft.cognitiveservices.speech', name: 'client-sdk', version: '1.34.0'
Diarization is described as the process of segmenting audio containing multiple speakers into discrete speech segments based on the identity of the speaker during each segment.
Note: Real-time diarization is currently in public preview.
Output: