Can I extract who is speaking in a gmeet/teams using websocket?

144 views Asked by At

I am working on a bot to record gmeet, teams meeting and zoom. I am currently doing diarization using deep learning open-source libraries, but the results are far from good. I wanted to know if I could extract from the websocket information like when someone is talking, so I could make my diarization without using deep learning. Do you know if I could extract that information? With what kind of tool? My bots are made using puppeteer. In the best case scenario, I could extract audio by user, in an ok one I can extract it by timeframe. Thanks for your help!

0

There are 0 answers