For mms-tts-eng model I am getting ushort format error

Question

For mms-tts-eng model I am getting ushort format error

252 views Asked by popstick At 10 October 2023 at 20:28

from transformers import VitsModel, AutoTokenizer
import torch

model = VitsModel.from_pretrained("facebook/mms-tts-eng")
tokenizer = AutoTokenizer.from_pretrained("facebook/mms-tts-eng")

text = "some example text in the English language"
inputs = tokenizer(text, return_tensors="pt")

with torch.no_grad():
  output = model(**inputs).waveform

import scipy
scipy.io.wavfile.write("techno.wav", rate=model.config.sampling_rate, 
data=output.cpu().float().numpy())

I am getting this:

error: ushort format requires 0 <= number <= (0x7fff * 2 + 1)

Original Q&A

There are 1 answers

**Salad** · Answer 1 · 2023-10-15T15:40:46+00:00

This answer here solved this issue for me. So simply transposing your waveform output should fix this.

So instead of doing this:

import scipy
scipy.io.wavfile.write("techno.wav", rate=model.config.sampling_rate, 
data=output.cpu().float().numpy())

I did:

import scipy
scipy.io.wavfile.write("techno.wav", rate=model.config.sampling_rate, 
data=output.cpu().float().numpy().T)

TechQA.

For mms-tts-eng model I am getting ushort format error

There are 1 answers

Related Questions in MACHINE-LEARNING

Related Questions in PYTORCH

Related Questions in HUGGINGFACE-TRANSFORMERS

Related Questions in HUGGINGFACE

Popular Questions

Trending Questions