What padding values should be used for huggingface tokenizers?

57 views Asked by Ryan Marr At 05 December 2023 at 01:58

I am using Mbart50 to convert Nepalese to English and am not sure what values I should use for padding. I need to figure out the padding values for both English and Nepalese. My tokenizer code:

tokenizer = MBart50TokenizerFast.from_pretrained(model_name)
tokenizer.src_lang = "ne_NP"
tokenizer.tgt_lang = "en_XX"

Original Q&A

TechQA.

What padding values should be used for huggingface tokenizers?

There are 0 answers

Related Questions in TOKENIZE

Related Questions in HUGGINGFACE

Related Questions in BART

Popular Questions

Trending Questions