NLP - Specify custom vocabulary / word list for text generation

Question

NLP - Specify custom vocabulary / word list for text generation

386 views Asked by philipkd At 05 June 2020 at 00:31

I'm experimenting with text generators, like OpenAI's GPT-2, Hugging Face's transformers, and Facebook's ParlAI, and I'm wondering if I can limit or weight the output to a specified list of words? For example, how can I limit the output to only words that start with the letter 'a'?

One obvious idea is to train on a dataset that is limited by that vocabulary, but I only have a laundry list of words, not a natural corpus that only has those words.

Original Q&A

There are 1 answers

**Minions** · Answer 1 · 2022-09-30T21:55:17+00:00

Minions On 30 September 2022 at 21:55

yes, for instance if you're using huggingface, have a look at force_words_ids (Generation). In this way, the model will generate using only the list of token ids that you've created.

TechQA.

NLP - Specify custom vocabulary / word list for text generation

There are 1 answers

Related Questions in NLP

Related Questions in HUGGINGFACE-TRANSFORMERS

Related Questions in PARLAI

Popular Questions

Popular Tags

Trending Questions