Can I add a layer of meta data in a text classification model?

Question

Can I add a layer of meta data in a text classification model?

859 views Asked by Dror M At 05 October 2020 at 13:00

I am trying to create a multiclass classifier to identify topics of Facebook posts from a group of parliament members.

I'm using SimpleTransformers to put together an XML-RoBERTa-based classification model. Is there any way to add an embedding layer with metadata to improve the classifier? (For example, adding the political party to each Facebook post, together with the text itself.)

Original Q&A

There are 1 answers

**Jindřich** · Answer 1 · 2020-10-06T07:57:53+00:00

If you have a lot of training data, I would suggest adding the meta data to the input string (probably separated with [SEP] as another sentence) and just train the classification. The model is certainly strong enough to learn how the metadata interract with the input sentence, given you have enough training examples (my guess is tens of thousands might be enough).

If you do not have enough data, I would suggest running the XLM-RoBERTa only to get the features, independently embed your metadata, concatenate the features, and classify using a multi-layer perceptron. This is proably not doable SimpleTransformers, but it should be quite easy with Huggingface's Transformers if you write the classification code directly in PyTorch.

TechQA.

Can I add a layer of meta data in a text classification model?

There are 1 answers

Related Questions in PYTHON

Related Questions in DEEP-LEARNING

Related Questions in NLP

Related Questions in TEXT-CLASSIFICATION

Related Questions in BERT-LANGUAGE-MODEL

Popular Questions

Popular Tags

Trending Questions