I have successfully installed the multi-language NER model from DeepPavlov(ner_ontonotes_bert_mult). I want to retrain this model with new data(in the same format as they suggest in the documentation page) that are in the Albanian language.Is this possible(to retrain the multi-language NER model from DeepPavlov with data in a different language), or the retrain works only if we have English data??

1

There are 1 answers

0
yurak On BEST ANSWER

Yes, you can fine-tune the model on any language that was used for Multilingual BERT training https://github.com/google-research/bert/blob/master/multilingual.md#list-of-languages.

It is also possible to fine-tune on languages that are not from the list above if multilingual vocabulary has a good coverage for your language.