TechQA.

dbpedia NLP dataset used for Named entity extraction

1.2k views Asked by Tilney At 02 December 2014 at 11:34

I went through their github files as well as the official site, I can't find the named entity tagging training corpus they used in splotlight.

How Can I found the dataset instead of a trained model?

There are 1 answers

Gunjan

Gunjan On 12 December 2014 at 07:01

see This link https://github.com/dbpedia-spotlight/dbpedia-spotlight/wiki/Web-service

In here, method for setting up dbpedia lookup offline is explained. Also they have given 4 tar files which are

redirects_en.nt
short_abstracts_en.nt
instance_types_en.nt
article_categories_en.nt

these are supposed to be training data for it.