Importing previously annotated document to IBM Knowledge Studio

156 views Asked by At

I am doing some research on building custom models for Entity extraction. For this i have used some of the public dataset and wanted to see how it performs in IBM Knowledge Studio. But i am finding it difficult to find a way to load the public dataset (which is already annotated) to the Knowledge Studio.

There document says, previously annotated documents can be imported, but it doesn't specify about the format https://console.bluemix.net/docs/services/watson-knowledge-studio/create-project.html#create-project

Document also says, it can be from a UIMA analysis engine, i don't find any good examples which show the format of the file

Can anyone help on this?

2

There are 2 answers

0
i0707 On BEST ANSWER

Hope this helps other.

There is no clear documentation on the input format when you are trying to import existing annotated data in to Knowledge Studio.

We did a work around, by manually annotating few files from Knowledge Studio and exported this data. We did analysis on this exported data and wrote custom programs which will transform the existing annotated text to the format that Knowledge Studio accepts.

Then imported this data back to Knowledge Studio

0
Masashi Miyazaki On

Watson Knowledge Studio can handle XMI files that are exported from Watson Explorer Content Analytics, Content Analytics Studio or Apache UIMA. You can find some information in below document.

https://console.bluemix.net/docs/services/watson-knowledge-studio/preannotation.html#preannotation