I created a cTAKES custom dictionary from UMLS database with ICD10 codes.
Right now I able to analyze the text by for example disease name, like Asthma
and annotation index will contain the ICD10 code for this matching code = "J45.90"
.
Is it possible to configure cTAKES in order to reverse this process in order to look for ICD10 code appearance in the text instead?
The XML output contains the
start
andend
s of a matched concept in the original corpus. I personally find it easier to convert the XML to a simple JSON format and then loop through it as needed.I have been working on an open source solution for parsing out the data and displaying the corpus with the matches it in HTML: https://github.com/GoTeamEpsilon/ctakes-friendly-web-ui#demonstration - let me know if you'd like to contribute.