Hi All

Having created a new dictionary from the 2020AA UMLS and added Genes and
Receptors to the dictionary-creator's default selections, I have a curious
problem where cTakes now assigns the most bizarre acronyms to ordinary
words used in POS contexts where it shouldn't  find <XXX>Mentions.

Here are two examples:

1.   soft (in "soft tissue...")
becomes   "SHORT STATURE, ONYCHODYSPLASIA, FACIAL DYSMORPHISM, AND
HYPOTRICHOSIS SYNDROME",

2.   bed in ("The wound bed was...")
becomes  "BORNHOLM EYE DISEASE"

I have not changed the TermConsumer type in the descriptor XML.

Are the DictionaryCreator's defaults, the equivalent to the default sno_rx
that's delivered with the app?

Attached is the vocab subsets list I used


Peter

Reply via email to