Hi All Having created a new dictionary from the 2020AA UMLS and added Genes and Receptors to the dictionary-creator's default selections, I have a curious problem where cTakes now assigns the most bizarre acronyms to ordinary words used in POS contexts where it shouldn't find <XXX>Mentions.
Here are two examples: 1. soft (in "soft tissue...") becomes "SHORT STATURE, ONYCHODYSPLASIA, FACIAL DYSMORPHISM, AND HYPOTRICHOSIS SYNDROME", 2. bed in ("The wound bed was...") becomes "BORNHOLM EYE DISEASE" I have not changed the TermConsumer type in the descriptor XML. Are the DictionaryCreator's defaults, the equivalent to the default sno_rx that's delivered with the app? Attached is the vocab subsets list I used Peter