poses. 2 characters is the minimum – you cannot lookup 1 character
terms with the default dictionary. You can do so with a custom dictionary if
you like – which might be useful if you just have 1 or 2 single-character terms.
Sean
From: britt fitch [mailto:britt.fi...@wiredinformatics.com]
stom dictionary if
you like – which might be useful if you just have 1 or 2 single-character terms.
Sean
From: britt fitch [mailto:britt.fi...@wiredinformatics.com]
Sent: Monday, June 22, 2015 9:24 AM
To: dev@ctakes.apache.org
Subject: Re: The fast dictionary pipeline vs. the regular one
Regarding th
-character terms.
Sean
From: britt fitch [mailto:britt.fi...@wiredinformatics.com]
Sent: Monday, June 22, 2015 9:24 AM
To: dev@ctakes.apache.org
Subject: Re: The fast dictionary pipeline vs. the regular one
Regarding the miss on “cm” in #2, you might want to check out the dictionary
xml descriptor or
Regarding the miss on “cm” in #2, you might want to check out the dictionary
xml descriptor or uimafit wiring, depending on which you are using, for the
parameter “minimumSpan”. If I recall correctly the default minimum span is 3
characters, however you can reduce it to 2 if desired.
Cheers,
B
Sean wrote the fast version and may be able to answer your specific questions.
But in general, the fast dictionary does not match performance exactly -- it is
not implementing an equivalent search and it has different indexing methods. We
are happy to receive reports of what seem like bugs, thou