RE: The fast dictionary pipeline vs. the regular one

2015-06-29 Thread Finan, Sean
poses. 2 characters is the minimum – you cannot lookup 1 character terms with the default dictionary. You can do so with a custom dictionary if you like – which might be useful if you just have 1 or 2 single-character terms. Sean From: britt fitch [mailto:britt.fi...@wiredinformatics.com]

RE: The fast dictionary pipeline vs. the regular one

2015-06-29 Thread Oranit Dror
stom dictionary if you like – which might be useful if you just have 1 or 2 single-character terms. Sean From: britt fitch [mailto:britt.fi...@wiredinformatics.com] Sent: Monday, June 22, 2015 9:24 AM To: dev@ctakes.apache.org Subject: Re: The fast dictionary pipeline vs. the regular one Regarding th

RE: The fast dictionary pipeline vs. the regular one

2015-06-22 Thread Finan, Sean
-character terms. Sean From: britt fitch [mailto:britt.fi...@wiredinformatics.com] Sent: Monday, June 22, 2015 9:24 AM To: dev@ctakes.apache.org Subject: Re: The fast dictionary pipeline vs. the regular one Regarding the miss on “cm” in #2, you might want to check out the dictionary xml descriptor or

Re: The fast dictionary pipeline vs. the regular one

2015-06-22 Thread britt fitch
Regarding the miss on “cm” in #2, you might want to check out the dictionary xml descriptor or uimafit wiring, depending on which you are using, for the parameter “minimumSpan”. If I recall correctly the default minimum span is 3 characters, however you can reduce it to 2 if desired. Cheers, B

RE: The fast dictionary pipeline vs. the regular one

2015-06-21 Thread Miller, Timothy
Sean wrote the fast version and may be able to answer your specific questions. But in general, the fast dictionary does not match performance exactly -- it is not implementing an equivalent search and it has different indexing methods. We are happy to receive reports of what seem like bugs, thou