[ 
https://issues.apache.org/jira/browse/CTAKES-341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15029027#comment-15029027
 ] 

britt fitch commented on CTAKES-341:
------------------------------------

output before update: 
{code}
26 Nov 2015 10:52:38  INFO ContextDependentTokenizerAnnotator - process(JCas)
4.5 3.5
4.5
4.7
{code}

output after update:
{code}
26 Nov 2015 10:53:42  INFO ContextDependentTokenizerAnnotator - process(JCas)
4.5
3.5
4.7
{code}

Checked in: 
* FractionFSM - update as mentioned above
* AggregateAE - update for test case
* TestContextDependentTokenizerAnnotator - new test case

> FractionFSM annotates incorrect span
> ------------------------------------
>
>                 Key: CTAKES-341
>                 URL: https://issues.apache.org/jira/browse/CTAKES-341
>             Project: cTAKES
>          Issue Type: Bug
>          Components: ctakes-context-tokenizer
>    Affects Versions: 3.2.0
>            Reporter: britt fitch
>            Assignee: britt fitch
>             Fix For: 3.2.3
>
>
> It appears that when a decimal is followed by a range that the FractionFSM 
> incorrectly annotates the FractionToken
> given:
> {code}
> FOO 4.5 3.5-4.7
> {code}
> produces the following FractionTokens:
> * "4.5"
> * "4.5 3.5"
> * "4.7"
> after fsm.reset we need to also add the following in order to move the start 
> position and allow sequential END states to be handled correctly: 
> {code} tokenStartMap.put(fsm, tokenStartIndex); {code}
> i will create a test case and verify this solution before committing it back 
> to trunk.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to