Hi there,

I am using Ctakes to process 5000K free text  records  where each record has 
several medications.
This is the fixed flow that it goes through:

                                                               
<node>SimpleSegmentAnnotator</node>
                                                                
<node>SentenceDetectorAnnotator</node>
                                                                
<node>TokenizerAnnotator</node>
                                                                
<node>LvgAnnotator</node>
                                                                
<node>ContextDependentTokenizerAnnotator</node>
                                                                
<node>POSTagger</node>
                                                                
<node>Chunker</node>
                                                                
<node>LookupWindowAnnotator</node>
                                                                
<node>DictionaryLookupAnnotatorDB</node>
                                                                
<node>DependencyParser</node>
                                                                
<node>AssertionAnnotator</node>
                                                                
<node>ExtractionPrepAnnotator</node>

But it takes very very long time to process that many data( maybe a week or so) 
when I use SimpleSegmentAnnotator.  By eliminating SimpleSegmentAnnotator the 
process is very fast but no medication is being anotated.  Do you guys have any 
suggestion?

Thanks,
Nick

Reply via email to