Hi there, I am using Ctakes to process 5000K free text records where each record has several medications. This is the fixed flow that it goes through:
<node>SimpleSegmentAnnotator</node> <node>SentenceDetectorAnnotator</node> <node>TokenizerAnnotator</node> <node>LvgAnnotator</node> <node>ContextDependentTokenizerAnnotator</node> <node>POSTagger</node> <node>Chunker</node> <node>LookupWindowAnnotator</node> <node>DictionaryLookupAnnotatorDB</node> <node>DependencyParser</node> <node>AssertionAnnotator</node> <node>ExtractionPrepAnnotator</node> But it takes very very long time to process that many data( maybe a week or so) when I use SimpleSegmentAnnotator. By eliminating SimpleSegmentAnnotator the process is very fast but no medication is being anotated. Do you guys have any suggestion? Thanks, Nick