Hi Greg,

It looks like the offending class might be 
org.apache.ctakes.core.fsm.adapters.DecimalTokenAdapter, which seems to be the 
only NumberTokenAdapter without a try/catch around the parse(..).

I made a fix that runs on my tests.  Since you say that you cannot build 
trunk/4.0.1 locally (custom changes?) I can do one of two things:

1.  Send you a copy of the compiled DecimalTokenAdapter class, or
2.  Send you a copy of the complete ctakes-core jar.

The advantage to #1 is that any local custom changes that you have for the 
ctakes-core module shouldn't be overwritten.  If you haven't updated trunk in a 
while then there are most likely some changes.
The disadvantage to #1 is minor.  You will need to manually insert the 
DecimalTokenAdapter.class file into your local ctakes-core.jar file.   As the 
jar file is just a zip file with a different name this is actually pretty easy 
to do - you can find loads of "how to" on the web.
The advantage to #2 is that you just replace the jar file in your bin/ 
directory with the new copy.  However, there are the disadvantages as listed 
above.

I recommend #1.

Sean

________________________________________
From: Greg Silverman <g...@umn.edu.INVALID>
Sent: Wednesday, February 10, 2021 10:25 AM
To: dev@ctakes.apache.org; Himanshu Shekhar Sahoo
Subject: error: CRITICAL [EXTERNAL]

* External Email - Caution *


We're running version 4.0.0.1 on ~12K notes. The first time we ran it I got
a heap space error at ~10.5k notes processed (at about ~38 hours).

I increased the heap space params and then reran. This time it died at the
same place, but with a different error (see below):

SEVERE: Exception occurred
org.apache.uima.analysis_engine.AnalysisEngineProcessException
        at
org.apache.ctakes.contexttokenizer.ae.ContextDependentTokenizerAnnotator.process(ContextDependentTokenizerAnnotator.java:105)
        at
org.apache.uima.analysis_component.JCasAnnotator_ImplBase.process(JCasAnnotator_ImplBase.java:48)
        at
org.apache.uima.analysis_engine.impl.PrimitiveAnalysisEngine_impl.callAnalysisComponentProcess(PrimitiveAnalysisEngine_impl.java:396)
        at
org.apache.uima.analysis_engine.impl.PrimitiveAnalysisEngine_impl.processAndOutputNewCASes(PrimitiveAnalysisEngine_impl.java:314)
        at
org.apache.uima.analysis_engine.asb.impl.ASB_impl$AggregateCasIterator.processUntilNextOutputCas(ASB_impl.java:570)
        at
org.apache.uima.analysis_engine.asb.impl.ASB_impl$AggregateCasIterator.<init>(ASB_impl.java:412)
        at
org.apache.uima.analysis_engine.asb.impl.ASB_impl.process(ASB_impl.java:344)
        at
org.apache.uima.analysis_engine.impl.AggregateAnalysisEngine_impl.processAndOutputNewCASes(AggregateAnalysisEngine_impl.java:265)
...
Caused by: java.lang.NumberFormatException: For input string: "f"
        at
sun.misc.FloatingDecimal.readJavaFormatString(FloatingDecimal.java:2043)
        at sun.misc.FloatingDecimal.parseDouble(FloatingDecimal.java:110)
        at java.lang.Double.parseDouble(Double.java:538)
--------

Thus, it looks like a string is being detected as a float. This had worked
in version 4.0.1, so it must have been fixed at some point. Even after I
made changes for the new NLM authentication for UMLS and tested it in 4.0.1
based on Peter's authentication solution, it stopped working after January
15th.  Unfortunately, we're not set up to compile 4.0.1.

That being said, does someone have a working version of 4.0.1 built from
the trunk? If so, could you please send me a copy?

If not, how can I find the offending file?

This is kind of critical, since we're in the middle of an experiment and
another side effect of reverting to 4.0.0.1 is it is a LOT slower than
4.0.1.

Thanks very much in advance!

Greg--

--
Greg M. Silverman
Senior Systems Developer
NLP/IE 
<https://urldefense.proofpoint.com/v2/url?u=https-3A__healthinformatics.umn.edu_research_nlpie-2Dgroup&d=DwIBaQ&c=qS4goWBT7poplM69zy_3xhKwEW14JZMSdioCoppxeFU&r=fs67GvlGZstTpyIisCYNYmQCP6r0bcpKGd4f7d4gTao&m=-u760Da1zvrINW3S87TjC6FqWdNhNselgSQIfTTg4-c&s=oKao5Bz-_QkhcdgBtI7kF7CYmF1jl3v-7oaijC0jwZ4&e=
 >
Department of Surgery
University of Minnesota
g...@umn.edu

Reply via email to