Sweet! That was definitely it! It's flying now (granted, our files are not
in the > 1 mb realm, like it the jira issue - just in the nnn.kb realm, but
still!).

Mahalo nui loa!



On Tue, Sep 24, 2019 at 6:29 PM Finan, Sean <
sean.fi...@childrens.harvard.edu> wrote:

> Hi Greg,
>
> Check your log to see what component is taking all the time.
>
> There is a known problem with the cleartk assertion annotators:
>
> https://issues.apache.org/jira/browse/CTAKES-449
>
> A partial fix was made in the "windowed" sub-package of ctakes-assertion:
> org.apache.ctakes.assertion.medfacts.cleartk.windowed.
>
> Each of the normal assertion engines has a replacement in the windowed
> package.
>
> If you are using a piper file that contains "load AttributeCleartkSubPipe"
> as the Default clinical pipeline does, just replace it with "load
> WindowedAttributeCleartkSubPipe".
>
> It isn't a full fix for the problem, and I don't know if it will make your
> processing faster, but  you can give it a try.
>
> Sean
>
> ________________________________________
> From: Greg Silverman <g...@umn.edu>
> Sent: Tuesday, September 24, 2019 6:47 PM
> To: dev@ctakes.apache.org
> Subject: Large files taking forever to process [EXTERNAL]
>
> Any suggestions on how to speed up processing large clinical text notes
> approaching 13K lines? This is a very old corpus culled from EPIC notes
> back in 2009. I thought about splitting the notes into smaller chunks, but
> then I would have to deal with the offsets when analyzing system output
> against manual annotations that had been done.
>
> As is, I've tried different garbage collection options (this seemed to have
> worked well with CLAMP on the same set of notes).
>
> TIA!
>
> Greg--
>
> --
> Greg M. Silverman
> Senior Systems Developer
> NLP/IE <
> https://urldefense.proofpoint.com/v2/url?u=https-3A__healthinformatics.umn.edu_research_nlpie-2Dgroup&d=DwIFaQ&c=qS4goWBT7poplM69zy_3xhKwEW14JZMSdioCoppxeFU&r=fs67GvlGZstTpyIisCYNYmQCP6r0bcpKGd4f7d4gTao&m=kVCVyGR2m-zb7CsPmrrCeBL1N-9Z6tXZOp869xqkcBQ&s=TEirYUPMXTOjZ1PoJMxTXt7M8I5axwQI9zzNrvLmGRo&e=
> >
> Department of Surgery
> University of Minnesota
> g...@umn.edu
>
>  ›  evaluate-it.org  ‹
>


-- 
Greg M. Silverman
Senior Systems Developer
NLP/IE <https://healthinformatics.umn.edu/research/nlpie-group>
Department of Surgery
University of Minnesota
g...@umn.edu

 ›  evaluate-it.org  ‹

Reply via email to