You need to unpack the cTakes resources zip onto your classpath.
See
https://cwiki.apache.org/confluence/display/CTAKES/cTAKES+4.0+User+Install+Guide,
points 4 and 5. That's got lvg.properties in it, but it needs the
rest of the database unpacked too. It doesn't work with just the jar or zip,
i
I'm looking at SentenceDetector from ctakes-core. It has a surprising
idea of what counts as a "sentence". Before I delve any deeper,
I wanted to ask whether there is a reason for what it's doing, in particular
whether there's anything in the clinical pipeline that's depending on its
behavior spe
And if someone with access rights wants to put that on takes.apache.org,
there's a ticket for it:
https://issues.apache.org/jira/browse/CTAKES-499
Ewan.
On Tue, Apr 03, 2018 at 06:10:46PM +, Gandhi Rajan Natarajan wrote:
> Hi Sean,
>
> Please find the response from Sean Finan for the simil
ence splitter model, here:
> org/apache/ctakes/core/ae/SentenceDetectorAnnotatorBIO.java
>
> it operates at the character level and splits a document into sentences. For
> some people it works better. For data where there are potentially
> mid-sentence newlines (like MIMIC), it is
There is a point in LvgCmdApiResourceImpl where it changes the working
directory so that LVG can find the config file. I have no idea how this
would be supposed to work on Spark, but I guess that using relative paths
in your config is going to be a problem.
There is also a point in LvgCmdApiResou