Re: Getting CTakes to work with Windows

2018-04-03 Thread Ewan Mellor
You need to unpack the cTakes resources zip onto your classpath. See https://cwiki.apache.org/confluence/display/CTAKES/cTAKES+4.0+User+Install+Guide, points 4 and 5. That's got lvg.properties in it, but it needs the rest of the database unpacked too. It doesn't work with just the jar or zip, i

SentenceDetector

2018-04-06 Thread Ewan Mellor
I'm looking at SentenceDetector from ctakes-core. It has a surprising idea of what counts as a "sentence". Before I delve any deeper, I wanted to ask whether there is a reason for what it's doing, in particular whether there's anything in the clinical pipeline that's depending on its behavior spe

Re: consequences of change to typesystem [EXTERNAL]

2018-04-06 Thread Ewan Mellor
And if someone with access rights wants to put that on takes.apache.org, there's a ticket for it: https://issues.apache.org/jira/browse/CTAKES-499 Ewan. On Tue, Apr 03, 2018 at 06:10:46PM +, Gandhi Rajan Natarajan wrote: > Hi Sean, > > Please find the response from Sean Finan for the simil

Re: SentenceDetector [EXTERNAL] [SUSPICIOUS]

2018-04-17 Thread Ewan Mellor
ence splitter model, here: > org/apache/ctakes/core/ae/SentenceDetectorAnnotatorBIO.java > > it operates at the character level and splits a document into sentences. For > some people it works better. For data where there are potentially > mid-sentence newlines (like MIMIC), it is

Re: cTakes on Apache Spark - Error

2018-05-01 Thread Ewan Mellor
There is a point in LvgCmdApiResourceImpl where it changes the working directory so that LVG can find the config file. I have no idea how this would be supposed to work on Spark, but I guess that using relative paths in your config is going to be a problem. There is also a point in LvgCmdApiResou