I presume Joern was suggesting that if he supports new lines in the opennlp
SentenceDectector (either part of the trained models or post processing with
some rules?) cTAKES will be able to use it out of the box and we should be able
remove any additional custom logic that we currently have- whic
The sentence detection opennlp model used by ctakes does not split
sentences at newlines - there is additional logic in the takes sentence
splitter that does this (and an alternative impl that doesn't is in the
ytex branch). Afaik no retraining / change to the feature representation is
necessary.
Hi all,
currently I have quite a bit of time to work on OpenNLP, and would like
to help you
out with this issue.
Here is the follow up issue for this change:
https://issues.apache.org/jira/browse/OPENNLP-602
I am still trying to figure out what would be the best option to
implement this.
In