Tonic! -----Original Message----- From: Joern Kottmann [mailto:kottm...@gmail.com] Sent: Thursday, January 26, 2017 5:15 AM To: dev@ctakes.apache.org Subject: Re: Update to OpenNLP 1.7.1
Yes, the version should be updated to 1.7.1 and the opennlp maxent dependency should be removed. I prepared a patch yesterday and will open a jira tonight and attach it there. Let me attach the patch, then I will ask for a review and if you don't mind I would then commit it myself. Jörn On Thu, Jan 26, 2017 at 5:42 AM, Andrey Kurdumov <kant2...@googlemail.com> wrote: > Hello Joern, > > What changes needed to update to OpenNLP 1.7.1? Show I change only > pom.xml or something else? > I would like to try it out. I maintain some kind of cTaks fork, > exactly for purpose of keeping up with dependency updates, until they > land in the Ctakes itself. > > 2017-01-26 4:00 GMT+06:00 Joern Kottmann <kottm...@gmail.com>: > > > Hello, > > > > yes I did a lot of testing to make sure that 1.7.0 and 1.7.1 are as > > good as any other OpenNLP release. > > > > The OpenNLP eval tests use all the data we currently support to > > ensure it all works. If you have some specific corpora you need to > > be sure OpenNLP performs well on we can try to add them to the > > evaluation tests. > > > > Looks like to update we need to change the version and remove the > > maxent dependencies. The maxent part is not integrated into opennlp- > > tools. > > > > The existing models dating back to 1.5.0 are supported and should > > work just as good. In some rare cases we fixed obvious bugs which > > can affect the performance but this should be really minor. > > > > In case you have the data I would like to encourage you to retrain > > with 1.7.1, and otherwise you are also fine with the 1.5.x models. > > > > Also I can dedicate some time to ensure the corpora you are using > > are supported by the OpenNLP formats package. > > > > Your POS Tagger readme is mentioning GENIA, do you still use that? > > > > The POS Tagger these days can also be trained directly on the Penn > > Tree Bank, but you could also use OntoNotes. > > > > Should I sent you a patch to update your poms? > > > > I also noticed you are not using the opennlp-uima integration, is > > that not working for cTAKES? We could try to find a solution which > > works for you without duplicating our effort. > > > > Jörn > > > > > > On Wed, 2017-01-25 at 18:43 +0000, Finan, Sean wrote: > > > Hi Jorn, > > > > > > Great idea! Have you done any testing with the latest release? I > > > think that ctakes has 3 dependencies on opennlp libraries. We > > > should try to get our integration with that update (and others) > > > solidified for the upcoming release. The boston children's > > > hospital nlp team just gained a new member who will be devoting a > > > significant amount of time to a 3.2.3 (or 3.3.0) release of > > > ctakes. He is James Masanz, one of the original developers of > > > ctakes, and we are extremely happy to have him onboard. > > > > > > Thank you for the notification, > > > Sean > > > > > > > > > > > > > > > -----Original Message----- > > > From: Joern Kottmann [mailto:kottm...@gmail.com] > > > Sent: Tuesday, January 24, 2017 4:52 AM > > > To: dev@ctakes.apache.org > > > Subject: Update to OpenNLP 1.7.1 > > > > > > Hello all, > > > > > > the OpenNLP team just released a new version and I would like to > > > propose to update cTAKES to it. > > > > > > The release should be fully backward compatible. If you don't mind > > > I am happy to send you a patch to make it easier to migrate. > > > > > > The OpenNLP project also ships a UIMA integration which as far as > > > I can tell could be used by cTAKES directly. > > > > > > BR, > > > Jörn > > >