Re: OpenNLP dictionary-based lemmatizer memory issue

2021-11-17 Thread Spyros Kapnissis
x27;s > see if they have any insight and in a couple of weeks we merge! > Cheers > -- > Alessandro Benedetti > Apache Lucene/Solr Committer > Director, R&D Software Engineer, Search Consultant > > www.sease.io > > > On Mon, 15 Nov 2021 at

OpenNLP dictionary-based lemmatizer memory issue

2021-11-15 Thread Spyros Kapnissis
Hi all, We recently identified and fixed an issue with the OpenNLP dictionary-based lemmatizer that seems to affect all versions. It resulted in generally high memory usage and random OOM exceptions, generally high server load during both indexing and querying and overall unstable performance. It

Re: MultipleAdditiveTreeModel

2021-08-03 Thread Spyros Kapnissis
t; Apache Lucene/Solr Committer > Director, R&D Software Engineer, Search Consultant > > www.sease.io > > > On Wed, 28 Jul 2021 at 21:00, Spyros Kapnissis wrote: > > > Hi Alessandro, Roopa, I created the ticket here: > > https://issues.apache.org/jira/browse/SOLR

Re: MultipleAdditiveTreeModel

2021-07-28 Thread Spyros Kapnissis
-- > Alessandro Benedetti > Apache Lucene/Solr Committer > Director, R&D Software Engineer, Search Consultant > > www.sease.io > > > On Mon, 26 Jul 2021 at 21:29, Spyros Kapnissis wrote: > > > Hi Alessandro, Roopa, I also agree that this issue should b

Re: MultipleAdditiveTreeModel

2021-07-26 Thread Spyros Kapnissis
> > Alessandro Benedetti > > Apache Lucene/Solr Committer > > Director, R&D Software Engineer, Search Consultant > > > > www.sease.io > > > > > > On Fri, 16 Jul 2021 at 20:29, Roopa Rao wrote: > > > > >

Re: MultipleAdditiveTreeModel

2021-07-16 Thread Spyros Kapnissis
Hello, Just to verify this, we had come across the exact same issue when converting an XGBoost model to MUltipleAdditiveTrees. This was an issue specifically with the categorical features that take on integer values. We ended up subtracting 0.5 from the threshold value on any such split point on t