Re: Text dependent analyzer

2015-04-20 Thread Shay Hummel
Hi Rich Thank you very much, I understand your solution and will try to do something in that spirit. Shay On Fri, Apr 17, 2015 at 8:35 PM Rich Cariens wrote: > Ahoy, ahoy! > > I was playing around with something similar for indexing multi-lingual > documents, Shay. The code is up on github > <

Re: Changing analyzer in an indexwriter

2015-04-20 Thread Anna Elisabetta Ziri
Thank you everyone! Could I suggest that this change should be mentioned in the migration guide? In our case it requires a complete reindex and code change (in search too) so maybe others are in the same situation. Thank you again for the prompt help, Lisa Il giorno lun 20 apr 2015 alle ore 10:4

Re: Changing analyzer in an indexwriter

2015-04-20 Thread Michael McCandless
See the discussion on https://issues.apache.org/jira/browse/LUCENE-6212 about this? It's best to isolate different languages into different fields... Mike McCandless http://blog.mikemccandless.com On Sun, Apr 19, 2015 at 8:37 AM, Ahmet Arslan wrote: > Hi Lisa, > > I think AnalyzerWrapper > ht

Re: SortingAtomicReader alternate to Tim-Sort...

2015-04-20 Thread Adrien Grand
I like these ideas, the int[] we are using today are wasteful. My only concern about using the FixedBitSet is that it would make sorting each postings list run in O(maxDoc) but maybe we can make it better by using SparseFixedBitSet (added in 5.0, given your code snippets I assume you are still on 4