Re: How to prevent WordDelimiterFilter tokenize the string with underscore?

2016-06-15 Thread Ahmet Arslan
Hi, You can supply custom types. please see WordDelimiterFilterFactory and wdfftypes.txt for an example. ahmet On Wednesday, June 15, 2016 10:32 PM, Xiaolong Zheng wrote: Hi, How can I prevent WordDelimiterFilter tokenize the string with underscore, e.g. word_with_underscore. I am using Wo

How to prevent WordDelimiterFilter tokenize the string with underscore?

2016-06-15 Thread Xiaolong Zheng
Hi, How can I prevent WordDelimiterFilter tokenize the string with underscore, e.g. word_with_underscore. I am using WordDelimiterFilter to create my own Camel Case analyzer, I was using the configuration flag: flags |= GENERATE_WORD_PARTS; flags |= SPLIT_ON_CASE_CHANGE; flags |= PRESERVE_ORIGIN

Re: How to get the index for a document after a search over multiple indexes

2016-06-15 Thread Mark Shapiro
Thanks, I appreciate the useful info. I can go with option 1. Mark

Getting exception while initializing FSDirectory

2016-06-15 Thread Mukul Ranjan
Hi, I'm getting below exception while initializing FSDirectory- Caused by: java.lang.IllegalAccessError: tried to access method org.apache.lucene.store.MMapDirectory.unmapHackImpl()Ljava/lang/Object; from class org.apache.lucene.store.MMapDirectory$$dtt <@> at java.lang.invoke.MethodHandleNativ

Facet

2016-06-15 Thread Marcio Napoli
Hey! The Lucene facets module uses integer encoding using the method " FacetsConfig.dedupAndEncode " . It would be convenient to use the IntPoint ? Thanks! Marcio Napoli

Using Lucene to model ownership of documents

2016-06-15 Thread Geebee Coder
Hi there, I would like to use Lucene to solve the following problem: 1.We have about 100k customers and we have 25 millions of documents. 2.When a customer performs a text search on the document space, we want to return only documents that the customer has access to. 3.The # of documents a custo