Split / Concatenation of search term

2016-11-30 Thread hariram ravichandran
Is it possible to handle the split and concatenation of words when a space was inserted in a word or removed between two words? For example, "entert ainment"will match with "entertainment" and "smartwatch" will match with "smart watch".

Save the date: ApacheCon Miami, May 15-19, 2017

2016-11-30 Thread Rich Bowen
Dear Apache enthusiast, ApacheCon and Apache Big Data will be held at the Intercontinental in Miami, Florida, May 16-18, 2017. Submit your talks, and register, at http://apachecon.com/ Talks aimed at the Big Data section of the event should go to http://events.linuxfoundation.org/events/apache-bi

Re: commit frequency guideline?

2016-11-30 Thread Rob Audenaerde
Thanks for the quick reply! >What do you mean by "Lucene complain about too-many uncommitted docs"? --> good question, I was thoughtlessly echoing words from my colleague. I asked him and he said that it was about taking very long to commit and memory issues. So maybe this wasn't the best opening

Re: commit frequency guideline?

2016-11-30 Thread Michael McCandless
What do you mean by "Lucene complain about too-many uncommitted docs"? Lucene does not really care how frequently you commit... How frequently you commit is really your choice, i.e. what risk you see of power loss / OS crash vs the cost (not just in CPU/IO work for the computer, but in the users

commit frequency guideline?

2016-11-30 Thread Rob Audenaerde
Hi all, Currently we call commit() many times on our index (about 5M docs, where some 10.000-100.000 modifications during the day). The commit times typically get more expensive when the index grows, up to several seconds, so we want to reduce the number of calls. (Historically, we had Lucene com

Re: Query expansion

2016-11-30 Thread hariram ravichandran
I am overriding getFieldQuery(String field, String fieldText,boolean quoted). And in case of phrase query, getFieldQuery(String field, String queryText, int slop) will be called. And prefix query will not be my use case. So, we can ignore prefix query. Assume this is my only case. Sequence of

Re: Faceting : what are the limitations of Taxonomy (Separate index and hierarchical facets) and SortedSetDocValuesFacetField ( flat facets and no sidecar index) ?

2016-11-30 Thread Chitra R
Thank you so much, Shai... Chitra On Wed, Nov 30, 2016 at 2:17 PM, Shai Erera wrote: > This feature is not available in Lucene currently, but it shouldn't be hard > to add it. See Mike's comment here: > http://blog.mikemccandless.com/2013/05/dynamic-faceting- > with-lucene.html?showComment=1412

Re: Faceting : what are the limitations of Taxonomy (Separate index and hierarchical facets) and SortedSetDocValuesFacetField ( flat facets and no sidecar index) ?

2016-11-30 Thread Shai Erera
This feature is not available in Lucene currently, but it shouldn't be hard to add it. See Mike's comment here: http://blog.mikemccandless.com/2013/05/dynamic-faceting-with-lucene.html?showComment=1412777154420#c363162440067733144 One more tricky (yet nicer) feature would be to have it all in one