Re: [blog post] Automatically Acquiring Synonym Knowledge from Wikipedia

2013-05-28 Thread Tommaso Teofili
2013/5/29 Koji Sekiguchi > Hi Rajesh, > > Thanks! > I'm planning to open an NLP tool kit for Lucene, and the tool kit will > include > the following synonym library. > sounds nice, looking forward to it. Tommaso > > koji > > > (13/05/28 14:12), Rajesh Nikam wrote: > >> Hello Koji, >> >> This

Re: [blog post] Automatically Acquiring Synonym Knowledge from Wikipedia

2013-05-28 Thread Rajesh Nikam
Hi Koji, Great news ! I am looking forward for this OpenNLP toolkit. Thanks a lot ! Rajesh On Wed, May 29, 2013 at 4:12 AM, Koji Sekiguchi wrote: > Hi Rajesh, > > Thanks! > I'm planning to open an NLP tool kit for Lucene, and the tool kit will > include > the following synonym library. > > k

Re: Difference between Group and Facet

2013-05-28 Thread raj
Will update here if we finally decide to go with this. Looks like our requirement can be accomplished with normal String fields On Wed, May 29, 2013 at 7:55 AM, Brendan Grainger < brendan.grain...@gmail.com> wrote: > Great, did it work? > > By the way, If found this article: > http://blog.thetap

Re: Lucene 4.2 DocValues

2013-05-28 Thread Arun Kumar K
Adrein, Thanks for spending time to explain me the things clearly. I have got the things correctly now. Thanks, Arun On 29-May-2013, at 2:13 AM, Adrien Grand wrote: > On Tue, May 28, 2013 at 8:55 PM, Arun Kumar K wrote: >> Thanks for clarifying the things. >> I have some doubts regarding so

Re: Difference between Group and Facet

2013-05-28 Thread Brendan Grainger
Great, did it work? By the way, If found this article: http://blog.thetaphi.de/2012/02/is-your-indexreader-atomic-major.html very helpful in understanding the difference between AtomicReaders and CompositeReaders. Brendan On Tue, May 28, 2013 at 2:30 AM, raj wrote: > Thanks a lot Elisabeth >

Re: PrefixQuery with short prefix does not match documents

2013-05-28 Thread Steven Schlansker
Hi Mike, Thank you for the pointer, that is indeed the cause here. The reason I added the rewrite was to preserve the boost of the field on matches. Specifically, some results have a field boost of log(popularity) and others have a field boost of 100 to float them to the top. Without the rewrit

Re: [blog post] Automatically Acquiring Synonym Knowledge from Wikipedia

2013-05-28 Thread Koji Sekiguchi
Hi Rajesh, Thanks! I'm planning to open an NLP tool kit for Lucene, and the tool kit will include the following synonym library. koji (13/05/28 14:12), Rajesh Nikam wrote: Hello Koji, This is seems pretty useful post on how to create synonyms file. Thanks a lot for sharing this ! Have you sh

Re: Lucene 4.2 DocValues

2013-05-28 Thread Adrien Grand
On Tue, May 28, 2013 at 8:55 PM, Arun Kumar K wrote: > Thanks for clarifying the things. > I have some doubts regarding sorting : >> >> While you can do that, I don't recommend it. For example, if you have >> 5 fields, loading all fields from stored fields requires at most 1 >> disk seek while loa

Re: Lucene 4.2 DocValues

2013-05-28 Thread Arun Kumar K
Hi Adrein, Thanks for clarifying the things. I have some doubts regarding sorting : > > While you can do that, I don't recommend it. For example, if you have > 5 fields, loading all fields from stored fields requires at most 1 > disk seek while loading all fields from doc values requires at leas

Re: Lucene 4.2 DocValues

2013-05-28 Thread Adrien Grand
On Tue, May 28, 2013 at 4:48 PM, Arun Kumar K wrote: > Hi Guys, Hi, > I have been trying to understand DocValues and get some hands on and have > observed few things. > > I have added LongDocValuesField to the documents like: > doc.add(new LongDocValuesField("id",1)); > > 1> In 4.0 i saw that th

Infinispan & JGroups migrating to Apache License

2013-05-28 Thread Sanne Grinovero
Hello all, as some of you already know the Infinispan project includes several integration points with the Apache Lucene project, including a Directory implementation, but so far we had a separate community because of the license incompatibility. I'm very happy to announce now that both Infinispan

Lucene 4.2 DocValues

2013-05-28 Thread Arun Kumar K
Hi Guys, I have been trying to understand DocValues and get some hands on and have observed few things. I have added LongDocValuesField to the documents like: doc.add(new LongDocValuesField("id",1)); 1> In 4.0 i saw that there are two versions for docvalues, RAM Resident(using Sources.getSO

Re: Faceted Search: count direct matches/member für result nodes

2013-05-28 Thread Shai Erera
Hi I think that building the result tree is a good decision. And I think that you can accomplish that by implementing a FacetResultsHandler which takes the FacetArrays (that contains the counts for all categories) and builds the result FacetResult including all tree sub-nodes. You can find somewh

Re: Faceted Search: count direct matches/member für result nodes

2013-05-28 Thread Schimke, Danny
Hi, I finished 3 of my 4 requirements. * Indirect membership for list and hierarchical * direct membership for flat list I accomplished the flat list for direct memberships by indexing a second category using another category path prefix and "id" in this case does not contain any slash "/" e

Re: Faceted Search: count direct matches/member für result nodes

2013-05-28 Thread Schimke, Danny
Hi Shai, you understood correctly, fantastic! You described my data representation properly. Yes, one document is a person and for each community there is a referenced category, currently with hierachical String representation delemited by slash "/", in the taxonomy index. Yes, communities

Re: Faceted Search: count direct matches/member für result nodes

2013-05-28 Thread Shai Erera
What represents a document in the index, the persons? And each person is associated with 0..N communities? And are communities hierarchical? So if I have these two documents: *P1 * name: John Doe communities: C1/C2, C4/C5/C6 *P2 * name: John Smith communities: C1/C3, C4/C7 Does that represent th

Re: Faceted Search: count direct matches/member für result nodes

2013-05-28 Thread Schimke, Danny
Hello Shai, here my scenario: There are persons. Each person can be a member of 0..n groups. On the frontend there are 2 different views. One is hierarchical, the other one is a list (flat). These groups should be shown as facets (how many persons are member of a group). A further option (on

Re: Faceted Search: count direct matches/member für result nodes

2013-05-28 Thread Shai Erera
Danny, can you try to describe your problem again? I don't think overriding OrdinalPolicy is something that you want to do in general. If you can, please provide a short example (even in text) with 1-2 documents, with their facets and what you're trying to achieve. Shai On Tue, May 28, 2013 at 9