Re: How to create document objects in our case

2011-05-22 Thread Michael McCandless
-user@lucene.apache.org > Subject: Re: How to create document objects in our case > > Norms is how Lucene records what the apriori boost is for each > docXfield.  This boost is the product of per-field boost, per-doc > boost (both of which your app would set when it creates the doc),

RE: How to create document objects in our case

2011-05-22 Thread zhoucheng2008
Great, thanks Mike. -Original Message- From: Michael McCandless [mailto:luc...@mikemccandless.com] Sent: Sunday, May 22, 2011 8:09 PM To: java-user@lucene.apache.org Subject: Re: How to create document objects in our case Norms is how Lucene records what the apriori boost is for each

Re: How to create document objects in our case

2011-05-22 Thread Michael McCandless
ted document will be applicable? > > Cheng > > > -Original Message- > From: Michael McCandless [mailto:luc...@mikemccandless.com] > Sent: Sunday, May 22, 2011 6:58 PM > To: java-user@lucene.apache.org > Subject: Re: How to create document objects in our case > > 3

RE: How to create document objects in our case

2011-05-22 Thread zhoucheng2008
ent will be applicable? Cheng -Original Message- From: Michael McCandless [mailto:luc...@mikemccandless.com] Sent: Sunday, May 22, 2011 6:58 PM To: java-user@lucene.apache.org Subject: Re: How to create document objects in our case 30 fields is fine, but if they are all indexed you should w

Re: How to create document objects in our case

2011-05-22 Thread Michael McCandless
30 fields is fine, but if they are all indexed you should watch out for memory usage. Ie, norms require 1 byte per doc per indexed field. If you don't need norms (don't boost, lengths don't vary much or you don't care to have field length impact scoring) you can omit norms. The relationship b/w

How to create document objects in our case

2011-05-20 Thread Cheng Zhou
Hi, I have a large number of XML files to be indexed by Lucene. All the files share similar structure as below: .. Things to be noted are: The root element of Group has 30 or so attributes, and it usually has over 2000 Subgroup elements, which in turn also have more than 20