Re: Jason format exception for stats component facet results | stddev value of NaN

2010-11-18 Thread Chris Hostetter
: I am getting a mal-formatted json response when using the : stats component with a facet that returns a stddev value of NaN ,e.g I'm not a JSON expert, but i suspect NaN just isn't legal JSON, and the JSON response writer has a bug. quick google search... http://stackoverflow.com/questions/14

Re: Solr 1.4.1 stats component count not matching facet count for multi valued field

2010-11-18 Thread Chris Hostetter
1) i would suggest you use the solr-user mailing list for questions like this. 2) stats faceting has known bugs with multivalued *and* non string fields.. https://issues.apache.org/jira/browse/SOLR-1782 -Hoss - To unsubs

Re: IndexWriter.close() performance issue

2010-11-18 Thread Mark Kristensson
I finally bucked up and made the change to CheckIndex to verify that I do not, in fact, have any fields with norms in this index. The result is below - the largest segment currently is #3, which 300,000+ fields but no norms. -Mark Segments file=segments_acew numSegments=9 version=FORMAT_DIAGN

Re: uncorrect results

2010-11-18 Thread Jan
hmm ok i tried it but to no avail. It would have confused me even more to be honest. actually i would not have used a Document Collector at all, because I was supposed to give all results even when queried "the". What i mean is that i would not need the score at all. I just didn't know how ;) Anyw

Re: uncorrect results

2010-11-18 Thread Pulkit Singhal
Briefly looked at your code and there is no way that I'm right about this but I'll say it anyway: Every single field you index doesn't have any NORMS so how will the scoring happen? It probably happens based on the matches at query time but its not like you are specifying any boosts in you query. L

Re: uncorrect results

2010-11-18 Thread Pulkit Singhal
Wow, you live in a really great country and attend an awesome university where they have classes like "Text Analytics" I'm gonna send my kid there to study :) In all seriousness I think the problem may be with how you are collecting your results. I find this very amusing: > 80. 896889 phrase occu

Re: Deleted File Handles - Index Writer

2010-11-18 Thread Michael McCandless
On Thu, Nov 18, 2010 at 10:10 AM, Thomas Rewig wrote: >  Hi Michael, > > Thanks for your answer and sorry for my late reply. >> >> Are you using compound file format (the default)? >> > Yes I am using the compound file format as default. >> >> If you turn that off (just for this test) do you still

How to combine QueryParser and Wildcard search

2010-11-18 Thread Pulkit Singhal
Hello, I was wondering if there is any API call in Lucene that allows something like the following: Step 1: Take the user input "hello world" you are beautiful Step 2: QueryParser does its thing defaultField:hello world defaultField:you defaultField:are defaultField:beautiful Step 3: And someho

Re: Deleted File Handles - Index Writer

2010-11-18 Thread Thomas Rewig
Hi Michael, Thanks for your answer and sorry for my late reply. Are you using compound file format (the default)? Yes I am using the compound file format as default. If you turn that off (just for this test) do you still see that IndexWriter is holding open the files (35 in your example) aft

Custom similarity calculation ignoring fieldnorm

2010-11-18 Thread Philippe
Dear Lucene group, I wrote my own Scorer by extending Similarity. The scorer works quite well, but I would like to ignore the fieldnorm value. Is this somehow possible during search time? Or do I have to add a field indexed with no_norm? Best, Philippe -

Re: KeywordAnalyzer and Boosting

2010-11-18 Thread Pulkit Singhal
Thanks Ian, Yup that would do the trick for me, it seems. Also I would like to say that the following also worked, I only realized it after I went through the scores coming from my results step by step: KeywordAnalyzer + Index.ANALYZED (index-time norms were present) Cheers! On Thu, Nov 18, 20

Re: lucene anchor-distance based search

2010-11-18 Thread yang Yang
BWT,for some condition-required search I can make the condition as a filter and then filter the result. Also I can build a BooleanQuery according to the condition just like the code in the range search,I wonder which is better? 2010/11/18 yang Yang > Thank you very much!!! :) > > I will have a

Re: KeywordAnalyzer and Boosting

2010-11-18 Thread Ian Lea
Have you tried explicitly setting norms on/off the way you want with Field.setOmitNorms(boolean)? -- Ian. On Thu, Nov 18, 2010 at 12:54 AM, Pulkit Singhal wrote: > Based on my experimentation and what it says in the Lucene 2nd edition book: > "Using a KeywordAnalyzer on special fields during in