Re: asking about index verification tools

2010-11-15 Thread Anshum
Hi, One way to do that would be to iterate the terms and then reconstruct the document or just check for the terms one after the other. Though Luke also reconstructs the document and you could use the reconstruction logic to do the same and compare, it is not guaranteed that the reconstruction wou

asking about index verification tools

2010-11-15 Thread Yakob
hello all, I would like to ask about lucene index. I mean I created a simple program that created lucene indexes and stored it in a folder. also I had use a diagnostic tools name Luke to be able to lurk inside lucene index and find out its content. and I know that lucene is a standard framework whe

Re: What is the best Analyzer and Parser for this type of question?

2010-11-15 Thread Lance Norskog
First, to understand what your query looks like, go to admin/analysis.jsp. It lets you see what happens to your queries when they go in. Then, do the query with debugQuery=true. This will add some complex junk to the end of the XML page that describes in painful detail exactly how each document

Re: What is the best Analyzer and Parser for this type of question?

2010-11-15 Thread Ahmet Arslan
> Example of Question: > - What is the role of PrnP in mad cow disease? First thing is do not directly query questions. Manually formulate queries: remove 'what' 'is' 'the' 'of' '?' etc. For example i would convert this question into: "mad cow"^5 "cow disease"^3 "mad cow disease"^15 "role PrnP"

Re: What is the best Analyzer and Parser for this type of question?

2010-11-15 Thread Celso Fontes
Hi Erick, My queries going from a list of Genomic TREC 2006... What the operator you recommend to me? Thanks, Celso 2010/11/15 Erick Erickson > First question: What's the default operator? Out of > the box, its OR. See QueryParser.setDefaultOperator... > > Second, how are you forming your q

Re: What is the best Analyzer and Parser for this type of question?

2010-11-15 Thread Erick Erickson
First question: What's the default operator? Out of the box, its OR. See QueryParser.setDefaultOperator... Second, how are you forming your query? Just running it at the query parser? Query.toString() may be your friend. Best Erick On Mon, Nov 15, 2010 at 2:20 PM, Celso Fontes wrote: > I am u

Solr 1.4.1 stats component count not matching facet count for multi valued field

2010-11-15 Thread Johannes Goll
Hi, I have a facet field called option which may be multi-valued and a weight field which is single-valued. When I use the Solr 1.4.1 stats component with a facet field, i.e. q=*:*&version=2.2&stats=true&stats.field=weight&stats.facet=option I get conflicting results for the stats count result

What is the best Analyzer and Parser for this type of question?

2010-11-15 Thread Celso Fontes
I am using this code, with SnowBall and TopDocScore the code: http://pastebin.com/3X3gbpXE Example of Question: - What is the role of PrnP in mad cow disease? I am running in 11.638 documents and the result is 10410 docs for this question (loww precision) How optimize this? Thanks, Celso. -

Re: Deleted File Handles - Index Writer

2010-11-15 Thread Michael McCandless
OK I opened https://issues.apache.org/jira/browse/LUCENE-2762 for this issue. Mike On Fri, Nov 12, 2010 at 5:21 PM, Michael McCandless wrote: > Are you using compound file format (the default)? > > If you turn that off (just for this test) do you still see that > IndexWriter is holding open the

NoVA/DC - Lucene/Solr Meetup - Wednesday, Nov. 17

2010-11-15 Thread Erik Hatcher
We still have some open spots for the meetup we're hosting this Wednesday night in DC. Come on out, it'll be a great time. Erik -