Re: Lucene Challenge - sum, count, avg, etc.

2010-04-01 Thread Jason Eacott
Thanks for the ref - didn't know about Pig before. the language and approach looks useful, so now I'm wondering if it couldn't be used across lucene over hadoop too. If data was indexed in lucene and Pig knew that, then it could make for an interesting alternate lucene query language. could this w

fastest way to gather simple terms that match documents?

2010-03-31 Thread Jason Eacott
Hi all, After I've run a query I need to know which terms matched each result document (ie doc termfrequency>0). the only way I know to do this is by calling explain on each document, which the documentation claims to be almost the equivalent of a new query for each call so I'm keen to avoid th

Re: is it possible to make lucene searches match based on per doc field:termcount?

2009-11-06 Thread Jason Eacott
s too) Cheers Jason. Grant Ingersoll wrote: On Nov 5, 2009, at 4:31 PM, Jason Eacott wrote: Hi All, I hope someone can offer some advice. I want to extend lucene to search in a particular way(if it cant already): I want to index docs, each with file containing several terms something

is it possible to make lucene searches match based on per doc field:termcount?

2009-11-05 Thread Jason Eacott
Hi All, I hope someone can offer some advice. I want to extend lucene to search in a particular way(if it cant already): I want to index docs, each with file containing several terms something like: doc1=>myfield:a doc2=>myfield:a,b doc3=>myfield:a,b,c doc4=>myfield:a,b,c,d so far nothing new.

lucene 1.4 + needs spaces problem

2005-04-06 Thread Jason Eacott
Hi, I recently upgraded from lucene 1.3 final to 1.4 and discovered some things which no longer seem to work right. I am using Analyzer analyzer = new StandardAnalyzer(); QueryParser parser = new QueryParser( "terms", analyzer); parser.setOperator(QueryParser.DEFAULT_OPERATOR_AND); q