Re: Omitting term frequencies while preserving positions

2013-08-05 Thread Simon Willnauer
the reason why you can't omit it today is that $num_position == $term_frequency ie. we need to store it anyways. Yet, I kind of agree that this is an impl detail so we could in theory return 1 as the TF from the DocsAndPosEnum but this would break our APIs as well since DocsAndPositionsEnum require

Omitting term frequencies while preserving positions

2013-08-05 Thread Ivan Brusic
As the subject says, is it possible to omit the term frequencies for a field, but still keep positions? Term frequencies are omitted for better scoring under our model, but positions are required for span queries. Are the two concepts related? Are they indexed in the same data structure? One optio

Re: How to Index each file and then each Line for Complete Phrase Match. Sample Data shown.

2013-08-05 Thread Michael McCandless
Why not use one of the suggesters under lucene/suggest/*? Mike McCandless http://blog.mikemccandless.com On Mon, Aug 5, 2013 at 4:49 AM, Ankit Murarka wrote: > Hello. > > 1. What I am trying to implement is "Complete Suggestion Match-Did You Mean > feature for a phrase. I did it for Single Wor

RE: PhraseQuery Search

2013-08-05 Thread Allison, Timothy B.
Try: http://lucene.apache.org/core/4_4_0/queryparser/org/apache/lucene/queryparser/complexPhrase/ComplexPhraseQueryParser.html -Original Message- From: raghavendra.k@barclays.com [mailto:raghavendra.k@barclays.com] Sent: Friday, August 02, 2013 3:17 PM To: java-user@lucene.apach

Re: How to Index each file and then each Line for Complete Phrase Match. Sample Data shown.

2013-08-05 Thread Ankit Murarka
Hello. 1. What I am trying to implement is "Complete Suggestion Match-Did You Mean feature for a phrase. I did it for Single Word. I want to do it now for Sentence." 2. What my understanding of indexing each line as a valid phrase in a particular file is as follows: a. Instead of providing