Re: How do we reverse sort on the docid ?

2011-05-16 Thread shrinath.m
idden email]<http://user/SendEmail.jtp?type=node&node=2949011&i=1> > > > > > > > -Original Message- > > > From: shrinath.m [mailto:[hidden > > > email]<http://user/SendEmail.jtp?type=node&node=2949011&i=2>] > > > &g

What is the relation between n-grams/edge n-grams with prefix query ?

2011-05-16 Thread shrinath.m
I was under an assumption that edge n-grams speed up the prefix queries, but now it seems that it is not so... Am I right in thinking so ? Are there any relation between n-grams and prefix queries ? -- View this message in context: http://lucene.472066.n3.nabble.com/What-is-the-relation-betwee

Re: How do we reverse sort on the docid ?

2011-05-16 Thread shrinath.m
Erick Erickson wrote: > > At any rate, I think all you need to do is specify > the reverse boolean in the SortField c'tor??? > > Best > Erick > > The "true" in `Sort srt = new Sort(new SortField(null,SortField.DOC,true)); ` actually is the reverse boolean sort field. So it is turned on, but

Re: How do we reverse sort on the docid ?

2011-05-16 Thread shrinath.m
Erick Erickson wrote: > > Why do you want to do this? the internal doc ids are > transient. If you update a document by delete/add, the > internal id will now be different. What I'm getting at is > that I'd like to be sure the use case here does what > you think it will because this smells like a

How do we reverse sort on the docid ?

2011-05-16 Thread shrinath.m
What I want: docids that matched the search string are : 5 4 3 2 1 What I am getting : docids that matched the search string are : 999 998 997 996 995 I am lucene sort object, and it is constructed like this : Sort srt = new Sort(new SortField(null,SortField.DOC,true)); How do I get wh

switching between Query parsers

2011-04-14 Thread shrinath.m
Consider this case : Lucene index contains documents with these fields : title author publisher I have coded my app to use MultiFieldQueryParser so that it queries all fields. Now if user types something like "author:tom" in search box, how do I make it go with only that field ? -- View thi

Re: Highlighting a phrase with "Single"

2011-04-06 Thread shrinath.m
Thats right :) Thanks Koji :) On Wed, Apr 6, 2011 at 3:31 PM, Koji Sekiguchi [via Lucene] < ml-node+2784321-1329059645-376...@n3.nabble.com> wrote: > (11/04/06 14:01), shrinath.m wrote: > > > If there is a phrase in search, the highlighter highlights every word > > se

Highlighting a phrase with "Single"

2011-04-05 Thread shrinath.m
If there is a phrase in search, the highlighter highlights every word separately.. Like this : I love Lucene Instead what I want is like this : I love Lucene Is there a way to ask Lucene do this ? I know we could ask css or jquery to do the task, but whats the point ? right ? So, is

Re: Difference between regular Highlighter and Fast Vector Highlighter ?

2011-04-01 Thread shrinath.m
Got it :) Thanks for the link. [closed] On Sat, Apr 2, 2011 at 6:14 AM, Koji Sekiguchi [via Lucene] < ml-node+2765616-1923995541-376...@n3.nabble.com> wrote: > (11/04/01 21:32), shrinath.m wrote: > > I was wondering whats the difference between the Lucene's 2

Difference between regular Highlighter and Fast Vector Highlighter ?

2011-04-01 Thread shrinath.m
I was wondering whats the difference between the Lucene's 2 implementation of highlighters... I saw the javadoc of FVH, but it only says "another implementation of Lucene Highlighter" ... Can someone throw some more light on this ? -- View this message in context: http://lucene.472066.n3.nabbl

Re: Is it possible to update only selected fields in a document ?

2011-03-22 Thread shrinath.m
ind indexing it, but I wouldn't want it to pop up in search results or Autocomplete :) I'd say we can hide/block it from search in our app as Michael suggests, again, is that how everyone does it ? Is that the "best" way ? -- Regards Shrinath.M -- View this message in context:

Re: Is it possible to update only selected fields in a document ?

2011-03-22 Thread shrinath.m
the fields, but I wanted to do that without making the *id* field searchable. ( I mean, *not indexed* => Field.Index.NO) Is that possible ? -- Regards Shrinath.M -- View this message in context: http://lucene.472066.n3.nabble.com/Is-it-possible-to-update-only-selected-fields-in-a-document-

Re: Is it possible to update only selected fields in a document ?

2011-03-22 Thread shrinath.m
ique id into a HashMap, - replace the field which I want to update - call update on the index with the map. Is there any efficient way of doing this ? -- Regards Shrinath.M -- View this message in context: http://lucene.472066.n3.nabble.com/Is-it-possible-to-update-only-selected-fields

Is it possible to update only selected fields in a document ?

2011-03-22 Thread shrinath.m
I am asking for partial update in Lucene, where I want to update only a selected field of all fields in the document. Does Lucene provide any way to do this ? How to approach this ? -- View this message in context: http://lucene.472066.n3.nabble.com/Is-it-possible-to-update-only-selected-fie

Re: Re: How to do an all field search without using a "catchall" fieldor "MultiFieldQueryParser" ?

2011-03-17 Thread shrinath.m
7;t know what fields are in document) -- Regards Shrinath.M -- View this message in context: http://lucene.472066.n3.nabble.com/How-to-do-an-all-field-search-without-using-a-catchall-field-or-MultiFieldQueryParser-tp2681163p2693635.html Sent from the Lucene - Java Users mailing list archive at Nabble.com.

Re: Re: How to do an all field search without using a "catchall" fieldor "MultiFieldQueryParser" ?

2011-03-17 Thread shrinath.m
FieldQueryParser.parse(Version.LUCENE_CURRENT, s, fields, flags, new StandardAnalyzer(Version.LUCENE_CURRENT, Collections.emptySet())); :( -- Regards Shrinath.M -- View this message in context: http://lucene.472066.n3.nabble.com/How-to-do-an-all-field-search-without-using-a-catchall-field-or-Mult

Re: Re: How to do an all field search without using a "catchall" fieldor "MultiFieldQueryParser" ?

2011-03-17 Thread shrinath.m
ope that the 2_3_1 links don't indicate that you are using that version. > > > yeah, my bad... I am using 3.0.3... http://lucene.apache.org/java/3_0_3/api/all/org/apache/lucene/queryParser/MultiFieldQueryParser.html <http://lucene.apache.org/java/3_0_3/api/all/org/apache/lucene/query

Re: Re: How to do an all field search without using a "catchall" fieldor "MultiFieldQueryParser" ?

2011-03-17 Thread shrinath.m
i/org/apache/lucene/queryParser/MultiFieldQueryParser.html <http://lucene.apache.org/java/2_3_1/api/org/apache/lucene/queryParser/MultiFieldQueryParser.html>Look at the parse method. -- Regards Shrinath.M -- View this message in context: http://lucene.472066.n3.nabble.com/How-to-do-an-all-field-sea

RE:Re: How to do an all field search without using a "catchall" fieldor "MultiFieldQueryParser" ?

2011-03-15 Thread shrinath.m
Thanks Ian, I'd have to create a similarly sized array of Boolean.Occur too, isn't it? 1. Is that how SIREn implements it? 2. Is that optimal solution if I'm going to have an index of a billion docs with varying number of fields? -- View this message in context: http://lucene.472066.n3.na

How to do an all field search without using a "catchall" field or "MultiFieldQueryParser" ?

2011-03-15 Thread shrinath.m
Catchall field has its own disadvantages of increasing index size. MultiFieldQueryParser has to know the field names before hand. How do I do a multi field search - without knowing fields in the index - without having a CatchAll field PS : I went through the Lucene in Action book, and in cha

Re: Which is the +best +fast HTML parser/tokenizer that I can use with Lucene for indexing HTML content today ?

2011-03-14 Thread shrinath.m
Earl Hood wrote: > > Looks like Jericho does what you want already: > http://jericho.htmlparser.net/docs/javadoc/net/htmlparser/jericho/TextExtractor.html > > --ewh > I went through their feature list and found that out :) http://jericho.htmlparser.net/docs/index.html Thanks Earl :) This i

Re: Which is the +best +fast HTML parser/tokenizer that I can use with Lucene for indexing HTML content today ?

2011-03-14 Thread shrinath.m
I started trying out all your suggestions one by one, thanks to all who helped. I used Jericho and found it extremely simple to start with ... Just wanted to clarify one thing though. Is there some tool that does extract text from HTML without creating the DOM ? -- Regards Shrinath.M

Re: Which is the +best +fast HTML parser/tokenizer that I can use with Lucene for indexing HTML content today ?

2011-03-11 Thread shrinath.m
t; understands HTML, HTML-aware operations are outside Solr's purview. > > This is how Solr achieve it : http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.HTMLStripStandardTokenizerFactory -- Regards Shrinath.M -- View this message in context: http://lucene.472066.n3.na

Re: Which is the +best +fast HTML parser/tokenizer that I can use with Lucene for indexing HTML content today ?

2011-03-11 Thread shrinath.m
ML pages, no parsing while crawling, now what ? Any tokenizer someone has built for this ? How does Solr do it ? -- Regards Shrinath.M -- View this message in context: http://lucene.472066.n3.nabble.com/Which-is-the-best-fast-HTML-parser-tokenizer-that-I-can-use-with-Lucene-for-inde

Re: Which is the +best +fast HTML parser/tokenizer that I can use with Lucene for indexing HTML content today ?

2011-03-11 Thread shrinath.m
, Li Li [via Lucene] < ml-node+2664327-2139887543-376...@n3.nabble.com> wrote: > http://java-source.net/open-source/html-parsers > > 2011/3/11 shrinath.m <[hidden > email]<http://user/SendEmail.jtp?type=node&node=2664327&i=0&by-user=t>> > > > >

Re: spell checker across multiple fields

2011-03-09 Thread shrinath.m
Hi, can I know, if there is any improvement since 4 years ? Can we do that now in Lucene 3.0.3 ? -- View this message in context: http://lucene.472066.n3.nabble.com/spell-checker-across-multiple-fields-tp539392p2654706.html Sent from the Lucene - Java Users mailing list archive at Nabble.com. -

Re: phrase query highlighter spans matching

2011-03-08 Thread shrinath.m
Koji Sekiguchi wrote: > > SpanScorer implementation has replaced QueryScorer at LUCENE-1685. > From contrib/CHANGES.txt: > > * LUCENE-1685: The position aware SpanScorer has become the default > scorer > for Highlighting. The SpanScorer implementation has replaced > QueryScorer > a