Re: Phrase query vs span query

2006-02-21 Thread Chris Hostetter
: Rank 3: Documents containing atleast n (n < N, where N is total number of : query terms) in the same section and in order that's a non-trivial goal in itself -- even without the "in the same section" restriction, i can't think of a way to do that off the top of my head other then a Span query

Re: How can I get a term's frequency?

2006-02-21 Thread Daniel Noll
sog wrote: > > I search the index with a group of terms. I want to get every term's > frequency in each document of the search result. Are you looking for this? TermFreqVector vector = IndexReader.getTermFreqVector(docNum, "field"); That gives you the frequency of every term, but you can just

How can I get a term's frequency?

2006-02-21 Thread sog
I search the index with a group of terms. I want to get every term's frequency in each document of the search result. How can I? thx, sog - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMA

Searching/sorting strategy for many properties for semantic web app

2006-02-21 Thread David Pratt
Hi there. I am new to Lucene and I have been developing a semantic application for a while and it appears to me Lucene could help me to get a much needed search with reasonable speed. I have some general question to start: 1) Since my app is virtually all metadata, what should I store in the

Re: Phrase query vs span query

2006-02-21 Thread Rajesh Munavalli
On 2/21/06, Chris Hostetter <[EMAIL PROTECTED]> wrote: > > > your "Aim of the Query formation" got truncated, so it's not entirely > clear what you are looking for, but if the general idea of what you are Documents should be ordered as follows Rank 1: Documents containing section containing all t

Re: Boolean Precedence

2006-02-21 Thread Erik Hatcher
On Feb 21, 2006, at 5:39 PM, Eric Jain wrote: Daniel Noll wrote: http://tinyurl.com/hzsna Thanks! There is some mention of "open issues" with this parser. Anyone know what these are, and if anyone is still working on this? I worked on it to a point, but I don't recall what open issues th

Re: Open an IndexWriter in parallel with an IndexReader on the same index.

2006-02-21 Thread Chris Hostetter
: It would have been nice if someone wrote something like indexModifier, : but with a cache, similar to what Yonik suggested above: deletions will : not be done immediately, but rather cached and later done in batches. : Of course, batched deletions should not remember the term to delete, : but ra

Re: Phrase query vs span query

2006-02-21 Thread Chris Hostetter
your "Aim of the Query formation" got truncated, so it's not entirely clear what you are looking for, but if the general idea of what you are looking for is that you want searches for phrase like "quick brown fox" to only match if/when the words "quick" "brown" and "fox" all appear in the same sec

Phrase query vs span query

2006-02-21 Thread Rajesh Munavalli
I am trying to adopt lucene for a special IR system. The following scenario is an approximation of what I am trying to do. Please bear with me if some things doesnt make sense. I need some suggestions on formulating queries for the following scenario Each document consists of a set of fields (stan

Re: Boolean Precedence

2006-02-21 Thread Eric Jain
Daniel Noll wrote: http://tinyurl.com/hzsna Thanks! There is some mention of "open issues" with this parser. Anyone know what these are, and if anyone is still working on this? - To unsubscribe, e-mail: [EMAIL PROTECTED] For

Re: Boolean Precedence

2006-02-21 Thread Daniel Noll
Eric Jain wrote: Searched the mailing list, found something about a "PrecedenceQueryParser", but this seems to have disappeared? It's moved: http://tinyurl.com/hzsna Daniel -- Daniel Noll Nuix Australia Pty Ltd Suite 79, 89 Jones St, Ultimo NSW 2007, Australia Phone: (02) 9280 0699 Fax:

Re: get results by relevance, limiting results and then sort the results by some criterion

2006-02-21 Thread Mufaddal Khumri
Hi, Thats exactly what I am doing currently. Was just wondering if there is a lucene way to do what I am doing using QueryFilter etc. -Thanks. Dan Armbrust wrote: Mufaddal Khumri wrote: When I do a search for example on "batteries" i get 1200+ results. I would like to show the user lets s

Re: get results by relevance, limiting results and then sort the results by some criterion

2006-02-21 Thread Mufaddal Khumri
Currently I am doing exactly that. I am boosting relevant docs and I am sorting in java to get the desired effect. I just was trying to see if I can do something using QueryFilter or Sorts and do what I am doing. -Thanks. John Powers wrote: Also, if you don't like the tag solution, you could

RE: webserverless search with lucene on offline HTML doc

2006-02-21 Thread Trieschnigg, R.B. \(Dolf\)
> Wouldn't this be a good case for the JarDirectory implementation > somebody asked for? > The index could then be statically written in a jar file downloaded > with the applet (the original mail refers to static offline HTML > files). I wrote a quick and dirty implementation of a JarDirectory - i

Re: fitler vs query

2006-02-21 Thread Chris Hostetter
: Hoss, come on, where are you - Filters! ;) Otis my man, cut me some slack! .. I'm in the Pacific/USA Timezone, I don't get to work untill 10AM at the earliest, and I'm a little wacked out from having a really delayed flight last night. : John, filters are handy for situations where you want to

RE: get results by relevance, limiting results and then sort the results by some criterion

2006-02-21 Thread John Powers
Also, if you don't like the tag solution, you could borrow something right from LIA... boost the documents that are "significant" products with 1.5 (or whatever higher then 1), and the support/ancillary products boot with .1 If there is nothing relavent in the "significant" products, at least yo

RE: fitler vs query

2006-02-21 Thread John Powers
So by me building a new query instead of a filter, I'm missing this caching part?Seems pretty fast to me right now. But sounds like I should use the filter anyway. Thanks. -Original Message- From: Otis Gospodnetic [mailto:[EMAIL PROTECTED] Sent: Tuesday, February 21, 2006 1:04 P

Re: get results by relevance, limiting results and then sort the results by some criterion

2006-02-21 Thread Dan Armbrust
Mufaddal Khumri wrote: When I do a search for example on "batteries" i get 1200+ results. I would like to show the user lets say 300. I can do that by only extracting the first 300 hits (sorted by decreasing relevance by default) and displaying those to the user. If you are only talking ab

Re: fitler vs query

2006-02-21 Thread Otis Gospodnetic
Hoss, come on, where are you - Filters! ;) John, filters are handy for situations where you want to run your query multiple times against the same sub-set of your indexed documents. For instance, if you have Documents that span several years, and want to be able to search individual years, you

Re: get results by relevance, limiting results and then sort the results by some criterion

2006-02-21 Thread Otis Gospodnetic
It sounds like this is a webapp. I'd consider playing with HTML DOM a little bit - come up with a system where I get top N matches by relevance, store them somewhere, and then just re-sort them using users' criteria, without going back to the Lucene index. For instance, you could store this data

RE: get results by relevance, limiting results and then sort the results by some criterion

2006-02-21 Thread John Powers
It seems to me that the "X" then is a problem for you.. like you said "coming up with the X" is hard. Maybe you want another tag then. We call those ones you are trying to not return "related items" on our search system.So you do a search that has +related:0 or something like that. T

Re: get results by relevance, limiting results and then sort the results by some criterion

2006-02-21 Thread Mufaddal Khumri
So yes, if the xth + 1 item happens to be a camera and if its price happens to be lower than the previous x cameras it wont be included in this view and that is exactly what we want. Mufaddal Khumri wrote: In my case when we search for lets say cameras , my top x results are all sorts of came

Re: get results by relevance, limiting results and then sort the results by some criterion

2006-02-21 Thread Mufaddal Khumri
In my case when we search for lets say cameras , my top x results are all sorts of cameras and then i get documents that match camera casings etc. As a company we want to show as many cameras as possible and not other camera related products for this one web view on a specific page we have. On

RE: get results by relevance, limiting results and then sort the results by some criterion

2006-02-21 Thread John Powers
I'm sure you've taken care of this, but I am curious myself: If the 301 document only has a single term "batteries" (and thus is so far low on the Hits), but has a price of seven cents, then the sort of all the documents with "batteries" would put this near the top, but by eliminating all document

get results by relevance, limiting results and then sort the results by some criterion

2006-02-21 Thread Mufaddal Khumri
When I do a search for example on "batteries" i get 1200+ results. I would like to show the user lets say 300. I can do that by only extracting the first 300 hits (sorted by decreasing relevance by default) and displaying those to the user. Now on the search results page, I have a drop down bo

Re: Open an IndexWriter in parallel with an IndexReader on the same index.

2006-02-21 Thread Paul . Illingworth
I have a set of classes similar in function to IndexModifier but a little more advanced. The idea is to keep the IndexReaders and IndexWriters open as long as possible only closing them when absolutely necessary. Using the concurrency package allows for me to have multiple readers and a singl

Re: Open an IndexWriter in parallel with an IndexReader on the same index.

2006-02-21 Thread Nadav Har'El
"Yonik Seeley" <[EMAIL PROTECTED]> wrote on 21/02/2006 05:13:52 PM: > On 2/21/06, Pierre Luc Dupont <[EMAIL PROTECTED]> wrote: > > is it possible to open an IndexWriter and an IndexReader on the same > > index, at the same time, > > to do deleteTerm and addDocument? > > No, it's not possible.

fitler vs query

2006-02-21 Thread John Powers
Hello, Before I learned about filters in lucene I was building my initial query as a stringbuffer and then I use that with a queryparser.Is there any difference/advantage to separating out the "filter" part of my query into a proper filter in lucene or does it just add requirements the same to

RE: Open an IndexWriter in parallel with an IndexReader on the same index.

2006-02-21 Thread Pierre Luc Dupont
Ok, thanks. That is what I was thinking. Pierre-Luc -Original Message- From: Yonik Seeley [mailto:[EMAIL PROTECTED] Sent: 2006-02-21 10:14 To: java-user@lucene.apache.org Subject: Re: Open an IndexWriter in parallel with an IndexReader on the same index. On 2/21/06, Pierre Luc Dupont

Re: exact match ..

2006-02-21 Thread Robert Watkins
The way I have solved the problem of allowing exact matches is, for each field in which it is possible for an exact match to be requested, a parallel field is created at index time that is unstemmed and has a specific prefix: if (fieldData.isSearched() && tokenize && usingStemmingAnalyzer

Re: Open an IndexWriter in parallel with an IndexReader on the same index.

2006-02-21 Thread Yonik Seeley
On 2/21/06, Pierre Luc Dupont <[EMAIL PROTECTED]> wrote: > is it possible to open an IndexWriter and an IndexReader on the same > index, at the same time, > to do deleteTerm and addDocument? No, it's not possible. You should batch things: do all your deletions, close the IndexReader, then ope

Open an IndexWriter in parallel with an IndexReader on the same index.

2006-02-21 Thread Pierre Luc Dupont
Hi, is it possible to open an IndexWriter and an IndexReader on the same index, at the same time, to do deleteTerm and addDocument? Thanks! Pierre-Luc

Search through WML/WAP

2006-02-21 Thread artdivin
Hello, I'm trying to find a search engine allowing to search through WML (WAP). Do you know if Lucene enables to do it ? Did you already try it ? thanx for your answers ! artdivin - To unsubscribe, e-mail: [EMAIL PROTECTED] For ad

AccumulativeFieldAnalyzer

2006-02-21 Thread Oskar Berger
Hello all, Blatantly trying to take advantage of previous mistakes and expertise experience among all of you, I hereby ask for help in a small issue dealing with indexing into multiple fields I've run into. Objective: To index multiple fields derived from one source field. e.g. CASE, REVERSE, REV

Tools to save click through data in a search engine

2006-02-21 Thread rrshwrk
hi all, I am using lucene + tomcat to host a web search engine. I want to save the queries posted to the search engine and the clicked results returned by the search engine. However, i am unsure if they are any tools readily available to do that (prefererably open source tools) I am sure atleast s

Boolean Precedence

2006-02-21 Thread Eric Jain
I was wondering: Is there any good reason why x AND y OR z is interpreted as +(+x y z) rather than +(+(+x +y) z) ? If yes, any suggestions how this could be accomplished most easily? Searched the mailing list, found something about a "PrecedenceQueryParser", but this seems to have