Boosting a subquery

2005-09-29 Thread Anand Kishore
Hi, LIA has info about boosting terms and/or fields but how do I go about boosting an entire bracketed subquery. For example: if I have a query consisting of 2 subqueries like: (xyz:AAA OR qwe:BBB) OR (xyz:CCC OR zxc:DDD) I want to boost subquery1 so that documents are ranked accordingly. -- - A

lucene and UTF-8

2005-09-29 Thread John Cherouvim
Hello I'm having some problems indexing my UTF-8 html pages. I am running lucene on Linux and I cannot understand why does the index generated depends on the locale of my operating system. If I do set | grep LANG I get: LANG=el_GR which is Greek. If I set this to en_US the index generated will

Mirroring a remote index using only metadata

2005-09-29 Thread Murat Yakici
Hi, Let's assume that there is one remote index and one local index. I would like to create a mirror of the remote locally. I'm using a kind of protocol in between (which is not important) to only transfer each document ID, the unique terms in the document and the frequencies. Right now I'm n

Re: lucene and UTF-8

2005-09-29 Thread John Haxby
John Cherouvim wrote: I'm having some problems indexing my UTF-8 html pages. I am running lucene on Linux and I cannot understand why does the index generated depends on the locale of my operating system. If I do set | grep LANG I get: LANG=el_GR which is Greek. If I set this to en_US the inde

Re: lucene and UTF-8

2005-09-29 Thread Andrzej Bialecki
John Cherouvim wrote: Hello I'm having some problems indexing my UTF-8 html pages. I am running lucene on Linux and I cannot understand why does the index generated depends on the locale of my operating system. If I do set | grep LANG I get: LANG=el_GR which is Greek. If I set this to en_US t

Lucene vs SQL database

2005-09-29 Thread Eugeny N Dzhurinsky
Is there any way to use the contents of the SQL database as source for Lucene search engine? Is there any example applications or workarounds to do that? -- Eugene N Dzhurinsky - To unsubscribe, e-mail: [EMAIL PROTECTED] For add

Re: Lucene vs SQL database

2005-09-29 Thread George Abraham
Eugene, You could grab all the fields for a record in a SQL database, mash it all together and transfer it into one indexing field in Lucene. Use some scripting tools (or even JDBC and Java) to do this. However if you are asking if Lucene can go and look over a SQL database and return results, that

Re: Lucene vs SQL database

2005-09-29 Thread Nader Henein
You can use something like hibernate to load the database tables into java objects and then load them into Lucene Documents, fast and dirty will take you a few hours to code, but if you're going the distance a couple of days should do the trick. Nader Henein George Abraham wrote: Eugene, Yo

Re: Lucene vs SQL database

2005-09-29 Thread Eugeny N Dzhurinsky
On Thu, Sep 29, 2005 at 08:39:53AM -0400, George Abraham wrote: > Eugene, > You could grab all the fields for a record in a SQL database, mash it all > together and transfer it into one indexing field in Lucene. Use some > scripting tools (or even JDBC and Java) to do this. However if you are > ask

Re: A very technical question.

2005-09-29 Thread Dawid Weiss
Thanks for all the responses, guys. I'll analyze them and post my results if any. Doug's suggestion was closest to what I tentatively felt it could look like. I'll see if I can make it work. D. - To unsubscribe, e-mail: [EMA

Re: Lucene vs SQL database

2005-09-29 Thread Erik Hatcher
On Sep 29, 2005, at 8:46 AM, Eugeny N Dzhurinsky wrote: On Thu, Sep 29, 2005 at 08:39:53AM -0400, George Abraham wrote: Eugene, You could grab all the fields for a record in a SQL database, mash it all together and transfer it into one indexing field in Lucene. Use some scripting tools (or

luke start problem

2005-09-29 Thread Dirk Hennig
Hello, I downloaded lukeall.jar, put it in my classpath and tried to start it: > java org.getopt.luke.Luke and I get: -- Exception in thread "main" java.lang.SecurityException: class "org.apache.lucene.store.IndexInput"'s signer information does not match signer information of other classe

Re: Boosting a subquery

2005-09-29 Thread Erik Hatcher
On Sep 29, 2005, at 5:42 AM, Anand Kishore wrote: Hi, LIA has info about boosting terms and/or fields but how do I go about boosting an entire bracketed subquery. For example: if I have a query consisting of 2 subqueries like: (xyz:AAA OR qwe:BBB) OR (xyz:CCC OR zxc:DDD) I want to boost subqu

Re: Lucene vs SQL database

2005-09-29 Thread Mag Gam
Check this link outI am trying to do the same http://marc.theaimsgroup.com/?l=lucene-user&m=100556272928584&w=2 I am using Apache Derby and trying to integrate that with lucene Its tough to find a very very simple example for this online. goodluck On 9/29/05, Erik Hatcher <[EMAIL PRO

Re: Lucene vs SQL database

2005-09-29 Thread Eugeny N Dzhurinsky
On Thu, Sep 29, 2005 at 09:29:56AM -0400, Mag Gam wrote: > Check this link outI am trying to do the same > > http://marc.theaimsgroup.com/?l=lucene-user&m=100556272928584&w=2 > I am using Apache Derby and trying to integrate that with lucene > Its tough to find a very very simple example f

TermDocs.freq()

2005-09-29 Thread Tricia Williams
I am finding that TermDocs.freq() method is returning an incorrect value. I was wondering if anyone else had experienced this problem. I am using tp = IndexReader.termPositions( queryTerm ) to return a object which implements TermPositions. I then use tp.skipTo( docid ) to go directly to the docu

Re: TermDocs.freq()

2005-09-29 Thread Jérôme BENOIS
Hello everybody, I would like implement a "Google Suggest" (http://www.google.com/webhp?complete=1&hl=en) like but how to get similar criteria and number of results. Are you an idea ? Thanks, Jérôme. signature.asc Description: This is a digitally signed message part

Re: TermDocs.freq()

2005-09-29 Thread Greg Gershman
Save user queries in a database along with number of results from last time queried, use that as suggestion base. Notice that Google's result count in Suggest differs from the actual result count. They are not computing results on the fly. Greg --- Jérôme BENOIS <[EMAIL PROTECTED]> wrote: > He

Revisting FieldCacheImpl

2005-09-29 Thread Greg Gershman
Our search engine updates frequently, adding and removing documents from the index. After an index update, we create a new Searcher in the background, and execute a search against it to "prime" the sorting by fields. The new Searcher is swapped for the old. >From my understanding, this is a fair

Re: Lucene vs SQL database

2005-09-29 Thread Chris Lu
We have a product, DBSight, just to extract database content and render search results. Many features can be found on the website. Chris Lucene RAD on Any Databases http://www.dbsight.net On 9/29/05, Eugeny N Dzhurinsky <[EMAIL PROTECTED]> wrote: > Is there any way to use

Re: lucene and UTF-8

2005-09-29 Thread Chris Hostetter
: I'm having some problems indexing my UTF-8 html pages. I am running : lucene on Linux and I cannot understand why does the index generated : depends on the locale of my operating system. : If I do set | grep LANG I get: LANG=el_GR which is Greek. If I set this : to en_US the index generated will

Re: Revisting FieldCacheImpl

2005-09-29 Thread Chris Hostetter
: I'm playing around with making the caching work at the : field name/type level, and getting rid of cacheing by : Reader. What this would mean is that all searchers : would use the same sorting; under certain : circumstances, a new sorting could be created using : data from an old, cached sortin

Lucene lock with new io

2005-09-29 Thread M å n i s h
Hi All, Lucene write-lock is making my life difficult, Some times there is no lock file in temp folder but still the tomcat log says write lock exception and Clearing the lock files is not helping me any more, I saw in mail-archive that increasing lock time out property in lock.java file