Re:..

2012-04-04 Thread sol myr
hello! http://downloads.supportandmore.de/amrv-10-11.html?jhID=8aus2 - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org

"AND" Query "under the hood" ?

2011-10-25 Thread sol myr
Hi, Could I please ask another question regarding Lucene "under the hood" / performance. I wondered how "AND" queries are implemented? Say we query for "+hello +world". Would Lucene simply find 2 lists of documents ("documents containing HELLO",  and "documents containing WORLD"), and then in

Re: performance question - number of documents

2011-10-24 Thread sol myr
Thanks again. - Original Message - From: Erick Erickson To: java-user@lucene.apache.org; sol myr Cc: Sent: Sunday, October 23, 2011 7:18 PM Subject: Re: performance question - number of documents "Why would it matter...top 5 matches" Because Lucene has to calculate the

performance question - number of documents

2011-10-23 Thread sol myr
Hi, We've noticed some Lucene performance phenomenon, and would appreciate an explanation from anyone familiar with Lucene internals (I know Lucene as a user, but haven't looked under its hood). We have a Lucene index of about 30 million records. We ran 2 queries: "AND" and "OR" ("+john +doe" v

Using BlockJoinQuery?

2011-10-11 Thread sol myr
Hi, I noticed that the new Lucene 3.4 supports "BlockJoinQuery" (allowing for 'join' or 'relation' between documents). I understand the documented limitations on the feature (nowhere near the power of SQL join), but it's still very useful for me :) My question: does anyone please have a code ex

Re: [Lucene] Frequencies and positions - are they stored per field?

2011-10-10 Thread sol myr
Thanks so much, this helped a lot :) - Original Message - From: Uwe Schindler To: java-user@lucene.apache.org; 'sol myr' Cc: Sent: Tuesday, October 4, 2011 12:14 PM Subject: RE: [Lucene] Frequencies and positions - are they stored per field? Hi, Term Vectors are somehow

Re: [Lucene] Frequencies and positions - are they stored per field?

2011-10-04 Thread sol myr
Thanks a lot. But then what's the added value of Field.TermVector? Can't it be deduced from the overall Lucene index? Or is it just inefficient to deduce? Thanks again :) - Original Message - From: Uwe Schindler To: java-user@lucene.apache.org; 'sol myr' Cc: Se

[Lucene] Frequencies and positions - are they stored per field?

2011-10-04 Thread sol myr
Hi, I use Lucene, but an not familiar with its internals. I'd appreciate help understanding whether Term Frequences and Positions - are stored  per Document of per Field? On the one hand, I never ask for "Field.TermVector" because I read it's only required for "MoreLikeThis" (which I don't ne

Large index merging/optimization?

2011-06-15 Thread sol myr
Hi, Our Lucene index grew to about 4 GB . Unfortunately it brought up a performance problem of slow file merging. We have: 1. A writer thread: once an Hour it looks for modified documents, and updates the Lucene index. Usually there are only few modifications, but sometimes we switch the entire co

PhraseQuery with huge "slop"?

2011-04-07 Thread sol myr
Hi, I need to run and "AND" query with a twist: give higher ranking for "exact match". So when searching for BIG BEN - Give high rank for the Document "BIG BEN is in London" - Lower rank for "It's a BIG day for my dear friend BEN" Following good advice from this list, I combined 2 separate que

Performance and index size (rephrased question)

2011-03-31 Thread sol myr
Hi, I'm rephrasing a previous performance question, in light of new data... I have a Lucene index of about 0.5 GB. Currently performance is good - up to 200 milliseconds per search (with complex boolean queries, but never retrieving more than 200 top results). The question: how much can the index

Re: Distributing a Lucene application?

2011-03-31 Thread sol myr
--- > Instant Scalable Full-Text Search On Any Database/Application > site: http://www.dbsight.net > demo: http://search.dbsight.com > Lucene Database Search in 3 minutes: > > http://wiki.dbsight.com/index.php?title=Create_Lucene_Database_Search_in_3_minutes > > &g

Should I use MultiSearcher?

2011-03-24 Thread sol myr
Hi, I need to search a Catalog. Most users search *this* year's catalog, but on rare occasions they may ask for old products (from previous years). I'm trying to select between 2 options: 1) Keep huge big index for all years (where documents have a "year" field, so I can filter out the current ye

Re: Distributing a Lucene application?

2011-03-23 Thread sol myr
l-Text Search On Any Database/Application > site: http://www.dbsight.net > demo: http://search.dbsight.com > Lucene Database Search in 3 minutes: > http://wiki.dbsight.com/index.php?title=Create_Lucene_Database_Search_in_3_minutes > > > > On 3/22/2011 1:30 AM, sol myr wrote: &g

Distributing a Lucene application?

2011-03-22 Thread sol myr
Hi, What are my options for distributing an application that uses Lucene? Our current application works against a database of INVENTORY. We schedule hourly checks for modified items (timestamp-based), and update a single Lucene index. Now we want to distribute out application, to a Grid, with fail

"shared fields"?

2011-03-09 Thread sol myr
Hi, I have several documents that share the same (large) searchable data. For example, say my Documents represent movies, and  2 movies share the same actorBiography of Brad Pitt (assuming I want to search movies by actorBiography words, far-fetched as it might seem): Document1:  - movieName=

Re: [Lucene] custom Query, and Stop Words

2011-02-10 Thread sol myr
other complications. -- Ian. On Wed, Feb 9, 2011 at 8:43 AM, sol myr wrote: > Hi, > > I'm building my own BooleanQuery (rather than using Query Parser). That's > because I need different defaults from my users: > If a user types:  java program > I need to run the query: +java* +

[Lucene] custom Query, and Stop Words

2011-02-09 Thread sol myr
Hi, I'm building my own BooleanQuery (rather than using Query Parser). That's because I need different defaults from my users: If a user types:  java program I need to run the query: +java* +program* (namely AND search, with Prefix so as to hit "programS", "programMER"). So naively I split the

Question on writer optimize() / file merging?

2011-01-16 Thread sol myr
Hi, I'm trying to understand the behavior of file merging / optimization. I see that whenever my IndexWriter calls 'commit()', it creates a new file (or fileS). I also see these files merged when calling 'optimize()' , as much as allowed by the parameter 'NoCFSRatio' . But I'm still trying to f

Re: Newbie: "Life span" of IndexWriter / IndexSearcher?

2011-01-16 Thread sol myr
newReader; searcher = new IndexSearcher(reader); } instead of reader.reopen(true); Bye. *Raf* On Sun, Jan 16, 2011 at 11:06 AM, sol myr wrote: > Hi, > > Thank you kindly for replying. > Unfortunately, reopen() doesn't help me see the changes. > Here's my test: > First

RE: Newbie: "Life span" of IndexWriter / IndexSearcher?

2011-01-16 Thread sol myr
Hi, Thank you kindly for replying. Unfortunately, reopen() doesn't help me see the changes. Here's my test: First I write & commit a document, and run a search - which correctly finds this document. Then I write & commit another document, re-open the reader and run another search - this should

Newbie: "Life span" of IndexWriter / IndexSearcher?

2011-01-13 Thread sol myr
Hi, We're writing a web application, which naturally needs - "IndexSearcher" when users use our search screen - "IndexWriter" in a background process that periodically updates and optimizes our index. Note our writer is exclusive - no other applications/threads ever write to our index files. Wh

Re: Maintaining index for "flattened" database tables

2011-01-13 Thread sol myr
option 2 can automatically record create/update/delete changes cleanly in a separate table. Either of these options help you to "remember to reindex" just the changed items. Cheers Mark - Original Message From: sol myr To: java-user@lucene.apache.org Sent: Thu, 13 Jan

Maintaining index for "flattened" database tables

2011-01-13 Thread sol myr
Hi, Our main data storage is MySql tables. We index it on Lucene in order to improve the search (boosting, proximate spelling, etc). We naturally maintain it - for example, to insert a new "Contract" entity, we have: addContract(Contract cont){ // INSERT into MySQL: hibernateSession.sa

Re: Newbie question: optimized files?

2011-01-12 Thread sol myr
/changes/Changes.html#3.0.3.changes_in_runtime_behavior so its a feature not a bug :) Simon On Tue, Jan 11, 2011 at 7:46 AM, sol myr wrote: > Hi, > > Continuing my question - I now suspect a bug in Lucene 3.0.3, because I ran > the test with Lucene 3.0.0 and it worked okay (no junk fil

Re: Newbie question: optimized files?

2011-01-10 Thread sol myr
Hi, Continuing my question - I now suspect a bug in Lucene 3.0.3, because I ran the test with Lucene 3.0.0 and it worked okay (no junk files)... could anyone please confirm? --- On Mon, 1/10/11, sol myr wrote: From: sol myr Subject: Newbie question: optimized files? To: java-user

Newbie question: optimized files?

2011-01-10 Thread sol myr
Hi, I'm new to Lucene (using 3.0.3), and just started to check out the behavior of the 'optimize()' method (which is quite important for our application). Could it be that 'optimize' cancels out the 'compoundFile' mode? Or am I doing something wrong? Here's my test: I create an indexWriter wi