Re: Adding large files to index

2007-04-25 Thread Daniel Noll
David Xiao wrote: Consider reduce size of per file. Split them into smaller pieces will definitely help indexer working faster. A 50M pure text file is amazing size, very few text files reach that size: 50M. It must be very reasonable if you have to keep all information in such one big file. Wh

Re: Straigtforward stemming example? Dictionary needed?

2007-04-25 Thread Daniel Noll
[EMAIL PROTECTED] wrote: I guess there are a few points - it is impossible to stem with total accuracy using rules alone - combining a rule based stemmer with a dictionary could also be error prone. Unrelated words can have the same stem - consider the past tense of see and the stem of sawing (

Sorting with custom SortComparator

2007-04-25 Thread Theodan
Hello. I am trying to sort my query results on a String field called "AssetType" and then on the relevancy score, but I need a particular ordering of the possible values in "AssetType" (i.e. first "Video", then "Article", etc.). I have tried doing this with a custom SortComparator that returns I

Re: How often to optimize an Index?

2007-04-25 Thread Grant Ingersoll
Related discussion at http://www.gossamer-threads.com/lists/lucene/ java-dev/47895?search_string=optimize;#47895 Also, search this archive and the java-dev archive for optimize. On Apr 25, 2007, at 3:03 PM, Andreas Guther wrote: What are best practices for optimizing an Index? We have sev

Index Update Strategies

2007-04-25 Thread Andreas Guther
Hi We have an index of several GB in size which is updated very frequently-about every 2 seconds. Though it is desired to have changes updated to the index as soon as possible I wonder if this frequent updates can have negative affect on the search and data retrieval performance. Would it make m

Re[2]: Out of memory exception for big indexes

2007-04-25 Thread Artem
Hello Ivan, That was cool news! Thanks! :) The timings are surprisingly good. 10 mln docs sorted in 20s.. cool! Also it looks like sorting algorithm employed by Lucene is quite memory-economic. Not supporting multiple fields is in fact another limitation of my patch. I don't need it so I didn't i

Re: DBSight Turns Free! Instant Lucene Search on Database!

2007-04-25 Thread Chris Lu
Hi, Yes. You can easily create your own search, customize the display based on existing templates. You can put in your own analyzer, adjust your search ranking(e.g. based on price). I think the real effort to create a good search is collecting quality data into your database. Chris --- Laxmi

Re: DBSight Turns Free! Instant Lucene Search on Database!

2007-04-25 Thread Laxmilal Menaria
Hi, If I purchase the DBSight, then I will able to create my searchapplication same as DBSight ?, all the display is same.. ie. narrow by categoty, search speed, images... Please suggest me.. Thanks. Laxmilal Menaria On 4/24/07, Chris Lu <[EMAIL PROTECTED]> wrote: For those who may be inter

Customizing scoring

2007-04-25 Thread Diego Manilla Suárez
We have an index with fields than can be multiple (i.e. a document has two fields called “pets” that contain “dog” and “cat”). The problem comes when we calculate the score: If a BooleanQuery is created as the addition of two TermQuery TermQuery1 “pets:dog” TermQuery2 “pets:cat” BooleanQuery

Re: how to load index

2007-04-25 Thread karl wettin
25 apr 2007 kl. 11.58 skrev avishkar: i would like to know that what are the different ways to load the index created in lucene. Is it FSDirectory.getDirectory(.. you are looking for? -- karl

Re: Out of memory exception for big indexes

2007-04-25 Thread Ivan Vasilev
Hi Artem, Thank you very much for your mails :) So first I have to tell you that your patch works perfectly even with very big indexes - 40 GB (you can see the results bellow). The reason I to have bad test results last time is that I made a bit change (but I can not understand why this change

how to load index

2007-04-25 Thread avishkar
Hello, i would like to know that what are the different ways to load the index created in lucene. -- View this message in context: http://www.nabble.com/how-to-load-index-tf3644363.html#a10177720 Sent from the Lucene - Java Users mailing list archive at Nabble.com.

Re: Clustering in MultiSearcher Searchables

2007-04-25 Thread Chris Hostetter
: Suppose user makes a search for "X" and he get 200 hits from s[0], 150 hits : from s[1] and 300 hits from s[2]. Now we need to cluster/group the results : according to each searcher. it sounds like you don't really want a multisearcher .. it sounds like you just want to search each of the indiv