Filter documents with query

2007-06-17 Thread Monica Lestari Paramita
Hello everyone, I am currently using Lucene to search for documents. I'm having problems when I want to search for a query Q, but only in documents that has term T1 or T2 in it. At first I tried the query "Q AND (T1 OR T2)", but it affects the sorting of the results. For example: query: "Q", r

Re: negative queries

2007-06-17 Thread Daniel Noll
On Saturday 16 June 2007 11:39:35 Chris Hostetter wrote: > : The mailing list has already answered this question dozens of times. > : I've been wondering lately, does this list have a FAQ? If so, is this > : question on it? > > The wiki is open to editing by all. The FAQ page claims to be immuta

Re: Lucene index performance

2007-06-17 Thread Andreas Guther
Searching on multiple index files is incredible fast. We have 10 different index folders with different sizes. All folders together have a size of 7 GB. Results come back usual within less than 50 ms. Getting results out of the index i.e. reading documents is expensive and you will have to spe

Re: Lucene for chinese search

2007-06-17 Thread Chris Lu
There are three things to watch out for chinese or CJK languages: 1. The content source or database need to be encoded in UTF-8. 2. StandardAnalyzer doesn't support chinese words well. Use either ChineseAnalyzer or CJKAnalyzer. My experience is that CJKAnalyzer is a little better. 3. The user's q

Re: Several questions about scoring/sorting + random sorting in an image/related application

2007-06-17 Thread Chris Hostetter
: Thats what i discovered. The question is : Is the ValueSourceQuery : strong and fast enough to be : used confidently in a production environment? I looked at the source as i mentioned, i'm not intimately familiar with the new ValueSourceQuery, but the FunctionQuery it's based on is certainly pr

Re: Using Lucene to search Multiple Databases

2007-06-17 Thread Andreas Guther
Rajat, I don't know about the Web Interface you are mentioning but the task can be done with a little bit coding from your side. I would suggest indexing each database in its own index which allows to keep the access easily controlled. To find matches you will need to use a Multi Searcher. All

Re: Several questions about scoring/sorting + random sorting in an image/related application

2007-06-17 Thread Antoine Baudoux
Hi chris, I've really only had a chnce to skim this thread so far, but if i understand correctly, the goal is to get documents back in a "blended" order based on: 1) textual relevancy to the search input 2) recentness 3) a mapping of field values to arbitrary numeric weights which need

Lucene for chinese search

2007-06-17 Thread leelb
Hi, I would like to know whether Standard Analyzer allows searching of chinese words? And in order to support chinese searching, is there any encoding needed in order to develop the application? I'm currently using Jetty as web server, jsp as application, and search results will be saved in xml

Re: Lucene index performance

2007-06-17 Thread Mark Miller
Lee Li Bin wrote: Hi, I would like to know how's the performance during indexing and searching of results on a large index files would be like. Fast. And is it possible to create multiple index files and search across multiple index files? Yes. If possible, may I know how could it be d

Lucene index performance

2007-06-17 Thread Lee Li Bin
Hi, I would like to know how's the performance during indexing and searching of results on a large index files would be like. And is it possible to create multiple index files and search across multiple index files? If possible, may I know how could it be done? Thanks a lot. ---