Vedr. Re: Design question [too many fields?]

2005-06-30 Thread Naimdjon Takhirov
Hi Chris, It is anyway going to be too many fields then? Days of year for the whole year ahead? Since the fromDate and toDate can be across two months and the customer wants the data be available for one year. Naimdjon --- Chris Lu <[EMAIL PROTECTED]> skrev: > Mark, your suggestion will incur a

Re: Design question [too many fields?]

2005-06-30 Thread Chris Lu
Mark, your suggestion will incur another trip to the database. And if the search results is large, filtering in DB by pk is not really good. Erik, your original "date" field is good when there is not many dates(<1024) in the database. Otherwise, Range Query can not handle it. My suggestion is

Nested Boolean Query

2005-06-30 Thread Yousef Ourabi
Hello: I have two fields in my document a title, term and content. I want to programatically create a query that will return all documents that has both title AND content. I created a boolean query that looks like this. The logican return I want are any documents that Must have the alias passed to

Re: Does highlighter highlight phrases only?

2005-06-30 Thread markharw00d
Hi Erik, Yes I was thinking that code could form the basis of a new highlighter. I've just attached a QuerySpansExtractor to the bugzilla entry for the new highlighter. This class produces Spans from queries other than SpanXxxxQueries eg phrase, term and booleans. I'm thinking you can throw the

Re: Does highlighter highlight phrases only?

2005-06-30 Thread Erik Hatcher
On Jun 30, 2005, at 4:12 PM, Bohl, David wrote: FYI, I made an update to this (I'm the author) to handle span queries. I uploaded the updated code to the same bugzilla link. The code doesn't treat span queries like phrases - it just highlights the terms individually (if someone can find

RE: Does highlighter highlight phrases only?

2005-06-30 Thread Bohl, David
FYI, I made an update to this (I'm the author) to handle span queries. I uploaded the updated code to the same bugzilla link. The code doesn't treat span queries like phrases - it just highlights the terms individually (if someone can find a way to highlight it as a phrase, feel free to update it

Re: No.of Files in Directory

2005-06-30 Thread jian chen
Hi, My second suggestion is basically to store the user documents (word docs) directly in lucene index. 1) If you are using Lucene 1.4.3, you can do something like this: // suppose the word docs are now in byte array byte[] wordDoc = getUploadedWordDoc(); // add the byte array to lucene index D

Re: A query in multipleFields

2005-06-30 Thread Erik Hatcher
On Jun 30, 2005, at 7:41 AM, Daniel Cortes wrote: I have 2 indexs and the structure of the elements in this index are equals what in one index I have a field.Keyword "type" with value "forum" and the other index with a value "blog". How have I do to search in two indexs at same time? Use M

A query in multipleFields

2005-06-30 Thread Daniel Cortes
I have 2 indexs and the structure of the elements in this index are equals what in one index I have a field.Keyword "type" with value "forum" and the other index with a value "blog". How have I do to search in two indexs at same time? How can I restrict the results with only "forum" or "blog" t

Re: No.of Files in Directory

2005-06-30 Thread Erik Hatcher
On Jun 30, 2005, at 2:23 AM, Karthik N S wrote: With my Experience of Lucene since 2004,I can say that u need to update the Index Once in a day (rather then doing it for every upload),But if u'r requirement say's u have to make the Doc avaliable on the run then u may do so. Karthik -

Re: lucene query

2005-06-30 Thread Paul Libbrecht
'not treating xml file as text file' is quite fuzzy it doesn't say how you treat the files that you wish to perform retrieval on. It may be of interest to you that I am in the process of allowing search for mathematical formulae based on lucene-index built from OMDoc encoded documents (an XML sy

Re: Index Replication / Clustering

2005-06-30 Thread Stephane Bailliez
Nader Henein wrote: Considerations that you may want to think about when sanitizing your clustered indecies: 1) Number of documents available vs. number of documents in the persistent store. 2) Are all the document up to date (involves comparing the existence and the last date updated of Luce