Can we just update one field of a document in a lucene index, and leave other fields along?

2010-09-08 Thread fulin tang
if we can update only one field of a document in the index, a lot of jobs can be done here ! 梦的开始挣扎于城市的边缘 心的远方执着在脚步的瞬间 我的宿命埋藏了寂寞的永远

Re: Largest Lucene installation?

2010-09-08 Thread fulin tang
we now have 0.15 billion documents, which source size 1.5 TB, on 16 shards . I am very interested how you get your job done 梦的开始挣扎于城市的边缘 心的远方执着在脚步的瞬间 我的宿命埋藏了寂寞的永远 2010/8/26 Nigel : > I'm curious about what the largest Lucene installations are, in terms of: > > - Greatest number of documents (

Re: How to do facet search in lucene, NOT solr ?

2010-09-08 Thread fulin tang
ocs into a > DocSet(bitset). Also getting facet fields into memory(doc id->field > value) then loop the DocSet to count it. > > 2010/8/30 fulin tang : >> we are building a search system on top of lucene, and we are now >> looking for a facet feature >> >> So

Re: Combine data from index and db before sorting and pagination

2010-09-08 Thread fulin tang
That is exactly what I am looking for now ! Our mail search system has a field name flags, like read/unread etc, and it will change after the email indexed , so we need an update . But we only update one field, more exactly, one Field.Index.NOT_ANALYZED and Field.Store.YES field , how can we av

How to do facet search in lucene, NOT solr ?

2010-08-30 Thread fulin tang
we are building a search system on top of lucene, and we are now looking for a facet feature So it there a easy way to do this ? btw, we do not want to switch to solr just for this! - To unsubscribe, e-mail: java-user-unsubscr..

Re: How to do alias(Pinyin) search in Lucene

2009-12-17 Thread fulin tang
another way to do this: pinyin4j you can trans all Chinese words to pinyin form first, and index the pinyin form as a field, then you can search on them see: http://www.slideshare.net/tangfl/ss-2364878 in which we implement a pinyin search for our music search 2009/12/16 Weiwei Wang : > Thanks

Re: To exit the while loop if match is found

2009-11-26 Thread fulin tang
Make sure your analyzer has called strToLower when index the data 2009/11/26 Savvas-Andreas Moysidis : > hi, > > maybe you could not break but rather collect them in an ordered collection. > If it's a web app you are having you can then pass it up your view layer > wrapped in json? > > savvas > >

Re: Need help regarding implementation of autosuggest using jquery

2009-11-26 Thread fulin tang
By the way , we search Chinese words, so Trie tree looks not perfect for us either 2009/11/27 fulin tang : > We have the same needs in our music search, and we found this is not a > good approach for performance reason . > > Did any one have experience of implement the autosuggestio

Re: Need help regarding implementation of autosuggest using jquery

2009-11-26 Thread fulin tang
We have the same needs in our music search, and we found this is not a good approach for performance reason . Did any one have experience of implement the autosuggestion in a heavy product environment ? Any suggestions ? 2009/11/26 Anshum : > Try this, > Change the code as required: > -

Re: Is it a lucene bug?

2009-11-26 Thread fulin tang
Maybe you should take a look at the Scorer and Similarity series classes , they will show you how the score is calculated , make some change of them, and you will get what you want. We have the same problem and we get it done by write subclasses of DefaultSimilarity and BooleanScorer 2009/11

Re: Is Lucene a good choice for PB scale mailbox search?

2009-11-25 Thread fulin tang
Thanks all for the good suggestions ! But any idea of the storage? How can we make the indexes as small as possible? We know compressing is the only way, but when and where to compress is best for search? Thanks all again! 2009/11/24 Kay Kay : > fulin tang wrote: >> >> We a

Is Lucene a good choice for PB scale mailbox search?

2009-11-23 Thread fulin tang
We are going to add full-text search for our mailbox service . The problem is we have more than 1 PB mails there , and obviously we don't want to add another PB storage for search service , so we hope the index data will be small enough for storage while the search keeps fast . The lucky is that