For what it's worth, AOL uses a Solr cluster to handle searches for @aol users. Each user has his own index.
Otis -- Sematext is hiring -- http://sematext.com/about/jobs.html?mls Lucene, Solr, Nutch, Katta, Hadoop, HBase, UIMA, NLP, NER, IR ----- Original Message ---- > From: fulin tang <tangfu...@gmail.com> > To: java-user@lucene.apache.org > Sent: Mon, November 23, 2009 9:35:57 PM > Subject: Is Lucene a good choice for PB scale mailbox search? > > We are going to add full-text search for our mailbox service . > > The problem is we have more than 1 PB mails there , and obviously we > don't want to add another PB storage for search service , so we hope > the index data will be small enough for storage while the search keeps > fast . > > The lucky is that every user just search with mails of their own , so > we can split the data into a lot of indexes instead of keeping them in > a big one . > > So, after all these concerns , the question is , is lucene a good > choice for this ? or which is the right way to do this ? Does anyone > have done this before ? > > All opinions and comments are welcome ! > > fulin > > > -- > 梦的开始挣扎于城市的边缘 > 心的远方执着在脚步的瞬间 > 我的宿命埋藏了寂寞的永远 > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org > For additional commands, e-mail: java-user-h...@lucene.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org