I'm investigating possible alternatives for indexing/searching a very large dataset (2TB) of xml data from the pubmed database[1]. Does anyone have any experience working with indexes of this size? Granted the actual index size would be smaller than the source files, but I'm just curious how big the largest known lucene indexes are, and what sort of hardware they run on...assuming they're not behind closed doors at the Dept of Homeland Security ;-)

//Ed

[1] http://www.ncbi.nlm.nih.gov/entrez

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to