Lotka's law and Lucene

2005-11-22 Thread aurora
20% of contributors does 80% of work. That's how Lotka describes the productivity of scientific authors back in 1926. An analysis on open source software development show this is equally applicable (http://www.javarants.com/B1823453972/C1460559707/E20051119163857/index.html). Lucene is used

create a single or multiple index?

2005-08-10 Thread aurora
I have two sources of data, let's say one is a set of articles and one is from forum messages. I'd like see the opinion on whether to create one single index or separate index for each kind of document. The user interface is not yet finalized. The search result may be presented as separated

O'Reilly on Native XML Databases

2005-04-07 Thread aurora
I was reading an interesting article on O'Reilly about Native XML Databases. http://www.xml.com/pub/a/2005/03/30/native.html My initial reaction is someone is trying to take on relational database again and this time it is a resurrection of hierarchical database. But as I read on, I find th