Al Boldi wrote:
Sam Liddicott wrote:
Tom Gregorovic wrote:
Hi,
it crossed my mind that it would be convenient to have one search
service for all the information about Lazarus and Free Pascal, which
are in forums, mail list archives, documentation reference, wiki,
Mantis, SVN repository, SVN log, blog ...
One possibility is Google Custom Search Engine
(http://www.google.com/coop/cse/):
* it is free for noncommercial usage, but I think it parses not all
resources The alternative is to use some web search engine running on
own server (e.g. http://www.mnogosearch.org/ ).
I recommend xapian; www.xapian.org

I've worked on it, I've used it and I've followed it's progress and when
I worked at Orange we hired one of the developers to enhance it.

It is based on a once-commercial search engine.

It's not bad,
you've clearly not used it much
but it's huge.
I don't think xapian is huge, although it can deal with huge data sets.
It's being used to index gmane.
I'm currently using swish-e, which is small and fast.
It is a probabilistic search engine that also uses boolean terms
The btree index for xapian has been re-written from scratch 3 times for maximum speed.

I don't know fully how swish-e and xapian compare but I can't have anyone maligning xapian :-)

in 2005 (by the looks of this message:
http://lists.tartarus.org/pipermail/xapian-devel/2005-September/000254.html
) the swish-e guys were willing to ditch their backend db for xapian. I don't know how mych better it has become since ethen.

I'd also suggest reading from this thread to see how xapian works and could work for lazarus:
http://lists.xapian.org/pipermail/xapian-discuss/2005-January/000681.html

Sam

Reply via email to