Re: Search correction

2009-12-15 Thread Chris Were
I recall reading Google does it based on statistical analysis of what words users type. For example if I search for "googl" and then my next search is for "google" that is stored. Next time someone types "googl", "google" is suggested. Sorry, I don't have a source to link you to on that though. C

Re: Indexing domain names?

2009-11-08 Thread Chris Were
Thanks for the tips guys, got it working now. Cheers, Chris On Sun, Nov 8, 2009 at 6:43 AM, Erick Erickson wrote: > << indexed > are free form text. >>> > > If you try Ahmet's suggestion, PerFieldAnalyzerWrapper is your friend. The > snippet > above makes me wonder if you've seen this class.

Indexing domain names?

2009-11-07 Thread Chris Were
Hi, How do I go about indexing domain names? I currently index the domain, but it only works if I put the exact full domain in. For example: site:www.youtube.com (this works) site:youtube.com (this doesn't work) I am using the StandardAnalyzer as most of the other fields being indexed are free f

Re: How to setup a scalable deployment?

2009-10-08 Thread Chris Were
wrote: > > > On Thu, Oct 8, 2009 at 9:32 PM, Chris Were wrote: > >> Zoie looks very close to what I'm after, however my whole app is written >> in >> Python and uses PyLucene, so there is a non-trivial amount of work to make >> things work with Zoie. >> &

Re: How to setup a scalable deployment?

2009-10-08 Thread Chris Were
> > In this case, I'd say that if you have a reliable, scalable queueing system > for > getting indexing events distributed to all of your servers, then indexing > on > all replicas simultaneously can be the best way to have maximally realtime > search, either using the very new feature of "near re

Re: How to setup a scalable deployment?

2009-10-07 Thread Chris Were
Thanks for all the excellent replies. Lots of great of software mentioned that I'd never heard of -- and I thought I'd Google'd this subject to death already! Cheers, Chris.

How to setup a scalable deployment?

2009-10-06 Thread Chris Were
Hi, I've been using lucene for a project and it works great on the one dev. machine. Next step is to investigate the best method of deploying lucene so that multiple web servers can access the lucene directory of indexes. I see four potential options: 1) Each web server indexes the content separa