Re: Indexing Names in Lucene -- Thomas = Tom, etc

2008-12-04 Thread Grant Ingersoll
I believe these lists exists out on the Internet, just google for something like "most common first names" or "common nicknames" (yields: http://www.cc.kyoto-su.ac.jp/~trobb/nicklist.html for instance) If you want to dig deeper, you might look into named entity recognition research, and a

Re: Indexing Names in Lucene -- Thomas = Tom, etc

2008-12-03 Thread Khawaja Shams
Hi, Yes that is pretty obvious that I would have to index Tom, but I think you missed the point. I don't have a list of names with their nick names, and this is pretty common: Mike being Michael, Richard being Rich or Dick, William could be Bill or Will, etc. I thought I would check if there was

Re: Indexing Names in Lucene -- Thomas = Tom, etc

2008-12-03 Thread Ganesh
If you want to query for Tom, then you need to index the value Tom. Create one more field as Alias or add alias name as part of name field. Regards Ganesh - Original Message - From: "Khawaja Shams" <[EMAIL PROTECTED]> To: Sent: Wednesday, December 03, 2008 11:46 AM Subject: Indexing

Re: Indexing Names in Lucene -- Thomas = Tom, etc

2008-12-03 Thread Ian Lea
Hi To get from Thomas to Tom you'll need to use synonyms. For Thom you would have been able to use prefixes or wild cards. If you google for lucene synonyms you'll find loads of stuff. Also, I believe that Solr has built in support for synonyms. -- Ian. On Wed, Dec 3, 2008 at 6:16 AM, Khaw