RE: intra-word delimiters

2005-08-15 Thread Rajesh Munavalli
. Hope it helps... Rajesh Munavalli -Original Message- From: Yonik Seeley [mailto:[EMAIL PROTECTED] Sent: Mon 8/15/2005 7:47 PM To: java-user@lucene.apache.org Subject: Re: intra-word delimiters That was the plan, but step (4) really seems problematic. - term expansion this way can

Re: intra-word delimiters

2005-08-15 Thread Marvin Humphrey
On Aug 15, 2005, at 8:53 PM, Marvin Humphrey wrote: Create a phrase query that when it encounters ab => { tokenlength => 2 } knows to look for something at position 3. Fencepost error! That should have been "position 2". Not that correcting the error makes the algo any more practical. ;)

Re: intra-word delimiters

2005-08-15 Thread Marvin Humphrey
On Aug 15, 2005, at 7:47 PM, Yonik Seeley wrote: That was the plan, but step (4) really seems problematic. - term expansion this way can lead to a lot of false matches - phrase queries with many bordering words break - settingt term positions such that phrase queries work on all combos of subw

Re: intra-word delimiters

2005-08-15 Thread Yonik Seeley
That was the plan, but step (4) really seems problematic. - term expansion this way can lead to a lot of false matches - phrase queries with many bordering words break - settingt term positions such that phrase queries work on all combos of subwords is non-trivial. It seems like a better approach

Re: intra-word delimiters

2005-08-15 Thread Marvin Humphrey
On Aug 15, 2005, at 3:16 PM, Yonik Seeley wrote: Another example: Source Text contains "Canon Powershot SD500 7MP Digital Elph" And I want to be able to match the following user queries: Power Shot SD 500 CanonPowerShotSD500 SD 500 7 MP digitalelph Canon-Powershot-SD 500 Any ideas? How abou