On Fri, 2003-10-17 at 09:04, Paul D. Kraus wrote:
> Ok here is the scenioro I have two price lists that contain itemcode 
> description cost list .... The important two are itemcode and description.
> One is our list the other is the price list of our major competitior. Of 
> course they use different itemcode to identify there items then we do. I 
> need a way to search via the description that will let find the most 
> relevant item in our system that matches there descriptions.
> 
> Ideas i had. Seperate out each word of there description. Ignore common 
> words /and the of .../ . Then have it search our description then based on 
> how many of those words are found togeather in our description give the 
> item a rating. Then print out there item and descr followed by the top 5 
> most likely canidates.
> 
> This would be step one anyways. I have never done anything even remotely 
> close to this so any insights, modules, or ideas that you may have will be 
> most welcome.

Paul,

Can you post a few lines from your price list and then a few of your
competitors so we can see how your data looks.

Also, for example purposes, say the item you want to match on is a
"Black Kettle" and your description is "Black Kettle", but their
description is "BLK KTTL".  Would there be occurrences like this?

Any chance your product numbers are contained in theirs or vice versa?

For example, your product number is "0-832450-12345-9" and theirs is
"12345".  12345 is contained in both and can be obtained from the first
with a simple regular expression.

Also, any similarities in any piece of the data would help solve this,
if there are any to point out.


Hope this helps,
Kevin
-- 
Kevin Old <[EMAIL PROTECTED]>


-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to