-----Original Message-----
From: Paul D. Kraus [mailto:[EMAIL PROTECTED]
Sent: Friday, October 17, 2003 3:32 PM
To: [EMAIL PROTECTED]
Subject: Re: Description Search


On Fri, 17 Oct 2003 15:24:09 +0200, <[EMAIL PROTECTED]> wrote:

> This seems my University project.
>
> Some years ago I did something more or less similar.
>
> Insted of removing some words, I would just rate witch word. For instance
> count each word in your description and give a coeficient to it like 1 /
> (count * count). After for similarities you can consider to sum for each
> common word in the two text.
>
> Like this the most common words will not count in your rating and the not
> common words (product name, code, utility, ... for instance) would be 
> very
> hight rated.
>
>
> Marcos Rebelo
>
> -----Original Message-----
> From: Paul D. Kraus [mailto:[EMAIL PROTECTED]
> Sent: Friday, October 17, 2003 3:05 PM
> To: [EMAIL PROTECTED]
> Subject: Description Search
>
>
> Ok here is the scenioro I have two price lists that contain itemcode
> description cost list .... The important two are itemcode and 
> description.
> One is our list the other is the price list of our major competitior. Of
> course they use different itemcode to identify there items then we do. I
> need a way to search via the description that will let find the most
> relevant item in our system that matches there descriptions.
>
> Ideas i had. Seperate out each word of there description. Ignore common
> words /and the of .../ . Then have it search our description then based 
> on
> how many of those words are found togeather in our description give the
> item a rating. Then print out there item and descr followed by the top 5
> most likely canidates.
>
> This would be step one anyways. I have never done anything even remotely
> close to this so any insights, modules, or ideas that you may have will 
> be
> most welcome.
>
> Thanks in advance.
>
> Paul Kraus
> =-=-=-=-=-=-=-=-=-=-=
> PEL Supply Company
> Network Administrator
> 216.267.5775 Voice
> 216.267.6176 Fax
> 800.321.1263 Toll Free
> www.pelsupply.com
> =-=-=-=-=-=-=-=-=-=-=
>

Ok this could work but then I would need to know before hand what words 
are important and which ones arent. We carry over 40 thousand products 
having to do with orthotics, prosthetics, ect. Unless I misunderstood 
you.  Thanks agian!

-- 
Paul Kraus
=-=-=-=-=-=-=-=-=-=-=
PEL Supply Company
Network Administrator
216.267.5775 Voice
216.267.6176 Fax
800.321.1263 Toll Free
www.pelsupply.com
=-=-=-=-=-=-=-=-=-=-=


The most important words to one product shall not appear frequently in many
descriptions. The important words will be well rated. You have to reated
previously.

MArcos

-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to