> -----Original Message----- > From: Kipp, James [mailto:[EMAIL PROTECTED]] > Sent: Tuesday, February 11, 2003 4:25 PM > To: '[EMAIL PROTECTED]'; 'Perl' > Subject: RE: Finding Duplicates. > > > > I have to find duplicate customers in are customer file > (around 60,000 > > customers). The file has been exported into a pipe delimited file. > > > > CustCode|Ship2Code|Name|Addr1|Addr2|City|State|ZipCode|Phone|F > > ax|Country > > > The problem is the duplicates can be misspelled meaning you > can't just > > do an exact search. My thinking was a couple of passes. > Phone Numbers, > > Addresses, then address digits & City. > > So you are looking for duplicates on any field? Wouldn't you > just be looking for a dup on the Name? Shouldn't the CustCode > field be unique? > Has your company thought about importing these files into a > relational database. Would be much faster and easier.
This will not solve the problem of Paul, since it's possible by accident to insert the "same customer" with different customer code in the database. The problem here if I well understood, is to have a tool that will check such duplication. José. > > > > > -- > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] > > **** DISCLAIMER **** "This e-mail and any attachment thereto may contain information which is confidential and/or protected by intellectual property rights and are intended for the sole use of the recipient(s) named above. Any use of the information contained herein (including, but not limited to, total or partial reproduction, communication or distribution in any form) by other persons than the designated recipient(s) is prohibited. If you have received this e-mail in error, please notify the sender either by telephone or by e-mail and delete the material from any computer". Thank you for your cooperation. For further information about Proximus mobile phone services please see our website at http://www.proximus.be or refer to any Proximus agent. -- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]