Chris, How large is large? How may columns?
"Duplicate" across all columns of just some? Henrique gave you simple R answer. Perhaps doing in SQL is more efficient? eg SELECT DISTINCT <stuff> FROM <somewhere>; HTH, Jim Porzak TGN.com San Francisco, CA www.linkedin.com/in/jimporzak use R! Group SF: www.meetup.com/R-Users/ On Fri, Jun 5, 2009 at 9:38 AM, Chris Anderson <chris6...@netzero.net>wrote: > I have a large dataset that contain duplicate records. How do I identify > and remove duplicate records? > > > Chris Anderson > 707.315.8486 > www.sassydeals4u.com > ____________________________________________________________ > Free info for small business owners. Click here to find great products > geared for your business. > > http://thirdpartyoffers.netzero.net/TGL2241/fc/BLSrjpYWIQYSqjUJ0P8Kjx22OUKmSdbeF2JnHh5X1EZsmlucvn6niiClhHS/ > [[alternative HTML version deleted]] > > ______________________________________________ > R-help@r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide > http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code. > [[alternative HTML version deleted]] ______________________________________________ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.