Dear Josh, Try this: dataset1[,colnames(dataset1) %in% colnames(dataset2)]
Take a look at ?colnames and ?"%in%" for more information. HTH, Jorge On Fri, Feb 27, 2009 at 12:27 PM, Josh B <josh...@yahoo.com> wrote: > Hello all, > > I hope some of you can come to my rescue, yet again. > > I have two genetic datasets, and I want one of the datasets to have only > the columns that are in common with the other dataset. > Here is a toy example (my real datasets have hundreds of columns): > > Dataset 1: > > Individual SNP1 SNP2 SNP3 SNP4 SNP5 > 1 A G T C A > 2 T C A G T > 3 A C T C A > > Dataset 2: > > Individual SNP1 SNP3 SNP5 SNP6 SNP7 > 4 A T T G C > 5 T A A G G > 6 A A T C G > > I want Dataset1 to have only columns that are also represented in Dataset > 2, i.e., I want to generate a new Dataset 3 that looks like this: > > Individual SNP1 SNP3 SNP5 > 1 A T A > 2 T A T > 3 A T A > > Does anyone know how I could do this? Keep in mind that this is not a > simple merge, as in the "merge" function. > > Thanks very much for your help everyone. > Josh B. > > > > > [[alternative HTML version deleted]] > > ______________________________________________ > R-help@r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide > http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code. > [[alternative HTML version deleted]] ______________________________________________ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.