So you want the data that is in Dataset 1 but only the column names that are also in Dataset 2:

How about:

 subset(DS1, select = names(DS1) %in% names(DS2) )

> DS1 <-read.table(textConnection("Individual SNP1 SNP2 SNP3 SNP4 SNP5
+ 1    A    G    T    C    A
+ 2    T    C    A    G    T
+ 3    A    C    T    C    A"),header=TRUE)
> DS2 <-read.table(textConnection("Individual SNP1 SNP3 SNP5 SNP6 SNP7
+ 4    A    T    T    G    C
+ 5    T    A    A    G    G
+ 6    A    A    T    C    G"),header=TRUE)

> subset(DS1, select= names(DS1) %in% names(DS2) )
  Individual SNP1 SNP3 SNP5
1          1    A    T    A
2          2    T    A    T
3          3    A    T    A

Tested!
--
David Winsemius
Heritage Labs

On Feb 27, 2009, at 12:27 PM, Josh B wrote:

Hello all,

I hope some of you can come to my rescue, yet again.

I have two genetic datasets, and I want one of the datasets to have only the columns that are in common with the other dataset.
Here is a toy example (my real datasets have hundreds of columns):

Dataset 1:

Individual    SNP1    SNP2    SNP3    SNP4    SNP5
1    A    G    T    C    A
2    T    C    A    G    T
3    A    C    T    C    A

Dataset 2:

Individual    SNP1    SNP3    SNP5    SNP6    SNP7
4    A    T    T    G    C
5    T    A    A    G    G
6    A    A    T    C    G

I want Dataset1 to have only columns that are also represented in Dataset 2, i.e., I want to generate a new Dataset 3 that looks like this:

Individual    SNP1    SNP3    SNP5
1    A    T    A
2    T    A    T
3    A    T    A

Does anyone know how I could do this? Keep in mind that this is not a simple merge, as in the "merge" function.

Thanks very much for your help everyone.
Josh B.




        [[alternative HTML version deleted]]

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to