[R] Remove part of string in colname and calculate mean for columns groups

Joel Fürstenberg-Hägg Fri, 15 Jan 2010 02:06:37 -0800

Hi all,


I have two question. First, I wonder how to remove a part of the column names 
in a matrix? I would like to remove the "_ACCX" or "_NAX" part below. Is there 
a method where the "_" as well as all characters after i can be removed?

 

> dim(exprdata)
[1]  88 512

>

> colnames(exprdata[,c(1:20)])
 [1] "Akita_ACC1" "Akita_ACC2" "Akita_ACC3" "Akita_ACC4" "Alc.0_ACC1" 
"Alc.0_ACC2" "Alc.0_ACC3"
 [8] "Alc.0_ACC4" "Alc.0_ACC5" "Bl.1_ACC1"  "Bl.1_ACC2"  "Bl.1_ACC3"  
"Bl.1_ACC4"  "Bla.1_ACC1"
[15] "Bla.1_ACC2" "Bla.1_ACC3" "Bla.1_ACC4" "Blh.1_ACC1" "Blh.1_ACC2" 
"Blh.1_ACC3"

 

 

Secondly, I would like to calculate the mean of each column group in the 
matrix, for instance all columns beginning with "Akita", and save all new 
columns as a new matrix. 

 

For instance, use:

 

> head(exprdata[,c(1:4)])
            Akita_ACC1 Akita_ACC2 Akita_ACC3 Akita_ACC4
A100005-101   6.668931         NA         NA         NA
A122001-101  10.562564  11.706395  11.608989   8.289093
A128001-101  14.946749   8.112625   8.176438  10.104254
A133001-101   5.186679   6.089870   4.119589   3.168841
A133003-101         NA         NA  19.825480   2.587695
A134001-101   3.259402   4.835642   4.679607   4.490254

 

To get something like:

 

                         Akita
A100005-101   6.668931    

A122001-101   10.54176  

A128001-101  10.10425
A133001-101   3.168841

A133003-101   2.587695 
A134001-101   4.490254

 

 

However, the column groups are of different sizes (3-10 columns) so I guess 
I'll need a method based on the column names.

 

Anyone who can help me?

 

Best regards,

 

Joel
                                          
_________________________________________________________________
Nya Windows 7 - Hitta en dator som passar dig! Mer information. 
http://windows.microsoft.com/shop
        [[alternative HTML version deleted]]

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] Remove part of string in colname and calculate mean for columns groups

Reply via email to