On 03/12/2014 03:02 PM, Wolfgang Huber wrote:
Hi Martin, Mike
a DESeq2 user brought up the observation that when he subsets a ‘DESeqDataSet’
object (the class inherits from ‘SummarizedExperiment’) by samples, he often
ends up with unused factor levels in the colData. (Esp. since the subsetting is
often to select certain subgroups). Would either of the following two make
sense:
- a ‘droplevels’ method for ‘SummarizedExperiment’ that efficiently and
conveniently removes unused levels, i.e.
x = x[, x$tissue %in% c(“guts”, “brains”)]
x = droplevels(x)
vs. x$tissue = droplevels(x$tissue)
- a ‘droplevels’ argument (default: FALSE)
x = x[, x$tissue %in% c(“guts”, “brains”), droplevels=TRUE]
there are a surprising number of places were levels could be dropped -- each
column of colData, each column of (possibly two levels of) 'mcols' on the row
data, and the seqlevels of the row data.
Does this make sense lower in the class hierarchy, e.g., Vector, as well as
GRanges/List?
Martin
Wolfgang
_______________________________________________
Bioc-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/bioc-devel
--
Computational Biology / Fred Hutchinson Cancer Research Center
1100 Fairview Ave. N.
PO Box 19024 Seattle, WA 98109
Location: Arnold Building M1 B861
Phone: (206) 667-2793
_______________________________________________
Bioc-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/bioc-devel