We've found that analysts often need to restrict seqlevels to certain
pre-defined sets of chromsomes. Given the variability across organisms, it
would be nice to have an abstraction.

We often see this in code:

keepSeqlevels(seqinfo, as.character(1:22)
keepSeqlevels(seqinfo, c(1:22, "X", "Y"))

Perhaps instead we could the more abstract and arguably more readable:

keepAutosomes(seqinfo)
keepPrimaryChromosomes(seqinfo)

Not sure of the best term for the latter. It refers to the set of
chromosomes that are not assembly fragments but are generally in the
nucleus (when there is one).

It would also be nice to have a sort,Seqinfo method that sorts by the
natural ordering of the chromosomes, if there is one. Maybe the function
needs its own name, but either way, this is something that really needs to
be in the infrastructure.

I think the existing SeqnameStyle infrastructure should be able to support
this.

Thoughts?

Michael

        [[alternative HTML version deleted]]

_______________________________________________
Bioc-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/bioc-devel

Reply via email to