Hi Alan,

On 4/16/21 4:28 AM, Murphy, Alan E wrote:
Hi all,

I am looking for the 1000genomes Phase3 Reference Genome Sequence (equivalent 
to the Phase 2 version would be useful: BSgenome.Hsapiens.1000genomes.hs37d5 
https://bioconductor.org/packages/release/data/annotation/html/BSgenome.Hsapiens.1000genomes.hs37d5.html).
 The dataset I'm looking for is also found here for download: 
https://ctg.cncr.nl/software/MAGMA/ref_data/g1000_eur.zip

Is this available in Bioconductor?

I don't think we have that:

  library(BSgenome)
  grep("1000", available.genomes(), value=TRUE)
  # [1] "BSgenome.Hsapiens.1000genomes.hs37d5"

Note that BSgenome.Hsapiens.1000genomes.hs37d5 is a contributed package (by Julian Gehring). You're welcome to contribute a BSgenome data package for the 1000genomes Phase3 Reference Genome if you'd like.

Best,
H.

I want to use it in a package I'm developing. I know I could download it 
through the package when needed or store the dataset in as package data but I 
know neither of these solutions are not good practice for Bioconductor 
submission.

Kind regards,
Alan.

Alan Murphy
Bioinformatician
Neurogenomics lab
UK Dementia Research Institute
Imperial College London

        [[alternative HTML version deleted]]

_______________________________________________
Bioc-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/bioc-devel


--
Hervé Pagès

Bioconductor Core Team
hpages.on.git...@gmail.com

_______________________________________________
Bioc-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/bioc-devel

Reply via email to