Recently heard about GenomicsDB from Intel: https://github.com/Intel-HLS/GenomicsDB/wiki
My understanding is that the upcoming GATK 4? will target it directly for storing genotypes. It's based on the array-oriented database TileDB, which I think distinguishes itself from e.g. SciDB by special support for sparse data. Seems TileDB might be a good candidate for a DelayedArray backend. But I guess we would to represent it as an entire SummarizedExperiment/VCF. Thoughts? Michael _______________________________________________ Bioc-devel@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/bioc-devel