In article <[EMAIL PROTECTED]>, Chris Lasher wrote: > Hello, > I have a rather large (100+ MB) FASTA file from which I need to > access records in a random order. The FASTA format is a standard format > for storing molecular biological sequences. Each record contains a > header line for describing the sequence that begins with a '>' > (right-angle bracket) followed by lines that contain the actual > sequence data. Three example FASTA records are below:
Use biopython. They have dictionary-style classes which wrap FASTA files using indexes. http://www.biopython.org Dave -- http://mail.python.org/mailman/listinfo/python-list