[Bioperl-l] extracting subsequences
jason.stajich at duke.edu
Tue Oct 25 22:57:22 EDT 2005
Bio::DB::Fasta is the best way to do this for big sequences.
On Oct 25, 2005, at 4:48 PM, Amit Indap wrote:
> I have to extract subsequences from fasta files containing entire
> human chromosomes. For example I would like to extract bp
> 167506667..167523040. I know how to do this using the Bio::Seq and
> Bio::SeqIO APIs. The problem is it takes a long time to read in an
> entire fasta file containing a chromosome. Is there a way I can speed
> this up?
> The bp indices are taken from BLAT-ing my sequences to the genome. I
> could use megablast to find which contigs my sequences lie on, and
> then read in those files rather than the whole chromosome.
> Any suggestions would be helpful. Thanks.
> Amit Indap
> Bioperl-l mailing list
> Bioperl-l at portal.open-bio.org
More information about the Bioperl-l