[Bioperl-l] Indexing nr database
David.Messina at sbc.su.se
Tue Sep 7 05:23:42 EDT 2010
What do you need the index for?
If it's random retrieval of sequences using an accession or GI, you'd be better off using NCBI's own database indexing and retrieval tools. They're far faster than BioPerl.
They're distributed with Blast+ and available here:
Specifically, I'm talking about 'makeblastdb' and blastdbcmd'.
I'm not sure what you mean by "4g" nr, but there's an already-indexed version of nr available here:
You can use that directly with the BLAST+ database tools.
Also, you take a look at the cookbook at the end of the Blast+ user manual (available in the same download directory as Blast+ itself). Some nice examples there showing off the flexibility of this latest version of the software.
More information about the Bioperl-l