From j.scholtalbers at gmail.com Mon Feb 1 07:24:29 2010 From: j.scholtalbers at gmail.com (Jelle Scholtalbers) Date: Mon, 1 Feb 2010 13:24:29 +0100 Subject: [Bioperl-l] Bio::DB::SeqFeature::Store::memory -> filter_by_type very slow Message-ID: Hi, I used the Bio::DB::SeqFeature::Store::memory module to load in a GFF3 file which I could then use in my script in a 'queryable' way. To retrieve features I used for example $db->features(-type => 'BAC:FPC', -seq_id=>'chromosome0') However when doing a profile on my script I found out that 60% of the running time went into filter_by_type from Bio::DB::SeqFeature::Store::memory. Replacing this function with my @features = grep{$_->type eq 'BAC:FPC'} $db->features(-seq_id=>'chromosome0') which gave me the same results was just a fraction of the earlier run time. My script went from 60min. to 4min. for the same result and only changing this function (is called often). Can/Should this be fixed or is this just the faster way to do it? Cheers, Jelle From cjfields at illinois.edu Mon Feb 1 08:23:02 2010 From: cjfields at illinois.edu (Chris Fields) Date: Mon, 1 Feb 2010 07:23:02 -0600 Subject: [Bioperl-l] Bio::DB::SeqFeature::Store::memory -> filter_by_type very slow In-Reply-To: References: Message-ID: <3435CD81-A2B1-4419-995B-751BE3810017@illinois.edu> Jelle, Seems reasonable, but Lincoln and Scott know that code better and are better suited to comment on it. Lincoln, Scott? chris On Feb 1, 2010, at 6:24 AM, Jelle Scholtalbers wrote: > Hi, > I used the Bio::DB::SeqFeature::Store::memory module to load in a GFF3 file > which I could then use in my script in a 'queryable' way. To retrieve > features I used for example > $db->features(-type => 'BAC:FPC', -seq_id=>'chromosome0') > However when doing a profile on my script I found out that 60% of the > running time went into filter_by_type from > Bio::DB::SeqFeature::Store::memory. > Replacing this function with > my @features = grep{$_->type eq 'BAC:FPC'} > $db->features(-seq_id=>'chromosome0') > which gave me the same results was just a fraction of the earlier run time. > My script went from 60min. to 4min. for the same result and only changing > this function (is called often). > Can/Should this be fixed or is this just the faster way to do it? > > Cheers, > Jelle > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l From maj at fortinbras.us Mon Feb 1 09:39:29 2010 From: maj at fortinbras.us (Mark A. Jensen) Date: Mon, 1 Feb 2010 09:39:29 -0500 Subject: [Bioperl-l] Bio::Tools::Run::StandAloneBlastPlus - Illegal seek error running blastn In-Reply-To: References: <1DB83E7D-DFDE-4BE7-9EC0-F78B8875DF55@bbsrc.ac.uk><56CC8EF7EEAD4E28A3B822ADA8AD74B1@NewLife><05171AE0-71F2-4102-8131-70C6019D0172@bbsrc.ac.uk> Message-ID: <4B93C35AE3564EA1ADFC3F9156517ECC@NewLife> This issue appears resolved; will close out #3003-- thanks MAJ ----- Original Message ----- From: "mike stubbington (BI)" To: "Mark A. Jensen" Cc: ; ; "Osborne" Sent: Friday, January 29, 2010 8:25 AM Subject: Re: [Bioperl-l] Bio::Tools::Run::StandAloneBlastPlus - Illegal seek error running blastn Hi Mark, Thanks for your continued help. It now fails with this: ------------- EXCEPTION ------------- MSG: /usr/local/ncbi/blast/bin/blastn call crashed: There was a problem running /usr/local/ncbi/blast/bin/blastn : BLAST Database error: No alias or index file found for nucleotide database [MouseGenome] in search path [/Volumes/stubbing/PerlScripts/5CTest/trunk::] STACK Bio::Tools::Run::WrapperBase::_run /Library/Perl/5.10.0/Bio/Tools/Run/WrapperBase/CommandExts.pm:1004 STACK Bio::Tools::Run::StandAloneBlastPlus::AUTOLOAD /Library/Perl/5.10.0/Bio/Tools/Run/StandAloneBlastPlus.pm:1236 STACK Bio::Tools::Run::StandAloneBlastPlus::run /Library/Perl/5.10.0/Bio/Tools/Run/StandAloneBlastPlus/BlastMethods.pm:267 STACK Bio::Tools::Run::StandAloneBlastPlus::AUTOLOAD /Library/Perl/5.10.0/Bio/Tools/Run/StandAloneBlastPlus.pm:1233 STACK toplevel ./5CTest.pl:63 ------------------------------------- If I change the factory creation to: my $blastFactory = Bio::Tools::Run::StandAloneBlastPlus->new( -db_name => '/Users/stubbing/localBlast/MouseGenome' ); it fails with ------------- EXCEPTION ------------- MSG: DB name not valid STACK Bio::Tools::Run::StandAloneBlastPlus::new /Library/Perl/5.10.0/Bio/Tools/Run/StandAloneBlastPlus.pm:516 STACK toplevel ./5CTest.pl:45 ------------------------------------- However I can run the following successfully from the command line: blastn -db /Users/stubbing/localBlast/MouseGenome -query querySequence.fasta Is there something wrong with how I'm referring to the blast database when I construct my factory? Thanks again, M On 28 Jan 2010, at 18:47, Mark A. Jensen wrote: > Hi Mike, > Believe I found the real bug causing the problem (was not accounting for > the db_dir parameter). Crashes should now also throw much more helpful > errors. Please try the code at r16774, and shout back. > thanks -- > MAJ > ----- Original Message ----- > From: "mike stubbington (BI)" > To: "Mark A. Jensen" > Cc: > Sent: Thursday, January 28, 2010 11:18 AM > Subject: Re: [Bioperl-l] Bio::Tools::Run::StandAloneBlastPlus - Illegal seek > error running blastn > > > Hi, > > Thanks for the suggestion. Unfortunately it still fails - error as follows: > > ------------- EXCEPTION ------------- > MSG: /usr/local/ncbi/blast/bin/blastn call crashed: There was a problem > running > /usr/local/ncbi/blast/bin/blastn : Illegal seek at > /Library/Perl/5.10.0/Bio/Tools/Run/WrapperBase/CommandExts.pm line 1000, > > line 532. > > STACK Bio::Tools::Run::WrapperBase::_run > /Library/Perl/5.10.0/Bio/Tools/Run/WrapperBase/CommandExts.pm:1005 > STACK Bio::Tools::Run::StandAloneBlastPlus::AUTOLOAD > /Library/Perl/5.10.0/Bio/Tools/Run/StandAloneBlastPlus.pm:1236 > STACK Bio::Tools::Run::StandAloneBlastPlus::run > /Library/Perl/5.10.0/Bio/Tools/Run/StandAloneBlastPlus/BlastMethods.pm:267 > STACK Bio::Tools::Run::StandAloneBlastPlus::AUTOLOAD > /Library/Perl/5.10.0/Bio/Tools/Run/StandAloneBlastPlus.pm:1233 > STACK toplevel ./5CTest.pl:63 > ------------------------------------- > > M > > On 28 Jan 2010, at 15:56, Mark A. Jensen wrote: > >> Mike - please try updating your bioperl-live (the core) to the latest code >> (revision 16761 or so). >> CommandExts is a work in progress; from the stack errors it looks like you've >> got an older version. >> Try it then ping us back, if you would-- >> Thanks >> Mark >> ----- Original Message ----- >> From: "mike stubbington (BI)" >> To: >> Sent: Thursday, January 28, 2010 10:41 AM >> Subject: [Bioperl-l] Bio::Tools::Run::StandAloneBlastPlus - Illegal seek >> error >> running blastn >> >> >> Dear all, >> >> I am attempting to blast some primers against the mouse genome. I have >> created >> a >> local mouse genome blast database and I can search against it using 'blastn' >> at >> the command line. >> >> I have perl code that creates an array of bioperl sequence objects called >> @primers >> >> I then create a StandAloneBlastPlus factory using the following code? >> >> my $blastFactory = Bio::Tools::Run::StandAloneBlastPlus->new( >> -db_dir => '/Users/stubbing/localBlast/', >> -db_name => 'MouseGenome' >> ); >> >> and then attempt to blast my primers using this? >> >> my @shortPrimers; >> my $count=1; >> foreach (@primers) { >> my $currentSeq = $_; >> print "Checking primer $count/$primerNumber "; >> if ($_->length < 40) { >> push(@shortPrimers,$_); >> print "Too short!\n"; >> } >> else { >> print "BLASTing..."; >> my $blastResult = $blastFactory->blastn(-query => $currentSeq); >> } >> $count++; >> } >> >> This fails with the following error? >> >> ------------- EXCEPTION ------------- >> MSG: /usr/local/ncbi/blast/bin/blastn call crashed: There was a problem >> running >> /usr/local/ncbi/blast/bin/blastn : Illegal seek at >> /Library/Perl/5.10.0/Bio/Tools/Run/WrapperBase/CommandExts.pm line 989, >> >> line 532. >> >> STACK Bio::Tools::Run::WrapperBase::_run >> /Library/Perl/5.10.0/Bio/Tools/Run/WrapperBase/CommandExts.pm:994 >> STACK Bio::Tools::Run::StandAloneBlastPlus::AUTOLOAD >> /Library/Perl/5.10.0/Bio/Tools/Run/StandAloneBlastPlus.pm:1236 >> STACK Bio::Tools::Run::StandAloneBlastPlus::run >> /Library/Perl/5.10.0/Bio/Tools/Run/StandAloneBlastPlus/BlastMethods.pm:267 >> STACK Bio::Tools::Run::StandAloneBlastPlus::AUTOLOAD >> /Library/Perl/5.10.0/Bio/Tools/Run/StandAloneBlastPlus.pm:1233 >> STACK toplevel ./5CTest.pl:63 >> ------------------------------------- >> >> Line 63 in my code is (as you might expect) the one that calls blastn on my >> factory object. >> >> I'd appreciate any help you might be able to provide to shed light on this. >> >> Thanks in advance, >> >> Mike >> >> >> >> _______________________________________________ >> Bioperl-l mailing list >> Bioperl-l at lists.open-bio.org >> http://lists.open-bio.org/mailman/listinfo/bioperl-l >> >> > > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l > > _______________________________________________ Bioperl-l mailing list Bioperl-l at lists.open-bio.org http://lists.open-bio.org/mailman/listinfo/bioperl-l From jason at bioperl.org Mon Feb 1 14:48:17 2010 From: jason at bioperl.org (Jason Stajich) Date: Mon, 1 Feb 2010 11:48:17 -0800 Subject: [Bioperl-l] Fwd: megablast using bioperl References: <624104.84596.qm@web30907.mail.mud.yahoo.com> Message-ID: Xiujing - Your message is best asked on the mailing list. Begin forwarded message: > From: Xiujing Gu > Date: February 1, 2010 7:58:55 AM PST > To: jason at bioperl.org > Subject: megablast using bioperl > > Hello Jason: > > After numerous googling, I found you might be the only person who > knows about this subject. Would you so kindly show me an example of > setting up megablast using bioperl Bio::Tools::Run::StandAloneBlast? > > Thanks a lot! > > Best regards, > Xiujing From Russell.Smithies at agresearch.co.nz Mon Feb 1 15:12:07 2010 From: Russell.Smithies at agresearch.co.nz (Smithies, Russell) Date: Tue, 2 Feb 2010 09:12:07 +1300 Subject: [Bioperl-l] Fwd: megablast using bioperl In-Reply-To: References: <624104.84596.qm@web30907.mail.mud.yahoo.com> Message-ID: <18DF7D20DFEC044098A1062202F5FFF32C61B820F2@exchsth.agresearch.co.nz> Have you read the docs? It's very well documented: http://search.cpan.org/~cjfields/BioPerl-1.6.1/Bio/Tools/Run/StandAloneBlast.pm Have you searched the BioPerl wiki? Again, well documented with examples: http://www.bioperl.org/wiki/HOWTO:StandAloneBlast And if that doesn't help, perhaps you need to improve your Google skills: http://www.google.com/support/websearch/?ctx=web --Russell > -----Original Message----- > From: bioperl-l-bounces at lists.open-bio.org [mailto:bioperl-l- > bounces at lists.open-bio.org] On Behalf Of Jason Stajich > Sent: Tuesday, 2 February 2010 8:48 a.m. > To: BioPerl List > Cc: xiujingg at yahoo.com > Subject: [Bioperl-l] Fwd: megablast using bioperl > > Xiujing - > Your message is best asked on the mailing list. > > Begin forwarded message: > > > From: Xiujing Gu > > Date: February 1, 2010 7:58:55 AM PST > > To: jason at bioperl.org > > Subject: megablast using bioperl > > > > Hello Jason: > > > > After numerous googling, I found you might be the only person who > > knows about this subject. Would you so kindly show me an example of > > setting up megablast using bioperl Bio::Tools::Run::StandAloneBlast? > > > > Thanks a lot! > > > > Best regards, > > Xiujing > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l ======================================================================= Attention: The information contained in this message and/or attachments from AgResearch Limited is intended only for the persons or entities to which it is addressed and may contain confidential and/or privileged material. Any review, retransmission, dissemination or other use of, or taking of any action in reliance upon, this information by persons or entities other than the intended recipients is prohibited by AgResearch Limited. If you have received this message in error, please notify the sender immediately. ======================================================================= From maj at fortinbras.us Mon Feb 1 15:22:40 2010 From: maj at fortinbras.us (Mark A. Jensen) Date: Mon, 1 Feb 2010 15:22:40 -0500 Subject: [Bioperl-l] Fwd: megablast using bioperl In-Reply-To: References: <624104.84596.qm@web30907.mail.mud.yahoo.com> Message-ID: <68636D1D473B459CAF3DE0FF890F7C40@NewLife> Xiujing-- If you choose to use blast+, this is pretty easily done with Bio::Tools::Run::StandAloneBlastPlus : use Bio::Tools::Run::StandAloneBlastPlus; $ENV{BLASTPLUSDIR} = "/path/to/blast+/bin"; $fac = Bio::Tools::Run::StandAloneBlastPlus->new( -db_name => '/path/to/blastdb/mydb' ); $result = $fac->blastn( -query => 'myqueryseqs.fas' ); while ($hit = $result->next_hit) { # process hits } Here, blastn is used, because in blast+, megablast is the default algorithm. --to specify others, you set the -task parameter: e.g., legacy blastn would be $result = $fac->blastn( -query =. 'myqueryseqs.fas', -task => 'blastn' ); For much more information, please look at http://bioperl.org/wiki/HOWTO:SearchIO and http://bioperl.org/wiki/HOWTO:BlastPlus Many questions you might have will be answered in those pages. MAJ ----- Original Message ----- From: "Jason Stajich" To: "BioPerl List" Cc: Sent: Monday, February 01, 2010 2:48 PM Subject: [Bioperl-l] Fwd: megablast using bioperl > Xiujing - > Your message is best asked on the mailing list. > > Begin forwarded message: > >> From: Xiujing Gu >> Date: February 1, 2010 7:58:55 AM PST >> To: jason at bioperl.org >> Subject: megablast using bioperl >> >> Hello Jason: >> >> After numerous googling, I found you might be the only person who >> knows about this subject. Would you so kindly show me an example of >> setting up megablast using bioperl Bio::Tools::Run::StandAloneBlast? >> >> Thanks a lot! >> >> Best regards, >> Xiujing > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l > > From abhishek.vit at gmail.com Mon Feb 1 15:36:18 2010 From: abhishek.vit at gmail.com (Abhishek Pratap) Date: Mon, 1 Feb 2010 15:36:18 -0500 Subject: [Bioperl-l] Pulling down data from NCBI Message-ID: Hi All I looking to batch download some 34K nucleotide sequences, corresponding to a NCBI accession number. I tired the following and getting an error. Has it got anything to do with recent update to code that Chris was discussing. my $factory = Bio::DB::EUtilities->new ( -eutil => 'efecth', -db => 'nucleotide', -retype => 'fasta', -id => $id ); ----------- EXCEPTION: Bio::Root::Exception ------------- MSG: efecth not supported STACK: Error::throw STACK: Bio::Root::Root::throw /usr/lib/perl5/vendor_perl/5.8.8/Bio/Root/Root.pm:357 STACK: Bio::Tools::EUtilities::EUtilParameters::eutil /usr/lib/perl5/vendor_perl/5.8.8/Bio/Tools/EUtilities/EUtilParameters.pm:452 STACK: Bio::Root::RootI::_set_from_args /usr/lib/perl5/vendor_perl/5.8.8/Bio/Root/RootI.pm:546 STACK: Bio::Tools::EUtilities::EUtilParameters::new /usr/lib/perl5/vendor_perl/5.8.8/Bio/Tools/EUtilities/EUtilParameters.pm:193 STACK: Bio::DB::EUtilities::new /usr/lib/perl5/vendor_perl/5.8.8/Bio/DB/EUtilities.pm:74 STACK: ./getDatafromNCBI.pl:9 -Abhi From cjfields at illinois.edu Mon Feb 1 15:38:10 2010 From: cjfields at illinois.edu (Chris Fields) Date: Mon, 1 Feb 2010 14:38:10 -0600 Subject: [Bioperl-l] Pulling down data from NCBI In-Reply-To: References: Message-ID: <7CDCC994-B84A-48B8-8A91-346579A74378@illinois.edu> that would be 'efetch', not 'efecth'. chris On Feb 1, 2010, at 2:36 PM, Abhishek Pratap wrote: > Hi All > > I looking to batch download some 34K nucleotide sequences, > corresponding to a NCBI accession number. I tired the following and > getting an error. Has it got anything to do with recent update to code > that Chris was discussing. > > > > my $factory = Bio::DB::EUtilities->new ( > -eutil => 'efecth', > -db => 'nucleotide', > -retype => 'fasta', > -id => $id > ); > > > ----------- EXCEPTION: Bio::Root::Exception ------------- > MSG: efecth not supported > STACK: Error::throw > STACK: Bio::Root::Root::throw > /usr/lib/perl5/vendor_perl/5.8.8/Bio/Root/Root.pm:357 > STACK: Bio::Tools::EUtilities::EUtilParameters::eutil > /usr/lib/perl5/vendor_perl/5.8.8/Bio/Tools/EUtilities/EUtilParameters.pm:452 > STACK: Bio::Root::RootI::_set_from_args > /usr/lib/perl5/vendor_perl/5.8.8/Bio/Root/RootI.pm:546 > STACK: Bio::Tools::EUtilities::EUtilParameters::new > /usr/lib/perl5/vendor_perl/5.8.8/Bio/Tools/EUtilities/EUtilParameters.pm:193 > STACK: Bio::DB::EUtilities::new > /usr/lib/perl5/vendor_perl/5.8.8/Bio/DB/EUtilities.pm:74 > STACK: ./getDatafromNCBI.pl:9 > > > -Abhi > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l From Kevin.M.Brown at asu.edu Mon Feb 1 15:39:28 2010 From: Kevin.M.Brown at asu.edu (Kevin Brown) Date: Mon, 1 Feb 2010 13:39:28 -0700 Subject: [Bioperl-l] Pulling down data from NCBI In-Reply-To: References: Message-ID: <1A4207F8295607498283FE9E93B775B406849E36@EX02.asurite.ad.asu.edu> Looks like you've misspelled one of the parameters. It should be 'efetch' not 'efecth' Kevin Brown Center for Innovations in Medicine Biodesign Institute Arizona State University > -----Original Message----- > From: bioperl-l-bounces at lists.open-bio.org > [mailto:bioperl-l-bounces at lists.open-bio.org] On Behalf Of > Abhishek Pratap > Sent: Monday, February 01, 2010 1:36 PM > To: bioperl-l at lists.open-bio.org > Subject: [Bioperl-l] Pulling down data from NCBI > > Hi All > > I looking to batch download some 34K nucleotide sequences, > corresponding to a NCBI accession number. I tired the following and > getting an error. Has it got anything to do with recent update to code > that Chris was discussing. > > > > my $factory = Bio::DB::EUtilities->new ( > -eutil => 'efecth', > -db => 'nucleotide', > -retype => 'fasta', > -id => $id > ); > > > ----------- EXCEPTION: Bio::Root::Exception ------------- > MSG: efecth not supported > STACK: Error::throw > STACK: Bio::Root::Root::throw > /usr/lib/perl5/vendor_perl/5.8.8/Bio/Root/Root.pm:357 > STACK: Bio::Tools::EUtilities::EUtilParameters::eutil > /usr/lib/perl5/vendor_perl/5.8.8/Bio/Tools/EUtilities/EUtilPar > ameters.pm:452 > STACK: Bio::Root::RootI::_set_from_args > /usr/lib/perl5/vendor_perl/5.8.8/Bio/Root/RootI.pm:546 > STACK: Bio::Tools::EUtilities::EUtilParameters::new > /usr/lib/perl5/vendor_perl/5.8.8/Bio/Tools/EUtilities/EUtilPar > ameters.pm:193 > STACK: Bio::DB::EUtilities::new > /usr/lib/perl5/vendor_perl/5.8.8/Bio/DB/EUtilities.pm:74 > STACK: ./getDatafromNCBI.pl:9 > > > -Abhi > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l > From abhishek.vit at gmail.com Mon Feb 1 15:45:30 2010 From: abhishek.vit at gmail.com (Abhishek Pratap) Date: Mon, 1 Feb 2010 15:45:30 -0500 Subject: [Bioperl-l] Pulling down data from NCBI In-Reply-To: <1A4207F8295607498283FE9E93B775B406849E36@EX02.asurite.ad.asu.edu> References: <1A4207F8295607498283FE9E93B775B406849E36@EX02.asurite.ad.asu.edu> Message-ID: Thank you guys for very quick responses. My bad I trusted my fingers. Now that this is working the output that I am getting is not what I want. I am sure I am missing the correct way of doing it. So If I search the Nucleotide db @NCBI for this accession number " AAPP01000000", I see some 34 k records. What I need to do is pull down those sequences as fasta files. I am referring to http://www.bioperl.org/wiki/HOWTO:EUtilities_Cookbook but dint quite find a similar example. Thanks! -Abhi On Mon, Feb 1, 2010 at 3:39 PM, Kevin Brown wrote: > Looks like you've misspelled one of the parameters. It should be > 'efetch' not 'efecth' > > Kevin Brown > Center for Innovations in Medicine > Biodesign Institute > Arizona State University > >> -----Original Message----- >> From: bioperl-l-bounces at lists.open-bio.org >> [mailto:bioperl-l-bounces at lists.open-bio.org] On Behalf Of >> Abhishek Pratap >> Sent: Monday, February 01, 2010 1:36 PM >> To: bioperl-l at lists.open-bio.org >> Subject: [Bioperl-l] Pulling down data from NCBI >> >> Hi All >> >> I looking to batch download some 34K nucleotide sequences, >> corresponding to a NCBI accession number. I tired the following and >> getting an error. Has it got anything to do with recent update to code >> that Chris was discussing. >> >> >> >> my $factory = Bio::DB::EUtilities->new ?( >> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? -eutil ?=> 'efecth', >> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? -db ? ? => ? ? ?'nucleotide', >> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? -retype => ? ? ?'fasta', >> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? -id ? ? ? ? ? ? => $id >> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ); >> >> >> ----------- EXCEPTION: Bio::Root::Exception ------------- >> MSG: efecth not supported >> STACK: Error::throw >> STACK: Bio::Root::Root::throw >> /usr/lib/perl5/vendor_perl/5.8.8/Bio/Root/Root.pm:357 >> STACK: Bio::Tools::EUtilities::EUtilParameters::eutil >> /usr/lib/perl5/vendor_perl/5.8.8/Bio/Tools/EUtilities/EUtilPar >> ameters.pm:452 >> STACK: Bio::Root::RootI::_set_from_args >> /usr/lib/perl5/vendor_perl/5.8.8/Bio/Root/RootI.pm:546 >> STACK: Bio::Tools::EUtilities::EUtilParameters::new >> /usr/lib/perl5/vendor_perl/5.8.8/Bio/Tools/EUtilities/EUtilPar >> ameters.pm:193 >> STACK: Bio::DB::EUtilities::new >> /usr/lib/perl5/vendor_perl/5.8.8/Bio/DB/EUtilities.pm:74 >> STACK: ./getDatafromNCBI.pl:9 >> >> >> -Abhi >> _______________________________________________ >> Bioperl-l mailing list >> Bioperl-l at lists.open-bio.org >> http://lists.open-bio.org/mailman/listinfo/bioperl-l >> > From senthil.debian at gmail.com Mon Feb 1 15:45:46 2010 From: senthil.debian at gmail.com (Senthil Kumar M) Date: Mon, 1 Feb 2010 15:45:46 -0500 Subject: [Bioperl-l] Pulling down data from NCBI In-Reply-To: References: Message-ID: <56b822011002011245w6f97af9cpfe986daa2e593974@mail.gmail.com> On Mon, Feb 1, 2010 at 3:36 PM, Abhishek Pratap wrote: > > my $factory = Bio::DB::EUtilities->new ?( > ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?-eutil ?=> 'efecth', > ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?-db ? ? => ? ? ?'nucleotide', > ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?-retype => ? ? ?'fasta', > ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?-id ? ? ? ? ? ? => $id > ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?); > > > ----------- EXCEPTION: Bio::Root::Exception ------------- > MSG: efecth not supported Hi, Shouldn't that be efetch? Senthil From Russell.Smithies at agresearch.co.nz Mon Feb 1 16:02:12 2010 From: Russell.Smithies at agresearch.co.nz (Smithies, Russell) Date: Tue, 2 Feb 2010 10:02:12 +1300 Subject: [Bioperl-l] Fwd: megablast using bioperl In-Reply-To: <718515.61675.qm@web30905.mail.mud.yahoo.com> References: <18DF7D20DFEC044098A1062202F5FFF32C61B820F2@exchsth.agresearch.co.nz> <718515.61675.qm@web30905.mail.mud.yahoo.com> Message-ID: <18DF7D20DFEC044098A1062202F5FFF32C61B82183@exchsth.agresearch.co.nz> Sorry, I missed the megablast bit. There's a Bio::SearchIO module for reading megablast output but I don't think there's any built-in way of executing it. You may have to use system() to execute the command then read the output back into Bio::SearchIO Have you had a look at Blastplus? http://doc.bioperl.org/releases/bioperl-current/bioperl-run/lib/Bio/Tools/Run/BlastPlus.html --Russell From: Xiujing Gu [mailto:xiujingg at yahoo.com] Sent: Tuesday, 2 February 2010 9:39 a.m. To: 'BioPerl List'; Smithies, Russell Subject: RE: [Bioperl-l] Fwd: megablast using bioperl Thanks for the links! But, they only talk about the regular blastall program, not the megablast. I am not sure how can I switch from blastall to megablast. I tried to just call megablast, but I got a very differenct report. $VAR1 = bless( { 'primary_seq' => bless( { 'display_id' => 'test', '_seq_length' => undef, '_root_verbose' => 0, 'seq' => 'TGAACCTAGAAGTATTTGTTGCTGGGTTTGTCTTCAGGTTCTGTTGCTCGGTTTTCTAGTTCCCCACCTAGTCTGGGTTACTCTGCAGCTACTTTTGCATTACAATGGCCTTGGTGAGACTGGTAGACGGGATTAACTGAGAATTCACAAGGGTGGGTCAGTAGGGGGTGTGCCCGCCAGGAGGGGTGGGTCTAAGGTGATAGAGCCTTCATTATAAATCTAGAGACTCCAGGATTTTAACGTTCTGCTGGACTGAGCTGGTTGCCTCATGTTATTATGCAGGCAACTCACTTTATCCCAATTTCTTGATACTTTTCCTTCTGGAGGTCCTATTTCTCTAACATCTTCCAGAAAAGTCTTAAAGCTGCCTTAACCTTTTTTCCAGTCCACCTCTTAAATTTTTTCCTCCTCTTCCTCTATACTAACATGAGTGTGGATCCAGCTTGTCCCCAAAGCTTGCCTTGCTTTGAAGCATCCGACTGTAAAGAATCTTCACCTATGCCTGTGATTTGTGGGCCTGAAGAAAACTATCCATCCTTGCAAATGTCTTCTGCTGAGATGCCTCACACGGAGACTGGTAAGAAAGAAATTTATCCTTGAAAGGCCAAGTTCCTTAAGGGAAAAGAGAGAAGGAGAGAGGGTTAAGGGATCATTTCCCTCTTGAGCAATGATGGACCATTACTATAAAGAAGTGTTATTATCAACTAATCCTCTGGAAACCCCTTTTTCCATTATAACTTGGTGGCACCTGCCCTTTGAACTATGTCCCAGGTCTCAGGAGTGTGCATTGAGTTGAAGGACACAGAATTCGGCAGTTGAACAGTGTGCAGTAAGTTTGAGAACCTATGGGCTTAGGCATGGTGGAAACAAAAATGTATCGTTATAGTTAAATGAAGGTGATGTGTACATCTTCACATAGTGCTGGACACATGTGAATAAATAGCAGATTTATTGCTAATTAGCCA GAAGACCTAACGTCATAGCTCAGGGATGAGCATGATTTTGTTTTGCCAAAAATGGCATGGCAAATCACGATGAGATTTCTGTAATACATAATTTGGGTAATTCTTTCTATGTCAGTAACGGCTGTCTCTTCTCCATTCTCTGGGTTTGTGGATGTTACTGGGCAGCTCTGAGTTTGGGAGCACCTCCCATGTCTAATTCTCCTAAGTCCTGGGAAGCGTTGACCCAACTTTATGGTAAAGATAATTCCAGAAAGTTTAATCTACTGACAGTCAAACAGAATGTAGCTAGAAGTCCAGTTTGGCTTCAAAACCTGTGCTAGTACTCATGCTTCTGACTGGTAGCTGCAAGGGGTGGGGGATACTCGGGATACTCATAAAGCCGCTACCACTTTTTTGAAAATCAATTTTTCAGTAGTTTTCAAAAACTTGAGAATGAACCAACTTTACCAAGAATG', 'alphabet' => 'dna' }, 'Bio::PrimarySeq' ), '_root_verbose' => 0 }, 'Bio::Seq' ); Can't locate object method "nextSbjct" via package "Bio::Seq" Thanks, Xiujing Gu --- On Mon, 2/1/10, Smithies, Russell wrote: From: Smithies, Russell Subject: RE: [Bioperl-l] Fwd: megablast using bioperl To: "'BioPerl List'" Cc: "'xiujingg at yahoo.com'" Date: Monday, February 1, 2010, 3:12 PM Have you read the docs? It's very well documented: http://search.cpan.org/~cjfields/BioPerl-1.6.1/Bio/Tools/Run/StandAloneBlast.pm Have you searched the BioPerl wiki? Again, well documented with examples: http://www.bioperl.org/wiki/HOWTO:StandAloneBlast And if that doesn't help, perhaps you need to improve your Google skills: http://www.google.com/support/websearch/?ctx=web --Russell > -----Original Message----- > From: bioperl-l-bounces at lists.open-bio.org [mailto:bioperl-l- > bounces at lists.open-bio.org] On Behalf Of Jason Stajich > Sent: Tuesday, 2 February 2010 8:48 a.m. > To: BioPerl List > Cc: xiujingg at yahoo.com > Subject: [Bioperl-l] Fwd: megablast using bioperl > > Xiujing - > Your message is best asked on the mailing list. > > Begin forwarded message: > > > From: Xiujing Gu > > > Date: February 1, 2010 7:58:55 AM PST > > To: jason at bioperl.org > > Subject: megablast using bioperl > > > > Hello Jason: > > > > After numerous googling, I found you might be the only person who > > knows about this subject. Would you so kindly show me an example of > > setting up megablast using bioperl Bio::Tools::Run::StandAloneBlast? > > > > Thanks a lot! > > > > Best regards, > > Xiujing > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l ======================================================================= Attention: The information contained in this message and/or attachments from AgResearch Limited is intended only for the persons or entities to which it is addressed and may contain confidential and/or privileged material. Any review, retransmission, dissemination or other use of, or taking of any action in reliance upon, this information by persons or entities other than the intended recipients is prohibited by AgResearch Limited. If you have received this message in error, please notify the sender immediately. ======================================================================= From abhishek.vit at gmail.com Mon Feb 1 16:31:45 2010 From: abhishek.vit at gmail.com (Abhishek Pratap) Date: Mon, 1 Feb 2010 16:31:45 -0500 Subject: [Bioperl-l] Pulling down data from NCBI In-Reply-To: References: <1A4207F8295607498283FE9E93B775B406849E36@EX02.asurite.ad.asu.edu> Message-ID: I was able to work this out. Thanks! How do I retrieve a long list of sequences using a query? from Eutilities How To helped. -Abhi On Mon, Feb 1, 2010 at 3:45 PM, Abhishek Pratap wrote: > Thank you guys for very quick responses. ?My bad I trusted my fingers. > > Now that this is working the output that I am getting is not what I > want. I am sure I am missing the correct way of doing it. So If I > search the Nucleotide db @NCBI for this accession number " > AAPP01000000", I see some 34 k records. What I need to do is pull down > those sequences as fasta files. > > I am referring to > http://www.bioperl.org/wiki/HOWTO:EUtilities_Cookbook ?but dint quite > find a similar example. > > Thanks! > -Abhi > > On Mon, Feb 1, 2010 at 3:39 PM, Kevin Brown wrote: >> Looks like you've misspelled one of the parameters. It should be >> 'efetch' not 'efecth' >> >> Kevin Brown >> Center for Innovations in Medicine >> Biodesign Institute >> Arizona State University >> >>> -----Original Message----- >>> From: bioperl-l-bounces at lists.open-bio.org >>> [mailto:bioperl-l-bounces at lists.open-bio.org] On Behalf Of >>> Abhishek Pratap >>> Sent: Monday, February 01, 2010 1:36 PM >>> To: bioperl-l at lists.open-bio.org >>> Subject: [Bioperl-l] Pulling down data from NCBI >>> >>> Hi All >>> >>> I looking to batch download some 34K nucleotide sequences, >>> corresponding to a NCBI accession number. I tired the following and >>> getting an error. Has it got anything to do with recent update to code >>> that Chris was discussing. >>> >>> >>> >>> my $factory = Bio::DB::EUtilities->new ?( >>> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? -eutil ?=> 'efecth', >>> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? -db ? ? => ? ? ?'nucleotide', >>> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? -retype => ? ? ?'fasta', >>> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? -id ? ? ? ? ? ? => $id >>> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ); >>> >>> >>> ----------- EXCEPTION: Bio::Root::Exception ------------- >>> MSG: efecth not supported >>> STACK: Error::throw >>> STACK: Bio::Root::Root::throw >>> /usr/lib/perl5/vendor_perl/5.8.8/Bio/Root/Root.pm:357 >>> STACK: Bio::Tools::EUtilities::EUtilParameters::eutil >>> /usr/lib/perl5/vendor_perl/5.8.8/Bio/Tools/EUtilities/EUtilPar >>> ameters.pm:452 >>> STACK: Bio::Root::RootI::_set_from_args >>> /usr/lib/perl5/vendor_perl/5.8.8/Bio/Root/RootI.pm:546 >>> STACK: Bio::Tools::EUtilities::EUtilParameters::new >>> /usr/lib/perl5/vendor_perl/5.8.8/Bio/Tools/EUtilities/EUtilPar >>> ameters.pm:193 >>> STACK: Bio::DB::EUtilities::new >>> /usr/lib/perl5/vendor_perl/5.8.8/Bio/DB/EUtilities.pm:74 >>> STACK: ./getDatafromNCBI.pl:9 >>> >>> >>> -Abhi >>> _______________________________________________ >>> Bioperl-l mailing list >>> Bioperl-l at lists.open-bio.org >>> http://lists.open-bio.org/mailman/listinfo/bioperl-l >>> >> > From cjfields at illinois.edu Mon Feb 1 16:31:48 2010 From: cjfields at illinois.edu (Chris Fields) Date: Mon, 1 Feb 2010 15:31:48 -0600 Subject: [Bioperl-l] Pulling down data from NCBI In-Reply-To: References: <1A4207F8295607498283FE9E93B775B406849E36@EX02.asurite.ad.asu.edu> Message-ID: <2F1F855C-16BF-42ED-BE91-63CD1B331F90@illinois.edu> Abhi, The accession in question is for a record containing a set of sequences, not just one sequence (it's a contig record). The NCBI web interface is performing an esearch on this to get 34K seqs, the equivalent with EUtilities is: ================================ use Bio::DB::EUtilities; my $id = 'AAPP01000000[ACCN]'; my $factory = Bio::DB::EUtilities->new ( -eutil => 'esearch', -db => 'nucleotide', -term => $id, -usehistory => 'y'); say $factory->get_count; # do more here... ================================ The 'do more here' part is covered in the cookbook, and will require you retrieving the seqs in chunks. chris On Feb 1, 2010, at 2:45 PM, Abhishek Pratap wrote: > Thank you guys for very quick responses. My bad I trusted my fingers. > > Now that this is working the output that I am getting is not what I > want. I am sure I am missing the correct way of doing it. So If I > search the Nucleotide db @NCBI for this accession number " > AAPP01000000", I see some 34 k records. What I need to do is pull down > those sequences as fasta files. > > I am referring to > http://www.bioperl.org/wiki/HOWTO:EUtilities_Cookbook but dint quite > find a similar example. > > Thanks! > -Abhi > > On Mon, Feb 1, 2010 at 3:39 PM, Kevin Brown wrote: >> Looks like you've misspelled one of the parameters. It should be >> 'efetch' not 'efecth' >> >> Kevin Brown >> Center for Innovations in Medicine >> Biodesign Institute >> Arizona State University >> >>> -----Original Message----- >>> From: bioperl-l-bounces at lists.open-bio.org >>> [mailto:bioperl-l-bounces at lists.open-bio.org] On Behalf Of >>> Abhishek Pratap >>> Sent: Monday, February 01, 2010 1:36 PM >>> To: bioperl-l at lists.open-bio.org >>> Subject: [Bioperl-l] Pulling down data from NCBI >>> >>> Hi All >>> >>> I looking to batch download some 34K nucleotide sequences, >>> corresponding to a NCBI accession number. I tired the following and >>> getting an error. Has it got anything to do with recent update to code >>> that Chris was discussing. >>> >>> >>> >>> my $factory = Bio::DB::EUtilities->new ( >>> -eutil => 'efecth', >>> -db => 'nucleotide', >>> -retype => 'fasta', >>> -id => $id >>> ); >>> >>> >>> ----------- EXCEPTION: Bio::Root::Exception ------------- >>> MSG: efecth not supported >>> STACK: Error::throw >>> STACK: Bio::Root::Root::throw >>> /usr/lib/perl5/vendor_perl/5.8.8/Bio/Root/Root.pm:357 >>> STACK: Bio::Tools::EUtilities::EUtilParameters::eutil >>> /usr/lib/perl5/vendor_perl/5.8.8/Bio/Tools/EUtilities/EUtilPar >>> ameters.pm:452 >>> STACK: Bio::Root::RootI::_set_from_args >>> /usr/lib/perl5/vendor_perl/5.8.8/Bio/Root/RootI.pm:546 >>> STACK: Bio::Tools::EUtilities::EUtilParameters::new >>> /usr/lib/perl5/vendor_perl/5.8.8/Bio/Tools/EUtilities/EUtilPar >>> ameters.pm:193 >>> STACK: Bio::DB::EUtilities::new >>> /usr/lib/perl5/vendor_perl/5.8.8/Bio/DB/EUtilities.pm:74 >>> STACK: ./getDatafromNCBI.pl:9 >>> >>> >>> -Abhi >>> _______________________________________________ >>> Bioperl-l mailing list >>> Bioperl-l at lists.open-bio.org >>> http://lists.open-bio.org/mailman/listinfo/bioperl-l >>> >> > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l From abhishek.vit at gmail.com Mon Feb 1 16:33:17 2010 From: abhishek.vit at gmail.com (Abhishek Pratap) Date: Mon, 1 Feb 2010 16:33:17 -0500 Subject: [Bioperl-l] Pulling down data from NCBI In-Reply-To: <2F1F855C-16BF-42ED-BE91-63CD1B331F90@illinois.edu> References: <1A4207F8295607498283FE9E93B775B406849E36@EX02.asurite.ad.asu.edu> <2F1F855C-16BF-42ED-BE91-63CD1B331F90@illinois.edu> Message-ID: Thanks Chris. I was looking at the same thing in cookbook moments ago. Thanks! -A On Mon, Feb 1, 2010 at 4:31 PM, Chris Fields wrote: > Abhi, > > The accession in question is for a record containing a set of sequences, not just one sequence (it's a contig record). ?The NCBI web interface is performing an esearch on this to get 34K seqs, the equivalent with EUtilities is: > > ================================ > use Bio::DB::EUtilities; > > my $id = 'AAPP01000000[ACCN]'; > > my $factory = Bio::DB::EUtilities->new ?( > ? ? ? -eutil => ?'esearch', > ? ? ? -db ? ?=> ?'nucleotide', > ? ? ? -term ?=> ?$id, > ? ? ? -usehistory => 'y'); > > say $factory->get_count; > > # do more here... > > ================================ > > The 'do more here' part is covered in the cookbook, and will require you retrieving the seqs in chunks. > > chris > > On Feb 1, 2010, at 2:45 PM, Abhishek Pratap wrote: > >> Thank you guys for very quick responses. ?My bad I trusted my fingers. >> >> Now that this is working the output that I am getting is not what I >> want. I am sure I am missing the correct way of doing it. So If I >> search the Nucleotide db @NCBI for this accession number " >> AAPP01000000", I see some 34 k records. What I need to do is pull down >> those sequences as fasta files. >> >> I am referring to >> http://www.bioperl.org/wiki/HOWTO:EUtilities_Cookbook ?but dint quite >> find a similar example. >> >> Thanks! >> -Abhi >> >> On Mon, Feb 1, 2010 at 3:39 PM, Kevin Brown wrote: >>> Looks like you've misspelled one of the parameters. It should be >>> 'efetch' not 'efecth' >>> >>> Kevin Brown >>> Center for Innovations in Medicine >>> Biodesign Institute >>> Arizona State University >>> >>>> -----Original Message----- >>>> From: bioperl-l-bounces at lists.open-bio.org >>>> [mailto:bioperl-l-bounces at lists.open-bio.org] On Behalf Of >>>> Abhishek Pratap >>>> Sent: Monday, February 01, 2010 1:36 PM >>>> To: bioperl-l at lists.open-bio.org >>>> Subject: [Bioperl-l] Pulling down data from NCBI >>>> >>>> Hi All >>>> >>>> I looking to batch download some 34K nucleotide sequences, >>>> corresponding to a NCBI accession number. I tired the following and >>>> getting an error. Has it got anything to do with recent update to code >>>> that Chris was discussing. >>>> >>>> >>>> >>>> my $factory = Bio::DB::EUtilities->new ?( >>>> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? -eutil ?=> 'efecth', >>>> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? -db ? ? => ? ? ?'nucleotide', >>>> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? -retype => ? ? ?'fasta', >>>> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? -id ? ? ? ? ? ? => $id >>>> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ); >>>> >>>> >>>> ----------- EXCEPTION: Bio::Root::Exception ------------- >>>> MSG: efecth not supported >>>> STACK: Error::throw >>>> STACK: Bio::Root::Root::throw >>>> /usr/lib/perl5/vendor_perl/5.8.8/Bio/Root/Root.pm:357 >>>> STACK: Bio::Tools::EUtilities::EUtilParameters::eutil >>>> /usr/lib/perl5/vendor_perl/5.8.8/Bio/Tools/EUtilities/EUtilPar >>>> ameters.pm:452 >>>> STACK: Bio::Root::RootI::_set_from_args >>>> /usr/lib/perl5/vendor_perl/5.8.8/Bio/Root/RootI.pm:546 >>>> STACK: Bio::Tools::EUtilities::EUtilParameters::new >>>> /usr/lib/perl5/vendor_perl/5.8.8/Bio/Tools/EUtilities/EUtilPar >>>> ameters.pm:193 >>>> STACK: Bio::DB::EUtilities::new >>>> /usr/lib/perl5/vendor_perl/5.8.8/Bio/DB/EUtilities.pm:74 >>>> STACK: ./getDatafromNCBI.pl:9 >>>> >>>> >>>> -Abhi >>>> _______________________________________________ >>>> Bioperl-l mailing list >>>> Bioperl-l at lists.open-bio.org >>>> http://lists.open-bio.org/mailman/listinfo/bioperl-l >>>> >>> >> >> _______________________________________________ >> Bioperl-l mailing list >> Bioperl-l at lists.open-bio.org >> http://lists.open-bio.org/mailman/listinfo/bioperl-l > > From xiujingg at yahoo.com Mon Feb 1 15:38:39 2010 From: xiujingg at yahoo.com (Xiujing Gu) Date: Mon, 1 Feb 2010 12:38:39 -0800 (PST) Subject: [Bioperl-l] Fwd: megablast using bioperl In-Reply-To: <18DF7D20DFEC044098A1062202F5FFF32C61B820F2@exchsth.agresearch.co.nz> Message-ID: <718515.61675.qm@web30905.mail.mud.yahoo.com> Thanks for the links! But, they only talk about the regular blastall program, not the megablast. I am not sure how can I switch from blastall to megablast. I tried to just call megablast, but I got a very differenct report. $VAR1 = bless( { ???????????????? 'primary_seq' => bless( { ?????????????????????????????????????????? 'display_id' => 'test', ?????????????????????????????????????????? '_seq_length' => undef, ?????????????????????????????????????????? '_root_verbose' => 0, ?????????????????????????????????????????? 'seq' => 'TGAACCTAGAAGTATTTGTTGCTGGGTTTGTCTTCAGGTTCTGTTGCTCGGTTTTCTAGTTCCCCACCTAGTCTGGGTTACTCTGCAGCTACTTTTGCATTACAATGGCCTTGGTGAGACTGGTAGACGGGATTAACTGAGAATTCACAAGGGTGGGTCAGTAGGGGGTGTGCCCGCCAGGAGGGGTGGGTCTAAGGTGATAGAGCCTTCATTATAAATCTAGAGACTCCAGGATTTTAACGTTCTGCTGGACTGAGCTGGTTGCCTCATGTTATTATGCAGGCAACTCACTTTATCCCAATTTCTTGATACTTTTCCTTCTGGAGGTCCTATTTCTCTAACATCTTCCAGAAAAGTCTTAAAGCTGCCTTAACCTTTTTTCCAGTCCACCTCTTAAATTTTTTCCTCCTCTTCCTCTATACTAACATGAGTGTGGATCCAGCTTGTCCCCAAAGCTTGCCTTGCTTTGAAGCATCCGACTGTAAAGAATCTTCACCTATGCCTGTGATTTGTGGGCCTGAAGAAAACTATCCATCCTTGCAAATGTCTTCTGCTGAGATGCCTCACACGGAGACTGGTAAGAAAGAAATTTATCCTTGAAAGGCCAAGTTCCTTAAGGGAAAAGAGAGAAGGAGAGAGGGTTAAGGGATCATTTCCCTCTTGAGCAATGATGGACCATTACTATAAAGAAGTGTTATTATCAACTAATCCTCTGGAAACCCCTTTTTCCATTATAACTTGGTGGCACCTGCCCTTTGAACTATGTCCCAGGTCTCAGGAGTGTGCATTGAGTTGAAGGACACAGAATTCGGCAGTTGAACAGTGTGCAGTAAGTTTGAGAACCTATGGGCTTAGGCATGGTGGAAACAAAAATGTATCGTTATAGTTAAATGAAGGTGATGTGTACATCTTCACATAGTGCTGGACACATGTGAATAAATAGCAGATTTATTGCTAATTAGCCA GAAGACCTAACGTCATAGCTCAGGGATGAGCATGATTTTGTTTTGCCAAAAATGGCATGGCAAATCACGATGAGATTTCTGTAATACATAATTTGGGTAATTCTTTCTATGTCAGTAACGGCTGTCTCTTCTCCATTCTCTGGGTTTGTGGATGTTACTGGGCAGCTCTGAGTTTGGGAGCACCTCCCATGTCTAATTCTCCTAAGTCCTGGGAAGCGTTGACCCAACTTTATGGTAAAGATAATTCCAGAAAGTTTAATCTACTGACAGTCAAACAGAATGTAGCTAGAAGTCCAGTTTGGCTTCAAAACCTGTGCTAGTACTCATGCTTCTGACTGGTAGCTGCAAGGGGTGGGGGATACTCGGGATACTCATAAAGCCGCTACCACTTTTTTGAAAATCAATTTTTCAGTAGTTTTCAAAAACTTGAGAATGAACCAACTTTACCAAGAATG', ?????????????????????????????????????????? 'alphabet' => 'dna' ???????????????????????????????????????? }, 'Bio::PrimarySeq' ), ???????????????? '_root_verbose' => 0 ?????????????? }, 'Bio::Seq' ); Can't locate object method "nextSbjct" via package "Bio::Seq" Thanks, Xiujing Gu --- On Mon, 2/1/10, Smithies, Russell wrote: From: Smithies, Russell Subject: RE: [Bioperl-l] Fwd: megablast using bioperl To: "'BioPerl List'" Cc: "'xiujingg at yahoo.com'" Date: Monday, February 1, 2010, 3:12 PM Have you read the docs? It's very well documented: http://search.cpan.org/~cjfields/BioPerl-1.6.1/Bio/Tools/Run/StandAloneBlast.pm Have you searched the BioPerl wiki? Again, well documented with examples: http://www.bioperl.org/wiki/HOWTO:StandAloneBlast And if that doesn't help, perhaps you need to improve your Google skills: http://www.google.com/support/websearch/?ctx=web --Russell > -----Original Message----- > From: bioperl-l-bounces at lists.open-bio.org [mailto:bioperl-l- > bounces at lists.open-bio.org] On Behalf Of Jason Stajich > Sent: Tuesday, 2 February 2010 8:48 a.m. > To: BioPerl List > Cc: xiujingg at yahoo.com > Subject: [Bioperl-l] Fwd: megablast using bioperl > > Xiujing - > Your message is best asked on the mailing list. > > Begin forwarded message: > > > From: Xiujing Gu > > Date: February 1, 2010 7:58:55 AM PST > > To: jason at bioperl.org > > Subject: megablast using bioperl > > > > Hello Jason: > > > > After numerous googling, I found you might be the only person who > > knows about this subject. Would you so kindly show me an example of > > setting up megablast using bioperl Bio::Tools::Run::StandAloneBlast? > > > > Thanks a lot! > > > > Best regards, > > Xiujing > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l ======================================================================= Attention: The information contained in this message and/or attachments from AgResearch Limited is intended only for the persons or entities to which it is addressed and may contain confidential and/or privileged material. Any review, retransmission, dissemination or other use of, or taking of any action in reliance upon, this information by persons or entities other than the intended recipients is prohibited by AgResearch Limited. If you have received this message in error, please notify the sender immediately. ======================================================================= From cjfields at illinois.edu Mon Feb 1 17:02:43 2010 From: cjfields at illinois.edu (Chris Fields) Date: Mon, 1 Feb 2010 16:02:43 -0600 Subject: [Bioperl-l] Fwd: megablast using bioperl In-Reply-To: <718515.61675.qm@web30905.mail.mud.yahoo.com> References: <718515.61675.qm@web30905.mail.mud.yahoo.com> Message-ID: <786A566A-E72E-441E-A9FE-E4B124332EE8@illinois.edu> Acc. to the blastall docs MEGABLAST is just setting -n to T for blastall. Is that what you mean? pyrimidine1:~ cjfields$ blast-2.2.22/bin/blastall - blastall 2.2.22 arguments: -p Program Name [String] -d Database [String] default = nr ... -n MegaBlast search [T/F] default = F chris On Feb 1, 2010, at 2:38 PM, Xiujing Gu wrote: > Thanks for the links! > But, they only talk about the regular blastall program, not the megablast. > > I am not sure how can I switch from blastall to megablast. I tried to just call megablast, but I got a very differenct report. > > $VAR1 = bless( { > 'primary_seq' => bless( { > 'display_id' => 'test', > '_seq_length' => undef, > '_root_verbose' => 0, > 'seq' => > 'TGAACCTAGAAGTATTTGTTGCTGGGTTTGTCTTCAGGTTCTGTTGCTCGGTTTTCTAGTTCCCCACCTAGTCTGGGTTACTCTGCAGCTACTTTTGCATTACAATGGCCTTGGTGAGACTGGTAGACGGGATTAACTGAGAATTCACAAGGGTGGGTCAGTAGGGGGTGTGCCCGCCAGGAGGGGTGGGTCTAAGGTGATAGAGCCTTCATTATAAATCTAGAGACTCCAGGATTTTAACGTTCTGCTGGACTGAGCTGGTTGCCTCATGTTATTATGCAGGCAACTCACTTTATCCCAATTTCTTGATACTTTTCCTTCTGGAGGTCCTATTTCTCTAACATCTTCCAGAAAAGTCTTAAAGCTGCCTTAACCTTTTTTCCAGTCCACCTCTTAAATTTTTTCCTCCTCTTCCTCTATACTAACATGAGTGTGGATCCAGCTTGTCCCCAAAGCTTGCCTTGCTTTGAAGCATCCGACTGTAAAGAATCTTCACCTATGCCTGTGATTTGTGGGCCTGAAGAAAACTATCCATCCTTGCAAATGTCTTCTGCTGAGATGCCTCACACGGAGACTGGTAAGAAAGAAATTTATCCTTGAAAGGCCAAGTTCCTTAAGGGAAAAGAGAGAAGGAGAGAGGGTTAAGGGATCATTTCCCTCTTGAGCAATGATGGACCATTACTATAAAGAAGTGTTATTATCAACTAATCCTCTGGAAACCCCTTTTTCCATTATAACTTGGTGGCACCTGCCCTTTGAACTATGTCCCAGGTCTCAGGAGTGTGCATTGAGTTGAAGGACACAGAATTCGGCAGTTGAACAGTGTGCAGTAAGTTTGAGAACCTATGGGCTTAGGCATGGTGGAAACAAAAATGTATCGTTATAGTTAAATGAAGGTGATGTGTACATCTTCACATAGTGCTGGACACATGTGAATAAATAGCAGATTTATTGCTAATTAGCCA > GAAGACCTAACGTCATAGCTCAGGGATGAGCATGATTTTGTTTTGCCAAAAATGGCATGGCAAATCACGATGAGATTTCTGTAATACATAATTTGGGTAATTCTTTCTATGTCAGTAACGGCTGTCTCTTCTCCATTCTCTGGGTTTGTGGATGTTACTGGGCAGCTCTGAGTTTGGGAGCACCTCCCATGTCTAATTCTCCTAAGTCCTGGGAAGCGTTGACCCAACTTTATGGTAAAGATAATTCCAGAAAGTTTAATCTACTGACAGTCAAACAGAATGTAGCTAGAAGTCCAGTTTGGCTTCAAAACCTGTGCTAGTACTCATGCTTCTGACTGGTAGCTGCAAGGGGTGGGGGATACTCGGGATACTCATAAAGCCGCTACCACTTTTTTGAAAATCAATTTTTCAGTAGTTTTCAAAAACTTGAGAATGAACCAACTTTACCAAGAATG', > 'alphabet' => 'dna' > }, 'Bio::PrimarySeq' ), > '_root_verbose' => 0 > }, 'Bio::Seq' ); > Can't locate object method "nextSbjct" via package "Bio::Seq" > > > > Thanks, > Xiujing Gu > > > --- On Mon, 2/1/10, Smithies, Russell wrote: > > From: Smithies, Russell > Subject: RE: [Bioperl-l] Fwd: megablast using bioperl > To: "'BioPerl List'" > Cc: "'xiujingg at yahoo.com'" > Date: Monday, February 1, 2010, 3:12 PM > > Have you read the docs? It's very well documented: > http://search.cpan.org/~cjfields/BioPerl-1.6.1/Bio/Tools/Run/StandAloneBlast.pm > > Have you searched the BioPerl wiki? Again, well documented with examples: > http://www.bioperl.org/wiki/HOWTO:StandAloneBlast > > And if that doesn't help, perhaps you need to improve your Google skills: > http://www.google.com/support/websearch/?ctx=web > > --Russell > > > >> -----Original Message----- >> From: bioperl-l-bounces at lists.open-bio.org [mailto:bioperl-l- >> bounces at lists.open-bio.org] On Behalf Of Jason Stajich >> Sent: Tuesday, 2 February 2010 8:48 a.m. >> To: BioPerl List >> Cc: xiujingg at yahoo.com >> Subject: [Bioperl-l] Fwd: megablast using bioperl >> >> Xiujing - >> Your message is best asked on the mailing list. >> >> Begin forwarded message: >> >>> From: Xiujing Gu >>> Date: February 1, 2010 7:58:55 AM PST >>> To: jason at bioperl.org >>> Subject: megablast using bioperl >>> >>> Hello Jason: >>> >>> After numerous googling, I found you might be the only person who >>> knows about this subject. Would you so kindly show me an example of >>> setting up megablast using bioperl Bio::Tools::Run::StandAloneBlast? >>> >>> Thanks a lot! >>> >>> Best regards, >>> Xiujing >> _______________________________________________ >> Bioperl-l mailing list >> Bioperl-l at lists.open-bio.org >> http://lists.open-bio.org/mailman/listinfo/bioperl-l > ======================================================================= > Attention: The information contained in this message and/or attachments > from AgResearch Limited is intended only for the persons or entities > to which it is addressed and may contain confidential and/or privileged > material. Any review, retransmission, dissemination or other use of, or > taking of any action in reliance upon, this information by persons or > entities other than the intended recipients is prohibited by AgResearch > Limited. If you have received this message in error, please notify the > sender immediately. > ======================================================================= > > > > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l From Russell.Smithies at agresearch.co.nz Mon Feb 1 17:42:31 2010 From: Russell.Smithies at agresearch.co.nz (Smithies, Russell) Date: Tue, 2 Feb 2010 11:42:31 +1300 Subject: [Bioperl-l] Fwd: megablast using bioperl In-Reply-To: <786A566A-E72E-441E-A9FE-E4B124332EE8@illinois.edu> References: <718515.61675.qm@web30905.mail.mud.yahoo.com> <786A566A-E72E-441E-A9FE-E4B124332EE8@illinois.edu> Message-ID: <18DF7D20DFEC044098A1062202F5FFF32C61B8223C@exchsth.agresearch.co.nz> I don't think it allows a discontiguous template when run via blastall though. "blastall -p blastn -n T" will give you a different result than straight megablast. --Russell > -----Original Message----- > From: Chris Fields [mailto:cjfields at illinois.edu] > Sent: Tuesday, 2 February 2010 11:03 a.m. > To: Xiujing Gu > Cc: 'BioPerl List'; Smithies, Russell > Subject: Re: [Bioperl-l] Fwd: megablast using bioperl > > Acc. to the blastall docs MEGABLAST is just setting -n to T for blastall. > Is that what you mean? > > pyrimidine1:~ cjfields$ blast-2.2.22/bin/blastall - > > blastall 2.2.22 arguments: > > -p Program Name [String] > -d Database [String] > default = nr > ... > -n MegaBlast search [T/F] > default = F > > chris > > On Feb 1, 2010, at 2:38 PM, Xiujing Gu wrote: > > > Thanks for the links! > > But, they only talk about the regular blastall program, not the > megablast. > > > > I am not sure how can I switch from blastall to megablast. I tried to > just call megablast, but I got a very differenct report. > > > > $VAR1 = bless( { > > 'primary_seq' => bless( { > > 'display_id' => 'test', > > '_seq_length' => undef, > > '_root_verbose' => 0, > > 'seq' => > > > 'TGAACCTAGAAGTATTTGTTGCTGGGTTTGTCTTCAGGTTCTGTTGCTCGGTTTTCTAGTTCCCCACCTAGTC > TGGGTTACTCTGCAGCTACTTTTGCATTACAATGGCCTTGGTGAGACTGGTAGACGGGATTAACTGAGAATTCA > CAAGGGTGGGTCAGTAGGGGGTGTGCCCGCCAGGAGGGGTGGGTCTAAGGTGATAGAGCCTTCATTATAAATCT > AGAGACTCCAGGATTTTAACGTTCTGCTGGACTGAGCTGGTTGCCTCATGTTATTATGCAGGCAACTCACTTTA > TCCCAATTTCTTGATACTTTTCCTTCTGGAGGTCCTATTTCTCTAACATCTTCCAGAAAAGTCTTAAAGCTGCC > TTAACCTTTTTTCCAGTCCACCTCTTAAATTTTTTCCTCCTCTTCCTCTATACTAACATGAGTGTGGATCCAGC > TTGTCCCCAAAGCTTGCCTTGCTTTGAAGCATCCGACTGTAAAGAATCTTCACCTATGCCTGTGATTTGTGGGC > CTGAAGAAAACTATCCATCCTTGCAAATGTCTTCTGCTGAGATGCCTCACACGGAGACTGGTAAGAAAGAAATT > TATCCTTGAAAGGCCAAGTTCCTTAAGGGAAAAGAGAGAAGGAGAGAGGGTTAAGGGATCATTTCCCTCTTGAG > CAATGATGGACCATTACTATAAAGAAGTGTTATTATCAACTAATCCTCTGGAAACCCCTTTTTCCATTATAACT > TGGTGGCACCTGCCCTTTGAACTATGTCCCAGGTCTCAGGAGTGTGCATTGAGTTGAAGGACACAGAATTCGGC > AGTTGAACAGTGTGCAGTAAGTTTGAGAACCTATGGGCTTAGGCATGGTGGAAACAAAAATGTATCGTTATAGT > TAAATGAAGGTGATGTGTACATCTTCACATAGTGCTGGACACATGTGAATAAATAGCAGATTTATTGCTAATTA > GCCA > > > GAAGACCTAACGTCATAGCTCAGGGATGAGCATGATTTTGTTTTGCCAAAAATGGCATGGCAAATCACGATGAG > ATTTCTGTAATACATAATTTGGGTAATTCTTTCTATGTCAGTAACGGCTGTCTCTTCTCCATTCTCTGGGTTTG > TGGATGTTACTGGGCAGCTCTGAGTTTGGGAGCACCTCCCATGTCTAATTCTCCTAAGTCCTGGGAAGCGTTGA > CCCAACTTTATGGTAAAGATAATTCCAGAAAGTTTAATCTACTGACAGTCAAACAGAATGTAGCTAGAAGTCCA > GTTTGGCTTCAAAACCTGTGCTAGTACTCATGCTTCTGACTGGTAGCTGCAAGGGGTGGGGGATACTCGGGATA > CTCATAAAGCCGCTACCACTTTTTTGAAAATCAATTTTTCAGTAGTTTTCAAAAACTTGAGAATGAACCAACTT > TACCAAGAATG', > > 'alphabet' => 'dna' > > }, 'Bio::PrimarySeq' ), > > '_root_verbose' => 0 > > }, 'Bio::Seq' ); > > Can't locate object method "nextSbjct" via package "Bio::Seq" > > > > > > > > Thanks, > > Xiujing Gu > > > > > > --- On Mon, 2/1/10, Smithies, Russell > wrote: > > > > From: Smithies, Russell > > Subject: RE: [Bioperl-l] Fwd: megablast using bioperl > > To: "'BioPerl List'" > > Cc: "'xiujingg at yahoo.com'" > > Date: Monday, February 1, 2010, 3:12 PM > > > > Have you read the docs? It's very well documented: > > http://search.cpan.org/~cjfields/BioPerl- > 1.6.1/Bio/Tools/Run/StandAloneBlast.pm > > > > Have you searched the BioPerl wiki? Again, well documented with > examples: > > http://www.bioperl.org/wiki/HOWTO:StandAloneBlast > > > > And if that doesn't help, perhaps you need to improve your Google > skills: > > http://www.google.com/support/websearch/?ctx=web > > > > --Russell > > > > > > > >> -----Original Message----- > >> From: bioperl-l-bounces at lists.open-bio.org [mailto:bioperl-l- > >> bounces at lists.open-bio.org] On Behalf Of Jason Stajich > >> Sent: Tuesday, 2 February 2010 8:48 a.m. > >> To: BioPerl List > >> Cc: xiujingg at yahoo.com > >> Subject: [Bioperl-l] Fwd: megablast using bioperl > >> > >> Xiujing - > >> Your message is best asked on the mailing list. > >> > >> Begin forwarded message: > >> > >>> From: Xiujing Gu > >>> Date: February 1, 2010 7:58:55 AM PST > >>> To: jason at bioperl.org > >>> Subject: megablast using bioperl > >>> > >>> Hello Jason: > >>> > >>> After numerous googling, I found you might be the only person who > >>> knows about this subject. Would you so kindly show me an example of > >>> setting up megablast using bioperl Bio::Tools::Run::StandAloneBlast? > >>> > >>> Thanks a lot! > >>> > >>> Best regards, > >>> Xiujing > >> _______________________________________________ > >> Bioperl-l mailing list > >> Bioperl-l at lists.open-bio.org > >> http://lists.open-bio.org/mailman/listinfo/bioperl-l > > ======================================================================= > > Attention: The information contained in this message and/or attachments > > from AgResearch Limited is intended only for the persons or entities > > to which it is addressed and may contain confidential and/or privileged > > material. Any review, retransmission, dissemination or other use of, or > > taking of any action in reliance upon, this information by persons or > > entities other than the intended recipients is prohibited by AgResearch > > Limited. If you have received this message in error, please notify the > > sender immediately. > > ======================================================================= > > > > > > > > > > _______________________________________________ > > Bioperl-l mailing list > > Bioperl-l at lists.open-bio.org > > http://lists.open-bio.org/mailman/listinfo/bioperl-l From Russell.Smithies at agresearch.co.nz Mon Feb 1 19:27:16 2010 From: Russell.Smithies at agresearch.co.nz (Smithies, Russell) Date: Tue, 2 Feb 2010 13:27:16 +1300 Subject: [Bioperl-l] Fwd: megablast using bioperl In-Reply-To: <18DF7D20DFEC044098A1062202F5FFF32C618DD435@exchsth.agresearch.co.nz> References: <718515.61675.qm@web30905.mail.mud.yahoo.com> <786A566A-E72E-441E-A9FE-E4B124332EE8@illinois.edu> <18DF7D20DFEC044098A1062202F5FFF32C618DD435@exchsth.agresearch.co.nz> Message-ID: <18DF7D20DFEC044098A1062202F5FFF32C61B822D1@exchsth.agresearch.co.nz> As I suspected, default gap penalties are different when running "megablast" and "blastall -p blastn -n T". Here are hits to the same sequence obtained both ways: ========================================= Megablast >gb|FJ882402.1| Sus scrofa homeodomain transcription factor Nanog-like protein mRNA, complete cds Length = 925 Score = 93.7 bits (47), Expect = 2e-15 Identities = 129/155 (83%), Gaps = 8/155 (5%) Strand = Plus / Plus Query: 427 atgagtgtggatccagcttgtccccaaagcttgccttgctttgaagcatccgactgtaaa 486 |||||||||||||||||||||||||||||| ||| |||| ||||||||| | | | | Sbjct: 1 atgagtgtggatccagcttgtccccaaagcctgctttgccccgaagcatcc-att-tcca 58 Query: 487 g--aatcttcacctatgcctgtgatttgtgggcctgaagaaaactat-ccatccttgcaa 543 | |||||||||| ||||||| | ||| ||||||||||||||| ||| || |||||||| Sbjct: 59 gcgaatcttcaccaatgcctgaggtttatgggcctgaagaaaattatgcc-tccttgcag 117 Query: 544 atgtcttctgctgagatgcctc-acacggagactg 577 ||||| ||||||||| |||| |||| ||||||| Sbjct: 118 atgtcagctgctgagac-cctcgacaccgagactg 151 Database: All GenBank+EMBL+DDBJ+PDB sequences (but no EST, STS, GSS,environmental samples or phase 0, 1 or 2 HTGS sequences) Posted date: Jan 26, 2010 5:41 PM Number of letters in database: 29,627,559,461 Number of sequences in database: 10,420,530 Lambda K H 1.37 0.711 1.31 Gapped Lambda K H 1.37 0.711 1.31 Matrix: blastn matrix:1 -3 Gap Penalties: Existence: 0, Extension: 0 ========================================================== blastall -p blastn -n T >gb|FJ882402.1| Sus scrofa homeodomain transcription factor Nanog-like protein mRNA, complete cds Length = 925 Score = 89.7 bits (45), Expect = 3e-14 Identities = 111/133 (83%) Strand = Plus / Plus Query: 427 atgagtgtggatccagcttgtccccaaagcttgccttgctttgaagcatccgactgtaaa 486 |||||||||||||||||||||||||||||| ||| |||| ||||||||| | | Sbjct: 1 atgagtgtggatccagcttgtccccaaagcctgctttgccccgaagcatccatttccagc 60 Query: 487 gaatcttcacctatgcctgtgatttgtgggcctgaagaaaactatccatccttgcaaatg 546 ||||||||||| ||||||| | ||| ||||||||||||||| ||| | |||||||| ||| Sbjct: 61 gaatcttcaccaatgcctgaggtttatgggcctgaagaaaattatgcctccttgcagatg 120 Query: 547 tcttctgctgaga 559 || ||||||||| Sbjct: 121 tcagctgctgaga 133 Database: All GenBank+EMBL+DDBJ+PDB sequences (but no EST, STS, GSS,environmental samples or phase 0, 1 or 2 HTGS sequences) Posted date: Jan 26, 2010 5:41 PM Number of letters in database: 29,627,559,461 Number of sequences in database: 10,420,530 Lambda K H 1.37 0.711 1.31 Gapped Lambda K H 1.37 0.711 1.31 Matrix: blastn matrix:1 -3 Gap Penalties: Existence: 5, Extension: 2 --Russell > -----Original Message----- > From: Smithies, Russell > Sent: Tuesday, 2 February 2010 11:43 a.m. > To: 'Chris Fields'; 'Xiujing Gu' > Cc: 'BioPerl List' > Subject: RE: [Bioperl-l] Fwd: megablast using bioperl > > I don't think it allows a discontiguous template when run via blastall > though. > "blastall -p blastn -n T" will give you a different result than straight > megablast. > > --Russell > > > -----Original Message----- > > From: Chris Fields [mailto:cjfields at illinois.edu] > > Sent: Tuesday, 2 February 2010 11:03 a.m. > > To: Xiujing Gu > > Cc: 'BioPerl List'; Smithies, Russell > > Subject: Re: [Bioperl-l] Fwd: megablast using bioperl > > > > Acc. to the blastall docs MEGABLAST is just setting -n to T for > blastall. > > Is that what you mean? > > > > pyrimidine1:~ cjfields$ blast-2.2.22/bin/blastall - > > > > blastall 2.2.22 arguments: > > > > -p Program Name [String] > > -d Database [String] > > default = nr > > ... > > -n MegaBlast search [T/F] > > default = F > > > > chris > > > > On Feb 1, 2010, at 2:38 PM, Xiujing Gu wrote: > > > > > Thanks for the links! > > > But, they only talk about the regular blastall program, not the > > megablast. > > > > > > I am not sure how can I switch from blastall to megablast. I tried to > > just call megablast, but I got a very differenct report. > > > > > > $VAR1 = bless( { > > > 'primary_seq' => bless( { > > > 'display_id' => 'test', > > > '_seq_length' => undef, > > > '_root_verbose' => 0, > > > 'seq' => > > > > > > 'TGAACCTAGAAGTATTTGTTGCTGGGTTTGTCTTCAGGTTCTGTTGCTCGGTTTTCTAGTTCCCCACCTAGTC > > > TGGGTTACTCTGCAGCTACTTTTGCATTACAATGGCCTTGGTGAGACTGGTAGACGGGATTAACTGAGAATTCA > > > CAAGGGTGGGTCAGTAGGGGGTGTGCCCGCCAGGAGGGGTGGGTCTAAGGTGATAGAGCCTTCATTATAAATCT > > > AGAGACTCCAGGATTTTAACGTTCTGCTGGACTGAGCTGGTTGCCTCATGTTATTATGCAGGCAACTCACTTTA > > > TCCCAATTTCTTGATACTTTTCCTTCTGGAGGTCCTATTTCTCTAACATCTTCCAGAAAAGTCTTAAAGCTGCC > > > TTAACCTTTTTTCCAGTCCACCTCTTAAATTTTTTCCTCCTCTTCCTCTATACTAACATGAGTGTGGATCCAGC > > > TTGTCCCCAAAGCTTGCCTTGCTTTGAAGCATCCGACTGTAAAGAATCTTCACCTATGCCTGTGATTTGTGGGC > > > CTGAAGAAAACTATCCATCCTTGCAAATGTCTTCTGCTGAGATGCCTCACACGGAGACTGGTAAGAAAGAAATT > > > TATCCTTGAAAGGCCAAGTTCCTTAAGGGAAAAGAGAGAAGGAGAGAGGGTTAAGGGATCATTTCCCTCTTGAG > > > CAATGATGGACCATTACTATAAAGAAGTGTTATTATCAACTAATCCTCTGGAAACCCCTTTTTCCATTATAACT > > > TGGTGGCACCTGCCCTTTGAACTATGTCCCAGGTCTCAGGAGTGTGCATTGAGTTGAAGGACACAGAATTCGGC > > > AGTTGAACAGTGTGCAGTAAGTTTGAGAACCTATGGGCTTAGGCATGGTGGAAACAAAAATGTATCGTTATAGT > > > TAAATGAAGGTGATGTGTACATCTTCACATAGTGCTGGACACATGTGAATAAATAGCAGATTTATTGCTAATTA > > GCCA > > > > > > GAAGACCTAACGTCATAGCTCAGGGATGAGCATGATTTTGTTTTGCCAAAAATGGCATGGCAAATCACGATGAG > > > ATTTCTGTAATACATAATTTGGGTAATTCTTTCTATGTCAGTAACGGCTGTCTCTTCTCCATTCTCTGGGTTTG > > > TGGATGTTACTGGGCAGCTCTGAGTTTGGGAGCACCTCCCATGTCTAATTCTCCTAAGTCCTGGGAAGCGTTGA > > > CCCAACTTTATGGTAAAGATAATTCCAGAAAGTTTAATCTACTGACAGTCAAACAGAATGTAGCTAGAAGTCCA > > > GTTTGGCTTCAAAACCTGTGCTAGTACTCATGCTTCTGACTGGTAGCTGCAAGGGGTGGGGGATACTCGGGATA > > > CTCATAAAGCCGCTACCACTTTTTTGAAAATCAATTTTTCAGTAGTTTTCAAAAACTTGAGAATGAACCAACTT > > TACCAAGAATG', > > > 'alphabet' => 'dna' > > > }, 'Bio::PrimarySeq' ), > > > '_root_verbose' => 0 > > > }, 'Bio::Seq' ); > > > Can't locate object method "nextSbjct" via package "Bio::Seq" > > > > > > > > > > > > Thanks, > > > Xiujing Gu > > > > > > > > > --- On Mon, 2/1/10, Smithies, Russell > > wrote: > > > > > > From: Smithies, Russell > > > Subject: RE: [Bioperl-l] Fwd: megablast using bioperl > > > To: "'BioPerl List'" > > > Cc: "'xiujingg at yahoo.com'" > > > Date: Monday, February 1, 2010, 3:12 PM > > > > > > Have you read the docs? It's very well documented: > > > http://search.cpan.org/~cjfields/BioPerl- > > 1.6.1/Bio/Tools/Run/StandAloneBlast.pm > > > > > > Have you searched the BioPerl wiki? Again, well documented with > > examples: > > > http://www.bioperl.org/wiki/HOWTO:StandAloneBlast > > > > > > And if that doesn't help, perhaps you need to improve your Google > > skills: > > > http://www.google.com/support/websearch/?ctx=web > > > > > > --Russell > > > > > > > > > > > >> -----Original Message----- > > >> From: bioperl-l-bounces at lists.open-bio.org [mailto:bioperl-l- > > >> bounces at lists.open-bio.org] On Behalf Of Jason Stajich > > >> Sent: Tuesday, 2 February 2010 8:48 a.m. > > >> To: BioPerl List > > >> Cc: xiujingg at yahoo.com > > >> Subject: [Bioperl-l] Fwd: megablast using bioperl > > >> > > >> Xiujing - > > >> Your message is best asked on the mailing list. > > >> > > >> Begin forwarded message: > > >> > > >>> From: Xiujing Gu > > >>> Date: February 1, 2010 7:58:55 AM PST > > >>> To: jason at bioperl.org > > >>> Subject: megablast using bioperl > > >>> > > >>> Hello Jason: > > >>> > > >>> After numerous googling, I found you might be the only person who > > >>> knows about this subject. Would you so kindly show me an example of > > >>> setting up megablast using bioperl Bio::Tools::Run::StandAloneBlast? > > >>> > > >>> Thanks a lot! > > >>> > > >>> Best regards, > > >>> Xiujing > > >> _______________________________________________ > > >> Bioperl-l mailing list > > >> Bioperl-l at lists.open-bio.org > > >> http://lists.open-bio.org/mailman/listinfo/bioperl-l > > > > ======================================================================= > > > Attention: The information contained in this message and/or > attachments > > > from AgResearch Limited is intended only for the persons or entities > > > to which it is addressed and may contain confidential and/or > privileged > > > material. Any review, retransmission, dissemination or other use of, > or > > > taking of any action in reliance upon, this information by persons or > > > entities other than the intended recipients is prohibited by > AgResearch > > > Limited. If you have received this message in error, please notify the > > > sender immediately. > > > > ======================================================================= > > > > > > > > > > > > > > > _______________________________________________ > > > Bioperl-l mailing list > > > Bioperl-l at lists.open-bio.org > > > http://lists.open-bio.org/mailman/listinfo/bioperl-l From mmuratet at hudsonalpha.org Tue Feb 2 07:14:17 2010 From: mmuratet at hudsonalpha.org (Michael Muratet) Date: Tue, 2 Feb 2010 06:14:17 -0600 Subject: [Bioperl-l] Apparent disagreement between EUtilities and web page In-Reply-To: References: <1A4207F8295607498283FE9E93B775B406849E36@EX02.asurite.ad.asu.edu> Message-ID: <3D2AF4CC-D10D-479C-868F-E78B9EC3B420@hudsonalpha.org> Greetings I am using EUtilities for the first time to retrieve data from NCBI and I could use some help trying to isolate a problem I have. I was using a script I put together from examples in the How-To and email archives $factory->set_parameters(-id => \@sub ); while ( my $docsum =$factory->next_DocSum ) { my ($item) = $docsum- >get_Items_by_name('GenomicInfoType'); my ( $chrloc, $acc, $start, $end) = ( $item- >get_contents_by_name('ChrLoc'), $item- >get_contents_by_name('ChrAccVer'), $item- >get_contents_by_name('ChrStart'), $item- >get_contents_by_name('ChrStop')); and I found a record that has a suspect ChrStart field and no ChrStop field (using another example from the HowTo) ID: 100039753 Name-Description-Orgname-Status-CurrentID-Chromosome-GeneticSource- MapLocation-OtherAliases-OtherDesignations-NomenclatureSymbol- NomenclatureName-NomenclatureStatus-TaxID-Mim-GenomicInfo-GeneWeight- Summary-ChrSort-ChrStart Name :LOC100039753 Description :similar to putative Orgname :Mus musculus Chromosome :Y GeneticSource :genomic MapLocation :Y B OtherDesignations :hypothetical protein LOC100039753 TaxID :10090 GeneWeight :401 ChrSort :~~last ChrStart :999999999 yet on the NCBI web page it shows a defined start and a stop for the locus. I can fix the perl code so that it won't barf when the field isn't defined, but I'd like to get all the data and I can't figure out if the problem is on the client side or the server side. All the data comes from the same source, doesn't it? Thanks Mike From robert.bradbury at gmail.com Tue Feb 2 08:57:53 2010 From: robert.bradbury at gmail.com (Robert Bradbury) Date: Tue, 2 Feb 2010 08:57:53 -0500 Subject: [Bioperl-l] Pulling down data from NCBI In-Reply-To: References: <1A4207F8295607498283FE9E93B775B406849E36@EX02.asurite.ad.asu.edu> <2F1F855C-16BF-42ED-BE91-63CD1B331F90@illinois.edu> Message-ID: What species/chromosome is this? > > my $id = 'AAPP01000000[ACCN]'; > > One can usually download the genome sequence files, chromosome files, or fasta files from the various FTP sites (almost all of the major genomes have them) or the that would generally be the fastest way to do it. Or simply look up the Genome sequence at NCBI and download it using a web browser. There is standard documentation on how to convert genome sequences into fasta files. If you are looking for the "big" genomes which may not be in NCBI yet, go to the Broad Institute. Some bacterial sequences may still only be at TIGR or JGI until they migrate upward. But using the standard "system" utilities to do this is usually a far better way to do this then wrestling in BioPerl. FTP and Web Servers are written in C or C++ which is compiled and much more efficient than an interpreted language like Perl or Java. Robert From cjfields at illinois.edu Tue Feb 2 09:04:49 2010 From: cjfields at illinois.edu (Chris Fields) Date: Tue, 2 Feb 2010 08:04:49 -0600 Subject: [Bioperl-l] FeatureIO refactoring Message-ID: <85A8D301-1E20-4981-9741-C5EC437A639E@illinois.edu> All, Based on a few discussions from the BioPerl meeting, I've decided to go ahead, bite the bullet, and (on a branch) pull out the Bio::FeatureIO code from core for refactoring. FeatureIO-based code now lies in the Bio-FeatureIO repository in bioperl's SVN. The resulting code (FeatureIO-less) core probably won't be merged back into main trunk until after a 1.6.2 release, at which point we may just proceed towards the long-awaited restructuring scheme for BioPerl. I'm pretty open for discussion on how to proceed. My current somewhat loose plans: 1) Rewrite FeatureIO code to remove dependencies specifically on Bio::SeqFeature::Annotated (BSFA). There have been several arguments on and off list as to BSFA's issues. I think, due to BSFA's idiosyncracies, it's best to remove use of it from BioPerl and replace it with something more lightweight. 2) Consolidate FeatureIO around Bio::DB::SeqFeature::Store's Loader classes and built-in SeqFeature methods for generating GFF data. No need to rehash code if it's already working well (and an opportunity to fix code that needs it). In particular, the GFF3 loader appears to be working very well. However, if there are any caveats, this would be the time to air them! Also, with that mind... 3) Optional type-checking against SO/SOFA using tools already in place (Bio::Ontology::*) This is a bit trickier, as I don't recall Bio::DB::SeqFeature having any typing mechanism, I'm guessing in keeping with it's lightweight nature. Should we use the attribute Ontology_term for this? 4) Of course, addressing any FeatureIO-related bugs present in Bugzilla. Feedback and suggestions welcome! chris From cjfields at illinois.edu Tue Feb 2 09:06:09 2010 From: cjfields at illinois.edu (Chris Fields) Date: Tue, 2 Feb 2010 08:06:09 -0600 Subject: [Bioperl-l] Pulling down data from NCBI In-Reply-To: References: <1A4207F8295607498283FE9E93B775B406849E36@EX02.asurite.ad.asu.edu> <2F1F855C-16BF-42ED-BE91-63CD1B331F90@illinois.edu> Message-ID: <8CE3E21B-CBBA-4F43-8D68-0A37030CC5D0@illinois.edu> On Feb 2, 2010, at 7:57 AM, Robert Bradbury wrote: > > What species/chromosome is this? > > my $id = 'AAPP01000000[ACCN]'; > > > One can usually download the genome sequence files, chromosome files, or fasta files from the various FTP sites (almost all of the major genomes have them) or the that would generally be the fastest way to do it. Or simply look up the Genome sequence at NCBI and download it using a web browser. There is standard documentation on how to convert genome sequences into fasta files. Yes, Robert, that is true. This is just an example sequence file demonstrating the problem; retrieving the sequences via FTP would be more efficient. However, if one had hundreds of these, and the IDs weren't known ahead of time (they cam from a prior analysis), then this wouldn't be the case. And this is a common issue.` > If you are looking for the "big" genomes which may not be in NCBI yet, go to the Broad Institute. Some bacterial sequences may still only be at TIGR or JGI until they migrate upward. > > But using the standard "system" utilities to do this is usually a far better way to do this then wrestling in BioPerl. FTP and Web Servers are written in C or C++ which is compiled and much more efficient than an interpreted language like Perl or Java. > > Robert Actually, most if not all the web services stuff is in Java, and (for many genome sites) they use other languages besides C/C++ (Perl, Python, Ruby, etc). Ensembl, FlyBase, etc come to mind. chris From cjfields at illinois.edu Tue Feb 2 09:34:46 2010 From: cjfields at illinois.edu (Chris Fields) Date: Tue, 2 Feb 2010 08:34:46 -0600 Subject: [Bioperl-l] Apparent disagreement between EUtilities and web page In-Reply-To: <3D2AF4CC-D10D-479C-868F-E78B9EC3B420@hudsonalpha.org> References: <1A4207F8295607498283FE9E93B775B406849E36@EX02.asurite.ad.asu.edu> <3D2AF4CC-D10D-479C-868F-E78B9EC3B420@hudsonalpha.org> Message-ID: <534C7B91-C42B-4032-A78E-C9202EE44E7B@illinois.edu> Michael, This is an unusual one and appears to be server-side. I've seen this before with older obsoleted gene IDs (see: 435023, which is the older version of 100039753), so I'm wondering if an update going on that hasn't made it's way to the web interface yet. I'm attempting to contact NCBI about the issue, so hopefully we'll hear back. chris On Feb 2, 2010, at 6:14 AM, Michael Muratet wrote: > Greetings > > I am using EUtilities for the first time to retrieve data from NCBI and I could use some help trying to isolate a problem I have. I was using a script I put together from examples in the How-To and email archives > > $factory->set_parameters(-id => \@sub ); > > while ( my $docsum =$factory->next_DocSum ) { > my ($item) = $docsum->get_Items_by_name('GenomicInfoType'); > > my ( $chrloc, $acc, $start, $end) = ( > $item->get_contents_by_name('ChrLoc'), > $item->get_contents_by_name('ChrAccVer'), > $item->get_contents_by_name('ChrStart'), > $item->get_contents_by_name('ChrStop')); > > and I found a record that has a suspect ChrStart field and no ChrStop field (using another example from the HowTo) > > ID: 100039753 > Name-Description-Orgname-Status-CurrentID-Chromosome-GeneticSource-MapLocation-OtherAliases-OtherDesignations-NomenclatureSymbol-NomenclatureName-NomenclatureStatus-TaxID-Mim-GenomicInfo-GeneWeight-Summary-ChrSort-ChrStart > Name :LOC100039753 > Description :similar to putative > Orgname :Mus musculus > Chromosome :Y > GeneticSource :genomic > MapLocation :Y B > OtherDesignations :hypothetical protein LOC100039753 > TaxID :10090 > GeneWeight :401 > ChrSort :~~last > ChrStart :999999999 > > yet on the NCBI web page it shows a defined start and a stop for the locus. > > I can fix the perl code so that it won't barf when the field isn't defined, but I'd like to get all the data and I can't figure out if the problem is on the client side or the server side. All the data comes from the same source, doesn't it? > > Thanks > > Mike > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l From shalabh.sharma7 at gmail.com Tue Feb 2 10:16:26 2010 From: shalabh.sharma7 at gmail.com (shalabh sharma) Date: Tue, 2 Feb 2010 10:16:26 -0500 Subject: [Bioperl-l] Mapping Blast results Message-ID: <9fcc48c71002020716u2981596ekf3ca36cc693200f0@mail.gmail.com> Hi, I am trying to map blast results on a gene. I am using Bio::Graphics http://bioperl.org/wiki/HOWTO:Graphics This is working fine for small blast files but not for large blast file with lot of hits. Is there any way i can use this for large blast files? Thanks Shalabh From lsbrath at gmail.com Tue Feb 2 10:40:45 2010 From: lsbrath at gmail.com (Mgavi Brathwaite) Date: Tue, 2 Feb 2010 10:40:45 -0500 Subject: [Bioperl-l] Pulling down data from NCBI In-Reply-To: <2F1F855C-16BF-42ED-BE91-63CD1B331F90@illinois.edu> References: <1A4207F8295607498283FE9E93B775B406849E36@EX02.asurite.ad.asu.edu> <2F1F855C-16BF-42ED-BE91-63CD1B331F90@illinois.edu> Message-ID: <69367b8f1002020740x23f05aecn633c27b242deb56a@mail.gmail.com> What "cookbook"? Mgavi On Mon, Feb 1, 2010 at 4:31 PM, Chris Fields wrote: > Abhi, > > The accession in question is for a record containing a set of sequences, > not just one sequence (it's a contig record). The NCBI web interface is > performing an esearch on this to get 34K seqs, the equivalent with > EUtilities is: > > ================================ > use Bio::DB::EUtilities; > > my $id = 'AAPP01000000[ACCN]'; > > my $factory = Bio::DB::EUtilities->new ( > -eutil => 'esearch', > -db => 'nucleotide', > -term => $id, > -usehistory => 'y'); > > say $factory->get_count; > > # do more here... > > ================================ > > The 'do more here' part is covered in the cookbook, and will require you > retrieving the seqs in chunks. > > chris > > On Feb 1, 2010, at 2:45 PM, Abhishek Pratap wrote: > > > Thank you guys for very quick responses. My bad I trusted my fingers. > > > > Now that this is working the output that I am getting is not what I > > want. I am sure I am missing the correct way of doing it. So If I > > search the Nucleotide db @NCBI for this accession number " > > AAPP01000000", I see some 34 k records. What I need to do is pull down > > those sequences as fasta files. > > > > I am referring to > > http://www.bioperl.org/wiki/HOWTO:EUtilities_Cookbook but dint quite > > find a similar example. > > > > Thanks! > > -Abhi > > > > On Mon, Feb 1, 2010 at 3:39 PM, Kevin Brown > wrote: > >> Looks like you've misspelled one of the parameters. It should be > >> 'efetch' not 'efecth' > >> > >> Kevin Brown > >> Center for Innovations in Medicine > >> Biodesign Institute > >> Arizona State University > >> > >>> -----Original Message----- > >>> From: bioperl-l-bounces at lists.open-bio.org > >>> [mailto:bioperl-l-bounces at lists.open-bio.org] On Behalf Of > >>> Abhishek Pratap > >>> Sent: Monday, February 01, 2010 1:36 PM > >>> To: bioperl-l at lists.open-bio.org > >>> Subject: [Bioperl-l] Pulling down data from NCBI > >>> > >>> Hi All > >>> > >>> I looking to batch download some 34K nucleotide sequences, > >>> corresponding to a NCBI accession number. I tired the following and > >>> getting an error. Has it got anything to do with recent update to code > >>> that Chris was discussing. > >>> > >>> > >>> > >>> my $factory = Bio::DB::EUtilities->new ( > >>> -eutil => 'efecth', > >>> -db => 'nucleotide', > >>> -retype => 'fasta', > >>> -id => $id > >>> ); > >>> > >>> > >>> ----------- EXCEPTION: Bio::Root::Exception ------------- > >>> MSG: efecth not supported > >>> STACK: Error::throw > >>> STACK: Bio::Root::Root::throw > >>> /usr/lib/perl5/vendor_perl/5.8.8/Bio/Root/Root.pm:357 > >>> STACK: Bio::Tools::EUtilities::EUtilParameters::eutil > >>> /usr/lib/perl5/vendor_perl/5.8.8/Bio/Tools/EUtilities/EUtilPar > >>> ameters.pm:452 > >>> STACK: Bio::Root::RootI::_set_from_args > >>> /usr/lib/perl5/vendor_perl/5.8.8/Bio/Root/RootI.pm:546 > >>> STACK: Bio::Tools::EUtilities::EUtilParameters::new > >>> /usr/lib/perl5/vendor_perl/5.8.8/Bio/Tools/EUtilities/EUtilPar > >>> ameters.pm:193 > >>> STACK: Bio::DB::EUtilities::new > >>> /usr/lib/perl5/vendor_perl/5.8.8/Bio/DB/EUtilities.pm:74 > >>> STACK: ./getDatafromNCBI.pl:9 > >>> > >>> > >>> -Abhi > >>> _______________________________________________ > >>> Bioperl-l mailing list > >>> Bioperl-l at lists.open-bio.org > >>> http://lists.open-bio.org/mailman/listinfo/bioperl-l > >>> > >> > > > > _______________________________________________ > > Bioperl-l mailing list > > Bioperl-l at lists.open-bio.org > > http://lists.open-bio.org/mailman/listinfo/bioperl-l > > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l > From cjfields at illinois.edu Tue Feb 2 11:14:01 2010 From: cjfields at illinois.edu (Chris Fields) Date: Tue, 2 Feb 2010 10:14:01 -0600 Subject: [Bioperl-l] Pulling down data from NCBI In-Reply-To: <69367b8f1002020740x23f05aecn633c27b242deb56a@mail.gmail.com> References: <1A4207F8295607498283FE9E93B775B406849E36@EX02.asurite.ad.asu.edu> <2F1F855C-16BF-42ED-BE91-63CD1B331F90@illinois.edu> <69367b8f1002020740x23f05aecn633c27b242deb56a@mail.gmail.com> Message-ID: <0B50E488-6821-47DF-AD11-C6C297670C40@illinois.edu> On Feb 2, 2010, at 9:40 AM, Mgavi Brathwaite wrote: > What "cookbook"? > > Mgavi http://www.bioperl.org/wiki/HOWTO:EUtilities_Cookbook chris From folmerf at gmail.com Tue Feb 2 11:16:48 2010 From: folmerf at gmail.com (Folmer Fredslund) Date: Tue, 2 Feb 2010 17:16:48 +0100 Subject: [Bioperl-l] Pulling down data from NCBI In-Reply-To: <69367b8f1002020740x23f05aecn633c27b242deb56a@mail.gmail.com> References: <1A4207F8295607498283FE9E93B775B406849E36@EX02.asurite.ad.asu.edu> <2F1F855C-16BF-42ED-BE91-63CD1B331F90@illinois.edu> <69367b8f1002020740x23f05aecn633c27b242deb56a@mail.gmail.com> Message-ID: <991070401002020816r1fc36982j84a8115efe48600d@mail.gmail.com> Hi Mgavi, On 2 February 2010 16:40, Mgavi Brathwaite wrote: > What "cookbook"? > > Mgavi > I don't know much about BioPerl yet, but merely searching for "ncbi eutilities cookbook" on google gave me this site http://www.bioperl.org/wiki/HOWTO:EUtilities_Cookbook I would guess this is the page referred to. This is just me saying: Please use 2min to get your own answers before you ask everybody on this list. Best regards, Folmer Fredslund > On Mon, Feb 1, 2010 at 4:31 PM, Chris Fields wrote: > >> Abhi, >> >> The accession in question is for a record containing a set of sequences, >> not just one sequence (it's a contig record). ?The NCBI web interface is >> performing an esearch on this to get 34K seqs, the equivalent with >> EUtilities is: >> >> ================================ >> use Bio::DB::EUtilities; >> >> my $id = 'AAPP01000000[ACCN]'; >> >> my $factory = Bio::DB::EUtilities->new ?( >> ? ? ? ?-eutil => ?'esearch', >> ? ? ? -db ? ?=> ?'nucleotide', >> ? ? ? -term ?=> ?$id, >> ? ? ? -usehistory => 'y'); >> >> say $factory->get_count; >> >> # do more here... >> >> ================================ >> >> The 'do more here' part is covered in the cookbook, and will require you >> retrieving the seqs in chunks. >> >> chris >> >> On Feb 1, 2010, at 2:45 PM, Abhishek Pratap wrote: >> >> > Thank you guys for very quick responses. ?My bad I trusted my fingers. >> > >> > Now that this is working the output that I am getting is not what I >> > want. I am sure I am missing the correct way of doing it. So If I >> > search the Nucleotide db @NCBI for this accession number " >> > AAPP01000000", I see some 34 k records. What I need to do is pull down >> > those sequences as fasta files. >> > >> > I am referring to >> > http://www.bioperl.org/wiki/HOWTO:EUtilities_Cookbook ?but dint quite >> > find a similar example. >> > >> > Thanks! >> > -Abhi >> > >> > On Mon, Feb 1, 2010 at 3:39 PM, Kevin Brown >> wrote: >> >> Looks like you've misspelled one of the parameters. It should be >> >> 'efetch' not 'efecth' >> >> >> >> Kevin Brown >> >> Center for Innovations in Medicine >> >> Biodesign Institute >> >> Arizona State University >> >> >> >>> -----Original Message----- >> >>> From: bioperl-l-bounces at lists.open-bio.org >> >>> [mailto:bioperl-l-bounces at lists.open-bio.org] On Behalf Of >> >>> Abhishek Pratap >> >>> Sent: Monday, February 01, 2010 1:36 PM >> >>> To: bioperl-l at lists.open-bio.org >> >>> Subject: [Bioperl-l] Pulling down data from NCBI >> >>> >> >>> Hi All >> >>> >> >>> I looking to batch download some 34K nucleotide sequences, >> >>> corresponding to a NCBI accession number. I tired the following and >> >>> getting an error. Has it got anything to do with recent update to code >> >>> that Chris was discussing. >> >>> >> >>> >> >>> >> >>> my $factory = Bio::DB::EUtilities->new ?( >> >>> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? -eutil ?=> 'efecth', >> >>> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? -db ? ? => ? ? ?'nucleotide', >> >>> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? -retype => ? ? ?'fasta', >> >>> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? -id ? ? ? ? ? ? => $id >> >>> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ); >> >>> >> >>> >> >>> ----------- EXCEPTION: Bio::Root::Exception ------------- >> >>> MSG: efecth not supported >> >>> STACK: Error::throw >> >>> STACK: Bio::Root::Root::throw >> >>> /usr/lib/perl5/vendor_perl/5.8.8/Bio/Root/Root.pm:357 >> >>> STACK: Bio::Tools::EUtilities::EUtilParameters::eutil >> >>> /usr/lib/perl5/vendor_perl/5.8.8/Bio/Tools/EUtilities/EUtilPar >> >>> ameters.pm:452 >> >>> STACK: Bio::Root::RootI::_set_from_args >> >>> /usr/lib/perl5/vendor_perl/5.8.8/Bio/Root/RootI.pm:546 >> >>> STACK: Bio::Tools::EUtilities::EUtilParameters::new >> >>> /usr/lib/perl5/vendor_perl/5.8.8/Bio/Tools/EUtilities/EUtilPar >> >>> ameters.pm:193 >> >>> STACK: Bio::DB::EUtilities::new >> >>> /usr/lib/perl5/vendor_perl/5.8.8/Bio/DB/EUtilities.pm:74 >> >>> STACK: ./getDatafromNCBI.pl:9 >> >>> >> >>> >> >>> -Abhi >> >>> _______________________________________________ >> >>> Bioperl-l mailing list >> >>> Bioperl-l at lists.open-bio.org >> >>> http://lists.open-bio.org/mailman/listinfo/bioperl-l >> >>> >> >> >> > >> > _______________________________________________ >> > Bioperl-l mailing list >> > Bioperl-l at lists.open-bio.org >> > http://lists.open-bio.org/mailman/listinfo/bioperl-l >> >> >> _______________________________________________ >> Bioperl-l mailing list >> Bioperl-l at lists.open-bio.org >> http://lists.open-bio.org/mailman/listinfo/bioperl-l >> > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l > From scott at scottcain.net Tue Feb 2 12:06:07 2010 From: scott at scottcain.net (Scott Cain) Date: Tue, 2 Feb 2010 12:06:07 -0500 Subject: [Bioperl-l] [Gmod-gbrowse] FeatureIO refactoring In-Reply-To: <85A8D301-1E20-4981-9741-C5EC437A639E@illinois.edu> References: <85A8D301-1E20-4981-9741-C5EC437A639E@illinois.edu> Message-ID: <4536f7701002020906r57979ca4n1833035628fae4b8@mail.gmail.com> Hi Chris, This sounds like a really good approach; I particularly like (2) and tying FeatureIO to SeqFeature::Store. About ontology checking: I think the only thing that ought to be (optionally) checked is the type (column 3) against SO (not SOFA). It is true that SeqFeature::Store does no type checking, and I don't know that it needs to, but having the option of doing that through FeatureIO would be a good thing. Thanks for getting this started, Scott On Tue, Feb 2, 2010 at 9:04 AM, Chris Fields wrote: > All, > > Based on a few discussions from the BioPerl meeting, I've decided to go ahead, bite the bullet, and (on a branch) pull out the Bio::FeatureIO code from core for refactoring. ?FeatureIO-based code now lies in the Bio-FeatureIO repository in bioperl's SVN. ?The resulting code (FeatureIO-less) core probably won't be merged back into main trunk until after a 1.6.2 release, at which point we may just proceed towards the long-awaited restructuring scheme for BioPerl. > > I'm pretty open for discussion on how to proceed. ?My current somewhat loose plans: > > 1) Rewrite FeatureIO code to remove dependencies specifically on Bio::SeqFeature::Annotated (BSFA). > > There have been several arguments on and off list as to BSFA's issues. ?I think, due to BSFA's idiosyncracies, it's best to remove use of it from BioPerl and replace it with something more lightweight. > > 2) Consolidate FeatureIO around Bio::DB::SeqFeature::Store's Loader classes and built-in SeqFeature methods for generating GFF data. > > No need to rehash code if it's already working well (and an opportunity to fix code that needs it). ?In particular, the GFF3 loader appears to be working very well. ?However, if there are any caveats, this would be the time to air them! ?Also, with that mind... > > 3) Optional type-checking against SO/SOFA using tools already in place (Bio::Ontology::*) > > This is a bit trickier, as I don't recall Bio::DB::SeqFeature having any typing mechanism, I'm guessing in keeping with it's lightweight nature. ?Should we use the attribute Ontology_term for this? > > 4) Of course, addressing any FeatureIO-related bugs present in Bugzilla. > > > > Feedback and suggestions welcome! > > chris > ------------------------------------------------------------------------------ > The Planet: dedicated and managed hosting, cloud storage, colocation > Stay online with enterprise data centers and the best network in the business > Choose flexible plans and management services without long-term contracts > Personal 24x7 support from experience hosting pros just a phone call away. > http://p.sf.net/sfu/theplanet-com > _______________________________________________ > Gmod-gbrowse mailing list > Gmod-gbrowse at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/gmod-gbrowse > -- ------------------------------------------------------------------------ Scott Cain, Ph. D. scott at scottcain dot net GMOD Coordinator (http://gmod.org/) 216-392-3087 Ontario Institute for Cancer Research From cjfields at illinois.edu Tue Feb 2 12:54:55 2010 From: cjfields at illinois.edu (Chris Fields) Date: Tue, 2 Feb 2010 11:54:55 -0600 Subject: [Bioperl-l] Apparent disagreement between EUtilities and web page In-Reply-To: <534C7B91-C42B-4032-A78E-C9202EE44E7B@illinois.edu> References: <1A4207F8295607498283FE9E93B775B406849E36@EX02.asurite.ad.asu.edu> <3D2AF4CC-D10D-479C-868F-E78B9EC3B420@hudsonalpha.org> <534C7B91-C42B-4032-A78E-C9202EE44E7B@illinois.edu> Message-ID: <600261B2-E43F-465C-B7FA-6033A6795BFB@illinois.edu> Just to follow up, it appears to be a disparity between the efetch gene data and the summary information (via esummary). The folks at NCBI are checking up on it. chris On Feb 2, 2010, at 8:34 AM, Chris Fields wrote: > Michael, > > This is an unusual one and appears to be server-side. I've seen this before with older obsoleted gene IDs (see: 435023, which is the older version of 100039753), so I'm wondering if an update going on that hasn't made it's way to the web interface yet. > > I'm attempting to contact NCBI about the issue, so hopefully we'll hear back. > > chris > > On Feb 2, 2010, at 6:14 AM, Michael Muratet wrote: > >> Greetings >> >> I am using EUtilities for the first time to retrieve data from NCBI and I could use some help trying to isolate a problem I have. I was using a script I put together from examples in the How-To and email archives >> >> $factory->set_parameters(-id => \@sub ); >> >> while ( my $docsum =$factory->next_DocSum ) { >> my ($item) = $docsum->get_Items_by_name('GenomicInfoType'); >> >> my ( $chrloc, $acc, $start, $end) = ( >> $item->get_contents_by_name('ChrLoc'), >> $item->get_contents_by_name('ChrAccVer'), >> $item->get_contents_by_name('ChrStart'), >> $item->get_contents_by_name('ChrStop')); >> >> and I found a record that has a suspect ChrStart field and no ChrStop field (using another example from the HowTo) >> >> ID: 100039753 >> Name-Description-Orgname-Status-CurrentID-Chromosome-GeneticSource-MapLocation-OtherAliases-OtherDesignations-NomenclatureSymbol-NomenclatureName-NomenclatureStatus-TaxID-Mim-GenomicInfo-GeneWeight-Summary-ChrSort-ChrStart >> Name :LOC100039753 >> Description :similar to putative >> Orgname :Mus musculus >> Chromosome :Y >> GeneticSource :genomic >> MapLocation :Y B >> OtherDesignations :hypothetical protein LOC100039753 >> TaxID :10090 >> GeneWeight :401 >> ChrSort :~~last >> ChrStart :999999999 >> >> yet on the NCBI web page it shows a defined start and a stop for the locus. >> >> I can fix the perl code so that it won't barf when the field isn't defined, but I'd like to get all the data and I can't figure out if the problem is on the client side or the server side. All the data comes from the same source, doesn't it? >> >> Thanks >> >> Mike >> >> _______________________________________________ >> Bioperl-l mailing list >> Bioperl-l at lists.open-bio.org >> http://lists.open-bio.org/mailman/listinfo/bioperl-l > > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l From mmuratet at hudsonalpha.org Tue Feb 2 13:22:30 2010 From: mmuratet at hudsonalpha.org (Michael Muratet) Date: Tue, 2 Feb 2010 12:22:30 -0600 Subject: [Bioperl-l] Apparent disagreement between EUtilities and web page In-Reply-To: <600261B2-E43F-465C-B7FA-6033A6795BFB@illinois.edu> References: <1A4207F8295607498283FE9E93B775B406849E36@EX02.asurite.ad.asu.edu> <3D2AF4CC-D10D-479C-868F-E78B9EC3B420@hudsonalpha.org> <534C7B91-C42B-4032-A78E-C9202EE44E7B@illinois.edu> <600261B2-E43F-465C-B7FA-6033A6795BFB@illinois.edu> Message-ID: <746E25CE-BA37-4B06-BB21-ED497127C703@hudsonalpha.org> On Feb 2, 2010, at 11:54 AM, Chris Fields wrote: > Just to follow up, it appears to be a disparity between the efetch > gene data and the summary information (via esummary). The folks at > NCBI are checking up on it. > Chris Thanks for the help! I put a test for defined into the script and managed to pull down the other 9K genes. It was just the one that had issues. Mike > chris > > On Feb 2, 2010, at 8:34 AM, Chris Fields wrote: > >> Michael, >> >> This is an unusual one and appears to be server-side. I've seen >> this before with older obsoleted gene IDs (see: 435023, which is >> the older version of 100039753), so I'm wondering if an update >> going on that hasn't made it's way to the web interface yet. >> >> I'm attempting to contact NCBI about the issue, so hopefully we'll >> hear back. >> >> chris >> >> On Feb 2, 2010, at 6:14 AM, Michael Muratet wrote: >> >>> Greetings >>> >>> I am using EUtilities for the first time to retrieve data from >>> NCBI and I could use some help trying to isolate a problem I have. >>> I was using a script I put together from examples in the How-To >>> and email archives >>> >>> $factory->set_parameters(-id => \@sub ); >>> >>> while ( my $docsum =$factory->next_DocSum ) { >>> my ($item) = $docsum- >>> >get_Items_by_name('GenomicInfoType'); >>> >>> my ( $chrloc, $acc, $start, $end) = ( >>> $item- >>> >get_contents_by_name('ChrLoc'), >>> $item- >>> >get_contents_by_name('ChrAccVer'), >>> $item- >>> >get_contents_by_name('ChrStart'), >>> $item- >>> >get_contents_by_name('ChrStop')); >>> >>> and I found a record that has a suspect ChrStart field and no >>> ChrStop field (using another example from the HowTo) >>> >>> ID: 100039753 >>> Name-Description-Orgname-Status-CurrentID-Chromosome-GeneticSource- >>> MapLocation-OtherAliases-OtherDesignations-NomenclatureSymbol- >>> NomenclatureName-NomenclatureStatus-TaxID-Mim-GenomicInfo- >>> GeneWeight-Summary-ChrSort-ChrStart >>> Name :LOC100039753 >>> Description :similar to putative >>> Orgname :Mus musculus >>> Chromosome :Y >>> GeneticSource :genomic >>> MapLocation :Y B >>> OtherDesignations :hypothetical protein LOC100039753 >>> TaxID :10090 >>> GeneWeight :401 >>> ChrSort :~~last >>> ChrStart :999999999 >>> >>> yet on the NCBI web page it shows a defined start and a stop for >>> the locus. >>> >>> I can fix the perl code so that it won't barf when the field isn't >>> defined, but I'd like to get all the data and I can't figure out >>> if the problem is on the client side or the server side. All the >>> data comes from the same source, doesn't it? >>> >>> Thanks >>> >>> Mike >>> >>> _______________________________________________ >>> Bioperl-l mailing list >>> Bioperl-l at lists.open-bio.org >>> http://lists.open-bio.org/mailman/listinfo/bioperl-l >> >> >> _______________________________________________ >> Bioperl-l mailing list >> Bioperl-l at lists.open-bio.org >> http://lists.open-bio.org/mailman/listinfo/bioperl-l > Michael Muratet, Ph.D. Senior Scientist HudsonAlpha Institute for Biotechnology mmuratet at hudsonalpha.org (256) 327-0473 (p) (256) 327-0966 (f) Room 4005 601 Genome Way Huntsville, Alabama 35806 From scott at scottcain.net Tue Feb 2 13:34:29 2010 From: scott at scottcain.net (Scott Cain) Date: Tue, 2 Feb 2010 13:34:29 -0500 Subject: [Bioperl-l] Mapping Blast results In-Reply-To: <9fcc48c71002020716u2981596ekf3ca36cc693200f0@mail.gmail.com> References: <9fcc48c71002020716u2981596ekf3ca36cc693200f0@mail.gmail.com> Message-ID: <4536f7701002021034p253c78e4u5c9b9c71cc0cb62@mail.gmail.com> Hi Shalabh, When you say it's not working, what do you mean? Presumably, you could write a script that uses Bio::DB::SeqFeature::Store if speed is the problem, but if you wanted to go that far, your time would probably be better spent installing GBrowse. Scott On Tue, Feb 2, 2010 at 10:16 AM, shalabh sharma wrote: > Hi, > ? ?I am trying to map blast results on a gene. I am using Bio::Graphics > ? ?http://bioperl.org/wiki/HOWTO:Graphics > > This is working fine for small blast files but not for large blast file with > lot of hits. Is there any way i can use this for large blast files? > > Thanks > Shalabh > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l > -- ------------------------------------------------------------------------ Scott Cain, Ph. D. scott at scottcain dot net GMOD Coordinator (http://gmod.org/) 216-392-3087 Ontario Institute for Cancer Research From scott at scottcain.net Tue Feb 2 13:55:02 2010 From: scott at scottcain.net (Scott Cain) Date: Tue, 2 Feb 2010 13:55:02 -0500 Subject: [Bioperl-l] Mapping Blast results In-Reply-To: <9fcc48c71002021047q4058efaes735f9ecbdde8b66a@mail.gmail.com> References: <9fcc48c71002020716u2981596ekf3ca36cc693200f0@mail.gmail.com> <4536f7701002021034p253c78e4u5c9b9c71cc0cb62@mail.gmail.com> <9fcc48c71002021047q4058efaes735f9ecbdde8b66a@mail.gmail.com> Message-ID: <4536f7701002021055l2e6ac98cwa95b5a5d13414f77@mail.gmail.com> Hi Shalabh, Please keep your responses on the list. A few things: * While it interesting that the resulting png isn't readable (and I don't know why--perhaps the script is silently crashing?), I have to wonder if a graphic with 1500 hits for a particular gene will be of that much use anyway? Perhaps, but it seems to me that 500 will do just as well as 1500 :-) * When you say you don't know how to map hits to a particular gene, what is the hang up? Coordinate mapping? Converting blast results to GFF? Scott On Tue, Feb 2, 2010 at 1:47 PM, shalabh sharma wrote: > Hi, > ?? ? thanks for your quick reply. > Actually i am getting a png file but its not readable (there are about 1500 > hits). > I have a working Gbrowse on my system but i don't know how i can map my > blast reads to a particular gene. > Thanks > Shalabh > > On Tue, Feb 2, 2010 at 1:34 PM, Scott Cain wrote: >> >> Hi Shalabh, >> >> When you say it's not working, what do you mean? ?Presumably, you >> could write a script that uses Bio::DB::SeqFeature::Store if speed is >> the problem, but if you wanted to go that far, your time would >> probably be better spent installing GBrowse. >> >> Scott >> >> >> On Tue, Feb 2, 2010 at 10:16 AM, shalabh sharma >> wrote: >> > Hi, >> > ? ?I am trying to map blast results on a gene. I am using Bio::Graphics >> > ? ?http://bioperl.org/wiki/HOWTO:Graphics >> > >> > This is working fine for small blast files but not for large blast file >> > with >> > lot of hits. Is there any way i can use this for large blast files? >> > >> > Thanks >> > Shalabh >> > _______________________________________________ >> > Bioperl-l mailing list >> > Bioperl-l at lists.open-bio.org >> > http://lists.open-bio.org/mailman/listinfo/bioperl-l >> > >> >> >> >> -- >> ------------------------------------------------------------------------ >> Scott Cain, Ph. D. ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? scott at scottcain >> dot net >> GMOD Coordinator (http://gmod.org/) ? ? ? ? ? ? ? ? ? ? 216-392-3087 >> Ontario Institute for Cancer Research > > -- ------------------------------------------------------------------------ Scott Cain, Ph. D. scott at scottcain dot net GMOD Coordinator (http://gmod.org/) 216-392-3087 Ontario Institute for Cancer Research From shalabh.sharma7 at gmail.com Tue Feb 2 14:05:17 2010 From: shalabh.sharma7 at gmail.com (shalabh sharma) Date: Tue, 2 Feb 2010 14:05:17 -0500 Subject: [Bioperl-l] Mapping Blast results In-Reply-To: <4536f7701002021055l2e6ac98cwa95b5a5d13414f77@mail.gmail.com> References: <9fcc48c71002020716u2981596ekf3ca36cc693200f0@mail.gmail.com> <4536f7701002021034p253c78e4u5c9b9c71cc0cb62@mail.gmail.com> <9fcc48c71002021047q4058efaes735f9ecbdde8b66a@mail.gmail.com> <4536f7701002021055l2e6ac98cwa95b5a5d13414f77@mail.gmail.com> Message-ID: <9fcc48c71002021105h60625687q13b013c4fedd0abe@mail.gmail.com> Hi, Its not readable as the reads are so closed that i can just see one vertical line. Do i need to increase the panel size? I am not sure that how to convert blast results to gff file. Thanks Shalabh On Tue, Feb 2, 2010 at 1:55 PM, Scott Cain wrote: > Hi Shalabh, > > Please keep your responses on the list. > > A few things: > > * While it interesting that the resulting png isn't readable (and I > don't know why--perhaps the script is silently crashing?), I have to > wonder if a graphic with 1500 hits for a particular gene will be of > that much use anyway? Perhaps, but it seems to me that 500 will do > just as well as 1500 :-) > > * When you say you don't know how to map hits to a particular gene, > what is the hang up? Coordinate mapping? Converting blast results to > GFF? > > Scott > > > On Tue, Feb 2, 2010 at 1:47 PM, shalabh sharma > wrote: > > Hi, > > thanks for your quick reply. > > Actually i am getting a png file but its not readable (there are about > 1500 > > hits). > > I have a working Gbrowse on my system but i don't know how i can map my > > blast reads to a particular gene. > > Thanks > > Shalabh > > > > On Tue, Feb 2, 2010 at 1:34 PM, Scott Cain wrote: > >> > >> Hi Shalabh, > >> > >> When you say it's not working, what do you mean? Presumably, you > >> could write a script that uses Bio::DB::SeqFeature::Store if speed is > >> the problem, but if you wanted to go that far, your time would > >> probably be better spent installing GBrowse. > >> > >> Scott > >> > >> > >> On Tue, Feb 2, 2010 at 10:16 AM, shalabh sharma > >> wrote: > >> > Hi, > >> > I am trying to map blast results on a gene. I am using > Bio::Graphics > >> > http://bioperl.org/wiki/HOWTO:Graphics > >> > > >> > This is working fine for small blast files but not for large blast > file > >> > with > >> > lot of hits. Is there any way i can use this for large blast files? > >> > > >> > Thanks > >> > Shalabh > >> > _______________________________________________ > >> > Bioperl-l mailing list > >> > Bioperl-l at lists.open-bio.org > >> > http://lists.open-bio.org/mailman/listinfo/bioperl-l > >> > > >> > >> > >> > >> -- > >> ------------------------------------------------------------------------ > >> Scott Cain, Ph. D. scott at scottcain > >> dot net > >> GMOD Coordinator (http://gmod.org/) 216-392-3087 > >> Ontario Institute for Cancer Research > > > > > > > > -- > ------------------------------------------------------------------------ > Scott Cain, Ph. D. scott at scottcain dot > net > GMOD Coordinator (http://gmod.org/) 216-392-3087 > Ontario Institute for Cancer Research > From scott at scottcain.net Tue Feb 2 14:28:08 2010 From: scott at scottcain.net (Scott Cain) Date: Tue, 2 Feb 2010 14:28:08 -0500 Subject: [Bioperl-l] Mapping Blast results In-Reply-To: <9fcc48c71002021105h60625687q13b013c4fedd0abe@mail.gmail.com> References: <9fcc48c71002020716u2981596ekf3ca36cc693200f0@mail.gmail.com> <4536f7701002021034p253c78e4u5c9b9c71cc0cb62@mail.gmail.com> <9fcc48c71002021047q4058efaes735f9ecbdde8b66a@mail.gmail.com> <4536f7701002021055l2e6ac98cwa95b5a5d13414f77@mail.gmail.com> <9fcc48c71002021105h60625687q13b013c4fedd0abe@mail.gmail.com> Message-ID: <4536f7701002021128kcea35cfm50f651b6c9c8ca43@mail.gmail.com> Ah, not readable as in "my eyes can't make anything out" not as in "the file is corrupted and my image viewing software won't open it." That's two different things. Yes, I imagine making the panel taller would help, but in Bio::Graphics::Panel you don't have direct control of the panel height. Instead, you can try changing the value of the bump parameter. Try +1 or +2 to see if that helps. See the perldoc for Bio::Graphics::Panel for more info. For converting blast results to GFF, see the bioperl script bp_search2gff.pl. Scott On Tue, Feb 2, 2010 at 2:05 PM, shalabh sharma wrote: > Hi, > ?? ?Its not readable as the reads are so closed that i can just see one > vertical line. Do i need to increase the panel size? > I am not sure that how to convert blast results to gff file. > Thanks > Shalabh > On Tue, Feb 2, 2010 at 1:55 PM, Scott Cain wrote: >> >> Hi Shalabh, >> >> Please keep your responses on the list. >> >> A few things: >> >> * While it interesting that the resulting png isn't readable (and I >> don't know why--perhaps the script is silently crashing?), I have to >> wonder if a graphic with 1500 hits for a particular gene will be of >> that much use anyway? ?Perhaps, but it seems to me that 500 will do >> just as well as 1500 :-) >> >> * When you say you don't know how to map hits to a particular gene, >> what is the hang up? ?Coordinate mapping? ?Converting blast results to >> GFF? >> >> Scott >> >> >> On Tue, Feb 2, 2010 at 1:47 PM, shalabh sharma >> wrote: >> > Hi, >> > ?? ? thanks for your quick reply. >> > Actually i am getting a png file but its not readable (there are about >> > 1500 >> > hits). >> > I have a working Gbrowse on my system but i don't know how i can map my >> > blast reads to a particular gene. >> > Thanks >> > Shalabh >> > >> > On Tue, Feb 2, 2010 at 1:34 PM, Scott Cain wrote: >> >> >> >> Hi Shalabh, >> >> >> >> When you say it's not working, what do you mean? ?Presumably, you >> >> could write a script that uses Bio::DB::SeqFeature::Store if speed is >> >> the problem, but if you wanted to go that far, your time would >> >> probably be better spent installing GBrowse. >> >> >> >> Scott >> >> >> >> >> >> On Tue, Feb 2, 2010 at 10:16 AM, shalabh sharma >> >> wrote: >> >> > Hi, >> >> > ? ?I am trying to map blast results on a gene. I am using >> >> > Bio::Graphics >> >> > ? ?http://bioperl.org/wiki/HOWTO:Graphics >> >> > >> >> > This is working fine for small blast files but not for large blast >> >> > file >> >> > with >> >> > lot of hits. Is there any way i can use this for large blast files? >> >> > >> >> > Thanks >> >> > Shalabh >> >> > _______________________________________________ >> >> > Bioperl-l mailing list >> >> > Bioperl-l at lists.open-bio.org >> >> > http://lists.open-bio.org/mailman/listinfo/bioperl-l >> >> > >> >> >> >> >> >> >> >> -- >> >> >> >> ------------------------------------------------------------------------ >> >> Scott Cain, Ph. D. ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? scott at scottcain >> >> dot net >> >> GMOD Coordinator (http://gmod.org/) ? ? ? ? ? ? ? ? ? ? 216-392-3087 >> >> Ontario Institute for Cancer Research >> > >> > >> >> >> >> -- >> ------------------------------------------------------------------------ >> Scott Cain, Ph. D. ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? scott at scottcain >> dot net >> GMOD Coordinator (http://gmod.org/) ? ? ? ? ? ? ? ? ? ? 216-392-3087 >> Ontario Institute for Cancer Research > > -- ------------------------------------------------------------------------ Scott Cain, Ph. D. scott at scottcain dot net GMOD Coordinator (http://gmod.org/) 216-392-3087 Ontario Institute for Cancer Research From robert.bradbury at gmail.com Wed Feb 3 04:56:49 2010 From: robert.bradbury at gmail.com (Robert Bradbury) Date: Wed, 3 Feb 2010 04:56:49 -0500 Subject: [Bioperl-l] Problems at NCBI with Javascript Message-ID: Hello, I know there will be people who think that this may be off-topic for the list, but I don't know of another list that might be better. The basic problem is the recent trend by NCBI to insist that browsers must have Javascript turned on to work. Now I have a real problem with this. All the databases at NCBI have been writen over the last 15 or so years to produce basic HTML.. Switching this to a requirement for HTML + Javascript in my mind is simply wrong-headed. I will save the list my list of problems I have with Javascrip (it isn't short)t. I'll simply provide the first one -- It isn't Green. So I would like to ask if anyone knows why NCBI is heading in this direction? (Tonight they seem to have broken non-Javascript enabled use of PubMed and that has me a little upset. Is there a better source to taking with anyone directly at NCBI about this than the Help Desk (I usually find them unresponsive). Do they have something like a Bugzilla bug database or an internal developers mailing list? I'm sorry in advance for troubling list members with this but I didn't know of anwhere else to turn [1]. Thank you, Robert Bradbury 1. I'll also note that they could at least leave the old interface active indefinitely. Google/gmail show that its perfectly possible to write "enhanced" user interfaces that work with HTML+Javascript and simple HTML. From rtbio.2009 at gmail.com Wed Feb 3 08:50:07 2010 From: rtbio.2009 at gmail.com (Roopa Raghuveer) Date: Wed, 3 Feb 2010 14:50:07 +0100 Subject: [Bioperl-l] string comparision mismatches and matches Message-ID: Hello all, I have a problem. I would like to compare two strings of equal length using Perl and trying to count the number of matches. I have used an algorithm to count the matches,but I found them to be time consuming in counting the mismatches. Could any one suggest a better algorithm for this? Ex:- Let the two sequences be input:- ACCTCCTCCTCGAGTATGTG target:- TATCTTGCGCCGGAGATAAT The no of matches are 5. In my program, I have used two indices which run along the string in the forward direction and reverse direction and extracting two characters at a time and comparing them with the respective characters in the output sequence. i.e., i/p:- ACCTCCTCCTCGAGTATGTG I have extracted 'A' in the forward direction and 'G' in the reverse direction from the i/p. target:- TATCTTGCGCCGGAGATAAT I have extracted 'T' in the forward direction and 'T' in the reverse direction from the target and compared 'A' with 'T' and 'G' with 'T'. In this way, I have proceeded along the complete length of the string and counting the matches . But it seems to be time consuming though its working. Can anybody suggest a better way? Regards, Roopa. From maj at fortinbras.us Wed Feb 3 09:50:05 2010 From: maj at fortinbras.us (Mark A. Jensen) Date: Wed, 3 Feb 2010 09:50:05 -0500 Subject: [Bioperl-l] string comparision mismatches and matches In-Reply-To: References: Message-ID: try this: $in = 'ACCTCCTCCTCGAGTATGTG'; $tgt = 'TATCTTGCGCCGGAGATAAT'; $mask = pack("A*",$in)^pack("A*",$tgt); $matches = $mask =~ tr/"\x0"/"\x0"/; ----- Original Message ----- From: "Roopa Raghuveer" To: Cc: "Chris Fields" ; "Mark A. Jensen" Sent: Wednesday, February 03, 2010 8:50 AM Subject: [Bioperl-l] string comparision mismatches and matches > Hello all, > > I have a problem. I would like to compare two strings of equal length using > Perl and trying to count the number of matches. I have used an algorithm to > count the matches,but I found them to be time consuming in counting the > mismatches. Could any one suggest a better algorithm for this? > > Ex:- Let the two sequences be > > input:- ACCTCCTCCTCGAGTATGTG > > target:- TATCTTGCGCCGGAGATAAT > > The no of matches are 5. > > In my program, I have used two indices which run along the string in the > forward direction and reverse direction and extracting two characters at a > time and comparing them with the respective characters in the output > sequence. i.e., > > i/p:- ACCTCCTCCTCGAGTATGTG > > I have extracted 'A' in the forward direction and 'G' in the reverse > direction from the i/p. > > target:- TATCTTGCGCCGGAGATAAT > > I have extracted 'T' in the forward direction and 'T' in the reverse > direction from the target and compared 'A' with 'T' and 'G' with 'T'. In > this way, I have proceeded along the complete length of the string and > counting the matches . But it seems to be time consuming though its working. > Can anybody suggest a better way? > > Regards, > Roopa. > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l > > From laceyanne_sanderson at shaw.ca Wed Feb 3 10:32:56 2010 From: laceyanne_sanderson at shaw.ca (Lacey Sanderson) Date: Wed, 03 Feb 2010 09:32:56 -0600 Subject: [Bioperl-l] After Checkout of Bioperl-live version doesn't seem to change Message-ID: Hi, I just checked out bioperl-live from svn as follows $ sudo svn co svn://code.open-bio.org/bioperl/bioperl-live/trunk bioperl-live $ echo 'export PERL5LIB="/usr/local/share/applications/Bioperl/bioperl- live:$PERL5LIB" #adds path of bioperl-live to PERL5LIB' >> ~/.profile I then restarted by shell and checked the version using $ perl -MBio::Perl -le 'print Bio::Perl->VERSION;' Which returned 1.006... I am sure this is the version of bioperl I had previously installed and not the new one I just checked out. Is there something else I need to do to tell my system to disregard the old bioperl and instead use bioperl-live? I had installed BioPerl earlier using the GMOD GBrowse netinstaller although I can't seem to locate where it was installed. Will this effect anything? If so, how would I go about uninstalling it (I'm a little new to Linux)... By the way I am running Ubuntu Intrepid Thanks! Lacey From scott at scottcain.net Wed Feb 3 12:08:26 2010 From: scott at scottcain.net (Scott Cain) Date: Wed, 3 Feb 2010 12:08:26 -0500 Subject: [Bioperl-l] After Checkout of Bioperl-live version doesn't seem to change In-Reply-To: References: Message-ID: <4536f7701002030908p558e697ex79087c2ad0d9970d@mail.gmail.com> Hi Lacey, Did it return "1.006" or 1.0069 exactly, which is what it should return if you are using bioperl-live? As for older versions of BioPerl, if you go down the install route (which is what I generally do), you can do the standard build procedure: perl Build.PL ./Build ./Build test sudo ./Build install --uninst 1 and that last part (--uninst 1) will remove older versions of the modules installed elsewhere. If you want to hunt down where GBrowse installed bioperl, first you could look in the typical linuxy places, like /usr/share/perl or /usr/lib/perl for the Bio directory, but deleting that directory will also delete Bio::Graphics, which isn't part of bioperl anymore. Using PERL5LIB will put your specified bioperl at the front of the list for when things get used, so even if you have bioperl installed elsewhere, that one should get used. Scott On Wed, Feb 3, 2010 at 10:32 AM, Lacey Sanderson wrote: > Hi, > > I just checked out bioperl-live from svn as follows > $ sudo svn co svn://code.open-bio.org/bioperl/bioperl-live/trunk > bioperl-live > $ echo 'export > PERL5LIB="/usr/local/share/applications/Bioperl/bioperl-live:$PERL5LIB" > ?#adds path of bioperl-live to PERL5LIB' >> ~/.profile > > I then restarted by shell and checked the version using > $ perl -MBio::Perl -le 'print Bio::Perl->VERSION;' > > Which returned 1.006... I am sure this is the version of bioperl I had > previously installed and not the new one I just checked out. Is there > something else I need to do to tell my system to disregard the old bioperl > and instead use bioperl-live? > > I had installed BioPerl earlier using the GMOD GBrowse netinstaller although > I can't seem to locate where it was installed. Will this effect anything? If > so, how would I go about uninstalling it (I'm a little new to Linux)... > > By the way I am running Ubuntu Intrepid > > Thanks! > Lacey > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l > -- ------------------------------------------------------------------------ Scott Cain, Ph. D. scott at scottcain dot net GMOD Coordinator (http://gmod.org/) 216-392-3087 Ontario Institute for Cancer Research From laceyanne_sanderson at shaw.ca Wed Feb 3 12:32:47 2010 From: laceyanne_sanderson at shaw.ca (Lacey Sanderson) Date: Wed, 03 Feb 2010 11:32:47 -0600 Subject: [Bioperl-l] After Checkout of Bioperl-live version doesn't seem to change In-Reply-To: <4536f7701002030908p558e697ex79087c2ad0d9970d@mail.gmail.com> References: <4536f7701002030908p558e697ex79087c2ad0d9970d@mail.gmail.com> Message-ID: It returned "1.006"... unfortunately no 9 at the end... Any idea why it doesn't seem to be picking up the bioperl-live checkout? PS. Why do the distributions have version numbers like 1.5.2 and 1.6 and yet the bioperl-live (which I take to be the most current version of bioperl) returns 1.069? Lacey On 3-Feb-10, at 11:08 AM, Scott Cain wrote: > Hi Lacey, > > Did it return "1.006" or 1.0069 exactly, which is what it should > return if you are using bioperl-live? As for older versions of > BioPerl, if you go down the install route (which is what I generally > do), you can do the standard build procedure: > > perl Build.PL > ./Build > ./Build test > sudo ./Build install --uninst 1 > > and that last part (--uninst 1) will remove older versions of the > modules installed elsewhere. If you want to hunt down where GBrowse > installed bioperl, first you could look in the typical linuxy places, > like /usr/share/perl or /usr/lib/perl for the Bio directory, but > deleting that directory will also delete Bio::Graphics, which isn't > part of bioperl anymore. > > Using PERL5LIB will put your specified bioperl at the front of the > list for when things get used, so even if you have bioperl installed > elsewhere, that one should get used. > > Scott > > > On Wed, Feb 3, 2010 at 10:32 AM, Lacey Sanderson > wrote: >> Hi, >> >> I just checked out bioperl-live from svn as follows >> $ sudo svn co svn://code.open-bio.org/bioperl/bioperl-live/trunk >> bioperl-live >> $ echo 'export >> PERL5LIB="/usr/local/share/applications/Bioperl/bioperl-live: >> $PERL5LIB" >> #adds path of bioperl-live to PERL5LIB' >> ~/.profile >> >> I then restarted by shell and checked the version using >> $ perl -MBio::Perl -le 'print Bio::Perl->VERSION;' >> >> Which returned 1.006... I am sure this is the version of bioperl I >> had >> previously installed and not the new one I just checked out. Is there >> something else I need to do to tell my system to disregard the old >> bioperl >> and instead use bioperl-live? >> >> I had installed BioPerl earlier using the GMOD GBrowse netinstaller >> although >> I can't seem to locate where it was installed. Will this effect >> anything? If >> so, how would I go about uninstalling it (I'm a little new to >> Linux)... >> >> By the way I am running Ubuntu Intrepid >> >> Thanks! >> Lacey >> >> _______________________________________________ >> Bioperl-l mailing list >> Bioperl-l at lists.open-bio.org >> http://lists.open-bio.org/mailman/listinfo/bioperl-l >> > > > > -- > ------------------------------------------------------------------------ > Scott Cain, Ph. D. scott at > scottcain dot net > GMOD Coordinator (http://gmod.org/) 216-392-3087 > Ontario Institute for Cancer Research --------------------------------- Lacey Sanderson Bioinformaticist Pulse Crop Breeding and Genetics Department of Plant Sciences University of Saskatchewan. From cjfields at illinois.edu Wed Feb 3 12:58:26 2010 From: cjfields at illinois.edu (Chris Fields) Date: Wed, 3 Feb 2010 11:58:26 -0600 Subject: [Bioperl-l] After Checkout of Bioperl-live version doesn't seem to change In-Reply-To: References: <4536f7701002030908p558e697ex79087c2ad0d9970d@mail.gmail.com> Message-ID: If I were to choose a versioning scheme I would probably go with simple bipartite ones, but (by legacy) BioPerl is based on the perl tripartite versioning scheme: http://www.perl.com/pub/a/2000/04/whatsnew.html#Version_Tuples For instance, with perl 5.10.1 the vstring v5.10.1 would be 5.010001. BioPerl 1.6.0 is 1.006000, so you either have an older lingering CPAN installation of 1.6.0 or you are installing that one (I'm guessing the former, as the latest from CPAN is now 1.6.1, or 1.006001). Effectively, svn trunk is v1.6.900 (roughly the 900th point release). The reason trunk is set at 1.6.9 is simple, it supercedes any current CPAN release version, and it's very unlikely we'll release another 898 point releases of BioPerl. It also tags the main trunk as pre-1.7. I can cherry-pick off trunk into the properly designated 1.6 branch (now at 1.6.1), bump the release there, and push to CPAN. Once we actively start on 1.7 we'll probably bump trunk up to something higher. It also enables one to distinguish between a CPAN release of BioPerl vs. the SVN version, as installing from svn has unfortunately become very common practice. Comes in useful in cases like yours, where we're trying to distinguish what version is being called. chris On Feb 3, 2010, at 11:32 AM, Lacey Sanderson wrote: > It returned "1.006"... unfortunately no 9 at the end... Any idea why it doesn't seem to be picking up the bioperl-live checkout? > > PS. Why do the distributions have version numbers like 1.5.2 and 1.6 and yet the bioperl-live (which I take to be the most current version of bioperl) returns 1.069? > > Lacey > > On 3-Feb-10, at 11:08 AM, Scott Cain wrote: > >> Hi Lacey, >> >> Did it return "1.006" or 1.0069 exactly, which is what it should >> return if you are using bioperl-live? As for older versions of >> BioPerl, if you go down the install route (which is what I generally >> do), you can do the standard build procedure: >> >> perl Build.PL >> ./Build >> ./Build test >> sudo ./Build install --uninst 1 >> >> and that last part (--uninst 1) will remove older versions of the >> modules installed elsewhere. If you want to hunt down where GBrowse >> installed bioperl, first you could look in the typical linuxy places, >> like /usr/share/perl or /usr/lib/perl for the Bio directory, but >> deleting that directory will also delete Bio::Graphics, which isn't >> part of bioperl anymore. >> >> Using PERL5LIB will put your specified bioperl at the front of the >> list for when things get used, so even if you have bioperl installed >> elsewhere, that one should get used. >> >> Scott >> >> >> On Wed, Feb 3, 2010 at 10:32 AM, Lacey Sanderson >> wrote: >>> Hi, >>> >>> I just checked out bioperl-live from svn as follows >>> $ sudo svn co svn://code.open-bio.org/bioperl/bioperl-live/trunk >>> bioperl-live >>> $ echo 'export >>> PERL5LIB="/usr/local/share/applications/Bioperl/bioperl-live:$PERL5LIB" >>> #adds path of bioperl-live to PERL5LIB' >> ~/.profile >>> >>> I then restarted by shell and checked the version using >>> $ perl -MBio::Perl -le 'print Bio::Perl->VERSION;' >>> >>> Which returned 1.006... I am sure this is the version of bioperl I had >>> previously installed and not the new one I just checked out. Is there >>> something else I need to do to tell my system to disregard the old bioperl >>> and instead use bioperl-live? >>> >>> I had installed BioPerl earlier using the GMOD GBrowse netinstaller although >>> I can't seem to locate where it was installed. Will this effect anything? If >>> so, how would I go about uninstalling it (I'm a little new to Linux)... >>> >>> By the way I am running Ubuntu Intrepid >>> >>> Thanks! >>> Lacey >>> >>> _______________________________________________ >>> Bioperl-l mailing list >>> Bioperl-l at lists.open-bio.org >>> http://lists.open-bio.org/mailman/listinfo/bioperl-l >>> >> >> >> >> -- >> ------------------------------------------------------------------------ >> Scott Cain, Ph. D. scott at scottcain dot net >> GMOD Coordinator (http://gmod.org/) 216-392-3087 >> Ontario Institute for Cancer Research > > --------------------------------- > Lacey Sanderson > Bioinformaticist > Pulse Crop Breeding and Genetics > Department of Plant Sciences > University of Saskatchewan. > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l From maj at fortinbras.us Wed Feb 3 12:50:20 2010 From: maj at fortinbras.us (Mark A. Jensen) Date: Wed, 3 Feb 2010 12:50:20 -0500 Subject: [Bioperl-l] After Checkout of Bioperl-live version doesn't seem tochange In-Reply-To: References: Message-ID: Presumably your pwd was /usr/local/share/applications/Bioperl when you did the checkout? ----- Original Message ----- From: "Lacey Sanderson" To: Sent: Wednesday, February 03, 2010 10:32 AM Subject: [Bioperl-l] After Checkout of Bioperl-live version doesn't seem tochange > Hi, > > I just checked out bioperl-live from svn as follows > $ sudo svn co svn://code.open-bio.org/bioperl/bioperl-live/trunk bioperl-live > $ echo 'export PERL5LIB="/usr/local/share/applications/Bioperl/bioperl- > live:$PERL5LIB" #adds path of bioperl-live to PERL5LIB' >> ~/.profile > > I then restarted by shell and checked the version using > $ perl -MBio::Perl -le 'print Bio::Perl->VERSION;' > > Which returned 1.006... I am sure this is the version of bioperl I had > previously installed and not the new one I just checked out. Is there > something else I need to do to tell my system to disregard the old bioperl > and instead use bioperl-live? > > I had installed BioPerl earlier using the GMOD GBrowse netinstaller although > I can't seem to locate where it was installed. Will this effect anything? If > so, how would I go about uninstalling it (I'm a little new to Linux)... > > By the way I am running Ubuntu Intrepid > > Thanks! > Lacey > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l > > From laceyanne_sanderson at shaw.ca Wed Feb 3 12:58:09 2010 From: laceyanne_sanderson at shaw.ca (Lacey Sanderson) Date: Wed, 03 Feb 2010 11:58:09 -0600 Subject: [Bioperl-l] After Checkout of Bioperl-live version doesn't seem tochange In-Reply-To: References: Message-ID: Yes, Sorry I forgot to include that. I checked and the bioperl-live checkout is where I told export it would be (ie: in /usr/local/share/ applications/Bioperl/) Lacey On 3-Feb-10, at 11:50 AM, Mark A. Jensen wrote: > Presumably your pwd was > /usr/local/share/applications/Bioperl > when you did the checkout? > ----- Original Message ----- From: "Lacey Sanderson" > > To: > Sent: Wednesday, February 03, 2010 10:32 AM > Subject: [Bioperl-l] After Checkout of Bioperl-live version doesn't > seem tochange > > >> Hi, >> >> I just checked out bioperl-live from svn as follows >> $ sudo svn co svn://code.open-bio.org/bioperl/bioperl-live/trunk >> bioperl-live >> $ echo 'export PERL5LIB="/usr/local/share/applications/Bioperl/ >> bioperl- live:$PERL5LIB" #adds path of bioperl-live to PERL5LIB' >> >> ~/.profile >> >> I then restarted by shell and checked the version using >> $ perl -MBio::Perl -le 'print Bio::Perl->VERSION;' >> >> Which returned 1.006... I am sure this is the version of bioperl I >> had previously installed and not the new one I just checked out. Is >> there something else I need to do to tell my system to disregard >> the old bioperl and instead use bioperl-live? >> >> I had installed BioPerl earlier using the GMOD GBrowse >> netinstaller although I can't seem to locate where it was >> installed. Will this effect anything? If so, how would I go about >> uninstalling it (I'm a little new to Linux)... >> >> By the way I am running Ubuntu Intrepid >> >> Thanks! >> Lacey >> >> _______________________________________________ >> Bioperl-l mailing list >> Bioperl-l at lists.open-bio.org >> http://lists.open-bio.org/mailman/listinfo/bioperl-l >> > --------------------------------- Lacey Sanderson Bioinformaticist Pulse Crop Breeding and Genetics Department of Plant Sciences University of Saskatchewan. From maj at fortinbras.us Wed Feb 3 13:10:09 2010 From: maj at fortinbras.us (Mark A. Jensen) Date: Wed, 3 Feb 2010 13:10:09 -0500 Subject: [Bioperl-l] After Checkout of Bioperl-live version doesn'tseem to change In-Reply-To: References: <4536f7701002030908p558e697ex79087c2ad0d9970d@mail.gmail.com> Message-ID: The practicality of it is that you can do a simple float comparision to check for prerequisites and such: if ($Bio::Perl::VERSION < 1.006900) { die "You ain't got the freshest." } Harder if you're comparing strings "1.6" to "1.6.9". MAJ ----- Original Message ----- From: "Chris Fields" To: "Lacey Sanderson" Cc: ; "Scott Cain" Sent: Wednesday, February 03, 2010 12:58 PM Subject: Re: [Bioperl-l] After Checkout of Bioperl-live version doesn'tseem to change > If I were to choose a versioning scheme I would probably go with simple > bipartite ones, but (by legacy) BioPerl is based on the perl tripartite > versioning scheme: > > http://www.perl.com/pub/a/2000/04/whatsnew.html#Version_Tuples > > For instance, with perl 5.10.1 the vstring v5.10.1 would be 5.010001. BioPerl > 1.6.0 is 1.006000, so you either have an older lingering CPAN installation of > 1.6.0 or you are installing that one (I'm guessing the former, as the latest > from CPAN is now 1.6.1, or 1.006001). Effectively, svn trunk is v1.6.900 > (roughly the 900th point release). > > The reason trunk is set at 1.6.9 is simple, it supercedes any current CPAN > release version, and it's very unlikely we'll release another 898 point > releases of BioPerl. It also tags the main trunk as pre-1.7. I can > cherry-pick off trunk into the properly designated 1.6 branch (now at 1.6.1), > bump the release there, and push to CPAN. Once we actively start on 1.7 we'll > probably bump trunk up to something higher. It also enables one to > distinguish between a CPAN release of BioPerl vs. the SVN version, as > installing from svn has unfortunately become very common practice. Comes in > useful in cases like yours, where we're trying to distinguish what version is > being called. > > chris > > On Feb 3, 2010, at 11:32 AM, Lacey Sanderson wrote: > >> It returned "1.006"... unfortunately no 9 at the end... Any idea why it >> doesn't seem to be picking up the bioperl-live checkout? >> >> PS. Why do the distributions have version numbers like 1.5.2 and 1.6 and yet >> the bioperl-live (which I take to be the most current version of bioperl) >> returns 1.069? >> >> Lacey >> >> On 3-Feb-10, at 11:08 AM, Scott Cain wrote: >> >>> Hi Lacey, >>> >>> Did it return "1.006" or 1.0069 exactly, which is what it should >>> return if you are using bioperl-live? As for older versions of >>> BioPerl, if you go down the install route (which is what I generally >>> do), you can do the standard build procedure: >>> >>> perl Build.PL >>> ./Build >>> ./Build test >>> sudo ./Build install --uninst 1 >>> >>> and that last part (--uninst 1) will remove older versions of the >>> modules installed elsewhere. If you want to hunt down where GBrowse >>> installed bioperl, first you could look in the typical linuxy places, >>> like /usr/share/perl or /usr/lib/perl for the Bio directory, but >>> deleting that directory will also delete Bio::Graphics, which isn't >>> part of bioperl anymore. >>> >>> Using PERL5LIB will put your specified bioperl at the front of the >>> list for when things get used, so even if you have bioperl installed >>> elsewhere, that one should get used. >>> >>> Scott >>> >>> >>> On Wed, Feb 3, 2010 at 10:32 AM, Lacey Sanderson >>> wrote: >>>> Hi, >>>> >>>> I just checked out bioperl-live from svn as follows >>>> $ sudo svn co svn://code.open-bio.org/bioperl/bioperl-live/trunk >>>> bioperl-live >>>> $ echo 'export >>>> PERL5LIB="/usr/local/share/applications/Bioperl/bioperl-live:$PERL5LIB" >>>> #adds path of bioperl-live to PERL5LIB' >> ~/.profile >>>> >>>> I then restarted by shell and checked the version using >>>> $ perl -MBio::Perl -le 'print Bio::Perl->VERSION;' >>>> >>>> Which returned 1.006... I am sure this is the version of bioperl I had >>>> previously installed and not the new one I just checked out. Is there >>>> something else I need to do to tell my system to disregard the old bioperl >>>> and instead use bioperl-live? >>>> >>>> I had installed BioPerl earlier using the GMOD GBrowse netinstaller >>>> although >>>> I can't seem to locate where it was installed. Will this effect anything? >>>> If >>>> so, how would I go about uninstalling it (I'm a little new to Linux)... >>>> >>>> By the way I am running Ubuntu Intrepid >>>> >>>> Thanks! >>>> Lacey >>>> >>>> _______________________________________________ >>>> Bioperl-l mailing list >>>> Bioperl-l at lists.open-bio.org >>>> http://lists.open-bio.org/mailman/listinfo/bioperl-l >>>> >>> >>> >>> >>> -- >>> ------------------------------------------------------------------------ >>> Scott Cain, Ph. D. scott at scottcain dot >>> net >>> GMOD Coordinator (http://gmod.org/) 216-392-3087 >>> Ontario Institute for Cancer Research >> >> --------------------------------- >> Lacey Sanderson >> Bioinformaticist >> Pulse Crop Breeding and Genetics >> Department of Plant Sciences >> University of Saskatchewan. >> >> _______________________________________________ >> Bioperl-l mailing list >> Bioperl-l at lists.open-bio.org >> http://lists.open-bio.org/mailman/listinfo/bioperl-l > > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l > > From laceyanne_sanderson at shaw.ca Wed Feb 3 13:21:03 2010 From: laceyanne_sanderson at shaw.ca (Lacey Sanderson) Date: Wed, 03 Feb 2010 12:21:03 -0600 Subject: [Bioperl-l] After Checkout of Bioperl-live version doesn't seem tochange In-Reply-To: <0A23A254827D40A997BF65EE356183E6@NewLife> References: <0A23A254827D40A997BF65EE356183E6@NewLife> Message-ID: That gives me the 1.069 that I was wanting so the problem seems to be that perl is finding the wrong installation by default... Lacey On 3-Feb-10, at 12:14 PM, Mark A. Jensen wrote: > perl -MBio::Perl -I/usr/local/share/applications/Bioperl/bioperl- > live -le 'print Bio::Perl->VERSION;' From Jonathan_Epstein at nih.gov Wed Feb 3 13:40:56 2010 From: Jonathan_Epstein at nih.gov (Jonathan Epstein) Date: Wed, 03 Feb 2010 13:40:56 -0500 Subject: [Bioperl-l] After Checkout of Bioperl-live version doesn't seem to change In-Reply-To: References: <4536f7701002030908p558e697ex79087c2ad0d9970d@mail.gmail.com> Message-ID: <4B69C338.1080501@nih.gov> Here's a low-tech answer: if I'm having a problem like this with my PERL5LIB environment variable [Perl is not doing what I /think/ I've told it to do], I try to figure out where Perl is actually looking. So in this case I'd write a two-line BioPerl program foo.pl like: use Bio::Perl; $seq_object = get_sequence('swiss',"ROA1_HUMAN"); Start the debugger with "perl -d:ptkdb foo.pl", set a breakpoint before line 2, run to that breakpoint, and then step-in with the debugger and see in which directory the corresponding source file resides. My guess is that there are additional subordinates paths within /usr/local/share/applications/Bioperl/bioperl-live which must be added to your path. Cheers, Jonathan On 2/3/2010 12:58 PM, Chris Fields wrote: > If I were to choose a versioning scheme I would probably go with simple bipartite ones, but (by legacy) BioPerl is based on the perl tripartite versioning scheme: > > http://www.perl.com/pub/a/2000/04/whatsnew.html#Version_Tuples > > For instance, with perl 5.10.1 the vstring v5.10.1 would be 5.010001. BioPerl 1.6.0 is 1.006000, so you either have an older lingering CPAN installation of 1.6.0 or you are installing that one (I'm guessing the former, as the latest from CPAN is now 1.6.1, or 1.006001). Effectively, svn trunk is v1.6.900 (roughly the 900th point release). > > The reason trunk is set at 1.6.9 is simple, it supercedes any current CPAN release version, and it's very unlikely we'll release another 898 point releases of BioPerl. It also tags the main trunk as pre-1.7. I can cherry-pick off trunk into the properly designated 1.6 branch (now at 1.6.1), bump the release there, and push to CPAN. Once we actively start on 1.7 we'll probably bump trunk up to something higher. It also enables one to distinguish between a CPAN release of BioPerl vs. the SVN version, as installing from svn has unfortunately become very common practice. Comes in useful in cases like yours, where we're trying to distinguish what version is being called. > > chris > > On Feb 3, 2010, at 11:32 AM, Lacey Sanderson wrote: > > >> It returned "1.006"... unfortunately no 9 at the end... Any idea why it doesn't seem to be picking up the bioperl-live checkout? >> >> PS. Why do the distributions have version numbers like 1.5.2 and 1.6 and yet the bioperl-live (which I take to be the most current version of bioperl) returns 1.069? >> >> Lacey >> >> On 3-Feb-10, at 11:08 AM, Scott Cain wrote: >> >> >>> Hi Lacey, >>> >>> Did it return "1.006" or 1.0069 exactly, which is what it should >>> return if you are using bioperl-live? As for older versions of >>> BioPerl, if you go down the install route (which is what I generally >>> do), you can do the standard build procedure: >>> >>> perl Build.PL >>> ./Build >>> ./Build test >>> sudo ./Build install --uninst 1 >>> >>> and that last part (--uninst 1) will remove older versions of the >>> modules installed elsewhere. If you want to hunt down where GBrowse >>> installed bioperl, first you could look in the typical linuxy places, >>> like /usr/share/perl or /usr/lib/perl for the Bio directory, but >>> deleting that directory will also delete Bio::Graphics, which isn't >>> part of bioperl anymore. >>> >>> Using PERL5LIB will put your specified bioperl at the front of the >>> list for when things get used, so even if you have bioperl installed >>> elsewhere, that one should get used. >>> >>> Scott >>> >>> >>> On Wed, Feb 3, 2010 at 10:32 AM, Lacey Sanderson >>> wrote: >>> >>>> Hi, >>>> >>>> I just checked out bioperl-live from svn as follows >>>> $ sudo svn co svn://code.open-bio.org/bioperl/bioperl-live/trunk >>>> bioperl-live >>>> $ echo 'export >>>> PERL5LIB="/usr/local/share/applications/Bioperl/bioperl-live:$PERL5LIB" >>>> #adds path of bioperl-live to PERL5LIB'>> ~/.profile >>>> >>>> I then restarted by shell and checked the version using >>>> $ perl -MBio::Perl -le 'print Bio::Perl->VERSION;' >>>> >>>> Which returned 1.006... I am sure this is the version of bioperl I had >>>> previously installed and not the new one I just checked out. Is there >>>> something else I need to do to tell my system to disregard the old bioperl >>>> and instead use bioperl-live? >>>> >>>> I had installed BioPerl earlier using the GMOD GBrowse netinstaller although >>>> I can't seem to locate where it was installed. Will this effect anything? If >>>> so, how would I go about uninstalling it (I'm a little new to Linux)... >>>> >>>> By the way I am running Ubuntu Intrepid >>>> >>>> Thanks! >>>> Lacey >>>> >>>> _______________________________________________ >>>> Bioperl-l mailing list >>>> Bioperl-l at lists.open-bio.org >>>> http://lists.open-bio.org/mailman/listinfo/bioperl-l >>>> >>>> >>> >>> >>> -- >>> ------------------------------------------------------------------------ >>> Scott Cain, Ph. D. scott at scottcain dot net >>> GMOD Coordinator (http://gmod.org/) 216-392-3087 >>> Ontario Institute for Cancer Research >>> >> --------------------------------- >> Lacey Sanderson >> Bioinformaticist >> Pulse Crop Breeding and Genetics >> Department of Plant Sciences >> University of Saskatchewan. >> >> _______________________________________________ >> Bioperl-l mailing list >> Bioperl-l at lists.open-bio.org >> http://lists.open-bio.org/mailman/listinfo/bioperl-l >> > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l > . > > From maj at fortinbras.us Wed Feb 3 13:56:26 2010 From: maj at fortinbras.us (Mark A. Jensen) Date: Wed, 3 Feb 2010 13:56:26 -0500 Subject: [Bioperl-l] Fw: After Checkout of Bioperl-live version doesn't seem tochange Message-ID: <9EAAAFBA93614EFCB4BCB7FF5F2FC732@NewLife> up to list-- MAJ ----- Original Message ----- From: "Mark A. Jensen" To: "Lacey Sanderson" Sent: Wednesday, February 03, 2010 1:34 PM Subject: Re: [Bioperl-l] After Checkout of Bioperl-live version doesn't seem tochange > Well, that's progress. It hard to debug these things remotely, but now > you can at least work around the problem by using the -I flag. The next > weak link I would try to eliminate is the append to the .profile file. Just > do directly from the command line (don't relogin) > > $ export PERL5LIB=/usr/local/share/applications/Bioperl/bioperl-live:$PERL5LIB > $ perl -MBio::Perl -e 'print Bio::Perl->VERSION' > > If this gives 1.0069, then the problem is in that append (and, you can also do > your work!). > (If you're a sudoer, I'd just edit that file directly.) > > cheers MAJ > ----- Original Message ----- > From: "Lacey Sanderson" > To: "Mark A. Jensen" > Cc: > Sent: Wednesday, February 03, 2010 1:21 PM > Subject: Re: [Bioperl-l] After Checkout of Bioperl-live version doesn't seem > tochange > > >> That gives me the 1.069 that I was wanting so the problem seems to be that >> perl is finding the wrong installation by default... >> >> Lacey >> >> On 3-Feb-10, at 12:14 PM, Mark A. Jensen wrote: >> >>> perl -MBio::Perl -I/usr/local/share/applications/Bioperl/bioperl- live -le >>> 'print Bio::Perl->VERSION;' >> >> >> > From zhabizg at yahoo.com Wed Feb 3 14:05:50 2010 From: zhabizg at yahoo.com (zhabiz golkar) Date: Wed, 3 Feb 2010 11:05:50 -0800 (PST) Subject: [Bioperl-l] query Message-ID: <634371.13033.qm@web34205.mail.mud.yahoo.com> Dear Sir/Madam We have done the sequencing of about 50 HIV +ve samples. We have used HIV BLAST database. And identified the subtypes,.. we have used 2 pair primers( reverse and forward) means we have 4 seq. for each sample. NOW 1-????? We are looking for gag, reverse transcriptase and protease(POL) regions of each sample and 2-????? We want to do comparison of these regions which each other. Please note , when we use database, they do the comparison with the reference genome (HIV-1). But we need to compare theses regions among the isolates. And clustalW and X also is not highlighting the specific region . I would be great full if you can give suggestion or send the link or software name that we can do application . ? Looking forward to hear you . and thanking you in anticipation Regards Dr.Zhabiz From cjfields at illinois.edu Wed Feb 3 14:21:02 2010 From: cjfields at illinois.edu (Chris Fields) Date: Wed, 3 Feb 2010 13:21:02 -0600 Subject: [Bioperl-l] Fw: After Checkout of Bioperl-live version doesn't seem tochange In-Reply-To: <9EAAAFBA93614EFCB4BCB7FF5F2FC732@NewLife> References: <9EAAAFBA93614EFCB4BCB7FF5F2FC732@NewLife> Message-ID: <30452F69-00B3-4E72-9E70-D4B12948E65E@illinois.edu> Yes, though I'm assuming by .profile you mean the user-specific one ($HOME/.profile) and not the system/global one. The local one should suffice. chris On Feb 3, 2010, at 12:56 PM, Mark A. Jensen wrote: > up to list-- MAJ > ----- Original Message ----- From: "Mark A. Jensen" > To: "Lacey Sanderson" > Sent: Wednesday, February 03, 2010 1:34 PM > Subject: Re: [Bioperl-l] After Checkout of Bioperl-live version doesn't seem tochange > > >> Well, that's progress. It hard to debug these things remotely, but now >> you can at least work around the problem by using the -I flag. The next >> weak link I would try to eliminate is the append to the .profile file. Just >> do directly from the command line (don't relogin) >> >> $ export PERL5LIB=/usr/local/share/applications/Bioperl/bioperl-live:$PERL5LIB >> $ perl -MBio::Perl -e 'print Bio::Perl->VERSION' >> >> If this gives 1.0069, then the problem is in that append (and, you can also do your work!). >> (If you're a sudoer, I'd just edit that file directly.) >> >> cheers MAJ >> ----- Original Message ----- From: "Lacey Sanderson" >> To: "Mark A. Jensen" >> Cc: >> Sent: Wednesday, February 03, 2010 1:21 PM >> Subject: Re: [Bioperl-l] After Checkout of Bioperl-live version doesn't seem tochange >> >> >>> That gives me the 1.069 that I was wanting so the problem seems to be that perl is finding the wrong installation by default... >>> >>> Lacey >>> >>> On 3-Feb-10, at 12:14 PM, Mark A. Jensen wrote: >>> >>>> perl -MBio::Perl -I/usr/local/share/applications/Bioperl/bioperl- live -le 'print Bio::Perl->VERSION;' >>> >>> >>> > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l From cjfields at illinois.edu Wed Feb 3 14:22:31 2010 From: cjfields at illinois.edu (Chris Fields) Date: Wed, 3 Feb 2010 13:22:31 -0600 Subject: [Bioperl-l] Fw: After Checkout of Bioperl-live version doesn't seem tochange In-Reply-To: <9EAAAFBA93614EFCB4BCB7FF5F2FC732@NewLife> References: <9EAAAFBA93614EFCB4BCB7FF5F2FC732@NewLife> Message-ID: <924CFC00-B1CF-401E-B5A8-815E2E8EB1A1@illinois.edu> Yes, though I'm assuming by .profile you mean the user-specific one ($HOME/.profile) and not the system/global one. The local one should suffice. chris On Feb 3, 2010, at 12:56 PM, Mark A. Jensen wrote: > up to list-- MAJ > ----- Original Message ----- From: "Mark A. Jensen" > To: "Lacey Sanderson" > Sent: Wednesday, February 03, 2010 1:34 PM > Subject: Re: [Bioperl-l] After Checkout of Bioperl-live version doesn't seem tochange > > >> Well, that's progress. It hard to debug these things remotely, but now >> you can at least work around the problem by using the -I flag. The next >> weak link I would try to eliminate is the append to the .profile file. Just >> do directly from the command line (don't relogin) >> >> $ export PERL5LIB=/usr/local/share/applications/Bioperl/bioperl-live:$PERL5LIB >> $ perl -MBio::Perl -e 'print Bio::Perl->VERSION' >> >> If this gives 1.0069, then the problem is in that append (and, you can also do your work!). >> (If you're a sudoer, I'd just edit that file directly.) >> >> cheers MAJ >> ----- Original Message ----- From: "Lacey Sanderson" >> To: "Mark A. Jensen" >> Cc: >> Sent: Wednesday, February 03, 2010 1:21 PM >> Subject: Re: [Bioperl-l] After Checkout of Bioperl-live version doesn't seem tochange >> >> >>> That gives me the 1.069 that I was wanting so the problem seems to be that perl is finding the wrong installation by default... >>> >>> Lacey >>> >>> On 3-Feb-10, at 12:14 PM, Mark A. Jensen wrote: >>> >>>> perl -MBio::Perl -I/usr/local/share/applications/Bioperl/bioperl- live -le 'print Bio::Perl->VERSION;' >>> >>> >>> > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l From laceyanne_sanderson at shaw.ca Wed Feb 3 14:38:09 2010 From: laceyanne_sanderson at shaw.ca (Lacey Sanderson) Date: Wed, 03 Feb 2010 13:38:09 -0600 Subject: [Bioperl-l] After Checkout of Bioperl-live version doesn't seem tochange In-Reply-To: <0A73EA011D85433787444922A42E979C@NewLife> References: <0A23A254827D40A997BF65EE356183E6@NewLife> <0A73EA011D85433787444922A42E979C@NewLife> Message-ID: <2163AB4F-0CD3-44C8-AA04-6DAD0E67F549@shaw.ca> Okay, > directly from the command line (don't relogin) > > $ export PERL5LIB=/usr/local/share/applications/Bioperl/bioperl-live: > $PERL5LIB > $ perl -MBio::Perl -e 'print Bio::Perl->VERSION' That gives me the correct version... For that user until I log out... I sudoed in to .profile and added the line directly but that doesn't seem to have helped. How would I tell perl to look in the same place for Bioperl for ALL users? Lacey On 3-Feb-10, at 12:34 PM, Mark A. Jensen wrote: > Well, that's progress. It hard to debug these things remotely, but now > you can at least work around the problem by using the -I flag. The > next > weak link I would try to eliminate is the append to the .profile > file. Just > do directly from the command line (don't relogin) > > $ export PERL5LIB=/usr/local/share/applications/Bioperl/bioperl-live: > $PERL5LIB > $ perl -MBio::Perl -e 'print Bio::Perl->VERSION' > > If this gives 1.0069, then the problem is in that append (and, you > can also do your work!). > (If you're a sudoer, I'd just edit that file directly.) > > cheers MAJ > ----- Original Message ----- From: "Lacey Sanderson" > > To: "Mark A. Jensen" > Cc: > Sent: Wednesday, February 03, 2010 1:21 PM > Subject: Re: [Bioperl-l] After Checkout of Bioperl-live version > doesn't seem tochange > > >> That gives me the 1.069 that I was wanting so the problem seems to >> be that perl is finding the wrong installation by default... >> >> Lacey >> >> On 3-Feb-10, at 12:14 PM, Mark A. Jensen wrote: >> >>> perl -MBio::Perl -I/usr/local/share/applications/Bioperl/bioperl- >>> live -le 'print Bio::Perl->VERSION;' >> >> > --------------------------------- Lacey Sanderson Bioinformaticist Pulse Crop Breeding and Genetics Department of Plant Sciences University of Saskatchewan. From scott at scottcain.net Wed Feb 3 14:49:21 2010 From: scott at scottcain.net (Scott Cain) Date: Wed, 3 Feb 2010 14:49:21 -0500 Subject: [Bioperl-l] After Checkout of Bioperl-live version doesn't seem tochange In-Reply-To: <2163AB4F-0CD3-44C8-AA04-6DAD0E67F549@shaw.ca> References: <0A23A254827D40A997BF65EE356183E6@NewLife> <0A73EA011D85433787444922A42E979C@NewLife> <2163AB4F-0CD3-44C8-AA04-6DAD0E67F549@shaw.ca> Message-ID: <4536f7701002031149j3abd0beavfdd8d67e9219b7e2@mail.gmail.com> Hi Lacey, In order for it to be available to all users, I would suggest you go through the "normal" installation procedure: perl Build.PL ./Build ./Build test sudo ./Build install --uninst 1 Scott On Wed, Feb 3, 2010 at 2:38 PM, Lacey Sanderson wrote: > Okay, >> >> directly from the command line (don't relogin) >> >> $ export >> PERL5LIB=/usr/local/share/applications/Bioperl/bioperl-live:$PERL5LIB >> $ perl -MBio::Perl -e 'print Bio::Perl->VERSION' > > That gives me the correct version... For that user until I log out... > > I sudoed in to .profile and added the line directly but that doesn't seem to > have helped. > > How would I tell perl to look in the same place for Bioperl for ALL users? > > Lacey > > On 3-Feb-10, at 12:34 PM, Mark A. Jensen wrote: > >> Well, that's progress. It hard to debug these things remotely, but now >> you can at least work around the problem by using the -I flag. The next >> weak link I would try to eliminate is the append to the .profile file. >> Just >> do directly from the command line (don't relogin) >> >> $ export >> PERL5LIB=/usr/local/share/applications/Bioperl/bioperl-live:$PERL5LIB >> $ perl -MBio::Perl -e 'print Bio::Perl->VERSION' >> >> If this gives 1.0069, then the problem is in that append (and, you can >> also do your work!). >> (If you're a sudoer, I'd just edit that file directly.) >> >> cheers MAJ >> ----- Original Message ----- From: "Lacey Sanderson" >> >> To: "Mark A. Jensen" >> Cc: >> Sent: Wednesday, February 03, 2010 1:21 PM >> Subject: Re: [Bioperl-l] After Checkout of Bioperl-live version doesn't >> seem tochange >> >> >>> That gives me the 1.069 that I was wanting so the problem seems to be >>> ?that perl is finding the wrong installation by default... >>> >>> Lacey >>> >>> On 3-Feb-10, at 12:14 PM, Mark A. Jensen wrote: >>> >>>> perl -MBio::Perl -I/usr/local/share/applications/Bioperl/bioperl- live >>>> -le 'print Bio::Perl->VERSION;' >>> >>> >> > > --------------------------------- > Lacey Sanderson > Bioinformaticist > Pulse Crop Breeding and Genetics > Department of Plant Sciences > University of Saskatchewan. > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l > -- ------------------------------------------------------------------------ Scott Cain, Ph. D. scott at scottcain dot net GMOD Coordinator (http://gmod.org/) 216-392-3087 Ontario Institute for Cancer Research From maj at fortinbras.us Wed Feb 3 14:55:41 2010 From: maj at fortinbras.us (Mark A. Jensen) Date: Wed, 3 Feb 2010 14:55:41 -0500 Subject: [Bioperl-l] After Checkout of Bioperl-live version doesn't seem tochange In-Reply-To: <4536f7701002031149j3abd0beavfdd8d67e9219b7e2@mail.gmail.com> References: <0A23A254827D40A997BF65EE356183E6@NewLife> <0A73EA011D85433787444922A42E979C@NewLife> <2163AB4F-0CD3-44C8-AA04-6DAD0E67F549@shaw.ca> <4536f7701002031149j3abd0beavfdd8d67e9219b7e2@mail.gmail.com> Message-ID: <5063EF67524746ECA04C060F66E50486@NewLife> I'll jump in to +1 on this; the Build will make sure you have all the necessary prequisites, and install those automatically (I *think* it still does this....). It will also do comprehensive tests. These will save MANY headaches down the road. [I speak as someone who didn't build in the beginning, until one day Dave Messina blithely said to me "why don't you just run ./Build test ?". Three days later, I was back up...] ----- Original Message ----- From: "Scott Cain" To: "Lacey Sanderson" Cc: "Mark A. Jensen" ; Sent: Wednesday, February 03, 2010 2:49 PM Subject: Re: [Bioperl-l] After Checkout of Bioperl-live version doesn't seem tochange Hi Lacey, In order for it to be available to all users, I would suggest you go through the "normal" installation procedure: perl Build.PL ./Build ./Build test sudo ./Build install --uninst 1 Scott On Wed, Feb 3, 2010 at 2:38 PM, Lacey Sanderson wrote: > Okay, >> >> directly from the command line (don't relogin) >> >> $ export >> PERL5LIB=/usr/local/share/applications/Bioperl/bioperl-live:$PERL5LIB >> $ perl -MBio::Perl -e 'print Bio::Perl->VERSION' > > That gives me the correct version... For that user until I log out... > > I sudoed in to .profile and added the line directly but that doesn't seem to > have helped. > > How would I tell perl to look in the same place for Bioperl for ALL users? > > Lacey > > On 3-Feb-10, at 12:34 PM, Mark A. Jensen wrote: > >> Well, that's progress. It hard to debug these things remotely, but now >> you can at least work around the problem by using the -I flag. The next >> weak link I would try to eliminate is the append to the .profile file. >> Just >> do directly from the command line (don't relogin) >> >> $ export >> PERL5LIB=/usr/local/share/applications/Bioperl/bioperl-live:$PERL5LIB >> $ perl -MBio::Perl -e 'print Bio::Perl->VERSION' >> >> If this gives 1.0069, then the problem is in that append (and, you can >> also do your work!). >> (If you're a sudoer, I'd just edit that file directly.) >> >> cheers MAJ >> ----- Original Message ----- From: "Lacey Sanderson" >> >> To: "Mark A. Jensen" >> Cc: >> Sent: Wednesday, February 03, 2010 1:21 PM >> Subject: Re: [Bioperl-l] After Checkout of Bioperl-live version doesn't >> seem tochange >> >> >>> That gives me the 1.069 that I was wanting so the problem seems to be >>> that perl is finding the wrong installation by default... >>> >>> Lacey >>> >>> On 3-Feb-10, at 12:14 PM, Mark A. Jensen wrote: >>> >>>> perl -MBio::Perl -I/usr/local/share/applications/Bioperl/bioperl- live >>>> -le 'print Bio::Perl->VERSION;' >>> >>> >> > > --------------------------------- > Lacey Sanderson > Bioinformaticist > Pulse Crop Breeding and Genetics > Department of Plant Sciences > University of Saskatchewan. > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l > -- ------------------------------------------------------------------------ Scott Cain, Ph. D. scott at scottcain dot net GMOD Coordinator (http://gmod.org/) 216-392-3087 Ontario Institute for Cancer Research From cjfields at illinois.edu Wed Feb 3 15:05:16 2010 From: cjfields at illinois.edu (Chris Fields) Date: Wed, 3 Feb 2010 14:05:16 -0600 Subject: [Bioperl-l] After Checkout of Bioperl-live version doesn't seem tochange In-Reply-To: <5063EF67524746ECA04C060F66E50486@NewLife> References: <0A23A254827D40A997BF65EE356183E6@NewLife> <0A73EA011D85433787444922A42E979C@NewLife> <2163AB4F-0CD3-44C8-AA04-6DAD0E67F549@shaw.ca> <4536f7701002031149j3abd0beavfdd8d67e9219b7e2@mail.gmail.com> <5063EF67524746ECA04C060F66E50486@NewLife> Message-ID: <85545C48-7DEB-4D92-B5F8-3781BEE5D8AA@illinois.edu> Also, remember if you are installing from svn you are installing from the bleeding edge code. IIRC the last svn co I made had several tests failing for various reasons; nothing that should affect your installation, but something to be aware of. chris On Feb 3, 2010, at 1:55 PM, Mark A. Jensen wrote: > I'll jump in to +1 on this; the Build will make sure you have all the necessary prequisites, and install those automatically (I *think* it still does this....). It will also do comprehensive tests. These will save MANY headaches down the road. > > [I speak as someone who didn't build in the beginning, until one day Dave Messina > blithely said to me "why don't you just run ./Build test ?". Three days later, I was back up...] > > ----- Original Message ----- From: "Scott Cain" > To: "Lacey Sanderson" > Cc: "Mark A. Jensen" ; > Sent: Wednesday, February 03, 2010 2:49 PM > Subject: Re: [Bioperl-l] After Checkout of Bioperl-live version doesn't seem tochange > > > Hi Lacey, > > In order for it to be available to all users, I would suggest you go > through the "normal" installation procedure: > > perl Build.PL > ./Build > ./Build test > sudo ./Build install --uninst 1 > > Scott > > > On Wed, Feb 3, 2010 at 2:38 PM, Lacey Sanderson > wrote: >> Okay, >>> >>> directly from the command line (don't relogin) >>> >>> $ export >>> PERL5LIB=/usr/local/share/applications/Bioperl/bioperl-live:$PERL5LIB >>> $ perl -MBio::Perl -e 'print Bio::Perl->VERSION' >> >> That gives me the correct version... For that user until I log out... >> >> I sudoed in to .profile and added the line directly but that doesn't seem to >> have helped. >> >> How would I tell perl to look in the same place for Bioperl for ALL users? >> >> Lacey >> >> On 3-Feb-10, at 12:34 PM, Mark A. Jensen wrote: >> >>> Well, that's progress. It hard to debug these things remotely, but now >>> you can at least work around the problem by using the -I flag. The next >>> weak link I would try to eliminate is the append to the .profile file. >>> Just >>> do directly from the command line (don't relogin) >>> >>> $ export >>> PERL5LIB=/usr/local/share/applications/Bioperl/bioperl-live:$PERL5LIB >>> $ perl -MBio::Perl -e 'print Bio::Perl->VERSION' >>> >>> If this gives 1.0069, then the problem is in that append (and, you can >>> also do your work!). >>> (If you're a sudoer, I'd just edit that file directly.) >>> >>> cheers MAJ >>> ----- Original Message ----- From: "Lacey Sanderson" >>> >>> To: "Mark A. Jensen" >>> Cc: >>> Sent: Wednesday, February 03, 2010 1:21 PM >>> Subject: Re: [Bioperl-l] After Checkout of Bioperl-live version doesn't >>> seem tochange >>> >>> >>>> That gives me the 1.069 that I was wanting so the problem seems to be >>>> that perl is finding the wrong installation by default... >>>> >>>> Lacey >>>> >>>> On 3-Feb-10, at 12:14 PM, Mark A. Jensen wrote: >>>> >>>>> perl -MBio::Perl -I/usr/local/share/applications/Bioperl/bioperl- live >>>>> -le 'print Bio::Perl->VERSION;' >>>> >>>> >>> >> >> --------------------------------- >> Lacey Sanderson >> Bioinformaticist >> Pulse Crop Breeding and Genetics >> Department of Plant Sciences >> University of Saskatchewan. >> >> _______________________________________________ >> Bioperl-l mailing list >> Bioperl-l at lists.open-bio.org >> http://lists.open-bio.org/mailman/listinfo/bioperl-l >> > > > > -- > ------------------------------------------------------------------------ > Scott Cain, Ph. D. scott at scottcain dot net > GMOD Coordinator (http://gmod.org/) 216-392-3087 > Ontario Institute for Cancer Research > > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l From maj at fortinbras.us Wed Feb 3 14:47:24 2010 From: maj at fortinbras.us (Mark A. Jensen) Date: Wed, 3 Feb 2010 14:47:24 -0500 Subject: [Bioperl-l] After Checkout of Bioperl-live version doesn't seem tochange In-Reply-To: <2163AB4F-0CD3-44C8-AA04-6DAD0E67F549@shaw.ca> References: <0A23A254827D40A997BF65EE356183E6@NewLife><0A73EA011D85433787444922A42E979C@NewLife> <2163AB4F-0CD3-44C8-AA04-6DAD0E67F549@shaw.ca> Message-ID: <3934A7E8419D47178E80C9844EDC4B7E@NewLife> You probably want to change the file /etc/profile (no .) --MAJ ----- Original Message ----- From: "Lacey Sanderson" To: "Mark A. Jensen" Cc: Sent: Wednesday, February 03, 2010 2:38 PM Subject: Re: [Bioperl-l] After Checkout of Bioperl-live version doesn't seem tochange > Okay, >> directly from the command line (don't relogin) >> >> $ export PERL5LIB=/usr/local/share/applications/Bioperl/bioperl-live: >> $PERL5LIB >> $ perl -MBio::Perl -e 'print Bio::Perl->VERSION' > That gives me the correct version... For that user until I log out... > > I sudoed in to .profile and added the line directly but that doesn't seem to > have helped. > > How would I tell perl to look in the same place for Bioperl for ALL users? > > Lacey > > On 3-Feb-10, at 12:34 PM, Mark A. Jensen wrote: > >> Well, that's progress. It hard to debug these things remotely, but now >> you can at least work around the problem by using the -I flag. The next >> weak link I would try to eliminate is the append to the .profile file. Just >> do directly from the command line (don't relogin) >> >> $ export PERL5LIB=/usr/local/share/applications/Bioperl/bioperl-live: >> $PERL5LIB >> $ perl -MBio::Perl -e 'print Bio::Perl->VERSION' >> >> If this gives 1.0069, then the problem is in that append (and, you can also >> do your work!). >> (If you're a sudoer, I'd just edit that file directly.) >> >> cheers MAJ >> ----- Original Message ----- From: "Lacey Sanderson" >> > > >> To: "Mark A. Jensen" >> Cc: >> Sent: Wednesday, February 03, 2010 1:21 PM >> Subject: Re: [Bioperl-l] After Checkout of Bioperl-live version doesn't seem >> tochange >> >> >>> That gives me the 1.069 that I was wanting so the problem seems to be that >>> perl is finding the wrong installation by default... >>> >>> Lacey >>> >>> On 3-Feb-10, at 12:14 PM, Mark A. Jensen wrote: >>> >>>> perl -MBio::Perl -I/usr/local/share/applications/Bioperl/bioperl- live -le >>>> 'print Bio::Perl->VERSION;' >>> >>> >> > > --------------------------------- > Lacey Sanderson > Bioinformaticist > Pulse Crop Breeding and Genetics > Department of Plant Sciences > University of Saskatchewan. > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l > > From laceyanne_sanderson at shaw.ca Wed Feb 3 15:49:22 2010 From: laceyanne_sanderson at shaw.ca (Lacey Sanderson) Date: Wed, 03 Feb 2010 14:49:22 -0600 Subject: [Bioperl-l] After Checkout of Bioperl-live version doesn't seem tochange In-Reply-To: <4536f7701002031149j3abd0beavfdd8d67e9219b7e2@mail.gmail.com> References: <0A23A254827D40A997BF65EE356183E6@NewLife> <0A73EA011D85433787444922A42E979C@NewLife> <2163AB4F-0CD3-44C8-AA04-6DAD0E67F549@shaw.ca> <4536f7701002031149j3abd0beavfdd8d67e9219b7e2@mail.gmail.com> Message-ID: <6278A3EC-95AC-4CE2-AE77-593D97BEA871@shaw.ca> Okay, running Build from the base of the bioperl-live directory Worked. Version is now 1.0069 !! For all users, even after re-logging in... Thanks! Lacey On 3-Feb-10, at 1:49 PM, Scott Cain wrote: > Hi Lacey, > > In order for it to be available to all users, I would suggest you go > through the "normal" installation procedure: > > perl Build.PL > ./Build > ./Build test > sudo ./Build install --uninst 1 > > Scott > > > On Wed, Feb 3, 2010 at 2:38 PM, Lacey Sanderson > wrote: >> Okay, >>> >>> directly from the command line (don't relogin) >>> >>> $ export >>> PERL5LIB=/usr/local/share/applications/Bioperl/bioperl-live: >>> $PERL5LIB >>> $ perl -MBio::Perl -e 'print Bio::Perl->VERSION' >> >> That gives me the correct version... For that user until I log out... >> >> I sudoed in to .profile and added the line directly but that >> doesn't seem to >> have helped. >> >> How would I tell perl to look in the same place for Bioperl for ALL >> users? >> >> Lacey >> >> On 3-Feb-10, at 12:34 PM, Mark A. Jensen wrote: >> >>> Well, that's progress. It hard to debug these things remotely, but >>> now >>> you can at least work around the problem by using the -I flag. The >>> next >>> weak link I would try to eliminate is the append to the .profile >>> file. >>> Just >>> do directly from the command line (don't relogin) >>> >>> $ export >>> PERL5LIB=/usr/local/share/applications/Bioperl/bioperl-live: >>> $PERL5LIB >>> $ perl -MBio::Perl -e 'print Bio::Perl->VERSION' >>> >>> If this gives 1.0069, then the problem is in that append (and, you >>> can >>> also do your work!). >>> (If you're a sudoer, I'd just edit that file directly.) >>> >>> cheers MAJ >>> ----- Original Message ----- From: "Lacey Sanderson" >>> >>> To: "Mark A. Jensen" >>> Cc: >>> Sent: Wednesday, February 03, 2010 1:21 PM >>> Subject: Re: [Bioperl-l] After Checkout of Bioperl-live version >>> doesn't >>> seem tochange >>> >>> >>>> That gives me the 1.069 that I was wanting so the problem seems >>>> to be >>>> that perl is finding the wrong installation by default... >>>> >>>> Lacey >>>> >>>> On 3-Feb-10, at 12:14 PM, Mark A. Jensen wrote: >>>> >>>>> perl -MBio::Perl -I/usr/local/share/applications/Bioperl/ >>>>> bioperl- live >>>>> -le 'print Bio::Perl->VERSION;' >>>> >>>> >>> >> >> --------------------------------- >> Lacey Sanderson >> Bioinformaticist >> Pulse Crop Breeding and Genetics >> Department of Plant Sciences >> University of Saskatchewan. >> >> _______________________________________________ >> Bioperl-l mailing list >> Bioperl-l at lists.open-bio.org >> http://lists.open-bio.org/mailman/listinfo/bioperl-l >> > > > > -- > ------------------------------------------------------------------------ > Scott Cain, Ph. D. scott at > scottcain dot net > GMOD Coordinator (http://gmod.org/) 216-392-3087 > Ontario Institute for Cancer Research --------------------------------- Lacey Sanderson Bioinformaticist Pulse Crop Breeding and Genetics Department of Plant Sciences University of Saskatchewan. From jason at bioperl.org Wed Feb 3 15:52:04 2010 From: jason at bioperl.org (Jason Stajich) Date: Wed, 3 Feb 2010 12:52:04 -0800 Subject: [Bioperl-l] After Checkout of Bioperl-live version doesn't seem tochange In-Reply-To: <3934A7E8419D47178E80C9844EDC4B7E@NewLife> References: <0A23A254827D40A997BF65EE356183E6@NewLife><0A73EA011D85433787444922A42E979C@NewLife> <2163AB4F-0CD3-44C8-AA04-6DAD0E67F549@shaw.ca> <3934A7E8419D47178E80C9844EDC4B7E@NewLife> Message-ID: <34A48A92-6ECF-4401-8D92-F15EC91FA040@bioperl.org> Or add a file on centos/rh to set global variables in the more modular way than editing /etc/profile $ cat /etc/profile.d/perl.sh LIBS=/usr/local/pkg/bioperl/core if [ -z "${PERL5LIB}" ] then PERL5LIB=$LIBS; export PERL5LIB else PERL5LIB=$LIBS:$PERL5LIB; export PERL5LIB fi -jason On Feb 3, 2010, at 11:47 AM, Mark A. Jensen wrote: > You probably want to change the file > /etc/profile > (no .) --MAJ > ----- Original Message ----- From: "Lacey Sanderson" > > To: "Mark A. Jensen" > Cc: > Sent: Wednesday, February 03, 2010 2:38 PM > Subject: Re: [Bioperl-l] After Checkout of Bioperl-live version > doesn't seem tochange > > >> Okay, >>> directly from the command line (don't relogin) >>> >>> $ export PERL5LIB=/usr/local/share/applications/Bioperl/bioperl- >>> live: $PERL5LIB >>> $ perl -MBio::Perl -e 'print Bio::Perl->VERSION' >> That gives me the correct version... For that user until I log out... >> >> I sudoed in to .profile and added the line directly but that >> doesn't seem to have helped. >> >> How would I tell perl to look in the same place for Bioperl for >> ALL users? >> >> Lacey >> >> On 3-Feb-10, at 12:34 PM, Mark A. Jensen wrote: >> >>> Well, that's progress. It hard to debug these things remotely, but >>> now >>> you can at least work around the problem by using the -I flag. >>> The next >>> weak link I would try to eliminate is the append to the .profile >>> file. Just >>> do directly from the command line (don't relogin) >>> >>> $ export PERL5LIB=/usr/local/share/applications/Bioperl/bioperl- >>> live: $PERL5LIB >>> $ perl -MBio::Perl -e 'print Bio::Perl->VERSION' >>> >>> If this gives 1.0069, then the problem is in that append (and, >>> you can also do your work!). >>> (If you're a sudoer, I'd just edit that file directly.) >>> >>> cheers MAJ >>> ----- Original Message ----- From: "Lacey Sanderson" >> > >>> To: "Mark A. Jensen" >>> Cc: >>> Sent: Wednesday, February 03, 2010 1:21 PM >>> Subject: Re: [Bioperl-l] After Checkout of Bioperl-live version >>> doesn't seem tochange >>> >>> >>>> That gives me the 1.069 that I was wanting so the problem seems >>>> to be that perl is finding the wrong installation by default... >>>> >>>> Lacey >>>> >>>> On 3-Feb-10, at 12:14 PM, Mark A. Jensen wrote: >>>> >>>>> perl -MBio::Perl -I/usr/local/share/applications/Bioperl/ >>>>> bioperl- live -le 'print Bio::Perl->VERSION;' >>>> >>>> >>> >> >> --------------------------------- >> Lacey Sanderson >> Bioinformaticist >> Pulse Crop Breeding and Genetics >> Department of Plant Sciences >> University of Saskatchewan. >> >> _______________________________________________ >> Bioperl-l mailing list >> Bioperl-l at lists.open-bio.org >> http://lists.open-bio.org/mailman/listinfo/bioperl-l >> > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l -- Jason Stajich jason.stajich at gmail.com jason at bioperl.org http://fungalgenomes.org/ http://twitter.com/hyphaltip From dan.kortschak at adelaide.edu.au Wed Feb 3 17:30:50 2010 From: dan.kortschak at adelaide.edu.au (Dan Kortschak) Date: Thu, 04 Feb 2010 09:00:50 +1030 Subject: [Bioperl-l] After Checkout of Bioperl-live version doesn't seem to change In-Reply-To: References: Message-ID: <1265236250.5663.30.camel@zoidberg.mbs.adelaide.edu.au> Hi Chris, I too have had strangeness with version reporting - not with the svn checkouts, but with the CPAN package which I use on my production machine (v 1.6.1 installed). cpanp 'o' reports the version of bioperl (actually 'Bio::Root::RootI') installed as 'Unparsable', with the available upgrade as 1.006001 as you would expect, 'l' gives the following: CPAN Terminal> l Bio::Root::RootI Details for 'Bio::Root::RootI' Author Christopher Fields (cjfields at bioperl.org) Description None given Development Stage Unknown Installed File /usr/local/share/perl/5.10.0/Bio/Root/RootI.pm Interface Style Unknown Language Used Unknown Package BioPerl-1.6.1.tar.gz Public License Unknown Support Level Unknown Version Installed 0.0 Version on CPAN 1.006001 etc The head line of B:R:RI: # $Id: RootI.pm 16090 2009-09-15 21:57:56Z cjfields $ and the cpanp install logs show 1.6.1 is actually installed. This has never really worried me, but since version reporting was raised I thought I'd bring it up. Also curious about why BioPerl.pm isn't installed/manified on bioperl install as I wonder around the distro. cheers Dan From irisbuiton at gmail.com Thu Feb 4 11:06:25 2010 From: irisbuiton at gmail.com (Iris Buiton) Date: Thu, 4 Feb 2010 08:06:25 -0800 (PST) Subject: [Bioperl-l] clone ID to Accession ID genbank Message-ID: <62377232-beb8-4aa3-aa4a-10f6d926a8ee@m35g2000prh.googlegroups.com> Hi members of Bioperl, I am trying to run some code that given a clone ID (for instance, CH255-149A8) get the accesion ID (AC238489.2). The problem is that using this ID, it works perfectly, but using this other ID (CH251-558O8) I do not get anything (not $seq object is obtained). Checking to Genbank, this clone exist, there is seqeunce, and the accession ID should be AC147515.1 This is the code: my $name_file="CH251-558O8"; my $query = Bio::DB::Query::GenBank->new (-query => $name_file, -reldate => '30', -db => 'nucleotide'); #eval 'my $seqio = $gb->get_Stream_by_query($query)'; print $query->count,"\n"; my $gb = Bio::DB::GenBank->new(); my $seqio = $gb->get_Stream_by_query($query); print "Running $name_file.....\n"; while( my $seq = $seqio->next_seq ) { if ($seq->accession_number) { print "$name_file = ", $seq->accession_number," = ", $seq- >length,"\n" ; }else { print "$name_file = #N/A \n" ; } } Any ideas what am I doing wrong? Thanks! Iris From MEC at stowers.org Thu Feb 4 12:15:12 2010 From: MEC at stowers.org (Cook, Malcolm) Date: Thu, 4 Feb 2010 11:15:12 -0600 Subject: [Bioperl-l] clone ID to Accession ID genbank In-Reply-To: <62377232-beb8-4aa3-aa4a-10f6d926a8ee@m35g2000prh.googlegroups.com> References: <62377232-beb8-4aa3-aa4a-10f6d926a8ee@m35g2000prh.googlegroups.com> Message-ID: remove the -reldate Malcolm Cook Stowers Institute for Medical Research - Bioinformatics Kansas City, Missouri USA > -----Original Message----- > From: bioperl-l-bounces at lists.open-bio.org > [mailto:bioperl-l-bounces at lists.open-bio.org] On Behalf Of Iris Buiton > Sent: Thursday, February 04, 2010 10:06 AM > To: bioperl-l at bioperl.org > Subject: [Bioperl-l] clone ID to Accession ID genbank > > Hi members of Bioperl, > > I am trying to run some code that given a clone ID (for instance, > CH255-149A8) get the accesion ID (AC238489.2). > > The problem is that using this ID, it works perfectly, but > using this other ID (CH251-558O8) I do not get anything (not > $seq object is obtained). Checking to Genbank, this clone > exist, there is seqeunce, and the accession ID should be AC147515.1 > > This is the code: > > my $name_file="CH251-558O8"; > > my $query = Bio::DB::Query::GenBank->new > (-query => $name_file, > -reldate => '30', > -db => 'nucleotide'); > #eval 'my $seqio = $gb->get_Stream_by_query($query)'; > print $query->count,"\n"; > > > my $gb = Bio::DB::GenBank->new(); > > my $seqio = $gb->get_Stream_by_query($query); > > print "Running $name_file.....\n"; > > > while( my $seq = $seqio->next_seq ) { > if ($seq->accession_number) > { > print "$name_file = ", $seq->accession_number," = ", $seq- > >length,"\n" ; > }else > { > print "$name_file = #N/A \n" ; > } > } > > Any ideas what am I doing wrong? > > Thanks! > > Iris > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l > From Irisbuiton at gmail.com Thu Feb 4 12:26:57 2010 From: Irisbuiton at gmail.com (Irisbuiton at gmail.com) Date: Thu, 04 Feb 2010 17:26:57 +0000 Subject: [Bioperl-l] clone ID to Accession ID genbank In-Reply-To: Message-ID: <00504502e11453f33b047ec9a4b5@google.com> Many thanks! Such a stupid mistake... IRis. On Feb 4, 2010 9:15am, "Cook, Malcolm" wrote: > remove the -reldate > Malcolm Cook > Stowers Institute for Medical Research - Bioinformatics > Kansas City, Missouri USA > > -----Original Message----- > > From: bioperl-l-bounces at lists.open-bio.org > > [mailto:bioperl-l-bounces at lists.open-bio.org] On Behalf Of Iris Buiton > > Sent: Thursday, February 04, 2010 10:06 AM > > To: bioperl-l at bioperl.org > > Subject: [Bioperl-l] clone ID to Accession ID genbank > > > > Hi members of Bioperl, > > > > I am trying to run some code that given a clone ID (for instance, > > CH255-149A8) get the accesion ID (AC238489.2). > > > > The problem is that using this ID, it works perfectly, but > > using this other ID (CH251-558O8) I do not get anything (not > > $seq object is obtained). Checking to Genbank, this clone > > exist, there is seqeunce, and the accession ID should be AC147515.1 > > > > This is the code: > > > > my $name_file="CH251-558O8"; > > > > my $query = Bio::DB::Query::GenBank->new > > (-query => $name_file, > > -reldate => '30', > > -db => 'nucleotide'); > > #eval 'my $seqio = $gb->get_Stream_by_query($query)'; > > print $query->count,"\n"; > > > > > > my $gb = Bio::DB::GenBank->new(); > > > > my $seqio = $gb->get_Stream_by_query($query); > > > > print "Running $name_file.....\n"; > > > > > > while( my $seq = $seqio->next_seq ) { > > if ($seq->accession_number) > > { > > print "$name_file = ", $seq->accession_number," = ", $seq- > > >length,"\n" ; > > }else > > { > > print "$name_file = #N/A \n" ; > > } > > } > > > > Any ideas what am I doing wrong? > > > > Thanks! > > > > Iris > > _______________________________________________ > > Bioperl-l mailing list > > Bioperl-l at lists.open-bio.org > > http://lists.open-bio.org/mailman/listinfo/bioperl-l > > From sharmashalu.bio at gmail.com Thu Feb 4 17:01:35 2010 From: sharmashalu.bio at gmail.com (shalu sharma) Date: Thu, 4 Feb 2010 17:01:35 -0500 Subject: [Bioperl-l] Refseq Version Message-ID: <465b5a661002041401u3b8ac5fdp483f7032ce35139b@mail.gmail.com> Hi All, This is not a bioperl query. Is there any way to check refseq version (release). Actually i am using some server to blast my sequences (blastall) against refseq. Is there any way i can get the version information on the refseq database (from the blast file or directly from the database)? Thanks Shalu From Russell.Smithies at agresearch.co.nz Thu Feb 4 18:39:55 2010 From: Russell.Smithies at agresearch.co.nz (Smithies, Russell) Date: Fri, 5 Feb 2010 12:39:55 +1300 Subject: [Bioperl-l] Refseq Version In-Reply-To: <465b5a661002041401u3b8ac5fdp483f7032ce35139b@mail.gmail.com> References: <465b5a661002041401u3b8ac5fdp483f7032ce35139b@mail.gmail.com> Message-ID: <18DF7D20DFEC044098A1062202F5FFF32C61BED520@exchsth.agresearch.co.nz> If you have access to the blast database, use fastacmd -I -d databasename Otherwise, it's usually at the bottom of your blast result. --Russell > -----Original Message----- > From: bioperl-l-bounces at lists.open-bio.org [mailto:bioperl-l- > bounces at lists.open-bio.org] On Behalf Of shalu sharma > Sent: Friday, 5 February 2010 11:02 a.m. > To: bioperl-l at lists.open-bio.org > Subject: [Bioperl-l] Refseq Version > > Hi All, > This is not a bioperl query. > Is there any way to check refseq version (release). Actually i am using > some > server to blast my sequences (blastall) against refseq. Is there any way i > can get the version information on the refseq database (from the blast > file > or directly from the database)? > > Thanks > Shalu > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l ======================================================================= Attention: The information contained in this message and/or attachments from AgResearch Limited is intended only for the persons or entities to which it is addressed and may contain confidential and/or privileged material. Any review, retransmission, dissemination or other use of, or taking of any action in reliance upon, this information by persons or entities other than the intended recipients is prohibited by AgResearch Limited. If you have received this message in error, please notify the sender immediately. ======================================================================= From pisceanpranav at gmail.com Thu Feb 4 21:17:05 2010 From: pisceanpranav at gmail.com (pranav karkhanis) Date: Fri, 5 Feb 2010 07:47:05 +0530 Subject: [Bioperl-l] (no subject) Message-ID: <3122b9be1002041817v5f8704afwca44f282d35f4140@mail.gmail.com> Where can I find the examples or source code for running BLAST with a input SEQUENCE using RemoteBlast.pm? From maj at fortinbras.us Thu Feb 4 21:20:10 2010 From: maj at fortinbras.us (Mark A. Jensen) Date: Thu, 4 Feb 2010 21:20:10 -0500 Subject: [Bioperl-l] (no subject) In-Reply-To: <3122b9be1002041817v5f8704afwca44f282d35f4140@mail.gmail.com> References: <3122b9be1002041817v5f8704afwca44f282d35f4140@mail.gmail.com> Message-ID: <911C931932FA4836936C2E0AD23CE284@NewLife> http://bioperl.org/wiki/HOWTOs ----- Original Message ----- From: "pranav karkhanis" To: Cc: "Rahul parakkat" Sent: Thursday, February 04, 2010 9:17 PM Subject: [Bioperl-l] (no subject) > Where can I find the examples or source code for running BLAST with a > input SEQUENCE > using RemoteBlast.pm? > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l > > From fungazid at yahoo.com Thu Feb 4 21:52:53 2010 From: fungazid at yahoo.com (Fungazid) Date: Thu, 4 Feb 2010 18:52:53 -0800 (PST) Subject: [Bioperl-l] Too many links at LoadHelper.pm Message-ID: <34328.58710.qm@web65506.mail.ac4.yahoo.com> Bioperl hi, I have multiple gff3 files that look like: -------------------------------------------------------- ##gff-version 3 # file: temp1.gff3 contig453278 hs contig 1 699 . + . Name=contig453278 contig453278 hs alignment 1 194 . + . Name=hs.ENST00000258455 contig453278 hs alignment 252 699 . + . Name=hs.ENST00000258455 contig453278 hs mi2 194 252 . + . Name=mi2 contig453278 hs transcript 104 183 . + . Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,4 contig453278 hs transcript 1 9 . + . Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,1 contig453278 hs transcript 73 103 . + . Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,3 contig453278 hs transcript 10 72 . + . Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,2 --------------------------------------------------------- I uploaded gff3 files to mysql and view the tracks in gBrowse like this: -------------------------------------------- Bio::DB::SeqFeature::Store->new(...) $loader=Bio::DB::SeqFeature::Store::GFF3Loader->new(...) for($i=0; $i<$#files+1; $i++){ $loader->load($file[$i]); } -------------------------------------------- Eventually after uploading ~15000 small gff3 files I got the error: Error in tempdir() using /tmp/XXXXXXXXXX: Could not create directory /tmp/9bbpdE5ibf: Too many links at /usr/local/share/perl/5.8.8/Bio/DB/SeqFeature/Store/LoadHelper.pm line 49 I am trying to see what went wrong, maybe you can rescue me, Avi From jason at bioperl.org Thu Feb 4 22:31:06 2010 From: jason at bioperl.org (Jason Stajich) Date: Thu, 4 Feb 2010 19:31:06 -0800 Subject: [Bioperl-l] Too many links at LoadHelper.pm In-Reply-To: <34328.58710.qm@web65506.mail.ac4.yahoo.com> References: <34328.58710.qm@web65506.mail.ac4.yahoo.com> Message-ID: why are you not using the bp_seqfeature_load script? On Feb 4, 2010, at 6:52 PM, Fungazid wrote: > Bioperl hi, > > I have multiple gff3 files that look like: > > -------------------------------------------------------- > ##gff-version 3 > # file: temp1.gff3 > contig453278 hs contig 1 699 . + . > Name=contig453278 > contig453278 hs alignment 1 194 . + . > Name=hs.ENST00000258455 > contig453278 hs alignment 252 699 . + . > Name=hs.ENST00000258455 > contig453278 hs mi2 194 252 . + . Name=mi2 > contig453278 hs transcript 104 183 . + . > Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,4 > contig453278 hs transcript 1 9 . + . > Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,1 > contig453278 hs transcript 73 103 . + . > Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,3 > contig453278 hs transcript 10 72 . + . > Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,2 > --------------------------------------------------------- > > > > I uploaded gff3 files to mysql and view the tracks in gBrowse like > this: > > -------------------------------------------- > Bio::DB::SeqFeature::Store->new(...) > $loader=Bio::DB::SeqFeature::Store::GFF3Loader->new(...) > for($i=0; $i<$#files+1; $i++){ > $loader->load($file[$i]); > } > -------------------------------------------- > > Eventually after uploading ~15000 small gff3 files I got the error: > > Error in tempdir() using /tmp/XXXXXXXXXX: Could not create > directory /tmp/9bbpdE5ibf: Too many links at /usr/local/share/perl/ > 5.8.8/Bio/DB/SeqFeature/Store/LoadHelper.pm line 49 > > > > I am trying to see what went wrong, maybe you can rescue me, > Avi > > > > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l -- Jason Stajich jason.stajich at gmail.com jason at bioperl.org http://fungalgenomes.org/ http://twitter.com/hyphaltip From scott at scottcain.net Thu Feb 4 22:38:52 2010 From: scott at scottcain.net (Scott Cain) Date: Thu, 4 Feb 2010 22:38:52 -0500 Subject: [Bioperl-l] Too many links at LoadHelper.pm In-Reply-To: <34328.58710.qm@web65506.mail.ac4.yahoo.com> References: <34328.58710.qm@web65506.mail.ac4.yahoo.com> Message-ID: <4536f7701002041938m3ad2d6ddgf8fa56312d62c28a@mail.gmail.com> Hi Avi, Off hand, I have no idea what is happening. Is there a reason you aren't using bp_seqfeature_load.pl? Scott On Thu, Feb 4, 2010 at 9:52 PM, Fungazid wrote: > Bioperl hi, > > I have multiple gff3 files that look like: > > -------------------------------------------------------- > ##gff-version 3 > # file: temp1.gff3 > contig453278 ? ?hs ? ?contig ? ?1 ? ?699 ? ?. ? ?+ ? ?. ? ?Name=contig453278 > contig453278 ? ?hs ? ?alignment ? ?1 ? ?194 ? ?. ? ?+ ? ?. ? ?Name=hs.ENST00000258455 > contig453278 ? ?hs ? ?alignment ? ?252 ? ?699 ? ?. ? ?+ ? ?. ? ?Name=hs.ENST00000258455 > contig453278 ? ?hs ? ?mi2 ? ?194 ? ?252 ? ?. ? ?+ ? ?. ? ?Name=mi2 > contig453278 ? ?hs ? ?transcript ? ?104 ? ?183 ? ?. ? ?+ ? ?. ? ?Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,4 > contig453278 ? ?hs ? ?transcript ? ?1 ? ?9 ? ?. ? ?+ ? ?. ? ?Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,1 > contig453278 ? ?hs ? ?transcript ? ?73 ? ?103 ? ?. ? ?+ ? ?. ? ?Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,3 > contig453278 ? ?hs ? ?transcript ? ?10 ? ?72 ? ?. ? ?+ ? ?. ? ?Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,2 > --------------------------------------------------------- > > > > I uploaded gff3 files to mysql and view the tracks in gBrowse like this: > > -------------------------------------------- > Bio::DB::SeqFeature::Store->new(...) > $loader=Bio::DB::SeqFeature::Store::GFF3Loader->new(...) > for($i=0; $i<$#files+1; $i++){ > $loader->load($file[$i]); > } > -------------------------------------------- > > Eventually after uploading ~15000 small gff3 files I got the error: > > Error in tempdir() using /tmp/XXXXXXXXXX: Could not create directory /tmp/9bbpdE5ibf: Too many links at /usr/local/share/perl/5.8.8/Bio/DB/SeqFeature/Store/LoadHelper.pm line 49 > > > > I am trying to see what went wrong, maybe you can rescue me, > Avi > > > > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l > -- ------------------------------------------------------------------------ Scott Cain, Ph. D. scott at scottcain dot net GMOD Coordinator (http://gmod.org/) 216-392-3087 Ontario Institute for Cancer Research From maj at fortinbras.us Fri Feb 5 07:27:46 2010 From: maj at fortinbras.us (Mark A. Jensen) Date: Fri, 5 Feb 2010 07:27:46 -0500 Subject: [Bioperl-l] Too many links at LoadHelper.pm In-Reply-To: <34328.58710.qm@web65506.mail.ac4.yahoo.com> References: <34328.58710.qm@web65506.mail.ac4.yahoo.com> Message-ID: The tempfiles are not being unlinked before there are too many for the OS to handle. Sounds like this may be your job, fungazid, if you're not employing the script (but not sure about that) ----- Original Message ----- From: "Fungazid" To: Sent: Thursday, February 04, 2010 9:52 PM Subject: [Bioperl-l] Too many links at LoadHelper.pm > Bioperl hi, > > I have multiple gff3 files that look like: > > -------------------------------------------------------- > ##gff-version 3 > # file: temp1.gff3 > contig453278 hs contig 1 699 . + . Name=contig453278 > contig453278 hs alignment 1 194 . + . > Name=hs.ENST00000258455 > contig453278 hs alignment 252 699 . + . > Name=hs.ENST00000258455 > contig453278 hs mi2 194 252 . + . Name=mi2 > contig453278 hs transcript 104 183 . + . > Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,4 > contig453278 hs transcript 1 9 . + . > Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,1 > contig453278 hs transcript 73 103 . + . > Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,3 > contig453278 hs transcript 10 72 . + . > Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,2 > --------------------------------------------------------- > > > > I uploaded gff3 files to mysql and view the tracks in gBrowse like this: > > -------------------------------------------- > Bio::DB::SeqFeature::Store->new(...) > $loader=Bio::DB::SeqFeature::Store::GFF3Loader->new(...) > for($i=0; $i<$#files+1; $i++){ > $loader->load($file[$i]); > } > -------------------------------------------- > > Eventually after uploading ~15000 small gff3 files I got the error: > > Error in tempdir() using /tmp/XXXXXXXXXX: Could not create directory > /tmp/9bbpdE5ibf: Too many links at > /usr/local/share/perl/5.8.8/Bio/DB/SeqFeature/Store/LoadHelper.pm line 49 > > > > I am trying to see what went wrong, maybe you can rescue me, > Avi > > > > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l > > From sharmashalu.bio at gmail.com Fri Feb 5 09:55:55 2010 From: sharmashalu.bio at gmail.com (shalu sharma) Date: Fri, 5 Feb 2010 09:55:55 -0500 Subject: [Bioperl-l] Refseq Version In-Reply-To: <18DF7D20DFEC044098A1062202F5FFF32C61BED520@exchsth.agresearch.co.nz> References: <465b5a661002041401u3b8ac5fdp483f7032ce35139b@mail.gmail.com> <18DF7D20DFEC044098A1062202F5FFF32C61BED520@exchsth.agresearch.co.nz> Message-ID: <465b5a661002050655u34ff94acwdf8fe72512dec4b3@mail.gmail.com> Hi Russell, Thanks for your response. I am getting the number of sequence in the database but not the release number (like 38, 39). This is what i did: $ fastacmd -I -d /db/ncbiblast/refseq/refseq_protein Database: NCBI Protein Reference Sequences 7,585,993 sequences; 2,644,770,521 total letters File names: /db/ncbiblast/refseq/refseq_protein.00 Date: Jan 30, 2010 8:34 PM Version: 4 Longest sequence: 36,805 res /db/ncbiblast/refseq/refseq_protein.01 Date: Jan 30, 2010 8:34 PM Version: 4 Longest sequence: 33,403 res /db/ncbiblast/refseq/refseq_protein.02 Date: Jan 30, 2010 8:34 PM Version: 4 Longest sequence: 15,830 res I am still confuse that how i can get the release number. I know refseq 39 was released on Jan 30, 2010 but i don't know how to confirm this. I also tried look refseq release file but was not able to get any thing. I would really appreciate if anyone can help me out with this. Thanks Shalu On Thu, Feb 4, 2010 at 6:39 PM, Smithies, Russell < Russell.Smithies at agresearch.co.nz> wrote: > If you have access to the blast database, use fastacmd -I -d databasename > Otherwise, it's usually at the bottom of your blast result. > > --Russell > > > -----Original Message----- > > From: bioperl-l-bounces at lists.open-bio.org [mailto:bioperl-l- > > bounces at lists.open-bio.org] On Behalf Of shalu sharma > > Sent: Friday, 5 February 2010 11:02 a.m. > > To: bioperl-l at lists.open-bio.org > > Subject: [Bioperl-l] Refseq Version > > > > Hi All, > > This is not a bioperl query. > > Is there any way to check refseq version (release). Actually i am using > > some > > server to blast my sequences (blastall) against refseq. Is there any way > i > > can get the version information on the refseq database (from the blast > > file > > or directly from the database)? > > > > Thanks > > Shalu > > _______________________________________________ > > Bioperl-l mailing list > > Bioperl-l at lists.open-bio.org > > http://lists.open-bio.org/mailman/listinfo/bioperl-l > ======================================================================= > Attention: The information contained in this message and/or attachments > from AgResearch Limited is intended only for the persons or entities > to which it is addressed and may contain confidential and/or privileged > material. Any review, retransmission, dissemination or other use of, or > taking of any action in reliance upon, this information by persons or > entities other than the intended recipients is prohibited by AgResearch > Limited. If you have received this message in error, please notify the > sender immediately. > ======================================================================= > From maj at fortinbras.us Fri Feb 5 10:03:49 2010 From: maj at fortinbras.us (Mark A. Jensen) Date: Fri, 5 Feb 2010 10:03:49 -0500 Subject: [Bioperl-l] Refseq Version In-Reply-To: <465b5a661002050655u34ff94acwdf8fe72512dec4b3@mail.gmail.com> References: <465b5a661002041401u3b8ac5fdp483f7032ce35139b@mail.gmail.com><18DF7D20DFEC044098A1062202F5FFF32C61BED520@exchsth.agresearch.co.nz> <465b5a661002050655u34ff94acwdf8fe72512dec4b3@mail.gmail.com> Message-ID: <296EA2B4017347E9B8700FBC37836445@NewLife> Hey guys-- thanks for this conversation, pls resist the temptation to go offlist with this. When Russell provides the info, we can possibly add a new accesor for the version info. thanks MAJ ----- Original Message ----- From: "shalu sharma" To: "Smithies, Russell" Cc: Sent: Friday, February 05, 2010 9:55 AM Subject: Re: [Bioperl-l] Refseq Version > Hi Russell, > Thanks for your response. > I am getting the number of sequence in the database but not the release > number (like 38, 39). > This is what i did: > > $ fastacmd -I -d /db/ncbiblast/refseq/refseq_protein > Database: NCBI Protein Reference Sequences > 7,585,993 sequences; 2,644,770,521 total letters > > File names: > /db/ncbiblast/refseq/refseq_protein.00 > Date: Jan 30, 2010 8:34 PM Version: 4 Longest sequence: 36,805 res > /db/ncbiblast/refseq/refseq_protein.01 > Date: Jan 30, 2010 8:34 PM Version: 4 Longest sequence: 33,403 res > /db/ncbiblast/refseq/refseq_protein.02 > Date: Jan 30, 2010 8:34 PM Version: 4 Longest sequence: 15,830 res > > I am still confuse that how i can get the release number. I know refseq 39 > was released on Jan 30, 2010 but i don't know how to confirm this. I also > tried look refseq release file but was not able to get any thing. > > I would really appreciate if anyone can help me out with this. > > Thanks > Shalu > > > On Thu, Feb 4, 2010 at 6:39 PM, Smithies, Russell < > Russell.Smithies at agresearch.co.nz> wrote: > >> If you have access to the blast database, use fastacmd -I -d databasename >> Otherwise, it's usually at the bottom of your blast result. >> >> --Russell >> >> > -----Original Message----- >> > From: bioperl-l-bounces at lists.open-bio.org [mailto:bioperl-l- >> > bounces at lists.open-bio.org] On Behalf Of shalu sharma >> > Sent: Friday, 5 February 2010 11:02 a.m. >> > To: bioperl-l at lists.open-bio.org >> > Subject: [Bioperl-l] Refseq Version >> > >> > Hi All, >> > This is not a bioperl query. >> > Is there any way to check refseq version (release). Actually i am using >> > some >> > server to blast my sequences (blastall) against refseq. Is there any way >> i >> > can get the version information on the refseq database (from the blast >> > file >> > or directly from the database)? >> > >> > Thanks >> > Shalu >> > _______________________________________________ >> > Bioperl-l mailing list >> > Bioperl-l at lists.open-bio.org >> > http://lists.open-bio.org/mailman/listinfo/bioperl-l >> ======================================================================= >> Attention: The information contained in this message and/or attachments >> from AgResearch Limited is intended only for the persons or entities >> to which it is addressed and may contain confidential and/or privileged >> material. Any review, retransmission, dissemination or other use of, or >> taking of any action in reliance upon, this information by persons or >> entities other than the intended recipients is prohibited by AgResearch >> Limited. If you have received this message in error, please notify the >> sender immediately. >> ======================================================================= >> > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l > > From j.scholtalbers at gmail.com Fri Feb 5 10:36:50 2010 From: j.scholtalbers at gmail.com (Jelle Scholtalbers) Date: Fri, 5 Feb 2010 16:36:50 +0100 Subject: [Bioperl-l] Bio::DB::SeqFeature::Store::memory -> filter_by_type very slow In-Reply-To: <3435CD81-A2B1-4419-995B-751BE3810017@illinois.edu> References: <3435CD81-A2B1-4419-995B-751BE3810017@illinois.edu> Message-ID: Hi, a different issue I encountered today with the Bio::DB::SeqFeature::Store::memory is with the BINSIZE that this module sets: use constant BINSIZE => 10_000; It would be nice to be able to set this dynamically since different GFF files ask for different indexing rules. I ran into this as a problem when I used a file that had its position multiplied by 1000 and at that point the program ran fairly quick, 3-4min. After dividing the positions by 1000 (which is desired) the program took ~30min. to finish. The slowdown was traceable to Bio::DB::SeqFeature::Store::memory::filter_by_location. By setting the BINSIZE to 1 the issue was solved. However for another GFF file this size is way too low. Is this already possible and did I not see it or would this be an option to add? Cheers, Jelle 2010/2/1 Chris Fields > Jelle, > > Seems reasonable, but Lincoln and Scott know that code better and are > better suited to comment on it. Lincoln, Scott? > > chris > > On Feb 1, 2010, at 6:24 AM, Jelle Scholtalbers wrote: > > > Hi, > > I used the Bio::DB::SeqFeature::Store::memory module to load in a GFF3 > file > > which I could then use in my script in a 'queryable' way. To retrieve > > features I used for example > > $db->features(-type => 'BAC:FPC', -seq_id=>'chromosome0') > > However when doing a profile on my script I found out that 60% of the > > running time went into filter_by_type from > > Bio::DB::SeqFeature::Store::memory. > > Replacing this function with > > my @features = grep{$_->type eq 'BAC:FPC'} > > $db->features(-seq_id=>'chromosome0') > > which gave me the same results was just a fraction of the earlier run > time. > > My script went from 60min. to 4min. for the same result and only changing > > this function (is called often). > > Can/Should this be fixed or is this just the faster way to do it? > > > > Cheers, > > Jelle > > _______________________________________________ > > Bioperl-l mailing list > > Bioperl-l at lists.open-bio.org > > http://lists.open-bio.org/mailman/listinfo/bioperl-l > > From lincoln.stein at gmail.com Fri Feb 5 10:46:02 2010 From: lincoln.stein at gmail.com (Lincoln Stein) Date: Fri, 5 Feb 2010 10:46:02 -0500 Subject: [Bioperl-l] Bio::DB::SeqFeature::Store::memory -> filter_by_type very slow In-Reply-To: References: Message-ID: <6dce9a0b1002050746o1271259dxf52972ef41d591f0@mail.gmail.com> I think the problem with the filter function is that the type you request may be "BAC" but the feature's type is "BAC:FPC", and you want to be able to filter by the more generic type terms. Nevertheless I'm sure we can do better than 60 min running time and so I'll have to look at how this function works more carefully. I can't do this right now, unfortunately, so perhaps someone on the mailing list would be willing to take a look? Lincoln On Mon, Feb 1, 2010 at 7:24 AM, Jelle Scholtalbers wrote: > Hi, > I used the Bio::DB::SeqFeature::Store::memory module to load in a GFF3 file > which I could then use in my script in a 'queryable' way. To retrieve > features I used for example > $db->features(-type => 'BAC:FPC', -seq_id=>'chromosome0') > However when doing a profile on my script I found out that 60% of the > running time went into filter_by_type from > Bio::DB::SeqFeature::Store::memory. > Replacing this function with > my @features = grep{$_->type eq 'BAC:FPC'} > $db->features(-seq_id=>'chromosome0') > which gave me the same results was just a fraction of the earlier run time. > My script went from 60min. to 4min. for the same result and only changing > this function (is called often). > Can/Should this be fixed or is this just the faster way to do it? > > Cheers, > Jelle > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l > -- Lincoln D. Stein Director, Informatics and Biocomputing Platform Ontario Institute for Cancer Research 101 College St., Suite 800 Toronto, ON, Canada M5G0A3 416 673-8514 Assistant: Renata Musa From mauricio at open-bio.org Fri Feb 5 10:48:30 2010 From: mauricio at open-bio.org (Mauricio Herrera Cuadra) Date: Fri, 05 Feb 2010 09:48:30 -0600 Subject: [Bioperl-l] Fwd: Changes to NCBI BLAST and E-utilities. Message-ID: <4B6C3DCE.2070808@open-bio.org> Forwarding to the proper lists... -------- Original Message -------- Subject: [O|B|F Helpdesk #889] Changes to NCBI BLAST and E-utilities. Date: Fri, 5 Feb 2010 10:08:51 -0500 From: mcginnis via RT Reply-To: support at helpdesk.open-bio.org To: chris at bioteam.net, heikki at sanbi.ac.za, hlapp at gmx.net, jason at bioperl.org, mauricio at open-bio.org Fri Feb 05 10:08:51 2010: Request 889 was acted upon. Transaction: Ticket created by mcginnis at ncbi.nlm.nih.gov Queue: support at open-bio.org Subject: Changes to NCBI BLAST and E-utilities. Owner: Nobody Requestors: mcginnis at ncbi.nlm.nih.gov Status: new Ticket Dear Colleague: There are two changes I'd like to make you aware of. As you may or may not have noticed, we have been working on a new C++ version of the BLAST binaries. In the coming months we will be moving the C++ binaries into prominence and (slowly) phasing out the C toolkit binaries. There are many changes not least of which is a move to individual binaries for each program (blastn, blastp, etc). We are not sure how many of your users use BioPerl with the BLAST binaries, my understanding is that many use BioPerl to to remote BLAST. However, there isa change to the BLAST results in Text and presumably HTML. This could have an effect on any parsers which scrape these formats and do not use XML. For obvious reason, we want to support only the XML format for parsing, but we thought we should give you heads up on this. blast 2.2.22 Query: 3307 ------------------------------------------------------------ 3307 Sbjct: 390 GSGPEAFRGSGPEAFRGSGPEAFRGSGPEAFRGSGPEAFRGSGPEAFRGSGPEAFRGSGP 449 blast 2.2.22+ Query ------------------------------------------------------------ Sbjct 390 GSGPEAFRGSGPEAFRGSGPEAFRGSGPEAFRGSGPEAFRGSGPEAFRGSGPEAFRGSGP 449 A single line of gaps lacks the Query numbering in the blast+ output. The C version of blast has numbering in this case. Sample alignment shown below. According to users the blast+ output without the numbering breaks bioperl parsers. Wehave heard forma few but I think they may be older parsers? The second issue is a policy concerning E-utilities. This was announced on the utilities-announce at ncbi.nlm.nih.gov mail-list but you may not have seen it. As part of an ongoing effort to ensure efficient access to the Entrez Utilities (E-utilities) by all users, NCBI has decided to change the usage policy for the E-utilities effective June 1, 2010. Effective on June 1, 2010, all E-utility requests, either using standard URLs or SOAP, must contain non-null values for both the &tool and &email parameters. Any E-utility request made after June 1, 2010 that does not contain values for both parameters will return an error explaining that these parameters must be included in E-utility requests. The value of the &tool parameter should be a URI-safe string that is the name of the software package, script or web page producing the E-utility request. The value of the &email parameter should be a valid e-mail address for the appropriate contact person or group responsible for maintaining the tool producing the E-utility request. NCBI uses these parameters to contact users whose use of the E-utilities violates the standard usage policies described athttp://eutils.ncbi.nlm.nih.gov/entrez/query/static/eutils_help.html#UserSystemRequirements. These usage policies are designed to prevent excessive requests from a small group of users from reducing or eliminating the wider community's access to the E-utilities. NCBI will attempt to contact a user at the e-mail address provided in the &email parameter prior to blocking access to the E-utilities. NCBI realizes that this policy change will require many of our users to change their code. Based on past experience, we anticipate that most of our users should be able to make the necessary changes before the June 1, 2010 deadline. If you have any concerns about making these changes by that date, or if you have any questions about these policies, please contact eutilities at ncbi.nlm.nih.gov. Thank you for your understanding and cooperation in helping us continue to deliver a reliable and efficient web service. I think you already adhere to this policy but should a user's script not meet these requirements, than the script will fail and requests will be turned away with an error message. Scott D. McGinnis M.A. NCBI/NLM/NIH 45 Center Drive, MSC 6511 Bldg 45, Room 4AN.44C Bethesda, MD 20892 mcginnis at ncbi.nlm.nih.gov From maj at fortinbras.us Fri Feb 5 10:57:29 2010 From: maj at fortinbras.us (Mark A. Jensen) Date: Fri, 5 Feb 2010 10:57:29 -0500 Subject: [Bioperl-l] Bio::DB::SeqFeature::Store::memory ->filter_by_type very slow In-Reply-To: <6dce9a0b1002050746o1271259dxf52972ef41d591f0@mail.gmail.com> References: <6dce9a0b1002050746o1271259dxf52972ef41d591f0@mail.gmail.com> Message-ID: my guess is ..memory::find_types is costly, as it descends into Bio::DB::GFF::Typename objects...could implement a cache or memo? ----- Original Message ----- From: "Lincoln Stein" To: "Jelle Scholtalbers" Cc: Sent: Friday, February 05, 2010 10:46 AM Subject: Re: [Bioperl-l] Bio::DB::SeqFeature::Store::memory ->filter_by_type very slow >I think the problem with the filter function is that the type you request > may be "BAC" but the feature's type is "BAC:FPC", and you want to be able to > filter by the more generic type terms. Nevertheless I'm sure we can do > better than 60 min running time and so I'll have to look at how this > function works more carefully. I can't do this right now, unfortunately, so > perhaps someone on the mailing list would be willing to take a look? > > Lincoln > > On Mon, Feb 1, 2010 at 7:24 AM, Jelle Scholtalbers > wrote: > >> Hi, >> I used the Bio::DB::SeqFeature::Store::memory module to load in a GFF3 file >> which I could then use in my script in a 'queryable' way. To retrieve >> features I used for example >> $db->features(-type => 'BAC:FPC', -seq_id=>'chromosome0') >> However when doing a profile on my script I found out that 60% of the >> running time went into filter_by_type from >> Bio::DB::SeqFeature::Store::memory. >> Replacing this function with >> my @features = grep{$_->type eq 'BAC:FPC'} >> $db->features(-seq_id=>'chromosome0') >> which gave me the same results was just a fraction of the earlier run time. >> My script went from 60min. to 4min. for the same result and only changing >> this function (is called often). >> Can/Should this be fixed or is this just the faster way to do it? >> >> Cheers, >> Jelle >> _______________________________________________ >> Bioperl-l mailing list >> Bioperl-l at lists.open-bio.org >> http://lists.open-bio.org/mailman/listinfo/bioperl-l >> > > > > -- > Lincoln D. Stein > Director, Informatics and Biocomputing Platform > Ontario Institute for Cancer Research > 101 College St., Suite 800 > Toronto, ON, Canada M5G0A3 > 416 673-8514 > Assistant: Renata Musa > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l > > From cjfields at illinois.edu Fri Feb 5 11:16:29 2010 From: cjfields at illinois.edu (Chris Fields) Date: Fri, 5 Feb 2010 10:16:29 -0600 Subject: [Bioperl-l] Fwd: Changes to NCBI BLAST and E-utilities. In-Reply-To: <4B6C3DCE.2070808@open-bio.org> References: <4B6C3DCE.2070808@open-bio.org> Message-ID: Thanks Mauricio. I think the only thing of impact will be the BLAST+ stuff (and Mark has developed a wrapper for it). We'll need to update the SearchIO parser, so any BLAST+ output examples would be tremendously helpful. chris On Feb 5, 2010, at 9:48 AM, Mauricio Herrera Cuadra wrote: > Forwarding to the proper lists... > > -------- Original Message -------- > Subject: [O|B|F Helpdesk #889] Changes to NCBI BLAST and E-utilities. > Date: Fri, 5 Feb 2010 10:08:51 -0500 > From: mcginnis via RT > Reply-To: support at helpdesk.open-bio.org > To: chris at bioteam.net, heikki at sanbi.ac.za, hlapp at gmx.net, jason at bioperl.org, mauricio at open-bio.org > > > Fri Feb 05 10:08:51 2010: Request 889 was acted upon. > Transaction: Ticket created by mcginnis at ncbi.nlm.nih.gov > Queue: support at open-bio.org > Subject: Changes to NCBI BLAST and E-utilities. > Owner: Nobody > Requestors: mcginnis at ncbi.nlm.nih.gov > Status: new > Ticket > > > Dear Colleague: > > There are two changes I'd like to make you aware of. > > As you may or may not have noticed, we have been working on a new C++ version of the BLAST binaries. In the coming months we will be moving the C++ binaries into prominence and (slowly) phasing out the C toolkit binaries. > > There are many changes not least of which is a move to individual binaries for each program (blastn, blastp, etc). We are not sure how many of your users use BioPerl with the BLAST binaries, my understanding is that many use BioPerl to to remote BLAST. However, there isa change to the BLAST results in Text and presumably HTML. This could have an effect on any parsers which scrape these formats and do not use XML. For obvious reason, we want to support only the XML format for parsing, but we thought we should give you heads up on this. > blast 2.2.22 > > Query: 3307 ------------------------------------------------------------ 3307 > > Sbjct: 390 GSGPEAFRGSGPEAFRGSGPEAFRGSGPEAFRGSGPEAFRGSGPEAFRGSGPEAFRGSGP 449 > > blast 2.2.22+ > > Query ------------------------------------------------------------ > > Sbjct 390 GSGPEAFRGSGPEAFRGSGPEAFRGSGPEAFRGSGPEAFRGSGPEAFRGSGPEAFRGSGP 449 > > A single line of gaps lacks the Query numbering in the blast+ output. The C version of blast has numbering in this case. Sample alignment shown below. According to users the blast+ output without the numbering breaks bioperl parsers. Wehave heard forma few but I think they may be older parsers? > The second issue is a policy concerning E-utilities. This was announced on the utilities-announce at ncbi.nlm.nih.gov mail-list but you may not have seen it. > > As part of an ongoing effort to ensure efficient access to the Entrez Utilities (E-utilities) by all users, NCBI has decided to change the usage policy for the E-utilities effective June 1, 2010. Effective on June 1, 2010, all E-utility requests, either using standard URLs or SOAP, must contain non-null values for both the &tool and &email parameters. Any E-utility request made after June 1, 2010 that does not contain values for both parameters will return an error explaining that these parameters must be included in E-utility requests. > > The value of the &tool parameter should be a URI-safe string that is the name of the software package, script or web page producing the E-utility request. > > The value of the &email parameter should be a valid e-mail address for the appropriate contact person or group responsible for maintaining the tool producing the E-utility request. > > NCBI uses these parameters to contact users whose use of the E-utilities violates the standard usage policies described athttp://eutils.ncbi.nlm.nih.gov/entrez/query/static/eutils_help.html#UserSystemRequirements. These usage policies are designed to prevent excessive requests from a small group of users from reducing or eliminating the wider community's access to the E-utilities. NCBI will attempt to contact a user at the e-mail address provided in the &email parameter prior to blocking access to the E-utilities. > > NCBI realizes that this policy change will require many of our users to change their code. Based on past experience, we anticipate that most of our users should be able to make the necessary changes before the June 1, 2010 deadline. If you have any concerns about making these changes by that date, or if you have any questions about these policies, please contact eutilities at ncbi.nlm.nih.gov. > > Thank you for your understanding and cooperation in helping us continue to deliver a reliable and efficient web service. > > I think you already adhere to this policy but should a user's script not meet these requirements, than the script will fail and requests will be turned away with an error message. > > Scott D. McGinnis M.A. > NCBI/NLM/NIH > 45 Center Drive, MSC 6511 > Bldg 45, Room 4AN.44C > Bethesda, MD 20892 > mcginnis at ncbi.nlm.nih.gov > > > > > > > > > > > > > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l From fungazid at yahoo.com Fri Feb 5 11:25:00 2010 From: fungazid at yahoo.com (Fungazid) Date: Fri, 5 Feb 2010 08:25:00 -0800 (PST) Subject: [Bioperl-l] Too many links at LoadHelper.pm In-Reply-To: <4536f7701002041938m3ad2d6ddgf8fa56312d62c28a@mail.gmail.com> Message-ID: <654818.36134.qm@web65508.mail.ac4.yahoo.com> Scott, Jason, hi, I replaced my code with: for(...) {system("bp_seqfeature_load.pl -d contigs -u root -p xxxx ./$file[$i]")==0 or die;} run it, and it went smoothly (or seem to be like that ). The reason I used an alternative script is that it is almost identical to bp_seqfeature_load.pl but it calls the following functions only once: Bio::DB::SeqFeature::Store->new(..) Bio::DB::SeqFeature::Store::GFF3Loader->new(...) Thanks, Avi --- On Fri, 2/5/10, Scott Cain wrote: > From: Scott Cain > Subject: Re: [Bioperl-l] Too many links at LoadHelper.pm > To: "Fungazid" > Cc: Bioperl-l at lists.open-bio.org > Date: Friday, February 5, 2010, 5:38 AM > Hi Avi, > > Off hand, I have no idea what is happening.? Is there > a reason you > aren't using bp_seqfeature_load.pl? > > Scott > > > On Thu, Feb 4, 2010 at 9:52 PM, Fungazid > wrote: > > Bioperl hi, > > > > I have multiple gff3 files that look like: > > > > > -------------------------------------------------------- > > ##gff-version 3 > > # file: temp1.gff3 > > contig453278 ? ?hs ? ?contig ? ?1 ? ?699 ? > ?. ? ?+ ? ?. ? ?Name=contig453278 > > contig453278 ? ?hs ? ?alignment ? ?1 ? ?194 ? > ?. ? ?+ ? ?. ? ?Name=hs.ENST00000258455 > > contig453278 ? ?hs ? ?alignment ? ?252 ? ?699 > ? ?. ? ?+ ? ?. ? ?Name=hs.ENST00000258455 > > contig453278 ? ?hs ? ?mi2 ? ?194 ? ?252 ? ?. > ? ?+ ? ?. ? ?Name=mi2 > > contig453278 ? ?hs ? ?transcript ? ?104 ? ?183 > ? ?. ? ?+ ? ?. ? > ?Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,4 > > contig453278 ? ?hs ? ?transcript ? ?1 ? ?9 ? > ?. ? ?+ ? ?. ? > ?Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,1 > > contig453278 ? ?hs ? ?transcript ? ?73 ? ?103 > ? ?. ? ?+ ? ?. ? > ?Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,3 > > contig453278 ? ?hs ? ?transcript ? ?10 ? ?72 > ? ?. ? ?+ ? ?. ? > ?Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,2 > > > --------------------------------------------------------- > > > > > > > > I uploaded gff3 files to mysql and view the tracks in > gBrowse like this: > > > > -------------------------------------------- > > Bio::DB::SeqFeature::Store->new(...) > > > $loader=Bio::DB::SeqFeature::Store::GFF3Loader->new(...) > > for($i=0; $i<$#files+1; $i++){ > > $loader->load($file[$i]); > > } > > -------------------------------------------- > > > > Eventually after uploading ~15000 small gff3 files I > got the error: > > > > Error in tempdir() using /tmp/XXXXXXXXXX: Could not > create directory /tmp/9bbpdE5ibf: Too many links at > /usr/local/share/perl/5.8.8/Bio/DB/SeqFeature/Store/LoadHelper.pm > line 49 > > > > > > > > I am trying to see what went wrong, maybe you can > rescue me, > > Avi > > > > > > > > > > _______________________________________________ > > Bioperl-l mailing list > > Bioperl-l at lists.open-bio.org > > http://lists.open-bio.org/mailman/listinfo/bioperl-l > > > > > > -- > ------------------------------------------------------------------------ > Scott Cain, Ph. D.? ? ? ? ? ? > ? ? ? ? ? ? ? ? > ? ? ???scott at scottcain dot net > GMOD Coordinator (http://gmod.org/)? ? ? ? ? > ? ? ? ? ???216-392-3087 > Ontario Institute for Cancer Research > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l > From scott at scottcain.net Fri Feb 5 11:50:59 2010 From: scott at scottcain.net (Scott Cain) Date: Fri, 5 Feb 2010 11:50:59 -0500 Subject: [Bioperl-l] Too many links at LoadHelper.pm In-Reply-To: <654818.36134.qm@web65508.mail.ac4.yahoo.com> References: <4536f7701002041938m3ad2d6ddgf8fa56312d62c28a@mail.gmail.com> <654818.36134.qm@web65508.mail.ac4.yahoo.com> Message-ID: <4536f7701002050850h65a242f7o84b5df6cec0a51f2@mail.gmail.com> I believe you can call the loading script with more than one GFF file as the argument, eliminating the need for the for loop. On Fri, Feb 5, 2010 at 11:25 AM, Fungazid wrote: > Scott, Jason, hi, > > I replaced my code with: > > for(...) {system("bp_seqfeature_load.pl -d contigs -u root -p xxxx ./$file[$i]")==0 or die;} > > run it, and it went smoothly (or seem to be like that ). > > The reason I used an alternative script is that it is almost identical to bp_seqfeature_load.pl but it calls the following functions only once: > > Bio::DB::SeqFeature::Store->new(..) > Bio::DB::SeqFeature::Store::GFF3Loader->new(...) > > > Thanks, > Avi > > > --- On Fri, 2/5/10, Scott Cain wrote: > >> From: Scott Cain >> Subject: Re: [Bioperl-l] Too many links at LoadHelper.pm >> To: "Fungazid" >> Cc: Bioperl-l at lists.open-bio.org >> Date: Friday, February 5, 2010, 5:38 AM >> Hi Avi, >> >> Off hand, I have no idea what is happening.? Is there >> a reason you >> aren't using bp_seqfeature_load.pl? >> >> Scott >> >> >> On Thu, Feb 4, 2010 at 9:52 PM, Fungazid >> wrote: >> > Bioperl hi, >> > >> > I have multiple gff3 files that look like: >> > >> > >> -------------------------------------------------------- >> > ##gff-version 3 >> > # file: temp1.gff3 >> > contig453278 ? ?hs ? ?contig ? ?1 ? ?699 >> ?. ? ?+ ? ?. ? ?Name=contig453278 >> > contig453278 ? ?hs ? ?alignment ? ?1 ? ?194 >> ?. ? ?+ ? ?. ? ?Name=hs.ENST00000258455 >> > contig453278 ? ?hs ? ?alignment ? ?252 ? ?699 >> ? ?. ? ?+ ? ?. ? ?Name=hs.ENST00000258455 >> > contig453278 ? ?hs ? ?mi2 ? ?194 ? ?252 ? ?. >> ? ?+ ? ?. ? ?Name=mi2 >> > contig453278 ? ?hs ? ?transcript ? ?104 ? ?183 >> ? ?. ? ?+ ? ?. >> ?Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,4 >> > contig453278 ? ?hs ? ?transcript ? ?1 ? ?9 >> ?. ? ?+ ? ?. >> ?Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,1 >> > contig453278 ? ?hs ? ?transcript ? ?73 ? ?103 >> ? ?. ? ?+ ? ?. >> ?Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,3 >> > contig453278 ? ?hs ? ?transcript ? ?10 ? ?72 >> ? ?. ? ?+ ? ?. >> ?Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,2 >> > >> --------------------------------------------------------- >> > >> > >> > >> > I uploaded gff3 files to mysql and view the tracks in >> gBrowse like this: >> > >> > -------------------------------------------- >> > Bio::DB::SeqFeature::Store->new(...) >> > >> $loader=Bio::DB::SeqFeature::Store::GFF3Loader->new(...) >> > for($i=0; $i<$#files+1; $i++){ >> > $loader->load($file[$i]); >> > } >> > -------------------------------------------- >> > >> > Eventually after uploading ~15000 small gff3 files I >> got the error: >> > >> > Error in tempdir() using /tmp/XXXXXXXXXX: Could not >> create directory /tmp/9bbpdE5ibf: Too many links at >> /usr/local/share/perl/5.8.8/Bio/DB/SeqFeature/Store/LoadHelper.pm >> line 49 >> > >> > >> > >> > I am trying to see what went wrong, maybe you can >> rescue me, >> > Avi >> > >> > >> > >> > >> > _______________________________________________ >> > Bioperl-l mailing list >> > Bioperl-l at lists.open-bio.org >> > http://lists.open-bio.org/mailman/listinfo/bioperl-l >> > >> >> >> >> -- >> ------------------------------------------------------------------------ >> Scott Cain, Ph. D. >> >> ? ? ???scott at scottcain dot net >> GMOD Coordinator (http://gmod.org/) >> ? ? ? ? ???216-392-3087 >> Ontario Institute for Cancer Research >> >> _______________________________________________ >> Bioperl-l mailing list >> Bioperl-l at lists.open-bio.org >> http://lists.open-bio.org/mailman/listinfo/bioperl-l >> > > > > > -- ------------------------------------------------------------------------ Scott Cain, Ph. D. scott at scottcain dot net GMOD Coordinator (http://gmod.org/) 216-392-3087 Ontario Institute for Cancer Research From maj at fortinbras.us Fri Feb 5 11:32:12 2010 From: maj at fortinbras.us (Mark A. Jensen) Date: Fri, 5 Feb 2010 11:32:12 -0500 Subject: [Bioperl-l] Fwd: Changes to NCBI BLAST and E-utilities. In-Reply-To: References: <4B6C3DCE.2070808@open-bio.org> Message-ID: <8DEC18FA40C04635B6E7598DF27C3906@NewLife> all -- note too that blast+ has remote searches built in, and you can access this functionality in StandAloneBlastPlus by specifying -remote => 1 to the factory-- cheers MAJ ----- Original Message ----- From: "Chris Fields" To: "Mauricio Herrera Cuadra" Cc: "BioPerl List" Sent: Friday, February 05, 2010 11:16 AM Subject: Re: [Bioperl-l] Fwd: Changes to NCBI BLAST and E-utilities. > Thanks Mauricio. I think the only thing of impact will be the BLAST+ stuff > (and Mark has developed a wrapper for it). We'll need to update the SearchIO > parser, so any BLAST+ output examples would be tremendously helpful. > > chris > > On Feb 5, 2010, at 9:48 AM, Mauricio Herrera Cuadra wrote: > >> Forwarding to the proper lists... >> >> -------- Original Message -------- >> Subject: [O|B|F Helpdesk #889] Changes to NCBI BLAST and E-utilities. >> Date: Fri, 5 Feb 2010 10:08:51 -0500 >> From: mcginnis via RT >> Reply-To: support at helpdesk.open-bio.org >> To: chris at bioteam.net, heikki at sanbi.ac.za, hlapp at gmx.net, jason at bioperl.org, >> mauricio at open-bio.org >> >> >> Fri Feb 05 10:08:51 2010: Request 889 was acted upon. >> Transaction: Ticket created by mcginnis at ncbi.nlm.nih.gov >> Queue: support at open-bio.org >> Subject: Changes to NCBI BLAST and E-utilities. >> Owner: Nobody >> Requestors: mcginnis at ncbi.nlm.nih.gov >> Status: new >> Ticket >> >> >> Dear Colleague: >> >> There are two changes I'd like to make you aware of. >> >> As you may or may not have noticed, we have been working on a new C++ version >> of the BLAST binaries. In the coming months we will be moving the C++ >> binaries into prominence and (slowly) phasing out the C toolkit binaries. >> >> There are many changes not least of which is a move to individual binaries >> for each program (blastn, blastp, etc). We are not sure how many of your >> users use BioPerl with the BLAST binaries, my understanding is that many use >> BioPerl to to remote BLAST. However, there isa change to the BLAST results in >> Text and presumably HTML. This could have an effect on any parsers which >> scrape these formats and do not use XML. For obvious reason, we want to >> support only the XML format for parsing, but we thought we should give you >> heads up on this. >> blast 2.2.22 >> >> Query: 3307 ------------------------------------------------------------ 3307 >> >> Sbjct: 390 GSGPEAFRGSGPEAFRGSGPEAFRGSGPEAFRGSGPEAFRGSGPEAFRGSGPEAFRGSGP 449 >> >> blast 2.2.22+ >> >> Query ------------------------------------------------------------ >> >> Sbjct 390 GSGPEAFRGSGPEAFRGSGPEAFRGSGPEAFRGSGPEAFRGSGPEAFRGSGPEAFRGSGP 449 >> >> A single line of gaps lacks the Query numbering in the blast+ output. The C >> version of blast has numbering in this case. Sample alignment shown below. >> According to users the blast+ output without the numbering breaks bioperl >> parsers. Wehave heard forma few but I think they may be older parsers? >> The second issue is a policy concerning E-utilities. This was announced on >> the utilities-announce at ncbi.nlm.nih.gov mail-list but you may not have seen >> it. >> >> As part of an ongoing effort to ensure efficient access to the Entrez >> Utilities (E-utilities) by all users, NCBI has decided to change the usage >> policy for the E-utilities effective June 1, 2010. Effective on June 1, 2010, >> all E-utility requests, either using standard URLs or SOAP, must contain >> non-null values for both the &tool and &email parameters. Any E-utility >> request made after June 1, 2010 that does not contain values for both >> parameters will return an error explaining that these parameters must be >> included in E-utility requests. >> >> The value of the &tool parameter should be a URI-safe string that is the name >> of the software package, script or web page producing the E-utility request. >> >> The value of the &email parameter should be a valid e-mail address for the >> appropriate contact person or group responsible for maintaining the tool >> producing the E-utility request. >> >> NCBI uses these parameters to contact users whose use of the E-utilities >> violates the standard usage policies described >> athttp://eutils.ncbi.nlm.nih.gov/entrez/query/static/eutils_help.html#UserSystemRequirements. >> These usage policies are designed to prevent excessive requests from a small >> group of users from reducing or eliminating the wider community's access to >> the E-utilities. NCBI will attempt to contact a user at the e-mail address >> provided in the &email parameter prior to blocking access to the E-utilities. >> >> NCBI realizes that this policy change will require many of our users to >> change their code. Based on past experience, we anticipate that most of our >> users should be able to make the necessary changes before the June 1, 2010 >> deadline. If you have any concerns about making these changes by that date, >> or if you have any questions about these policies, please contact >> eutilities at ncbi.nlm.nih.gov. >> >> Thank you for your understanding and cooperation in helping us continue to >> deliver a reliable and efficient web service. >> >> I think you already adhere to this policy but should a user's script not meet >> these requirements, than the script will fail and requests will be turned >> away with an error message. >> >> Scott D. McGinnis M.A. >> NCBI/NLM/NIH >> 45 Center Drive, MSC 6511 >> Bldg 45, Room 4AN.44C >> Bethesda, MD 20892 >> mcginnis at ncbi.nlm.nih.gov >> >> >> >> >> >> >> >> >> >> >> >> >> >> _______________________________________________ >> Bioperl-l mailing list >> Bioperl-l at lists.open-bio.org >> http://lists.open-bio.org/mailman/listinfo/bioperl-l > > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l > > From miquel.ramia at uab.cat Fri Feb 5 12:43:17 2010 From: miquel.ramia at uab.cat (Miquel Ramia) Date: Fri, 05 Feb 2010 18:43:17 +0100 Subject: [Bioperl-l] Bio::Graphics inside PHP Message-ID: <4B6C58B5.3080304@uab.cat> Hi, I'm working in a database with a web interface developed in php. I'm displaying Bio::Graphic images inside an IFrame, but doesn't look very well. Anybody knows if it's possible to include a Bio::Graphics image directly in a php page? Thank you Miquel R?mia From sharmashalu.bio at gmail.com Fri Feb 5 14:21:36 2010 From: sharmashalu.bio at gmail.com (shalu sharma) Date: Fri, 5 Feb 2010 14:21:36 -0500 Subject: [Bioperl-l] Aligning Reads Message-ID: <465b5a661002051121l50402168s7a715e62da6812ee@mail.gmail.com> Hi All, Is there any program available which can align reads (200-400bp in fasta format) against a reference sequence and the whole alignment can be viewed graphically. I tried bowtie but i think it only takes fastaq format. Thanks shalu From j.scholtalbers at gmail.com Fri Feb 5 14:22:43 2010 From: j.scholtalbers at gmail.com (Jelle Scholtalbers) Date: Fri, 5 Feb 2010 20:22:43 +0100 Subject: [Bioperl-l] Too many links at LoadHelper.pm In-Reply-To: <4536f7701002050850h65a242f7o84b5df6cec0a51f2@mail.gmail.com> References: <4536f7701002041938m3ad2d6ddgf8fa56312d62c28a@mail.gmail.com> <654818.36134.qm@web65508.mail.ac4.yahoo.com> <4536f7701002050850h65a242f7o84b5df6cec0a51f2@mail.gmail.com> Message-ID: Also specifying the -f(ast) option could be beneficial for you. Probably you have a limit in how many files you can specify on the commandline, so could it also be an idea for you to concatenate the files before loading them? Or take a look at xargs when on linux: "find -maxdepth 1 -name '*.gff' | xargs bp_seqfeature_load.pl -d contigs -u root -p xxxx -f" 2010/2/5 Scott Cain > I believe you can call the loading script with more than one GFF file > as the argument, eliminating the need for the for loop. > > > > On Fri, Feb 5, 2010 at 11:25 AM, Fungazid wrote: > > Scott, Jason, hi, > > > > I replaced my code with: > > > > for(...) {system("bp_seqfeature_load.pl -d contigs -u root -p xxxx > ./$file[$i]")==0 or die;} > > > > run it, and it went smoothly (or seem to be like that ). > > > > The reason I used an alternative script is that it is almost identical to > bp_seqfeature_load.pl but it calls the following functions only once: > > > > Bio::DB::SeqFeature::Store->new(..) > > Bio::DB::SeqFeature::Store::GFF3Loader->new(...) > > > > > > Thanks, > > Avi > > > > > > --- On Fri, 2/5/10, Scott Cain wrote: > > > >> From: Scott Cain > >> Subject: Re: [Bioperl-l] Too many links at LoadHelper.pm > >> To: "Fungazid" > >> Cc: Bioperl-l at lists.open-bio.org > >> Date: Friday, February 5, 2010, 5:38 AM > >> Hi Avi, > >> > >> Off hand, I have no idea what is happening. Is there > >> a reason you > >> aren't using bp_seqfeature_load.pl? > >> > >> Scott > >> > >> > >> On Thu, Feb 4, 2010 at 9:52 PM, Fungazid > >> wrote: > >> > Bioperl hi, > >> > > >> > I have multiple gff3 files that look like: > >> > > >> > > >> -------------------------------------------------------- > >> > ##gff-version 3 > >> > # file: temp1.gff3 > >> > contig453278 hs contig 1 699 > >> . + . Name=contig453278 > >> > contig453278 hs alignment 1 194 > >> . + . Name=hs.ENST00000258455 > >> > contig453278 hs alignment 252 699 > >> . + . Name=hs.ENST00000258455 > >> > contig453278 hs mi2 194 252 . > >> + . Name=mi2 > >> > contig453278 hs transcript 104 183 > >> . + . > >> Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,4 > >> > contig453278 hs transcript 1 9 > >> . + . > >> Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,1 > >> > contig453278 hs transcript 73 103 > >> . + . > >> Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,3 > >> > contig453278 hs transcript 10 72 > >> . + . > >> Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,2 > >> > > >> --------------------------------------------------------- > >> > > >> > > >> > > >> > I uploaded gff3 files to mysql and view the tracks in > >> gBrowse like this: > >> > > >> > -------------------------------------------- > >> > Bio::DB::SeqFeature::Store->new(...) > >> > > >> $loader=Bio::DB::SeqFeature::Store::GFF3Loader->new(...) > >> > for($i=0; $i<$#files+1; $i++){ > >> > $loader->load($file[$i]); > >> > } > >> > -------------------------------------------- > >> > > >> > Eventually after uploading ~15000 small gff3 files I > >> got the error: > >> > > >> > Error in tempdir() using /tmp/XXXXXXXXXX: Could not > >> create directory /tmp/9bbpdE5ibf: Too many links at > >> /usr/local/share/perl/5.8.8/Bio/DB/SeqFeature/Store/LoadHelper.pm > >> line 49 > >> > > >> > > >> > > >> > I am trying to see what went wrong, maybe you can > >> rescue me, > >> > Avi > >> > > >> > > >> > > >> > > >> > _______________________________________________ > >> > Bioperl-l mailing list > >> > Bioperl-l at lists.open-bio.org > >> > http://lists.open-bio.org/mailman/listinfo/bioperl-l > >> > > >> > >> > >> > >> -- > >> ------------------------------------------------------------------------ > >> Scott Cain, Ph. D. > >> > >> scott at scottcain dot net > >> GMOD Coordinator (http://gmod.org/) > >> 216-392-3087 > >> Ontario Institute for Cancer Research > >> > >> _______________________________________________ > >> Bioperl-l mailing list > >> Bioperl-l at lists.open-bio.org > >> http://lists.open-bio.org/mailman/listinfo/bioperl-l > >> > > > > > > > > > > > > > > -- > ------------------------------------------------------------------------ > Scott Cain, Ph. D. scott at scottcain dot > net > GMOD Coordinator (http://gmod.org/) 216-392-3087 > Ontario Institute for Cancer Research > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l > From awitney at sgul.ac.uk Fri Feb 5 14:14:30 2010 From: awitney at sgul.ac.uk (Adam Witney) Date: Fri, 5 Feb 2010 19:14:30 +0000 Subject: [Bioperl-l] Bio::Graphics inside PHP In-Reply-To: <4B6C58B5.3080304@uab.cat> References: <4B6C58B5.3080304@uab.cat> Message-ID: <7E0F3ED7-682E-492B-99D7-45C77041F6F5@sgul.ac.uk> you could output an tag where the source is a perl script that generates the image. eg that perl script would then use Bio::Graphics to generate the image HTH adam On 5 Feb 2010, at 17:43, Miquel Ramia wrote: > Hi, > > I'm working in a database with a web interface developed in php. I'm displaying Bio::Graphic images inside an IFrame, but doesn't look very well. > > Anybody knows if it's possible to include a Bio::Graphics image directly in a php page? > > Thank you > > Miquel R?mia > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l From sdavis2 at mail.nih.gov Fri Feb 5 14:40:05 2010 From: sdavis2 at mail.nih.gov (Sean Davis) Date: Fri, 5 Feb 2010 14:40:05 -0500 Subject: [Bioperl-l] Aligning Reads In-Reply-To: <465b5a661002051121l50402168s7a715e62da6812ee@mail.gmail.com> References: <465b5a661002051121l50402168s7a715e62da6812ee@mail.gmail.com> Message-ID: <264855a01002051140sa9d309cw302c3266f628e9a3@mail.gmail.com> On Fri, Feb 5, 2010 at 2:21 PM, shalu sharma wrote: > Hi All, > ? ? ? ? ? ?Is there any program available which can align reads (200-400bp > in fasta format) against a reference sequence and the whole alignment can be > viewed graphically. > I tried bowtie but i think it only takes fastaq format. Take a look at the -f option for bowtie. Sean From abhishek.vit at gmail.com Fri Feb 5 15:23:26 2010 From: abhishek.vit at gmail.com (Abhishek Pratap) Date: Fri, 5 Feb 2010 15:23:26 -0500 Subject: [Bioperl-l] Aligning Reads In-Reply-To: <264855a01002051140sa9d309cw302c3266f628e9a3@mail.gmail.com> References: <465b5a661002051121l50402168s7a715e62da6812ee@mail.gmail.com> <264855a01002051140sa9d309cw302c3266f628e9a3@mail.gmail.com> Message-ID: I would also try Mosaik. It can take fasta as input and align both short and long reads. -Abhi On Fri, Feb 5, 2010 at 2:40 PM, Sean Davis wrote: > On Fri, Feb 5, 2010 at 2:21 PM, shalu sharma wrote: >> Hi All, >> ? ? ? ? ? ?Is there any program available which can align reads (200-400bp >> in fasta format) against a reference sequence and the whole alignment can be >> viewed graphically. >> I tried bowtie but i think it only takes fastaq format. > > Take a look at the -f option for bowtie. > > Sean > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l From thomas.kallman at ebc.uu.se Fri Feb 5 16:23:50 2010 From: thomas.kallman at ebc.uu.se (=?ISO-8859-1?Q?Thomas_K=E4llman?=) Date: Fri, 5 Feb 2010 22:23:50 +0100 Subject: [Bioperl-l] Aligning Reads In-Reply-To: <465b5a661002051121l50402168s7a715e62da6812ee@mail.gmail.com> References: <465b5a661002051121l50402168s7a715e62da6812ee@mail.gmail.com> Message-ID: Hi Shalu, Not really related to bioperl, but you can find a nice list of software aligning short read data at http://seqanswers.com/forums/showthread.php?t=43 /thomas On Feb 5, 2010, at 8:21 PM, shalu sharma wrote: > Hi All, > Is there any program available which can align reads > (200-400bp > in fasta format) against a reference sequence and the whole > alignment can be > viewed graphically. > I tried bowtie but i think it only takes fastaq format. > > Thanks > shalu > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l From sharmashalu.bio at gmail.com Fri Feb 5 16:43:26 2010 From: sharmashalu.bio at gmail.com (shalu sharma) Date: Fri, 5 Feb 2010 16:43:26 -0500 Subject: [Bioperl-l] Aligning Reads In-Reply-To: References: <465b5a661002051121l50402168s7a715e62da6812ee@mail.gmail.com> Message-ID: <465b5a661002051343o5791a021yf10972239a389718@mail.gmail.com> Thanks a lot for your valuable suggestions. Thanks Shalu 2010/2/5 Thomas K?llman > Hi Shalu, > > Not really related to bioperl, but you can find a nice list of software > aligning short read data at > http://seqanswers.com/forums/showthread.php?t=43 > > /thomas > > On Feb 5, 2010, at 8:21 PM, shalu sharma wrote: > > Hi All, >> Is there any program available which can align reads (200-400bp >> in fasta format) against a reference sequence and the whole alignment can >> be >> viewed graphically. >> I tried bowtie but i think it only takes fastaq format. >> >> Thanks >> shalu >> _______________________________________________ >> Bioperl-l mailing list >> Bioperl-l at lists.open-bio.org >> http://lists.open-bio.org/mailman/listinfo/bioperl-l >> > > From florent.angly at gmail.com Fri Feb 5 21:56:56 2010 From: florent.angly at gmail.com (Florent Angly) Date: Sat, 06 Feb 2010 12:56:56 +1000 Subject: [Bioperl-l] How to draw Phylogeny Tree using Bioperl ? In-Reply-To: References: <53150.192.168.1.1.1191257835.squirrel@mail.ncbs.res.in> <1191261780.47013654cb81b@webmail.sas.upenn.edu> <48581.192.168.1.1.1191262243.squirrel@mail.ncbs.res.in> <1191263834.47013e5a6af93@webmail.sas.upenn.edu> <58283.192.168.1.1.1191263945.squirrel@mail.ncbs.res.in> <1191264531.47014113f2b54@webmail.sas.upenn.edu> Message-ID: <4B6CDA78.9060906@gmail.com> Hi, I've used Bio::Phylo for drawing SVG trees non-interactively. It's not part of Bioperl but draws nice trees: http://search.cpan.org/~rvosa/Bio-Phylo/ Florent On 02/10/07 05:32, Jason Stajich wrote: > I'd definitely recommend Bio::Tree::Draw::Cladogram over svggraph > for prettier trees - you get postscript out but you can render this > to png or jpg with unix tools. If there is a better stand alone tree > drawing engine we're happy to try and integrate it into bioperl - the > modules here are native Perl only and you can use the bioperl-run > modules that wrap DrawTree and DrawGram from EMBOSS to get other PS > rendering output. > > Mesquite, TreeView or other tools are usually much better but not > always an option if you want to auto-render these images for a > website, etc. > > -jason > > > On Oct 1, 2007, at 11:48 AM, Lucia Peixoto wrote: > > >> Yes, >> that's the issue about those commands, trees are not pretty at all >> that's why for a one tree only kind of thing I rather use ITOL >> other thing to try is the tree drawer of the Mesquite package >> glad I could help >> >> Lucia >> >> Quoting Shameer Khadar: >> >> >>> Dear Lucia, >>> >>> Thanks for the mail. Now I got it. I didnt used this TreeIO / >>> Tree::Draw >>> methods. Some how missed this excellent HOWTO : >>> http://www.bioperl.org/wiki/HOWTO:Trees. Thanks for that code as >>> well. I >>> tried that and it worked very nicely. I have to work around to >>> beautify >>> the tree and I am just going to do that. >>> >>> Thanks& Cheers, >>> Shameer >>> >>> >>>> OK >>>> >>>> you can use the implementations in Bio::TreeIO >>>> >>>> you can basically read the tree in newick format and out as an >>>> svg graph >>>> something like this: >>>> >>>> my $in = new Bio::TreeIO(-file => 'input', >>>> -format => 'newick'); >>>> my $out = new Bio::TreeIO(-file => '>mytree.svg', >>>> -format => 'svggraph'); >>>> while( my $tree = $in->next_tree ) { >>>> $out->write_tree($tree); >>>> } >>>> >>>> you can also use Bio::Tree::Draw >>>> >>>> hope that helps >>>> >>>> Lucia >>>> >>>> >>>> Quoting Shameer Khadar: >>>> >>>> >>>>> Hi, >>>>> >>>>> Thanks for your mail. I have to create these trees as a part of a >>>>> webserver. i need to generate them dynamically using users input >>>>> sequence. >>>>> I think ITOL is not the stuff best suited for my purpose. >>>>> >>>>> >>>>>> I think you'll have better luck using some of already available >>>>>> >>>>> programs >>>>> >>>>>> to do >>>>>> that, you'll get better looking trees. If you just have one >>>>>> tree to >>>>>> >>>>> draw I >>>>> >>>>>> recommend you use: >>>>>> http://itol.embl.de/ >>>>>> >>>>>> Lucia >>>>>> >>>>>> >>>>>> Quoting Shameer Khadar: >>>>>> >>>>>> >>>>>>> Dear All, >>>>>>> >>>>>>> Is it possible to draw a phylogeny tree file in PNG format using >>>>>>> >>>>> Bioperl >>>>> >>>>>>> ? >>>>>>> My input file are in phylip treefile format. Any Modules / >>>>>>> codes in >>>>>>> Bio::Graphics / Phylogeny sections ? >>>>>>> >>>>>>> Input file : >>>>>>> >>>>>>> >>>>>> >>>>> >>>> >>> >> ((((((_L_537_539:3.70000,_H_535_536:3.70000):4.97461, >> (((((((_E_499_500:2.75000,_E_250_251:2.75000): >> 0.55805,_L_252_254:3.30805):1.38576,_H_494_497:4.69381): >> 1.51514,_H_255_263:6.20895):0.83877, >> (_L_246_249:4.30000,_H_244_245:4.30000):2.74772): >> 0.92645,_H_520_534:7.97418):0.15279, >> (_H_502_512:6.95000,_H_273_282:6.95000):1.17697):0.54765): >> 1.10967,_L_264_272:9.78428):0.53441,_L_283_291:10.31869):3.59264, >> ((((_H_479_493:8.25300,((_H_448_462:7.25000,_L_409_411:7.25000): >> 0.50000,_L_445_447:7.75000):0.50300):1.08808, >> (((((_E_381_382:2.65000,_E_377_378:2.65000): >> 0.26434,_L_379_380:2.91434):1.77630,_L_373_376:4.69063): >> 1.70029,_L_436_444:6.39093):0.93320,_L_383_391:7.32413):2.01696): >> 0.94916,(((((_L_465_469:3.90000,_L_366_368:3.90000): >> 1.52226,_H_463_464:5.42226):0.94093, >> (_E_427_435:5.15000,_E_369_372:5.15000):1.21319): >> 1.64489,_L_336_343:8.00808):0.88402, >> (((_H_355_365:6.20000,_L_349_354:6.20000):0.91541, >> (_L_327_328:3.40000,_L_318_326:3.40000):3.71541):0.59082, >> (((((_E_470_474:3.85000,_E_344_348: >> >>>>> 3. >>>>> >>>>>>> >>>>>> >>>>> >>>> >>> >> 85000):0.89054,_L_475_478:4.74054):1.20107, >> (_E_329_335:3.85000,_E_315_317:3.85000):2.09161): >> 0.71112,_L_513_519:6.65273):0.67204, >> ((_L_296_304:5.00000,_H_292_295:5.00000): >> 0.71200,_L_305_314:5.71200):1.61276):0.38147):1.18587):1.39814): >> 0.37326,((((_L_397_398:3.55000,_E_394_396:3.55000): >> 1.36938,_L_392_393:4.91938):1.43993,_L_422_426:6.35931):1.00790, >> (_E_412_421:5.95000,_E_399_408:5.95000):1.41721):3.29629):3.24784): >> 4.31679,((((((_L_206_210:3.80000,_H_203_205:3.80000): >> 1.75687,_L__49__50:5.55687):1.29579,_H_188_202:6.85266): >> 1.08193,_H_229_243:7.93459):0.18730, >> (_L_222_228:5.55000,_H_211_221:5.55000):2.57189):3.16298, >> ((((_H_159_171:7.00000,_L_156_158:7.00000): >> 0.07448,_L_120_122:7.07448):1.59389, >> ((((_L__90__91:2.65000,_E__88__89:2.65000): >> 0.18425,_E__92__93:2.83425):1.94636,_L__84__87:4.78061): >> 1.74719,_L_147_155:6.52780):2.14057):2.44189, >> ((((((((_E_179_182:3.50000,_E__80__83:3.50000):2.15544, >> (_L_172_178:3.95000,_L__77__79:3.95000):1.70544): >> 0.42200,_E_138_146:6.07744):0.46209,_E__51__ >> >>>>> 5 >>>>> >>>>>>> >>>>>> >>>>> >>>> >>> >> 8:6.53954):0.74619,_L_183_187:7.28573):0.73805, >> (_E_123_131:5.70000,_E_110_119:5.70000):2.32378):0.58197, >> ((((_L_108_109:4.30000,_E_104_107:4.30000): >> 1.34305,_H__66__76:5.64305):0.81535,_L_132_137:6.45840):1.22044, >> (_L__94_103:6.55000,_L__59__65:6.55000):1.12884):0.92691):1.25371, >> ((_L__38__39:3.40000,_L__29__37:3.40000):3.64775, >> (((((_H___3___6:3.30000,_L___1___2:3.30000): >> 0.79885,_L__21__25:4.09885):0.80972,_E__26__28:4.90856): >> 0.88488,_E__40__48:5.79344):0.60814, >> (_L__12__20:5.25000,_H___8__11:5.25000):1.15158):0.64616):2.81172): >> 1.25080):0.17461):6.94325); >> >>>>>>> >>>>> -- >>>>> Shameer Khadar >>>>> Prof. R. Sowdhamini's Lab (# 25) The Computational Biology Group >>>>> National Centre for Biological Sciences (TIFR) >>>>> GKVK Campus, Bellary Road, Bangalore - 65, Karnataka - India >>>>> T - 91-080-23666001 EXT - 6251 >>>>> W - http://www.ncbs.res.in >>>>> >>>>> >>>> >>>> Lucia Peixoto >>>> Department of Biology,SAS >>>> University of Pennsylvania >>>> >>>> >>> >>> -- >>> Shameer Khadar >>> Prof. R. Sowdhamini's Lab (# 25) The Computational Biology Group >>> National Centre for Biological Sciences (TIFR) >>> GKVK Campus, Bellary Road, Bangalore - 65, Karnataka - India >>> T - 91-080-23666001 EXT - 6251 >>> W - http://www.ncbs.res.in >>> >>> >> >> Lucia Peixoto >> Department of Biology,SAS >> University of Pennsylvania >> _______________________________________________ >> Bioperl-l mailing list >> Bioperl-l at lists.open-bio.org >> http://lists.open-bio.org/mailman/listinfo/bioperl-l >> > -- > Jason Stajich > jason at bioperl.org > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l > From florent.angly at gmail.com Fri Feb 5 22:06:08 2010 From: florent.angly at gmail.com (Florent Angly) Date: Sat, 06 Feb 2010 13:06:08 +1000 Subject: [Bioperl-l] Too many links at LoadHelper.pm In-Reply-To: References: <34328.58710.qm@web65506.mail.ac4.yahoo.com> Message-ID: <4B6CDCA0.6060700@gmail.com> I've run into this issue very recently, albeit working on something unrelated to loading GFF files: generating many small independent assemblies. Maybe there's a bug in the tempdir() code, or its documentation is not clear. I will take a look at it when I have some time. Florent On 05/02/10 22:27, Mark A. Jensen wrote: > The tempfiles are not being unlinked before there are too many for the > OS to handle. > Sounds like this may be your job, fungazid, if you're not employing > the script (but > not sure about that) > ----- Original Message ----- From: "Fungazid" > To: > Sent: Thursday, February 04, 2010 9:52 PM > Subject: [Bioperl-l] Too many links at LoadHelper.pm > > >> Bioperl hi, >> >> I have multiple gff3 files that look like: >> >> -------------------------------------------------------- >> ##gff-version 3 >> # file: temp1.gff3 >> contig453278 hs contig 1 699 . + . >> Name=contig453278 >> contig453278 hs alignment 1 194 . + . >> Name=hs.ENST00000258455 >> contig453278 hs alignment 252 699 . + . >> Name=hs.ENST00000258455 >> contig453278 hs mi2 194 252 . + . Name=mi2 >> contig453278 hs transcript 104 183 . + . >> Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,4 >> contig453278 hs transcript 1 9 . + . >> Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,1 >> contig453278 hs transcript 73 103 . + . >> Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,3 >> contig453278 hs transcript 10 72 . + . >> Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,2 >> --------------------------------------------------------- >> >> >> >> I uploaded gff3 files to mysql and view the tracks in gBrowse like this: >> >> -------------------------------------------- >> Bio::DB::SeqFeature::Store->new(...) >> $loader=Bio::DB::SeqFeature::Store::GFF3Loader->new(...) >> for($i=0; $i<$#files+1; $i++){ >> $loader->load($file[$i]); >> } >> -------------------------------------------- >> >> Eventually after uploading ~15000 small gff3 files I got the error: >> >> Error in tempdir() using /tmp/XXXXXXXXXX: Could not create directory >> /tmp/9bbpdE5ibf: Too many links at >> /usr/local/share/perl/5.8.8/Bio/DB/SeqFeature/Store/LoadHelper.pm >> line 49 >> >> >> >> I am trying to see what went wrong, maybe you can rescue me, >> Avi >> >> >> >> >> _______________________________________________ >> Bioperl-l mailing list >> Bioperl-l at lists.open-bio.org >> http://lists.open-bio.org/mailman/listinfo/bioperl-l >> >> > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l From cjfields at illinois.edu Fri Feb 5 22:13:48 2010 From: cjfields at illinois.edu (Chris Fields) Date: Fri, 5 Feb 2010 21:13:48 -0600 Subject: [Bioperl-l] Too many links at LoadHelper.pm In-Reply-To: <4B6CDCA0.6060700@gmail.com> References: <34328.58710.qm@web65506.mail.ac4.yahoo.com> <4B6CDCA0.6060700@gmail.com> Message-ID: Florent, The Bio::Assembly error is documented in Bugzilla: http://bugzilla.open-bio.org/show_bug.cgi?id=2577 Switching to a Bio::SeqFeature::CollectionI capable of distinguishing features by seq_id (such as Bio::DB::SeqFeature::Store) may be the best way to go. chris On Feb 5, 2010, at 9:06 PM, Florent Angly wrote: > I've run into this issue very recently, albeit working on something unrelated to loading GFF files: generating many small independent assemblies. Maybe there's a bug in the tempdir() code, or its documentation is not clear. I will take a look at it when I have some time. > Florent > > > On 05/02/10 22:27, Mark A. Jensen wrote: >> The tempfiles are not being unlinked before there are too many for the OS to handle. >> Sounds like this may be your job, fungazid, if you're not employing the script (but >> not sure about that) >> ----- Original Message ----- From: "Fungazid" >> To: >> Sent: Thursday, February 04, 2010 9:52 PM >> Subject: [Bioperl-l] Too many links at LoadHelper.pm >> >> >>> Bioperl hi, >>> >>> I have multiple gff3 files that look like: >>> >>> -------------------------------------------------------- >>> ##gff-version 3 >>> # file: temp1.gff3 >>> contig453278 hs contig 1 699 . + . Name=contig453278 >>> contig453278 hs alignment 1 194 . + . Name=hs.ENST00000258455 >>> contig453278 hs alignment 252 699 . + . Name=hs.ENST00000258455 >>> contig453278 hs mi2 194 252 . + . Name=mi2 >>> contig453278 hs transcript 104 183 . + . Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,4 >>> contig453278 hs transcript 1 9 . + . Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,1 >>> contig453278 hs transcript 73 103 . + . Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,3 >>> contig453278 hs transcript 10 72 . + . Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,2 >>> --------------------------------------------------------- >>> >>> >>> >>> I uploaded gff3 files to mysql and view the tracks in gBrowse like this: >>> >>> -------------------------------------------- >>> Bio::DB::SeqFeature::Store->new(...) >>> $loader=Bio::DB::SeqFeature::Store::GFF3Loader->new(...) >>> for($i=0; $i<$#files+1; $i++){ >>> $loader->load($file[$i]); >>> } >>> -------------------------------------------- >>> >>> Eventually after uploading ~15000 small gff3 files I got the error: >>> >>> Error in tempdir() using /tmp/XXXXXXXXXX: Could not create directory /tmp/9bbpdE5ibf: Too many links at /usr/local/share/perl/5.8.8/Bio/DB/SeqFeature/Store/LoadHelper.pm line 49 >>> >>> >>> >>> I am trying to see what went wrong, maybe you can rescue me, >>> Avi >>> >>> >>> >>> >>> _______________________________________________ >>> Bioperl-l mailing list >>> Bioperl-l at lists.open-bio.org >>> http://lists.open-bio.org/mailman/listinfo/bioperl-l >>> >>> >> >> _______________________________________________ >> Bioperl-l mailing list >> Bioperl-l at lists.open-bio.org >> http://lists.open-bio.org/mailman/listinfo/bioperl-l > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l From maj at fortinbras.us Fri Feb 5 22:16:54 2010 From: maj at fortinbras.us (Mark A. Jensen) Date: Fri, 5 Feb 2010 22:16:54 -0500 Subject: [Bioperl-l] Too many links at LoadHelper.pm In-Reply-To: <4B6CDCA0.6060700@gmail.com> References: <34328.58710.qm@web65506.mail.ac4.yahoo.com> <4B6CDCA0.6060700@gmail.com> Message-ID: I do tend to use File::Temp directly, rather than rely on Bio::Root::IO, because I more sure of what will happen. Would be excellent if you would review the code! MAJ ----- Original Message ----- From: "Florent Angly" To: Sent: Friday, February 05, 2010 10:06 PM Subject: Re: [Bioperl-l] Too many links at LoadHelper.pm > I've run into this issue very recently, albeit working on something > unrelated to loading GFF files: generating many small independent > assemblies. Maybe there's a bug in the tempdir() code, or its > documentation is not clear. I will take a look at it when I have some time. > Florent > > > On 05/02/10 22:27, Mark A. Jensen wrote: >> The tempfiles are not being unlinked before there are too many for the >> OS to handle. >> Sounds like this may be your job, fungazid, if you're not employing >> the script (but >> not sure about that) >> ----- Original Message ----- From: "Fungazid" >> To: >> Sent: Thursday, February 04, 2010 9:52 PM >> Subject: [Bioperl-l] Too many links at LoadHelper.pm >> >> >>> Bioperl hi, >>> >>> I have multiple gff3 files that look like: >>> >>> -------------------------------------------------------- >>> ##gff-version 3 >>> # file: temp1.gff3 >>> contig453278 hs contig 1 699 . + . >>> Name=contig453278 >>> contig453278 hs alignment 1 194 . + . >>> Name=hs.ENST00000258455 >>> contig453278 hs alignment 252 699 . + . >>> Name=hs.ENST00000258455 >>> contig453278 hs mi2 194 252 . + . Name=mi2 >>> contig453278 hs transcript 104 183 . + . >>> Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,4 >>> contig453278 hs transcript 1 9 . + . >>> Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,1 >>> contig453278 hs transcript 73 103 . + . >>> Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,3 >>> contig453278 hs transcript 10 72 . + . >>> Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,2 >>> --------------------------------------------------------- >>> >>> >>> >>> I uploaded gff3 files to mysql and view the tracks in gBrowse like this: >>> >>> -------------------------------------------- >>> Bio::DB::SeqFeature::Store->new(...) >>> $loader=Bio::DB::SeqFeature::Store::GFF3Loader->new(...) >>> for($i=0; $i<$#files+1; $i++){ >>> $loader->load($file[$i]); >>> } >>> -------------------------------------------- >>> >>> Eventually after uploading ~15000 small gff3 files I got the error: >>> >>> Error in tempdir() using /tmp/XXXXXXXXXX: Could not create directory >>> /tmp/9bbpdE5ibf: Too many links at >>> /usr/local/share/perl/5.8.8/Bio/DB/SeqFeature/Store/LoadHelper.pm >>> line 49 >>> >>> >>> >>> I am trying to see what went wrong, maybe you can rescue me, >>> Avi >>> >>> >>> >>> >>> _______________________________________________ >>> Bioperl-l mailing list >>> Bioperl-l at lists.open-bio.org >>> http://lists.open-bio.org/mailman/listinfo/bioperl-l >>> >>> >> >> _______________________________________________ >> Bioperl-l mailing list >> Bioperl-l at lists.open-bio.org >> http://lists.open-bio.org/mailman/listinfo/bioperl-l > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l > > From florent.angly at gmail.com Fri Feb 5 22:24:40 2010 From: florent.angly at gmail.com (Florent Angly) Date: Sat, 06 Feb 2010 13:24:40 +1000 Subject: [Bioperl-l] Too many links at LoadHelper.pm In-Reply-To: References: <34328.58710.qm@web65506.mail.ac4.yahoo.com> <4B6CDCA0.6060700@gmail.com> Message-ID: <4B6CE0F8.2090707@gmail.com> It can't be, Chris, because I use the workaround mentioned here: http://bugzilla.open-bio.org/show_bug.cgi?id=2577#c5 It seems like temporary files keep accumulating assembly after assembly. Looking at the lsof command, I get a lot of files described as "(deleted)". I am thinking that maybe some filehandle is not closed or something. I'll investigate... Florent On 06/02/10 13:13, Chris Fields wrote: > Florent, > > The Bio::Assembly error is documented in Bugzilla: > > http://bugzilla.open-bio.org/show_bug.cgi?id=2577 > > Switching to a Bio::SeqFeature::CollectionI capable of distinguishing features by seq_id (such as Bio::DB::SeqFeature::Store) may be the best way to go. > > chris > > On Feb 5, 2010, at 9:06 PM, Florent Angly wrote: > > >> I've run into this issue very recently, albeit working on something unrelated to loading GFF files: generating many small independent assemblies. Maybe there's a bug in the tempdir() code, or its documentation is not clear. I will take a look at it when I have some time. >> Florent >> >> >> On 05/02/10 22:27, Mark A. Jensen wrote: >> >>> The tempfiles are not being unlinked before there are too many for the OS to handle. >>> Sounds like this may be your job, fungazid, if you're not employing the script (but >>> not sure about that) >>> ----- Original Message ----- From: "Fungazid" >>> To: >>> Sent: Thursday, February 04, 2010 9:52 PM >>> Subject: [Bioperl-l] Too many links at LoadHelper.pm >>> >>> >>> >>>> Bioperl hi, >>>> >>>> I have multiple gff3 files that look like: >>>> >>>> -------------------------------------------------------- >>>> ##gff-version 3 >>>> # file: temp1.gff3 >>>> contig453278 hs contig 1 699 . + . Name=contig453278 >>>> contig453278 hs alignment 1 194 . + . Name=hs.ENST00000258455 >>>> contig453278 hs alignment 252 699 . + . Name=hs.ENST00000258455 >>>> contig453278 hs mi2 194 252 . + . Name=mi2 >>>> contig453278 hs transcript 104 183 . + . Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,4 >>>> contig453278 hs transcript 1 9 . + . Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,1 >>>> contig453278 hs transcript 73 103 . + . Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,3 >>>> contig453278 hs transcript 10 72 . + . Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,2 >>>> --------------------------------------------------------- >>>> >>>> >>>> >>>> I uploaded gff3 files to mysql and view the tracks in gBrowse like this: >>>> >>>> -------------------------------------------- >>>> Bio::DB::SeqFeature::Store->new(...) >>>> $loader=Bio::DB::SeqFeature::Store::GFF3Loader->new(...) >>>> for($i=0; $i<$#files+1; $i++){ >>>> $loader->load($file[$i]); >>>> } >>>> -------------------------------------------- >>>> >>>> Eventually after uploading ~15000 small gff3 files I got the error: >>>> >>>> Error in tempdir() using /tmp/XXXXXXXXXX: Could not create directory /tmp/9bbpdE5ibf: Too many links at /usr/local/share/perl/5.8.8/Bio/DB/SeqFeature/Store/LoadHelper.pm line 49 >>>> >>>> >>>> >>>> I am trying to see what went wrong, maybe you can rescue me, >>>> Avi >>>> >>>> >>>> >>>> >>>> _______________________________________________ >>>> Bioperl-l mailing list >>>> Bioperl-l at lists.open-bio.org >>>> http://lists.open-bio.org/mailman/listinfo/bioperl-l >>>> >>>> >>>> >>> _______________________________________________ >>> Bioperl-l mailing list >>> Bioperl-l at lists.open-bio.org >>> http://lists.open-bio.org/mailman/listinfo/bioperl-l >>> >> _______________________________________________ >> Bioperl-l mailing list >> Bioperl-l at lists.open-bio.org >> http://lists.open-bio.org/mailman/listinfo/bioperl-l >> > From maj at fortinbras.us Fri Feb 5 23:06:16 2010 From: maj at fortinbras.us (Mark A. Jensen) Date: Fri, 5 Feb 2010 23:06:16 -0500 Subject: [Bioperl-l] Too many links at LoadHelper.pm In-Reply-To: <4B6CE0F8.2090707@gmail.com> References: <34328.58710.qm@web65506.mail.ac4.yahoo.com><4B6CDCA0.6060700@gmail.com> <4B6CE0F8.2090707@gmail.com> Message-ID: That's a good thought; when writing a Bioperl tempfile, I find I must ($fh, $fn) = tempfile(); $fh->close; $seqio = Bio::SeqIO( -file => ">$fn", -format=>'fasta'); since the better way (always using the filehandle) ($fh, $fn) = tempfile; $seqio = Bio::SeqIO( -fh => $fh, -format=>'fasta'); doesn't seem to work, though $fh should be open for both writing and reading. If people do the former kludge without closing the handle, File::Temp can't auto-unlink, but it won't cluck about it either. If I missing something about writing to temp filehandles here, please set me straight- cheers MAJ ----- Original Message ----- From: "Florent Angly" To: "Chris Fields" Cc: Sent: Friday, February 05, 2010 10:24 PM Subject: Re: [Bioperl-l] Too many links at LoadHelper.pm > It can't be, Chris, because I use the workaround mentioned here: > http://bugzilla.open-bio.org/show_bug.cgi?id=2577#c5 > > It seems like temporary files keep accumulating assembly after assembly. > Looking at the lsof command, I get a lot of files described as "(deleted)". I > am thinking that maybe some filehandle is not closed or something. > > I'll investigate... > > Florent > > On 06/02/10 13:13, Chris Fields wrote: >> Florent, >> >> The Bio::Assembly error is documented in Bugzilla: >> >> http://bugzilla.open-bio.org/show_bug.cgi?id=2577 >> >> Switching to a Bio::SeqFeature::CollectionI capable of distinguishing >> features by seq_id (such as Bio::DB::SeqFeature::Store) may be the best way >> to go. >> >> chris >> >> On Feb 5, 2010, at 9:06 PM, Florent Angly wrote: >> >> >>> I've run into this issue very recently, albeit working on something >>> unrelated to loading GFF files: generating many small independent >>> assemblies. Maybe there's a bug in the tempdir() code, or its documentation >>> is not clear. I will take a look at it when I have some time. >>> Florent >>> >>> >>> On 05/02/10 22:27, Mark A. Jensen wrote: >>> >>>> The tempfiles are not being unlinked before there are too many for the OS >>>> to handle. >>>> Sounds like this may be your job, fungazid, if you're not employing the >>>> script (but >>>> not sure about that) >>>> ----- Original Message ----- From: "Fungazid" >>>> To: >>>> Sent: Thursday, February 04, 2010 9:52 PM >>>> Subject: [Bioperl-l] Too many links at LoadHelper.pm >>>> >>>> >>>> >>>>> Bioperl hi, >>>>> >>>>> I have multiple gff3 files that look like: >>>>> >>>>> -------------------------------------------------------- >>>>> ##gff-version 3 >>>>> # file: temp1.gff3 >>>>> contig453278 hs contig 1 699 . + . >>>>> Name=contig453278 >>>>> contig453278 hs alignment 1 194 . + . >>>>> Name=hs.ENST00000258455 >>>>> contig453278 hs alignment 252 699 . + . >>>>> Name=hs.ENST00000258455 >>>>> contig453278 hs mi2 194 252 . + . Name=mi2 >>>>> contig453278 hs transcript 104 183 . + . >>>>> Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,4 >>>>> contig453278 hs transcript 1 9 . + . >>>>> Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,1 >>>>> contig453278 hs transcript 73 103 . + . >>>>> Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,3 >>>>> contig453278 hs transcript 10 72 . + . >>>>> Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,2 >>>>> --------------------------------------------------------- >>>>> >>>>> >>>>> >>>>> I uploaded gff3 files to mysql and view the tracks in gBrowse like this: >>>>> >>>>> -------------------------------------------- >>>>> Bio::DB::SeqFeature::Store->new(...) >>>>> $loader=Bio::DB::SeqFeature::Store::GFF3Loader->new(...) >>>>> for($i=0; $i<$#files+1; $i++){ >>>>> $loader->load($file[$i]); >>>>> } >>>>> -------------------------------------------- >>>>> >>>>> Eventually after uploading ~15000 small gff3 files I got the error: >>>>> >>>>> Error in tempdir() using /tmp/XXXXXXXXXX: Could not create directory >>>>> /tmp/9bbpdE5ibf: Too many links at >>>>> /usr/local/share/perl/5.8.8/Bio/DB/SeqFeature/Store/LoadHelper.pm line 49 >>>>> >>>>> >>>>> >>>>> I am trying to see what went wrong, maybe you can rescue me, >>>>> Avi >>>>> >>>>> >>>>> >>>>> >>>>> _______________________________________________ >>>>> Bioperl-l mailing list >>>>> Bioperl-l at lists.open-bio.org >>>>> http://lists.open-bio.org/mailman/listinfo/bioperl-l >>>>> >>>>> >>>>> >>>> _______________________________________________ >>>> Bioperl-l mailing list >>>> Bioperl-l at lists.open-bio.org >>>> http://lists.open-bio.org/mailman/listinfo/bioperl-l >>>> >>> _______________________________________________ >>> Bioperl-l mailing list >>> Bioperl-l at lists.open-bio.org >>> http://lists.open-bio.org/mailman/listinfo/bioperl-l >>> >> > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l > > From cjfields at illinois.edu Fri Feb 5 23:33:20 2010 From: cjfields at illinois.edu (Chris Fields) Date: Fri, 5 Feb 2010 22:33:20 -0600 Subject: [Bioperl-l] Too many links at LoadHelper.pm In-Reply-To: References: <34328.58710.qm@web65506.mail.ac4.yahoo.com><4B6CDCA0.6060700@gmail.com> <4B6CE0F8.2090707@gmail.com> Message-ID: <6A63C057-9319-47AA-A972-0720AED70270@illinois.edu> On Feb 5, 2010, at 10:06 PM, Mark A. Jensen wrote: > That's a good thought; when writing a Bioperl > tempfile, I find I must > > ($fh, $fn) = tempfile(); > $fh->close; > $seqio = Bio::SeqIO( -file => ">$fn", -format=>'fasta'); > > since the better way (always using the filehandle) > > ($fh, $fn) = tempfile; > $seqio = Bio::SeqIO( -fh => $fh, -format=>'fasta'); > > doesn't seem to work, though $fh should be open > for both writing and reading. If people do the former > kludge without closing the handle, File::Temp can't > auto-unlink, but it won't cluck about it either. > > If I missing something about writing to temp filehandles > here, please set me straight- > cheers MAJ No, I think you're on the right track. There isn't any reason why using a temp filehandle shouldn't work with the various *IO that I can think of, so I would consider that a bug. Does the same thing occur when using Bio::Root::IO::tempfile() (which just wraps File::Temp)? chris From maj at fortinbras.us Fri Feb 5 23:40:25 2010 From: maj at fortinbras.us (Mark A. Jensen) Date: Fri, 5 Feb 2010 23:40:25 -0500 Subject: [Bioperl-l] Too many links at LoadHelper.pm In-Reply-To: <6A63C057-9319-47AA-A972-0720AED70270@illinois.edu> References: <34328.58710.qm@web65506.mail.ac4.yahoo.com><4B6CDCA0.6060700@gmail.com> <4B6CE0F8.2090707@gmail.com> <6A63C057-9319-47AA-A972-0720AED70270@illinois.edu> Message-ID: <22F7FF69165C4175AAB639DE9331B668@NewLife> sth to check-- ----- Original Message ----- From: "Chris Fields" To: "Mark A. Jensen" Cc: "Florent Angly" ; Sent: Friday, February 05, 2010 11:33 PM Subject: Re: [Bioperl-l] Too many links at LoadHelper.pm On Feb 5, 2010, at 10:06 PM, Mark A. Jensen wrote: > That's a good thought; when writing a Bioperl ... Does the same thing occur when using Bio::Root::IO::tempfile() (which just wraps File::Temp)? chris From cjfields at illinois.edu Fri Feb 5 23:41:57 2010 From: cjfields at illinois.edu (Chris Fields) Date: Fri, 5 Feb 2010 22:41:57 -0600 Subject: [Bioperl-l] Too many links at LoadHelper.pm In-Reply-To: <4B6CE0F8.2090707@gmail.com> References: <34328.58710.qm@web65506.mail.ac4.yahoo.com> <4B6CDCA0.6060700@gmail.com> <4B6CE0F8.2090707@gmail.com> Message-ID: <8D50C6AA-2BD5-43AD-8798-929A0B656CC9@illinois.edu> The only place I can think of tempfiles being generated offhand are Bio::SeqFeature::Collection (actually, tied handles), but as you mention this may not be related. There are calls to tempfile from Bio::Assembly::IO::sam/bowtie (one using Root::IO, one using File::Temp directly). chris On Feb 5, 2010, at 9:24 PM, Florent Angly wrote: > It can't be, Chris, because I use the workaround mentioned here: http://bugzilla.open-bio.org/show_bug.cgi?id=2577#c5 > > It seems like temporary files keep accumulating assembly after assembly. Looking at the lsof command, I get a lot of files described as "(deleted)". I am thinking that maybe some filehandle is not closed or something. > > I'll investigate... > > Florent > > On 06/02/10 13:13, Chris Fields wrote: >> Florent, >> >> The Bio::Assembly error is documented in Bugzilla: >> >> http://bugzilla.open-bio.org/show_bug.cgi?id=2577 >> >> Switching to a Bio::SeqFeature::CollectionI capable of distinguishing features by seq_id (such as Bio::DB::SeqFeature::Store) may be the best way to go. >> >> chris >> >> On Feb 5, 2010, at 9:06 PM, Florent Angly wrote: >> >> >>> I've run into this issue very recently, albeit working on something unrelated to loading GFF files: generating many small independent assemblies. Maybe there's a bug in the tempdir() code, or its documentation is not clear. I will take a look at it when I have some time. >>> Florent >>> >>> >>> On 05/02/10 22:27, Mark A. Jensen wrote: >>> >>>> The tempfiles are not being unlinked before there are too many for the OS to handle. >>>> Sounds like this may be your job, fungazid, if you're not employing the script (but >>>> not sure about that) >>>> ----- Original Message ----- From: "Fungazid" >>>> To: >>>> Sent: Thursday, February 04, 2010 9:52 PM >>>> Subject: [Bioperl-l] Too many links at LoadHelper.pm >>>> >>>> >>>> >>>>> Bioperl hi, >>>>> >>>>> I have multiple gff3 files that look like: >>>>> >>>>> -------------------------------------------------------- >>>>> ##gff-version 3 >>>>> # file: temp1.gff3 >>>>> contig453278 hs contig 1 699 . + . Name=contig453278 >>>>> contig453278 hs alignment 1 194 . + . Name=hs.ENST00000258455 >>>>> contig453278 hs alignment 252 699 . + . Name=hs.ENST00000258455 >>>>> contig453278 hs mi2 194 252 . + . Name=mi2 >>>>> contig453278 hs transcript 104 183 . + . Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,4 >>>>> contig453278 hs transcript 1 9 . + . Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,1 >>>>> contig453278 hs transcript 73 103 . + . Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,3 >>>>> contig453278 hs transcript 10 72 . + . Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,2 >>>>> --------------------------------------------------------- >>>>> >>>>> >>>>> >>>>> I uploaded gff3 files to mysql and view the tracks in gBrowse like this: >>>>> >>>>> -------------------------------------------- >>>>> Bio::DB::SeqFeature::Store->new(...) >>>>> $loader=Bio::DB::SeqFeature::Store::GFF3Loader->new(...) >>>>> for($i=0; $i<$#files+1; $i++){ >>>>> $loader->load($file[$i]); >>>>> } >>>>> -------------------------------------------- >>>>> >>>>> Eventually after uploading ~15000 small gff3 files I got the error: >>>>> >>>>> Error in tempdir() using /tmp/XXXXXXXXXX: Could not create directory /tmp/9bbpdE5ibf: Too many links at /usr/local/share/perl/5.8.8/Bio/DB/SeqFeature/Store/LoadHelper.pm line 49 >>>>> >>>>> >>>>> >>>>> I am trying to see what went wrong, maybe you can rescue me, >>>>> Avi >>>>> >>>>> >>>>> >>>>> >>>>> _______________________________________________ >>>>> Bioperl-l mailing list >>>>> Bioperl-l at lists.open-bio.org >>>>> http://lists.open-bio.org/mailman/listinfo/bioperl-l >>>>> >>>>> >>>>> >>>> _______________________________________________ >>>> Bioperl-l mailing list >>>> Bioperl-l at lists.open-bio.org >>>> http://lists.open-bio.org/mailman/listinfo/bioperl-l >>>> >>> _______________________________________________ >>> Bioperl-l mailing list >>> Bioperl-l at lists.open-bio.org >>> http://lists.open-bio.org/mailman/listinfo/bioperl-l >>> >> > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l From maj at fortinbras.us Fri Feb 5 23:49:34 2010 From: maj at fortinbras.us (Mark A. Jensen) Date: Fri, 5 Feb 2010 23:49:34 -0500 Subject: [Bioperl-l] Too many links at LoadHelper.pm In-Reply-To: <8D50C6AA-2BD5-43AD-8798-929A0B656CC9@illinois.edu> References: <34328.58710.qm@web65506.mail.ac4.yahoo.com><4B6CDCA0.6060700@gmail.com><4B6CE0F8.2090707@gmail.com> <8D50C6AA-2BD5-43AD-8798-929A0B656CC9@illinois.edu> Message-ID: <68FE1652EDB34C3097DDAB00C2F6D6CA@NewLife> Dan has been partial to temp in Root::IO (bowtie); I scurried off to File::Temp (sam)-- any issues with tempfile handles, Dan? ----- Original Message ----- From: "Chris Fields" To: "Florent Angly" Cc: Sent: Friday, February 05, 2010 11:41 PM Subject: Re: [Bioperl-l] Too many links at LoadHelper.pm > The only place I can think of tempfiles being generated offhand are > Bio::SeqFeature::Collection (actually, tied handles), but as you mention this > may not be related. There are calls to tempfile from > Bio::Assembly::IO::sam/bowtie (one using Root::IO, one using File::Temp > directly). > > chris > > On Feb 5, 2010, at 9:24 PM, Florent Angly wrote: > >> It can't be, Chris, because I use the workaround mentioned here: >> http://bugzilla.open-bio.org/show_bug.cgi?id=2577#c5 >> >> It seems like temporary files keep accumulating assembly after assembly. >> Looking at the lsof command, I get a lot of files described as "(deleted)". I >> am thinking that maybe some filehandle is not closed or something. >> >> I'll investigate... >> >> Florent >> >> On 06/02/10 13:13, Chris Fields wrote: >>> Florent, >>> >>> The Bio::Assembly error is documented in Bugzilla: >>> >>> http://bugzilla.open-bio.org/show_bug.cgi?id=2577 >>> >>> Switching to a Bio::SeqFeature::CollectionI capable of distinguishing >>> features by seq_id (such as Bio::DB::SeqFeature::Store) may be the best way >>> to go. >>> >>> chris >>> >>> On Feb 5, 2010, at 9:06 PM, Florent Angly wrote: >>> >>> >>>> I've run into this issue very recently, albeit working on something >>>> unrelated to loading GFF files: generating many small independent >>>> assemblies. Maybe there's a bug in the tempdir() code, or its documentation >>>> is not clear. I will take a look at it when I have some time. >>>> Florent >>>> >>>> >>>> On 05/02/10 22:27, Mark A. Jensen wrote: >>>> >>>>> The tempfiles are not being unlinked before there are too many for the OS >>>>> to handle. >>>>> Sounds like this may be your job, fungazid, if you're not employing the >>>>> script (but >>>>> not sure about that) >>>>> ----- Original Message ----- From: "Fungazid" >>>>> To: >>>>> Sent: Thursday, February 04, 2010 9:52 PM >>>>> Subject: [Bioperl-l] Too many links at LoadHelper.pm >>>>> >>>>> >>>>> >>>>>> Bioperl hi, >>>>>> >>>>>> I have multiple gff3 files that look like: >>>>>> >>>>>> -------------------------------------------------------- >>>>>> ##gff-version 3 >>>>>> # file: temp1.gff3 >>>>>> contig453278 hs contig 1 699 . + . >>>>>> Name=contig453278 >>>>>> contig453278 hs alignment 1 194 . + . >>>>>> Name=hs.ENST00000258455 >>>>>> contig453278 hs alignment 252 699 . + . >>>>>> Name=hs.ENST00000258455 >>>>>> contig453278 hs mi2 194 252 . + . Name=mi2 >>>>>> contig453278 hs transcript 104 183 . + . >>>>>> Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,4 >>>>>> contig453278 hs transcript 1 9 . + . >>>>>> Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,1 >>>>>> contig453278 hs transcript 73 103 . + . >>>>>> Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,3 >>>>>> contig453278 hs transcript 10 72 . + . >>>>>> Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,2 >>>>>> --------------------------------------------------------- >>>>>> >>>>>> >>>>>> >>>>>> I uploaded gff3 files to mysql and view the tracks in gBrowse like this: >>>>>> >>>>>> -------------------------------------------- >>>>>> Bio::DB::SeqFeature::Store->new(...) >>>>>> $loader=Bio::DB::SeqFeature::Store::GFF3Loader->new(...) >>>>>> for($i=0; $i<$#files+1; $i++){ >>>>>> $loader->load($file[$i]); >>>>>> } >>>>>> -------------------------------------------- >>>>>> >>>>>> Eventually after uploading ~15000 small gff3 files I got the error: >>>>>> >>>>>> Error in tempdir() using /tmp/XXXXXXXXXX: Could not create directory >>>>>> /tmp/9bbpdE5ibf: Too many links at >>>>>> /usr/local/share/perl/5.8.8/Bio/DB/SeqFeature/Store/LoadHelper.pm line 49 >>>>>> >>>>>> >>>>>> >>>>>> I am trying to see what went wrong, maybe you can rescue me, >>>>>> Avi >>>>>> >>>>>> >>>>>> >>>>>> >>>>>> _______________________________________________ >>>>>> Bioperl-l mailing list >>>>>> Bioperl-l at lists.open-bio.org >>>>>> http://lists.open-bio.org/mailman/listinfo/bioperl-l >>>>>> >>>>>> >>>>>> >>>>> _______________________________________________ >>>>> Bioperl-l mailing list >>>>> Bioperl-l at lists.open-bio.org >>>>> http://lists.open-bio.org/mailman/listinfo/bioperl-l >>>>> >>>> _______________________________________________ >>>> Bioperl-l mailing list >>>> Bioperl-l at lists.open-bio.org >>>> http://lists.open-bio.org/mailman/listinfo/bioperl-l >>>> >>> >> >> _______________________________________________ >> Bioperl-l mailing list >> Bioperl-l at lists.open-bio.org >> http://lists.open-bio.org/mailman/listinfo/bioperl-l > > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l > > From dan.kortschak at adelaide.edu.au Sat Feb 6 00:11:23 2010 From: dan.kortschak at adelaide.edu.au (Dan Kortschak) Date: Sat, 06 Feb 2010 15:41:23 +1030 Subject: [Bioperl-l] Too many links at LoadHelper.pm In-Reply-To: <68FE1652EDB34C3097DDAB00C2F6D6CA@NewLife> References: <34328.58710.qm@web65506.mail.ac4.yahoo.com> <4B6CDCA0.6060700@gmail.com> <4B6CE0F8.2090707@gmail.com> <8D50C6AA-2BD5-43AD-8798-929A0B656CC9@illinois.edu> <68FE1652EDB34C3097DDAB00C2F6D6CA@NewLife> Message-ID: <1265433083.4676.7.camel@epistle> When I was working through those issues (I was having problems with Bio::DB::Sam causing tempfiles to be released before they were finished with) I paid quite close attention to having a clean /tmp after running tests, so I pretty confident that my modules are working with Bio::Root::IO properly and that Bio::Root::IO is working properly with them. Sorry I can't be more helpful. Dan On Fri, 2010-02-05 at 23:49 -0500, Mark A. Jensen wrote: > Dan has been partial to temp in Root::IO (bowtie); I scurried off to File::Temp > (sam)-- > any issues with tempfile handles, Dan? > ----- Original Message ----- > From: "Chris Fields" > To: "Florent Angly" > Cc: > Sent: Friday, February 05, 2010 11:41 PM > Subject: Re: [Bioperl-l] Too many links at LoadHelper.pm > > > > The only place I can think of tempfiles being generated offhand are > > Bio::SeqFeature::Collection (actually, tied handles), but as you mention this > > may not be related. There are calls to tempfile from > > Bio::Assembly::IO::sam/bowtie (one using Root::IO, one using File::Temp > > directly). > > > > chris > > > > On Feb 5, 2010, at 9:24 PM, Florent Angly wrote: > > > >> It can't be, Chris, because I use the workaround mentioned here: > >> http://bugzilla.open-bio.org/show_bug.cgi?id=2577#c5 > >> > >> It seems like temporary files keep accumulating assembly after assembly. > >> Looking at the lsof command, I get a lot of files described as "(deleted)". I > >> am thinking that maybe some filehandle is not closed or something. > >> > >> I'll investigate... > >> > >> Florent > >> > >> On 06/02/10 13:13, Chris Fields wrote: > >>> Florent, > >>> > >>> The Bio::Assembly error is documented in Bugzilla: > >>> > >>> http://bugzilla.open-bio.org/show_bug.cgi?id=2577 > >>> > >>> Switching to a Bio::SeqFeature::CollectionI capable of distinguishing > >>> features by seq_id (such as Bio::DB::SeqFeature::Store) may be the best way > >>> to go. > >>> > >>> chris > >>> > >>> On Feb 5, 2010, at 9:06 PM, Florent Angly wrote: > >>> > >>> > >>>> I've run into this issue very recently, albeit working on something > >>>> unrelated to loading GFF files: generating many small independent > >>>> assemblies. Maybe there's a bug in the tempdir() code, or its documentation > >>>> is not clear. I will take a look at it when I have some time. > >>>> Florent > >>>> > >>>> > >>>> On 05/02/10 22:27, Mark A. Jensen wrote: > >>>> > >>>>> The tempfiles are not being unlinked before there are too many for the OS > >>>>> to handle. > >>>>> Sounds like this may be your job, fungazid, if you're not employing the > >>>>> script (but > >>>>> not sure about that) > >>>>> ----- Original Message ----- From: "Fungazid" > >>>>> To: > >>>>> Sent: Thursday, February 04, 2010 9:52 PM > >>>>> Subject: [Bioperl-l] Too many links at LoadHelper.pm > >>>>> > >>>>> > >>>>> > >>>>>> Bioperl hi, > >>>>>> > >>>>>> I have multiple gff3 files that look like: > >>>>>> > >>>>>> -------------------------------------------------------- > >>>>>> ##gff-version 3 > >>>>>> # file: temp1.gff3 > >>>>>> contig453278 hs contig 1 699 . + . > >>>>>> Name=contig453278 > >>>>>> contig453278 hs alignment 1 194 . + . > >>>>>> Name=hs.ENST00000258455 > >>>>>> contig453278 hs alignment 252 699 . + . > >>>>>> Name=hs.ENST00000258455 > >>>>>> contig453278 hs mi2 194 252 . + . Name=mi2 > >>>>>> contig453278 hs transcript 104 183 . + . > >>>>>> Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,4 > >>>>>> contig453278 hs transcript 1 9 . + . > >>>>>> Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,1 > >>>>>> contig453278 hs transcript 73 103 . + . > >>>>>> Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,3 > >>>>>> contig453278 hs transcript 10 72 . + . > >>>>>> Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,2 > >>>>>> --------------------------------------------------------- > >>>>>> > >>>>>> > >>>>>> > >>>>>> I uploaded gff3 files to mysql and view the tracks in gBrowse like this: > >>>>>> > >>>>>> -------------------------------------------- > >>>>>> Bio::DB::SeqFeature::Store->new(...) > >>>>>> $loader=Bio::DB::SeqFeature::Store::GFF3Loader->new(...) > >>>>>> for($i=0; $i<$#files+1; $i++){ > >>>>>> $loader->load($file[$i]); > >>>>>> } > >>>>>> -------------------------------------------- > >>>>>> > >>>>>> Eventually after uploading ~15000 small gff3 files I got the error: > >>>>>> > >>>>>> Error in tempdir() using /tmp/XXXXXXXXXX: Could not create directory > >>>>>> /tmp/9bbpdE5ibf: Too many links at > >>>>>> /usr/local/share/perl/5.8.8/Bio/DB/SeqFeature/Store/LoadHelper.pm line 49 > >>>>>> > >>>>>> > >>>>>> > >>>>>> I am trying to see what went wrong, maybe you can rescue me, > >>>>>> Avi > >>>>>> > >>>>>> > >>>>>> > >>>>>> > >>>>>> _______________________________________________ > >>>>>> Bioperl-l mailing list > >>>>>> Bioperl-l at lists.open-bio.org > >>>>>> http://lists.open-bio.org/mailman/listinfo/bioperl-l > >>>>>> > >>>>>> > >>>>>> > >>>>> _______________________________________________ > >>>>> Bioperl-l mailing list > >>>>> Bioperl-l at lists.open-bio.org > >>>>> http://lists.open-bio.org/mailman/listinfo/bioperl-l > >>>>> > >>>> _______________________________________________ > >>>> Bioperl-l mailing list > >>>> Bioperl-l at lists.open-bio.org > >>>> http://lists.open-bio.org/mailman/listinfo/bioperl-l > >>>> > >>> > >> > >> _______________________________________________ > >> Bioperl-l mailing list > >> Bioperl-l at lists.open-bio.org > >> http://lists.open-bio.org/mailman/listinfo/bioperl-l > > > > > > _______________________________________________ > > Bioperl-l mailing list > > Bioperl-l at lists.open-bio.org > > http://lists.open-bio.org/mailman/listinfo/bioperl-l > > > > From pisceanpranav at gmail.com Sat Feb 6 01:17:49 2010 From: pisceanpranav at gmail.com (pranav karkhanis) Date: Sat, 6 Feb 2010 11:47:49 +0530 Subject: [Bioperl-l] Remote Blast Message-ID: <3122b9be1002052217t6749866cw42cb22935be08a44@mail.gmail.com> Where can I find the examples or source code for running BLAST with a input SEQUENCE using RemoteBlast.pm? From ross at cuhk.edu.hk Sat Feb 6 06:09:27 2010 From: ross at cuhk.edu.hk (Ross KK Leung) Date: Sat, 6 Feb 2010 19:09:27 +0800 Subject: [Bioperl-l] get a full taxonomy path from Bio::Taxon Message-ID: <000001caa71c$d9308900$8b919b00$@edu.hk> In 2003, there's an old post asking about a way to get a full taxonomy. http://osdir.com/ml/lang.perl.bio.general/2003-08/msg00206.html Then Bio::Taxonomy is deprecated and now combination with Bio::Tree has to use in order to work out the story. In http://search.cpan.org/dist/bioperl/Bio/Taxon.pm it also gives an example by explicitly stating what to retrieve. However, what I need is to just provide a GI or accession number and to get a full taxonomy and I find there is no simple way (at least I don't know .) to do so. Would anybody advise? Thanks a lot, Ross From maj at fortinbras.us Sat Feb 6 08:52:31 2010 From: maj at fortinbras.us (Mark A. Jensen) Date: Sat, 6 Feb 2010 08:52:31 -0500 Subject: [Bioperl-l] get a full taxonomy path from Bio::Taxon In-Reply-To: <000001caa71c$d9308900$8b919b00$@edu.hk> References: <000001caa71c$d9308900$8b919b00$@edu.hk> Message-ID: Try this, Ross: use Bio::DB::SoapEUtilities; my $fac = Bio::DB::SoapEUtilities->new(); my $links = $fac->elink( -db => 'taxonomy', -db_from => protein, -id => $accn ) ->run( -auto_adapt => 1 ); my $spcio = $fac->efetch( -db => 'taxonomy', -id => [ $links->id_map( $accn ) ]) ->run( -auto_adapt => 1); my $species = $spcio->next_species; my @fullpath = $species->classification; Bio::DB::SoapEUtilities is in the bioperl-run trunk. MAJ ----- Original Message ----- From: "Ross KK Leung" To: Sent: Saturday, February 06, 2010 6:09 AM Subject: [Bioperl-l] get a full taxonomy path from Bio::Taxon > In 2003, there's an old post asking about a way to get a full taxonomy. > > > > http://osdir.com/ml/lang.perl.bio.general/2003-08/msg00206.html > > > > Then Bio::Taxonomy is deprecated and now combination with Bio::Tree has to > use in order to work out the story. > > > > In > > > > http://search.cpan.org/dist/bioperl/Bio/Taxon.pm > > > > it also gives an example by explicitly stating what to retrieve. However, > what I need is to just provide a GI or accession number and to get a full > taxonomy and I find there is no simple way (at least I don't know .) to do > so. Would anybody advise? > > > > Thanks a lot, > > Ross > > > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l > > From florent.angly at gmail.com Sat Feb 6 09:27:53 2010 From: florent.angly at gmail.com (Florent Angly) Date: Sun, 07 Feb 2010 00:27:53 +1000 Subject: [Bioperl-l] Too many links at LoadHelper.pm In-Reply-To: <1265433083.4676.7.camel@epistle> References: <34328.58710.qm@web65506.mail.ac4.yahoo.com> <4B6CDCA0.6060700@gmail.com> <4B6CE0F8.2090707@gmail.com> <8D50C6AA-2BD5-43AD-8798-929A0B656CC9@illinois.edu> <68FE1652EDB34C3097DDAB00C2F6D6CA@NewLife> <1265433083.4676.7.camel@epistle> Message-ID: <4B6D7C69.2050704@gmail.com> Ok, so I looked at my code, and it's a totally unrelated issue that caused files/filehandles to not be removed/closed properly. The BioPerl tempdir() system seems to be doing fine :) Florent On 06/02/10 15:11, Dan Kortschak wrote: > When I was working through those issues (I was having problems with > Bio::DB::Sam causing tempfiles to be released before they were finished > with) I paid quite close attention to having a clean /tmp after running > tests, so I pretty confident that my modules are working with > Bio::Root::IO properly and that Bio::Root::IO is working properly with > them. > > Sorry I can't be more helpful. > Dan > > On Fri, 2010-02-05 at 23:49 -0500, Mark A. Jensen wrote: > >> Dan has been partial to temp in Root::IO (bowtie); I scurried off to File::Temp >> (sam)-- >> any issues with tempfile handles, Dan? >> ----- Original Message ----- >> From: "Chris Fields" >> To: "Florent Angly" >> Cc: >> Sent: Friday, February 05, 2010 11:41 PM >> Subject: Re: [Bioperl-l] Too many links at LoadHelper.pm >> >> >> >>> The only place I can think of tempfiles being generated offhand are >>> Bio::SeqFeature::Collection (actually, tied handles), but as you mention this >>> may not be related. There are calls to tempfile from >>> Bio::Assembly::IO::sam/bowtie (one using Root::IO, one using File::Temp >>> directly). >>> >>> chris >>> >>> On Feb 5, 2010, at 9:24 PM, Florent Angly wrote: >>> >>> >>>> It can't be, Chris, because I use the workaround mentioned here: >>>> http://bugzilla.open-bio.org/show_bug.cgi?id=2577#c5 >>>> >>>> It seems like temporary files keep accumulating assembly after assembly. >>>> Looking at the lsof command, I get a lot of files described as "(deleted)". I >>>> am thinking that maybe some filehandle is not closed or something. >>>> >>>> I'll investigate... >>>> >>>> Florent >>>> >>>> On 06/02/10 13:13, Chris Fields wrote: >>>> >>>>> Florent, >>>>> >>>>> The Bio::Assembly error is documented in Bugzilla: >>>>> >>>>> http://bugzilla.open-bio.org/show_bug.cgi?id=2577 >>>>> >>>>> Switching to a Bio::SeqFeature::CollectionI capable of distinguishing >>>>> features by seq_id (such as Bio::DB::SeqFeature::Store) may be the best way >>>>> to go. >>>>> >>>>> chris >>>>> >>>>> On Feb 5, 2010, at 9:06 PM, Florent Angly wrote: >>>>> >>>>> >>>>> >>>>>> I've run into this issue very recently, albeit working on something >>>>>> unrelated to loading GFF files: generating many small independent >>>>>> assemblies. Maybe there's a bug in the tempdir() code, or its documentation >>>>>> is not clear. I will take a look at it when I have some time. >>>>>> Florent >>>>>> >>>>>> >>>>>> On 05/02/10 22:27, Mark A. Jensen wrote: >>>>>> >>>>>> >>>>>>> The tempfiles are not being unlinked before there are too many for the OS >>>>>>> to handle. >>>>>>> Sounds like this may be your job, fungazid, if you're not employing the >>>>>>> script (but >>>>>>> not sure about that) >>>>>>> ----- Original Message ----- From: "Fungazid" >>>>>>> To: >>>>>>> Sent: Thursday, February 04, 2010 9:52 PM >>>>>>> Subject: [Bioperl-l] Too many links at LoadHelper.pm >>>>>>> >>>>>>> >>>>>>> >>>>>>> >>>>>>>> Bioperl hi, >>>>>>>> >>>>>>>> I have multiple gff3 files that look like: >>>>>>>> >>>>>>>> -------------------------------------------------------- >>>>>>>> ##gff-version 3 >>>>>>>> # file: temp1.gff3 >>>>>>>> contig453278 hs contig 1 699 . + . >>>>>>>> Name=contig453278 >>>>>>>> contig453278 hs alignment 1 194 . + . >>>>>>>> Name=hs.ENST00000258455 >>>>>>>> contig453278 hs alignment 252 699 . + . >>>>>>>> Name=hs.ENST00000258455 >>>>>>>> contig453278 hs mi2 194 252 . + . Name=mi2 >>>>>>>> contig453278 hs transcript 104 183 . + . >>>>>>>> Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,4 >>>>>>>> contig453278 hs transcript 1 9 . + . >>>>>>>> Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,1 >>>>>>>> contig453278 hs transcript 73 103 . + . >>>>>>>> Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,3 >>>>>>>> contig453278 hs transcript 10 72 . + . >>>>>>>> Name=hs.ENST00000258455;ID=transcript-hs.ENST00000258455;Note=CD,2 >>>>>>>> --------------------------------------------------------- >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> I uploaded gff3 files to mysql and view the tracks in gBrowse like this: >>>>>>>> >>>>>>>> -------------------------------------------- >>>>>>>> Bio::DB::SeqFeature::Store->new(...) >>>>>>>> $loader=Bio::DB::SeqFeature::Store::GFF3Loader->new(...) >>>>>>>> for($i=0; $i<$#files+1; $i++){ >>>>>>>> $loader->load($file[$i]); >>>>>>>> } >>>>>>>> -------------------------------------------- >>>>>>>> >>>>>>>> Eventually after uploading ~15000 small gff3 files I got the error: >>>>>>>> >>>>>>>> Error in tempdir() using /tmp/XXXXXXXXXX: Could not create directory >>>>>>>> /tmp/9bbpdE5ibf: Too many links at >>>>>>>> /usr/local/share/perl/5.8.8/Bio/DB/SeqFeature/Store/LoadHelper.pm line 49 >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> I am trying to see what went wrong, maybe you can rescue me, >>>>>>>> Avi >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> _______________________________________________ >>>>>>>> Bioperl-l mailing list >>>>>>>> Bioperl-l at lists.open-bio.org >>>>>>>> http://lists.open-bio.org/mailman/listinfo/bioperl-l >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>> _______________________________________________ >>>>>>> Bioperl-l mailing list >>>>>>> Bioperl-l at lists.open-bio.org >>>>>>> http://lists.open-bio.org/mailman/listinfo/bioperl-l >>>>>>> >>>>>>> >>>>>> _______________________________________________ >>>>>> Bioperl-l mailing list >>>>>> Bioperl-l at lists.open-bio.org >>>>>> http://lists.open-bio.org/mailman/listinfo/bioperl-l >>>>>> >>>>>> >>>>> >>>> _______________________________________________ >>>> Bioperl-l mailing list >>>> Bioperl-l at lists.open-bio.org >>>> http://lists.open-bio.org/mailman/listinfo/bioperl-l >>>> >>> >>> _______________________________________________ >>> Bioperl-l mailing list >>> Bioperl-l at lists.open-bio.org >>> http://lists.open-bio.org/mailman/listinfo/bioperl-l >>> >>> >>> > > From fungazid at yahoo.com Sat Feb 6 18:36:57 2010 From: fungazid at yahoo.com (Fungazid) Date: Sat, 6 Feb 2010 15:36:57 -0800 (PST) Subject: [Bioperl-l] Bio::seqFeature functions coresponding to specific gBrowse variables Message-ID: <963778.13913.qm@web65505.mail.ac4.yahoo.com> Hello everybody, thanks for you help, here is another small question, and I would be grateful for your help: In the gBrowse .conf file of gBrowse I have the following lines ------------------------------------------------------------------- balloon hover = =

Gene

balloon click = gene regions
--------------------------------------------------------------------- As you can see above, gBrowse recognizes the properties $name,$ref, $start, $end, $id Now, lets say I wrote this: --------------------------- balloon click = sub { my $feature = shift; $name = $feature->?????(); $ref = $feature->?????(); ... } --------------------------- what are the seqFeature parallel functions/variables ? Avi From Russell.Smithies at agresearch.co.nz Sun Feb 7 15:47:00 2010 From: Russell.Smithies at agresearch.co.nz (Smithies, Russell) Date: Mon, 8 Feb 2010 09:47:00 +1300 Subject: [Bioperl-l] Refseq Version In-Reply-To: <465b5a661002050655u34ff94acwdf8fe72512dec4b3@mail.gmail.com> References: <465b5a661002041401u3b8ac5fdp483f7032ce35139b@mail.gmail.com> <18DF7D20DFEC044098A1062202F5FFF32C61BED520@exchsth.agresearch.co.nz> <465b5a661002050655u34ff94acwdf8fe72512dec4b3@mail.gmail.com> Message-ID: <18DF7D20DFEC044098A1062202F5FFF32C61BED76C@exchsth.agresearch.co.nz> Release 39 was Jan 30 and according to the README releases only come out in odd months (January, March, May, July, September, November) The stats file is here: ftp://ftp.ncbi.nih.gov/refseq/release/release-statistics/RefSeq-release39.01232010.stats.txt The numbers of sequences between the fasta release and the pre-build blast databases seem to differ but I guess only NCBI can explain that. I can't see any way of extracting the release number from the pre-build blast databases (apart from the build date) but it might be worth asking NCBI if they'd include the information in future releases. FYI, here's the old release stats. (I wget'ed and grep'ed all the stats files) Release Date Year Organisms Nucleotide Bases Amino Acids Records 1 Jun-30 2003 4,672,871,949 263,588,685 1,061,675 2 Oct-21 2003 2,124 7,745,398,573 286,957,682 1,097,404 3 Jan-13 2004 2,218 7,992,741,222 294,647,847 1,101,244 4 Mar-24 2004 2,358 8,175,128,887 318,253,841 1,193,457 5 May-02 2004 2,395 8,325,515,623 337,229,387 1,255,613 6 Jul-05 2004 2,467 8,696,371,716 365,446,682 1,367,206 7 Sep-12 2004 2,558 21,072,808,460 405,233,619 1,579,579 8 Oct-31 2004 2,645 26,814,386,658 430,300,369 1,709,723 9 Jan-09 2005 2,780 36,786,975,473 470,534,907 1,843,944 10 Mar-06 2005 2,827 36,893,741,150 482,862,858 1,893,478 11 May-08 2005 2,928 39,731,702,362 507,980,644 2,477,893 12 Jul-10 2005 2,969 43,043,256,058 608,493,108 2,869,675 13 Sep-11 2005 3,060 44,727,484,853 686,768,902 3,400,773 14 Nov-20 2005 3,198 47,364,955,367 763,761,075 3,272,776 15 Jan-01 2006 3,244 52,645,441,913 810,009,733 3,436,263 16 Mar-11 2006 3,397 56,175,443,059 887,509,001 3,715,260 17 May-01 2006 3,497 62,130,037,371 927,587,669 3,999,859 18 Jul-11 2006 3,695 70,474,041,999 974,374,765 4,186,692 19 Sep-10 2006 3,774 70,694,879,544 1,012,985,077 4,311,543 20 Nov-05 2006 3,919 72,679,681,505 1,061,797,276 4,567,569 21 Jan-06 2007 4,079 73,864,990,566 1,144,795,927 4,742,335 22 Mar-05 2007 4,187 82,441,128,546 1,215,085,694 5,207,865 23 May-08 2007 4,300 83,148,327,110 1,291,050,995 5,503,385 24 Jul-10 2007 4,511 89,856,995,521 1,365,916,222 6,073,814 25 Sep-11 2007 4,646 91,265,840,843 1,470,475,398 6,515,132 26 Nov-04 2007 4,737 99,105,705,485 1,495,032,507 6,698,250 27 Jan-06 2008 4,926 101,059,552,113 1,556,356,987 7,025,715 28 Mar-09 2008 5,059 102,051,350,525 1,770,627,427 7,914,560 29 May-04 2008 5,168 104,671,101,150 1,870,214,220 8,376,141 30 Jul-07 2008 5,395 105,074,486,709 1,913,447,691 8,572,852 31 Aug-30 2008 5,513 109,214,348,591 2,026,768,719 9,145,702 32 Nov-10 2008 5,726 111,122,203,221 2,089,596,746 9,501,764 33 Jan-16 2009 7,773 116,001,583,818 2,204,073,443 10,325,282 34 Mar-06 2009 8,054 111,792,574,830 2,299,682,138 10,021,870 35 May-04 2009 8,393 113,210,655,336 2,565,199,170 10,993,891 36 Jul-02 2009 8,665 117,013,741,530 2,756,884,219 12,141,825 37 Sep-03 2009 9,005 119,151,229,820 2,965,450,333 12,941,750 38 Nov-07 2009 9,166 119,196,622,435 3,115,246,540 13,436,447 --Russell From: shalu sharma [mailto:sharmashalu.bio at gmail.com] Sent: Saturday, 6 February 2010 3:56 a.m. To: Smithies, Russell Cc: bioperl-l at lists.open-bio.org Subject: Re: [Bioperl-l] Refseq Version Hi Russell, Thanks for your response. I am getting the number of sequence in the database but not the release number (like 38, 39). This is what i did: $ fastacmd -I -d /db/ncbiblast/refseq/refseq_protein Database: NCBI Protein Reference Sequences 7,585,993 sequences; 2,644,770,521 total letters File names: /db/ncbiblast/refseq/refseq_protein.00 Date: Jan 30, 2010 8:34 PM Version: 4 Longest sequence: 36,805 res /db/ncbiblast/refseq/refseq_protein.01 Date: Jan 30, 2010 8:34 PM Version: 4 Longest sequence: 33,403 res /db/ncbiblast/refseq/refseq_protein.02 Date: Jan 30, 2010 8:34 PM Version: 4 Longest sequence: 15,830 res I am still confuse that how i can get the release number. I know refseq 39 was released on Jan 30, 2010 but i don't know how to confirm this. I also tried look refseq release file but was not able to get any thing. I would really appreciate if anyone can help me out with this. Thanks Shalu On Thu, Feb 4, 2010 at 6:39 PM, Smithies, Russell > wrote: If you have access to the blast database, use fastacmd -I -d databasename Otherwise, it's usually at the bottom of your blast result. --Russell > -----Original Message----- > From: bioperl-l-bounces at lists.open-bio.org [mailto:bioperl-l- > bounces at lists.open-bio.org] On Behalf Of shalu sharma > Sent: Friday, 5 February 2010 11:02 a.m. > To: bioperl-l at lists.open-bio.org > Subject: [Bioperl-l] Refseq Version > > Hi All, > This is not a bioperl query. > Is there any way to check refseq version (release). Actually i am using > some > server to blast my sequences (blastall) against refseq. Is there any way i > can get the version information on the refseq database (from the blast > file > or directly from the database)? > > Thanks > Shalu > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l ======================================================================= Attention: The information contained in this message and/or attachments from AgResearch Limited is intended only for the persons or entities to which it is addressed and may contain confidential and/or privileged material. Any review, retransmission, dissemination or other use of, or taking of any action in reliance upon, this information by persons or entities other than the intended recipients is prohibited by AgResearch Limited. If you have received this message in error, please notify the sender immediately. ======================================================================= From Russell.Smithies at agresearch.co.nz Sun Feb 7 15:59:28 2010 From: Russell.Smithies at agresearch.co.nz (Smithies, Russell) Date: Mon, 8 Feb 2010 09:59:28 +1300 Subject: [Bioperl-l] Refseq Version In-Reply-To: <18DF7D20DFEC044098A1062202F5FFF32C61BED76C@exchsth.agresearch.co.nz> References: <465b5a661002041401u3b8ac5fdp483f7032ce35139b@mail.gmail.com> <18DF7D20DFEC044098A1062202F5FFF32C61BED520@exchsth.agresearch.co.nz> <465b5a661002050655u34ff94acwdf8fe72512dec4b3@mail.gmail.com> <18DF7D20DFEC044098A1062202F5FFF32C61BED76C@exchsth.agresearch.co.nz> Message-ID: <18DF7D20DFEC044098A1062202F5FFF32C61BED78C@exchsth.agresearch.co.nz> I should have known it would break the formatting :-( Try this: Release 1:June 30, 2003;Release Size: 4672871949 bases, 263588685 amino acids, 1061675 records Release 2:October 21, 2003;Release Size: 2124 organisms, 7745398573 nucleotide bases, 286957682 amino acids, 1097404 records Release 3:January 13, 2004;Release Size: 2218 organisms, 7992741222 nucleotide bases, 294647847 amino acids, 1101244 records Release 4:March 24, 2004;Release Size: 2358 organisms, 8175128887 nucleotide bases, 318253841 amino acids, 1193457 records Release 5:May 2 , 2004;Release Size: 2395 organisms, 8325515623 nucleotide bases, 337229387 amino acids, 1255613 records Release 6:July 5, 2004;Release Size: 2467 organisms, 8696371716 nucleotide bases, 365446682 amino acids, 1367206 records Release 7:September 12, 2004;Release Size: 2558 organisms, 21072808460 nucleotide bases, 405233619 amino acids, 1579579 records Release 8:October 31, 2004;Release Size: 2645 organisms, 26814386658 nucleotide bases, 430300369 amino acids, 1709723 records Release 9:January 9, 2005;Release Size: 2780 organisms, 36786975473 nucleotide bases, 470534907 amino acids, 1843944 records Release 10:March 6, 2005;Release Size:2827 organisms, 36893741150 nucleotide bases, 482862858 amino acids, 1893478 records Release 11:May 8, 2005;Release Size:2928 organisms, 39731702362 nucleotide bases, 507980644 amino acids, 2477893 records Release 12:July 10, 2005;Release Size:2969 organisms, 43043256058 nucleotide bases, 608493108 amino acids, 2869675 records Release 13:September 11, 2005;Release Size:3060 organisms, 44727484853 nucleotide bases, 686768902 amino acids, 3400773 records Release 14:November 20, 2005;Release Size:3198 organisms, 47364955367 nucleotide bases, 763761075 amino acids, 3272776 records Release 15:January 1, 2006;Release Size:3244 organisms, 52645441913 nucleotide bases, 810009733 amino acids, 3436263 records Release 16:March 11, 2006;Release Size:3397 organisms, 56175443059 nucleotide bases, 887509001 amino acids, 3715260 records Release 17:May 1, 2006;Release Size:3497 organisms, 62130037371 nucleotide bases, 927587669 amino acids, 3999859 records Release 18:July 11, 2006;Release Size:3695 organisms, 70474041999 nucleotide bases, 974374765 amino acids, 4186692 records Release 19:September 10, 2006;Release Size: 3774 organisms, 70694879544 nucleotide bases, 1012985077 amino acids, 4311543 records Release 20:November 5, 2006;Release Size:3919 organisms, 72679681505 nucleotide bases, 1061797276 amino acids, 4567569 records Release 21:January 6, 2007;Release Size:4079 organisms, 73864990566 nucleotide bases, 1144795927 amino acids, 4742335 records Release 22:March 5, 2007;Release Size:4187 organisms, 82441128546 nucleotide bases, 1215085694 amino acids, 5207865 records Release 23:May 8, 2007;Release Size:4300 organisms, 83148327110 nucleotide bases, 1291050995 amino acids, 5503385 records Release 24:July 10, 2007;Release Size:4511 organisms, 89856995521 nucleotide bases, 1365916222 amino acids, 6073814 records Release 25:September 11, 2007;Release Size:4646 organisms, 91265840843 nucleotide bases, 1470475398 amino acids, 6515132 records Release 26:November 4, 2007;Release Size:4737 organisms, 99105705485 nucleotide bases, 1495032507 amino acids, 6698250 records Release 27:January 6, 2008;Release Size:4926 organisms, 101059552113 nucleotide bases, 1556356987 amino acids, 7025715 records Release 28:March 9, 2008;Release Size: 5059 organisms, 102051350525 nucleotide bases, 1770627427 amino acids, 7914560 records Release 29:May 4, 2008;Release Size:5168 organisms, 104671101150 nucleotide bases, 1870214220 amino acids, 8376141 records Release 30:July 7, 2008;Release Size:5395 organisms, 105074486709 nucleotide bases, 1913447691 amino acids, 8572852 records Release 31:August 30, 2008;Release Size: 5513 organisms, 109214348591 nucleotide bases, 2026768719 amino acids, 9145702 records Release 32:November 10, 2008;Release Size: 5726 organisms, 111122203221 nucleotide bases, 2089596746 amino acids, 9501764 records Release 33:January 16, 2009;Release Size:7773 organisms, 116001583818 nucleotide bases, 2204073443 amino acids, 10325282 records Release 34:March 6, 2009;Release Size: 8054 organisms, 111792574830 nucleotide bases, 2299682138 amino acids, 10021870 records Release 35:May 4, 2009;Release Size: 8393 organisms, 113210655336 nucleotide bases, 2565199170 amino acids, 10993891 records Release 36:July 2, 2009;Release Size: 8665 organisms, 117013741530 nucleotide bases, 2756884219 amino acids, 12141825 records Release 37:September 3, 2009;Release Size: 9005 organisms, 119151229820 nucleotide bases, 2965450333 amino acids, 12941750 records Release 38:November 7, 2009;Release Size: 9166 organisms, 119196622435 nucleotide bases, 3115246540 amino acids, 13436447 records > -----Original Message----- > From: bioperl-l-bounces at lists.open-bio.org [mailto:bioperl-l- > bounces at lists.open-bio.org] On Behalf Of Smithies, Russell > Sent: Monday, 8 February 2010 9:47 a.m. > To: 'shalu sharma' > Cc: 'bioperl-l at lists.open-bio.org' > Subject: Re: [Bioperl-l] Refseq Version > > Release 39 was Jan 30 and according to the README releases only come out > in odd months (January, March, May, July, September, November) > The stats file is here: ftp://ftp.ncbi.nih.gov/refseq/release/release- > statistics/RefSeq-release39.01232010.stats.txt > > The numbers of sequences between the fasta release and the pre-build blast > databases seem to differ but I guess only NCBI can explain that. > I can't see any way of extracting the release number from the pre-build > blast databases (apart from the build date) but it might be worth asking > NCBI if they'd include the information in future releases. > > > FYI, here's the old release stats. > (I wget'ed and grep'ed all the stats files) > > Release > > Date > > Year > > Organisms > > Nucleotide Bases > > Amino Acids > > Records > > 1 > > Jun-30 > > 2003 > > 4,672,871,949 > > 263,588,685 > > 1,061,675 > > 2 > > Oct-21 > > 2003 > > 2,124 > > 7,745,398,573 > > 286,957,682 > > 1,097,404 > > 3 > > Jan-13 > > 2004 > > 2,218 > > 7,992,741,222 > > 294,647,847 > > 1,101,244 > > 4 > > Mar-24 > > 2004 > > 2,358 > > 8,175,128,887 > > 318,253,841 > > 1,193,457 > > 5 > > May-02 > > 2004 > > 2,395 > > 8,325,515,623 > > 337,229,387 > > 1,255,613 > > 6 > > Jul-05 > > 2004 > > 2,467 > > 8,696,371,716 > > 365,446,682 > > 1,367,206 > > 7 > > Sep-12 > > 2004 > > 2,558 > > 21,072,808,460 > > 405,233,619 > > 1,579,579 > > 8 > > Oct-31 > > 2004 > > 2,645 > > 26,814,386,658 > > 430,300,369 > > 1,709,723 > > 9 > > Jan-09 > > 2005 > > 2,780 > > 36,786,975,473 > > 470,534,907 > > 1,843,944 > > 10 > > Mar-06 > > 2005 > > 2,827 > > 36,893,741,150 > > 482,862,858 > > 1,893,478 > > 11 > > May-08 > > 2005 > > 2,928 > > 39,731,702,362 > > 507,980,644 > > 2,477,893 > > 12 > > Jul-10 > > 2005 > > 2,969 > > 43,043,256,058 > > 608,493,108 > > 2,869,675 > > 13 > > Sep-11 > > 2005 > > 3,060 > > 44,727,484,853 > > 686,768,902 > > 3,400,773 > > 14 > > Nov-20 > > 2005 > > 3,198 > > 47,364,955,367 > > 763,761,075 > > 3,272,776 > > 15 > > Jan-01 > > 2006 > > 3,244 > > 52,645,441,913 > > 810,009,733 > > 3,436,263 > > 16 > > Mar-11 > > 2006 > > 3,397 > > 56,175,443,059 > > 887,509,001 > > 3,715,260 > > 17 > > May-01 > > 2006 > > 3,497 > > 62,130,037,371 > > 927,587,669 > > 3,999,859 > > 18 > > Jul-11 > > 2006 > > 3,695 > > 70,474,041,999 > > 974,374,765 > > 4,186,692 > > 19 > > Sep-10 > > 2006 > > 3,774 > > 70,694,879,544 > > 1,012,985,077 > > 4,311,543 > > 20 > > Nov-05 > > 2006 > > 3,919 > > 72,679,681,505 > > 1,061,797,276 > > 4,567,569 > > 21 > > Jan-06 > > 2007 > > 4,079 > > 73,864,990,566 > > 1,144,795,927 > > 4,742,335 > > 22 > > Mar-05 > > 2007 > > 4,187 > > 82,441,128,546 > > 1,215,085,694 > > 5,207,865 > > 23 > > May-08 > > 2007 > > 4,300 > > 83,148,327,110 > > 1,291,050,995 > > 5,503,385 > > 24 > > Jul-10 > > 2007 > > 4,511 > > 89,856,995,521 > > 1,365,916,222 > > 6,073,814 > > 25 > > Sep-11 > > 2007 > > 4,646 > > 91,265,840,843 > > 1,470,475,398 > > 6,515,132 > > 26 > > Nov-04 > > 2007 > > 4,737 > > 99,105,705,485 > > 1,495,032,507 > > 6,698,250 > > 27 > > Jan-06 > > 2008 > > 4,926 > > 101,059,552,113 > > 1,556,356,987 > > 7,025,715 > > 28 > > Mar-09 > > 2008 > > 5,059 > > 102,051,350,525 > > 1,770,627,427 > > 7,914,560 > > 29 > > May-04 > > 2008 > > 5,168 > > 104,671,101,150 > > 1,870,214,220 > > 8,376,141 > > 30 > > Jul-07 > > 2008 > > 5,395 > > 105,074,486,709 > > 1,913,447,691 > > 8,572,852 > > 31 > > Aug-30 > > 2008 > > 5,513 > > 109,214,348,591 > > 2,026,768,719 > > 9,145,702 > > 32 > > Nov-10 > > 2008 > > 5,726 > > 111,122,203,221 > > 2,089,596,746 > > 9,501,764 > > 33 > > Jan-16 > > 2009 > > 7,773 > > 116,001,583,818 > > 2,204,073,443 > > 10,325,282 > > 34 > > Mar-06 > > 2009 > > 8,054 > > 111,792,574,830 > > 2,299,682,138 > > 10,021,870 > > 35 > > May-04 > > 2009 > > 8,393 > > 113,210,655,336 > > 2,565,199,170 > > 10,993,891 > > 36 > > Jul-02 > > 2009 > > 8,665 > > 117,013,741,530 > > 2,756,884,219 > > 12,141,825 > > 37 > > Sep-03 > > 2009 > > 9,005 > > 119,151,229,820 > > 2,965,450,333 > > 12,941,750 > > 38 > > Nov-07 > > 2009 > > 9,166 > > 119,196,622,435 > > 3,115,246,540 > > 13,436,447 > > > > --Russell > > > From: shalu sharma [mailto:sharmashalu.bio at gmail.com] > Sent: Saturday, 6 February 2010 3:56 a.m. > To: Smithies, Russell > Cc: bioperl-l at lists.open-bio.org > Subject: Re: [Bioperl-l] Refseq Version > > Hi Russell, > Thanks for your response. > I am getting the number of sequence in the database but not the release > number (like 38, 39). > This is what i did: > > $ fastacmd -I -d /db/ncbiblast/refseq/refseq_protein > Database: NCBI Protein Reference Sequences > 7,585,993 sequences; 2,644,770,521 total letters > > File names: > /db/ncbiblast/refseq/refseq_protein.00 > Date: Jan 30, 2010 8:34 PM Version: 4 Longest sequence: 36,805 > res > /db/ncbiblast/refseq/refseq_protein.01 > Date: Jan 30, 2010 8:34 PM Version: 4 Longest sequence: 33,403 > res > /db/ncbiblast/refseq/refseq_protein.02 > Date: Jan 30, 2010 8:34 PM Version: 4 Longest sequence: 15,830 > res > > I am still confuse that how i can get the release number. I know refseq 39 > was released on Jan 30, 2010 but i don't know how to confirm this. I also > tried look refseq release file but was not able to get any thing. > > I would really appreciate if anyone can help me out with this. > > Thanks > Shalu > > On Thu, Feb 4, 2010 at 6:39 PM, Smithies, Russell > z>> wrote: > If you have access to the blast database, use fastacmd -I -d databasename > Otherwise, it's usually at the bottom of your blast result. > > --Russell > > > -----Original Message----- > > From: bioperl-l-bounces at lists.open-bio.org bounces at lists.open-bio.org> [mailto:bioperl-l- > > bounces at lists.open-bio.org] On Behalf > Of shalu sharma > > Sent: Friday, 5 February 2010 11:02 a.m. > > To: bioperl-l at lists.open-bio.org > > Subject: [Bioperl-l] Refseq Version > > > > Hi All, > > This is not a bioperl query. > > Is there any way to check refseq version (release). Actually i am using > > some > > server to blast my sequences (blastall) against refseq. Is there any way > i > > can get the version information on the refseq database (from the blast > > file > > or directly from the database)? > > > > Thanks > > Shalu > > _______________________________________________ > > Bioperl-l mailing list > > Bioperl-l at lists.open-bio.org > > http://lists.open-bio.org/mailman/listinfo/bioperl-l > ======================================================================= > Attention: The information contained in this message and/or attachments > from AgResearch Limited is intended only for the persons or entities > to which it is addressed and may contain confidential and/or privileged > material. Any review, retransmission, dissemination or other use of, or > taking of any action in reliance upon, this information by persons or > entities other than the intended recipients is prohibited by AgResearch > Limited. If you have received this message in error, please notify the > sender immediately. > ======================================================================= > > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l From Russell.Smithies at agresearch.co.nz Sun Feb 7 16:05:17 2010 From: Russell.Smithies at agresearch.co.nz (Smithies, Russell) Date: Mon, 8 Feb 2010 10:05:17 +1300 Subject: [Bioperl-l] Refseq Version In-Reply-To: <18DF7D20DFEC044098A1062202F5FFF32C61BED78C@exchsth.agresearch.co.nz> References: <465b5a661002041401u3b8ac5fdp483f7032ce35139b@mail.gmail.com> <18DF7D20DFEC044098A1062202F5FFF32C61BED520@exchsth.agresearch.co.nz> <465b5a661002050655u34ff94acwdf8fe72512dec4b3@mail.gmail.com> <18DF7D20DFEC044098A1062202F5FFF32C61BED76C@exchsth.agresearch.co.nz> <18DF7D20DFEC044098A1062202F5FFF32C61BED78C@exchsth.agresearch.co.nz> Message-ID: <18DF7D20DFEC044098A1062202F5FFF32C61BED795@exchsth.agresearch.co.nz> AAArrrgg, what is it with Outlook this morning!!! Formatting kaput again but I'm sure you can work it out from there! --Russell > -----Original Message----- > From: bioperl-l-bounces at lists.open-bio.org [mailto:bioperl-l- > bounces at lists.open-bio.org] On Behalf Of Smithies, Russell > Sent: Monday, 8 February 2010 9:59 a.m. > To: 'shalu sharma' > Cc: 'bioperl-l at lists.open-bio.org' > Subject: Re: [Bioperl-l] Refseq Version > > I should have known it would break the formatting :-( > > Try this: > > Release 1:June 30, 2003;Release Size: 4672871949 bases, 263588685 amino > acids, 1061675 records > Release 2:October 21, 2003;Release Size: 2124 organisms, 7745398573 > nucleotide bases, 286957682 amino acids, 1097404 records > Release 3:January 13, 2004;Release Size: 2218 organisms, 7992741222 > nucleotide bases, 294647847 amino acids, 1101244 records > Release 4:March 24, 2004;Release Size: 2358 organisms, 8175128887 > nucleotide bases, 318253841 amino acids, 1193457 records > Release 5:May 2 , 2004;Release Size: 2395 organisms, 8325515623 nucleotide > bases, 337229387 amino acids, 1255613 records > Release 6:July 5, 2004;Release Size: 2467 organisms, 8696371716 nucleotide > bases, 365446682 amino acids, 1367206 records > Release 7:September 12, 2004;Release Size: 2558 organisms, 21072808460 > nucleotide bases, 405233619 amino acids, 1579579 records > Release 8:October 31, 2004;Release Size: 2645 organisms, 26814386658 > nucleotide bases, 430300369 amino acids, 1709723 records > Release 9:January 9, 2005;Release Size: 2780 organisms, 36786975473 > nucleotide bases, 470534907 amino acids, 1843944 records > Release 10:March 6, 2005;Release Size:2827 organisms, 36893741150 > nucleotide bases, 482862858 amino acids, 1893478 records > Release 11:May 8, 2005;Release Size:2928 organisms, 39731702362 nucleotide > bases, 507980644 amino acids, 2477893 records > Release 12:July 10, 2005;Release Size:2969 organisms, 43043256058 > nucleotide bases, 608493108 amino acids, 2869675 records > Release 13:September 11, 2005;Release Size:3060 organisms, 44727484853 > nucleotide bases, 686768902 amino acids, 3400773 records > Release 14:November 20, 2005;Release Size:3198 organisms, 47364955367 > nucleotide bases, 763761075 amino acids, 3272776 records > Release 15:January 1, 2006;Release Size:3244 organisms, 52645441913 > nucleotide bases, 810009733 amino acids, 3436263 records > Release 16:March 11, 2006;Release Size:3397 organisms, 56175443059 > nucleotide bases, 887509001 amino acids, 3715260 records > Release 17:May 1, 2006;Release Size:3497 organisms, 62130037371 nucleotide > bases, 927587669 amino acids, 3999859 records > Release 18:July 11, 2006;Release Size:3695 organisms, 70474041999 > nucleotide bases, 974374765 amino acids, 4186692 records > Release 19:September 10, 2006;Release Size: 3774 organisms, 70694879544 > nucleotide bases, 1012985077 amino acids, 4311543 records > Release 20:November 5, 2006;Release Size:3919 organisms, 72679681505 > nucleotide bases, 1061797276 amino acids, 4567569 records > Release 21:January 6, 2007;Release Size:4079 organisms, 73864990566 > nucleotide bases, 1144795927 amino acids, 4742335 records > Release 22:March 5, 2007;Release Size:4187 organisms, 82441128546 > nucleotide bases, 1215085694 amino acids, 5207865 records > Release 23:May 8, 2007;Release Size:4300 organisms, 83148327110 nucleotide > bases, 1291050995 amino acids, 5503385 records > Release 24:July 10, 2007;Release Size:4511 organisms, 89856995521 > nucleotide bases, 1365916222 amino acids, 6073814 records > Release 25:September 11, 2007;Release Size:4646 organisms, 91265840843 > nucleotide bases, 1470475398 amino acids, 6515132 records > Release 26:November 4, 2007;Release Size:4737 organisms, 99105705485 > nucleotide bases, 1495032507 amino acids, 6698250 records > Release 27:January 6, 2008;Release Size:4926 organisms, 101059552113 > nucleotide bases, 1556356987 amino acids, 7025715 records > Release 28:March 9, 2008;Release Size: 5059 organisms, 102051350525 > nucleotide bases, 1770627427 amino acids, 7914560 records > Release 29:May 4, 2008;Release Size:5168 organisms, 104671101150 > nucleotide bases, 1870214220 amino acids, 8376141 records > Release 30:July 7, 2008;Release Size:5395 organisms, 105074486709 > nucleotide bases, 1913447691 amino acids, 8572852 records > Release 31:August 30, 2008;Release Size: 5513 organisms, 109214348591 > nucleotide bases, 2026768719 amino acids, 9145702 records > Release 32:November 10, 2008;Release Size: 5726 organisms, 111122203221 > nucleotide bases, 2089596746 amino acids, 9501764 records > Release 33:January 16, 2009;Release Size:7773 organisms, 116001583818 > nucleotide bases, 2204073443 amino acids, 10325282 records > Release 34:March 6, 2009;Release Size: 8054 organisms, 111792574830 > nucleotide bases, 2299682138 amino acids, 10021870 records > Release 35:May 4, 2009;Release Size: 8393 organisms, 113210655336 > nucleotide bases, 2565199170 amino acids, 10993891 records > Release 36:July 2, 2009;Release Size: 8665 organisms, 117013741530 > nucleotide bases, 2756884219 amino acids, 12141825 records > Release 37:September 3, 2009;Release Size: 9005 organisms, 119151229820 > nucleotide bases, 2965450333 amino acids, 12941750 records > Release 38:November 7, 2009;Release Size: 9166 organisms, 119196622435 > nucleotide bases, 3115246540 amino acids, 13436447 records > > > > > -----Original Message----- > > From: bioperl-l-bounces at lists.open-bio.org [mailto:bioperl-l- > > bounces at lists.open-bio.org] On Behalf Of Smithies, Russell > > Sent: Monday, 8 February 2010 9:47 a.m. > > To: 'shalu sharma' > > Cc: 'bioperl-l at lists.open-bio.org' > > Subject: Re: [Bioperl-l] Refseq Version > > > > Release 39 was Jan 30 and according to the README releases only come out > > in odd months (January, March, May, July, September, November) > > The stats file is here: ftp://ftp.ncbi.nih.gov/refseq/release/release- > > statistics/RefSeq-release39.01232010.stats.txt > > > > The numbers of sequences between the fasta release and the pre-build > blast > > databases seem to differ but I guess only NCBI can explain that. > > I can't see any way of extracting the release number from the pre-build > > blast databases (apart from the build date) but it might be worth asking > > NCBI if they'd include the information in future releases. > > > > > > FYI, here's the old release stats. > > (I wget'ed and grep'ed all the stats files) > > > > Release > > > > Date > > > > Year > > > > Organisms > > > > Nucleotide Bases > > > > Amino Acids > > > > Records > > > > 1 > > > > Jun-30 > > > > 2003 > > > > 4,672,871,949 > > > > 263,588,685 > > > > 1,061,675 > > > > 2 > > > > Oct-21 > > > > 2003 > > > > 2,124 > > > > 7,745,398,573 > > > > 286,957,682 > > > > 1,097,404 > > > > 3 > > > > Jan-13 > > > > 2004 > > > > 2,218 > > > > 7,992,741,222 > > > > 294,647,847 > > > > 1,101,244 > > > > 4 > > > > Mar-24 > > > > 2004 > > > > 2,358 > > > > 8,175,128,887 > > > > 318,253,841 > > > > 1,193,457 > > > > 5 > > > > May-02 > > > > 2004 > > > > 2,395 > > > > 8,325,515,623 > > > > 337,229,387 > > > > 1,255,613 > > > > 6 > > > > Jul-05 > > > > 2004 > > > > 2,467 > > > > 8,696,371,716 > > > > 365,446,682 > > > > 1,367,206 > > > > 7 > > > > Sep-12 > > > > 2004 > > > > 2,558 > > > > 21,072,808,460 > > > > 405,233,619 > > > > 1,579,579 > > > > 8 > > > > Oct-31 > > > > 2004 > > > > 2,645 > > > > 26,814,386,658 > > > > 430,300,369 > > > > 1,709,723 > > > > 9 > > > > Jan-09 > > > > 2005 > > > > 2,780 > > > > 36,786,975,473 > > > > 470,534,907 > > > > 1,843,944 > > > > 10 > > > > Mar-06 > > > > 2005 > > > > 2,827 > > > > 36,893,741,150 > > > > 482,862,858 > > > > 1,893,478 > > > > 11 > > > > May-08 > > > > 2005 > > > > 2,928 > > > > 39,731,702,362 > > > > 507,980,644 > > > > 2,477,893 > > > > 12 > > > > Jul-10 > > > > 2005 > > > > 2,969 > > > > 43,043,256,058 > > > > 608,493,108 > > > > 2,869,675 > > > > 13 > > > > Sep-11 > > > > 2005 > > > > 3,060 > > > > 44,727,484,853 > > > > 686,768,902 > > > > 3,400,773 > > > > 14 > > > > Nov-20 > > > > 2005 > > > > 3,198 > > > > 47,364,955,367 > > > > 763,761,075 > > > > 3,272,776 > > > > 15 > > > > Jan-01 > > > > 2006 > > > > 3,244 > > > > 52,645,441,913 > > > > 810,009,733 > > > > 3,436,263 > > > > 16 > > > > Mar-11 > > > > 2006 > > > > 3,397 > > > > 56,175,443,059 > > > > 887,509,001 > > > > 3,715,260 > > > > 17 > > > > May-01 > > > > 2006 > > > > 3,497 > > > > 62,130,037,371 > > > > 927,587,669 > > > > 3,999,859 > > > > 18 > > > > Jul-11 > > > > 2006 > > > > 3,695 > > > > 70,474,041,999 > > > > 974,374,765 > > > > 4,186,692 > > > > 19 > > > > Sep-10 > > > > 2006 > > > > 3,774 > > > > 70,694,879,544 > > > > 1,012,985,077 > > > > 4,311,543 > > > > 20 > > > > Nov-05 > > > > 2006 > > > > 3,919 > > > > 72,679,681,505 > > > > 1,061,797,276 > > > > 4,567,569 > > > > 21 > > > > Jan-06 > > > > 2007 > > > > 4,079 > > > > 73,864,990,566 > > > > 1,144,795,927 > > > > 4,742,335 > > > > 22 > > > > Mar-05 > > > > 2007 > > > > 4,187 > > > > 82,441,128,546 > > > > 1,215,085,694 > > > > 5,207,865 > > > > 23 > > > > May-08 > > > > 2007 > > > > 4,300 > > > > 83,148,327,110 > > > > 1,291,050,995 > > > > 5,503,385 > > > > 24 > > > > Jul-10 > > > > 2007 > > > > 4,511 > > > > 89,856,995,521 > > > > 1,365,916,222 > > > > 6,073,814 > > > > 25 > > > > Sep-11 > > > > 2007 > > > > 4,646 > > > > 91,265,840,843 > > > > 1,470,475,398 > > > > 6,515,132 > > > > 26 > > > > Nov-04 > > > > 2007 > > > > 4,737 > > > > 99,105,705,485 > > > > 1,495,032,507 > > > > 6,698,250 > > > > 27 > > > > Jan-06 > > > > 2008 > > > > 4,926 > > > > 101,059,552,113 > > > > 1,556,356,987 > > > > 7,025,715 > > > > 28 > > > > Mar-09 > > > > 2008 > > > > 5,059 > > > > 102,051,350,525 > > > > 1,770,627,427 > > > > 7,914,560 > > > > 29 > > > > May-04 > > > > 2008 > > > > 5,168 > > > > 104,671,101,150 > > > > 1,870,214,220 > > > > 8,376,141 > > > > 30 > > > > Jul-07 > > > > 2008 > > > > 5,395 > > > > 105,074,486,709 > > > > 1,913,447,691 > > > > 8,572,852 > > > > 31 > > > > Aug-30 > > > > 2008 > > > > 5,513 > > > > 109,214,348,591 > > > > 2,026,768,719 > > > > 9,145,702 > > > > 32 > > > > Nov-10 > > > > 2008 > > > > 5,726 > > > > 111,122,203,221 > > > > 2,089,596,746 > > > > 9,501,764 > > > > 33 > > > > Jan-16 > > > > 2009 > > > > 7,773 > > > > 116,001,583,818 > > > > 2,204,073,443 > > > > 10,325,282 > > > > 34 > > > > Mar-06 > > > > 2009 > > > > 8,054 > > > > 111,792,574,830 > > > > 2,299,682,138 > > > > 10,021,870 > > > > 35 > > > > May-04 > > > > 2009 > > > > 8,393 > > > > 113,210,655,336 > > > > 2,565,199,170 > > > > 10,993,891 > > > > 36 > > > > Jul-02 > > > > 2009 > > > > 8,665 > > > > 117,013,741,530 > > > > 2,756,884,219 > > > > 12,141,825 > > > > 37 > > > > Sep-03 > > > > 2009 > > > > 9,005 > > > > 119,151,229,820 > > > > 2,965,450,333 > > > > 12,941,750 > > > > 38 > > > > Nov-07 > > > > 2009 > > > > 9,166 > > > > 119,196,622,435 > > > > 3,115,246,540 > > > > 13,436,447 > > > > > > > > --Russell > > > > > > From: shalu sharma [mailto:sharmashalu.bio at gmail.com] > > Sent: Saturday, 6 February 2010 3:56 a.m. > > To: Smithies, Russell > > Cc: bioperl-l at lists.open-bio.org > > Subject: Re: [Bioperl-l] Refseq Version > > > > Hi Russell, > > Thanks for your response. > > I am getting the number of sequence in the database but not the release > > number (like 38, 39). > > This is what i did: > > > > $ fastacmd -I -d /db/ncbiblast/refseq/refseq_protein > > Database: NCBI Protein Reference Sequences > > 7,585,993 sequences; 2,644,770,521 total letters > > > > File names: > > /db/ncbiblast/refseq/refseq_protein.00 > > Date: Jan 30, 2010 8:34 PM Version: 4 Longest sequence: 36,805 > > res > > /db/ncbiblast/refseq/refseq_protein.01 > > Date: Jan 30, 2010 8:34 PM Version: 4 Longest sequence: 33,403 > > res > > /db/ncbiblast/refseq/refseq_protein.02 > > Date: Jan 30, 2010 8:34 PM Version: 4 Longest sequence: 15,830 > > res > > > > I am still confuse that how i can get the release number. I know refseq > 39 > > was released on Jan 30, 2010 but i don't know how to confirm this. I > also > > tried look refseq release file but was not able to get any thing. > > > > I would really appreciate if anyone can help me out with this. > > > > Thanks > > Shalu > > > > On Thu, Feb 4, 2010 at 6:39 PM, Smithies, Russell > > > > z>> wrote: > > If you have access to the blast database, use fastacmd -I -d > databasename > > Otherwise, it's usually at the bottom of your blast result. > > > > --Russell > > > > > -----Original Message----- > > > From: bioperl-l-bounces at lists.open-bio.org > bounces at lists.open-bio.org> [mailto:bioperl-l- > > > bounces at lists.open-bio.org] On > Behalf > > Of shalu sharma > > > Sent: Friday, 5 February 2010 11:02 a.m. > > > To: bioperl-l at lists.open-bio.org > > > Subject: [Bioperl-l] Refseq Version > > > > > > Hi All, > > > This is not a bioperl query. > > > Is there any way to check refseq version (release). Actually i am > using > > > some > > > server to blast my sequences (blastall) against refseq. Is there any > way > > i > > > can get the version information on the refseq database (from the blast > > > file > > > or directly from the database)? > > > > > > Thanks > > > Shalu > > > _______________________________________________ > > > Bioperl-l mailing list > > > Bioperl-l at lists.open-bio.org > > > http://lists.open-bio.org/mailman/listinfo/bioperl-l > > ======================================================================= > > Attention: The information contained in this message and/or attachments > > from AgResearch Limited is intended only for the persons or entities > > to which it is addressed and may contain confidential and/or privileged > > material. Any review, retransmission, dissemination or other use of, or > > taking of any action in reliance upon, this information by persons or > > entities other than the intended recipients is prohibited by AgResearch > > Limited. If you have received this message in error, please notify the > > sender immediately. > > ======================================================================= > > > > > > _______________________________________________ > > Bioperl-l mailing list > > Bioperl-l at lists.open-bio.org > > http://lists.open-bio.org/mailman/listinfo/bioperl-l > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l From Russell.Smithies at agresearch.co.nz Sun Feb 7 16:09:59 2010 From: Russell.Smithies at agresearch.co.nz (Smithies, Russell) Date: Mon, 8 Feb 2010 10:09:59 +1300 Subject: [Bioperl-l] Bio::Graphics inside PHP In-Reply-To: <4B6C58B5.3080304@uab.cat> References: <4B6C58B5.3080304@uab.cat> Message-ID: <18DF7D20DFEC044098A1062202F5FFF32C61BED7A0@exchsth.agresearch.co.nz> We looked at this a year or so, you can base64 encode an image and use that in the img src tag but only a few browsers support it. Easiest way is write the image as a png (with a unique name) to a temp dir. --Russell > -----Original Message----- > From: bioperl-l-bounces at lists.open-bio.org [mailto:bioperl-l- > bounces at lists.open-bio.org] On Behalf Of Miquel Ramia > Sent: Saturday, 6 February 2010 6:43 a.m. > To: BioPerl List > Subject: [Bioperl-l] Bio::Graphics inside PHP > > Hi, > > I'm working in a database with a web interface developed in php. I'm > displaying Bio::Graphic images inside an IFrame, but doesn't look very > well. > > Anybody knows if it's possible to include a Bio::Graphics image directly > in a php page? > > Thank you > > Miquel R?mia > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l ======================================================================= Attention: The information contained in this message and/or attachments from AgResearch Limited is intended only for the persons or entities to which it is addressed and may contain confidential and/or privileged material. Any review, retransmission, dissemination or other use of, or taking of any action in reliance upon, this information by persons or entities other than the intended recipients is prohibited by AgResearch Limited. If you have received this message in error, please notify the sender immediately. ======================================================================= From sdavis2 at mail.nih.gov Sun Feb 7 18:00:34 2010 From: sdavis2 at mail.nih.gov (Sean Davis) Date: Sun, 7 Feb 2010 18:00:34 -0500 Subject: [Bioperl-l] Bio::Graphics inside PHP In-Reply-To: <18DF7D20DFEC044098A1062202F5FFF32C61BED7A0@exchsth.agresearch.co.nz> References: <4B6C58B5.3080304@uab.cat> <18DF7D20DFEC044098A1062202F5FFF32C61BED7A0@exchsth.agresearch.co.nz> Message-ID: <264855a01002071500r6c4662cbq7814a78b22b416b2@mail.gmail.com> On Sun, Feb 7, 2010 at 4:09 PM, Smithies, Russell wrote: > We looked at this a year or so, you can base64 encode an image and use that in the img src tag but only a few browsers support it. Easiest way is write the image as a png (with a unique name) to a temp dir. > You could also have a separate URL that returns the image and then embed that URL in the generated page, thus avoiding the temp file. Sean >> -----Original Message----- >> From: bioperl-l-bounces at lists.open-bio.org [mailto:bioperl-l- >> bounces at lists.open-bio.org] On Behalf Of Miquel Ramia >> Sent: Saturday, 6 February 2010 6:43 a.m. >> To: BioPerl List >> Subject: [Bioperl-l] Bio::Graphics inside PHP >> >> Hi, >> >> I'm working in a database with a web interface developed in php. I'm >> displaying Bio::Graphic images inside an IFrame, but doesn't look very >> well. >> >> Anybody knows if it's possible to include a Bio::Graphics image directly >> in a php page? >> >> Thank you >> >> Miquel R?mia >> _______________________________________________ >> Bioperl-l mailing list >> Bioperl-l at lists.open-bio.org >> http://lists.open-bio.org/mailman/listinfo/bioperl-l > ======================================================================= > Attention: The information contained in this message and/or attachments > from AgResearch Limited is intended only for the persons or entities > to which it is addressed and may contain confidential and/or privileged > material. Any review, retransmission, dissemination or other use of, or > taking of any action in reliance upon, this information by persons or > entities other than the intended recipients is prohibited by AgResearch > Limited. If you have received this message in error, please notify the > sender immediately. > ======================================================================= > > _______________________________________________ > Bioperl-l mailing list > Bioperl-l at lists.open-bio.org > http://lists.open-bio.org/mailman/listinfo/bioperl-l > From jessica.sun at gmail.com Sun Feb 7 21:53:56 2010 From: jessica.sun at gmail.com (Jessica Sun) Date: Sun, 7 Feb 2010 21:53:56 -0500 Subject: [Bioperl-l] questions Message-ID: <9adc0e9b1002071853v2fd4bbe8gb34a6228d346a948@mail.gmail.com> Does any one knows how to add locations(CDS, Joins(600,140;...) to the new sequence object as feature,' or if anyone knows how to keep the feature of truncated sequences object. thx -- Jessica Jingping Sun From roy.chaudhuri at gmail.com Mon Feb 8 06:09:29 2010 From: roy.chaudhuri at gmail.com (Roy Chaudhuri) Date: Mon, 08 Feb 2010 11:09:29 +0000 Subject: [Bioperl-l] questions In-Reply-To: <9adc0e9b1002071853v2fd4bbe8gb34a6228d346a948@mail.gmail.com> References: <9adc0e9b1002071853v2fd4bbe8gb34a6228d346a948@mail.gmail.com> Message-ID: <4B6FF0E9.3040704@gmail.com> Hi Jessica, > Does any one knows how to add locations(CDS, Joins(600,140;...) to the new > sequence object as feature,' use Bio::SeqIO; use Bio::Seq; use Bio::Location::Split; use Bio::Location::Simple; use Bio::SeqFeature::Generic; my $seq=Bio::Seq->new(-seq=>'ACTGATCGATGCATGATGTAGATCGATAGCTGCTA'); my $loc=Bio::Location::Split->new; $loc->add_sub_Location(Bio::Location::Simple->new(-start=>4, -end=>6, -strand=>1); $loc->add_sub_Location(Bio::Location::Simple->new(-start=>10, -end=>12, -strand=>1); my $feat=Bio::SeqFeature::Generic->new(-primary_tag=>'misc_feat', -location=>$loc); $seq->add_SeqFeature($feat); my $out=Bio::SeqIO->new(-format=>'genbank'; $out->write_seq($seq); > or if anyone knows how to keep the feature of truncated sequences object. Look at the trunc_with_features method in Bio::SeqUtils: use Bio::SeqUtils; my $trunc=Bio::SeqUtils->trunc_with_features($seq, 2, 12); $out->write_seq($trunc); Hope this helps. Roy. From maj at fortinbras.us Mon Feb 8 09:38:25 2010 From: maj at fortinbras.us (Mark A. Jensen) Date: Mon, 8 Feb 2010 09:38:25 -0500 Subject: [Bioperl-l] server maintenace occurring Message-ID: <8F69A20965E64983887DE6028356911A@NewLife> Hi All, Chris Dag informs us that necessary server changes are taking place over the next couple of days. There could be temporary outages at the wiki and codebase. Thanks for your patience. MAJ From bluecurio at gmail.com Mon Feb 8 11:20:43 2010 From: bluecurio at gmail.com (Daniel Renfro) Date: Mon, 8 Feb 2010 10:20:43 -0600 Subject: [Bioperl-l] Bio::Graphics inside PHP Message-ID: <25c0f63d1002080820l60e9f9d2p837b9afd8bb1f90d@mail.gmail.com> I've been trying different ways to do this same thing. At EcoliWiki.net we use GBrowse to show all the strains of Ecoli and assoc phage, plasmids, etc. so this is definitely something I think many other model-organism databases could use. The final solution was to construct a link to gbrowse_img with all the necessary variables coming in from $_REQUEST and throw it in the src="" attribute of the tag. This works pretty good for what we're doing. Good luck. -Daniel Renfro Hu Lab Research Associate Dept. of Biochemistry and Biophysics 2128 TAMU Texas A&M Univ. College Station, TX 77843-2128 979-862-4055 From lincoln.stein at gmail.com Mon Feb 8 11:33:18 2010 From: lincoln.stein at gmail.com (Lincoln Stein) Date: Mon, 8 Feb 2010 11:33:18 -0500 Subject: [Bioperl-l] Bio::Graphics inside PHP In-Reply-To: <25c0f63d1002080820l60e9f9d2p837b9afd8bb1f90d@mail.gmail.com> References: <25c0f63d1002080820l60e9f9d2p837b9afd8bb1f90d@mail.gmail.com> Message-ID: <6dce9a0b1002080833o3ad29300kfcee727ed499e74e@mail.gmail.com> There's an embeddable version of gbrowse_img that generates the imagemap so it is clickable. If you use an