[Bioperl-l] problem parsing FASTA output - bug or my fault?

Aidan Budd budd at embl-heidelberg.de
Thu Apr 26 06:18:11 EDT 2007


Hi Bioperlers,

I'm trying to parse a FASTA search output file (see attached .out file) 
using Bioperl 1.4. My Bioperl installation has otherwise been working 
fine, however I currently get the following error when running a simple 
script that attempts to access result from this outfile via bioperl.

Is this a problem with the parser?
Or have I executed FASTA wrongly creating output that isn't covered by the 
parser?

Any suggestions on how to deal with this much appreciated.

Best wishes,

Aidan

Script:

#!/usr/bin/perl -w
$^W=1;
use strict;
use Bio::SearchIO;

my $fasta_report = new Bio::SearchIO ('-format' => 'fasta',
                                      '-file'   => $ARGV[0]);
                                      
my $result = $fasta_report->next_result();            

Errors:

Use of uninitialized value in concatenation (.) or string at 
/Users/budd/perl_modules/bioperl_1_4/bioperl-1.4/Bio/Search/HSP/GenericHSP.pm 
line 231, <GEN3> line 47.

------------- EXCEPTION  -------------
MSG: Did not specify a Query End or Query Begin -verbose 0 -algorithm 
FASTP -score 62.4 -hit_frame 0 -hsp_length 180 -hit_seq  -hit_length 0 
-query_length 128 -query_frame 0 -swscore 122 -rank 1 -query_seq 
GTTILQYAQTTDGQQILVPSNQVVVQAASGDVQTYQIRTAPTSTIAPGVVMASS--PALPTQPAEEAARKREVRLMKNREAARECRRKKKEYVKCLENRVAVLENQ-NKTLIEELKALKD-LYCHKSD 
-homology_seq                              
MEMTDFELTSNSQ.NL.IPTNFK.TLP.RKRAKTK..KEQR.IE.ILR..R..HQS.E..RLHLQY..RKCSL...LL.SVNL.K.ADHE.A.T.SHDAFVASLDEYRDFQSTRGASLDTRASSHSSSDTFTPSPLNCTMEPATLSPKSMR 
-hit_name YFL031W -bits 19.4 -query_name CREB1_MONKEY -evalue 1.1 (qs='
STACK Bio::Search::HSP::GenericHSP::new 
/Users/budd/perl_modules/bioperl_1_4/bioperl-1.4/Bio/Search/HSP/GenericHSP.pm:231
STACK Bio::Search::HSP::FastaHSP::new 
/Users/budd/perl_modules/bioperl_1_4/bioperl-1.4/Bio/Search/HSP/FastaHSP.pm:97
STACK Bio::Factory::ObjectFactory::create_object 
/Users/budd/perl_modules/bioperl_1_4/bioperl-1.4/Bio/Factory/ObjectFactory.pm:150
STACK Bio::SearchIO::SearchResultEventBuilder::end_hsp 
/Users/budd/perl_modules/bioperl_1_4/bioperl-1.4/Bio/SearchIO/SearchResultEventBuilder.pm:275
STACK Bio::SearchIO::fasta::end_element 
/Users/budd/perl_modules/bioperl_1_4/bioperl-1.4/Bio/SearchIO/fasta.pm:872
STACK Bio::SearchIO::fasta::next_result 
/Users/budd/perl_modules/bioperl_1_4/bioperl-1.4/Bio/SearchIO/fasta.pm:403
STACK toplevel 
/Users/budd/scripts/test_scripts/test_parsing_fasta_output.pl:22

--------------------------------------

-- 
----------------------------------------------------------------------
Aidan Budd, PhD                               tel:+49 (0)6221 387 8530
EMBL - European Molecular Biology Laboratory  fax:+49 (0)6221 387 8517
Meyerhofstr. 1, 69117 Heidelberg, Germany

URL: http://www-db.embl.de/jss/EmblGroupsHD/per_1807.html
-------------- next part --------------
# fasta34 -m 2 creb1_human.fasta yeast_bzips_from_ensembl.fasta
FASTA searches a protein or DNA sequence data bank
 version 34.26 January 12, 2007
Please cite:
 W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

Query library creb1_human.fasta vs yeast_bzips_from_ensembl.fasta library
searching yeast_bzips_from_ensembl.fasta library

  1>>>CREB1_MONKEY 341 aa - 341 aa
 vs  yeast_bzips_from_ensembl.fasta library

   3683 residues in    10 sequences
 MLE_cen statistics: Lambda= 0.0338;  K=8.757e-05 (cen=0)

FASTA (3.5 Sept 2006) function [optimized, BL50 matrix (15:-5)] ktup: 2
 join: 37, opt: 25, open/ext: -10/-2, width:  16
 Scan time:  0.000
The best scores are:                                      opt bits E(10)
YFL031W                                            ( 238)  122 19.4     1.1
YEL009C                                            ( 281)  121 19.4     1.3
YIL036W                                            ( 587)  129 19.8       2
YIR017C                                            ( 187)   83 17.5     2.9
YVNL167C                                           ( 647)  119 19.3     2.9
YIR018W                                            ( 245)   67 16.7     5.3
YER045C                                            ( 489)   73 17.0     7.1
YDR259C                                            ( 383)   62 16.5     7.5
YOR028C                                            ( 296)   41 15.5     8.9
YHL009C                                            ( 330)   33 15.1     9.6

>>YFL031W                                                 (238 aa)
 initn: 107 init1: 107 opt: 122  Z-score: 62.4  bits: 19.4 E():  1.1
Smith-Waterman score: 122;  27.660% identity (63.830% similar) in 94 aa overlap (248-337:2-95)

       220       230       240       250       260       270       
CREB1_ GTTILQYAQTTDGQQILVPSNQVVVQAASGDVQTYQIRTAPTSTIAPGVVMASS--PALP
YFL031                              MEMTDFELTSNSQ.NL.IPTNFK.TLP.RKR

         280       290       300       310       320        330    
CREB1_ TQPAEEAARKREVRLMKNREAARECRRKKKEYVKCLENRVAVLENQ-NKTLIEELKALKD
YFL031 AKTK..KEQR.IE.ILR..R..HQS.E..RLHLQY..RKCSL...LL.SVNL.K.ADHE.

           340                                                     
CREB1_ -LYCHKSD                                                    
YFL031 A.T.SHDAFVASLDEYRDFQSTRGASLDTRASSHSSSDTFTPSPLNCTMEPATLSPKSMR

>>YEL009C                                                 (281 aa)
 initn: 138 init1:  83 opt: 121  Z-score: 60.8  bits: 19.4 E():  1.3
Smith-Waterman score: 121;  29.412% identity (55.462% similar) in 119 aa overlap (219-335:165-277)

      190       200       210       220       230       240        
CREB1_ GAIQLANNGTDGVQGLQTLTMTNAAATQPGTTILQYAQTTDGQQILVPSNQVVVQAASGD
YEL009 VSLADKAIESTEEVSLVPSNLEVSTTSFLP.PV.ED.KL.QTRKVKK.NS--..KKSHHV

      250       260       270         280       290       300      
CREB1_ VQTYQIRTAPTSTIAPGVVMASSPALPTQP--AEEAARKREVRLMKNREAARECRRKKKE
YEL009 GKDDES.LDHLGVV.YNRKQR.I.LS.IV.ESSDP..L..----AR.T....RS.AR.LQ

        310       320       330       340 
CREB1_ YVKCLENRVAVLENQNKTLIEELKALKDLYCHKSD
YEL009 RM.Q..DK.EE.LSK.YH.EN.VAR..K.VGER  

>>YIL036W                                                 (587 aa)
 initn: 132 init1:  70 opt: 129  Z-score: 57.2  bits: 19.8 E():    2
Smith-Waterman score: 129;  18.750% identity (55.682% similar) in 352 aa overlap (2-335:137-477)

                                            10        20           
CREB1_                              MTMESGAENQQSGDAAVTEAENQQM--TVQA
YIL036 RVVKPSANSNYQQAAYLRQQQQQDQRQQSPS.KTEE.S.LY..ILMNSGVV.D.HQNLAT

      30        40        50        60        70        80         
CREB1_ QPQIATLAQVSMPAAHATSSAPTVTLVQLPNGQTVQVHGVIQAAQPSVIQSPQVQTVQSS
YIL036 HTNLSQ.SSTRKS.PNDSTT...-NASNIA.--.AS.NKQMYFMNMNMNNN.HALNDP.I

      90         100       110         120       130       140     
CREB1_ CKDLKRLFS--GTQISTIAESEDS--QESVDSVTDSQKRREILSRRPSYRKILNDL----
YIL036 LET.SPF.QPF.VDVAHLPMTNPPIF.S.LPGCDEPIR..R.SISNGQISQLGE.IETLE

                150       160          170        180       190    
CREB1_ ---SSDAPGVPRIEEEKSEEET---SAPAITTVTVP-TPIYQTSSGQYIAITQGGAIQLA
YIL036 NLHNTQP.PM.NFHNYNGLSQ.RNV.NKPVFNQA..VSS.P.YNAKKV.NP.KDS.--.G

          200       210       220       230       240       250    
CREB1_ NNGTDGVQGLQTLTMTNAAATQPGTTILQYAQTTDGQQILVPSNQVVVQAASGDVQTYQI
YIL036 DQSVIYSKSQ.RNFVNAPSKNT.AES.----SDLE.MTTFA.TTGGENRGK.ALRESHSN

           260       270       280       290       300       310   
CREB1_ RT-APTSTIAPGVVMASSPALPTQPAEEAARKREVRLMKNREAARECRRKKKEYVKCLEN
YIL036 PSFT.K.QGSHLNLA.NTQGN.I-.GT-T.W..ARL.ER..I..SK..QR..VAQLQ.QK

           320       330       340                                 
CREB1_ RVAVLENQNKTLIEELKALKDLYCHKSD                                
YIL036 EFNEIKDE.RI.LKK.NYYEK.ISKFKKFSKIHLREHEKLNKDSDNNVNGTNSSNKNESM

>>YIR017C                                                 (187 aa)
 initn:  43 init1:  43 opt:  83  Z-score: 54.0  bits: 17.5 E():  2.9
Smith-Waterman score: 84;  22.785% identity (56.962% similar) in 158 aa overlap (176-330:9-148)

         150       160       170       180       190       200     
CREB1_ PGVPRIEEEKSEEETSAPAITTVTVPTPIYQTSSGQYIAITQGGAIQLANNGTDGVQGLQ
YIR017                       MSAKQGWEKK.TNID..SRK.MNV---..LSEHL.N.I

         210       220       230       240        250       260    
CREB1_ TLTMTNAAATQPGTTILQYAQTTDGQQILVPSNQVVVQAASG-DVQTYQIRTAPTS--TI
YIR017 S------SDSEL.SRL.SLLLVSS.N-----AEELISMINN.Q..SQFKKLRE.RKGKVA

            270       280       290       300       310       320  
CREB1_ APGVVMASSPALPTQPAEEAARKREVRLMKNREAARECRRKKKEYVKCLENRVAVLENQN
YIR017 .TTA.VVKEEEA.VSTSN.LDKIKQE.RR..T..SQRF.IR..Q--.NF..-MNK.Q.L.

            330       340                             
CREB1_ KTLIEELKALKDLYCHKSD                            
YIR017 -.Q.NK.RDRIEQLNKENEFWKAKLNDINEIKSLKLLNDIKRRNMGR

>>YVNL167C                                                (647 aa)
 initn: 142 init1: 119 opt: 119  Z-score: 53.8  bits: 19.3 E():  2.9
Smith-Waterman score: 119;  39.623% identity (62.264% similar) in 53 aa overlap (280-332:426-478)

     250       260       270       280       290       300         
CREB1_ QTYQIRTAPTSTIAPGVVMASSPALPTQPAEEAARKREVRLMKNREAARECRRKKKEYVK
YVNL16 RKNSAVTTAPAQKDDVENNKISNNVTLDEN..QE...KEF.ER..V..SKF.KR....I.

     310       320       330       340                             
CREB1_ CLENRVAVLENQNKTLIEELKALKDLYCHKSD                            
YVNL16 KI..DLQFY.SEYDD.TQVIGK.CGIIPSSSSNSQFNVNVSTPSSSSPPSTSLIALLESS

>>YIR018W                                                 (245 aa)
 initn:  61 init1:  61 opt:  67  Z-score: 47.6  bits: 16.7 E():  5.3
Smith-Waterman score: 67;  25.455% identity (61.818% similar) in 55 aa overlap (280-334:55-109)

     250       260       270       280       290       300         
CREB1_ QTYQIRTAPTSTIAPGVVMASSPALPTQPAEEAARKREVRLMKNREAARECRRKKKEYVK
YIR018 SKNWKLPPRLPHRAAQRRKRVHRLHEDYET..NDEELQKKKRQ..D.Q.AY.ER.NNKLQ

     310       320       330       340                             
CREB1_ CLENRVAVLENQNKTLIEELKALKDLYCHKSD                            
YIR018 V..ETIES.SKVV.NYETK.NR.QNELQAKESENHALKQKLETLTLKQASVPAQDPILQN

>>YER045C                                                 (489 aa)
 initn: 111 init1:  70 opt:  73  Z-score: 43.8  bits: 17.0 E():  7.1
Smith-Waterman score: 97;  22.826% identity (67.391% similar) in 92 aa overlap (3-92:210-300)

                                           10        20         30 
CREB1_                             MTMESGAENQQSGDAAVTEAE-NQQMTVQAQP
YER045 QTGSKNIYAAMTPYDSNIKLNIPAVAATCDIP.ATPSIP...STMNQ.YI.M.LRL...M

              40        50        60         70        80        90
CREB1_ QIATLAQVSMPAAHATSSAPTVTLVQLPNGQTVQVHGV-IQAAQPSVIQSPQVQTVQSSC
YER045 .TKAWKNAQL-NV.PCTP.SNSSVSSSSSC.NIND.NIEN.SVHS.ISHGVNHH..NN..

              100       110       120       130       140       150
CREB1_ KDLKRLFSGTQISTIAESEDSQESVDSVTDSQKRREILSRRPSYRKILNDLSSDAPGVPR
YER045 QNAELNISSSLPYESKCPDVNLTHANSKPQYKDATSALKNNINSEKDVHTAPFSSMHTTA

>>YDR259C                                                 (383 aa)
 initn:  84 init1:  52 opt:  62  Z-score: 42.8  bits: 16.5 E():  7.5
Smith-Waterman score: 81;  33.333% identity (64.583% similar) in 48 aa overlap (289-330:227-274)

      260       270       280       290       300       310        
CREB1_ TSTIAPGVVMASSPALPTQPAEEAARKREVRLMKNREAARECRRKKKEYVKCLENRVAVL
YDR259 NDNNDNVTKPVPDKDTQLISSSGKTLRNTR.AAQ..T.QKAF.QR.EK.I.N..QKSKIF

           320        330       340                                
CREB1_ -----ENQN-KTLIEELKALKDLYCHKSD                               
YDR259 DDLLA..N.F.S.NDS.RNDNNILIAQHEAIRNAITMLRSEYDVLCNENNMLKNENSIIK

>>YOR028C                                                 (296 aa)
 initn:  35 init1:  35 opt:  41  Z-score: 39.3  bits: 15.5 E():  8.9
Smith-Waterman score: 80;  33.962% identity (66.038% similar) in 53 aa overlap (289-334:243-295)

      260       270       280       290       300       310        
CREB1_ TSTIAPGVVMASSPALPTQPAEEAARKREVRLMKNREAARECRRKKKEYVKCLENRVAVL
YOR028 LSEQVFNEGERYNNDGQLIGKTGKPLRNTK.AAQ..S.QKAF.QRREK.I.N..EKSKLF

           320        330        340 
CREB1_ -----ENQN-KTLIEELKA-LKDLYCHKSD
YOR028 DGLMK..SEL.KM..S..SK..E*      

>>YHL009C                                                 (330 aa)
 initn:  33 init1:  33 opt:  33  Z-score: 36.4  bits: 15.1 E():  9.6
Smith-Waterman score: 91;  21.667% identity (57.500% similar) in 120 aa overlap (222-333:79-194)

             200       210       220       230             240     
CREB1_ QLANNGTDGVQGLQTLTMTNAAATQPGTTILQYAQTTDGQQI-LVP-----SNQVVVQAA
YHL009 EQTAPFPILEDQCPALNLDRSNNDLLLQNNISFPKGS.L.A.Q.T.ISGDY.TY.MADNN

         250         260       270       280       290       300   
CREB1_ SGDVQTYQIRT--APTSTIAPGVVMASSPALPTQPAEEAARKREVRLMKNREAARECRRK
YHL009 NN.NDS.SNTNYFSKNNG.S.SSRSP.VAHNENV.DDSK.K.KA----Q..A.QKAF.ER

           310       320       330       340                       
CREB1_ KKEYVKCLENRVAVLENQNKTLIEELKALKDLYCHKSD                      
YHL009 .EARM.E.QDKLLES.RNRQS.LK.IEE.RKANTEINAENRLLLRSGNENFSKDIEDDTN



341 residues in 1 query   sequences
3683 residues in 10 library sequences
 Scomplib [34.26]
 start: Thu Apr 26 11:52:16 2007 done: Thu Apr 26 11:52:16 2007
 Total Scan time:  0.000 Total Display time:  0.010

Function used was FASTA [version 34.26 January 12, 2007]
-------------- next part --------------
>CREB1_MONKEY
MTMESGAENQQSGDAAVTEAENQQMTVQAQPQIATLAQVSMPAAHATSSAPTVTLVQLPN
GQTVQVHGVIQAAQPSVIQSPQVQTVQSSCKDLKRLFSGTQISTIAESEDSQESVDSVTD
SQKRREILSRRPSYRKILNDLSSDAPGVPRIEEEKSEEETSAPAITTVTVPTPIYQTSSG
QYIAITQGGAIQLANNGTDGVQGLQTLTMTNAAATQPGTTILQYAQTTDGQQILVPSNQV
VVQAASGDVQTYQIRTAPTSTIAPGVVMASSPALPTQPAEEAARKREVRLMKNREAAREC
RRKKKEYVKCLENRVAVLENQNKTLIEELKALKDLYCHKSD
-------------- next part --------------
>YIL036W
MFTGQEYHSVDSNSNKQKDNNKRGIDDTSKILNNKIPHSVSDTSAAATTTSTMNNSALSR
SLDPTDINYSTNMAGVVDQIHDYTTSNRNSLTPQYSIAAGNVNSHDRVVKPSANSNYQQA
AYLRQQQQQDQRQQSPSMKTEEESQLYGDILMNSGVVQDMHQNLATHTNLSQLSSTRKSA
PNDSTTAPTNASNIANTASVNKQMYFMNMNMNNNPHALNDPSILETLSPFFQPFGVDVAH
LPMTNPPIFQSSLPGCDEPIRRRRISISNGQISQLGEDIETLENLHNTQPPPMPNFHNYN
GLSQTRNVSNKPVFNQAVPVSSIPQYNAKKVINPTKDSALGDQSVIYSKSQQRNFVNAPS
KNTPAESISDLEGMTTFAPTTGGENRGKSALRESHSNPSFTPKSQGSHLNLAANTQGNPI
PGTTAWKRARLLERNRIAASKCRQRKKVAQLQLQKEFNEIKDENRILLKKLNYYEKLISK
FKKFSKIHLREHEKLNKDSDNNVNGTNSSNKNESMTVDSLKIIEELLMIDSDVTEVDKDT
GKIIAIKHEPYSQRFGSDTDDDDIDLKPVEGGKDPDNQSLPNSEKIK
>YIR017C
MSAKQGWEKKSTNIDIASRKGMNVNNLSEHLQNLISSDSELGSRLLSLLLVSSGNAEELI
SMINNGQDVSQFKKLREPRKGKVAATTAVVVKEEEAPVSTSNELDKIKQERRRKNTEASQ
RFRIRKKQKNFENMNKLQNLNTQINKLRDRIEQLNKENEFWKAKLNDINEIKSLKLLNDI
KRRNMGR
>YVNL167C
MSSEERSRQPSTVSTFDLEPNPFEQSFASSKKALSLPGTISHPSLPKELSRNNSTSTITQ
HSQRSTHSLNSIPEENGNSTVTDNSNHNDVKKDSPSFLPGQQRPTIISPPILTPGGSKRL
PPLLLSPSILYQANSTTNPSQNSHSVSVSNSNPSAIGVSSTSGSLYPNSSSPSGTSLIRQ
PRNSNVTTSNSGNGFPTNDSQMPGFLLNLSKSGLTPNESNIRTGLTPGILTQSYNYPVLP
SINKNTITGSKNVNKSVTVNGSIENHPHVNIMHPTVNGTPLTPGLSSLLNLPSTGVLANP
VFKSTPTTNTTDGTVNNSISNSNFSPNTSTKAAVKMDNPAEFNAIEHSAHNHKENENLTT
QIENNDQFNNKTRKRKRRMSSTSSTSKASRKNSISRKNSAVTTAPAQKDDVENNKISNNV
TLDENEEQERKRKEFLERNRVAASKFRKRKKEYIKKIENDLQFYESEYDDLTQVIGKLCG
IIPSSSSNSQFNVNVSTPSSSSPPSTSLIALLESSISRSDYSSAMSVLSNMKQLICETNF
YRRGGKNPRDDMDGQEDSFNKDTNVVKSENAGYPSVNSRPIILDKKYSLNSGANISKSNT
TTNNVGNSAQNIINSCYSVTNPLVINANSDTHDTNKHDVLSTLPHNN
>YER045C
MDYKHNFATSPDSFLDGRQNPLLYTDFLSSNKELIYKQPSGPGLVDSAYNFHHQNSLHDR
SVQENLGPMFQPFGVDISHLPITNPPIFQSSLPAFDQPVYKRRISISNGQISQLGEDLET
VENLYNCQPPILSSKAQQNPNPQQVANPSAAIYPSFSSNELQNVPQPHEQATVIPEAAPQ
TGSKNIYAAMTPYDSNIKLNIPAVAATCDIPSATPSIPSGDSTMNQAYINMQLRLQAQMQ
TKAWKNAQLNVHPCTPASNSSVSSSSSCQNINDHNIENQSVHSSISHGVNHHTVNNSCQN
AELNISSSLPYESKCPDVNLTHANSKPQYKDATSALKNNINSEKDVHTAPFSSMHTTATF
QIKQEARPQKIENNTAGLKDGAKAWKRARLLERNRIAASKCRQRKKMSQLQLQREFDQIS
KENTMMKKKIENYEKLVQKMKKISRLHMQECTINGGNNSYQSLQNKDSDVNGFLKMIEEM
IRSSSLYDE
>YIR018W
MALPLIKPKESEESHLALLSKIHVSKNWKLPPRLPHRAAQRRKRVHRLHEDYETEENDEE
LQKKKRQNRDAQRAYRERKNNKLQVLEETIESLSKVVKNYETKLNRLQNELQAKESENHA
LKQKLETLTLKQASVPAQDPILQNLIENFKPMKAIPIKYNTAIKRHQHSTELPSSVKCGF
CNDNTTCVCKELETDHRKSDDGVATEQKDMSMPHAECNNKDNPNGLCSNCTNIDKSCIDI
RSIIH
>YHL009C
MTPSNMDDNTSGFMKFINPQCQEEDCCIRNSLFQEDSKCIKQQPDLLSEQTAPFPILEDQ
CPALNLDRSNNDLLLQNNISFPKGSDLQAIQLTPISGDYSTYVMADNNNNDNDSYSNTNY
FSKNNGISPSSRSPSVAHNENVPDDSKAKKKAQNRAAQKAFRERKEARMKELQDKLLESE
RNRQSLLKEIEELRKANTEINAENRLLLRSGNENFSKDIEDDTNYKYSFPTKDEFFTSMV
LESKLNHKGKYSLKDNEIMKRNTQYTDEAGRHVLTVPATWEYLYKLSEERDFDVTYVMSK
LQGQECCHTHGPAYPRSLIDFLVEEATLNE
>YOR028C
MLMQIKMDNHPFNFQPILASHSMTRDSTKPKKMTDTAFVPSPPVGFIKEENKADLHTISV
VASNVTLPQIQLPKIATLEEPGYESRTGSLTDLSGRRNSVNIGALCEDVPNTAGPHIARP
VTINNLIPPSLPRLNTYQLRPQLSDTHLNCHFNSNPYTTASHAPFESSYTTASTFTSQPA
ASYFPSNSTPATRKNSATTNLPSEERRRVSVSLSEQVFNEGERYNNDGQLIGKTGKPLRN
TKRAAQNRSAQKAFRQRREKYIKNLEEKSKLFDGLMKENSELKKMIESLKSKLKE*
>YEL009C
MSEYQPSLFALNPMGFSPLDGSKSTNENVSASTSTAKPMVGQLIFDKFIKTEEDPIIKQD
TPSNLDFDFALPQTATAPDAKTVLPIPELDDAVVESFFSSSTDSTPMFEYENLEDNSKEW
TSLFDNDIPVTTDDVSLADKAIESTEEVSLVPSNLEVSTTSFLPTPVLEDAKLTQTRKVK
KPNSVVKKSHHVGKDDESRLDHLGVVAYNRKQRSIPLSPIVPESSDPAALKRARNTEAAR
RSRARKLQRMKQLEDKVEELLSKNYHLENEVARLKKLVGER
>YDR259C
MQNPPLIRPDMYNQGSSSMATYNASEKNLNEHPSPQIAQPSTSQKLPYRINPTTTNGDTD
ISVNSNPIQPPLPNLMHLSGPSDYRSMHQSPIHPSYIIPPHSNERKQSASYNRPQNAHVS
IQPSVVFPPKSYSISYAPYQINPPLPNGLPNQSISLNKEYIAEEQLSTLPSRNTSVTTAP
PSFQNSADTAKNSADNNDNNDNVTKPVPDKDTQLISSSGKTLRNTRRAAQNRTAQKAFRQ
RKEKYIKNLEQKSKIFDDLLAENNNFKSLNDSLRNDNNILIAQHEAIRNAITMLRSEYDV
LCNENNMLKNENSIIKNEHNMSRNENENLKLENKRFHAEYIRMIEDIENTKRKEQEQRDE
IEQLKKKIRSLEEIVGRHSDSAT
>YFL031W
MEMTDFELTSNSQSNLAIPTNFKSTLPPRKRAKTKEEKEQRRIERILRNRRAAHQSREKK
RLHLQYLERKCSLLENLLNSVNLEKLADHEDALTCSHDAFVASLDEYRDFQSTRGASLDT
RASSHSSSDTFTPSPLNCTMEPATLSPKSMRDSASDQETSWELQMFKTENVPESTTLPAV
DNNNLFDAVASPLADPLCDDIAGNSLPFDNSIDLDNWRNPEAQSGLNSFELNDFFITS


More information about the Bioperl-l mailing list