[Bioperl-l] problem with alignments and sequence locations

Chris Fields cjfields at illinois.edu
Tue Nov 10 08:58:52 EST 2009

On Nov 10, 2009, at 6:55 AM, Steffen Heyne wrote:

> Hi,
> I'm using Bioperl for my research and it is very useful! Thank you!
> Currently I have a problem with locations tags of sequences. I read  
> in seed alignments of Rfam (in stockholm format, but I think it is  
> similar to other formats).
> If the location is like:
> AB194432.1/908-846
> the start/end values are changed to
> $seq->start = 846
> $seq->end = 908
> and therefore the new location (e.g.$seq->get_nse) is:
> AB194432.1/846-908
> The $seq->strand tag is correctly set to -1 in this case, but if the  
> alignment is written out again (clustal, stockholm,...) this strand  
> info is lost and the sequences have this "wrong" location. But this  
> information is important in respect to the sequence accession number.
> Is there a way to set the location back to the original one or is  
> this behavior desired? Any manually setting with $seq->start($val)  
> failed due to automatic checking.
> I'm using bioperl 1.6.1
> Thanks!
> steffen

This is a definite bug. We recently discussed amending the NSE format  
due to this (the subject came up over the last few months or so); it's  
fallen through the cracks.  Fortunaely it is very easy to fix (the  
relevant method is in LocatableSeq).

Does anyone have a problem with me adding this in?  It will change  
output for only those instances where the strand is -1, so


would be start = 846, end = 908, strand = -1


would be start = 846, end = 908, strand = 1


More information about the Bioperl-l mailing list