Bioperl: A greedy consensus builder

Mike Cariaso
Wed, 04 Aug 1999 17:39:40 -0700

To scratch an itch I've built a small object which ISA Bio::UnivAln.
It has a single method 'greedy' which constucts a consensus sequence using a
somewhat different method. The approach does ignores alignment gaps that are
before (or after) the first (last) non-gap of a row. An example may explain it
more clearly.

Sample alignment:

row6  :-------------------------------------------CGCTCGCCTCGCTCCTC---CCTCGCTC


So for the above alignment the current technique will call the first 20 or so
bases as gaps since that is the most common char. The greedy approach assumes
that this area is outside the known region of those rows, and ignores the gaps
there. This seems useful when working with small partial fragments.

If there is interest I'll be happy (honored, actually) to contribute it to
The interface has the threshold param as well as another optional one to specify
the minimum number of rows necessary to do a base call. And if you noticed any
errors in the alignment, its totally bogus data, so the mistake is mine.

mike cariaso      --------------------
ph:510-981-3156 -------------------------- fax:510-649-3449
=========== Bioperl Project Mailing List Message Footer =======
Project URL:
For info about how to (un)subscribe, where messages are archived, etc: