[Bioperl-l] Protein interaction modules
Richard.Adams at ed.ac.uk
Thu Jul 15 05:54:30 EDT 2004
I'd like to bring to peoples' attention some modules for parsing and
analyzing protein interaction data.
They are in CVS under Bio/Graph and are :
are modules for reading /writing graph data and function in an analagous
way to the SeqIO system.
are generic graph modules written by Nat Goodman and provide
functionality for traversing and building graphs.
These are independent of BioPerl but are added here as they're not in
extend the SimpleGraph modules to deal with multiple sequence
identifiers, duplicate edges and more complex
data about the nature of an interaction.
In this implementation, nodes are Bio::Seq objects. Interactions are
represented by Bio/Graph/Edge objects
These modules are very much biologically orientated, and are written
with the following sort of tasks in mind:
How can I annotate my sequences with interaction data?
Which nodes cause the most disruption to the network if perturbed?
What happens to network properties if a node is deleted?
How can I merge 2 protein interaction data sets together, and find
How can I calculate basic graph properties of my interaction data set ?
e.g., density, clustering coefficient.
code to demonstrate some of these tasks can be found in the Synopsis of
There is test suite , t/protgraph.t which test most of the methods.
I'd be very interested in feedback, ideas for what to include in a
protein/DNA or protein/RNA interaction class,
To use these modules you need:
XML::Twig if you want to parse XML
Class::AutoClass - the SimpleGraph modules depend on this.
The test suite tests for these modules.
Obvious improvements are :
AT present the XML parser just gets the basic interaction data, not the
An psi_xml writer.
Dr Richard Adams
Psychiatric Genetics Group,
Molecular Medicine Centre,
Western General Hospital,
Crewe Rd West,
Tel: 44 131 651 1084
richard.adams at ed.ac.uk
More information about the Bioperl-l