[Bioperl-l] Extract features from GFF
barry.moore at genetics.utah.edu
Tue Oct 23 00:37:07 EDT 2007
Oh yeah, much better option if you've got a database running. Look
at the docs for the modules Chris suggested. Something like:
my $db = Bio::DB::SeqFeature::Store->new( -adaptor => 'DBI::mysql',
-write => 1 );
# load an entire GFF3 file, using the GFF3 loader...
my $loader = Bio::DB::SeqFeature::Store::GFF3Loader->new(-store =>
-verbose => 1,
-fast => 1);
@features = $db->get_features_by_location(-seq_id=>'Chr1',-
On Oct 22, 2007, at 6:43 PM, Chris Fields wrote:
> On Oct 22, 2007, at 5:30 PM, Hang wrote:
>> I have a list of about 100,000 short genomic regions with paired
>> start and end
>> coordinations on reference fly genome (R5.3). I also have GFF files
>> from the
>> same genome release. I wonder how I can extract all overlapping
>> features from
>> these regions.
>> For example:
>> region A is on chromosome 2L between 123,456 bp to 123,489 bp. What
>> code should
>> I use to extract feature, like gene, CDS etc., that overlaps with
>> this region?
>> Thank you in advance!
>> -- Hang
> Look into using Bio::DB::GFF or Bio::DB::SeqFeature::Store; this will
> depend on the GFF version of the data you have.
> Bioperl-l mailing list
> Bioperl-l at lists.open-bio.org
More information about the Bioperl-l