help  | about  | cite  | software

Publication : Analysis and recognition of the GAGA transcription factor binding sites in Drosophila genes.

First Author  Omelina E S Year  2011
Journal  Comput Biol Chem Volume  35
Pages  363-70 PubMed ID  22099633
Abstract Text  The transcription factor GAGA, encoded by the gene Trl, controls expression of many Drosophila melanogaster genes. We have compiled the presently largest sample (120 sites) of published nucleotide sequences with experimentally confirmed binding to GAGA protein. Analysis of the sample has demonstrated that despite an apparent structural diversity of the GAGA sites, they fall into four distinct groups, namely, (1) the sites containing two GAG trinucleotides with no more than one nucleotide substitution in each and separated by spacers with a length of 1 or 3 nucleotides (GAGnGAG and GAGnnnGAG); (2) the sites containing a single GAGAG motif; (3) (GA)(3-9) microsatellite repeats; and (4) the sites corresponding to three and more direct repeats of GAG trinucleotide homolog and its inverted repeats separated by spacers of various lengths. Using the software package SITECON, the methods were elaborated for recognizing the sites of GAGnGAG (method 1) and GAGnnnGAG (method 2) types in DNA sequences. Experimental verification confirmed the ability to interact with the GAGA factor for 72% of the sites predicted using method 1 and 94.5% of the sites predicted by method 2. Application of the experimentally verified methods to analyzing the localization of potential GAGA binding sites in the target genes of this transcription factor has demonstrated that the 5'-untranslated regions (5'UTRs) and first introns are enriched for these sites (two-threefold relative to the average occurrence frequency in the D. melanogaster genome) as compared with a moderate enrichment (not exceeding 1.5-fold) of promoter regions (-4000/+200 bp or -1000/+100 bp). Doi  10.1016/j.compbiolchem.2011.10.008
Issue  6 Month  Dec

Publication Annotations Displayer

1 Entities

14 Mesh Terms