Eukaryotic gene example
Many genomes have been sequenced and their gene sequences are stored in general DNA sequence databases (e.g. GenBank) and in species specific databases (e.g. The Arabidopsis Information Resource (TAIR).
These images are views of the gene, cDNA, and coding sequence (CDS) used by researchers to study it further. Such research might involve seed germination or plant flavour.
Note that TAIR provide three views of the 'Nucleotide Sequence', 'full length CDS', 'full length cDNA' (Fig. 1), 'full length genomic' (Fig. 2]. In each of them the DNA alphabet is used, although strictly the CDS should be shown as RNA (AUG etc).
The mRNA is comprised of a 5' UTR (red) CDS (uppercase yellow) and 3' UTR (red again). All three of these regions are exonic (not just the CDS). Introns are shown in purple (lowercase). For convienience neither the 5' Cap nor 3' tail are shown in the cDNA (fig 1) although the mRNA will have them. The gene sequence is also shown in a form where the codons can be read (ATG...), rather than as the template DNA strand which is actually copied into mRNA.
|40x30px||This genetics article is a stub. You can help Wikipedia by expanding it.|