Promoters
Definition
A promoter is a region of DNA where transcription of a gene is initiated. Promoters are a vital component of expression vectors because they control the binding of RNA polymerase to DNA. RNA polymerase transcribes DNA to mRNA which is ultimately translated into a functional protein. Thus the promoter region controls when and where in the organism your gene of interest is expressed.
Summary
Promoters are about 100-1000 base pairs long and are adjacent and typically upstream (5’) of the sense or coding strand of the transcribed gene. The coding strand is the DNA strand that encodes codons and whose sequence corresponds to the mRNA transcript produced. The antisense strand is referred to as the template strand or non-coding strand as this is the strand that is transcribed by the RNA polymerase.
DNA sequences called response elements are located within promoter regions, and they provide a stable binding site for RNA polymerase and transcription factors. Transcription factors are proteins which recruit RNA polymerase and control and regulate the transcription of DNA into mRNA.
Promoter binding is very different in bacteria compared to eukaryotes. In bacteria, the core RNA polymerase requires an associated sigma factor for promoter recognition and binding. On the other hand, the process in eukaryotes is much more complex. Eukaryotes require a minimum of seven transcription factors in order for RNA polymerase II (a eukaryote-specific RNA polymerase) to bind to a promoter. Transcription is tightly controlled in both bacteria and eukaryotes. Promoters are controlled by various DNA regulatory sequences including enhancers, boundary elements, insulators, and silencers.
Promoter Regions
There are three main portions that make up a promoter: core promoter, proximal promoter, and distal promoter. Below describes the specifics of these regions in eukaryotic cells.
Core Promoter
The core promoter region is located most proximal to the start codon and contains the RNA polymerase binding site, TATA box, and transcription start site (TSS). RNA polymerase will bind to this core promoter region stably and transcription of the template strand can initiate. The TATA box is a DNA sequence (5'-TATAAA-3') within the core promoter region where general transcription factor proteins and histones can bind. Histones are proteins found in eukaryotic cells that package DNA into nucleosomes. Histone binding prevents the initiation of transcription whereas transcription factors promote the initiation of transcription. The most 3' portion (closest to the gene's start codon) of the core promoter is the TSS which is where transcription actually begins. Only eukaryotes and archaea, however, contain this TATA box. Most prokaryotes contain a sequence thought to be functionally equivalent called the Pribnow box which usually consists of the six nucleotides, TATAAT.
Proximal Promoter
Further upstream from the core promoter you will find the proximal promoter which contains many primary regulatory elements. The proximal promoter is found approximately 250 base pairs upstream from the TSS and it is the site where general transcription factors bind.
Distal Promoter
The final portion of the promoter region is called the distal promoter which is upstream of the proximal promoter. The distal promoter also contains transcription factor binding sites, but mostly contains regulatory elements.
Eukaryotic Promoters
Eukaryotic promoters are much more complex and diverse than prokaryotic promoters. Eukaryotic promoters span a wide range of DNA sequences. It is not unusual to have several regulatory elements such as enhancers several kilobases away from the TSS. Eukaryotic promoters are so complex in structure that the DNA tends to fold back on itself which helps to explain how many physically distant DNA sequences can affect transcription of a given gene. The TATA-binding protein binds the TATA box and helps in the subsequent binding of the RNA polymerase. A transcription complex is constructed from the RNA polymerase and several transcription factor proteins.
Common Eukaryotic Promoters Used in Research
Promoter | Expression | Description |
---|---|---|
CMV | Constitutive | Strong mammalian promoter from human cytomegalovirus |
EF1a | Constituitve | Strong mammalian promoter from human elongation factor 1 alpha |
CAG | Constitutive | Strong hybrid mammalian promoter |
PGK | Constitutive | Mammalian promoter from phospholycerate kinase gene |
TRE | Inducible | Tetracycline response element promoter |
U6 | Constitutive | Human U6 nuclear promoter for small RNA expression |
UAS | Specific | Drosophila promoter containing Gal4 binding sites |
Bacterial Promoters
Promoters in bacteria contain two short DNA sequences located at the -10 (10 bp 5' or upstream) and -35 positions from the transcription start site (TSS). Their equivalent to the eukaryotic TATA box, the Pribnow box (TATAAT) is located at the -10 position and is essential for transcription initiation. The -35 position, simply titled the -35 element, typically consists of the sequence TTGACA and this element controls the rate of transcription. Bacterial cells contain sigma factors which assist the RNA polymerase in binding to the promoter region. Each sigma factor recognizes different core promoter sequences.
Operons
Although bacterial transcription is simpler than eukaryotic transcription bacteria still have complex systems of gene regulation, like operons. Operons are a cluster of different genes that are controlled by a single promoter and operator. Operons are common in prokayotes, specifically bacteria, but have also been discovered in eukaryotes. Operons consist of a promoter, which is recognized by the RNA polymerase, an operator, a segment of DNA in which a repressor or activator can bind, and the structural genes that are transcribed together.
Operon regulation can be either negative or positive. Negative repressible operons, are normally bound by a repressor protein that prevents transcription. When an inducer molecule binds to the repressor, it changes its conformation, preventing its binding to the operator and thus allowing for transcription. The Lac operon in bacteria is an example of a negatively controlled operon.
A positive repressible operon works in the opposite way. The operon is normally transcribed until a repressor/corepressor binds to the operator preventing transcription. The trp operon involved in the production of tryptophan is an example of a positively controlled operon.
Common Bacterial Promoters used in Research
Promoter | Expression | Description |
---|---|---|
T7 | Constitutive but requires T7 RNA polymerase | Promoter from T7 bacteriophage |
Sp6 | Constitutive but requires Sp6 RNA polymerase | Promoter from Sp6 bacteriophage |
lac | Constitutive in the absense of lac repressor (lacI or lacIq). Can be induced by IPTG or lactose | Promoter from Lac operon |
araBad | Inducible by arabinose | Promoter of the arabinose metabolic operon |
trp | Repressible by tryptophan | Promoter from E. coli tryptophan operon |
Ptac | Regulated like the lac promoter | Hybrid promoter of lac and trp |
Types of RNA Polymerases
Promoters control the binding of RNA polymerase to DNA to initiate the transcription of genes. There are three types of RNA polymerases that all transcribe different genes.
RNA polymerase I transcribes genes encoding ribosomal RNA (rRNA) which is a main component of a cell’s ribosome structure. Ribosomes are the site of protein syntehsis where mRNA is translated into a protein.
RNA polymerase II transcribes messenger RNA (mRNA) which is the RNA responsible for providing a stable template for the translation of a protein.
RNA polymerase III transcribes genes encoding transfer RNAs (tRNA), the adaptor molecules that are responsible for bringing amino acids to the ribosome when proteins are being synthesized. RNA Polymerase III also transcribes small RNAs, such as shRNAs and gRNAs.
Resources
Addgene's blog, including our popular Plasmids 101 series, covers topics ranging from the newest breakthroughs in plasmid technologies and research to overviews of molecular biology basics and plasmid components.