Elongation and Termination in Eukaryotes
Elongation synthesizes pre-mRNA in a 5′ to 3′ direction, and termination occurs in response to termination sequences and signals.
Describe what is happening during transcription elongation and termination
- RNA polymerase II (RNAPII) transcribes the major share of eukaryotic genes.
- During elongation, the transcription machinery needs to move histones out of the way every time it encounters a nucleosome.
- Transcription elongation occurs in a bubble of unwound DNA, where the RNA Polymerase uses one strand of DNA as a template to catalyze the synthesis of a new RNA strand in the 5′ to 3′ direction.
- RNA Polymerase I and RNA Polymerase III terminate transcription in response to specific termination sequences in either the DNA being transcribed (RNA Polymerase I) or in the newly-synthesized RNA (RNA Polymerase III).
- RNA Polymerase II terminates transcription at random locations past the end of the gene being transcribed. The newly-synthesized RNA is cleaved at a sequence-specified location and released before transcription terminates.
- nucleosome: any of the subunits that repeat in chromatin; a coil of DNA surrounding a histone core
- histone: any of various simple water-soluble proteins that are rich in the basic amino acids lysine and arginine and are complexed with DNA in the nucleosomes of eukaryotic chromatin
- chromatin: a complex of DNA, RNA, and proteins within the cell nucleus out of which chromosomes condense during cell division
Transcription through Nucleosomes
Following the formation of the pre-initiation complex, the polymerase is released from the other transcription factors, and elongation is allowed to proceed with the polymerase synthesizing RNA in the 5′ to 3′ direction. RNA Polymerase II (RNAPII) transcribes the major share of eukaryotic genes, so this section will mainly focus on how this specific polymerase accomplishes elongation and termination.
Although the enzymatic process of elongation is essentially the same in eukaryotes and prokaryotes, the eukaryotic DNA template is more complex. When eukaryotic cells are not dividing, their genes exist as a diffuse, but still extensively packaged and compacted mass of DNA and proteins called chromatin. The DNA is tightly packaged around charged histone proteins at repeated intervals. These DNA–histone complexes, collectively called nucleosomes, are regularly spaced and include 146 nucleotides of DNA wound twice around the eight histones in a nucleosome like thread around a spool.
For polynucleotide synthesis to occur, the transcription machinery needs to move histones out of the way every time it encounters a nucleosome. This is accomplished by a special protein dimer called FACT, which stands for “facilitates chromatin transcription.” FACT partially disassembles the nucleosome immediately ahead (upstream) of a transcribing RNA Polymerase II by removing two of the eight histones (a single dimer of H2A and H2B histones is removed.) This presumably sufficiently loosens the DNA wrapped around that nucleosome so that RNA Polymerase II can transcribe through it. FACT reassembles the nucleosome behind the RNA Polymerase II by returning the missing histones to it. RNA Polymerase II will continue to elongate the newly-synthesized RNA until transcription terminates.
RNA Polymerase II is a complex of 12 protein subunits. Specific subunits within the protein allow RNA Polymerase II to act as its own helicase, sliding clamp, single-stranded DNA binding protein, as well as carry out other functions. Consequently, RNA Polymerase II does not need as many accessory proteins to catalyze the synthesis of new RNA strands during transcription elongation as DNA Polymerase does to catalyze the synthesis of new DNA strands during replication elongation.
However, RNA Polymerase II does need a large collection of accessory proteins to initiate transcription at gene promoters, but once the double-stranded DNA in the transcription start region has been unwound, the RNA Polymerase II has been positioned at the +1 initiation nucleotide, and has started catalyzing new RNA strand synthesis, RNA Polymerase II clears or “escapes” the promoter region and leaves most of the transcription initiation proteins behind.
All RNA Polymerases travel along the template DNA strand in the 3′ to 5′ direction and catalyze the synthesis of new RNA strands in the 5′ to 3′ direction, adding new nucleotides to the 3′ end of the growing RNA strand.
RNA Polymerases unwind the double stranded DNA ahead of them and allow the unwound DNA behind them to rewind. As a result, RNA strand synthesis occurs in a transcription bubble of about 25 unwound DNA basebairs. Only about 8 nucleotides of newly-synthesized RNA remain basepaired to the template DNA. The rest of the RNA molecules falls off the template to allow the DNA behind it to rewind.
RNA Polymerases use the DNA strand below them as a template to direct which nucleotide to add to the 3′ end of the growing RNA strand at each point in the sequence. The RNA Polymerase travels along the template DNA one nucleotide at at time. Whichever RNA nucleotide is capable of basepairing to the template nucleotide below the RNA Polymerase is the next nucleotide to be added. Once the addition of a new nucleotide to the 3′ end of the growing strand has been catalyzed, the RNA Polymerase moves to the next DNA nucleotide on the template below it. This process continues until transcription termination occurs.
The termination of transcription is different for the three different eukaryotic RNA polymerases.
The ribosomal rRNA genes transcribed by RNA Polymerase I contain a specific sequence of basepairs (11 bp long in humans; 18 bp in mice) that is recognized by a termination protein called TTF-1 (Transcription Termination Factor for RNA Polymerase I.) This protein binds the DNA at its recognition sequence and blocks further transcription, causing the RNA Polymerase I to disengage from the template DNA strand and to release its newly-synthesized RNA.
The protein-encoding, structural RNA, and regulatory RNA genes transcribed by RNA Polymerse II lack any specific signals or sequences that direct RNA Polymerase II to terminate at specific locations. RNA Polymerase II can continue to transcribe RNA anywhere from a few bp to thousands of bp past the actual end of the gene. However, the transcript is cleaved at an internal site before RNA Polymerase II finishes transcribing. This releases the upstream portion of the transcript, which will serve as the initial RNA prior to further processing (the pre-mRNA in the case of protein-encoding genes.) This cleavage site is considered the “end” of the gene. The remainder of the transcript is digested by a 5′-exonuclease (called Xrn2 in humans) while it is still being transcribed by the RNA Polymerase II. When the 5′-exonulease “catches up” to RNA Polymerase II by digesting away all the overhanging RNA, it helps disengage the polymerase from its DNA template strand, finally terminating that round of transcription.
In the case of protein-encoding genes, the cleavage site which determines the “end” of the emerging pre-mRNA occurs between an upstream AAUAAA sequence and a downstream GU-rich sequence separated by about 40-60 nucleotides in the emerging RNA. Once both of these sequences have been transcribed, a protein called CPSF in humans binds the AAUAAA sequence and a protein called CstF in humans binds the GU-rich sequence. These two proteins form the base of a complicated protein complex that forms in this region before CPSF cleaves the nascent pre-mRNA at a site 10-30 nucleotides downstream from the AAUAAA site. The Poly(A) Polymerase enzyme which catalyzes the addition of a 3′ poly-A tail on the pre-mRNA is part of the complex that forms with CPSF and CstF.
The tRNA, 5S rRNA, and structural RNAs genes transcribed by RNA Polymerase III have a not-entirely-understood termination signal. The RNAs transcribed by RNA Polymerase III have a short stretch of four to seven U’s at their 3′ end. This somehow triggers RNA Polymerase III to both release the nascent RNA and disengage from the template DNA strand.
The Promoter and the Transcription Machinery
When transcription factors bind to the promoter region, RNA polymerase is placed in an orientation that allows transcription to begin.
Describe the role of promoters in RNA transcription
- The purpose of the promoter is to bind transcription factors that control the initiation of transcription.
- The promoter region can be short or quite long; the longer the promoter is, the more available space for proteins to bind.
- To initiate transcription, a transcription factor (TFIID) binds to the TATA box, which causes other transcription factors to subsequently bind to the TATA box.
- Once the transcription initiation complex is assembled, RNA polymerase can bind to its upstream sequence and is then phosphorylated.
- Phosphorylation of RNA polymerase releases part of the protein from the DNA to activate the transcription initiation complex and places RNA polymerase in the correct orientation to begin transcription.
- Transcription factors respond to environmental stimuli that cause the proteins to find their binding sites and initiate transcription of the gene that is needed.
- TATA box: a DNA sequence (cis-regulatory element) found in the promoter region of genes in archaea and eukaryotes
- transcription factor: a protein that binds to specific DNA sequences, thereby controlling the flow (or transcription) of genetic information from DNA to mRNA
- promoter: the section of DNA that controls the initiation of RNA transcription
The Promoter and the Transcription Machinery
Genes are organized to make the control of gene expression easier. The promoter region is immediately upstream of the coding sequence. This region can be short (only a few nucleotides in length) or quite long (hundreds of nucleotides long). The longer the promoter, the more available space for proteins to bind. This also adds more control to the transcription process. The length of the promoter is gene-specific and can differ dramatically between genes. Consequently, the level of control of gene expression can also differ quite dramatically between genes. The purpose of the promoter is to bind transcription factors that control the initiation of transcription.
Within the promoter region, just upstream of the transcriptional start site, resides the TATA box. This box is simply a repeat of thymine and adenine dinucleotides (literally, TATA repeats). RNA polymerase binds to the transcription initiation complex, allowing transcription to occur. To initiate transcription, a transcription factor (TFIID) is the first to bind to the TATA box. Binding of TFIID recruits other transcription factors, including TFIIB, TFIIE, TFIIF, and TFIIH to the TATA box. Once this transcription initiation complex is assembled, RNA polymerase can bind to its upstream sequence. When bound along with the transcription factors, RNA polymerase is phosphorylated. This releases part of the protein from the DNA to activate the transcription initiation complex and places RNA polymerase in the correct orientation to begin transcription; DNA-bending protein brings the enhancer, which can be quite a distance from the gene, in contact with transcription factors and mediator proteins.
In addition to the general transcription factors, other transcription factors can bind to the promoter to regulate gene transcription. These transcription factors bind to the promoters of a specific set of genes. They are not general transcription factors that bind to every promoter complex, but are recruited to a specific sequence on the promoter of a specific gene. There are hundreds of transcription factors in a cell that each bind specifically to a particular DNA sequence motif. When transcription factors bind to the promoter just upstream of the encoded gene, they are referred to as cis-acting elements because they are on the same chromosome, just next to the gene. The region that a particular transcription factor binds to is called the transcription factor binding site. Transcription factors respond to environmental stimuli that cause the proteins to find their binding sites and initiate transcription of the gene that is needed.