"

14.3 Transcription: from gene to message

Elizabeth Dahlhoff and Melissa Hardy

Learning Objectives

By the end of this section, you will be able to do the following:

  • Explain the role of promoters and RNA polymerase in transcription of DNA to messenger RNA.
  • Explain the significance of transcription factors in eukaryotic transcription.
  • Be able to describe the major steps of transcription, defining template strand, reading strand, and that transcription always moves from the 5- to 3′ end of the growing mRNA.
  • Describe how transcription is terminated.

Transcription

Transcription differs between prokaryotes (single-celled organisms that lack membrane-bound nuclei and membrane-enclosed organelles) and eukaryotes (organized around histone proteins), but the fundamentals are the same. Transcription of a gene begins when the enzyme RNA polymerase binds to DNA at the promoter region of that gene. A promoter is a DNA sequence onto which the transcription machinery, including RNA polymerase, binds and initiates transcription. Promoters exist a certain distance upstream of the genes they regulate.

The σ subunit of prokaryotic RNA polymerase recognizes consensus sequences found in the promoter region upstream of the transcription start site. The σ subunit dissociates from the polymerase after transcription has been initiated. (Figure by OpenStax is used under a Creative Commons Attribution license).

Transcription requires the DNA double helix to partially unwind in the region of messenger RNA (mRNA) synthesis. The region of unwinding is called a transcription bubble. Transcription always proceeds from the same DNA strand for each gene, which is called the template strand. The mRNA product is complementary and antiparallel to the template strand and is almost identical to the other DNA strand, called the coding strand. There are two differences: 1) in mRNA, all of the T nucleotides are replaced with U nucleotides; 2) in mRNA, the sugar phosphate “backbone” of the molecule is ribose, whereas for DNA, the sugar is deoxyribose. In an mRNA-DNA strand pairing during transcription, A can bind U via two hydrogen bonds, just as in A–T pairing in a DNA double helix. Messenger RNA is always synthesized in the 5′ to 3′ direction.

Once a gene is transcribed, the prokaryotic polymerase needs to be instructed to dissociate from the DNA template and liberate the newly made mRNA. Depending on the gene being transcribed, there are two kinds of termination signals. One is protein-based and the other is RNA-based. Rho-dependent termination is controlled by the rho protein, which tracks along behind the polymerase on the growing mRNA chain. Near the end of the gene, the polymerase encounters a run of G nucleotides on the DNA template and it stalls. As a result, the rho protein collides with the polymerase. The interaction with rho releases the mRNA from the transcription bubble.

 

Transcription
Messenger RNA is a copy of protein-coding information in the coding strand of DNA, with the substitution of U in the RNA for T in the coding sequence. However, new RNA nucleotides base pair with the nucleotides of the template strand. RNA is synthesized in its 5′-3′ direction, using the enzyme RNA polymerase. As the template is read, the DNA unwinds ahead of the polymerase and then rewinds behind it. (Figure by OpenStax is used under a Creative Commons Attribution license).

Unlike prokaryotes, eukaryotes have multiple, linear chromosomes. Eukaryotic mRNAs typically specify a single protein, as opposed to the mRNA transcribed from multi-gene units called operons. The structure of a generalized eukaryotic protein-coding gene and its product is shown below.

 

Illustration of a protein-coding eukaryotic gene, the pre-mRNA, and mature mRNA that it produces
The structure of a eukaryotic protein-coding gene. Regulatory sequences control when and where expression occurs for the protein coding region (red). Promoter and enhancer regions (yellow) regulate the transcription of the gene into a pre-mRNA which is modified to add a 5′ cap and poly-A tail (grey) and remove introns. The mRNA 5′ and 3′ untranslated regions (blue) regulate translation into the final protein product. (Eukaryotic gene structure by Thomas Shafee is used under a Creative Commons Attribution license).

Prokaryotes and eukaryotes perform fundamentally the same process of transcription, with a few key differences. The most important difference between prokaryote and eukaryote transcription is due to the latter’s membrane-bound nucleus and organelles. With the genes bound in a nucleus, the eukaryotic cell must be able to transport its mRNA to the cytoplasm and must protect its mRNA from degrading before it is translated. Eukaryotes also employ three different polymerases that each transcribe a different subset of genes.

RNA Polymerase Cellular Compartment Product of Transcription
I Nucleolus All rRNAs except 5S rRNA
II Nucleus All protein-coding nuclear pre-mRNAs and most small nuclear RNAs
III Nucleus 5S rRNA, tRNAs, and some small nuclear RNAs

RNA polymerase II is located in the nucleus and synthesizes all protein-coding nuclear pre-mRNAs. Eukaryotic pre-mRNAs undergo extensive processing after transcription but before translation. For clarity, we will only use the term “mRNAs” to describe only the mature, processed molecules that are ready to be translated. RNA polymerase II is responsible for transcribing the overwhelming majority of eukaryotic genes.

RNA Polymerase II Promoters and Transcription Factors

Eukaryotic promoters are much larger and more intricate than prokaryotic promoters. However, both have a sequence similar to the -10 sequence of prokaryotes. In eukaryotes, this sequence is called the TATA box, and has the consensus sequence TATAAA on the coding strand. It is located at -25 to -35 bases relative to the initiation (+1) site.

Like prokaryotic cells, the transcription of genes in eukaryotes requires the action of an RNA polymerase to bind to a DNA sequence upstream of a gene in order to initiate transcription of that gene. However, unlike prokaryotic cells, the eukaryotic RNA polymerase requires other proteins, or transcription factors, to facilitate transcription initiation. RNA polymerase by itself cannot initiate transcription in eukaryotic cells.

 

Transcription factors and RNA polymerase II at a eukaryotic promoter.
A generalized promoter of a gene transcribed by RNA polymerase II is shown. Transcription factors recognize the promoter, then RNA polymerase II binds and forms the transcription initiation complex. (Figure by OpenStax is used under a Creative Commons Attribution license).

Eukaryotic Elongation and Termination

Following the formation of the pre-initiation complex, the polymerase is released from the promoter, and elongation is allowed to proceed as it does in prokaryotes, with the polymerase synthesizing pre-mRNA in the 5′ to 3′ direction. As discussed previously, RNA polymerase II transcribes the major share of eukaryotic genes, so in this section we will focus on how this polymerase accomplishes elongation and termination.

Although the enzymatic process of elongation is essentially the same in eukaryotes and prokaryotes, the DNA template is considerably more complex. When eukaryotic cells are not dividing, their genes exist as a diffuse mass of DNA and proteins called chromatin. The DNA is tightly packaged around charged histone proteins at repeated intervals. These DNA–histone complexes, collectively called nucleosomes, are regularly spaced and include 146 nucleotides of DNA wound around eight histones like thread around a spool. For polynucleotide synthesis to occur, the transcription machinery needs to move histones out of the way every time it encounters a nucleosome. This is accomplished by a special protein complex called FACT, which stands for “facilitates chromatin transcription.” This complex pulls histones away from the DNA template as the polymerase moves along it. Once the pre-mRNA is synthesized, the FACT complex replaces the histones to recreate the nucleosomes.

The termination of transcription is different for the different polymerases. Unlike in prokaryotes, elongation by RNA polymerase II in eukaryotes, which is the RNA polymerase responsible for protein synthesis, takes place 1,000 to 2,000 nucleotides beyond the end of the gene being transcribed. This pre-mRNA tail is subsequently removed by cleavage during mRNA processing.

Regulation of Transcription

The DNA of prokaryotes is organized into a circular chromosome. Proteins that are needed for a specific function, or that are involved in the same biochemical pathway, are often encoded together in blocks called operons. For example, all of the genes needed to use lactose as an energy source are coded next to each other in the lactose (or lac) operon, and transcribed into a single mRNA. You will learn more about operons and other aspects of prokaryotic and eukaryotic gene expression in BIOL 1C.

Section Summary

In prokaryotes, mRNA synthesis is initiated at a promoter sequence on the DNA template comprising two consensus sequences that recruit RNA polymerase. Elongation synthesizes mRNA in the 5′ to 3′ direction. Termination liberates the mRNA and occurs either by rho protein interaction or by the formation of an mRNA hairpin. Transcription in eukaryotes involves one of three types of polymerases, depending on the gene being transcribed. RNA polymerase II transcribes all of the protein-coding genes, whereas RNA polymerase I transcribes rRNA genes, and RNA polymerase III transcribes rRNA, tRNA, and small nuclear RNA genes. The initiation of transcription in eukaryotes involves the binding of several transcription factors to complex promoter sequences that are usually located upstream of the gene being copied. The mRNA is synthesized in the 5′ to 3′ direction, and the FACT complex moves and reassembles nucleosomes as the polymerase passes by. Whereas RNA polymerases I and III terminate transcription by protein- or RNA hairpin-dependent methods, RNA polymerase II transcribes for 1,000 or more nucleotides beyond the gene template and cleaves the excess during pre-mRNA processing, which is discussed in the following chapter.

Review Questions

The -10 and -35 regions of prokaryotic promoters are called consensus sequences because ________.

  1. they are identical in all bacterial species
  2. they are similar in all bacterial species
  3. they exist in all organisms
  4. they have the same function in all organisms

2. they are similar in all bacterial species

Which feature of promoters can be found in both prokaryotes and eukaryotes?

  1. GC box
  2. TATA box
  3. octamer box
  4. -10 and -35 sequences

2. TATA box

Free Response

If mRNA is complementary to the DNA template strand and the DNA template strand is complementary to the DNA non-template (coding) strand, then why are base sequences of mRNA and the DNA nontemplate strand not identical?

DNA is different from RNA in that T nucleotides in DNA are replaced with U nucleotides in RNA. Therefore, they could never be identical in base sequence.

 

Glossary

consensus
DNA sequence that is used by many species to perform the same or similar functions
core enzyme
prokaryotic RNA polymerase consisting of α, α, β, and β‘ but missing σ; this complex performs elongation
downstream
nucleotides following the initiation site in the direction of mRNA transcription; in general, sequences that are toward the 3′ end relative to a site on the mRNA
hairpin
structure of RNA when it folds back on itself and forms intramolecular hydrogen bonds between complementary nucleotides
initiation site
nucleotide from which mRNA synthesis proceeds in the 5′ to 3′ direction; denoted with a “+1”
nontemplate (coding) strand
strand of DNA that is not used to transcribe mRNA; this strand is identical to the mRNA except that T nucleotides in the DNA are replaced by U nucleotides in the mRNA
promoter
DNA sequence to which RNA polymerase and associated factors bind and initiate transcription
TATA box
conserved promoter sequence in eukaryotes and prokaryotes that helps to establish the initiation site for transcription
template strand
strand of DNA that specifies the complementary mRNA molecule
transcription bubble
region of locally unwound DNA that allows for transcription of mRNA
upstream
nucleotides preceding the initiation site; in general, sequences toward the 5′ end relative to a site on the mRNA

Text adapted from OpenStax Biology 2e and used under a Creative Commons Attribution License 4.0.
Access for free at https://openstax.org/books/biology-2e/pages/1-introduction

Media Attributions

definition

License

Icon for the Creative Commons Attribution-NonCommercial 4.0 International License

14.3 Transcription: from gene to message Copyright © by Elizabeth Dahlhoff and Melissa Hardy is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License, except where otherwise noted.