This is “Protein Synthesis and the Genetic Code”, section 19.4 from the book Introduction to Chemistry: General, Organic, and Biological (v. 1.0). For details on it (including licensing), click here.

For more information on the source of this book, or why it is available for free, please see the project's home page. You can browse or download additional books there. You may also download a PDF copy of this book (72 MB) or just this chapter (5 MB), suitable for printing or most e-readers, or a .zip file containing this book's HTML files (for use in a web browser offline).

Has this book helped you? Consider passing it on:
Creative Commons supports free culture from music to education. Their licenses helped make this book available to you.
DonorsChoose.org helps people like you help teachers fund their classroom projects, from art supplies to books to calculators.

19.4 Protein Synthesis and the Genetic Code

Learning Objectives

  1. Describe the characteristics of the genetic code.
  2. Describe how a protein is synthesized from mRNA.

One of the definitions of a gene is as follows: a segment of deoxyribonucleic acid (DNA) carrying the code for a specific polypeptide. Each molecule of messenger RNA (mRNA) is a transcribed copy of a gene that is used by a cell for synthesizing a polypeptide chain. If a protein contains two or more different polypeptide chains, each chain is coded by a different gene. We turn now to the question of how the sequence of nucleotides in a molecule of ribonucleic acid (RNA) is translated into an amino acid sequence.

How can a molecule containing just 4 different nucleotides specify the sequence of the 20 amino acids that occur in proteins? If each nucleotide coded for 1 amino acid, then obviously the nucleic acids could code for only 4 amino acids. What if amino acids were coded for by groups of 2 nucleotides? There are 42, or 16, different combinations of 2 nucleotides (AA, AU, AC, AG, UU, and so on). Such a code is more extensive but still not adequate to code for 20 amino acids. However, if the nucleotides are arranged in groups of 3, the number of different possible combinations is 43, or 64. Here we have a code that is extensive enough to direct the synthesis of the primary structure of a protein molecule.

The genetic codeThe identification of each group of three nucleotides and its particular amino acid. can therefore be described as the identification of each group of three nucleotides and its particular amino acid. The sequence of these triplet groups in the mRNA dictates the sequence of the amino acids in the protein. Each individual three-nucleotide coding unit, as we have seen, is called a codon.

Protein synthesis is accomplished by orderly interactions between mRNA and the other ribonucleic acids (transfer RNA [tRNA] and ribosomal RNA [rRNA]), the ribosome, and more than 100 enzymes. The mRNA formed in the nucleus during transcription is transported across the nuclear membrane into the cytoplasm to the ribosomes—carrying with it the genetic instructions. The process in which the information encoded in the mRNA is used to direct the sequencing of amino acids and thus ultimately to synthesize a protein is referred to as translationThe process in which the information encoded in mRNA is used to direct the sequencing of amino acids to synthesize a protein..

Before an amino acid can be incorporated into a polypeptide chain, it must be attached to its unique tRNA. This crucial process requires an enzyme known as aminoacyl-tRNA synthetase (Figure 19.12 "Binding of an Amino Acid to Its tRNA"). There is a specific aminoacyl-tRNA synthetase for each amino acid. This high degree of specificity is vital to the incorporation of the correct amino acid into a protein. After the amino acid molecule has been bound to its tRNA carrier, protein synthesis can take place. Figure 19.13 "The Elongation Steps in Protein Synthesis" depicts a schematic stepwise representation of this all-important process.

Figure 19.12 Binding of an Amino Acid to Its tRNA

Figure 19.13 The Elongation Steps in Protein Synthesis

Early experimenters were faced with the task of determining which of the 64 possible codons stood for each of the 20 amino acids. The cracking of the genetic code was the joint accomplishment of several well-known geneticists—notably Har Khorana, Marshall Nirenberg, Philip Leder, and Severo Ochoa—from 1961 to 1964. The genetic dictionary they compiled, summarized in Figure 19.14 "The Genetic Code", shows that 61 codons code for amino acids, and 3 codons serve as signals for the termination of polypeptide synthesis (much like the period at the end of a sentence). Notice that only methionine (AUG) and tryptophan (UGG) have single codons. All other amino acids have two or more codons.

Figure 19.14 The Genetic Code

Example 3

A portion of an mRNA molecule has the sequence 5′‑AUGCCACGAGUUGAC‑3′. What amino acid sequence does this code for?

Solution

Use Figure 19.14 "The Genetic Code" to determine what amino acid each set of three nucleotides (codon) codes for. Remember that the sequence is read starting from the 5′ end and that a protein is synthesized starting with the N-terminal amino acid. The sequence 5′‑AUGCCACGAGUUGAC‑3′ codes for met-pro-arg-val-asp.

Skill-Building Exercise

  1. A portion of an RNA molecule has the sequence 5′‑AUGCUGAAUUGCGUAGGA‑3′. What amino acid sequence does this code for?

Further experimentation threw much light on the nature of the genetic code, as follows:

  1. The code is virtually universal; animal, plant, and bacterial cells use the same codons to specify each amino acid (with a few exceptions).
  2. The code is “degenerate”; in all but two cases (methionine and tryptophan), more than one triplet codes for a given amino acid.
  3. The first two bases of each codon are most significant; the third base often varies. This suggests that a change in the third base by a mutation may still permit the correct incorporation of a given amino acid into a protein. (For more information on mutations, see Section 19.5 "Mutations and Genetic Diseases".) The third base is sometimes called the “wobble” base.
  4. The code is continuous and nonoverlapping; there are no nucleotides between codons, and adjacent codons do not overlap.
  5. The three termination codons are read by special proteins called release factors, which signal the end of the translation process.
  6. The codon AUG codes for methionine and is also the initiation codon. Thus methionine is the first amino acid in each newly synthesized polypeptide. This first amino acid is usually removed enzymatically before the polypeptide chain is completed; the vast majority of polypeptides do not begin with methionine.

Concept Review Exercises

  1. What are the roles of mRNA and tRNA in protein synthesis?

  2. What is the initiation codon?

  3. What are the termination codons and how are they recognized?

Answers

  1. mRNA provides the code that determines the order of amino acids in the protein; tRNA transports the amino acids to the ribosome to incorporate into the growing protein chain.

  2. AUG

  3. UAA, UAG, and UGA; they are recognized by special proteins called release factors, which signal the end of the translation process.

Key Takeaways

  • In translation, the information in mRNA directs the order of amino acids in protein synthesis.
  • A set of three nucleotides (codon) codes for a specific amino acid.

Exercises

  1. Write the anticodon on tRNA that would pair with each mRNA codon.

    1. 5′‑UUU‑3′
    2. 5′‑CAU‑3′
    3. 5′‑AGC‑3′
    4. 5′‑CCG‑3′
  2. Write the codon on mRNA that would pair with each tRNA anticodon.

    1. 5′‑UUG‑3′
    2. 5′‑GAA‑3′
    3. 5′‑UCC‑3′
    4. 5′‑CAC‑3′
  3. The peptide hormone oxytocin contains 9 amino acid units. What is the minimum number of nucleotides needed to code for this peptide?

  4. Myoglobin, a protein that stores oxygen in muscle cells, has been purified from a number of organisms. The protein from a sperm whale is composed of 153 amino acid units. What is the minimum number of nucleotides that must be present in the mRNA that codes for this protein?

  5. Use Figure 19.14 "The Genetic Code" to identify the amino acids carried by each tRNA molecule in Exercise 1.

  6. Use Figure 19.14 "The Genetic Code" to identify the amino acids carried by each tRNA molecule in Exercise 2.

  7. Use Figure 19.14 "The Genetic Code" to determine the amino acid sequence produced from this mRNA sequence: 5′‑AUGAGCGACUUUGCGGGAUUA‑3′.

  8. Use Figure 19.14 "The Genetic Code" to determine the amino acid sequence produced from this mRNA sequence: 5′‑AUGGCAAUCCUCAAACGCUGU‑3′

Answers

    1. 3′‑AAA‑5′
    2. 3′‑GUA‑5′
    3. 3′‑UCG‑5′
    4. 3′‑GGC‑5′
  1. 27 nucleotides (3 nucleotides/codon)

  2. 1a: phenyalanine; 1b: histidine; 1c: serine; 1d: proline

  3. met-ser-asp-phe-ala-gly-leu