The genome and gene expressionWJEC GCSE Study Guide

    Exam Board: WJEC | Level: GCSE

    Master the instruction manual of life: from the double helix structure of DNA to the complex machinery of protein synthesis. This topic is heavily tested in exams, particularly the distinction between coding and non-coding DNA, and the step-by-step mechanism of transcription and translation.

    ![Header image for The Genome & Gene Expression](https://xnnrgnazirrqvdgfhvou.supabase.co/storage/v1/object/public/study-guide-assets/guide_8bf66825-6979-4734-9109-85b655cf52f9/header_image.png) ## Overview The genome and gene expression lie at the very heart of molecular biology. This topic explores the 'instruction manual' of life—how genetic information is stored, protected, and ultimately expressed as the proteins that build and run every living organism. For your exam, this is a high-yield topic. Examiners frequently test your understanding of DNA structure, the sequence of events in protein synthesis, and the crucial differences between coding and non-coding DNA. You must be prepared to link the microscopic structure of a nucleotide to the macroscopic characteristics of an organism. This topic synoptically links to cell division (mitosis and meiosis), genetic inheritance, and evolution. Expect questions ranging from simple factual recall of base pairing rules to complex, extended-response questions asking you to evaluate the impact of a specific mutation on protein function. ## Key Concepts ### Concept 1: The Structure of DNA and the Genome DNA (deoxyribonucleic acid) is a polymer composed of repeating monomer units called **nucleotides**. Each nucleotide consists of three fundamental components: a phosphate group, a 5-carbon deoxyribose sugar, and a nitrogenous base. There are four possible bases: Adenine (A), Thymine (T), Cytosine (C), and Guanine (G). ![Structure of a DNA nucleotide and the double helix](https://xnnrgnazirrqvdgfhvou.supabase.co/storage/v1/object/public/study-guide-assets/guide_8bf66825-6979-4734-9109-85b655cf52f9/dna_structure_diagram.png) The structure of DNA is a double helix, formed by two antiparallel polynucleotide strands. These strands are held together by hydrogen bonds between complementary base pairs. **Complementary base pairing** is a strict rule: Adenine always pairs with Thymine (forming two hydrogen bonds), and Cytosine always pairs with Guanine (forming three hydrogen bonds). This consistent pairing is what allows DNA to be replicated accurately during cell division. The **genome** is defined as the entire genetic material of an organism. In humans, this consists of approximately 3 billion base pairs packaged into 46 chromosomes (23 pairs) within the nucleus, plus a small amount of mitochondrial DNA. A **gene** is a specific, short section of DNA on a chromosome that codes for a particular sequence of amino acids, which join together to make a specific protein. **Why does this matter for the exam?** Examiners often ask candidates to distinguish between the genome, a chromosome, and a gene. Remember the hierarchy: Genome > Chromosome > Gene > DNA > Nucleotide. ### Concept 2: Coding vs Non-Coding DNA A common misconception is that all DNA codes for proteins. In reality, only about 1-2% of the human genome consists of **coding DNA** (genes). The vast majority is **non-coding DNA**. Historically dismissed as "junk DNA", we now know non-coding DNA serves vital functions: 1. **Regulatory roles**: It controls gene expression, switching genes 'on' or 'off' depending on the cell's requirements. 2. **Introns**: These are non-coding sequences *within* genes that are transcribed into mRNA but are spliced (cut) out before translation. 3. **Structural roles**: It forms telomeres (protecting chromosome ends) and centromeres. 4. **Genetic profiling**: It contains Short Tandem Repeats (STRs) used in DNA fingerprinting. **Example**: If an exam question asks about the function of non-coding DNA, *never* state that it has no function. State that it regulates gene expression or is used to synthesise RNA molecules like tRNA and rRNA. ### Concept 3: The Triplet Code The sequence of bases in a gene acts as a code for building a protein. Because there are 20 different amino acids but only 4 DNA bases, the bases are read in groups of three. This is known as the **triplet code** (or codon in mRNA). The genetic code has three key features examiners test: 1. **Degenerate**: Most amino acids are coded for by more than one triplet (e.g., both UUU and UUC code for Phenylalanine). This protects against the effects of some mutations. 2. **Non-overlapping**: Each base is read only once, as part of one specific triplet. 3. **Universal**: The same specific triplets code for the same amino acids in almost all living organisms, providing strong evidence for a common evolutionary ancestor. ### Concept 4: Protein Synthesis (Transcription and Translation) Protein synthesis is the process by which the genetic code is used to build functional proteins. It occurs in two main stages: transcription (in the nucleus) and translation (at the ribosome). ![The two stages of protein synthesis: Transcription and Translation](https://xnnrgnazirrqvdgfhvou.supabase.co/storage/v1/object/public/study-guide-assets/guide_8bf66825-6979-4734-9109-85b655cf52f9/protein_synthesis_diagram.png) **Stage 1: Transcription** Transcription is the process of making a messenger RNA (mRNA) copy of a gene. 1. The DNA double helix unwinds and the hydrogen bonds between the strands break, exposing the bases. 2. Only one strand acts as a template (the template strand). 3. The enzyme **RNA polymerase** binds to the non-coding region just before the gene. 4. RNA polymerase moves along the template strand, joining free RNA nucleotides to form an mRNA strand. 5. The mRNA strand is complementary to the template strand. Crucially, RNA contains **Uracil (U)** instead of Thymine (T). So, if the DNA template is A-T-C-G, the mRNA will be U-A-G-C. 6. The mRNA detaches and leaves the nucleus through a nuclear pore. **Stage 2: Translation** Translation is the process of converting the mRNA sequence into a polypeptide chain (protein). 1. The mRNA attaches to a **ribosome** in the cytoplasm. 2. Carrier molecules called **tRNA** (transfer RNA) bring specific amino acids to the ribosome. 3. Each tRNA molecule has an **anticodon** (three bases) that is complementary to a specific mRNA **codon**. 4. The ribosome reads the mRNA one codon at a time. The tRNA with the complementary anticodon binds to the mRNA, bringing its specific amino acid into position. 5. The ribosome joins the amino acids together with peptide bonds to form a polypeptide chain. 6. Once a 'stop' codon is reached, the polypeptide is released and folds into its unique 3D functional shape (e.g., an enzyme active site). ### Concept 5: Mutations and Genetic Profiling A **mutation** is a random, spontaneous change in the sequence of DNA bases. If a mutation occurs in a gene, it changes the sequence of bases. This changes the triplet code, which *may* change the sequence of amino acids in the polypeptide. If the amino acid sequence changes, the protein will fold differently, altering its 3D shape and potentially its function (e.g., an enzyme's active site may no longer fit its substrate). **Genetic profiling** (DNA fingerprinting) exploits the highly variable non-coding regions of DNA, specifically Short Tandem Repeats (STRs). 1. A DNA sample is extracted (e.g., from blood or saliva). 2. The DNA is cut into fragments using specific restriction enzymes. 3. The fragments are separated by size using a technique called **gel electrophoresis**. 4. This creates a unique pattern of bands (a profile) that can be compared to other samples to identify individuals in forensic science or establish paternity. Listen to our deep-dive podcast episode covering these concepts in detail: ![Biology Revision Podcast: The Genome and Gene Expression](https://xnnrgnazirrqvdgfhvou.supabase.co/storage/v1/object/public/study-guide-assets/guide_8bf66825-6979-4734-9109-85b655cf52f9/genome_and_gene_expression_podcast.mp3)
    The genome and gene expression Study Guide — WJEC GCSE | MasteryMind