What is the difference between ASCII and Unicode?

ASCII (American Standard Code for Information Interchange) uses 7 bits to represent 128 characters, mainly English letters, digits, and punctuation. Unicode is a more comprehensive standard that uses up to 32 bits per character, allowing it to represent over a million characters from virtually all writing systems worldwide. While ASCII is simpler and uses less memory, Unicode is essential for global communication and modern software.

How do you convert binary to denary?

To convert binary to denary, write the binary number with place values (powers of 2) above each digit, starting from the rightmost digit (2^0). Then add up the place values where the binary digit is 1. For example, binary 1011: place values 8,4,2,1 → 8+0+2+1 = 11 in denary. Practice with numbers up to 8 bits (255) for GCSE.

Why do computers use binary instead of decimal?

Computers use binary because it's simple and reliable for electronic circuits. Transistors can be in one of two states: on (1) or off (0), representing binary digits. This two-state system is easy to implement, resistant to noise, and allows for straightforward logic gates. Decimal would require ten distinct voltage levels, which is harder to maintain accurately.

How is sound stored on a computer?

Sound is stored digitally through a process called sampling. An analog-to-digital converter measures the amplitude of a sound wave at regular intervals (sample rate, e.g., 44.1 kHz) and assigns a binary value to each sample based on its amplitude (bit depth, e.g., 16 bits). These binary values are stored sequentially. To play back, a digital-to-analog converter reconstructs the wave. Higher sample rates and bit depths give better quality but larger file sizes.

What is the difference between resolution and colour depth in images?

Resolution refers to the number of pixels in an image, usually given as width × height (e.g., 1920×1080). Colour depth (or bit depth) is the number of bits used to represent the colour of each pixel. For example, 1-bit colour depth gives 2 colours (black and white), while 24-bit gives over 16 million colours. Both affect file size: higher resolution means more pixels, and higher colour depth means more bits per pixel.

How do you calculate the file size of an image?

File size (in bits) = width (pixels) × height (pixels) × colour depth (bits per pixel). Then convert to bytes by dividing by 8. For example, a 100×100 image with 24-bit colour: 100×100×24 = 240,000 bits = 30,000 bytes (about 29.3 KiB). Remember to use the correct units: 1 byte = 8 bits, 1 KiB = 1024 bytes.

Fundamentals of data representation

AQA

GCSE

Character encoding defines how computers represent text by mapping characters to unique binary codes. Students must understand the purpose and mechanics of 7-bit ASCII and Unicode, including how to use encoding tables to convert between characters and their binary or decimal representations.

Objectives

Exam Tips

Pitfalls

Key Terms

Mark Points

Subtopics in this area

Character Encoding

Representing images

Binary Arithmetic

Units of information

Number bases

Converting between number bases

Quantities of Bytes

Representing sound

Data compression

Character Codes

Topic Overview

Fundamentals of data representation is the cornerstone of how computers store, process, and communicate information. At its core, this topic explores how all data—whether text, numbers, images, or sound—is ultimately represented using just two symbols: 0 and 1. This binary system is the language of computers, and understanding it is essential for grasping how digital devices work. You'll learn about binary numbers, character encoding (like ASCII and Unicode), and how images and sound are digitised through sampling and colour depth.

This topic matters because it explains the bridge between the physical hardware (transistors storing electrical charges) and the abstract data we interact with daily. For example, when you type a letter on a keyboard, the computer converts it into a binary code, processes it, and then displays it on screen. Without data representation, there would be no digital text, no photos, no music—just raw electrical signals. In the AQA GCSE Computer Science course, this knowledge underpins topics like data storage, compression, and even programming, as you'll need to understand how data types work in code.

Data representation also connects to real-world applications like file formats (JPEG, MP3) and data compression, which are crucial for efficient storage and transmission. By mastering this topic, you'll be able to calculate file sizes, understand why some images are blurry or why audio files are large, and appreciate the trade-offs between quality and file size. This foundational knowledge will serve you well in more advanced topics like networking, databases, and computational thinking.

Key Concepts

Core ideas you must understand for this topic

→Binary and denary conversion: Understand how to convert between base-2 (binary) and base-10 (denary) numbers, including the use of place values (powers of 2).
→Character encoding: Know the difference between ASCII (7-bit, 128 characters) and Unicode (16-bit or more, covering global scripts), and why Unicode is necessary for international text.
→Bitmap images: Learn how images are stored as a grid of pixels, each with a binary colour code determined by colour depth (bits per pixel). Understand how resolution and colour depth affect file size.
→Sound sampling: Understand how sound waves are digitised by taking samples at regular intervals (sample rate) and storing each sample's amplitude as a binary value (bit depth). Know the impact of sample rate and bit depth on quality and file size.
→Units of data: Be able to convert between bits, bytes, kilobytes, megabytes, gigabytes, and terabytes, and understand that data is measured in powers of 2 (e.g., 1 KiB = 1024 bytes).

What You Need to Demonstrate

Key skills and knowledge for this topic

Definition of a character set
Description of 7-bit ASCII
Description of Unicode
Conversion of characters to character codes using a provided table
Conversion of character codes to characters using a provided table
Explanation of character code sequencing (e.g., 'A' is 65, 'B' is 66)
Explanation of the purpose of Unicode and its advantages over ASCII
Definition of a pixel as a picture element

Marking Points

Key points examiners look for in your answers

Definition of a character set
Description of 7-bit ASCII
Description of Unicode
Conversion of characters to character codes using a provided table
Conversion of character codes to characters using a provided table
Explanation of character code sequencing (e.g., 'A' is 65, 'B' is 66)
Explanation of the purpose of Unicode and its advantages over ASCII
Definition of a pixel as a picture element
Relationship between image size (width x height) and file size
Impact of colour depth on file size and image quality
Calculation of file size in bits using (W x H x D)
Calculation of file size in bytes using (W x H x D) / 8
Conversion of binary patterns to bitmap images
Conversion of bitmap images to binary data
Addition of up to three binary numbers using a maximum of 8 bits.
Correct application of logical binary shifts.
Understanding that binary shifts correspond to multiplication or division by powers of 2.
No carrying beyond the 8th bit in addition tasks.
No requirement to handle fractional representations in binary shifts.
A bit is the fundamental unit of information (0 or 1)
A byte is a group of 8 bits
1 kB = 1,000 bytes
1 MB = 1,000 kilobytes
1 GB = 1,000 Megabytes
1 TB = 1,000 Gigabytes
b represents bit and B represents byte
Ability to represent decimal values between 0 and 255 in binary
Ability to represent decimal values between 0 and 255 in hexadecimal
Accurate conversion between binary and decimal
Accurate conversion between binary and hexadecimal
Accurate conversion between decimal and hexadecimal
Recognition of equivalent maximum values: decimal 255, binary 1111 1111, hexadecimal FF
Correct conversion between decimal and binary for values 0-255
Correct conversion between decimal and hexadecimal for values 0-255
Correct conversion between binary and hexadecimal
Accurate representation of decimal 255 as binary 1111 1111 and hexadecimal FF
1 kB = 1,000 bytes
1 MB = 1,000 kilobytes
1 GB = 1,000 Megabytes
1 TB = 1,000 Gigabytes
A bit is the fundamental unit of information (0 or 1)
A byte is a group of 8 bits
Correct use of symbols: b for bit, B for byte
Sound is analogue and must be converted to digital form for computer processing
Analogue signals are sampled to create a digital version
A sample is a measure of amplitude at a point in time
Sampling rate is the number of samples taken per second (measured in Hertz)
Sample resolution is the number of bits per sample
File size calculation: File size (bits) = sampling rate × sample resolution × number of seconds
Explanation of why data compression is necessary or desirable
Interpretation of Huffman trees to determine compressed bit sequences
Calculation of bits required for Huffman compressed data versus uncompressed ASCII
Calculation of bits saved through compression
Representation of data using RLE frequency/data pairs
Conversion of bitmap rows into RLE pairs
Understanding that character codes are grouped and run in sequence within encoding tables
Ability to calculate codes for letters (e.g., 'B' from 'A') based on their sequential position
Describing the purpose of Unicode and its advantages over ASCII
Converting characters to character codes using a provided table
Converting character codes to characters using a provided table

Examiner Tips

Expert advice for maximising your marks

💡Always check if an exam question provides a specific character encoding table to use for conversions
💡Remember that character codes for 'A'-'Z', 'a'-'z', and '0'-'9' are sequential in standard encoding tables
💡When asked for the advantage of Unicode, focus on the ability to represent a wider range of alphabets and special symbols compared to ASCII
💡Always show your working out for file size calculations to gain method marks
💡Double-check if the question asks for the answer in bits or bytes
💡When converting binary to images, draw a grid to help visualise the pixel mapping
💡Remember that colour depth is the number of bits per pixel, not the total number of colours
💡Always double-check the number of bits used in your answer to ensure it does not exceed the 8-bit limit.
💡When performing binary addition, clearly show your working for carries to avoid simple arithmetic errors.
💡Remember that a left shift by one position is equivalent to multiplying by 2, and a right shift by one position is equivalent to dividing by 2.
💡Always check if the question asks for bits or bytes
💡Remember that AQA uses the decimal definition (1,000) for these prefixes, not the binary definition (1,024)
💡Pay close attention to the symbols b and B in exam questions
💡Practice converting between binary and hexadecimal by grouping bits into sets of four
💡Remember that hexadecimal is used as a shorthand for binary to make it easier for humans to read and debug
💡Always double-check your conversions by converting back to the original base
💡Use a table of place values (128, 64, 32, 16, 8, 4, 2, 1) to assist with binary conversions
💡Convert binary to hexadecimal by splitting the 8-bit number into two 4-bit nibbles
💡Double-check your work by converting back to the original base
💡Remember that hexadecimal digits A-F represent 10-15
💡Always check if the question asks for bits or bytes, as this changes the unit of measurement
💡Remember that AQA defines these prefixes using base 10 (1,000) rather than base 2 (1,024)
💡Use the provided symbols (b and B) carefully in your working out to avoid losing marks
💡Always check if the question asks for the file size in bits or bytes
💡Ensure you show your working out for calculations to gain method marks
💡Remember that higher sampling rates and higher sample resolutions lead to larger file sizes but better sound quality
💡When performing RLE, double-check that the sum of your frequencies equals the total number of pixels in the row
💡Always show your working for bit calculations to gain method marks
💡Remember that ASCII is typically 8 bits per character unless otherwise specified
💡Practice drawing Huffman trees from frequency tables to ensure accuracy
💡When given a character table, always check the sequence of the codes provided to identify the pattern
💡Remember that 'A' is 65 in ASCII, but always use the table provided in the exam if one is given
💡Be prepared to explain that Unicode allows for a much larger range of characters and symbols compared to ASCII
💡When calculating file sizes, always show your working step by step. For example, for an image: (width × height) × colour depth (bits) = total bits, then convert to bytes by dividing by 8. This methodical approach earns method marks even if the final answer is wrong.
💡For sound questions, remember that file size = sample rate × bit depth × duration (in seconds). Don't forget to convert to appropriate units (e.g., bytes) and watch out for units like kHz (1 kHz = 1000 Hz).
💡In binary addition questions, be careful with carries. Practice adding binary numbers like 1011 + 0111, and check your answer by converting to denary. Examiners often test overflow errors when the result exceeds the available bits.

Common Mistakes

Pitfalls to avoid in your exam answers

Confusing the number of bits in ASCII (7-bit) with a full byte (8-bit)
Failing to recognize that character codes for letters and digits run in a continuous sequence
Assuming Unicode is a specific version like UTF-8 or UTF-16 rather than a general standard
Misinterpreting the requirement to use a provided table during exam questions
Confusing bits and bytes in file size calculations
Forgetting to divide by 8 when converting bits to bytes
Incorrectly identifying the dimensions of an image from a binary grid
Misunderstanding that colour depth is measured in bits per pixel
Attempting to carry beyond the 8th bit during binary addition.
Confusing logical binary shifts with arithmetic shifts.
Incorrectly identifying the direction of a shift for multiplication versus division.
Exceeding the 8-bit limit for binary numbers.
Confusing bits (b) with bytes (B)
Using binary prefixes (1024) instead of decimal prefixes (1000) as specified by the AQA 8525 specification
Incorrectly ordering the magnitude of prefixes
Incorrectly grouping binary bits when converting to hexadecimal
Confusing the base of the number system during conversion calculations
Failing to account for the full 8-bit representation when converting small decimal numbers to binary
Errors in place value calculation during conversion
Confusing the base of the number system being used
Incorrectly handling the hexadecimal digits A-F
Exceeding the 8-bit limit (255) in calculations
Confusing bits (b) with bytes (B)
Using binary prefixes (1024) instead of the decimal prefixes (1000) specified by AQA for this topic
Incorrectly ordering the magnitude of units (e.g., thinking a TB is smaller than a GB)
Confusing sampling rate with sample resolution
Forgetting to convert bits to bytes if the question asks for the final answer in bytes
Incorrectly identifying the units for sampling rate (Hertz) or resolution (bits)
Failing to include the duration (seconds) in the file size calculation
Confusing the frequency and the data value in RLE pairs
Incorrectly calculating the total bits for uncompressed ASCII (forgetting to multiply by 8 bits per character)
Misinterpreting the path in a Huffman tree
Failing to account for all bits in a bitmap when performing RLE
Assuming that all character sets use the same number of bits
Failing to recognize that character codes for letters (A-Z, a-z) and digits (0-9) are sequential
Confusing the purpose of Unicode with specific implementations like UTF-8 or UTF-16
Misconception: 'A kilobyte is exactly 1000 bytes.' Correction: In computing, a kilobyte (KiB) is 1024 bytes (2^10), though some contexts use 1000 for simplicity. The AQA specification uses the binary definition (1024).
Misconception: 'ASCII can represent any character from any language.' Correction: ASCII only covers 128 characters (English letters, digits, punctuation, control codes). For other languages, Unicode (e.g., UTF-8) is needed.
Misconception: 'Higher resolution always means better image quality.' Correction: While higher resolution (more pixels) can improve detail, it also increases file size. Quality also depends on colour depth and the viewing distance/medium.

Frequently Asked Questions

Common questions students ask about this topic

Before You Start

Prior knowledge that will help with this topic

•Basic understanding of number systems (denary/decimal) and place value.
•Familiarity with powers of 2 (e.g., 2^0=1, 2^1=2, 2^2=4, etc.) up to at least 2^10 (1024).
•Simple arithmetic skills (addition, multiplication) as used in file size calculations.

Key Terminology

Essential terms to know

Binary representation of alphanumeric and control characters
Relationship between bit depth and character set capacity (2^n)
Evolution from ASCII to Unicode for global interoperability
Pixel-level binary representation and color encoding
Impact of resolution and color depth on image fidelity and storage requirements
Role and components of metadata in image file structures
Binary Addition Rules and Columnar Carry
Overflow Detection and Hardware Limitations
Logical Binary Shifts for Multiplication and Division
Fixed-width Integer Representation
Binary representation and the bit as a fundamental unit
Hierarchical scaling of data units (bit, nibble, byte, KB, MB, GB, TB, PB)
Calculation of storage capacity and file size requirements
Positional notation and place value systems
Conversion algorithms between Denary, Binary, and Hexadecimal
Binary arithmetic and logical shifts
The relationship between bit depth and representable range
Positional notation and place value systems
Binary to Denary and Hexadecimal mapping
Logical bitwise shifts and arithmetic implications
Hierarchy of data units (bit, nibble, byte, KB, MB, GB, TB, PB)
Decimal (SI) vs Binary (IEC) prefixes and their multipliers
File size calculation methodologies for discrete and continuous data
Analogue-to-digital conversion (ADC) mechanisms
Sampling theory: Rate and Resolution
Mathematical modeling of file size and storage
Fidelity versus compression trade-offs
Lossless vs. Lossy compression mechanisms
Algorithmic efficiency in Huffman Coding and Run-Length Encoding
Impact on storage capacity and transmission bandwidth
File format characteristics and metadata preservation
Relationship between bit depth and character set capacity
Standardization and interoperability (ASCII vs. Unicode)
Sequential and logical ordering of character codes
Storage requirements and efficiency of encoding schemes

Likely Command Words

How questions on this topic are typically asked

Describe

Explain

Convert

Understand

Calculate

Add

Apply

Know

Compare

Represent

Define

Ready to test yourself?

Practice questions tailored to this topic

Fundamentals of data representation

Subtopics in this area

Topic Overview

Key Concepts

What You Need to Demonstrate

Marking Points

Examiner Tips

Common Mistakes

Frequently Asked Questions

Before You Start

Key Terminology

Likely Command Words

Ready to test yourself?

Related Topics in AQA GCSE Computer Science

E2E stub concept

Programming

Computer systems

Programming

Topic Synopsis

Key Concepts & Core Principles

Exam Tips & Revision Strategies

Common Misconceptions & Mistakes to Avoid

Examiner Marking Points

Fundamentals of data representation

Subtopics in this area

Topic Overview

Key Concepts

What You Need to Demonstrate

Marking Points

Examiner Tips

Common Mistakes

Frequently Asked Questions

What is the difference between ASCII and Unicode?

How do you convert binary to denary?

Why do computers use binary instead of decimal?

How is sound stored on a computer?

What is the difference between resolution and colour depth in images?

How do you calculate the file size of an image?

Before You Start

Key Terminology

Likely Command Words

Ready to test yourself?

Related Topics in AQA GCSE Computer Science

E2E stub concept

Programming

Computer systems

Programming