This subtopic covers the essential skills of identifying appropriate data sources for a project, employing suitable research methods to collect primary and
Topic Synopsis
This subtopic covers the essential skills of identifying appropriate data sources for a project, employing suitable research methods to collect primary and secondary data, and effectively structuring and cleansing raw data into a format ready for analysis. In practice, these competencies enable learners to prepare reliable datasets for generating meaningful insights and making data-driven decisions within a business or research context.
Key Concepts & Core Principles
- Data types: Understand the difference between qualitative (categorical) and quantitative (numerical) data, and between discrete and continuous data.
- Measures of central tendency: Mean, median, and mode – how to calculate each and when to use them appropriately.
- Data visualisation: Creating bar charts, pie charts, line graphs, and histograms to represent data effectively and accurately.
- Data cleaning: Identifying and correcting errors, duplicates, and missing values to ensure data quality before analysis.
- Drawing conclusions: Interpreting charts and summary statistics to answer specific questions and support decision-making.
Exam Tips & Revision Strategies
- Always justify your choice of data sources and collection methods by referencing the specific needs of the analysis task or brief.
- Document every step of your data preparation process, including any assumptions made and actions taken to address data quality issues.
- Use real-world examples and terminology (e.g., ‘data cleaning’, ‘normalisation’, ‘outlier detection’) to demonstrate professional competence.
Common Misconceptions & Mistakes to Avoid
- Confusing primary and secondary data sources, often misclassifying public datasets or reports as primary data.
- Failing to consider ethical and legal constraints when collecting data, such as GDPR compliance or informed consent.
- Overlooking data cleaning steps, resulting in incomplete or erroneous datasets that undermine the validity of subsequent analysis.
Examiner Marking Points
- Award credit for accurately distinguishing between primary and secondary data sources, providing clear, context-appropriate examples of each.
- Expect evidence of selecting a research method (e.g., survey, interview, observation, web scraping) that is justified in relation to the data requirements and project objectives.
- Credit is given for demonstrating a systematic approach to data preparation, including steps such as removing duplicates, handling missing values, and standardising formats.
- Look for the production of a well-organised, structured dataset (e.g., in a spreadsheet or database) with appropriate field names, data types, and documentation of cleaning steps.