Rationale

Author

Jon Reades

Published

September 12, 2025

This is group work due Tuesday, 16 December 2025 @ 13:00 that you will undertake in a small group of no more than four students. The project is intended to resemble real-world data science ways of working: you will be part of a small team thrown together at short notice, you will need to figure out how to work effectively together, and you will need to jointly produce an output in which you all have confidence.

Applying Your Knowledge

The focus of this assessment is therefore the student’s ability to make use of concepts and methods covered in class as part of an analytical process to support decision-making by a non-expert. It is not necessary that you employ every technique covered in class. It is necessary that you justify your approach and results with reference to relevant regulation/legislation, as well as academic and ‘grey’ literature as needed. It is perfectly possible to obtain a distinction-level grade without the use of any advanced analytical techniques (e.g. clustering, NLP, or Random Forests); however, it is unlikely that you would be able to complete this assessment to a high standard without some graphs and some maps chosen for their ability to advance your argument.

New Code?

So the assessment may be completed by drawing on the code written in the practicals and the judicious use of descriptive statistics (see, for instance, Housing and Inequality in London and The suburbanisation of poverty in British cities, 2004-16: extent, processes and nature), but it is likely that a better mark will be obtained by demonstrating the capacity to go beyond exactly what was covered in class.

Two-Part Submission

The submission will have two parts and both are evaluated as part of your overall grade:

  1. A runnable QMD (Quarto Markdown Document) file that addresses the set questions provided in class. The QMD file allows us to evaluate reproducibility (25% of your module grade) by rendering your QMD file on our computer. So we are looking at whether outputs created by the group run fully and without error on a different computer, and whether they show evidence of thought in relation to the quality of coding and outputs.
  2. A rendered PDF file that is the output of your QMD file. The PDF file allows us to focus on your content (35% of your module grade), regardless of whether or not there are issues with its reproducibility. So we are looking at whether the group engages with the set questions through a mix of literature, critical thinking, and data analysis.

Please see Resources for the templates and information about how to create and render the QMD file.