2024/25 Taught Postgraduate Module Catalogue

COMP5712M Programming for Data Science

15 Credits Class Size: 400

Module manager: Dr Hui Lau
Email: H.K.Lau@leeds.ac.uk

Taught: Semester 1 (Sep to Jan) View Timetable

Year running 2024/25

This module is not approved as an Elective

Module summary

This module is designed to give those with little or no programming experience a firm foundation in programming for data analysis and AI systems, recognising a diversity of backgrounds. The module will also fully stretch those with substantial prior programming experience (e.g., computer scientists) to extend their programming and system-building knowledge through self-learning supported by on-line courseware.

Objectives

The module introduces the fundamental skills of programming and software system development. It aims to give students the skills and experience to produce simple computer-based applications for a range of sectors. It prepares students to develop and integrate systems using Artificial Intelligence and Data Analytics techniques.

Learning outcomes

By the end of the module, students will be able to:
• Design, build and test computer programs in Python
• Implement applications across a selected domain (e.g., health, finance)
• Build systems that integrate with the internet and databases
• Understand how software is used for data analysis and AI

Syllabus

Computer programming in Python: control structures, datatypes, data structures, functions and classes, importing and using libraries, implementing simple algorithms.
Use of a Python development platform.
Use of specific libraries/APIs providing data access and analysis functionality, such as: accessing information from the web or from databases, statistical analysis, ML algorithms, graphical display of data.
Students will undertake a sequence of programming exercises starting with the fundamentals of programming and building up to a system that performs significant data analysis on real data:
• Basic algorithms for representing and processing information.
• Importing, manipulating and displaying data.
• Use of basic ML algorithms (e.g., k-means, nearest neighbour and linear classification).
• Implementation of a data analysis ‘pipeline’ in which data is extracted from some source, processed, analysed and visualised. The data investigated in this exercise will be drawn from the sector of each student’s masters programme (e.g., finance, health, law).

Teaching Methods

Delivery type Number Length hours Student hours
Drop-in Session 10 1 10
Practical 10 2 20
Private study hours 120
Total Contact hours 30
Total hours (100hr per 10 credits) 150

Private study

The student will be expected to complete several programming exercises relating to specialist domains. They will undertake program development and submit their code for evaluation. For some exercises an additional written report may be required.

Methods of Assessment

Coursework
Assessment type Notes % of formal assessment
In-course Assessment Coursework 1 20
In-course Assessment Coursework 2 60
In-course Assessment Online Test 20
Total percentage (Assessment Coursework) 100

This module will be reassessed by coursework.

Reading List

The reading list is available from the Library website

Last updated: 9/25/2024

Errors, omissions, failed links etc should be notified to the Catalogue Team