Name Cr Method of study Time Location Organiser
Data Science Project I 5 Cr Lecture Course 19.1.2021 - 4.5.2021
Name Cr Method of study Time Location Organiser
Data Science Project I 5 Cr Laboratory work 14.1.2020 - 28.4.2020
Data Science Project I 5 Cr Laboratory work 12.9.2019 - 12.12.2019
Data Science Project I 5 Cr Laboratory work 15.1.2019 - 30.4.2019
Data Science Project I 5 Cr Laboratory work 10.9.2018 - 10.12.2018
Data Science Project I 5 Cr Lecture Course 16.1.2018 - 1.5.2018
Data Science Project I 5 Cr Laboratory work 11.9.2017 - 11.12.2017

Target group

Master's Programme in Data Science is responsible for the course.

The course belongs to the Data Science Methods module.

The course is primarily intended for students of the Data Science Master's program. Other students can enrol for the course, but in case it fills up preference is given for the Data Science students.


Prerequisites in terms of knowledge

Software development skills on a level that is sufficient for working as part of a larger software development team (good programming skills, version control etc), for example as obtained during Bachelor in Computer Science. Some background on modeling data; no requirements are assumed on specific set of algorithms, but one should be familiar with the basic process of learning models from data and evaluating their accuracy, and should know some practical models or algorithms that can be applied for such tasks.

Prerequisites for students in the Data Science programme, in terms of courses

DATA11002 Introduction to Machine Learning (or DATA12002 Probabilistic Graphical Models)

Prerequisites for other students in terms of courses

Good programming skills; DATA11001 Introduction to Data Science; at least one of: DATA11002 Introduction to Machine Learning, DATA20001 Deep Learning, DATA12002 Probabilistic Graphical Models

Recommended preceding courses


Learning outcomes

Solve a practical data science challenge as part of a group, taking responsibility of individual elements of a bigger project while actively interacting with the group towards solving a common goal.
Identify a need for a data-driven service in a given context (typically a data source or device that produces data), choose suitable tools for solving the problem, and deliver a functioning service that fills the need.
Deal with challenges associated with working on real data and the limitations of available data science tools.
Understand how to use practical data science tools and argue about their usability.
Apply theoretical knowledge learned during other courses in practice.


Recommended time/stage of studies for completion: First year spring.

Term/teaching period when the course will be offered: offered during both spring and fall, covering periods I-II and III-IV


Application of data science skills in producing a practical data science product or service. The detailed content, such as algorithms and tools used for creating the solution, depends on the practical problem and domain chosen by the group.

Activities and teaching methods in support of learning

The course combines instructions by the lecturer, presentations by the students, and long-term group work. The details of the supervision of the group work will be determined case-by-case. The students will write a study diary analysing and reflecting their learning during the course.

Study materials

The course material is provided as lecture notes, slides and links to external sources.

Assessment practices and criteria

Grading scale is 1...5.

The grading is based on active participation in the group work, demonstrable individual contributions in the final result, the quality and complexity of the solution and its presentation, and the quality of the individual work not carried out as a group member, such as a tool presentation and the study diary.

Recommended optional studies

The project is about applying theoretical knowledge into solving practical problems, and hence all other courses in the program support the course.

Completion methods

The course is completed as a group project. The group is together responsible for delivering a practical data science solution for a problem they have jointly identified. The group will also present the solution to the class. In addition, the course typically involves elements the student completes alone, such as analyzing a particular tool and presenting it for the rest of the course attendants as well as a study diary. The groups receive supervision from the teacher and possibly other instructors.