Instruction
Name | Cr | Method of study | Time | Location | Organiser |
---|---|---|---|---|---|
Data Science Project I | 5 Cr | Lecture Course | 19.1.2021 - 4.5.2021 |
Name | Cr | Method of study | Time | Location | Organiser |
---|---|---|---|---|---|
Data Science Project I | 5 Cr | Laboratory work | 14.1.2020 - 28.4.2020 | ||
Data Science Project I | 5 Cr | Laboratory work | 12.9.2019 - 12.12.2019 | ||
Data Science Project I | 5 Cr | Laboratory work | 15.1.2019 - 30.4.2019 | ||
Data Science Project I | 5 Cr | Laboratory work | 10.9.2018 - 10.12.2018 | ||
Data Science Project I | 5 Cr | Lecture Course | 16.1.2018 - 1.5.2018 | ||
Data Science Project I | 5 Cr | Laboratory work | 11.9.2017 - 11.12.2017 |
Target group
Master's Programme in Data Science is responsible for the course.
The course belongs to the Data Science Methods module.
The course is primarily intended for students of the Data Science Master's program. Other students can enrol for the course, but in case it fills up preference is given for the Data Science students.
Prerequisites
Prerequisites in terms of knowledge
Software development skills on a level that is sufficient for working as part of a larger software development team (good programming skills, version control etc), for example as obtained during Bachelor in Computer Science. Some background on modeling data; no requirements are assumed on specific set of algorithms, but one should be familiar with the basic process of learning models from data and evaluating their accuracy, and should know some practical models or algorithms that can be applied for such tasks.
Prerequisites for students in the Data Science programme, in terms of courses
DATA11002 Introduction to Machine Learning (or DATA12002 Probabilistic Graphical Models)
Prerequisites for other students in terms of courses
Good programming skills; DATA11001 Introduction to Data Science; at least one of: DATA11002 Introduction to Machine Learning, DATA20001 Deep Learning, DATA12002 Probabilistic Graphical Models
Recommended preceding courses
None
Learning outcomes
Solve a practical data science challenge as part of a group, taking responsibility of individual elements of a bigger project while actively interacting with the group towards solving a common goal.
Identify a need for a data-driven service in a given context (typically a data source or device that produces data), choose suitable tools for solving the problem, and deliver a functioning service that fills the need.
Deal with challenges associated with working on real data and the limitations of available data science tools.
Understand how to use practical data science tools and argue about their usability.
Apply theoretical knowledge learned during other courses in practice.
Timing
Recommended time/stage of studies for completion: First year spring.
Term/teaching period when the course will be offered: offered during both spring and fall, covering periods I-II and III-IV
Contents
Application of data science skills in producing a practical data science product or service. The detailed content, such as algorithms and tools used for creating the solution, depends on the practical problem and domain chosen by the group.
Activities and teaching methods in support of learning
The course combines instructions by the lecturer, presentations by the students, and long-term group work. The details of the supervision of the group work will be determined case-by-case. The students will write a study diary analysing and reflecting their learning during the course.
Study materials
The course material is provided as lecture notes, slides and links to external sources.
Assessment practices and criteria
Grading scale is 1...5.
The grading is based on active participation in the group work, demonstrable individual contributions in the final result, the quality and complexity of the solution and its presentation, and the quality of the individual work not carried out as a group member, such as a tool presentation and the study diary.
Recommended optional studies
The project is about applying theoretical knowledge into solving practical problems, and hence all other courses in the program support the course.
Completion methods
The course is completed as a group project. The group is together responsible for delivering a practical data science solution for a problem they have jointly identified. The group will also present the solution to the class. In addition, the course typically involves elements the student completes alone, such as analyzing a particular tool and presenting it for the rest of the course attendants as well as a study diary. The groups receive supervision from the teacher and possibly other instructors.