We use Piazza forum for the interaction of this course, please enrol yourself. Thanks!
Master's Programme in Data Science is responsible for the course.
The course belongs to the CSM14000 - Software Systems study track module.
The course is available to students from other degree programmes.
Prerequisites in terms of knowledge
Good programming skills. Basic data models such as the relational data model and semi-structured data models (e.g., JSON, XML).
Prerequisites for students in the Data Science programme, in terms of courses
Prerequisites for other students in terms of courses
TKT10002 Introduction to Programming
Recommended preceding courses
- Transaction management and query optimisation
- Big data framework
- Distributed data framework
Recommended time/stage of studies for completion: autumn the first or second year of the Master study
Term/teaching period when the course will be offered: the course is in Autumn term / second period. The course will be offered every year.
- Hadoop and MapReduce, HDFS
- data models, relational databases and SQL
- semi-structured data and JSON query with MongoDB
- data streaming and data lake
- data integration
- Hands-on experience for different systems, including Splunk, Hadoop, Gephi, PostgreSQL and MongoDB.
The grading is based on the sum of the points from the exercises (max. 50 marks) and the exam (max. 50 marks). 51 marks are required to pass and give the lowest grade 1, 91 points or more gives the highest grade 5.