The course schedule and further information is available on the course page on https://big-data-platforms-f20.mooc.fi/ , with first lecture on 1st of September 2020 at 10:00-11:30 Finnish time.
The materials will be available on the MOOC.fi portal on the course homepage at: https://big-data-platforms-f20.mooc.fi/
See the course page on https://big-data-platforms-f20.mooc.fi/
Free of charge.
WHEN - No registration before the course begins. You will be given instructions for Open University course enrollment through email after you have completed the course assignments in the MOOC online learning environment. Course assignments, course exam, and Open University course enrollment must be completed by 31.12.2020 to be eligible for credits. Please see details on course home exercise deadlines and exam schedule in the learning environment.
WHY - Open University course enrollment is required be eligible for the ECTS credits. The ECTS entered in the University of Helsinki’s student records. Course materials (without ECTS) are available to everyone without officially enrolling on the course.
WHO - You can enroll on the course through the electronic enrollment form if you meet one of the following criteria:
- You have a Finnish personal identity number (format: ddmmyyyy-xxxx), or
- you are a student at the University of Helsinki, or
- you are a student at a HAKA member institution.
- If you enter erroneous information when enrolling, we cannot register your credits.
The course credits will be registered within 4 to 6 weeks of enrollment.
Open University reserves the right to make changes to the study programme.
The course is suitable to those who are interested in big data platforms employed in cloud computing and have previous knowledge in programming, database systems and command line tools.
Optional course in Data Science Master's Program. Also suitable for Computer Science Master's Program students.
The course is suitable to University of Helsinki exchange students.
To attend this course, you must have
- basic programming skills (Python),
- skills to work with command line tools in Linux, and
- basic knowledge in database systems (SQL).
Basic courses in programming and computer architecture are helpful to make the contents of the course easier to approach. However, apart from basic programming skills the course is self-contained.
This course focuses on big data platforms and on key algorithmic ideas and methods used to implement them.
After completing this course you are able to
- list many of the key technologies used in big data processing,
- select suitable methods for solving challenging big data processing tasks using cloud computing technologies, and
- to compare the scalability and fault tolerance implications of using the selected methodologies.
First or second year of Master's curriculum.
Autumn term, period 1.
Advanced topics in cloud computing with emphasis on scalable big data platforms employed in cloud computing. Key cloud technologies and their algorithmic background.
Main topics are
- distributed computing,
- Warehouse-Scale Computers,
- fault tolerance in distributed systems,
- distributed file systems,
- distributed batch processing with the MapReduce and the Apache Spark computing frameworks, and
- distributed cloud based databases.
The course material will consist of lecture materials and exercises provided by the lecturer.
As optional reading the students can read the following freely available textbook that unfortunately covers only a part of the course contents but also has extra material beyond the course scope:
Luiz André Barroso, Urs Hölzle, Parthasarathy Ranganathan:
The Datacenter as a Computer: Designing Warehouse-Scale Machines, Third Edition
Synthesis Lectures on Computer Architecture
October 2018, 189 pages,
Course will be a MOOC course with contents available in the Web, automated home assignment for practical hands-on use of Big Data platforms such as Spark.
Both home exercises and an exam are required to pass the course.
Taking this course does not give students access to a University of Helsinki user account. If you want to view and share information about your studies after completing the course, sign up for the Oma Opintopolku -service maintained by the Finnish National Agency for Education. The ECTS will be displayed within two days in the Oma Opintopolku -service after credits have been registered to the University of Helsinki. To register for the Oma Opintopolku – service you must identify yourself by using Finnish bank identification codes, mobile certificate or certificate card.
This course is a massive open online course (MOOC), which means that the course material is available to everyone. For further information about the course, please visit the online learning environment. The online learning environment contains the material and instructions necessary for completing the course.
The course involves assignments and an exam that are completed in the online learning environment.
How to participate?
1. Complete the assignments in the online learning environment. Course materials (without ECTS) are available to everyone without officially enrolling on the course. Please see details on course home exercise deadlines in the learning environment.
2. Complete the online exam. Please see details on exam schedule in the learning environment.
3. Enroll on the course through the Open University to receive credits. You will be given instructions for Open University course enrollment through email after you have completed the course assignments and the exam in the MOOC online learning environment.
Prof. Keijo Heljanko
Optional course in Data Science Master's Program.