Universität Wien FIND

Due to the COVID-19 pandemic, changes to courses and exams may be necessary at short notice (e.g. cancellation of on-site teaching and conversion to online exams). Register for courses/exams via u:space, find out about the current status on u:find and on the moodle learning platform. NOTE: Courses where at least one unit is on-site are currently marked "on-site" in u:find.

Further information about on-site teaching and access tests can be found at https://studieren.univie.ac.at/en/info.

053621 VU Mining Massive Data (2021S)

Continuous assessment of course work
Mo 21.06. 09:45-11:15 Digital

Registration/Deregistration

Note: The time of your registration within the registration period has no effect on the allocation of places (no first come, first serve).

Details

max. 25 participants
Language: English

Lecturers

Classes (iCal) - next class is marked with N

Monday 01.03. 09:45 - 11:15 Digital
Friday 05.03. 09:45 - 11:15 Digital
Monday 08.03. 09:45 - 11:15 Digital
Monday 15.03. 09:45 - 11:15 Digital
Friday 19.03. 09:45 - 11:15 Digital
Monday 22.03. 09:45 - 11:15 Digital
Friday 26.03. 09:45 - 11:15 Digital
Monday 12.04. 09:45 - 11:15 Digital
Friday 16.04. 09:45 - 11:15 Digital
Monday 19.04. 09:45 - 11:15 Digital
Friday 23.04. 09:45 - 11:15 Digital
Monday 26.04. 09:45 - 11:15 Digital
Friday 30.04. 09:45 - 11:15 Digital
Monday 03.05. 09:45 - 11:15 Digital
Friday 07.05. 09:45 - 11:15 Digital
Monday 10.05. 09:45 - 11:15 Digital
Friday 14.05. 09:45 - 11:15 Digital
Monday 17.05. 09:45 - 11:15 Digital
Friday 21.05. 09:45 - 11:15 Digital
Friday 28.05. 09:45 - 11:15 Digital
Monday 31.05. 09:45 - 11:15 Digital
Friday 04.06. 09:45 - 11:15 Digital
Monday 07.06. 09:45 - 11:15 Digital
Friday 11.06. 09:45 - 11:15 Digital
Monday 14.06. 09:45 - 11:15 Digital
Friday 18.06. 09:45 - 11:15 Digital
Friday 25.06. 09:45 - 11:15 Digital
Monday 28.06. 09:45 - 11:15 Digital

Information

Aims, contents and method of the course

Goals:
Upon successful participation in the course, students will understand principles of state-of-the-art techniques for learning from massive data and can apply and evaluate those techniques in practical applications.

Lecture Contents:
* Dealing with large data (e.g., Map-Reduce, Spark)
* Fast nearest neighbor methods (e.g., Locality Senistive Hashing)
* Scalable Supervised Learning, Online learning
* Active learning
* Dimension reduction
* Clustering
* Bandits
* Recommender systems

Method:
Lecture (recorded lectures will be made available via Moodle) + pen & paper exercises and their discussion
+ programming exercises

Assessment and permitted materials

Written exam
Programming exercises
Pen & paper exercises and their (live) discussion

Minimum requirements and assessment criteria

It is recommended that students attending this course have solid basic knowledge in statistics, algorithms and programming.

30% Written exam
40% Programming exercises
30% Pen & paper exercises (with minor programming exercises)

P = Average percentage on the written exam, the programming exercises, and the pen & paper exercises

85% <= P <= % Sehr Gut (1)
74% <= P < 85% Gut (2)
62% <= P < 74% Befriedigend (3)
50% <= P < 62% Genügend (4)
0% <= P < 50% Nicht Genügend (5)

At least 50% on the written exam, 50% on the programming exercises, and 50% on the pen&paper exercises must be achieved for a passing grade.

Examination topics

The presented topics in the lecture (according to slides + exercises). Referenced Literature (as indicated in detail on lecture slides).

Reading list

Ester M., Sander J. Knowledge Discovery in Databases: Techniken und Anwendungen.
J. Leskovec, A. Rajaraman, J. Ullman. Mining of Massive Datasets.
J. Han, M. Kamber, J.Pei.Data Mining: Concepts and Techniques.
I. H. Witten , E. Frank, M. A. Hall. Data Mining: Practical Machine Learning Tools and Techniques.
+ papers mentioned lecture slides

Association in the course directory

Modul: MMD

Last modified: Mo 01.03.2021 12:48