Universität Wien

050055 VU Applied Social Network Analysis using Advanced Data Mining (2016W)

Continuous assessment of course work

Der Vortragende ist Associate Professor am Department of Industrial Engineering, Universidad de Chile, Chile

Unterrichtssprache ist Englisch. Wenn alle Teilnehmerinnen über ausreichende Deutschkenntnisse verfügen, wird auf Wunsch Deutsch unterrichtet.

Registration/Deregistration

Note: The time of your registration within the registration period has no effect on the allocation of places (no first come, first served).

Details

max. 25 participants
Language: English

Lecturers

Classes (iCal) - next class is marked with N

  • Monday 09.01. 09:00 - 12:00 PC-Unterrichtsraum 1, Währinger Straße 29 1.UG
  • Monday 09.01. 14:00 - 17:00 PC-Unterrichtsraum 1, Währinger Straße 29 1.UG
  • Tuesday 10.01. 09:00 - 12:00 PC-Unterrichtsraum 1, Währinger Straße 29 1.UG
  • Wednesday 11.01. 09:00 - 12:00 PC-Unterrichtsraum 1, Währinger Straße 29 1.UG
  • Thursday 12.01. 09:00 - 12:00 PC-Unterrichtsraum 1, Währinger Straße 29 1.UG
  • Friday 13.01. 09:00 - 12:00 PC-Unterrichtsraum 1, Währinger Straße 29 1.UG
  • Monday 16.01. 09:00 - 12:00 PC-Unterrichtsraum 5, Währinger Straße 29 2.OG
  • Tuesday 17.01. 09:00 - 12:00 PC-Unterrichtsraum 5, Währinger Straße 29 2.OG
  • Wednesday 18.01. 09:00 - 12:00 PC-Unterrichtsraum 5, Währinger Straße 29 2.OG
  • Thursday 19.01. 09:00 - 12:00 PC-Unterrichtsraum 5, Währinger Straße 29 2.OG
  • Friday 20.01. 09:00 - 12:00 PC-Unterrichtsraum 6, Währinger Straße 29 2.OG

Information

Aims, contents and method of the course

"Data is the new oil" is just one of the sayings that underline the rapidly increasing importance of data and their adequate analysis for today´s businesses, but also for the entire society and mankind. Areas such as finance and marketing in companies, but also health care, security, astronomy, and environmental issues will gain a lot in the future by applying the most appropriate techniques to analyze the ever-increasing flood of available data. Techniques from different disciplines, such as statistics, artificial intelligence, social network analysis, computer science, among others are used to find useful pattern in "big data". The first part of this course (week 1) will provide students with solid knowledge about state-of-the-art techniques from data mining. Real-world applications and software tools will offer the ability to apply the respective methods. A class on privacy and ethical issues will also discuss boundaries of the respective technology.

The second part of this course (week 2) is dedicated to the analysis of Social Networks that play an increasing role in our society. Facebook and Twitter are just two such internet sites where users can network. Many traditional business decisions will be influenced by social network analysis (SNA). Loan granting or marketing campaigns are just two examples. But also less traditional areas, such as e.g. investigation of organized crime can benefit from this relatively new approach. The second part of this course first lays the foundation for social network analysis. Then the main topics related to SNA will be introduced. Applications with real-world data from social networks using the respective software tools will conclude the course.

Course syllabus

Week 1:
KDD process, data preprocessing, feature selection, data mining methods (clustering, neural networks, decision trees, Support Vector Machines), applications (marketing, fraud detection, risk analysis), privacy and ethical issues, current research in data mining (dynamic data mining, combination with game theory).

Week 2:
Representation of social networks, network centrality metrics, community detection, applications (churn detection in telecommunication, gang crime), classification and prediction in social networks, privacy issue in social network analysis.

Course objectives

This course enables the participants to analyze data effectively in different situations; special emphasis will be given to social network analysis. It introduces recent methodological developments from Data Mining as well as from Social Network Analysis (SNA) and presents representative applications from different areas. Students will get practical experience via group work with real-world cases as well as research insights based on discussions of relevant publications.

Instructional methods

Lectures, case discussions, group presentations, and general discussions.

Assessment and permitted materials

Assessment will comprise group work including student presentation (weight: 45%), an individual written exam (60 minutes) (weight: 45%) on day 10, and four short exams (total weight 10%).

Minimum requirements and assessment criteria

Prerequisites:
All the concepts that are used in the course will be defined, explained, and discussed during the course.
Solid command of English.
Basic knowledge of statistics.

Examination topics

Reading list

Week 1:

Essential Readings:

Baesens, B. (2014): Analytics in a Big Data World: The Essential Guide to Data Science and its Applications. John Wiley & Sons.
Fayyad, U., Piatetsky-Shapiro, G., Smyth, P. (1996): From Data Mining to Knowledge Discovery in Databases. AI Magazine Fall 1996, 37-54

Additional Readings:

Han, J., Kamber, M. (2006): Data Mining: Concepts and Techniques, 2nd ed., The Morgan Kaufmann Series in Data Management Systems, Jim Gray, Series Editor, Morgan Kaufmann Publishers.
Additional papers will be provided by the lecturer.

Week 2:

Essential Readings:

Bonchi, F., Castillo, C., Gionis, A., Jaimes, Al. (2011): Social Network Analysis and Mining for Business Applications. ACM Transactions on Intelligent Systems and Technology, Vol. 2, No. 3
Nettleton, D. F.(2013): Data mining of social networks represented as graphs. Computer Science Review 7,1-34
S. Wasserman, Social network analysis: Methods and applications, Vol. 8, Cambridge University Press

Additional Readings:

Pinheiro, Carlos A.R. (2011): Social Network Analysis in Telecommunications. John Wiley & Sons.
Ray M. Chang, Robert J. Kauffman, Young Ok Kwon (2014): Understanding the paradigm shift to computational social science in the presence of big data. Decision Support Systems 63, 6780.
M. Wang, W. Pan, A comparative study of network centrality metrics in identifying key classes in software, Journal of Computational Information Systems 8 (24) (2012) 1020510212.

Association in the course directory

Last modified: Mo 07.09.2020 15:29