220050 SE SE Advanced Data Analysis 2 (2024S)
Continuous assessment of course work
Labels
Registration/Deregistration
Note: The time of your registration within the registration period has no effect on the allocation of places (no first come, first served).
- Registration is open from Mo 19.02.2024 09:00 to We 21.02.2024 18:00
- Deregistration possible until We 21.02.2024 18:00
Details
max. 30 participants
Language: English
Lecturers
Classes (iCal) - next class is marked with N
We will meet on Zoom for the first session.
- Friday 01.03. 15:00 - 18:00 Seminarraum 6, Kolingasse 14-16, EG00
- Friday 15.03. 15:00 - 18:00 Seminarraum 6, Kolingasse 14-16, EG00
- Friday 22.03. 15:00 - 18:00 Seminarraum 6, Kolingasse 14-16, EG00
- Friday 19.04. 15:00 - 18:00 Seminarraum 6, Kolingasse 14-16, EG00
- Friday 03.05. 15:00 - 18:00 Seminarraum 6, Kolingasse 14-16, EG00
- Friday 28.06. 15:00 - 18:00 Seminarraum 6, Kolingasse 14-16, EG00
Information
Aims, contents and method of the course
Assessment and permitted materials
Throughout the semester, participants will engage in two coding challenges designed to reinforce their comprehension of topics covered in class and practiced during guided coding sessions.
In addition to the coding challenges, the course evaluation will be based on a group project presentation and an accompanying written report. This project involves the practical application of learned techniques to analyze a provided text dataset through secondary data analysis. Detailed instructions and supplementary information will be shared during class sessions to ensure students are well-equipped for the assignment.
Course Evaluation Breakdown:
1. Two Coding Challenges (40%)
2. Group Presentation of Data Analysis Project (30%)
3. Written Report of Data Analysis Project (30%)
In addition to the coding challenges, the course evaluation will be based on a group project presentation and an accompanying written report. This project involves the practical application of learned techniques to analyze a provided text dataset through secondary data analysis. Detailed instructions and supplementary information will be shared during class sessions to ensure students are well-equipped for the assignment.
Course Evaluation Breakdown:
1. Two Coding Challenges (40%)
2. Group Presentation of Data Analysis Project (30%)
3. Written Report of Data Analysis Project (30%)
Minimum requirements and assessment criteria
• Engage actively in class discussions and activities.
• Maintain a minimum attendance of 75% throughout the course.
• Proficiency in the programming language R and the utilization of RStudio are fundamental prerequisites for participation in this course. Ensure you possess basic practical knowledge in these tools.
• Dedicate time to independent study and exercises to successfully achieve the course objectives.
• Kindly bring your personal laptop to each class session, and ensure that both R and RStudio are installed before the commencement of the first class. This is a mandatory requirement for seamless participation in the course.
• Maintain a minimum attendance of 75% throughout the course.
• Proficiency in the programming language R and the utilization of RStudio are fundamental prerequisites for participation in this course. Ensure you possess basic practical knowledge in these tools.
• Dedicate time to independent study and exercises to successfully achieve the course objectives.
• Kindly bring your personal laptop to each class session, and ensure that both R and RStudio are installed before the commencement of the first class. This is a mandatory requirement for seamless participation in the course.
Examination topics
Examination topics consist of the content of the learning units.Required knowledge and practical skills will be conveyed during the lectures.The slides used during the lectures will be shared on Moodle.Additional readings will also be shared on Moodle.
Reading list
Resources to refresh your R skills before the course:
R for Data Science https://r4ds.hadley.nz/
Tidy Modeling with R. https://www.tmwr.org/
Intro text to read before the first class:
Grimmer, J., & Stewart, B. M. (2013). Text as data: The promise and pitfalls of automatic content analysis methods for political texts. Political analysis, 21(3), 267-297.
R for Data Science https://r4ds.hadley.nz/
Tidy Modeling with R. https://www.tmwr.org/
Intro text to read before the first class:
Grimmer, J., & Stewart, B. M. (2013). Text as data: The promise and pitfalls of automatic content analysis methods for political texts. Political analysis, 21(3), 267-297.
Association in the course directory
Last modified: Mo 05.02.2024 10:46
• Data collection and data selection
• Feature Selection
• Text Representation
• Rule-based classification
• Supervised classification
• Unsupervised approaches
• Intro advanced methods (e.g., Transformers, Embeddings, Multilingual text analysis)All topics are introduced with a lecture type approach and then illustrated with practical examples. The lecture part consists of input by the instructor (i.e., covering the basics of each topic, highlighting latest methods research, and introducing resources) and shorter interactive parts (i.e., reflection and discussions on different methods in plenary and small groups). The practical part consists of guided coding sessions, where we work together through prepared code.By the end of this course participants will be able to:• make an informed decision about a suitable method for a given application scenario.
• practically apply various basic text as data methods.
• critically evaluate results.