Universität Wien

400011 SE Advanced quantitative text analysis (2024W)

Vertiefungsseminar Methoden

Prüfungsimmanente Lehrveranstaltung

Bitte beachten Sie: Voraussetzung für den Besuch von Vertiefungsseminaren ist der Abschluss der Dissertationsvereinbarung.

An/Abmeldung

Hinweis: Ihr Anmeldezeitpunkt innerhalb der Frist hat keine Auswirkungen auf die Platzvergabe (kein "first come, first served").

Details

max. 15 Teilnehmer*innen
Sprache: Englisch

Lehrende

Termine (iCal) - nächster Termin ist mit N markiert

  • Donnerstag 28.11. 13:00 - 18:00 Seminarraum 11 Vernetzungsraum für Vienna Doctoral School of Social Sciences, Kolingasse 14-16, OG01
  • Freitag 29.11. 13:00 - 18:00 Seminarraum 11 Vernetzungsraum für Vienna Doctoral School of Social Sciences, Kolingasse 14-16, OG01
  • Montag 02.12. 13:00 - 18:00 Seminarraum 11 Vernetzungsraum für Vienna Doctoral School of Social Sciences, Kolingasse 14-16, OG01
  • Mittwoch 04.12. 13:15 - 18:00 Seminarraum 10, Kolingasse 14-16, OG01

Information

Ziele, Inhalte und Methode der Lehrveranstaltung

Lehrende: Fabienne Lind, Petro Tolochko

Facing the massive volumes of text data that are available in digital format and valuing their potential, over recent years social scientists have increasingly turned to methods that rely on the support of computer power, so- called automated text analysis methods. The text-as-data methods are used to draw reproducible and valid inferences or meanings from documents. As an enhancement of the more classical manual methods of content analysis, automated methods of text analysis are becoming prevalent in disciplines that are overall increasingly computationally oriented.
This course is aimed at people with some knowledge of automated text analysis who want to use this method in their PhD and/or want to deepen their expertise in the matter.
The course covers topics related to data collection, data processing, quality control, and the critical interpretation of results.
We cover the following topics:
What kind of questions can be answered with automated text analysis
Recap Python
Text Data Collection and Selection
Pre-processing
Causal Inference in Text
Regular Expressions and Classification with Dictionaries
Machine Learning and Classification
Neural Networks and Transformer Models
Topic modeling/k-means
Text analysis and network analysis
Multilingual text analysis
Validation
Ethics and Data Security
Critical reflection on the methods
All topics are introduced with a lecture type approach and then illustrated with practical examples. In the lecture type part, we show applied cases, explain the motivation for specific methods, illustrate the mathematical background and discuss the pros and cons of different methods in respect to different research goals. The practical part consists of guided coding sessions, where we work together through prepared code, and small coding challenges (no grades), which are worked on alone or in groups.
The guided coding sessions during class will predominantly feature examples in Python. To fully engage with the course content, it's essential to possess the ability to create and manipulate vectors, data frames, and list objects, as well as to load tabular data files such as CSVs. Proficiency in performing fundamental operations like subsetting/filtering, indexing, and creating/changing columns within data frames is also crucial. If you require a refresher or a starting point to bolster your proficiency in these Python basics, please refer to the resources provided below. Additionally, most of our code examples include an equivalent version in R, which you can employ for your personal projects.
Furthermore, when it comes to submitting your final paper, you have the flexibility to choose between Python or R.

Please note: The prerequisite for participation in advanced seminars is the conclusion of the doctoral thesis agreement.

Art der Leistungskontrolle und erlaubte Hilfsmittel

The use of AI tools is permitted as an aid to coding and writing the final paper.

Mindestanforderungen und Beurteilungsmaßstab

Final paper: application of one or several automated text analysis methods on a topic related to the PhD thesis or a topic of free choice (80%)
Continuous assessment of class participation (20%)

Prüfungsstoff

Materials provided in class.

Literatur

Python Ressources

Learn Python for Everybody: https://www.youtube.com/watch?v=8DvywoWv6fI
Data Analysis with Python: https://www.youtube.com/watch?v=GPVsHOlRBBI
VanderPlas, J. (2016). Python Data Science Handbook: Essential Tools https://github.com/jakevdp/PythonDataScienceHandbook

Zuordnung im Vorlesungsverzeichnis

Letzte Änderung: Do 22.08.2024 13:06