There are two exams, one after the first half of the semester and one at the end of the semester. For each exam there are 80 minutes to answer 20 questions. Each correct answer counts 2 points. No support material is allowed.
All electronic devices must be turned off and put away before starting the exam. They must not be kept on the person or placed in clothes but packed in, e.g. a closed bag and cannot be taken out during the entire exam.
The better of the two results is chosen to account for 40 % of the total rating.
The remaining 60 % are earned through voluntary oral presentations during the semester. There are altogether 10 exercise sheets with problems to solve. For a certain exercise sheet at most one problem can be presented by a student. The best two results each account for 30 % of the total rating.
A mandatory prerequisite for this course is the successful completion of Foundations of Data Analysis.
The grading scale for the course is: 1: at least 90%, 2: at least 80%, 3: at least 65%, 4: at least 50%.
There are exercise sheets for the following topics: language processing and Python, accessing text corpora and lexical resources, processing raw text, writing structured programs, categorizing and tagging words, learning to classify text, extracting information from text, analyzing sentence structure, building feature based grammars, analyzing the meaning of sentences. The first exam covers the first five topics, the second exam the remaining topics.
Steven Bird, Ewan Klein, and Edward Loper. Natural Language Processing with Python. http://www.nltk.org/book/,
O'Reilly Media, 2009.
Daniel Jurafsky and James H. Martin. Speech and Language Processing. 2nd Edition, Pearson, 2009.
Ruslan Mitkov, ed. The Oxford Handbook of Computational Linguistics. Oxford University Press, 2005.
Nitin Indurkhya and Fred J. Damerau, eds. Handbook of Natural Language Processing. 2nd Edition, Chapman and Hall/CRC, 2010.
Kai-Uwe Carstensen et al., eds. Computerlinguistik und Sprachtechnologie - Eine Einführung. 3rd Edition, Springer Spektrum, 2010 (in German).