Statistical NLP

Ling 684.02, Spring '10
MW 11:30–1:18, Journalism 387
Instructor: Michael White


In this course, students will learn the basics of probabilistic modeling and machine learning for natural language processing and computational linguistics. Along the way, students will gain experience with using the Python programming language to analyze corpus data.

The course will be based primarily on the second edition of Jurafsky and Martin's textbook, Speech and Language Processing. We will also work hands-on with Bird, Klein and Loper's book, Natural Language Processing with Python, based on the Natural Language Toolkit (NLTK).


Student in the course will have the opportunity to:


Topics will include:

Time permitting, we may also look at semantic role labeling, machine translation, or statistical parsing with Combinatory Categorial Grammars (CCGs).


Ling 684.01 or equivalent. The course is open to advanced undergraduate and graduate students.


Letter grades will be assigned using the standard OSU scale based on class participation and homework assignments.


We'll be using the Carmen system for the schedule and for homework and reading assignments. There will also be discussion forums for posting questions and providing feedback (comments, complaints or ideas) during the course, anonymously if desired.

