This course gives an overview of basic techniques of working with language data. We will introduce basic linguistic notions, issues involved in building and working with language corpora, current standard techniques for preparing text for analysis, and methods of computational processing of a subset of language phenomena. By the end of the course students will
(1) have an understanding of key word-level, syntactic, semantic, and discourse phenomena,
(2) be aware of issues involved in building text corpora,
(3) be familiar with typical language processing tasks addressed in the NLP community and methods of addressing them, and
(4) will be able to perform tasks that are part of a standard NLP pipeline. |