Human language technologies

Code 649AA
Credits 9

Learning outcomes

The course presents principles, models and the state of the art techniques for the analysis of natural language, focusing mainly on statistical machine learning approaches and Deep Learning in particular. Students will learn how to apply these techniques in a wide range of applications using modern programming libraries.
- Formal and statistical approaches to NLP.
- Statistical methods: Language Model, Hidden Markov Model, Viterbi Algorithm, Generative vs Discriminative Models
- Linguistic essentials (tokenization, morphology, PoS, collocations, etc.).
- Parsing (constituency and dependency parsing).
- Processing Pipelines.
- Lexical semantics: corpora, thesauri, gazetteers.
- Distributional Semantics: Word embeddings, Character embeddings.
- Deep Learning for natural language.
- Applications: Entity recognition, Entity linking, classification, summarization.
- Opinion mining, Sentiment Analysis.
- Question answering, Language inference, Dialogic interfaces.
- Statistical Machine Translation.
- NLP libraries: NLTK, Theano, Tensorflow.