Schedule

During the morning there will be lectures focusing on the main areas of ML and their application to NLP. These areas include but are not restricted to: Classification, Structured Prediction (sequences, trees, graphs), Parsing, Information Retrieval, and their applications to practical language processing on the Web.

For each topic introduced in the morning there will be a practical session in the afternoon, where students will have the opportunity to test the concepts in practice. The practical sessions will consist in implementation exercises (using Python, Numpy, and Matplotlib) of the methods learned during the morning, testing them on real examples.

At the end of the afternoon there will be special talks of concrete applications of the these techniques being currently used in production.

All Morning Sessions and Evening Talks will be held at the Complexo Interdisciplinar. All Afternoon Labs will be held at Pavilhão de Informática. Here you can find the detailed location.

The tentative schedule is shown below.

WEDNESDAY, JULY 15TH

18:00 – 20:00 Early Bird Registration and Meet-up at Complexo Interdisciplinar
NOTE: If you can come to the early registration, do so.

THURSDAY, JULY 16TH

08:00 – 09:00 Registration at Complexo Interdisciplinar

09:00 – 10:30 Morning Session 1

BASIC TUTORIALS ON PROBABILITY THEORY AND LINEAR ALGEBRA (MARIO FIGUEIREDO)

10:30 – 11:00 Coffee Break
11:00 – 12:30 Morning Session 2

INTRODUCTION TO PYTHON (LUIS PEDRO COELHO)

12:30 – 13:30 Lunch
13:30 – 16:30 Afternoon session: Introduction to the Labs and Python

16:30 Welcome reception

FRIDAY, JULY 17TH

09:00 – 12:30 Morning Lecture (with 30 min coffee break)

LECTURE 1: INTRODUCTION TO MACHINE LEARNING: LINEAR CLASSIFIERS (SHAY COHEN)

  • Decision theory
  • Classification
  • Generative and discriminative models
  • Naive Bayes, logistic regression, support vector machines (SVMs)
  • Online learning: perceptron and passive-aggressive algorithms

12:30 – 14:00 Lunch
14:00 – 17:00 Afternoon Labs: Classification (additional material can be found here)
17:00 – 17:30 Coffee Break
17:30 – 18:30 Evening Talk

PRACTICAL TALK: LARGE-SCALE LANGUAGE GROUNDING WITH VISION: (YEJIN CHOI)

SATURDAY, JULY 18TH

09:00 – 12:30 Morning Lecture (with 30 min coffee break)

LECTURE 2: SEQUENCE MODELS (NOAH SMITH)

  • Markov models and hidden Markov models (HMMs)
  • Dynamic programming algorithms (Viterbi and sum-product)
  • Parameter learning (MLE and Baum-Welch/EM)
  • Finite state machines and finite state transducers

12:30 – 14:00 Lunch
14:00 – 17:00 Afternoon Labs: Sequence Models
17:00 – 17:30 Coffee Break
17:30 – 18:30 Evening Talk

PRACTICAL TALK: STATISTICAL MACHINE TRANSLATION (LUCIA SPECIA)

20:00 Summer School Banquet at Casa do Alentejo:

SUNDAY, JULY 19TH

Free Day!

MONDAY, JULY 20TH

09:00 – 12:30 Morning Lecture (with 30 min coffee break)

LECTURE 3: LEARNING STRUCTURED PREDICTORS (XAVIER CARRERAS)

  • From HMMs to CRFs: discriminative learning and features
  • Structured perceptron, structured SVMs and max-margin Markov networks
  • Training and optimization
  • Iterative scaling, L-BFGS, perceptron, MIRA, stochastic and batch gradient descent

12:30 – 14:00 Lunch
14:00 – 17:00 Afternoon Labs: Structured Predictors
17:00 – 17:30 Coffee Break
17:30 – 18:30 Evening Talk

PRACTICAL TALK: MEANING IN THE WILD (FERNANDO PEREIRA)

TUESDAY, JULY 21TH

09:00 – 12:30 Morning Lecture (with 30 min coffee break)

LECTURE 4: SYNTAX AND PARSING (Part1, Part2) (SLAV PETROV)

  • Context-free grammars (CFGs) and phrase-based parsing
  • Dynamic programming and CKY algorithm
  • Probabilistic CFGs, parent annotation and lexicalization
  • Dependency parsing (projective and non-projective)
  • Transition and graph-based parsers

12:30 – 14:00 Lunch
14:00 – 17:00 Afternoon Labs: Parsing
17:00 – 20:00 LxMLS Demo Day

WEDNESDAY, JULY 22ND

09:00 – 12:30 Morning Lecture (with 30 min coffee break)

LECTURE 5:LEARNING WITH BIG DATA (CHRIS DYER)

12:30 – 14:00 Lunch
14:00 – 17:00 Afternoon Labs: Big Data with Python-MrJob
17:00 – 17:30 Coffee Break
17:30 – 18:30 Evening Talk

PRACTICAL TALK: MULTILINGUAL WORD SENSE DISAMBIGUATION AND ENTITY LINKING (ROBERTO NAVIGLI)

THURSDAY, JULY 23RD

09:00 – 12:30 Morning Lecture (with 30 min coffee break)

LECTURE 6: DEEP LEARNING (YOSHUA BENGIO)

12:30 – 14:00 Lunch
14:00 – 17:00 Afternoon Labs: Deep learning with Python-Theano
17:00 – 17:30 Coffee Break
17:30 – 18:30 Evening Talk

PRACTICAL TALK: TEACHING MACHINES TO READ AND COMPREHEND (PHIL BLUNSOM)

18:30 – 19:00 Closing Remarks