Schedule
During the morning there will be lectures focusing on the main areas of ML and their application to NLP. These areas include but are not restricted to: Classification, Structured Prediction (sequences, trees, graphs), Parsing, Semi-Supervised Learning, and their applications to practical language processing on the Web.
For each topic introduced in the morning there will be a practical session in the afternoon, where students will have the opportunity to test the concepts in practice. The practical sessions will consist in implementation exercises (using Python, Numpy, and Matplotlib) of the methods learned during the morning, testing them on real examples.
At the end of the afternoon there will be special talks of concrete applications of the these techniques being currently used in production.
All Morning Sessions and Evening Talks will be held at Complexo Interdisciplinar. All Afternoon Labs will be held at Pavilhão de Informática.
The final schedule is shown below.
Wednesday, July 20th
09:00 - 10:30 Morning Session 1 [SLIDES]
Basic tutorials on probability theory and linear algebra
10:30 - 11:00 Coffee Break
11:00 - 12:30 Morning Session 2 [SLIDES]
Introduction to Python (instructions on how to install Python in your machine)
12:30 - 14:00 Lunch
14:00 - 17:00 Afternoon session: Pratical implementation exercises [PDF]
17:30 Welcome reception
Thursday, July 21st
09:00 - 12:30 Morning Lecture (with 30 min coffee break) [SLIDES] [VIDEO]
LECTURE 1: INTRODUCTION TO MACHINE LEARNING (KOBY CRAMMER)
‣Decision theory
‣Classification
‣Generative and discriminative models
‣Naive Bayes, logistic regression, support vector machines (SVMs)
‣Online learning: perceptron and passive-aggressive algorithms
12:30 - 14:00 Lunch
14:00 - 17:00 Afternoon Labs [PDF]
17:00 - 17:30 Coffee Break
18:00 - 19:30 Evening Talk
PRACTICAL TALK: SOCIAL MEDIA (MILES OSBORNE)
Friday, July 22ND
09:00 - 12:30 Morning Lecture (with 30 min coffee break) [SLIDES] [VIDEO]
LECTURE 2: SEQUENCE MODELS (NOAH SMITH)
‣Markov models and hidden Markov models (HMMs)
‣Dynamic programming algorithms (Viterbi and sum-product)
‣Parameter learning (MLE and Baum-Welch/EM)
‣Finite state machines and finite state transducers
12:30 - 14:00 Lunch
14:00 - 17:00 Afternoon Labs [PDF]
17:00 - 17:30 Coffee Break
18:00 - 19:30 Evening Talk
PRACTICAL TALK: NETWORK INFERENCE FROM CO-OCCURENCES (MÁRIO FIGUEIREDO)
Saturday, July 23RD
09:00 - 12:30 Morning Lecture (with 30 min coffee break) [SLIDES] [VIDEO]
LECTURE 3: LEARNING STRUCTURED PREDICTORS (XAVIER CARRERAS)
‣From HMMs to CRFs: discriminative learning and features
‣Structured perceptron, structured SVMs and max-margin Markov networks
‣Training and optimization
‣Iterative scaling, L-BFGS, perceptron, MIRA, stochastic and batch gradient descent
12:30 - 14:00 Lunch
14:00 - 17:00 Afternoon Labs [PDF]
17:00 - 17:30 Coffee Break
18:00 - 19:30 Evening Talk
PRACTICAL TALK: LANGUAGE IN THE WILD: LEARNING FROM THE WEB TO UNDERSTAND THE WEB (SLAV PETROV)
20:30 Summer School Banquet (at Mercado da Ribeira)
Sunday, July 24TH
09:00 - 12:30 Morning Lecture (with 30 min coffee break) [SLIDES] [VIDEO]
LECTURE 4: SYNTAX AND PARSING (SLAV PETROV)
‣Context-free grammars (CFGs) and phrase-based parsing
‣Dynamic programming and CKY algorithm
‣Probabilistic CFGs, parent annotation and lexicalization
‣Dependency parsing (projective and non-projective)
‣Transition and graph-based parsers
12:30 - 14:00 Lunch
14:00 - 17:00 Afternoon Labs [PDF]
17:00 - 17:30 Coffee Break
18:00 - 19:30 Evening Talk
PRACTICAL TALK: STATISTICAL MACHINE TRANSLATION (PHIL BLUNSOM)
Monday, July 25TH
09:00 - 12:30 Morning Lecture (with 30 min coffee break) [SLIDES] [VIDEO]
LECTURE 5: SEMI-SUPERVISED/UNSUPERVISED/TOPIC MODELING (JASON EISNER)
‣Exponential family and conjugate priors
‣The Bayesian perspective on finite state and context free models
‣Missing data and unsupervised learning
‣Topic models and nonparametric Bayes
‣Approximate inference: MCMC, variational methods, contrastive divergence
12:30 - 14:00 Lunch
14:00 - 17:00 Afternoon Labs [PDF]
17:00 - 17:30 Coffee Break
18:00 - 19:30 Evening Talk
PRACTICAL TALK: RICH PRIOR KNOWLEDGE IN LEARNING FOR NLP (JOÃO GRAÇA)
19:30 - 20:00 Closing Remarks
July
20-25
Instituto
Superior Técnico
http://www.ist.utl.pt
NEWS
July 7th
Detailed schedule!
APRIL 15th
Application notification extended to April 20th.
MARCH 13th
New speakers for evening talks!
Updated accomodation information!
SAPO Data Challenge offers scholarships!
February 24th
New speakers added!
Schedule information update!
February 14th
Schedule update
January 26th
The website is up!
Deadline for Application: February 1st - March 31th
Picture by Miguel Vieira
Sponsors: