During the morning there will be lectures focusing on the main areas of ML and their application to NLP. These areas include but are not restricted to: Classification, Structured Prediction (sequences, trees, graphs), Parsing, Information Retrieval, and their applications to practical language processing on the Web.


For each topic introduced in the morning there will be a practical session in the afternoon, where students will have the opportunity to test the concepts in practice. The practical sessions will consist in implementation exercises (using Python, Numpy, and Matplotlib) of the methods learned during the morning, testing them on real examples.


At the end of the afternoon there will be special talks of concrete applications of the these techniques being currently used in production.


All Morning Sessions and Evening Talks will be held at Complexo Interdisciplinar. All Afternoon Labs will be held at Pavilhão de Informática.


The tentative schedule is shown below.





THURSDAY, JULY 19TH


09:00 - 10:30    Morning Session 1


Basic tutorials on probability theory and linear algebra (MARIO FIGUEIREDO)


[Download PDF]


10:30 - 11:00    Coffee Break


11:00 - 12:30    Morning Session 2


Introduction to Python (LUIS PEDRO COELHO)

[Download PDF]
[instructions on how to install Python in your machine]


12:30 - 14:00    Lunch


14:00 - 17:00    Afternoon session: Pratical implementation exercises


17:00                Parallel Seminar: "Deeper QA: CMU, Watson, and the Open
                         Advancement of Question Answering"
                        (Eric Nyberg, Carnegie Mellon University)


18:00                Welcome reception





FRiday, July 20TH


09:00 - 12:30    Morning Lecture (with 30 min coffee break)


LECTURE 1: INTRODUCTION TO MACHINE LEARNING (KOBY CRAMMER)

  1. Decision theory

  2. Classification

  3. Generative and discriminative models

  4. Naive Bayes, logistic regression, support vector machines (SVMs)

  5. Online learning: perceptron and passive-aggressive algorithms

    [Download PPS part 1]
    [
    Download PPS part 2]
    [
    Download PPS part 3]


  1. [WATCH VIDEO] (2011)


12:30 - 14:00    Lunch


14:00 - 17:00    Afternoon Labs


17:00 - 17:30    Coffee Break


18:00 - 19:30    Evening Talk


PRACTICAL TALK: Text and Social Context:  Analysis and Prediction

(NOAH SMITH)

[Download PDF]


[WATCH VIDEO] (2012)





SATURDAY, JULY 21ST

09:00 - 12:30    Morning Lecture (with 30 min coffee break)


LECTURE 2: SEQUENCE MODELS (NOAH SMITH)

[Download PDF]


[WATCH VIDEO] (2011)


  1. Markov models and hidden Markov models (HMMs)

  2. Dynamic programming algorithms (Viterbi and sum-product)

  3. Parameter learning (MLE and Baum-Welch/EM)

  4. Finite state machines and finite state transducers


12:30 - 14:00    Lunch


14:00 - 17:00    Afternoon Labs


17:00 - 17:30    Coffee Break


18:00 - 19:30    Evening Talk


PRACTICAL TALK: Social meanings from social media (JACOB EISENSTEIN)

[Download PDF]


[WATCH VIDEO] (2012)


20:30                Summer School Banquet


Restaurante Casa do Alentejo

Rua das Portas Santo Antão 58  1150 Lisbon

phone: 213 405 140

(Location on Google Maps)

                        





SUNDAY, JULY 22ND

09:00 - 17:00    Free Day





MONDAY, JULY 23RD

09:00 - 12:30    Morning Lecture (with 30 min coffee break)


LECTURE 3: LEARNING STRUCTURED PREDICTORS (XAVIER CARRERAS)

[Download PDF]


[WATCH VIDEO] (2011)


  1. From HMMs to CRFs: discriminative learning and features

  2. Structured perceptron, structured SVMs and max-margin Markov networks

  3. Training and optimization

  4. Iterative scaling, L-BFGS, perceptron, MIRA, stochastic and batch gradient descent


12:30 - 14:00    Lunch


14:00 - 17:00    Afternoon Labs


17:00 - 17:30    Coffee Break


18:00 - 19:30    Evening Talk


PRACTICAL TALK: Recommender Systems at LinkedIn (PAUL OGILVIE)

[Download PDF]


[WATCH VIDEO] (2012)





TUESDAY, JULY 24TH

09:00 - 12:30    Morning Lecture (with 30 min coffee break)


LECTURE 4: SYNTAX AND PARSING (SLAV PETROV)



  1. Context-free grammars (CFGs) and phrase-based parsing

  2. Dynamic programming and CKY algorithm

  3. Probabilistic CFGs, parent annotation and lexicalization

  4. Dependency parsing (projective and non-projective)

  5. Transition and graph-based parsers


[Download PDF - Part 1]
[
Download PDF - Part 2]


[WATCH VIDEO] (2011)


12:30 - 14:00    Lunch


14:00 - 17:00    Afternoon Labs


17:00 - 17:30    Coffee Break


18:00 - 19:30    Evening Talk


PRACTICAL TALK: Understanding All the World's Languages (SLAV PETROV)





WEDNESDAY, JULY 25TH

09:00 - 12:30    Morning Lecture (with 30 min coffee break)


LECTURE 5: We Mine Your Life (MAARTEN DE RIJKE)


[WATCH VIDEO] (2012)


12:30 - 14:00    Lunch


14:00 - 17:00    Afternoon Labs


17:00 - 17:30    Coffee Break


18:00 - 19:30    Evening Talk


PRACTICAL TALK: GRAPH-BASED SEMI-SUPERVISED LEARNING (PARTHA TALUKDAR)


[Download PDF]


[WATCH VIDEO] (2012)


19:30 - 20:00    Closing Remarks









 
 

Schedule

 

July
19-25

Instituto
Superior Técnico
http://www.ist.utl.pt

Sponsors:

NEWS


JULY 18TH
In the scope of the Carnegie Mellon | Portugal joint doctoral program on Language Technologies,  Eric Nyberg will give a talk at Instituto superior Técnico entitled:


Deeper QA: CMU, Watson, and the Open Advancement of Question Answering




MAY 16TH
Notifications were sent.

Registration is open.




May 4TH
Lisbon was featured in Anthony Bourdain’s “No Reservations” food and travel show.




APRIL 16TH
Application deadline extended to April 30!




APRIL 10TH
Updated Scholarship information.




FEBRUARY 21ST
Website Online!