UC Berkeley School of Information

IS 240: Information Retrieval

Outline and Schedule

This is a preliminary outline for the course. It is expected to change during the semester.

(Week. Content -- dates -- readings)

  1. Introduction to Course & IR history -- (Jan 21) -- Readings: Preface in Manning Papers by Joyce and Needham (PDF); Luhn (PDF); Doyle (PDF)in Readings

  2. Key Concepts in IR -- (Jan 26, 28) -- Readings: Chapter 2, 3, 4 in Manning; Maron and Kuhns (PDF); Cleverdon (PDF); Salton and Lesk (PDF); Hutchins (PDF); Saracevic (PDF) in Readings from Lecture 3

  3. IR Models: Boolean and Extensions -- (Feb 2, 4) -- Readings: Ch. 1, 2, 3, 4, 5 in Manning; - McCune, Tong & Dean "RUBRIC..." (PDF)

  4. IR Models: Vector Space -- (Feb 9, 11) -- Readings: Ch. 6, 7 in Manning; - Salton, Wong & Yang "A Vector Space Model..." (PDF); Salton & Buckley "Term-Weighting Approaches..." (PDF); Salton & McGill "SMART and SIRE..." (PDF); Additional Readings: Salton (Science) (PDF); Singhal, Buckley & Mitra (Pivoted) (PDF); Raghavan & Wong (PDF)

    • Slides from Lecture 6 download
    • Slides from Lecture 7 download
    • Mini-TREC database and previous queries made available (Feb 11)
  5. Vector Space and Clustering -- (Feb 18) -- Readings: Ch. 16 and 17 in Manning;

    • Monday Feb. 16 is President's Day Holiday
    • Slides from Lecture 8 download
  6. IR Models: Probabilistic Models and Language Models -- (Feb 23, 25) -- Readings: Ch. 11 in Manning; - Robertson "The Probability Ranking Principle in IR"(PDF); Belkin, et al. "ASK for IR"(PDF); Croft & Harper "Using Probabilistic..." (PDF); Robertson & Walker "Some Simple Effective Approximations..." (PDF); Turtle & Croft "Inference Networks" (PDF);

    • Slides from Lecture 9 download
    • Note: No Class on Feb. 25
  7. Probabilistic cont & IR Evaluation and Measures -- (Mar 2, 4) -- Readings: Ch. 8 in Manning; Ch. 4 in Readings in IR Additional Readings: Cooper, Gey, Dabney "Staged Logistic Regression" (PDF); Ponte & Croft ;Lavrenko & Croft; Hiemstra, Robertson & Zaragoza from Lectures 9 & 10   

    Mini-TREC group reports on system setup

  8. No Class -- (Mar 9, 11) -- Readings: Ch. ?

  9. Evaluation -- Cont. -- (Mar 16, 18) -- Readings: Ch. 8 in Manning;& Salton and Buckley "...Relevance Feedback...", Griffiths, Luckhurst and Willett "Using Interdocument Similarity...".

    Mini-TREC New queries for testing will be made available.

  10. Spring Break: No Class -- (Mar 23, 25) -- Readings: Ch. ?

  11. IR Components - Intro to GIR -- (Mar 30, Apr 1) -- Readings: Ch. 9, 10 in Manning;

    • Slides from Lecture 14 download
    • Slides from Lecture 15 download
    • Also on Apr. 1, Guest lecture by Kazutoshi Sumiya, University of Hyogo, Japan
  12. LSI & Filtering and Routing -- (Apr 6, Apr 8) -- Readings: Ch. 18 in Manning;

  13. Digital Libraries & GIR -- (Apr 13, 15) -- Readings:

    • Slides from Lecture 18 download
    • No Class on Wednesday
  14. Web IR -- -- (Apr 20, 22) -- Readings: Ch. 19, 20, 21 in Manning; Google Description

    Mini-TREC results of runs due Thursday!

  15. Web Search, Grid IR & NLP -- (Apr 27, 29) -- Readings: Ch. 13, 14 in Manning;

    Mini-TREC Results and system rankings returned Tuesday. See the new RESULTS directory of your group directories for results data

  16. NLP & Cross-language IR -- (May 4, 6) -- Readings: Ch. 15 in Manning;

  17. MiniTREC Group reports & Wrapup -- (May 11) -- Readings: Ch. ? Mini-TREC: Group reports (Thursday)