UC Berkeley School of Information

IS 240: Information Retrieval

Outline and Schedule

This is a preliminary outline for the course. It is expected to change during the semester.

(Week. Content -- dates -- readings)

  1. Introduction to Course & IR history -- (Jan 16,18) -- Readings: Ch. 1 in Grossman & Frieder, Papers by Joyce and Needham; Luhn; Doyle in Readings

  2. Key Concepts in IR -- (Jan 23, 25) -- Readings: Ch. 1 & Maron and Kuhns; Cleverdon; Salton and Lesk; Hutchins; Saracevic in Readings from Lecture 3

    • Slides from Lecture 3 download
    • Slides from Lecture 4 download
    • Unix for Poets by Ken Church PDF
  3. IR Models: Boolean and Extensions -- (Jan 30, Feb 1) -- Readings: Ch. 2.5,2.9 - McCune, Tong & Dean "RUBRIC..."

  4. IR Models: Vector Space & Clustering -- (Feb 6, 8) -- Readings: Ch. 2.1 - Salton, Wong & Yang "A Vector Space Model..."; Salton & Buckley "Term-Weighting Approaches..."; Salton & McGill "SMART and SIRE..."; Handouts: Salton (Science) PDF; Singhal, Buckley & Mitra (Pivoted)PDF; Raghavan & Wong

  5. IR Models: Probabilistic Models and Inference Nets -- (Feb 13, 15) -- Readings: Ch. 2.2,2.4 - Robertson "The Probability Ranking Principle in IR"; Belkin, et al. "ASK for IR"; Croft & Harper "Using Probabilistic..."; Robertson & Walker "Some Simple Effective Approximations..."; Turtle & Croft "Inference Networks"; Handout - Cooper, Gey, Dabney "Staged Logistic Regression" PDF;

    Mini-TREC database and previous queries made available (Feb 15)

  6. Language Models and Intro. to Evaluation -- (Feb 20, 22 ) -- Readings: Ch. 2.3,3.5 & - Stralkowski "Robust Text Processing..."; Handouts:
    Ponte & Croft ;Lavrenko & Croft; Hiemstra, Robertson & Zaragoza from Lecture 12   

    *Note: Class on Feb. 20 attended lecture by Paul Dourish -- no slides Slides from Lecture 11 download

  7. IR Evaluation and Measures -- (Feb 27, Mar 1) -- Readings: Ch. 4 in Readings in IR
    Mini-TREC group reports on system setup

    No Class Thursday, March 1

  8. Evaluation -- Continued & Filtering and Routing -- (Mar 6, 8) -- Readings: Ch. 5 & Salton and Buckley "...Relevance Feedback...", Griffiths, Luckhurst and Willett "Using Interdocument Similarity...".

    Mini-TREC New queries for testing will be made available.

  9. IR Components -- (Mar 13, 15) -- Readings: Ch. 3

    Note: On March 15th class will be held in room 205

  10. LSI & Filtering and Routing -- (Mar 20, 22) -- Readings: 2.6

    Note: Class may attend the Lecture in 202 on Tuesday.

  11. Spring Break: No Class -- (Mar 27, 29) -- Readings: Ch. ?

  12. Digital Libraries & Geographic IR -- (Apr 3, 5) -- Readings: Ch. 8

  13. GIR & IR Components Continued -- (Apr 10, 12) -- Readings: Ch. 5, Ch. 6

  14. Web IR -- -- (Apr 17, 19) -- Readings: Ch. 8, Ch. 3.8 Google Description

    Mini-TREC results of runs due Thursday!

  15. Web Search, Grid IR & NLP -- (Apr 24, 26) -- Readings: Ch. 4

    Mini-TREC Results and system rankings returned Tuesday. See the new RESULTS directory of your group directories for results data

  16. NLP & Cross-language IR -- (May 1, 3) -- Readings: Ch. 8

  17. MiniTREC Group reports & Wrapup -- (May 8) -- Readings: Ch. ? Mini-TREC: Group reports (Thursday)