Readings for IS202, Part II
Fall 1997
Prof. Hearst and Prof. Larson
This reader provides supplemental material for the last seven weeks of
IS202 (information search and retrieval). Most articles are required;
the Schedule page tells specifically which are are required.
Overviews of Information Access
The following is the first chapter of a new textbook that will
probably be used for this course instead of a reader when the book is
completed next year. This chapter serves as an introduction to issues
surrounding search and Information Retrieval, taking a more cognitive
viewpoint.
Richard K. Belew, Finding Out About: Information Retrieval and
other technologies for seeking knowledge, Cambridge University Press,
to appear. Chapter 1.
This is another overview paper, as well as a kinder, gentler
introduction to term weighting and the vector space model than the
Salton chapters in Part I of the reader:
Gerard Salton, Developments in Automatic Text Retrieval, in
Science, 253, pp. 974-980, 1991.
Evaluation
This is the famous Blair and Maron evaluation study.
David C. Blair and M. E. Maron, An Evaluation of Retrieval
Effectiveness for a Full-Text Document Retrieval System, in
Communications of the ACM, 28(3), 1985.
The following is a good introduction to TREC.
Donna Harman, The TREC Conferences, In R. Kuhlen and
M. Rittberger (Eds.), Proceedings of Hypertext, Information
Retrieval, Multimedia 95, pp. 9-28, Konstanz, Germany, 1995.
This chapter discusses some aspects of evaluation than are touched on
in class.
Querying and Ranking
William Cooper, Getting Beyond Boole, Information
Processing and Management, 24, 23-248, 1988.
Donna Harman, et al., Ranking Algorithms, Chapter 14 from
Information Retrieval: Data Structures and Algorithms by Frakes and
Baeza-Yates (Eds.), pp. 363-392, Prentice-Hall, 1992.
William S. Cooper, et al., Probabilistic Retrieval Based on
Staged Logistic Regression, in the Proceedings of ACM/SIGIR,
Denmark, pp. 198-210, 1992.
Search Strategies and Cognitive Issues
Marcia J. Bates, Information Search Tactics in Journal of
the American Society for Information Science, 30 (4), pp. 205-214,
July 1979.
Marcia J. Bates, The Berry-Picking Search: User Interface
Design, in User interface
design, Harold Thimbleby (Ed.), Addison-Wesley, 1990.
Vicki L. O'Day and Robin Jeffries, Orienteering in an Information
Landscape: How Information Seekers Get From Here to There, in
Proceedings of ACM InterCHI '93, pp. 438-445, 1993.
Nicholas J. Belkin et al., Cases, Scripts, and
Information-Seeking Strategies: On the Design of Interactive
Information Retrieval Systems, in Expert Systems with
Applications, 9 (3), pp. 379-395, 1995.
Daniel M. Russell et al., The Cost Structure of Sensemaking, in
the Proceedings of ACM/InterCHI '93, pp. 269--276, April 1993.
IR Systems and Implementations
Donna Harman, et al., Inverted Files, Chapter 3 from
Information Retrieval: Data Structures and Algorithms by Frakes and
Baeza-Yates (Eds.), pp. 28-43, Prentice-Hall, 1992.
Ray R. Larson, et al., Cheshire II: Designing a Next-Generation
Online Catalog, in Journal of the American Society of
Information Science, 47(7), pp. 555-567, 1996.
Web Crawling and Indexing
Michael Mauldin, Lycos: Design choices in an Internet search
service, in IEEE Expert Intelligent Systems, Trends and
Controversies feature, Craig Knoblock (Ed.), 12 (1), January-February
1997.
Erik Selberg and Oren Etzioni, The MetaCrawler architecture for
resource aggregation on the Web in IEEE Expert Intelligent
Systems, Trends and Controversies feature, Craig Knoblock (Ed.), 12
(1), January-February 1997.
Content Analysis and Machine Learning
Tom M. Mitchell, Machine Learning, McGraw Hill, 1997. Pages 1-19 in
Chapter 1, and pages 180-184 in Chapter 6.
Relevance Feedback
Donna Harman, Relevance Feedback and Other Query Modification
Techniques, Chapter 11 from Information Retrieval: Data
Structures and Algorithms by Frakes and Baeza-Yates (Eds.),
pp. 241-263, Prentice-Hall, 1992.
Jurgen Koenemann and Nicholas J. Belkin, A Case for Interaction:
A Study of Interactive Information Retrieval Behavior and
Effectiveness, in the Proceedings of ACM/CHI, Vancouver, CA,
pp. 205-212, 1996.
User Interfaces for Information Access
Ronald M. Baecker et al., Design and Evaluation, introduction to
Chapter 2 of Readings in Human-Computer Interaction: Toward the
Year 2000, Second Edition, Morgan Kaufmann Publishers, Inc.,
pp. 73-91, 1995.
David G. Hendry and David J. Harper, An Informal Information
Seeking Environment, in Journal of the American Society for
Information Science, 48 (11), pp. 1036-1048, November 1997.
Marti Hearst, TileBars: Visualization of Term Distribution Information
in Full Text Information Access, in the Proceedings of the ACM SIGCHI
Conference on Human Factors in Computing Systems, pp. 59-66, Denver,
CO, May 1995.
Using Metadata in Search
Elaine Svenonius, Unanswered Questions in the Design of
Controlled Vocabularies, in Journal of the American Society of
Information Science, 37 (5), pp. 331-340, 1986.
Joel L. Fagan, Automatic Phrase Indexing for Document Retrieval:
An Examination of Syntactic and Non-Syntactic Methods, in the
Proceedings of ACM/SIGIR, pp. 91-101, 1987.
Marti Hearst and Chandu Karadi, Cat-a-Cone: An Interactive
Interface for Specifying Searches and Viewing Retrieval Results using
a Large Category Hierarchy in the Proceedings of the 20th Annual
International ACM/SIGIR Conference, Philadelphia, PA, July 1997.
Hypertext Navigation and Search
Ronald M. Baecker et al., Hypertext and Multimedia, introduction to
Chapter 13 of Readings in Human-Computer Interaction: Toward the
Year 2000, Second Edition, Morgan Kaufmann Publishers, Inc.,
pp. 833-842, 1995.
Dennis E. Egan et al., Behavioral Evaluation and Analysis of a
Hypertext Browser, in the Proceedings of ACM/CHI 89,
pp. 205-210, 1989.
F. R. Campagnoni and Kate Ehrlich, Information Retrieval Using a
Hypertext-Based Help System, in ACM Transactions on Office
Information Systems, 7 (3), pp. 271-291, July 1989.
Jakob Nielsen, The Art of Navigating Through Hypertext in
Communications of the ACM, 33 (3), pp. 311-322, March 1990.
Collaborative Filtering
Paul Resnick and Hal R. Varian, Recommender Systems,
in Communications of the ACM, 40 (3), pp. 56-58, March 1997.
Joseph A. Konstan et al., GroupLens: Applying Collaborative
Filtering to Usenet News, in Communications of the ACM, 40 (3)
pp. 77-87, March 1997.
Upendra Shardanand and Pattie Maes, Social Information Filtering:
Algorithms for Automating ``Word of Mouth'', in the Proceedings
of ACM/CHI, pp. 210-217, Denver, CO, May 1995.
Multilingual IR
David Hull and Gregory Grefenstette, Experiments in Multilingual
Information Retrieval, in Proceedings of ACM/SIGIR, Zurich, 1996.