go to
UC Berkeley home page go to SIMS home page

Supplemental Readings for SIMS202
Fall 1999
Prof. Hearst and Prof. Larson

This reader provides supplemental material for SIMS202 (Information Organization and Retrieval). The course textbook is Modern Information Retrieval, Baeza-Yates and Ribeiro-Neto (Ed.), Addison Wesley, 1999. (See also http://www.sims.berkeley.edu/~hearst/irbook.)


Information Overload

Jorge Luis Borges, The Library of Babel, from Labyrinths: Selected Stories & Other Writings, New Directions, 1962.

Daniel C. Dennett, Darwin's Dangerous Idea, Simon & Schuster, 1995. Excerpt from Chapter 2, The Library of Mendel. pp. 107-111.

Classification and Categorization

A general introduction to metadata.

Jennifer E. Rowley, Organizing Knowledge, Second Edition. Gower Publishing, 1996. (Chapters 1-3.)
A general introduction to information architecture.

Louis Rosenfeld and Peter Morville, Information Architecture for the World Wide Web, O'Reilly Publishing, 1998. (Chapters 2.)
An introduction to XML and DTDs.

Natanya Pitts-Moultis and Cheryl Kirk, XML Black Book, The Coriolis Group, 1999. (Chapter 5.)

This provides supplemental reading on cognitive aspects of categorization.

H. Clark and E. Clark, Psychology and Language: An Introduction to Psycholinguistics. Harcourt, Brace, Javanovich Publishers, 1977. Excerpts: pages 462-468, 523-530, 552-554.

This article contrasts faceted and hierarchical classifications, and subject headings vs. category codes. This and the two following papers also address the use of controlled vocabulary in search.

Marcia Bates, How to Use Controlled Vocabularies more Effectively in Online Searching, Online, November 1988, 45-56.

Elaine Svenonius, Unanswered Questions in the Design of Controlled Vocabularies, in Journal of the American Society of Information Science, 37 (5), pp. 331-340, 1986.

Christine L. Borgman. Why are Online Catalogs Still Hard to Use?, Journal of the American Society for Information Science 47(7):493-503, 1996.

This is an introduction to WordNet, a lexical thesaurus.

Christiane Fellbaum (Ed.), WordNet : an electronic lexical database, MIT Press, 1998. (Introduction and Chapter 1.)

Information Design

The following articles descibe information design methodology applied to four tasks: web site design, product design, database design, and thesaurus design.

Darrell Sano, Designing large-scale web sites: a visual design methodology John Wiley, 1996. (Chapter 3)

Hauser, J. R., Clausing, D. The House of Quality. Harvard Business Review, 66 (May-June), 63-73, 1988.

Toby J. Teorey, Database Modeling and Design, Third Edition. Morgan Kaufmann Publishers, Inc. 1999. (Chapters 1, 2, 3.0-3.3, 4, 5.0-5.2)

Dagobert Soergel, Indexing Languages and Thesauri: Construction and Maintenance, Melville Publishing Company, 1974. (Chapter F.)

Information Retrieval Evaluation

Richard K. Belew and John Hatton, RAVE Reviews: Acquiring relevance assessments from multiple users, in Hearst, M. and Hirsh, H. (Eds.), Working notes of the AAAI Spring Symposium on Machine Learning in Information Access, March 1996, AAAI Press.

David C. Blair and M. E. Maron, An Evaluation of Retrieval Effectiveness for a Full-Text Document Retrieval System, in Communications of the ACM, 28(3), 1985.

Querying, Ranking, and Lexical Analysis

Kenneth W. Church and Patrick Hanks, Word Association Norms, Mutual Information, and Lexicography, Proceedings of the 27th Annual Meeting of the Association for Computational Linguistics, 1989, pages 76-83.

William Cooper, Getting Beyond Boole, Information Processing and Management, 24, 23-248, 1988.

Ray R. Larson, et al., Cheshire II: Designing a Next-Generation Online Catalog, in Journal of the American Society of Information Science, 47(7), pp. 555-567, 1996.

Search Strategies and Cognitive Issues

Marcia J. Bates, Information Search Tactics in Journal of the American Society for Information Science, 30 (4), pp. 205-214, July 1979.

Marcia J. Bates, The design of browsing and berrypicking techniques for the on-line search interface, Online Review, 13 (5), 407-431, 1989.

Vicki L. O'Day and Robin Jeffries, Orienteering in an Information Landscape: How Information Seekers Get From Here to There, in Proceedings of ACM InterCHI '93, pp. 438-445, 1993.

Daniel M. Russell et al., The Cost Structure of Sensemaking, in the Proceedings of ACM/InterCHI '93, pp. 269-276, April 1993.

Relevance Feedback and Collaborative Filtering

Jurgen Koenemann and Nicholas J. Belkin, A Case for Interaction: A Study of Interactive Information Retrieval Behavior and Effectiveness, in the Proceedings of ACM/CHI, Vancouver, CA, pp. 205-212, 1996.

Joseph A. Konstan et al., GroupLens: Applying Collaborative Filtering to Usenet News, in Communications of the ACM, 40 (3) pp. 77-87, March 1997.

Upendra Shardanand and Pattie Maes, Social Information Filtering: Algorithms for Automating ``Word of Mouth'', in the Proceedings of ACM/CHI, pp. 210-217, Denver, CO, May 1995.