School of Information Management & Systems.   Spring 2002.
245 Organization of Information in Collections.   Michael Buckland.

Description: Objects, Attributes, and Values.

Bibliographic access can be viewed as "fitting descriptions". How well does the searcher's description of what is wanted fit the "system's" descriptions provided in the bibliography, catalog, or other information system? Bibliographic descriptions should anticipate how searchers' questions will be expressed. Searchers need to pose their questions in terms of the descriptions used in the system.

A description is composed of combinations of attributes ("color of eyes") and values ("brown"). Which attributes will be most useful to provide in any given retrieval system? How can attributes and values be expressed in a mutually intelligible and mutually convenient ways? How much description is worthwhile given limited resources? Some features of attributes are:

1. "Derived" description: Parts of what is being described, e.g. title, original abstract, thumbnail copy of an image, text. Objects can represent themselves or be represented by versions of themselves (or images or copies or fragments of them).

2. "Tidiness", e.g. Can fit a fixed standard format (e.g. year of publication); use standardized codes (e.g. language: ENG SPA JAP). Examples: Social Security No.; Book ID# (e.g. International Standard Book Number ISBN; International Standard Serial Number ISSN); Catalog record ID# (e.g. Library of Congress Card Number LCCN. Each information storage system ordinarily has its own ID numbers for each record. Can it be expressed as YES or NO (Is it a biography?).

3. "Messiness", e.g.:
  --   of variable length (e.g. title, person's name)
  --   absent, present once, or present many times (e.g. authorship: Anonymous? Many authors?)
  --   complex structure (e.g. complicated names)
  --   open-ended (e.g. descriptive notes)
  --   ambiguous (e.g. many names and subject descriptions)

4. Variety, e.g. (a) Copy of (part of) the object (e.g. picture of it, text); (b) Description of the object (physical dimensions, where it came from); (c) Description of what the object signifies (what the document is "about"); (d) Description how one object is related to another (e.g. continuation of, cure for, commentary on, contradiction of).

Distinguish between (i) the description of an object, mostly a selective summary of details derived from the object, but can include description derived from other sources; and (ii) the headings, the index or "access points", used to avoid having to search through all of all of the descriptions. For a book on Albania look for the subject heading ALBANIA, then look at the descriptions of the books listed under ALBANIA. Consider alternatives. A standardized list of headings is called a "Thesaurus" or "Authority File".

245 examines conventions and standardized methods for making and handling (a) descriptions (attributes, values) and (b) headings. Different domains have different conventions. Libraries, for example, use the Anglo-American Cataloging Rules 2nd ed (AACR2), a highly stylized, agreed standard for describing documents. AACR2 incorporates the International Standard Book Description (ISBD), a standard for the visual presentation of bibliographic descriptions by imposing special punctuation on the description. MAchine-Readable Cataloging (MARC) denotes a growing family of standards for formatting and and for communication between computers of records used for various purposes in information retrieval.
Revised Jan 28, 2002.