School of Information Management & Systems   Spring 2000.   Michael Buckland.
245 Organization of Information in Collections

Description: Objects, Attributes, and Values
Revised Jan 25, 2000.

Bibliographic access can be viewed as "fitting descriptions". How well does the searcher's description of what is wanted fit the "system's" descriptions provided in the bibliography, catalog, other information system? Bibliographic descriptions should anticipate how searchers' questions will be expressed. Searchers need to pose their questions in terms of the descriptions used in the system.

A description is composed of combinations of attributes ("color of eyes") and values ("brown"). Which attributes will be most useful to provide in any given retrieval system? How can attributes and values be expressed in a mutually intelligible and mutually convenient ways? How much description is worthwhile given limited resources? Some features of attributes are:

1. "Brute facts": Parts of what is being described, e.g. title, original abstract, text. Objects can represent themselves or be represented by themselves (or images or copies or fragments of them).

2. "Tidiness", e.g. Can fit a fixed standard format (e.g. year of publication); use standardized codes (e.g. language: ENG SPA JAP). Examplesi: Book ID# (e.g. International Standard Book Number ISBN; International Standard Serial Number ISSN); Catalog record ID# (e.g. Library of Congress Card Number LCCN. Each system (MELVYL, OCLC, RLIN) ordinarily has its own ID numbers for each record. Can it be expressed as YES or NO (Is it a biography?).

3. "Messiness", e.g.:
  --   of variable length (e.g. title, author's name)
  --   absent, present once, or present many times (e.g. authorship: Anonymous? Many authors?)
  --   complex structure (e.g. complicated names)
  --   open-ended (e.g. cataloger's notes)
  --   ambiguous (e.g. many names and subject descriptions)

4. Variety, e.g. (a) Copy of (part of) the object (e.g. picture of it, text); (b) Description of the object (physical dimensions, provenance); (c) Description of what the object signifies (what the text is "about"); (d) Description how one object is related to another (e.g. continuation of, commentary on, contradiction of).

Distinguish between (i) the description of an object, mostly a selective summary of details derived from the object, but can include description derived from other sources; and (ii) the headings , the index or "access points", used to avoid having to search through all of all of the descriptions. For a book on Albania look for the subject heading ALBANIA, then look at the descriptions of the books listed under ALBANIA. Consider alternatives. A standardized list of headings is called a "Thesaurus" or "Authority File".

SIMS 245 examines conventions and standardized methods for making and handling (a) descriptions (attributes, values) and (b) headings. Different domains have different conventions. Libraries, for example, use the Anglo-American Cataloging Rules 2nd ed (AACR2), a highly stylized, agreed standard for describing documents. AACR2 incorporates the International Standard Book Description (ISBD), a standard for the visual presentation of bibliographic descriptions by imposing special punctuation on the description. MAchine-Readable Cataloging (MARC) denotes a growing family of standards for formatting and and for communication between computers of records used for various purposes in information retrieval.