School of Information Management & Systems
 Previously School of Library & Information Studies

  296a-1 Seminar: Information Access.
 ("The Friday Afternoon Seminar")
 Summaries - Spring 2003.


Fridays 3-5. 107 South Hall. Schedule.
Summaries will be added below as they become available.

Jan 24: Movie: Paul Otlet, The Man who Wanted to Classify the World.

    West Coast Premiere! Including a few seconds filmed in South Hall!!
    The Man Who Wanted to Classify the World: From the Index Card to the World City: The Visionary Life of a Belgian Utopian, Paul Otlet (1868-1944), a documentary made for public television by Francoise Levie, a professional documentary film maker in Brussels.
    Paul Otlet, an idealist young Belgian lawyer, set out in September 1895 to solve the world's information problems, both in theory and in practice. For forty years he tirelessly explored the central issues of Information Management: hypertext, faceted classification, user-friendly workstations, document formating, technical standards, etc. Already in 1925 he realised that cathode ray tubes and TV technology would allow people to read remote texts on a screen in their own homes and and tried to do a cost analysis.     His "Treatise on Documentation" (1934), although obviously obsolete with respect to technology, is arguably still the completest text. But like almost all the pioneers in our field from the period before 1945, when Europe was ahead of North America, Otlet (and others who did not write in English) were almost completely forgotten until a revival of interest in the history of information management in the 1990s.
    Francoise Levie, a documentary movie maker, heard about Otlet and set out to discover who he was. This 65 minute documentary shows how she did that and what she found.     For more on Paul Otlet, see http://www.sims.berkeley.edu/~buckland/otlet.html.

Jan 31: No seminar meeting.

Feb 7: Nicolas DUCHENEAUT: The Sociotechnical Construction of Open Source Software.
    Open Source Software (OSS) development is often characterized as a fundamentally new way to develop software. In many OSS projects developers work in geographically distributed locations, rarely or never meet face-to-face, and coordinate their activities over the Internet. In efforts to understand OSS development, social scientists have examined the socioeconomic, political and institutional aspects of this movement. However, few studies have paid close attention to the materiality of OSS projects. Indeed email, code and databases constitute the material means through which OSS projects are coordinated and also the material productions of OSS development efforts. It is crucial to take this notion seriously and make material things the focus of OSS studies in order to really understand how these online spaces function.
    I study software development by following the sociotechnical processes that produce, circulate, and transform the email, code and databases of OSS. My approach to studying OSS comprises two mutually informing activities: ethnography and the construction and use of software to visualize and explore the networks of humans and non-humans incorporated in the email, code and databases of OSS. This software's purpose is novel in that it serves both as a theoretical statement and as a tool.
    I hope to illustrate how the place of software in science and technology studies might be extended to support a practice of "technography" -- a way to embody the findings of ethnographic work in a technical artifact, which in turn can be used as the starting point for further ethnographic research or a way of strengthening the insights of ethnographic analysis. During this talk I will describe this approach further and demonstrate the workings and visualizations of such a "technographic" software.

Feb 14: Aitao CHEN, Fredric GEY, Ray LARSON, & Michael BUCKLAND: Searching Across Different Genres: Texts, Numbers, and Places.
    Utopian visions of information service include providing access to all documents everywhere, all genres, in all formats, in all languages, and on every topic. This is hard to do! We will present recently completed work on "Seamless Searching of Numeric and Textual Resources"; work recently started on searching by place; and, if time permits, additional work we now wish to undertake on searching across multilingual, multigenre collections; and some reasons for making a substantial change in strategies for search support. For background see
metadata.sims.berkeley.edu/papers/SeamlessSearchFinalReport.pdf and
http://www.sims.berkeley.edu/~buckland/catplace.pdf.

Feb 21: Emily LIGGETT & Clifford LYNCH
    Emily LIGGETT: Interfaces Designed for the Visualization of Complex Relationships.

    As more and more information gets created, it becomes more difficult for people to manage, gain insight into, and make conclusions about information sets using only the data-mining tools with which they are presented. In order to facilitate the tasks of working with large data sets, researchers have put considerable efforts into designing visualization tools to display the information in a manner where people can easily manage large data sets. These interfaces aim to display large networks of information in such a way that people can conceptualize relationships between information which cannot readily be obtained through previously existing information organization methods.
    I will introduce a few tools that have been designed in order to display large relationship sets. I will discuss the motivations for these tools and analyse the good and bad aspects of each tool.
    Clifford LYNCH: Key Research and Implementation Issues in Metadata.
    I'll examine a number of key research and implementation problems focused primarily on descriptive metadata in the networked environment.

Feb 28: Michael BUCKLAND and others: Going Places in the Catalog: Improved Geographical Access.
    Online library catalogs are not very effective in supporting searches by place and allowing searches to be limited by place. How could searching by place be improved? One approach would be to make better use of the several place-related cues in library catalog records that are currently not used. Another approach would be to use a gazetteer as a "place name authority file." Coupling a catalog with a gazetteer has additional interesting possibilities because the latitude and longitude data supplied in gazetteer entries would allow map visualizations displaying retrieval results and a way of specifying geographical aspects for a search.
    The federal Institute of Museum and Library Services recently awarded a National Library Leadership Grant to the Electronic Cultural Atlas Initiative to explore these possibilities, Fredric Gey, Ray Larson, and myself as Co-P.I.s. See http://www.sims.berkeley.edu/~buckland/catplace.pdf The project is at a very early stage and the seminar will be a very informal, preliminary discussion of this work.

Mar 7: Richard BEAUBIEN, Ralph MOON, & Lynne GRIGSBY-STANDFILL, University Library Systems Office: Publishing, Organizing, and Managing of Digital Content.
    An update on the Library's local and inter-institutional projects that involve the publishing, organizing, and managing of digital content, including a discussion of the projects themselves, of the standards work that supports them, and of the changing IT infrastructure that provides their physical context.

Mar 14: Clifford LYNCH: Two Topics: e-Gutenberg-e Report; and Collaborative Filtering / Personalization.     Two topics:
1. A report from the Gutenberg-e meeting this week. This will deal with topics in authoring and publishing in the new digital medium; and
2. A request for help and input on the research agenda for collaborative filtering and personalization for the NSF-DELOS Working Group that I am participating in.

Mar 21: Alessandro ACQUISTI: Privacy, Anonymity, and Tracking in Computer-mediated Economic Transactions
    The reduction of the cost of storing and manipulating information has led organizations to capture and use increasing amounts of information about individual behavior. New trade-offs have emerged for parties involved with privacy-enhancing or privacy-intrusive technologies: individuals want to avoid the misuse of the information they pass along to others, but they also want to share enough information to achieve satisfactory interaction; organizations want to know more about the parties with which they interact, but they do not want to alienate them with intrusive policies. Is there a "sweet spot" that satisfies the interests of all parties? Is there a combination of economic incentives and technological solutions that is acceptable for the individual and beneficial to society?
    In my dissertation, I study some of the technological and economic issues that I find most interesting in this debate. I develop cryptographic tools to protect the privacy of individuals who participate to electronic transactions, and I study the economic incentives to deploy and use anonymizing and tracking technologies. In my talk I will introduce some of the results from my work. I will then focus on technical and economic aspects of the use of cookie-like technologies for price discrimination. Finally, I will extend the economic and technical analysis by considering its policy implications and by describing how economic incentives and specific technologies can be combined to address privacy and information security concerns in a networked society.

Mar 28: Spring Recess: No Seminar Meeting

Apr 4: Jerome McDONOUGH, New York University: METS and METS Profiling: A Not So Gentle Introduction.
    METS provides an XML format for encoding descriptive, administrative and structural metadata for digital library objects. As it also provides facilities for encapsulating object content, it is well-suited to serve as any of the various forms of Information Package specified in the OAIS Reference Model. I'll provide a technical introduction to METS, and also discuss METS profiling, an effort to further support interoperability of digital library repositories.
    Jerome McDonough is Digital Library Development Team Leader, Elmer Bobst Library, New York University.

Apr 11: John KUNZE, California Digital Library: What Makes Identifiers Persistent
    Persistent identifiers at first glance. Tangled terminology. Persistence aside, What about complete identification? Persistent identifiers at second glance. So what if some URLs break? Setting user expectations. Who is breaking URLs? Persistent identifiers at third glance. The Archival Resource Key (ARK) Scheme. Persistent identifiers at the California Digital Library (CDL). Technical references. http://www.ietf.org/inte rnet-drafts/draft-kunze-ark-05.txt (38 pages).
http://jodi.ecs.soton.ac.uk/Art icles/v02/i02/Kunze/

Apr 18: Linda Cathryn MUEHLINGHAUS (formerly Everstz), Dominican University of California: The difficulty of practicing what we preach: Teaching about culture in an on-line environment in K-12 teacher preparation.
    A discussion of difficulties encountered in using a hybrid instructional model: face-to-face and online discussion boards. Ethical, communication, and access issues: shifting instructional delivery modality; support and communication (access to usable computers; ease of use). Emotional issues: Tackling these issues without interpersonal support mechanisms; anonymity as a problem: Being polite to the faceless.
    Dr Muehlinghaus is Assistant Professor of Education at Dominican University of California. She completed her doctoral dissertation on The Myth of a Democratic Education and the Politics of Curriculum in the UCBerkeley School of Education in 1998.

Apr 25: Avi RAPPOPORT, Search Engine Consultant: Notes from the CHI Search Interface Workshop and the Infonortics Search Engines Meeting.
    I found consensus on interface issues, agreement on the value of faceted metadata, interest in usability testing, practical applications for NLP, real-world implementations, and discussions of the search industry and unstructured text. It's an interesting time.
    Avi Rappoport is a Consultant at Search Tools Consulting - www.searchtools.com/

May 2: Merrilee PROFFITT, RLG: RedLightGreen: RLG Accelerates Undergraduate Research.
    With generous funding from the Andrew W. Mellon Foundation, RLG began work in early 2001 to explore how the rich information held in the RLG Union Catalog could be made available to a wider audience in a freely available Web environment. During the intervening time, we have learned volumes about what undergraduate users want from online information resources; how data mining software can uncover valuable new information hidden in the RLG Union Catalog; how to provide access to a wealth of complex information through a simple, easy-to-use interface; new opportunities for using bibliographic data to help end users find authoritative sources of research information; what is involved in the complicated process of transforming MARC records to XML; and incorporating concepts outlined in the Functional Requirements for Bibliographic Records (FRBR), an emerging standard for distinguishing between various editions.
    This presentation will feature an explanation of the history, motivations, "lessons learned," and future directions of the project, a demo of the pilot system, and outtakes of use studies.

May 9: Last seminar meeting of the semester: Two presentations.

    Ethan EISMANN: Sustainable Knowledge Communities: Learning from the bIPLog
    Final Progress Report: Our research reveals that knowledge communities supported by weblogs successfully encourage their members to consistently read, analyze, and write about a topic, which leads to deep knowledge. By writing this paper we hope to encourage the production of and interrelations among high quality weblogs of specific topical focus, which we hope will result in the proliferation of robust and sustainable knowledge communities.

    Paul ELL, Queens University Belfast: From Printed Historical Census Data, to Electronic Datasets, to a Paper Atlas, to an Electronic Atlas of the Great Irish Famine.
    This talk will describes the digitisation of one of the largest historical datasets available for Ireland - the Database of Irish Historical Statistics. It will discuss the methodological challenges and solutions in data creation, documentation and preservation.
    The paper will then proceed to discuss one project which extensively used the data in a major work to map the Great Irish Famine. In the past, much of the work by scholars concerned with the Famine employed often complex quantitative or qualitative techniques, and not infrequently references were made to the spatially differing impact of the disaster. Very few maps have actually been created however, and no attempt has been made to chart systematically the regional impact of the Famine. Several themes are discussed in the atlas including demographic patterns, emigration, celibacy, the geography of religion, pauperism, diet, changes in the use of the Irish language, housing, regional patterns of disease, changes in Irish agriculture, and employment.
    Finally the paper will discuss plans to create a historical Geographical Information System for Ireland which will be used not only as a scholarly resource but as a source of information to the lay person interested in their countries past.
Dr Paul S. ELL is founding director of the Centre for Data Digitisation and Analysis at the Queen's University of Belfast. The Centre aims to act as a centre of excellence in data digitisation, and to be a leader and in, and promoter of, spatio-temporal scholarship. He has written a number of books including a Cambridge University Press book analysing the geography of nineteenth century religion in England and Wales; an atlas of the Great Irish Famine; and a study of the use of census data in nineteenth century Britain and Ireland. He is currently working on a co-authored book on the application of historical GIS and the construction of a GIS for Ireland (Forthcoming, Cambridge Univ. Press).

The Seminar will resume on August 29.

Spring 2003 schedule.
Fall 2002 schedule and summaries.