|
IS290-rmm Syllabus
Resource and Metadata Management in Museums, Archives, and
Research Collections: Tools and Practices
- (Mon 16. Jan No Class - holiday)
- Wed 18. Jan Lecture 1: Intro to class, concepts, goals.
- Mon 23. Jan Lecture 2 CollectionSpace
- Wed 25. Jan Lecture/Lab 3 Basic metadata tools
- Mon 30. Jan Lecture 4 Intro to Museum, Archives, and Research Collections practices
- Wed 1. Feb Lecture 5 Intro to Museum, Archives, and Research Collections practices
- Mon 6. Feb Lecture 6 Introducing our customer for the practical projects (UC Bot Garden)
- Wed 8. Feb Lecture 7 CollectionSpace model for configuration and extensibility, and deployment tools lab
- Mon 13. Feb Lecture 8 Authorities - shared, local, models, uses, & Metadata mapping in practice: Background and principles
- Wed 15. Feb Lecture 9
Lecture 9 Botanical Garden Visit
- Wed 22. Feb Lecture 10: Review of legacy system metadata models and authorities, for the chosen projects
- Mon 27. Feb Lecture 11: Discussion of proposed mappings, gaps, UX issues
- Wed 29. Feb (Cancelled due to illness) BPA Session 1 (Metadata mapping in practice)
- Mon 5. Mar Lecture 12: BPA Session 1 (Metadata mapping in practice)
- Wed 7. Mar Lecture 13: Review of BPA session, & ETL Intro
- Mon 12. Mar Lecture 14: Extract, Transform, and Load (ETL) tools intro
- Wed 14. Mar Lecture 15: Diving into data: Managing scope and complexity
- Mon 19. Mar Lecture 16: UX Workshop 1
- Wed 21. Mar Lecture 17: UX Workshop 2
- (Mon 26. Mar, and Wed 28. Mar No Class Spring break)
- Mon 2. Apr Lecture 18: Voucher workflows in context
- Wed 4. Apr Lecture 19: Customer session to review mappings, data issues, authorities, etc.
- Mon 9. Apr Lecture 20: Reporting Intro, examples from customer
- Wed 11. Apr Lecture 21: Reporting lab, and installing a report into CSpace
- Mon 16. Apr Lecture 22: Language and I18N issues + Lab
- Wed 18. Apr Lecture 23: Standards and models for metadata management + Lab
- Mon 23. Apr Lecture 24: Metadata interchange and sharing + Lab
- Wed 25. Apr Lecture 25: Exposing collections + Lab
- Mon 30. Apr Lecture 26: Review week
- Wed 2. May Lecture 27: Review week
- Mon 7. May Exams week
(Mon 16. Jan No Class - holiday)
Wed 18. Jan Lecture 1: Intro to class, concepts, goals.
- My background and reasons for being here
- Introduction to course themes
- Course structure and projects
Readings
I School students will already have read Bob's Chapter 1, but it is worth reviewing for those who have not.
Otherwise, just review this syllabus, etc.
Mon 23. Jan Lecture 2 CollectionSpace
- History and heritage
- Business and cultural context
- Goals and vision (for the community, and for UCB)
Readings
CollectionSpace website - Read about background (About, FAQ).
CollectionSpace wiki. This is a morass of documentation. Do explore these areas
(but do not feel you have to absorb everything!):
- Technical Development. Read about the Team structure, and browse the architecture section (and linked pages).
- Design approach - read over the intro, and approaches. Explore further if you are interested.
- Functional Team - Again, focus on the intro, and get a sense of the scope.
Much of the museum jargon may be confusing right now - don't worry about that.
Wed 25. Jan Lecture/Lab 3 Basic metadata tools
- XML, XSD as basis for modern info exchange, management
- JSON in WebUI, jsonlint
- XSD as development tool (JAXB, REST, SOAP overview).
Preparation (do this before class meets):
Choose and install an XML editor capable of handling XML Schemas. See resources below for XML Editor info.
Optional, but highly recommended: Choose and install a browser add-on for handling REST and HTTP data calls. See Poster and Chrome
Readings
How I Explained REST to My Wife
Resources
XML Foundations class materials - Especially if you are not familiar with XML, XSD, Processing XML. Ignore DTD stuff, XQuery, XSLT. XPath is optional.
Web Architecture class materials - If you are unfamiliar with basics of how a web browser accesses and display pages. Focus on topics around Web browsers, HTML, CSS, what is javascript. Must have basic understanding of URL/URL structure, HTTP.
XML Editor info from Bob's 202 class - You will need a basic XML editor. XMLSpy and Oxygen are quite good.
Accessing CollectionSpace Services from Your Programs or
Scripts
REST support add-ons for your browser:
JSON utilities:
Assignment 2: XML, REST, Started
in class, due next Wednesday
Mon 30. Jan Lecture 4 Intro to Museum, Archives, and Research Collections practices
- Stuff vs. Activity
- Common themes across collections, and outliers
- Internal activities, research activities, outreach and other uses
Readings
Fragments of the World, Chapters 2, 3, and 4 (through pg 62).
Dry Storeroom #1, Chapter 1 (through page 30).
A Legal Primer on Managing Museum Collections, Only the TOC is included, to provide some idea of the scope of concerns.
Wed 1. Feb Lecture 5 Intro to Museum, Archives, and Research Collections practices
Slides
(Lectures 4 and 5 share same slides)
Readings
Dry Storeroom #1, Chapter 5 (pp 154-171).
Dry Storeroom #1, Multum in Parvo (pp 185-205).
Spectrum diagram and data requirements, from CollectionsLink UK
Resources
Spectrum 3.1 Note that this is long and very dry. I find it tilted to the arts and cultural heritage, and weak on the life sciences and physical sciences. Nevertheless, it is a useful reference.
Mon 6. Feb Lecture 6 Introducing our customer for the practical projects (UC Bot Garden)
- Museum and collections overview and history
- Larger context of migration
- Users, activities
(No slides)
Readings
UC Botanical Garden website. Read the Collections section, try the plant browser,
UCBG (BSCIT) About the Data model
BNHM Collections statistics
Query Help page (provides some idea of how diverse schemas and usage are even within Natural History domain)
Wed 8. Feb Lecture 7 CollectionSpace model for configuration and extensibility, and deployment tools lab
- Architecture and function
- Shared semantics, domain and local extensions
- IT Architecture and community dynamics
- Multi-tenancy and its implications
- Extensions, overlays, and replacements
- Communications and project workflows (using the wiki, IRC, email lists)
Readings
Successful Strategies for a Multi-tenant Architecture
HP Service Oriented Architecture White Paper. Through pg 13.
Real World SOA. pp 1-6 (you can scan the rest as you wish)
Resources
If you want to learn more about SOA, check out the series by Thomas Erl at soabooks.com. They are very good.
Mon 13. Feb Lecture 8 Authorities - shared, local, models, uses, & Metadata mapping in practice: Background and principles
Authorities
- Reference and search issues.
- Management and policy.
- Standards, common resources, ontology.
Resources:
- Getty Research Institute authorities
- CSpace standard Authority support
Metadata Mapping
- Business process analysis and UCD
- UI concepts and workflow, versus data models
- The data model, and the data model
- Active listening and critical analysis
- Mapping needs and desires to the possible
- Identify resources in CSpace wikis on project focus
Resources:
Readings
Dry Storeroom #1, Chapter 2 The Naming of Names (pp 31-72), House of the Muses (pp 304-307).
Wed 15. Feb Lecture 9
Botanical Garden Visit
- Discussion of background and tools, in the context of this museum
- Considering choices for the mapping project
(No slides)
Readings:
Review materials from Monday's discussion:
Assignment 3 discussed, due Wed
22nd Feb.
Mon 20. Feb No Class - holiday
Wed 22. Feb Lecture 10: Review of legacy system metadata models and authorities, for the chosen projects
Review of schemas, data samples, proposed procedures for attention
- Understanding the project-specific schemas
- Relationships to other resources in the model
- Managing scope for the project
Readings:
Resources:
Assignment 4 discussed, due Mon
27nd Feb.:
Mon 27. Feb Lecture 11: Discussion of proposed mappings, gaps, UX issues
- Review and discussion with Informatics staff
- Introduction of sample data sets for migration
Resources:
Please also note the
UCBG site on the CSpace wiki,
where we will begin to gather our work.
Wed 29. Feb (Cancelled due to illness) BPA Session 1 (Metadata mapping in practice)
Mon 5. Mar Lecture 12: BPA Session 1 (Metadata mapping in practice)
Interview session with collection expert (probably UCBG collections manager, IST Informatics staff). Students will conduct BPA session with the domain experts, and must identify the current usage, requirements, and desires for the new system. Each team will concentrate on a single procedure (activity record) or authority.
Will break into groups for interviews, and review of proposed mappings.
Assignment 5 discuss at end of
class, due Wed 7. Mar
Wed 7. Mar Lecture 13: Review of BPA session, & ETL Intro
BPA review
- Presentations of Observations
- Open Questions
- Follow-up strategies
- Embracing ambiguity
ETL Intro
- Data warehousing activity
- Metadata migration as a discipline
- Principles and techniques of mapping
Readings:
Mon 12. Mar Lecture 14: Extract, Transform, and Load (ETL) tools intro
- Intro to the our tool of choice - Talend Open Studio
- Resources and tutorials
- Adding XML or XSD models
- Simple example map, mapping columns from CSV into XML schema
- String manipulation
- Advanced ETL
(discussion only)
- Configuring a CSpace DB as a data source
- Merging and mapping sources
- Generating OAI-PMH from CSpace
- Limitations of the ETL tools, and alternate approaches
Preparation (Must be done before class, as we will work with
it in class):
- Install TOS (free). Do not go to the Talend download page. Instead go to the
Softpedia pages for Talend
TOS.
- Choose the version for your computer OS. Choose the "stable" version.
Note that this is a pig, and can take a while (their servers were slow for me).
The
installation instructions (really simple) are embedded in section 2.2.1
of the User Manual (see #3 below).
- Mac users launch the only mac app among
the files exploded from the zip file.
- You can experiment with starting it up, and getting started, but we'll cover
it in the Lab on Monday.
- View the "Data Integration" demo at
http://www.talend.com/demo/. Some of the steps are done quickly and may not
be clear - don't worry about this as much as the overall feel of the
application, the way that they wire together components, etc.
- In the
Talend User Guide:
- Read Appendix A. This describes the UI of the tool.
- Read chapters 1, and 2.1 through 2.4. Sections 2.5 and 2.6 are about
configuration, and you can skip them for now.
- Chapter 3 is about Business modeling, and you can also ignore that.
- Read Chapter 4, sections 4.1, 4.2, and 4.3 up through 4.3.1.1. You can skim
the other sections if you are curious.
- Chapter 6 will be very useful to have read, but may be slightly overwhelming
before we do the lab. You might want to skim it once to see the concepts, and
then review it once we have played with the tool.
- Read Chapter 7, sections 7.1, 7.5, and 7.8
- Appendix B has several tutorials. Read them if you're feeling ambitious ;-)
Assignment 6 begun in class, due Mon 19. Mar.
Note changed scope!
Wed 14. Mar Lecture 15: Diving into data: Managing scope and complexity
- Deeper look at sample data and the schemas
- The implications of dirty, un-normalized data
- Noise detection and noise reduction
- Normalization as the goal - and the enemy
- Deduplication
- Ex post facto strategies (procrastination as a strategy)
- Michael and John for examples?
- Examples in the UCBG data
Resources:
Mon 19. Mar Lecture 16: UX Workshop 1
- How the Voucher UI fits into the CSpace context
- Overview of how UI is supported and built by the
framework
- Brainstorming on the Voucher UI (collaborative, led
by Leslie)
First of a two-day workshop to create wireframes,
describe how the wireframes are bound to a data model
(conceptually), and document this for review
Readings:
Assignment 6 DUE. Note changed scope!
Wed 21. Mar Lecture 17: UX Workshop 2
- Wind up UX brainstorming
- How to formalize UX proposals for community review.
- Goals
- Structure
- Traditions in CollectionSpace (and what we may
ignore)
- Capturing requirements
- Preparing hi-fi wireframes, sharing on wiki
- Using workflow diagrams
- User stories
(Mon 26. Mar, and Wed 28. Mar No Class Spring break)
Mon 2. Apr Lecture 18: Voucher workflows in context
- Considering the context of the voucher work
- What related workflows intersect with vouchers?
- How do we think about and ensure support for
this context?
- How do we relate and document the old and new
workflows?
- Considering context as we migrate data from the old
system to the new one
- Linking voucher imports to authorities (Person,
Organization, Location)
- What's our story on Voucher labels?
- Proposal for review: prepare the voucher label
with the same schema we use for Vouchers.
- Proposal for review: generate the labels with a
report.
- Preparing a story for customer review
Wed 4. Apr Lecture 19: Customer session to review
mappings, data issues, authorities, etc.
- Present wireframes, data mappings, behavior aspects
- Identify issues and limitations
- Negotiate scope
Mon 9. Apr Lecture 20: Reporting Intro, examples from customer
- Enterprise Reporting, whys and wherefores, issues
- Reporting use-cases in Museums and Archives
- UCBG example reports
Reading:
Addition Resources:
Preparation (do this before class meets):
Install iReport authoring tool. See
http://sourceforge.net/projects/ireport/ for downloads.
Assignment 9 Discussed - be
prepared to demonstrate progress.
Wed 11. Apr Lecture 21: Reporting lab, and installing a report into CSpace
- BIRT, Jasper, Commercial tools
- Configuring Jasper for CollectionSpace
- Authoring a basic report
- Handling parameters in reports, passed and default values
- User model vs. Services model vs. DB model
- Denormalizing tables to produce a report
- String manipulation (refName as example, and gathering or truncating fields for title)
Reading:
- iReport Ultimate Guide Read:
- Chapter 5, intro and sections 5.1 and 5.3
- Chapter 6, up through section 6.1.2
- Scan Ch 8, ignoring the technical bits
- Intro to Ch 9 and 9.1
- Ch. 11 through 11.3
Mon 16. Apr Lecture 22: Language and I18N issues + Lab
We will spend half the class in lecture, and half in Lab
(as needed) to continue with the ETL and UI work for
vouchering, and consider some of the L14N issues for the
vouchering UI.
- Marking language
- Considering translations
- UI versus data language
- Cross-lingual and multi-lingual search, reporting, access
- Localization in CollectionSpace
- Changing labels and titles in UI
- Changing themes and styling
- Changing layout and organization
Wed 18. Apr Lecture 23: Standards and models for metadata management (Spectrum, CDLA, OAI, CIDOC-CRM,
& friends) +
Lab
We will spend half the class in lecture, and half in Lab
(as needed) to continue with the ETL and UI work for
vouchering.
- Purpose and goals, reality and usage
- Examples and specifics
Mon 23. Apr Lecture 24: Metadata interchange and sharing (OAI-PMH, MARC/MODS, DiGiR, Dublin Core, Darwin Core, etc.)
+ Lab
We will spend half the class in lecture, and half in Lab
(as needed) to continue with the ETL and UI work for
vouchering.
- Modeling versus harvesting/sharing
- Museums vs. Archives (Mary Elings guest?)
- Domain specific standards
- UCBG examples
Readings
To come...
Wed 25. Apr Lecture 25: Exposing collections + Lab
We will spend half the class in lecture, and half in Lab
(as needed) to continue with the ETL and UI work for
vouchering.
- Data access for research
- Public access for service
- Portals for discovery
- Varying points of view, and implications for data models, authorities, UI
- Examples and approaches:
- Community curation and feedback
Readings
Dry Storeroom #1, (pp 174-177).
Mon 30. Apr Lecture 26: Review week
Wed 2. May Lecture 27: Review week
Mon 7. May Exams week
|