SIMS 247 Fall '00 Project Suggestions

Project Suggestions

In each case, I've listed the general area but not really the task that a user would be trying to accomplish in that area. It is important to identify the target task(s) before designing the visualization. It is also important to think and describe how the visualization would fit within the context of a user interface that helps users work on their problems as a whole.

Web Logs and Web Use
Database Systems and Content
Incrementally Revealing Database Content
Add Features to Xmdvtool
Evaluate Efficacy of a Visualization

Visualizing Web Logs and Web Use

In the WebTango research project, we are developing tools to help web site designers evaluate sites, both to consider alternative designs and assess what is problematic about current designs.

There are two visualization problems in this domain.

One project is to design a visual interface for displaying the results of a metrics analysis tool we are developing to help users improve their site designs. The tool does not yet do a site-level analysis, but we would like it to in future. To discuss ideas realted to this, send mail to Melody Ivory.

Another project is to help assess the usability of the site via examining usage data in the web logs. There are well-known problems assiociated with doing this, but it should be useful as well. The implementation of this project should use existing data since it is time-consuming to collect new data. There is some available at the KDD Cup 2000 site, as well as a discussion of the analysis of this data using data mining techniques. This problem is tough because there is a huge amount of data, it has complex structure across time and many different users, and it is incomplete.

Some students (Tim Hirzel, Andrew Volpe, and jeff Enos). made an interesting attempt last year. Their writeup. Figure 1. Figure 2.

For other examples, see:

Using Interactive Visualizations of WWW Log Data to Characterize Access Patterns and Inform Site Design

The QUIP project (not applied to the web, but an interesting related idea).

Gerald L. Lohse, University of Pennsylvania; Peter Spiller, McKinsey & Company Quantifying the Effect of User Interface Design Features on Cyberstore Traffic and Sales Proceedings of ACM CHI 98 Conference on Human Factors in Computing Systems, 1998. (This is a nice empirical evaluation of 20 online (web-based) "superstores" comparing how various aspects of their web site design and information layouts correlate with sales and traffic. Regression analysis was used to assess impact of variables.)

A different but related problem is visualizing an individual's web usage history in the hopes of helping them navigate:

Graphical Multiscale Web Histories: A Study of PadPrints

Michael D. Byrne and Bonnie E. John and Neil S. Wehrle and David C. Crow, The Tangled Web We Wove: A Taxonomy of {WWW} Use Proceedings of ACM CHI 99 Conference on Human Factors in Computing Systems, Volume 1, 544--551, 1999.

Tauscher, L. and Greenberg, S. (1997). How People Revisit Web Pages: Empirical Findings and Implications for the Design of History Systems. International Journal of Human Computer Studies, Special issue on World Wide Web Usability, 47(1), p97-138. Academic Press.

Visualizing Database Systems and Content

Anna Wichansky

Business Intelligence with multidimensional data
Object tree hierarchies
Database system topologies

We have the problem descriptions, sample data, and her talk.

Incrementally Revealing Database Content

CONTROL Project

Prof. Joe Hellerstein

an easy-to-read overview.

The goal of this class project would be to think of new kinds of visualizations that work well within this paradigm for particular types of datasets. For example, Hellerstein et. al have given examples of progressively showing university enrollment data by plotting the data progressively as an average with error bars that shrink as more data becomes available. They've also shown cities on a map, showing "clouds" of points, starting with the densest parts of the population, rather than alphabetical order by state, thus showing the most important information first. How else can visualization of information from large datasets be done differently given the underlying mechanisms supplied by the system?

Add Features to XMDVtool

Matt Ward's

XmdvTool

Process-driven brush generation: using analytic tools to suggest useful brush configurations. (Matt says you could probably could get a paper out of this one.)
Add an Attribute Explorer capability
Process-driven dimension ordering: he is doing a bit of this for the SGI parallel coordinates module.
Intelligent processing of nominal dimensions: start with mapping unique strings to integers, and perhaps reorder or cluster them depending on aspects of the numeric dimensions. Some work has been done on this, but it would be a great addition to the code.
Add dimensional stacking controls: interactive control over binning and ordering of dimensions.

Evaluate Efficacy of a Visualization

http://otal.umd.edu/SHORE

Project Process

Submit Project Proposal Here

Project groups should have 2-3 people.

Timing:

Oct 19: Write a short description of a project proposal and submit via the online form. I will review it and give you feedback. If it is ok, then you're done, but I might want an iteraction on it. This description should include:
- Name(s) of student(s) involved
- Project goals, including what kinds of tasks the interface containing the visualization is targeted towards.
- Which tools will be used to accomplish the goals (this can change if needed).
- What steps will be required to accomplish goals.
- What kinds of results you anticipate achieving.
- What kinds of results you would like to achieve but which you probably do not have the time or the tools for.
Oct 31: Finalized version of project description, on a topic that I have approved.
Oct 31 and Nov 2: Students discuss project goals in class (not graded)
Nov 30, Dec 5 and Dec 7: Students present final projects in class
Dec 9: Writeups due.

Grading:

Preliminary project description submitted on time (5%)
Finalized project description submitted on time (5%)
Class presentation of project results (must fit within designated time limits which are TBA) (15%)
Quality of writeup of results (25%)
Quality of actual project (50%)