Lecture 10 Heuristic Evaluation & User Centered Design

UI Hall of Fame or Shame?

Outline

Revisit Usability Heuristics
Heuristic Evaluation
User Centered Design

Usability Heuristics

Usability Heuristics (“Guidelines”)

What are they?

Rules that distill principles of effective UIs
Usually, but not always correct
- Recognizing when to follow/ignore takes practice/experience
Help designers choose design alternatives
Help evaluators find problems in interfaces ("heuristic evaluation")

Plenty to choose from

Learnability/Efficiency/Safety
Nielsen's 10 principles
Norman's rules from Design of Everyday Things
Tognazzini's 16 principles
Shneiderman's 8 golden rules

Same general ideas, organized differently.

To understand the technique, we should start by defining what we mean by a usability heuristic or guideline. Heuristics, or usability guidelines, are rules that distill out the principles of effective user interfaces. There are plenty of sets of guidelines to choose from - sometimes it seems like every usability researcher has their own set of heuristics. Most of these guidelines overlap in important ways, however. The experts don't disagree about what constitutes good UI. They just disagree about how to organize what we know into a small set of operational rules.

Heuristics can be used in two ways: during design, to help you choose among alternative designs; and during heuristic evaluation, to find and justify problems in interfaces.

Principles from This Course

Learnability
Efficiency
Safety

To help relate these heuristics to what you already know, here are the high-level principles that have organized our readings.

Nielsen Heuristics

Match the real world (L)
Consistency & standards (L)
Help & documentation (L)
User control & freedom (S)
Visibility of system status (S)
Flexibility & efficiency (E)
Error prevention (S)
Recognition, not recall (S)
Error reporting, diagnosis, and recovery (S)
Aesthetic & minimalist design

Jakob Nielsen, who invented the technique we're talking about, has 10 heuristics. (An older version of the same heuristics, with different names but similar content, can be found in his Usability Engineering book, one of the recommended books for designerss.)

We've talked about all of these in previous design principles reading (the particular reading is marked by a letter, e.g. L for Learnability).

Match System and Real World (Metaphor)

Consistency and Standards

Consistent Word/Excel/Powerpoint toolbars

User Control and Freedom

Visibility of System Status (Feedback)

Efficiency

Error prevention

Recognition over Recall

Error Recovery

Aesthetics/Graphic Design

Contrasting labels, Repeating color, Aligned text, Tags set apart (Proximity)

Padded cells, differentiated header/footer

Norman Principles

Affordances
Natural mapping
Visibility
Feedback

We've also talked about some design guidelines proposed by Don Norman: visibility, affordances, natural mapping, and feedback (all in the Learnability reading).

Shneiderman's 8 Golden Rules

Consistency
Universal Usability
Feedback
Dialog closure
Prevent and repair with Errors
Reversible actions
Keep user in control
Reduce short-term memory load

Finally we have Shneiderman's 8 Golden Rules of UI design, which include most of the principles we've already discussed.

Consistency

Consistent sequences of actions required in similar situations
Identical terminology in prompts, menus, and help screens
Consistent color, layout, capitalization, fonts, and so on
Exceptions should be comprehensible and limited in number
- required confirmation of the delete command
- no echoing of passwords

Universal Usability

Recognize needs of diverse users
Design for plasticity, facilitating transformation of content
- content vs. presentation on the web
- variable screen size
- users control window size, fonts, colors, language
Plan for novice vs. expert, age ranges, disabilities, international variations, and technological diversity
Features for novices and experts enrich the interface design and improves perceived quality
- explanations
- shortcuts and faster pacing

Feedback

Feedback for every user action
For frequent and minor actions, response can be modest
For infrequent and major actions, response more substantial
Visual presentation of the objects of interest (and direct manipulation) is a convenient environment for showing changes explicitly

Dialogs with Closure

Sequences of actions should be organized into groups with a beginning, middle, and end.
Informative feedback at the completion of a group of actions gives users
- the satisfaction of accomplishment
- a sense of relief
- signal to drop contingency plans from their minds
- an indicator to prepare for the next group of actions.
For example, e-commerce websites move users from selecting products to the checkout, ending with a clear confirmation page that completes the transaction.

Prevent Errors

Design the interface so that users cannot make serious errors
- gray out menu items that are not appropriate
- do not allow alphabetic characters in numeric entry fields
On error, offer simple, constructive, specific instructions for recovery.
- shouldn't have to retype entire address form if enter an invalid zip code
- rather, guide to repair only the faulty part.
Erroneous actions should leave the interface state unchanged
Or the interface should give instructions about restoring the state.

Reversibile Actions

As much as possible, actions should be reversible.
Relieves anxiety, since users know that errors can be undone
Encourages exploration of unfamiliar options
Units of reversibility
- a single action
- a data-entry task
- a complete group of actions, such as entry of a name-address block.

User Control

Experienced users strongly desire the sense that
- they are in charge of the interface
- interface responds to their actions.
Don’t want surprises or changes in familiar behavior
Annoyed by
- tedious data-entry sequences
- difficulty obtaining necessary information
- inability to produce their desired result.

Reduce Memory Load

Humans have limited capacity for information processing in short-term memory
- rule of thumb: “seven plus or minus two chunks” of memory
Avoid interfaces where users must remember information from one display and use it on another
- cellphones should not require reentry of phone numbers
- website locations should remain visible
- long forms should be compacted to fit a single display

Tog's First Principles

Aesthetics
Anticipation
Autonomy (control)
Color
Consistency
Defaults
Discoverability
Efficiency
Explorable interfaces
Fitts's Law

Human interface objects
Latency reduction (feedback)
Learnability
Metaphors
Protect users' work
Readability
Simplicity
Track state
Visible navigation

Another good list is Tog's First Principles, 16 principles from Bruce Tognazzini. We've seen most of these in previous readings. Here are the ones we haven't discussed (as such):

Autonomy: user is in control.
Human interface objects: another way of saying direct manipulation: onscreen objects should be continuously perceivable, and manipulable by physical actions
Latency reduction: minimize response time and give appropriate feedback for slow operations.

Heuristic Evaluation

Inspection technique, like code review to find bugs in software
Performed by an expert
Steps
- Inspect UI thoroughly
- Compare UI against heuristics
- List usability problems
- Explain & justify each problem, referencing heuristics

Heuristic evaluation is a usability inspection process originally invented by Nielsen. Nielsen has done a number of studies to evaluate its effectiveness. Those studies have shown that heuristic evaluation's cost-benefit ratio is quite favorable; the cost per problem of finding usability problems in an interface is generally cheaper than alternative methods.

Heuristic evaluation is an inspection method. It is performed by a usability expert - someone who knows and understands the heuristics we've just discussed, and has used and thought about lots of interfaces.

The basic steps are simple: the evaluator inspects the user interface thoroughly, judges the interface on the basis of the heuristics we've just discussed, and makes a list of the usability problems found - the ways in which individual elements of the interface deviate from the usability heuristics.

The Hall of Fame and Hall of Shame discussions we have at the beginning of each class are informal heuristic evaluations. In particular, if you look back at previous readings, you'll see that many of the usability problems identified in the Hall of Fame & Shame are justified by appealing to a heuristic.

How To Do Heuristic Evaluation

Justify every problem with a heuristic
- “Too many choices on the home page (Aesthetic & Minimalist Design)”
- Don't just say “I don't like the colors”
List every problem
- Even if an interface element has multiple problems
Go through the interface at least twice
- Once to get the feel of the system
- Again to focus on particular interface elements
Don't have to limit to the 10 Nielsen heuristics
- But easy to compare against
- Our general principles (LES, user control, errors, GD) are easier still

Let's look at heuristic evaluation from the evaluator's perspective. That's the role you'll be adopting in the next homework, when you'll serve as heuristic evaluators for each others' computer prototypes.

Here are some tips for doing a good heuristic evaluation. First, your evaluation should be grounded in known usability guidelines. You should justify each problem you list by appealing to a heuristic, and explaining how the heuristic is violated. This practice helps you focus on usability and not on other system properties, like functionality or security. It also removes some of the subjectivity involved in inspections. You can't just say "that's an ugly yellow color"; you have to justify why this is a usability problem that's likely to affect *usability* for other people.

List every problem you find. If a button has several problems with it - inconsistent placement, bad color combination, bad information scent - then each of those problems should be listed separately. Some of the problems may be more severe than others, and some may be easier to fix than others. It's best to get all the problems on the table in order to make these tradeoffs.

Inspect the interface at least twice. The first time you'll get an overview and a feel for the system. The second time, you should focus carefully on individual elements of the interface, one at a time.

Finally, although you have to justify every problem with a guideline, you don't have to limit yourself to the Nielsen 10. We've seen a number of specific usability principles that can serve equally well: affordances, visibility, Fitts's Law, perceptual fusion, color guidelines, graphic design rules are a few. The Nielsen 10 are helpful in that they're a short list that covers a wide spectrum of usability problems. For each element of the interface, you can quickly look down the Nielsen list to guide your thinking. You can also use the 6 high-level principles we've discussed (learnability, visibility, user control, errors, efficiency, graphic design) to help spur your thinking.

In Class

Nielsen Heuristics

Match the real world (L)
Consistency & standards (L)
Help & documentation (L)
User control & freedom (S)
Visibility of system status (S)
Flexibility & efficiency (E)
Error prevention (S)
Recognition, not recall (S)
Error reporting, diagnosis, and recovery (S)
Aesthetic & minimalist design

Let's try it on an example. Here's a screenshot of part of a web page (an intentionally bad interface). A partial heuristic evaluation of the screen is shown below. Can you find any other usability issues?

Shopping cart icon is not balanced with its background whitespace (graphic design)
Good: user is greeted by name (feedback)
Red is used both for help messages and for error messages (consistency, match real world)
"There is a problem with your order", but no explanation or suggestions for resolution (error reporting)
ExtPrice and UnitPrice are strange labels (match real world)
Remove Hardware button inconsistent with Remove checkbox (consistency)
"Click here" is unnecessary (minimalist)
No "Continue shopping" button (user control & freedom)
Recalculate is very close to Clear Cart (error prevention)
"Check Out" button doesn't look like other buttons (consistency, both internal & external)
Uses "Cart Title" and "Cart Name" for the same concept (consistency)
Must recall and type in cart title to load (recognition not recall, error prevention, efficiency)

Formalization

Formal Evaluation Process

Training
- Meeting for design team & evaluators
- Introduce application
- Explain user population, domain, scenarios
Evaluation
- Evaluators work separately
- Generate written report, or oral comments recorded by an observer
- Focus on generating problems, not on ranking their severity yet
- 1-2 hours per evaluator
Severity Rating
- Evaluators prioritize all problems found (not just their own)
- Take the mean of the evaluators' ratings
Debriefing
- Evaluators & design team discuss results, brainstorm solutions

Here's a formal process for performing heuristic evaluation. The training meeting brings together the design team with all the evaluators, and brings the evaluators up to speed on what they need to know about the application, its domain, its target users, and scenarios of use.

The evaluators then go off and evaluate the interface separately. They may work alone, writing down their own observations, or they may be observed by a member of the design team, who records their observations (and helps them through difficult parts of the interface, as we discussed earlier). In this stage, the evaluators focus just on generating problems, not on how important they are or how to solve them.

Next, all the problems found by all the evaluators are compiled into a single list, and the evaluators rate the severity of each problem. We'll see one possible severity scale in the next slide. Evaluators can assign severity ratings either independently or in a meeting together. Since studies have found that severity ratings from independent evaluators tend to have a large variance, it's best to collect severity ratings from several evaluators and take the mean to get a better estimate.

Finally, the design team and the evaluators meet again to discuss the results. This meeting offers a forum for brainstorming possible solutions, focusing on the most severe (highest priority) usability problems.

When you do heuristic evaluations in this class, I suggest you follow this ordering as well: first focus on generating as many usability problems as you can, then rank their severity, and then think about solutions.

Severity Ratings

Contributing factors

Frequency: how common?
Impact: how hard to overcome?
Persistence: how often to overcome?

Severity scale

Cosmetic: need not be fixed
Minor: needs fixing but low priority
Major: needs fixing and high priority
Catastrophic: imperative to fix

Here's one scale you can use to judge the severity of usability problems found by heuristic evaluation. It helps to think about the factors that contribute to the severity of a problem: its frequency of occurrence (common or rare); its impact on users (easy or hard to overcome), and its persistence (does it need to be overcome once or repeatedly). A problem that scores highly on several contributing factors should be rated more severe than another problem that isn't so common, hard to overcome, or persistent.

Writing Good Heuristic Evaluations

Must communicate well to developers and managers
Include positive comments as well as criticisms
- “Good: Toolbar icons are simple, with good contrast and few colors (minimalist design)”
Be tactful
- Not: “the menu organization is a complete mess”
- Better: “menus are not organized by function”
Be specific
- Not: “text is unreadable”
- Better: “text is too small, and has poor contrast (black text on dark green background)”

Here are some tips on writing good heuristic evaluations.

First, remember your audience: you're trying to communicate to developers. Don't expect them to be experts on usability, and keep in mind that they have some ego investment in the user interface. Don't be unnecessarily harsh.

Although the primary purpose of heuristic evaluation is to identify problems, positive comments can be valuable too. If some part of the design is *good* for usability reasons, you want to make sure that aspect doesn't disappear in future iterations.

Suggested Report Format

What to include:
- Problem
- Heuristic
- Description
- Severity
- Recommendation (if any)
- Screenshot (if helpful)

12 . Severe: User may close window without saving data (error prevention)

If the user has made changes without saving, and then closes the window using the Close button, rather than File ⟩⟩ Exit, no confirmation dialog appears.

Recommendation: show a confirmation dialog or save automatically

UI Hall of Fame or Shame?

A Detailed Evaluation

UC Irvine student project
Medium post
Performed user testing (study next week)
Identified flaws
Redesigned to fix

Lack of feature visibility
- Buttons for important features hidden among text, hard-to-find menus
Lack of feedback
- No signal of successful post
Consistency/standards volations
- clicking name doesn't access settings
- settings button is an icon; most others are text
Lack of user control
- Trouble moving back. Much use of back button

Students at UC Irvine evaluated Piazza through user testing, then redesigned it based on what they learned. Wrote it up in a Medium post.

Heuristic Evaluation Is Not User Testing

Evaluator is not the user either
- Maybe closer to being a typical user than you are, though
Analogy: code inspection vs. testing
HE finds problems that UT often misses
- Inconsistent fonts
- Fitts's Law problems
But UT is the gold standard for usability

Heuristic evaluation is only one way to evaluate a user interface. User testing-watching users interact with the interface-is another. User testing is really the gold standard for usability evaluation. An interface has usability problems only if real users have real problems with it, and the only sure way to know is to watch and see.

A key reason why heuristic evaluation is different is that an evaluator is not a typical user either! They may be closer to a typical user, however, in the sense that they don't know the system model to the same degree that its designers do. And a good heuristic evaluator tries to think like a typical user. But an evaluator knows too much about user interfaces, and too much about usability, to respond like a typical user.

So heuristic evaluation is not the same as user testing. A useful analogy from software engineering is the difference between code inspection and testing.

Heuristic evaluation may find problems that user testing would miss (unless the user testing was extremely expensive and comprehensive). For example, heuristic evaluators can easily detect problems like inconsistent font styles, e.g. a sans-serif font in one part of the interface, and a serif font in another. Adapting to the inconsistency slows down users slightly, but only extensive user testing would reveal it. Similarly, a heuristic evaluation might notice that buttons along the edge of the screen are not taking proper advantage of the Fitts's Law benefits of the screen boundaries, but this problem might be hard to detect in user testing.

Evaluating Prototypes

Heuristic evaluation works on:
- Sketches
- Paper prototypes
- Buggy implementations
"Missing-element" problems are harder to find on sketches
- Because you're not actually using the interface, you aren't blocked by feature's absence
- Look harder for them

A final advantage of heuristic evaluation that's worth noting: heuristic evaluation can be applied to interfaces in varying states of readiness, including unstable implementations, paper prototypes, and even just sketches. When you're evaluating an incomplete interface, however, you should be aware of one pitfall. When you're just inspecting a sketch, you're less likely to notice missing elements, like buttons or features essential to proceeding in a task. If you were actually *interacting* with an active prototype, essential missing pieces rear up as obstacles that prevent you from proceeding. With sketches, nothing prevents you from going on: you just turn the page. So you have to look harder for missing elements when you're heuristically evaluating static sketches or screenshots.

Hints for Better Heuristic Evaluation

Use multiple evaluators
- Different evaluators find different problems
- The more the better, but diminishing returns
- Nielsen recommends 3-5 evaluators
Alternate heuristic evaluation with user testing
- Each method finds different problems
- Heuristic evaluation is cheaper
It's OK for observer to help evaluator
- As long as the problem has already been noted
- This wouldn't be OK in a user test

Now let's look at heuristic evaluation from the designer's perspective. Assuming I've decided to use this technique to evaluate my interface, how do I get the most mileage out of it?

First, use more than one evaluator. Studies of heuristic evaluation have shown that no single evaluator can find all the usability problems, and some of the hardest usability problems are found by evaluators who find few problems overall (Nielsen, "Finding usability problems through heuristic evaluation", CHI '92). The more evaluators the better, but with diminishing returns: each additional evaluator finds fewer new problems. The sweet spot for cost-benefit, recommended by Nielsen based on his studies, is 3-5 evaluators.

One way to get the most out of heuristic evaluation is to alternate it with user testing in subsequent trips around the iterative design cycle. Each method finds different problems in an interface, and heuristic evaluation is almost always cheaper than user testing. Heuristic evaluation is particularly useful in the tight inner loops of the iterative design cycle, when prototypes are raw and low-fidelity, and cheap, fast iteration is a must.

In heuristic evaluation, it's OK to help the evaluator when they get stuck in a confusing interface. As long as the usability problems that led to the confusion have already been noted, an observer can help the evaluator get unstuck and proceed with evaluating the rest of the interface, saving valuable time. In user testing, this kind of personal help is totally inappropriate, because you want to see how a user would really behave if confronted with the interface in the real world, without the designer of the system present to guide them. In a user test, when the user gets stuck and can't figure out how to complete a task, you usually have to abandon the task and move on to another one.

Cognitive Walkthrough:
Another Inspection Technique

Expert inspection focused on learnability
Inputs:
- prototype
- task
- sequence of actions to do the task in the prototype
- user analysis

For each action, evaluator asks:
- will user know what subgoal they want to achieve?
- will user find the action in the interface?
- will user recognize that it accomplishes the subgoal?
- will user understand the feedback of the action?

Cognitive walkthrough is another kind of usability inspection technique. Unlike heuristic evaluation, which is general, a cognitive walkthrough is particularly focused on evaluating learnability - determining whether an interface supports learning how to do a task by exploration.

In addition to the inputs given to a heuristic evaluation (a prototype, typical tasks, and user profile), a cognitive walkthrough also needs an explicit sequence of actions that would perform each task. This establishes the *path* that the walkthrough process follows. The overall goal of the process is to determine whether this is an easy path for users to discover on their own.

Where heuristic evaluation is focusing on individual elements in the interface, a cognitive walkthrough focuses on individual actions in the sequence, asking a number of questions about the learnability of each action.

Will user try to achieve the right subgoal? For example, suppose the interface is an e-commerce web site, and the overall goal of the task is to create a wish list. The first action is actually to sign up for an account with the site. Will users realize that? (They might if they're familiar with the way wish lists work on other site; or if the site displays a message telling them to do so; or if they try to invoke the Create Wish List action and the system directs them to register first.)
Will the user find the action in the interface? This question deals with visibility, navigation, and labeling of actions.
Will the user recognize that the action accomplishes their subgoal? This question addresses whether action labels and descriptions match the user's mental model and vocabulary.
If the correct action was done, will the user understand its feedback? This question concerns visibility of system state - how does the user recognize that the desired subgoal was actually achieved.

Cognitive walkthrough is a more specialized inspection technique than heuristic evaluation, but if learnability is very important in your application, then a cognitive walkthrough can produce very detailed, useful feedback, very cheaply.

User Centered Design

User-Centered Design

Iterative design
Early focus on users and tasks
Constant evaluation

Traditional Software Engineering Process: Waterfall Model

Let's contrast the iterative design process against another way. The waterfall model was one of the earliest carefully-articulated design processes for software development. It models the design process as a sequence of stages. Each stage results in a concrete product - a requirements document, a design, a set of coded modules - that feeds into the next stage. Each stage also includes its own validation: the design is validated against the requirements, the code is validated (unit-tested) against the design, etc.

The biggest improvement of the waterfall model over previous (chaotic) approaches to software development is the discipline it puts on developers to think first, and code second. Requirements and designs generally precede the first line of code.

If you've taken a software engineering course, you've experienced this process yourself. The course staff probably handed you a set of requirements for the software you had to build --- e.g., the specification of a chat client or a pinball game. (In the real world, identifying these requirements would be part of your job as software developers.) You were then expected to meet certain milestones for each stage of your project, and each milestone had a concrete product: (1) a design document; (2) code modules that implemented certain functionality; (3) an integrated system.

Validation is not always sufficient; sometimes problems are missed until the next stage. Trying to code the design may reveal flaws in the design - e.g., that it can't be implemented in a way that meets the performance requirements. Trying to integrate may reveal bugs in the code that weren't exposed by unit tests. So the waterfall model implicitly needs feedback between stages.

The danger arises when a mistake in an early stage - such as a missing requirement - isn't discovered until a very late stage - like acceptance testing. Mistakes like this can force costly rework of the intervening stages. (That box labeled "Code" may look small, but you know from experience that it isn't!)

Waterfall Model Is Bad for UI Design

User interface design is risky
- So we're likely to get it wrong
Users are not involved in validation until acceptance testing
- So we won't find out until the end
UI flaws often cause changes in requirements and design
- So we have to throw away carefully-written and tested code

Although the waterfall model is useful for some kinds of software development, it's very poorly suited to user interface development.

First, UI development is inherently risky. UI design is hard for all the reasons we discussed in the first class. (You are not the user; the user is always right, except when the user isn't; users aren't designers either.) We don't (yet) have an easy way to predict whether a UI design will succeed.

Second, in the usual way that the waterfall model is applied, users appear in the process in only two places: requirements analysis and acceptance testing. Hopefully we asked the users what they needed at the beginning (requirements analysis), but then we code happily away and don't check back with the users until we're ready to present them with a finished system. So if we screwed up the design, the waterfall process won't tell us until the end.

Third, when UI problems arise, they often require dramatic fixes: new requirements or new design. We saw in Lecture 1 that slapping on patches doesn't fix serious usability problems.

Iterative Design

We won't get it right the first time
Evaluation will force re-design
Eventually, converge to good solution
Design guidelines help reduce number and cost of iterations
Isn't this just like a repeated waterfall?

Iterative design offers a way to manage the inherent risk in user interface design. In iterative design, the software is refined by repeated trips around a design cycle: first imagining it (design), then realizing it physically (implementation), then testing it (evaluation).

In other words, we have to admit to ourselves that we aren't going to get it right on the first try, and plan for it. Using the results of evaluation, we redesign the interface, build new prototypes, and do more evaluation. Eventually, hopefully, the process produces a sufficiently usable interface.

Sometimes you just iterate until you're satisfied or run out of time and resources, but a more principled approach is to set usability goals for your system. For example, an e-commerce web site might set a goal that users should be able to complete a purchase in less than 30 seconds.

Many of the techniques we'll learn in this course are optimizations for the iterative design process: design guidelines reduce the number of iterations by helping us make better designs; cheap prototypes and discount evaluation techniques reduce the cost of each iteration. But even more important than these techniques is the basic realization that in general, you won't get it right the first time. If you learn nothing else about user interfaces from this class, I hope you learn this.

You might object to this, though. At a high level, iterative design just looks like the worst-case waterfall model, where we made it all the way from design to acceptance testing before discovering a design flaw that *forced* us to repeat the process. Is iterative design just saying that we're going to have to repeat the waterfall over and over and over? What's the trick here?

Spiral Model

Know early iterations will be discarded
So make them cheap
Storyboards, sketches, mock-ups
Low-fidelity prototypes
Just detailed enough for evaluation

The spiral model offers a way out of the dilemma. We build room for several iterations into our design process, and we do it by making the early iterations as cheap as possible.

The radial dimension of the spiral model corresponds to the cost of the iteration step - or, equivalently, its fidelity or accuracy. For example, an early implementation might be a paper sketch or mockup. It's low fidelity, only a pale shadow of what it would look and behave like as interactive software. But it's incredibly cheap to make, and we can evaluate it by showing it to users and asking them questions about it.

Early Prototyping

Sketches

Paper Prototypes

Computer Mockups

Here are some examples of early-stage prototyping for graphical user interfaces. We'll talk about these techniques and more in a future prototyping lecture.

Early Prototypes Can Detect Usability Problems

Even a sketch would have revealed many usability problems
No need for an interactive implementation

Remember this Hall of Shame candidate from the first class? This dialog's design problems would have been easy to catch if it were only tested as a simple paper sketch, in an early iteration of a spiral design. At that point, changing the design would have cost only another sketch, instead of a day of coding.

Increasing Fidelity over Iterations

Iterative Design of User Interfaces

Early iterations use cheap prototypes
- Parallel design is feasible: build & test multiple prototypes to explore design alternatives
Later iterations use richer implementations, after UI risk has been mitigated
More iterations generally mean better UI
Only mature iterations are seen by the world

Why is the spiral model a good idea? Risk is greatest in the early iterations, when we know the least. So we put our least commitment into the early implementations. Early prototypes are made to be thrown away. If we find ourselves with several design alternatives, we can build multiple prototypes (parallel design) and evaluate them, without much expense. The end of this reading will make more arguments for the value of parallel design.

After we have evaluated and redesigned several times, we have (hopefully) learned enough to avoid making a major UI design error. Then we actually implement the UI - which is to say, we build a prototype that we intend to keep. Then we evaluate it again, and refine it further.

The more iterations we can make, the more refinements in the design are possible. We're hill-climbing here, not exploring the design space randomly. We keep the parts of the design that work, and redesign the parts that don't. So we should get a better design if we can do more iterations.

Case Study of User-Centered Design:
The Olympic Message System

Cheap prototypes
- Scenarios
- User guides
- Simulation (Wizard of Oz)
- Prototyping tools (IBM Voice Toolkit)
Iterative design
- 200 (!) iterations for user guide
Evaluation at every step
You are not the user
- Non-English speakers had trouble with alphabetic entry on telephone keypad

The Olympic Message System is a classic demonstration of the effectiveness of user-centered design (Gould et al, “The 1984 Olympic Message System”), CACM, v30 n9, Sept 1987). The OMS designers used a variety of cheap prototypes: scenarios (stories envisioning a user interacting with the system), manuals, and simulation (in which the experimenter read the system's prompts aloud, and the user typed responses into a terminal). All of these prototypes could be (and were) shown to users to solicit reactions and feedback.

Iteration was pursued aggressively. The user guide went through 200 iterations!

A video about OMS can be found on YouTube. Check it out---it includes a mime demonstrating the system.

The OMS also has some interesting cases reinforcing the point that the designers cannot rely entirely on themselves for evaluating usability. Most prompts requested numeric input ("press 1, 2, or 3"), but some prompts needed alphabetic entry ("enter your three-letter country code"). Non-English speakers - particularly from countries with non-Latin languages - found this confusing, because, as one athlete reported in an early field test, "you have to read the keys differently." The designers didn't remove the alphabetic prompts, but they did change the user guide's examples to use only uppercase letters, just like the telephone keys.

Summary

Usability Heuristics
- Sets of usually-right rules for UIs
Heuristic Evaluation
- Process to inspect/review interfaces to identify problems
User Center Design
- Plan for iteration
- Low-fidelity first
- Frequent feedback

Discussion of Course

Parallel tracks: Design and Implementation
- So far evaluation; now begin doing it
- Order important?
Learning by doing
- Lab
- Homework iteration

UI Hall of Fame or Shame?

Outline

Usability Heuristics (“Guidelines”)

Principles from This Course

Nielsen Heuristics

Match System and Real World (Metaphor)

Consistency and Standards

User Control and Freedom

Visibility of System Status (Feedback)

Efficiency

Error prevention

Recognition over Recall

Error Recovery

Aesthetics/Graphic Design

Norman Principles

Shneiderman's 8 Golden Rules

Consistency

Universal Usability

Feedback

Dialogs with Closure

Prevent Errors

Reversibile Actions

User Control

Reduce Memory Load

Tog's First Principles

Heuristic Evaluation

How To Do Heuristic Evaluation

In Class

Nielsen Heuristics

Formal Evaluation Process

Severity Ratings

Contributing factors

Severity scale

Writing Good Heuristic Evaluations

Suggested Report Format

UI Hall of Fame or Shame?

A Detailed Evaluation

Heuristic Evaluation Is Not User Testing

Evaluating Prototypes

Hints for Better Heuristic Evaluation

Cognitive Walkthrough: Another Inspection Technique

User-Centered Design

Traditional Software Engineering Process: Waterfall Model

Waterfall Model Is Bad for UI Design

Iterative Design

Spiral Model

Early Prototyping

Sketches

Paper Prototypes

Computer Mockups

Early Prototypes Can Detect Usability Problems

Increasing Fidelity over Iterations

Iterative Design of User Interfaces

Case Study of User-Centered Design: The Olympic Message System

Summary

Discussion of Course

Cognitive Walkthrough:
Another Inspection Technique

Case Study of User-Centered Design:
The Olympic Message System