U141201B
U141201B
2
INDEX
1. INTRODUCTION - THE IMPORTANCE OF RESEARCH AND TESTING
2. USABILITY TESTING GOALS
Defining Your Usability Goals
Usability Metrics
Takeaway
3. CHOOSING YOUR TEST AND PARTICIPANTS
Types of Test
Finding Your Target Test Audience
Usability Test Plan
Takeaway
4. SCRIPTED TESTS
Moderated vs. Unmoderated Tests
Tree Testing
Usability Benchmark Testing
Hallway Usability Testing
Takeaway
5....DECONTEXTUALIZED TESTS & HEURISTIC REVIEWS
Card Sorting
User Interviews
Heuristics Evaluations
Takeaway
6....NATURAL & NEAR-NATURAL TESTS
A/B Testing
First Click Testing
Field and Diary Studies
Eye Tracking Test
3
Beta Testing (User Acceptance Testing)
Takeaway
7... HYBRID TESTS
Desirability Testing
Concept Testing
Participatory Testing
Takeaway
8....WEBSITE & MOBILE USABILITY TESTING
Website Usability Testing
Mobile Usability Testing
Takeaway
9....ABOUT UXPIN
4
CHAPTER ONE
Introduction
A quick note from the authors
5
T he biggest challenge designers and prod-
uct managers face isn’t how the market or
different technologies work — it’s how humans
work. What users say versus what users do are
two completely different things, and the only
way to verify is to test. Usability testing is more
than a just a checkbox on a list of product re-
quirements — it is the most convincing support
for your design decisions.
Test early and test often. Every company and product is different, so there is
no magical usability test that will tell you everything you need to know. Define
your hypothesis, pick several quantitative and qualitative methods, and get
ready to go out of your comfort zone.
Usability testing helps you see the bottom line of whether your design works or
doesn’t. We’ll look at how highly successful companies like Apple, MailChimp,
Yahoo, DirecTV, Microsoft, Buffer, among others, used different usability
testing tactics that all suited their own unique needs. We’ve also included our
own preferences, and outlined how UXPin conducts usability testing.
6
We’d love your thoughts on what we’ve written. And feel free to include anyone
else in the discussion by sharing this e-book.
Chris Bank
(co-written by Jerry Cao)
7
CHAPTER T WO
8
L ike all significant undertakings, you need to go into usability testing with a
plan. As you’ll see, a little extra time planning at the beginning can pay off
in the end. By following a few simple guidelines, you’ll know what to expect,
what to look for, and what to take away from your usability testing.
Obviously you’d like to optimize the results of your usability testing, and in
order to do that, you must first know what you’re testing for. We’ll explain how
to define your testing objectives and set your usability metrics.
There’s no question about what Waldo looks like before you open the book, but
all too often companies jump the gun with their usability tests and don’t know
what they’re looking for, or even why. For this, the first step in usability research
should always be knowing what you want to get out of it — but that’s not as easy
as it sounds. You need to categorize your testing goals and know what type of
data is most appropriate.
9
I. CATEGORIZING YOUR GOALS
• Users — Who uses your product? Who do you want to use your product?
Be as specific as possible: demographics, location, usage patterns —
whatever you can find out.
• Success — What is your idea of success for this product? Make sure the
entire team is on the same page.
10
• Timing and Scope — What time frame are you working with for collecting
your data? When is it due?
Once you’ve finished your benchmark questions, you can reverse the roles and
have your team write down their questions (that way you’ll have identified what
they know, and what they’d like to know). Becky White of Mutual Mobile talks
about a sample exercise to help you narrow down your goals. Gather your team
together and pass out sticky notes. Then, have everyone write down questions
they have about their users and the UX. Collect all the questions and stick them
to a board. Finally, try to organize all the questions based on similarity. You’ll
see that certain categories will have more questions than others — these will
likely become your testing objectives.
It also helps to make sure your testing objectives are as simple as possible. Your
objectives should be simple like “Can visitors find the information they need?”
instead of complex objectives like “Can visitors easily find our products and make
an informed purchase decision?”
11
If you think using usability testing questions as a means to set your goals,
Userium offers this helpful website usability checklist. If you notice you’re
lacking in one or more categories, those are where collecting data would be
most helpful (and are good talking points if your team gets stuck during the
initial Q&A).
”The simplest usability testing objectives lead to the deepest design insights.”
TWEET THIS
Now that you know your goals, it’s time to figure out how to apply usability
testing to accomplish them. Essentially, you’re clarifying the greater scope of
your testing.
The UserTesting e-book about user testing suggests that you must first
understand what type of feedback would be most helpful for your results. Does
your team need a graph or a rating scale? Personal user accounts or numbers?
Written responses or sound bites? The people who will read the data can impact
the best type to collect: skeptical stakeholders might be convinced by the cold,
hard numbers of a graphed quantitative rating scale, while the CEO might be
made to understand a problem if he saw a video clip of users failing at a certain
task.
This is why knowing your usability goals first is so important. If you don’t know
the overall goals and objectives, then you certainly don’t know what type of
feedback and data you need. This chart below should help give you an example
of how the type of data affects the type of testing.
12
source: User Testing Your Next Project
Once you know your goals and what type of data you’re looking for, it’s time
to begin planning the actual tests. But before we get into that, let’s talk a little
about metrics.
13
Usability Metrics
”Qualitative & quantitative data help you understand what to fix & why, and how many
TWEET THIS
problems exist.”
• Success Rate — In a given scenario, was the user able to complete the
14
assigned task? When we tested 35 users for a redesign of the Yelp website,
this was one of the most important bottom-line metrics.
• Error Rate — Which errors tripped up users most? These can be divided
into two types: critical and noncritical. Critical errors will prevent a user
from completing a task, while noncritical errors will simply lower the
efficiency with which they complete it.
• Time to Completion — How much time did it take the user to complete
the task? This can be particularly useful when determining how your
product compares with your competitors (if you’re testing both).
15
In a general overview of metrics, Jakob Nielsen, co-founder of the Nielsen
Norman Group and usability expert, states plainly, “It is easy to specify
usability metrics, but hard to collect them.” Because gathering usability metrics
can be difficult, time-consuming, and/or expensive, a lot of small-budget
companies shy away from them even though they could prove useful. So are
metrics a worthwhile investment for you? Nielsen lists several situations in
particular where metrics are the most useful:
• Tracking progress between releases — Did your newest update hit the
mark? The metrics will show you if you’ve solved your past problems or
still need to tweak your design.
Usability metrics are always helpful, but can be a costly investment since you
need to test more people for statistical significance. If you plan on gathering
quantitative data, make sure you collect qualitative data so you have a system of
checks-and-balances, otherwise you run the risk of numbers fetishism. You can
actually see how this risk could play out in the real world in a clever explanation
of margarine causing divorce by Hannah Alvarez of UserTesting.
”There’s a fine line between quant analysis and numbers fetishism. Qualitative data is
TWEET THIS
your reality check.”
16
Takeaway
In some ways, the planning phase is the most important in usability research.
When it’s done correctly, with patience and thought, you data will be accurate
and most beneficial. However, if the initial planning is glossed over — or even
ignored — your data will suffer and call into question the value of the whole
endeavor. Take to heart the items discussed in this chapter, and don’t move
forward until you’re completely confident in your objectives and how to achieve
them.
In the next chapter, we’ll start to get into the specifics of the actual test
planning, namely what kind of test will work and whom to choose to
participate. As both the type of test and the type of user can differ greatly, it’s
vital to take the time in deciding.
17
CHAPTER THREE
18
I n this chapter we’re going to discuss two of essential factors in a user test:
the users and the tests. Now that you know what your goals are, you’re ready
to hone your test planning to meet those specific goals. There are many tests to
choose from, and many types of people to recruit, so narrowing your focus will
get you closer to the results you want.
Types of Test
But no matter what type of test you choose, you should always start with a pilot
test. Many people like to gloss over this, but sacrificing a little extra time for a
pilot test almost always pays off.
I. PILOT TEST
Pilot testing is like a test run of your greater user test. In A Practical Guide to
Usability Testing, Joseph S. Dumas and Janice C. Redish call pilot tests a “dress
rehearsal for the usability test to follow.” You will conduct the test and collect
the data in the same way you would a real test, but the difference is that you
don’t analyze or include the data. You are, quite literally, testing your test.
”Before you test your users, test your test. Always run a pilot test.”
TWEET THIS
That may seem like a waste of time — and you will likely be tempted to just
jump directly into the actual tests — but pilot tests are highly recommended.
19
The reason is that, in most cases, something will go wrong with your first test.
Whether technical problems, human error, or a situational occurrence, it’s rare
that a first test session goes well, or even adequately.
The idea is that these tests should be as scientific and precise as possible. If you
want the most reliable data, run a pilot test or two until you feel you understand
the process and have removed all the kinks.
In the following four chapters, we’ll be going over the specifics of each type of
user testing method. But for now we’ll give you an overview so you know what to
expect.
20
Christian Rohrer, Chief Design Officer in the Consumer Division at McAfee,
Inc., explains in an article for the Nielsen Norman Group the distinctions
between the types of tests. While he uses a complex three-dimensional
framework to explain the intricacies of the different tests, for simplicity’s sake
we’re going to focus on his division among how the product is used.
1. Scripted use of the product — These tests focus on specific usage aspects.
The degree of scripting varies, with more scripting generating more
controlled data.
2. Decontextualized use the product — Tests that don’t use the product — at
least in the actual testing phase — are designed for broader topics like UX
or generating ideas.
Each of these types of tests — and their most common examples — will be
discussed at length in the rest of the e-book. For now, though, let’s get back to
creating a plan.
Each type of test is divided into tasks, the execution of which will affect the
validity and overall usefulness of the data collected. While each test will have
its own properties for the type of tasks, Tingting Zhao, Usability Specialist for
21
Ubuntu, shows us some distinctions to keep in mind when designing tasks.
Zhao outlines two main choices to make for each task. The first choice is
whether to phrase your tasks directly or with a scenario.
source: Zezz
22
The second distinction to make when creating tasks is between closed and
open-ended tasks.
• Closed — A closed task is one with clearly defined success or failure. These
are used for testing specific factors like success rate or time. For example, in
our Yelp redesign exercise, we gave participants the following task: “Your
friend is having a birthday this weekend. Find a fun venue that can seat up
to 15 people.”
We’ll talk more about tasks in the following chapters, but for now keep these
important distinctions in mind as you come to understand what you want out of
your usability testing.
With all this talk of data and research, it’s easy to forget that the core
component of these tests are actual people. To think of your participants
as merely test subjects is a mistake; they are all individuals with their own
personalities and their own way of doing things. Deciding the type of people
you want to provide you data is a major factor — even if ultimately you decide
you want them to be random.
23
I. YOUR TARGET USERS
Unless you’re designing the Beatles of products where everyone can enjoy it, it’s
best to narrow down your target audience to the users most likely to use your
product.
”It’s a mistake to think of participants as test subjects. They are all individuals with per-
TWEET THIS
sonalities and built-in habits.”
Knowing your target audience is not really a topic for usability testing; in
theory, this is something you should have already decided in the Product
Definition phase (as discussed in The Guide to UX Design Process &
Documentation).
24
However, depending on the complexity of your tasks, you may need more
than one user group. For example, when conducting user testing for our
Yelp redesign, we realized we needed two groups of people: those with Yelp
accounts, and those who did not. Once we knew the overall groups, we then
decided that both groups needed to have users who were located in the US,
used Yelp at most 1-2x a week, and browsed mostly on their desktops.
When focusing in on your test group, it’s also important not to obsess over
demographics. The biggest differentiator will likely be whether users have prior
experience or are knowledgeable about their domain or industry — not gender,
age, or geography. Once you know whom you’re looking for, it’s time to get
out there and find them. If you find you have more than one target group, that’s
okay; just remember to test each group independently of each other — that will
make your data more telling.
”Don’t obsess over demographics. Users’ prior experience and knowledge will likely matter
TWEET THIS
more.”
Knowing who you want for the test is only half the battle; you still need to get
them to come (or agree to let you come to them). Luckily, Jeff Sauro, founder
of Measuring Usability LLC, lists seven of the most effective methods and
usability tools for recruiting people for usability tests. Below, we’ll briefly
describe each method (we’re big fans of UserTesting and hallway testing).
1. Existing Users — By definition, these are your target users. Try self-
promoting on your website, or work with your customer service
department to locate interested users. Even if you’re researching a new
product or if your company has produced similar products in the past,
there’s a chance they both target the same type of person.
25
2. UserTesting — A website designed specifically for this, UserTesting lets
you select users by age, gender, location, and even more customizable
options. The site delivers audio and video of users actually testing your site
or app.
If you’re conducting later-stage beta testing, you can recruit beta testers from
within your existing user base, as long as it’s large enough. If, however, you need
to recruit them elsewhere, Udemy explains the best ways to find them.
You’re almost ready to dive into your testing, but before you do, there’s just
one last thing: a one-page usability checklist. As discussed in The Guide to UX
Design Process & Documentation, this succinct outline will tell stakeholders
everything they need to know about the test, but without boring them with all
the details.
source: TU Delft
27
Tomer Sharon, Author and UX Researcher at Google Search, provides a
simple outline for your synopsis:
5. Goals — Try to sum it up with one sentence, but if you have multiple goals,
use a short bulleted list.
6. Research Questions — Make it clear these are the questions you hope to
answer with the study, not the questions you’ll be asking the participants.
With the usability checklist in hand, all the key players will be on the same page,
so to speak. We’ve provided a free usability testing kit (which includes a testing
report) so that you can incorporate these points.
28
Takeaway
We can’t stress enough the importance of the pre-planning phases. The type
of test and users you go with will have the biggest impact on your results, and
going with the wrong choices will greatly reduce the accuracy. Having a solid
plan can make all the difference, and ensure that you meet your own personal
needs.
In the next chapter we’re going to start getting into the types of tests,
specifically scripted tests. With your usability goals ready, keep an eye out for
the tests that will help accomplish your plan to the fullest.
29
CHAPTER FOUR
Scripted Tests
More controlled tests for more specific results
30
A scripted test is the most controlled of the test types, and is recommended
for testing specific usage aspects, like whether or not the user can find/
access a certain feature (or how long it takes to do so). Scripted tests tend to
produce more quantitative data, beneficial for usability metrics, but can also
generate qualitative data as well, depending on the how tight or controlling the
script is.
Before we get into the specific types of scripted tests (tree testing, benchmark
tests, and hallway testing), we’ll first discuss a crucial decision in how you
conduct your test: whether to moderate it or not.
31
Moderated vs. Unmoderated Tests
Physicists understand well the observer effect — the idea that the presence of
an observer changes the behavior of what’s being observed, negating the whole
point of observing it. This may apply to photons, but what about people?
Whether or not you choose to moderate your test depends on your specific
goals and what you need to accomplish. In some instances a moderator will help
facilitate the process and aid it in going smoothly, while in other instances they
will only interrupt, not to mention the extra costs of an on-site staff. Below we’ll
talk about the pros and cons of each, so you can decide which will work best for
your user test.
I. MODERATED TESTS
32
source: An Introduction To Website Usability Testing
As you can guess, moderated testing is not recommended for all tests. The
experts at UserTesting recommend it for the following situations:
33
But even the moderation proponents admit that moderated tests have their
drawbacks, specifically convenience. Moderated tests require a knowledgeable
moderator, their time, and usually a specified location, as opposed to remote
usability testing. Coordinating the schedules of moderated tests can be
problematic, and only one can be done at a time, unless more moderators are
hired. More importantly, moderated tests can take participants out of their
comfort zone, so special care must be taken to avoid the various kinds of biases.
In UXPin, you can actually run a remote moderated usability tests quite easily.
Download the Chrome plugin, set up your tasks, and start testing. As you can see in
our testing overview, UXPin generates video clips that let you see every click, hear
user’s thoughts, and see their screens and faces.
For a moderated test, you could also let your testers participate from the
comfort of their own home. For example, Evernote actually ran a remote
usability test that was moderated in which the testers were in different
locations, but the moderators were all in the office. This offers the benefits
of moderation at lower cost (since you don’t have to worry as much about
equipment setup), but it may not be suitable if you need a controlled lab
environment due to information sensitivity. Nonetheless, this tactic is effective
34
and Evernote gained insights that helped them improve user retention by 15%.
If you have any of the special needs listed above, moderation may be the right
choice. If you do choose this route, make sure you follow these 12 tips for being
a perfect moderator to minimize the likelihood of bias.
• Cost savings — Costs are usually quite low since you don’t need to
pay for moderators or equipment setup. With usability testing tools
like UserTesting and Userlytics, tests can run as low as $49 per user.
Unmoderated tests are also scalable depending on the testing tool used.
As you’ll see in the above video from our User Testing & Design e-book, you
can get maximum value for minimum cost when the tasks are written as clearly
as possible. Users are encouraged to think out loud, and you record their on-
screen interactions. When the test is done, you can then use the video clips that
are most insightful and present them to your team for design changes.
There are downsides, however. The lack of a moderator means less control, less
personal observation, and a higher risk of confusion. Additionally, conducting
high volume, unmoderated tests using an online tool opens you to the risk of
attracting participants looking only for the incentive without putting effort into
the tasks. On the bright side, such participants can be filtered, especially by
looking at their time-to-completion or open-ended feedback.
Nonetheless, if you choose unmoderated testing, make sure you know the
criteria for picking the best usability tool. As the Nielsen Norman Group
advises, you’ll want something that offers same-day results, audiovisual
recording, and offers a broad demographic for recruiting testers.
36
Tree Testing
Tree testing allows you to test the information architecture by stripping out
the visual elements. With a tree test, you examine only the labelling and
hierarchy of your content. Martin Rosenmejer of Webcredible names it as one
of the most important steps early in the web design process. And we all know
the importance of information architecture — if the content isn’t structured
logically with a simple flow, it might as well not exist. That’s why an early tree
test can help identify and solve the problems before they actually become
problems.
37
In a nutshell, a tree test tasks participants with finding different information on
a clickable sitemap (or “tree”). Using a usability testing tool like Treejack, you
then record task success (clicking to the correct destination) and task directness
(certainty of users that they found what was needed). This shows you the
effectiveness and clarity of your information architecture.
”If your site content doesn’t flow with a nice logical structure, it might as well not exist.”
TWEET THIS
38
If tree testing seems like something that could benefit your project, Jeff Sauro,
Founding Principal of MeasuringU, goes into details about how to properly
run them. He explains that tree testing is used primarily for two reasons:
2. Validating a change — Did a recent update correctly fix the problem, or are
further revisions necessary?
Because tree testing examines the success rate of a specific task, more
participants will give you more accurate results. Check the chart below to find
the smallest margin of error within your means (we recommend aiming for 5%
error or better).
source: MeasuringU
If you’re concerned with navigational problems, see our section on card sorting
in the next chapter and compare which, if not both, would benefit you more.
One distinct benefit of tree testing is that you can also test hundreds of items (if
your site is even larger, just prioritize the most used navigation items).
39
Usability Benchmark Testing
Usability benchmark testing is the only test covered in this e-book that
measures the overall usability of a product. As its name suggests, a usability
benchmark test is a high-level report of the overall ease and success rate of
completing tasks. You can check out this benchmark report from UserFocus
and follow the discussion below.
In an essay on his website, bestselling author and speaker Scott Berkun points
out that, while other usability tests focus on specific aspects, the usability
benchmark test measures only user performance without regard to the why. In
fact, participants should even be discouraged from explaining their thoughts,
as explanations would impact the time and focus spent interacting with the
product.
40
Because benchmarks require more time and effort, Berkun outlines the optimal
times in which to run the test:
• The product is stable — To get the most out of the benchmark, make sure
the product is stable, i.e., all the errors you already know about are fixed
and it’s running at peak efficiency.
• After a major release or update — At this time, a benchmark can test how
effective the changes were, or if unforeseen problems arose in the process.
When conducting this type of test, there are a few factors to consider. Nadyne
Richmond, Researcher at VM Press, gives 5 tips for planning out your
benchmark test:
1. Select the most important tasks to the product overall — While it’s
tempting to select tasks related to the newest or experimental features, this
is not the correct test for that. A benchmark measures usability as a whole,
41
not in a specific area.
2. Use standard metrics — The most reliable data comes from success rates,
time-to-completion, error rates, and satisfaction rating.
The important thing to remember about usability benchmark tests is that they
are different than other usability tests. Think of them as a dashboard of your
product’s usability. If you’re looking to workshop a specific feature or area, you
should look elsewhere.
Hallway usability tests are the bare minimum for worthwhile usability testing,
so if you’re on a tight budget or don’t want to invest a lot in usability, this one is
for you. Joel Spolsky, CEO of Stack Exchange, describes it like this:
42
“A hallway usability test is where you grab the next person that passes by
in the hallway and force them to try to use the code you just wrote. If you
do this to five people, you will learn 95% of what there is to learn about
usability problems in your code.”
Of course you don’t need to literally grab people from the hallway, but the idea
is that any small number of random users (from within your target audience)
will give you a sufficient amount of data for your usability goals.
”Hallway testing is the bare minimum for usability testing. Grab 5 coworkers and get to
TWEET THIS
work.”
source: MeasuringU
The test itself doesn’t have to be that complex. Corinna Baldauf, Web
Developer and UX Blogger, elaborated on Spolsky’s theories. She suggests
setting up a station with your product in a public venue — she used an
office break room, while others suggest Starbucks. When someone comes
by, ask them to test the system, perhaps even adding some incentive (don’t
underestimate the power of chocolate). Give them instructions, then step
43
back and watch. Don’t forget to take notes, particularly on what is not going as
expected.
If you do this with five people, that should give you data that’s accurate enough.
Why five? Jakob Nielson, co-founder of the Nielson Norman Group, created a
formula for the amount of usability problems found in a usability test:
N (1-(1- L ) n )
You can see clearly that five people gives you all the data you need, while
anything more seems superfluous.
Hallway usability testing is one of the most popular forms due to its simplicity,
low cost, low resources, and high output. If you’re interested in conducting
your own hallway usability test, the USAJOBS Team gives these tips:
44
• Choose the right time and place — choose a location with a lot of foot
traffic, at a time when you’re not inconveniencing people.
• Come prepared — make sure you outline your plan ahead of time, and set
up 30 minutes before you’d like to start.
• Good greeters — use greeters who are outgoing and charismatic, and who
can identify your target audience.
• Look for ways to improve — learn from your experience and keep an eye
out for ways to improve your testing process.
”When observing your user test, make sure you also write down what’s not going as
TWEET THIS
expected.”
At UXPin, we’re big fans of hallway testing. When we were finishing up our
integration with Photoshop and Sketch, our product team was visiting our
California office so hallway testing happened every day. A developer or designer
would set up his computer and ask us to import a static design file and turn it into a
fully layered prototype. The product manager would then take notes and revise the
weekly sprint based on the insights.
45
Takeaway
After reading this chapter, you are now more aware of the main scripted tests:
tree testing, usability benchmark testing, and hallway usability testing. You
know that tree testing focuses specifically on navigation, usability benchmark
testing determines a product’s overall usability, and hallway usability testing is
great for a simple and low-cost usability test. You also learned the difference
between moderated and unmoderated tests, and why unmoderated tests may
be more appropriate, except when you have incomplete or otherwise confusing
setbacks to your product.
In the next chapter, we’ll talk about decontextualized tests, or tests that don’t
directly use the product.
46
CHAPTER FIVE
Decontextualized Tests
& Heuristic Reviews
Delving deeper into your product without it immediately present
47
S ometimes the best way to test a product doesn’t involve the product at all.
Decontextualized tests, or tests that don’t involve the product, are generally
geared to testing users’ attitudes on the product, or in generating ideas. But just
because they may be more conceptual doesn’t mean they’re any less valuable as
a source of data.
In this piece, we’ll focus on card sorting and interviews as two popular and
cost-effective decontextualized testing methods. On a related noted, we’ll also
discuss heuristic reviews. We’ve included it in this discussion because while
someone is interacting with the product, it’s not the end-user — instead, a
group of experts reviews the features based on established best practices.
”Sometimes the best way to test a product doesn’t involve the product at all.”
TWEET THIS
Card Sorting
The idea is so simple yet so meaningful. You write the different elements of your
product on note cards or Post-It notes, then have your participants organize
them in a way that makes the most sense to them. If you’d like to go paperless,
you can also choose usability testing tool like OptimalSort for quick analysis of
common groupings. Regardless of analog or digital, the result gives you a solid
understanding on your product’s information architecture (IA), a big term than
means simply how you organize the elements of your product.
48
source: Smashing Magazine
There’s more to card sorting than it seems. Donna Spencer, card sorting expert
and Founder of Maadmob, believes that while card sorting might not provide
a final structure, it does provide a rare glimpse into users’ mental models. For
such a simple activity, there’s plenty of variation and controls that will affect the
kind, and validity, of data you receive. For starters, there are two different styles
of how you can conduct it:
• Open Sorting — Users are provided only with the elements, and are left
to group them however they see fit. Once grouped, users are asked to give
names to the groups themselves. This is recommended for generating new
ideas since you can work backwards from users’ natural thought processes.
49
• Closed Sorting — As with open sorting, users are given the elements;
however, they are asked to categorize them into predefined groups. This
is recommended if you’re working within the restrictions of pre-existing
categories, for example, when updating an already developed website
structure.
The above image is an example of a closed card sort. In this case, you can see the
four categories in blue and the cards below. Users are then asked to place the
cards under whatever category seems best to them. If this were an open card
sort, you’d simply remove the blue categories and ask users to create their own.
Aside from open and closed, other variations include groups or individuals, and
remote or on-location. Groups allow users to work collaboratively, for better
or worse, and can help you learn about multiple users at once; however, group
dynamics might affect your results. Remote location testing — for example,
using an online software tool — allows you to test more users in a faster time,
yet you’re unable to directly observe their decision-making processes. On-
location gives you a fuller understanding of how your users came to their
decisions, but requires more planning and scheduling.
50
II. CARD SORTING GUIDELINES
While every card sort is different depending on the cards, William Hudson, UX
Strategist and Consultant, suggests some general benchmarks for card sorting.
Specifically, he lists the approximate times it will take people to sort a given
number of elements:
Using this time structure, you can plan out in advance how long the tests will
take to administer, once the cards are written or the software established. From
our experience, these guidelines can be quite generous — one of our closed
card sorts involved 47 cards and four categories, but only required an average of
three minutes to complete.
When naming the cards, simpler is better. Avoid big words (many syllables)
and technical jargon. While this is good advice in general for the language
usage of a product, it’s essential for card sorting since overly complex labeling
will disrupt the natural thought processes. Pierre Croft, IA and UX expert
for Decibel Digital believes that card sorting can help deflect the bad ideas of
HIPPOS (highest paid people in the room) who might not know how to build a
good website. Card sorting is cheap, useful, and quick, so we’ve included a few
pointers:
• Don’t mix parent and child categories — In other words, use categories
from the same level, or else you will confuse your participants.
• Provide some blank cards and pens — While this is standard procedure
for open card sorting, it’s also quite useful for closed card sorting. After
the formal testing is done, you can provide a couple blank cards for
51
participants to write down additional categories. While the information
might be “off-the-record,” it could bring to light some useful insights.
• Don’t intervene — After giving the instructions, try your best to sit back
and observe the participants (unless they ask for help). Intervention will
obscure the data.
• Set time limits — This makes scheduling easier in general, and gives the
participants an idea of how much time to spend on their tasks.
If your website has hundreds or even thousands of pages, you can choose only
first and second-level pages to keep things manageable. For example, “Contact
Us,” “Terms of Agreement,” and other utility pages can be omitted since they
can be found on almost all websites out there (so you wouldn’t really be testing
anything unique to your site).
”For card sorting, simpler is better. Avoid unnecessarily complex words and jargon.”
TWEET THIS
User Interviews
If you want to know what users think, sometimes all you have to do is ask.
Interviews directly connect you with your target audience and give you a high
degree of control over what data you collect; however, your research is mostly
qualitative and limited by the participant’s self-awareness and articulation.
52
source: User Interview Techniques
The nuances of interviews lie in what to say and how you say it. Kate Lawrence,
UX Researcher at EBSCO Publishing, offers some great insights into these
areas. When asking questions, it’s best to center around the participant’s
perspective of the environment in which your product will exist. Here are a few
great interview questions that apply to any product:
• How would you describe your ideal experience with our product? — A
little on the nose, but the answers will tell you exactly what your users like.
While you may not want to follow their responses word-for-word, try to
notice any commonalities with the answers from other interviews.
With the right questions and the right atmosphere,you can mine a lot of usable
data from interviewees. But you also need to know how to behave in a way that
won’t bias the results while putting interviewees at ease. Michael Margolis,
UX Research Partner for Google Ventures, gives 16 practical tips for running
a usability interview. For example, make sure you also write down interviewee
body language and always ask follow up questions.
When it comes to usability interviews, the same people skills you would use at
a party still apply, just with laser-focused purpose. With the right mood and the
right questions, the interview will be productive and maybe even fun.
”Everything the participant says should be fascinating, because even if it might seem
TWEET THIS
boring, it’s still valuable research.”
Heuristics Evaluations
54
Once you have a working prototype, a heuristic evaluation (or usability review)
can be a low-cost method of checking against usability best practices. Heuristic
evaluations are also helpful for competitive benchmarking since you can
compare competitors against the same criteria (as you’ll see below).
Heuristic reviews can even be carried out by people who aren’t UX experts,
as long as you’ve reviewed and walked through the usability scenarios. While
they’re cheap and usually only require a day or two, they don’t actually tell
you the usability of a system (since you’re not testing with real users) and may
suffer from inconsistency and subjectivity (since they’re carried out by different
people). That being said, they are a still a great reality check since you’ll be able
to catch glaring UX violations.
”Heuristic reviews don’t reveal if the product is actually usable - only that it should be
TWEET THIS
usable.”
55
While heuristics evaluations can be conducted by anyone, you could also
hire a team of heuristics experts to evaluate your product thoroughly and
professionally. As Foolproof Labs suggests, make sure you follow a thorough
process of completing a heuristic evaluation:
1. Plan the evaluation — Establish your usability goals so that you can
communicate them to the evaluators. If you want to know specifically
about the dialogue windows on your website, don’t be afraid to mention
that.
3. Brief the evaluators — If you choose not to go with experts, make sure you
brief your evaluators on Nielsen’s ten heuristic checkpoints so that they
know what they’re looking for. If you’re reviewing a website, you can start
with a more concrete 45-point website usability checklist.
5. Analyze the results — Unless you’re going with a professional firm, you
may need to compile and analyze your own responses. Remember that a
high score doesn’t mean your product is actually usable, only that it should
be usable.
56
To give you a better idea of how this works in real life, we’ll explain a few
examples. Oracle uses a streamlined 10-point list of heuristics gauging
everything from application complexity to frequency and helpfulness of
error messages. Usability issues are categorized as either “low,” “medium,”
or “high” severity with supporting notes. The team then isolates the top ten
most important issues for immediate fixing. If you’re curious about what a full
heuristic report may look like, check out this full heuristic evaluation of Apple
iTunes.
Takeaway
In this chapter, you learned about user tests that examine your product without
actually using it. Decontextualized tests tend to focus more on abstract and
conceptual areas, so if those are what you’re looking for, one of these tests may
be what you’re looking for.
In the next chapter, we’ll learn about a more direct testing method: testing the
product as the user would use it naturally.
57
CHAPTER SIX
58
T ests in which people use the product naturally (without a script) are the
closest you will get to seeing how your product might perform “in the
wild.” Natural and near-natural tests minimize the amount of interference from
the observer, who is more interested in what the user does of their own will.
These tests are great for broad data, especially ethnographic, but sacrifice con-
trol in exchange for greater data validity.
source: UserTesting
Because the goal is to minimize interference from the study, natural tests are
usually conducted remotely and without a moderator. The most common
natural tests — A/B testing, first click testing, field/diary studies, and eye-
tracking — are intended to understand user behavior and attitudes as close as
possible to reality.
A/B Testing
In an A/B test, different groups of participants are presented with two choices
or variations of an element. It is generally a scientific test, where only one
variable differs, while the rest are controlled. Mostly conducted with websites to
59
test if a certain layout, placement, or messaging will result in better conversions,
A/B testing is considered a natural test because users are not notified nor
provided a set of tasks.
• Call to actions — Read here about how Friendbuy more than doubled their
response rate to their CTAs using A/B tests.
• Headlines — In this A/B test, it was discovered that a single line of text for
headlines increased signups by 38% compared to longer headlines.
• Forms — A unique style of form field input, the “Mad Libs” style, has been
proven to increase conversions by 25-40%.
60
• Pricing and promotional offers — Another A/B case study shows
explicitly stating “it’s free” increased sign-up conversions by 28%.
There are also other usability testing tools like Optimizely (great for everything)
and Unbounce (more landing page focused) that make it extremely easy to
get started with A/B testing. These usability tools handle the distribution and
collection of data for you, so all you have to do is wait for the results. If you’re
interested in a comprehensive list of website elements to test, you can also
check out this detailed explanation of 71 things to A/B test.
source: WhichTestWon
61
Regardless of what you choose to test, make sure you follow these five
guidelines:
2. Test with enough people for statistical significance — As shown with this
sample size calculator, you should test each variation with enough people
for a 95% significance rate.
3. Test new users — Regular users will be confused if they see a new
variation, especially if you ultimately choose not to use it. Plus, there’s the
mere-exposure effect, in which people prefer what they’re used to.
5. Tailor your test length to statistical significance — Cancelling the test too
early will reduce accuracy. Decide your statistical significance, then you can
use this test duration calculator to get a rough timeline. Many paid online
usability tools (especially Optimizely) also have a feature for calculating
optimum time based on the goals.
To see some of these best practices put to use, check out this site containing
hundreds of free A/B test case studies. Hubspot also provides a highly visual
and easily digestible 27-page guide to A/B testing.
62
First Click Testing
In the late 2000s, Dr. Bob Bailey, UX Designer and Researcher, conducted
a series of studies around what he called the “first click” concept. The results
of the studies were surprising, and very advantageous to anyone looking to
improve their website. As it turns out, for any given task, a user’s success rate is
87% as long as their first click is correct. If their first click was not correct, the
chances for success fell to below 50%.
This type of usability testing is even more relevant if your site gets a large
volume of search traffic — because your homepage probably won’t be the first
page users find, first click testing should ideally be done across your entire site.
We would consider this a “near-natural” test because users are still assigned
tasks (instead of just using the site for whatever purpose they please), but these
tests are usually unmoderated and ran remotely in the comfort of the user’s
home.
The test itself is simple, and can be conducted with online testing tools like
Chalkmark by Optimal Workshop. The software presents the user with a
screenshot and a task, and then records their first click. For example, as we
discuss in User Testing & Design, we asked users to find a local mechanic on
Yelp and found that 24% of them first clicked on the Search bar (suggesting that
the existing information architecture may not be clear enough).
63
”When it comes to the web, first impressions are oftentimes final impressions.”
TWEET THIS
1. Write clear task scenarios — Just like you would for a scripted usability
test, make sure the participant is thinking about how to solve a problem
instead of just where to click. Detail isn’ required, but clarity is.
2. Define the best paths to success — Start from the homepage and plot all
possible paths that will correctly accomplish each task.
3. Time each task — A 90% first click rate on the correct label might
deceptively indicate that your navigation is effective, unless you timed the
test and saw it took an average of three minutes to make that first click.
4. Measure user confidence — After each task, you can ask participants to
rate on a scale of 1 to 7 regarding their confidence of task completion. Any
3s and 4s will indicate navigation problems.
When running a first click test, it also helps to ask some open-ended questions
afterward about what users liked and didn’t like about the site. We did this for
our Yelp redesign exercise and it gave us great insights, such as learning that
30% of users felt the navigation was confusing with comments like, “it’s a bit
cluttered...a lot of it quite useful, but can feel overwhelming the first time.”
For more information on how a first click test might help, the customer
experience newsletter Neo Insight wrote about the three biggest usability
problems that first click testing can help solve.
64
Field and Diary Studies
It doesn’t get more “natural” than field and diary studies. Both are designed to
observe a user as they behave naturally, without the interference of a testing
process. The beauty of these tests is that the user never leaves their natural
environment and are encouraged to act normally. The difference between the
two is that field studies involve an observer going on location, and diary studies
involve the participant recording their own thoughts, feelings, and opinions.
I. FIELD STUDY
A field study provides data you can’t find anywhere else by letting you observe
users in their own environment. Jared M. Spool, Founder of User Interface
Engineering, believes that while standard usability tests can lead to valuable
insights, the most powerful tool in the toolbox is the field study.
65
Field studies provide three main benefits:
2. Context — Users aren’t always aware of how external factors, like timing
for example, affect their decisions. Field studies mark the times and
environments of the user, and their impact can be seen during the analysis
of the data, even if the user themselves doesn’t know.
The biggest downside is primarily the cost of organization and time required
(they can last anywhere from a few weeks to several months). Workers have to
leave the office for large periods of time, plus scheduling these studies can be
troublesome.
However, if you still think field studies could help with your usability goals, take
a look at this helpful list of tips, and you can also follow this process for field
research that helped companies like Apple, Yahoo, DirecTV, and others.
A diary study captures the expectations, mindsets, moods, and social contexts
that affect the user experience. A diary study might reveal that a bad mood
or criticism read on the web impacted the user’s assessment of the product
experience, independent of the product itself.
Let’s say that you’re asked to improve a web application that helps product
managers track progress. You could provide tape recorders and/or journals to
five product managers and ask them to document anything odd or frustrating
they experienced when using the application. After a few weeks, you would
analyze the data and make specific recommendations.
While these may make the diary study seem like the perfect usability test, like
all others, it too has drawbacks:
Ruth Stalker Firth, HCI Researcher and Lecturer, believes that diary studies
are best used as a means of cultural probing and go beyond the “find out what’s
wrong” mentality that can be prevalent in usability testing. To help counter the
downsides, you can follow a few best practices:
3. Keep size in mind — The diary (whatever form) can be as small or large
as needed. On paper, space for forty entries can be overwhelming, while
ten might be more encouraging. That’s also why digital methods might be
better since users can use as much space as they want.
”Diary studies are a means of cultural probing that go beyond the ‚find out what’s wrong’
TWEET THIS
mentality.”
While diary and field studies let you see the context for how and why products
are used in everyday life, an eye tracking test goes into almost microscopic
detail. An eye tracking test is just as it sounds, tracking a user’s eye movement,
and more to the point where specifically they are looking.
69
Already, eye tracking tests have given us some general rules that apply across all
products, not just yours. Ritika Puri, co-founder of StoryHackers, writes in a
post for Crazy Egg about the five most important lessons eye tracking has taught
us so far:
1. Users are predictable — As we can see by the eye tracking patterns above,
people’s sight follows similar trends, allowing us to plan our visual layouts
for the masses. In Web UI Best Practices, we explain how to lay out a site in
accordance to the popular F pattern and Z patterns.
3. Users are drawn to visuals — Visuals like thumbnails or vibrant colors will
attract a user’s attention more than plain text, so use this accordingly.
If you’re interested in using eye tracking to help your website, it’s a lot more
achievable than it might seem. This instructive guide will explain how you can
make eye tracking work for you.
70
Beta Testing (User Acceptance Testing)
Your product is in the later stages of development, and you’re ready for some
feedback (and bug reports) before the grand public launch. Now’s the time for
beta testing, which is a type of user acceptance testing (UAT). The beta test is
when you allow access to your product to a few beta testers and collect their
feedback so that you can smooth out all remaining wrinkles before launch.
source: DevelopMentor
1. Filter your testers — Select your own beta testers. In open beta tests, too
many testers will flood you with unnecessary data or not enough data.
Take the time to select your own beta testers, and Udemy outlines the best
procedures for doing so.
2. Recruit five times as many people as you need feedback — Even if you
follow the “commitment” advice above, your numbers will still be low. Plan
accordingly.
71
3. The ratio for committed beta testers to beta reviewers should be 100:1
— One beta manager can find conclusive data in 100 beta testers, but those
with more resources or teams of beta managers can handle more. That
means you should recruit 500 people to get 100 qualified testers for each
beta manager.
4. Set apart 8-10 weeks — Don’t try to rush through the beta cycle, keep it
thorough if you want the best results.
5. Release new builds to testers around every two weeks — Any sooner
would be too much strain on your end, but you still want them to continue
reviewing the most updated versions of the product.
6. Adding a new features resets the beta cycle — It may seem harmless to
add some new tricks during the end of the beta cycle, but these often have
unforeseen consequences. If a new feature is necessary, accept that you’ll
need eight more weeks to fully test it.
Keep in mind that beta testing should be the last usability test conducted
in the development process. Make sure you’re at the right stage before you
proceed; otherwise, there will be a lot of wasted effort. To learn more about
beta testing, you can check out Chapter 7 in The Guide to UX Design Process &
Documentation and the many free e-books in Centercode’s library.
72
Takeaway
Tests that observe the users in their natural (or near-natural) environments
provide a specific type of data that other, more controlled tests can’t access.
An A/B test lets you make decisions that are informed by more thorough and
statistically significant results (since you have a huge sample size). A hallway
usability test, meanwhile, is just a quick and dirty method but not very
„scientific.”
Similarly, field and diary studies can provide you with unique information about
your target users — namely external factors such as timing, environment, mood,
etc. — that more direct card sorting or tree testing cannot. As for first click and
eye tracking tests, they literally let you see your website as your users do, but
make sure you run other types of tests for the right context. While each of the
different test types has its own advantages and disadvantages, sometimes its
best to mix-and-match them to achieve results more specific to you.
73
CHAPTER SEVEN
Hybrid Tests
If other tests don’t meet your needs, try combining them
74
T ests that incorporate elements from one or more of the previous catego-
ries (scripted, decontextualized, natural tests) fall under the label of hy-
brid tests. These tests tend to lean towards capturing attitudinal and conceptual
feedback, but nonetheless reveal insights that have very specific impact on the
usability of the final design.
Hybrid tests present the user with creative methods for discerning what kind
of experience or features they would want in a product, sometimes even
allowing users to provide direct input for the design. While they may not be
very practical for some of the later stages of product development, the testing
we’ll discuss here can make a big difference in the earlier phases by helping you
understand the minds of your target users. Specifically, we’ll cover desirability
testing, concept testing, and participatory design.
75
Desirability Testing
The idea is that asking participants directly what they want can bring misleading
results. The approaches in desirability testing seek to circumvent factors
like poor articulation, lack of self-awareness, or the apathy that comes from
answering similar questions one after another.
”What users say they want can be completely different from what will actually help
TWEET THIS
them.”
I. TRIADING
These are broad, experience-based surveys that, like other desirability tests,
focus more on the emotions of the participant rather than the statistics.
Participants are presented with statements, and then answer based on the
degree to which they agree or disagree. This format also delves deeper than
simply, “do you like our product or not.”
source: ChaiOne
77
As discussed in the Guide to UX Design Process & Documentation, using
qualitative questionnaires during your design process and feedback surveys
for iteration will also help create a customer feedback loop. While qualitative
questionnaires are fairly free-form, they can be found in three standardized
forms:
The questionnaires can be treated as starting points for your own questionnaire,
so feel free to adapt as needed.
78
III. QUICK EXPOSURE MEMORY TEST
Unusual but effective, the quick exposure memory test shows the participant a
user interface for only a moment before it is removed. The user is then asked to
recall what stood out the most in that brief amount of time, and why.
source: FiveSecondTest
Similar to first click testing, this test works well for pinpointing initial
impressions on layout design, information architecture, and content. But
because this test focuses on the user’s memory of particular elements instead of
emotional impact, it’s best used as a supplementary method. You could run this
test cheaply and manually by showing screenshots and then asking questions, or
use a scaleable online service like FiveSecondTest.
Not all desirability tests require deep and probing methods of getting into the
user’s psyche. Popularized by Microsoft, adjective cards (also known as product
79
reaction cards) are an extremely simple way to capture emotional responses to
designs and products. Simply show the design or have the user interact with
the product, then ask them to pick 3-5 cards that best capture their feelings and
explain their reasoning.
source: UX Matters
Michael Hawley, Chief Design Officer at Mad*Pow, writes about his success
with the adjective card. In his test, he gave participants a card with 118 carefully
selected adjectives, both positive and negative. He would then show the
participant a user interface and ask them to describe it with 3-5 words on the
card. This format allowed the test-taker to better articulate their emotions, and
also allowed the opportunity for the tester to follow up on why they felt as they
did.
80
Dr. David Travis, Managing Director of UserFocus, has also experienced
success with adjective cards. For him, this method stood out by giving
participants permission to criticize the system. In fact, not only did users select
negative and positive adjectives, they could also reveal negative connotations of
otherwise positive adjectives. For example, a user might select “sophisticated,”
but then explain that the interface was “too sophisticated for my tastes.”
You can run this test manually by printing out and cutting out the full list of 118
cards, or use an online service like MojoLeaf to administer the test remotely to
many participants at once.
Concept Testing
In the spirit of looking before you leap, concept testing allows you to discover
your users’ reactions to your concepts before you spend the time and resources
on development. These tests follow the same formats as the other usability tests
except they substitute concepts in place of the real product.
I. OVERVIEW
”Concept tests provide the necessary feedback to turn sketchy ideas into desirable pro-
TWEET THIS
ducts.”
82
Because testing an idea with an actual product can be tricky, concept testing
methods gravitate towards surveys, interviews, and landing pages. However, it
is the focus of these methods that set them apart from more traditional usability
tests. There are three main types of concept tests depending on the maturity of
the product:
• New Product Concept Tests — These identify the benefits that resonate
most with customers, and the features to create these benefits. Successful
tests let you to prioritize your design elements and better schedule the
development process, plus allow you to plan ahead for after the release.
• Interest — How likely they are to buy the product (or use it, if it’s free).
• Value — How they perceive the product’s benefits (compared to its price).
• Brand Fit — How closely the product fits in their existing idea of the brand.
If you’re looking for a more concrete way to test a product, the designers at
ZURB created a concept-testing app called Verify. Similar to prototyping,
Verify combines concept testing and the quick exposure memory test we
discussed above in the desirability tests section.
The app allows you to create sample screen presentations to test on your
prospective target audience through quick exposure. As the participant marks
what stood out for them, you can get an idea of what to keep or fix — all before
designing the actual product.
If you’re looking for a cheaper method, you could do a “hallway concept test” in
which you draw a few sketches, grab a colleague not associated with the project,
85
show the sketch for five seconds and then ask for what stood out. You could just
as easily replicate this process with five users or customers for quick feedback
on your concept.
Participatory Design
Sometimes if you want to design a product your users will like, it’s best to
involve them directly in the design process. Participatory design is a usability
testing method that falls right within the discipline of user centered design and
can be a great complement to the collaborative design methods we discussed
in Web UI Best Practices. It’s become quite a popular methodology with
companies like Pinterest, who incorporate it into their design process.
86
Erin Muntzert and David Sherwin, UX Consultants for Frog Design, point
out how to get the most use out of participatory design. In terms of general
guidelines, it helps to treat the session as a conversation (instead of a classroom
exercise), be crystal clear about the problem space and scenario, and record the
session (or take detailed notes). We’ll explain below how to prepare, narrate,
and conduct participatory design sessions.
”For the best results, treat participatory design sessions as conversations - not classroom
TWEET THIS
exercises.”
The first phase — framing — is kind of like the pre-planning phase, where you
figure out what you want to get out of the test. This phase handles your abstract
usability goals, and helps you narrow down the specifics of what will help.
This kind of pre-planning is what we outlined in the first two chapters, but
we’ll review its application to participatory design. There are four steps to the
framing phase:
2. Create your goals — Ask your team questions (follow the 5W & 1H
guideline) and prioritize them based on which ones you want answered
most. Your usability goals will be to answer the questions that are top
priority.
3. Define what you think you know — Create hypotheses to answer your
goal questions and jumpstart your research — but don’t get too attached,
because they might be proven wrong. If your goal is to understand what
87
young, tech-centric out-of-towners use to find the “best kept secrets” in
Los Angeles, one hypothesis could be that “users will first reach out in
social networks to find what to do.”.
In thirty minutes, you can complete the first three steps above, generating
around 10-15 hypotheses. Then as we discussed in step four, spend five minutes
categorizing hypotheses. Once you’ve finished that, here’s three areas to
consider to run a successful session:
1. Group Size — Large groups, small groups, and individuals all have their
advantages and disadvantages. Involving more people at once allows for
faster data collection, but less people may lead to more detailed results.
Because this is a qualitative method, make sure you test at least five users.
2. Location — Where will you conduct your test? Typically group size and
ease of access will determine whether you hold your test in a professional
facility, in the test-taker’s own environment, or out in the world (on the
street, coffee shop, etc.).
There are several common ways to do this, each with its own specialty:
89
• Love or Breakup Letter — Great for groups as an icebreaker, this exercise
lets users write a personal letter to a product or brand about what they
love, want, hate, or expect. This helps you see both ends of the emotional
spectrum in plain language.
Once you have the emotional context, it’s time to let users create elements for
the product in question. Seeing the kinds of things your participants come up
with of their own free can validate or disprove your hypothesis, not to mention
inspire new theories. When planning creation activities, the key component to
keep in mind is the balance between structure and interpretation.
source: UXPin
• Interface Toolkit — Using a tool like UXPin, give participants various pre-
made elements and ask them to “build” their perfect interface. Not only is
this fun, but it’s also ideal for seeing how your users prioritize features.
91
• Ideal Workflow — Participants are presented with different circumstances
and then explain their ideal workflow for each. If you’re designing a
complex system, such as e-trading platforms, this will reveal where and
what your users prioritize.
Using mainly text, images, and iconography, prioritization activities will help
you understand the ways in which your users value the product’s individual
features. These methods deal with trade-offs, connections, and hierarchy to
determine not necessarily what the user wants, but what the user wants most.
”Prioritization activities help you determine not what the user wants, but what the user
TWEET THIS
wants most.”
92
• Concept Ranking — Participants are given several options and asked to
rank them. For example, if you’re making tablet devices, you can provide
paper mock-ups of different dimensions and ask users to rank their size
preferences. This works best for culling down multiple concepts when
users can tweak existing prototypes.
V. CONTEXTUAL ACTIVITIES
By simulating the experience of using the product, users will be better able to
describe their opinions about it. Contextual activities try as best they can to
immerse the participant into what the concept or product might be.
93
• Simulating Experience — The next best thing to an actual product test, a
simulating experience creates a virtual experience of what it would be like
to use the product. For example, you can simulate a new in-car feature by
presenting the interface on an iPad and add a mock steering wheel.
Takeaway
Hybrid tests are a great way to think outside the box and collect insight that
more traditional tests can’t reach. Desirability tests go above and beyond in
understanding the target user’s psyche. Conceptual tests can save you a lot of
time by solidifying your plan before you begin development. More than any
other test, participatory design gives the target user a hands-on approach in
designing towards their needs.
We’ve just examined the most common and most useful usability tests available
today.In the next chapter, we’ll close by discussing the differences between web
and mobile usability testing.
94
CHAPTER EIGHT
95
W e’ve spent the bulk of this e-book outlining the different types of usabil-
ity tests and the strategies to use them most effectively. However, the
scope of these tests is vast and can be used on any product from cloud payment
systems to next-generation gaming consoles. In this chapter we want to narrow
our focus a little so you can best understand how usability evaluation works in-
dividually for websites and mobile devices.
User tests are mandatory for website success since Murphy’s Laws of
Technology always seem to strike at the worst times. While many of the
usability testing methods we discussed can adapt to web usability, we thought
it best to showcase a few pointers specific to website protocols and testing
criteria.
”The web is more than just your website. Test your competitors.”
TWEET THIS
96
The principles for web usability are the same as with other products, except
they are even more important considering that there are over a billion websites
as of September 2014. The bottom line is that there are so many similar websites
that visitors will simply move onto the next site if the first one they visit isn’t
usable.
”While your website might be your baby, visitors will just move to the next one if it’s ugly
TWEET THIS
and unusable.”
2. Let users complete the task how they want. If you feel your user has
misunderstood the task or is going off track, just wait. The goal is to learn
how a user interacts with your website, period. In the real world, you won’t
be there to reign them back in, so observe why they got sidetracked —
those may be your best insights.
97
3. Test competitors or peer websites. Only testing your own site robs you
of context. Including other websites will help you see “the forests and the
trees”. Try asking the participant to show you a site they use on their own,
and have them show you how they use it. It’s not just about how users
interact with your website — it’s about tailoring your website based on how
they use the web.
4. Hide which site you’re testing. Users tend to be less honest when they
know they’re talking to an employee of the company under scrutiny. Do
your best to not reveal you’re testing your site. The user may figure it out by
the end of the session, but the longer you delay it, the more accurate your
first impressions. Try asking them to assess competitor or peer websites
first — this puts them in the right critical mindset.
As a guiding principle, try not being too rigid. Keeping an open mind and a
loose attitude will put your test-taker at ease and yield better, more natural
results.
”It’s not just about how users interact with your website. It’s about tailoring your website
TWEET THIS
based on how they use the web.”
98
source: Don’t Guess, Test
When conducting a usability test for a website, there are specific criteria you
should check for that might not be relevant to other products. Jacob Gube,
founder of Six Revisions, believes that qualitative feedback alone is not enough
for websites — especially considering how simple technical tweaks to things like
site speed can drastically affect the experience. There are six criteria that must
be tested for all websites, whether it’s a personal blog or a corporate site:
99
source: User-Centered Design
• UX Design — User satisfaction can get lost in the mix when focusing
on more quantitative factors, but it’s just as important (if not more).
Interviews, field studies, diary studies, and the tests listed in the previous
hybrid chapter all get feedback on the user’s emotional responses.
Remember: being usable isn’t enough, aim to be delightful.
100
source: Organizing Content for Readability
• Speed — No one likes to wait. A website’s speed will impact the UX,
functionality, and even SEO performance. Check your file sizes and code
quality to reduce unnecessary lag. Follow these best practices, then test
your site speed with a tool like Pingdom or Google PageSpeed
With a few simple tweaks, you can adapt any of the previous usability tests to
better analyze the usability of a website. Find out where your site’s lacking,
then view the tests through the sharp lens of web usability. To see a live example
of different ways of evaluating some of these criteria, check out the e-book User
Testing & Design.
101
”For websites, usability is just the bare minimum. Delight is the new standard.”
TWEET THIS
source: Blink UX
Designing for mobile is different than designing for the web, and those same
subtleties apply to usability testing. Dr. David Travis, Founder of UserFocus,
102
believes the fundamentals of usability testing still apply — provided you make a
couple important tweaks. There are a few important changes to keep in mind:
103
Jeff Sauro, Founder of MeasuringU, adds some tips of his own that are
exclusive to mobile usability testing. If you are new to mobile testing, pay
attention — Sauro is a usability veteran, and his experience will save you time
and possibly embarrassment.
1. Have chargers ready — This really isn’t an issue with desktop tests, but
nothing ends a mobile usability test faster than a dead battery.
2. Encourage users to pick up their phone — It’s true that some people
prefer using mobile devices on a stationary surface, but there’s always a
chance they’re just doing it due to the testing environment.
While mobile testing used to be only done in a lab setting, unmoderated remote
testing (which we mentioned before) has also adapted quite well to the mobile
space. While you won’t be able to record fingers and body movements, tools
like UserZoom and UserTesting make it easier to recruit users based on very
specific usage criteria and can quickly gather much larger sample sizes. If you’d
like to conduct remote mobile testing on your own, you can follow the process
used by MailChimp to test seven people in two days.
104
II. 1-HOUR MOBILE USABILITY TESTING
While these tips can help adapt any of the previously mentioned usability tests
for mobile context, Marina Lin, Mobile Interaction Designer at Cars.com,
relates her own personal participatory design test that she uses to test the Cars.
com app. While the procedure may seem familiar at this point, it’s the alteration
to suit mobile devices that you should be aware of.
105
source: Applying Participatory Design to Mobile Usability Testing
3. Create a Feature (20 minutes) — With the app experience fresh in the
test-taker’s mind, hand them drawing utensils and ask them to design a
feature they think would help. Again, it’s best to set parameters, maybe
with previously made cards with feature attributes, stencils, shape cutouts,
or another worksheet. Refer back to the timeline and ask them when their
original features would apply.
106
source: Applying Participatory Design to Mobile Usability Testing
4. Wrap-up (5-10 minutes) — Close the session with one last round of
feedback. If one of their features interested you, ask them to elaborate; or
perhaps ask their opinion on user-designed features from previous sessions.
Some feedback on the testing process for improvements on future sessions
might also be useful.
A test such as this one retains all the benefits of the participatory design tests
discussed in Chapter 6 while requiring relatively little time. By following the
advice we’ve provided and throwing some participatory design into the mix,
you might even find that mobile usability testing is just as easy as testing for the
web.
107
Takeaway
Web usability and mobile usability may be under the single umbrella of
usability, but the approaches can seem like night and day when you think
about all the subtleties. When planning your goals, keep in mind the usability
functions special to whichever one you’re designing for, its distinct functionality
criteria, and the tests best used to study it.
Now it’s time to get started. Take your time at each step of the way and don’t
proceed if you don’t understand something. To help standardize the process,
feel free to check out the free usability testing kit created by UXPin CEO
Marcin Treder. As you’re testing, remember to always focus on your goals.
Because if you don’t know why you’re testing, then the methods are irrelevant.
”If you don’t know why you’re testing, then the methods are irrelevant.”
TWEET THIS
108
Built-in usability testing so you can create tasks and see
and hear where users get confused
Generate video clips showing all clicks, screen recording,
facial expressions, and voice calls
Collaboration and feedback for any team size
109