This year marks an important anniversary for people who moderate usability tests. In a classic study carried out exactly 30 years ago, psychologists showed that people are very poor at explaining the reasons behind their choices. This is why usability tests focus on what people do, not on what people say. So why do so many usability test moderators continue to ask participants, "Why"?
— Gret Higgins, Sept 3, 2007
When running a usability test, it's very tempting to ask participants why they behaved in a particular way. We are naturally curious. "Why did you choose that option?", "Why don't you try searching?", "Why did you pick the 'About Us' link rather than the 'Contact Us' link?"
The assumption behind these kinds of questions is that people can introspect into the reasons behind their behaviour. We think that the participant knows why he chose the 'About Us" link, and all we need to do is to help the participant elicit the reason.
In fact, there is a whole raft of evidence showing that people are very poor at introspection. In a classic study carried out thirty years ago, Richard Nisbett and Timothy Wilson carried out some research outside a bargain store in Ann Arbor, Michigan. The researchers set up a table outside the store with a sign that read, "Consumer Evaluation Survey — Which is the best quality?" On the table were four pairs of ladies' stockings, labelled A, B, C and D from left to right. Most people (40%) preferred D, and fewest people (12%) preferred A.
In fact, all the pairs of stockings were identical. The reason most people preferred D was simply a position effect: the researchers knew that people show a marked preference for items on the right side of a display. But when the researchers asked people why they preferred the stockings that they chose, people identified an attribute of their preferred pair, such as its superior knit, sheerness or elasticity. The researchers even asked people if they may have been influenced by the order of the items, but with just one exception (a psychology student who had just learnt about order effects) nobody thought this had affected their choice. Instead, people confabulated: they made up plausible reasons for their choice.
In a related study published in Science two years ago, Petter Johansson and his colleagues showed a similar finding. In this study, an experimenter showed a participant two pictures of different women, and asked the participant to point to the most attractive. If you were a participant in this study, you would have seen the experimenter hand you your chosen picture, discard the other photo, and then ask you to justify your choice.
Unknown to participants, the experimenter was a part-time magician, and using a sleight of hand technique he was really showing the participant the picture of the woman rated less attractive. He now asked participants why they had chosen that picture.
Remarkably, even when the photos weren't that similar, the majority of participants (73%) didn't spot that they were now looking at the woman they thought was less attractive. Even more curiously, participants now provided "explanations" for their choice. So for example they might say, "Well, I chose this picture because I like blondes", even though the participant had really chosen a brunette (whose picture was now face down on the table). Just as with the fake consumer evaluation survey, people confabulated: they made up reasons to justify their choice.
Let's now interpret these findings in the light of a usability test. When we ask a participant, "Why did you choose that option?", the participant will introspect and provide an answer. But the participant may not have conscious access to the reason for his or her choice. And as these studies show, participants are unlikely to say, "I don't know". They will provide an explanation for their behaviour, an explanation that they may really believe to be true but which is effectively made up.
This isn't the best basis on which to make product design decision.
Research like this shows that we shouldn't be asking "Why" in usability tests. Instead, your usability test should focus on the definition of usability in an international standard:
“Extent to which a product can be used by specified users to achieve specified goals with effectiveness, efficiency and satisfaction in a specified context of use.”
For example, this might mean:
With its focus on what people do, rather than what people say, usability testing has a unique place amongst consumer research methods. When you ask participants "Why", you are diluting the power of your usability test and you could end up changing your design for all the wrong reasons.
Listen to participant comments by all means — they can make good highlights videos — but remember to support them with behavioural data.
Nisbett, R.E. and Wilson, T.D. (1977). "Telling more than we can know: Verbal reports on mental processes". Psychological Review, Vol 84 pp 231-259.
Johansson, P., Hall, L., Sikström, S. and Olsson A. (2005). "Failure to Detect Mismatches Between Intention and Outcome in a Simple Decision Task". Science, Vol. 310. no. 5745, pp. 116-119.
You can republish this article on your own web site or intranet. You don't need to ask permission so long as you include this citation at the end of the article.
Let us notify you when we publish more articles like this. Benefits of signing up.
Focus groups continually fail to tell us what customers want. The fundamental problem is that, in spite of what conventional wisdom tells us, it is not the voice of the consumer that matters. What matters is the mind of the consumer. The big mistake is in believing that what the mind thinks, the voice speaks. It is time to start embracing methods that can deliver stronger predictive value.
Discount usability techniques are a great way to eradicate usability problems. But they can never answer the question, "How usable is this system?" We blow the dust off some techniques commonly used in the early days of usability testing to see if they can provide an answer.
See all articles and resources tagged iso 9241, moderating, standards, usability testing.