|
ABSTRACT
For 11 studies, we find that the detection of usability problems as a function of number of users tested or heuristic evaluators employed is well modeled as a Poisson process. The model can be used to plan the amount of evaluation required to achieve desired levels of thoroughness or benefits. Results of early tests can provide estimates of the number of problems left to be found and the number of additional evaluations needed to find a given fraction. With quantitative evaluation costs and detection values, the model can estimate the numbers of evaluations at which optimal cost/benefit ratios are obtained and at which marginal utility vanishes. For a “medium” example, we estimate that 16 evaluations would be worth their cost, with maximum benefit/cost ratio at four.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Dalal, S.R., and Mallows, C.L. (1988). When should one stop testing software? J. American Statistical Association 83, 403 (September), 872-879.
|
| |
2
|
Dalai, S.R., and Mallows, C.L. (1990). Some graphical aids for deciding when to stop testing software. IEEE J. Selected Areas in Communication 8, 2 (February), 169- 175.
|
| |
3
|
Erhan, S. (1975). Introduction to Stochastic Processes. Prentice Hall, Englewood Cliffs, NJ. p. 87.
|
 |
4
|
Wayne D. Gray , Bonnie E. John , Michael E. Atwood, The precis of Project Ernestine or an overview of a validation of GOMS, Proceedings of the SIGCHI conference on Human factors in computing systems, p.307-312, May 03-07, 1992, Monterey, California, United States
[doi> 10.1145/142750.142821]
|
 |
5
|
Robin Jeffries , James R. Miller , Cathleen Wharton , Kathy Uyeda, User interface evaluation in the real world: a comparison of four techniques, Proceedings of the SIGCHI conference on Human factors in computing systems: Reaching through technology, p.119-124, April 27-May 02, 1991, New Orleans, Louisiana, United States
[doi> 10.1145/108844.108862]
|
| |
6
|
|
 |
7
|
Claire-Marie Karat , Robert Campbell , Tarra Fiegel, Comparison of empirical testing and walkthrough methods in user interface evaluation, Proceedings of the SIGCHI conference on Human factors in computing systems, p.397-404, May 03-07, 1992, Monterey, California, United States
[doi> 10.1145/142750.142873]
|
| |
8
|
Lewis, C. (1982). Using the 'thinking-aloud' method in cognitive interface design. Research Report RC-9265, IBM T.J. Watson Research Center, Yorktown Heights, NY.
|
| |
9
|
|
 |
10
|
|
 |
11
|
|
| |
12
|
|
| |
13
|
|
| |
14
|
|
 |
15
|
|
| |
16
|
|
| |
17
|
Nielsen, J. (1993). Estimating the number of subjects needed for a thinking aloud test. Intl. J. Man-Machine Studies in press.
|
| |
18
|
|
| |
19
|
|
| |
20
|
|
 |
21
|
|
| |
22
|
Virzi, R.A. (1990). Streamlining the design process: Running fewer subjects. Proceedings of the Human Factors Society 34th Annual Meeting (Orlando, FL, 8-12 October), 291-294.
|
| |
23
|
|
| |
24
|
|
CITED BY 34
|
|
|
|
|
Niels Ebbe Jacobsen , Morten Hertzum , Bonnie E. John, The evaluator effect in usability tests, CHI 98 conference summary on Human factors in computing systems, p.255-256, April 18-23, 1998, Los Angeles, California, United States
|
|
|
|
|
|
Paul Sawyer , Alicia Flanders , Dennis Wixon, Making a difference—the impact of inspections, Proceedings of the SIGCHI conference on Human factors in computing systems: common ground, p.376-382, April 13-18, 1996, Vancouver, British Columbia, Canada
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Brad Myers , Jim Hollan , Isabel Cruz , Steve Bryson , Dick Bulterman , Tiziana Catarci , Wayne Citrin , Ephraim Glinert , Jonathan Grudin , Yannis Ioannidis, Strategic directions in human-computer interaction, ACM Computing Surveys (CSUR), v.28 n.4, p.794-809, Dec. 1996
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Jennifer Mankoff , Anind K. Dey , Gary Hsieh , Julie Kientz , Scott Lederer , Morgan Ames, Heuristic evaluation of ambient displays, Proceedings of the SIGCHI conference on Human factors in computing systems, April 05-10, 2003, Ft. Lauderdale, Florida, USA
|
|
|
Nigel Bevan , Carol Barnum , Gilbert Cockton , Jakob Nielsen , Jared Spool , Dennis Wixon, The "magic number 5": is it enough for web testing?, CHI '03 extended abstracts on Human factors in computing systems, April 05-10, 2003, Ft. Lauderdale, Florida, USA
|
|
|
Jeanette Blomberg , Mark Burrell , Greg Guest, An ethnographic approach to design, The human-computer interaction handbook: fundamentals, evolving technologies and emerging applications, Lawrence Erlbaum Associates, Inc., Mahwah, NJ, 2002
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Xerxes P. Kotval , Cheryl L. Coyle , Paulo A. Santos , Heather Vaughn , Rebecca Iden, Heuristic evaluations at bell labs: analyses of evaluator overlap and group session, CHI '07 extended abstracts on Human factors in computing systems, April 28-May 03, 2007, San Jose, CA, USA
|
|
|
|
|
|
|
|
|
Saija Lemmelä , Akos Vetek , Kaj Mäkelä , Dari Trendafilov, Designing and evaluating multimodal interaction for mobile contexts, Proceedings of the 10th international conference on Multimodal interfaces, October 20-22, 2008, Chania, Crete, Greece
|
|
|
|
|
|
Rakesh Bobba , Joe Muggli , Meenal Pant , Jim Basney , Himanshu Khurana, Usable secure mailing lists with untrusted servers, Proceedings of the 8th Symposium on Identity and Trust on the Internet, April 14-16, 2009, Gaithersburg, Maryland
|
|
|
David Akers , Matthew Simpson , Robin Jeffries , Terry Winograd, Undo and erase events as indicators of usability problems, Proceedings of the 27th international conference on Human factors in computing systems, April 04-09, 2009, Boston, MA, USA
|
INDEX TERMS
Primary Classification:
H.
Information Systems
H.1
MODELS AND PRINCIPLES
Additional Classification:
H.
Information Systems
H.5
INFORMATION INTERFACES AND PRESENTATION (I.7)
H.5.2
User Interfaces (D.2.2, H.1.2, I.3.6)
Subjects:
Evaluation/methodology
I.
Computing Methodologies
I.6
SIMULATION AND MODELING
General Terms:
Design,
Human Factors,
Performance
Keywords:
Poisson models,
cost-benefit analysis,
heuristic evaluation,
iterative design,
usability engineering,
usability problems,
user testing
|