Show simple item record

dc.contributor.authorCapra, Miranda Galadrielen
dc.date.accessioned2014-03-14T20:08:21Zen
dc.date.available2014-03-14T20:08:21Zen
dc.date.issued2006-03-13en
dc.identifier.otheretd-03222006-201913en
dc.identifier.urihttp://hdl.handle.net/10919/26477en
dc.description.abstractPrevious usability evaluation method (UEM) comparison studies have noted an evaluator effect on problem detection in heuristic evaluation, with evaluators differing in problems found and problem severity judgments. There have been few studies of the evaluator effect in usability testing (UT), task-based testing with end-users. UEM comparison studies focus on counting usability problems detected, but we also need to assess the content of usability problem descriptions (UPDs) to more fully measure evaluation effectiveness. The goals of this research were to develop UPD guidelines, explore the evaluator effect in UT, and evaluate the usefulness of the guidelines for grading UPD content.

Ten guidelines for writing UPDs were developed by consulting usability practitioners through two questionnaires and a card sort. These guidelines are (briefly): be clear and avoid jargon, describe problem severity, provide backing data, describe problem causes, describe user actions, provide a solution, consider politics and diplomacy, be professional and scientific, describe your methodology, and help the reader sympathize with the user. A fourth study compared usability reports collected from 44 evaluators, both practitioners and graduate students, watching the same 10-minute UT session recording. Three judges measured problem detection for each evaluator and graded the reports for following 6 of the UPD guidelines.

There was support for existence of an evaluator effect, even when watching pre-recorded sessions, with low to moderate individual thoroughness of problem detection across all/severe problems (22%/34%), reliability of problem detection (37%/50%) and reliability of severity judgments (57% for severe ratings). Practitioners received higher grades averaged across the 6 guidelines than students did, suggesting that the guidelines may be useful for grading reports. The grades for the guidelines were not correlated with thoroughness, suggesting that the guideline grades complement measures of problem detection.

A simulation of evaluators working in groups found a 34% increase in severe problems found by adding a second evaluator. The simulation also found that thoroughness of individual evaluators would have been overestimated if the study had included a small number of evaluators. The final recommendations are to use multiple evaluators in UT, and to assess both problem detection and description when measuring evaluation effectiveness.

en
dc.publisherVirginia Techen
dc.relation.haspartMCapra_dissertation.pdfen
dc.rightsIn Copyrighten
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.subjectthoroughnessen
dc.subjectreliabilityen
dc.subjectevaluator effecten
dc.subjectusability testingen
dc.subjectformative usability evaluation methodsen
dc.subjectproblem descriptionsen
dc.titleUsability Problem Description and the Evaluator Effect in Usability Testingen
dc.typeDissertationen
dc.contributor.departmentIndustrial and Systems Engineeringen
dc.description.degreePh. D.en
thesis.degree.namePh. D.en
thesis.degree.leveldoctoralen
thesis.degree.grantorVirginia Polytechnic Institute and State Universityen
thesis.degree.disciplineIndustrial and Systems Engineeringen
dc.contributor.committeechairSmith-Jackson, Tonya L.en
dc.contributor.committeememberNussbaum, Maury A.en
dc.contributor.committeememberKleiner, Brian M.en
dc.contributor.committeememberBurton, John K.en
dc.contributor.committeememberHartson, H. Rexen
dc.identifier.sourceurlhttp://scholar.lib.vt.edu/theses/available/etd-03222006-201913/en
dc.date.sdate2006-03-22en
dc.date.rdate2006-04-05en
dc.date.adate2006-04-05en


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record