Collaborative heuristic evaluation - masters project presentation

•

2 likes•2,168 views

A short presentation given in Sept 2009 to finish my masters degree. The project experiment found that evaluators working collaboratively could identify more usability problems and reach a significantly higher level of inter-evaluator agreement than using the traditional method of heuristic evaluation.

Technology Design

Improving heuristic
evaluation through
collaborative working
Lucy Buykx

“discount usability
engineering”

Jakob Nielsen, 1992

Still very popular

• Usability scores for Supermarket websites
Chen 2005
• Generate user protocols for medical
equipment Zhang 2003
• Comparing library websites Peng 2004

Standard Heuristic
Evaluation (SHE)
“each individual evaluator
inspect the interface alone......
to ensure independent and
unbiased evaluations from each
evaluator’’

Nielsen 1994

Collaborative Heuristic
Evaluation (CHE)

• Collaborative inspection to ﬁnd potential
problems
• Individual evaluation with secret voting

Experimental design

Group 1 Group 2

National Rail Easy Jet
CHE Visit Britain British Towns

Easy Jet National Rail
SHE British Towns Visit Britain

Research questions

• ﬁnd same problems ?
• ﬁnd more severe problems ?
• ﬁnd more reliable problems ?
• use heuristics better ?

Research questions

• ﬁnd same problems ? - NO
• ﬁnd more severe problems ? - maybe
• ﬁnd more reliable problems ? - YES !
• use heuristics better ? - NO

SHE CHE
60

45
Problems found

30

15

0
1 2 3 4 5
Number of evaluators

What causes effect?

• Group Think ? (Janis 1982)
• Social loaﬁng ? (Karau 1995)

Severity ratings

• Unlikely to be inﬂuenced the same way
• Show high inter-evaluator agreement
• 24-48% agreement
• 76-95% within one agreement

Future research
• Explore social processes
• e.g. remote voting
• Explore how experts perform their role
• e.g. compare heuristics & severity
between experts
• Use it to train & evaluate novices

Viewers also liked

GST in India - Amendment Scenariossahigst

Online reputation-manangementJulie-Anne Kelechian

University Project Proposal docxMichael Mwita CAPM®

Tugas Rekweb 4dendyalfianisatrio

ebay v. Amazon / Harvard Case Analysis SolutionAditya Anupkumar

Atlas de parasitología médicaRoger Lopez

Metodología investigación científicapalomitas6

Amazon VS. EbayMonaZahran

10 the word and prayerchucho1943

Ten Usability Heuristics with Example -Sivaprasath SelvarajSivaprasath Selvaraj

Artificial Intelligence in Project Management by Dr. Khaled A. HamdyAgile ME

eBay .vs. AmazonBCmoney MobileTV

Feeding practices in toddlersDr. BMN college of Home Science

My holidaysian pareja

Conquista 2 PARTEcarlos arturo rubio jimenez

Mobile Computing Complete IntroductionDenis R

Viewers also liked (16)

GST in India - Amendment Scenarios

Online reputation-manangement

University Project Proposal docx

Tugas Rekweb 4

ebay v. Amazon / Harvard Case Analysis Solution

Atlas de parasitología médica

Metodología investigación científica

Amazon VS. Ebay

10 the word and prayer

Ten Usability Heuristics with Example -Sivaprasath Selvaraj

Artificial Intelligence in Project Management by Dr. Khaled A. Hamdy

eBay .vs. Amazon

Feeding practices in toddlers

My holidays

Conquista 2 PARTE

Mobile Computing Complete Introduction

Similar to Collaborative heuristic evaluation - masters project presentation

Messy Research: How to Make Qualitative Data Quantifiable and Make Messy Data...Gigi Johnson

Crowdsourcing for HCI Research with Amazon Mechanical TurkEd Chi

Michael Bolton - Testing Through The Qualitive Lens - EuroSTAR 2012TEST Huddle

Better Search Engine Testing - Eric Pughlucenerevolution

A Case Study on the Use of Developmental Evaluation for Navigating Uncertain...Chi Yan Lam

Mmig talk jan 245 2011Brock Dubbels

Classification and Detection of Micro-Level Impact-CSCW2017 (Link: http://dl....R R

History Class - For software testersJoris Meerts

Fairness and Code Reviewsdmgerman

Values in DesignUXPA Boston

Developing affective constructsCarlo Magno

Understanding the impact of certain uncertain event using bayesian networkKobi Vider

CHI: evaluationErik Duval

Session1 methods research_questionmilolostinspace

Creating Dynamic Critical Thinkers You TubeOzgur Pala

Puccio parnes tributeAPGICO- Associação Portuguesa de Inovação e Criatividade

Capturing the Student Perspective: A New Instrument for Measuring Advising Sa...Marilee Teasley

How aesthetics / beauty and usability influence each other in web designMatthias Schreck

Capturing the Ineffable: Collecting, Analysing, and Automating Web Document ...Davide Ceolin

Similar to Collaborative heuristic evaluation - masters project presentation (20)

Messy Research: How to Make Qualitative Data Quantifiable and Make Messy Data...

Crowdsourcing for HCI Research with Amazon Mechanical Turk

Michael Bolton - Testing Through The Qualitive Lens - EuroSTAR 2012

Better Search Engine Testing - Eric Pugh

A Case Study on the Use of Developmental Evaluation for Navigating Uncertain...

Mmig talk jan 245 2011

Classification and Detection of Micro-Level Impact-CSCW2017 (Link: http://dl....

History Class - For software testers

Fairness and Code Reviews

Values in Design

Developing affective constructs

Understanding the impact of certain uncertain event using bayesian network

CHI: evaluation

Session1 methods research_question

Creating Dynamic Critical Thinkers You Tube

Puccio parnes tribute

Capturing the Student Perspective: A New Instrument for Measuring Advising Sa...

How aesthetics / beauty and usability influence each other in web design

Capturing the Ineffable: Collecting, Analysing, and Automating Web Document ...

Recently uploaded

Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed

Unlocking the Potential of the Cloud for IBM Power SystemsPrecisely

Vulnerability_Management_GRC_by Sohang Sengupta.pptxnull - The Open Security Community

Pigging Solutions in Pet Food ManufacturingPigging Solutions

SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren

SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j

Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix

Artificial intelligence in the post-deep learning eraDeakin University

APIForce Zurich 5 April Automation LPDGMarianaLemus7

Install Stable Diffusion in windows machinePadma Pradeep

Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada

Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson

DMCC Future of Trade Web3 - Special EditionDubai Multi Commodity Centre

Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software

AI as an Interface for Commercial BuildingsMemoori

Understanding the Laravel MVC ArchitecturePixlogix Infotech

Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community

Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited

My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar

Recently uploaded (20)

Scanning the Internet for External Cloud Exposures via SSL Certs

Unlocking the Potential of the Cloud for IBM Power Systems

Vulnerability_Management_GRC_by Sohang Sengupta.pptx

Pigging Solutions in Pet Food Manufacturing

SQL Database Design For Developers at php[tek] 2024

SIEMENS: RAPUNZEL – A Tale About Knowledge Graph

Swan(sea) Song – personal research during my six years at Swansea ... and bey...

Artificial intelligence in the post-deep learning era

APIForce Zurich 5 April Automation LPDG

Install Stable Diffusion in windows machine

Bun (KitWorks Team Study 노별마루 발표 2024.4.22)

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024

Are Multi-Cloud and Serverless Good or Bad?

DMCC Future of Trade Web3 - Special Edition

Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation

AI as an Interface for Commercial Buildings

Understanding the Laravel MVC Architecture

Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx

Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365

My Hashitalk Indonesia April 2024 Presentation

Collaborative heuristic evaluation - masters project presentation

1. Improving heuristic evaluation through collaborative working Lucy Buykx

2. “discount usability engineering” Jakob Nielsen, 1992

3. Heuristics

4. John 38

5. Alesha 25 John 38

6. Usability problems 56

7. Carolyn 29 56

8. Usability problems 29 2 3 78 1 7

9. Nielsen & Landauer, 1993

10. Still very popular • Usability scores for Supermarket websites Chen 2005 • Generate user protocols for medical equipment Zhang 2003 • Comparing library websites Peng 2004

11. so what is the question?

12. Usability problems 29 2 3 78 1 7

13. Carolyn 29 2 3 John 1 Alesha 31 7 18

14. Inter - evaluator agreement is very low

15. Standard Heuristic Evaluation (SHE) “each individual evaluator inspect the interface alone...... to ensure independent and unbiased evaluations from each evaluator’’ Nielsen 1994

16. Collaborative Heuristic Evaluation (CHE) • Collaborative inspection to ﬁnd potential problems • Individual evaluation with secret voting

17. Experimental design Group 1 Group 2 National Rail Easy Jet CHE Visit Britain British Towns Easy Jet National Rail SHE British Towns Visit Britain

18. Research questions • find same problems ? • find more severe problems ? • find more reliable problems ? • use heuristics better ?

19. Research questions • find same problems ? - NO • find more severe problems ? - maybe • find more reliable problems ? - YES ! • use heuristics better ? - NO

20. SHE results conﬁrm previous ﬁndings

21. CHE ﬁndings

22. SHE CHE 60 45 Problems found 30 15 0 1 2 3 4 5 Number of evaluators

23. What causes effect? • Group Think ? (Janis 1982) • Social loaﬁng ? (Karau 1995)

24.

25. Severity ratings • Unlikely to be inﬂuenced the same way • Show high inter-evaluator agreement • 24-48% agreement • 76-95% within one agreement

26. Conclude : the effect is real

27. Future research • Explore social processes • e.g. remote voting • Explore how experts perform their role • e.g. compare heuristics & severity between experts • Use it to train & evaluate novices

Editor's Notes

scenarios + think aloud user testing + heuristic evaluation
10 heuristics - available on his website useit.com So how does it work... take John
then Alesha
she finds 25, merge them together...
and makes 56 ... add Carolyn
she finds 29 and merge them together...
and fab we have 78
Lots of studies and found 50-75% of problems at 5 evaluators / users had advantage of knowing how many problems there were
Add severity ratings of all problems in each heuristic, weight them then sum to find global score (Sainsburys won!) Infusion pumps cant be changed but can spot the problems and help ensure people dont make them again
been going so long, what is left to ask? back to the 78 usability problems...
..they are made up of three individual sets
most found by just one evaluator If you were making a change based on problem found by one would you feel confident? Was it overlooked by the other 4 (assuming we have quorum) or was it rejected as a problem by 4?
Confirmed in the literature - Gilbert Cockton and Alan Woolrych Sunderland Problem matching - Ebba Hvannberg and Effie Law Why? work independently so see different pages, or in different ways
Together in same room, see the same interface at the same time And keep the unbiased approach with no discussion and secret voting
2x2 individual differences - superstar evaluators compare problems, two conditions same website
imagine 6th evaluator - dont expect to overlap much, and viewed different pages 2 way ANOVA CHE vs SHE - but most found by only one so question the premise Vague heuristics question - no pattern of usage, very little agreement
National Rail most problems found by only 1 evaluator, none by all
Stoven Karau - good round up of studies Irving Janis - group think - discussion comes to wrong conclusions, but we banned discussion Social loafing - he said it was so I&#x2019;m just gonna agree,
Could not loaf on their decision of severity rating & could not know what other evaluators were voting

Collaborative heuristic evaluation - masters project presentation

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (16)

Similar to Collaborative heuristic evaluation - masters project presentation

Similar to Collaborative heuristic evaluation - masters project presentation (20)

More from Lucy Buykx

More from Lucy Buykx (7)

Recently uploaded

Recently uploaded (20)

Collaborative heuristic evaluation - masters project presentation

Editor's Notes