These slides cover various aspects of testing in the context of computer science and human-computer interaction (HCI). It includes topics like evaluation techniques, empirical methods in HCI, controlled experiments, and various testing approaches such as A/B testing, cognitive walkthrough, heuristic evaluation, and review-based evaluation. Additionally, it touches on issues related to internal and external validity, reliability in testing, and different experimental designs like between-subjects and within-subjects experiments. This material appears to be educational and is likely used in a course related to computer science or HCI.
All content within this presentation is the property of Royal Holloway, University of London. Unauthorized use, duplication, or distribution of the materials contained herein is strictly prohibited.
User Experiments in Human-Computer InteractionDr. Arindam Dey
This lecture covers the basics of user experiment design in human-computer interaction. Computer scientists and developers often create interfaces for a particular purpose. This lecture explains how a user experiment can be designed and conducted to systematically compare one interface with the other.
Chapter 9: Evaluation techniques
from
Dix, Finlay, Abowd and Beale (2004).
Human-Computer Interaction, third edition.
Prentice Hall. ISBN 0-13-239864-8.
http://www.hcibook.com/e3/
Introduction to Usability Testing for Survey ResearchCaroline Jarrett
The basics of how to incorporate usability testing in the development process of a survey. Workshp first presented at the SAPOR conference, Raleigh, North Carolina USA, October 2011 by Emily Geisen of RTI and Caroline Jarrett of Effortmark.
User Experiments in Human-Computer InteractionDr. Arindam Dey
This lecture covers the basics of user experiment design in human-computer interaction. Computer scientists and developers often create interfaces for a particular purpose. This lecture explains how a user experiment can be designed and conducted to systematically compare one interface with the other.
Chapter 9: Evaluation techniques
from
Dix, Finlay, Abowd and Beale (2004).
Human-Computer Interaction, third edition.
Prentice Hall. ISBN 0-13-239864-8.
http://www.hcibook.com/e3/
Introduction to Usability Testing for Survey ResearchCaroline Jarrett
The basics of how to incorporate usability testing in the development process of a survey. Workshp first presented at the SAPOR conference, Raleigh, North Carolina USA, October 2011 by Emily Geisen of RTI and Caroline Jarrett of Effortmark.
Designing for maximum usability – the goal of interaction design
Principles of usability
general understanding
Standards and guidelines
direction for design
Design patterns
capture and reuse design knowledge
Business Research Method - Unit II, AKTU, Lucknow SyllabusKartikeya Singh
Business Research Methods, Unit II, AKTU, Lucknow Syllabus.
Research Methodology, Topics Covered - Research design: Concept, Features of a good research design, Use of a good research design; Qualitative and Quantitative research approaches, Comparison – Pros and Cons of both approaches.
Exploratory Research Design: Concept, Types: Qualitative techniques – Projective Techniques, Depth Interview, Experience Survey, Focus Groups, Observation.
Descriptive Research Designs: Concept, types and uses. Concept of Cross-sectional and Longitudinal Research
Experimental Design: Concept of Cause, Causal relationships, Concept of Independent & Dependent variables, concomitant variable, extraneous variable, Treatment, Control group.
Unit 8 - Information and Communication Technology (Paper I).pdfThiyagu K
This slides describes the basic concepts of ICT, basics of Email, Emerging Technology and Digital Initiatives in Education. This presentations aligns with the UGC Paper I syllabus.
Designing for maximum usability – the goal of interaction design
Principles of usability
general understanding
Standards and guidelines
direction for design
Design patterns
capture and reuse design knowledge
Business Research Method - Unit II, AKTU, Lucknow SyllabusKartikeya Singh
Business Research Methods, Unit II, AKTU, Lucknow Syllabus.
Research Methodology, Topics Covered - Research design: Concept, Features of a good research design, Use of a good research design; Qualitative and Quantitative research approaches, Comparison – Pros and Cons of both approaches.
Exploratory Research Design: Concept, Types: Qualitative techniques – Projective Techniques, Depth Interview, Experience Survey, Focus Groups, Observation.
Descriptive Research Designs: Concept, types and uses. Concept of Cross-sectional and Longitudinal Research
Experimental Design: Concept of Cause, Causal relationships, Concept of Independent & Dependent variables, concomitant variable, extraneous variable, Treatment, Control group.
Unit 8 - Information and Communication Technology (Paper I).pdfThiyagu K
This slides describes the basic concepts of ICT, basics of Email, Emerging Technology and Digital Initiatives in Education. This presentations aligns with the UGC Paper I syllabus.
Synthetic Fiber Construction in lab .pptxPavel ( NSTU)
Synthetic fiber production is a fascinating and complex field that blends chemistry, engineering, and environmental science. By understanding these aspects, students can gain a comprehensive view of synthetic fiber production, its impact on society and the environment, and the potential for future innovations. Synthetic fibers play a crucial role in modern society, impacting various aspects of daily life, industry, and the environment. ynthetic fibers are integral to modern life, offering a range of benefits from cost-effectiveness and versatility to innovative applications and performance characteristics. While they pose environmental challenges, ongoing research and development aim to create more sustainable and eco-friendly alternatives. Understanding the importance of synthetic fibers helps in appreciating their role in the economy, industry, and daily life, while also emphasizing the need for sustainable practices and innovation.
How to Make a Field invisible in Odoo 17Celine George
It is possible to hide or invisible some fields in odoo. Commonly using “invisible” attribute in the field definition to invisible the fields. This slide will show how to make a field invisible in odoo 17.
Acetabularia Information For Class 9 .docxvaibhavrinwa19
Acetabularia acetabulum is a single-celled green alga that in its vegetative state is morphologically differentiated into a basal rhizoid and an axially elongated stalk, which bears whorls of branching hairs. The single diploid nucleus resides in the rhizoid.
Palestine last event orientationfvgnh .pptxRaedMohamed3
An EFL lesson about the current events in Palestine. It is intended to be for intermediate students who wish to increase their listening skills through a short lesson in power point.
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...Levi Shapiro
Letter from the Congress of the United States regarding Anti-Semitism sent June 3rd to MIT President Sally Kornbluth, MIT Corp Chair, Mark Gorenberg
Dear Dr. Kornbluth and Mr. Gorenberg,
The US House of Representatives is deeply concerned by ongoing and pervasive acts of antisemitic
harassment and intimidation at the Massachusetts Institute of Technology (MIT). Failing to act decisively to ensure a safe learning environment for all students would be a grave dereliction of your responsibilities as President of MIT and Chair of the MIT Corporation.
This Congress will not stand idly by and allow an environment hostile to Jewish students to persist. The House believes that your institution is in violation of Title VI of the Civil Rights Act, and the inability or
unwillingness to rectify this violation through action requires accountability.
Postsecondary education is a unique opportunity for students to learn and have their ideas and beliefs challenged. However, universities receiving hundreds of millions of federal funds annually have denied
students that opportunity and have been hijacked to become venues for the promotion of terrorism, antisemitic harassment and intimidation, unlawful encampments, and in some cases, assaults and riots.
The House of Representatives will not countenance the use of federal funds to indoctrinate students into hateful, antisemitic, anti-American supporters of terrorism. Investigations into campus antisemitism by the Committee on Education and the Workforce and the Committee on Ways and Means have been expanded into a Congress-wide probe across all relevant jurisdictions to address this national crisis. The undersigned Committees will conduct oversight into the use of federal funds at MIT and its learning environment under authorities granted to each Committee.
• The Committee on Education and the Workforce has been investigating your institution since December 7, 2023. The Committee has broad jurisdiction over postsecondary education, including its compliance with Title VI of the Civil Rights Act, campus safety concerns over disruptions to the learning environment, and the awarding of federal student aid under the Higher Education Act.
• The Committee on Oversight and Accountability is investigating the sources of funding and other support flowing to groups espousing pro-Hamas propaganda and engaged in antisemitic harassment and intimidation of students. The Committee on Oversight and Accountability is the principal oversight committee of the US House of Representatives and has broad authority to investigate “any matter” at “any time” under House Rule X.
• The Committee on Ways and Means has been investigating several universities since November 15, 2023, when the Committee held a hearing entitled From Ivory Towers to Dark Corners: Investigating the Nexus Between Antisemitism, Tax-Exempt Universities, and Terror Financing. The Committee followed the hearing with letters to those institutions on January 10, 202
Introduction to AI for Nonprofits with Tapp NetworkTechSoup
Dive into the world of AI! Experts Jon Hill and Tareq Monaur will guide you through AI's role in enhancing nonprofit websites and basic marketing strategies, making it easy to understand and apply.
Macroeconomics- Movie Location
This will be used as part of your Personal Professional Portfolio once graded.
Objective:
Prepare a presentation or a paper using research, basic comparative analysis, data organization and application of economic information. You will make an informed assessment of an economic climate outside of the United States to accomplish an entertainment industry objective.
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdfTechSoup
In this webinar you will learn how your organization can access TechSoup's wide variety of product discount and donation programs. From hardware to software, we'll give you a tour of the tools available to help your nonprofit with productivity, collaboration, financial management, donor tracking, security, and more.
3. Evaluation
techniques
Evaluation
• tests usability and functionalityof system
• occurs in laboratory, field and/or in collaboration
with users
• evaluates both design and implementation
• should be considered at all stages in the design life
cycle
6. Cognitive
walkthrough
(task-specific)
Based on the concept that users prefer to learn
by doing rather than read manuals
Evaluates design on how well it supports users
in learning specific tasks
Performed by a group that includes the UI
designers and developers, led by experts in
cognitive psychology
The group ‘walks through’ the design to identify
potential problems using psychological
principles
Forms are used to guide the analysis on each
step of the tasks
7. Heuristic
evaluation
(holistic)
Proposed by Nielsen and Molich
Compares the UI design against
accepted usabilityprinciples
Identify usability criteria (heuristics)
Experts check that the design meets the
criteria
Example heuristics
• Consistencyand standards
• Match between systemand the real world
• Error prevention
Heuristic evaluation ‘debugs’ design
8. Review-based
evaluation
(holistic)
Expert-based evaluation method
Results from experimental results and
empirical evidence, found in the literature to
validate design methods
Care needed to ensure results are
transferable to new design
Model-based evaluation
Cognitive models used to filter design
options
• e.g. GOMS1 prediction of user performance
Design rationale can also provide useful
evaluation information
1GOMS
"a set of Goals,a set of Operators,aset of Methods forachievingthegoals, and a set of Selectionsrules for choosing among competingmethods forgoals."
10. Empirical
methods in
HCI
Lab experiment
• Artificial, highly controlledby experimenter
Field study
• Occurs in the actual environment people use the UI
and with real tasks
Survey
• Questionnaire, conductedby paper, phone, web, or
in person
12. Controlled
experiment
Controlled evaluation of specific aspects of
interactive behaviour
Evaluator chooses hypothesis to be tested
Manipulate independent variables
• Different placement, font size, input
Measure dependent variables
• Times, #errors, #tasks done, satisfaction
Use statistical analysis to accept or reject the
hypothesis
• How changes in independent variables affect the
dependent variables – are those effects significant?
14. Designing the
experiment
Subjects/users: who – representative, sufficient
sample
Implementation: real environment, artificial
variations
Tasks
• Real tasks: word processing, e-mail, web browsing
• Artificial: users focus on a simple subset of tasks
Measuring: how to count time, #clicks, #errors
Ordering: of conditions and tasks
Hardware: physical conditions of the test,
available inputs
15. Hypothesis
Prediction of outcome
• Framed in terms of independent and dependent
variables
e.g. “error rate will increase as font size decreases”
Null hypothesis
• States no difference between conditions
• The aim is to disprove this
e.g. null hypothesis = “no change of error rate with
font size”
16. A/B Testing
Experiment based on two alternative
interfaces
• Normally A is the controland B is the variation
In Web design, this is normally used to
identify improvementsthat can maximise a
certain outcome of interest
Normally the currentversionof the interface
is associated with the null hypothesis
17. Concerns
Internal validity
Are observed results actually caused by the
independent variables?
External validity
Can observed results be generalised to the world
outside the lab?
Reliability
Will consistent results be obtained by repeating
the experiment?
18. Threats to
Internal
Validity
Ordering effects
• People learn, and people get tired
• Randomise or counterbalance ordering
Selection effects
• Avoid pre-existinggroups (unless the group is an
independent variable)
• Randomly assign users to independent variables
Experimenter bias
• Experimenters may prefer an hypothesis to be
proven valid
• Double blind experiment is quite hard for HCI
• Controlprotocol
19. Threats to
External
Validity
Population
• Draw a random sample from the real target
population
Ecological
• Make lab conditions as realistic as possible
Training
• Training should mimic how the real interface would
be encounteredand learned
Task
• Tasks for testingshould be based on task analysis
20. Threats to
Reliability
Uncontrolled variation
• User differences
• Task design
• Measurement error
Solutions
• Eliminate uncontrolled variation
Select users by experience
Give consistent training
Measure dependent variables precisely
• Repetition, repetition
Many users, many trials
Standard deviation of the mean shrinks like the square
root of N (i.e. quadrupling #users makes the mean
twice as accurate)
21. Blocking
Divide samples into subsets that are more
homogeneous than the whole set
• Example: testing wear rate of different shoe sole
material
Lots of variation between feet of different people, but
the feet on the same person are more homogeneous
Apply all conditions within each block
• Test material A on one foot, material B on the other
Measure difference within block
• Wear(A) – Wear(B)
Randomise within the block to eliminate validity
threats
• Randomly put A on left or right foot
22. Between-
subjects
experiment
Each subject performs the experiment under
only one condition
Results are compared between different
groups
• Is mean(xi) > mean (yj) ?
No transfer of learning
More users required
Variation can bias results
23. Within-
subjects
experiment
Each subject performs the experiment under
each condition
Results are compared within each user
• For user i computexi – yi
• Is mean(xi-yi) > 0 ?
Transfer of learning possible
Less costly and less likely
to sufferfrom user variation
24. Counterbalancing
Defeats ordering effects by varying order of conditions
systematically(not randomly)
Latin Square designs
• Randomly assign subjects to equal-size groups
• A, B, C, … are the experimental conditions
• Latin Square ensures that each condition occurs in every
position in the ordering for an equal number of users
• Balanced Latin Squares: http://www.yorku.ca/mack/RN-
Counterbalancing.html
G1 G2
A B
B A
G1 G2 G3
A C B
B A C
C B A
G1 G2 G3 G4
A D C B
B A D C
C B A D
D C B A
25. Kinds of
measures
Self-report
• E.g. satisfaction
Observation
• Visible vs. hidden observer
• Hawthorne effect1
Archivalrecords
• Public vs. private
Trace
• Subjects normally unaware (e.g. testing for book
read wear)
1Hawthorneeffect:thealterationof behaviourby the subjectsof a study due to theirawarenessof being observed.
27. Interviews
Analyst questionsuser on one-to -one basis
usually based on prepared questions
Informal, subjective and relatively cheap
Advantages
• Can be varied to suit context
• Issues can be explored more fully
• Can elicit user views and identify unanticipated
problems
Disadvantages
• Very subjective
• Time consuming
28. Questionnaires
Set of fixed questions given to users
Advantages
• Quick and reaches large user group
• Can be analysed more rigorously
Disadvantages
• Less flexible
• Less probing
29. Questionnaires
Need careful design
• What information is required?
• How are answers to be analysed?
Styles of question
• General
• Open-ended
• Scalar
• Multi-choice
• Ranked
31. Eye tracking
Head or desk mounted equipment tracks eye
position
Eye movement reflects the amount of
cognitive processinga display requires;
measurements include
• fixations: eye maintains stable position. Number
and duration indicate level of difficulty with display
• saccades: rapid eye movement between points of
interest
• scan paths: moving straight to a target with a short
fixation at the target is optimal
32. Physiological
measurements
Emotional responselinked to physical
changes
These may help determine a user’s reaction
to an interface; measurementsinclude:
• heart activity, including blood pressure, volume and pulse.
• activity of sweat glands: Galvanic Skin Response (GSR)
• electrical activity in muscle: electromyogram (EMG)
• electrical activity in brain: electroencephalogram (EEG)
Some difficulty in interpretingthese
physiologicalresponses- more research
needed
34. Choosing an
evaluation
method
Question Decision
When in process design vs. implementation
Style of evaluation laboratory vs. field
Level of objectivity subjective vs. objective
Type of measures qualitative vs. quantitative
Level of information high level vs. low level
Level of interference obtrusive vs. unobtrusive
Available resources
time, subjects,
equipment, expertise