Tonkin.2011.eye tracking within the packaging design workflow interaction with physical and virtual shelves
Eye Tracking Within the Packaging Design Workﬂow: Interaction with Physical and Virtual Shelves Chip Tonkin Andrew D. Ouzts Andrew T. Duchowski Clemson University Clemson University Clemson University email@example.com firstname.lastname@example.org email@example.comABSTRACT searching for a product on a store shelf. Companies such asMeasuring consumers’ overt visual attention through eye Kraft Foods, PepsiCo, and Unilever regularly employ thistracking is a useful method of assessing a package design’s technology in the development of new packaging and retailimpact on likely buyer purchase patterns. To preserve eco- strategies, however, their methodologies and results are notlogical validity, subjects should remain immersed in a shop- available to the public . For this approach to be usefulping context throughout the entire study. Immersion can be to and to be widely adopted by industry, it needs to be anachieved through proper priming, environmental cues, and integral part of the design process that is incorporated as avisual stimuli. While a complete physical store oﬀers the feedback loop attached to the creative step in the workﬂowmost realistic environment, the use of projectors in creating (as opposed to a research study performed post facto). Thea virtual environment is desirable for eﬃciency, cost, and goal is similar to performance simulation developed for eval-ﬂexibility reasons. Results are presented from a study com- uation of package design (similar to shelf life, sustainability,paring consumers’ visual behavior in the presence of either and shipping capability), and it follows that one would alsovirtual or physical shelving through eye movement perfor- want to accurately predict its consumer response.mance and process metrics and their subjective impressions. The challenge is developing the workﬂow, equipment pa-Analysis suggests a diﬀerence in visual search performance rameters, and environment that provides a realistic approx-between environments even though the perceived diﬀerence imation to the retail shopping experience. It must be usableis negligible. for a wide variety of products, remain cost eﬀective, and provide meaningful results. Most of the eye tracking equip- ment and the bulk of published research suggest use of anCategories and Subject Descriptors all-in-one monitor system. This is suﬃcient for testing webJ.4 [Computer Applications]: Social and Behavioral Sci. usability and responses to printed ads and promotions be- cause the size of the test stimuli readily ﬁt the screens and1. INTRODUCTION the seating position of the subjects is not unnatural. How- ever, this is not a realistic shopping simulation. The shift in consumer behavior over the past two decades Alternatives to the all-in-one desktop system include self-is forcing dramatic changes in the way products are designed, standing eye tracking modules that can be used in front ofpackaged and marketed. Presently 70% of consumer pur- physical samples or a projector screen. These have beenchase choices are made at the shelf, 85% are made without used for retail studies, but the requirement of standing per-even picking up a competitive item, and 90% are made af- fectly still in front of the shelf or projection screen makes thister looking at just the front face of the packaging . Thus setup less than ideal for simulating the retail shopping ex-package design clearly plays a signiﬁcant role in determin- perience. To better simulate the shopping experience, con-ing the success of a product, but as Clement points out, sumers should be allowed to “wander up and down the aisles”currently accepted design methodology understates this im- as they would do in reality.pact and does not include any objective method of assess- Obviously the most realistic consumer experience studying the product’s visual impact on buying decisions. Eye ought to be performed in an actual retail store, and intracking is a means of quantifying an observer’s overt vi- some instances this may be possible, but the logistics ofsual attention, and can be used to evaluate and compare regularly rearranging a store to meet experimental condi-visual search patterns of individuals in a variety of situa- tions, and controlling many other real-world variables maketions. This approach is readily applicable to the measure- this prospect infeasible for the majority of typical controlledment of consumer behaviour in a retail environment when studies. The CUshop consumer experience lab (depicted in Fig- ure 1) will simulate browsing freedom within a realistic en-Permission to make digital or hard copies of all or part of this work for vironment. The lab is designed as a fully self-containedpersonal or classroom use is granted without fee provided that copies are environment with sliding glass doors, re-conﬁgurable shelv-not made or distributed for proﬁt or commercial advantage and that copies ing, a refrigerated section, and appropriate signage and win-bear this notice and the full citation on the ﬁrst page. To copy otherwise, to dow treatments to create a realistic consumer shelf simu-republish, to post on servers or to redistribute to lists, requires prior speciﬁcpermission and/or a fee. lation. While it is understood that the physical environ-NGCA ’11 Karlskrona, Sweden ment is ideal, the lab will contain equipment to run studiesCopyright 2011 ACM 978-1-4503-0680-5/11/05 ...$10.00.
Figure 1: Architectural renderings of the CUshop consumer experience laboratory (to be completed June2011). The winning design was developed as part of the Spring 2011 Creative Inquiry class led by A. Hurley.within projected virtual environments because projection of- tion is key in the buying decision [16, 12]. Buying decisionsfers cheaper and faster setup while preserving a high level are based on a combination of brand recognition and whatof stimulus control. The purpose of this study is to evalu- Chandon et al. coined “visual equity” . This term refersate consumers’ visual behavior and subjective impressions to the incremental consideration given to items that attractas they perform a product search task when encountering a buyer’s attention so that while a consumer enters with aeither a virtual projected or physical shelf unit. certain amount of “memory equity” related to their needs The use of projectors to simulate an otherwise expensive and understanding of brand value, this can be changed ator diﬃcult environment is not unusual. Flight and driving the point of decision by what catches their attention. Eyesimulators have been used successfully for training and re- tracking is a good tool for measuring this eﬀect, as Johansensearch purposes, and while they are not able to achieve com- and Hansen discovered during webpage navigation, notingplete physical or photo-realism, they have served as viable that individual recollection of what attracted attention andpredictors of behavior in various situations . This paper what order things were seen in were not nearly as accuratecompares and contrasts the impact of a virtual store shelf as a record of recorded eye movements .on consumer behavior to gauge the level of realism aﬀorded. Because shoppers may not remember what they saw orAlthough patterns of information acquisition when viewing perhaps because they may not be willing to honestly di-an image are expected to be similar to those when viewing vulge their reasons for their decisions, the practice of mea-the real environment, the general level of performance in ob- suring “brand recall” as typically done in marketing stud-ject memory tests has been shown to be better in the latter ies is largely meaningless. According to Chandon et al.,. In this paper visual is search is examined between the brand recall is overwhelmingly driven by brand familiarityreal and virtual to test for performance diﬀerences, if any. and, oddly, eye ﬁxations on products within a given mar- ket segment can enhance brand recall for the target product whether it is present in the study or not . They found2. BACKGROUND that major brands tended to inhibit the recollection of mi- Russo and LeClerc describe the consumer’s product se- nor brands while conversely the viewing of minor brandslection process in three stages . The ﬁrst, dubbed “Ori- tended to enhance the recall of major brands.entation”, relies on the ability to evaluate overall patterns, Young makes the point that the most important factorcolors, and shapes in the scene. Once an interesting area in achieving applicable results is that the consumer mustis identiﬁed, the consumer transitions to “Evaluation”, in be kept in a shopping context . He stated that “when awhich the focus is on a small number of items. At this stage shopper is removed from this context, she often leaves be-information is likely processed much more intently and in a hind the shopping mindset and, instead, takes on an art di-serial fashion . The last phase of the process deﬁned by rector’s aesthetic mentality.” He compares this to a “beautyRusso and LeClerc is the “Veriﬁcation” stage. This is the contest” in which the most aesthetically pleasing packagepoint at which the consumer veriﬁes that the product meets tends to win (this is not typically the attribute that actu-their needs, makes pricing comparisons, and garners assur- ally decides purchase decisions at the shelf). This lack ofance that it was the right choice. Currently, the consumer’s realism has been a signiﬁcant problem (noted or otherwise)rationale for making product selection decisions are stud- in practically all of the consumer shelf studies thus far.ied using standard interview techniques, focus groups, and Russo and LeClerc noted that the mean decision time inobservation, but being able to objectively determine when, their experiment was well above industry norm (30 secondshow long, and what attracts attention could give much more vs. 12 seconds) and gave several likely experimental setupprecise and actionable information than the softer, subjec- reasons for their subjects’ slower behavior . In review-tive responses typically garnered from a focus group . ing these ﬁndings along with those of other experiments, Assuming recorded visual attention follows the fovea , Clement found that they presented serious validity problemseye tracking can be a valuable tool for assessing consumer because they were in laboratory experiments that poorlyattention in shopping environments . In cases where simulated real-world conditions . Subjects were sitting inthere is clear brand recognition, attention-catching packag- chairs looking at pictures of packages or viewing relativelying (imagery, colors, etc.) have less impact, but when brand small projected images that were not accurate for size oris not a major consideration, packaging that captures atten- visual angle.
4.0 m 2.5 m 1.25 m 0.75 m Figure 2: Physical and virtual viewing dimensions with example participant searching for target item. Even if subjects are shown an accurate picture with ob- 3. METHODOLOGYjects taking up the same amount of visual angle as a real The eﬀect of physical or virtual environment was mea-life product, performance on spatial tasks signiﬁcantly im- sured on performance (visual search), process (eye move-proves as the image becomes larger . This supports the ments), and subjective measures (i.e., the feeling of presencenotion that peripheral vision is used to improve search capa- within each environment and preference). The main taskbility. When searching, the consumer visually perceives the was search for a target item, with the main experimentalenvironment (i.e., the store shelf) in toto and in parallel, ori- factor consisting of environment type.enting selective attention . This can be compared to one’s Stimulus. Two shelving environments were created forability to hear and “feel” the overall surrounding sounds or the experiment. The physical shelf was a 3.6 m (141 ) Aisleselectively listening to a particular voice in a crowd, but not made with a Gondola 0.6 m (23 ) base system, constitutingdo both at the same time . The size of the visual ﬁeld is a 2 m (78 ) tall shelving system with four 0.4 m (16 ) deeptherefore likely to inﬂuence visual search, e.g., a small ﬁeld is upper shelves (this was used store shelving removed fromlikely to restrict parafoveal preview beneﬁt. Construction of a major US retailer). The shelf was populated with reala visually realistic shopping environment is likely to matter physical cereal boxes with two fabricated cereal brands usedto any studies of shoppers’ visual behavior. as search targets. Factors such as visual realism and the visual ﬁeld size need The virtual environment was a snapshot of the physicalto be considered in the CUshop design and methodology. To shelf projected on a wall. The image was captured by aachieve ecological validity, we believe it is necessary for the Canon EOS Rebel T1i 500D camera mounted on a tripodlaboratory to look and feel like a shopping environment, the approximating the eye-level of an average-height US adulteye tracking equipment must be unobtrusive and ﬂexible, (1.7 m (67 ) ). The image was then corrected for geo-and the task be structured and primed in such a way that metrical distortion caused by the lens, cropped, and resam-the subject carries the mindset of a consumer during the pled to achieve pixel dimensions of 2560×800, and displayedlength of the study. across two Epson BrightLink 450 WI projectors, chosen for To our knowledge, thus far all consumer-related eye track- their brightness and short throw distance which eliminateding studies have been restricted to environments where the shadow interference when standing in front of the display.visual display was projected on a screen. Studies most sim- In both physical and virtual presentations of the cerealilar to ours include those of Lundberg, Whitney et al., and shelf, care was taken to present the participant with theChandon et al. [9, 18, 3]. Lundberg proposed development same apparent view. In both instances the environmentof the Packaging Media Lab, in which an eye tracker would measured 4.0×1.25 m (160 ×49 ) at an elevation of 0.75 mbe used while a shopper viewed a shelf of products pro- (30 ) oﬀ the ground, as sketched in Figure 2. In both phys-jected on a screen. The lab was eventually designed by The ical and virtual search tasks, participants stood centered atPackaging Arena, Ltd., and built within the Bergvik shop- a distance of 2.5 m (98 ) from either display.ping centre in Karlstad, Sweden. Whitney et al. constructed The stimuli (see Figure 3) used as search targets werethe Balance NAVE Automatic Virtual Environment consist- cereal boxes made especially for this study to preclude fa-ing of three back-projected screens providing a wide ﬁeld of miliarity with the products. Artiﬁcial cereal boxes were cre-view projection-based system. Their purpose was not to ated to ensure that they could not have been known a prioritest shopping decisions per se, rather, it was to test the ef- to any of the participants. Each box measured 22×28 cmfect of navigation through the VR grocery environment on (8.5 ×11 ) and matched the dimensions of a box on the pro-participants with and without vestibular dysfunction (no eye jector wall. Figure 4 shows one of the physical cereal boxestracker was used). Chandon et al. used an eye tracker when matching the dimensions of its projected counterpart.looking at planograms (shown on a single 4 ×5 screen, 80 Yellow and black price tags, visible in Figure 4, were alsoaway from the viewer) to test the inﬂuence of number of artiﬁcially created for this study and displayed below ev-shelf facings and position. ery distinct cereal box. Tobii’s infra-red (IR) markers were In the present study, we measure the diﬀerences in vi- placed atop the darker portions of the price tags in an eﬀortsual behavior between a virtual environment and its physical to blend their appearance.counterpart from which the virtual is derived.
Figure 3: Artiﬁcial cereal boxes designed and con- Figure 4: Physical cereal box held against its coun-structed speciﬁcally for the experiment. terpart projected in the virtual environment. Participants. The study recruited 42 participants re- Apparatus. Eye movements were captured using Tobii cruited from Packaging Science and Computer Science clas-Glasses, a head-mounted eye tracking system resembling a ses. Ten participants were excluded from analysis due to cal-pair of glasses (see Figure 5(a)). The tracker is monocular ibration issues (speciﬁcally we found that calibration points(right eye only), sampling at 30 Hz with 56◦ × 40◦ record- on the left side of the grid were diﬃcult for these participantsing visual angle. The Tobii Glasses were used in conjunction to ﬁxate; a possible consequence of the monocular nature ofwith two other pieces of hardware: the Recording Assistant the Tobii Glasses). Four additional participants were ex-and IR markers. The Recording Assistant is a small de- cluded for incorrectly performing the task on at least onevice (4.7 ×3.1 ×1.1 ) that attaches to the glasses and is trial—data showed post facto that these participants neverused to both calibrate the eye tracker and store recorded ﬁxated the target box, their data could thus be consideredeye movement and video data on a mini-SD card. IR mark- oﬀ-target or erroneous. Analysis therefore considered onlyers (see Figure 5(b)) are used to delineate an Area of Anal- successful trials, consisting of data captured from 28 partici-ysis (AOA), a plane determined by the placement of 4 or pants (18 male, 14 female). These participants’ ages rangedmore IR markers, similar in concept to an Area/Region of from 20 to 42 (median 22).Interest (A/ROI) commonly used in eye tracking research to Procedure. Before starting the experiment, participantsdelineate sections of stimulus within which ﬁltered eye move- were asked to ﬁll out a basic demographic questionnairements, i.e., ﬁxations, are counted. The diﬀerence between an (gender, age, use and type of corrective lenses, etc.). TheyAOA and an AOI is that an AOA exists in physical space and were then walked to an unmarked, white wall for the cal-is required for data aggregation when the glasses are used. ibration process. Participants stood 1 m (39 ) from theAn IR marker serves this function only when attached to an wall and underwent the 9-point calibration procedure. TenIR marker holder; otherwise, it works in calibration mode participants could not achieve a satisfactory calibration andand emits a visible (green) light for calibration. were thanked for their participation and dismissed. Calibration. Calibration using the Tobii Glasses is some- Next, participants were given instructions for their ﬁrstwhat diﬀerent from traditional calibration procedures em-ployed with table-mounted, ﬁxed, or more commonly knownas “remote” eye trackers. To calibrate the glasses, an IRmarker is used in calibration mode. The experimenter ﬁrstasks the participant to stand at a distance of 1 m from a ﬂat,vertical surface (e.g., a wall) and begins the calibration pro-cess using the Recording Assistant. The Recording Assistantthen displays a 3×3 grid of points to the experimenter, whomust position the IR marker at each corresponding point onthe wall. During this process, the participant is instructedto hold their head steady and follow the green light emittedby the IR marker with their eyes. Experimental Design. The experiment consisted of a 2(environment) × 2 (box type) × 2 (box placement) design.The environment was either the physical or virtual cerealshelf, the box type included two versions of a cereal box(Figure 3), and box placement featured the target box atone of two locations (left vs. right). A center target position (a) (b)was avoided it is likely to be ﬁxated ﬁrst . Each participant performed two trials, with environmentand box type reversed in the second trial, counterbalancing Figure 5: (a) Tobii Glasses, Recording Assistant,trial combinations. and (b) IR marker. Courtesy of Tobii Technology.
task. If their task was the physical space task, the researcher mersion and involvement subscales and three from the sen-showed the participant one of the two target boxes. The par- sory ﬁdelity and interface quality subscales. All questionsticipant was told that their task would be to ﬁnd this box were administered along a 7-point Likert scale. Questionson a physical shelf and verbally announce its price. They relating to non-visual senses were omitted.were given as much time as desired to examine their targetbox in as much detail as they wished (no participant spentmore than 30 s). The participant was also shown examples 4. RESULTSof the price tags’ appearance. They were then told the lo- Eye movement data in the form of numbers of ﬁxationscation of the physical shelf, and asked to walk directly to a and time to ﬁrst ﬁxation of the target AOI were exportedmarker on the ground (2.5 m (98 ) from the stimulus) be- from Tobii Studio for analysis with R .fore looking up at the shelf. When ready, they were asked A repeated-measures three-way ANOVA of time to ﬁrstto look straight ahead so the glasses could auto-adjust for ﬁxation revealed signiﬁcance of the main eﬀect of environ-recording to begin. Finally, the experimenter walked with ment (F(1,27) = 22.77, p 0.01). No other signiﬁcantthe participant to the shelf and recorded eye movements un- eﬀects (of box type or placement) were detected (see Fig-til the participant announced the price of the object. The ures 6(a) and 6(b)).physical shelving area was concealed from the participant A repeated-measures three-way ANOVA of the numberprior to this task, to avoid preview beneﬁt. of ﬁxations prior to the ﬁrst ﬁxation on the target also re- For the virtual space task, a similar procedure was fol- vealed signiﬁcance of the main eﬀect of environment (F(1,27)lowed, with the only diﬀerence being that the participant = 16.56, p 0.01) but not of box type or placement (seewas walked to a marker 2.5 m (98 ) from a projector wall, Figures 6(c) and 6(d)).and the image on the projector was changed from a blank Results from the modiﬁed Witmer-Singer Presence Ques-image to the stimulus image when the participant was ready. tionnaire were analyzed following Madathil and Greenstein’s After the ﬁrst task, the participant was given a custom- analytical approach, by ﬁrst computing the mean responsestailored Witmer-Singer presence questionnaire . The of questions related to each of the four subscales used andparticipant was given the option to remove the glasses while then comparing diﬀerences between each of these means (ofthey took the questionnaire if they felt uncomfortable wear- means) via a Welch two-sample t-test between physical anding them. Those who chose to remove them had to repeat virtual trials . No signiﬁcant diﬀerences were observedthe calibration procedure before the second task; however, between the means of any of the four subscales tested (seeonly one participant elected to do so. Participants were Table 1 and Figure 9). A trend toward higher perceivedthen given their second task, with the same instructions. ﬁdelity appears to point toward the physical environment,After completion of the second task, they were again given but, on average, the eﬀect is negligible. Furthermore, modalthe presence questionnaire, but told that it referred only to responses to the subjective post-experiment questionnairetheir experience in the second task (be it physical or vir- show neutral preferential attitudes to either of the physicaltual). Finally, the participant was given a post-experiment or virtual (projector) tasks (see Table 2).questionnaire to collect subjective information (e.g., com-fort) and any comments related to the study. 5. DISCUSSION Search in the environments was counterbalanced such that Results indicate that the physical environment aﬀordedhalf the participants searched within the physical environ- signiﬁcantly faster search performance than the virtual pro-ment ﬁrst and half ﬁrst searched in the virtual. Position of jected image. The eye tracking data provides clear evidencethe target box was also counterbalanced so that one quarter of the discrepancy in performance: because the number ofof the trials contained the target at left, another quarter at ﬁxations generally coincides with time taken to complete vi-right and vice versa (corresponding images of the physical sual search, it is clear that participants took longer in theenvironment were used in the virtual projection). virtual environment because they had to issue a larger num- ber of ﬁxations. This is visualized in Figure 8 and shows the3.1 Dependent Measures reason for the diﬀerence in time to task completion which Eye Tracking Metrics. The primary metric of interest might not have been evident had this been measured with awas time to ﬁrst ﬁxation on the target box. This metric ef- stopwatch (eye tracking data provides clear evidence of ac-fectively measures time to task completion, or performance tive visual search—participants were not simply daydream-of the task. Additionally, we measured the number of ﬁx- ing or staring at a ﬁxed point).ations prior to the ﬁrst ﬁxation on target. We considered, Eye movement data also suggests that individuals maybut rejected, other eye tracking metrics such as ﬁxation du- have approached the search task in a fundamentally diﬀer-ration. In this type of visual search task, a participant’s ent way over the projected image. Heatmap visualizationseye movements typically consist mostly of saccades until the of aggregated scanpaths are shown in Figure 7. Note thattarget is found. After the target is found, the number or du- the heavily ﬁxated regions in the four corners represent theration of ﬁxations on it give us no further information—we possible locations of the boxes—the image chosen for thewere mainly interested if the time to location of the target visualization is one of the layouts used in the experiment,diﬀered between environment types. it is used in Figure 7 as a representative for visualization Presence Questionnaire. A presence questionnaire, of aggregate data from all trials. In the virtual environ-based on Witmer and Singer’s version 3.0, tailored to the ment, it appears that most viewers may have begun theirpresent experiment, was used to gauge participants’ sub- search near the center, but there is no such obvious trend injective impressions of both environments, speciﬁcally along the physical environment. What is particularly interestingfour subscales: immersion, involvement, sensory ﬁdelity, and about this result is that Chandon et al. found that objectsinterface quality. Four questions were chosen from the im- located near the center of the “shelf” can be seen more often
Table 1: Mean responses to the tailored Witmer-Singer presence questionnaire, marked on a 7-point Likertscale with 1 indicating most negative agreement and 7 indicating most positive agreement to the givenquestion regarding experiences in either virtual or physical environment. # Question Physical Virtual environment environment Involvement 1. My interactions with the shelving environment seemed natural. 6.1 4.6 3. The visual aspects of the environment involved me. 5.8 5.1 8. I was able to completely survey or search the environment using vision. 6.4 6.2 11. I felt involved in the search task. 6.0 5.8 group means (means of means) 6.1 5.4 Immersion 2. All my senses were completely engaged. 4.6 3.8 4. I was completely aware of events occurring in the real world around me. 5.9 5.1 6. The information coming from my visual sense felt inconsistent or disconnected. 2.5 3.2 12. I was distracted by display devices. 2.9 3.2 group means (means of means) 4.0 3.8 Sensory Fidelity 7. My experiences with the shelving system seemed consistent with my real-world experience. 5.9 4.6 9. I felt that I was able to examine objects closely. 5.4 4.9 10. I felt that I was able to examine objects from multiple viewpoints. 4.5 3.9 group means (means of means) 5.3 4.5 Interface Quality 5. I was completely aware of any display and control devices. 5.9 5.1 13. Visual display quality interfered or distracted me from completing my task. 2.2 3.0 14. I was able to concentrate on the search task and not on the devices used to perform the task. 6.1 5.2 group means (means of means) 4.5 4.6Table 2: Modal responses to subjective post-experiment questions, marked on a 7-point Likert scale with 1indicating strong disagreement and 7 indicating strong agreement. # Question mode 1. The eye tracking glasses felt comfortable. 6 2. The eye tracking glasses distracted me and hindered my ability to perform my tasks. 1 3. I preferred the projector search task to the physical search task. 4 4. I understood what was expected of me in each task. 7 5. I preferred the physical search task to the projector search task. 4but not actually considered (for purchase) in corresponding home cinema projector), but these projectors are usuallypercentages. Their ﬁnding did not ﬁt with other data that “long-throw” projectors and would cause shadow interfer-suggested that attention correlates fairly well with consider- ence problems in the CUshop virtual shopping experienceation. Since they did not use an actual shelf in their study being constructed.(only a projected image), they speculated that this occurred What is curious in our study is the lack of perceived diﬀer-because people might tend to orient their attention to the ences in response to post-task presence and post-experimentcenter of an image during a transition increasing the number preference questionnaires. Figure 9 summarizes the dataof ﬁxations in the area (as is seen in Figures 7(b) and 8(b)). found in Table 1 and shows that while the physical envi-Our ﬁndings suggest that this might not occur as consis- ronment appears to have been rated slightly higher in termstently in physical environments. of the presence subscales, the diﬀerences, along with modal A key reason for the observed diﬀerence in visual search responses to preference, are negligible. The projected imageperformance may be the ﬁdelity of the projected scene. Al- may have failed to provide either physical realism (in whichthough we were careful to control for apparent image size, the image provides the same visual stimulation as scene) orthe projected image clearly diﬀers from its projected coun- photo-realism (in which the image produces the same visualterpart. The projectors oﬀer relatively poor brightness and response as the scene), but the image may have containedcontrast reproduction of the physical scene. The physical suﬃcient functional realism (in which the images providesscene is much richer in terms of visual elements (color gamut, the same visual information)  to perform the task, albeitcontrast, and visual depth). The human eye can perceive consistently more slowly (note that our data analysis per-a very high dynamic range contrast ratio, e.g., 100,000:1, tains to all successful trials).with static perception of about 10,000:1 at any given time.The projectors’ lumens rating of 2,500 and contrast ratioof 2,000:11 may have impeded visual search in comparison 6. CONCLUSIONto what was seen in the physical environment. Projectors Results were presented from a study comparing consumers’are available with greater contrast ratios and spatial resolu- visual behavior when searching for an item located on a vir-tion (e.g., 12,000:1, 1080p high-deﬁnition of the PowerLite tual or physical shelf. These indicate that the physical en- vironment aﬀorded signiﬁcantly faster search performance1 http://www.epson.com/brightlink than the virtual projected image. Eye tracking data corrob-
Time to 1st fixation on target 10 Time (in seconds; with SE) 8 6 4 (a) Physical environment 2 0 Physical Virtual Environment (a) Time to 1st fixation on target 10 (b) Virtual environment Time (in seconds; with SE) 8 6 Figure 7: Heatmaps (all participants) in either env. 4 2 0 Left Right Left Right Physical Virtual Target Placement × Environment (b) No. of fixations prior to 1st fixation on target (a) Physical environment 200 Fixation Count (with SE) 150 100 50 (b) Virtual environment 0 Physical Virtual Environment Figure 8: Scanpaths (all participants) in either env. (c) No. of fixations prior to 1st fixation on target Presence metrics 200 7 Likert score means of means (with SE) physical virtual 6 Fixation Count (with SE) 150 5 100 4 3 50 2 1 0 Left Right Left Right 0 Physical Virtual Involvement Immersion Sensory Interface Target Placement × Environment Fidelity Quality (d) Figure 9: Results: presence questionnaire.Figure 6: Results: performance and process metrics.
orates this ﬁnding by indicating a signiﬁcantly larger number Extended abstracts on human factors in computingof ﬁxations made over the virtual shelf. systems, pages 923–928, New York, NY, 2006. ACM. One reason for the observed diﬀerence in visual search per-  A. F. Kramer and J. S. McCarley. Oculomotorformance may be due to the poor ﬁdelity of the projected Behaviour as a Reﬂection of Attention and Memoryscene in comparison to the physical shelf. It is possible that Processes: Neural Mechanisms and Applications tothe projectors’ relatively low contrast ratio impeded visual Human Factors. Theoretical Issues in Ergonomicssearch. Better projectors and more photo-realistic simula- Science, 4(1–2):21–55, 2003.tions may improve congruence of eye movement metrics, but  M. F. Land and B. W. Tatler. Looking and Acting:one must also consider the overall environment in which the Vision and Eye Movements in Natural Behavior.participant is immersed. Advancements in other forms of Oxford University Press, New York, NY, 2009.simulation (automotive and ﬂight for instance) have come  E. Lundberg. Packaging Media Lab: A proposal to anot from improvements in visual quality (e.g., resolution, Packaging Evaluation Environment for Conductingcontrast), but from an expanded ﬁeld of view, realistic mo- Consumer Studies. Master’s thesis, Uppsalation, and sound. Although visual ﬁdelity will continue to University, Uppsala, Sweden, Sep. 2004.play a signiﬁcant role in the shopping simulation, the re-  K. C. Madathil and J. S. Greenstein. Synchronousmaining senses must also be addressed. We believe construc- Remote Usability Testing – A New Approachtion of a physical space ﬁlled with tactile objects, rich visual Facilitated by Virtual Worlds. In CHI ’11: Proceedingselements, and sounds, through which participants navigate, of the SIGCHI Conference on Human Factors inwill go a long way toward mitigating the sense of standing Computing Systems, New York, 2011. ACM Press.in front of a projection screen.  M. A. McDowell, C. D. Fryar, C. L. Ogden, and K. M. Physical shelves oﬀer a step closer towards physical real- Flegal. Anthropometric Reference Data for Childrenism, but they are costly to set up and to stock. If there are and Adults: United States, 2003–2006. Technicalsuﬃcient resources, such shelves oﬀer better ecological valid- report, National Health Statistics, 2008.ity. However, the lack of a perceived diﬀerence between theenvironments suggests that projected replicas may be suf-  U. Orth and K. Malkewitz. Holistic package designﬁcient for consumer testing (e.g., visual search) since they and consumer brand impressions. Journal ofprovide as much visual information. Our ﬁndings suggest Marketing, 72:64–81, 2008.that virtual presentation of the stimulus oﬀers a viable al-  E. J. Russo and F. LeClerc. An eye-ﬁxation analysis ofternative to a physical mock-up so long as one maintains choice processes for consumer nondurables. Journal ofawareness of the potential eﬀect on performance in relation Consumer Research, 21(2):274, 1994.to performance in the ﬁeld. If the eﬀect is consistent, how-  D. S. Tan, D. Gergle, P. G. Scupelli, and R. Pausch.ever, then relative measurements of performance within the Physically large displays improve performance onvirtual environments are still likely to be valid. spatial tasks. In ACM Transactions on Computer - Human Interaction, volume 13, pages 71–99, New York, NY, 2006. ACM.Acknowledgments  J. Tornros. Driving behavior in a real and simulatedWe would like to thank Andrew Hurley and his Spring 2011 road tunnel – A validation study. Accident Analysiscreative inquiry team for their help in designing and brand- and Prevention, 30(4):497–503, 1998.ing the CUshop consumer experience lab. We are also grate-  R. Underwood, N. Klein, and R. Burke. Packagingful to Harris A. Smith for his generous support of the lab. communication: Attentional eﬀects of product imagery. Journal of Product and Brand Packaging,7. REFERENCES 10(7):403–422(20), 2001.  J. Baron and Y. Li. Notes on the use of R for  M. Wedel and R. Pieters. A review of eye-tracking psychology experiments and questionnaires. Online research in marketing. In Review of Marketing Notes, 09 November 2007. URL: http://www.psych. Research. Emerald Group, Bingley, UK, 2008. upenn.edu/~baron/rpsych/rpsych.html.  S. L. Whitney, P. J. Sparto, L. F. Hodges, S. V. Babu,  P. Chandon, J. W. Hutchinson, E. T. Bradlow, and J. M. Furman, and M. S. Redfern. Responses to a S. Young. Visual Marketing: From Attention to Virtual Reality Grocery Store in Persons with and Action. Lawrence Erlbaum Assoc., Mahwah, NJ, 2007. without Vestibular Dysfunction. CyberPsychology  P. Chandon, J. W. Hutchinson, E. T. Bradlow, and Behavior, 9(2), 2006. S. Young. Does in-store marketing work? Eﬀects of  B. G. Witmer and M. J. Singer. Measuring Presence the number and position of shelf facings on brand in Virtual Environments: A Presence Questionnaire. attention and evaluation at the point of purchase. Presence, 7(3):225–240, June 1998. Journal of Marketing, 73:1–17, 2009.  D. S. Wooding. Fixation Maps: Quantifying  J. Clement. Visual inﬂuence on in-store buying Eye-Movement Traces. In ETRA ’02: Proceedings of decisions: an eye-track experiment on the visual the 2002 Symposium on Eye Tracking Research inﬂuence of packaging design. Journal of Marketing Applications, pages 31–36, New York, NY, 2002. ACM. Management, 23(9):917 — 928, 2007.  S. Young. Packaging design, consumer research, and  J. A. Ferwerda. Three Varieties of Realism in business strategy: The march toward accountability. Computer Graphics. In Human Vision and Electronic Design Management Journal, 10(3):10–14, 2002. Imaging, pages 290–297, Bellingham, WA, 2003. SPIE.  S. Young. Five principles for eﬀective paging research.  S. A. Johansen and J. P. Hansen. Do we need eye Brand Packaging, 18(1):24–26, 2005. trackers to tell where people look? In CHI’06