SlideShare a Scribd company logo
1 of 21
Download to read offline
VISUAL ANALYSIS OF
MASSIVE WEB SESSION DATA
Jack Shen
Jack Shen
¨  Architect @ Behavior and Product Experience, eBay
¨  Previously
¤  Research Scientist @ eBay Research Labs
¤  Ph.D. in Information Visualization @ UC Davis
¨  Blog (in Chinese) : www.vizinsight.com
¨  zqshen@gmail.com
Visual Analysis of Massive Web Session Data
Zeqian Shen, Jishang Wei, Neel Sundaresan, and Kwan-Liu Ma,
In IEEE Symposium on Large Data Analysis and Visualization 2012
eBay Marketplace
¨  Connecting buyers and sellers world wide
¤  Hundreds of millions of people visit every day
¤  Millions of items are posted and purchased every day
¨  Understanding user experience is crucial for
improving our service.
Analyzing Web Session Data
¨  Sessions and events
¨  Visual analytics enables exploration and discovery.
¨  Challenges:
¤  Large-scale data: hundreds of millions of sessions, 2TB
per day.
¤  Structural complexity
Real-world Scenarios for Analyzing
User Experience
¨  Goal: Improve the success rates of the key flows
¨  Study the correlations between user behavior
patterns in the flow and the success rates.
¤  The success rates of various event sequence patterns,
i.e., ordering of events and elapsed time.
¤  The inbound/outbound, elapsed time and other statistics
of an event and its correlation with the success rate.
¤  Analysts conduct the analysis iteratively.
System Design
Two-tier visualization system
Tier1: Data filtering
on Hadoop
Tier2: In-memory visualization,
real-time interactive exploration
Data Filtering
Using Mobius, a Hadoop-based large scale data analytics platform,
to extract event sequences based on user input.
Visualization
Grouping the event sequences by ordering of the events and
elapsed time
Data Sizes at Different Tiers
¨  Tier 1: 2TB on Hadoop, hundreds of millions of
sessions.
¨  Tier 2: ~50MB, 200K sequences (Checkout)
¤  In memory tree: ~30MB, bucket size for elapsed time: 1
second.
¤  Visualizing: ~5MB, 67K sequences, bucket size for
elapsed time: 10 seconds.
User Interface
User Interactions: Highlighting
User Interactions: Aligning
Rebuilding the tree is needed
User Interactions: Zooming
Changing the bucket size for elapsed time to do sematic zooming
Case Study: Category Selection in
Listing Process
Browse
Category
Catalog
Search
Search
Category
Describe Item
Review
Enhance
Congratulation
Category Selection
Process for sellers listing a item on eBay
Case Study: Category Selection in
Listing Process
Case Study: Category Selection in
Listing Process
Understand behavior patterns of sellers who chose to find a
category through browsing.
Case Study: Category Selection in
Listing Process
Understand the impact of catalog based category selection on
user experience in listing.
Feedback
¨  The system is useful for both novice and expert
users.
¨  The response latency in data filtering is acceptable.
¨  The visualization and interaction are intuitive and
useful.
Summary
¨  Introduced a two-tier visual analytics system for
analyzing user temporal behavior patterns.
¨  Discussed the process and principles in designing
such large scale visualization system:
¤  Design the system in tiers based on real-world
scenarios.
¤  As moving up the tiers, less data and computation, more
interactive.
Questions?
zqshen@gmail.com

More Related Content

Similar to Visual Analysis of Massive Web Session Data

The Art and Science of Requirements Gathering
The Art and Science of Requirements GatheringThe Art and Science of Requirements Gathering
The Art and Science of Requirements GatheringVanessa Turke
 
Apache Hadoop India Summit 2011 talk "Online Content Optimization using Hadoo...
Apache Hadoop India Summit 2011 talk "Online Content Optimization using Hadoo...Apache Hadoop India Summit 2011 talk "Online Content Optimization using Hadoo...
Apache Hadoop India Summit 2011 talk "Online Content Optimization using Hadoo...Yahoo Developer Network
 
By the Community & For the Community: A Deep Learning Approach to Assist Coll...
By the Community & For the Community: A Deep Learning Approach to Assist Coll...By the Community & For the Community: A Deep Learning Approach to Assist Coll...
By the Community & For the Community: A Deep Learning Approach to Assist Coll...Chunyang Chen
 
IEEE 2014 DOTNET CLOUD COMPUTING PROJECTS A scientometric analysis of cloud c...
IEEE 2014 DOTNET CLOUD COMPUTING PROJECTS A scientometric analysis of cloud c...IEEE 2014 DOTNET CLOUD COMPUTING PROJECTS A scientometric analysis of cloud c...
IEEE 2014 DOTNET CLOUD COMPUTING PROJECTS A scientometric analysis of cloud c...IEEEMEMTECHSTUDENTPROJECTS
 
Build User-Facing Analytics Application That Scales Using StarRocks (DLH).pdf
Build User-Facing Analytics Application That Scales Using StarRocks (DLH).pdfBuild User-Facing Analytics Application That Scales Using StarRocks (DLH).pdf
Build User-Facing Analytics Application That Scales Using StarRocks (DLH).pdfAlbert Wong
 
Rashmi Xerox Parc
Rashmi Xerox ParcRashmi Xerox Parc
Rashmi Xerox Parcguru100
 
Rashmi Xerox Parc
Rashmi Xerox ParcRashmi Xerox Parc
Rashmi Xerox Parcguestbdd02b
 
UX Analytics Cemal Büyükgökçesu
UX Analytics Cemal BüyükgökçesuUX Analytics Cemal Büyükgökçesu
UX Analytics Cemal BüyükgökçesuUserspots
 
Web Performance BootCamp 2013
Web Performance BootCamp 2013Web Performance BootCamp 2013
Web Performance BootCamp 2013Daniel Austin
 
NYC UXPA: 2014 - Bringing Together User Experience and Web Analytics (Michael...
NYC UXPA: 2014 - Bringing Together User Experience and Web Analytics (Michael...NYC UXPA: 2014 - Bringing Together User Experience and Web Analytics (Michael...
NYC UXPA: 2014 - Bringing Together User Experience and Web Analytics (Michael...NYCUXPA
 
3 Reasons Data Virtualization Matters in Your Portfolio
3 Reasons Data Virtualization Matters in Your Portfolio3 Reasons Data Virtualization Matters in Your Portfolio
3 Reasons Data Virtualization Matters in Your PortfolioDenodo
 
Yuri M. Brovman, Data Scientist, eBay
Yuri M. Brovman, Data Scientist, eBayYuri M. Brovman, Data Scientist, eBay
Yuri M. Brovman, Data Scientist, eBayMLconf
 
From Inspiration to Activation: Making Online Collaborative Communities Work
From Inspiration to Activation: Making Online Collaborative Communities WorkFrom Inspiration to Activation: Making Online Collaborative Communities Work
From Inspiration to Activation: Making Online Collaborative Communities WorkCommunitySense
 
The Agile Drupalist - Methodologies & Techniques for Running Effective Drupal...
The Agile Drupalist - Methodologies & Techniques for Running Effective Drupal...The Agile Drupalist - Methodologies & Techniques for Running Effective Drupal...
The Agile Drupalist - Methodologies & Techniques for Running Effective Drupal...Adrian Jones
 
W4A 2010 - Web Not For All: A Large Scale Study of Web Accessibility
W4A 2010 - Web Not For All: A Large Scale Study of Web AccessibilityW4A 2010 - Web Not For All: A Large Scale Study of Web Accessibility
W4A 2010 - Web Not For All: A Large Scale Study of Web AccessibilityRui Lopes
 
Trip report for UX coctailhour amsterdam
Trip report for UX coctailhour amsterdamTrip report for UX coctailhour amsterdam
Trip report for UX coctailhour amsterdamdkaremaker
 
BI on Big Data Presentation
BI on Big Data PresentationBI on Big Data Presentation
BI on Big Data PresentationArcadia Data
 
Thinking Outside the Cube: How In-Memory Bolsters Analytics
Thinking Outside the Cube: How In-Memory Bolsters AnalyticsThinking Outside the Cube: How In-Memory Bolsters Analytics
Thinking Outside the Cube: How In-Memory Bolsters AnalyticsInside Analysis
 
BDX 2016 - Kevin lyons & yakir buskilla @ eXelate
BDX 2016 - Kevin lyons & yakir buskilla  @ eXelate BDX 2016 - Kevin lyons & yakir buskilla  @ eXelate
BDX 2016 - Kevin lyons & yakir buskilla @ eXelate Ido Shilon
 

Similar to Visual Analysis of Massive Web Session Data (20)

The Art and Science of Requirements Gathering
The Art and Science of Requirements GatheringThe Art and Science of Requirements Gathering
The Art and Science of Requirements Gathering
 
Apache Hadoop India Summit 2011 talk "Online Content Optimization using Hadoo...
Apache Hadoop India Summit 2011 talk "Online Content Optimization using Hadoo...Apache Hadoop India Summit 2011 talk "Online Content Optimization using Hadoo...
Apache Hadoop India Summit 2011 talk "Online Content Optimization using Hadoo...
 
By the Community & For the Community: A Deep Learning Approach to Assist Coll...
By the Community & For the Community: A Deep Learning Approach to Assist Coll...By the Community & For the Community: A Deep Learning Approach to Assist Coll...
By the Community & For the Community: A Deep Learning Approach to Assist Coll...
 
IEEE 2014 DOTNET CLOUD COMPUTING PROJECTS A scientometric analysis of cloud c...
IEEE 2014 DOTNET CLOUD COMPUTING PROJECTS A scientometric analysis of cloud c...IEEE 2014 DOTNET CLOUD COMPUTING PROJECTS A scientometric analysis of cloud c...
IEEE 2014 DOTNET CLOUD COMPUTING PROJECTS A scientometric analysis of cloud c...
 
Build User-Facing Analytics Application That Scales Using StarRocks (DLH).pdf
Build User-Facing Analytics Application That Scales Using StarRocks (DLH).pdfBuild User-Facing Analytics Application That Scales Using StarRocks (DLH).pdf
Build User-Facing Analytics Application That Scales Using StarRocks (DLH).pdf
 
Rashmi Xerox Parc
Rashmi Xerox ParcRashmi Xerox Parc
Rashmi Xerox Parc
 
Rashmi Xerox Parc
Rashmi Xerox ParcRashmi Xerox Parc
Rashmi Xerox Parc
 
UX Analytics Cemal Büyükgökçesu
UX Analytics Cemal BüyükgökçesuUX Analytics Cemal Büyükgökçesu
UX Analytics Cemal Büyükgökçesu
 
Web Performance BootCamp 2013
Web Performance BootCamp 2013Web Performance BootCamp 2013
Web Performance BootCamp 2013
 
NYC UXPA: 2014 - Bringing Together User Experience and Web Analytics (Michael...
NYC UXPA: 2014 - Bringing Together User Experience and Web Analytics (Michael...NYC UXPA: 2014 - Bringing Together User Experience and Web Analytics (Michael...
NYC UXPA: 2014 - Bringing Together User Experience and Web Analytics (Michael...
 
3 Reasons Data Virtualization Matters in Your Portfolio
3 Reasons Data Virtualization Matters in Your Portfolio3 Reasons Data Virtualization Matters in Your Portfolio
3 Reasons Data Virtualization Matters in Your Portfolio
 
Yuri M. Brovman, Data Scientist, eBay
Yuri M. Brovman, Data Scientist, eBayYuri M. Brovman, Data Scientist, eBay
Yuri M. Brovman, Data Scientist, eBay
 
MediaGlu and Mongo DB
MediaGlu and Mongo DBMediaGlu and Mongo DB
MediaGlu and Mongo DB
 
From Inspiration to Activation: Making Online Collaborative Communities Work
From Inspiration to Activation: Making Online Collaborative Communities WorkFrom Inspiration to Activation: Making Online Collaborative Communities Work
From Inspiration to Activation: Making Online Collaborative Communities Work
 
The Agile Drupalist - Methodologies & Techniques for Running Effective Drupal...
The Agile Drupalist - Methodologies & Techniques for Running Effective Drupal...The Agile Drupalist - Methodologies & Techniques for Running Effective Drupal...
The Agile Drupalist - Methodologies & Techniques for Running Effective Drupal...
 
W4A 2010 - Web Not For All: A Large Scale Study of Web Accessibility
W4A 2010 - Web Not For All: A Large Scale Study of Web AccessibilityW4A 2010 - Web Not For All: A Large Scale Study of Web Accessibility
W4A 2010 - Web Not For All: A Large Scale Study of Web Accessibility
 
Trip report for UX coctailhour amsterdam
Trip report for UX coctailhour amsterdamTrip report for UX coctailhour amsterdam
Trip report for UX coctailhour amsterdam
 
BI on Big Data Presentation
BI on Big Data PresentationBI on Big Data Presentation
BI on Big Data Presentation
 
Thinking Outside the Cube: How In-Memory Bolsters Analytics
Thinking Outside the Cube: How In-Memory Bolsters AnalyticsThinking Outside the Cube: How In-Memory Bolsters Analytics
Thinking Outside the Cube: How In-Memory Bolsters Analytics
 
BDX 2016 - Kevin lyons & yakir buskilla @ eXelate
BDX 2016 - Kevin lyons & yakir buskilla  @ eXelate BDX 2016 - Kevin lyons & yakir buskilla  @ eXelate
BDX 2016 - Kevin lyons & yakir buskilla @ eXelate
 

Recently uploaded

Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentationphoebematthew05
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 

Recently uploaded (20)

Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort ServiceHot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentation
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 

Visual Analysis of Massive Web Session Data

  • 1. VISUAL ANALYSIS OF MASSIVE WEB SESSION DATA Jack Shen
  • 2. Jack Shen ¨  Architect @ Behavior and Product Experience, eBay ¨  Previously ¤  Research Scientist @ eBay Research Labs ¤  Ph.D. in Information Visualization @ UC Davis ¨  Blog (in Chinese) : www.vizinsight.com ¨  zqshen@gmail.com
  • 3. Visual Analysis of Massive Web Session Data Zeqian Shen, Jishang Wei, Neel Sundaresan, and Kwan-Liu Ma, In IEEE Symposium on Large Data Analysis and Visualization 2012
  • 4. eBay Marketplace ¨  Connecting buyers and sellers world wide ¤  Hundreds of millions of people visit every day ¤  Millions of items are posted and purchased every day ¨  Understanding user experience is crucial for improving our service.
  • 5. Analyzing Web Session Data ¨  Sessions and events ¨  Visual analytics enables exploration and discovery. ¨  Challenges: ¤  Large-scale data: hundreds of millions of sessions, 2TB per day. ¤  Structural complexity
  • 6. Real-world Scenarios for Analyzing User Experience ¨  Goal: Improve the success rates of the key flows ¨  Study the correlations between user behavior patterns in the flow and the success rates. ¤  The success rates of various event sequence patterns, i.e., ordering of events and elapsed time. ¤  The inbound/outbound, elapsed time and other statistics of an event and its correlation with the success rate. ¤  Analysts conduct the analysis iteratively.
  • 7. System Design Two-tier visualization system Tier1: Data filtering on Hadoop Tier2: In-memory visualization, real-time interactive exploration
  • 8. Data Filtering Using Mobius, a Hadoop-based large scale data analytics platform, to extract event sequences based on user input.
  • 9. Visualization Grouping the event sequences by ordering of the events and elapsed time
  • 10. Data Sizes at Different Tiers ¨  Tier 1: 2TB on Hadoop, hundreds of millions of sessions. ¨  Tier 2: ~50MB, 200K sequences (Checkout) ¤  In memory tree: ~30MB, bucket size for elapsed time: 1 second. ¤  Visualizing: ~5MB, 67K sequences, bucket size for elapsed time: 10 seconds.
  • 14. User Interactions: Zooming Changing the bucket size for elapsed time to do sematic zooming
  • 15. Case Study: Category Selection in Listing Process Browse Category Catalog Search Search Category Describe Item Review Enhance Congratulation Category Selection Process for sellers listing a item on eBay
  • 16. Case Study: Category Selection in Listing Process
  • 17. Case Study: Category Selection in Listing Process Understand behavior patterns of sellers who chose to find a category through browsing.
  • 18. Case Study: Category Selection in Listing Process Understand the impact of catalog based category selection on user experience in listing.
  • 19. Feedback ¨  The system is useful for both novice and expert users. ¨  The response latency in data filtering is acceptable. ¨  The visualization and interaction are intuitive and useful.
  • 20. Summary ¨  Introduced a two-tier visual analytics system for analyzing user temporal behavior patterns. ¨  Discussed the process and principles in designing such large scale visualization system: ¤  Design the system in tiers based on real-world scenarios. ¤  As moving up the tiers, less data and computation, more interactive.