SlideShare a Scribd company logo
1 of 26
© 2015 Uncharted Software Inc.
http://pantera.io http://influent.io
© 2015 Uncharted Software Inc.
Visualizing
Graphs as Maps
Communities
Relationships
Spatial Model
Bollen J, Van de Sompel H, Hagberg A, Bettencourt L, Chute R, et al. (2009)
Clickstream Data Yields High-Resolution Maps of Science. PLoS ONE 4(3): e4803.
doi:10.1371/journal.pone.0004803
© 2015 Uncharted Software Inc.
Visualizing
Graphs as Maps
Communities
Relationships
Spatial Model
Bollen J, Van de Sompel H, Hagberg A, Bettencourt L, Chute R, et al. (2009)
Clickstream Data Yields High-Resolution Maps of Science. PLoS ONE 4(3): e4803.
doi:10.1371/journal.pone.0004803
© 2015 Uncharted Software Inc.
Challenges of
Big Graphs
Computation
Layout
Interaction
Labeling
© 2015 Uncharted Software Inc.
Chelsea FC Fan Communities
248,747,072 tweets
554,430 account nodes
100,700 mention links
Demo
skip
© 2015 Uncharted Software Inc.
Geo Communities
Fans Mention Graph
© 2015 Uncharted Software Inc.
Geo Communities
Fans Mention Graph
© 2015 Uncharted Software Inc.
Geo Communities
Player Sentiment
© 2015 Uncharted Software Inc.
Geo Communities
Trending Topics + Drilldown
© 2015 Uncharted Software Inc.
Community Graph
© 2015 Uncharted Software Inc.
Community Graph
© 2015 Uncharted Software Inc.
Graph Tiling Pipeline
Hierarchical
Community
Clustering
Hierarchical
Graph Layout
Tile Generation
GraphX
Distributed Louvain
Modularity
Hierarchical
Force Directed
© 2015 Uncharted Software Inc.
Graph Tiling Pipeline
Hierarchical
Community
Clustering
Hierarchical
Graph Layout
Tile Generation
GraphX
Distributed Louvain
Modularity
Hierarchical
Force Directed
© 2015 Uncharted Software Inc.
Graph Tiling Pipeline
Hierarchical
Community
Clustering
Hierarchical
Graph Layout
Tile Generation
GraphX
Distributed Louvain
Modularity
Hierarchical
Force Directed
© 2015 Uncharted Software Inc.
Graph Tiling Pipeline
Hierarchical
Community
Clustering
Hierarchical
Graph Layout
Tile Generation
GraphX
Distributed Louvain
Modularity
Hierarchical
Force Directed
© 2015 Uncharted Software Inc.
Product Affinity
Stanford collection of anonymized Amazon product reviews and
“customers who bought this also bought…” over 9 years
2,372,409 product and customer nodes
9,909,551 review and co-purchase links
© 2015 Uncharted Software Inc.
© 2015 Uncharted Software Inc.
© 2015 Uncharted Software Inc.
© 2015 Uncharted Software Inc.
© 2015 Uncharted Software Inc.
© 2015 Uncharted Software Inc.
© 2015 Uncharted Software Inc.
Meeting the
Challenges of
Big Graphs
Computation
Layout
Interaction
Labeling
Distributed Computing
Hierarchical Communities
Level-of-detail Rendering
Layered Analytics
© 2015 Uncharted Software Inc.
Other Applications
Bitcoin Network Doctor Referrals Neural Networks
© 2015 Uncharted Software Inc.
Thanks!
Rob Harper
rharper@uncharted.software
@rdharper

More Related Content

Viewers also liked

Doedag DA2020 Venlo 9 maart
Doedag DA2020 Venlo 9 maart Doedag DA2020 Venlo 9 maart
Doedag DA2020 Venlo 9 maart KING
 
Doedag DA2020 Venlo 9 maart
Doedag DA2020 Venlo 9 maart Doedag DA2020 Venlo 9 maart
Doedag DA2020 Venlo 9 maart KING
 
Anatomy of Eyelids & Its Clinical Correlations
Anatomy of Eyelids & Its Clinical CorrelationsAnatomy of Eyelids & Its Clinical Correlations
Anatomy of Eyelids & Its Clinical CorrelationsSarmila Acharya
 
Boimetric using 8051
Boimetric  using 8051 Boimetric  using 8051
Boimetric using 8051 Shilpa Nayak
 
RECHECK - Електронен Етикет
RECHECK - Електронен Етикет RECHECK - Електронен Етикет
RECHECK - Електронен Етикет recheck
 
Documentación requerida para la aplicación de la norma ISO 14000
Documentación requerida para la aplicación de la norma ISO 14000Documentación requerida para la aplicación de la norma ISO 14000
Documentación requerida para la aplicación de la norma ISO 14000alfredoluis27
 
Eco 405 week 10 quiz strayer
Eco 405 week 10 quiz   strayerEco 405 week 10 quiz   strayer
Eco 405 week 10 quiz strayerMaryLouis2018
 
Doedag DA2020 Venlo 9 maart
Doedag DA2020 Venlo 9 maart Doedag DA2020 Venlo 9 maart
Doedag DA2020 Venlo 9 maart KING
 

Viewers also liked (8)

Doedag DA2020 Venlo 9 maart
Doedag DA2020 Venlo 9 maart Doedag DA2020 Venlo 9 maart
Doedag DA2020 Venlo 9 maart
 
Doedag DA2020 Venlo 9 maart
Doedag DA2020 Venlo 9 maart Doedag DA2020 Venlo 9 maart
Doedag DA2020 Venlo 9 maart
 
Anatomy of Eyelids & Its Clinical Correlations
Anatomy of Eyelids & Its Clinical CorrelationsAnatomy of Eyelids & Its Clinical Correlations
Anatomy of Eyelids & Its Clinical Correlations
 
Boimetric using 8051
Boimetric  using 8051 Boimetric  using 8051
Boimetric using 8051
 
RECHECK - Електронен Етикет
RECHECK - Електронен Етикет RECHECK - Електронен Етикет
RECHECK - Електронен Етикет
 
Documentación requerida para la aplicación de la norma ISO 14000
Documentación requerida para la aplicación de la norma ISO 14000Documentación requerida para la aplicación de la norma ISO 14000
Documentación requerida para la aplicación de la norma ISO 14000
 
Eco 405 week 10 quiz strayer
Eco 405 week 10 quiz   strayerEco 405 week 10 quiz   strayer
Eco 405 week 10 quiz strayer
 
Doedag DA2020 Venlo 9 maart
Doedag DA2020 Venlo 9 maart Doedag DA2020 Venlo 9 maart
Doedag DA2020 Venlo 9 maart
 

Similar to Untangling Influence and Desire: Visual Analysis of Massive Data

Starling Report
Starling Report  Starling Report
Starling Report Gary Henkle
 
Enhancing user engagement on mobile devices
Enhancing user engagement on mobile devicesEnhancing user engagement on mobile devices
Enhancing user engagement on mobile devicesRandall Arnold
 
Next Generation Devices
Next Generation DevicesNext Generation Devices
Next Generation DevicesGavin Bridge
 
Carri Bugbee on Social TV
Carri Bugbee on Social TVCarri Bugbee on Social TV
Carri Bugbee on Social TVCarri Bugbee
 
Data 2.0 - Harnessing New Data Visualization Tools CIL 2008
Data 2.0 - Harnessing New Data Visualization Tools CIL 2008Data 2.0 - Harnessing New Data Visualization Tools CIL 2008
Data 2.0 - Harnessing New Data Visualization Tools CIL 2008Darlene Fichter
 
The Data Behind Distributed Content
The Data Behind Distributed ContentThe Data Behind Distributed Content
The Data Behind Distributed ContentDan Valente
 
The Social Media Renaissance
The Social Media RenaissanceThe Social Media Renaissance
The Social Media Renaissancejawadshuaib
 
The Convergence of TV and Tablet Experiences by mPortal
The Convergence of TV and Tablet Experiences by mPortalThe Convergence of TV and Tablet Experiences by mPortal
The Convergence of TV and Tablet Experiences by mPortalmPortal, Inc.
 
TV and the Second Screen: A Red Bee Media and Decipher white paper
TV and the Second Screen: A Red Bee Media and Decipher white paperTV and the Second Screen: A Red Bee Media and Decipher white paper
TV and the Second Screen: A Red Bee Media and Decipher white paperMIP Markets
 
Rich Internet Applications
Rich Internet ApplicationsRich Internet Applications
Rich Internet ApplicationsDr. V Vorvoreanu
 
Healthy City Community Planning and Development webinar
Healthy City Community Planning and Development webinarHealthy City Community Planning and Development webinar
Healthy City Community Planning and Development webinarHealthy City
 
Keynote: Riding the Digital Wave
Keynote: Riding the Digital WaveKeynote: Riding the Digital Wave
Keynote: Riding the Digital WaveMerging Media
 
Which way up? Drawing and reading maps of the blogosphere
Which way up? Drawing and reading maps of the blogosphereWhich way up? Drawing and reading maps of the blogosphere
Which way up? Drawing and reading maps of the blogosphereTim Highfield
 
DIVE+ @PATCH2015 Workshop @IUI2015
DIVE+ @PATCH2015 Workshop @IUI2015DIVE+ @PATCH2015 Workshop @IUI2015
DIVE+ @PATCH2015 Workshop @IUI2015Lora Aroyo
 
Eavesdropping on the Twitter Microblogging Site
Eavesdropping on the Twitter Microblogging SiteEavesdropping on the Twitter Microblogging Site
Eavesdropping on the Twitter Microblogging SiteShalin Hai-Jew
 
Paid TV's Uncertain Future
Paid TV's Uncertain FuturePaid TV's Uncertain Future
Paid TV's Uncertain FutureYankee Group
 
Oscon Presentation.Computational Journalism
Oscon Presentation.Computational JournalismOscon Presentation.Computational Journalism
Oscon Presentation.Computational Journalismbradstenger
 
Data as a Creative Material
Data as a Creative MaterialData as a Creative Material
Data as a Creative MaterialAudree Lapierre
 

Similar to Untangling Influence and Desire: Visual Analysis of Massive Data (20)

Starling Report
Starling Report  Starling Report
Starling Report
 
Enhancing user engagement on mobile devices
Enhancing user engagement on mobile devicesEnhancing user engagement on mobile devices
Enhancing user engagement on mobile devices
 
Next Generation Devices
Next Generation DevicesNext Generation Devices
Next Generation Devices
 
Carri Bugbee on Social TV
Carri Bugbee on Social TVCarri Bugbee on Social TV
Carri Bugbee on Social TV
 
Data 2.0 - Harnessing New Data Visualization Tools CIL 2008
Data 2.0 - Harnessing New Data Visualization Tools CIL 2008Data 2.0 - Harnessing New Data Visualization Tools CIL 2008
Data 2.0 - Harnessing New Data Visualization Tools CIL 2008
 
The Data Behind Distributed Content
The Data Behind Distributed ContentThe Data Behind Distributed Content
The Data Behind Distributed Content
 
The Social Media Renaissance
The Social Media RenaissanceThe Social Media Renaissance
The Social Media Renaissance
 
The Convergence of TV and Tablet Experiences by mPortal
The Convergence of TV and Tablet Experiences by mPortalThe Convergence of TV and Tablet Experiences by mPortal
The Convergence of TV and Tablet Experiences by mPortal
 
TV and the Second Screen: A Red Bee Media and Decipher white paper
TV and the Second Screen: A Red Bee Media and Decipher white paperTV and the Second Screen: A Red Bee Media and Decipher white paper
TV and the Second Screen: A Red Bee Media and Decipher white paper
 
Rich Internet Applications
Rich Internet ApplicationsRich Internet Applications
Rich Internet Applications
 
Healthy City Community Planning and Development webinar
Healthy City Community Planning and Development webinarHealthy City Community Planning and Development webinar
Healthy City Community Planning and Development webinar
 
data, big data, open data
data, big data, open datadata, big data, open data
data, big data, open data
 
Keynote: Riding the Digital Wave
Keynote: Riding the Digital WaveKeynote: Riding the Digital Wave
Keynote: Riding the Digital Wave
 
Which way up? Drawing and reading maps of the blogosphere
Which way up? Drawing and reading maps of the blogosphereWhich way up? Drawing and reading maps of the blogosphere
Which way up? Drawing and reading maps of the blogosphere
 
DIVE+ @PATCH2015 Workshop @IUI2015
DIVE+ @PATCH2015 Workshop @IUI2015DIVE+ @PATCH2015 Workshop @IUI2015
DIVE+ @PATCH2015 Workshop @IUI2015
 
Eavesdropping on the Twitter Microblogging Site
Eavesdropping on the Twitter Microblogging SiteEavesdropping on the Twitter Microblogging Site
Eavesdropping on the Twitter Microblogging Site
 
Paid TV's Uncertain Future
Paid TV's Uncertain FuturePaid TV's Uncertain Future
Paid TV's Uncertain Future
 
Introducing telemetrics
Introducing telemetricsIntroducing telemetrics
Introducing telemetrics
 
Oscon Presentation.Computational Journalism
Oscon Presentation.Computational JournalismOscon Presentation.Computational Journalism
Oscon Presentation.Computational Journalism
 
Data as a Creative Material
Data as a Creative MaterialData as a Creative Material
Data as a Creative Material
 

More from Turi, Inc.

Webinar - Analyzing Video
Webinar - Analyzing VideoWebinar - Analyzing Video
Webinar - Analyzing VideoTuri, Inc.
 
Webinar - Patient Readmission Risk
Webinar - Patient Readmission RiskWebinar - Patient Readmission Risk
Webinar - Patient Readmission RiskTuri, Inc.
 
Webinar - Know Your Customer - Arya (20160526)
Webinar - Know Your Customer - Arya (20160526)Webinar - Know Your Customer - Arya (20160526)
Webinar - Know Your Customer - Arya (20160526)Turi, Inc.
 
Webinar - Product Matching - Palombo (20160428)
Webinar - Product Matching - Palombo (20160428)Webinar - Product Matching - Palombo (20160428)
Webinar - Product Matching - Palombo (20160428)Turi, Inc.
 
Webinar - Pattern Mining Log Data - Vega (20160426)
Webinar - Pattern Mining Log Data - Vega (20160426)Webinar - Pattern Mining Log Data - Vega (20160426)
Webinar - Pattern Mining Log Data - Vega (20160426)Turi, Inc.
 
Webinar - Fraud Detection - Palombo (20160428)
Webinar - Fraud Detection - Palombo (20160428)Webinar - Fraud Detection - Palombo (20160428)
Webinar - Fraud Detection - Palombo (20160428)Turi, Inc.
 
Scaling Up Machine Learning: How to Benchmark GraphLab Create on Huge Datasets
Scaling Up Machine Learning: How to Benchmark GraphLab Create on Huge DatasetsScaling Up Machine Learning: How to Benchmark GraphLab Create on Huge Datasets
Scaling Up Machine Learning: How to Benchmark GraphLab Create on Huge DatasetsTuri, Inc.
 
Pattern Mining: Extracting Value from Log Data
Pattern Mining: Extracting Value from Log DataPattern Mining: Extracting Value from Log Data
Pattern Mining: Extracting Value from Log DataTuri, Inc.
 
Intelligent Applications with Machine Learning Toolkits
Intelligent Applications with Machine Learning ToolkitsIntelligent Applications with Machine Learning Toolkits
Intelligent Applications with Machine Learning ToolkitsTuri, Inc.
 
Text Analysis with Machine Learning
Text Analysis with Machine LearningText Analysis with Machine Learning
Text Analysis with Machine LearningTuri, Inc.
 
Machine Learning with GraphLab Create
Machine Learning with GraphLab CreateMachine Learning with GraphLab Create
Machine Learning with GraphLab CreateTuri, Inc.
 
Machine Learning in Production with Dato Predictive Services
Machine Learning in Production with Dato Predictive ServicesMachine Learning in Production with Dato Predictive Services
Machine Learning in Production with Dato Predictive ServicesTuri, Inc.
 
Machine Learning in 2016: Live Q&A with Carlos Guestrin
Machine Learning in 2016: Live Q&A with Carlos GuestrinMachine Learning in 2016: Live Q&A with Carlos Guestrin
Machine Learning in 2016: Live Q&A with Carlos GuestrinTuri, Inc.
 
Scalable data structures for data science
Scalable data structures for data scienceScalable data structures for data science
Scalable data structures for data scienceTuri, Inc.
 
Introduction to Deep Learning for Image Analysis at Strata NYC, Sep 2015
Introduction to Deep Learning for Image Analysis at Strata NYC, Sep 2015Introduction to Deep Learning for Image Analysis at Strata NYC, Sep 2015
Introduction to Deep Learning for Image Analysis at Strata NYC, Sep 2015Turi, Inc.
 
Introduction to Recommender Systems
Introduction to Recommender SystemsIntroduction to Recommender Systems
Introduction to Recommender SystemsTuri, Inc.
 
Machine learning in production
Machine learning in productionMachine learning in production
Machine learning in productionTuri, Inc.
 
Overview of Machine Learning and Feature Engineering
Overview of Machine Learning and Feature EngineeringOverview of Machine Learning and Feature Engineering
Overview of Machine Learning and Feature EngineeringTuri, Inc.
 
Building Personalized Data Products with Dato
Building Personalized Data Products with DatoBuilding Personalized Data Products with Dato
Building Personalized Data Products with DatoTuri, Inc.
 

More from Turi, Inc. (20)

Webinar - Analyzing Video
Webinar - Analyzing VideoWebinar - Analyzing Video
Webinar - Analyzing Video
 
Webinar - Patient Readmission Risk
Webinar - Patient Readmission RiskWebinar - Patient Readmission Risk
Webinar - Patient Readmission Risk
 
Webinar - Know Your Customer - Arya (20160526)
Webinar - Know Your Customer - Arya (20160526)Webinar - Know Your Customer - Arya (20160526)
Webinar - Know Your Customer - Arya (20160526)
 
Webinar - Product Matching - Palombo (20160428)
Webinar - Product Matching - Palombo (20160428)Webinar - Product Matching - Palombo (20160428)
Webinar - Product Matching - Palombo (20160428)
 
Webinar - Pattern Mining Log Data - Vega (20160426)
Webinar - Pattern Mining Log Data - Vega (20160426)Webinar - Pattern Mining Log Data - Vega (20160426)
Webinar - Pattern Mining Log Data - Vega (20160426)
 
Webinar - Fraud Detection - Palombo (20160428)
Webinar - Fraud Detection - Palombo (20160428)Webinar - Fraud Detection - Palombo (20160428)
Webinar - Fraud Detection - Palombo (20160428)
 
Scaling Up Machine Learning: How to Benchmark GraphLab Create on Huge Datasets
Scaling Up Machine Learning: How to Benchmark GraphLab Create on Huge DatasetsScaling Up Machine Learning: How to Benchmark GraphLab Create on Huge Datasets
Scaling Up Machine Learning: How to Benchmark GraphLab Create on Huge Datasets
 
Pattern Mining: Extracting Value from Log Data
Pattern Mining: Extracting Value from Log DataPattern Mining: Extracting Value from Log Data
Pattern Mining: Extracting Value from Log Data
 
Intelligent Applications with Machine Learning Toolkits
Intelligent Applications with Machine Learning ToolkitsIntelligent Applications with Machine Learning Toolkits
Intelligent Applications with Machine Learning Toolkits
 
Text Analysis with Machine Learning
Text Analysis with Machine LearningText Analysis with Machine Learning
Text Analysis with Machine Learning
 
Machine Learning with GraphLab Create
Machine Learning with GraphLab CreateMachine Learning with GraphLab Create
Machine Learning with GraphLab Create
 
Machine Learning in Production with Dato Predictive Services
Machine Learning in Production with Dato Predictive ServicesMachine Learning in Production with Dato Predictive Services
Machine Learning in Production with Dato Predictive Services
 
Machine Learning in 2016: Live Q&A with Carlos Guestrin
Machine Learning in 2016: Live Q&A with Carlos GuestrinMachine Learning in 2016: Live Q&A with Carlos Guestrin
Machine Learning in 2016: Live Q&A with Carlos Guestrin
 
Scalable data structures for data science
Scalable data structures for data scienceScalable data structures for data science
Scalable data structures for data science
 
Introduction to Deep Learning for Image Analysis at Strata NYC, Sep 2015
Introduction to Deep Learning for Image Analysis at Strata NYC, Sep 2015Introduction to Deep Learning for Image Analysis at Strata NYC, Sep 2015
Introduction to Deep Learning for Image Analysis at Strata NYC, Sep 2015
 
Introduction to Recommender Systems
Introduction to Recommender SystemsIntroduction to Recommender Systems
Introduction to Recommender Systems
 
Machine learning in production
Machine learning in productionMachine learning in production
Machine learning in production
 
Overview of Machine Learning and Feature Engineering
Overview of Machine Learning and Feature EngineeringOverview of Machine Learning and Feature Engineering
Overview of Machine Learning and Feature Engineering
 
SFrame
SFrameSFrame
SFrame
 
Building Personalized Data Products with Dato
Building Personalized Data Products with DatoBuilding Personalized Data Products with Dato
Building Personalized Data Products with Dato
 

Recently uploaded

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentationphoebematthew05
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 

Recently uploaded (20)

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentation
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort ServiceHot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 

Untangling Influence and Desire: Visual Analysis of Massive Data

  • 1.
  • 2. © 2015 Uncharted Software Inc. http://pantera.io http://influent.io
  • 3. © 2015 Uncharted Software Inc. Visualizing Graphs as Maps Communities Relationships Spatial Model Bollen J, Van de Sompel H, Hagberg A, Bettencourt L, Chute R, et al. (2009) Clickstream Data Yields High-Resolution Maps of Science. PLoS ONE 4(3): e4803. doi:10.1371/journal.pone.0004803
  • 4. © 2015 Uncharted Software Inc. Visualizing Graphs as Maps Communities Relationships Spatial Model Bollen J, Van de Sompel H, Hagberg A, Bettencourt L, Chute R, et al. (2009) Clickstream Data Yields High-Resolution Maps of Science. PLoS ONE 4(3): e4803. doi:10.1371/journal.pone.0004803
  • 5. © 2015 Uncharted Software Inc. Challenges of Big Graphs Computation Layout Interaction Labeling
  • 6. © 2015 Uncharted Software Inc. Chelsea FC Fan Communities 248,747,072 tweets 554,430 account nodes 100,700 mention links Demo skip
  • 7. © 2015 Uncharted Software Inc. Geo Communities Fans Mention Graph
  • 8. © 2015 Uncharted Software Inc. Geo Communities Fans Mention Graph
  • 9. © 2015 Uncharted Software Inc. Geo Communities Player Sentiment
  • 10. © 2015 Uncharted Software Inc. Geo Communities Trending Topics + Drilldown
  • 11. © 2015 Uncharted Software Inc. Community Graph
  • 12. © 2015 Uncharted Software Inc. Community Graph
  • 13. © 2015 Uncharted Software Inc. Graph Tiling Pipeline Hierarchical Community Clustering Hierarchical Graph Layout Tile Generation GraphX Distributed Louvain Modularity Hierarchical Force Directed
  • 14. © 2015 Uncharted Software Inc. Graph Tiling Pipeline Hierarchical Community Clustering Hierarchical Graph Layout Tile Generation GraphX Distributed Louvain Modularity Hierarchical Force Directed
  • 15. © 2015 Uncharted Software Inc. Graph Tiling Pipeline Hierarchical Community Clustering Hierarchical Graph Layout Tile Generation GraphX Distributed Louvain Modularity Hierarchical Force Directed
  • 16. © 2015 Uncharted Software Inc. Graph Tiling Pipeline Hierarchical Community Clustering Hierarchical Graph Layout Tile Generation GraphX Distributed Louvain Modularity Hierarchical Force Directed
  • 17. © 2015 Uncharted Software Inc. Product Affinity Stanford collection of anonymized Amazon product reviews and “customers who bought this also bought…” over 9 years 2,372,409 product and customer nodes 9,909,551 review and co-purchase links
  • 18. © 2015 Uncharted Software Inc.
  • 19. © 2015 Uncharted Software Inc.
  • 20. © 2015 Uncharted Software Inc.
  • 21. © 2015 Uncharted Software Inc.
  • 22. © 2015 Uncharted Software Inc.
  • 23. © 2015 Uncharted Software Inc.
  • 24. © 2015 Uncharted Software Inc. Meeting the Challenges of Big Graphs Computation Layout Interaction Labeling Distributed Computing Hierarchical Communities Level-of-detail Rendering Layered Analytics
  • 25. © 2015 Uncharted Software Inc. Other Applications Bitcoin Network Doctor Referrals Neural Networks
  • 26. © 2015 Uncharted Software Inc. Thanks! Rob Harper rharper@uncharted.software @rdharper

Editor's Notes

  1. At Uncharted specializes in creating visual analytic solutions for a variety of industries. Today drawing on elements from a few of the projects we’re working on, namely: PanTera for large multidimensional and geo data and Influent for big graph data Also be pulling from related research and open source initiatives being developed as part of a DARPA program for reasoning over big data
  2. There are many reasons to visualize node-link graph data. Often the need stems from assessing believability or trust in an ML computed answer, or seeking context for it, looking for nuance or patterns that may have been missed. But beyond that, what does visual analysis entail? Frequently we’re looking for clusters and communities; groups of nodes that are closely connected and may suggest similarity or affinity. We also look for the relationships / structures between nodes and communities …
  3. In this map of science, what is it that connects organic chemistry to international studies? Ultimately we’re creating a spatial representation or a 2d map of a complex dataset. This helps us, as spatial thinkers, build mental models of how the data is organized. We use our well-developed visual perception to see anomalies and patterns that may beg further questions or lead to new insights.
  4. As the volume of data that we’re collecting and analyzing grows, millions of nodes or billions of edges, actually seeing our graphs becomes increasingly problematic. The computational cost of laying out and rendering the graph becomes prohibitive Layouts often results in an unresolved hairball that reveals little The size and cost leads to our interactions being limited And ineffective labeling makes it hard to understand the nature of communities and relationships
  5. Look at a graph of Twitter users who tweeted about Chelsea FC soccer team during the 2014 season. Demo:
  6. Start by side-stepping two of the issues the plague big graphs – computation cost of layout and hairball results. Do this by using the actual geo-locations of the nodes (the users) as our layout. This is visualizing a graph as a map in the most literal sense and focuses on spatial communities and patterns. The arcs we see are twitter mentions between users and are drawn in a clockwise fashion to show link directionality. At this level we get an immediate sense of high-level geo structure of the data – large community in england but also high engagement in Spain, and West Africa.
  7. To address render time and interactivity costs, we use a heatmap aggregate view of all links and show appropriate level of detail at each scale. Adopting a Google Maps paradigm we serve only the data necessary for a given view keeping render speed high in the browser. So at this level our focus is on link patterns and many are immediately obvious - such as a high one-sided volume of traffic from Guatemala to Spain, also England to West Africa. And I can drill down from the global to the local level, revealing patterns at different scales.
  8. But we also need to address labeling - ways of understanding the character of the communities we see (beyond geography). Here is a layer showing player mention sentiment by region
  9. Another with trending topics that may help characterize the communities we see. In this case we see a very specific hashtag “we need west cumberland hospital” common to the users in this region. Drilling down reveals a single-day campaign around a local issue.
  10. Here we’ve plotted the same graph but laid out based on network structure. Many of the elements remain the same as the geo layout: directional arc links and approach to rendering. At this zoom level some of the most prominent communities are outlined and labeled. Just like zooming in a map reviews finer details of cities and streets, zooming here reveals additional communities and labels. Spatial proximity between clusters helps show us strength of relationship and the heatmap of links gives us a sense of structure.
  11. Approach the labeling issue by layering in additional information to help understand the nature of our clusters. In this case we have a cluster centered around BBC Sports that also has strong mention of the soap opera “Coronation Street”. Manchester United appears as a common theme in this region of the graph.
  12. Technical Approach: We start with raw source data and run it through a series of steps to produce a set of web-map-style data tiles for consumption by the client.
  13. We first detect communities in the graph. We apply a distributed Louvain Modularity algorithm which starts with the raw graph and detects increasingly higher-level community structures. We address the typically large computational costs of this by using the distributed computing platform Apache Spark using the GraphX library.
  14. We then take these communities and run a hierarchical graph layout algorithm, again using Spark. This starts by laying out the highest level aggregate community graph with a distributed force-directed layout. Once the top level communities are positioned, the next level down is laid out within the constraints of the parent communities, and so-on down to the raw nodes. In this manner we are addressing two of the challenges of big graph visualization that we avoided in the geo-case: - Computation cost, as hierarchical layout is easily data parallelizable across a cluster of computers. - Furthermore, this community-based layout is good at reducing hairball results by pulling apart communities and focusing on relationships between them.
  15. In the final stage again we borrow concepts web geographic maps and apply a “data tile” approach - the information from earlier stages is stored using a hierarchical division of space. Again, we’re addressing interaction and labeling here by efficiently delivering appropriate level-of-detail and domain-specific data representations to the client.
  16. Final Demo: This application looks at a far larger graph - a 2M node, 10M link graph of Amazon products, purchases, and reviews.
  17. We use the same community-based approach to big graph layout. Here Amazon products are our nodes and links are co-review and co-purchases. Communities tend to suggest product affinity. Despite an order of magnitude increase in data size, the speed and interactivity remains the same. But one issue I didn’t discuss with the twitter graph was how we can navigate the graph at the individual node level.
  18. By selecting a node or even a cluster of interest we can switch to a different view of the data – an interactive lens into the graph with the ability to trace a path through links of interest either individually or in aggregate. As we do so we can see the strength and temporal distribution of connections. This is all linked back to the global graph.
  19. …where I can see them in context.
  20. Zooming in ….
  21. and again, we see the big cluster from earlier break up into a constellation of smaller communities.
  22. Zooming further still, we see the position of these products in the context of other clusters showing interesting co-purchase groups by proximity – The Lord of the Rings not surprisingly near Harry Potter
  23. In summary, our approach seeks to meet some of the challenges of visualizing big graphs: Computation cost through distributed computing and data parallelism; Hairball layouts through a hierarchical community-based layout; Interaction through the application of web-map-style data tiles And finally effective labeling through the use of scale-sensitive domain-specific data stored with the tiles.
  24. Finally, we’ve applied this technology to several other domains such as financial networks, doctor referral networks, and biological data.