Clickstream analytics with Markov Chains

Alex Papageorgiou
Alex PapageorgiouMarketing analytics & data science consultant
Clickstream Analytics
Overview and practical applications
with Markov Chains
Data Science and Engineering Club
Dublin, May 2018
Alexandros Papageorgiou
Agenda
● Clickstream introduction
● Markov Chains overview
● 3 Practical applications
My journey so far
alex-papageo.com
Digital transformation
● Traditional companies undergoing digital transformation
● Increasing number of IRL startups now purely digital
● Clickstream becoming an ideal way to listen to the voices of customers
Warm-up: Wikipedia Clickstream and Network analysis
Why Clickstream
● Perform advanced types of analysis
● Go beyond standard segmentation analysis
● Get closer to the individual voices of customers
Alternatives ?
What’s the clickstream exactly ?
The Weblog
Accessing the Clickstream via Google Analytics
1. Implement Customer ID dimension
2. Implement timestamp dimension
Then for every pageview we can see the customer ID and the time stamp
How to guide: https://www.simoahava.com/analytics/improve-data-collection-with-
four-custom-dimensions/
A tidy clickstream example
Multiple models for clickstream analysis
● Network Analysis to visualise flow of
web traffic
● Clustering of customers
● Clustering of sessions
● Markov Chains for future click
prediction
● Frequent path analysis
● Hidden Markov Models to identify
user’s stage in the buying cycle.
● Association Rules to identify
bottlenecks to conversion
● Bot analysis for SEO optimisation
3 useful applications
● Frequent Path analysis
● Future Click predicition w/ Markov Chains
● Transition Probablities w/ Markov Chains
Markov Chains
● It’s a 100+ year old theory.
● Studies the evolution of dynamic systems
● Used widely in science from physics to finance, information science
● Hidden Markov Models, Markov Chain Monte Carlo, higer order
Markov Chains
Markov Chains vocabulary
Media Exposure through the Funnel: A Model of Multi-Stage Attribution
repository.cmu.edu/cgi/viewcontent.cgi?article=1399&context=heinzworks
The clickstream R package.
Package Author: Michael Scholz
- Cluster your clickstream
- Model the clickstream clusters as a markov chain
- Visualise and calculate transition probabilities
- Predict next click given a submited click sequence.
- Convert the clickstream to an object that is ready for association rules
Useful References
Markov Chains intro – when to use them, how they work
https://towardsdatascience.com/introduction-to-markov-chains-50da3645a50d
Clickstream package article on the Journal of Statistical Software
www.jstatsoft.org/article/view/v074i04
Supercharging websites with a real-time R API
http://code.markedmondson.me/predictClickOpenCPU/supercharge
Notebook on Github
https://github.com/papageorgiou/clickstream-talk/blob/master/data-sci-eng-meetup.md
Thank you!
@alpapag
analyst@alex-papageo.com
linkedin.com/in/alexandrospapageorgiou
1 of 18

Recommended

Graph-Powered Machine Learning by
Graph-Powered Machine LearningGraph-Powered Machine Learning
Graph-Powered Machine LearningDatabricks
651 views26 slides
K-Means Clustering Algorithm - Cluster Analysis | Machine Learning Algorithm ... by
K-Means Clustering Algorithm - Cluster Analysis | Machine Learning Algorithm ...K-Means Clustering Algorithm - Cluster Analysis | Machine Learning Algorithm ...
K-Means Clustering Algorithm - Cluster Analysis | Machine Learning Algorithm ...Edureka!
3.9K views33 slides
06 Community Detection by
06 Community Detection06 Community Detection
06 Community DetectionDuke Network Analysis Center
1.9K views49 slides
Link prediction 방법의 개념 및 활용 by
Link prediction 방법의 개념 및 활용Link prediction 방법의 개념 및 활용
Link prediction 방법의 개념 및 활용Kyunghoon Kim
5.7K views86 slides
MITRE ATT&CKcon 2018: Building an Atomic Testing Program, Brian Beyer, Red Ca... by
MITRE ATT&CKcon 2018: Building an Atomic Testing Program, Brian Beyer, Red Ca...MITRE ATT&CKcon 2018: Building an Atomic Testing Program, Brian Beyer, Red Ca...
MITRE ATT&CKcon 2018: Building an Atomic Testing Program, Brian Beyer, Red Ca...MITRE - ATT&CKcon
2K views20 slides
Deepwalk vs Node2vec by
Deepwalk vs Node2vecDeepwalk vs Node2vec
Deepwalk vs Node2vecSiddhantVerma49
487 views20 slides

More Related Content

What's hot

K means clustering by
K means clusteringK means clustering
K means clusteringAhmedasbasb
1.3K views16 slides
K Means Clustering Algorithm | K Means Clustering Example | Machine Learning ... by
K Means Clustering Algorithm | K Means Clustering Example | Machine Learning ...K Means Clustering Algorithm | K Means Clustering Example | Machine Learning ...
K Means Clustering Algorithm | K Means Clustering Example | Machine Learning ...Simplilearn
9.6K views71 slides
Adversary Emulation using CALDERA by
Adversary Emulation using CALDERAAdversary Emulation using CALDERA
Adversary Emulation using CALDERAErik Van Buggenhout
1.4K views45 slides
K-means clustering algorithm by
K-means clustering algorithmK-means clustering algorithm
K-means clustering algorithmVinit Dantkale
965 views10 slides
Machine Learning & Cyber Security: Detecting Malicious URLs in the Haystack by
Machine Learning & Cyber Security: Detecting Malicious URLs in the HaystackMachine Learning & Cyber Security: Detecting Malicious URLs in the Haystack
Machine Learning & Cyber Security: Detecting Malicious URLs in the HaystackAlistair Gillespie
255 views35 slides
Kaggle presentation by
Kaggle presentationKaggle presentation
Kaggle presentationHJ van Veen
13.4K views58 slides

What's hot(20)

K means clustering by Ahmedasbasb
K means clusteringK means clustering
K means clustering
Ahmedasbasb1.3K views
K Means Clustering Algorithm | K Means Clustering Example | Machine Learning ... by Simplilearn
K Means Clustering Algorithm | K Means Clustering Example | Machine Learning ...K Means Clustering Algorithm | K Means Clustering Example | Machine Learning ...
K Means Clustering Algorithm | K Means Clustering Example | Machine Learning ...
Simplilearn9.6K views
K-means clustering algorithm by Vinit Dantkale
K-means clustering algorithmK-means clustering algorithm
K-means clustering algorithm
Vinit Dantkale965 views
Machine Learning & Cyber Security: Detecting Malicious URLs in the Haystack by Alistair Gillespie
Machine Learning & Cyber Security: Detecting Malicious URLs in the HaystackMachine Learning & Cyber Security: Detecting Malicious URLs in the Haystack
Machine Learning & Cyber Security: Detecting Malicious URLs in the Haystack
Alistair Gillespie255 views
Kaggle presentation by HJ van Veen
Kaggle presentationKaggle presentation
Kaggle presentation
HJ van Veen13.4K views
Netflix's Recommendation ML Pipeline Using Apache Spark: Spark Summit East ta... by Spark Summit
Netflix's Recommendation ML Pipeline Using Apache Spark: Spark Summit East ta...Netflix's Recommendation ML Pipeline Using Apache Spark: Spark Summit East ta...
Netflix's Recommendation ML Pipeline Using Apache Spark: Spark Summit East ta...
Spark Summit4.7K views
Threat hunting and achieving security maturity by DNIF
Threat hunting and achieving security maturityThreat hunting and achieving security maturity
Threat hunting and achieving security maturity
DNIF220 views
Application of Machine Learning in Cybersecurity by Pratap Dangeti
Application of Machine Learning in CybersecurityApplication of Machine Learning in Cybersecurity
Application of Machine Learning in Cybersecurity
Pratap Dangeti527 views
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori... by Simplilearn
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...
Simplilearn3.6K views
Feature Engineering for ML - Dmitry Larko, H2O.ai by Sri Ambati
Feature Engineering for ML - Dmitry Larko, H2O.aiFeature Engineering for ML - Dmitry Larko, H2O.ai
Feature Engineering for ML - Dmitry Larko, H2O.ai
Sri Ambati2.5K views
Community Detection by Kanika Kanwal
Community Detection Community Detection
Community Detection
Kanika Kanwal2.3K views
Clique by sk_klms
Clique Clique
Clique
sk_klms17.1K views
[HITCON 2020 CTI Village] Threat Hunting and Campaign Tracking Workshop.pptx by Chi En (Ashley) Shen
[HITCON 2020 CTI Village] Threat Hunting and Campaign Tracking Workshop.pptx[HITCON 2020 CTI Village] Threat Hunting and Campaign Tracking Workshop.pptx
[HITCON 2020 CTI Village] Threat Hunting and Campaign Tracking Workshop.pptx
When Cyber Security Meets Machine Learning by Lior Rokach
When Cyber Security Meets Machine LearningWhen Cyber Security Meets Machine Learning
When Cyber Security Meets Machine Learning
Lior Rokach17.7K views
Microsoft threat modeling tool 2016 by Rihab Chebbah
Microsoft threat modeling tool 2016Microsoft threat modeling tool 2016
Microsoft threat modeling tool 2016
Rihab Chebbah4.4K views

Similar to Clickstream analytics with Markov Chains

Clickstream Analytics with Markov Chains by
Clickstream Analytics with Markov Chains Clickstream Analytics with Markov Chains
Clickstream Analytics with Markov Chains Alex Papageorgiou
2.3K views16 slides
Machine Learning and Blockchain by Director of Product at Target by
Machine Learning and Blockchain by Director of Product at TargetMachine Learning and Blockchain by Director of Product at Target
Machine Learning and Blockchain by Director of Product at TargetProduct School
613 views32 slides
Stream Processing in Action by
Stream Processing in ActionStream Processing in Action
Stream Processing in ActionWSO2
781 views37 slides
Types of Blockchain, AI and its future by
Types of Blockchain, AI and its futureTypes of Blockchain, AI and its future
Types of Blockchain, AI and its futureAarthi Srinivasan
631 views29 slides
Leaning Forward: Imagine Supply Chain 2030 by
Leaning Forward: Imagine Supply Chain 2030Leaning Forward: Imagine Supply Chain 2030
Leaning Forward: Imagine Supply Chain 2030Lora Cecere
5.7K views45 slides
Supply Chain 2030: Presentation by Lora Cecere at CLX Conference by
Supply Chain 2030: Presentation by Lora Cecere at CLX ConferenceSupply Chain 2030: Presentation by Lora Cecere at CLX Conference
Supply Chain 2030: Presentation by Lora Cecere at CLX ConferenceLora Cecere
1.2K views83 slides

Similar to Clickstream analytics with Markov Chains(20)

Clickstream Analytics with Markov Chains by Alex Papageorgiou
Clickstream Analytics with Markov Chains Clickstream Analytics with Markov Chains
Clickstream Analytics with Markov Chains
Alex Papageorgiou2.3K views
Machine Learning and Blockchain by Director of Product at Target by Product School
Machine Learning and Blockchain by Director of Product at TargetMachine Learning and Blockchain by Director of Product at Target
Machine Learning and Blockchain by Director of Product at Target
Product School613 views
Stream Processing in Action by WSO2
Stream Processing in ActionStream Processing in Action
Stream Processing in Action
WSO2781 views
Leaning Forward: Imagine Supply Chain 2030 by Lora Cecere
Leaning Forward: Imagine Supply Chain 2030Leaning Forward: Imagine Supply Chain 2030
Leaning Forward: Imagine Supply Chain 2030
Lora Cecere5.7K views
Supply Chain 2030: Presentation by Lora Cecere at CLX Conference by Lora Cecere
Supply Chain 2030: Presentation by Lora Cecere at CLX ConferenceSupply Chain 2030: Presentation by Lora Cecere at CLX Conference
Supply Chain 2030: Presentation by Lora Cecere at CLX Conference
Lora Cecere1.2K views
Big Data LDN 2017: Advanced Analytics Applied to Marketing Attribution by Matt Stubbs
Big Data LDN 2017: Advanced Analytics Applied to Marketing AttributionBig Data LDN 2017: Advanced Analytics Applied to Marketing Attribution
Big Data LDN 2017: Advanced Analytics Applied to Marketing Attribution
Matt Stubbs804 views
Enactment of Firefly Algorithm and Fuzzy C-Means Clustering For Consumer Requ... by IRJET Journal
Enactment of Firefly Algorithm and Fuzzy C-Means Clustering For Consumer Requ...Enactment of Firefly Algorithm and Fuzzy C-Means Clustering For Consumer Requ...
Enactment of Firefly Algorithm and Fuzzy C-Means Clustering For Consumer Requ...
IRJET Journal4 views
WSO2Con USA 2017: Analytics Patterns for Your Digital Enterprise by WSO2
WSO2Con USA 2017: Analytics Patterns for Your Digital EnterpriseWSO2Con USA 2017: Analytics Patterns for Your Digital Enterprise
WSO2Con USA 2017: Analytics Patterns for Your Digital Enterprise
WSO2587 views
Solutions Using WSO2 Analytics by WSO2
Solutions Using WSO2 AnalyticsSolutions Using WSO2 Analytics
Solutions Using WSO2 Analytics
WSO2630 views
Christian Kreuzfeld – Static vs Dynamic Stream Processing by Flink Forward
Christian Kreuzfeld – Static vs Dynamic Stream ProcessingChristian Kreuzfeld – Static vs Dynamic Stream Processing
Christian Kreuzfeld – Static vs Dynamic Stream Processing
Flink Forward7.6K views
Five performance factors you need to know about in 2018 by Fredric Lundgren
Five performance factors you need to know about in 2018Five performance factors you need to know about in 2018
Five performance factors you need to know about in 2018
Fredric Lundgren108 views
Cloud, saas and analytics driven value chain business transformation versio... by Angela Alejandro
Cloud, saas and analytics driven value chain business transformation   versio...Cloud, saas and analytics driven value chain business transformation   versio...
Cloud, saas and analytics driven value chain business transformation versio...
Angela Alejandro269 views
Everything you want to know about microservices by Youness Lasmak
Everything you want to know about microservicesEverything you want to know about microservices
Everything you want to know about microservices
Youness Lasmak148 views
Transport for London - Using Data to Keep London Moving by WSO2
Transport for London - Using Data to Keep London MovingTransport for London - Using Data to Keep London Moving
Transport for London - Using Data to Keep London Moving
WSO21.1K views
BUTLER project presentation by butler-iot
BUTLER project presentationBUTLER project presentation
BUTLER project presentation
butler-iot18.8K views

More from Alex Papageorgiou

Webinar Advanced marketing analytics by
Webinar Advanced marketing analyticsWebinar Advanced marketing analytics
Webinar Advanced marketing analyticsAlex Papageorgiou
202 views33 slides
Kaggle for digital analysts by
Kaggle for digital analystsKaggle for digital analysts
Kaggle for digital analystsAlex Papageorgiou
165 views18 slides
Kaggle for Analysts - MeasureCamp London 2019 by
Kaggle for Analysts - MeasureCamp London 2019Kaggle for Analysts - MeasureCamp London 2019
Kaggle for Analysts - MeasureCamp London 2019Alex Papageorgiou
139 views18 slides
Travel information search: the presence of social media by
Travel information search: the presence of social mediaTravel information search: the presence of social media
Travel information search: the presence of social mediaAlex Papageorgiou
384 views10 slides
The Kaggle Experience from a Digital Analysts' Perspective by
The Kaggle Experience from a Digital Analysts' PerspectiveThe Kaggle Experience from a Digital Analysts' Perspective
The Kaggle Experience from a Digital Analysts' PerspectiveAlex Papageorgiou
1.3K views24 slides
Growth Analytics: Evolution, Community and Tools by
Growth Analytics: Evolution, Community and ToolsGrowth Analytics: Evolution, Community and Tools
Growth Analytics: Evolution, Community and ToolsAlex Papageorgiou
1.3K views34 slides

More from Alex Papageorgiou(14)

Kaggle for Analysts - MeasureCamp London 2019 by Alex Papageorgiou
Kaggle for Analysts - MeasureCamp London 2019Kaggle for Analysts - MeasureCamp London 2019
Kaggle for Analysts - MeasureCamp London 2019
Alex Papageorgiou139 views
Travel information search: the presence of social media by Alex Papageorgiou
Travel information search: the presence of social mediaTravel information search: the presence of social media
Travel information search: the presence of social media
Alex Papageorgiou384 views
The Kaggle Experience from a Digital Analysts' Perspective by Alex Papageorgiou
The Kaggle Experience from a Digital Analysts' PerspectiveThe Kaggle Experience from a Digital Analysts' Perspective
The Kaggle Experience from a Digital Analysts' Perspective
Alex Papageorgiou1.3K views
Growth Analytics: Evolution, Community and Tools by Alex Papageorgiou
Growth Analytics: Evolution, Community and ToolsGrowth Analytics: Evolution, Community and Tools
Growth Analytics: Evolution, Community and Tools
Alex Papageorgiou1.3K views
The impact of search ads on organic search traffic by Alex Papageorgiou
The impact of search ads on organic search trafficThe impact of search ads on organic search traffic
The impact of search ads on organic search traffic
Alex Papageorgiou159 views
Prediciting happiness from mobile app survey data by Alex Papageorgiou
Prediciting happiness from mobile app survey dataPrediciting happiness from mobile app survey data
Prediciting happiness from mobile app survey data
Alex Papageorgiou251 views
E com conversion prediction and optimisation by Alex Papageorgiou
E com conversion prediction and optimisationE com conversion prediction and optimisation
E com conversion prediction and optimisation
Alex Papageorgiou382 views
Data science with Google Analytics @MeasureCamp by Alex Papageorgiou
Data science with Google Analytics @MeasureCampData science with Google Analytics @MeasureCamp
Data science with Google Analytics @MeasureCamp

Recently uploaded

6498-Butun_Beyinli_Cocuq-Daniel_J.Siegel-Tina_Payne_Bryson-2011-259s.pdf by
6498-Butun_Beyinli_Cocuq-Daniel_J.Siegel-Tina_Payne_Bryson-2011-259s.pdf6498-Butun_Beyinli_Cocuq-Daniel_J.Siegel-Tina_Payne_Bryson-2011-259s.pdf
6498-Butun_Beyinli_Cocuq-Daniel_J.Siegel-Tina_Payne_Bryson-2011-259s.pdf10urkyr34
8 views259 slides
META.pptx by
META.pptxMETA.pptx
META.pptxvasanthan19012003
8 views10 slides
Pydata Global 2023 - How can a learnt model unlearn something by
Pydata Global 2023 - How can a learnt model unlearn somethingPydata Global 2023 - How can a learnt model unlearn something
Pydata Global 2023 - How can a learnt model unlearn somethingSARADINDU SENGUPTA
11 views13 slides
LIVE OAK MEMORIAL PARK.pptx by
LIVE OAK MEMORIAL PARK.pptxLIVE OAK MEMORIAL PARK.pptx
LIVE OAK MEMORIAL PARK.pptxms2332always
9 views6 slides
apple.pptx by
apple.pptxapple.pptx
apple.pptxhoneybeeqwe
8 views15 slides
GDG Community Day 2023 - Interpretable ML in production by
GDG Community Day 2023 - Interpretable ML in productionGDG Community Day 2023 - Interpretable ML in production
GDG Community Day 2023 - Interpretable ML in productionSARADINDU SENGUPTA
7 views19 slides

Recently uploaded(20)

6498-Butun_Beyinli_Cocuq-Daniel_J.Siegel-Tina_Payne_Bryson-2011-259s.pdf by 10urkyr34
6498-Butun_Beyinli_Cocuq-Daniel_J.Siegel-Tina_Payne_Bryson-2011-259s.pdf6498-Butun_Beyinli_Cocuq-Daniel_J.Siegel-Tina_Payne_Bryson-2011-259s.pdf
6498-Butun_Beyinli_Cocuq-Daniel_J.Siegel-Tina_Payne_Bryson-2011-259s.pdf
10urkyr348 views
Pydata Global 2023 - How can a learnt model unlearn something by SARADINDU SENGUPTA
Pydata Global 2023 - How can a learnt model unlearn somethingPydata Global 2023 - How can a learnt model unlearn something
Pydata Global 2023 - How can a learnt model unlearn something
LIVE OAK MEMORIAL PARK.pptx by ms2332always
LIVE OAK MEMORIAL PARK.pptxLIVE OAK MEMORIAL PARK.pptx
LIVE OAK MEMORIAL PARK.pptx
ms2332always9 views
GDG Community Day 2023 - Interpretable ML in production by SARADINDU SENGUPTA
GDG Community Day 2023 - Interpretable ML in productionGDG Community Day 2023 - Interpretable ML in production
GDG Community Day 2023 - Interpretable ML in production
[DSC Europe 23] Branka Panic - Peace in the age of artificial intelligence.pptx by DataScienceConferenc1
[DSC Europe 23] Branka Panic - Peace in the age of artificial intelligence.pptx[DSC Europe 23] Branka Panic - Peace in the age of artificial intelligence.pptx
[DSC Europe 23] Branka Panic - Peace in the age of artificial intelligence.pptx
[DSC Europe 23] Ilija Duni - How Foursquare Builds Meaningful Bridges Between... by DataScienceConferenc1
[DSC Europe 23] Ilija Duni - How Foursquare Builds Meaningful Bridges Between...[DSC Europe 23] Ilija Duni - How Foursquare Builds Meaningful Bridges Between...
[DSC Europe 23] Ilija Duni - How Foursquare Builds Meaningful Bridges Between...
Business administration Project File.pdf by KiranPrajapati91
Business administration Project File.pdfBusiness administration Project File.pdf
Business administration Project File.pdf
KiranPrajapati9111 views
Games, Queries, and Argumentation Frameworks: Time for a Family Reunion by Bertram Ludäscher
Games, Queries, and Argumentation Frameworks: Time for a Family ReunionGames, Queries, and Argumentation Frameworks: Time for a Family Reunion
Games, Queries, and Argumentation Frameworks: Time for a Family Reunion
DGST Methodology Presentation.pdf by maddierlegum
DGST Methodology Presentation.pdfDGST Methodology Presentation.pdf
DGST Methodology Presentation.pdf
maddierlegum8 views
GDG Cloud Community Day 2022 - Managing data quality in Machine Learning by SARADINDU SENGUPTA
GDG Cloud Community Day 2022 -  Managing data quality in Machine LearningGDG Cloud Community Day 2022 -  Managing data quality in Machine Learning
GDG Cloud Community Day 2022 - Managing data quality in Machine Learning
Best Home Security Systems.pptx by mogalang
Best Home Security Systems.pptxBest Home Security Systems.pptx
Best Home Security Systems.pptx
mogalang11 views
Underfunded.pptx by vgarcia19
Underfunded.pptxUnderfunded.pptx
Underfunded.pptx
vgarcia1916 views

Clickstream analytics with Markov Chains

  • 1. Clickstream Analytics Overview and practical applications with Markov Chains Data Science and Engineering Club Dublin, May 2018 Alexandros Papageorgiou
  • 2. Agenda ● Clickstream introduction ● Markov Chains overview ● 3 Practical applications
  • 3. My journey so far alex-papageo.com
  • 4. Digital transformation ● Traditional companies undergoing digital transformation ● Increasing number of IRL startups now purely digital ● Clickstream becoming an ideal way to listen to the voices of customers
  • 5. Warm-up: Wikipedia Clickstream and Network analysis
  • 6. Why Clickstream ● Perform advanced types of analysis ● Go beyond standard segmentation analysis ● Get closer to the individual voices of customers
  • 10. Accessing the Clickstream via Google Analytics 1. Implement Customer ID dimension 2. Implement timestamp dimension Then for every pageview we can see the customer ID and the time stamp How to guide: https://www.simoahava.com/analytics/improve-data-collection-with- four-custom-dimensions/
  • 12. Multiple models for clickstream analysis ● Network Analysis to visualise flow of web traffic ● Clustering of customers ● Clustering of sessions ● Markov Chains for future click prediction ● Frequent path analysis ● Hidden Markov Models to identify user’s stage in the buying cycle. ● Association Rules to identify bottlenecks to conversion ● Bot analysis for SEO optimisation
  • 13. 3 useful applications ● Frequent Path analysis ● Future Click predicition w/ Markov Chains ● Transition Probablities w/ Markov Chains
  • 14. Markov Chains ● It’s a 100+ year old theory. ● Studies the evolution of dynamic systems ● Used widely in science from physics to finance, information science ● Hidden Markov Models, Markov Chain Monte Carlo, higer order Markov Chains
  • 15. Markov Chains vocabulary Media Exposure through the Funnel: A Model of Multi-Stage Attribution repository.cmu.edu/cgi/viewcontent.cgi?article=1399&context=heinzworks
  • 16. The clickstream R package. Package Author: Michael Scholz - Cluster your clickstream - Model the clickstream clusters as a markov chain - Visualise and calculate transition probabilities - Predict next click given a submited click sequence. - Convert the clickstream to an object that is ready for association rules
  • 17. Useful References Markov Chains intro – when to use them, how they work https://towardsdatascience.com/introduction-to-markov-chains-50da3645a50d Clickstream package article on the Journal of Statistical Software www.jstatsoft.org/article/view/v074i04 Supercharging websites with a real-time R API http://code.markedmondson.me/predictClickOpenCPU/supercharge Notebook on Github https://github.com/papageorgiou/clickstream-talk/blob/master/data-sci-eng-meetup.md

Editor's Notes

  1. There is a lot of talk about digital transformation..lots of companies especially new are completely digital OR more traditional ones are moving to that direction fast. Clickstream is becoming a key data structure/resource that its critical to underand it and work with it in order not to give potential value on the table and use it for competitive advantage to better understand customer journeys.  Will talk about cls from the perspective of a startup company, that’s in line with my experience and in line with how the vast majoriy of businesses can benefit.  If you work for a company with data engineers and data science teams, this is something that you might take for granted.  Of course we record everything, we structure the web log files we put data in data bases and then analysts can access them and we build real time streaming applications on top of that data...but this is probably 1 % of companies. But even if you work there, if you are in Marketing or customer department, there is a lot you can do, without necessarily asking for dedicated engineering resources.
  2. Out of context warm up from a recent blogpost. What you see here is the result of some clickstream combined network analysis. Use network analysis to visualise association between wikipedia pages in a particular thematic area in this case Data science and the traffic that goes back and forth between them. Just one of the application of clickstream combined with network analysis…we ll see a few more. We ll go there step by step.