SlideShare a Scribd company logo
Titolo presentazione
sottotitolo
Milano, XX mese 20XX
A Big Data Analysis Framework for
Model-Based Web User Behavior
Analytics
Carlo Bernaschina, Marco Brambila, Andrea Mauri, Eric Umuhoza
6th June 2017
DEIB. Data Science LabICWE2017 – June 6th 2017
Context
Web analytics has become the tool of choice to inform both
business users and designers.
Several tools exist that support analysis of Web server logs and
extract information on application usage.
DEIB. Data Science LabICWE2017 – June 6th 2017
Context
DEIB. Data Science LabICWE2017 – June 6th 2017
Context
DEIB. Data Science LabICWE2017 – June 6th 2017
Context
Web analytics has become the tool of choice towards taking
informed business and interaction design decisions.
Several tools exist that support analysis of Web server logs and
extract information on application usage.
• Usually unaware of the design structure and the actual
content managed by the application.
DEIB. Data Science LabICWE2017 – June 6th 2017
Objective
Provide valuable insights to designers and decision makers
Integration of two approaches:
• Web Log Analytics
• Model Driven Development
DEIB. Data Science LabICWE2017 – June 6th 2017
Model Driven Development
Software development paradigm where the models are the main
artefacts of the development process.
Main benefits:
• Model once, generate for any platform of choice
• Validation of the requirements
In this specific case:
• The model includes the structure of the web application.
DEIB. Data Science LabICWE2017 – June 6th 2017
Interaction Flow Modeling Language
Interaction Flow Modeling Language (IFML) is designed for
expressing the content, user interaction and control behavior of
the front-end of software applications
DEIB. Data Science LabICWE2017 – June 6th 2017
Overview
Application Server
Data Analyser
Storage
Analyzer
(Spark)
Code
Generator
Webratio
IFML Editor
(Enriched Analytics Model)
Web Server LogRTXLog
ModelCode
Data Visualization
Analysis
(b)
(a,f)
(d)
(e)
Application
DB
Web Server
(Tomcat)
Database
(c)
Modeling
Application
Deployment
Analysis Visualization
Execution
DEIB. Data Science LabICWE2017 – June 6th 2017
Logs Integration
(d)
(c)
(b)
Model
RTXLog
Web Server Log
Database
RTXLog
WebServerLog
Model
Database
EnrichedLog
GlobalLog
FinalLog
RTX.sessionId = WebServer.sessionId AND
RTX.timestamp = WebServer.timestamp
EnrichedLog.elementId = Model.elementId
GlobalLog.tabelName = Database.tableName
AND Database.attributeName=“OID”
AND GlobalLog.instanceID = Database,value
(a)
(a)
(a)
(a)
A denormalized view of the logs
DEIB. Data Science LabICWE2017 – June 6th 2017
Navigation Based Analyses
Includes information regarding how the users navigate the Web
site
Examples:
• Entrance Rate
• Bounce Rate
• Page Visit
• Residence Time
• Link Navigation
• …
DEIB. Data Science LabICWE2017 – June 6th 2017
Content Based Analyses
Comprehends information regarding the domain entities involved
in the user interaction, their types and their semantics.
Example (e-commerce website)
• Top K Visualized Books
• Top K Visualized Authors
• Top K Clicked Books
• …
DEIB. Data Science LabICWE2017 – June 6th 2017
Structure Based Analyses
Comprehends information regarding the kind of widget,
visualization, or even navigation pattern used in the user
interaction.
Example:
• Top K elements clicked by users when shown in a map
throughout the site
• Top K elements clicked when shown in the first three positions
of a list
• Top K elements clicked when an attribute of type image is
shown in the page versus an attribute of type currency.
• …
DEIB. Data Science LabICWE2017 – June 6th 2017
Data Visualization Tool
Traditional charts like pie charts, bar charts, navigation flow
charts and so on..
DEIB. Data Science LabICWE2017 – June 6th 2017
Visual Feedback on Model Editor
Three types of visualization:
• Color: the analytics is shown through the change of color of
the corresponding model element
• Label: the analytics is shown with a label on the
corresponding model element
• Properties: the analytics is shown in a separate property panel
DEIB. Data Science LabICWE2017 – June 6th 2017
Visual Feedback on Model Editor - Color
• Residence Time
• Page visit
• …
DEIB. Data Science LabICWE2017 – June 6th 2017
Visual Feedback on Model Editor - Label
• Link Out Ratio
• Link In Ratio
• Top 1 Clicked Entity
• …
DEIB. Data Science LabICWE2017 – June 6th 2017
Visual Feedback on Model Editor - Properties
• Top k clicked
entities
• Top k visualized
entities
• …
DEIB. Data Science LabICWE2017 – June 6th 2017
Future Work
Do not stop at the visualization!
• Effects of Model Restructuring on User Behavior
• Conversion Rate
• Number of user who reach payment Module
• Conversion Path: the path (with high contribution)
leading to conversion rate
• Model Optimization
DEIB. Data Science LabICWE2017 – June 6th 2017
Thanks for your attention
Questions?
Contact: andrea.mauri@polimi.it
Tool: www.ifmledit.org
Further Info: http://datascience.deib.polimi.it/bigdata-modeling-
weblogs/

More Related Content

What's hot

Data Visualizations with D3.js
Data Visualizations with D3.jsData Visualizations with D3.js
Data Visualizations with D3.js
Brian Greig
 
Resume xiaodan(vinci)
Resume xiaodan(vinci)Resume xiaodan(vinci)
Resume xiaodan(vinci)
vinci105
 
Introduction BI Semantic Model with Sql Server Data Tools copy
Introduction BI Semantic Model with Sql Server Data Tools   copyIntroduction BI Semantic Model with Sql Server Data Tools   copy
Introduction BI Semantic Model with Sql Server Data Tools copy
Slava Kokaev
 
BDVe Webinar Series - Designing Big Data pipelines with Toreador (Ernesto Dam...
BDVe Webinar Series - Designing Big Data pipelines with Toreador (Ernesto Dam...BDVe Webinar Series - Designing Big Data pipelines with Toreador (Ernesto Dam...
BDVe Webinar Series - Designing Big Data pipelines with Toreador (Ernesto Dam...
Big Data Value Association
 
This week in Neo4j - 21st October 2017
This week in Neo4j - 21st October 2017This week in Neo4j - 21st October 2017
This week in Neo4j - 21st October 2017
Neo4j
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data Virtualization
Denodo
 
Hobbit in a Nutshell - EDF2016
Hobbit in a Nutshell - EDF2016Hobbit in a Nutshell - EDF2016
Hobbit in a Nutshell - EDF2016
Holistic Benchmarking of Big Linked Data
 
Access database
Access databaseAccess database
Access database
phaocake
 
Lju Lazarevic
Lju LazarevicLju Lazarevic
Lju Lazarevic
Connected Data World
 
DataDevOps: A Manifesto for a DevOps-like Culture Shift in Data & Analytics
DataDevOps: A Manifesto for a DevOps-like Culture Shift in Data & AnalyticsDataDevOps: A Manifesto for a DevOps-like Culture Shift in Data & Analytics
DataDevOps: A Manifesto for a DevOps-like Culture Shift in Data & Analytics
Dr. Arif Wider
 
Building business processes with content in OneDrive and SharePoint
Building business processes with content in OneDrive and SharePointBuilding business processes with content in OneDrive and SharePoint
Building business processes with content in OneDrive and SharePoint
Microsoft Tech Community
 
Couchbase and Apache Kafka - Bridging the gap between RDBMS and NoSQL
Couchbase and Apache Kafka - Bridging the gap between RDBMS and NoSQLCouchbase and Apache Kafka - Bridging the gap between RDBMS and NoSQL
Couchbase and Apache Kafka - Bridging the gap between RDBMS and NoSQL
DATAVERSITY
 
Tracking research data footprints - slides
Tracking research data footprints - slidesTracking research data footprints - slides
Tracking research data footprints - slides
ARDC
 
Analytical Systems Evolution: From Excel to Big Data Platforms and Data Lakes
Analytical Systems Evolution: From Excel to Big Data Platforms and Data LakesAnalytical Systems Evolution: From Excel to Big Data Platforms and Data Lakes
Analytical Systems Evolution: From Excel to Big Data Platforms and Data Lakes
Provectus
 
Edmc use cases 2018 nyc
Edmc use cases 2018   nycEdmc use cases 2018   nyc
Edmc use cases 2018 nyc
Marty Loughlin
 
Fireside Chat with Bloor Research: State of the Graph Database Market 2020
Fireside Chat with Bloor Research: State of the Graph Database Market 2020Fireside Chat with Bloor Research: State of the Graph Database Market 2020
Fireside Chat with Bloor Research: State of the Graph Database Market 2020
Cambridge Semantics
 
20180901 - 9th Meeting - Power BI Module 0-1
20180901 - 9th Meeting - Power BI Module 0-120180901 - 9th Meeting - Power BI Module 0-1
20180901 - 9th Meeting - Power BI Module 0-1
Duc Lai Trung Minh
 
Prague data management meetup 2017-09-26
Prague data management meetup 2017-09-26Prague data management meetup 2017-09-26
Prague data management meetup 2017-09-26
Martin Bém
 
GRAPHITE — An Extensible Graph Traversal Framework for RDBMS
GRAPHITE — An Extensible Graph Traversal Framework for RDBMSGRAPHITE — An Extensible Graph Traversal Framework for RDBMS
GRAPHITE — An Extensible Graph Traversal Framework for RDBMS
Graph-TA
 
Connected data meetup group - introduction & scope
Connected data meetup group - introduction & scopeConnected data meetup group - introduction & scope
Connected data meetup group - introduction & scope
Connected Data World
 

What's hot (20)

Data Visualizations with D3.js
Data Visualizations with D3.jsData Visualizations with D3.js
Data Visualizations with D3.js
 
Resume xiaodan(vinci)
Resume xiaodan(vinci)Resume xiaodan(vinci)
Resume xiaodan(vinci)
 
Introduction BI Semantic Model with Sql Server Data Tools copy
Introduction BI Semantic Model with Sql Server Data Tools   copyIntroduction BI Semantic Model with Sql Server Data Tools   copy
Introduction BI Semantic Model with Sql Server Data Tools copy
 
BDVe Webinar Series - Designing Big Data pipelines with Toreador (Ernesto Dam...
BDVe Webinar Series - Designing Big Data pipelines with Toreador (Ernesto Dam...BDVe Webinar Series - Designing Big Data pipelines with Toreador (Ernesto Dam...
BDVe Webinar Series - Designing Big Data pipelines with Toreador (Ernesto Dam...
 
This week in Neo4j - 21st October 2017
This week in Neo4j - 21st October 2017This week in Neo4j - 21st October 2017
This week in Neo4j - 21st October 2017
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data Virtualization
 
Hobbit in a Nutshell - EDF2016
Hobbit in a Nutshell - EDF2016Hobbit in a Nutshell - EDF2016
Hobbit in a Nutshell - EDF2016
 
Access database
Access databaseAccess database
Access database
 
Lju Lazarevic
Lju LazarevicLju Lazarevic
Lju Lazarevic
 
DataDevOps: A Manifesto for a DevOps-like Culture Shift in Data & Analytics
DataDevOps: A Manifesto for a DevOps-like Culture Shift in Data & AnalyticsDataDevOps: A Manifesto for a DevOps-like Culture Shift in Data & Analytics
DataDevOps: A Manifesto for a DevOps-like Culture Shift in Data & Analytics
 
Building business processes with content in OneDrive and SharePoint
Building business processes with content in OneDrive and SharePointBuilding business processes with content in OneDrive and SharePoint
Building business processes with content in OneDrive and SharePoint
 
Couchbase and Apache Kafka - Bridging the gap between RDBMS and NoSQL
Couchbase and Apache Kafka - Bridging the gap between RDBMS and NoSQLCouchbase and Apache Kafka - Bridging the gap between RDBMS and NoSQL
Couchbase and Apache Kafka - Bridging the gap between RDBMS and NoSQL
 
Tracking research data footprints - slides
Tracking research data footprints - slidesTracking research data footprints - slides
Tracking research data footprints - slides
 
Analytical Systems Evolution: From Excel to Big Data Platforms and Data Lakes
Analytical Systems Evolution: From Excel to Big Data Platforms and Data LakesAnalytical Systems Evolution: From Excel to Big Data Platforms and Data Lakes
Analytical Systems Evolution: From Excel to Big Data Platforms and Data Lakes
 
Edmc use cases 2018 nyc
Edmc use cases 2018   nycEdmc use cases 2018   nyc
Edmc use cases 2018 nyc
 
Fireside Chat with Bloor Research: State of the Graph Database Market 2020
Fireside Chat with Bloor Research: State of the Graph Database Market 2020Fireside Chat with Bloor Research: State of the Graph Database Market 2020
Fireside Chat with Bloor Research: State of the Graph Database Market 2020
 
20180901 - 9th Meeting - Power BI Module 0-1
20180901 - 9th Meeting - Power BI Module 0-120180901 - 9th Meeting - Power BI Module 0-1
20180901 - 9th Meeting - Power BI Module 0-1
 
Prague data management meetup 2017-09-26
Prague data management meetup 2017-09-26Prague data management meetup 2017-09-26
Prague data management meetup 2017-09-26
 
GRAPHITE — An Extensible Graph Traversal Framework for RDBMS
GRAPHITE — An Extensible Graph Traversal Framework for RDBMSGRAPHITE — An Extensible Graph Traversal Framework for RDBMS
GRAPHITE — An Extensible Graph Traversal Framework for RDBMS
 
Connected data meetup group - introduction & scope
Connected data meetup group - introduction & scopeConnected data meetup group - introduction & scope
Connected data meetup group - introduction & scope
 

Similar to A Big Data Analysis Framework for Model-Based Web User Behavior Analytics

App dev and partner ecosystem for pink social connections 2017
App dev and partner ecosystem for pink   social connections 2017App dev and partner ecosystem for pink   social connections 2017
App dev and partner ecosystem for pink social connections 2017
Heath McCarthy
 
Turning the IBM Collaboration Ecosystem Pink
Turning the IBM Collaboration Ecosystem PinkTurning the IBM Collaboration Ecosystem Pink
Turning the IBM Collaboration Ecosystem Pink
LetsConnect
 
Research Paper
Research PaperResearch Paper
Research Paper
Dominic Mutai
 
Calling all Developers: Building Connections Apps and Integrating with Pink
Calling all Developers: Building Connections Apps and Integrating with PinkCalling all Developers: Building Connections Apps and Integrating with Pink
Calling all Developers: Building Connections Apps and Integrating with Pink
LetsConnect
 
Data Preparation vs. Inline Data Wrangling in Data Science and Machine Learning
Data Preparation vs. Inline Data Wrangling in Data Science and Machine LearningData Preparation vs. Inline Data Wrangling in Data Science and Machine Learning
Data Preparation vs. Inline Data Wrangling in Data Science and Machine Learning
Kai Wähner
 
Koneksys Presentation March 2021
Koneksys Presentation March 2021Koneksys Presentation March 2021
Koneksys Presentation March 2021
Axel Reichwein
 
Announcing the Connections Cloud Catalog: How to Get new Apps fresh out of th...
Announcing the Connections Cloud Catalog: How to Get new Apps fresh out of th...Announcing the Connections Cloud Catalog: How to Get new Apps fresh out of th...
Announcing the Connections Cloud Catalog: How to Get new Apps fresh out of th...
LetsConnect
 
Digital Twin: A radical new approach to IoT
Digital Twin: A radical new approach to IoTDigital Twin: A radical new approach to IoT
Digital Twin: A radical new approach to IoT
Dimitri Volkmann
 
SPFx- A modern development model for SharePoint
SPFx- A modern development model  for SharePointSPFx- A modern development model  for SharePoint
SPFx- A modern development model for SharePoint
Kirti Prajapati
 
Choosing a Data Visualization Tool for Data Scientists_Final
Choosing a Data Visualization Tool for Data Scientists_FinalChoosing a Data Visualization Tool for Data Scientists_Final
Choosing a Data Visualization Tool for Data Scientists_Final
Heather Choi
 
Data & Analytics at Scale
Data & Analytics at ScaleData & Analytics at Scale
Data & Analytics at Scale
Walid Mehanna
 
Analyst Toolbox August 2017
Analyst Toolbox August 2017Analyst Toolbox August 2017
Analyst Toolbox August 2017
Boulder Equity Analytics
 
Build Business Web Applications with PHPOpenbiz Framework and Cubi Platform
Build Business Web Applications with PHPOpenbiz Framework and Cubi PlatformBuild Business Web Applications with PHPOpenbiz Framework and Cubi Platform
Build Business Web Applications with PHPOpenbiz Framework and Cubi Platform
Agus Suhartono
 
Charting libraries vs BI & Analytics Platforms
Charting libraries vs BI & Analytics Platforms Charting libraries vs BI & Analytics Platforms
Charting libraries vs BI & Analytics Platforms
TIBCO Jaspersoft
 
Bridging the Gap
Bridging the GapBridging the Gap
Bridging the Gap
Dickson Lukose
 
Neo4j GraphDay Seattle- Sept19- in the enterprise
Neo4j GraphDay Seattle- Sept19-  in the enterpriseNeo4j GraphDay Seattle- Sept19-  in the enterprise
Neo4j GraphDay Seattle- Sept19- in the enterprise
Neo4j
 
Proof of Concept for Learning Analytics Interoperability
Proof of Concept for Learning Analytics InteroperabilityProof of Concept for Learning Analytics Interoperability
Proof of Concept for Learning Analytics Interoperability
Open Cyber University of Korea
 
SLAS 2017 - "Multiple Research Platforms: One Single Data Sharing Portal"
SLAS 2017 - "Multiple Research Platforms:  One Single Data Sharing Portal"SLAS 2017 - "Multiple Research Platforms:  One Single Data Sharing Portal"
SLAS 2017 - "Multiple Research Platforms: One Single Data Sharing Portal"
CSols, Inc.
 
Analytical Innovation: How to Build the Next Generation Data Platform
Analytical Innovation: How to Build the Next Generation Data PlatformAnalytical Innovation: How to Build the Next Generation Data Platform
Analytical Innovation: How to Build the Next Generation Data Platform
VMware Tanzu
 
Graphs for Enterprise Architects
Graphs for Enterprise ArchitectsGraphs for Enterprise Architects
Graphs for Enterprise Architects
Neo4j
 

Similar to A Big Data Analysis Framework for Model-Based Web User Behavior Analytics (20)

App dev and partner ecosystem for pink social connections 2017
App dev and partner ecosystem for pink   social connections 2017App dev and partner ecosystem for pink   social connections 2017
App dev and partner ecosystem for pink social connections 2017
 
Turning the IBM Collaboration Ecosystem Pink
Turning the IBM Collaboration Ecosystem PinkTurning the IBM Collaboration Ecosystem Pink
Turning the IBM Collaboration Ecosystem Pink
 
Research Paper
Research PaperResearch Paper
Research Paper
 
Calling all Developers: Building Connections Apps and Integrating with Pink
Calling all Developers: Building Connections Apps and Integrating with PinkCalling all Developers: Building Connections Apps and Integrating with Pink
Calling all Developers: Building Connections Apps and Integrating with Pink
 
Data Preparation vs. Inline Data Wrangling in Data Science and Machine Learning
Data Preparation vs. Inline Data Wrangling in Data Science and Machine LearningData Preparation vs. Inline Data Wrangling in Data Science and Machine Learning
Data Preparation vs. Inline Data Wrangling in Data Science and Machine Learning
 
Koneksys Presentation March 2021
Koneksys Presentation March 2021Koneksys Presentation March 2021
Koneksys Presentation March 2021
 
Announcing the Connections Cloud Catalog: How to Get new Apps fresh out of th...
Announcing the Connections Cloud Catalog: How to Get new Apps fresh out of th...Announcing the Connections Cloud Catalog: How to Get new Apps fresh out of th...
Announcing the Connections Cloud Catalog: How to Get new Apps fresh out of th...
 
Digital Twin: A radical new approach to IoT
Digital Twin: A radical new approach to IoTDigital Twin: A radical new approach to IoT
Digital Twin: A radical new approach to IoT
 
SPFx- A modern development model for SharePoint
SPFx- A modern development model  for SharePointSPFx- A modern development model  for SharePoint
SPFx- A modern development model for SharePoint
 
Choosing a Data Visualization Tool for Data Scientists_Final
Choosing a Data Visualization Tool for Data Scientists_FinalChoosing a Data Visualization Tool for Data Scientists_Final
Choosing a Data Visualization Tool for Data Scientists_Final
 
Data & Analytics at Scale
Data & Analytics at ScaleData & Analytics at Scale
Data & Analytics at Scale
 
Analyst Toolbox August 2017
Analyst Toolbox August 2017Analyst Toolbox August 2017
Analyst Toolbox August 2017
 
Build Business Web Applications with PHPOpenbiz Framework and Cubi Platform
Build Business Web Applications with PHPOpenbiz Framework and Cubi PlatformBuild Business Web Applications with PHPOpenbiz Framework and Cubi Platform
Build Business Web Applications with PHPOpenbiz Framework and Cubi Platform
 
Charting libraries vs BI & Analytics Platforms
Charting libraries vs BI & Analytics Platforms Charting libraries vs BI & Analytics Platforms
Charting libraries vs BI & Analytics Platforms
 
Bridging the Gap
Bridging the GapBridging the Gap
Bridging the Gap
 
Neo4j GraphDay Seattle- Sept19- in the enterprise
Neo4j GraphDay Seattle- Sept19-  in the enterpriseNeo4j GraphDay Seattle- Sept19-  in the enterprise
Neo4j GraphDay Seattle- Sept19- in the enterprise
 
Proof of Concept for Learning Analytics Interoperability
Proof of Concept for Learning Analytics InteroperabilityProof of Concept for Learning Analytics Interoperability
Proof of Concept for Learning Analytics Interoperability
 
SLAS 2017 - "Multiple Research Platforms: One Single Data Sharing Portal"
SLAS 2017 - "Multiple Research Platforms:  One Single Data Sharing Portal"SLAS 2017 - "Multiple Research Platforms:  One Single Data Sharing Portal"
SLAS 2017 - "Multiple Research Platforms: One Single Data Sharing Portal"
 
Analytical Innovation: How to Build the Next Generation Data Platform
Analytical Innovation: How to Build the Next Generation Data PlatformAnalytical Innovation: How to Build the Next Generation Data Platform
Analytical Innovation: How to Build the Next Generation Data Platform
 
Graphs for Enterprise Architects
Graphs for Enterprise ArchitectsGraphs for Enterprise Architects
Graphs for Enterprise Architects
 

More from Andrea Mauri

TripleWave: Spreading RDF Streams on the Web
TripleWave: Spreading RDF Streams on the WebTripleWave: Spreading RDF Streams on the Web
TripleWave: Spreading RDF Streams on the Web
Andrea Mauri
 
Model Driven Development of Social Media Environmental Monitoring Applications
Model Driven Development of Social Media Environmental Monitoring ApplicationsModel Driven Development of Social Media Environmental Monitoring Applications
Model Driven Development of Social Media Environmental Monitoring Applications
Andrea Mauri
 
Methodologies for the Development of Crowd and Social-based applications
Methodologies for the Development of Crowd and Social-based applicationsMethodologies for the Development of Crowd and Social-based applications
Methodologies for the Development of Crowd and Social-based applications
Andrea Mauri
 
An explorative approach for Crowdsourcing tasks design
An explorative approach for Crowdsourcing tasks design �An explorative approach for Crowdsourcing tasks design �
An explorative approach for Crowdsourcing tasks design
Andrea Mauri
 
Community Profiling for Crowdsourcing Queries
Community Profiling for Crowdsourcing QueriesCommunity Profiling for Crowdsourcing Queries
Community Profiling for Crowdsourcing Queries
Andrea Mauri
 
Community-based Crowdsourcing
Community-based CrowdsourcingCommunity-based Crowdsourcing
Community-based Crowdsourcing
Andrea Mauri
 

More from Andrea Mauri (6)

TripleWave: Spreading RDF Streams on the Web
TripleWave: Spreading RDF Streams on the WebTripleWave: Spreading RDF Streams on the Web
TripleWave: Spreading RDF Streams on the Web
 
Model Driven Development of Social Media Environmental Monitoring Applications
Model Driven Development of Social Media Environmental Monitoring ApplicationsModel Driven Development of Social Media Environmental Monitoring Applications
Model Driven Development of Social Media Environmental Monitoring Applications
 
Methodologies for the Development of Crowd and Social-based applications
Methodologies for the Development of Crowd and Social-based applicationsMethodologies for the Development of Crowd and Social-based applications
Methodologies for the Development of Crowd and Social-based applications
 
An explorative approach for Crowdsourcing tasks design
An explorative approach for Crowdsourcing tasks design �An explorative approach for Crowdsourcing tasks design �
An explorative approach for Crowdsourcing tasks design
 
Community Profiling for Crowdsourcing Queries
Community Profiling for Crowdsourcing QueriesCommunity Profiling for Crowdsourcing Queries
Community Profiling for Crowdsourcing Queries
 
Community-based Crowdsourcing
Community-based CrowdsourcingCommunity-based Crowdsourcing
Community-based Crowdsourcing
 

Recently uploaded

原版一比一利兹贝克特大学毕业证(LeedsBeckett毕业证书)如何办理
原版一比一利兹贝克特大学毕业证(LeedsBeckett毕业证书)如何办理原版一比一利兹贝克特大学毕业证(LeedsBeckett毕业证书)如何办理
原版一比一利兹贝克特大学毕业证(LeedsBeckett毕业证书)如何办理
wyddcwye1
 
Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024
ElizabethGarrettChri
 
The Ipsos - AI - Monitor 2024 Report.pdf
The  Ipsos - AI - Monitor 2024 Report.pdfThe  Ipsos - AI - Monitor 2024 Report.pdf
The Ipsos - AI - Monitor 2024 Report.pdf
Social Samosa
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
Lars Albertsson
 
Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......
Sachin Paul
 
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
nyfuhyz
 
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
bopyb
 
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataPredictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Kiwi Creative
 
原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理
原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理
原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理
a9qfiubqu
 
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
taqyea
 
Build applications with generative AI on Google Cloud
Build applications with generative AI on Google CloudBuild applications with generative AI on Google Cloud
Build applications with generative AI on Google Cloud
Márton Kodok
 
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
xclpvhuk
 
一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理
一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理
一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理
y3i0qsdzb
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
Walaa Eldin Moustafa
 
Experts live - Improving user adoption with AI
Experts live - Improving user adoption with AIExperts live - Improving user adoption with AI
Experts live - Improving user adoption with AI
jitskeb
 
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
mkkikqvo
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
sameer shah
 
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
Social Samosa
 
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
nuttdpt
 
Challenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more importantChallenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more important
Sm321
 

Recently uploaded (20)

原版一比一利兹贝克特大学毕业证(LeedsBeckett毕业证书)如何办理
原版一比一利兹贝克特大学毕业证(LeedsBeckett毕业证书)如何办理原版一比一利兹贝克特大学毕业证(LeedsBeckett毕业证书)如何办理
原版一比一利兹贝克特大学毕业证(LeedsBeckett毕业证书)如何办理
 
Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024
 
The Ipsos - AI - Monitor 2024 Report.pdf
The  Ipsos - AI - Monitor 2024 Report.pdfThe  Ipsos - AI - Monitor 2024 Report.pdf
The Ipsos - AI - Monitor 2024 Report.pdf
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
 
Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......
 
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
 
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
 
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataPredictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
 
原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理
原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理
原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理
 
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
 
Build applications with generative AI on Google Cloud
Build applications with generative AI on Google CloudBuild applications with generative AI on Google Cloud
Build applications with generative AI on Google Cloud
 
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
 
一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理
一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理
一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
 
Experts live - Improving user adoption with AI
Experts live - Improving user adoption with AIExperts live - Improving user adoption with AI
Experts live - Improving user adoption with AI
 
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
 
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
 
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
 
Challenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more importantChallenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more important
 

A Big Data Analysis Framework for Model-Based Web User Behavior Analytics

  • 1. Titolo presentazione sottotitolo Milano, XX mese 20XX A Big Data Analysis Framework for Model-Based Web User Behavior Analytics Carlo Bernaschina, Marco Brambila, Andrea Mauri, Eric Umuhoza 6th June 2017
  • 2. DEIB. Data Science LabICWE2017 – June 6th 2017 Context Web analytics has become the tool of choice to inform both business users and designers. Several tools exist that support analysis of Web server logs and extract information on application usage.
  • 3. DEIB. Data Science LabICWE2017 – June 6th 2017 Context
  • 4. DEIB. Data Science LabICWE2017 – June 6th 2017 Context
  • 5. DEIB. Data Science LabICWE2017 – June 6th 2017 Context Web analytics has become the tool of choice towards taking informed business and interaction design decisions. Several tools exist that support analysis of Web server logs and extract information on application usage. • Usually unaware of the design structure and the actual content managed by the application.
  • 6. DEIB. Data Science LabICWE2017 – June 6th 2017 Objective Provide valuable insights to designers and decision makers Integration of two approaches: • Web Log Analytics • Model Driven Development
  • 7. DEIB. Data Science LabICWE2017 – June 6th 2017 Model Driven Development Software development paradigm where the models are the main artefacts of the development process. Main benefits: • Model once, generate for any platform of choice • Validation of the requirements In this specific case: • The model includes the structure of the web application.
  • 8. DEIB. Data Science LabICWE2017 – June 6th 2017 Interaction Flow Modeling Language Interaction Flow Modeling Language (IFML) is designed for expressing the content, user interaction and control behavior of the front-end of software applications
  • 9. DEIB. Data Science LabICWE2017 – June 6th 2017 Overview Application Server Data Analyser Storage Analyzer (Spark) Code Generator Webratio IFML Editor (Enriched Analytics Model) Web Server LogRTXLog ModelCode Data Visualization Analysis (b) (a,f) (d) (e) Application DB Web Server (Tomcat) Database (c) Modeling Application Deployment Analysis Visualization Execution
  • 10. DEIB. Data Science LabICWE2017 – June 6th 2017 Logs Integration (d) (c) (b) Model RTXLog Web Server Log Database RTXLog WebServerLog Model Database EnrichedLog GlobalLog FinalLog RTX.sessionId = WebServer.sessionId AND RTX.timestamp = WebServer.timestamp EnrichedLog.elementId = Model.elementId GlobalLog.tabelName = Database.tableName AND Database.attributeName=“OID” AND GlobalLog.instanceID = Database,value (a) (a) (a) (a) A denormalized view of the logs
  • 11. DEIB. Data Science LabICWE2017 – June 6th 2017 Navigation Based Analyses Includes information regarding how the users navigate the Web site Examples: • Entrance Rate • Bounce Rate • Page Visit • Residence Time • Link Navigation • …
  • 12. DEIB. Data Science LabICWE2017 – June 6th 2017 Content Based Analyses Comprehends information regarding the domain entities involved in the user interaction, their types and their semantics. Example (e-commerce website) • Top K Visualized Books • Top K Visualized Authors • Top K Clicked Books • …
  • 13. DEIB. Data Science LabICWE2017 – June 6th 2017 Structure Based Analyses Comprehends information regarding the kind of widget, visualization, or even navigation pattern used in the user interaction. Example: • Top K elements clicked by users when shown in a map throughout the site • Top K elements clicked when shown in the first three positions of a list • Top K elements clicked when an attribute of type image is shown in the page versus an attribute of type currency. • …
  • 14. DEIB. Data Science LabICWE2017 – June 6th 2017 Data Visualization Tool Traditional charts like pie charts, bar charts, navigation flow charts and so on..
  • 15. DEIB. Data Science LabICWE2017 – June 6th 2017 Visual Feedback on Model Editor Three types of visualization: • Color: the analytics is shown through the change of color of the corresponding model element • Label: the analytics is shown with a label on the corresponding model element • Properties: the analytics is shown in a separate property panel
  • 16. DEIB. Data Science LabICWE2017 – June 6th 2017 Visual Feedback on Model Editor - Color • Residence Time • Page visit • …
  • 17. DEIB. Data Science LabICWE2017 – June 6th 2017 Visual Feedback on Model Editor - Label • Link Out Ratio • Link In Ratio • Top 1 Clicked Entity • …
  • 18. DEIB. Data Science LabICWE2017 – June 6th 2017 Visual Feedback on Model Editor - Properties • Top k clicked entities • Top k visualized entities • …
  • 19. DEIB. Data Science LabICWE2017 – June 6th 2017 Future Work Do not stop at the visualization! • Effects of Model Restructuring on User Behavior • Conversion Rate • Number of user who reach payment Module • Conversion Path: the path (with high contribution) leading to conversion rate • Model Optimization
  • 20. DEIB. Data Science LabICWE2017 – June 6th 2017 Thanks for your attention Questions? Contact: andrea.mauri@polimi.it Tool: www.ifmledit.org Further Info: http://datascience.deib.polimi.it/bigdata-modeling- weblogs/

Editor's Notes

  1. Business user - > customer needs Designers -> good web applications
  2. Here there is an example of this kind of tools. It shows statistics such as: residence time, page views, bounce rate and so on..
  3. It also offers a more complex view is the ”behavior flow”, that represents how the user move inside the web application.
  4. Design structure -> what is actually present in the page -> hidden in the source code Content -> entities involved -> hidden in the database
  5. Include and expose the structure of the web application.
  6. The runtime component log (RTX) that stores events and data produced and consumed by the application runtime for serving page requests. It traces the history of the ViewContainers, ViewComponents, and operations that are executed, along with the executed queries. Enriched log: At this stage of the process we know all the elements involved in the users navigation, but only limited to their id. Global Log: information regarding to the model elements involved in the user interaction. From this global, denormalized view one can generate any desired behaviour analysis. In particular every statistics is computed with one or more SQL-like queries on the FinalLog .
  7. Color allows to have a heatmap-like view of the statistic
  8. Label allows to attach a single piece of information to a model element
  9. Properties allows to visualize complex statistics that would be too cumbersome to visualize directly on the model editor.