SlideShare a Scribd company logo
1 of 76
WEB USAGE MINING
Monu Chaudhary
071BCT522
INTRODUCTION
Web Usage mining is the process of
applying data mining techniques for the
discovery of usage patterns from Web data,
targeted towards various applications.
INTRODUCTION
Data collected at different levels:
➢ Server level
➢ Client level
➢ Proxy level
INTRODUCTION
Goal:
➢ analyze the behavioral patterns and
profiles of users interacting with a Web
site
➢ Understand and better serve the needs
of Web-based applications
INTRODUCTION
Classification based on Usage Data:
➢ Web server Data
➢ Application Server Data
➢ Application Level Data
INTRODUCTION
Importance:
➢ Growth of e-commerce
○ Provides an a cost effective way of
doing business.
➢ Hidden useful information
○ Visitors’ profile
○ Measure online marketing effort
INTRODUCTION
3 Phases:
➢ Preprocessing
➢ Pattern Discovery
➢ Pattern Analysis
PREPROCESSING
Preprocessing consists of converting the:
➢ usage information
➢ content information
➢ structure information
contained in the various available data
sources into the data abstractions necessary
for pattern discovery.
Web Usage Mining Process
Web Usage Mining Process
Preprocessing of Web Usage Mining
Preprocessing of Web Usage Mining
Data Cleaning remove irrelevant references
and fields in server logs, removes erroneous
references and adds missing references due
to caching.
Preprocessing of Web Usage Mining
Sessionization: the activities performed by a
user from the moment she enters the site
until the moment she leaves it.
Sessionization
Preprocessing of Web Usage Mining
User Identification records multiple sessions
for user. This log is called User activity
record.
User Identification
Preprocessing of Web Usage Mining
A page view consists of every file that
contributes to the display on a user's browser
at one time.
Preprocessing of Web Usage Mining
Conceptually, each Page view can be viewed
as a collection of Web objects or resources
representing a specific “user event,” e.g.,
reading an article, viewing a product page, or
adding a product to the shopping cart.
Preprocessing of Web Usage Mining
Path Completion: Client- or proxy-side
caching can often result in missing access
references to those pages or objects that
have been cached.
Preprocessing of Web Usage Mining
Path Completion: For instance,
➢ if a user returns to a page A during the
same session, the second access to A will
likely result in viewing the previously
downloaded version of A that was
cached on the client- side, and therefore,
no request is made to the server.
Preprocessing of Web Usage Mining
Path Completion:
➢ This results in the second reference to A
not being recorded on the server logs.
Path Completion
Preprocessing of Web Usage Mining
Episode is a subset or subsequence of a
session comprised of semantically or
functionally related page views.
PATTERN DISCOVERY
Pattern discovery draws upon methods and
algorithms developed from several fields such as
statistics, data mining, machine learning and
pattern recognition.
PATTERN DISCOVERY
Methods:
➢ Statistical Analysis
➢ Association Rules
➢ Clustering
➢ Classification
➢ Sequential Patterns
PATTERN ANALYSIS
The motivation behind pattern analysis is to filter
out uninteresting rules or patterns from the set
found in the pattern discovery phase.
PATTERN ANALYSIS
Methods:
➢ A knowledge query mechanism such as SQL.
➢ Another method is to load usage data into a
data cube in order to perform Online
Analytical Processing (OLAP) operations.
PATTERN ANALYSIS
Methods:
➢ Visualization techniques, such as graphing
patterns or assigning colors to different
values.
➢ content and structure information can be
used to filter out patterns containing pages of
a certain usage type, content type, or pages
that match a certain hyperlink structure.
Application of Web Usage Mining
Advantages
➢ Personalized marketing.
➢ Fight against terrorism.
➢ Customer Relationship.
➢ Increase profitability by target pricing.
COLLABORATIVE FILTERING
Subodh chandra shakya
071BCT543
What is collaborative filtering…???
Collaborative filtering is a method of making
automatic predictions about the interest of a
user by collecting preferences or taste
information from other other users users(I.e
collaborating the interest )
Application
Mostly in e-commerce recommendation
system
Amazon
Netflix
This is how it works….
1.Weight all users with respect to similarity with active user
2. Select a subset of Users to use as a set of predictors
3. Compute a prediction from a weighted combination of
selected neighbors’ ratings
Collaborative filtering types
Memory Based: uses user rating data to compute
similarity between users or items user
rating,Neighbourhood based,Item Based etc
Model Based:Uses data mining and machine learning
Bayesian networks,neural embedding
models,clustering models,latent semantic models
such as SVD.
Approaches for CF (memory based)
User-Based CF - compute similarity based on User
Item-Based CF-Compute similarity base on item
User based CF
Look for users who share the same rating
patterns with the active user(the user whom
the prediction is for)
Use the ratings from those like-minded users
to calculate a prediction for the active user
Item based CF
1. Build an item-item matrix determining
relationships between pairs of items
1. Infer the tastes of the current user by
examining the matrix and matching that
user's data
Simple similarity is cosine similarity
Pearson correlation similarity
Collaborative Filtering problem
Cold-start: There should be enough other users
already in the system to find a match.New items
need to get enough ratings
Popularity Bias:Hard to recommend items to
someone with unique tastes
RECOMMENDER SYSTEMS
Atul Khatri
071bct509
Definition
● Estimate a utility function that automatically predicts how a
user will like an item
● Based on
○ Past Behavior
○ Relations to other users
○ Item similarity
○ Context
Impact
Apparent
● Advertisement
● Restaurants, cafes
● Movies, Tv shows, Music
● Books
● News articles
● Social sites including dating services
Impact(continued)
Not so apparent
● Courses in E-learning
● Drug components
● Research papers
● Citations
● Code modules
Architecture
Types
● Collaborative Filtering system
● Content-based system
● Hybrid recommender system
○ Context-based system
○ Knowledge-based system
Paradigms of recommender
systems
Content-Based
Recommender System
● System creates a user profile based on users likes or dislikes
which are explicitly stated
● Every purchase updates the user profile.
● A content-based recommender system matches the profile of
item to user profile to decide its relevancy to the user
Storage of items in database
Content Representation
● Structured data
○ Small number of attributes
○ Each item described by same set of attributes
○ Known set of values of attributes
Content
Representation(contd...)
● Unstructured data
○ No attribute names with well defined values
○ Need to impose structure on text before use
○ Natural language complexity
■ Same word with different meaning
■ Different word with same meaning
Context-Based
Recommender Systems
● System uses additional data about context of an item
consumption.
● Example: Additional component of time may be used to
recommend restaurants to consumers i.e different restaurants
for breakfast, lunch and so on. Further, information about
whether you are going out to eat with your friends or family
should also vary the recommendation.
Major obstacles for contextual computing
● Obtain sufficient and reliable data describing user context
● Understand the impact of contextual dimensions on
personalisation process
● Computational model of contextual dimensions in more
classical recommendation technology
● For instance: How to extend Collaborative filtering to
include contextual dimensions?
Collective Intelligence
Sagun Nakarmi
071bct533
● A shared or group intelligence that emerges
from the collaboration and competition of many
individuals.
● Groups of people and computers, connected by
the Internet, collectively doing intelligent
things.
It can be understood as an emergent property from
the synergies among:
1) Data - knowledge-information
2) Software-hardware
3) Experts
For instance,
Google technology harvests knowledge generated
by millions of people creating and linking web
pages and then uses this knowledge to answer
queries in ways that often seem amazingly
intelligent.
In Wikipedia, thousands of people around the world
have collectively created a very large and high quality
intellectual product with almost no centralized control,
and almost all as volunteers!
Online multi-player games are another example
of collective intelligence. Games such as Dota 2,
Second Life and Call of Duty rely on gamers
coming together as a community to form the
game’s Identity.
Other examples:
● social networking ( perhaps the
most popular of collective intelligence.)
● Amazon, Hamrobazaar & other ecommerce sites
● etc
THANK YOU FOR YOUR
PATIENCE!!

More Related Content

What's hot

Introduction to Web Mining and Spatial Data Mining
Introduction to Web Mining and Spatial Data MiningIntroduction to Web Mining and Spatial Data Mining
Introduction to Web Mining and Spatial Data MiningAarshDhokai
 
Data Mining: What is Data Mining?
Data Mining: What is Data Mining?Data Mining: What is Data Mining?
Data Mining: What is Data Mining?Seerat Malik
 
Information retrieval introduction
Information retrieval introductionInformation retrieval introduction
Information retrieval introductionnimmyjans4
 
Information retrieval s
Information retrieval sInformation retrieval s
Information retrieval ssilambu111
 
Intro to web scraping with Python
Intro to web scraping with PythonIntro to web scraping with Python
Intro to web scraping with PythonMaris Lemba
 
Web Mining & Text Mining
Web Mining & Text MiningWeb Mining & Text Mining
Web Mining & Text MiningHemant Sharma
 
Introduction to Information Retrieval
Introduction to Information RetrievalIntroduction to Information Retrieval
Introduction to Information RetrievalRoi Blanco
 
Information retrieval (introduction)
Information  retrieval (introduction) Information  retrieval (introduction)
Information retrieval (introduction) Primya Tamil
 
WEB BASED INFORMATION RETRIEVAL SYSTEM
WEB BASED INFORMATION RETRIEVAL SYSTEMWEB BASED INFORMATION RETRIEVAL SYSTEM
WEB BASED INFORMATION RETRIEVAL SYSTEMSai Kumar Ale
 
Clustering in data Mining (Data Mining)
Clustering in data Mining (Data Mining)Clustering in data Mining (Data Mining)
Clustering in data Mining (Data Mining)Mustafa Sherazi
 
Data Mining: Text and web mining
Data Mining: Text and web miningData Mining: Text and web mining
Data Mining: Text and web miningDataminingTools Inc
 

What's hot (20)

Web Content Mining
Web Content MiningWeb Content Mining
Web Content Mining
 
Introduction to Web Mining and Spatial Data Mining
Introduction to Web Mining and Spatial Data MiningIntroduction to Web Mining and Spatial Data Mining
Introduction to Web Mining and Spatial Data Mining
 
Data Mining: What is Data Mining?
Data Mining: What is Data Mining?Data Mining: What is Data Mining?
Data Mining: What is Data Mining?
 
Information retrieval introduction
Information retrieval introductionInformation retrieval introduction
Information retrieval introduction
 
Information retrieval s
Information retrieval sInformation retrieval s
Information retrieval s
 
Web usage-mining
Web usage-miningWeb usage-mining
Web usage-mining
 
Web data mining
Web data miningWeb data mining
Web data mining
 
Intro to web scraping with Python
Intro to web scraping with PythonIntro to web scraping with Python
Intro to web scraping with Python
 
Data Mining
Data MiningData Mining
Data Mining
 
Clustering in Data Mining
Clustering in Data MiningClustering in Data Mining
Clustering in Data Mining
 
Web mining (1)
Web mining (1)Web mining (1)
Web mining (1)
 
Web Mining & Text Mining
Web Mining & Text MiningWeb Mining & Text Mining
Web Mining & Text Mining
 
Introduction to Information Retrieval
Introduction to Information RetrievalIntroduction to Information Retrieval
Introduction to Information Retrieval
 
Information retrieval (introduction)
Information  retrieval (introduction) Information  retrieval (introduction)
Information retrieval (introduction)
 
WEB BASED INFORMATION RETRIEVAL SYSTEM
WEB BASED INFORMATION RETRIEVAL SYSTEMWEB BASED INFORMATION RETRIEVAL SYSTEM
WEB BASED INFORMATION RETRIEVAL SYSTEM
 
Web mining
Web miningWeb mining
Web mining
 
Web mining
Web miningWeb mining
Web mining
 
Tutorial on Web Scraping in Python
Tutorial on Web Scraping in PythonTutorial on Web Scraping in Python
Tutorial on Web Scraping in Python
 
Clustering in data Mining (Data Mining)
Clustering in data Mining (Data Mining)Clustering in data Mining (Data Mining)
Clustering in data Mining (Data Mining)
 
Data Mining: Text and web mining
Data Mining: Text and web miningData Mining: Text and web mining
Data Mining: Text and web mining
 

Similar to Web Usage Mining Process and its Applications

Context Mining and Integration in Web Predictive Analytics
Context Mining and Integration in Web Predictive AnalyticsContext Mining and Integration in Web Predictive Analytics
Context Mining and Integration in Web Predictive AnalyticsJulia Kiseleva
 
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...inventionjournals
 
major ppt.pptx
major ppt.pptxmajor ppt.pptx
major ppt.pptxAnushaG52
 
Architecting AI Solutions in Azure for Business
Architecting AI Solutions in Azure for BusinessArchitecting AI Solutions in Azure for Business
Architecting AI Solutions in Azure for BusinessIvo Andreev
 
Enactment of Firefly Algorithm and Fuzzy C-Means Clustering For Consumer Requ...
Enactment of Firefly Algorithm and Fuzzy C-Means Clustering For Consumer Requ...Enactment of Firefly Algorithm and Fuzzy C-Means Clustering For Consumer Requ...
Enactment of Firefly Algorithm and Fuzzy C-Means Clustering For Consumer Requ...IRJET Journal
 
Study of Recommendation System Used In Tourism and Travel
Study of Recommendation System Used In Tourism and TravelStudy of Recommendation System Used In Tourism and Travel
Study of Recommendation System Used In Tourism and Travelijtsrd
 
IJRET : International Journal of Research in Engineering and TechnologyImprov...
IJRET : International Journal of Research in Engineering and TechnologyImprov...IJRET : International Journal of Research in Engineering and TechnologyImprov...
IJRET : International Journal of Research in Engineering and TechnologyImprov...eSAT Publishing House
 
Personalized E-commerce based recommendation systems using deep-learning tech...
Personalized E-commerce based recommendation systems using deep-learning tech...Personalized E-commerce based recommendation systems using deep-learning tech...
Personalized E-commerce based recommendation systems using deep-learning tech...IAESIJAI
 
Iaetsd web personalization a general survey
Iaetsd web personalization a general surveyIaetsd web personalization a general survey
Iaetsd web personalization a general surveyIaetsd Iaetsd
 
A recommender system-using novel deep network collaborative filtering
A recommender system-using novel deep network collaborative filteringA recommender system-using novel deep network collaborative filtering
A recommender system-using novel deep network collaborative filteringIAESIJAI
 
Introduction to Recommendation Systems
Introduction to Recommendation SystemsIntroduction to Recommendation Systems
Introduction to Recommendation SystemsZia Babar
 
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.comHABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.comHABIB FIGA GUYE
 
Connecting social media to e commerce (2)
Connecting social media to e commerce (2)Connecting social media to e commerce (2)
Connecting social media to e commerce (2)krsenthamizhselvi
 
Recommender System _Module 1_Introduction to Recommender System.pptx
Recommender System _Module 1_Introduction to Recommender System.pptxRecommender System _Module 1_Introduction to Recommender System.pptx
Recommender System _Module 1_Introduction to Recommender System.pptxSatyam Sharma
 

Similar to Web Usage Mining Process and its Applications (20)

Web usage mining
Web usage miningWeb usage mining
Web usage mining
 
AN EFFECTIVE FRAMEWORK FOR GENERATING RECOMMENDATIONS
AN EFFECTIVE FRAMEWORK FOR GENERATING RECOMMENDATIONSAN EFFECTIVE FRAMEWORK FOR GENERATING RECOMMENDATIONS
AN EFFECTIVE FRAMEWORK FOR GENERATING RECOMMENDATIONS
 
Context Mining and Integration in Web Predictive Analytics
Context Mining and Integration in Web Predictive AnalyticsContext Mining and Integration in Web Predictive Analytics
Context Mining and Integration in Web Predictive Analytics
 
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...
 
major ppt.pptx
major ppt.pptxmajor ppt.pptx
major ppt.pptx
 
Architecting AI Solutions in Azure for Business
Architecting AI Solutions in Azure for BusinessArchitecting AI Solutions in Azure for Business
Architecting AI Solutions in Azure for Business
 
Enactment of Firefly Algorithm and Fuzzy C-Means Clustering For Consumer Requ...
Enactment of Firefly Algorithm and Fuzzy C-Means Clustering For Consumer Requ...Enactment of Firefly Algorithm and Fuzzy C-Means Clustering For Consumer Requ...
Enactment of Firefly Algorithm and Fuzzy C-Means Clustering For Consumer Requ...
 
Web Usage Pattern
Web Usage PatternWeb Usage Pattern
Web Usage Pattern
 
Study of Recommendation System Used In Tourism and Travel
Study of Recommendation System Used In Tourism and TravelStudy of Recommendation System Used In Tourism and Travel
Study of Recommendation System Used In Tourism and Travel
 
IJRET : International Journal of Research in Engineering and TechnologyImprov...
IJRET : International Journal of Research in Engineering and TechnologyImprov...IJRET : International Journal of Research in Engineering and TechnologyImprov...
IJRET : International Journal of Research in Engineering and TechnologyImprov...
 
clickstream analysis
 clickstream analysis clickstream analysis
clickstream analysis
 
Personalized E-commerce based recommendation systems using deep-learning tech...
Personalized E-commerce based recommendation systems using deep-learning tech...Personalized E-commerce based recommendation systems using deep-learning tech...
Personalized E-commerce based recommendation systems using deep-learning tech...
 
Iaetsd web personalization a general survey
Iaetsd web personalization a general surveyIaetsd web personalization a general survey
Iaetsd web personalization a general survey
 
A recommender system-using novel deep network collaborative filtering
A recommender system-using novel deep network collaborative filteringA recommender system-using novel deep network collaborative filtering
A recommender system-using novel deep network collaborative filtering
 
Introduction to Recommendation Systems
Introduction to Recommendation SystemsIntroduction to Recommendation Systems
Introduction to Recommendation Systems
 
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.comHABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
 
Connecting social media to e commerce (2)
Connecting social media to e commerce (2)Connecting social media to e commerce (2)
Connecting social media to e commerce (2)
 
Recommender system
Recommender system Recommender system
Recommender system
 
Recommender System _Module 1_Introduction to Recommender System.pptx
Recommender System _Module 1_Introduction to Recommender System.pptxRecommender System _Module 1_Introduction to Recommender System.pptx
Recommender System _Module 1_Introduction to Recommender System.pptx
 
Al26234241
Al26234241Al26234241
Al26234241
 

Recently uploaded

Analog to Digital and Digital to Analog Converter
Analog to Digital and Digital to Analog ConverterAnalog to Digital and Digital to Analog Converter
Analog to Digital and Digital to Analog ConverterAbhinavSharma374939
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Dr.Costas Sachpazis
 
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxDecoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxJoão Esperancinha
 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidNikhilNagaraju
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escortsranjana rawat
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSRajkumarAkumalla
 
chaitra-1.pptx fake news detection using machine learning
chaitra-1.pptx  fake news detection using machine learningchaitra-1.pptx  fake news detection using machine learning
chaitra-1.pptx fake news detection using machine learningmisbanausheenparvam
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSSIVASHANKAR N
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )Tsuyoshi Horigome
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...ranjana rawat
 
Extrusion Processes and Their Limitations
Extrusion Processes and Their LimitationsExtrusion Processes and Their Limitations
Extrusion Processes and Their Limitations120cr0395
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 

Recently uploaded (20)

Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCRCall Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
 
Analog to Digital and Digital to Analog Converter
Analog to Digital and Digital to Analog ConverterAnalog to Digital and Digital to Analog Converter
Analog to Digital and Digital to Analog Converter
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
 
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxDecoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfid
 
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
 
chaitra-1.pptx fake news detection using machine learning
chaitra-1.pptx  fake news detection using machine learningchaitra-1.pptx  fake news detection using machine learning
chaitra-1.pptx fake news detection using machine learning
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
 
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptxExploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
 
Extrusion Processes and Their Limitations
Extrusion Processes and Their LimitationsExtrusion Processes and Their Limitations
Extrusion Processes and Their Limitations
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEDJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
 

Web Usage Mining Process and its Applications

  • 1. WEB USAGE MINING Monu Chaudhary 071BCT522
  • 2. INTRODUCTION Web Usage mining is the process of applying data mining techniques for the discovery of usage patterns from Web data, targeted towards various applications.
  • 3. INTRODUCTION Data collected at different levels: ➢ Server level ➢ Client level ➢ Proxy level
  • 4. INTRODUCTION Goal: ➢ analyze the behavioral patterns and profiles of users interacting with a Web site ➢ Understand and better serve the needs of Web-based applications
  • 5. INTRODUCTION Classification based on Usage Data: ➢ Web server Data ➢ Application Server Data ➢ Application Level Data
  • 6. INTRODUCTION Importance: ➢ Growth of e-commerce ○ Provides an a cost effective way of doing business. ➢ Hidden useful information ○ Visitors’ profile ○ Measure online marketing effort
  • 7. INTRODUCTION 3 Phases: ➢ Preprocessing ➢ Pattern Discovery ➢ Pattern Analysis
  • 8. PREPROCESSING Preprocessing consists of converting the: ➢ usage information ➢ content information ➢ structure information contained in the various available data sources into the data abstractions necessary for pattern discovery.
  • 10. Web Usage Mining Process
  • 11.
  • 12. Preprocessing of Web Usage Mining
  • 13. Preprocessing of Web Usage Mining Data Cleaning remove irrelevant references and fields in server logs, removes erroneous references and adds missing references due to caching.
  • 14. Preprocessing of Web Usage Mining Sessionization: the activities performed by a user from the moment she enters the site until the moment she leaves it.
  • 16. Preprocessing of Web Usage Mining User Identification records multiple sessions for user. This log is called User activity record.
  • 18. Preprocessing of Web Usage Mining A page view consists of every file that contributes to the display on a user's browser at one time.
  • 19. Preprocessing of Web Usage Mining Conceptually, each Page view can be viewed as a collection of Web objects or resources representing a specific “user event,” e.g., reading an article, viewing a product page, or adding a product to the shopping cart.
  • 20. Preprocessing of Web Usage Mining Path Completion: Client- or proxy-side caching can often result in missing access references to those pages or objects that have been cached.
  • 21. Preprocessing of Web Usage Mining Path Completion: For instance, ➢ if a user returns to a page A during the same session, the second access to A will likely result in viewing the previously downloaded version of A that was cached on the client- side, and therefore, no request is made to the server.
  • 22. Preprocessing of Web Usage Mining Path Completion: ➢ This results in the second reference to A not being recorded on the server logs.
  • 24. Preprocessing of Web Usage Mining Episode is a subset or subsequence of a session comprised of semantically or functionally related page views.
  • 25. PATTERN DISCOVERY Pattern discovery draws upon methods and algorithms developed from several fields such as statistics, data mining, machine learning and pattern recognition.
  • 26. PATTERN DISCOVERY Methods: ➢ Statistical Analysis ➢ Association Rules ➢ Clustering ➢ Classification ➢ Sequential Patterns
  • 27. PATTERN ANALYSIS The motivation behind pattern analysis is to filter out uninteresting rules or patterns from the set found in the pattern discovery phase.
  • 28. PATTERN ANALYSIS Methods: ➢ A knowledge query mechanism such as SQL. ➢ Another method is to load usage data into a data cube in order to perform Online Analytical Processing (OLAP) operations.
  • 29. PATTERN ANALYSIS Methods: ➢ Visualization techniques, such as graphing patterns or assigning colors to different values. ➢ content and structure information can be used to filter out patterns containing pages of a certain usage type, content type, or pages that match a certain hyperlink structure.
  • 30. Application of Web Usage Mining
  • 31. Advantages ➢ Personalized marketing. ➢ Fight against terrorism. ➢ Customer Relationship. ➢ Increase profitability by target pricing.
  • 33. What is collaborative filtering…??? Collaborative filtering is a method of making automatic predictions about the interest of a user by collecting preferences or taste information from other other users users(I.e collaborating the interest )
  • 34. Application Mostly in e-commerce recommendation system Amazon Netflix
  • 35. This is how it works…. 1.Weight all users with respect to similarity with active user 2. Select a subset of Users to use as a set of predictors 3. Compute a prediction from a weighted combination of selected neighbors’ ratings
  • 36. Collaborative filtering types Memory Based: uses user rating data to compute similarity between users or items user rating,Neighbourhood based,Item Based etc Model Based:Uses data mining and machine learning Bayesian networks,neural embedding models,clustering models,latent semantic models such as SVD.
  • 37. Approaches for CF (memory based) User-Based CF - compute similarity based on User Item-Based CF-Compute similarity base on item
  • 38. User based CF Look for users who share the same rating patterns with the active user(the user whom the prediction is for) Use the ratings from those like-minded users to calculate a prediction for the active user
  • 39.
  • 40. Item based CF 1. Build an item-item matrix determining relationships between pairs of items 1. Infer the tastes of the current user by examining the matrix and matching that user's data
  • 41.
  • 42.
  • 43.
  • 44. Simple similarity is cosine similarity
  • 46. Collaborative Filtering problem Cold-start: There should be enough other users already in the system to find a match.New items need to get enough ratings Popularity Bias:Hard to recommend items to someone with unique tastes
  • 48. Definition ● Estimate a utility function that automatically predicts how a user will like an item ● Based on ○ Past Behavior ○ Relations to other users ○ Item similarity ○ Context
  • 49. Impact Apparent ● Advertisement ● Restaurants, cafes ● Movies, Tv shows, Music ● Books ● News articles ● Social sites including dating services
  • 50. Impact(continued) Not so apparent ● Courses in E-learning ● Drug components ● Research papers ● Citations ● Code modules
  • 52.
  • 53. Types ● Collaborative Filtering system ● Content-based system ● Hybrid recommender system ○ Context-based system ○ Knowledge-based system
  • 55.
  • 56.
  • 57.
  • 59. ● System creates a user profile based on users likes or dislikes which are explicitly stated ● Every purchase updates the user profile. ● A content-based recommender system matches the profile of item to user profile to decide its relevancy to the user
  • 60.
  • 61. Storage of items in database
  • 62.
  • 63. Content Representation ● Structured data ○ Small number of attributes ○ Each item described by same set of attributes ○ Known set of values of attributes
  • 64. Content Representation(contd...) ● Unstructured data ○ No attribute names with well defined values ○ Need to impose structure on text before use ○ Natural language complexity ■ Same word with different meaning ■ Different word with same meaning
  • 66. ● System uses additional data about context of an item consumption. ● Example: Additional component of time may be used to recommend restaurants to consumers i.e different restaurants for breakfast, lunch and so on. Further, information about whether you are going out to eat with your friends or family should also vary the recommendation.
  • 67. Major obstacles for contextual computing ● Obtain sufficient and reliable data describing user context ● Understand the impact of contextual dimensions on personalisation process ● Computational model of contextual dimensions in more classical recommendation technology ● For instance: How to extend Collaborative filtering to include contextual dimensions?
  • 69. ● A shared or group intelligence that emerges from the collaboration and competition of many individuals. ● Groups of people and computers, connected by the Internet, collectively doing intelligent things.
  • 70.
  • 71. It can be understood as an emergent property from the synergies among: 1) Data - knowledge-information 2) Software-hardware 3) Experts
  • 72. For instance, Google technology harvests knowledge generated by millions of people creating and linking web pages and then uses this knowledge to answer queries in ways that often seem amazingly intelligent.
  • 73. In Wikipedia, thousands of people around the world have collectively created a very large and high quality intellectual product with almost no centralized control, and almost all as volunteers!
  • 74. Online multi-player games are another example of collective intelligence. Games such as Dota 2, Second Life and Call of Duty rely on gamers coming together as a community to form the game’s Identity.
  • 75. Other examples: ● social networking ( perhaps the most popular of collective intelligence.) ● Amazon, Hamrobazaar & other ecommerce sites ● etc
  • 76. THANK YOU FOR YOUR PATIENCE!!