SlideShare a Scribd company logo
1 of 25
Download to read offline
Recommender Systems: Challenges
and Opportunities
Ayse Bener
January 22, 2015
}  Search is what you do when you're looking for
something. Discovery is when something
wonderful that you didn't know existed, or
didn’t know how to ask for, finds you.” – CNN
Money, “The race to create ‘smart’ Google.”
From Search to Recommendation
Recommender problem
the user is
} Consumer
} Subscriber
} Member
Estimate a utility function
to predict
how a user will like an item
the item is
}  Movie
}  Apps
}  Travel destinations
Recommender
}  A good recommendation
} Relevant to the user
} Personalized
} Diverse
} Expands the user's taste into neighboring
areas (serendipity – unsought finding)
Paradigm of Recommender Systems
}  Recommender systems reduce information overload by
estimating relevance
}  Collaborative filtering :What is popular in a community
}  User profile & community information
}  Content Based: Provides more of what user liked before
}  User profile & Item profile
}  Knowledge Based :What is best based on the users’ needs
}  User profile & Item profile & Knowledge Model
}  Hybrid Method: Combination of inputs and/or composition of
different methods
}  User profile & Item profile & knowledge Model & Community
Information
Recommender Systems Challenges
}  Dealing with Big Data problems
}  Lack of Useful Data
}  Unstructured data
}  Missing Data
}  New user and New Item
}  Cold Start problem
}  Temporality
}  Changing Data
}  Changing user preferences and biases
}  Negative choices
}  Evaluating Recommenders
Main Research Issues
}  Understanding the context and modeling context
}  Algorithms
}  Evaluation
}  Engineering
Bayesian Networks For Evidence-Based Decision-
Making in Software Engineering
Ayse Tosun Misirli, and Ayse Bener, IEEE Transactions on
Software Engineering, vol.40, no.6., June 2014
Recommendation systems
for software engineering (RSSE)
}  Recommendation systems/ prediction models should be
designed in a way that they are capable of integrating
evidence, i.e., facts and probabilities systematically
collected or measured from real data and observations,
into practitioners’ experience.
}  In this study, we follow the lead of computational biology
and healthcare decision-making, and investigate the
applications of BNs in SE
The Bayesian Approach
}  Provides a natural statistical framework for evidence-based
decision-making by incorporating an integrated summary of
the available evidence and associated uncertainty (of
consequences)
}  Maintaining observations, statistical distributions, prior
assumptions, and expert judgment in a single model
}  Encoding causal relationships among variables for predicting
future actions
}  “information propagation through the network”, i.e., gaming
over the network to see all possible scenarios and their
outcomes to give the best action
}  imitating the process of human thinking, while going beyond the
capabilities of human reasoning with a fact-based, error-free
intelligence through the usage of enormous amounts of
historical data
Example of a simple BN with different
variable types
Systematic Mapping of BNs in SWE
}  To investigate the
applications of BNs
in SE
}  main software
engineering
challenges addressed
}  techniques used to
learn causal
relationships among
variables
}  techniques used to
infer the
parameters
}  variable types used
as BN nodes
Empirical Analysis on Bayesian Decision-
Making
}  Hybrid Bayesian Network that would solve a specific
software engineering challenge
}  predicting software reliability in terms of post-release defects
}  a ’mixeddata’ model to represent software life cycle
phases by incorporating expert judgment (qualitative data
through surveys) into quantitative data collected from
software repositories
}  a ’hybrid’ BN that incorporates both continuous and
categorical variables
Demographics for Two Software Companies
BN Models in this Study
Model Representation
Model #1
Model #2
Model #3
Graphical Representation of BN (Co. A)
Graphical Representation of BN (Co. B)
Setting Prior Distributions
}  Model #1
}  expert knowledge
}  Model #2
}  Lilliefors significance test on all variables and on post
release defects
}  normal probability plots
}  Model #3
}  The requirements specification subnet whose
distributions were set based on expert knowledge is used,
and it is incorporated with the development and testing
subnet in Model #2 whose variables are assigned different
distributions based on the significance tests
Structure Learning
}  Expert Judgement
}  Chi-plot
}  Independence betwen two variables
}  Copula models- a transformation of data with marginal distributions
}  Prior to modeling it is necessary to chack the presence of dependence
there is a positive monotone dependence
between test cases and post release
defects as data pairs are shifted towards
right from the center
Inference
}  Bayesian learning for complex models using Monte Carlo
methods, especially Gibbs sampling
}  insufficient statistics
}  incomplete data
}  successively sample from posterior distribution of each
node in a Bayesian model given all the others as full
conditionals
}  successful when estimating the unknown parameters of
probability distributions or when conducting empirical
analysis to infer true values of a given sample
}  enables to make predictions for future scenarios even though
some of the input variables are missing
Prediction Performance of the Models
Threats to Validity
}  Internal validity
}  biases during data collection
}  Used scripts to extract data
}  Eliminated outliers
}  BNs for causality and to avoid over-fitting
}  Construct validity
}  Large set of metrics were chosen
}  Well-known performance measures are used
}  Conclusion validity
}  Non-parametric test (Mann-Whitney U-test),ANOVA, t-test were used
}  External validity
}  we aim to transfer the methodology behind BN construction to enhance the
usage of these graphical, probabilistic models in software engineering
Conclusions
}  Similar to computational biology and healthcare, we need
to make decisions under uncertainty using multiple data
sources
}  As we understand the dynamics of BNs and the
techniques used for model learning, these models would
enable us to uncover hidden relationships between
variables, which cannot be easily identified by experts
}  Understanding the theory behind BNs also gives us the
opportunity to adopt these models to different industrial
settings by changing the set of metrics, their distributions,
and causal relationships among variables
Conclusions
}  An integrated tool support (intelligent software delivery
platform)
}  Dione – to be integrated to IBM Rational

More Related Content

What's hot

Twala2007.doc
Twala2007.docTwala2007.doc
Twala2007.doc
butest
 
Report of Previous Project by Yifan Guo
Report of Previous Project by Yifan GuoReport of Previous Project by Yifan Guo
Report of Previous Project by Yifan Guo
Yifan Guo
 
GTC 2021: Counterfactual Learning to Rank in E-commerce
GTC 2021: Counterfactual Learning to Rank in E-commerceGTC 2021: Counterfactual Learning to Rank in E-commerce
GTC 2021: Counterfactual Learning to Rank in E-commerce
GrubhubTech
 
Paper Annotated: SinGAN-Seg: Synthetic Training Data Generation for Medical I...
Paper Annotated: SinGAN-Seg: Synthetic Training Data Generation for Medical I...Paper Annotated: SinGAN-Seg: Synthetic Training Data Generation for Medical I...
Paper Annotated: SinGAN-Seg: Synthetic Training Data Generation for Medical I...
Devansh16
 

What's hot (20)

Query aware determinization of uncertain objects
Query aware determinization of uncertain objectsQuery aware determinization of uncertain objects
Query aware determinization of uncertain objects
 
At4102337341
At4102337341At4102337341
At4102337341
 
Crowdsourcing predictors of behavioral outcomes
Crowdsourcing predictors of behavioral outcomesCrowdsourcing predictors of behavioral outcomes
Crowdsourcing predictors of behavioral outcomes
 
Design Presentation-CGillies
Design Presentation-CGilliesDesign Presentation-CGillies
Design Presentation-CGillies
 
F363941
F363941F363941
F363941
 
Assess data reliability from a set of criteria using the theory of belief fun...
Assess data reliability from a set of criteria using the theory of belief fun...Assess data reliability from a set of criteria using the theory of belief fun...
Assess data reliability from a set of criteria using the theory of belief fun...
 
Unsupervised Distance Based Detection of Outliers by using Anti-hubs
Unsupervised Distance Based Detection of Outliers by using Anti-hubsUnsupervised Distance Based Detection of Outliers by using Anti-hubs
Unsupervised Distance Based Detection of Outliers by using Anti-hubs
 
Data quality assessment in context= a cognitive perspective
Data quality assessment in context= a cognitive perspectiveData quality assessment in context= a cognitive perspective
Data quality assessment in context= a cognitive perspective
 
Survey: Biological Inspired Computing in the Network Security
Survey: Biological Inspired Computing in the Network SecuritySurvey: Biological Inspired Computing in the Network Security
Survey: Biological Inspired Computing in the Network Security
 
Twala2007.doc
Twala2007.docTwala2007.doc
Twala2007.doc
 
IRJET- An Extensive Study of Sentiment Analysis Techniques and its Progressio...
IRJET- An Extensive Study of Sentiment Analysis Techniques and its Progressio...IRJET- An Extensive Study of Sentiment Analysis Techniques and its Progressio...
IRJET- An Extensive Study of Sentiment Analysis Techniques and its Progressio...
 
User friendly pattern search paradigm
User friendly pattern search paradigmUser friendly pattern search paradigm
User friendly pattern search paradigm
 
Cognitive automation
Cognitive automationCognitive automation
Cognitive automation
 
Research Method EMBA chapter 12
Research Method EMBA chapter 12Research Method EMBA chapter 12
Research Method EMBA chapter 12
 
Report of Previous Project by Yifan Guo
Report of Previous Project by Yifan GuoReport of Previous Project by Yifan Guo
Report of Previous Project by Yifan Guo
 
"Agro-Market Prediction by Fuzzy based Neuro-Genetic Algorithm"
"Agro-Market Prediction by Fuzzy based Neuro-Genetic Algorithm""Agro-Market Prediction by Fuzzy based Neuro-Genetic Algorithm"
"Agro-Market Prediction by Fuzzy based Neuro-Genetic Algorithm"
 
System Adoption: Socio-Technical Integration
System Adoption: Socio-Technical IntegrationSystem Adoption: Socio-Technical Integration
System Adoption: Socio-Technical Integration
 
GTC 2021: Counterfactual Learning to Rank in E-commerce
GTC 2021: Counterfactual Learning to Rank in E-commerceGTC 2021: Counterfactual Learning to Rank in E-commerce
GTC 2021: Counterfactual Learning to Rank in E-commerce
 
IRJET- Analysis of Question and Answering Recommendation System
IRJET-  	  Analysis of Question and Answering Recommendation SystemIRJET-  	  Analysis of Question and Answering Recommendation System
IRJET- Analysis of Question and Answering Recommendation System
 
Paper Annotated: SinGAN-Seg: Synthetic Training Data Generation for Medical I...
Paper Annotated: SinGAN-Seg: Synthetic Training Data Generation for Medical I...Paper Annotated: SinGAN-Seg: Synthetic Training Data Generation for Medical I...
Paper Annotated: SinGAN-Seg: Synthetic Training Data Generation for Medical I...
 

Viewers also liked

Recommender systems bener
Recommender systems   benerRecommender systems   bener
Recommender systems bener
diannepatricia
 

Viewers also liked (17)

Pagi World from RPI Licato and Bringsjord
Pagi World from RPI Licato and BringsjordPagi World from RPI Licato and Bringsjord
Pagi World from RPI Licato and Bringsjord
 
Recommender systems bener
Recommender systems   benerRecommender systems   bener
Recommender systems bener
 
2015 07-30-sysetm t.short.no.animation
2015 07-30-sysetm t.short.no.animation2015 07-30-sysetm t.short.no.animation
2015 07-30-sysetm t.short.no.animation
 
Wat eco cogins9-24pres16x9
Wat eco cogins9-24pres16x9Wat eco cogins9-24pres16x9
Wat eco cogins9-24pres16x9
 
Machine Learning for Understanding and Managing Ecosystems
Machine Learning for Understanding and Managing EcosystemsMachine Learning for Understanding and Managing Ecosystems
Machine Learning for Understanding and Managing Ecosystems
 
Pre distaster planning for transportation networks
Pre distaster planning for transportation networksPre distaster planning for transportation networks
Pre distaster planning for transportation networks
 
Design for Learning and Assessment in Virtual Worlds
Design for Learning and Assessment in Virtual WorldsDesign for Learning and Assessment in Virtual Worlds
Design for Learning and Assessment in Virtual Worlds
 
Biemann ibm cog_comp_jan2015_noanim
Biemann ibm cog_comp_jan2015_noanimBiemann ibm cog_comp_jan2015_noanim
Biemann ibm cog_comp_jan2015_noanim
 
Ibm cog institutetalk_diab
Ibm cog institutetalk_diabIbm cog institutetalk_diab
Ibm cog institutetalk_diab
 
"Km4City: Smart City Ontology Building for Effective Erogation of Services"
"Km4City: Smart City Ontology Building for Effective Erogation of Services""Km4City: Smart City Ontology Building for Effective Erogation of Services"
"Km4City: Smart City Ontology Building for Effective Erogation of Services"
 
3.2 part 2 by group mighty chondrion ( 5 curie)
3.2 part 2 by group mighty chondrion ( 5 curie)3.2 part 2 by group mighty chondrion ( 5 curie)
3.2 part 2 by group mighty chondrion ( 5 curie)
 
Martin Takac - “Solving Large-Scale Machine Learning Problems in a Distribute...
Martin Takac - “Solving Large-Scale Machine Learning Problems in a Distribute...Martin Takac - “Solving Large-Scale Machine Learning Problems in a Distribute...
Martin Takac - “Solving Large-Scale Machine Learning Problems in a Distribute...
 
From Semantic Parsing to Reasoning
From Semantic Parsing to ReasoningFrom Semantic Parsing to Reasoning
From Semantic Parsing to Reasoning
 
Creating Instructable Cognitive Agents
Creating Instructable Cognitive Agents Creating Instructable Cognitive Agents
Creating Instructable Cognitive Agents
 
Affective computing short -external
Affective computing   short -externalAffective computing   short -external
Affective computing short -external
 
"Toward Generating Domain-specific / Personalized Problem Lists from Electron...
"Toward Generating Domain-specific / Personalized Problem Lists from Electron..."Toward Generating Domain-specific / Personalized Problem Lists from Electron...
"Toward Generating Domain-specific / Personalized Problem Lists from Electron...
 
Laird ibm-small
Laird ibm-smallLaird ibm-small
Laird ibm-small
 

Similar to Recommender Systems Bener

Similar to Recommender Systems Bener (20)

Cw36587594
Cw36587594Cw36587594
Cw36587594
 
Predicting Life Expectancy of Hepatitis B Patients
Predicting Life Expectancy of Hepatitis B PatientsPredicting Life Expectancy of Hepatitis B Patients
Predicting Life Expectancy of Hepatitis B Patients
 
Research-KS-Jun2015
Research-KS-Jun2015Research-KS-Jun2015
Research-KS-Jun2015
 
Feature selection in multimodal
Feature selection in multimodalFeature selection in multimodal
Feature selection in multimodal
 
A Software Measurement Using Artificial Neural Network and Support Vector Mac...
A Software Measurement Using Artificial Neural Network and Support Vector Mac...A Software Measurement Using Artificial Neural Network and Support Vector Mac...
A Software Measurement Using Artificial Neural Network and Support Vector Mac...
 
Forecasting movie rating using k-nearest neighbor based collaborative filtering
Forecasting movie rating using k-nearest neighbor based  collaborative filteringForecasting movie rating using k-nearest neighbor based  collaborative filtering
Forecasting movie rating using k-nearest neighbor based collaborative filtering
 
Booster in High Dimensional Data Classification
Booster in High Dimensional Data ClassificationBooster in High Dimensional Data Classification
Booster in High Dimensional Data Classification
 
Context Based Classification of Reviews Using Association Rule Mining, Fuzzy ...
Context Based Classification of Reviews Using Association Rule Mining, Fuzzy ...Context Based Classification of Reviews Using Association Rule Mining, Fuzzy ...
Context Based Classification of Reviews Using Association Rule Mining, Fuzzy ...
 
Demography basedhybridrecommendersystemformovierecommendation
Demography basedhybridrecommendersystemformovierecommendationDemography basedhybridrecommendersystemformovierecommendation
Demography basedhybridrecommendersystemformovierecommendation
 
Resume garima garg
Resume garima gargResume garima garg
Resume garima garg
 
Resume garima garg
Resume garima gargResume garima garg
Resume garima garg
 
Software Cost Estimation Using Clustering and Ranking Scheme
Software Cost Estimation Using Clustering and Ranking SchemeSoftware Cost Estimation Using Clustering and Ranking Scheme
Software Cost Estimation Using Clustering and Ranking Scheme
 
Lecture 2.pptx
Lecture 2.pptxLecture 2.pptx
Lecture 2.pptx
 
JAVA 2013 IEEE DATAMINING PROJECT Crowdsourcing predictors of behavioral outc...
JAVA 2013 IEEE DATAMINING PROJECT Crowdsourcing predictors of behavioral outc...JAVA 2013 IEEE DATAMINING PROJECT Crowdsourcing predictors of behavioral outc...
JAVA 2013 IEEE DATAMINING PROJECT Crowdsourcing predictors of behavioral outc...
 
Crowdsourcing predictors of behavioral outcomes
Crowdsourcing predictors of behavioral outcomesCrowdsourcing predictors of behavioral outcomes
Crowdsourcing predictors of behavioral outcomes
 
An Adaptive Framework for Enhancing Recommendation Using Hybrid Technique
An Adaptive Framework for Enhancing Recommendation Using Hybrid TechniqueAn Adaptive Framework for Enhancing Recommendation Using Hybrid Technique
An Adaptive Framework for Enhancing Recommendation Using Hybrid Technique
 
Datascience
DatascienceDatascience
Datascience
 
datascience.docx
datascience.docxdatascience.docx
datascience.docx
 
Performance Analysis of Selected Classifiers in User Profiling
Performance Analysis of Selected Classifiers in User ProfilingPerformance Analysis of Selected Classifiers in User Profiling
Performance Analysis of Selected Classifiers in User Profiling
 
Recommendation System using Machine Learning Techniques
Recommendation System using Machine Learning TechniquesRecommendation System using Machine Learning Techniques
Recommendation System using Machine Learning Techniques
 

More from diannepatricia

More from diannepatricia (20)

Teaching cognitive computing with ibm watson
Teaching cognitive computing with ibm watsonTeaching cognitive computing with ibm watson
Teaching cognitive computing with ibm watson
 
Cognitive systems institute talk 8 june 2017 - v.1.0
Cognitive systems institute talk   8 june 2017 - v.1.0Cognitive systems institute talk   8 june 2017 - v.1.0
Cognitive systems institute talk 8 june 2017 - v.1.0
 
Building Compassionate Conversational Systems
Building Compassionate Conversational SystemsBuilding Compassionate Conversational Systems
Building Compassionate Conversational Systems
 
“Artificial Intelligence, Cognitive Computing and Innovating in Practice”
“Artificial Intelligence, Cognitive Computing and Innovating in Practice”“Artificial Intelligence, Cognitive Computing and Innovating in Practice”
“Artificial Intelligence, Cognitive Computing and Innovating in Practice”
 
Cognitive Insights drive self-driving Accessibility
Cognitive Insights drive self-driving AccessibilityCognitive Insights drive self-driving Accessibility
Cognitive Insights drive self-driving Accessibility
 
Artificial Intellingence in the Car
Artificial Intellingence in the CarArtificial Intellingence in the Car
Artificial Intellingence in the Car
 
“Semantic PDF Processing & Document Representation”
“Semantic PDF Processing & Document Representation”“Semantic PDF Processing & Document Representation”
“Semantic PDF Processing & Document Representation”
 
Joining Industry and Students for Cognitive Solutions at Karlsruhe Services R...
Joining Industry and Students for Cognitive Solutions at Karlsruhe Services R...Joining Industry and Students for Cognitive Solutions at Karlsruhe Services R...
Joining Industry and Students for Cognitive Solutions at Karlsruhe Services R...
 
170330 cognitive systems institute speaker series mark sherman - watson pr...
170330 cognitive systems institute speaker series    mark sherman - watson pr...170330 cognitive systems institute speaker series    mark sherman - watson pr...
170330 cognitive systems institute speaker series mark sherman - watson pr...
 
“Fairness Cases as an Accelerant and Enabler for Cognitive Assistance Adoption”
“Fairness Cases as an Accelerant and Enabler for Cognitive Assistance Adoption”“Fairness Cases as an Accelerant and Enabler for Cognitive Assistance Adoption”
“Fairness Cases as an Accelerant and Enabler for Cognitive Assistance Adoption”
 
Cognitive Assistance for the Aging
Cognitive Assistance for the AgingCognitive Assistance for the Aging
Cognitive Assistance for the Aging
 
From complex Systems to Networks: Discovering and Modeling the Correct Network"
From complex Systems to Networks: Discovering and Modeling the Correct Network"From complex Systems to Networks: Discovering and Modeling the Correct Network"
From complex Systems to Networks: Discovering and Modeling the Correct Network"
 
The Role of Dialog in Augmented Intelligence
The Role of Dialog in Augmented IntelligenceThe Role of Dialog in Augmented Intelligence
The Role of Dialog in Augmented Intelligence
 
Developing Cognitive Systems to Support Team Cognition
Developing Cognitive Systems to Support Team CognitionDeveloping Cognitive Systems to Support Team Cognition
Developing Cognitive Systems to Support Team Cognition
 
Cyber-Social Learning Systems
Cyber-Social Learning SystemsCyber-Social Learning Systems
Cyber-Social Learning Systems
 
“IT Technology Trends in 2017… and Beyond”
“IT Technology Trends in 2017… and Beyond”“IT Technology Trends in 2017… and Beyond”
“IT Technology Trends in 2017… and Beyond”
 
"Curious Learning: using a mobile platform for early literacy education as a ...
"Curious Learning: using a mobile platform for early literacy education as a ..."Curious Learning: using a mobile platform for early literacy education as a ...
"Curious Learning: using a mobile platform for early literacy education as a ...
 
Embodied Cognition - Booch HICSS50
Embodied Cognition - Booch HICSS50Embodied Cognition - Booch HICSS50
Embodied Cognition - Booch HICSS50
 
KATE - a Platform for Machine Learning
KATE - a Platform for Machine LearningKATE - a Platform for Machine Learning
KATE - a Platform for Machine Learning
 
Cognitive Computing for Aging Society
Cognitive Computing for Aging SocietyCognitive Computing for Aging Society
Cognitive Computing for Aging Society
 

Recently uploaded

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Recently uploaded (20)

WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 

Recommender Systems Bener

  • 1. Recommender Systems: Challenges and Opportunities Ayse Bener January 22, 2015
  • 2. }  Search is what you do when you're looking for something. Discovery is when something wonderful that you didn't know existed, or didn’t know how to ask for, finds you.” – CNN Money, “The race to create ‘smart’ Google.” From Search to Recommendation
  • 3. Recommender problem the user is } Consumer } Subscriber } Member Estimate a utility function to predict how a user will like an item the item is }  Movie }  Apps }  Travel destinations
  • 4. Recommender }  A good recommendation } Relevant to the user } Personalized } Diverse } Expands the user's taste into neighboring areas (serendipity – unsought finding)
  • 5. Paradigm of Recommender Systems }  Recommender systems reduce information overload by estimating relevance }  Collaborative filtering :What is popular in a community }  User profile & community information }  Content Based: Provides more of what user liked before }  User profile & Item profile }  Knowledge Based :What is best based on the users’ needs }  User profile & Item profile & Knowledge Model }  Hybrid Method: Combination of inputs and/or composition of different methods }  User profile & Item profile & knowledge Model & Community Information
  • 6. Recommender Systems Challenges }  Dealing with Big Data problems }  Lack of Useful Data }  Unstructured data }  Missing Data }  New user and New Item }  Cold Start problem }  Temporality }  Changing Data }  Changing user preferences and biases }  Negative choices }  Evaluating Recommenders
  • 7. Main Research Issues }  Understanding the context and modeling context }  Algorithms }  Evaluation }  Engineering
  • 8. Bayesian Networks For Evidence-Based Decision- Making in Software Engineering Ayse Tosun Misirli, and Ayse Bener, IEEE Transactions on Software Engineering, vol.40, no.6., June 2014
  • 9. Recommendation systems for software engineering (RSSE) }  Recommendation systems/ prediction models should be designed in a way that they are capable of integrating evidence, i.e., facts and probabilities systematically collected or measured from real data and observations, into practitioners’ experience. }  In this study, we follow the lead of computational biology and healthcare decision-making, and investigate the applications of BNs in SE
  • 10. The Bayesian Approach }  Provides a natural statistical framework for evidence-based decision-making by incorporating an integrated summary of the available evidence and associated uncertainty (of consequences) }  Maintaining observations, statistical distributions, prior assumptions, and expert judgment in a single model }  Encoding causal relationships among variables for predicting future actions }  “information propagation through the network”, i.e., gaming over the network to see all possible scenarios and their outcomes to give the best action }  imitating the process of human thinking, while going beyond the capabilities of human reasoning with a fact-based, error-free intelligence through the usage of enormous amounts of historical data
  • 11. Example of a simple BN with different variable types
  • 12. Systematic Mapping of BNs in SWE }  To investigate the applications of BNs in SE }  main software engineering challenges addressed }  techniques used to learn causal relationships among variables }  techniques used to infer the parameters }  variable types used as BN nodes
  • 13. Empirical Analysis on Bayesian Decision- Making }  Hybrid Bayesian Network that would solve a specific software engineering challenge }  predicting software reliability in terms of post-release defects }  a ’mixeddata’ model to represent software life cycle phases by incorporating expert judgment (qualitative data through surveys) into quantitative data collected from software repositories }  a ’hybrid’ BN that incorporates both continuous and categorical variables
  • 14. Demographics for Two Software Companies
  • 15. BN Models in this Study
  • 19. Setting Prior Distributions }  Model #1 }  expert knowledge }  Model #2 }  Lilliefors significance test on all variables and on post release defects }  normal probability plots }  Model #3 }  The requirements specification subnet whose distributions were set based on expert knowledge is used, and it is incorporated with the development and testing subnet in Model #2 whose variables are assigned different distributions based on the significance tests
  • 20. Structure Learning }  Expert Judgement }  Chi-plot }  Independence betwen two variables }  Copula models- a transformation of data with marginal distributions }  Prior to modeling it is necessary to chack the presence of dependence there is a positive monotone dependence between test cases and post release defects as data pairs are shifted towards right from the center
  • 21. Inference }  Bayesian learning for complex models using Monte Carlo methods, especially Gibbs sampling }  insufficient statistics }  incomplete data }  successively sample from posterior distribution of each node in a Bayesian model given all the others as full conditionals }  successful when estimating the unknown parameters of probability distributions or when conducting empirical analysis to infer true values of a given sample }  enables to make predictions for future scenarios even though some of the input variables are missing
  • 23. Threats to Validity }  Internal validity }  biases during data collection }  Used scripts to extract data }  Eliminated outliers }  BNs for causality and to avoid over-fitting }  Construct validity }  Large set of metrics were chosen }  Well-known performance measures are used }  Conclusion validity }  Non-parametric test (Mann-Whitney U-test),ANOVA, t-test were used }  External validity }  we aim to transfer the methodology behind BN construction to enhance the usage of these graphical, probabilistic models in software engineering
  • 24. Conclusions }  Similar to computational biology and healthcare, we need to make decisions under uncertainty using multiple data sources }  As we understand the dynamics of BNs and the techniques used for model learning, these models would enable us to uncover hidden relationships between variables, which cannot be easily identified by experts }  Understanding the theory behind BNs also gives us the opportunity to adopt these models to different industrial settings by changing the set of metrics, their distributions, and causal relationships among variables
  • 25. Conclusions }  An integrated tool support (intelligent software delivery platform) }  Dione – to be integrated to IBM Rational