SlideShare a Scribd company logo
Smart Personal Shopping
Assistant (SPA)
Arvind Rapaka
Sairam Bantupalli
Ravindra Nath
SpotDy Inc
www.spotdy.com
Why we need?
A Personal Shopping Assistant is an occupation
where people help customers by giving advice and
making suggestions. They are employed by
departmental Stores.
But you have a Mobile/Web Ecommerce business.
How can you enable your mobile/web application turn
into a smart personal shopper for your customers?
Enter - Smart Personal Shopping Assistant.
Customer
Store
Assistant
Online Store
Why we need ?
I need skinny pants that
girls like. My size is 32
inch waist and 34 length.
Here you go. Let me
know If I should filter by
price, size or brand
I like it. My price range is
40-50 dollars.
I need skinny pants that
girls like. 32 inch waist,
34 length.
I like this pant. Let’s buy
it.
Ok, I placed the order.
You should receive your
order by tomorrow. Best
of luck.
Why we need ?
Can you place an order
of red skinny pants that I
ordered last year
Do you want the same
size?
Yes
I have placed the order.
You should receive your
pants by tomorrow.
Process Overview
ASR Image Q&A
Knowledge Graph/ Image DB
● Speech Recognition
● Image Matching
● Q & A Dialogue
ASR- Acoustic Speech Recognition
Q&A - Question and Answers Dialogue
SPA - System Call Flow
Q&A Dialogue and IR
ASR
Image Analysis
Engine
Pre-computed
KD
SpotDy BigAITM
Platform
Image
Text
Voice
IR - Information Retrieval
KD - Knowledge Graph DB
Dialogue/Action
Dialogue/Action
Components
Knowledge Database (KD)
● Build Product Knowledge Database
○ Classification (LDA, Existing Taxonomy)
○ NLP Analysis (CRF, Bayesian etc ..)
○ Image Analysis/ Text Attribution (SURF)
○ Ontologies
Image Matching
● Image Analysis
○ Extract Feature (SURF Feature Extraction)
■ Find keypoints
○ Grouping Descriptors (SURF Feature Descriptor)
■ Keypoints are grouped in descriptors
○ Match images in the precomputed descriptor database.
○ Post Processing
ASR
● Speech Recognition
○ Extract Feature vectors
○ Speech Decoder
■ Scoring (DNN)
■ Most Likely Text from Acoustic Model (HMM/Viterbi Algorithm)
○ Pass to Q & A system
Question and Answering
● Query Analysis
○ Query Processing (Stemming, lemma, Gazetteer ..)
○ Understand User Intent (HMM )
■ Navigational/Specificity
■ Initiate dialogue if necessary
○ Query POS Labelling / Entity Extraction (CRF)
○ Query Rewrite/ Retrieval/Post Processing
Q&A Personalization
● Q & A results should be personalized and
aggregated based on:
○ Past user history
○ User Geo/Demo
○ Occasions such as Christmas, Thanksgiving etc ..
SpotDy BigAITM
Query Results
Algorithms
SURF (Speeded up Robust Features)
SURF is a feature detection process to examine an
image to extract features, that are unique to the objects
in the image. Based on SIFT but faster.
In our case, it help in retrieving similar products based
on images.
Process Involves :
○ Build Scale Space
○ LoG Approximation
○ Key Point Extraction
○ Generate Features
LoG Approximation
● The Laplacian is a 2-D isotropic measure of the 2nd spatial derivative of an
image.
● The Laplacian of an image highlights regions of rapid intensity change to
detect edges.
● Uses Gaussian smoothing filter in order to reduce its sensitivity to noise due
to second derivation
GMM/DNN-HMM
HMM is a generative probabilistic model that provides
a framework for modelling time-varying spectral vector
sequences. In our case, we use for speech recognition.
● GMM/DNN produce posterior probabilities for HMM States
● predicts likelihood of observation sequence being generated by
state sequence using Viterbi Algo
● Sub word HMMs concatenate to create larger word-based HMM
Observations (Feature vectors)
GMM/DNN
HMM States
(Senones)
Posterior Probabilities
NLP
Knowledge Database (KD) is the key for the query
processing and information retrieval
● NLP is extensively used to process unstructured data in building
KD.
Algorithms:
● Conditional Random Fields/Maxent for POS Tagging, Entity
Extraction, concept tagging etc.
● LDA for topic Analysis and Classification
Q&A Dialogue and IR
Indexed
KD
Product
Catalog
Product
metainfo
NLP Engine
Query Processing
Query
Indexed
KD
Annotators/Filters
Results
● User Query pass goes through
various annotators. Some of the
few annotators include :
○ Gazetteer, Lemmatization,
Stemming, POS Tagging,
Entity Extraction
● Query Rewrite
● Search - Similarity (IR). Basic
Algorithms include
○ Vector Space Modelling
○ BM25F
● Result Generation
Scalability
SPA - HA Architecture
Significant computing resources are required while
scaling to millions of requests in real time.
BigAITM
BigAITM
is purpose built for the scalability
of applications such as SPA.
● Building KD (Knowledge Database)
● Image Repository Store
● Query Processing
● Scalable Machine Learning Models
Q&A

More Related Content

Viewers also liked

Personal digital assistant
Personal digital assistantPersonal digital assistant
Personal digital assistant
Marietaaaa
 
Natural Language Processing in AI
Natural Language Processing in AINatural Language Processing in AI
Natural Language Processing in AI
Saurav Shrestha
 
Ai powered personal assistants
Ai powered personal assistantsAi powered personal assistants
Ai powered personal assistants
Savannah Boettcher
 
The new era of bots, multi-channel messaging solutions, dynamic routing and a...
The new era of bots, multi-channel messaging solutions, dynamic routing and a...The new era of bots, multi-channel messaging solutions, dynamic routing and a...
The new era of bots, multi-channel messaging solutions, dynamic routing and a...
Nicola Junior Vitto
 
Powerpoint Artificial Heart
Powerpoint Artificial HeartPowerpoint Artificial Heart
Powerpoint Artificial Heart
bfarber
 
smart assistant
smart assistantsmart assistant
Unlocking Business Value Using Data
Unlocking Business Value Using DataUnlocking Business Value Using Data
Unlocking Business Value Using Data
Splunk
 
NLP
NLPNLP
The lifecycle of a chatbot
The lifecycle of a chatbotThe lifecycle of a chatbot
The lifecycle of a chatbot
Sohan Maheshwar
 
Virtual personal assistant
Virtual personal assistantVirtual personal assistant
Virtual personal assistant
Shubham Bhalekar
 
Social Media in the Life of the Digital Library by Lorri Mon
Social Media in the Life of the Digital Library by Lorri MonSocial Media in the Life of the Digital Library by Lorri Mon
Social Media in the Life of the Digital Library by Lorri Mon
ipl2: Information You Can Trust
 
RememberItNow! A Great Employee Benefit
RememberItNow! A Great Employee BenefitRememberItNow! A Great Employee Benefit
RememberItNow! A Great Employee Benefit
RememberItNow
 
CES 2016 – GfK smart home presentation
CES 2016 – GfK smart home presentationCES 2016 – GfK smart home presentation
CES 2016 – GfK smart home presentation
GfK
 
How to use google calendar
How to use google calendar How to use google calendar
How to use google calendar
Ailyn Arcangeles
 
Smart Home technologies
Smart Home technologiesSmart Home technologies
Smart Home technologies
loggcity
 
Chatbot Artificial Intelligence
Chatbot Artificial IntelligenceChatbot Artificial Intelligence
Chatbot Artificial Intelligence
Md. Mahedi Mahfuj
 

Viewers also liked (16)

Personal digital assistant
Personal digital assistantPersonal digital assistant
Personal digital assistant
 
Natural Language Processing in AI
Natural Language Processing in AINatural Language Processing in AI
Natural Language Processing in AI
 
Ai powered personal assistants
Ai powered personal assistantsAi powered personal assistants
Ai powered personal assistants
 
The new era of bots, multi-channel messaging solutions, dynamic routing and a...
The new era of bots, multi-channel messaging solutions, dynamic routing and a...The new era of bots, multi-channel messaging solutions, dynamic routing and a...
The new era of bots, multi-channel messaging solutions, dynamic routing and a...
 
Powerpoint Artificial Heart
Powerpoint Artificial HeartPowerpoint Artificial Heart
Powerpoint Artificial Heart
 
smart assistant
smart assistantsmart assistant
smart assistant
 
Unlocking Business Value Using Data
Unlocking Business Value Using DataUnlocking Business Value Using Data
Unlocking Business Value Using Data
 
NLP
NLPNLP
NLP
 
The lifecycle of a chatbot
The lifecycle of a chatbotThe lifecycle of a chatbot
The lifecycle of a chatbot
 
Virtual personal assistant
Virtual personal assistantVirtual personal assistant
Virtual personal assistant
 
Social Media in the Life of the Digital Library by Lorri Mon
Social Media in the Life of the Digital Library by Lorri MonSocial Media in the Life of the Digital Library by Lorri Mon
Social Media in the Life of the Digital Library by Lorri Mon
 
RememberItNow! A Great Employee Benefit
RememberItNow! A Great Employee BenefitRememberItNow! A Great Employee Benefit
RememberItNow! A Great Employee Benefit
 
CES 2016 – GfK smart home presentation
CES 2016 – GfK smart home presentationCES 2016 – GfK smart home presentation
CES 2016 – GfK smart home presentation
 
How to use google calendar
How to use google calendar How to use google calendar
How to use google calendar
 
Smart Home technologies
Smart Home technologiesSmart Home technologies
Smart Home technologies
 
Chatbot Artificial Intelligence
Chatbot Artificial IntelligenceChatbot Artificial Intelligence
Chatbot Artificial Intelligence
 

Similar to Personal Shopping Assistant - A Big Data Problem

Introduction to Artificial Intelligence
Introduction to Artificial IntelligenceIntroduction to Artificial Intelligence
Introduction to Artificial Intelligence
ananth
 
A step towards machine learning at accionlabs
A step towards machine learning at accionlabsA step towards machine learning at accionlabs
A step towards machine learning at accionlabs
Chetan Khatri
 
Search and fpga
Search and fpgaSearch and fpga
Search and fpga
Arvind Rapaka
 
Enterprise PHP Architecture through Design Patterns and Modularization (Midwe...
Enterprise PHP Architecture through Design Patterns and Modularization (Midwe...Enterprise PHP Architecture through Design Patterns and Modularization (Midwe...
Enterprise PHP Architecture through Design Patterns and Modularization (Midwe...
Aaron Saray
 
Deep learning applications in e-commerce search: Dynamic talks Chicago 3/14/2019
Deep learning applications in e-commerce search: Dynamic talks Chicago 3/14/2019Deep learning applications in e-commerce search: Dynamic talks Chicago 3/14/2019
Deep learning applications in e-commerce search: Dynamic talks Chicago 3/14/2019
Grid Dynamics
 
Being a Data Science Product Manager
Being a Data Science Product ManagerBeing a Data Science Product Manager
Being a Data Science Product Manager
Ram Narayan Subudhi
 
Machine learning and TensorFlow
Machine learning and TensorFlowMachine learning and TensorFlow
Machine learning and TensorFlow
Jose Papo, MSc
 
Big & Personal: the data and the models behind Netflix recommendations by Xa...
 Big & Personal: the data and the models behind Netflix recommendations by Xa... Big & Personal: the data and the models behind Netflix recommendations by Xa...
Big & Personal: the data and the models behind Netflix recommendations by Xa...
BigMine
 
2016 XUG Conference Big Data: Big Deal for Personalized Communications or Meh?
2016 XUG Conference   Big Data: Big Deal for Personalized Communications or Meh?2016 XUG Conference   Big Data: Big Deal for Personalized Communications or Meh?
2016 XUG Conference Big Data: Big Deal for Personalized Communications or Meh?
Jeffrey Stewart
 
Geek Sync | A Photographer and a Data Modeler Walk into a Bar…
Geek Sync | A Photographer and a Data Modeler Walk into a Bar…Geek Sync | A Photographer and a Data Modeler Walk into a Bar…
Geek Sync | A Photographer and a Data Modeler Walk into a Bar…
IDERA Software
 
Introduction to ML and solutions challenge (GDSC)
Introduction to ML and solutions challenge (GDSC)Introduction to ML and solutions challenge (GDSC)
Introduction to ML and solutions challenge (GDSC)
AbrarSharif2
 
GDSC Machine Learning Session Presentation
GDSC Machine Learning Session PresentationGDSC Machine Learning Session Presentation
GDSC Machine Learning Session Presentation
gdsclavasa
 
GDSC BPIT ML Campaign.pptx
GDSC BPIT ML Campaign.pptxGDSC BPIT ML Campaign.pptx
GDSC BPIT ML Campaign.pptx
khushbooGupta928250
 
Qcon SF 2013 - Machine Learning & Recommender Systems @ Netflix Scale
Qcon SF 2013 - Machine Learning & Recommender Systems @ Netflix ScaleQcon SF 2013 - Machine Learning & Recommender Systems @ Netflix Scale
Qcon SF 2013 - Machine Learning & Recommender Systems @ Netflix Scale
Xavier Amatriain
 
Data Structures Algorithm and Career Guidance
Data Structures Algorithm and Career GuidanceData Structures Algorithm and Career Guidance
Data Structures Algorithm and Career Guidance
SwapnilNarayan
 
Role of Data Science in eCommerce
Role of Data Science in eCommerceRole of Data Science in eCommerce
Role of Data Science in eCommerce
ManojKumar Rangasamy Kannadasan
 
Generative AI by Salesforce Admin Group Dehradun
Generative AI by Salesforce Admin Group DehradunGenerative AI by Salesforce Admin Group Dehradun
Generative AI by Salesforce Admin Group Dehradun
kailashChandra95
 
Live predictions with schemaless data at scale. MLMU Kosice, Exponea
Live predictions with schemaless data at scale. MLMU Kosice, ExponeaLive predictions with schemaless data at scale. MLMU Kosice, Exponea
Live predictions with schemaless data at scale. MLMU Kosice, Exponea
Data Science Club
 
Artificial Intelligence (ML - DL)
Artificial Intelligence (ML - DL)Artificial Intelligence (ML - DL)
Artificial Intelligence (ML - DL)
ShehryarSH1
 
infoShare AI Roadshow 2018 - Adam Karwan (Groupon) - Jak wykorzystać uczenie ...
infoShare AI Roadshow 2018 - Adam Karwan (Groupon) - Jak wykorzystać uczenie ...infoShare AI Roadshow 2018 - Adam Karwan (Groupon) - Jak wykorzystać uczenie ...
infoShare AI Roadshow 2018 - Adam Karwan (Groupon) - Jak wykorzystać uczenie ...
Infoshare
 

Similar to Personal Shopping Assistant - A Big Data Problem (20)

Introduction to Artificial Intelligence
Introduction to Artificial IntelligenceIntroduction to Artificial Intelligence
Introduction to Artificial Intelligence
 
A step towards machine learning at accionlabs
A step towards machine learning at accionlabsA step towards machine learning at accionlabs
A step towards machine learning at accionlabs
 
Search and fpga
Search and fpgaSearch and fpga
Search and fpga
 
Enterprise PHP Architecture through Design Patterns and Modularization (Midwe...
Enterprise PHP Architecture through Design Patterns and Modularization (Midwe...Enterprise PHP Architecture through Design Patterns and Modularization (Midwe...
Enterprise PHP Architecture through Design Patterns and Modularization (Midwe...
 
Deep learning applications in e-commerce search: Dynamic talks Chicago 3/14/2019
Deep learning applications in e-commerce search: Dynamic talks Chicago 3/14/2019Deep learning applications in e-commerce search: Dynamic talks Chicago 3/14/2019
Deep learning applications in e-commerce search: Dynamic talks Chicago 3/14/2019
 
Being a Data Science Product Manager
Being a Data Science Product ManagerBeing a Data Science Product Manager
Being a Data Science Product Manager
 
Machine learning and TensorFlow
Machine learning and TensorFlowMachine learning and TensorFlow
Machine learning and TensorFlow
 
Big & Personal: the data and the models behind Netflix recommendations by Xa...
 Big & Personal: the data and the models behind Netflix recommendations by Xa... Big & Personal: the data and the models behind Netflix recommendations by Xa...
Big & Personal: the data and the models behind Netflix recommendations by Xa...
 
2016 XUG Conference Big Data: Big Deal for Personalized Communications or Meh?
2016 XUG Conference   Big Data: Big Deal for Personalized Communications or Meh?2016 XUG Conference   Big Data: Big Deal for Personalized Communications or Meh?
2016 XUG Conference Big Data: Big Deal for Personalized Communications or Meh?
 
Geek Sync | A Photographer and a Data Modeler Walk into a Bar…
Geek Sync | A Photographer and a Data Modeler Walk into a Bar…Geek Sync | A Photographer and a Data Modeler Walk into a Bar…
Geek Sync | A Photographer and a Data Modeler Walk into a Bar…
 
Introduction to ML and solutions challenge (GDSC)
Introduction to ML and solutions challenge (GDSC)Introduction to ML and solutions challenge (GDSC)
Introduction to ML and solutions challenge (GDSC)
 
GDSC Machine Learning Session Presentation
GDSC Machine Learning Session PresentationGDSC Machine Learning Session Presentation
GDSC Machine Learning Session Presentation
 
GDSC BPIT ML Campaign.pptx
GDSC BPIT ML Campaign.pptxGDSC BPIT ML Campaign.pptx
GDSC BPIT ML Campaign.pptx
 
Qcon SF 2013 - Machine Learning & Recommender Systems @ Netflix Scale
Qcon SF 2013 - Machine Learning & Recommender Systems @ Netflix ScaleQcon SF 2013 - Machine Learning & Recommender Systems @ Netflix Scale
Qcon SF 2013 - Machine Learning & Recommender Systems @ Netflix Scale
 
Data Structures Algorithm and Career Guidance
Data Structures Algorithm and Career GuidanceData Structures Algorithm and Career Guidance
Data Structures Algorithm and Career Guidance
 
Role of Data Science in eCommerce
Role of Data Science in eCommerceRole of Data Science in eCommerce
Role of Data Science in eCommerce
 
Generative AI by Salesforce Admin Group Dehradun
Generative AI by Salesforce Admin Group DehradunGenerative AI by Salesforce Admin Group Dehradun
Generative AI by Salesforce Admin Group Dehradun
 
Live predictions with schemaless data at scale. MLMU Kosice, Exponea
Live predictions with schemaless data at scale. MLMU Kosice, ExponeaLive predictions with schemaless data at scale. MLMU Kosice, Exponea
Live predictions with schemaless data at scale. MLMU Kosice, Exponea
 
Artificial Intelligence (ML - DL)
Artificial Intelligence (ML - DL)Artificial Intelligence (ML - DL)
Artificial Intelligence (ML - DL)
 
infoShare AI Roadshow 2018 - Adam Karwan (Groupon) - Jak wykorzystać uczenie ...
infoShare AI Roadshow 2018 - Adam Karwan (Groupon) - Jak wykorzystać uczenie ...infoShare AI Roadshow 2018 - Adam Karwan (Groupon) - Jak wykorzystać uczenie ...
infoShare AI Roadshow 2018 - Adam Karwan (Groupon) - Jak wykorzystać uczenie ...
 

Recently uploaded

"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota
Fwdays
 
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge GraphGraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
Neo4j
 
Essentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation ParametersEssentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation Parameters
Safe Software
 
Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |
AstuteBusiness
 
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
Jason Yip
 
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
AlexanderRichford
 
Containers & AI - Beauty and the Beast!?!
Containers & AI - Beauty and the Beast!?!Containers & AI - Beauty and the Beast!?!
Containers & AI - Beauty and the Beast!?!
Tobias Schneck
 
GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...
GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...
GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...
GlobalLogic Ukraine
 
"NATO Hackathon Winner: AI-Powered Drug Search", Taras Kloba
"NATO Hackathon Winner: AI-Powered Drug Search",  Taras Kloba"NATO Hackathon Winner: AI-Powered Drug Search",  Taras Kloba
"NATO Hackathon Winner: AI-Powered Drug Search", Taras Kloba
Fwdays
 
AWS Certified Solutions Architect Associate (SAA-C03)
AWS Certified Solutions Architect Associate (SAA-C03)AWS Certified Solutions Architect Associate (SAA-C03)
AWS Certified Solutions Architect Associate (SAA-C03)
HarpalGohil4
 
Session 1 - Intro to Robotic Process Automation.pdf
Session 1 - Intro to Robotic Process Automation.pdfSession 1 - Intro to Robotic Process Automation.pdf
Session 1 - Intro to Robotic Process Automation.pdf
UiPathCommunity
 
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
DanBrown980551
 
GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)
Javier Junquera
 
Day 2 - Intro to UiPath Studio Fundamentals
Day 2 - Intro to UiPath Studio FundamentalsDay 2 - Intro to UiPath Studio Fundamentals
Day 2 - Intro to UiPath Studio Fundamentals
UiPathCommunity
 
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptxPRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
christinelarrosa
 
Harnessing the Power of NLP and Knowledge Graphs for Opioid Research
Harnessing the Power of NLP and Knowledge Graphs for Opioid ResearchHarnessing the Power of NLP and Knowledge Graphs for Opioid Research
Harnessing the Power of NLP and Knowledge Graphs for Opioid Research
Neo4j
 
"Scaling RAG Applications to serve millions of users", Kevin Goedecke
"Scaling RAG Applications to serve millions of users",  Kevin Goedecke"Scaling RAG Applications to serve millions of users",  Kevin Goedecke
"Scaling RAG Applications to serve millions of users", Kevin Goedecke
Fwdays
 
The Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptxThe Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptx
operationspcvita
 
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin..."$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
Fwdays
 
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansBiomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Neo4j
 

Recently uploaded (20)

"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota
 
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge GraphGraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
 
Essentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation ParametersEssentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation Parameters
 
Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |
 
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
 
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
 
Containers & AI - Beauty and the Beast!?!
Containers & AI - Beauty and the Beast!?!Containers & AI - Beauty and the Beast!?!
Containers & AI - Beauty and the Beast!?!
 
GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...
GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...
GlobalLogic Java Community Webinar #18 “How to Improve Web Application Perfor...
 
"NATO Hackathon Winner: AI-Powered Drug Search", Taras Kloba
"NATO Hackathon Winner: AI-Powered Drug Search",  Taras Kloba"NATO Hackathon Winner: AI-Powered Drug Search",  Taras Kloba
"NATO Hackathon Winner: AI-Powered Drug Search", Taras Kloba
 
AWS Certified Solutions Architect Associate (SAA-C03)
AWS Certified Solutions Architect Associate (SAA-C03)AWS Certified Solutions Architect Associate (SAA-C03)
AWS Certified Solutions Architect Associate (SAA-C03)
 
Session 1 - Intro to Robotic Process Automation.pdf
Session 1 - Intro to Robotic Process Automation.pdfSession 1 - Intro to Robotic Process Automation.pdf
Session 1 - Intro to Robotic Process Automation.pdf
 
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
 
GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)
 
Day 2 - Intro to UiPath Studio Fundamentals
Day 2 - Intro to UiPath Studio FundamentalsDay 2 - Intro to UiPath Studio Fundamentals
Day 2 - Intro to UiPath Studio Fundamentals
 
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptxPRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
 
Harnessing the Power of NLP and Knowledge Graphs for Opioid Research
Harnessing the Power of NLP and Knowledge Graphs for Opioid ResearchHarnessing the Power of NLP and Knowledge Graphs for Opioid Research
Harnessing the Power of NLP and Knowledge Graphs for Opioid Research
 
"Scaling RAG Applications to serve millions of users", Kevin Goedecke
"Scaling RAG Applications to serve millions of users",  Kevin Goedecke"Scaling RAG Applications to serve millions of users",  Kevin Goedecke
"Scaling RAG Applications to serve millions of users", Kevin Goedecke
 
The Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptxThe Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptx
 
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin..."$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
 
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansBiomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
 

Personal Shopping Assistant - A Big Data Problem

  • 1. Smart Personal Shopping Assistant (SPA) Arvind Rapaka Sairam Bantupalli Ravindra Nath SpotDy Inc www.spotdy.com
  • 2. Why we need? A Personal Shopping Assistant is an occupation where people help customers by giving advice and making suggestions. They are employed by departmental Stores. But you have a Mobile/Web Ecommerce business. How can you enable your mobile/web application turn into a smart personal shopper for your customers? Enter - Smart Personal Shopping Assistant. Customer Store Assistant Online Store
  • 3. Why we need ? I need skinny pants that girls like. My size is 32 inch waist and 34 length. Here you go. Let me know If I should filter by price, size or brand I like it. My price range is 40-50 dollars. I need skinny pants that girls like. 32 inch waist, 34 length. I like this pant. Let’s buy it. Ok, I placed the order. You should receive your order by tomorrow. Best of luck.
  • 4. Why we need ? Can you place an order of red skinny pants that I ordered last year Do you want the same size? Yes I have placed the order. You should receive your pants by tomorrow.
  • 5. Process Overview ASR Image Q&A Knowledge Graph/ Image DB ● Speech Recognition ● Image Matching ● Q & A Dialogue ASR- Acoustic Speech Recognition Q&A - Question and Answers Dialogue
  • 6. SPA - System Call Flow Q&A Dialogue and IR ASR Image Analysis Engine Pre-computed KD SpotDy BigAITM Platform Image Text Voice IR - Information Retrieval KD - Knowledge Graph DB Dialogue/Action Dialogue/Action
  • 8. Knowledge Database (KD) ● Build Product Knowledge Database ○ Classification (LDA, Existing Taxonomy) ○ NLP Analysis (CRF, Bayesian etc ..) ○ Image Analysis/ Text Attribution (SURF) ○ Ontologies
  • 9. Image Matching ● Image Analysis ○ Extract Feature (SURF Feature Extraction) ■ Find keypoints ○ Grouping Descriptors (SURF Feature Descriptor) ■ Keypoints are grouped in descriptors ○ Match images in the precomputed descriptor database. ○ Post Processing
  • 10. ASR ● Speech Recognition ○ Extract Feature vectors ○ Speech Decoder ■ Scoring (DNN) ■ Most Likely Text from Acoustic Model (HMM/Viterbi Algorithm) ○ Pass to Q & A system
  • 11. Question and Answering ● Query Analysis ○ Query Processing (Stemming, lemma, Gazetteer ..) ○ Understand User Intent (HMM ) ■ Navigational/Specificity ■ Initiate dialogue if necessary ○ Query POS Labelling / Entity Extraction (CRF) ○ Query Rewrite/ Retrieval/Post Processing
  • 12. Q&A Personalization ● Q & A results should be personalized and aggregated based on: ○ Past user history ○ User Geo/Demo ○ Occasions such as Christmas, Thanksgiving etc .. SpotDy BigAITM Query Results
  • 14. SURF (Speeded up Robust Features) SURF is a feature detection process to examine an image to extract features, that are unique to the objects in the image. Based on SIFT but faster. In our case, it help in retrieving similar products based on images. Process Involves : ○ Build Scale Space ○ LoG Approximation ○ Key Point Extraction ○ Generate Features
  • 15. LoG Approximation ● The Laplacian is a 2-D isotropic measure of the 2nd spatial derivative of an image. ● The Laplacian of an image highlights regions of rapid intensity change to detect edges. ● Uses Gaussian smoothing filter in order to reduce its sensitivity to noise due to second derivation
  • 16. GMM/DNN-HMM HMM is a generative probabilistic model that provides a framework for modelling time-varying spectral vector sequences. In our case, we use for speech recognition. ● GMM/DNN produce posterior probabilities for HMM States ● predicts likelihood of observation sequence being generated by state sequence using Viterbi Algo ● Sub word HMMs concatenate to create larger word-based HMM Observations (Feature vectors) GMM/DNN HMM States (Senones) Posterior Probabilities
  • 17. NLP Knowledge Database (KD) is the key for the query processing and information retrieval ● NLP is extensively used to process unstructured data in building KD. Algorithms: ● Conditional Random Fields/Maxent for POS Tagging, Entity Extraction, concept tagging etc. ● LDA for topic Analysis and Classification Q&A Dialogue and IR Indexed KD Product Catalog Product metainfo NLP Engine
  • 18. Query Processing Query Indexed KD Annotators/Filters Results ● User Query pass goes through various annotators. Some of the few annotators include : ○ Gazetteer, Lemmatization, Stemming, POS Tagging, Entity Extraction ● Query Rewrite ● Search - Similarity (IR). Basic Algorithms include ○ Vector Space Modelling ○ BM25F ● Result Generation
  • 20. SPA - HA Architecture Significant computing resources are required while scaling to millions of requests in real time.
  • 21. BigAITM BigAITM is purpose built for the scalability of applications such as SPA. ● Building KD (Knowledge Database) ● Image Repository Store ● Query Processing ● Scalable Machine Learning Models
  • 22. Q&A