As companies continue to invest in big data, their focus is shifting from predictive analytics for reporting and business dashboards to machine learning & AI for real-time intelligent decision-making embedded in software. Many organizations are testing, exploring and piloting applications that automatically promote trending products, adjust prices or respond to alerts raised by intelligent real-time systems.
In her talk, Ms. Victoria Livschitz, founder and CTO of Grid Dynamics, will discuss common business drivers of real-time analytics applications and the emerging platforms for building such applications.
To learn more, read an adjoining blog series on this topic here : https://blog.griddynamics.com/in-stream-processing-service-blueprint
Webinar: Analytics with NoSQL: Why, for What, and When?MongoDB
Similar to Open Blueprint for Real-Time Analytics with In-Stream Processing (ISP); 2017 CIO Leadership Forum: Data Strategy & Innovation, Boston, MA (20)
Open Blueprint for Real-Time Analytics with In-Stream Processing (ISP); 2017 CIO Leadership Forum: Data Strategy & Innovation, Boston, MA
1. Privileged and confidential
Open Blueprint for Real-Time Analytics
with In-Stream Processing (ISP)
Victoria Livschitz, Founder & CTO, Grid Dynamics
03/16/2017
2. 2
Business Need
About the speaker:
CTO @Grid Dynamics: present
Founder and CEO @Grid Dynamics: 2006 – 2013
Principal engineer @Sun: 1997 – 2006
Engineering IT services company focused on digital transformation
through cloud, big data & open source for Fortune 500 clients.
Pioneer in real-time processing from company’s inception in 2006.
Architected 3 out of top-10 busiest e-commerce sites. Never had
production outage in peak season.
Frequent contributor to open source projects: Hadoop, Solr,
Lucene, Storm, others.
Victoria Livschitz
About Grid Dynamics:
3. 3
Agenda
• What is “real-time” in analytics, and why it matters?
• In-Stream Processing: emerging platform for real-time processing
• Open ISP blueprint: reference architecture, reference implementation
5. 5
What is “real-time” in analytics, machine learning,
data sciences & AI?
Receive
event
Event
Analyze
event
Act on
event
ResponseAugment
model
How long is the cycle?
What is done online vs. offline?
Learning Analysis
6. 6
Weeks Days Hours Seconds
Receive
event
Event
Analyze
event
Act on
event
ResponseAugment
model
How long is the cycle?
What is done online vs. offline?
Learning Analysis
What is “real-time” in analytics, machine learning,
data sciences & AI?
8. 2. Offline learning, real-time
analytics, online response
Event
Act on
event
Response
Receive
event
A few seconds
A day
Receive
event
Augment
model
Analyze
event
Modify
reaction
1.Offline learning/analytics, online response
Valueof“real-time”
Event
Receive
event Response
Analyze
event
Act on
event
A few
seconds
Receive
event
Augment
modelA day
9. Receive
event
Analyze
event
Act on
event
Augment
model
3. Real-time learning/analytics, online response A few seconds
2. Offline learning, real-time
analytics, online response
Event
Act on
event
Response
Receive
event
A few seconds
A day
Receive
event
Augment
model
Analyze
event
Modify
reaction
1.Offline learning/analytics, online response
Valueof“real-time”
Event
Receive
event Response
Analyze
event
Act on
event
A few
seconds
Receive
event
Augment
modelA day
Event Response
11. 11
Example: B2C retail use cases relative to “real-timeness”
Level 1: Segmented historic context: data on
what happened to all such customers before
Level 3: Situational context: where customer is,
what she wants – or might buy - right now
Level 4: Supply chain dynamics: demand surge,
product availability, competitive pricing
From time to time, send a coupon
based on a segment
Level 2: individualized historic context: 360-
degree view across personal data
On a birthday, offer a coupon based
on personal history
Right now, offer a product based on
what’s in her hands
During a storm, deliver trending
umbrella/pancho combo
Example: Personalized Offers
12. 12
Level 1: Segmented historic context: data on
what happened to all such customers before
Level 3: Situational context: where customer is,
what she wants – or might buy - right now
Level 4: Supply/demand dynamics: impact of
demand surge, shortage, competitive actions...
Level 2: individualized historic context: 360-
degree view across individual’s data Suited
for offline
ML
Requires
real-time
ML
Historic aggregated
data
Real-time
individual’s data
Historic
individual’s data
Real-time
everything
Example: B2C retail use cases relative to “real-timeness”
13. 13
Example 1: top drivers of real-time applications in retail
#3. Dynamic pricing
Determine “right price” for products
based on availability, trending,
personal context & competitive price
#1. Personalized search
Augment search hits and relevancy
ranking based on personal context
#2. Personalized offers
Motivate “buy now” behavior by
offering context-aware deals
#4. Dynamic inventory
Predict inventory needs & re-stock
products in stores based on
fluctuations in inventory & demand
#5. Intelligent sourcing
Determine what order to source from
what store to optimize delivery SLAs
& shipment costs
#6. Real-time alerts
Detect unusual patterns: fraud, surge in
demand, weather changes, shift in
brand sentiment. Respond right away
22. 22
Blueprint goals
Scalable to
100,000+
events /second
Real-time streaming;
real-time ML
Cloud-portable
Proven for mission-
critical use
Open source
(and built 100%
with open source)
Production-ready
Portable across
clouds
Extendable
24. 24
Designed as a complete platform
• No single points of failure
• No bottlenecks
• Built-in scaling
• Dockerized
• Deployable to any cloud
• Reference implementation for
AWS (open source)
• Reference demo: real-time
twitter sentiment analytics for
new movie reviews
25. 25
Where to learn more
• 7-part blog series on ISP
• 7-part blog series on Data Science Kitchen
1. Read our blog: blog.griddynamics.com
2. Let’s chat today
• Stop by our booth, chat in a corridor
3. Connect
• Twitter: @griddynamics
• Subscribe to our blog
• Drop email: info@griddynamics.com