11. “
NoSQL Job Skills (LinkedIn)
Source: 451 Group
After three years it has become
clear that in terms of LinkedIn
member profiles there is only one
trend: the total dominance of
MongoDB.”
– Matt Aslett, 451 Group
15. Data Lake
• Centralized repository for analytics
against data collected from
operational systems
• Extension of EDW: often
based on Hadoop
• 50% of organizations invested in data
lakes*
* Gartner
16. MessageQueue
Customer Data Mgmt Mobile App IoT App Live Dashboards
Raw Data
Processed
Events
Millisecond latency. Expressive querying & flexible indexing against subsets of
data. Updates-in place. In-database aggregations & transformations
Multi-minute latency with scans across TB/PB of data. No indexes. Data stored in
128MB blocks. Write-once-read-many & append-only storage model
Sensors
User Data
Clickstreams
Logs
Churn
Analysis
Enriched
Customer
Profiles
Risk
Modeling
Predictive
Analytics
Real-Time Access
Batch Processing, Batch Views
Where we fit in: Operationalized Data Lake
Distributed
Processing
Frameworks