2. A Modern Take on Alchemy
Turning Data into Actionable Insights
3. What to Expect from this Session
Big Data architectural principles
Reference Lambda Architecture
Live demo
4. Architectural Principles
• Decoupled “data bus”
Data → Store → Process → Answers
• Use the right tool for the job
Latency, throughput, access patterns
• Apply Lambda architecture ideas
Immutable (append-only) log, batch/speed/serving layer
• Leverage AWS managed services
No/low admin
• Be cost conscious
Big data ≠ big cost
5. Simplify Big Data Processing
Ingest /
collect
store process /
analyze
consume /
visualize
Time to Answer (data freshness)
Throughput
9. Your First Big Data Application on AWS
PROCESS
STORE
ANALYZE & VISUALIZE
COLLECT
10. Your First Big Data Application on AWS
PROCESS
STORE
COLLECT:
Amazon Kinesis Firehose
ANALYZE & VISUALIZE
11. Your First Big Data Application on AWS
STORE
COLLECT:
Amazon Kinesis Firehose
ANALYZE & VISUALIZE
PROCESS:
Amazon EMR with Spark & Hive
12. Your First Big Data Application on AWS
PROCESS:
Amazon EMR with Spark & Hive
STORE
ANALYZE & VISUALIZE:
Amazon Redshift and Amazon QuickSight
COLLECT:
Amazon Kinesis Firehose
15. Back to our demo…
PROCESS:
Amazon EMR with Spark & Hive
STORE
ANALYZE & VISUALIZE:
Amazon Redshift and Amazon QuickSight
COLLECT:
Amazon Kinesis Firehose