Real-time Analytics with
Presto and Apache Pinot
Xiang Fu
Sept. 24, 2020
User Facing Applications Business Facing Metrics
Apache Pinot
Anomaly Detection
- Ingestion: Millions of events/sec
- Workload: Thousands of queries/sec
- Performance: Millisecond
Fact Table
Dimension Table Pre-Join Pre-Aggregation Pre-Cube
Presto Pinot
Latency
Flexibility
low
high
low
high
Latency vs Flexibility
SPEED
FLEXIBILITY
Presto + Pinot
Presto + Pinot
SPEED
FLEXIBILITY
Thank you
- Getting Started
https://tinyurl.com/prestoPinotTutorial
- Pinot Slack Channel
https://tinyurl.com/pinotSlackChannel
Contributors: Devesh Agrawal, Dharak
Kharod, Haibo Wang, James Sun, Venki
Korukanti, Xiang Fu, Zhenxiao Luo

Real-time Analytics with Presto and Apache Pinot

Editor's Notes

  • #3 Realtime OLAP Database Columnar, Indexed Storage Low latency analytics Distributed – highly available, reliable, scalable Lambda architecture Offline data pushes Real-time stream ingestion Open Source
  • #6 Pinot - Fast single table OLAP Presto - Powerful connector ecosystem Complete system - covers entire landscape Get the best of Presto and Pinot Proven stack at Uber and many more