SlideShare a Scribd company logo
Providing Interactive Analytics on Excel
with Billions of Rows
Saswata Sengupta
2020.4
© Kyligence Inc. 2019, Confidential.
Excel-Superhero: The Unsung Hero of Modern Analytics
The Excel-
Superhero
Fulfilling business analytics
Generating complex reports and analysis
Building complex models
AND … keeping everyone happy, moving
business forward
VS.
The Big Data Super
Villain
Large datasets, often billions of rows, sitting in data
lake, cloud storage or in distributed databases.
Hard to do number crunching and analysis, often
crashing the Excel-Superhero.
Slow processing times lead to inefficient decision making.
Complex calculations are often difficult to perform with billions of rows of data.
Petabyte-scale datasets combined with many concurrent users often becomes too challenging for many organizations.
© Kyligence Inc. 2019, Confidential.
Challenges in Excel with Big Data
Title
Limited Scalability
Slow Response Time
Limited
Number of
Dimensions
Limited
Scalability
Slow
Response
Time
Difficult to
Access DataThe Big Data Villain
I will finish Excel and all Excel
users!
© Kyligence Inc. 2019, Confidential.
contentcontentcontentcont
entcontentcontentcontentc
ontentcontentcontent
• Excel has been a tool of choice for analysts across the board and
throughout the industry.
• Analysts have been creating complicated models in Excel to build reports
using PivotTable, PivotChart, Macros etc.
• Excel is easy to use, time tested, easily available, and a reliable
application for analysts for data consuming, crunching, and analytics.
Excel – Analytics for Every Organization
Fun Fact
© Kyligence Inc. 2019, Confidential.
Apache Kylin
TopLevel Apache Project
 The only open-source OLAP on big
data platform
BestOpen-Source Big Data Tool
 InfoWorld’s Bossies (Best of Open Source
Software Awards) in 2015 & 2016
Sub-Second Interactive
Query
 Large scale, high concurrency, multi-
dimensional, sub-second query latency
1,000+ Organizations
 Adopted by thousands of
organizations globally
© Kyligence Inc. 2019, Confidential.
Kyligence = Kylin + Intelligence
• Founded in 2016 by the creators of Apache Kylin
• Built around Kylin with augmented AI, enhanced to deliver
unprecedented enterprise analytic performance
• CRN Top-10 big data startups in 2018
• Global Presence: San Jose, Seattle, New York, Shanghai, Beijing
• VCs: Fidelity International, Shunwei Capital, Broadband Capital,
Redpoint, Cisco, Coatue
Accelerate Critical Business Decisions with AI-Augmented Data Management and
Analytics
2016
Founded Pre-A
Redpoint
Cisco
2017
Series A
CBC
Shunwei
2018
Series B
8Roads
2019
Series C
Coatue
© Kyligence Inc. 2019, Confidential.
Excel-Superhero: The Unsung Hero of Modern Analytics
The Excel-
Superhero
© Kyligence Inc. 2019, Confidential.
Trusted by Enterprises Worldwide
© Kyligence Inc. 2019, Confidential.
Kyligence MDX
OLAP
A Case for OLAP for Analytics
• More data sources
• Lower TCO
• Scalable
• High performance
• Ad hoc queries
• Flexible analysis
• Enterprise security
Business AnalysisBusiness Analysis
Data Lake DW / DM
DW / DM
Kyligence Enterprise
Scale Up
Logs
Caching
QueriesData
MDX
SQL
Complex
Modeling
Semantic
Translation
Measure
Groups
Query
Engine
Multi-Level
Accesses
Multi-
Tenants
Distributed
Arch.
Orders CRM POS
Semantic Layer
Data Service Layer
MOLAP
MDX
© Kyligence Inc. 2019, Confidential.
Automatic Model Creation
AI-augmented engine automatically
designs the most optimal model based on
past user behaviors and query patterns.
This reduces the need for manual
modeling and maintenance.
Adaptive Schema Evolution
As analytical requests change, the model
needs to reflect those changes. Our model
automatically adapts to any schema changes.
The model evolves along with your analytical
needs.
Automatic Query Optimization
The model continuously evolves and
self-optimizes as it obtains new usage
behavior. This guarantees sub-second
performance, no matter the data
volume or concurrency.
Kyligence Solution
© Kyligence Inc. 2019, Confidential.
Traditional OLAP vs. Kyligence
• Rigid schema, dependent on data warehouse
• Single node solution
• End-user analytics is limited by the OLAP cube. If the measures and
dimensions do not already exist, the query cannot be answered.
• Adaptive schema
• Distributed multi-node solution
• OLAP cube provides sub-second responses
• Smart pushdown capabilities, guaranteed query responses
© Kyligence Inc. 2019, Confidential.
AI-Powered Data Management For Most Valuable Data
ANSI SQL
MDX
REST
Semantic Layer
FinanceMarketing
Sales
Index
AI-Augmented Engine
© Kyligence Inc. 2019, Confidential.
Kyligence Architecture
Data Source
Analytics
Data Service
Data Lake Azure
Blob Storage
AWS
S3
Hadoop
Google
Cloud Storage
Azure SynapseSnowflake
Management
Query Engine Semantic Layer SQL Query Engine Smart Modeling
Scaling Maintenance Monitor
Enterprise-Level Security
TCO
Database Events Files Logs IoT
Business Insights Multidimensional Analysis 3rd-Party Applications Machine Learning
Visualization Self-Service Collaboration 3rd-Party BI Tools
© Kyligence Inc. 2019, Confidential.
Sub-Second
Query
Multi-Level Security
Semantic Layer
Seamless Integration
with Excel/BI
 Fast response time
 Supports aggregation, detail, and ad hoc queries
 Project, table, row, and column level access control
 Supports complex business logic
 Supports Excel core anlytics functionalities
 Model synchronization into BI tools
Value Proposition
 All or incremantal build
 Query while build
Cube Building
© Kyligence Inc. 2019, Confidential.
DEMO
© Kyligence Inc. 2019, Confidential.
Title
Originally built to replace Teradata 3 trillion rows of detail
100,000 concurrent users on
hand-held devices
With millisecond responses
Replaced Teradata
IBM Cognos replacement
From 1,200+ cubes down
to 2 cubes
Complete replacement of Greenplum
eBay Global Top
3 Bank
Top Global
Insurance
Company
World’s Largest
Credit Card
Processor
Title
© Kyligence Inc. 2019, Confidential.
Title
Originally built to replace Teradata 3 trillion rows of detail
100,000 concurrent users on
hand-held devices
With milli-second responses
Replaced Teradata
IBM Cognos replacement
From 1,200+ cubes down
to 2 cubes
Complete replacement of Greenplum
eBay Global Top
3 Bank
Top Global
Insurance
Company
World’s Largest
Credit Card
Processor
Excel Trivia
What number of actions can you undo in Excel ?
100
What is the earliest date allowed in Excel calculations ?
Jan 01, 1900
What is the result of =1*(0.5-0.4-0.1) in Excel?
© Kyligence Inc. 2019, Confidential.
Thank You!
#kyligence@kyligence
Connect with us on LinkedIn, Twitter & Facebook
Try Kyligence @ https://kyligence.io/download-free-trial/
www.kyligence.io

More Related Content

What's hot

The API Lie
The API LieThe API Lie
The API Lie
SnapLogic
 
Altis Webinar: Use Cases For The Modern Data Platform
Altis Webinar: Use Cases For The Modern Data PlatformAltis Webinar: Use Cases For The Modern Data Platform
Altis Webinar: Use Cases For The Modern Data Platform
Altis Consulting
 
Big Data Experience Sharing: Building Collaborative Data Analytics Platform -...
Big Data Experience Sharing: Building Collaborative Data Analytics Platform -...Big Data Experience Sharing: Building Collaborative Data Analytics Platform -...
Big Data Experience Sharing: Building Collaborative Data Analytics Platform -...
Amazon Web Services
 
Well Architected Framework - Data
Well Architected Framework - Data Well Architected Framework - Data
Well Architected Framework - Data
Craig Milroy
 
Allianz x api_management_servic_fabric
Allianz x api_management_servic_fabricAllianz x api_management_servic_fabric
Allianz x api_management_servic_fabric
Michele Danieli
 
Simplify Data Analytics Over the Cloud
Simplify Data Analytics Over the CloudSimplify Data Analytics Over the Cloud
Simplify Data Analytics Over the Cloud
Tyler Wishnoff
 
Turn Big Data into Big Value on Informatica and AWS
Turn Big Data into Big Value on Informatica and AWSTurn Big Data into Big Value on Informatica and AWS
Turn Big Data into Big Value on Informatica and AWS
Amazon Web Services
 
About Pragmatic Works
About Pragmatic WorksAbout Pragmatic Works
About Pragmatic Works
MILL5
 
Modern Data Platforms
Modern Data Platforms Modern Data Platforms
Modern Data Platforms
Arne Roßmann
 
Auto AI : AI used to create AI applications
Auto AI : AI used to create AI applicationsAuto AI : AI used to create AI applications
Auto AI : AI used to create AI applications
Karan Sachdeva
 
Cloud architecture patterns and pratices
Cloud architecture patterns and praticesCloud architecture patterns and pratices
Cloud architecture patterns and pratices
Gustavo Alzate Sandoval
 
Modern Data Platform Part 1: Data Ingestion
Modern Data Platform Part 1: Data IngestionModern Data Platform Part 1: Data Ingestion
Modern Data Platform Part 1: Data Ingestion
Nilesh Shah
 
Towards Lensfield
Towards LensfieldTowards Lensfield
Towards Lensfield
Jim Downing
 
Qlik sense- Technical Seminar
Qlik sense- Technical SeminarQlik sense- Technical Seminar
Qlik sense- Technical Seminar
Sanjana Gondane
 
AI-Powered Analytics: What It Is and How It’s Powering the Next Generation of...
AI-Powered Analytics: What It Is and How It’s Powering the Next Generation of...AI-Powered Analytics: What It Is and How It’s Powering the Next Generation of...
AI-Powered Analytics: What It Is and How It’s Powering the Next Generation of...
Tyler Wishnoff
 
Importance of global certifications
Importance of global certificationsImportance of global certifications
Importance of global certifications
Anjani Phuyal
 
Unlock Data-driven Insights in Databricks Using Location Intelligence
Unlock Data-driven Insights in Databricks Using Location IntelligenceUnlock Data-driven Insights in Databricks Using Location Intelligence
Unlock Data-driven Insights in Databricks Using Location Intelligence
Precisely
 
Aeris + Cassandra: An IOT Solution Helping Automakers Make the Connected Car ...
Aeris + Cassandra: An IOT Solution Helping Automakers Make the Connected Car ...Aeris + Cassandra: An IOT Solution Helping Automakers Make the Connected Car ...
Aeris + Cassandra: An IOT Solution Helping Automakers Make the Connected Car ...
DataStax
 
CI/DC in MLOps by J.B. Hunt
CI/DC in MLOps by J.B. HuntCI/DC in MLOps by J.B. Hunt
CI/DC in MLOps by J.B. Hunt
Databricks
 
The Beginner's Guide to Data Lakes in AWS
The Beginner's Guide to Data Lakes in AWSThe Beginner's Guide to Data Lakes in AWS
The Beginner's Guide to Data Lakes in AWS
Guillermo A. Fisher
 

What's hot (20)

The API Lie
The API LieThe API Lie
The API Lie
 
Altis Webinar: Use Cases For The Modern Data Platform
Altis Webinar: Use Cases For The Modern Data PlatformAltis Webinar: Use Cases For The Modern Data Platform
Altis Webinar: Use Cases For The Modern Data Platform
 
Big Data Experience Sharing: Building Collaborative Data Analytics Platform -...
Big Data Experience Sharing: Building Collaborative Data Analytics Platform -...Big Data Experience Sharing: Building Collaborative Data Analytics Platform -...
Big Data Experience Sharing: Building Collaborative Data Analytics Platform -...
 
Well Architected Framework - Data
Well Architected Framework - Data Well Architected Framework - Data
Well Architected Framework - Data
 
Allianz x api_management_servic_fabric
Allianz x api_management_servic_fabricAllianz x api_management_servic_fabric
Allianz x api_management_servic_fabric
 
Simplify Data Analytics Over the Cloud
Simplify Data Analytics Over the CloudSimplify Data Analytics Over the Cloud
Simplify Data Analytics Over the Cloud
 
Turn Big Data into Big Value on Informatica and AWS
Turn Big Data into Big Value on Informatica and AWSTurn Big Data into Big Value on Informatica and AWS
Turn Big Data into Big Value on Informatica and AWS
 
About Pragmatic Works
About Pragmatic WorksAbout Pragmatic Works
About Pragmatic Works
 
Modern Data Platforms
Modern Data Platforms Modern Data Platforms
Modern Data Platforms
 
Auto AI : AI used to create AI applications
Auto AI : AI used to create AI applicationsAuto AI : AI used to create AI applications
Auto AI : AI used to create AI applications
 
Cloud architecture patterns and pratices
Cloud architecture patterns and praticesCloud architecture patterns and pratices
Cloud architecture patterns and pratices
 
Modern Data Platform Part 1: Data Ingestion
Modern Data Platform Part 1: Data IngestionModern Data Platform Part 1: Data Ingestion
Modern Data Platform Part 1: Data Ingestion
 
Towards Lensfield
Towards LensfieldTowards Lensfield
Towards Lensfield
 
Qlik sense- Technical Seminar
Qlik sense- Technical SeminarQlik sense- Technical Seminar
Qlik sense- Technical Seminar
 
AI-Powered Analytics: What It Is and How It’s Powering the Next Generation of...
AI-Powered Analytics: What It Is and How It’s Powering the Next Generation of...AI-Powered Analytics: What It Is and How It’s Powering the Next Generation of...
AI-Powered Analytics: What It Is and How It’s Powering the Next Generation of...
 
Importance of global certifications
Importance of global certificationsImportance of global certifications
Importance of global certifications
 
Unlock Data-driven Insights in Databricks Using Location Intelligence
Unlock Data-driven Insights in Databricks Using Location IntelligenceUnlock Data-driven Insights in Databricks Using Location Intelligence
Unlock Data-driven Insights in Databricks Using Location Intelligence
 
Aeris + Cassandra: An IOT Solution Helping Automakers Make the Connected Car ...
Aeris + Cassandra: An IOT Solution Helping Automakers Make the Connected Car ...Aeris + Cassandra: An IOT Solution Helping Automakers Make the Connected Car ...
Aeris + Cassandra: An IOT Solution Helping Automakers Make the Connected Car ...
 
CI/DC in MLOps by J.B. Hunt
CI/DC in MLOps by J.B. HuntCI/DC in MLOps by J.B. Hunt
CI/DC in MLOps by J.B. Hunt
 
The Beginner's Guide to Data Lakes in AWS
The Beginner's Guide to Data Lakes in AWSThe Beginner's Guide to Data Lakes in AWS
The Beginner's Guide to Data Lakes in AWS
 

Similar to Providing Interactive Analytics on Excel with Billions of Rows

Addressing the systemic shortcomings of cloud analytics
Addressing the systemic shortcomings of cloud analyticsAddressing the systemic shortcomings of cloud analytics
Addressing the systemic shortcomings of cloud analytics
SamanthaBerlant
 
Enhance Data Governance with Kyligence Unified Semantic Layer
Enhance Data Governance with Kyligence Unified Semantic LayerEnhance Data Governance with Kyligence Unified Semantic Layer
Enhance Data Governance with Kyligence Unified Semantic Layer
SamanthaBerlant
 
How Analytics Teams Using SSAS Can Embrace Big Data and the Cloud
How Analytics Teams Using SSAS Can Embrace Big Data and the CloudHow Analytics Teams Using SSAS Can Embrace Big Data and the Cloud
How Analytics Teams Using SSAS Can Embrace Big Data and the Cloud
Tyler Wishnoff
 
ICP for Data- Enterprise platform for AI, ML and Data Science
ICP for Data- Enterprise platform for AI, ML and Data ScienceICP for Data- Enterprise platform for AI, ML and Data Science
ICP for Data- Enterprise platform for AI, ML and Data Science
Karan Sachdeva
 
Kyligence Cloud 4 - An Overview
Kyligence Cloud 4 - An OverviewKyligence Cloud 4 - An Overview
Kyligence Cloud 4 - An Overview
SamanthaBerlant
 
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
Dataconomy Media
 
Architecting Snowflake for High Concurrency and High Performance
Architecting Snowflake for High Concurrency and High PerformanceArchitecting Snowflake for High Concurrency and High Performance
Architecting Snowflake for High Concurrency and High Performance
SamanthaBerlant
 
Analytics in a Day Virtual Workshop
Analytics in a Day Virtual WorkshopAnalytics in a Day Virtual Workshop
Analytics in a Day Virtual Workshop
CCG
 
IBM & Cloudera: Hybrid Cloud & the Power of Possibilities
IBM & Cloudera: Hybrid Cloud & the Power of PossibilitiesIBM & Cloudera: Hybrid Cloud & the Power of Possibilities
IBM & Cloudera: Hybrid Cloud & the Power of Possibilities
omkar_nimbalkar
 
Jakarta keynote
Jakarta keynoteJakarta keynote
Jakarta keynote
Karan Sachdeva
 
How Businesses use Big Data to Impact the Bottom Line
How Businesses use Big Data to Impact the Bottom LineHow Businesses use Big Data to Impact the Bottom Line
How Businesses use Big Data to Impact the Bottom Line
Enterprise Management Associates
 
Microsoft cloud big data strategy
Microsoft cloud big data strategyMicrosoft cloud big data strategy
Microsoft cloud big data strategy
James Serra
 
ESGYN Overview
ESGYN OverviewESGYN Overview
ESGYN Overview
Rajender K Salgam
 
Extreme SSAS- SQL 2011
Extreme SSAS- SQL 2011Extreme SSAS- SQL 2011
Extreme SSAS- SQL 2011
Itay Braun
 
Democratizing AI/ML with GCP - Abishay Rao (Google) at GoDataFest 2019
Democratizing AI/ML with GCP - Abishay Rao (Google) at GoDataFest 2019Democratizing AI/ML with GCP - Abishay Rao (Google) at GoDataFest 2019
Democratizing AI/ML with GCP - Abishay Rao (Google) at GoDataFest 2019
GoDataDriven
 
Analytics in a Day Ft. Synapse Virtual Workshop
Analytics in a Day Ft. Synapse Virtual WorkshopAnalytics in a Day Ft. Synapse Virtual Workshop
Analytics in a Day Ft. Synapse Virtual Workshop
CCG
 
Enable Better Decision Making with Power BI Visualizations & Modern Data Estate
Enable Better Decision Making with Power BI Visualizations & Modern Data EstateEnable Better Decision Making with Power BI Visualizations & Modern Data Estate
Enable Better Decision Making with Power BI Visualizations & Modern Data Estate
CCG
 
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTX
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTXCustomer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTX
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTX
tsigitnist02
 
Is your data paying you dividends?
Is your data paying you dividends? Is your data paying you dividends?
Is your data paying you dividends?
Karan Sachdeva
 
Derfor skal du bruge en DataLake
Derfor skal du bruge en DataLakeDerfor skal du bruge en DataLake
Derfor skal du bruge en DataLake
Microsoft
 

Similar to Providing Interactive Analytics on Excel with Billions of Rows (20)

Addressing the systemic shortcomings of cloud analytics
Addressing the systemic shortcomings of cloud analyticsAddressing the systemic shortcomings of cloud analytics
Addressing the systemic shortcomings of cloud analytics
 
Enhance Data Governance with Kyligence Unified Semantic Layer
Enhance Data Governance with Kyligence Unified Semantic LayerEnhance Data Governance with Kyligence Unified Semantic Layer
Enhance Data Governance with Kyligence Unified Semantic Layer
 
How Analytics Teams Using SSAS Can Embrace Big Data and the Cloud
How Analytics Teams Using SSAS Can Embrace Big Data and the CloudHow Analytics Teams Using SSAS Can Embrace Big Data and the Cloud
How Analytics Teams Using SSAS Can Embrace Big Data and the Cloud
 
ICP for Data- Enterprise platform for AI, ML and Data Science
ICP for Data- Enterprise platform for AI, ML and Data ScienceICP for Data- Enterprise platform for AI, ML and Data Science
ICP for Data- Enterprise platform for AI, ML and Data Science
 
Kyligence Cloud 4 - An Overview
Kyligence Cloud 4 - An OverviewKyligence Cloud 4 - An Overview
Kyligence Cloud 4 - An Overview
 
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
 
Architecting Snowflake for High Concurrency and High Performance
Architecting Snowflake for High Concurrency and High PerformanceArchitecting Snowflake for High Concurrency and High Performance
Architecting Snowflake for High Concurrency and High Performance
 
Analytics in a Day Virtual Workshop
Analytics in a Day Virtual WorkshopAnalytics in a Day Virtual Workshop
Analytics in a Day Virtual Workshop
 
IBM & Cloudera: Hybrid Cloud & the Power of Possibilities
IBM & Cloudera: Hybrid Cloud & the Power of PossibilitiesIBM & Cloudera: Hybrid Cloud & the Power of Possibilities
IBM & Cloudera: Hybrid Cloud & the Power of Possibilities
 
Jakarta keynote
Jakarta keynoteJakarta keynote
Jakarta keynote
 
How Businesses use Big Data to Impact the Bottom Line
How Businesses use Big Data to Impact the Bottom LineHow Businesses use Big Data to Impact the Bottom Line
How Businesses use Big Data to Impact the Bottom Line
 
Microsoft cloud big data strategy
Microsoft cloud big data strategyMicrosoft cloud big data strategy
Microsoft cloud big data strategy
 
ESGYN Overview
ESGYN OverviewESGYN Overview
ESGYN Overview
 
Extreme SSAS- SQL 2011
Extreme SSAS- SQL 2011Extreme SSAS- SQL 2011
Extreme SSAS- SQL 2011
 
Democratizing AI/ML with GCP - Abishay Rao (Google) at GoDataFest 2019
Democratizing AI/ML with GCP - Abishay Rao (Google) at GoDataFest 2019Democratizing AI/ML with GCP - Abishay Rao (Google) at GoDataFest 2019
Democratizing AI/ML with GCP - Abishay Rao (Google) at GoDataFest 2019
 
Analytics in a Day Ft. Synapse Virtual Workshop
Analytics in a Day Ft. Synapse Virtual WorkshopAnalytics in a Day Ft. Synapse Virtual Workshop
Analytics in a Day Ft. Synapse Virtual Workshop
 
Enable Better Decision Making with Power BI Visualizations & Modern Data Estate
Enable Better Decision Making with Power BI Visualizations & Modern Data EstateEnable Better Decision Making with Power BI Visualizations & Modern Data Estate
Enable Better Decision Making with Power BI Visualizations & Modern Data Estate
 
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTX
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTXCustomer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTX
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTX
 
Is your data paying you dividends?
Is your data paying you dividends? Is your data paying you dividends?
Is your data paying you dividends?
 
Derfor skal du bruge en DataLake
Derfor skal du bruge en DataLakeDerfor skal du bruge en DataLake
Derfor skal du bruge en DataLake
 

More from Tyler Wishnoff

Snowflake: The Good, the Bad, and the Ugly
Snowflake: The Good, the Bad, and the UglySnowflake: The Good, the Bad, and the Ugly
Snowflake: The Good, the Bad, and the Ugly
Tyler Wishnoff
 
How to Guarantee Exact COUNT DISTINCT Queries with Sub-Second Latency on Mass...
How to Guarantee Exact COUNT DISTINCT Queries with Sub-Second Latency on Mass...How to Guarantee Exact COUNT DISTINCT Queries with Sub-Second Latency on Mass...
How to Guarantee Exact COUNT DISTINCT Queries with Sub-Second Latency on Mass...
Tyler Wishnoff
 
Apache kylin 101 - Get Sub-Second Analytics on Massive Datasets
Apache kylin 101 - Get Sub-Second Analytics on Massive DatasetsApache kylin 101 - Get Sub-Second Analytics on Massive Datasets
Apache kylin 101 - Get Sub-Second Analytics on Massive Datasets
Tyler Wishnoff
 
Analysis of the Pressure Placed on Medical Systems during the COVID-19 Pandemic
Analysis of the Pressure Placed on Medical Systems during the COVID-19 PandemicAnalysis of the Pressure Placed on Medical Systems during the COVID-19 Pandemic
Analysis of the Pressure Placed on Medical Systems during the COVID-19 Pandemic
Tyler Wishnoff
 
Augmented OLAP Analytics for Big Data
Augmented OLAP Analytics for Big DataAugmented OLAP Analytics for Big Data
Augmented OLAP Analytics for Big Data
Tyler Wishnoff
 
Apache Kylin Meetup: Berlin - With OLX Group
Apache Kylin Meetup: Berlin - With OLX GroupApache Kylin Meetup: Berlin - With OLX Group
Apache Kylin Meetup: Berlin - With OLX Group
Tyler Wishnoff
 
Apache Kylin Data Summit 2019: Kyligence Presentation
Apache Kylin Data Summit 2019: Kyligence PresentationApache Kylin Data Summit 2019: Kyligence Presentation
Apache Kylin Data Summit 2019: Kyligence Presentation
Tyler Wishnoff
 
Augmented OLAP for Big Data Analytics
Augmented OLAP for Big Data AnalyticsAugmented OLAP for Big Data Analytics
Augmented OLAP for Big Data Analytics
Tyler Wishnoff
 
Accelerating Big Data Analytics with Apache Kylin
Accelerating Big Data Analytics with Apache KylinAccelerating Big Data Analytics with Apache Kylin
Accelerating Big Data Analytics with Apache Kylin
Tyler Wishnoff
 

More from Tyler Wishnoff (9)

Snowflake: The Good, the Bad, and the Ugly
Snowflake: The Good, the Bad, and the UglySnowflake: The Good, the Bad, and the Ugly
Snowflake: The Good, the Bad, and the Ugly
 
How to Guarantee Exact COUNT DISTINCT Queries with Sub-Second Latency on Mass...
How to Guarantee Exact COUNT DISTINCT Queries with Sub-Second Latency on Mass...How to Guarantee Exact COUNT DISTINCT Queries with Sub-Second Latency on Mass...
How to Guarantee Exact COUNT DISTINCT Queries with Sub-Second Latency on Mass...
 
Apache kylin 101 - Get Sub-Second Analytics on Massive Datasets
Apache kylin 101 - Get Sub-Second Analytics on Massive DatasetsApache kylin 101 - Get Sub-Second Analytics on Massive Datasets
Apache kylin 101 - Get Sub-Second Analytics on Massive Datasets
 
Analysis of the Pressure Placed on Medical Systems during the COVID-19 Pandemic
Analysis of the Pressure Placed on Medical Systems during the COVID-19 PandemicAnalysis of the Pressure Placed on Medical Systems during the COVID-19 Pandemic
Analysis of the Pressure Placed on Medical Systems during the COVID-19 Pandemic
 
Augmented OLAP Analytics for Big Data
Augmented OLAP Analytics for Big DataAugmented OLAP Analytics for Big Data
Augmented OLAP Analytics for Big Data
 
Apache Kylin Meetup: Berlin - With OLX Group
Apache Kylin Meetup: Berlin - With OLX GroupApache Kylin Meetup: Berlin - With OLX Group
Apache Kylin Meetup: Berlin - With OLX Group
 
Apache Kylin Data Summit 2019: Kyligence Presentation
Apache Kylin Data Summit 2019: Kyligence PresentationApache Kylin Data Summit 2019: Kyligence Presentation
Apache Kylin Data Summit 2019: Kyligence Presentation
 
Augmented OLAP for Big Data Analytics
Augmented OLAP for Big Data AnalyticsAugmented OLAP for Big Data Analytics
Augmented OLAP for Big Data Analytics
 
Accelerating Big Data Analytics with Apache Kylin
Accelerating Big Data Analytics with Apache KylinAccelerating Big Data Analytics with Apache Kylin
Accelerating Big Data Analytics with Apache Kylin
 

Recently uploaded

一比一原版(CU毕业证)卡尔顿大学毕业证如何办理
一比一原版(CU毕业证)卡尔顿大学毕业证如何办理一比一原版(CU毕业证)卡尔顿大学毕业证如何办理
一比一原版(CU毕业证)卡尔顿大学毕业证如何办理
bmucuha
 
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataPredictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Kiwi Creative
 
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
Timothy Spann
 
DATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docx
DATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docxDATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docx
DATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docx
SaffaIbrahim1
 
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Kaxil Naik
 
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
taqyea
 
Experts live - Improving user adoption with AI
Experts live - Improving user adoption with AIExperts live - Improving user adoption with AI
Experts live - Improving user adoption with AI
jitskeb
 
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
nuttdpt
 
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
mkkikqvo
 
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
nuttdpt
 
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
v7oacc3l
 
原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理
原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理
原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理
a9qfiubqu
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
sameer shah
 
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
Social Samosa
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
Timothy Spann
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
aqzctr7x
 
A presentation that explain the Power BI Licensing
A presentation that explain the Power BI LicensingA presentation that explain the Power BI Licensing
A presentation that explain the Power BI Licensing
AlessioFois2
 
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
apvysm8
 
Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024
ElizabethGarrettChri
 
原版一比一利兹贝克特大学毕业证(LeedsBeckett毕业证书)如何办理
原版一比一利兹贝克特大学毕业证(LeedsBeckett毕业证书)如何办理原版一比一利兹贝克特大学毕业证(LeedsBeckett毕业证书)如何办理
原版一比一利兹贝克特大学毕业证(LeedsBeckett毕业证书)如何办理
wyddcwye1
 

Recently uploaded (20)

一比一原版(CU毕业证)卡尔顿大学毕业证如何办理
一比一原版(CU毕业证)卡尔顿大学毕业证如何办理一比一原版(CU毕业证)卡尔顿大学毕业证如何办理
一比一原版(CU毕业证)卡尔顿大学毕业证如何办理
 
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataPredictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
 
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
 
DATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docx
DATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docxDATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docx
DATA COMMS-NETWORKS YR2 lecture 08 NAT & CLOUD.docx
 
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
 
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
 
Experts live - Improving user adoption with AI
Experts live - Improving user adoption with AIExperts live - Improving user adoption with AI
Experts live - Improving user adoption with AI
 
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
 
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
 
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
 
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
 
原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理
原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理
原版一比一弗林德斯大学毕业证(Flinders毕业证书)如何办理
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
 
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
 
A presentation that explain the Power BI Licensing
A presentation that explain the Power BI LicensingA presentation that explain the Power BI Licensing
A presentation that explain the Power BI Licensing
 
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
 
Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024
 
原版一比一利兹贝克特大学毕业证(LeedsBeckett毕业证书)如何办理
原版一比一利兹贝克特大学毕业证(LeedsBeckett毕业证书)如何办理原版一比一利兹贝克特大学毕业证(LeedsBeckett毕业证书)如何办理
原版一比一利兹贝克特大学毕业证(LeedsBeckett毕业证书)如何办理
 

Providing Interactive Analytics on Excel with Billions of Rows

  • 1. Providing Interactive Analytics on Excel with Billions of Rows Saswata Sengupta 2020.4
  • 2. © Kyligence Inc. 2019, Confidential. Excel-Superhero: The Unsung Hero of Modern Analytics The Excel- Superhero Fulfilling business analytics Generating complex reports and analysis Building complex models AND … keeping everyone happy, moving business forward VS. The Big Data Super Villain Large datasets, often billions of rows, sitting in data lake, cloud storage or in distributed databases. Hard to do number crunching and analysis, often crashing the Excel-Superhero. Slow processing times lead to inefficient decision making. Complex calculations are often difficult to perform with billions of rows of data. Petabyte-scale datasets combined with many concurrent users often becomes too challenging for many organizations.
  • 3. © Kyligence Inc. 2019, Confidential. Challenges in Excel with Big Data Title Limited Scalability Slow Response Time Limited Number of Dimensions Limited Scalability Slow Response Time Difficult to Access DataThe Big Data Villain I will finish Excel and all Excel users!
  • 4. © Kyligence Inc. 2019, Confidential. contentcontentcontentcont entcontentcontentcontentc ontentcontentcontent • Excel has been a tool of choice for analysts across the board and throughout the industry. • Analysts have been creating complicated models in Excel to build reports using PivotTable, PivotChart, Macros etc. • Excel is easy to use, time tested, easily available, and a reliable application for analysts for data consuming, crunching, and analytics. Excel – Analytics for Every Organization Fun Fact
  • 5. © Kyligence Inc. 2019, Confidential. Apache Kylin TopLevel Apache Project  The only open-source OLAP on big data platform BestOpen-Source Big Data Tool  InfoWorld’s Bossies (Best of Open Source Software Awards) in 2015 & 2016 Sub-Second Interactive Query  Large scale, high concurrency, multi- dimensional, sub-second query latency 1,000+ Organizations  Adopted by thousands of organizations globally
  • 6. © Kyligence Inc. 2019, Confidential. Kyligence = Kylin + Intelligence • Founded in 2016 by the creators of Apache Kylin • Built around Kylin with augmented AI, enhanced to deliver unprecedented enterprise analytic performance • CRN Top-10 big data startups in 2018 • Global Presence: San Jose, Seattle, New York, Shanghai, Beijing • VCs: Fidelity International, Shunwei Capital, Broadband Capital, Redpoint, Cisco, Coatue Accelerate Critical Business Decisions with AI-Augmented Data Management and Analytics 2016 Founded Pre-A Redpoint Cisco 2017 Series A CBC Shunwei 2018 Series B 8Roads 2019 Series C Coatue
  • 7. © Kyligence Inc. 2019, Confidential. Excel-Superhero: The Unsung Hero of Modern Analytics The Excel- Superhero
  • 8. © Kyligence Inc. 2019, Confidential. Trusted by Enterprises Worldwide
  • 9. © Kyligence Inc. 2019, Confidential. Kyligence MDX OLAP A Case for OLAP for Analytics • More data sources • Lower TCO • Scalable • High performance • Ad hoc queries • Flexible analysis • Enterprise security Business AnalysisBusiness Analysis Data Lake DW / DM DW / DM Kyligence Enterprise Scale Up Logs Caching QueriesData MDX SQL Complex Modeling Semantic Translation Measure Groups Query Engine Multi-Level Accesses Multi- Tenants Distributed Arch. Orders CRM POS Semantic Layer Data Service Layer MOLAP MDX
  • 10. © Kyligence Inc. 2019, Confidential. Automatic Model Creation AI-augmented engine automatically designs the most optimal model based on past user behaviors and query patterns. This reduces the need for manual modeling and maintenance. Adaptive Schema Evolution As analytical requests change, the model needs to reflect those changes. Our model automatically adapts to any schema changes. The model evolves along with your analytical needs. Automatic Query Optimization The model continuously evolves and self-optimizes as it obtains new usage behavior. This guarantees sub-second performance, no matter the data volume or concurrency. Kyligence Solution
  • 11. © Kyligence Inc. 2019, Confidential. Traditional OLAP vs. Kyligence • Rigid schema, dependent on data warehouse • Single node solution • End-user analytics is limited by the OLAP cube. If the measures and dimensions do not already exist, the query cannot be answered. • Adaptive schema • Distributed multi-node solution • OLAP cube provides sub-second responses • Smart pushdown capabilities, guaranteed query responses
  • 12. © Kyligence Inc. 2019, Confidential. AI-Powered Data Management For Most Valuable Data ANSI SQL MDX REST Semantic Layer FinanceMarketing Sales Index AI-Augmented Engine
  • 13. © Kyligence Inc. 2019, Confidential. Kyligence Architecture Data Source Analytics Data Service Data Lake Azure Blob Storage AWS S3 Hadoop Google Cloud Storage Azure SynapseSnowflake Management Query Engine Semantic Layer SQL Query Engine Smart Modeling Scaling Maintenance Monitor Enterprise-Level Security TCO Database Events Files Logs IoT Business Insights Multidimensional Analysis 3rd-Party Applications Machine Learning Visualization Self-Service Collaboration 3rd-Party BI Tools
  • 14. © Kyligence Inc. 2019, Confidential. Sub-Second Query Multi-Level Security Semantic Layer Seamless Integration with Excel/BI  Fast response time  Supports aggregation, detail, and ad hoc queries  Project, table, row, and column level access control  Supports complex business logic  Supports Excel core anlytics functionalities  Model synchronization into BI tools Value Proposition  All or incremantal build  Query while build Cube Building
  • 15. © Kyligence Inc. 2019, Confidential. DEMO
  • 16. © Kyligence Inc. 2019, Confidential. Title Originally built to replace Teradata 3 trillion rows of detail 100,000 concurrent users on hand-held devices With millisecond responses Replaced Teradata IBM Cognos replacement From 1,200+ cubes down to 2 cubes Complete replacement of Greenplum eBay Global Top 3 Bank Top Global Insurance Company World’s Largest Credit Card Processor Title
  • 17. © Kyligence Inc. 2019, Confidential. Title Originally built to replace Teradata 3 trillion rows of detail 100,000 concurrent users on hand-held devices With milli-second responses Replaced Teradata IBM Cognos replacement From 1,200+ cubes down to 2 cubes Complete replacement of Greenplum eBay Global Top 3 Bank Top Global Insurance Company World’s Largest Credit Card Processor Excel Trivia What number of actions can you undo in Excel ? 100 What is the earliest date allowed in Excel calculations ? Jan 01, 1900 What is the result of =1*(0.5-0.4-0.1) in Excel?
  • 18. © Kyligence Inc. 2019, Confidential. Thank You! #kyligence@kyligence Connect with us on LinkedIn, Twitter & Facebook Try Kyligence @ https://kyligence.io/download-free-trial/ www.kyligence.io