SlideShare a Scribd company logo
BitYota – data warehouse service !
An overview!

Dev Patel, CEO!

© 2014
BitYota: Who we are!
Problem	
  

Today’s	
  big	
  data	
  analy/cs	
  is	
  either	
  a	
  ‘Big	
  Cost’	
  or	
  a	
  ‘Big	
  Headache’	
  or	
  both	
  for	
  
companies	
  of	
  all	
  sizes.	
  Users	
  have	
  to	
  learn	
  new	
  skills	
  and	
  CEOs	
  need	
  to	
  buy	
  
uniquely	
  engineered,	
  prohibi/vely	
  expensive	
  systems.	
  	
  

Solu+on	
  

BitYota	
  offers	
  a	
  be-er	
  alterna0ve:	
  A	
  Data	
  Warehouse	
  Service	
  for	
  Big	
  Data	
  
analy0cs.	
  This	
  PaaS	
  offering	
  takes	
  away	
  both	
  big	
  cost	
  and	
  big	
  headache,	
  
making	
  analy0cs	
  accessible	
  to	
  everyone	
  at	
  scale,	
  with	
  no	
  compromise	
  on	
  
func0onality	
  or	
  service	
  levels.	
  	
  

Customers	
   Mobile	
  apps,	
  E/commerce,	
  Adver/sing/Marke/ng,	
  Games	
  
Founded	
  Sep	
  2011	
  by	
  data	
  experts:	
  Dev	
  Patel,	
  Harmeek	
  Bedi	
  and	
  Soren	
  Riise.	
  	
  

Background	
  

Opportunity	
  
Team	
  

Company	
  has	
  raised	
  $12M	
  through	
  Seed	
  &	
  Series	
  A	
  from	
  Globespan	
  Capital,	
  
Social+Capital	
  Partnership,	
  Dawn	
  Capital,	
  Andreessen	
  Horowitz,	
  Crosslink	
  
Capital,	
  Morado	
  Ventures,	
  &	
  individual	
  investors	
  Maynard	
  Webb,	
  Graham	
  
Summers,	
  Jerry	
  Yang	
  and	
  Sharmila	
  Mulligan.	
  
Companies	
  are	
  increasingly	
  looking	
  to	
  gain	
  insights	
  from	
  their	
  data	
  via	
  
analy/cs.	
  Analy/cs	
  for	
  big	
  data	
  in	
  the	
  cloud	
  is	
  BitYota’s	
  opportunity.	
  
Management:	
  Dev	
  Patel,	
  CEO;	
  Harmeek	
  Singh	
  Bedi,	
  CTO;	
  Soren	
  Riise,	
  Chief	
  
Cloud	
  Service;	
  Poulomi	
  Damany,	
  VP	
  Product.	
  
Its	
  core	
  team	
  has	
  35+	
  years	
  of	
  big	
  data	
  experience	
  at	
  Yahoo!,	
  Oracle,	
  Veritas/
Symantec,	
  Informix,	
  BMC,	
  Kabira/Tibco,	
  and	
  Microso_.	
  	
  
Does This Sound Like You?!!

OR

You’re a company
that just launched
.. And you need some
critical insights for what’s
next

OR

Your data infrastructure can’t scale
.. And you can’t spare
any more engineers or
money to maintain it

You’re a
company that
just launched
…
You have lots of data
in multiple silos
.. And it takes too long for
your analysts to get answers

Today’s Big Data = Big Cost/Big Headache!
3
What questions do you want answered?!
How can I combine social profiles, in-app
purchases, and event stream data?

Who are my
best users?

How do I
increase
engagement?

Why is the
new app
version
crashing?
Access
patterns by
OS/ device?

What’s my ROI on my marketing spend?
Where should I be spending more/less $$$?

4
BitYota: Data warehouse for next gen data!
Variety, semistructured data

Velocity, analytics on
fresh data

Velocity, “fast”
analytics

No translation to
structure & No
data modeling

Continuous extract of
changing data from
MongoDB

MPP architecture
– scale with
Compute

Cloud, easy set up
with Burst capacity

Agility & Time-toMarket

No CAPEX & low
OPEX

Elastic scale up/
down

Integration into
SQL/BI ecosystem

Managed Service &
Pay per Use

5
BitYota is focused on use cases where …!
Customers want:!
1.  Analytics over data from multiple sources!
2.  Migrate analytics from on-premise to Cloud !
3.  Analytics on data from single source NoSQL or
relational transactional systems !
4.  Analytics on “fresh data” !

6
Markets for BitYota!
Companies in!
•  Advertising/Marketing!
•  Social Media!
•  SaaS!
•  Games & Entertainment!
•  E-commerce!
•  Communication & Productivity !

7
BitYota focused in new Big Data Analytics!
•  Data from Multiple sources
in Multiple formats !
User profiles
Social data

Server Logs

!

!

Volume	
  
Variety	
  
Velocity	
  

!

Semi-structured data types (JSON,
XML), Data types for new applications –
timestamp, IP, location, etc!
Table Layout – row and column, on disk,
memory, external tables !

•  Fast time to analytics !
!

Load and explore directly, not dependent
on slow & fragile ETL!

•  Interactive analytics !
!

Inventory

Deploy in a heterogeneous environment;
scale out; scale storage & compute
independently!

•  Flexible Storage !

Sales Orders/
Returns
Website Views
& Clicks

•  Cost effective, elastic capacity!

Use ANSI SQL directly on new data
types. Leverage existing BI tools!

8
Business Analytics on data from MongoDB!
Mobile/Web
Apps
Primary
shard

SQL over JSON, and
access from BI tools

BitYota	
  Cluster	
  

Secondary
shards

Compute	
  nodes	
  

Mongo
dump
Oplog
Tail

Load
BitYota	
  
Extract	
  
Tool	
  

BSON,	
  
JSON	
  

Extract	
  

Load	
  

Data	
  nodes	
  

Schedule incremental extract and load
MongoDB extract format (BSON)

Transform	
  &	
  Analyze	
  
Joins across collections
SQL over JSON, UDFs
Transforms into Cols for performance
Views for BI tool

9
Process to Load Data into BitYota!
SOURCE	
  JSON	
  DATA	
  

LOAD	
  DATA	
  

"session":[{	
  
"u":"8927ABBCD2873CCD",	
  
"v":"1.0",	
  
"uid","TheTestUser1",	
  	
  
"dv":"Apple	
  iPhone	
  3GS",	
  	
  
"t":200	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  
}	
  

CREATE	
  TABLE	
  session(	
  
	
  jdoc	
  JSON	
  
)	
  	
  	
  

1-­‐+me	
  setup	
  	
  

•  Scheduled	
  Load	
  
•  Schema	
  auto-­‐discovered	
  
•  Table	
  auto-­‐created	
  

ANALYZE	
  DATA	
  

OPTIMIZE	
  DESIGN	
  

SELECT	
  jdoc-­‐>'u’,	
  jdoc-­‐>’t’	
  
FROM	
  session;	
  

CREATE	
  TABLE	
  session_cols	
  (	
  	
  
	
  	
  	
  u	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
   TEXT,	
  	
  
	
  	
  	
  t	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
   INT,	
  
	
  	
  	
  origjdoc	
  	
  	
  	
  	
  	
  	
  	
  	
  JSON	
  )	
  
PARTITION	
  BY	
  RANGE	
  (t)	
  
	
  (PARTITION	
  VALUES	
  ('0'),	
  
PARTITION	
  VALUES	
  ('50'),	
  
PARTITION	
  VALUES	
  ('200')	
  )	
  
COLUMNSTORE	
  STORAGE	
  
(SEGMENTSIZE	
  13102	
  
TABLESIZE	
  200000);	
  
INSERT	
  INTO	
  session_cols	
  
	
  	
  	
  SELECT	
  jdoc-­‐>'u',	
  
	
  ( jdoc-­‐>'t')::int8,	
  jdoc	
  	
  
	
  	
  	
  FROM	
  session;;	
  

Change	
  MongoDB	
  JSON	
  doc	
  structure	
  any/me	
  =	
  NO	
  extra	
  downstream	
  effort	
  
needed	
  

10
As a Service!

•  Launch cluster in minutes !
•  Removes the ‘headache’ of database management!
•  No hardware, no software installation & upgrades; no licenses !
•  Available on AWS & Rackspace!

11
Recap!
•  BitYota is a Cloud based Data Warehouse Service for
Big Data Analytics.!
•  Its core attributes are:!
•  100% Service oriented!
•  Analytics on data from multiple sources/formats!
•  Analytics on “fresh” data !

•  Customers are gaining deep insights on their business
operations!
•  Customers in Games, Mobile apps, advertising/
marketing, e/commerce!
12

More Related Content

What's hot

Make AI & BI work at Scale
Make AI & BI work at ScaleMake AI & BI work at Scale
Make AI & BI work at Scale
Steve Nouri
 
Data Lake: A simple introduction
Data Lake: A simple introductionData Lake: A simple introduction
Data Lake: A simple introduction
IBM Analytics
 
Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...
Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...
Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...
Cambridge Semantics
 
Big Data & Analytics Architecture
Big Data & Analytics ArchitectureBig Data & Analytics Architecture
Big Data & Analytics Architecture
Arvind Sathi
 
IBM Governed Data Lake
IBM Governed Data LakeIBM Governed Data Lake
IBM Governed Data Lake
Karan Sachdeva
 
Case Study - Spotad: Rebuilding And Optimizing Real-Time Mobile Adverting Bid...
Case Study - Spotad: Rebuilding And Optimizing Real-Time Mobile Adverting Bid...Case Study - Spotad: Rebuilding And Optimizing Real-Time Mobile Adverting Bid...
Case Study - Spotad: Rebuilding And Optimizing Real-Time Mobile Adverting Bid...
Vasu S
 
How to Build a Smart Data Lake Using Semantics
How to Build a Smart Data Lake Using SemanticsHow to Build a Smart Data Lake Using Semantics
How to Build a Smart Data Lake Using Semantics
Cambridge Semantics
 
Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 1Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 1
Databricks
 
Protecting data privacy in analytics and machine learning ISACA London UK
Protecting data privacy in analytics and machine learning ISACA London UKProtecting data privacy in analytics and machine learning ISACA London UK
Protecting data privacy in analytics and machine learning ISACA London UK
Ulf Mattsson
 
Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...
Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...
Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...
Denodo
 
Knowledge Graphs Webinar- 11/7/2017
Knowledge Graphs Webinar- 11/7/2017Knowledge Graphs Webinar- 11/7/2017
Knowledge Graphs Webinar- 11/7/2017
Neo4j
 
Business case for Big Data Analytics
Business case for Big Data AnalyticsBusiness case for Big Data Analytics
Business case for Big Data Analytics
Vijay Rao
 
Building enterprise advance analytics platform
Building enterprise advance analytics platformBuilding enterprise advance analytics platform
Building enterprise advance analytics platform
Haoran Du
 
ESGYN Overview
ESGYN OverviewESGYN Overview
ESGYN Overview
Rajender K Salgam
 
[XConf Brasil 2020] Data mesh
[XConf Brasil 2020] Data mesh[XConf Brasil 2020] Data mesh
[XConf Brasil 2020] Data mesh
ThoughtWorks Brasil
 
Requirements document for big data use cases
Requirements document for big data use casesRequirements document for big data use cases
Requirements document for big data use cases
Allied Consultants
 
BigData in Banking
BigData in BankingBigData in Banking
BigData in Banking
Andzhey Arshavskiy
 
Modern Manufacturing: 4 Ways Data is Transforming the Industry
Modern Manufacturing: 4 Ways Data is Transforming the IndustryModern Manufacturing: 4 Ways Data is Transforming the Industry
Modern Manufacturing: 4 Ways Data is Transforming the Industry
Tableau Software
 
National Conference - Big Data - 31 Jan 2015
National Conference - Big Data - 31 Jan 2015National Conference - Big Data - 31 Jan 2015
National Conference - Big Data - 31 Jan 2015
SANJIV VERMA - (Big Data & Data Scientist)
 
Introduction to Anzo Unstructured
Introduction to Anzo UnstructuredIntroduction to Anzo Unstructured
Introduction to Anzo Unstructured
Cambridge Semantics
 

What's hot (20)

Make AI & BI work at Scale
Make AI & BI work at ScaleMake AI & BI work at Scale
Make AI & BI work at Scale
 
Data Lake: A simple introduction
Data Lake: A simple introductionData Lake: A simple introduction
Data Lake: A simple introduction
 
Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...
Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...
Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...
 
Big Data & Analytics Architecture
Big Data & Analytics ArchitectureBig Data & Analytics Architecture
Big Data & Analytics Architecture
 
IBM Governed Data Lake
IBM Governed Data LakeIBM Governed Data Lake
IBM Governed Data Lake
 
Case Study - Spotad: Rebuilding And Optimizing Real-Time Mobile Adverting Bid...
Case Study - Spotad: Rebuilding And Optimizing Real-Time Mobile Adverting Bid...Case Study - Spotad: Rebuilding And Optimizing Real-Time Mobile Adverting Bid...
Case Study - Spotad: Rebuilding And Optimizing Real-Time Mobile Adverting Bid...
 
How to Build a Smart Data Lake Using Semantics
How to Build a Smart Data Lake Using SemanticsHow to Build a Smart Data Lake Using Semantics
How to Build a Smart Data Lake Using Semantics
 
Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 1Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 1
 
Protecting data privacy in analytics and machine learning ISACA London UK
Protecting data privacy in analytics and machine learning ISACA London UKProtecting data privacy in analytics and machine learning ISACA London UK
Protecting data privacy in analytics and machine learning ISACA London UK
 
Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...
Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...
Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...
 
Knowledge Graphs Webinar- 11/7/2017
Knowledge Graphs Webinar- 11/7/2017Knowledge Graphs Webinar- 11/7/2017
Knowledge Graphs Webinar- 11/7/2017
 
Business case for Big Data Analytics
Business case for Big Data AnalyticsBusiness case for Big Data Analytics
Business case for Big Data Analytics
 
Building enterprise advance analytics platform
Building enterprise advance analytics platformBuilding enterprise advance analytics platform
Building enterprise advance analytics platform
 
ESGYN Overview
ESGYN OverviewESGYN Overview
ESGYN Overview
 
[XConf Brasil 2020] Data mesh
[XConf Brasil 2020] Data mesh[XConf Brasil 2020] Data mesh
[XConf Brasil 2020] Data mesh
 
Requirements document for big data use cases
Requirements document for big data use casesRequirements document for big data use cases
Requirements document for big data use cases
 
BigData in Banking
BigData in BankingBigData in Banking
BigData in Banking
 
Modern Manufacturing: 4 Ways Data is Transforming the Industry
Modern Manufacturing: 4 Ways Data is Transforming the IndustryModern Manufacturing: 4 Ways Data is Transforming the Industry
Modern Manufacturing: 4 Ways Data is Transforming the Industry
 
National Conference - Big Data - 31 Jan 2015
National Conference - Big Data - 31 Jan 2015National Conference - Big Data - 31 Jan 2015
National Conference - Big Data - 31 Jan 2015
 
Introduction to Anzo Unstructured
Introduction to Anzo UnstructuredIntroduction to Anzo Unstructured
Introduction to Anzo Unstructured
 

Viewers also liked

Denodo Datafest 2016: Modernizing Data Warehouse Using Real-time Data Virtual...
Denodo Datafest 2016: Modernizing Data Warehouse Using Real-time Data Virtual...Denodo Datafest 2016: Modernizing Data Warehouse Using Real-time Data Virtual...
Denodo Datafest 2016: Modernizing Data Warehouse Using Real-time Data Virtual...
Denodo
 
Suxess business intelligence worldbi catalog
Suxess business intelligence worldbi catalogSuxess business intelligence worldbi catalog
Suxess business intelligence worldbi catalog
SuXess Iş Platformu
 
From Personal BI to Managed BI with Power BI
From Personal BI to Managed BI with Power BIFrom Personal BI to Managed BI with Power BI
From Personal BI to Managed BI with Power BI
Jean-Pierre Riehl
 
Using Exam Analytics to Evaluate Student Use of Lecture Capture Recordings
Using Exam Analytics to Evaluate Student Use of Lecture Capture RecordingsUsing Exam Analytics to Evaluate Student Use of Lecture Capture Recordings
Using Exam Analytics to Evaluate Student Use of Lecture Capture Recordings
ExamSoft
 
Roland bouman modern_data_warehouse_architectures_data_vault_and_anchor_model...
Roland bouman modern_data_warehouse_architectures_data_vault_and_anchor_model...Roland bouman modern_data_warehouse_architectures_data_vault_and_anchor_model...
Roland bouman modern_data_warehouse_architectures_data_vault_and_anchor_model...
Roland Bouman
 
Pivotal Data Warehouse in the Age of Digital Transformation
Pivotal Data Warehouse in the Age of Digital TransformationPivotal Data Warehouse in the Age of Digital Transformation
Pivotal Data Warehouse in the Age of Digital Transformation
VMware Tanzu
 
IT and Business Service Catalogs
IT and Business Service CatalogsIT and Business Service Catalogs
IT and Business Service Catalogs
ITSM Academy, Inc.
 
The Modern Data Warehouse - A Hybrid Story
The Modern Data Warehouse - A Hybrid StoryThe Modern Data Warehouse - A Hybrid Story
The Modern Data Warehouse - A Hybrid Story
Perficient, Inc.
 
Defining Services for a Service Catalog
Defining Services for a Service CatalogDefining Services for a Service Catalog
Defining Services for a Service Catalog
Axios Systems
 

Viewers also liked (9)

Denodo Datafest 2016: Modernizing Data Warehouse Using Real-time Data Virtual...
Denodo Datafest 2016: Modernizing Data Warehouse Using Real-time Data Virtual...Denodo Datafest 2016: Modernizing Data Warehouse Using Real-time Data Virtual...
Denodo Datafest 2016: Modernizing Data Warehouse Using Real-time Data Virtual...
 
Suxess business intelligence worldbi catalog
Suxess business intelligence worldbi catalogSuxess business intelligence worldbi catalog
Suxess business intelligence worldbi catalog
 
From Personal BI to Managed BI with Power BI
From Personal BI to Managed BI with Power BIFrom Personal BI to Managed BI with Power BI
From Personal BI to Managed BI with Power BI
 
Using Exam Analytics to Evaluate Student Use of Lecture Capture Recordings
Using Exam Analytics to Evaluate Student Use of Lecture Capture RecordingsUsing Exam Analytics to Evaluate Student Use of Lecture Capture Recordings
Using Exam Analytics to Evaluate Student Use of Lecture Capture Recordings
 
Roland bouman modern_data_warehouse_architectures_data_vault_and_anchor_model...
Roland bouman modern_data_warehouse_architectures_data_vault_and_anchor_model...Roland bouman modern_data_warehouse_architectures_data_vault_and_anchor_model...
Roland bouman modern_data_warehouse_architectures_data_vault_and_anchor_model...
 
Pivotal Data Warehouse in the Age of Digital Transformation
Pivotal Data Warehouse in the Age of Digital TransformationPivotal Data Warehouse in the Age of Digital Transformation
Pivotal Data Warehouse in the Age of Digital Transformation
 
IT and Business Service Catalogs
IT and Business Service CatalogsIT and Business Service Catalogs
IT and Business Service Catalogs
 
The Modern Data Warehouse - A Hybrid Story
The Modern Data Warehouse - A Hybrid StoryThe Modern Data Warehouse - A Hybrid Story
The Modern Data Warehouse - A Hybrid Story
 
Defining Services for a Service Catalog
Defining Services for a Service CatalogDefining Services for a Service Catalog
Defining Services for a Service Catalog
 

Similar to BitYota Data Warehouse Podcast

Architecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment OptionsArchitecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment Options
Caserta
 
Take Action: The New Reality of Data-Driven Business
Take Action: The New Reality of Data-Driven BusinessTake Action: The New Reality of Data-Driven Business
Take Action: The New Reality of Data-Driven Business
Inside Analysis
 
Big Data Analytics with Microsoft
Big Data Analytics with MicrosoftBig Data Analytics with Microsoft
Big Data Analytics with Microsoft
Caserta
 
Big Data in Azure
Big Data in AzureBig Data in Azure
Data Provisioning & Optimization
Data Provisioning & OptimizationData Provisioning & Optimization
Data Provisioning & Optimization
Ambareesh Kulkarni
 
Big data for Telco: opportunity or threat?
Big data for Telco: opportunity or threat?Big data for Telco: opportunity or threat?
Big data for Telco: opportunity or threat?
Swiss Big Data User Group
 
Denodo Data Virtualization - IT Days in Luxembourg with Oktopus
Denodo Data Virtualization - IT Days in Luxembourg with OktopusDenodo Data Virtualization - IT Days in Luxembourg with Oktopus
Denodo Data Virtualization - IT Days in Luxembourg with Oktopus
Denodo
 
How to Swiftly Operationalize the Data Lake for Advanced Analytics Using a Lo...
How to Swiftly Operationalize the Data Lake for Advanced Analytics Using a Lo...How to Swiftly Operationalize the Data Lake for Advanced Analytics Using a Lo...
How to Swiftly Operationalize the Data Lake for Advanced Analytics Using a Lo...
Denodo
 
Finding business value in Big Data
Finding business value in Big DataFinding business value in Big Data
Finding business value in Big Data
James Serra
 
The Double win business transformation and in-year ROI and TCO reduction
The Double win business transformation and in-year ROI and TCO reductionThe Double win business transformation and in-year ROI and TCO reduction
The Double win business transformation and in-year ROI and TCO reduction
MongoDB
 
In-Memory Computing Webcast. Market Predictions 2017
In-Memory Computing Webcast. Market Predictions 2017In-Memory Computing Webcast. Market Predictions 2017
In-Memory Computing Webcast. Market Predictions 2017
SingleStore
 
BAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneyBAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, Sydney
Sai Paravastu
 
cognos BI10.pptx
cognos BI10.pptxcognos BI10.pptx
cognos BI10.pptx
vishal choudhary
 
cognos BI10.pptx
cognos BI10.pptxcognos BI10.pptx
cognos BI10.pptx
vishal choudhary
 
Data Virtualization: Introduction and Business Value (UK)
Data Virtualization: Introduction and Business Value (UK)Data Virtualization: Introduction and Business Value (UK)
Data Virtualization: Introduction and Business Value (UK)
Denodo
 
Building a Big Data Solution
Building a Big Data SolutionBuilding a Big Data Solution
Building a Big Data Solution
James Serra
 
The Emerging Role of the Data Lake
The Emerging Role of the Data LakeThe Emerging Role of the Data Lake
The Emerging Role of the Data Lake
Caserta
 
Crowdstar case-study
Crowdstar case-studyCrowdstar case-study
Crowdstar case-study
Satya Harish
 
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
Dataconomy Media
 
Big Data Expo 2015 - Pentaho The Future of Analytics
Big Data Expo 2015 - Pentaho The Future of AnalyticsBig Data Expo 2015 - Pentaho The Future of Analytics
Big Data Expo 2015 - Pentaho The Future of Analytics
BigDataExpo
 

Similar to BitYota Data Warehouse Podcast (20)

Architecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment OptionsArchitecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment Options
 
Take Action: The New Reality of Data-Driven Business
Take Action: The New Reality of Data-Driven BusinessTake Action: The New Reality of Data-Driven Business
Take Action: The New Reality of Data-Driven Business
 
Big Data Analytics with Microsoft
Big Data Analytics with MicrosoftBig Data Analytics with Microsoft
Big Data Analytics with Microsoft
 
Big Data in Azure
Big Data in AzureBig Data in Azure
Big Data in Azure
 
Data Provisioning & Optimization
Data Provisioning & OptimizationData Provisioning & Optimization
Data Provisioning & Optimization
 
Big data for Telco: opportunity or threat?
Big data for Telco: opportunity or threat?Big data for Telco: opportunity or threat?
Big data for Telco: opportunity or threat?
 
Denodo Data Virtualization - IT Days in Luxembourg with Oktopus
Denodo Data Virtualization - IT Days in Luxembourg with OktopusDenodo Data Virtualization - IT Days in Luxembourg with Oktopus
Denodo Data Virtualization - IT Days in Luxembourg with Oktopus
 
How to Swiftly Operationalize the Data Lake for Advanced Analytics Using a Lo...
How to Swiftly Operationalize the Data Lake for Advanced Analytics Using a Lo...How to Swiftly Operationalize the Data Lake for Advanced Analytics Using a Lo...
How to Swiftly Operationalize the Data Lake for Advanced Analytics Using a Lo...
 
Finding business value in Big Data
Finding business value in Big DataFinding business value in Big Data
Finding business value in Big Data
 
The Double win business transformation and in-year ROI and TCO reduction
The Double win business transformation and in-year ROI and TCO reductionThe Double win business transformation and in-year ROI and TCO reduction
The Double win business transformation and in-year ROI and TCO reduction
 
In-Memory Computing Webcast. Market Predictions 2017
In-Memory Computing Webcast. Market Predictions 2017In-Memory Computing Webcast. Market Predictions 2017
In-Memory Computing Webcast. Market Predictions 2017
 
BAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneyBAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, Sydney
 
cognos BI10.pptx
cognos BI10.pptxcognos BI10.pptx
cognos BI10.pptx
 
cognos BI10.pptx
cognos BI10.pptxcognos BI10.pptx
cognos BI10.pptx
 
Data Virtualization: Introduction and Business Value (UK)
Data Virtualization: Introduction and Business Value (UK)Data Virtualization: Introduction and Business Value (UK)
Data Virtualization: Introduction and Business Value (UK)
 
Building a Big Data Solution
Building a Big Data SolutionBuilding a Big Data Solution
Building a Big Data Solution
 
The Emerging Role of the Data Lake
The Emerging Role of the Data LakeThe Emerging Role of the Data Lake
The Emerging Role of the Data Lake
 
Crowdstar case-study
Crowdstar case-studyCrowdstar case-study
Crowdstar case-study
 
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
 
Big Data Expo 2015 - Pentaho The Future of Analytics
Big Data Expo 2015 - Pentaho The Future of AnalyticsBig Data Expo 2015 - Pentaho The Future of Analytics
Big Data Expo 2015 - Pentaho The Future of Analytics
 

More from inside-BigData.com

Major Market Shifts in IT
Major Market Shifts in ITMajor Market Shifts in IT
Major Market Shifts in IT
inside-BigData.com
 
Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...
inside-BigData.com
 
Transforming Private 5G Networks
Transforming Private 5G NetworksTransforming Private 5G Networks
Transforming Private 5G Networks
inside-BigData.com
 
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
inside-BigData.com
 
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
inside-BigData.com
 
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
inside-BigData.com
 
HPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural NetworksHPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural Networks
inside-BigData.com
 
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean MonitoringBiohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
inside-BigData.com
 
Machine Learning for Weather Forecasts
Machine Learning for Weather ForecastsMachine Learning for Weather Forecasts
Machine Learning for Weather Forecasts
inside-BigData.com
 
HPC AI Advisory Council Update
HPC AI Advisory Council UpdateHPC AI Advisory Council Update
HPC AI Advisory Council Update
inside-BigData.com
 
Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19
inside-BigData.com
 
Energy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic TuningEnergy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic Tuning
inside-BigData.com
 
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODHPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
inside-BigData.com
 
State of ARM-based HPC
State of ARM-based HPCState of ARM-based HPC
State of ARM-based HPC
inside-BigData.com
 
Versal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud AccelerationVersal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud Acceleration
inside-BigData.com
 
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance EfficientlyZettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
inside-BigData.com
 
Scaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's EraScaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's Era
inside-BigData.com
 
CUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computingCUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computing
inside-BigData.com
 
Introducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi ClusterIntroducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi Cluster
inside-BigData.com
 
Overview of HPC Interconnects
Overview of HPC InterconnectsOverview of HPC Interconnects
Overview of HPC Interconnects
inside-BigData.com
 

More from inside-BigData.com (20)

Major Market Shifts in IT
Major Market Shifts in ITMajor Market Shifts in IT
Major Market Shifts in IT
 
Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...
 
Transforming Private 5G Networks
Transforming Private 5G NetworksTransforming Private 5G Networks
Transforming Private 5G Networks
 
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
 
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
 
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
 
HPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural NetworksHPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural Networks
 
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean MonitoringBiohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
 
Machine Learning for Weather Forecasts
Machine Learning for Weather ForecastsMachine Learning for Weather Forecasts
Machine Learning for Weather Forecasts
 
HPC AI Advisory Council Update
HPC AI Advisory Council UpdateHPC AI Advisory Council Update
HPC AI Advisory Council Update
 
Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19
 
Energy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic TuningEnergy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic Tuning
 
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODHPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
 
State of ARM-based HPC
State of ARM-based HPCState of ARM-based HPC
State of ARM-based HPC
 
Versal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud AccelerationVersal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud Acceleration
 
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance EfficientlyZettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
 
Scaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's EraScaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's Era
 
CUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computingCUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computing
 
Introducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi ClusterIntroducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi Cluster
 
Overview of HPC Interconnects
Overview of HPC InterconnectsOverview of HPC Interconnects
Overview of HPC Interconnects
 

Recently uploaded

“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
Edge AI and Vision Alliance
 
Principle of conventional tomography-Bibash Shahi ppt..pptx
Principle of conventional tomography-Bibash Shahi ppt..pptxPrinciple of conventional tomography-Bibash Shahi ppt..pptx
Principle of conventional tomography-Bibash Shahi ppt..pptx
BibashShahi
 
Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...
Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...
Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...
Pitangent Analytics & Technology Solutions Pvt. Ltd
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
Zilliz
 
5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides
DanBrown980551
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
Zilliz
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
akankshawande
 
GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)
Javier Junquera
 
Harnessing the Power of NLP and Knowledge Graphs for Opioid Research
Harnessing the Power of NLP and Knowledge Graphs for Opioid ResearchHarnessing the Power of NLP and Knowledge Graphs for Opioid Research
Harnessing the Power of NLP and Knowledge Graphs for Opioid Research
Neo4j
 
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
saastr
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
Ivanti
 
What is an RPA CoE? Session 1 – CoE Vision
What is an RPA CoE?  Session 1 – CoE VisionWhat is an RPA CoE?  Session 1 – CoE Vision
What is an RPA CoE? Session 1 – CoE Vision
DianaGray10
 
"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota
Fwdays
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
Chart Kalyan
 
Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
Brandon Minnick, MBA
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
Zilliz
 
Mutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented ChatbotsMutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented Chatbots
Pablo Gómez Abajo
 
“How Axelera AI Uses Digital Compute-in-memory to Deliver Fast and Energy-eff...
“How Axelera AI Uses Digital Compute-in-memory to Deliver Fast and Energy-eff...“How Axelera AI Uses Digital Compute-in-memory to Deliver Fast and Energy-eff...
“How Axelera AI Uses Digital Compute-in-memory to Deliver Fast and Energy-eff...
Edge AI and Vision Alliance
 
9 CEO's who hit $100m ARR Share Their Top Growth Tactics Nathan Latka, Founde...
9 CEO's who hit $100m ARR Share Their Top Growth Tactics Nathan Latka, Founde...9 CEO's who hit $100m ARR Share Their Top Growth Tactics Nathan Latka, Founde...
9 CEO's who hit $100m ARR Share Their Top Growth Tactics Nathan Latka, Founde...
saastr
 
Essentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation ParametersEssentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation Parameters
Safe Software
 

Recently uploaded (20)

“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
 
Principle of conventional tomography-Bibash Shahi ppt..pptx
Principle of conventional tomography-Bibash Shahi ppt..pptxPrinciple of conventional tomography-Bibash Shahi ppt..pptx
Principle of conventional tomography-Bibash Shahi ppt..pptx
 
Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...
Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...
Crafting Excellence: A Comprehensive Guide to iOS Mobile App Development Serv...
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
 
5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
 
GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)
 
Harnessing the Power of NLP and Knowledge Graphs for Opioid Research
Harnessing the Power of NLP and Knowledge Graphs for Opioid ResearchHarnessing the Power of NLP and Knowledge Graphs for Opioid Research
Harnessing the Power of NLP and Knowledge Graphs for Opioid Research
 
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
 
What is an RPA CoE? Session 1 – CoE Vision
What is an RPA CoE?  Session 1 – CoE VisionWhat is an RPA CoE?  Session 1 – CoE Vision
What is an RPA CoE? Session 1 – CoE Vision
 
"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
 
Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
 
Mutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented ChatbotsMutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented Chatbots
 
“How Axelera AI Uses Digital Compute-in-memory to Deliver Fast and Energy-eff...
“How Axelera AI Uses Digital Compute-in-memory to Deliver Fast and Energy-eff...“How Axelera AI Uses Digital Compute-in-memory to Deliver Fast and Energy-eff...
“How Axelera AI Uses Digital Compute-in-memory to Deliver Fast and Energy-eff...
 
9 CEO's who hit $100m ARR Share Their Top Growth Tactics Nathan Latka, Founde...
9 CEO's who hit $100m ARR Share Their Top Growth Tactics Nathan Latka, Founde...9 CEO's who hit $100m ARR Share Their Top Growth Tactics Nathan Latka, Founde...
9 CEO's who hit $100m ARR Share Their Top Growth Tactics Nathan Latka, Founde...
 
Essentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation ParametersEssentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation Parameters
 

BitYota Data Warehouse Podcast

  • 1. BitYota – data warehouse service ! An overview! Dev Patel, CEO! © 2014
  • 2. BitYota: Who we are! Problem   Today’s  big  data  analy/cs  is  either  a  ‘Big  Cost’  or  a  ‘Big  Headache’  or  both  for   companies  of  all  sizes.  Users  have  to  learn  new  skills  and  CEOs  need  to  buy   uniquely  engineered,  prohibi/vely  expensive  systems.     Solu+on   BitYota  offers  a  be-er  alterna0ve:  A  Data  Warehouse  Service  for  Big  Data   analy0cs.  This  PaaS  offering  takes  away  both  big  cost  and  big  headache,   making  analy0cs  accessible  to  everyone  at  scale,  with  no  compromise  on   func0onality  or  service  levels.     Customers   Mobile  apps,  E/commerce,  Adver/sing/Marke/ng,  Games   Founded  Sep  2011  by  data  experts:  Dev  Patel,  Harmeek  Bedi  and  Soren  Riise.     Background   Opportunity   Team   Company  has  raised  $12M  through  Seed  &  Series  A  from  Globespan  Capital,   Social+Capital  Partnership,  Dawn  Capital,  Andreessen  Horowitz,  Crosslink   Capital,  Morado  Ventures,  &  individual  investors  Maynard  Webb,  Graham   Summers,  Jerry  Yang  and  Sharmila  Mulligan.   Companies  are  increasingly  looking  to  gain  insights  from  their  data  via   analy/cs.  Analy/cs  for  big  data  in  the  cloud  is  BitYota’s  opportunity.   Management:  Dev  Patel,  CEO;  Harmeek  Singh  Bedi,  CTO;  Soren  Riise,  Chief   Cloud  Service;  Poulomi  Damany,  VP  Product.   Its  core  team  has  35+  years  of  big  data  experience  at  Yahoo!,  Oracle,  Veritas/ Symantec,  Informix,  BMC,  Kabira/Tibco,  and  Microso_.    
  • 3. Does This Sound Like You?!! OR You’re a company that just launched .. And you need some critical insights for what’s next OR Your data infrastructure can’t scale .. And you can’t spare any more engineers or money to maintain it You’re a company that just launched … You have lots of data in multiple silos .. And it takes too long for your analysts to get answers Today’s Big Data = Big Cost/Big Headache! 3
  • 4. What questions do you want answered?! How can I combine social profiles, in-app purchases, and event stream data? Who are my best users? How do I increase engagement? Why is the new app version crashing? Access patterns by OS/ device? What’s my ROI on my marketing spend? Where should I be spending more/less $$$? 4
  • 5. BitYota: Data warehouse for next gen data! Variety, semistructured data Velocity, analytics on fresh data Velocity, “fast” analytics No translation to structure & No data modeling Continuous extract of changing data from MongoDB MPP architecture – scale with Compute Cloud, easy set up with Burst capacity Agility & Time-toMarket No CAPEX & low OPEX Elastic scale up/ down Integration into SQL/BI ecosystem Managed Service & Pay per Use 5
  • 6. BitYota is focused on use cases where …! Customers want:! 1.  Analytics over data from multiple sources! 2.  Migrate analytics from on-premise to Cloud ! 3.  Analytics on data from single source NoSQL or relational transactional systems ! 4.  Analytics on “fresh data” ! 6
  • 7. Markets for BitYota! Companies in! •  Advertising/Marketing! •  Social Media! •  SaaS! •  Games & Entertainment! •  E-commerce! •  Communication & Productivity ! 7
  • 8. BitYota focused in new Big Data Analytics! •  Data from Multiple sources in Multiple formats ! User profiles Social data Server Logs ! ! Volume   Variety   Velocity   ! Semi-structured data types (JSON, XML), Data types for new applications – timestamp, IP, location, etc! Table Layout – row and column, on disk, memory, external tables ! •  Fast time to analytics ! ! Load and explore directly, not dependent on slow & fragile ETL! •  Interactive analytics ! ! Inventory Deploy in a heterogeneous environment; scale out; scale storage & compute independently! •  Flexible Storage ! Sales Orders/ Returns Website Views & Clicks •  Cost effective, elastic capacity! Use ANSI SQL directly on new data types. Leverage existing BI tools! 8
  • 9. Business Analytics on data from MongoDB! Mobile/Web Apps Primary shard SQL over JSON, and access from BI tools BitYota  Cluster   Secondary shards Compute  nodes   Mongo dump Oplog Tail Load BitYota   Extract   Tool   BSON,   JSON   Extract   Load   Data  nodes   Schedule incremental extract and load MongoDB extract format (BSON) Transform  &  Analyze   Joins across collections SQL over JSON, UDFs Transforms into Cols for performance Views for BI tool 9
  • 10. Process to Load Data into BitYota! SOURCE  JSON  DATA   LOAD  DATA   "session":[{   "u":"8927ABBCD2873CCD",   "v":"1.0",   "uid","TheTestUser1",     "dv":"Apple  iPhone  3GS",     "t":200                           }   CREATE  TABLE  session(    jdoc  JSON   )       1-­‐+me  setup     •  Scheduled  Load   •  Schema  auto-­‐discovered   •  Table  auto-­‐created   ANALYZE  DATA   OPTIMIZE  DESIGN   SELECT  jdoc-­‐>'u’,  jdoc-­‐>’t’   FROM  session;   CREATE  TABLE  session_cols  (          u                               TEXT,          t                                 INT,        origjdoc                  JSON  )   PARTITION  BY  RANGE  (t)    (PARTITION  VALUES  ('0'),   PARTITION  VALUES  ('50'),   PARTITION  VALUES  ('200')  )   COLUMNSTORE  STORAGE   (SEGMENTSIZE  13102   TABLESIZE  200000);   INSERT  INTO  session_cols        SELECT  jdoc-­‐>'u',    ( jdoc-­‐>'t')::int8,  jdoc          FROM  session;;   Change  MongoDB  JSON  doc  structure  any/me  =  NO  extra  downstream  effort   needed   10
  • 11. As a Service! •  Launch cluster in minutes ! •  Removes the ‘headache’ of database management! •  No hardware, no software installation & upgrades; no licenses ! •  Available on AWS & Rackspace! 11
  • 12. Recap! •  BitYota is a Cloud based Data Warehouse Service for Big Data Analytics.! •  Its core attributes are:! •  100% Service oriented! •  Analytics on data from multiple sources/formats! •  Analytics on “fresh” data ! •  Customers are gaining deep insights on their business operations! •  Customers in Games, Mobile apps, advertising/ marketing, e/commerce! 12