SlideShare a Scribd company logo
1 of 30
© 2014 MapR Technologies 1© 2014 MapR Technologies
Zeta Architecture
© 2014 MapR Technologies 2
Agenda
• Current State
– History
– Moving Forward
• The Next Enterprise Architecture
• Business Implications
• Concrete Implementations
© 2014 MapR Technologies 3© 2014 MapR Technologies
Current State
© 2014 MapR Technologies 4
Study History to Prepare for the Future
• A data center was built
• The servers were statically
partitioned
• If we want to break the cycle
we have to break the
partitions and become
dynamic
© 2014 MapR Technologies 5
Understanding the Why’s
• Isolation of resources
– Assists in troubleshooting
– Prevents the analytics team from impacting production
• Maximum throughput of an application
– Guaranteed volume (maximum): compute, memory and storage
• Business Continuity
– We know exactly what is backed up, when, and where
– Difficult to perfect and to test
© 2014 MapR Technologies 6
Issues with Isolated Workloads
• Segregated servers lead to under utilized hardware
– Wasted capacity and energy
• Complicated processes to move data to the required processing
servers
– Operational impact, including extra monitoring
– Time delays moving data (not real-time)
– Troubleshooting time when there are issues
• Difficult to thoroughly test DEV vs. QA vs. Production
– Environments have different shapes and sizes
– They will not have identical configurations
© 2014 MapR Technologies 7
Goals Moving Forward
• Leverage all existing hardware
• Create isolation in a different way
• Improve production operational processes
• Fix process of moving from DEV to QA to Production
• Support real-time business continuity
© 2014 MapR Technologies 8© 2014 MapR Technologies
The Next “Last” Enterprise Architecture
© 2014 MapR Technologies 9
The Next Generation Enterprise Architecture
• Dynamic compute resources
• Common storage platform
• Real-time application support
• Flexible programming models
• Deployment management
• Solution based approach
• Applications to operate a
business
* This is a pluggable architecture
© 2014 MapR Technologies 10
Technologies That Work
© 2014 MapR Technologies 11
We Will Call This Architecture…
© 2014 MapR Technologies 12
What’s in a Name
• The letter Z is the last letter in the English
alphabet, but Zeta is not the last letter of the
Greek alphabet
– But this is the last generalized architecture you
will need.
• Sixth letter of the Greek alphabet
– Hexagon represents the 6 surrounding pieces
• Zeta represents the number 7
– 7 total components in this architecture
– Components work with a global resource
manager
© 2014 MapR Technologies 13
Origin Story of the Zeta Architecture
• Cultivated by Jim Scott
– Created the pretty diagrams
– Put a nice name on it
– Documented the concepts
• Not really a new concept
– Google pretty much pioneered these
technology concepts
– They have never really discussed it
cohesively in this way
© 2014 MapR Technologies 14
Zeta Architecture at Google
© 2014 MapR Technologies 15© 2014 MapR Technologies
Concrete Implementations
© 2014 MapR Technologies 16
Web Server Logs
• Web server generates logs
• Land on local disk
– Logs periodically rotated
• Shipped to other servers
• Run jobs on logs
© 2014 MapR Technologies 17
Web Server Logs
• Web server generates logs
• Land on DFS
– Logs still rotate
– Logs now tolerant of a server
failure prior to rotation
– Logs are instantaneously
available for computation
• Run jobs on logs
– Data locality
© 2014 MapR Technologies 18
Advertising Platform
© 2014 MapR Technologies 19
Advertising Platform - Simplified
© 2014 MapR Technologies 20
Advertising Platform on Zeta
© 2014 MapR Technologies 21© 2014 MapR Technologies
Business Implications
© 2014 MapR Technologies 22
Integration of Existing Systems
• Use standards like NFS to connect existing
systems
• Pluggable security models fit into your
companies current standards
• Database that have been tested to work
– MySQL, Vertica, Sybase IQ
– ElasticSearch, SOLR
• May work, but not tested
– Oracle, DB2, SQL Server, PostgreSQL
• Applications in this architecture can still use them
• If they start supporting these technologies things change
© 2014 MapR Technologies 23
Rethink the Data Center
• All Servers
– Run Mesos
– Participate in the Distributed File System
• Dynamic Allocation of Resources
– Spin up more web servers
– Custom Business Applications
– Big Data Analytics
• Data Locality
– No more shipping data
– Store and process the data where it was created
© 2014 MapR Technologies 24
Simplified Architecture
• Less moving parts
– Less things to go wrong
• Better resource utilization
– Scale any application up or down on demand
• Common deployment model (new isolation model)
– Repeatability between environments (dev, qa, production)
• Shared file system
– Get at the data anywhere in the cluster
– Simplifies business continuity
© 2014 MapR Technologies 25
Business Continuity
• Resilience
– Redundancy
– High Availability
– Spare Capacity
• Recovery
– Snapshots
– Disaster Recovery
• Contingency
– Protect against the unforeseen
– Multisite Capability
Production
WAN
Production Research
Datacenter 1 Datacenter 2
WAN EC2
© 2014 MapR Technologies 26
Platform-wide Security and Compliance
• Authentication, Authorization, Auditing
– Users and jobs
– All tiers
• Data protection
– Wire-level encryption between servers
– Masking
• Regulatory Compliance
– Automatic expiration of “old” data
– Data locality supported by distributed file system
© 2014 MapR Technologies 27
Net Benefit
• Reduced operating expenses (OPEX)
– Better utilization of available capacity and data center space
• Reduced capital expenses (CAPEX)
– Less total hardware needed
• Improves time to market
– Streamlined deployments
– Environments become consistent and predictable
• Delivers a competitive advantage
– Via platform scaling
– Performance improvements
© 2014 MapR Technologies 28
Recap
• Saves valuable time and money
• Enables stronger business continuity capabilities
• Google has been doing this for years
– Real-time is the crux of everything Google does
• Time for the rest of us to operate at Google scale
– The technologies are there and they play together nicely
– Process changes must occur internally to achieve this architecture
• This approach will become the “traditional” way of thinking
– Don’t get beat to it by your competitors
© 2014 MapR Technologies 29
Go Forth and Implement the Zeta Architecture
© 2014 MapR Technologies 30
Q&A
@kingmesal maprtech
jscott@mapr.com
Engage with us!
MapR
maprtech
mapr-technologies

More Related Content

What's hot

Timeless design in a cloud-native world
Timeless design in a cloud-native worldTimeless design in a cloud-native world
Timeless design in a cloud-native worldUwe Friedrichsen
 
How Financial Services Organizations Use MongoDB
How Financial Services Organizations Use MongoDBHow Financial Services Organizations Use MongoDB
How Financial Services Organizations Use MongoDBMongoDB
 
Data center proposal
Data center proposalData center proposal
Data center proposalMuhammad Ahad
 
Data Integration, Access, Flow, Exchange, Transfer, Load And Extract Architec...
Data Integration, Access, Flow, Exchange, Transfer, Load And Extract Architec...Data Integration, Access, Flow, Exchange, Transfer, Load And Extract Architec...
Data Integration, Access, Flow, Exchange, Transfer, Load And Extract Architec...Alan McSweeney
 
How to Structure the Data Organization
How to Structure the Data OrganizationHow to Structure the Data Organization
How to Structure the Data OrganizationRobyn Bollhorst
 
IT Service Management Powerpoint Presentation Slides
IT Service Management Powerpoint Presentation SlidesIT Service Management Powerpoint Presentation Slides
IT Service Management Powerpoint Presentation SlidesSlideTeam
 
Multiple Flat Files(CSV) to Target Table in ODI12c(12.2.1.0.0)
Multiple Flat Files(CSV) to Target Table in ODI12c(12.2.1.0.0)Multiple Flat Files(CSV) to Target Table in ODI12c(12.2.1.0.0)
Multiple Flat Files(CSV) to Target Table in ODI12c(12.2.1.0.0)Darshankumar Prajapati
 
InterPlanetary File System (IPFS)
InterPlanetary File System (IPFS)InterPlanetary File System (IPFS)
InterPlanetary File System (IPFS)Gene Leybzon
 
OOW16 - Technical Upgrade Best Practices for Oracle E-Business Suite 12.2 [CO...
OOW16 - Technical Upgrade Best Practices for Oracle E-Business Suite 12.2 [CO...OOW16 - Technical Upgrade Best Practices for Oracle E-Business Suite 12.2 [CO...
OOW16 - Technical Upgrade Best Practices for Oracle E-Business Suite 12.2 [CO...vasuballa
 
Design Guidelines for Data Mesh and Decentralized Data Organizations
Design Guidelines for Data Mesh and Decentralized Data OrganizationsDesign Guidelines for Data Mesh and Decentralized Data Organizations
Design Guidelines for Data Mesh and Decentralized Data OrganizationsDenodo
 
DataEd Online: Data Architecture and Data Modeling Differences — Achieving a ...
DataEd Online: Data Architecture and Data Modeling Differences — Achieving a ...DataEd Online: Data Architecture and Data Modeling Differences — Achieving a ...
DataEd Online: Data Architecture and Data Modeling Differences — Achieving a ...DATAVERSITY
 
Azure Data Factory v2
Azure Data Factory v2Azure Data Factory v2
Azure Data Factory v2inovex GmbH
 
Dealing with Azure Cosmos DB
Dealing with Azure Cosmos DBDealing with Azure Cosmos DB
Dealing with Azure Cosmos DBMihail Mateev
 
(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling
(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling
(OTW13) Agile Data Warehousing: Introduction to Data Vault ModelingKent Graziano
 
Analyst field reports on top 20 multi domain MDM solutions - Aaron Zornes (NY...
Analyst field reports on top 20 multi domain MDM solutions - Aaron Zornes (NY...Analyst field reports on top 20 multi domain MDM solutions - Aaron Zornes (NY...
Analyst field reports on top 20 multi domain MDM solutions - Aaron Zornes (NY...Aaron Zornes
 
Oracle PPM Cloud Deployment, Made Easy
Oracle PPM Cloud Deployment, Made EasyOracle PPM Cloud Deployment, Made Easy
Oracle PPM Cloud Deployment, Made EasyMatthew Bezuidenhout
 
Data Management Services
Data Management ServicesData Management Services
Data Management ServicesBackOfficePro
 

What's hot (20)

Timeless design in a cloud-native world
Timeless design in a cloud-native worldTimeless design in a cloud-native world
Timeless design in a cloud-native world
 
Data center
Data centerData center
Data center
 
How Financial Services Organizations Use MongoDB
How Financial Services Organizations Use MongoDBHow Financial Services Organizations Use MongoDB
How Financial Services Organizations Use MongoDB
 
Database
DatabaseDatabase
Database
 
Data center proposal
Data center proposalData center proposal
Data center proposal
 
Data Integration, Access, Flow, Exchange, Transfer, Load And Extract Architec...
Data Integration, Access, Flow, Exchange, Transfer, Load And Extract Architec...Data Integration, Access, Flow, Exchange, Transfer, Load And Extract Architec...
Data Integration, Access, Flow, Exchange, Transfer, Load And Extract Architec...
 
How to Structure the Data Organization
How to Structure the Data OrganizationHow to Structure the Data Organization
How to Structure the Data Organization
 
IT Service Management Powerpoint Presentation Slides
IT Service Management Powerpoint Presentation SlidesIT Service Management Powerpoint Presentation Slides
IT Service Management Powerpoint Presentation Slides
 
Multiple Flat Files(CSV) to Target Table in ODI12c(12.2.1.0.0)
Multiple Flat Files(CSV) to Target Table in ODI12c(12.2.1.0.0)Multiple Flat Files(CSV) to Target Table in ODI12c(12.2.1.0.0)
Multiple Flat Files(CSV) to Target Table in ODI12c(12.2.1.0.0)
 
InterPlanetary File System (IPFS)
InterPlanetary File System (IPFS)InterPlanetary File System (IPFS)
InterPlanetary File System (IPFS)
 
OOW16 - Technical Upgrade Best Practices for Oracle E-Business Suite 12.2 [CO...
OOW16 - Technical Upgrade Best Practices for Oracle E-Business Suite 12.2 [CO...OOW16 - Technical Upgrade Best Practices for Oracle E-Business Suite 12.2 [CO...
OOW16 - Technical Upgrade Best Practices for Oracle E-Business Suite 12.2 [CO...
 
Design Guidelines for Data Mesh and Decentralized Data Organizations
Design Guidelines for Data Mesh and Decentralized Data OrganizationsDesign Guidelines for Data Mesh and Decentralized Data Organizations
Design Guidelines for Data Mesh and Decentralized Data Organizations
 
DataEd Online: Data Architecture and Data Modeling Differences — Achieving a ...
DataEd Online: Data Architecture and Data Modeling Differences — Achieving a ...DataEd Online: Data Architecture and Data Modeling Differences — Achieving a ...
DataEd Online: Data Architecture and Data Modeling Differences — Achieving a ...
 
Azure Data Factory v2
Azure Data Factory v2Azure Data Factory v2
Azure Data Factory v2
 
Dealing with Azure Cosmos DB
Dealing with Azure Cosmos DBDealing with Azure Cosmos DB
Dealing with Azure Cosmos DB
 
(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling
(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling
(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling
 
Dbms presentaion
Dbms presentaionDbms presentaion
Dbms presentaion
 
Analyst field reports on top 20 multi domain MDM solutions - Aaron Zornes (NY...
Analyst field reports on top 20 multi domain MDM solutions - Aaron Zornes (NY...Analyst field reports on top 20 multi domain MDM solutions - Aaron Zornes (NY...
Analyst field reports on top 20 multi domain MDM solutions - Aaron Zornes (NY...
 
Oracle PPM Cloud Deployment, Made Easy
Oracle PPM Cloud Deployment, Made EasyOracle PPM Cloud Deployment, Made Easy
Oracle PPM Cloud Deployment, Made Easy
 
Data Management Services
Data Management ServicesData Management Services
Data Management Services
 

Similar to Zeta Architecture: The Next Generation Big Data Architecture

Zeta architecture - Hive London May15
Zeta architecture - Hive London May15Zeta architecture - Hive London May15
Zeta architecture - Hive London May15MapR Technologies
 
Next Generation Enterprise Architecture
Next Generation Enterprise ArchitectureNext Generation Enterprise Architecture
Next Generation Enterprise ArchitectureMapR Technologies
 
Real time-hadoop
Real time-hadoopReal time-hadoop
Real time-hadoopTed Dunning
 
Hadoop and NoSQL joining forces by Dale Kim of MapR
Hadoop and NoSQL joining forces by Dale Kim of MapRHadoop and NoSQL joining forces by Dale Kim of MapR
Hadoop and NoSQL joining forces by Dale Kim of MapRData Con LA
 
Redefining End-to-End Monitoring: The Foundation - High-Performance Architect...
Redefining End-to-End Monitoring: The Foundation - High-Performance Architect...Redefining End-to-End Monitoring: The Foundation - High-Performance Architect...
Redefining End-to-End Monitoring: The Foundation - High-Performance Architect...SL Corporation
 
MapR on Azure: Getting Value from Big Data in the Cloud -
MapR on Azure: Getting Value from Big Data in the Cloud -MapR on Azure: Getting Value from Big Data in the Cloud -
MapR on Azure: Getting Value from Big Data in the Cloud -MapR Technologies
 
CEP - simplified streaming architecture - Strata Singapore 2016
CEP - simplified streaming architecture - Strata Singapore 2016CEP - simplified streaming architecture - Strata Singapore 2016
CEP - simplified streaming architecture - Strata Singapore 2016Mathieu Dumoulin
 
MapR-DB – The First In-Hadoop Document Database
MapR-DB – The First In-Hadoop Document DatabaseMapR-DB – The First In-Hadoop Document Database
MapR-DB – The First In-Hadoop Document DatabaseMapR Technologies
 
Real-time Hadoop: The Ideal Messaging System for Hadoop
Real-time Hadoop: The Ideal Messaging System for Hadoop Real-time Hadoop: The Ideal Messaging System for Hadoop
Real-time Hadoop: The Ideal Messaging System for Hadoop DataWorks Summit/Hadoop Summit
 
IoT and Big Data - Iot Asia 2014
IoT and Big Data - Iot Asia 2014IoT and Big Data - Iot Asia 2014
IoT and Big Data - Iot Asia 2014John Berns
 
Apache Hadoop YARN - The Future of Data Processing with Hadoop
Apache Hadoop YARN - The Future of Data Processing with HadoopApache Hadoop YARN - The Future of Data Processing with Hadoop
Apache Hadoop YARN - The Future of Data Processing with HadoopHortonworks
 
Advanced Spark and TensorFlow Meetup - Dec 12 2017 - Dong Meng, MapR + Kubern...
Advanced Spark and TensorFlow Meetup - Dec 12 2017 - Dong Meng, MapR + Kubern...Advanced Spark and TensorFlow Meetup - Dec 12 2017 - Dong Meng, MapR + Kubern...
Advanced Spark and TensorFlow Meetup - Dec 12 2017 - Dong Meng, MapR + Kubern...Chris Fregly
 
Knowledge Processing with Big Data and Semantic Web Technologies
Knowledge Processing with Big Data and  Semantic Web TechnologiesKnowledge Processing with Big Data and  Semantic Web Technologies
Knowledge Processing with Big Data and Semantic Web TechnologiesSyed Muhammad Ali Hasnain
 
Integrating Hadoop into your enterprise IT environment
Integrating Hadoop into your enterprise IT environmentIntegrating Hadoop into your enterprise IT environment
Integrating Hadoop into your enterprise IT environmentMapR Technologies
 
ProtectWise Revolutionizes Enterprise Network Security in the Cloud with Data...
ProtectWise Revolutionizes Enterprise Network Security in the Cloud with Data...ProtectWise Revolutionizes Enterprise Network Security in the Cloud with Data...
ProtectWise Revolutionizes Enterprise Network Security in the Cloud with Data...DataStax Academy
 
Lessons Learned from Building an Enterprise Big Data Platform from the Ground...
Lessons Learned from Building an Enterprise Big Data Platform from the Ground...Lessons Learned from Building an Enterprise Big Data Platform from the Ground...
Lessons Learned from Building an Enterprise Big Data Platform from the Ground...DataWorks Summit
 
How jKool Analyzes Streaming Data in Real Time with DataStax
How jKool Analyzes Streaming Data in Real Time with DataStaxHow jKool Analyzes Streaming Data in Real Time with DataStax
How jKool Analyzes Streaming Data in Real Time with DataStaxjKool
 
How jKool Analyzes Streaming Data in Real Time with DataStax
How jKool Analyzes Streaming Data in Real Time with DataStaxHow jKool Analyzes Streaming Data in Real Time with DataStax
How jKool Analyzes Streaming Data in Real Time with DataStaxDataStax
 

Similar to Zeta Architecture: The Next Generation Big Data Architecture (20)

Zeta architecture - Hive London May15
Zeta architecture - Hive London May15Zeta architecture - Hive London May15
Zeta architecture - Hive London May15
 
Zeta architecture -2015
Zeta architecture -2015Zeta architecture -2015
Zeta architecture -2015
 
Next Generation Enterprise Architecture
Next Generation Enterprise ArchitectureNext Generation Enterprise Architecture
Next Generation Enterprise Architecture
 
Keys for Success from Streams to Queries
Keys for Success from Streams to QueriesKeys for Success from Streams to Queries
Keys for Success from Streams to Queries
 
Real time-hadoop
Real time-hadoopReal time-hadoop
Real time-hadoop
 
Hadoop and NoSQL joining forces by Dale Kim of MapR
Hadoop and NoSQL joining forces by Dale Kim of MapRHadoop and NoSQL joining forces by Dale Kim of MapR
Hadoop and NoSQL joining forces by Dale Kim of MapR
 
Redefining End-to-End Monitoring: The Foundation - High-Performance Architect...
Redefining End-to-End Monitoring: The Foundation - High-Performance Architect...Redefining End-to-End Monitoring: The Foundation - High-Performance Architect...
Redefining End-to-End Monitoring: The Foundation - High-Performance Architect...
 
MapR on Azure: Getting Value from Big Data in the Cloud -
MapR on Azure: Getting Value from Big Data in the Cloud -MapR on Azure: Getting Value from Big Data in the Cloud -
MapR on Azure: Getting Value from Big Data in the Cloud -
 
CEP - simplified streaming architecture - Strata Singapore 2016
CEP - simplified streaming architecture - Strata Singapore 2016CEP - simplified streaming architecture - Strata Singapore 2016
CEP - simplified streaming architecture - Strata Singapore 2016
 
MapR-DB – The First In-Hadoop Document Database
MapR-DB – The First In-Hadoop Document DatabaseMapR-DB – The First In-Hadoop Document Database
MapR-DB – The First In-Hadoop Document Database
 
Real-time Hadoop: The Ideal Messaging System for Hadoop
Real-time Hadoop: The Ideal Messaging System for Hadoop Real-time Hadoop: The Ideal Messaging System for Hadoop
Real-time Hadoop: The Ideal Messaging System for Hadoop
 
IoT and Big Data - Iot Asia 2014
IoT and Big Data - Iot Asia 2014IoT and Big Data - Iot Asia 2014
IoT and Big Data - Iot Asia 2014
 
Apache Hadoop YARN - The Future of Data Processing with Hadoop
Apache Hadoop YARN - The Future of Data Processing with HadoopApache Hadoop YARN - The Future of Data Processing with Hadoop
Apache Hadoop YARN - The Future of Data Processing with Hadoop
 
Advanced Spark and TensorFlow Meetup - Dec 12 2017 - Dong Meng, MapR + Kubern...
Advanced Spark and TensorFlow Meetup - Dec 12 2017 - Dong Meng, MapR + Kubern...Advanced Spark and TensorFlow Meetup - Dec 12 2017 - Dong Meng, MapR + Kubern...
Advanced Spark and TensorFlow Meetup - Dec 12 2017 - Dong Meng, MapR + Kubern...
 
Knowledge Processing with Big Data and Semantic Web Technologies
Knowledge Processing with Big Data and  Semantic Web TechnologiesKnowledge Processing with Big Data and  Semantic Web Technologies
Knowledge Processing with Big Data and Semantic Web Technologies
 
Integrating Hadoop into your enterprise IT environment
Integrating Hadoop into your enterprise IT environmentIntegrating Hadoop into your enterprise IT environment
Integrating Hadoop into your enterprise IT environment
 
ProtectWise Revolutionizes Enterprise Network Security in the Cloud with Data...
ProtectWise Revolutionizes Enterprise Network Security in the Cloud with Data...ProtectWise Revolutionizes Enterprise Network Security in the Cloud with Data...
ProtectWise Revolutionizes Enterprise Network Security in the Cloud with Data...
 
Lessons Learned from Building an Enterprise Big Data Platform from the Ground...
Lessons Learned from Building an Enterprise Big Data Platform from the Ground...Lessons Learned from Building an Enterprise Big Data Platform from the Ground...
Lessons Learned from Building an Enterprise Big Data Platform from the Ground...
 
How jKool Analyzes Streaming Data in Real Time with DataStax
How jKool Analyzes Streaming Data in Real Time with DataStaxHow jKool Analyzes Streaming Data in Real Time with DataStax
How jKool Analyzes Streaming Data in Real Time with DataStax
 
How jKool Analyzes Streaming Data in Real Time with DataStax
How jKool Analyzes Streaming Data in Real Time with DataStaxHow jKool Analyzes Streaming Data in Real Time with DataStax
How jKool Analyzes Streaming Data in Real Time with DataStax
 

More from MapR Technologies

Converging your data landscape
Converging your data landscapeConverging your data landscape
Converging your data landscapeMapR Technologies
 
ML Workshop 2: Machine Learning Model Comparison & Evaluation
ML Workshop 2: Machine Learning Model Comparison & EvaluationML Workshop 2: Machine Learning Model Comparison & Evaluation
ML Workshop 2: Machine Learning Model Comparison & EvaluationMapR Technologies
 
Self-Service Data Science for Leveraging ML & AI on All of Your Data
Self-Service Data Science for Leveraging ML & AI on All of Your DataSelf-Service Data Science for Leveraging ML & AI on All of Your Data
Self-Service Data Science for Leveraging ML & AI on All of Your DataMapR Technologies
 
Enabling Real-Time Business with Change Data Capture
Enabling Real-Time Business with Change Data CaptureEnabling Real-Time Business with Change Data Capture
Enabling Real-Time Business with Change Data CaptureMapR Technologies
 
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...MapR Technologies
 
ML Workshop 1: A New Architecture for Machine Learning Logistics
ML Workshop 1: A New Architecture for Machine Learning LogisticsML Workshop 1: A New Architecture for Machine Learning Logistics
ML Workshop 1: A New Architecture for Machine Learning LogisticsMapR Technologies
 
Machine Learning Success: The Key to Easier Model Management
Machine Learning Success: The Key to Easier Model ManagementMachine Learning Success: The Key to Easier Model Management
Machine Learning Success: The Key to Easier Model ManagementMapR Technologies
 
Data Warehouse Modernization: Accelerating Time-To-Action
Data Warehouse Modernization: Accelerating Time-To-Action Data Warehouse Modernization: Accelerating Time-To-Action
Data Warehouse Modernization: Accelerating Time-To-Action MapR Technologies
 
Live Tutorial – Streaming Real-Time Events Using Apache APIs
Live Tutorial – Streaming Real-Time Events Using Apache APIsLive Tutorial – Streaming Real-Time Events Using Apache APIs
Live Tutorial – Streaming Real-Time Events Using Apache APIsMapR Technologies
 
Bringing Structure, Scalability, and Services to Cloud-Scale Storage
Bringing Structure, Scalability, and Services to Cloud-Scale StorageBringing Structure, Scalability, and Services to Cloud-Scale Storage
Bringing Structure, Scalability, and Services to Cloud-Scale StorageMapR Technologies
 
Live Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn PredictionLive Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn PredictionMapR Technologies
 
An Introduction to the MapR Converged Data Platform
An Introduction to the MapR Converged Data PlatformAn Introduction to the MapR Converged Data Platform
An Introduction to the MapR Converged Data PlatformMapR Technologies
 
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...MapR Technologies
 
Best Practices for Data Convergence in Healthcare
Best Practices for Data Convergence in HealthcareBest Practices for Data Convergence in Healthcare
Best Practices for Data Convergence in HealthcareMapR Technologies
 
Geo-Distributed Big Data and Analytics
Geo-Distributed Big Data and AnalyticsGeo-Distributed Big Data and Analytics
Geo-Distributed Big Data and AnalyticsMapR Technologies
 
MapR Product Update - Spring 2017
MapR Product Update - Spring 2017MapR Product Update - Spring 2017
MapR Product Update - Spring 2017MapR Technologies
 
3 Benefits of Multi-Temperature Data Management for Data Analytics
3 Benefits of Multi-Temperature Data Management for Data Analytics3 Benefits of Multi-Temperature Data Management for Data Analytics
3 Benefits of Multi-Temperature Data Management for Data AnalyticsMapR Technologies
 
Cisco & MapR bring 3 Superpowers to SAP HANA Deployments
Cisco & MapR bring 3 Superpowers to SAP HANA DeploymentsCisco & MapR bring 3 Superpowers to SAP HANA Deployments
Cisco & MapR bring 3 Superpowers to SAP HANA DeploymentsMapR Technologies
 
MapR and Cisco Make IT Better
MapR and Cisco Make IT BetterMapR and Cisco Make IT Better
MapR and Cisco Make IT BetterMapR Technologies
 
Evolving from RDBMS to NoSQL + SQL
Evolving from RDBMS to NoSQL + SQLEvolving from RDBMS to NoSQL + SQL
Evolving from RDBMS to NoSQL + SQLMapR Technologies
 

More from MapR Technologies (20)

Converging your data landscape
Converging your data landscapeConverging your data landscape
Converging your data landscape
 
ML Workshop 2: Machine Learning Model Comparison & Evaluation
ML Workshop 2: Machine Learning Model Comparison & EvaluationML Workshop 2: Machine Learning Model Comparison & Evaluation
ML Workshop 2: Machine Learning Model Comparison & Evaluation
 
Self-Service Data Science for Leveraging ML & AI on All of Your Data
Self-Service Data Science for Leveraging ML & AI on All of Your DataSelf-Service Data Science for Leveraging ML & AI on All of Your Data
Self-Service Data Science for Leveraging ML & AI on All of Your Data
 
Enabling Real-Time Business with Change Data Capture
Enabling Real-Time Business with Change Data CaptureEnabling Real-Time Business with Change Data Capture
Enabling Real-Time Business with Change Data Capture
 
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
 
ML Workshop 1: A New Architecture for Machine Learning Logistics
ML Workshop 1: A New Architecture for Machine Learning LogisticsML Workshop 1: A New Architecture for Machine Learning Logistics
ML Workshop 1: A New Architecture for Machine Learning Logistics
 
Machine Learning Success: The Key to Easier Model Management
Machine Learning Success: The Key to Easier Model ManagementMachine Learning Success: The Key to Easier Model Management
Machine Learning Success: The Key to Easier Model Management
 
Data Warehouse Modernization: Accelerating Time-To-Action
Data Warehouse Modernization: Accelerating Time-To-Action Data Warehouse Modernization: Accelerating Time-To-Action
Data Warehouse Modernization: Accelerating Time-To-Action
 
Live Tutorial – Streaming Real-Time Events Using Apache APIs
Live Tutorial – Streaming Real-Time Events Using Apache APIsLive Tutorial – Streaming Real-Time Events Using Apache APIs
Live Tutorial – Streaming Real-Time Events Using Apache APIs
 
Bringing Structure, Scalability, and Services to Cloud-Scale Storage
Bringing Structure, Scalability, and Services to Cloud-Scale StorageBringing Structure, Scalability, and Services to Cloud-Scale Storage
Bringing Structure, Scalability, and Services to Cloud-Scale Storage
 
Live Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn PredictionLive Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn Prediction
 
An Introduction to the MapR Converged Data Platform
An Introduction to the MapR Converged Data PlatformAn Introduction to the MapR Converged Data Platform
An Introduction to the MapR Converged Data Platform
 
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...
 
Best Practices for Data Convergence in Healthcare
Best Practices for Data Convergence in HealthcareBest Practices for Data Convergence in Healthcare
Best Practices for Data Convergence in Healthcare
 
Geo-Distributed Big Data and Analytics
Geo-Distributed Big Data and AnalyticsGeo-Distributed Big Data and Analytics
Geo-Distributed Big Data and Analytics
 
MapR Product Update - Spring 2017
MapR Product Update - Spring 2017MapR Product Update - Spring 2017
MapR Product Update - Spring 2017
 
3 Benefits of Multi-Temperature Data Management for Data Analytics
3 Benefits of Multi-Temperature Data Management for Data Analytics3 Benefits of Multi-Temperature Data Management for Data Analytics
3 Benefits of Multi-Temperature Data Management for Data Analytics
 
Cisco & MapR bring 3 Superpowers to SAP HANA Deployments
Cisco & MapR bring 3 Superpowers to SAP HANA DeploymentsCisco & MapR bring 3 Superpowers to SAP HANA Deployments
Cisco & MapR bring 3 Superpowers to SAP HANA Deployments
 
MapR and Cisco Make IT Better
MapR and Cisco Make IT BetterMapR and Cisco Make IT Better
MapR and Cisco Make IT Better
 
Evolving from RDBMS to NoSQL + SQL
Evolving from RDBMS to NoSQL + SQLEvolving from RDBMS to NoSQL + SQL
Evolving from RDBMS to NoSQL + SQL
 

Recently uploaded

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 

Recently uploaded (20)

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 

Zeta Architecture: The Next Generation Big Data Architecture

  • 1. © 2014 MapR Technologies 1© 2014 MapR Technologies Zeta Architecture
  • 2. © 2014 MapR Technologies 2 Agenda • Current State – History – Moving Forward • The Next Enterprise Architecture • Business Implications • Concrete Implementations
  • 3. © 2014 MapR Technologies 3© 2014 MapR Technologies Current State
  • 4. © 2014 MapR Technologies 4 Study History to Prepare for the Future • A data center was built • The servers were statically partitioned • If we want to break the cycle we have to break the partitions and become dynamic
  • 5. © 2014 MapR Technologies 5 Understanding the Why’s • Isolation of resources – Assists in troubleshooting – Prevents the analytics team from impacting production • Maximum throughput of an application – Guaranteed volume (maximum): compute, memory and storage • Business Continuity – We know exactly what is backed up, when, and where – Difficult to perfect and to test
  • 6. © 2014 MapR Technologies 6 Issues with Isolated Workloads • Segregated servers lead to under utilized hardware – Wasted capacity and energy • Complicated processes to move data to the required processing servers – Operational impact, including extra monitoring – Time delays moving data (not real-time) – Troubleshooting time when there are issues • Difficult to thoroughly test DEV vs. QA vs. Production – Environments have different shapes and sizes – They will not have identical configurations
  • 7. © 2014 MapR Technologies 7 Goals Moving Forward • Leverage all existing hardware • Create isolation in a different way • Improve production operational processes • Fix process of moving from DEV to QA to Production • Support real-time business continuity
  • 8. © 2014 MapR Technologies 8© 2014 MapR Technologies The Next “Last” Enterprise Architecture
  • 9. © 2014 MapR Technologies 9 The Next Generation Enterprise Architecture • Dynamic compute resources • Common storage platform • Real-time application support • Flexible programming models • Deployment management • Solution based approach • Applications to operate a business * This is a pluggable architecture
  • 10. © 2014 MapR Technologies 10 Technologies That Work
  • 11. © 2014 MapR Technologies 11 We Will Call This Architecture…
  • 12. © 2014 MapR Technologies 12 What’s in a Name • The letter Z is the last letter in the English alphabet, but Zeta is not the last letter of the Greek alphabet – But this is the last generalized architecture you will need. • Sixth letter of the Greek alphabet – Hexagon represents the 6 surrounding pieces • Zeta represents the number 7 – 7 total components in this architecture – Components work with a global resource manager
  • 13. © 2014 MapR Technologies 13 Origin Story of the Zeta Architecture • Cultivated by Jim Scott – Created the pretty diagrams – Put a nice name on it – Documented the concepts • Not really a new concept – Google pretty much pioneered these technology concepts – They have never really discussed it cohesively in this way
  • 14. © 2014 MapR Technologies 14 Zeta Architecture at Google
  • 15. © 2014 MapR Technologies 15© 2014 MapR Technologies Concrete Implementations
  • 16. © 2014 MapR Technologies 16 Web Server Logs • Web server generates logs • Land on local disk – Logs periodically rotated • Shipped to other servers • Run jobs on logs
  • 17. © 2014 MapR Technologies 17 Web Server Logs • Web server generates logs • Land on DFS – Logs still rotate – Logs now tolerant of a server failure prior to rotation – Logs are instantaneously available for computation • Run jobs on logs – Data locality
  • 18. © 2014 MapR Technologies 18 Advertising Platform
  • 19. © 2014 MapR Technologies 19 Advertising Platform - Simplified
  • 20. © 2014 MapR Technologies 20 Advertising Platform on Zeta
  • 21. © 2014 MapR Technologies 21© 2014 MapR Technologies Business Implications
  • 22. © 2014 MapR Technologies 22 Integration of Existing Systems • Use standards like NFS to connect existing systems • Pluggable security models fit into your companies current standards • Database that have been tested to work – MySQL, Vertica, Sybase IQ – ElasticSearch, SOLR • May work, but not tested – Oracle, DB2, SQL Server, PostgreSQL • Applications in this architecture can still use them • If they start supporting these technologies things change
  • 23. © 2014 MapR Technologies 23 Rethink the Data Center • All Servers – Run Mesos – Participate in the Distributed File System • Dynamic Allocation of Resources – Spin up more web servers – Custom Business Applications – Big Data Analytics • Data Locality – No more shipping data – Store and process the data where it was created
  • 24. © 2014 MapR Technologies 24 Simplified Architecture • Less moving parts – Less things to go wrong • Better resource utilization – Scale any application up or down on demand • Common deployment model (new isolation model) – Repeatability between environments (dev, qa, production) • Shared file system – Get at the data anywhere in the cluster – Simplifies business continuity
  • 25. © 2014 MapR Technologies 25 Business Continuity • Resilience – Redundancy – High Availability – Spare Capacity • Recovery – Snapshots – Disaster Recovery • Contingency – Protect against the unforeseen – Multisite Capability Production WAN Production Research Datacenter 1 Datacenter 2 WAN EC2
  • 26. © 2014 MapR Technologies 26 Platform-wide Security and Compliance • Authentication, Authorization, Auditing – Users and jobs – All tiers • Data protection – Wire-level encryption between servers – Masking • Regulatory Compliance – Automatic expiration of “old” data – Data locality supported by distributed file system
  • 27. © 2014 MapR Technologies 27 Net Benefit • Reduced operating expenses (OPEX) – Better utilization of available capacity and data center space • Reduced capital expenses (CAPEX) – Less total hardware needed • Improves time to market – Streamlined deployments – Environments become consistent and predictable • Delivers a competitive advantage – Via platform scaling – Performance improvements
  • 28. © 2014 MapR Technologies 28 Recap • Saves valuable time and money • Enables stronger business continuity capabilities • Google has been doing this for years – Real-time is the crux of everything Google does • Time for the rest of us to operate at Google scale – The technologies are there and they play together nicely – Process changes must occur internally to achieve this architecture • This approach will become the “traditional” way of thinking – Don’t get beat to it by your competitors
  • 29. © 2014 MapR Technologies 29 Go Forth and Implement the Zeta Architecture
  • 30. © 2014 MapR Technologies 30 Q&A @kingmesal maprtech jscott@mapr.com Engage with us! MapR maprtech mapr-technologies

Editor's Notes

  1. Static partitioning was used to create isolation. This enabled people to know that when a problem occurred what was causing the issue. This is something containers can fix. Additionally, when we statically partition, we cannot fully leverage all the resources for all the most important work. Most server types are busy at different times of day, or can be. Which work is the most important? Whichever is deemed the most important now!
  2. The problem with this business continuity story is that you are still limited to generally scheduled backups and not real-time.
  3. Pluggable, because we need an architecture that platforms can be modeled after. If we have to rethink an enterprise architecture every time the next greatest thing comes out we end up redoing a lot of work.
  4. These technology concepts are now mature enough that they will stick around and the specific implementation is flexible.
  5. NOTE: The solution architecture can integrate remote platforms and is a conceptual model and thus isn’t necessarily managed by a global resource manager
  6. “Google is living a few years in the future and sends the rest of us messages” –Doug Cutting, Hadoop Founder
  7. Server crashes before a file is rolled and the data is lost.
  8. Leveraging the distributed file system means that the data is NOT lost. Scales without the pain of figuring out how to move that data, the platform handles it for you. From the solution architecture perspective this is likely going to be utilized in monitoring the operations of an environment, or perhaps revenue management.
  9. These parts are expensive. They are difficult to test because production is often drastically different from dev and qa.
  10. NOTE: The billing database is an external entity, likely an RDBMS and is still a part of the solution architecture to deliver the business objectives
  11. If it can run on Netapp, then it can run on MapR. As pointed out in the advertising example. The billing system is still in an RDBMS. It still integrates via the solution architecture, but the RDBMS is not running on this platform architecture.
  12. The benefits are plentiful They rely heavily on Borg and Omega They perform 2 billion container deployments per week
  13. Follow me on twitter to get updates for documentation on the Zeta Architecture