SlideShare a Scribd company logo
1 of 28
Google Bigtable (Bigtable: A Distributed Storage System for Structured Data) Komadinovic Vanja, Vast Platform team
Presentation overview ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Introduction ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Design ,[object Object],[object Object],[object Object],[object Object],[object Object]
Design
Design ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Design
Basic implementation
GFS - HDFS introduction ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
GFS - HDFS introduction: overview
MapReduce introduction ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
MapReduce introduction: overview
Implementation ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Implementation: overview
Implementation: oragnization - two special tables: ROOT and METADATA - ROOT never gets splitted - location of ROOT is in Chubby file - estimated ROOT size 128Mb => 2^34 tablets - METADATA contains all user tablets, row key is an encoding of the tablet’s table identifier and its end row
Implementation ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
HBase - Apache Bigtable solution 
HBase - Bigtable synonyms ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
HBase differences ,[object Object],[object Object],[object Object]
Performance and usage case - number of 1000-byte values read/written per second - table shows the rate per tablet server Experiment 1 TS 50 TS 250 TS 500 TS random reads 1212 593 479 241 random reads mem 10811 8511 8000 6250 random writes 8850 3745 3425 2000 sequential reads 4425 2463 2625 2469 sequential writes 8547 3623 2451 1905 scans 15385 10526 9524 7843
Performances and usage case ,[object Object],[object Object],[object Object],[object Object]
Performances and usage case ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Cooked by Vast: RecordStack ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Cooked by Vast: RecordStack ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Some thoughts for discussion ,[object Object],[object Object],[object Object]
Useful links ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
If you, maybe, want to contact me? ,[object Object],[object Object],[object Object],[object Object]
Thanks !!!

More Related Content

What's hot (20)

Bigtable
BigtableBigtable
Bigtable
 
Big table
Big tableBig table
Big table
 
Big table
Big tableBig table
Big table
 
Google - Bigtable
Google - BigtableGoogle - Bigtable
Google - Bigtable
 
Google BigTable
Google BigTableGoogle BigTable
Google BigTable
 
Bigtable
BigtableBigtable
Bigtable
 
BigTable And Hbase
BigTable And HbaseBigTable And Hbase
BigTable And Hbase
 
The Google Bigtable
The Google BigtableThe Google Bigtable
The Google Bigtable
 
Big table
Big tableBig table
Big table
 
Bigtable
BigtableBigtable
Bigtable
 
Bigtable and Dynamo
Bigtable and DynamoBigtable and Dynamo
Bigtable and Dynamo
 
The Google File System (GFS)
The Google File System (GFS)The Google File System (GFS)
The Google File System (GFS)
 
Cloud Technology: Virtualization
Cloud Technology: VirtualizationCloud Technology: Virtualization
Cloud Technology: Virtualization
 
Bigtable and Boxwood
Bigtable and BoxwoodBigtable and Boxwood
Bigtable and Boxwood
 
Google cluster architecture
Google cluster architecture Google cluster architecture
Google cluster architecture
 
Column oriented database
Column oriented databaseColumn oriented database
Column oriented database
 
02.28.13 WANdisco ApacheCon 2013
02.28.13 WANdisco ApacheCon 201302.28.13 WANdisco ApacheCon 2013
02.28.13 WANdisco ApacheCon 2013
 
Row or Columnar Database
Row or Columnar DatabaseRow or Columnar Database
Row or Columnar Database
 
Google jeff dean lessons learned while building infrastructure software at go...
Google jeff dean lessons learned while building infrastructure software at go...Google jeff dean lessons learned while building infrastructure software at go...
Google jeff dean lessons learned while building infrastructure software at go...
 
An Introduction to Hadoop
An Introduction to HadoopAn Introduction to Hadoop
An Introduction to Hadoop
 

Similar to An SEO-Optimized Title for a Document on Google Bigtable and Distributed Data Storage Systems

GCP Data Engineer cheatsheet
GCP Data Engineer cheatsheetGCP Data Engineer cheatsheet
GCP Data Engineer cheatsheetGuang Xu
 
Apache hadoop, hdfs and map reduce Overview
Apache hadoop, hdfs and map reduce OverviewApache hadoop, hdfs and map reduce Overview
Apache hadoop, hdfs and map reduce OverviewNisanth Simon
 
Meethadoop
MeethadoopMeethadoop
MeethadoopIIIT-H
 
Kudu - Fast Analytics on Fast Data
Kudu - Fast Analytics on Fast DataKudu - Fast Analytics on Fast Data
Kudu - Fast Analytics on Fast DataRyan Bosshart
 
HBase, crazy dances on the elephant back.
HBase, crazy dances on the elephant back.HBase, crazy dances on the elephant back.
HBase, crazy dances on the elephant back.Roman Nikitchenko
 
Apache Hadoop India Summit 2011 talk "Searching Information Inside Hadoop Pla...
Apache Hadoop India Summit 2011 talk "Searching Information Inside Hadoop Pla...Apache Hadoop India Summit 2011 talk "Searching Information Inside Hadoop Pla...
Apache Hadoop India Summit 2011 talk "Searching Information Inside Hadoop Pla...Yahoo Developer Network
 
Hoodie - DataEngConf 2017
Hoodie - DataEngConf 2017Hoodie - DataEngConf 2017
Hoodie - DataEngConf 2017Vinoth Chandar
 
The Anatomy Of The Google Architecture Fina Lv1.1
The Anatomy Of The Google Architecture Fina Lv1.1The Anatomy Of The Google Architecture Fina Lv1.1
The Anatomy Of The Google Architecture Fina Lv1.1Hassy Veldstra
 
SQLSaturday #230 - Introduction to Microsoft Big Data (Part 2)
SQLSaturday #230 - Introduction to Microsoft Big Data (Part 2)SQLSaturday #230 - Introduction to Microsoft Big Data (Part 2)
SQLSaturday #230 - Introduction to Microsoft Big Data (Part 2)Sascha Dittmann
 

Similar to An SEO-Optimized Title for a Document on Google Bigtable and Distributed Data Storage Systems (20)

Gcp data engineer
Gcp data engineerGcp data engineer
Gcp data engineer
 
GCP Data Engineer cheatsheet
GCP Data Engineer cheatsheetGCP Data Engineer cheatsheet
GCP Data Engineer cheatsheet
 
Apache hadoop, hdfs and map reduce Overview
Apache hadoop, hdfs and map reduce OverviewApache hadoop, hdfs and map reduce Overview
Apache hadoop, hdfs and map reduce Overview
 
Training
TrainingTraining
Training
 
Hadoop
HadoopHadoop
Hadoop
 
Hbase: an introduction
Hbase: an introductionHbase: an introduction
Hbase: an introduction
 
Meethadoop
MeethadoopMeethadoop
Meethadoop
 
Kudu - Fast Analytics on Fast Data
Kudu - Fast Analytics on Fast DataKudu - Fast Analytics on Fast Data
Kudu - Fast Analytics on Fast Data
 
HBase introduction talk
HBase introduction talkHBase introduction talk
HBase introduction talk
 
HBase, crazy dances on the elephant back.
HBase, crazy dances on the elephant back.HBase, crazy dances on the elephant back.
HBase, crazy dances on the elephant back.
 
Hadoop - Introduction to HDFS
Hadoop - Introduction to HDFSHadoop - Introduction to HDFS
Hadoop - Introduction to HDFS
 
Apache Hadoop India Summit 2011 talk "Searching Information Inside Hadoop Pla...
Apache Hadoop India Summit 2011 talk "Searching Information Inside Hadoop Pla...Apache Hadoop India Summit 2011 talk "Searching Information Inside Hadoop Pla...
Apache Hadoop India Summit 2011 talk "Searching Information Inside Hadoop Pla...
 
Hadoop Research
Hadoop Research Hadoop Research
Hadoop Research
 
Kudu Deep-Dive
Kudu Deep-DiveKudu Deep-Dive
Kudu Deep-Dive
 
Lecture 2 part 1
Lecture 2 part 1Lecture 2 part 1
Lecture 2 part 1
 
Hoodie - DataEngConf 2017
Hoodie - DataEngConf 2017Hoodie - DataEngConf 2017
Hoodie - DataEngConf 2017
 
The Anatomy Of The Google Architecture Fina Lv1.1
The Anatomy Of The Google Architecture Fina Lv1.1The Anatomy Of The Google Architecture Fina Lv1.1
The Anatomy Of The Google Architecture Fina Lv1.1
 
Fast Analytics
Fast Analytics Fast Analytics
Fast Analytics
 
SQLSaturday #230 - Introduction to Microsoft Big Data (Part 2)
SQLSaturday #230 - Introduction to Microsoft Big Data (Part 2)SQLSaturday #230 - Introduction to Microsoft Big Data (Part 2)
SQLSaturday #230 - Introduction to Microsoft Big Data (Part 2)
 
Apache Kudu
Apache KuduApache Kudu
Apache Kudu
 

An SEO-Optimized Title for a Document on Google Bigtable and Distributed Data Storage Systems