7. Google Compute Engine
Google App Engine
Google BigQuery
Data Analytics with Google’s
massively parallel query engine
IaaS with Google’s
network and computing infra
PaaS with Google’s
container technology
9. 72 hours / min
100 petabytes
500M active users
How Big is Big Data in Google?
10. In Google, we use
Google BigQuery
for processing
Big Data
SELECT
top(appId, 20) AS app,
count(*) AS count
FROM installlog.2013
ORDER BY
count DESC
Scans 100B rows
in 10 sec, without index.
13. select top(title), count(*)
from publicdata:samples.wikipedia
To scan 1 TB in 1 sec,
you need 5,000 disks in parallel
Massively Parallel Processing
14. Mixer 0
Mixer 1 Mixer 1
Shard Shard Shard Shard
ColumnIO on Colossus SELECT state, year
COUNT(*)
GROUP BY state
WHERE year >= 1980 and year < 1990
ORDER BY count_babies DESC
LIMIT 10
COUNT(*)
GROUP BY state
Tree structure for fast aggregation
18. Google confidential | Do not distribute
Google’s Container Technology
Every Google service runs in containers
Launches 2B containers every week
19. Google confidential | Do not distribute
Scaling Out by Google’s Container Technology
Instantiation
in 40 ms
20. GAE Architecture And Optmization
The Traditional Web Development
Hardware Failures
Traffic Spike
Growing Big Data
Complex Design
Complex Dev.
Complex Admin
Cost
21. GAE Architecture And Optmization
Google App Engine Encapsulates Them All
Hardware Failures
Traffic Spike
Growing Big Data
Simpler Design
Simpler Dev
No Admin
No Initial Funding
23. 700million
“App Engine enabled us to focus on developing the
application. We wouldn’t have gotten here without the
ease of development that App Engine gave us.”
Bobby Murphy, CTO
Snapchat sends
photos and videos each day Google App Engine
scaled seamlessly
during growth to
millions of users
Small team is able
to innovate quickly
and expand
globally
26. Andromeda: The latest Software Defined Network (SDN)
- Intra/Inter zone throughput = 9G bps
- Inter regional private network by default
27. Google-scale Load Balancing
region 2 region 3region 1
VMs VMs VMs
:::::::::::::::::::::::::::::::::::::::::::::::HTTP
LB
DNS :)
One global IP:
11.22.33.44
● Load Balancing infra
shared with Google.com
● Easily handles 1M req/s
● TCP anycast with single
global IP
● Fast fail-over in seconds
29. Run and manage an optimized cluster
Kubernetes
Open Source
Workload
API & Tools
Google Container Engine (GKE)
Kubernetes
● Growing Community
● Linux containers support
● Open source orchestration
create and manage
Container Engine
● クラスタをすぐに起動可能
● クラスタ環境の最適化
● Kubernetesを実行管理する
GKE
Master
GKE Nodes
Compute
Engine
VMs
Google Container Engine