PhoenixCon 2018 - Applications of HBase/Phoenix @ 23andMe

Applications of
HBase/Phoenix @ 23andMe
Tulasi Paradarami, Engineering Tech Lead
Marcell Ortutay, Sr. Software Engineer

2Copyright © 2017 23andMe, Inc. All rights reserved.
Agenda
● Phoenix Use Cases
○ Identical By Descent (IBD)
○ Ancestry Composition
● Hash Joins
● Cluster Conﬁguration
● Lessons Learnt
● Q & A

BigData @ 23andMe
● Over 5 million customers, 1 million genotyped and 10s of millions of
imputed SNPs
● Close to trillion shared segments and relationships
● Billions of customer reported survey answers
● Support real-time access to data at scale
● Simple arbitrary joins and aggregations
● Enable internal research through fast random access to any genetic
location across the database

HBase/Phoenix Cluster
● HBase 1.3, Phoenix 4.11
● Mirror cluster for load balancing and hot failover
● Live and mirror clusters run in diﬀerent AWS availability zones
● Data Pipeline
1. Spark based computation job stages data to HDFS
2. MR cluster for generating incremental HFiles from staged data
3. Transfer HFiles to live and mirror HBase clusters
4. Backup HFiles to S3 for disaster recovery

Phoenix
HBase
● 60 region servers
● 8 vCPUs, 60G RAM
● 30G RS (d2.2xlarge)
● 6 * 2T HDD
● enhanced networking
PQS
cluster
● 10 c5.4xlarge
● 16 vCPUs, 32G RAM
● EBS only
● enhanced networking
Import
Cluster
HFile
transfer
HFile
transfer
HFile
Backup
PQS
cluster
Phoenix
HBase
...
API hosts/
phoenixdb
Architecture
us-west-2a us-west-2b
S3 bucket
(99.999999999%
durability)
live mirror

Phoenix Optimizations
● FAST_DIFF block encoding
● GZ compression
● Enable ROW level bloomﬁlters
● Salted tables
● Tune query for RANGE SCAN or SKIP SCAN (HBase SEEK_NEXT_USING_HINT for intra
region scan) over FULL SCAN
● Apply ﬁlters to leverage predicate push down

Query Performance
● RELATION
● SEGMENT

Close Relatives
SELECT r1.person_id_2
FROM relation r1
WHERE r1.person_id_1 = ?
AND r1.half > ?;

Shared Segments
SELECT *
FROM segment
WHERE person_id_2 IN (?, ?) AND
person_id_1 = ? AND
cm > ?
ORDER BY chromosome, start_

Relatives in Common
SELECT r1.person_id_2
FROM relation r1 JOIN relation r2
ON r1.person_id_2 = r2.person_id_2
WHERE r1.person_id_1 = <your-id>
AND r2.person_id_1 = <relative’s-id>

Ancestry Composition (use case II)
● 23andMe's Ancestry Composition report is a powerful and
well-tested system for analyzing ancestry based on DNA
● Your Ancestry Composition report shows the percentage of your
DNA that comes from 31 populations.
● We use Phoenix to join data from segments, relations and
customer reported survey answers for building the report.

Ancestry Composition
Common across
queries
Billions of rows
per table

Joins
Eye Color ID Color
1 blue
2 green
3 brown
Person ID Eye Color ID
1 2
2 1
2 2
4 3

Joins
Eye Color ID Color
1 blue
2 green
3 brown
Person ID Eye Color ID Color
1 2 green
2 1 blue
2 2 green
4 3 brown

Hash Joins
Person ID Eye Color ID
1 2
2 1
2 2
4 3
1 => “blue”
2 => “green”
3 => “brown”
- Iterate over rows
- Use hashmap to look up
joined values
- O(N) performance

Hash Joins in Phoenix
Client
Region servers
Example query:
SELECT * FROM person
JOIN eye_color ON eye_color.id = person.eye_color_id

Client
Region servers
Step 1: Build the hash join table

Client
Region servers

Client
Region servers
Step 2: Distribute the hash join table

Client
Region servers
Step 3: Execute join algorithm on Region Servers

Client
Region servers
Step 4: Release the hash join tables, return results

Step 4: Release the hash join tables, return results
Step 1:
compute/IO
Step 2:
network
Step 3:
compute/IO
Step 4
Phase 1: generate hash join table Phase 2: execute join
Timing breakdown

Hash Joins in Phoenix - Pros/Cons
Good:
- Hash joins are fast, O(N) performance
Bad:
- Broadcasting can be slow for large hash tables
- Needs to re-run every time
WHERE person.some_value IN (SELECT value FROM large_expensive_table)
AND person_id = 1234;
- Only person_id changes

Hash Joins in Phoenix - Pros/Cons
Performance breakdown for example query
Region server 1
Region server 2
Region server 3

Subquery caching to improve hash joins
- Sometimes subquery to “large_expensive_table” repeats across queries
- In the example, only person_id changes
- Application level caching not possible if query is not identical
What if we keep the subquery results around?

New hint! /*+ USE_PERSISTENT_CACHE */
SELECT /*+ USE_PERSISTENT_CACHE */ * FROM person
- Each subquery gets assigned a cache key based on the hash of its query
statement
- Eg. Hash(“SELECT value FROM large_expensive_table”) = 0xa034bf...

First execution:
1. Generate hash join table (query on large_expensive_table)
2. Scan person table and execute join
O(large_expensive_table) + O(person)
Second execution:
1. Scan person table and execute join
O(person)

Hash Joins in Phoenix - with subquery cache
Region server 1
Region server 2
Region server 3

Region server 1
Region server 2
Region server 3

Subquery Cache at 23andMe
- Countries of Ancestry speedup:
- Before: ~3 seconds/query
- After: ~1 second/query
- Your DNA Family speedup:
- Before: 30+ seconds/query
- After: ~0.2 seconds/query

New conﬁg for subquery cache
phoenix.coprocessor.maxServerCachePersistenceTimeToLiveMs
- Sets TTL for items in persistent subquery cache

Future work for subquery cache
- Add cache key hint
- /*+ USE_PERSISTENT_CACHE(“2017-01-01”) */
- Treated as a suffix, so cache key is now:
Key = Hash(“...”) + “2017-01-01”
- Allows explicit cache eviction, cache warming
- Efficiently handle large caches (offheap?)
- Eg. 1GB+
- Plug into caching systems like memcache, redis?

Cluster Conﬁguration
HDFS Properties
● dfs.datanode.max.xceivers = 4096
○ Default value of 256 is too low for production
○ Server side threads used for connections
○ 1 MB per thread, so, tune it!
● dfs.datanode.handler.count = 36
● dfs.client.read.shortcircuit = true
○ Read directly from local disk
● dfs.namenode.avoid.read.stale.datanode = true
● dfs.namenode.avoid.write.stale.datanode = true
OS settings
● Disable transparent hugepages
● vm.swappiness = 0
Phoenix Properties
● phoenix.query.queueSize = 5000
● phoenix.coprocessor.maxServerCacheTimeToLiveMs = 90s
● phoenix.query.force.rowkeyorder = false
● phoenix.query.maxServerCacheBytes = 256M
HBase Properties
● Disable default major compactions schedule (run on demand)
● zookeeper.session.timeout = 30s
● hbase.regionserver.handler.count = 48
○ Default value of 10 is too low for production loads
○ Factor of number of cores and disks
● Hbase.hregion.memstore.ﬂush.size

Phoenix Query Timeouts
● phoenix.query.timeoutMs
● hbase.client.scanner.caching
● hbase.client.scanner.timeout.period
● hbase.rpc.timeout
● hbase.regionserver.lease.period

Lessons learnt in production
● Enable enhanced networking in AWS
● Host all regionservers and PQS within same AZ for consistent
performance
● Disable statistics collection
○ Duplicate rows
○ Impact on query performance
● Monitoring!

JIRA tickets
● PHOENIX-4666: Subquery caching for hash join
● PHOENIX-4751: Client side hash aggregation for
sort merge join

Q & A

PhoenixCon 2018 - Applications of HBase/Phoenix @ 23andMe

Recommended

Recommended

More Related Content

Similar to PhoenixCon 2018 - Applications of HBase/Phoenix @ 23andMe

Similar to PhoenixCon 2018 - Applications of HBase/Phoenix @ 23andMe (20)

Recently uploaded

Recently uploaded (20)

PhoenixCon 2018 - Applications of HBase/Phoenix @ 23andMe