Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

HBaseConAsia2018 Track3-6: HBase at Meituan

68 views

Published on

Chen Xu of Meituan

Published in: Internet
  • Be the first to comment

  • Be the first to like this

HBaseConAsia2018 Track3-6: HBase at Meituan

  1. 1. hosted by HBase Practics at Meituan Gehua New Century Hotel Beijing,China August 17,2018
  2. 2. hosted by Content 01 02 03 Multi-Tenancy Object Storage Large Query Isolation
  3. 3. hosted by Multi-Tenancy RSGroup - Computing Resource Isolation
 
 DNGroup - Storage Isolation
 
 Replication Isolation 01
  4. 4. hosted by Multi-Tenancy RSGroup - Compute Resource Isolation RegionServer 64 GB RegionServer 64 GB Group_A Table_A Table_B DATA_BLOCK_ENCODING BLOOMFILTER COMPRESSION GROUP_NAME : Group_A Group_B RegionServer 128 GB RegionServer 128 GB
  5. 5. hosted by Multi-Tenancy DNGroup - Storage Isolation DataNode SSD机型 Group_A Group_B DataNode SATA机型 DataNode SSD机型 DataNode SATA机型 path create time owner … group_name : Group_A
  6. 6. hosted by Multi-Tenancy Replication Isolation Groups Need
 Replication Other Groups ReplicationSource RegionServer ReplicationSource RegionServer ReplicationSink RegionServer ReplicationSink RegionServer ReplicationSink RegionServer ReplicationSource RegionServer Source Cluster Target Cluster
  7. 7. hosted by Multi-Tenancy Constraint PeerId naming specification • [GROUP]SOURCE_TARGET_INDEX
 (1)  [GROUP]  is  a  keyword
 (2)  SOURCE  is  the  group  of  source  cluster
 (3)  TARGET  is  the  group  of  target  cluster
 (4)  INDEX  used  to  determine  the  unique  identification
  8. 8. hosted by Multi-Tenancy Support Heterogeneous Storage Other  Groups SSD SATA 敏感业务 SSD SSD SSD ⼀一般业务 SATA SATA SATA
  9. 9. hosted by Multi-Tenancy Comparison with multi cluster deployment 80% 20% Machine A 20% 20% 80% 20% Group A Group B Storage(DN) Compute(RS) Machine B Machine C Resource can be used flexibly between groups
  10. 10. hosted by Multi-Tenancy Some BUGFIX About RSGroup • HBASE-18272  Fix  issue  about  RSGroupBasedLoadBalancer   #roundRobinAssignment  where  BOGUS_SERVER_NAME  is  involved   in  two  groups   • HBASE-20791  RSGroupBasedLoadBalancer#setClusterMetrics  should   pass  ClusterMetrics  to  its  internalBalancer

  11. 11. hosted by Object Storage MOB Solution
 
 BUGFIX 02
  12. 12. hosted by Object Storage YARN LOG Storage • Background              400k  applications  per  day
 200  million  log  files  need  to  upload  to  HDFS  and  bring  great   pressure  to  NN              So  consider  transferring  these  log  files  to  HBase  storage
  13. 13. hosted by Object Storage MOB Solution Client memstore IndexFile flush IndexFile DataFileDataFile HDFS DIROnline Region 拆分、整理 等涉及⼤大量IO的操作 内存每次flush会⽣生成2个⽂文件 只有索引⽂文件通过线上来托管 数据块不参与缓存 有效避免内存碎⽚片问题
  14. 14. hosted by Object Storage Some Problems About MOB Client memstore IndexFile flush IndexFile DataFileDataFile HDFS DIROnline Region 拆分、整理 等涉及⼤大量IO的操作 启⽤用MOB特性以后,Region只 能采⽤用默认的整理策略 TTL清理存在不同步问题 数据⽂文件清理后索引⽂文件还在 线下数据采⽤用单⼀一路径进⾏行存储
  15. 15. hosted by Object Storage MOB - BUGFIX Client memstore IndexFile flush IndexFile DataFileDataFile HDFS DIROnline Region 拆分、整理 等涉及⼤大量IO的操作 将索引⽂文件的清理 放在数据⽂文件清理之前进⾏行 线下⽂文件采⽤用多路径进⾏行存储
  16. 16. hosted by Object Storage MOB - BUGFIX • HBASE-19650  ExpiredMobFileCleaner  has  wrong  logic  about  TTL  check   • HBASE-19664  MOB  should  compatible  with  other  types  of  Compactor   in  addition  to  DefaultCompactor
  17. 17. hosted by Large Query Isolation03
  18. 18. hosted by Large Query Isolation • Characteristic  of  Large  Query
 (1)  Time  delay  requirements  are  not  very  sensitive
 (2)  may  occupying  Handler  thread  for  a  long  time • Large  Query  Types
 (1)  Scan  has  no  startkey  or  endkey  (sucn  as  full  table  scan)
 (2)  Client  call  ResultScanner.next()  more  than  a  certain  threshold
 (3)  Client  call  custom  coprocessor  which  involves  large  queries Large Query Concepts
  19. 19. hosted by Large Query Isolation Large Query Problems - resource may be run up Call Call Call Client … Handler Handler Handler Large  Query Common  Query Large  Query Large  Query Producer Consumer
  20. 20. hosted by Large Query Isolation Implementation Details Call Handler … Call Call Call Call Call Call Client Handler … … generalPriorityLarge Query Call Call
  21. 21. hosted by Large Query Isolation Isolation Effect - Limit the situation of resources run up
  22. 22. hosted by Thanks

×