This document discusses how Bloomberg uses HBase to serve billions of queries with millisecond latency. It covers HBase principles like being an ordered key-value store and providing ACID transactions. It also discusses modeling data for HBase, including dealing with data and query skew. Implementation details covered include caching, block size tuning, column families, and compaction. The overall goal is to optimize HBase for Bloomberg's low-latency data storage and retrieval needs.
This document discusses the rise of non-relational databases and their advantages over traditional relational databases for modern cloud applications. It outlines how characteristics like scale, data volume, and developer access have changed. It promotes the idea of using different data store technologies based on data needs, rather than relying on a single database. Examples of Amazon's non-relational database services are provided, including DynamoDB, ElastiCache, and the new Neptune graph database.
This document discusses Amazon ElastiCache, a fully managed in-memory cache and database service. It provides Redis and Memcached compatible data stores that can be used for fast databases, caches, and other use cases. The document outlines key features of ElastiCache like security, high availability, scalability, and common usage patterns. It also provides an example of how GE uses ElastiCache Redis to power its Predix platform and make it easy for developers to create Redis clusters.
This document provides an overview of Amazon Neptune and graph databases. It discusses how graph databases are optimized for storing and querying highly connected data and provides examples of property graphs and RDF graphs. It also summarizes Amazon Neptune's key features like multi-AZ deployment, continuous backups, read replicas, and online restore capabilities.
Big Data Governance in a Post-GDPR World (GPSCT310) - AWS re:Invent 2018Amazon Web Services
Come to this session to discuss the recent release of the General Data Protection Regulation (GDPR) and the California Consumer Protection Act. We review why the AWS Big Data Competency now requests information on your strategy for supporting data governance within your software (ISV) or in your architectures (SI). We also review the ISVs in our new Data Governance category and discuss how you might want to partner with them for success.
[NEW LAUNCH!] How to build and deploy Windows file system in AWS using Amazon...Amazon Web Services
If you have compute-intensive workloads like high performance computing, machine learning, and media processing then this is the workshop for you! Our new file storage service, Amazon FSx for Lustre, provides compute-optimized storage with fully managed Lustre file systems that can deliver hundreds of gigabytes of throughput and sub-millisecond latencies. You will learn how to spin up an FSx for Lustre file system in minutes, feed data to it from an S3 data lake, run analyses while writing results back to S3, and then spin down the file system once the workload is finished.
[NEW LAUNCH!] How do I know I need a ledger database? An Introduction to Amaz...Amazon Web Services
Do you need a ledger database? Let's talk about the kinds of problems that Amazon Quantum Ledger Database (QLDB) can solve, and answer your questions about when and why you would use a ledger database. We'll showcase a presentation with benefits and use cases for Amazon QLDB.
Real-Time Market Data Analytics Using Kafka Streamsconfluent
(Lei Chen, Bloomberg, L.P.) Kafka Summit SF 2018
At Bloomberg, we are building a streaming platform with Apache Kafka, Kafka Streams and Spark Streaming to handle high volume, real-time processing with rapid derivative market data. In this talk, we’ll share the experience of how we utilize Kafka Streams Processor API to build pipelines that are capable of handling millions of market movements per second with ultra-low latency, as well as performing complex analytics like outlier detection, source confidence evaluation (scoring), arbitrage detection and other financial-related processing.
We’ll cover:
-Our system architecture
-Best practices of using the Processor API and State Store API
-Dynamic gap session implementation
-Historical data re-processing practice in KStreams app
-Chaining multiple KStreams apps with Spark Streaming job
This document discusses Bloomberg's experience moving to a multi-tenant HBase cluster. It provides an overview of HBase features that support multi-tenancy like namespaces, region server groups, storage quotas, and request throttling. It also summarizes Bloomberg's implementation including creation of namespaces, region server groups, and quotas. Performance results showed region server groups improved data locality and throughput. Overall, the speaker concluded HBase's multi-tenancy story is good but could be improved further with enhancements to features like system table availability and memory quotas.
This document discusses the rise of non-relational databases and their advantages over traditional relational databases for modern cloud applications. It outlines how characteristics like scale, data volume, and developer access have changed. It promotes the idea of using different data store technologies based on data needs, rather than relying on a single database. Examples of Amazon's non-relational database services are provided, including DynamoDB, ElastiCache, and the new Neptune graph database.
This document discusses Amazon ElastiCache, a fully managed in-memory cache and database service. It provides Redis and Memcached compatible data stores that can be used for fast databases, caches, and other use cases. The document outlines key features of ElastiCache like security, high availability, scalability, and common usage patterns. It also provides an example of how GE uses ElastiCache Redis to power its Predix platform and make it easy for developers to create Redis clusters.
This document provides an overview of Amazon Neptune and graph databases. It discusses how graph databases are optimized for storing and querying highly connected data and provides examples of property graphs and RDF graphs. It also summarizes Amazon Neptune's key features like multi-AZ deployment, continuous backups, read replicas, and online restore capabilities.
Big Data Governance in a Post-GDPR World (GPSCT310) - AWS re:Invent 2018Amazon Web Services
Come to this session to discuss the recent release of the General Data Protection Regulation (GDPR) and the California Consumer Protection Act. We review why the AWS Big Data Competency now requests information on your strategy for supporting data governance within your software (ISV) or in your architectures (SI). We also review the ISVs in our new Data Governance category and discuss how you might want to partner with them for success.
[NEW LAUNCH!] How to build and deploy Windows file system in AWS using Amazon...Amazon Web Services
If you have compute-intensive workloads like high performance computing, machine learning, and media processing then this is the workshop for you! Our new file storage service, Amazon FSx for Lustre, provides compute-optimized storage with fully managed Lustre file systems that can deliver hundreds of gigabytes of throughput and sub-millisecond latencies. You will learn how to spin up an FSx for Lustre file system in minutes, feed data to it from an S3 data lake, run analyses while writing results back to S3, and then spin down the file system once the workload is finished.
[NEW LAUNCH!] How do I know I need a ledger database? An Introduction to Amaz...Amazon Web Services
Do you need a ledger database? Let's talk about the kinds of problems that Amazon Quantum Ledger Database (QLDB) can solve, and answer your questions about when and why you would use a ledger database. We'll showcase a presentation with benefits and use cases for Amazon QLDB.
Real-Time Market Data Analytics Using Kafka Streamsconfluent
(Lei Chen, Bloomberg, L.P.) Kafka Summit SF 2018
At Bloomberg, we are building a streaming platform with Apache Kafka, Kafka Streams and Spark Streaming to handle high volume, real-time processing with rapid derivative market data. In this talk, we’ll share the experience of how we utilize Kafka Streams Processor API to build pipelines that are capable of handling millions of market movements per second with ultra-low latency, as well as performing complex analytics like outlier detection, source confidence evaluation (scoring), arbitrage detection and other financial-related processing.
We’ll cover:
-Our system architecture
-Best practices of using the Processor API and State Store API
-Dynamic gap session implementation
-Historical data re-processing practice in KStreams app
-Chaining multiple KStreams apps with Spark Streaming job
This document discusses Bloomberg's experience moving to a multi-tenant HBase cluster. It provides an overview of HBase features that support multi-tenancy like namespaces, region server groups, storage quotas, and request throttling. It also summarizes Bloomberg's implementation including creation of namespaces, region server groups, and quotas. Performance results showed region server groups improved data locality and throughput. Overall, the speaker concluded HBase's multi-tenancy story is good but could be improved further with enhancements to features like system table availability and memory quotas.
Craig will discuss architecture patterns with InfluxDB Enterprise, covering an overview of InfluxDB Enterprise, features, ingestion and query rates, deployment examples, replication patterns, and general advice.
Building a Data Subscription Service with Kafka Connect (Danica Fine & Ajay V...confluent
The document discusses building a data subscription service using Apache Kafka Connect. It describes motivating factors like easy subscription to multiple data sources with consistent interfaces. An ideal system architecture is presented using a subscription manager and custom source connectors. Details are provided on building a source connector in Kafka Connect, including components, source connector tasks, and monitoring threads. Challenges around reconfiguration and testing connectors are also noted.
Moneyball: Using Advanced Account Insights for Effective ABM ActivationEngagio
A winning marketing team does its research and crafts a compelling gameplan to maximize at bats with their most important accounts. Figure out how your team can get the inside scoop within accounts so you can hit it out of the park with your ABM efforts.
Join TechTarget, Engagio and special guest Beth McCullough from WhiteHat Security on 8/10 as we share how to develop & activate account insights to drive deep digital consideration and sales enablement.
You’ll discover:
How to drive net new revenue streams with effective insights to target the right accounts and prospects
How to design your batting order: The most valuable insights for prioritizing account engagement strategy
How to read the right signals: Optimize efforts by monitoring and leveraging behavioral & intent insight
Examples of how to execute full-funnel account-based marketing from data to activation to account engagement
And more
Replaying KStreams Apps Using State Snapshots (Nishchay Sinha & Yan Wang, Blo...confluent
Are you a Kafka Streams application developer who needs a faster, more efficient way to reproduce a bug or issue from past events? Do you need to test a new algorithm patch near a discrete point-in-time? Do you have a standard methodology for investigating these issues, or does each team member devise an ad hoc solution? How much time does your simulation take?
It is technically impossible to solve this challenge using only changelog topics, since a compact, topic-based changelog doesn't capture the full change history. The Derivatives Data team at Bloomberg augments this through periodic snapshots. To make the snapshotted state accessible to our replay system, we built a query service that leverages Interactive Queries using the Kafka Streams API, along with gRPC-based coordination, to fetch the distributed snapshot states from different snapshotting instances.
In this talk, we will cover:
● Overview of this system architecture
● Deep dive into the mechanics of snapshots with Kafka Streams, state-store changelogs, and a query service to serve replay requests
● How two modes, normal and replay, are used on the Kafka Streams application runtime
● Some use cases that benefit from this replay system
Learning to Rank: From Theory to Production - Malvina Josephidou & Diego Cecc...Lucidworks
This document summarizes a presentation about implementing learning-to-rank (LTR) models for search relevance at Bloomberg. It discusses:
1) Collecting query-document judgments and extracting features to train LTR models;
2) Deploying trained models in Solr and applying them to re-rank search results; and
3) Optimizing the system for production use by precomputing static features, using docvalues to retrieve features faster, and evaluating performance.
This document provides an overview of HBase internals and operations. It discusses how HBase is used at Bloomberg to store over 51 TB of compressed data across billions of reads and writes per day. The document then covers key aspects of HBase including its ordered key-value store architecture, write process, read process, versioning, and ACID compliance. It also discusses HBase deployment configurations including masters, region servers, and Zookeeper coordination.
With an explosion of data, today’s emerging needs are not being met by existing technologies, which require rich skill sets and expertise. Companies that want to lead changes in highly competitive markets must optimize their storage, speed, and spending. The key is for them to augment their data management and analytics platforms with artificial intelligence and machine learning for analysts, engineers, and other users.
Big Data, Machine Learning, and AI have created new opportunities for organizations worldwide, but this has also put tremendous pressure on IT and data engineering teams to scale and maintain analytics performance on massive datasets. This presentation at Strata Data Conference London 2019 by Luke Han explains how Augmented OLAP technology solves the challenges of analytics on massive datasets while reducing IT costs. Learn more about this powerful approach to Big Data here: https://kyligence.io/
M&E Leadership Session: The State of the Industry, What's New from AWS for M&...Amazon Web Services
In this wide-ranging keynote session, first hear from AWS VP Carla Stratfold on the major forces affecting the industry, then learn from AWS Global M&E Tech Lead Usman Shakeel about the latest and most exciting releases coming out of re:Invent relevant to the M&E industry. And finally, hear how technical leaders at the forefront of the industry are responding to accelerating changes in the media landscape.
Building csm while going from on premise to saa sJessica Osborn
The document discusses a panel discussion between representatives from Flexera, Blackbaud, and N3 about their experiences transitioning from an on-premise software model to a Software as a Service (SaaS) model and how it impacted their customer success strategies. The panelists described how they evolved their customer success programs from technical support to more strategic customer success management. They also discussed how the transition to SaaS required new skills and processes that are more proactive, data-driven and focused on driving organic growth through expansion and upsells.
The document discusses using AWS services for blockchain and financial use cases. It provides examples of how companies can use AWS for risk modeling, core banking systems, data analytics, AI/ML, digital channels, and fintech strategies. Specific use cases discussed include AI voice interaction and compliance as code. The benefits of AWS for financial institutions are around scaling consistently, focusing on value, and making compliance part of day-to-day operations. The document also discusses blockchain concepts, costs and benefits, and how AWS collaborates with blockchain protocols and ecosystems.
Building SRE from Scratch at Coinbase during Hypergrowth (DEV315-S) - AWS re:...Amazon Web Services
Coinbase is a secure online platform for buying, selling, transferring, and storing digital currency. This talk covers its journey from a small band of engineers working on reliability to a centralized SRE organization, and the lessons learned along the way. We dive into the processes that we created—both technical and organizational—that enabled us to quickly build a world-class reliability engineering group. We also cover what reliability really means, and more importantly, how we measure it. This session is brought to you by AWS partner, Datadog.
Gimel at Dataworks Summit San Jose 2018Romit Mehta
Gimel is PayPal's data platform that provides a unified interface for accessing and analyzing data across different data stores and processing engines. The presentation provides an overview of Gimel, including PayPal's analytics ecosystem, the challenges Gimel addresses around data access and application lifecycle, and a demo of how Gimel simplifies a flights cancelled use case. It also discusses Gimel's open source journey and integration with ecosystems like Spark and Jupyter notebooks.
Gimel Data Platform is an analytics platform developed by PayPal that aims to simplify data access and analysis. The presentation provides an overview of Gimel, including PayPal's analytics ecosystem, the challenges Gimel addresses in data access and application lifecycle management, a demo of a sample flights cancelled use case using Gimel, and PayPal's plans to open source Gimel.
Modern Product Data Workflows: How King Crushes New Product Development using...Aggregage
Product Managers looking to leverage data to make informed product design decisions can learn a lot from renowned gaming company King, maker of Candy Crush and many other games - even if their product has seemingly no overlap with games. King uses almost a competitive launch strategy for new games, as each game has a series of KPIs that it needs to meet. King’s product managers rely heavily on analyzing product features using analytics data and visualization to improve outcomes. This session will examine how this works for King, but more importantly how this can be applied to other types of products. The key is the strategy and tools for accessing product data at the level that you'd like.
Modern Product Data Workflows: How King Crushes New Product Development using...Hannah Flynn
Product Managers looking to leverage data to make informed product design decisions can learn a lot from renowned gaming company King, maker of Candy Crush and many other games - even if their product has seemingly no overlap with games. King uses almost a competitive launch strategy for new games, as each game has a series of KPIs that it needs to meet. King’s product managers rely heavily on analyzing product features using analytics data and visualization to improve outcomes.
This session will examine how this works for King, but more importantly how this can be applied to other types of products. The key is the strategy and tools for accessing product data at the level that you'd like.
Hollywood's Cloud-Based Content Lakes: Modernized Media Archives (MAE203) - A...Amazon Web Services
Content lake architecture can evolve the media workflow by providing efficiency from content security all the way to value-added services, such as machine learning and content monetization. In this session, technical leaders from 21st Century Fox, Warner Bros., and Astro Malaysia discuss the migration of their petabyte-scale video libraries (production and distribution archives) to the cloud in order to increase the customer reach and value of their media archives. Discover some of the lessons learned, the TCO analysis around various different storage tiers, the challenges and best practices from 10s of petabytes ingest, storage, and value-added compute at scale.
hbaseconasia2019 HBase Table Monitoring and Troubleshooting System on CloudMichael Stack
Long Chen
Track 3: Applications
https://open.mi.com/conference/hbasecon-asia-2019
THE COMMUNITY EVENT FOR APACHE HBASE™
July 20th, 2019 - Sheraton Hotel, Beijing, China
https://hbase.apache.org/hbaseconasia-2019/
hbaseconasia2019 Recent work on HBase at PinterestMichael Stack
Lianghong Xu
Track 3: Applications
https://open.mi.com/conference/hbasecon-asia-2019
THE COMMUNITY EVENT FOR APACHE HBASE™
July 20th, 2019 - Sheraton Hotel, Beijing, China
https://hbase.apache.org/hbaseconasia-2019/
More Related Content
Similar to HBaseConAsia2018 Track3-1: Serving billions of queries in millisecond latencies
Craig will discuss architecture patterns with InfluxDB Enterprise, covering an overview of InfluxDB Enterprise, features, ingestion and query rates, deployment examples, replication patterns, and general advice.
Building a Data Subscription Service with Kafka Connect (Danica Fine & Ajay V...confluent
The document discusses building a data subscription service using Apache Kafka Connect. It describes motivating factors like easy subscription to multiple data sources with consistent interfaces. An ideal system architecture is presented using a subscription manager and custom source connectors. Details are provided on building a source connector in Kafka Connect, including components, source connector tasks, and monitoring threads. Challenges around reconfiguration and testing connectors are also noted.
Moneyball: Using Advanced Account Insights for Effective ABM ActivationEngagio
A winning marketing team does its research and crafts a compelling gameplan to maximize at bats with their most important accounts. Figure out how your team can get the inside scoop within accounts so you can hit it out of the park with your ABM efforts.
Join TechTarget, Engagio and special guest Beth McCullough from WhiteHat Security on 8/10 as we share how to develop & activate account insights to drive deep digital consideration and sales enablement.
You’ll discover:
How to drive net new revenue streams with effective insights to target the right accounts and prospects
How to design your batting order: The most valuable insights for prioritizing account engagement strategy
How to read the right signals: Optimize efforts by monitoring and leveraging behavioral & intent insight
Examples of how to execute full-funnel account-based marketing from data to activation to account engagement
And more
Replaying KStreams Apps Using State Snapshots (Nishchay Sinha & Yan Wang, Blo...confluent
Are you a Kafka Streams application developer who needs a faster, more efficient way to reproduce a bug or issue from past events? Do you need to test a new algorithm patch near a discrete point-in-time? Do you have a standard methodology for investigating these issues, or does each team member devise an ad hoc solution? How much time does your simulation take?
It is technically impossible to solve this challenge using only changelog topics, since a compact, topic-based changelog doesn't capture the full change history. The Derivatives Data team at Bloomberg augments this through periodic snapshots. To make the snapshotted state accessible to our replay system, we built a query service that leverages Interactive Queries using the Kafka Streams API, along with gRPC-based coordination, to fetch the distributed snapshot states from different snapshotting instances.
In this talk, we will cover:
● Overview of this system architecture
● Deep dive into the mechanics of snapshots with Kafka Streams, state-store changelogs, and a query service to serve replay requests
● How two modes, normal and replay, are used on the Kafka Streams application runtime
● Some use cases that benefit from this replay system
Learning to Rank: From Theory to Production - Malvina Josephidou & Diego Cecc...Lucidworks
This document summarizes a presentation about implementing learning-to-rank (LTR) models for search relevance at Bloomberg. It discusses:
1) Collecting query-document judgments and extracting features to train LTR models;
2) Deploying trained models in Solr and applying them to re-rank search results; and
3) Optimizing the system for production use by precomputing static features, using docvalues to retrieve features faster, and evaluating performance.
This document provides an overview of HBase internals and operations. It discusses how HBase is used at Bloomberg to store over 51 TB of compressed data across billions of reads and writes per day. The document then covers key aspects of HBase including its ordered key-value store architecture, write process, read process, versioning, and ACID compliance. It also discusses HBase deployment configurations including masters, region servers, and Zookeeper coordination.
With an explosion of data, today’s emerging needs are not being met by existing technologies, which require rich skill sets and expertise. Companies that want to lead changes in highly competitive markets must optimize their storage, speed, and spending. The key is for them to augment their data management and analytics platforms with artificial intelligence and machine learning for analysts, engineers, and other users.
Big Data, Machine Learning, and AI have created new opportunities for organizations worldwide, but this has also put tremendous pressure on IT and data engineering teams to scale and maintain analytics performance on massive datasets. This presentation at Strata Data Conference London 2019 by Luke Han explains how Augmented OLAP technology solves the challenges of analytics on massive datasets while reducing IT costs. Learn more about this powerful approach to Big Data here: https://kyligence.io/
M&E Leadership Session: The State of the Industry, What's New from AWS for M&...Amazon Web Services
In this wide-ranging keynote session, first hear from AWS VP Carla Stratfold on the major forces affecting the industry, then learn from AWS Global M&E Tech Lead Usman Shakeel about the latest and most exciting releases coming out of re:Invent relevant to the M&E industry. And finally, hear how technical leaders at the forefront of the industry are responding to accelerating changes in the media landscape.
Building csm while going from on premise to saa sJessica Osborn
The document discusses a panel discussion between representatives from Flexera, Blackbaud, and N3 about their experiences transitioning from an on-premise software model to a Software as a Service (SaaS) model and how it impacted their customer success strategies. The panelists described how they evolved their customer success programs from technical support to more strategic customer success management. They also discussed how the transition to SaaS required new skills and processes that are more proactive, data-driven and focused on driving organic growth through expansion and upsells.
The document discusses using AWS services for blockchain and financial use cases. It provides examples of how companies can use AWS for risk modeling, core banking systems, data analytics, AI/ML, digital channels, and fintech strategies. Specific use cases discussed include AI voice interaction and compliance as code. The benefits of AWS for financial institutions are around scaling consistently, focusing on value, and making compliance part of day-to-day operations. The document also discusses blockchain concepts, costs and benefits, and how AWS collaborates with blockchain protocols and ecosystems.
Building SRE from Scratch at Coinbase during Hypergrowth (DEV315-S) - AWS re:...Amazon Web Services
Coinbase is a secure online platform for buying, selling, transferring, and storing digital currency. This talk covers its journey from a small band of engineers working on reliability to a centralized SRE organization, and the lessons learned along the way. We dive into the processes that we created—both technical and organizational—that enabled us to quickly build a world-class reliability engineering group. We also cover what reliability really means, and more importantly, how we measure it. This session is brought to you by AWS partner, Datadog.
Gimel at Dataworks Summit San Jose 2018Romit Mehta
Gimel is PayPal's data platform that provides a unified interface for accessing and analyzing data across different data stores and processing engines. The presentation provides an overview of Gimel, including PayPal's analytics ecosystem, the challenges Gimel addresses around data access and application lifecycle, and a demo of how Gimel simplifies a flights cancelled use case. It also discusses Gimel's open source journey and integration with ecosystems like Spark and Jupyter notebooks.
Gimel Data Platform is an analytics platform developed by PayPal that aims to simplify data access and analysis. The presentation provides an overview of Gimel, including PayPal's analytics ecosystem, the challenges Gimel addresses in data access and application lifecycle management, a demo of a sample flights cancelled use case using Gimel, and PayPal's plans to open source Gimel.
Modern Product Data Workflows: How King Crushes New Product Development using...Aggregage
Product Managers looking to leverage data to make informed product design decisions can learn a lot from renowned gaming company King, maker of Candy Crush and many other games - even if their product has seemingly no overlap with games. King uses almost a competitive launch strategy for new games, as each game has a series of KPIs that it needs to meet. King’s product managers rely heavily on analyzing product features using analytics data and visualization to improve outcomes. This session will examine how this works for King, but more importantly how this can be applied to other types of products. The key is the strategy and tools for accessing product data at the level that you'd like.
Modern Product Data Workflows: How King Crushes New Product Development using...Hannah Flynn
Product Managers looking to leverage data to make informed product design decisions can learn a lot from renowned gaming company King, maker of Candy Crush and many other games - even if their product has seemingly no overlap with games. King uses almost a competitive launch strategy for new games, as each game has a series of KPIs that it needs to meet. King’s product managers rely heavily on analyzing product features using analytics data and visualization to improve outcomes.
This session will examine how this works for King, but more importantly how this can be applied to other types of products. The key is the strategy and tools for accessing product data at the level that you'd like.
Hollywood's Cloud-Based Content Lakes: Modernized Media Archives (MAE203) - A...Amazon Web Services
Content lake architecture can evolve the media workflow by providing efficiency from content security all the way to value-added services, such as machine learning and content monetization. In this session, technical leaders from 21st Century Fox, Warner Bros., and Astro Malaysia discuss the migration of their petabyte-scale video libraries (production and distribution archives) to the cloud in order to increase the customer reach and value of their media archives. Discover some of the lessons learned, the TCO analysis around various different storage tiers, the challenges and best practices from 10s of petabytes ingest, storage, and value-added compute at scale.
Similar to HBaseConAsia2018 Track3-1: Serving billions of queries in millisecond latencies (20)
hbaseconasia2019 HBase Table Monitoring and Troubleshooting System on CloudMichael Stack
Long Chen
Track 3: Applications
https://open.mi.com/conference/hbasecon-asia-2019
THE COMMUNITY EVENT FOR APACHE HBASE™
July 20th, 2019 - Sheraton Hotel, Beijing, China
https://hbase.apache.org/hbaseconasia-2019/
hbaseconasia2019 Recent work on HBase at PinterestMichael Stack
Lianghong Xu
Track 3: Applications
https://open.mi.com/conference/hbasecon-asia-2019
THE COMMUNITY EVENT FOR APACHE HBASE™
July 20th, 2019 - Sheraton Hotel, Beijing, China
https://hbase.apache.org/hbaseconasia-2019/
hbaseconasia2019 Phoenix Practice in China Life Insurance Co., LtdMichael Stack
Yechao Chen
Track 3: Applications
https://open.mi.com/conference/hbasecon-asia-2019
THE COMMUNITY EVENT FOR APACHE HBASE™
July 20th, 2019 - Sheraton Hotel, Beijing, China
https://hbase.apache.org/hbaseconasia-2019/
TianHang Tang
Track 3: Applications
https://open.mi.com/conference/hbasecon-asia-2019
THE COMMUNITY EVENT FOR APACHE HBASE™
July 20th, 2019 - Sheraton Hotel, Beijing, China
https://hbase.apache.org/hbaseconasia-2019/
hbaseconasia2019 The Practice in trillion-level Video Storage and billion-lev...Michael Stack
Xu Ming
Track 3: Applications
https://open.mi.com/conference/hbasecon-asia-2019
THE COMMUNITY EVENT FOR APACHE HBASE™
July 20th, 2019 - Sheraton Hotel, Beijing, China
https://hbase.apache.org/hbaseconasia-2019/
Andrew Cheng
Track 3: Applications
https://open.mi.com/conference/hbasecon-asia-2019
THE COMMUNITY EVENT FOR APACHE HBASE™
July 20th, 2019 - Sheraton Hotel, Beijing, China
https://hbase.apache.org/hbaseconasia-2019/
hbaseconasia2019 Spatio temporal Data Management based on Ali-HBase Ganos and...Michael Stack
Fei Xiao of Alibaba
Track 2: Ecology and Solutions
https://open.mi.com/conference/hbasecon-asia-2019
THE COMMUNITY EVENT FOR APACHE HBASE™
July 20th, 2019 - Sheraton Hotel, Beijing, China
https://hbase.apache.org/hbaseconasia-2019/
hbaseconasia2019 Bridging the Gap between Big Data System Software Stack and ...Michael Stack
Huan-Ping Su (蘇桓平), Yi-Sheng Lien (連奕盛) National Cheng Kung University
Track 2: Ecology and Solutions
https://open.mi.com/conference/hbasecon-asia-2019
THE COMMUNITY EVENT FOR APACHE HBASE™
July 20th, 2019 - Sheraton Hotel, Beijing, China
https://hbase.apache.org/hbaseconasia-2019/
hbaseconasia2019 Pharos as a Pluggable Secondary Index ComponentMichael Stack
Lei Wang China Everbright Bank
Track 2: Ecology and Solutions
https://open.mi.com/conference/hbasecon-asia-2019
THE COMMUNITY EVENT FOR APACHE HBASE™
July 20th, 2019 - Sheraton Hotel, Beijing, China
https://hbase.apache.org/hbaseconasia-2019/
hbaseconasia2019 Phoenix Improvements and Practices on Cloud HBase at AlibabaMichael Stack
Yun Zhang
Track 2: Ecology and Solutions
https://open.mi.com/conference/hbasecon-asia-2019
THE COMMUNITY EVENT FOR APACHE HBASE™
July 20th, 2019 - Sheraton Hotel, Beijing, China
https://hbase.apache.org/hbaseconasia-2019/
Junhong Xu of Xiaomi
Track 2: Ecology and Solutions
https://open.mi.com/conference/hbasecon-asia-2019
THE COMMUNITY EVENT FOR APACHE HBASE™
July 20th, 2019 - Sheraton Hotel, Beijing, China
https://hbase.apache.org/hbaseconasia-2019/
hbaseconasia2019 BigData NoSQL System: ApsaraDB, HBase and SparkMichael Stack
Wei Li of Alibaba
Track 2: Ecology and Solutions
https://open.mi.com/conference/hbasecon-asia-2019
THE COMMUNITY EVENT FOR APACHE HBASE™
July 20th, 2019 - Sheraton Hotel, Beijing, China
https://hbase.apache.org/hbaseconasia-2019/
hbaseconasia2019 Test-suite for Automating Data-consistency checks on HBaseMichael Stack
Pradeep S, Mallikarjun V of Flipkart
Track 1: Internals
https://open.mi.com/conference/hbasecon-asia-2019
THE COMMUNITY EVENT FOR APACHE HBASE™
July 20th, 2019 - Sheraton Hotel, Beijing, China
https://hbase.apache.org/hbaseconasia-2019/
hbaseconasia2019 Distributed Bitmap Index SolutionMichael Stack
Xingjun Hao of Huawei
Track 1: Internals
https://open.mi.com/conference/hbasecon-asia-2019
THE COMMUNITY EVENT FOR APACHE HBASE™
July 20th, 2019 - Sheraton Hotel, Beijing, China
https://hbase.apache.org/hbaseconasia-2019/
hbaseconasia2019 HBase Bucket Cache on Persistent MemoryMichael Stack
Anoop Sam John, Ramkrishna S Vasudevan, and Xu Kai of Intel
Track 1: Internals
https://open.mi.com/conference/hbasecon-asia-2019
THE COMMUNITY EVENT FOR APACHE HBASE™
July 20th, 2019 - Sheraton Hotel, Beijing, China
https://hbase.apache.org/hbaseconasia-2019/
hbaseconasia2019 The Procedure v2 Implementation of WAL Splitting and ACLMichael Stack
Mei Yi of Xiaomi
Track 1: Internals
https://open.mi.com/conference/hbasecon-asia-2019
THE COMMUNITY EVENT FOR APACHE HBASE™
July 20th, 2019 - Sheraton Hotel, Beijing, China
https://hbase.apache.org/hbaseconasia-2019/
hbaseconasia2019 BDS: A data synchronization platform for HBaseMichael Stack
熊嘉男
Track 1: Internals
https://open.mi.com/conference/hbasecon-asia-2019
THE COMMUNITY EVENT FOR APACHE HBASE™
July 20th, 2019 - Sheraton Hotel, Beijing, China
https://hbase.apache.org/hbaseconasia-2019/
hbaseconasia2019 Further GC optimization for HBase 2.x: Reading HFileBlock in...Michael Stack
Anoop Sam John of Intel and Zheng Hu of Alibaba
Track 1: Internals
https://open.mi.com/conference/hbasecon-asia-2019
THE COMMUNITY EVENT FOR APACHE HBASE™
July 20th, 2019 - Sheraton Hotel, Beijing, China
https://hbase.apache.org/hbaseconasia-2019/
hbaseconasia2019 HBCK2: Concepts, trends, and recipes for fixing issues in HB...Michael Stack
The document discusses HBCK2, a tool for fixing issues in HBase 2. Some key points:
1. HBCK2 is simpler than HBCK1, with fewer fix commands and no diagnosis commands. It requires a deeper understanding of HBase internals.
2. HBCK2 commands are master-oriented and fix issues one at a time. Common issues include regions not online, stuck procedures, and tables in the wrong state.
3. Recipes are provided to fix specific issues like missing meta regions or regions in transition using HBCK2 commands like assigns and bypass.
4. HBCK2 is still a work in progress but contributions are welcome
Keynote given by Duo Zhang of Xiaomi and Chunhui Shen of Alibab
Track 1: Internals
https://open.mi.com/conference/hbasecon-asia-2019
THE COMMUNITY EVENT FOR APACHE HBASE™
July 20th, 2019 - Sheraton Hotel, Beijing, China
https://hbase.apache.org/hbaseconasia-2019/
Discover the benefits of outsourcing SEO to Indiadavidjhones387
"Discover the benefits of outsourcing SEO to India! From cost-effective services and expert professionals to round-the-clock work advantages, learn how your business can achieve digital success with Indian SEO solutions.
Ready to Unlock the Power of Blockchain!Toptal Tech
Imagine a world where data flows freely, yet remains secure. A world where trust is built into the fabric of every transaction. This is the promise of blockchain, a revolutionary technology poised to reshape our digital landscape.
Toptal Tech is at the forefront of this innovation, connecting you with the brightest minds in blockchain development. Together, we can unlock the potential of this transformative technology, building a future of transparency, security, and endless possibilities.
Gen Z and the marketplaces - let's translate their needsLaura Szabó
The product workshop focused on exploring the requirements of Generation Z in relation to marketplace dynamics. We delved into their specific needs, examined the specifics in their shopping preferences, and analyzed their preferred methods for accessing information and making purchases within a marketplace. Through the study of real-life cases , we tried to gain valuable insights into enhancing the marketplace experience for Generation Z.
The workshop was held on the DMA Conference in Vienna June 2024.
HijackLoader Evolution: Interactive Process HollowingDonato Onofri
CrowdStrike researchers have identified a HijackLoader (aka IDAT Loader) sample that employs sophisticated evasion techniques to enhance the complexity of the threat. HijackLoader, an increasingly popular tool among adversaries for deploying additional payloads and tooling, continues to evolve as its developers experiment and enhance its capabilities.
In their analysis of a recent HijackLoader sample, CrowdStrike researchers discovered new techniques designed to increase the defense evasion capabilities of the loader. The malware developer used a standard process hollowing technique coupled with an additional trigger that was activated by the parent process writing to a pipe. This new approach, called "Interactive Process Hollowing", has the potential to make defense evasion stealthier.