Personal Information
Organization / Workplace
Beijing City, China China
Occupation
Senior Software Development Lead — Big Data Infrastructure at Hulu
Industry
Technology / Software / Internet
Website
dongxicheng.org/
About
1. 7 years experience in Big Data area. Focusing on Hadoop/Spark Big Data technology stack, especially distributed computing frameworks and resource management systems, such as YARN, MapReduce, Spark, Spark Streaming, etc.
2. Technical Blog is http://dongxicheng.org/, focusing on Hadoop related technology, including Flume/Kafka, HDFS/HBase, YARN, MapReduce, Spark, Storm, etc.
3. Author of Two Hadoop Books (Bestseller in Hadoop field on jd.com, amazon.cn, dangdang.com, etc.):
(1) [MapReduce Book] Hadoop Internals: in-depth Study of MapReduce (in Chinese, Hadoop技术内幕:深入解析MapReduce架构设计与实现原理), China Machine Press, ISBN:9787111422266
(2) [YARN Book] Hadoop Internals: in-depth Study of YARN...
Tags
hadoop 2.0
yarn,spark,storm,tez,mapreduce
See more
- Presentations
- Documents
- Infographics
Looker Data Modeling in the Age of Cloud - BDW Meetup May 2, 2017
Caserta
•
7 years ago
A Deep Dive into Stateful Stream Processing in Structured Streaming with Tathagata Das
Databricks
•
5 years ago
Building a Recommendation Engine - An example of a product recommendation engine
NYC Predictive Analytics
•
13 years ago
Recommender system algorithm and architecture
Liang Xiang
•
11 years ago
How to Build a Recommendation Engine on Spark
Caserta
•
9 years ago
Collaborative Filtering with Spark
Chris Johnson
•
10 years ago
Hortonworks Technical Workshop: HBase and Apache Phoenix
Hortonworks
•
9 years ago
Feb 2013 HUG: Large Scale Data Ingest Using Apache Flume
Yahoo Developer Network
•
11 years ago
Recipes for Running Spark Streaming Applications in Production-(Tathagata Das, Databricks)
Spark Summit
•
8 years ago
Powering Interactive Data Analysis at Pinterest by Amazon Redshift
Jie Li
•
10 years ago
Intro to Spark and Spark SQL
jeykottalam
•
9 years ago
Cloud-based Data Stream Processing
Zbigniew Jerzak
•
9 years ago
Advanced Introduction to Java Multi-Threading - Full (chok)
choksheak
•
11 years ago
Kafka and Storm - event processing in realtime
Guido Schmutz
•
10 years ago