Personal Information
Organization / Workplace
Within 23 wards, Tokyo, Japan Japan
Occupation
Solutions Engineering Lead, North APAC, Hortonworks
Industry
Technology / Software / Internet
About
I am a Solutions Engineering Lead at Hortonworks.
Before joining Hortonworks, I was a Solutions Architect Amazon Web Services.
I have been a Hadoop and HBase administrator and developer at Rakuten, inc., the largest e-commerce company in Japan.
I also developed and operated Rakuten's platform as a service (aka PaaS) using Cloud Foundry.
I am the author of HBase Administration Cookbook: http://amzn.to/17gXqd1
I am now living in Japan.
I like reading, learning and going outdoor with my lovely family.
Specialties: Hadoop, HBase, Cloud Foundry, Redis, AWS
Tags
hadoop
hdp
big data
hive
spark
sql on hadoop
iot
streaming process
hdf
hive llap
hortonworks
aws
hdc
sparksql
tez
kafka
hdfs
ambari
security
flash storage
hbase
nfs
s3
spark streaming
storm
real-time system
metron
druid
mpp
financial
nifi
data flow programming
phoenix
hop
kinesis
yarn
erasure code
ec2
data science
See more
- Presentations
- Documents
- Infographics
Deep Dive into the New Features of Apache Spark 3.0
Databricks
•
3 years ago
Trillion Dollar Coach Book (Bill Campbell)
Eric Schmidt
•
5 years ago
Apache Deep Learning 201
DataWorks Summit
•
5 years ago
Making Netflix Machine Learning Algorithms Reliable
Justin Basilico
•
6 years ago
Optimizing training on Apache MXNet
Amazon Web Services
•
6 years ago
HDFS tiered storage
DataWorks Summit
•
5 years ago
Transactional operations in Apache Hive: present and future
DataWorks Summit
•
5 years ago
What's new in apache hive
DataWorks Summit
•
5 years ago
Hive acid and_2.x new_features
Alberto Romero
•
7 years ago
From Mainframe to Microservice: An Introduction to Distributed Systems
Tyler Treat
•
9 years ago
A Non-Standard use Case of Hadoop: High Scale Image Processing and Analytics
DataWorks Summit
•
8 years ago
Show me the Money! Cost & Resource Tracking for Hadoop and Storm
DataWorks Summit/Hadoop Summit
•
7 years ago
Open Source Lambda Architecture with Hadoop, Kafka, Samza and Druid
DataWorks Summit
•
8 years ago
Hadoop summit-diverse-workload
Wangda Tan
•
8 years ago
Towards SLA-based Scheduling on YARN Clusters
DataWorks Summit
•
8 years ago
Apache Hadoop YARN – Multi-Tenancy, Capacity Scheduler & Preemption - StampedeCon 2015
StampedeCon
•
8 years ago
Apache Hadoop YARN: best practices
DataWorks Summit
•
9 years ago
Node Labels in YARN
DataWorks Summit
•
8 years ago
Scaling LinkedIn - A Brief History
Josh Clemm
•
8 years ago
Cost-based query optimization in Apache Hive 0.14
Julian Hyde
•
9 years ago