Personal Information
Organization / Workplace
Greater Los Angeles Area United States
Occupation
Big Data Evangelist, Enterprise Architect, Solution Engineer at Hortonworks
Industry
Technology / Software / Internet
About
I am a Linux geek; I am a self-taught engineer; I am a professional software developer. I believe in performance through scalability, security by up-front design, leveraging open source, using open standards, and believe there is always room for improvement.
Much of my experience has been in areas which require an emphasis on optimization, performance improvement or real-time computing. I am now focused on parallel and distributed computing for data-processing and I am currently building big data solutions that leverage Hadoop.
In addition to Hadoop and big data interests, I am also learning to develop CUDA-based software that exploits GPU hardware, and I am interested in visualization...
Tags
hdfs
yarn
mapreduce
hadoop
cluster
collaborative filtering
big data
supervised learning
data science
outliers
apache hadoop
predictive
interactive
hive
tez
sql
orc
open source
stinger
yahoo
linux
apache
hortonworks
container
See more
Presentations
(3)Likes
(3)Apache storm vs. Spark Streaming
P. Taylor Goetz
•
9 years ago
HPC-ABDS: The Case for an Integrating Apache Big Data Stack with HPC
Geoffrey Fox
•
10 years ago
Hadoop @ eBay: Past, Present, and Future
Ryan Hennig
•
10 years ago
Personal Information
Organization / Workplace
Greater Los Angeles Area United States
Occupation
Big Data Evangelist, Enterprise Architect, Solution Engineer at Hortonworks
Industry
Technology / Software / Internet
About
I am a Linux geek; I am a self-taught engineer; I am a professional software developer. I believe in performance through scalability, security by up-front design, leveraging open source, using open standards, and believe there is always room for improvement.
Much of my experience has been in areas which require an emphasis on optimization, performance improvement or real-time computing. I am now focused on parallel and distributed computing for data-processing and I am currently building big data solutions that leverage Hadoop.
In addition to Hadoop and big data interests, I am also learning to develop CUDA-based software that exploits GPU hardware, and I am interested in visualization...
Tags
hdfs
yarn
mapreduce
hadoop
cluster
collaborative filtering
big data
supervised learning
data science
outliers
apache hadoop
predictive
interactive
hive
tez
sql
orc
open source
stinger
yahoo
linux
apache
hortonworks
container
See more