Personal Information
Organization / Workplace
Singapore Singapore
Occupation
Data Geek
Industry
Technology / Software / Internet
About
Over 5 years specialized in big data analytic, mainly in Data Acquisition, Marketing Intelligence, Web Analytics, Fraud Detection, Recommendation, etc.
Specialties:
Machine Learning Algorithms: SVM & Neural Network & PCA & Clustering & Regression & Decision Tree & Outliers Detection;
Web Analytics & Clickstream System & Graph Analysis & Data Warehousing;
Tools: Hadoop(MapR), Spark(Scala,pyspark,MLlib,SparkSQL,Graphx, Magellan for Geospatial Analytics), Presto, HBase, Hive, Drill, Sqoop, Kafka and Storm.
DB: Greenplum & Oracle(11g&10g) & PostgreSQL & Mysql.
Also Interested in operation research, convex optimization, stochastic optimization.
Tags
dtcc
svm
strata singapore
spark
See more
Presentations
(10)Likes
(27)Stateful, Stateless and Serverless - Running Apache Kafka® on Kubernetes
confluent
•
5 years ago
Part 1: Lambda Architectures: Simplified by Apache Kudu
Cloudera, Inc.
•
7 years ago
Improving PySpark Performance - Spark Beyond the JVM @ PyData DC 2016
Holden Karau
•
7 years ago
林佳賢/資料視覺化的 20 個小訣竅
台灣資料科學年會
•
7 years ago
Productionizing Spark and the REST Job Server- Evan Chan
Spark Summit
•
8 years ago
Dreaming Infrastructure
kyhpudding
•
14 years ago
Handling Data Skew Adaptively In Spark Using Dynamic Repartitioning
Spark Summit
•
7 years ago
Apache Spark 2.0: A Deep Dive Into Structured Streaming - by Tathagata Das
Databricks
•
7 years ago
Magellen: Geospatial Analytics on Spark by Ram Sriharsha
Spark Summit
•
8 years ago
Sparkcamp stratasingapore
Cheng Feng
•
8 years ago
AWSome Day Singapore Keynote 2015
Hwee Bee Tan
•
8 years ago
Combine Apache Hadoop and Elasticsearch to Get the Most of Your Big Data
Hortonworks
•
10 years ago
Introduction to Machine Learning
Lior Rokach
•
11 years ago
Singapore startup ecosystem and entrepreneur toolbox - Aug 2015
Arnaud Bonzom
•
8 years ago
Using Apache Drill
Chicago Hadoop Users Group
•
9 years ago
Parquet Hadoop Summit 2013
Julien Le Dem
•
10 years ago
Titan: The Rise of Big Graph Data
Marko Rodriguez
•
11 years ago
Intro to Graph Databases Using Tinkerpop, TitanDB, and Gremlin
Caleb Jones
•
10 years ago
Real time Analytics with Apache Kafka and Apache Spark
Rahul Jain
•
9 years ago
Open Source Lambda Architecture with Hadoop, Kafka, Samza and Druid
DataWorks Summit
•
8 years ago
Sqoop on Spark for Data Ingestion
DataWorks Summit
•
8 years ago
Enterprise Kafka: Kafka as a Service
Todd Palino
•
10 years ago
Kdd 2014 Tutorial - the recommender problem revisited
Xavier Amatriain
•
9 years ago
鹰眼下的淘宝_EagleEye with Taobao
terryice
•
10 years ago
All you wanted to know about analytics in e commerce- amazon, ebay, flipkart
Anju Gothwal
•
9 years ago
Kaggle Otto Challenge: How we achieved 85th out of 3,514 and what we learnt
Eugene Yan Ziyou
•
8 years ago
Personal Information
Organization / Workplace
Singapore Singapore
Occupation
Data Geek
Industry
Technology / Software / Internet
About
Over 5 years specialized in big data analytic, mainly in Data Acquisition, Marketing Intelligence, Web Analytics, Fraud Detection, Recommendation, etc.
Specialties:
Machine Learning Algorithms: SVM & Neural Network & PCA & Clustering & Regression & Decision Tree & Outliers Detection;
Web Analytics & Clickstream System & Graph Analysis & Data Warehousing;
Tools: Hadoop(MapR), Spark(Scala,pyspark,MLlib,SparkSQL,Graphx, Magellan for Geospatial Analytics), Presto, HBase, Hive, Drill, Sqoop, Kafka and Storm.
DB: Greenplum & Oracle(11g&10g) & PostgreSQL & Mysql.
Also Interested in operation research, convex optimization, stochastic optimization.
Tags
dtcc
svm
strata singapore
spark
See more