Cheng Feng

3 Followers

10 SlideShares 3 Followers 51 Followings

Over 5 years specialized in big data analytic, mainly in Data Acquisition, Marketing Intelligence, Web Analytics, Fraud Detection, Recommendation, etc. Specialties: Machine Learning Algorithms: SVM & Neural Network & PCA & Clustering & Regression & Decision Tree & Outliers Detection; Web Analytics & Clickstream System & Graph Analysis & Data Warehousing; Tools: Hadoop(MapR), Spark(Scala,pyspark,MLlib,SparkSQL,Graphx, Magellan for Geospatial Analytics), Presto, HBase, Hive, Drill, Sqoop, Kafka and Storm. DB: Greenplum & Oracle(11g&10g) & PostgreSQL & Mysql. Also Interested in operation research, convex optimization, stochastic optimization.

dtcc svm strata singapore spark

Activity
About

Cheng Feng

Presentations

Epsrcws08 campbell isvm_01

Epsrcws08 campbell kbm_01

Maria db新特性剖析京东张金鹏

Inception自动审核系统设计与实现王竹峰

Maria db新特性剖析京东张金鹏

Tdsql在微众银行核心交易系统中的实践雷海林

数据库架构师做什么 58同城数据库架构设计思路-沈剑

运营商去O浅析公开版-王晓征

Sparkcamp stratasingapore

Strata singapore survey

Likes

Stateful, Stateless and Serverless - Running Apache Kafka® on Kubernetes

Part 1: Lambda Architectures: Simplified by Apache Kudu

Improving PySpark Performance - Spark Beyond the JVM @ PyData DC 2016

林佳賢/資料視覺化的 20 個小訣竅

Productionizing Spark and the REST Job Server- Evan Chan

Dreaming Infrastructure

Handling Data Skew Adaptively In Spark Using Dynamic Repartitioning

Apache Spark 2.0: A Deep Dive Into Structured Streaming - by Tathagata Das

1. Apache Kylin Deep Dive - Streaming and Plugin Architecture - Apache Kylin Meetup @Shanghai

Magellen: Geospatial Analytics on Spark by Ram Sriharsha

Sparkcamp stratasingapore

AWSome Day Singapore Keynote 2015

Combine Apache Hadoop and Elasticsearch to Get the Most of Your Big Data

Introduction to Machine Learning

Singapore startup ecosystem and entrepreneur toolbox - Aug 2015

Using Apache Drill

Parquet Hadoop Summit 2013

Titan: The Rise of Big Graph Data

Intro to Graph Databases Using Tinkerpop, TitanDB, and Gremlin

Real time Analytics with Apache Kafka and Apache Spark

Open Source Lambda Architecture with Hadoop, Kafka, Samza and Druid

Sqoop on Spark for Data Ingestion

Enterprise Kafka: Kafka as a Service

Kdd 2014 Tutorial - the recommender problem revisited

鹰眼下的淘宝_EagleEye with Taobao

All you wanted to know about analytics in e commerce- amazon, ebay, flipkart

Kaggle Otto Challenge: How we achieved 85th out of 3,514 and what we learnt