Personal Information
Organization / Workplace
Vietnam Vietnam
Occupation
Bigdata guy - Computer science addicted - Machine learning lover.
Industry
Technology / Software / Internet
About
More than three years working with technologies related to bigdata, cloud computing and distributed systems. Passionate about Apache Spark, Apache Kafka, Scala (with some flavour of Apache Mesos, Hadoop, YARN, and DC/OS). Machine learning lover.
. Solid mathematical background in statistics, modeling and machine learning.
. Experience of machine learning techniques, machine learning algorithms implementation, proficiency in data manipulation and preparation.
. Experienced in architecting and designing large scale data systems, data pipelines.
. Good understanding of distributed system.
Proficient with Java, Scala, familiar with Python.
- Presentations
- Documents
- Infographics
How Netflix Tunes EC2 Instances for Performance
Brendan Gregg
•
6 years ago
Về kỹ thuật Attention trong mô hình sequence-to-sequence tại hội nghị ACL 2017
Minh Pham
•
6 years ago
Bigdata based fraud detection
Mk Kim
•
9 years ago
NoLambda: Combining Streaming, Ad-Hoc, Machine Learning and Batch Analysis
Helena Edelson
•
8 years ago
How we built an event-time merge of two kafka-streams with spark-streaming
Ralf Sigmund
•
7 years ago
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Xavier Amatriain
•
9 years ago
Strata NYC 2015: Sketching Big Data with Spark: randomized algorithms for large-scale data analytics
Databricks
•
8 years ago
Detecting Hacks: Anomaly Detection on Networking Data
DataWorks Summit
•
8 years ago
Real-Time Anomaly Detection with Spark MLlib, Akka and Cassandra
Natalino Busa
•
8 years ago
How to deploy Apache Spark to Mesos/DCOS
Legacy Typesafe (now Lightbend)
•
8 years ago
Anomaly Detection using Spark MLlib and Spark Streaming
Keira Zhou
•
8 years ago
Anomaly Detection with Apache Spark
Cloudera, Inc.
•
9 years ago
Time Series Processing with Apache Spark
QAware GmbH
•
8 years ago
Efficient Data Storage for Analytics with Apache Parquet 2.0
Cloudera, Inc.
•
9 years ago
Avro introduction
Nanda8904648951
•
9 years ago
Consumer offset management in Kafka
Joel Koshy
•
9 years ago
Hadoop - Hệ thống tính toán và xử lý dữ liệu lớn
Thành Thư Thái
•
9 years ago
Apache Hadoop YARN, NameNode HA, HDFS Federation
Adam Kawa
•
11 years ago
Introduction to YARN and MapReduce 2
Cloudera, Inc.
•
10 years ago
Cassandra background-and-architecture
Markus Klems
•
10 years ago