Personal Information
Organization / Workplace
Brisbane, Australia Australia
Occupation
Big Data Engineer & Data Scientist [Australian Citizen]
Industry
Technology / Software / Internet
About
( Australian citizen )
My experience summary
Design & develop scalable Apache spark based streaming application for processing clickstream.
Implemented Machine learning model using Apache Spark python & R with {Cluster of Postgresql-XL
Machine learning using Apache Spark MLlib and Naive Bayes Algo in C++ & R.
Hands on experience on batch processing, in-memory technologies, and columnar databases and good understanding of distributed Database SQL/different types of noSQL databases, sharding for scalablity & performance.
Machine Learning Modeling Techniques
Languages: R, Python, c++
Visualization tools: Mainly R, some Tableau
Hypothesis Testing
Databases: PostgreSQL-XL cluster & NoSq...
Likes
(3)Lecture 4 Decision Trees (2): Entropy, Information Gain, Gain Ratio
Marina Santini
•
8 years ago
How to Build a Recommendation Engine on Spark
Caserta
•
9 years ago
C to perl binding
Shmuel Fomberg
•
13 years ago
Personal Information
Organization / Workplace
Brisbane, Australia Australia
Occupation
Big Data Engineer & Data Scientist [Australian Citizen]
Industry
Technology / Software / Internet
About
( Australian citizen )
My experience summary
Design & develop scalable Apache spark based streaming application for processing clickstream.
Implemented Machine learning model using Apache Spark python & R with {Cluster of Postgresql-XL
Machine learning using Apache Spark MLlib and Naive Bayes Algo in C++ & R.
Hands on experience on batch processing, in-memory technologies, and columnar databases and good understanding of distributed Database SQL/different types of noSQL databases, sharding for scalablity & performance.
Machine Learning Modeling Techniques
Languages: R, Python, c++
Visualization tools: Mainly R, some Tableau
Hypothesis Testing
Databases: PostgreSQL-XL cluster & NoSq...