Personal Information
Organization / Workplace
San Francisco Bay Area United States
Industry
Electronics / Computer Hardware
About
I've built data pipeline using Apache Spark, Hadoop & scikit-learn & I've done data munging - cleaning up of the data for processing, feature engineering as well as creating ML model with the clean & transformed data. I've solved ML both supervised & unsupervised ML. using Python, Scala & R.
Additionally, I've done sentiment analysis, text analysis, & ML projects.
As I was the only engineer in my team & picked up required technology like Hadoop, Apache Spark on my own & evaluated Python, R & Scala programming language.
Automate the Sales Credit Allocation for the sales transaction. Nearly 5% of X million sales transactions need to be manually allocated to the right Sales Account Team f...
Tags
collaborative computing
hadoop
networking
apache mahout
association rule
clustering
data mining
See more
Presentations
(8)Likes
(8)Frustration-Reduced PySpark: Data engineering with DataFrames
Ilya Ganelin
•
8 years ago
sparklyr - Jeff Allen
Sri Ambati
•
7 years ago
A lightweight browser start page - 3x3 Links
Federico Elles
•
15 years ago
The Secret Sauce of Successful Teams
Sven Peters
•
7 years ago
Web Services Testing
Vladimir Soghoyan
•
10 years ago
Clustering and Association Rule
Cisco
•
9 years ago
Personal Information
Organization / Workplace
San Francisco Bay Area United States
Industry
Electronics / Computer Hardware
About
I've built data pipeline using Apache Spark, Hadoop & scikit-learn & I've done data munging - cleaning up of the data for processing, feature engineering as well as creating ML model with the clean & transformed data. I've solved ML both supervised & unsupervised ML. using Python, Scala & R.
Additionally, I've done sentiment analysis, text analysis, & ML projects.
As I was the only engineer in my team & picked up required technology like Hadoop, Apache Spark on my own & evaluated Python, R & Scala programming language.
Automate the Sales Credit Allocation for the sales transaction. Nearly 5% of X million sales transactions need to be manually allocated to the right Sales Account Team f...
Tags
collaborative computing
hadoop
networking
apache mahout
association rule
clustering
data mining
See more