SlideShare a Scribd company logo
1 of 1
Download to read offline
KAUSHIK SHAKKARI
#318, 2700 Ellendale Place, Los Angeles, CA 90007 | shakkari@usc.edu | +1 (213) 477-3601 | linkedin.com/in/kaushik-shakkari/
public.tableau.com/profile/kaushik3654#!/|github.com/kaushikData|datacamp.com/kaushikshakkari| kaushikshakkari.wixsite.com
EDUCATION
University of Southern California, Los Angeles, CA Aug 2018 - May 2020
Master of Science in Computer Science (Data Science)
Amrita University, Coimbatore, TN, India Aug 2014 - May 2018
B. Tech. in Computer Science and Engineering, CGPA: 9.18 / 10
SKILLS
Programming Languages Python, Java, C++, C, Scala, JavaScript
Storage Systems SQL, MySQL, Oracle DB, Cassandra, MongoDB, PostgreSQL
Visualization Tools Tableau, Plotly
Big Data Framework and Technologies Hadoop, Hive, Sqoop, Pig, Oozie
Libraries (Python) Scikit-learn, NLTK, Beautiful Soup, Urllib, Bokeh, Keras, TensorFlow
Cloud Computing Google Cloud Platform, VMware, Hyper-V
RESEARCH
UG Research Assistant, Amrita Multidimensional Data Analysis Lab, Amrita University, India Jan 2016 - Jul 2018
• Collaborated with Dr. Vidhya Balasubramanian, PhD, UCI, designed algorithm and created a tool ‘Lakshya’ to analyse user
behaviour while browsing Internet, detect the level of focus and nudge him back in real-time.
• Installed framework as an extension in several volunteers’ systems. Framework’s usage histories show framework detected
diversion and alerted user appropriately. Improved model accuracy to 95% through continuous feedback from users.
• Extrapolated insights on browsing behaviour with Plotly and Bokeh to make users understand their internet usage.
WORK EXPERIENCE
Grader for Analytics and Statistics (GSBA 537), University of Southern California Sept 2018
• Responsible for creating visualization and analytics assignments and grading student assignments on strict deadlines.
Team Leader, University Cisco collaboration real-time project Nov 2016 - Dec 2017
• Developed a generic big data framework using Hadoop for Rating and Billing Scheduling application.
• Automated scheduling of process execution via Oozie tool. Hive and Sqoop were used for data management and data transfer.
• Implemented code was reviewed, perfected, and pushed to production.
Database Intern, APTOnline Dec 2016 - Jan 2017
• Designed, modelled and optimized a set of DML operations and cursors in collaboration with Watershed Development Team for
Watershed Management Project (Andhra Pradesh State Government Project).
PROJECTS
Clustering machines and detecting anomalies to find malware affected machines Nov 2018
• Implemented PCA to reduce dataset to less than 40% of original dataset, retaining 95% of information.
• Executed various clustering algorithms like K-Means, Mean-Shift, DBSCAN, Agglomerative Hierarchical Clustering etc.
• Mathematically stated K-Means with K = 2 is best fit for dataset and detected anomalies (malware affected systems.)
• Used XGBoost, an implementation of gradient boosted decision tree to identify the most crucial features by F-Scores.
Analysing product and developing pricing and product strategies using Tableau Sept 2018 – Oct 2018
• Pre-processed and analysed sales data of tablets sold by different companies like Apple, Samsung, Kindle and others.
• Created a dashboard to show how Apple’s and Samsung’s products are competing in sales rank, rating and discount etc.
• Computed trend lines and dynamic plots for various features of each brand across week 1 to 24 and addressed outliers.
Predicting the customer churning and finding reasons for cancelation of bank’s term deposit Jan 2018 - Apr 2018
• Performed pre-processing by encoding data, dealing with null values and reducing dimension of dataset.
• Executed k-folded cross validation technique with ten splits to avoid overfitting of data.
• Computed accuracy for ten classification algorithms including multilayer perceptron, a feedforward artificial neural network and
adaptive boost and visualized data using seaborn and matplotlib libraries to find insights in data.
• Created an interactive visualization application using bokeh for analysing relations of features in dataset.
Data tidying and cleaning Gapminder Original Dataset Dec 2017 - Feb 2018
• Implemented data cleaning techniques like melting and pivoting for data to be ready for analysis.
• Performed preliminary quality diagnosis by assert statements and created five-dimensional plot in tableau.
OOLECA (Optimization of Live Space Using Computer Algorithms) - predicting the best plant that can be grown at the area
considering climate, groundwater, humidity and purpose for growing plant. Jul 2016 - Dec 2016
• Cleaned data from different data sources and designed the database schema for the application.
• Constructed hybrid (content-based and collaborative) recommendation system for users of application.
• Collaborated with environmental science department for designing survey and modelling application.
ACHIEVEMENTS AND AWARDS
Outstanding Student Award, Amrita University Apr 2018
Outstanding Contribution and Successful Project Delivery, Cisco, India Nov 2017
Achieved 4 badges from IBM for good scores in Data Science and Big Data (youracclaim.com/user/kaushik-shakkari)

More Related Content

What's hot

Big Data - Linked In_DEEPU
Big Data - Linked In_DEEPUBig Data - Linked In_DEEPU
Big Data - Linked In_DEEPUDeepu M
 
Resume_Weixiang Ding
Resume_Weixiang DingResume_Weixiang Ding
Resume_Weixiang DingWeixiang Ding
 
Real-Time Supply Chain Analytics with Machine Learning, Kafka, and Spark
Real-Time Supply Chain Analytics with Machine Learning, Kafka, and SparkReal-Time Supply Chain Analytics with Machine Learning, Kafka, and Spark
Real-Time Supply Chain Analytics with Machine Learning, Kafka, and SparkSingleStore
 
The IoT and big data
The IoT and big dataThe IoT and big data
The IoT and big dataGal Ben-Haim
 
Build Machine Learning Models Quickly & Easily with Amazon SageMaker & Perisc...
Build Machine Learning Models Quickly & Easily with Amazon SageMaker & Perisc...Build Machine Learning Models Quickly & Easily with Amazon SageMaker & Perisc...
Build Machine Learning Models Quickly & Easily with Amazon SageMaker & Perisc...Amazon Web Services
 
Big data hadoop titles 2015 2016
Big data hadoop titles 2015 2016Big data hadoop titles 2015 2016
Big data hadoop titles 2015 2016xtreamtechnologies
 
Graph-Powered Machine Learning
Graph-Powered Machine Learning Graph-Powered Machine Learning
Graph-Powered Machine Learning GraphAware
 
Science base usage analysis - AGU2016 - in21d08
Science base usage analysis - AGU2016 - in21d08Science base usage analysis - AGU2016 - in21d08
Science base usage analysis - AGU2016 - in21d08Sky Bristol
 
Geolocation analysis using HiveQL
Geolocation analysis using HiveQLGeolocation analysis using HiveQL
Geolocation analysis using HiveQLPriyanka Kale
 
Database Week San Francisco: Database Services at AWS
Database Week San Francisco: Database Services at AWSDatabase Week San Francisco: Database Services at AWS
Database Week San Francisco: Database Services at AWSAmazon Web Services
 

What's hot (19)

Big Data - Linked In_DEEPU
Big Data - Linked In_DEEPUBig Data - Linked In_DEEPU
Big Data - Linked In_DEEPU
 
Query O
Query OQuery O
Query O
 
Resume_Weixiang Ding
Resume_Weixiang DingResume_Weixiang Ding
Resume_Weixiang Ding
 
Real-Time Supply Chain Analytics with Machine Learning, Kafka, and Spark
Real-Time Supply Chain Analytics with Machine Learning, Kafka, and SparkReal-Time Supply Chain Analytics with Machine Learning, Kafka, and Spark
Real-Time Supply Chain Analytics with Machine Learning, Kafka, and Spark
 
Data Science At Zillow
Data Science At ZillowData Science At Zillow
Data Science At Zillow
 
Resume analyst
Resume analystResume analyst
Resume analyst
 
Qo comparision
Qo comparisionQo comparision
Qo comparision
 
The IoT and big data
The IoT and big dataThe IoT and big data
The IoT and big data
 
Build Machine Learning Models Quickly & Easily with Amazon SageMaker & Perisc...
Build Machine Learning Models Quickly & Easily with Amazon SageMaker & Perisc...Build Machine Learning Models Quickly & Easily with Amazon SageMaker & Perisc...
Build Machine Learning Models Quickly & Easily with Amazon SageMaker & Perisc...
 
Big data hadoop titles 2015 2016
Big data hadoop titles 2015 2016Big data hadoop titles 2015 2016
Big data hadoop titles 2015 2016
 
Graph-Powered Machine Learning
Graph-Powered Machine Learning Graph-Powered Machine Learning
Graph-Powered Machine Learning
 
Science base usage analysis - AGU2016 - in21d08
Science base usage analysis - AGU2016 - in21d08Science base usage analysis - AGU2016 - in21d08
Science base usage analysis - AGU2016 - in21d08
 
MCT Summit Azure automated Machine Learning
MCT Summit Azure automated Machine Learning MCT Summit Azure automated Machine Learning
MCT Summit Azure automated Machine Learning
 
Newest mmis resume
Newest mmis  resumeNewest mmis  resume
Newest mmis resume
 
Resume_Ayush Gaur_v17
Resume_Ayush Gaur_v17Resume_Ayush Gaur_v17
Resume_Ayush Gaur_v17
 
Geolocation analysis using HiveQL
Geolocation analysis using HiveQLGeolocation analysis using HiveQL
Geolocation analysis using HiveQL
 
Enterprise Data Lakes
Enterprise Data LakesEnterprise Data Lakes
Enterprise Data Lakes
 
AWS and Analytics Services
AWS and Analytics ServicesAWS and Analytics Services
AWS and Analytics Services
 
Database Week San Francisco: Database Services at AWS
Database Week San Francisco: Database Services at AWSDatabase Week San Francisco: Database Services at AWS
Database Week San Francisco: Database Services at AWS
 

Similar to Kaushik Shakkari's Resume - Data Scientist with Python, SQL, Tableau Skills

Similar to Kaushik Shakkari's Resume - Data Scientist with Python, SQL, Tableau Skills (20)

Kavinya Rajendran Resume
Kavinya Rajendran ResumeKavinya Rajendran Resume
Kavinya Rajendran Resume
 
Resume_Vignesh_ThulasiDass
Resume_Vignesh_ThulasiDass Resume_Vignesh_ThulasiDass
Resume_Vignesh_ThulasiDass
 
Long resume v28
Long resume v28Long resume v28
Long resume v28
 
Resume
ResumeResume
Resume
 
Satwik Mishra Resume
Satwik Mishra ResumeSatwik Mishra Resume
Satwik Mishra Resume
 
Prashant s resume
Prashant s resumePrashant s resume
Prashant s resume
 
Pratik Patel Python/ Big Data Analyst
Pratik Patel Python/ Big Data AnalystPratik Patel Python/ Big Data Analyst
Pratik Patel Python/ Big Data Analyst
 
Prakash_Wagle_Resume
Prakash_Wagle_ResumePrakash_Wagle_Resume
Prakash_Wagle_Resume
 
Resume
ResumeResume
Resume
 
Resume anh chu
Resume anh chuResume anh chu
Resume anh chu
 
Resume_Tabluau_R_Python_ML
Resume_Tabluau_R_Python_MLResume_Tabluau_R_Python_ML
Resume_Tabluau_R_Python_ML
 
Kunal lalwani
Kunal lalwaniKunal lalwani
Kunal lalwani
 
ShantanuGuptaResume
ShantanuGuptaResumeShantanuGuptaResume
ShantanuGuptaResume
 
Parmanand_Sahu.pdf
Parmanand_Sahu.pdfParmanand_Sahu.pdf
Parmanand_Sahu.pdf
 
Resume
ResumeResume
Resume
 
Vadlamudi saketh30 (ml)
Vadlamudi saketh30 (ml)Vadlamudi saketh30 (ml)
Vadlamudi saketh30 (ml)
 
Gupta_Nidhi
Gupta_NidhiGupta_Nidhi
Gupta_Nidhi
 
Shantanu Gupta
Shantanu GuptaShantanu Gupta
Shantanu Gupta
 
Nimesh Deepak Rajal
Nimesh Deepak RajalNimesh Deepak Rajal
Nimesh Deepak Rajal
 
Resume yanwen lin
Resume yanwen linResume yanwen lin
Resume yanwen lin
 

Recently uploaded

Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girlsssuser7cb4ff
 
Heart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxHeart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxPoojaBan
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2RajaP95
 
power system scada applications and uses
power system scada applications and usespower system scada applications and uses
power system scada applications and usesDevarapalliHaritha
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSCAESB
 
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort servicejennyeacort
 
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...VICTOR MAESTRE RAMIREZ
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile servicerehmti665
 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...srsj9000
 
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfCCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfAsst.prof M.Gokilavani
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escortsranjana rawat
 
chaitra-1.pptx fake news detection using machine learning
chaitra-1.pptx  fake news detection using machine learningchaitra-1.pptx  fake news detection using machine learning
chaitra-1.pptx fake news detection using machine learningmisbanausheenparvam
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionDr.Costas Sachpazis
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal
 

Recently uploaded (20)

Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girls
 
Heart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxHeart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptx
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
 
power system scada applications and uses
power system scada applications and usespower system scada applications and uses
power system scada applications and uses
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentation
 
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
 
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile service
 
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
 
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfCCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
 
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
 
chaitra-1.pptx fake news detection using machine learning
chaitra-1.pptx  fake news detection using machine learningchaitra-1.pptx  fake news detection using machine learning
chaitra-1.pptx fake news detection using machine learning
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
 

Kaushik Shakkari's Resume - Data Scientist with Python, SQL, Tableau Skills

  • 1. KAUSHIK SHAKKARI #318, 2700 Ellendale Place, Los Angeles, CA 90007 | shakkari@usc.edu | +1 (213) 477-3601 | linkedin.com/in/kaushik-shakkari/ public.tableau.com/profile/kaushik3654#!/|github.com/kaushikData|datacamp.com/kaushikshakkari| kaushikshakkari.wixsite.com EDUCATION University of Southern California, Los Angeles, CA Aug 2018 - May 2020 Master of Science in Computer Science (Data Science) Amrita University, Coimbatore, TN, India Aug 2014 - May 2018 B. Tech. in Computer Science and Engineering, CGPA: 9.18 / 10 SKILLS Programming Languages Python, Java, C++, C, Scala, JavaScript Storage Systems SQL, MySQL, Oracle DB, Cassandra, MongoDB, PostgreSQL Visualization Tools Tableau, Plotly Big Data Framework and Technologies Hadoop, Hive, Sqoop, Pig, Oozie Libraries (Python) Scikit-learn, NLTK, Beautiful Soup, Urllib, Bokeh, Keras, TensorFlow Cloud Computing Google Cloud Platform, VMware, Hyper-V RESEARCH UG Research Assistant, Amrita Multidimensional Data Analysis Lab, Amrita University, India Jan 2016 - Jul 2018 • Collaborated with Dr. Vidhya Balasubramanian, PhD, UCI, designed algorithm and created a tool ‘Lakshya’ to analyse user behaviour while browsing Internet, detect the level of focus and nudge him back in real-time. • Installed framework as an extension in several volunteers’ systems. Framework’s usage histories show framework detected diversion and alerted user appropriately. Improved model accuracy to 95% through continuous feedback from users. • Extrapolated insights on browsing behaviour with Plotly and Bokeh to make users understand their internet usage. WORK EXPERIENCE Grader for Analytics and Statistics (GSBA 537), University of Southern California Sept 2018 • Responsible for creating visualization and analytics assignments and grading student assignments on strict deadlines. Team Leader, University Cisco collaboration real-time project Nov 2016 - Dec 2017 • Developed a generic big data framework using Hadoop for Rating and Billing Scheduling application. • Automated scheduling of process execution via Oozie tool. Hive and Sqoop were used for data management and data transfer. • Implemented code was reviewed, perfected, and pushed to production. Database Intern, APTOnline Dec 2016 - Jan 2017 • Designed, modelled and optimized a set of DML operations and cursors in collaboration with Watershed Development Team for Watershed Management Project (Andhra Pradesh State Government Project). PROJECTS Clustering machines and detecting anomalies to find malware affected machines Nov 2018 • Implemented PCA to reduce dataset to less than 40% of original dataset, retaining 95% of information. • Executed various clustering algorithms like K-Means, Mean-Shift, DBSCAN, Agglomerative Hierarchical Clustering etc. • Mathematically stated K-Means with K = 2 is best fit for dataset and detected anomalies (malware affected systems.) • Used XGBoost, an implementation of gradient boosted decision tree to identify the most crucial features by F-Scores. Analysing product and developing pricing and product strategies using Tableau Sept 2018 – Oct 2018 • Pre-processed and analysed sales data of tablets sold by different companies like Apple, Samsung, Kindle and others. • Created a dashboard to show how Apple’s and Samsung’s products are competing in sales rank, rating and discount etc. • Computed trend lines and dynamic plots for various features of each brand across week 1 to 24 and addressed outliers. Predicting the customer churning and finding reasons for cancelation of bank’s term deposit Jan 2018 - Apr 2018 • Performed pre-processing by encoding data, dealing with null values and reducing dimension of dataset. • Executed k-folded cross validation technique with ten splits to avoid overfitting of data. • Computed accuracy for ten classification algorithms including multilayer perceptron, a feedforward artificial neural network and adaptive boost and visualized data using seaborn and matplotlib libraries to find insights in data. • Created an interactive visualization application using bokeh for analysing relations of features in dataset. Data tidying and cleaning Gapminder Original Dataset Dec 2017 - Feb 2018 • Implemented data cleaning techniques like melting and pivoting for data to be ready for analysis. • Performed preliminary quality diagnosis by assert statements and created five-dimensional plot in tableau. OOLECA (Optimization of Live Space Using Computer Algorithms) - predicting the best plant that can be grown at the area considering climate, groundwater, humidity and purpose for growing plant. Jul 2016 - Dec 2016 • Cleaned data from different data sources and designed the database schema for the application. • Constructed hybrid (content-based and collaborative) recommendation system for users of application. • Collaborated with environmental science department for designing survey and modelling application. ACHIEVEMENTS AND AWARDS Outstanding Student Award, Amrita University Apr 2018 Outstanding Contribution and Successful Project Delivery, Cisco, India Nov 2017 Achieved 4 badges from IBM for good scores in Data Science and Big Data (youracclaim.com/user/kaushik-shakkari)