SlideShare a Scribd company logo
1 of 1
Download to read offline
KAUSHIK SHAKKARI
#318, 2700 Ellendale Place, Los Angeles, CA 90007 | shakkari@usc.edu | +1 (213) 477-3601 | linkedin.com/in/kaushik-shakkari/
public.tableau.com/profile/kaushik3654#!/|github.com/kaushikData|datacamp.com/kaushikshakkari| kaushikshakkari.wixsite.com
EDUCATION
University of Southern California, Los Angeles, CA Aug 2018 - May 2020
Master of Science in Computer Science (Data Science)
Amrita University, Coimbatore, TN, India Aug 2014 - May 2018
B. Tech. in Computer Science and Engineering, CGPA: 9.18 / 10
SKILLS
Programming Languages Python, Java, C++, C, JavaScript
Storage Systems and Query Languages SQL, MySQL, Oracle DB, Cassandra, MongoDB, PostgreSQL
Visualization Tools Tableau, Plotly
Big Data Framework and Technologies Hadoop, Hive, Sqoop, Pig, Oozie
Libraries (Python) Scikit-learn, NLTK, Beautiful Soup, Urllib, Bokeh, Keras, TensorFlow
Cloud Computing Google Cloud Platform
RESEARCH
UG Research Assistant, Amrita Multidimensional Data Analysis Lab, Amrita University, India Jan 2016 - Jul 2018
• Created a tool ‘Lakshya’ and evangelized ‘Lakshya’ as an extension to web browsers of 50 active users to analyse user
behaviour while browsing Internet, detect the level of diversion and nudge him back in real-time.
• Application’s usage history showed ‘Lakshya’ detected diversion and alerted users appropriately.
• Improved model accuracy to 95% through continuous feedback from users.
• Extrapolated insights on browsing behaviour with Plotly and Bokeh to make users understand their internet usage.
WORK EXPERIENCE
Grader for Statistics (GSBA 537) and Database Management (CSCI 585), University of Southern California Sept 2018
• Responsible for creating visualization and analytics assignments and grading student assignments on strict deadlines.
Team Leader, University Cisco collaboration real-time project Nov 2016 - Dec 2017
• Enabled cloud level scaling for a generic big data Rating and Billing Scheduling application using Hadoop framework.
• Automated deployment and orchestration of the application using Oozie tool, reducing human interaction.
• Managed and transferred data from various sources using Hive and Sqoop.
• Implemented code meet the production quality of Cisco Systems and was deployed to production.
Database Intern, APTOnline Dec 2016 - Jan 2017
• Designed, modelled and optimized a set of DML operations and cursors in collaboration with Watershed Development Team for
Watershed Management Project (Andhra Pradesh State Government Project).
PROJECTS
Data Clustering and Anomaly Detection of Malware Affected Systems Nov 2018
• Implemented PCA to reduce dataset to more than 60% of original dataset, retaining 95% of information.
• Executed various clustering algorithms like K-Means, Mean-Shift, DBSCAN, Agglomerative Hierarchical Clustering etc.
• Mathematically stated K-Means with K = 2 is best fit for dataset and detected anomalies (malware affected systems.)
• Used XGBoost, an implementation of gradient boosted decision tree to identify the most crucial features by F-Scores.
Product Analysis and Pricing Strategies Sept 2018 – Oct 2018
• Pre-processed and analysed sales data of tablets sold by different companies like Apple, Samsung, Kindle and others.
• Created a dashboard to show how Apple’s and Samsung’s products are competing in sales rank, rating and discount etc.
• Addressed outliers and computed trend lines on Tableau for various features of each brand over a period of 24 weeks.
Customer Churn Prediction and Analysis Jan 2018 - Apr 2018
• Performed pre-processing by encoding data, dealing with null values and reducing dimension of dataset.
• Executed k-folded cross validation technique with ten splits to avoid overfitting of data.
• Computed accuracy for ten classification algorithms including multilayer perceptron, a feedforward artificial neural network and
adaptive boost and visualized data using seaborn and matplotlib libraries to find insights in data.
• Created an interactive visualization application using bokeh for analysing relations of features in dataset.
Data tidying and cleaning Gapminder Original Dataset Dec 2017 - Feb 2018
• Implemented data cleaning techniques like melting and pivoting for data to be ready for analysis.
• Performed preliminary quality diagnosis by assert statements and created five-dimensional plot in tableau.
OOLECA (Optimization of Live Space Using Computer Algorithms) - predicting the best plant that can be grown at the area
considering climate, groundwater, humidity and purpose for growing plant. Jul 2016 - Dec 2016
• Cleaned data from different data sources and designed the database schema for the application.
• Constructed hybrid (content-based and collaborative) recommendation system for users of application.
• Collaborated with environmental science department for designing survey and modelling application.
ACHIEVEMENTS AND AWARDS
Outstanding Student Award, Amrita University Apr 2018
Outstanding Contribution and Successful Project Delivery, Cisco, India Nov 2017

More Related Content

What's hot

AnupDudaniDataScience2015
AnupDudaniDataScience2015AnupDudaniDataScience2015
AnupDudaniDataScience2015Anup Dudani
 
Resume_Weixiang Ding
Resume_Weixiang DingResume_Weixiang Ding
Resume_Weixiang DingWeixiang Ding
 
Big Data - Linked In_DEEPU
Big Data - Linked In_DEEPUBig Data - Linked In_DEEPU
Big Data - Linked In_DEEPUDeepu M
 
Build Machine Learning Models Quickly & Easily with Amazon SageMaker & Perisc...
Build Machine Learning Models Quickly & Easily with Amazon SageMaker & Perisc...Build Machine Learning Models Quickly & Easily with Amazon SageMaker & Perisc...
Build Machine Learning Models Quickly & Easily with Amazon SageMaker & Perisc...Amazon Web Services
 
Real-Time Supply Chain Analytics with Machine Learning, Kafka, and Spark
Real-Time Supply Chain Analytics with Machine Learning, Kafka, and SparkReal-Time Supply Chain Analytics with Machine Learning, Kafka, and Spark
Real-Time Supply Chain Analytics with Machine Learning, Kafka, and SparkSingleStore
 
Geolocation analysis using HiveQL
Geolocation analysis using HiveQLGeolocation analysis using HiveQL
Geolocation analysis using HiveQLPriyanka Kale
 
Science base usage analysis - AGU2016 - in21d08
Science base usage analysis - AGU2016 - in21d08Science base usage analysis - AGU2016 - in21d08
Science base usage analysis - AGU2016 - in21d08Sky Bristol
 
The IoT and big data
The IoT and big dataThe IoT and big data
The IoT and big dataGal Ben-Haim
 
Graph-Powered Machine Learning
Graph-Powered Machine Learning Graph-Powered Machine Learning
Graph-Powered Machine Learning GraphAware
 
ntakpe_boraud_resume
ntakpe_boraud_resumentakpe_boraud_resume
ntakpe_boraud_resumentakpe boraud
 
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...Spark Summit
 
20181003 Whirlwind tour into Pyspark
20181003 Whirlwind tour into Pyspark20181003 Whirlwind tour into Pyspark
20181003 Whirlwind tour into PysparkAndrey Vykhodtsev
 
Big data hadoop titles 2015 2016
Big data hadoop titles 2015 2016Big data hadoop titles 2015 2016
Big data hadoop titles 2015 2016xtreamtechnologies
 
Raghava Prasad S Resume
Raghava Prasad S ResumeRaghava Prasad S Resume
Raghava Prasad S ResumeRaghava Prasad
 

What's hot (19)

AnupDudaniDataScience2015
AnupDudaniDataScience2015AnupDudaniDataScience2015
AnupDudaniDataScience2015
 
Resume_Weixiang Ding
Resume_Weixiang DingResume_Weixiang Ding
Resume_Weixiang Ding
 
Query O
Query OQuery O
Query O
 
Big Data - Linked In_DEEPU
Big Data - Linked In_DEEPUBig Data - Linked In_DEEPU
Big Data - Linked In_DEEPU
 
Data Science At Zillow
Data Science At ZillowData Science At Zillow
Data Science At Zillow
 
Resume analyst
Resume analystResume analyst
Resume analyst
 
Build Machine Learning Models Quickly & Easily with Amazon SageMaker & Perisc...
Build Machine Learning Models Quickly & Easily with Amazon SageMaker & Perisc...Build Machine Learning Models Quickly & Easily with Amazon SageMaker & Perisc...
Build Machine Learning Models Quickly & Easily with Amazon SageMaker & Perisc...
 
Real-Time Supply Chain Analytics with Machine Learning, Kafka, and Spark
Real-Time Supply Chain Analytics with Machine Learning, Kafka, and SparkReal-Time Supply Chain Analytics with Machine Learning, Kafka, and Spark
Real-Time Supply Chain Analytics with Machine Learning, Kafka, and Spark
 
Geolocation analysis using HiveQL
Geolocation analysis using HiveQLGeolocation analysis using HiveQL
Geolocation analysis using HiveQL
 
Science base usage analysis - AGU2016 - in21d08
Science base usage analysis - AGU2016 - in21d08Science base usage analysis - AGU2016 - in21d08
Science base usage analysis - AGU2016 - in21d08
 
MCT Summit Azure automated Machine Learning
MCT Summit Azure automated Machine Learning MCT Summit Azure automated Machine Learning
MCT Summit Azure automated Machine Learning
 
The IoT and big data
The IoT and big dataThe IoT and big data
The IoT and big data
 
Graph-Powered Machine Learning
Graph-Powered Machine Learning Graph-Powered Machine Learning
Graph-Powered Machine Learning
 
ntakpe_boraud_resume
ntakpe_boraud_resumentakpe_boraud_resume
ntakpe_boraud_resume
 
Qo comparision
Qo comparisionQo comparision
Qo comparision
 
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...
 
20181003 Whirlwind tour into Pyspark
20181003 Whirlwind tour into Pyspark20181003 Whirlwind tour into Pyspark
20181003 Whirlwind tour into Pyspark
 
Big data hadoop titles 2015 2016
Big data hadoop titles 2015 2016Big data hadoop titles 2015 2016
Big data hadoop titles 2015 2016
 
Raghava Prasad S Resume
Raghava Prasad S ResumeRaghava Prasad S Resume
Raghava Prasad S Resume
 

Similar to Resume(kaushik shakkari) (20)

Resume_Vignesh_ThulasiDass
Resume_Vignesh_ThulasiDass Resume_Vignesh_ThulasiDass
Resume_Vignesh_ThulasiDass
 
Kavinya Rajendran Resume
Kavinya Rajendran ResumeKavinya Rajendran Resume
Kavinya Rajendran Resume
 
Long resume v28
Long resume v28Long resume v28
Long resume v28
 
Satwik Mishra Resume
Satwik Mishra ResumeSatwik Mishra Resume
Satwik Mishra Resume
 
Newest mmis resume
Newest mmis  resumeNewest mmis  resume
Newest mmis resume
 
Prashant s resume
Prashant s resumePrashant s resume
Prashant s resume
 
Prakash_Wagle_Resume
Prakash_Wagle_ResumePrakash_Wagle_Resume
Prakash_Wagle_Resume
 
Satwik Mishra Resume
Satwik Mishra ResumeSatwik Mishra Resume
Satwik Mishra Resume
 
Resume
ResumeResume
Resume
 
Tanaya jan 17 Resume
Tanaya jan 17 ResumeTanaya jan 17 Resume
Tanaya jan 17 Resume
 
Karanjeet Singh Resume
Karanjeet Singh ResumeKaranjeet Singh Resume
Karanjeet Singh Resume
 
Resume yanwen lin
Resume yanwen linResume yanwen lin
Resume yanwen lin
 
Kunal lalwani
Kunal lalwaniKunal lalwani
Kunal lalwani
 
PriyankaDighe_Resume_new
PriyankaDighe_Resume_newPriyankaDighe_Resume_new
PriyankaDighe_Resume_new
 
Parmanand_Sahu.pdf
Parmanand_Sahu.pdfParmanand_Sahu.pdf
Parmanand_Sahu.pdf
 
AbhijitTripathy
AbhijitTripathyAbhijitTripathy
AbhijitTripathy
 
ShantanuGuptaResume
ShantanuGuptaResumeShantanuGuptaResume
ShantanuGuptaResume
 
Gupta_Nidhi
Gupta_NidhiGupta_Nidhi
Gupta_Nidhi
 
Resume_Tabluau_R_Python_ML
Resume_Tabluau_R_Python_MLResume_Tabluau_R_Python_ML
Resume_Tabluau_R_Python_ML
 
Resume
ResumeResume
Resume
 

Recently uploaded

dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDRafezzaman
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改atducpo
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...ThinkInnovation
 

Recently uploaded (20)

Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts Service
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
 

Resume(kaushik shakkari)

  • 1. KAUSHIK SHAKKARI #318, 2700 Ellendale Place, Los Angeles, CA 90007 | shakkari@usc.edu | +1 (213) 477-3601 | linkedin.com/in/kaushik-shakkari/ public.tableau.com/profile/kaushik3654#!/|github.com/kaushikData|datacamp.com/kaushikshakkari| kaushikshakkari.wixsite.com EDUCATION University of Southern California, Los Angeles, CA Aug 2018 - May 2020 Master of Science in Computer Science (Data Science) Amrita University, Coimbatore, TN, India Aug 2014 - May 2018 B. Tech. in Computer Science and Engineering, CGPA: 9.18 / 10 SKILLS Programming Languages Python, Java, C++, C, JavaScript Storage Systems and Query Languages SQL, MySQL, Oracle DB, Cassandra, MongoDB, PostgreSQL Visualization Tools Tableau, Plotly Big Data Framework and Technologies Hadoop, Hive, Sqoop, Pig, Oozie Libraries (Python) Scikit-learn, NLTK, Beautiful Soup, Urllib, Bokeh, Keras, TensorFlow Cloud Computing Google Cloud Platform RESEARCH UG Research Assistant, Amrita Multidimensional Data Analysis Lab, Amrita University, India Jan 2016 - Jul 2018 • Created a tool ‘Lakshya’ and evangelized ‘Lakshya’ as an extension to web browsers of 50 active users to analyse user behaviour while browsing Internet, detect the level of diversion and nudge him back in real-time. • Application’s usage history showed ‘Lakshya’ detected diversion and alerted users appropriately. • Improved model accuracy to 95% through continuous feedback from users. • Extrapolated insights on browsing behaviour with Plotly and Bokeh to make users understand their internet usage. WORK EXPERIENCE Grader for Statistics (GSBA 537) and Database Management (CSCI 585), University of Southern California Sept 2018 • Responsible for creating visualization and analytics assignments and grading student assignments on strict deadlines. Team Leader, University Cisco collaboration real-time project Nov 2016 - Dec 2017 • Enabled cloud level scaling for a generic big data Rating and Billing Scheduling application using Hadoop framework. • Automated deployment and orchestration of the application using Oozie tool, reducing human interaction. • Managed and transferred data from various sources using Hive and Sqoop. • Implemented code meet the production quality of Cisco Systems and was deployed to production. Database Intern, APTOnline Dec 2016 - Jan 2017 • Designed, modelled and optimized a set of DML operations and cursors in collaboration with Watershed Development Team for Watershed Management Project (Andhra Pradesh State Government Project). PROJECTS Data Clustering and Anomaly Detection of Malware Affected Systems Nov 2018 • Implemented PCA to reduce dataset to more than 60% of original dataset, retaining 95% of information. • Executed various clustering algorithms like K-Means, Mean-Shift, DBSCAN, Agglomerative Hierarchical Clustering etc. • Mathematically stated K-Means with K = 2 is best fit for dataset and detected anomalies (malware affected systems.) • Used XGBoost, an implementation of gradient boosted decision tree to identify the most crucial features by F-Scores. Product Analysis and Pricing Strategies Sept 2018 – Oct 2018 • Pre-processed and analysed sales data of tablets sold by different companies like Apple, Samsung, Kindle and others. • Created a dashboard to show how Apple’s and Samsung’s products are competing in sales rank, rating and discount etc. • Addressed outliers and computed trend lines on Tableau for various features of each brand over a period of 24 weeks. Customer Churn Prediction and Analysis Jan 2018 - Apr 2018 • Performed pre-processing by encoding data, dealing with null values and reducing dimension of dataset. • Executed k-folded cross validation technique with ten splits to avoid overfitting of data. • Computed accuracy for ten classification algorithms including multilayer perceptron, a feedforward artificial neural network and adaptive boost and visualized data using seaborn and matplotlib libraries to find insights in data. • Created an interactive visualization application using bokeh for analysing relations of features in dataset. Data tidying and cleaning Gapminder Original Dataset Dec 2017 - Feb 2018 • Implemented data cleaning techniques like melting and pivoting for data to be ready for analysis. • Performed preliminary quality diagnosis by assert statements and created five-dimensional plot in tableau. OOLECA (Optimization of Live Space Using Computer Algorithms) - predicting the best plant that can be grown at the area considering climate, groundwater, humidity and purpose for growing plant. Jul 2016 - Dec 2016 • Cleaned data from different data sources and designed the database schema for the application. • Constructed hybrid (content-based and collaborative) recommendation system for users of application. • Collaborated with environmental science department for designing survey and modelling application. ACHIEVEMENTS AND AWARDS Outstanding Student Award, Amrita University Apr 2018 Outstanding Contribution and Successful Project Delivery, Cisco, India Nov 2017