SlideShare a Scribd company logo
Experiences
Education
➢ 7.5+ years of extensive experiences in Data Analytics/ Engineering with Business Intelligence, Data
Warehouse and Hadoop-Spark-MapR-HIVE eco-system.
➢ Process live streaming data into MapR-Hadoop filesystem using Apache-PySpark. Creating schema-
oriented Parquet files and load them into HIVE (external/internal table) and MapR DB using
Python. Creating multithreading python script for PySpark.
➢ Expertise in Huge Volume of data integration/migration and implementing Data-Warehouse
repository using SSIS, SQL Server, providing end-to-end ETL solutions with deployment to Azure
Cloud, Data Analysis and SQL DBA, Azure API Management, Azure Web Apps, converting business
requirements into technical solutions.
➢ Handling unstructured & semi-structured data in XML, JSON using Kafka streaming.
➢ Working on ML algorithms like Linear/Logistic Regression, Lasso, Ridge, Decision Tree, CNN-Neural
Network, Support Vector Machine (SVM), ARIMA Time Series, Text Mining, K-Means Cluster.
➢ An experienced team player (led a team of 6 members) of excellent interpersonal skills with ability
to work independently in projects with multiple clients’ simultaneously.
➢ Passionate, positive, can-do attitude, and willing to take ownership of problems for full resolutions.
Present
November 2017
December 2016
October 2014
April 2012
Profile Summary:
Awards & Achievements:
Skills:
Primary
Project
Big Data MapR-Hadoop, Spark, Python, HIVE, HQL, NoSQL, Mongo DB, UNIX,
Kafka, Snowflake
Business
Intelligence
SSIS / SQL Server Data Tool, SSRS, MSBI/ETL/BIDS, Data Quality
Services (DQS), Data Cleansing, Data Modelling, Dashboards
RDBMS Database Azure Cloud SQL DB, Azure BLOB Storage, Data Warehouse, SQL
Server, Netezza, T-SQL, Free Text Search, MySQL, DBA, DB Isolation
Level, Performance Tuning, SQL Automation, Backup Strategy, OLTP
Secondary
Project
Machine Learning Supervised/Unsupervised Learning, Linear/Logistic Regression,
Decision Tree, Random Forest, CNN-Neural Network, Support
Vector Machine (SVM), ARIMA Time Series, Text Mining, K-Means
Cluster
UI Framework Azure Service Bus, Web Services, ASP.NET, C#, OOPS, WCF
Other Technology Cloud Computing, Oracle, R-Studio, JavaScript, SQL Server SSAS-
Cube, MDX, Tableau/QlikView, Service-Now, JIRA, PostgreSQL
Tools Visual Studio, Version Control using TFS, GitHub
➢ Stood in Top 3 in Qualcomm Worldwide Online Machine Learning Contest in 2019.
➢ Won the WINNER title of Innovation Maestro 2018-19 in Qualcomm OneIT using ML.
➢ Qualcomm OneIT HaQkathon 2018 winner, got chance to present my prototype to Qualcomm CIO.
➢ Went to final round in Microsoft AI Challenge India 2018.
➢ Super-Qualstar awarded for rapid prototypes & executions on innovative ideas in Qualcomm.
➢ Qualstar awarded for prototyping on an innovative idea using Python & NLP in Qualcomm.
➢ QBuzz 2018 finalist. QBuzz is a platform organized by Qualcomm for new Innovative Paper-
Presentation in cutting-edge technologies. I presented an innovative idea in AI/ML domain.
➢ Certificate for my contribution to OneIT Innovation Program using Machine Learning in Qualcomm.
➢ Completed Microsoft Certification exam on “Implementing a Data Warehouse with Microsoft SQL
Server” (70-463).
➢ Client Appreciation Certificate for outstanding performance in Data Migration in PwC.
➢ Appreciation Certificate for quick successful deliverables in Data Migration in PwC.
➢ “STAR Performer” of the year awarded for outstanding performance in PwC.
➢ Received Client Appreciation Certificate for outstanding performance in Data Analytics in PwC.
Qualcomm Hackathon, Innovation Maestro, & Super-Qualstar Winner | Microsoft Certified Professional
❖ Pursuing M.Tech in
Data Science &
Engineering from
BITS, Pilani.
❖ B.Tech in Computer
Science & Engineering
in 2007-2011 from
W.B.U.T. with 7.86
CGPA
❖ Higher Secondary (12th)
in 2006 from
W.B.C.H.S.E. with
84.10%.
❖ Secondary (10th) in
2004 from W.B.B.S.E.
with 85.37%.
Shubham
Mallick
+91 - 9734389354
shubham.kolkata@gmail.com
https://www.linkedin.com/in/i-Shubham/
Hyderabad, India
USA Business Visa (B1/B2)
➢ Project: ECAP (Engineering Compute Analytics Platform) Data Processing
➢ Role: Senior Data Analyst
➢ Tools/Technology: MapR-Hadoop, Spark, Python, HIVE, Parquet, Netezza, UNIX, Kafka, ML
➢ Details:
i. Building high quality Big Data applications, data pipelines and analytics solutions.
ii. Loading huge volume of live streaming data into MapR-DB in Hadoop filesystem (HDFS)
using Apache-PySpark. Creating schema-oriented Parquet files.
iii. Loading data from MapR filesystem to HIVE (external/internal table) using Beeline/Spark.
Set-up message-broker using Kafka for live streaming of log file.
iv. Data Masking on huge volume of data by Data Dictionary using Python Parallel Processing.
v. Experiencing in building solutions using Columnar database architecture and performance.
vi. Scraping data from QlikView dashboards and loading into HIVE using PySpark.
vii. Working on Log-Stash from 'ELK' stack to centralize log files generated from different Hadoop
clusters & loading into MapR DB/HIVE using PySpark.
viii. Replacing Informatica ETL tool using Apache-Spark data pipeline scheduled on YARN.
ix. Demonstrating strong development processes in Agile. Also playing role as a Scrum Master.
x. Collaborating with multiple cross functional teams to work on solutions.
xi. Working on QCT chipset forecasting using Machine Learning models (Linear/Logistic, Lasso,
Ridge, Decision Tree, Convolutional Neural Network - CNN, SVM, ARIMA Time Series), &
plotting in matplotlib of few real-time problems which are the part of Q-Innovation team.
xii. Parameter tuning of ML models using GridSearchCV & RandomSearchCV, & deploy in Prod.
xiii. Code migration and version controlling using Github, & issue tracking in JIRA & Service Now.
xiv. Working as Mentor to interns who joined Qualcomm to make them industry ready. Helping
cross team to automate Netezza-to-SnowFlake, Netezza-to-MapR migrations using Python.
Work Experiences:
November 2017 to Present
December 2016 to November 2017
➢ Project: Project Oaktree (Azure Cloud Migration)
➢ Role: Team Lead, SME for application/module
➢ Tools/Technology: Azure Cloud, SQL Server, Web-Services, Service-Bus, WCF, SSRS, MongoDB
➢ Details:
i. Gather business requirements in AGILE/SCRUM method &design Data Model.
ii. Data migration to Azure Cloud DB from heterogeneous legacy source systems, performing
DBA tasks & end-to-end ETL solutions at Azure Cloud, with DBA tasks (Back up strategy and
SQL jobs, Recovery mode, Performance tuning, Index rebuilding, Availability groups).
iii. Create MongoDB database, migrate DataMart from SQL Server to MongoDB using SSIS
custom scripts, and process for Data Analytics.
iv. Deploy Azure Service Bus using C#.NET to interact with Broker message from client app,
create C# WCF web services interact with Cloud Database & deploy in Azure.
v. Led a team of 6 members and grooming them to make them ready for client facing.
October 2014 to December 2016
➢ Project: GCP-GDW & IPSR
➢ Role: Senior BI Developer and Module lead
➢ Tools/Technology: SQL Server, T-SQL, SSIS, ETL, DBA, C#, TFS, Free Text search, MongoDB, Python
➢ Details:
i. Data Warehouse migration in AGILE/SCRUM methodologies, understanding business
requirements & design Data Model for SQL Data Warehouse.
ii. Predictive search/Free-Text Search (like Google search) from billions of rows in just 2-3
seconds from .NET application using C# web services & jQuery.
iii. Create & deploy SSIS packages for Data Migration from heterogeneous source to SQL Server
and provide end-to-end ETL solution. Performance tuning of SSIS (with SQL tuning) to load
50 million data rows within 1.25 hours from remote data source to SQL server.
iv. Design Data-Mart and load data from SQL Server DW through SSIS packages, developing
SSRS and PowerBI report for Data Quality check, after & before Migration.
v. DBA tasks (Back up strategy and SQL jobs, Recovery mode, Performance tuning, Index
rebuilding, Availability groups).
vi. Integrate DQS (Data Quality Service) with SSIS for bulk data cleansing during migrations.
vii. Developing C# WCF web services to expose data to clients like Infosys, TCS.
April 2012 to October 2014
➢ Project: MetLife USBPM
➢ Role: Application developer
➢ Tools/Technology: Visual Studio, SQL Server, T-SQL, SSIS, C#, ADO.NET, JavaScript, jQuery
➢ Details:
i. Production Support project of 24X7 hours for MetLife, handling bulk production data &
process to SQL server through SSIS packages.
ii. Monitoring of running live Production jobs through IBM Tivoli/Maestro, solving issues
on Ticketing System on priority basis.
iii. Automation of existing process to minimize day-to-day manual efforts & errors.
iv. Upgrade of existing SSIS packages, tuning existing Stored Procedures, solving Production
issues with priority as per Business tickets.
Additional
Responsibility:
➢ Appointed as Mentor for newly joined people for internship at Qualcomm.
➢ Active member of Core technical interviewers’ panel for lateral hiring as an interviewer at
Qualcomm, IBM & PwC.
➢ Appointed as a trainer for fresh campus hired graduates/associates from India’s premier
institutes on technical competencies (Analytics, SQL, Data-Warehouse, DB Migration, ETL), &
received appreciation from Executive Director for quality training delivered at PwC.
Academic Project:
➢ Online Job Portal System – University Final Year Project
o This is a web-based application developed in JAVA and Oracle 9i in 2011.
➢ Online Air Line Reservation System – IETE Training Institute (Summer Intern)
o This is a web-based application developed in JAVA and Oracle 9i in 2011.
➢ Computer Retail Business Management – IBM Certified Academy
o This is a SQL/PL-SQL application developed in Oracle 9i in 2010.
Academic Details: Qualification University/Board Year Marks Obtained
B. Tech W.B.U.T. 2011 7.86 CGPA
12th
W.B.C.H.S.E. 2006 MATH PHYS CHEM BIOS Overall
195/200 152/200 167/200 165/200 84.10%
10th
W.B.B.S.E. 2004 Math Ph. Sc. Life Sc. Overall
91/100 95/100 88/100 85.37%
Interests: ➢ Hadoop, Big-Data, Machine Learning, Data Warehouse, Data Analytics, Business Intelligence,
Python, R, Data Mining, Blog writing
Personal Details: ➢ DOB : September 11, 1988
➢ Gender : Male
➢ Marital Status : Unmarried
➢ Language Known : English, Hindi, Bengali

More Related Content

What's hot

2020 – A Decade of Change
2020 – A Decade of Change2020 – A Decade of Change
2020 – A Decade of Change
Markus Michalewicz
 
Senior database administrator
Senior database administratorSenior database administrator
Senior database administrator
Mustafa EL-Masry
 
Resume-pradeep SQL DBA
Resume-pradeep SQL DBAResume-pradeep SQL DBA
Resume-pradeep SQL DBA
Pradeep GP
 
Dimensional modeling in oracle sql developer
Dimensional modeling in oracle sql developerDimensional modeling in oracle sql developer
Dimensional modeling in oracle sql developer
Jeff Smith
 
Oracle RAC 19c and Later - Best Practices #OOWLON
Oracle RAC 19c and Later - Best Practices #OOWLONOracle RAC 19c and Later - Best Practices #OOWLON
Oracle RAC 19c and Later - Best Practices #OOWLON
Markus Michalewicz
 
"Changing Role of the DBA" Skills to Have, to Obtain & to Nurture - Updated 2...
"Changing Role of the DBA" Skills to Have, to Obtain & to Nurture - Updated 2..."Changing Role of the DBA" Skills to Have, to Obtain & to Nurture - Updated 2...
"Changing Role of the DBA" Skills to Have, to Obtain & to Nurture - Updated 2...
Markus Michalewicz
 
Armando Marrero Resume Sql Server Database Developer
Armando Marrero Resume Sql Server Database DeveloperArmando Marrero Resume Sql Server Database Developer
Armando Marrero Resume Sql Server Database Developer
Armando Marrero
 
2010/09 - Database Architechs - Performance & Tuning Tool
2010/09 - Database Architechs - Performance & Tuning Tool2010/09 - Database Architechs - Performance & Tuning Tool
2010/09 - Database Architechs - Performance & Tuning Tool
Database Architechs
 
Make Your Application “Oracle RAC Ready” & Test For It
Make Your Application “Oracle RAC Ready” & Test For ItMake Your Application “Oracle RAC Ready” & Test For It
Make Your Application “Oracle RAC Ready” & Test For It
Markus Michalewicz
 
Why Use an Oracle Database?
Why Use an Oracle Database?Why Use an Oracle Database?
Why Use an Oracle Database?
Markus Michalewicz
 
(Oracle) DBA and Other Skills Needed in 2020
(Oracle) DBA and Other Skills Needed in 2020(Oracle) DBA and Other Skills Needed in 2020
(Oracle) DBA and Other Skills Needed in 2020
Markus Michalewicz
 
Under the Hood of the Smartest Availability Features in Oracle's Autonomous D...
Under the Hood of the Smartest Availability Features in Oracle's Autonomous D...Under the Hood of the Smartest Availability Features in Oracle's Autonomous D...
Under the Hood of the Smartest Availability Features in Oracle's Autonomous D...
Markus Michalewicz
 
The Top 5 Reasons to Deploy Your Applications on Oracle RAC
The Top 5 Reasons to Deploy Your Applications on Oracle RACThe Top 5 Reasons to Deploy Your Applications on Oracle RAC
The Top 5 Reasons to Deploy Your Applications on Oracle RAC
Markus Michalewicz
 
Database@Home : The Future is Data Driven
Database@Home : The Future is Data DrivenDatabase@Home : The Future is Data Driven
Database@Home : The Future is Data Driven
Tammy Bednar
 
MySQL Sharding: Tools and Best Practices for Horizontal Scaling
MySQL Sharding: Tools and Best Practices for Horizontal ScalingMySQL Sharding: Tools and Best Practices for Horizontal Scaling
MySQL Sharding: Tools and Best Practices for Horizontal Scaling
Mats Kindahl
 
Sql developer resume
Sql developer resumeSql developer resume
Sql developer resume
guest9db49b8
 
Using MySQL Enterprise Monitor for Continuous Performance Improvement
Using MySQL Enterprise Monitor for Continuous Performance ImprovementUsing MySQL Enterprise Monitor for Continuous Performance Improvement
Using MySQL Enterprise Monitor for Continuous Performance Improvement
Mark Matthews
 
Manish Kumar-.Net,Sql Server Developer,BI(SSIS,SSAS,SSRS)
Manish Kumar-.Net,Sql Server Developer,BI(SSIS,SSAS,SSRS)Manish Kumar-.Net,Sql Server Developer,BI(SSIS,SSAS,SSRS)
Manish Kumar-.Net,Sql Server Developer,BI(SSIS,SSAS,SSRS)
Manish Kumar Gupta
 
Top 20 FAQs on the Autonomous Database
Top 20 FAQs on the Autonomous DatabaseTop 20 FAQs on the Autonomous Database
Top 20 FAQs on the Autonomous Database
Sandesh Rao
 
Oracle EBS database upgrade to 12c
Oracle EBS database upgrade to 12cOracle EBS database upgrade to 12c
Oracle EBS database upgrade to 12c
vasuballa
 

What's hot (20)

2020 – A Decade of Change
2020 – A Decade of Change2020 – A Decade of Change
2020 – A Decade of Change
 
Senior database administrator
Senior database administratorSenior database administrator
Senior database administrator
 
Resume-pradeep SQL DBA
Resume-pradeep SQL DBAResume-pradeep SQL DBA
Resume-pradeep SQL DBA
 
Dimensional modeling in oracle sql developer
Dimensional modeling in oracle sql developerDimensional modeling in oracle sql developer
Dimensional modeling in oracle sql developer
 
Oracle RAC 19c and Later - Best Practices #OOWLON
Oracle RAC 19c and Later - Best Practices #OOWLONOracle RAC 19c and Later - Best Practices #OOWLON
Oracle RAC 19c and Later - Best Practices #OOWLON
 
"Changing Role of the DBA" Skills to Have, to Obtain & to Nurture - Updated 2...
"Changing Role of the DBA" Skills to Have, to Obtain & to Nurture - Updated 2..."Changing Role of the DBA" Skills to Have, to Obtain & to Nurture - Updated 2...
"Changing Role of the DBA" Skills to Have, to Obtain & to Nurture - Updated 2...
 
Armando Marrero Resume Sql Server Database Developer
Armando Marrero Resume Sql Server Database DeveloperArmando Marrero Resume Sql Server Database Developer
Armando Marrero Resume Sql Server Database Developer
 
2010/09 - Database Architechs - Performance & Tuning Tool
2010/09 - Database Architechs - Performance & Tuning Tool2010/09 - Database Architechs - Performance & Tuning Tool
2010/09 - Database Architechs - Performance & Tuning Tool
 
Make Your Application “Oracle RAC Ready” & Test For It
Make Your Application “Oracle RAC Ready” & Test For ItMake Your Application “Oracle RAC Ready” & Test For It
Make Your Application “Oracle RAC Ready” & Test For It
 
Why Use an Oracle Database?
Why Use an Oracle Database?Why Use an Oracle Database?
Why Use an Oracle Database?
 
(Oracle) DBA and Other Skills Needed in 2020
(Oracle) DBA and Other Skills Needed in 2020(Oracle) DBA and Other Skills Needed in 2020
(Oracle) DBA and Other Skills Needed in 2020
 
Under the Hood of the Smartest Availability Features in Oracle's Autonomous D...
Under the Hood of the Smartest Availability Features in Oracle's Autonomous D...Under the Hood of the Smartest Availability Features in Oracle's Autonomous D...
Under the Hood of the Smartest Availability Features in Oracle's Autonomous D...
 
The Top 5 Reasons to Deploy Your Applications on Oracle RAC
The Top 5 Reasons to Deploy Your Applications on Oracle RACThe Top 5 Reasons to Deploy Your Applications on Oracle RAC
The Top 5 Reasons to Deploy Your Applications on Oracle RAC
 
Database@Home : The Future is Data Driven
Database@Home : The Future is Data DrivenDatabase@Home : The Future is Data Driven
Database@Home : The Future is Data Driven
 
MySQL Sharding: Tools and Best Practices for Horizontal Scaling
MySQL Sharding: Tools and Best Practices for Horizontal ScalingMySQL Sharding: Tools and Best Practices for Horizontal Scaling
MySQL Sharding: Tools and Best Practices for Horizontal Scaling
 
Sql developer resume
Sql developer resumeSql developer resume
Sql developer resume
 
Using MySQL Enterprise Monitor for Continuous Performance Improvement
Using MySQL Enterprise Monitor for Continuous Performance ImprovementUsing MySQL Enterprise Monitor for Continuous Performance Improvement
Using MySQL Enterprise Monitor for Continuous Performance Improvement
 
Manish Kumar-.Net,Sql Server Developer,BI(SSIS,SSAS,SSRS)
Manish Kumar-.Net,Sql Server Developer,BI(SSIS,SSAS,SSRS)Manish Kumar-.Net,Sql Server Developer,BI(SSIS,SSAS,SSRS)
Manish Kumar-.Net,Sql Server Developer,BI(SSIS,SSAS,SSRS)
 
Top 20 FAQs on the Autonomous Database
Top 20 FAQs on the Autonomous DatabaseTop 20 FAQs on the Autonomous Database
Top 20 FAQs on the Autonomous Database
 
Oracle EBS database upgrade to 12c
Oracle EBS database upgrade to 12cOracle EBS database upgrade to 12c
Oracle EBS database upgrade to 12c
 

Similar to Shubham, 7.5+ years exp, mcp, map r spark-hive-bi-etl-azure-dataengineer-ml

DeepeshRehi
DeepeshRehiDeepeshRehi
DeepeshRehi
deepesh rehi
 
Nagarjuna_Damarla_Resume
Nagarjuna_Damarla_ResumeNagarjuna_Damarla_Resume
Nagarjuna_Damarla_Resume
Nag Arjun
 
Prashanth Kumar_Hadoop_NEW
Prashanth Kumar_Hadoop_NEWPrashanth Kumar_Hadoop_NEW
Prashanth Kumar_Hadoop_NEW
Prashanth Shankar kumar
 
Hadoop Big Data Resume
Hadoop Big Data ResumeHadoop Big Data Resume
Hadoop Big Data Resume
arbind_jha
 
Hadoop Big Data Resume
Hadoop Big Data ResumeHadoop Big Data Resume
Hadoop Big Data Resume
arbind_jha
 
Deepankar Sehdev- Resume2015
Deepankar Sehdev- Resume2015Deepankar Sehdev- Resume2015
Deepankar Sehdev- Resume2015
Deepankar Sehdev
 
AnilKumarT_Resume_latest
AnilKumarT_Resume_latestAnilKumarT_Resume_latest
AnilKumarT_Resume_latest
anil_thyagarajan
 
Rajeev kumar apache_spark & scala developer
Rajeev kumar apache_spark & scala developerRajeev kumar apache_spark & scala developer
Rajeev kumar apache_spark & scala developer
Rajeev Kumar
 
Kiran_Profile
Kiran_ProfileKiran_Profile
Kiran_Profile
Kiran Immadi
 
Hadoop training in Bangalore
Hadoop training in BangaloreHadoop training in Bangalore
Hadoop training in Bangalore
IGEEKS TECHNOLOGIES
 
Sunshine consulting mopuru babu cv_java_j2ee_spring_bigdata_scala
Sunshine consulting mopuru babu cv_java_j2ee_spring_bigdata_scalaSunshine consulting mopuru babu cv_java_j2ee_spring_bigdata_scala
Sunshine consulting mopuru babu cv_java_j2ee_spring_bigdata_scala
Mopuru Babu
 
Robin_Hadoop
Robin_HadoopRobin_Hadoop
Robin_Hadoop
Robin David
 
Maharshi_Amin_416
Maharshi_Amin_416Maharshi_Amin_416
Maharshi_Amin_416
mamin1411
 
Vaibhav_Rane
Vaibhav_RaneVaibhav_Rane
Vaibhav_Rane
Vaibhav Rane
 
Resume
ResumeResume
Simplifying the Creation of Machine Learning Workflow Pipelines for IoT Appli...
Simplifying the Creation of Machine Learning Workflow Pipelines for IoT Appli...Simplifying the Creation of Machine Learning Workflow Pipelines for IoT Appli...
Simplifying the Creation of Machine Learning Workflow Pipelines for IoT Appli...
ScyllaDB
 
Feature Store as a Data Foundation for Machine Learning
Feature Store as a Data Foundation for Machine LearningFeature Store as a Data Foundation for Machine Learning
Feature Store as a Data Foundation for Machine Learning
Provectus
 
Tarun poladi resume
Tarun poladi resumeTarun poladi resume
Tarun poladi resume
Tarun P
 
Resume
ResumeResume
Resume
nagapandu
 
Cv anith
Cv anithCv anith
Cv anith
Anith Shaji
 

Similar to Shubham, 7.5+ years exp, mcp, map r spark-hive-bi-etl-azure-dataengineer-ml (20)

DeepeshRehi
DeepeshRehiDeepeshRehi
DeepeshRehi
 
Nagarjuna_Damarla_Resume
Nagarjuna_Damarla_ResumeNagarjuna_Damarla_Resume
Nagarjuna_Damarla_Resume
 
Prashanth Kumar_Hadoop_NEW
Prashanth Kumar_Hadoop_NEWPrashanth Kumar_Hadoop_NEW
Prashanth Kumar_Hadoop_NEW
 
Hadoop Big Data Resume
Hadoop Big Data ResumeHadoop Big Data Resume
Hadoop Big Data Resume
 
Hadoop Big Data Resume
Hadoop Big Data ResumeHadoop Big Data Resume
Hadoop Big Data Resume
 
Deepankar Sehdev- Resume2015
Deepankar Sehdev- Resume2015Deepankar Sehdev- Resume2015
Deepankar Sehdev- Resume2015
 
AnilKumarT_Resume_latest
AnilKumarT_Resume_latestAnilKumarT_Resume_latest
AnilKumarT_Resume_latest
 
Rajeev kumar apache_spark & scala developer
Rajeev kumar apache_spark & scala developerRajeev kumar apache_spark & scala developer
Rajeev kumar apache_spark & scala developer
 
Kiran_Profile
Kiran_ProfileKiran_Profile
Kiran_Profile
 
Hadoop training in Bangalore
Hadoop training in BangaloreHadoop training in Bangalore
Hadoop training in Bangalore
 
Sunshine consulting mopuru babu cv_java_j2ee_spring_bigdata_scala
Sunshine consulting mopuru babu cv_java_j2ee_spring_bigdata_scalaSunshine consulting mopuru babu cv_java_j2ee_spring_bigdata_scala
Sunshine consulting mopuru babu cv_java_j2ee_spring_bigdata_scala
 
Robin_Hadoop
Robin_HadoopRobin_Hadoop
Robin_Hadoop
 
Maharshi_Amin_416
Maharshi_Amin_416Maharshi_Amin_416
Maharshi_Amin_416
 
Vaibhav_Rane
Vaibhav_RaneVaibhav_Rane
Vaibhav_Rane
 
Resume
ResumeResume
Resume
 
Simplifying the Creation of Machine Learning Workflow Pipelines for IoT Appli...
Simplifying the Creation of Machine Learning Workflow Pipelines for IoT Appli...Simplifying the Creation of Machine Learning Workflow Pipelines for IoT Appli...
Simplifying the Creation of Machine Learning Workflow Pipelines for IoT Appli...
 
Feature Store as a Data Foundation for Machine Learning
Feature Store as a Data Foundation for Machine LearningFeature Store as a Data Foundation for Machine Learning
Feature Store as a Data Foundation for Machine Learning
 
Tarun poladi resume
Tarun poladi resumeTarun poladi resume
Tarun poladi resume
 
Resume
ResumeResume
Resume
 
Cv anith
Cv anithCv anith
Cv anith
 

Recently uploaded

原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
u86oixdj
 
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
nuttdpt
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
roli9797
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
javier ramirez
 
State of Artificial intelligence Report 2023
State of Artificial intelligence Report 2023State of Artificial intelligence Report 2023
State of Artificial intelligence Report 2023
kuntobimo2016
 
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Aggregage
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
slg6lamcq
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
g4dpvqap0
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
sameer shah
 
Challenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more importantChallenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more important
Sm321
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
Roger Valdez
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
mbawufebxi
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
Lars Albertsson
 
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
74nqk8xf
 
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
74nqk8xf
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
ahzuo
 
Natural Language Processing (NLP), RAG and its applications .pptx
Natural Language Processing (NLP), RAG and its applications .pptxNatural Language Processing (NLP), RAG and its applications .pptx
Natural Language Processing (NLP), RAG and its applications .pptx
fkyes25
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
v3tuleee
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
manishkhaire30
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
Walaa Eldin Moustafa
 

Recently uploaded (20)

原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
 
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
 
State of Artificial intelligence Report 2023
State of Artificial intelligence Report 2023State of Artificial intelligence Report 2023
State of Artificial intelligence Report 2023
 
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
 
Challenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more importantChallenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more important
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
 
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
 
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
 
Natural Language Processing (NLP), RAG and its applications .pptx
Natural Language Processing (NLP), RAG and its applications .pptxNatural Language Processing (NLP), RAG and its applications .pptx
Natural Language Processing (NLP), RAG and its applications .pptx
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
 

Shubham, 7.5+ years exp, mcp, map r spark-hive-bi-etl-azure-dataengineer-ml

  • 1. Experiences Education ➢ 7.5+ years of extensive experiences in Data Analytics/ Engineering with Business Intelligence, Data Warehouse and Hadoop-Spark-MapR-HIVE eco-system. ➢ Process live streaming data into MapR-Hadoop filesystem using Apache-PySpark. Creating schema- oriented Parquet files and load them into HIVE (external/internal table) and MapR DB using Python. Creating multithreading python script for PySpark. ➢ Expertise in Huge Volume of data integration/migration and implementing Data-Warehouse repository using SSIS, SQL Server, providing end-to-end ETL solutions with deployment to Azure Cloud, Data Analysis and SQL DBA, Azure API Management, Azure Web Apps, converting business requirements into technical solutions. ➢ Handling unstructured & semi-structured data in XML, JSON using Kafka streaming. ➢ Working on ML algorithms like Linear/Logistic Regression, Lasso, Ridge, Decision Tree, CNN-Neural Network, Support Vector Machine (SVM), ARIMA Time Series, Text Mining, K-Means Cluster. ➢ An experienced team player (led a team of 6 members) of excellent interpersonal skills with ability to work independently in projects with multiple clients’ simultaneously. ➢ Passionate, positive, can-do attitude, and willing to take ownership of problems for full resolutions. Present November 2017 December 2016 October 2014 April 2012 Profile Summary: Awards & Achievements: Skills: Primary Project Big Data MapR-Hadoop, Spark, Python, HIVE, HQL, NoSQL, Mongo DB, UNIX, Kafka, Snowflake Business Intelligence SSIS / SQL Server Data Tool, SSRS, MSBI/ETL/BIDS, Data Quality Services (DQS), Data Cleansing, Data Modelling, Dashboards RDBMS Database Azure Cloud SQL DB, Azure BLOB Storage, Data Warehouse, SQL Server, Netezza, T-SQL, Free Text Search, MySQL, DBA, DB Isolation Level, Performance Tuning, SQL Automation, Backup Strategy, OLTP Secondary Project Machine Learning Supervised/Unsupervised Learning, Linear/Logistic Regression, Decision Tree, Random Forest, CNN-Neural Network, Support Vector Machine (SVM), ARIMA Time Series, Text Mining, K-Means Cluster UI Framework Azure Service Bus, Web Services, ASP.NET, C#, OOPS, WCF Other Technology Cloud Computing, Oracle, R-Studio, JavaScript, SQL Server SSAS- Cube, MDX, Tableau/QlikView, Service-Now, JIRA, PostgreSQL Tools Visual Studio, Version Control using TFS, GitHub ➢ Stood in Top 3 in Qualcomm Worldwide Online Machine Learning Contest in 2019. ➢ Won the WINNER title of Innovation Maestro 2018-19 in Qualcomm OneIT using ML. ➢ Qualcomm OneIT HaQkathon 2018 winner, got chance to present my prototype to Qualcomm CIO. ➢ Went to final round in Microsoft AI Challenge India 2018. ➢ Super-Qualstar awarded for rapid prototypes & executions on innovative ideas in Qualcomm. ➢ Qualstar awarded for prototyping on an innovative idea using Python & NLP in Qualcomm. ➢ QBuzz 2018 finalist. QBuzz is a platform organized by Qualcomm for new Innovative Paper- Presentation in cutting-edge technologies. I presented an innovative idea in AI/ML domain. ➢ Certificate for my contribution to OneIT Innovation Program using Machine Learning in Qualcomm. ➢ Completed Microsoft Certification exam on “Implementing a Data Warehouse with Microsoft SQL Server” (70-463). ➢ Client Appreciation Certificate for outstanding performance in Data Migration in PwC. ➢ Appreciation Certificate for quick successful deliverables in Data Migration in PwC. ➢ “STAR Performer” of the year awarded for outstanding performance in PwC. ➢ Received Client Appreciation Certificate for outstanding performance in Data Analytics in PwC. Qualcomm Hackathon, Innovation Maestro, & Super-Qualstar Winner | Microsoft Certified Professional ❖ Pursuing M.Tech in Data Science & Engineering from BITS, Pilani. ❖ B.Tech in Computer Science & Engineering in 2007-2011 from W.B.U.T. with 7.86 CGPA ❖ Higher Secondary (12th) in 2006 from W.B.C.H.S.E. with 84.10%. ❖ Secondary (10th) in 2004 from W.B.B.S.E. with 85.37%. Shubham Mallick +91 - 9734389354 shubham.kolkata@gmail.com https://www.linkedin.com/in/i-Shubham/ Hyderabad, India USA Business Visa (B1/B2)
  • 2. ➢ Project: ECAP (Engineering Compute Analytics Platform) Data Processing ➢ Role: Senior Data Analyst ➢ Tools/Technology: MapR-Hadoop, Spark, Python, HIVE, Parquet, Netezza, UNIX, Kafka, ML ➢ Details: i. Building high quality Big Data applications, data pipelines and analytics solutions. ii. Loading huge volume of live streaming data into MapR-DB in Hadoop filesystem (HDFS) using Apache-PySpark. Creating schema-oriented Parquet files. iii. Loading data from MapR filesystem to HIVE (external/internal table) using Beeline/Spark. Set-up message-broker using Kafka for live streaming of log file. iv. Data Masking on huge volume of data by Data Dictionary using Python Parallel Processing. v. Experiencing in building solutions using Columnar database architecture and performance. vi. Scraping data from QlikView dashboards and loading into HIVE using PySpark. vii. Working on Log-Stash from 'ELK' stack to centralize log files generated from different Hadoop clusters & loading into MapR DB/HIVE using PySpark. viii. Replacing Informatica ETL tool using Apache-Spark data pipeline scheduled on YARN. ix. Demonstrating strong development processes in Agile. Also playing role as a Scrum Master. x. Collaborating with multiple cross functional teams to work on solutions. xi. Working on QCT chipset forecasting using Machine Learning models (Linear/Logistic, Lasso, Ridge, Decision Tree, Convolutional Neural Network - CNN, SVM, ARIMA Time Series), & plotting in matplotlib of few real-time problems which are the part of Q-Innovation team. xii. Parameter tuning of ML models using GridSearchCV & RandomSearchCV, & deploy in Prod. xiii. Code migration and version controlling using Github, & issue tracking in JIRA & Service Now. xiv. Working as Mentor to interns who joined Qualcomm to make them industry ready. Helping cross team to automate Netezza-to-SnowFlake, Netezza-to-MapR migrations using Python. Work Experiences: November 2017 to Present December 2016 to November 2017 ➢ Project: Project Oaktree (Azure Cloud Migration) ➢ Role: Team Lead, SME for application/module ➢ Tools/Technology: Azure Cloud, SQL Server, Web-Services, Service-Bus, WCF, SSRS, MongoDB ➢ Details: i. Gather business requirements in AGILE/SCRUM method &design Data Model. ii. Data migration to Azure Cloud DB from heterogeneous legacy source systems, performing DBA tasks & end-to-end ETL solutions at Azure Cloud, with DBA tasks (Back up strategy and SQL jobs, Recovery mode, Performance tuning, Index rebuilding, Availability groups). iii. Create MongoDB database, migrate DataMart from SQL Server to MongoDB using SSIS custom scripts, and process for Data Analytics. iv. Deploy Azure Service Bus using C#.NET to interact with Broker message from client app, create C# WCF web services interact with Cloud Database & deploy in Azure. v. Led a team of 6 members and grooming them to make them ready for client facing. October 2014 to December 2016 ➢ Project: GCP-GDW & IPSR ➢ Role: Senior BI Developer and Module lead ➢ Tools/Technology: SQL Server, T-SQL, SSIS, ETL, DBA, C#, TFS, Free Text search, MongoDB, Python ➢ Details: i. Data Warehouse migration in AGILE/SCRUM methodologies, understanding business requirements & design Data Model for SQL Data Warehouse. ii. Predictive search/Free-Text Search (like Google search) from billions of rows in just 2-3 seconds from .NET application using C# web services & jQuery. iii. Create & deploy SSIS packages for Data Migration from heterogeneous source to SQL Server and provide end-to-end ETL solution. Performance tuning of SSIS (with SQL tuning) to load 50 million data rows within 1.25 hours from remote data source to SQL server. iv. Design Data-Mart and load data from SQL Server DW through SSIS packages, developing SSRS and PowerBI report for Data Quality check, after & before Migration. v. DBA tasks (Back up strategy and SQL jobs, Recovery mode, Performance tuning, Index rebuilding, Availability groups). vi. Integrate DQS (Data Quality Service) with SSIS for bulk data cleansing during migrations. vii. Developing C# WCF web services to expose data to clients like Infosys, TCS.
  • 3. April 2012 to October 2014 ➢ Project: MetLife USBPM ➢ Role: Application developer ➢ Tools/Technology: Visual Studio, SQL Server, T-SQL, SSIS, C#, ADO.NET, JavaScript, jQuery ➢ Details: i. Production Support project of 24X7 hours for MetLife, handling bulk production data & process to SQL server through SSIS packages. ii. Monitoring of running live Production jobs through IBM Tivoli/Maestro, solving issues on Ticketing System on priority basis. iii. Automation of existing process to minimize day-to-day manual efforts & errors. iv. Upgrade of existing SSIS packages, tuning existing Stored Procedures, solving Production issues with priority as per Business tickets. Additional Responsibility: ➢ Appointed as Mentor for newly joined people for internship at Qualcomm. ➢ Active member of Core technical interviewers’ panel for lateral hiring as an interviewer at Qualcomm, IBM & PwC. ➢ Appointed as a trainer for fresh campus hired graduates/associates from India’s premier institutes on technical competencies (Analytics, SQL, Data-Warehouse, DB Migration, ETL), & received appreciation from Executive Director for quality training delivered at PwC. Academic Project: ➢ Online Job Portal System – University Final Year Project o This is a web-based application developed in JAVA and Oracle 9i in 2011. ➢ Online Air Line Reservation System – IETE Training Institute (Summer Intern) o This is a web-based application developed in JAVA and Oracle 9i in 2011. ➢ Computer Retail Business Management – IBM Certified Academy o This is a SQL/PL-SQL application developed in Oracle 9i in 2010. Academic Details: Qualification University/Board Year Marks Obtained B. Tech W.B.U.T. 2011 7.86 CGPA 12th W.B.C.H.S.E. 2006 MATH PHYS CHEM BIOS Overall 195/200 152/200 167/200 165/200 84.10% 10th W.B.B.S.E. 2004 Math Ph. Sc. Life Sc. Overall 91/100 95/100 88/100 85.37% Interests: ➢ Hadoop, Big-Data, Machine Learning, Data Warehouse, Data Analytics, Business Intelligence, Python, R, Data Mining, Blog writing Personal Details: ➢ DOB : September 11, 1988 ➢ Gender : Male ➢ Marital Status : Unmarried ➢ Language Known : English, Hindi, Bengali