SlideShare a Scribd company logo
PROJECT MANAGEMENT &
BIG DATA ANALYTICS
Sandeep Kumar PMP®
INTRODUCTION
IT Strategy & Business Transformation
Industries:
Media & Advertising, Telecom, BFSI,
FMCG, Manufacturing, BPO/KPO
Services:
Shared Service Delivery, GIC & Back
Office, PMO, Lean Six Sigma, Continuous
Improvement, Enterprise IT, ERP, Cloud &
Infrastructure, Development, Outsourcing
and IT Security & Governance
TWO STREAMS
Project Management of
Analytics
• Big Data
• Data Warehousing
• Lean Six Sigma
• In-memory computing
• Internet of things
• Social Media
Analytics of Project
Management
• Time-Cost-Spec Analytics
• Feasibility Analytics
• Resource Analytics
• Management of Collaboration
• Agile & SCRUM
APOLOGIES, DISCLAIMERS, ET AL
• Big Data is over-hyped
• Big Data is still evolving
• Analytics is old, the tools are new!
• Project Management solves most of the problems
• Its importance is usually understated
• The success of Big Data initiative lies primarily on the management, then
on the PM & the DS
• Hold the PM responsible only if you know what you want!
• The roles I talk about here are essentially with respect Big Data Projects
If you have
Questions,
I will try &
Answer
them!
UNDER SCANNER !
Big Data / Analytics
Myths
• It is mature and cool
• Is an extension of EDW
• Data Quality can be slightly compromised
• A single pre-built technology (e.g. Hadoop) will suffice
• Data scientists are easy to get
• Virtualization / Clustering will take care of infra needs
• If you have huge data, every solution is a Big Data solution
Project Management
Myths
• Managing only activities
• Just time and cost management
• Mere resource allocation
• Reaching the finishing-line!
• General management suffices
• Have time to learn
UNDER SCANNER !
Big Data / Analytics
Facts
• Responsible for Business Case and ROI
definitions
• Executive Sponsorship & Funds
• ‘Real’ Resource Provisioning
• Based on Enterprise Architecture
• Highly complex and iterative process
• Loads of scientific knowledge required
• Source of data increases every day
• Should be able to adapt with time
Project Management
Facts
• Responsible for Scope & acceptance by all
parties
• Direction setting & KPI-SF definitions
• ‘Real’ Resource Management
• Right to procure & deploy the appropriate
resource
• Stakeholder & Communication management
• Accountability and Responsibility for the success
(and failure)
THE DATA SCIENTIST
Key skills of a Data Scientist – the hard skills guy
• Basic Tools: Knowledge of statistical programming languages, like R or Python, and SQL
• Basic Statistics: Familiar with statistical tests, distributions, maximum likelihood estimators, etc.
• ETL Tools: Best in class like Informatica, IBM Infosphere, SAP BO, Oracle or SAS Data Integrator, Penta-ho, AB-Initio
• Machine Learning / Artificial Intelligence / Pattern Recognition: Methods for Classification and Regression like k-nearest
neighbours, random forests, etc.
• Multivariable Calculus & Linear Algebra: Specially required where data is used for predictive performance or algorithm
optimization
• Data Munging / Scrubbing or Cleanliness: For example inconsistent string formatting as ND or Del for New Delhi; date
alignment as [mm-dd-yyyy] or [dd-mm-yyyy] or [yyyy-dd-mm]
• Data Visualization & Communication tools: Principles of and tools of Data Visualization like ggplot and d3.js.
• Software Engineering: Strong software engineering background, SDLC, Agile, Scrum, DB techniques, Data intensive product development
• Software Testing skills – To make sure the output delivers what the business needs
• Basic Project Management Skills: Thinking like a Project Manager
THE PROJECT MANAGER
Key skills of a Project Manager – the soft skills guy
• Project Charter: Project Stakeholders and Objectives documented and signed-off
• Business Case: Asks the ‘whats’ and ‘whys’ of the business requirement
• Scheduling Tools: Creates a Plan of Action to answer the ‘hows’ of the project
• Vendor Management: Links up all 1st and 3rd party resources
• Risk Management: The real management tool, with the mitigant
• Communication Management: The core of collaboration and Management
• Software Engineering: Software engineering background with fair knowledge of tools
• Software Testing skills: To make sure the output delivers what the business needs
• Basic Data Management Skills: Thinking like a Data Scientist
MERGING ROLES
The Data Scientist
 The Requirements guy
 The Data Tools guy
 The Resource guy
 The Specialist
 The Enterprise Architect
 The Software guy
Breaking the Technical Barriers
The Project Manager
 The Requirements & Scope guy
 The Project Tools guy
 The Resource guy
 The Generalist
 The Program Manager
 The hardware & Software guy
Breaking the Cultural Barriers
WHY DO ANALYTICS
PROJECTS FAIL ?
When do Projects fail, in general?
• Not completed within budgets
• Not completed on time
• Not completed as per specifications
Whys and Wherefores…
• Poor scoping – unclear objective
• Inadequate resources – lack of talent
• Inappropriate Solution – wrong tool selection
• Bad planning – Insufficient analysis
• Bad execution – poor Project Management
“3 out of 4 Big Data Projects fail”
• Inaccurate Project Scope
• Lack of Talent
• Challenging Tools
• Even more Challenging Concepts
• Poor Planning
• Ownership Issues – Business
Initiative or IT Project?
Apache Hadoop vs. Apache Cassandra
55% of Big Data projects don’t get completed; in case of IT projects in general, it is only 25%.
THE SUCCESS MANTRA!
• Get a Data Scientist at the PM or SME
• Or, at least get the Data Scientist as one of the leads
• Ask very tough questions to sponsors for a Business Case
• Get the CFO and the End-User on your side – get the expectations right
• Take your time – use appropriate Project Management methodologies
• Use the most appropriate Platform / Tool
• Accept requirements’ volatility
• Scrum: accepting that the problem cannot be fully understood or defined, focusing instead on maximizing the team's ability to deliver quickly
• POC  Deployment  Acceptance  Enhancement  Deployment … …
• Agile: adaptive planning, evolutionary development, early delivery, continuous improvement
• Document and Handover to End-User at every stage
PM answers “How” as long as the Business knows “Why & What”
QUESTIONS, ANY?

More Related Content

What's hot

Pitfalls and pro-tips for effective and transparent Business Intelligence too...
Pitfalls and pro-tips for effective and transparent Business Intelligence too...Pitfalls and pro-tips for effective and transparent Business Intelligence too...
Pitfalls and pro-tips for effective and transparent Business Intelligence too...
Data Con LA
 
Webinar: Data Quality, Data Engineering, and Data Science
Webinar: Data Quality, Data Engineering, and Data ScienceWebinar: Data Quality, Data Engineering, and Data Science
Webinar: Data Quality, Data Engineering, and Data Science
DATAVERSITY
 
Leveraged Analytics at Scale
Leveraged Analytics at ScaleLeveraged Analytics at Scale
Leveraged Analytics at Scale
Domino Data Lab
 
Ashley Ohmann--Data Governance Final 011315
Ashley Ohmann--Data Governance Final 011315Ashley Ohmann--Data Governance Final 011315
Ashley Ohmann--Data Governance Final 011315Ashley Ohmann
 
Join 2017_Deep Dive_To Use or Not Use PDT's
Join 2017_Deep Dive_To Use or Not Use PDT'sJoin 2017_Deep Dive_To Use or Not Use PDT's
Join 2017_Deep Dive_To Use or Not Use PDT's
Looker
 
Partner Summit 15_AHa Moments_AshleyOhmann FINAL
Partner Summit 15_AHa Moments_AshleyOhmann FINALPartner Summit 15_AHa Moments_AshleyOhmann FINAL
Partner Summit 15_AHa Moments_AshleyOhmann FINALAshley Ohmann
 
Keys to Formulating an Effective Data Management Strategy in the Age of Data
Keys to Formulating an Effective Data Management Strategy in the Age of DataKeys to Formulating an Effective Data Management Strategy in the Age of Data
Keys to Formulating an Effective Data Management Strategy in the Age of Data
Impetus Technologies
 
How Can You Calculate the Cost of Your Data?
How Can You Calculate the Cost of Your Data?How Can You Calculate the Cost of Your Data?
How Can You Calculate the Cost of Your Data?
DATAVERSITY
 
20220126 ARMA Calgary and Edmonton, Harnessing Your Information to Create Bus...
20220126 ARMA Calgary and Edmonton, Harnessing Your Information to Create Bus...20220126 ARMA Calgary and Edmonton, Harnessing Your Information to Create Bus...
20220126 ARMA Calgary and Edmonton, Harnessing Your Information to Create Bus...
Jesse Wilkins
 
Real-World Data Governance: Comparing World Class Solutions in Data Governanc...
Real-World Data Governance: Comparing World Class Solutions in Data Governanc...Real-World Data Governance: Comparing World Class Solutions in Data Governanc...
Real-World Data Governance: Comparing World Class Solutions in Data Governanc...DATAVERSITY
 
Competitive Intelligence and Big Data
Competitive Intelligence and Big DataCompetitive Intelligence and Big Data
Competitive Intelligence and Big Data
CID GmbH
 
Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...
Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...
Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...
Thoughtworks
 
Keysto effectivedatavisualization fsfp
Keysto effectivedatavisualization fsfpKeysto effectivedatavisualization fsfp
Keysto effectivedatavisualization fsfp
DATAVERSITY
 
Data Detectives - Presentation
Data Detectives - PresentationData Detectives - Presentation
Data Detectives - PresentationClint Campbell
 
Correlation does not mean causation
Correlation does not mean causationCorrelation does not mean causation
Correlation does not mean causation
Peter Varhol
 
RWDG Webinar: The New Non-Invasive Data Governance Framework
RWDG Webinar: The New Non-Invasive Data Governance FrameworkRWDG Webinar: The New Non-Invasive Data Governance Framework
RWDG Webinar: The New Non-Invasive Data Governance Framework
DATAVERSITY
 
[AIIM16] How Regulatory Data Can Set the Narrative for an Analytics Opportunity
[AIIM16] How Regulatory Data Can Set the Narrative for an Analytics Opportunity[AIIM16] How Regulatory Data Can Set the Narrative for an Analytics Opportunity
[AIIM16] How Regulatory Data Can Set the Narrative for an Analytics Opportunity
AIIM International
 
5 ways to improve performance through indexing
5 ways to improve performance through indexing5 ways to improve performance through indexing
5 ways to improve performance through indexingJason Strate
 
"Selling" Open Source 101
"Selling" Open Source 101"Selling" Open Source 101
"Selling" Open Source 101
Mohd Izhar Firdaus Ismail
 
[AIIM16] Automating Image Capture for Fun and Profit
[AIIM16] Automating Image Capture for Fun and Profit[AIIM16] Automating Image Capture for Fun and Profit
[AIIM16] Automating Image Capture for Fun and Profit
AIIM International
 

What's hot (20)

Pitfalls and pro-tips for effective and transparent Business Intelligence too...
Pitfalls and pro-tips for effective and transparent Business Intelligence too...Pitfalls and pro-tips for effective and transparent Business Intelligence too...
Pitfalls and pro-tips for effective and transparent Business Intelligence too...
 
Webinar: Data Quality, Data Engineering, and Data Science
Webinar: Data Quality, Data Engineering, and Data ScienceWebinar: Data Quality, Data Engineering, and Data Science
Webinar: Data Quality, Data Engineering, and Data Science
 
Leveraged Analytics at Scale
Leveraged Analytics at ScaleLeveraged Analytics at Scale
Leveraged Analytics at Scale
 
Ashley Ohmann--Data Governance Final 011315
Ashley Ohmann--Data Governance Final 011315Ashley Ohmann--Data Governance Final 011315
Ashley Ohmann--Data Governance Final 011315
 
Join 2017_Deep Dive_To Use or Not Use PDT's
Join 2017_Deep Dive_To Use or Not Use PDT'sJoin 2017_Deep Dive_To Use or Not Use PDT's
Join 2017_Deep Dive_To Use or Not Use PDT's
 
Partner Summit 15_AHa Moments_AshleyOhmann FINAL
Partner Summit 15_AHa Moments_AshleyOhmann FINALPartner Summit 15_AHa Moments_AshleyOhmann FINAL
Partner Summit 15_AHa Moments_AshleyOhmann FINAL
 
Keys to Formulating an Effective Data Management Strategy in the Age of Data
Keys to Formulating an Effective Data Management Strategy in the Age of DataKeys to Formulating an Effective Data Management Strategy in the Age of Data
Keys to Formulating an Effective Data Management Strategy in the Age of Data
 
How Can You Calculate the Cost of Your Data?
How Can You Calculate the Cost of Your Data?How Can You Calculate the Cost of Your Data?
How Can You Calculate the Cost of Your Data?
 
20220126 ARMA Calgary and Edmonton, Harnessing Your Information to Create Bus...
20220126 ARMA Calgary and Edmonton, Harnessing Your Information to Create Bus...20220126 ARMA Calgary and Edmonton, Harnessing Your Information to Create Bus...
20220126 ARMA Calgary and Edmonton, Harnessing Your Information to Create Bus...
 
Real-World Data Governance: Comparing World Class Solutions in Data Governanc...
Real-World Data Governance: Comparing World Class Solutions in Data Governanc...Real-World Data Governance: Comparing World Class Solutions in Data Governanc...
Real-World Data Governance: Comparing World Class Solutions in Data Governanc...
 
Competitive Intelligence and Big Data
Competitive Intelligence and Big DataCompetitive Intelligence and Big Data
Competitive Intelligence and Big Data
 
Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...
Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...
Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...
 
Keysto effectivedatavisualization fsfp
Keysto effectivedatavisualization fsfpKeysto effectivedatavisualization fsfp
Keysto effectivedatavisualization fsfp
 
Data Detectives - Presentation
Data Detectives - PresentationData Detectives - Presentation
Data Detectives - Presentation
 
Correlation does not mean causation
Correlation does not mean causationCorrelation does not mean causation
Correlation does not mean causation
 
RWDG Webinar: The New Non-Invasive Data Governance Framework
RWDG Webinar: The New Non-Invasive Data Governance FrameworkRWDG Webinar: The New Non-Invasive Data Governance Framework
RWDG Webinar: The New Non-Invasive Data Governance Framework
 
[AIIM16] How Regulatory Data Can Set the Narrative for an Analytics Opportunity
[AIIM16] How Regulatory Data Can Set the Narrative for an Analytics Opportunity[AIIM16] How Regulatory Data Can Set the Narrative for an Analytics Opportunity
[AIIM16] How Regulatory Data Can Set the Narrative for an Analytics Opportunity
 
5 ways to improve performance through indexing
5 ways to improve performance through indexing5 ways to improve performance through indexing
5 ways to improve performance through indexing
 
"Selling" Open Source 101
"Selling" Open Source 101"Selling" Open Source 101
"Selling" Open Source 101
 
[AIIM16] Automating Image Capture for Fun and Profit
[AIIM16] Automating Image Capture for Fun and Profit[AIIM16] Automating Image Capture for Fun and Profit
[AIIM16] Automating Image Capture for Fun and Profit
 

Viewers also liked

Project management for Big Data projects
Project management for Big Data projectsProject management for Big Data projects
Project management for Big Data projects
Sandeep Kumar, PMP®
 
BDM - project management in big data context.pptx
BDM -  project management in big data context.pptxBDM -  project management in big data context.pptx
BDM - project management in big data context.pptx
Jean-Louis Quéguiner
 
REFM's Demystifying The Excel Pro-Forma: What It Is And How You Can Begin To ...
REFM's Demystifying The Excel Pro-Forma: What It Is And How You Can Begin To ...REFM's Demystifying The Excel Pro-Forma: What It Is And How You Can Begin To ...
REFM's Demystifying The Excel Pro-Forma: What It Is And How You Can Begin To ...
Real Estate Financial Modeling
 
Big data project management
Big data project managementBig data project management
Big data project management
IMC Institute
 
Intro to Data Science for Enterprise Big Data
Intro to Data Science for Enterprise Big DataIntro to Data Science for Enterprise Big Data
Intro to Data Science for Enterprise Big Data
Paco Nathan
 
White Paper - Data Warehouse Project Management
White Paper - Data Warehouse Project ManagementWhite Paper - Data Warehouse Project Management
White Paper - Data Warehouse Project ManagementDavid Walker
 
Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Big Data [sorry] & Data Science: What Does a Data Scientist Do?Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Data Science London
 
Big Data - 25 Amazing Facts Everyone Should Know
Big Data - 25 Amazing Facts Everyone Should KnowBig Data - 25 Amazing Facts Everyone Should Know
Big Data - 25 Amazing Facts Everyone Should Know
Bernard Marr
 

Viewers also liked (9)

Project management for Big Data projects
Project management for Big Data projectsProject management for Big Data projects
Project management for Big Data projects
 
BDM - project management in big data context.pptx
BDM -  project management in big data context.pptxBDM -  project management in big data context.pptx
BDM - project management in big data context.pptx
 
REFM's Demystifying The Excel Pro-Forma: What It Is And How You Can Begin To ...
REFM's Demystifying The Excel Pro-Forma: What It Is And How You Can Begin To ...REFM's Demystifying The Excel Pro-Forma: What It Is And How You Can Begin To ...
REFM's Demystifying The Excel Pro-Forma: What It Is And How You Can Begin To ...
 
Big data project management
Big data project managementBig data project management
Big data project management
 
Intro to Data Science for Enterprise Big Data
Intro to Data Science for Enterprise Big DataIntro to Data Science for Enterprise Big Data
Intro to Data Science for Enterprise Big Data
 
Healthcare IT Management
Healthcare IT ManagementHealthcare IT Management
Healthcare IT Management
 
White Paper - Data Warehouse Project Management
White Paper - Data Warehouse Project ManagementWhite Paper - Data Warehouse Project Management
White Paper - Data Warehouse Project Management
 
Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Big Data [sorry] & Data Science: What Does a Data Scientist Do?Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Big Data [sorry] & Data Science: What Does a Data Scientist Do?
 
Big Data - 25 Amazing Facts Everyone Should Know
Big Data - 25 Amazing Facts Everyone Should KnowBig Data - 25 Amazing Facts Everyone Should Know
Big Data - 25 Amazing Facts Everyone Should Know
 

Similar to Project management for Big Data projects

Big Data Project Manager
Big Data Project ManagerBig Data Project Manager
Big Data Project Manager
Mrinal Chakravarty, CEng
 
Data-Ed Webinar: Data Architecture Requirements
Data-Ed Webinar: Data Architecture RequirementsData-Ed Webinar: Data Architecture Requirements
Data-Ed Webinar: Data Architecture Requirements
DATAVERSITY
 
Data-Ed: Data Architecture Requirements
Data-Ed: Data Architecture Requirements  Data-Ed: Data Architecture Requirements
Data-Ed: Data Architecture Requirements
Data Blueprint
 
Lessons after working as a data scientist for 1 year
Lessons after working as a data scientist for 1 yearLessons after working as a data scientist for 1 year
Lessons after working as a data scientist for 1 year
Yao Yao
 
Advanced Project Data Analytics for Improved Project Delivery
Advanced Project Data Analytics for Improved Project DeliveryAdvanced Project Data Analytics for Improved Project Delivery
Advanced Project Data Analytics for Improved Project Delivery
Mark Constable
 
The Big Data Dream Team
The Big Data Dream TeamThe Big Data Dream Team
The Big Data Dream Team
Accenture Analytics
 
IWMW 2004: It Always Takes Longer Than You Think (Even If You Think It Will T...
IWMW 2004: It Always Takes Longer Than You Think (Even If You Think It Will T...IWMW 2004: It Always Takes Longer Than You Think (Even If You Think It Will T...
IWMW 2004: It Always Takes Longer Than You Think (Even If You Think It Will T...
IWMW
 
Ajaykumar Reddy - Resume
Ajaykumar Reddy - ResumeAjaykumar Reddy - Resume
Ajaykumar Reddy - ResumeAJAY GOLLAPALLI
 
Intranet Design - How To Undertake An Intranet Redesign
Intranet Design - How To Undertake An Intranet RedesignIntranet Design - How To Undertake An Intranet Redesign
Intranet Design - How To Undertake An Intranet Redesign
Prescient Digital Media
 
Optimize your info-driven business processes. How to move from paper to digital
Optimize your info-driven business processes. How to move from paper to digitalOptimize your info-driven business processes. How to move from paper to digital
Optimize your info-driven business processes. How to move from paper to digital
Christiana Kozakou
 
Managing Data Science | Lessons from the Field
Managing Data Science | Lessons from the Field Managing Data Science | Lessons from the Field
Managing Data Science | Lessons from the Field
Domino Data Lab
 
II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceDr. Haxel Consult
 
Adding Hadoop to Your Analytics Mix?
Adding Hadoop to Your Analytics Mix?Adding Hadoop to Your Analytics Mix?
Adding Hadoop to Your Analytics Mix?
Think Big, a Teradata Company
 
Getting Data Quality Right
Getting Data Quality RightGetting Data Quality Right
Getting Data Quality Right
DATAVERSITY
 
Data-Ed Online Webinar: Data Architecture Requirements
Data-Ed Online Webinar: Data Architecture RequirementsData-Ed Online Webinar: Data Architecture Requirements
Data-Ed Online Webinar: Data Architecture Requirements
DATAVERSITY
 
Technology Consulting by Prasanna
Technology Consulting by PrasannaTechnology Consulting by Prasanna
Technology Consulting by Prasanna
SupportGCI
 
Career Sessions OM SCM
Career Sessions OM SCMCareer Sessions OM SCM
Career Sessions OM SCM
SupportGCI
 
Career Conversation Technology Consulting
Career Conversation Technology ConsultingCareer Conversation Technology Consulting
Career Conversation Technology Consulting
SupportGCI
 
1.ppt
1.ppt1.ppt
Operations and Supply Chain CC
Operations and Supply Chain CCOperations and Supply Chain CC
Operations and Supply Chain CC
SupportGCI
 

Similar to Project management for Big Data projects (20)

Big Data Project Manager
Big Data Project ManagerBig Data Project Manager
Big Data Project Manager
 
Data-Ed Webinar: Data Architecture Requirements
Data-Ed Webinar: Data Architecture RequirementsData-Ed Webinar: Data Architecture Requirements
Data-Ed Webinar: Data Architecture Requirements
 
Data-Ed: Data Architecture Requirements
Data-Ed: Data Architecture Requirements  Data-Ed: Data Architecture Requirements
Data-Ed: Data Architecture Requirements
 
Lessons after working as a data scientist for 1 year
Lessons after working as a data scientist for 1 yearLessons after working as a data scientist for 1 year
Lessons after working as a data scientist for 1 year
 
Advanced Project Data Analytics for Improved Project Delivery
Advanced Project Data Analytics for Improved Project DeliveryAdvanced Project Data Analytics for Improved Project Delivery
Advanced Project Data Analytics for Improved Project Delivery
 
The Big Data Dream Team
The Big Data Dream TeamThe Big Data Dream Team
The Big Data Dream Team
 
IWMW 2004: It Always Takes Longer Than You Think (Even If You Think It Will T...
IWMW 2004: It Always Takes Longer Than You Think (Even If You Think It Will T...IWMW 2004: It Always Takes Longer Than You Think (Even If You Think It Will T...
IWMW 2004: It Always Takes Longer Than You Think (Even If You Think It Will T...
 
Ajaykumar Reddy - Resume
Ajaykumar Reddy - ResumeAjaykumar Reddy - Resume
Ajaykumar Reddy - Resume
 
Intranet Design - How To Undertake An Intranet Redesign
Intranet Design - How To Undertake An Intranet RedesignIntranet Design - How To Undertake An Intranet Redesign
Intranet Design - How To Undertake An Intranet Redesign
 
Optimize your info-driven business processes. How to move from paper to digital
Optimize your info-driven business processes. How to move from paper to digitalOptimize your info-driven business processes. How to move from paper to digital
Optimize your info-driven business processes. How to move from paper to digital
 
Managing Data Science | Lessons from the Field
Managing Data Science | Lessons from the Field Managing Data Science | Lessons from the Field
Managing Data Science | Lessons from the Field
 
II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in Nice
 
Adding Hadoop to Your Analytics Mix?
Adding Hadoop to Your Analytics Mix?Adding Hadoop to Your Analytics Mix?
Adding Hadoop to Your Analytics Mix?
 
Getting Data Quality Right
Getting Data Quality RightGetting Data Quality Right
Getting Data Quality Right
 
Data-Ed Online Webinar: Data Architecture Requirements
Data-Ed Online Webinar: Data Architecture RequirementsData-Ed Online Webinar: Data Architecture Requirements
Data-Ed Online Webinar: Data Architecture Requirements
 
Technology Consulting by Prasanna
Technology Consulting by PrasannaTechnology Consulting by Prasanna
Technology Consulting by Prasanna
 
Career Sessions OM SCM
Career Sessions OM SCMCareer Sessions OM SCM
Career Sessions OM SCM
 
Career Conversation Technology Consulting
Career Conversation Technology ConsultingCareer Conversation Technology Consulting
Career Conversation Technology Consulting
 
1.ppt
1.ppt1.ppt
1.ppt
 
Operations and Supply Chain CC
Operations and Supply Chain CCOperations and Supply Chain CC
Operations and Supply Chain CC
 

Recently uploaded

哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
axoqas
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
Subhajit Sahu
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
ocavb
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
pchutichetpong
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
ewymefz
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
ewymefz
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
ewymefz
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
nscud
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
benishzehra469
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
Opendatabay
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
slg6lamcq
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Subhajit Sahu
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
ewymefz
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
ArpitMalhotra16
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
nscud
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
enxupq
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Boston Institute of Analytics
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
vcaxypu
 

Recently uploaded (20)

哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
 

Project management for Big Data projects

  • 1. PROJECT MANAGEMENT & BIG DATA ANALYTICS Sandeep Kumar PMP®
  • 2. INTRODUCTION IT Strategy & Business Transformation Industries: Media & Advertising, Telecom, BFSI, FMCG, Manufacturing, BPO/KPO Services: Shared Service Delivery, GIC & Back Office, PMO, Lean Six Sigma, Continuous Improvement, Enterprise IT, ERP, Cloud & Infrastructure, Development, Outsourcing and IT Security & Governance
  • 3. TWO STREAMS Project Management of Analytics • Big Data • Data Warehousing • Lean Six Sigma • In-memory computing • Internet of things • Social Media Analytics of Project Management • Time-Cost-Spec Analytics • Feasibility Analytics • Resource Analytics • Management of Collaboration • Agile & SCRUM
  • 4. APOLOGIES, DISCLAIMERS, ET AL • Big Data is over-hyped • Big Data is still evolving • Analytics is old, the tools are new! • Project Management solves most of the problems • Its importance is usually understated • The success of Big Data initiative lies primarily on the management, then on the PM & the DS • Hold the PM responsible only if you know what you want! • The roles I talk about here are essentially with respect Big Data Projects If you have Questions, I will try & Answer them!
  • 5. UNDER SCANNER ! Big Data / Analytics Myths • It is mature and cool • Is an extension of EDW • Data Quality can be slightly compromised • A single pre-built technology (e.g. Hadoop) will suffice • Data scientists are easy to get • Virtualization / Clustering will take care of infra needs • If you have huge data, every solution is a Big Data solution Project Management Myths • Managing only activities • Just time and cost management • Mere resource allocation • Reaching the finishing-line! • General management suffices • Have time to learn
  • 6. UNDER SCANNER ! Big Data / Analytics Facts • Responsible for Business Case and ROI definitions • Executive Sponsorship & Funds • ‘Real’ Resource Provisioning • Based on Enterprise Architecture • Highly complex and iterative process • Loads of scientific knowledge required • Source of data increases every day • Should be able to adapt with time Project Management Facts • Responsible for Scope & acceptance by all parties • Direction setting & KPI-SF definitions • ‘Real’ Resource Management • Right to procure & deploy the appropriate resource • Stakeholder & Communication management • Accountability and Responsibility for the success (and failure)
  • 7. THE DATA SCIENTIST Key skills of a Data Scientist – the hard skills guy • Basic Tools: Knowledge of statistical programming languages, like R or Python, and SQL • Basic Statistics: Familiar with statistical tests, distributions, maximum likelihood estimators, etc. • ETL Tools: Best in class like Informatica, IBM Infosphere, SAP BO, Oracle or SAS Data Integrator, Penta-ho, AB-Initio • Machine Learning / Artificial Intelligence / Pattern Recognition: Methods for Classification and Regression like k-nearest neighbours, random forests, etc. • Multivariable Calculus & Linear Algebra: Specially required where data is used for predictive performance or algorithm optimization • Data Munging / Scrubbing or Cleanliness: For example inconsistent string formatting as ND or Del for New Delhi; date alignment as [mm-dd-yyyy] or [dd-mm-yyyy] or [yyyy-dd-mm] • Data Visualization & Communication tools: Principles of and tools of Data Visualization like ggplot and d3.js. • Software Engineering: Strong software engineering background, SDLC, Agile, Scrum, DB techniques, Data intensive product development • Software Testing skills – To make sure the output delivers what the business needs • Basic Project Management Skills: Thinking like a Project Manager
  • 8. THE PROJECT MANAGER Key skills of a Project Manager – the soft skills guy • Project Charter: Project Stakeholders and Objectives documented and signed-off • Business Case: Asks the ‘whats’ and ‘whys’ of the business requirement • Scheduling Tools: Creates a Plan of Action to answer the ‘hows’ of the project • Vendor Management: Links up all 1st and 3rd party resources • Risk Management: The real management tool, with the mitigant • Communication Management: The core of collaboration and Management • Software Engineering: Software engineering background with fair knowledge of tools • Software Testing skills: To make sure the output delivers what the business needs • Basic Data Management Skills: Thinking like a Data Scientist
  • 9. MERGING ROLES The Data Scientist  The Requirements guy  The Data Tools guy  The Resource guy  The Specialist  The Enterprise Architect  The Software guy Breaking the Technical Barriers The Project Manager  The Requirements & Scope guy  The Project Tools guy  The Resource guy  The Generalist  The Program Manager  The hardware & Software guy Breaking the Cultural Barriers
  • 10. WHY DO ANALYTICS PROJECTS FAIL ? When do Projects fail, in general? • Not completed within budgets • Not completed on time • Not completed as per specifications Whys and Wherefores… • Poor scoping – unclear objective • Inadequate resources – lack of talent • Inappropriate Solution – wrong tool selection • Bad planning – Insufficient analysis • Bad execution – poor Project Management “3 out of 4 Big Data Projects fail” • Inaccurate Project Scope • Lack of Talent • Challenging Tools • Even more Challenging Concepts • Poor Planning • Ownership Issues – Business Initiative or IT Project? Apache Hadoop vs. Apache Cassandra 55% of Big Data projects don’t get completed; in case of IT projects in general, it is only 25%.
  • 11. THE SUCCESS MANTRA! • Get a Data Scientist at the PM or SME • Or, at least get the Data Scientist as one of the leads • Ask very tough questions to sponsors for a Business Case • Get the CFO and the End-User on your side – get the expectations right • Take your time – use appropriate Project Management methodologies • Use the most appropriate Platform / Tool • Accept requirements’ volatility • Scrum: accepting that the problem cannot be fully understood or defined, focusing instead on maximizing the team's ability to deliver quickly • POC  Deployment  Acceptance  Enhancement  Deployment … … • Agile: adaptive planning, evolutionary development, early delivery, continuous improvement • Document and Handover to End-User at every stage PM answers “How” as long as the Business knows “Why & What”