SlideShare a Scribd company logo
1 of 12
SNJB’S LATE SAU K. B. JAIN COE, CHANDWAD
DEPARTMENT OF COMPUTER ENGINEERING
ACADEMIC YEAR 2020-21
SUBJECT: DATAANALYTICS
CASE STUDY ON GINA
By
Name: Divya Prafull Wani
Roll No:41
Class: BE Computer
Date:16-08-2020
Case StudyOn GINA
1
GINA
■ Global Innovation Network and Analysis.
■ The GINA case study provides an example of how a team applied the Data Analytics
Lifecycle to analyze innovation data at EMC.
■ GINA is a group of senior technologists located in centers of excellence around the world.
■ The GINA team thought its approach would provide a means to share ideas globally and
increase knowledge sharing among members who may be separated geographically.
■ It planned to create a data repository containing both structured and unstructured data to
accomplish 3 main goals .
1. Store Formal and informal data.
2. Track research from global technologists.
3. Mine the data for patterns and insights to improve the teams operations and strategy.
Case StudyOn GINA 2
Phase 1 : Discovery
■ Team Members and Roles
 Business user, project sponsor, project manager -Vice President from Office of CTO.
 BI analyst – person from IT
 Data Engineer and DBA – people from IT
 Data Scientist – distinguished engineer.
■ The data fell into two categories
 5 years of idea submissions from internal innovation contests.
 Minutes an notes representing innovation and research activity from around the world.
■ The data fell into two categories
 5 years of idea submissions from internal innovation contests.
 Minutes an notes representing innovation and research activity from around the world.
■ Hypothesis grouped into 2 categories
 Descriptive analytics of what is happening to spark further creativity, collaboration, an asset generation
 Predictive analytics to advise executive management of where it should be investing in the future.
Case StudyOn GINA 3
Phase 2 : Data Preparation
■ Set up anAnalytical Sandbox to store and experiment on the data.
■ Discovered that certain data needed conditioning and normalization and that missing
datasets were critical.
■ Team recognized that poor quality data could impact subsequent steps.
■ They discovered many names were misspelled and problems with extra spaces.
■ Important to determine what level of data quality and cleanliness was sufficient for
the project being undertaken.
Case StudyOn GINA 4
Phase 3: Model Planning
■ Included following considerations :
 Identify the right milestones to achieve the goals
 Trace how people move ideas from each ,milestone towards the goal.
 Once this is done, trace ideas that die and others that reach the goal.Compare the
journeys of ideas that make it and those that do not.
 Compare times and outcomes using a few different methods.These could be as simple
as t-tests or perhaps involve different types of classificationAlgorithms.
Case StudyOn GINA 5
Phase 4 : Model Building
■ The GINA team employed several analytical methods.This included work by the data
scientist using Natural Language Processing (NLP) techniques on the textual
descriptions of the innovation Roadmap ideas.
■ Social Network Analysis using R and Rstudio.
■ Developed SocialGraphs andVisualizations.
Case StudyOn GINA 6
Social Graph Data
Submitters and Finalists
and Graph of top
innovation influencers
• Fig shows socai graphs that portray
relationships between idea submitters within
GINA.
• Each colour represents an innovator from a
different country.
• The large dots with red circles around them
represent hubs.
• A hub represents a person with high
connectivity and a high “betweeness” score.
• The team usedTableau software for data
visualization and exploration and used the
Pivotal Greenplum database as the main data
repository and analytics engine.
Case StudyOn GINA 7
Phase 5 : Communicate Results
■ This project was considered successful in identifying boundary spanners and hidden
innovators.
■ The GINA project promoted knowledge sharing related to innovation an researchers
spanning multiple areas within the company and outside of it.
■ The GINA also enables EMC to cultivate additional property leads to research topics
and provided opportunities to forge relationships with universities for joint academic
research in the fields of Data Science and Big Data.
■ Study was successful in identifying hidden innovators.(found high density in Cork,
Ireland)
■ The CTO office launched longitudinal studies.
Case StudyOn GINA 8
Phase 6: Operationalize
■ Deployment was not really discussed
■ Key Findings
 Need more data in future
 Some data were sensitive
 A parallel initiative needs to be created to improve basic BI activities.
 A mechanism is needed to continually reevaluate the model after deployment.
Case StudyOn GINA 9
Analytic Plan from the EMC GINA
Project
Case StudyOn GINA 10
Reference
■ Data-science-and-big-data-analy-nieizv_book
■ https://bhavanakhivsara.wordpress.com/subjects/data-analytics/
Case StudyOn GINA 11
ThankYou!!
Case StudyOn GINA 12

More Related Content

What's hot

CRISP-DM: a data science project methodology
CRISP-DM: a data science project methodologyCRISP-DM: a data science project methodology
CRISP-DM: a data science project methodologySergey Shelpuk
 
Privacy, security and ethics in data science
Privacy, security and ethics in data sciencePrivacy, security and ethics in data science
Privacy, security and ethics in data scienceNikolaos Vasiloglou
 
Information visualization - introduction
Information visualization - introductionInformation visualization - introduction
Information visualization - introductionKatrien Verbert
 
The future of big data analytics
The future of big data analyticsThe future of big data analytics
The future of big data analyticsAhmed Banafa
 
Realtime Sentiment Analysis Application Using Hadoop and HBase
Realtime Sentiment Analysis Application Using Hadoop and HBaseRealtime Sentiment Analysis Application Using Hadoop and HBase
Realtime Sentiment Analysis Application Using Hadoop and HBaseDataWorks Summit
 
Statistics and Data Mining
Statistics and  Data MiningStatistics and  Data Mining
Statistics and Data MiningR A Akerkar
 
Data science.chapter-1,2,3
Data science.chapter-1,2,3Data science.chapter-1,2,3
Data science.chapter-1,2,3varshakumar21
 
Dm from databases perspective u 1
Dm from databases perspective u 1Dm from databases perspective u 1
Dm from databases perspective u 1sakthyvel3
 
Big data visualization
Big data visualizationBig data visualization
Big data visualizationAnurag Gupta
 
Big Data Science - hype?
Big Data Science - hype?Big Data Science - hype?
Big Data Science - hype?BalaBit
 
VTU 7TH SEM CSE DATA WAREHOUSING AND DATA MINING SOLVED PAPERS OF DEC2013 JUN...
VTU 7TH SEM CSE DATA WAREHOUSING AND DATA MINING SOLVED PAPERS OF DEC2013 JUN...VTU 7TH SEM CSE DATA WAREHOUSING AND DATA MINING SOLVED PAPERS OF DEC2013 JUN...
VTU 7TH SEM CSE DATA WAREHOUSING AND DATA MINING SOLVED PAPERS OF DEC2013 JUN...vtunotesbysree
 
Intro to Data Science for Non-Data Scientists
Intro to Data Science for Non-Data ScientistsIntro to Data Science for Non-Data Scientists
Intro to Data Science for Non-Data ScientistsSri Ambati
 
DMTM Lecture 06 Classification evaluation
DMTM Lecture 06 Classification evaluationDMTM Lecture 06 Classification evaluation
DMTM Lecture 06 Classification evaluationPier Luca Lanzi
 
States, state graphs and transition testing
States, state graphs and transition testingStates, state graphs and transition testing
States, state graphs and transition testingABHISHEK KUMAR
 
Storage organization and stack allocation of space
Storage organization and stack allocation of spaceStorage organization and stack allocation of space
Storage organization and stack allocation of spaceMuhammad Haroon
 
Introduction on Data Science
Introduction on Data ScienceIntroduction on Data Science
Introduction on Data ScienceEdureka!
 
Big Data to avoid weather related flight delays
Big Data to avoid weather related flight delaysBig Data to avoid weather related flight delays
Big Data to avoid weather related flight delaysAkshatGiri3
 

What's hot (20)

CRISP-DM: a data science project methodology
CRISP-DM: a data science project methodologyCRISP-DM: a data science project methodology
CRISP-DM: a data science project methodology
 
Privacy, security and ethics in data science
Privacy, security and ethics in data sciencePrivacy, security and ethics in data science
Privacy, security and ethics in data science
 
Information visualization - introduction
Information visualization - introductionInformation visualization - introduction
Information visualization - introduction
 
18 Data Streams
18 Data Streams18 Data Streams
18 Data Streams
 
The future of big data analytics
The future of big data analyticsThe future of big data analytics
The future of big data analytics
 
Unit 2(knowledge)
Unit 2(knowledge)Unit 2(knowledge)
Unit 2(knowledge)
 
Realtime Sentiment Analysis Application Using Hadoop and HBase
Realtime Sentiment Analysis Application Using Hadoop and HBaseRealtime Sentiment Analysis Application Using Hadoop and HBase
Realtime Sentiment Analysis Application Using Hadoop and HBase
 
Statistics and Data Mining
Statistics and  Data MiningStatistics and  Data Mining
Statistics and Data Mining
 
Data science.chapter-1,2,3
Data science.chapter-1,2,3Data science.chapter-1,2,3
Data science.chapter-1,2,3
 
Dm from databases perspective u 1
Dm from databases perspective u 1Dm from databases perspective u 1
Dm from databases perspective u 1
 
Big data visualization
Big data visualizationBig data visualization
Big data visualization
 
Big Data Science - hype?
Big Data Science - hype?Big Data Science - hype?
Big Data Science - hype?
 
VTU 7TH SEM CSE DATA WAREHOUSING AND DATA MINING SOLVED PAPERS OF DEC2013 JUN...
VTU 7TH SEM CSE DATA WAREHOUSING AND DATA MINING SOLVED PAPERS OF DEC2013 JUN...VTU 7TH SEM CSE DATA WAREHOUSING AND DATA MINING SOLVED PAPERS OF DEC2013 JUN...
VTU 7TH SEM CSE DATA WAREHOUSING AND DATA MINING SOLVED PAPERS OF DEC2013 JUN...
 
Intro to Data Science for Non-Data Scientists
Intro to Data Science for Non-Data ScientistsIntro to Data Science for Non-Data Scientists
Intro to Data Science for Non-Data Scientists
 
DMTM Lecture 06 Classification evaluation
DMTM Lecture 06 Classification evaluationDMTM Lecture 06 Classification evaluation
DMTM Lecture 06 Classification evaluation
 
States, state graphs and transition testing
States, state graphs and transition testingStates, state graphs and transition testing
States, state graphs and transition testing
 
Storage organization and stack allocation of space
Storage organization and stack allocation of spaceStorage organization and stack allocation of space
Storage organization and stack allocation of space
 
Introduction on Data Science
Introduction on Data ScienceIntroduction on Data Science
Introduction on Data Science
 
Data Science Tools
Data Science ToolsData Science Tools
Data Science Tools
 
Big Data to avoid weather related flight delays
Big Data to avoid weather related flight delaysBig Data to avoid weather related flight delays
Big Data to avoid weather related flight delays
 

Similar to Analyzing Innovation Data at EMC's GINA Team

IT Capstone Report Fall 2022.pptx
IT Capstone Report Fall 2022.pptxIT Capstone Report Fall 2022.pptx
IT Capstone Report Fall 2022.pptxJack Zheng
 
New Data Science Framework for Analysing and Mining Big Data - Charith Silva
New Data Science Framework for Analysing and Mining Big Data - Charith SilvaNew Data Science Framework for Analysing and Mining Big Data - Charith Silva
New Data Science Framework for Analysing and Mining Big Data - Charith SilvaInstitute of Contemporary Sciences
 
Customer Research For Product Managers - Dawn of The Data Age Lecture Series
Customer Research For Product Managers - Dawn of The Data Age Lecture SeriesCustomer Research For Product Managers - Dawn of The Data Age Lecture Series
Customer Research For Product Managers - Dawn of The Data Age Lecture SeriesLuciano Pesci, PhD
 
A Survey on Big Data Analytics
A Survey on Big Data AnalyticsA Survey on Big Data Analytics
A Survey on Big Data AnalyticsBHARATH KUMAR
 
Fealing - Improving indicators to inform policy
Fealing - Improving indicators to inform policyFealing - Improving indicators to inform policy
Fealing - Improving indicators to inform policyinnovationoecd
 
1. Overview_of_data_analytics (1).pdf
1. Overview_of_data_analytics (1).pdf1. Overview_of_data_analytics (1).pdf
1. Overview_of_data_analytics (1).pdfAyele40
 
Fundamentals of data mining and its applications
Fundamentals of data mining and its applicationsFundamentals of data mining and its applications
Fundamentals of data mining and its applicationsSubrat Swain
 
An Evaluation Of Big Data Analytics Projects And The Project Predictive Analy...
An Evaluation Of Big Data Analytics Projects And The Project Predictive Analy...An Evaluation Of Big Data Analytics Projects And The Project Predictive Analy...
An Evaluation Of Big Data Analytics Projects And The Project Predictive Analy...Joshua Gorinson
 
Data Analytics in Industry Verticals, Data Analytics Lifecycle, Challenges of...
Data Analytics in Industry Verticals, Data Analytics Lifecycle, Challenges of...Data Analytics in Industry Verticals, Data Analytics Lifecycle, Challenges of...
Data Analytics in Industry Verticals, Data Analytics Lifecycle, Challenges of...Sahilakhurana
 
Turning FAIR into Reality: Final outcomes from the European Commission FAIR D...
Turning FAIR into Reality: Final outcomes from the European Commission FAIR D...Turning FAIR into Reality: Final outcomes from the European Commission FAIR D...
Turning FAIR into Reality: Final outcomes from the European Commission FAIR D...Sarah Jones
 
2b A-Using Big Data for the Sustainable Development Goals 10222015.pdf
2b A-Using Big Data for the Sustainable Development Goals 10222015.pdf2b A-Using Big Data for the Sustainable Development Goals 10222015.pdf
2b A-Using Big Data for the Sustainable Development Goals 10222015.pdfMuhammadZafarHasan
 
Apidays Paris 2023 - Crafting Sustainable Bytes for a Greener Digital Future,...
Apidays Paris 2023 - Crafting Sustainable Bytes for a Greener Digital Future,...Apidays Paris 2023 - Crafting Sustainable Bytes for a Greener Digital Future,...
Apidays Paris 2023 - Crafting Sustainable Bytes for a Greener Digital Future,...apidays
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data ScienceUmmeSalmaM1
 
Hadoop and Big Data Readiness in Africa: A Case of Tanzania
Hadoop and Big Data Readiness in Africa: A Case of TanzaniaHadoop and Big Data Readiness in Africa: A Case of Tanzania
Hadoop and Big Data Readiness in Africa: A Case of Tanzaniaijsrd.com
 

Similar to Analyzing Innovation Data at EMC's GINA Team (20)

IT Capstone Report Fall 2022.pptx
IT Capstone Report Fall 2022.pptxIT Capstone Report Fall 2022.pptx
IT Capstone Report Fall 2022.pptx
 
Toward supporting decision-making under uncertainty in digital humanities wit...
Toward supporting decision-making under uncertainty in digital humanities wit...Toward supporting decision-making under uncertainty in digital humanities wit...
Toward supporting decision-making under uncertainty in digital humanities wit...
 
New Data Science Framework for Analysing and Mining Big Data - Charith Silva
New Data Science Framework for Analysing and Mining Big Data - Charith SilvaNew Data Science Framework for Analysing and Mining Big Data - Charith Silva
New Data Science Framework for Analysing and Mining Big Data - Charith Silva
 
Data Mining Applications And Feature Scope Survey
Data Mining Applications And Feature Scope SurveyData Mining Applications And Feature Scope Survey
Data Mining Applications And Feature Scope Survey
 
Customer Research For Product Managers - Dawn of The Data Age Lecture Series
Customer Research For Product Managers - Dawn of The Data Age Lecture SeriesCustomer Research For Product Managers - Dawn of The Data Age Lecture Series
Customer Research For Product Managers - Dawn of The Data Age Lecture Series
 
A Survey on Big Data Analytics
A Survey on Big Data AnalyticsA Survey on Big Data Analytics
A Survey on Big Data Analytics
 
Fealing - Improving indicators to inform policy
Fealing - Improving indicators to inform policyFealing - Improving indicators to inform policy
Fealing - Improving indicators to inform policy
 
1. Overview_of_data_analytics (1).pdf
1. Overview_of_data_analytics (1).pdf1. Overview_of_data_analytics (1).pdf
1. Overview_of_data_analytics (1).pdf
 
Fundamentals of data mining and its applications
Fundamentals of data mining and its applicationsFundamentals of data mining and its applications
Fundamentals of data mining and its applications
 
An Evaluation Of Big Data Analytics Projects And The Project Predictive Analy...
An Evaluation Of Big Data Analytics Projects And The Project Predictive Analy...An Evaluation Of Big Data Analytics Projects And The Project Predictive Analy...
An Evaluation Of Big Data Analytics Projects And The Project Predictive Analy...
 
Ojcst vol12 n4_p_132-146
Ojcst vol12 n4_p_132-146Ojcst vol12 n4_p_132-146
Ojcst vol12 n4_p_132-146
 
Data Analytics in Industry Verticals, Data Analytics Lifecycle, Challenges of...
Data Analytics in Industry Verticals, Data Analytics Lifecycle, Challenges of...Data Analytics in Industry Verticals, Data Analytics Lifecycle, Challenges of...
Data Analytics in Industry Verticals, Data Analytics Lifecycle, Challenges of...
 
CODATA: Open Data, FAIR Data and Open Science/Simon Hodson
CODATA: Open Data, FAIR Data and Open Science/Simon HodsonCODATA: Open Data, FAIR Data and Open Science/Simon Hodson
CODATA: Open Data, FAIR Data and Open Science/Simon Hodson
 
Software Analytics
Software AnalyticsSoftware Analytics
Software Analytics
 
Turning FAIR into Reality: Final outcomes from the European Commission FAIR D...
Turning FAIR into Reality: Final outcomes from the European Commission FAIR D...Turning FAIR into Reality: Final outcomes from the European Commission FAIR D...
Turning FAIR into Reality: Final outcomes from the European Commission FAIR D...
 
2b A-Using Big Data for the Sustainable Development Goals 10222015.pdf
2b A-Using Big Data for the Sustainable Development Goals 10222015.pdf2b A-Using Big Data for the Sustainable Development Goals 10222015.pdf
2b A-Using Big Data for the Sustainable Development Goals 10222015.pdf
 
Apidays Paris 2023 - Crafting Sustainable Bytes for a Greener Digital Future,...
Apidays Paris 2023 - Crafting Sustainable Bytes for a Greener Digital Future,...Apidays Paris 2023 - Crafting Sustainable Bytes for a Greener Digital Future,...
Apidays Paris 2023 - Crafting Sustainable Bytes for a Greener Digital Future,...
 
PlanetData Project Overview
PlanetData Project OverviewPlanetData Project Overview
PlanetData Project Overview
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Hadoop and Big Data Readiness in Africa: A Case of Tanzania
Hadoop and Big Data Readiness in Africa: A Case of TanzaniaHadoop and Big Data Readiness in Africa: A Case of Tanzania
Hadoop and Big Data Readiness in Africa: A Case of Tanzania
 

More from divyawani2

Case Study on Cray T3E Architecture
Case Study on Cray T3E ArchitectureCase Study on Cray T3E Architecture
Case Study on Cray T3E Architecturedivyawani2
 
Case study on Transaction in Grocery Store
Case study on Transaction in Grocery Store Case study on Transaction in Grocery Store
Case study on Transaction in Grocery Store divyawani2
 
Case study on smart card (embeded system) based on IOT
Case study on smart card (embeded system) based on IOTCase study on smart card (embeded system) based on IOT
Case study on smart card (embeded system) based on IOTdivyawani2
 
Automatic Water Dispenser using IOT
Automatic Water Dispenser using IOTAutomatic Water Dispenser using IOT
Automatic Water Dispenser using IOTdivyawani2
 
Case study on automatic washing machine based on Internet of Things(IOT)
Case study on automatic washing machine based on Internet of Things(IOT)Case study on automatic washing machine based on Internet of Things(IOT)
Case study on automatic washing machine based on Internet of Things(IOT)divyawani2
 
Flutter technology Based on Web Development
Flutter technology Based on Web Development Flutter technology Based on Web Development
Flutter technology Based on Web Development divyawani2
 

More from divyawani2 (6)

Case Study on Cray T3E Architecture
Case Study on Cray T3E ArchitectureCase Study on Cray T3E Architecture
Case Study on Cray T3E Architecture
 
Case study on Transaction in Grocery Store
Case study on Transaction in Grocery Store Case study on Transaction in Grocery Store
Case study on Transaction in Grocery Store
 
Case study on smart card (embeded system) based on IOT
Case study on smart card (embeded system) based on IOTCase study on smart card (embeded system) based on IOT
Case study on smart card (embeded system) based on IOT
 
Automatic Water Dispenser using IOT
Automatic Water Dispenser using IOTAutomatic Water Dispenser using IOT
Automatic Water Dispenser using IOT
 
Case study on automatic washing machine based on Internet of Things(IOT)
Case study on automatic washing machine based on Internet of Things(IOT)Case study on automatic washing machine based on Internet of Things(IOT)
Case study on automatic washing machine based on Internet of Things(IOT)
 
Flutter technology Based on Web Development
Flutter technology Based on Web Development Flutter technology Based on Web Development
Flutter technology Based on Web Development
 

Recently uploaded

办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一F La
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...ThinkInnovation
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDRafezzaman
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改atducpo
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 

Recently uploaded (20)

办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts Service
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts Service
 
Decoding Loan Approval: Predictive Modeling in Action
Decoding Loan Approval: Predictive Modeling in ActionDecoding Loan Approval: Predictive Modeling in Action
Decoding Loan Approval: Predictive Modeling in Action
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 

Analyzing Innovation Data at EMC's GINA Team

  • 1. SNJB’S LATE SAU K. B. JAIN COE, CHANDWAD DEPARTMENT OF COMPUTER ENGINEERING ACADEMIC YEAR 2020-21 SUBJECT: DATAANALYTICS CASE STUDY ON GINA By Name: Divya Prafull Wani Roll No:41 Class: BE Computer Date:16-08-2020 Case StudyOn GINA 1
  • 2. GINA ■ Global Innovation Network and Analysis. ■ The GINA case study provides an example of how a team applied the Data Analytics Lifecycle to analyze innovation data at EMC. ■ GINA is a group of senior technologists located in centers of excellence around the world. ■ The GINA team thought its approach would provide a means to share ideas globally and increase knowledge sharing among members who may be separated geographically. ■ It planned to create a data repository containing both structured and unstructured data to accomplish 3 main goals . 1. Store Formal and informal data. 2. Track research from global technologists. 3. Mine the data for patterns and insights to improve the teams operations and strategy. Case StudyOn GINA 2
  • 3. Phase 1 : Discovery ■ Team Members and Roles  Business user, project sponsor, project manager -Vice President from Office of CTO.  BI analyst – person from IT  Data Engineer and DBA – people from IT  Data Scientist – distinguished engineer. ■ The data fell into two categories  5 years of idea submissions from internal innovation contests.  Minutes an notes representing innovation and research activity from around the world. ■ The data fell into two categories  5 years of idea submissions from internal innovation contests.  Minutes an notes representing innovation and research activity from around the world. ■ Hypothesis grouped into 2 categories  Descriptive analytics of what is happening to spark further creativity, collaboration, an asset generation  Predictive analytics to advise executive management of where it should be investing in the future. Case StudyOn GINA 3
  • 4. Phase 2 : Data Preparation ■ Set up anAnalytical Sandbox to store and experiment on the data. ■ Discovered that certain data needed conditioning and normalization and that missing datasets were critical. ■ Team recognized that poor quality data could impact subsequent steps. ■ They discovered many names were misspelled and problems with extra spaces. ■ Important to determine what level of data quality and cleanliness was sufficient for the project being undertaken. Case StudyOn GINA 4
  • 5. Phase 3: Model Planning ■ Included following considerations :  Identify the right milestones to achieve the goals  Trace how people move ideas from each ,milestone towards the goal.  Once this is done, trace ideas that die and others that reach the goal.Compare the journeys of ideas that make it and those that do not.  Compare times and outcomes using a few different methods.These could be as simple as t-tests or perhaps involve different types of classificationAlgorithms. Case StudyOn GINA 5
  • 6. Phase 4 : Model Building ■ The GINA team employed several analytical methods.This included work by the data scientist using Natural Language Processing (NLP) techniques on the textual descriptions of the innovation Roadmap ideas. ■ Social Network Analysis using R and Rstudio. ■ Developed SocialGraphs andVisualizations. Case StudyOn GINA 6
  • 7. Social Graph Data Submitters and Finalists and Graph of top innovation influencers • Fig shows socai graphs that portray relationships between idea submitters within GINA. • Each colour represents an innovator from a different country. • The large dots with red circles around them represent hubs. • A hub represents a person with high connectivity and a high “betweeness” score. • The team usedTableau software for data visualization and exploration and used the Pivotal Greenplum database as the main data repository and analytics engine. Case StudyOn GINA 7
  • 8. Phase 5 : Communicate Results ■ This project was considered successful in identifying boundary spanners and hidden innovators. ■ The GINA project promoted knowledge sharing related to innovation an researchers spanning multiple areas within the company and outside of it. ■ The GINA also enables EMC to cultivate additional property leads to research topics and provided opportunities to forge relationships with universities for joint academic research in the fields of Data Science and Big Data. ■ Study was successful in identifying hidden innovators.(found high density in Cork, Ireland) ■ The CTO office launched longitudinal studies. Case StudyOn GINA 8
  • 9. Phase 6: Operationalize ■ Deployment was not really discussed ■ Key Findings  Need more data in future  Some data were sensitive  A parallel initiative needs to be created to improve basic BI activities.  A mechanism is needed to continually reevaluate the model after deployment. Case StudyOn GINA 9
  • 10. Analytic Plan from the EMC GINA Project Case StudyOn GINA 10