SlideShare a Scribd company logo
z
SNJB’s Late Sau K. B. Jain COE, Chandwad
Department of Computer Engineering
Academic Year- 2020-21
Subject: Data Analytics
z
 Global Innovation Network and Analytics (GINA) team is a group of senior
technologists located in centers of excellence (COEs) around the world.
 This team’s charter is to engage employees across global COEs to drive
innovation, research, and university partnerships.
 The GINA case study provides an example of how a team applied the
Data Analytics Lifecycle to analyze innovation data at EMC.
 Innovation is typically a difficult concept to measure, and this team wanted
to look for ways to use advanced analytical methods to identify key
innovators within the company.
z
 The GINA team thought its approach would provide a means to share
ideas globally and increase knowledge sharing among GINA members who
may be separated geographically .
 It planned to create a data repository containing both structured and
unstructured data to accomplish three main goals:
 Store formal and informal data.
 Track research from global technologists.
 Mine the data for patterns and insights to improve the team’s
operations and strategy
z
 In the GINA project’s discovery phase, the team began identifying data
sources.
 The Various Roles are involved in this phase :
 Business user, project sponsor, project manager :– Vice President from Office
of CTO
 BI analyst :– person from IT
 Data engineer and DBA :– people from IT
 Data scientist :– distinguished engineer
z
 The data for the project fell into two main categories :-
 Innovation Roadmap.
 Data encompassed minutes and notes representing innovation
and research activity from around the world.
 The GINA (Hypothesis) can be grouped into two categories :-
 Descriptive analytics of what is currently happening to spark further
creativity, collaboration, and asset generation.
 Predictive analytics to advise executive management of where it
should be investing in the future. Global Innovation Network.
z
 IT department to set up a new analytics sandbox to store and experiment
on the data.
 The data scientists and data engineers began to notice that certain data
needed conditioning and normalization.
 As the team explored the data, it quickly realized that if it did not have
data of sufficient quality or could not get good quality data, it would not be
able to perform the subsequent steps in the lifecycle process.
 Important to determine what level of data quality and cleanliness was
sufficient for the project being undertaken.
z
 The team made a decision to initiate a longitudinal study to begin tracking data
points over time regarding people developing new intellectual property.
 The parameters related to the scope of the study included the following
considerations:
 Identify the right milestones to achieve this goal.
 Trace how people move ideas from each milestone toward the goal.
 Once this is done, trace ideas that die, and trace others that reach the goal.
Compare the journeys of ideas that make it and those that do not.
 Compare the times and the outcomes using a few different methods (depending on
how the data is collected and assembled). These could be as simple as t-tests or
perhaps involve different types of classification algorithms.
z
 The GINA team employed several analytical methods. This included work by
the data scientist using Natural Language Processing (NLP) techniques on
the textual descriptions of the Innovation Roadmap ideas.
 Social network analysis using R and Rstudio.
Fig. Social graph [27] visualization of idea submitters and finalists.
z
 Fig shows social graphs that portray the relationships between idea
submitters within GINA.
 Each colour represents an innovator from a different country.
 The large dots with red circles around them represent hubs. A hub
represents a person with high connectivity and a high “betweenness” score.
 The team used Tableau software for data visualization and exploration and
used the Pivotal Greenplum database as the main data repository and
analytics engine.
z
 This project was considered successful in identifying boundary spanners
and hidden innovators.
 The GINA project promoted knowledge sharing related to innovation and
researchers spanning multiple areas within the company and outside of it.
 GINA also enabled EMC to cultivate additional intellectual property that
led to additional research topics and provided opportunities to forge
relationships with universities for joint academic research in the fields of
Data Science and Big Data.
z
 Study was successful in in identifying hidden innovators :
 Found high density of innovators in Cork, Ireland.
 The CTO office launched longitudinal studies to begin data collection
efforts and track innovation results over longer periods of time.
z
 Deployment was not really discussed .
 Key findings :
 Need more data in future
 Some data were sensitive
 A parallel initiative needs to be created to improve basic BI activities
 A mechanism is needed to continually reevaluate the model after
deployment
z
z
 https://data-science-and-big-data-analy-nieizv_book.pdf.
 https://bhavanakhivsara.files.wordpress.com/2018/07/data
-analytics-unit-i-1.pptx

More Related Content

What's hot

Vector clock algorithm
Vector clock algorithmVector clock algorithm
Vector clock algorithm
S. Anbu
 
key distribution in network security
key distribution in network securitykey distribution in network security
key distribution in network security
babak danyal
 
ELEMENTS OF TRANSPORT PROTOCOL
ELEMENTS OF TRANSPORT PROTOCOLELEMENTS OF TRANSPORT PROTOCOL
ELEMENTS OF TRANSPORT PROTOCOL
Shashank Rustagi
 
Genetic programming
Genetic programmingGenetic programming
Genetic programming
Meghna Singh
 
OpenGL Mini Projects With Source Code [ Computer Graphics ]
OpenGL Mini Projects With Source Code [ Computer Graphics ]OpenGL Mini Projects With Source Code [ Computer Graphics ]
OpenGL Mini Projects With Source Code [ Computer Graphics ]
Daffodil International University
 
Ecg analysis in the cloud
Ecg analysis in the cloudEcg analysis in the cloud
Ecg analysis in the cloud
gaurav jain
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
RohithND
 
Lecture #01
Lecture #01Lecture #01
Lecture #01
Konpal Darakshan
 
Semantic nets in artificial intelligence
Semantic nets in artificial intelligenceSemantic nets in artificial intelligence
Semantic nets in artificial intelligence
harshita virwani
 
Diabetic Retinopathy.pptx
Diabetic Retinopathy.pptxDiabetic Retinopathy.pptx
Diabetic Retinopathy.pptx
NGOKUL3
 
GFS & HDFS Introduction
GFS & HDFS IntroductionGFS & HDFS Introduction
GFS & HDFS Introduction
Hariharan Ganesan
 
Issues in knowledge representation
Issues in knowledge representationIssues in knowledge representation
Issues in knowledge representationSravanthi Emani
 
Feature Engineering in Machine Learning
Feature Engineering in Machine LearningFeature Engineering in Machine Learning
Feature Engineering in Machine Learning
Knoldus Inc.
 
Stock Market Prediction using Machine Learning
Stock Market Prediction using Machine LearningStock Market Prediction using Machine Learning
Stock Market Prediction using Machine Learning
Aravind Balaji
 
I. AO* SEARCH ALGORITHM
I. AO* SEARCH ALGORITHMI. AO* SEARCH ALGORITHM
I. AO* SEARCH ALGORITHM
vikas dhakane
 
Data-Intensive Technologies for Cloud Computing
Data-Intensive Technologies for CloudComputingData-Intensive Technologies for CloudComputing
Data-Intensive Technologies for Cloud Computing
huda2018
 
Big data lecture notes
Big data lecture notesBig data lecture notes
Big data lecture notes
Mohit Saini
 
And or graph
And or graphAnd or graph
And or graph
Ali A Jalil
 
Semantic net in AI
Semantic net in AISemantic net in AI
Semantic net in AI
ShahDhruv21
 
Data Mining: Association Rules Basics
Data Mining: Association Rules BasicsData Mining: Association Rules Basics
Data Mining: Association Rules Basics
Benazir Income Support Program (BISP)
 

What's hot (20)

Vector clock algorithm
Vector clock algorithmVector clock algorithm
Vector clock algorithm
 
key distribution in network security
key distribution in network securitykey distribution in network security
key distribution in network security
 
ELEMENTS OF TRANSPORT PROTOCOL
ELEMENTS OF TRANSPORT PROTOCOLELEMENTS OF TRANSPORT PROTOCOL
ELEMENTS OF TRANSPORT PROTOCOL
 
Genetic programming
Genetic programmingGenetic programming
Genetic programming
 
OpenGL Mini Projects With Source Code [ Computer Graphics ]
OpenGL Mini Projects With Source Code [ Computer Graphics ]OpenGL Mini Projects With Source Code [ Computer Graphics ]
OpenGL Mini Projects With Source Code [ Computer Graphics ]
 
Ecg analysis in the cloud
Ecg analysis in the cloudEcg analysis in the cloud
Ecg analysis in the cloud
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
Lecture #01
Lecture #01Lecture #01
Lecture #01
 
Semantic nets in artificial intelligence
Semantic nets in artificial intelligenceSemantic nets in artificial intelligence
Semantic nets in artificial intelligence
 
Diabetic Retinopathy.pptx
Diabetic Retinopathy.pptxDiabetic Retinopathy.pptx
Diabetic Retinopathy.pptx
 
GFS & HDFS Introduction
GFS & HDFS IntroductionGFS & HDFS Introduction
GFS & HDFS Introduction
 
Issues in knowledge representation
Issues in knowledge representationIssues in knowledge representation
Issues in knowledge representation
 
Feature Engineering in Machine Learning
Feature Engineering in Machine LearningFeature Engineering in Machine Learning
Feature Engineering in Machine Learning
 
Stock Market Prediction using Machine Learning
Stock Market Prediction using Machine LearningStock Market Prediction using Machine Learning
Stock Market Prediction using Machine Learning
 
I. AO* SEARCH ALGORITHM
I. AO* SEARCH ALGORITHMI. AO* SEARCH ALGORITHM
I. AO* SEARCH ALGORITHM
 
Data-Intensive Technologies for Cloud Computing
Data-Intensive Technologies for CloudComputingData-Intensive Technologies for CloudComputing
Data-Intensive Technologies for Cloud Computing
 
Big data lecture notes
Big data lecture notesBig data lecture notes
Big data lecture notes
 
And or graph
And or graphAnd or graph
And or graph
 
Semantic net in AI
Semantic net in AISemantic net in AI
Semantic net in AI
 
Data Mining: Association Rules Basics
Data Mining: Association Rules BasicsData Mining: Association Rules Basics
Data Mining: Association Rules Basics
 

Similar to Case study on gina(gobal innovation network and analysis)

Application and Methods of Deep Learning in IoT
Application and Methods of Deep Learning in IoTApplication and Methods of Deep Learning in IoT
Application and Methods of Deep Learning in IoT
IJAEMSJORNAL
 
Effective Remote Design Thinking: A Basic Essential For Global Companies To D...
Effective Remote Design Thinking: A Basic Essential For Global Companies To D...Effective Remote Design Thinking: A Basic Essential For Global Companies To D...
Effective Remote Design Thinking: A Basic Essential For Global Companies To D...
Dr. Vidya Priya Rao, Founder
 
TTG Int.LTD Data Mining Technique
TTG Int.LTD Data Mining TechniqueTTG Int.LTD Data Mining Technique
TTG Int.LTD Data Mining Technique
Mehmet Beyaz
 
A Comprehensive Overview of Advance Techniques, Applications and Challenges i...
A Comprehensive Overview of Advance Techniques, Applications and Challenges i...A Comprehensive Overview of Advance Techniques, Applications and Challenges i...
A Comprehensive Overview of Advance Techniques, Applications and Challenges i...
IRJTAE
 
Data Mining Applications And Feature Scope Survey
Data Mining Applications And Feature Scope SurveyData Mining Applications And Feature Scope Survey
Data Mining Applications And Feature Scope Survey
NIET Journal of Engineering & Technology (NIETJET)
 
CODATA: Open Data, FAIR Data and Open Science/Simon Hodson
CODATA: Open Data, FAIR Data and Open Science/Simon HodsonCODATA: Open Data, FAIR Data and Open Science/Simon Hodson
CODATA: Open Data, FAIR Data and Open Science/Simon Hodson
Academy of Science of South Africa (ASSAf)
 
An Evaluation Of Big Data Analytics Projects And The Project Predictive Analy...
An Evaluation Of Big Data Analytics Projects And The Project Predictive Analy...An Evaluation Of Big Data Analytics Projects And The Project Predictive Analy...
An Evaluation Of Big Data Analytics Projects And The Project Predictive Analy...
Joshua Gorinson
 
Ojcst vol12 n4_p_132-146
Ojcst vol12 n4_p_132-146Ojcst vol12 n4_p_132-146
Ojcst vol12 n4_p_132-146
Dr. Madhumala Ghosh
 
Information entanglement
Information entanglementInformation entanglement
Information entanglement
Willard Van De Bogart
 
Fundamentals of data mining and its applications
Fundamentals of data mining and its applicationsFundamentals of data mining and its applications
Fundamentals of data mining and its applicationsSubrat Swain
 
Colloquium(7)_DataScience:ShivShaktiGhosh&MohitGarg
Colloquium(7)_DataScience:ShivShaktiGhosh&MohitGargColloquium(7)_DataScience:ShivShaktiGhosh&MohitGarg
Colloquium(7)_DataScience:ShivShaktiGhosh&MohitGarg
Shiv Shakti Ghosh
 
Intro to AI.pptx
Intro to AI.pptxIntro to AI.pptx
Intro to AI.pptx
TusharGupta635112
 
Rdaeu russia_fg_1_july2014_final
Rdaeu  russia_fg_1_july2014_finalRdaeu  russia_fg_1_july2014_final
Rdaeu russia_fg_1_july2014_final
Research Data Alliance
 
10[1].1.1.115.9508
10[1].1.1.115.950810[1].1.1.115.9508
10[1].1.1.115.9508okeee
 
Toward supporting decision-making under uncertainty in digital humanities wit...
Toward supporting decision-making under uncertainty in digital humanities wit...Toward supporting decision-making under uncertainty in digital humanities wit...
Toward supporting decision-making under uncertainty in digital humanities wit...
Technological Ecosystems for Enhancing Multiculturality
 
Data Science Demystified_ Journeying Through Insights and Innovations
Data Science Demystified_ Journeying Through Insights and InnovationsData Science Demystified_ Journeying Through Insights and Innovations
Data Science Demystified_ Journeying Through Insights and Innovations
Vaishali Pal
 
An Evaluation Of Big Data Analytics Projects And The Project Predictive Analy...
An Evaluation Of Big Data Analytics Projects And The Project Predictive Analy...An Evaluation Of Big Data Analytics Projects And The Project Predictive Analy...
An Evaluation Of Big Data Analytics Projects And The Project Predictive Analy...
Kelly Taylor
 
Jisc visions: research
Jisc visions: researchJisc visions: research
Jisc visions: research
Jisc
 
New Forms of Data for e-Research
New Forms of Data for e-ResearchNew Forms of Data for e-Research
New Forms of Data for e-Research
David De Roure
 
Hadoop and Big Data Readiness in Africa: A Case of Tanzania
Hadoop and Big Data Readiness in Africa: A Case of TanzaniaHadoop and Big Data Readiness in Africa: A Case of Tanzania
Hadoop and Big Data Readiness in Africa: A Case of Tanzania
ijsrd.com
 

Similar to Case study on gina(gobal innovation network and analysis) (20)

Application and Methods of Deep Learning in IoT
Application and Methods of Deep Learning in IoTApplication and Methods of Deep Learning in IoT
Application and Methods of Deep Learning in IoT
 
Effective Remote Design Thinking: A Basic Essential For Global Companies To D...
Effective Remote Design Thinking: A Basic Essential For Global Companies To D...Effective Remote Design Thinking: A Basic Essential For Global Companies To D...
Effective Remote Design Thinking: A Basic Essential For Global Companies To D...
 
TTG Int.LTD Data Mining Technique
TTG Int.LTD Data Mining TechniqueTTG Int.LTD Data Mining Technique
TTG Int.LTD Data Mining Technique
 
A Comprehensive Overview of Advance Techniques, Applications and Challenges i...
A Comprehensive Overview of Advance Techniques, Applications and Challenges i...A Comprehensive Overview of Advance Techniques, Applications and Challenges i...
A Comprehensive Overview of Advance Techniques, Applications and Challenges i...
 
Data Mining Applications And Feature Scope Survey
Data Mining Applications And Feature Scope SurveyData Mining Applications And Feature Scope Survey
Data Mining Applications And Feature Scope Survey
 
CODATA: Open Data, FAIR Data and Open Science/Simon Hodson
CODATA: Open Data, FAIR Data and Open Science/Simon HodsonCODATA: Open Data, FAIR Data and Open Science/Simon Hodson
CODATA: Open Data, FAIR Data and Open Science/Simon Hodson
 
An Evaluation Of Big Data Analytics Projects And The Project Predictive Analy...
An Evaluation Of Big Data Analytics Projects And The Project Predictive Analy...An Evaluation Of Big Data Analytics Projects And The Project Predictive Analy...
An Evaluation Of Big Data Analytics Projects And The Project Predictive Analy...
 
Ojcst vol12 n4_p_132-146
Ojcst vol12 n4_p_132-146Ojcst vol12 n4_p_132-146
Ojcst vol12 n4_p_132-146
 
Information entanglement
Information entanglementInformation entanglement
Information entanglement
 
Fundamentals of data mining and its applications
Fundamentals of data mining and its applicationsFundamentals of data mining and its applications
Fundamentals of data mining and its applications
 
Colloquium(7)_DataScience:ShivShaktiGhosh&MohitGarg
Colloquium(7)_DataScience:ShivShaktiGhosh&MohitGargColloquium(7)_DataScience:ShivShaktiGhosh&MohitGarg
Colloquium(7)_DataScience:ShivShaktiGhosh&MohitGarg
 
Intro to AI.pptx
Intro to AI.pptxIntro to AI.pptx
Intro to AI.pptx
 
Rdaeu russia_fg_1_july2014_final
Rdaeu  russia_fg_1_july2014_finalRdaeu  russia_fg_1_july2014_final
Rdaeu russia_fg_1_july2014_final
 
10[1].1.1.115.9508
10[1].1.1.115.950810[1].1.1.115.9508
10[1].1.1.115.9508
 
Toward supporting decision-making under uncertainty in digital humanities wit...
Toward supporting decision-making under uncertainty in digital humanities wit...Toward supporting decision-making under uncertainty in digital humanities wit...
Toward supporting decision-making under uncertainty in digital humanities wit...
 
Data Science Demystified_ Journeying Through Insights and Innovations
Data Science Demystified_ Journeying Through Insights and InnovationsData Science Demystified_ Journeying Through Insights and Innovations
Data Science Demystified_ Journeying Through Insights and Innovations
 
An Evaluation Of Big Data Analytics Projects And The Project Predictive Analy...
An Evaluation Of Big Data Analytics Projects And The Project Predictive Analy...An Evaluation Of Big Data Analytics Projects And The Project Predictive Analy...
An Evaluation Of Big Data Analytics Projects And The Project Predictive Analy...
 
Jisc visions: research
Jisc visions: researchJisc visions: research
Jisc visions: research
 
New Forms of Data for e-Research
New Forms of Data for e-ResearchNew Forms of Data for e-Research
New Forms of Data for e-Research
 
Hadoop and Big Data Readiness in Africa: A Case of Tanzania
Hadoop and Big Data Readiness in Africa: A Case of TanzaniaHadoop and Big Data Readiness in Africa: A Case of Tanzania
Hadoop and Big Data Readiness in Africa: A Case of Tanzania
 

Recently uploaded

Polish students' mobility in the Czech Republic
Polish students' mobility in the Czech RepublicPolish students' mobility in the Czech Republic
Polish students' mobility in the Czech Republic
Anna Sz.
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
beazzy04
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
Celine George
 
Template Jadual Bertugas Kelas (Boleh Edit)
Template Jadual Bertugas Kelas (Boleh Edit)Template Jadual Bertugas Kelas (Boleh Edit)
Template Jadual Bertugas Kelas (Boleh Edit)
rosedainty
 
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
Nguyen Thanh Tu Collection
 
Introduction to Quality Improvement Essentials
Introduction to Quality Improvement EssentialsIntroduction to Quality Improvement Essentials
Introduction to Quality Improvement Essentials
Excellence Foundation for South Sudan
 
Basic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumersBasic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumers
PedroFerreira53928
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
siemaillard
 
The Art Pastor's Guide to Sabbath | Steve Thomason
The Art Pastor's Guide to Sabbath | Steve ThomasonThe Art Pastor's Guide to Sabbath | Steve Thomason
The Art Pastor's Guide to Sabbath | Steve Thomason
Steve Thomason
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
Jisc
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
EverAndrsGuerraGuerr
 
Instructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptxInstructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptx
Jheel Barad
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
Pavel ( NSTU)
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
DeeptiGupta154
 
ESC Beyond Borders _From EU to You_ InfoPack general.pdf
ESC Beyond Borders _From EU to You_ InfoPack general.pdfESC Beyond Borders _From EU to You_ InfoPack general.pdf
ESC Beyond Borders _From EU to You_ InfoPack general.pdf
Fundacja Rozwoju Społeczeństwa Przedsiębiorczego
 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
TechSoup
 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
BhavyaRajput3
 
Additional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdfAdditional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdf
joachimlavalley1
 
Fish and Chips - have they had their chips
Fish and Chips - have they had their chipsFish and Chips - have they had their chips
Fish and Chips - have they had their chips
GeoBlogs
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
MysoreMuleSoftMeetup
 

Recently uploaded (20)

Polish students' mobility in the Czech Republic
Polish students' mobility in the Czech RepublicPolish students' mobility in the Czech Republic
Polish students' mobility in the Czech Republic
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
 
Template Jadual Bertugas Kelas (Boleh Edit)
Template Jadual Bertugas Kelas (Boleh Edit)Template Jadual Bertugas Kelas (Boleh Edit)
Template Jadual Bertugas Kelas (Boleh Edit)
 
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
 
Introduction to Quality Improvement Essentials
Introduction to Quality Improvement EssentialsIntroduction to Quality Improvement Essentials
Introduction to Quality Improvement Essentials
 
Basic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumersBasic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumers
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
 
The Art Pastor's Guide to Sabbath | Steve Thomason
The Art Pastor's Guide to Sabbath | Steve ThomasonThe Art Pastor's Guide to Sabbath | Steve Thomason
The Art Pastor's Guide to Sabbath | Steve Thomason
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
 
Instructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptxInstructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptx
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
 
ESC Beyond Borders _From EU to You_ InfoPack general.pdf
ESC Beyond Borders _From EU to You_ InfoPack general.pdfESC Beyond Borders _From EU to You_ InfoPack general.pdf
ESC Beyond Borders _From EU to You_ InfoPack general.pdf
 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
 
Additional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdfAdditional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdf
 
Fish and Chips - have they had their chips
Fish and Chips - have they had their chipsFish and Chips - have they had their chips
Fish and Chips - have they had their chips
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
 

Case study on gina(gobal innovation network and analysis)

  • 1. z SNJB’s Late Sau K. B. Jain COE, Chandwad Department of Computer Engineering Academic Year- 2020-21 Subject: Data Analytics
  • 2. z  Global Innovation Network and Analytics (GINA) team is a group of senior technologists located in centers of excellence (COEs) around the world.  This team’s charter is to engage employees across global COEs to drive innovation, research, and university partnerships.  The GINA case study provides an example of how a team applied the Data Analytics Lifecycle to analyze innovation data at EMC.  Innovation is typically a difficult concept to measure, and this team wanted to look for ways to use advanced analytical methods to identify key innovators within the company.
  • 3. z  The GINA team thought its approach would provide a means to share ideas globally and increase knowledge sharing among GINA members who may be separated geographically .  It planned to create a data repository containing both structured and unstructured data to accomplish three main goals:  Store formal and informal data.  Track research from global technologists.  Mine the data for patterns and insights to improve the team’s operations and strategy
  • 4. z  In the GINA project’s discovery phase, the team began identifying data sources.  The Various Roles are involved in this phase :  Business user, project sponsor, project manager :– Vice President from Office of CTO  BI analyst :– person from IT  Data engineer and DBA :– people from IT  Data scientist :– distinguished engineer
  • 5. z  The data for the project fell into two main categories :-  Innovation Roadmap.  Data encompassed minutes and notes representing innovation and research activity from around the world.  The GINA (Hypothesis) can be grouped into two categories :-  Descriptive analytics of what is currently happening to spark further creativity, collaboration, and asset generation.  Predictive analytics to advise executive management of where it should be investing in the future. Global Innovation Network.
  • 6. z  IT department to set up a new analytics sandbox to store and experiment on the data.  The data scientists and data engineers began to notice that certain data needed conditioning and normalization.  As the team explored the data, it quickly realized that if it did not have data of sufficient quality or could not get good quality data, it would not be able to perform the subsequent steps in the lifecycle process.  Important to determine what level of data quality and cleanliness was sufficient for the project being undertaken.
  • 7. z  The team made a decision to initiate a longitudinal study to begin tracking data points over time regarding people developing new intellectual property.  The parameters related to the scope of the study included the following considerations:  Identify the right milestones to achieve this goal.  Trace how people move ideas from each milestone toward the goal.  Once this is done, trace ideas that die, and trace others that reach the goal. Compare the journeys of ideas that make it and those that do not.  Compare the times and the outcomes using a few different methods (depending on how the data is collected and assembled). These could be as simple as t-tests or perhaps involve different types of classification algorithms.
  • 8. z  The GINA team employed several analytical methods. This included work by the data scientist using Natural Language Processing (NLP) techniques on the textual descriptions of the Innovation Roadmap ideas.  Social network analysis using R and Rstudio. Fig. Social graph [27] visualization of idea submitters and finalists.
  • 9. z  Fig shows social graphs that portray the relationships between idea submitters within GINA.  Each colour represents an innovator from a different country.  The large dots with red circles around them represent hubs. A hub represents a person with high connectivity and a high “betweenness” score.  The team used Tableau software for data visualization and exploration and used the Pivotal Greenplum database as the main data repository and analytics engine.
  • 10. z  This project was considered successful in identifying boundary spanners and hidden innovators.  The GINA project promoted knowledge sharing related to innovation and researchers spanning multiple areas within the company and outside of it.  GINA also enabled EMC to cultivate additional intellectual property that led to additional research topics and provided opportunities to forge relationships with universities for joint academic research in the fields of Data Science and Big Data.
  • 11. z  Study was successful in in identifying hidden innovators :  Found high density of innovators in Cork, Ireland.  The CTO office launched longitudinal studies to begin data collection efforts and track innovation results over longer periods of time.
  • 12. z  Deployment was not really discussed .  Key findings :  Need more data in future  Some data were sensitive  A parallel initiative needs to be created to improve basic BI activities  A mechanism is needed to continually reevaluate the model after deployment
  • 13. z