SlideShare a Scribd company logo
Applying Data Analytics Approach
in Systematic Literature Review:
Master Data Management Case Study
Faizura Haneem, Nazri Kama, Rosmah Ali, Ali Selamat
Advanced Informatics School
Universiti Teknologi Malaysia
26 Sept 2017
16th International Conference on Intelligent Software Methodologies, Tools, and Techniques
2 ADVANCED INFORMATICS SCHOOL, UTM
Problem Background
What is the progress of the
research domain? Does it
still relevant to study?
Which database should I
go to search the literature
of the research domain?
What are the common/
related terms in the
research domain?
What topics that still need
an improvement of
evidence in the research
domain?
3 ADVANCED INFORMATICS SCHOOL, UTM
Systematic Literature Review
What remains unclear
- how they practically conducted the
SLR protocol; and
- what tools they employed to assist in
performing activities in each SLR stage.
Thus, the SLR would not be easily
replicable to other new researchers as
they would need to create their own
ways of doing activities in conducting
SLR.
Okoli and Schabram 2010
Therefore, this paper attempts to address the
replicability issue by presenting the SLR
procedures using the Data Analytics
approach.
4 ADVANCED INFORMATICS SCHOOL, UTM
Data
Pre-processing &
Transformation
Data Analytics Technique in SLR
Problem
Formulation
Data
Selection
Data
Analysis
.
.
.
Combine
searching
result
Remove
article
duplications
Perform
Descriptive
Analysis
Perform
Text
Analysis
Searching result
Source 1
Searching result
Source 2
Searching result
Source ..n
Combined
dataset
Master
dataset
Define
Review
Questions
Search
articles from
selected
sources
PLANNING SELECTION EXTRACTION EXECUTION
SLR Stages
Okoli, 2010
Data Analytics
Fayyad 1996;
Jo 2013;
Provost and Fawcett 2013
5 ADVANCED INFORMATICS SCHOOL, UTM
Case study: Master Data Management
Master Data Management is one of the key
functions in Data Management Body of
Knowledge.
(DAMA-DMBOK, 2009)
6 ADVANCED INFORMATICS SCHOOL, UTM
Problem
Formulation
Define
Review
Questions
Data Analytics Technique in MDM Literature Review
ID Questions Purpose
RQ1 How did the numbers of the literature of
MDM vary by year?
To know the progress and current state
of the research
RQ2 Which database contains the most MDM
research work among the selected source
of databases?
To know the primary sources of the
research
RQ3 How do the MDM publications vary in
different publication types?
To know the trend of publication types of
the research
RQ4 Which keywords are the most frequently
used in searching the MDM research
domain?
To understand the common terms and
topic of interest in the research
Aim: To understand the progress of MDM research domain
7 ADVANCED INFORMATICS SCHOOL, UTM
Problem
Formulation
Data
Selection
.
.
.
Searching result
Source 1
Searching result
Source 2
Searching result
Source ..n
Define
Review
Questions
Search
articles from
selected
sources
Data Analytics Technique in MDM Literature Review
Database No. articles
found
ACM Digital Library 15
Emerald 14
Gartner 220
IEEE 18
Science Direct 48
Scopus 145
Springer Link 293
Web of Science, 12
Google Scholar 9
Total 777
Search keyword strings are (“master data”), (“management”),
(“Master Data Management”), and (“MDM”). Then, the search
strings were joint using “AND” and “OR” Boolean operators.
8 ADVANCED INFORMATICS SCHOOL, UTM
Data Analytics Technique in MDM Literature Review
Output File from searching process of ACM Digital Library
(File name: acm_searching.csv)
SCOPUS export function
IEEE export function
9 ADVANCED INFORMATICS SCHOOL, UTM
Problem
Formulation
Data
Selection
Data
Pre-processing &
Transformation
.
.
.
Combine
searching
result
Remove
article
duplications
Searching result
Source 1
Searching result
Source 2
Searching result
Source ..n
Combined
dataset
Master
dataset
Define
Review
Questions
Search
articles from
selected
sources
Data Analytics Technique in MDM Literature Review
735 publications
777 publications
10 ADVANCED INFORMATICS SCHOOL, UTM
Data Analytics Technique in MDM Literature Review
Output File after combining and removing duplicates (File name: master.csv)
11 ADVANCED INFORMATICS SCHOOL, UTM
Data
Pre-processing &
Transformation
Problem
Formulation
Data
Selection
Data
Analysis
.
.
.
Combine
searching
result
Remove
article
duplications
Perform
Descriptive
Analysis
Perform
Text
Analysis
Searching result
Source 1
Searching result
Source 2
Searching result
Source ..n
Combined
dataset
Master
dataset
Define
Review
Questions
Search
articles from
selected
sources
Data Analytics Technique in MDM Literature Review
12 ADVANCED INFORMATICS SCHOOL, UTM
Descriptive Analysis
July 2016
RQ1: How did the numbers
of publications of MDM vary
by year?
Articles distribution by Publication Year
Slope of Enlightenment:
More instances of how the technology
can benefit the enterprise become
more widely understood.
13 ADVANCED INFORMATICS SCHOOL, UTM
Descriptive Analysis
RQ2: Which database contains
the most MDM research work
among the selected source of
databases?
Articles Distribution by sources
This analysis is important so that
in future, the researcher could
focus on the primary databases
in searching the literature.
14 ADVANCED INFORMATICS SCHOOL, UTM
Descriptive Analysis
RQ3: How do the MDM
publications vary in different
publication types?
Articles Distribution by Publication Types
This analysis could help the
researchers for their future references.
For academic papers, it is highly
recommended to cite the paper from
journals and conference proceding.
15 ADVANCED INFORMATICS SCHOOL, UTM
Text Analysis
RQ4: Which keywords are the most
frequently used in searching the
MDM research domain?
Topic of interest Frequency
master data 148
data quality 70
business intelligence 57
business process 47
data integration 41
big data 34
data governance 29
information governance 29
data management 28
product data 28
information systems 26
information management 25
business processes 21
data sources 21
best practices 19
data model 18
information quality 18
research highlights 18
customer data 17
case study 16
Word cloud of 1-gram model represents
common terms used in the research
Table of 2-gram model represents topic of
interest in the research
16 ADVANCED INFORMATICS SCHOOL, UTM
17 ADVANCED INFORMATICS SCHOOL, UTM
Text Analysis
Topics that need attention for improvement
Example:
• Risk management
• Adoption model
Master data, data quality, business intelligent
18 ADVANCED INFORMATICS SCHOOL, UTM
Contribution
Outcome-based
approach
Effective
approach
Repetitive
approach
Data Analytics Technique in
Literature Review
Structured approach in
literature review
Repeatable method for
other research domain
Descriptive and Text Analysis
helps researcher in
strategizing the research
19 ADVANCED INFORMATICS SCHOOL, UTM
Thank You

More Related Content

What's hot

A Review: Text Classification on Social Media Data
A Review: Text Classification on Social Media DataA Review: Text Classification on Social Media Data
A Review: Text Classification on Social Media Data
IOSR Journals
 
Context Driven Technique for Document Classification
Context Driven Technique for Document ClassificationContext Driven Technique for Document Classification
Context Driven Technique for Document Classification
IDES Editor
 
Ir 01
Ir   01Ir   01
Multidimensioal database
Multidimensioal  databaseMultidimensioal  database
Multidimensioal database
TPO TPO
 
I1802055259
I1802055259I1802055259
I1802055259
IOSR Journals
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
JOSEPH FRANCIS
 
DATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVE
DATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVEDATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVE
DATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVE
IJDKP
 
Query-Based Retrieval of Annotated Document
Query-Based Retrieval of Annotated DocumentQuery-Based Retrieval of Annotated Document
Query-Based Retrieval of Annotated Document
IRJET Journal
 
Perception Determined Constructing Algorithm for Document Clustering
Perception Determined Constructing Algorithm for Document ClusteringPerception Determined Constructing Algorithm for Document Clustering
Perception Determined Constructing Algorithm for Document Clustering
IRJET Journal
 
Evolution and state-of-the art of Altmetric research: Insights from network a...
Evolution and state-of-the art of Altmetric research: Insights from network a...Evolution and state-of-the art of Altmetric research: Insights from network a...
Evolution and state-of-the art of Altmetric research: Insights from network a...
Aravind Sesagiri Raamkumar
 
DATA MINING.doc
DATA MINING.docDATA MINING.doc
DATA MINING.doc
butest
 
ESTIMATION OF REGRESSION COEFFICIENTS USING GEOMETRIC MEAN OF SQUARED ERROR F...
ESTIMATION OF REGRESSION COEFFICIENTS USING GEOMETRIC MEAN OF SQUARED ERROR F...ESTIMATION OF REGRESSION COEFFICIENTS USING GEOMETRIC MEAN OF SQUARED ERROR F...
ESTIMATION OF REGRESSION COEFFICIENTS USING GEOMETRIC MEAN OF SQUARED ERROR F...
ijaia
 
A Federated Search Approach to Facilitate Systematic Literature Review in Sof...
A Federated Search Approach to Facilitate Systematic Literature Review in Sof...A Federated Search Approach to Facilitate Systematic Literature Review in Sof...
A Federated Search Approach to Facilitate Systematic Literature Review in Sof...
ijseajournal
 
Functional Data Analysis Ecommerce
Functional Data Analysis EcommerceFunctional Data Analysis Ecommerce
Functional Data Analysis Ecommerce
Andrés Acosta Escobar
 
P11 goonetilleke
P11 goonetillekeP11 goonetilleke
P11 goonetilleke
Rahul Yadav
 
التنقيب في البيانات - Data Mining
التنقيب في البيانات -  Data Miningالتنقيب في البيانات -  Data Mining
التنقيب في البيانات - Data Mining
nabil_alsharafi
 
Internet Search: the past, present and the future
Internet Search: the past, present and the futureInternet Search: the past, present and the future
Internet Search: the past, present and the future
PayamBarnaghi
 
Data Mining
Data MiningData Mining
Data Mining
Medicaps University
 
The profile of the management (data) scientist: Potential scenarios and skill...
The profile of the management (data) scientist: Potential scenarios and skill...The profile of the management (data) scientist: Potential scenarios and skill...
The profile of the management (data) scientist: Potential scenarios and skill...
Juan Mateos-Garcia
 
Great model a model for the automatic generation of semantic relations betwee...
Great model a model for the automatic generation of semantic relations betwee...Great model a model for the automatic generation of semantic relations betwee...
Great model a model for the automatic generation of semantic relations betwee...
ijcsity
 

What's hot (20)

A Review: Text Classification on Social Media Data
A Review: Text Classification on Social Media DataA Review: Text Classification on Social Media Data
A Review: Text Classification on Social Media Data
 
Context Driven Technique for Document Classification
Context Driven Technique for Document ClassificationContext Driven Technique for Document Classification
Context Driven Technique for Document Classification
 
Ir 01
Ir   01Ir   01
Ir 01
 
Multidimensioal database
Multidimensioal  databaseMultidimensioal  database
Multidimensioal database
 
I1802055259
I1802055259I1802055259
I1802055259
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
DATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVE
DATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVEDATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVE
DATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVE
 
Query-Based Retrieval of Annotated Document
Query-Based Retrieval of Annotated DocumentQuery-Based Retrieval of Annotated Document
Query-Based Retrieval of Annotated Document
 
Perception Determined Constructing Algorithm for Document Clustering
Perception Determined Constructing Algorithm for Document ClusteringPerception Determined Constructing Algorithm for Document Clustering
Perception Determined Constructing Algorithm for Document Clustering
 
Evolution and state-of-the art of Altmetric research: Insights from network a...
Evolution and state-of-the art of Altmetric research: Insights from network a...Evolution and state-of-the art of Altmetric research: Insights from network a...
Evolution and state-of-the art of Altmetric research: Insights from network a...
 
DATA MINING.doc
DATA MINING.docDATA MINING.doc
DATA MINING.doc
 
ESTIMATION OF REGRESSION COEFFICIENTS USING GEOMETRIC MEAN OF SQUARED ERROR F...
ESTIMATION OF REGRESSION COEFFICIENTS USING GEOMETRIC MEAN OF SQUARED ERROR F...ESTIMATION OF REGRESSION COEFFICIENTS USING GEOMETRIC MEAN OF SQUARED ERROR F...
ESTIMATION OF REGRESSION COEFFICIENTS USING GEOMETRIC MEAN OF SQUARED ERROR F...
 
A Federated Search Approach to Facilitate Systematic Literature Review in Sof...
A Federated Search Approach to Facilitate Systematic Literature Review in Sof...A Federated Search Approach to Facilitate Systematic Literature Review in Sof...
A Federated Search Approach to Facilitate Systematic Literature Review in Sof...
 
Functional Data Analysis Ecommerce
Functional Data Analysis EcommerceFunctional Data Analysis Ecommerce
Functional Data Analysis Ecommerce
 
P11 goonetilleke
P11 goonetillekeP11 goonetilleke
P11 goonetilleke
 
التنقيب في البيانات - Data Mining
التنقيب في البيانات -  Data Miningالتنقيب في البيانات -  Data Mining
التنقيب في البيانات - Data Mining
 
Internet Search: the past, present and the future
Internet Search: the past, present and the futureInternet Search: the past, present and the future
Internet Search: the past, present and the future
 
Data Mining
Data MiningData Mining
Data Mining
 
The profile of the management (data) scientist: Potential scenarios and skill...
The profile of the management (data) scientist: Potential scenarios and skill...The profile of the management (data) scientist: Potential scenarios and skill...
The profile of the management (data) scientist: Potential scenarios and skill...
 
Great model a model for the automatic generation of semantic relations betwee...
Great model a model for the automatic generation of semantic relations betwee...Great model a model for the automatic generation of semantic relations betwee...
Great model a model for the automatic generation of semantic relations betwee...
 

Similar to Slide 26 sept2017v2

A Survey And Taxonomy Of Distributed Data Mining Research Studies A Systemat...
A Survey And Taxonomy Of Distributed Data Mining Research Studies  A Systemat...A Survey And Taxonomy Of Distributed Data Mining Research Studies  A Systemat...
A Survey And Taxonomy Of Distributed Data Mining Research Studies A Systemat...
Sandra Long
 
CATEGORIZATION OF FACTORS AFFECTING CLASSIFICATION ALGORITHMS SELECTION
CATEGORIZATION OF FACTORS AFFECTING CLASSIFICATION ALGORITHMS SELECTIONCATEGORIZATION OF FACTORS AFFECTING CLASSIFICATION ALGORITHMS SELECTION
CATEGORIZATION OF FACTORS AFFECTING CLASSIFICATION ALGORITHMS SELECTION
IJDKP
 
CATEGORIZATION OF FACTORS AFFECTING CLASSIFICATION ALGORITHMS SELECTION
CATEGORIZATION OF FACTORS AFFECTING CLASSIFICATION ALGORITHMS SELECTIONCATEGORIZATION OF FACTORS AFFECTING CLASSIFICATION ALGORITHMS SELECTION
CATEGORIZATION OF FACTORS AFFECTING CLASSIFICATION ALGORITHMS SELECTION
IJDKP
 
CATEGORIZATION OF FACTORS AFFECTING CLASSIFICATION ALGORITHMS SELECTION
CATEGORIZATION OF FACTORS AFFECTING CLASSIFICATION ALGORITHMS SELECTIONCATEGORIZATION OF FACTORS AFFECTING CLASSIFICATION ALGORITHMS SELECTION
CATEGORIZATION OF FACTORS AFFECTING CLASSIFICATION ALGORITHMS SELECTION
IJDKP
 
Data Mining System and Applications: A Review
Data Mining System and Applications: A ReviewData Mining System and Applications: A Review
Data Mining System and Applications: A Review
ijdpsjournal
 
Data Mining Xuequn Shang NorthWestern Polytechnical University
Data Mining Xuequn Shang NorthWestern Polytechnical UniversityData Mining Xuequn Shang NorthWestern Polytechnical University
Data Mining Xuequn Shang NorthWestern Polytechnical University
butest
 
Competitive advantage from Data Mining: some lessons learnt ...
Competitive advantage from Data Mining: some lessons learnt ...Competitive advantage from Data Mining: some lessons learnt ...
Competitive advantage from Data Mining: some lessons learnt ...
butest
 
Competitive advantage from Data Mining: some lessons learnt ...
Competitive advantage from Data Mining: some lessons learnt ...Competitive advantage from Data Mining: some lessons learnt ...
Competitive advantage from Data Mining: some lessons learnt ...
butest
 
Paper id 26201475
Paper id 26201475Paper id 26201475
Paper id 26201475
IJRAT
 
Introduction to Data Analytics and data analytics life cycle
Introduction to Data Analytics and data analytics life cycleIntroduction to Data Analytics and data analytics life cycle
Introduction to Data Analytics and data analytics life cycle
Dr. Radhey Shyam
 
WEB-BASED DATA MINING TOOLS : PERFORMING FEEDBACK ANALYSIS AND ASSOCIATION RU...
WEB-BASED DATA MINING TOOLS : PERFORMING FEEDBACK ANALYSIS AND ASSOCIATION RU...WEB-BASED DATA MINING TOOLS : PERFORMING FEEDBACK ANALYSIS AND ASSOCIATION RU...
WEB-BASED DATA MINING TOOLS : PERFORMING FEEDBACK ANALYSIS AND ASSOCIATION RU...
IJDKP
 
The Architecture of System for Predicting Student Performance based on the Da...
The Architecture of System for Predicting Student Performance based on the Da...The Architecture of System for Predicting Student Performance based on the Da...
The Architecture of System for Predicting Student Performance based on the Da...
Thada Jantakoon
 
Asl rof businessintelligencetechnology2019
Asl rof businessintelligencetechnology2019Asl rof businessintelligencetechnology2019
Asl rof businessintelligencetechnology2019
kamilHussain15
 
Asl rof businessintelligencetechnology2019
Asl rof businessintelligencetechnology2019Asl rof businessintelligencetechnology2019
Asl rof businessintelligencetechnology2019
kamilHussain15
 
A Comprehensive Review of Relevant Techniques used in Course Recommendation S...
A Comprehensive Review of Relevant Techniques used in Course Recommendation S...A Comprehensive Review of Relevant Techniques used in Course Recommendation S...
A Comprehensive Review of Relevant Techniques used in Course Recommendation S...
IRJET Journal
 
Data Mining
Data MiningData Mining
Data Mining
AnbreenJaved
 
IRJET- Automated Document Summarization and Classification using Deep Lear...
IRJET- 	  Automated Document Summarization and Classification using Deep Lear...IRJET- 	  Automated Document Summarization and Classification using Deep Lear...
IRJET- Automated Document Summarization and Classification using Deep Lear...
IRJET Journal
 
KIT-601-L-UNIT-1 (Revised) Introduction to Data Analytcs.pdf
KIT-601-L-UNIT-1 (Revised) Introduction to Data Analytcs.pdfKIT-601-L-UNIT-1 (Revised) Introduction to Data Analytcs.pdf
KIT-601-L-UNIT-1 (Revised) Introduction to Data Analytcs.pdf
Dr. Radhey Shyam
 
The Survey of Data Mining Applications And Feature Scope
The Survey of Data Mining Applications  And Feature Scope The Survey of Data Mining Applications  And Feature Scope
The Survey of Data Mining Applications And Feature Scope
IJCSEIT Journal
 
Unit 1.pptx
Unit 1.pptxUnit 1.pptx
Unit 1.pptx
DrThenmozhiSPESUMCA
 

Similar to Slide 26 sept2017v2 (20)

A Survey And Taxonomy Of Distributed Data Mining Research Studies A Systemat...
A Survey And Taxonomy Of Distributed Data Mining Research Studies  A Systemat...A Survey And Taxonomy Of Distributed Data Mining Research Studies  A Systemat...
A Survey And Taxonomy Of Distributed Data Mining Research Studies A Systemat...
 
CATEGORIZATION OF FACTORS AFFECTING CLASSIFICATION ALGORITHMS SELECTION
CATEGORIZATION OF FACTORS AFFECTING CLASSIFICATION ALGORITHMS SELECTIONCATEGORIZATION OF FACTORS AFFECTING CLASSIFICATION ALGORITHMS SELECTION
CATEGORIZATION OF FACTORS AFFECTING CLASSIFICATION ALGORITHMS SELECTION
 
CATEGORIZATION OF FACTORS AFFECTING CLASSIFICATION ALGORITHMS SELECTION
CATEGORIZATION OF FACTORS AFFECTING CLASSIFICATION ALGORITHMS SELECTIONCATEGORIZATION OF FACTORS AFFECTING CLASSIFICATION ALGORITHMS SELECTION
CATEGORIZATION OF FACTORS AFFECTING CLASSIFICATION ALGORITHMS SELECTION
 
CATEGORIZATION OF FACTORS AFFECTING CLASSIFICATION ALGORITHMS SELECTION
CATEGORIZATION OF FACTORS AFFECTING CLASSIFICATION ALGORITHMS SELECTIONCATEGORIZATION OF FACTORS AFFECTING CLASSIFICATION ALGORITHMS SELECTION
CATEGORIZATION OF FACTORS AFFECTING CLASSIFICATION ALGORITHMS SELECTION
 
Data Mining System and Applications: A Review
Data Mining System and Applications: A ReviewData Mining System and Applications: A Review
Data Mining System and Applications: A Review
 
Data Mining Xuequn Shang NorthWestern Polytechnical University
Data Mining Xuequn Shang NorthWestern Polytechnical UniversityData Mining Xuequn Shang NorthWestern Polytechnical University
Data Mining Xuequn Shang NorthWestern Polytechnical University
 
Competitive advantage from Data Mining: some lessons learnt ...
Competitive advantage from Data Mining: some lessons learnt ...Competitive advantage from Data Mining: some lessons learnt ...
Competitive advantage from Data Mining: some lessons learnt ...
 
Competitive advantage from Data Mining: some lessons learnt ...
Competitive advantage from Data Mining: some lessons learnt ...Competitive advantage from Data Mining: some lessons learnt ...
Competitive advantage from Data Mining: some lessons learnt ...
 
Paper id 26201475
Paper id 26201475Paper id 26201475
Paper id 26201475
 
Introduction to Data Analytics and data analytics life cycle
Introduction to Data Analytics and data analytics life cycleIntroduction to Data Analytics and data analytics life cycle
Introduction to Data Analytics and data analytics life cycle
 
WEB-BASED DATA MINING TOOLS : PERFORMING FEEDBACK ANALYSIS AND ASSOCIATION RU...
WEB-BASED DATA MINING TOOLS : PERFORMING FEEDBACK ANALYSIS AND ASSOCIATION RU...WEB-BASED DATA MINING TOOLS : PERFORMING FEEDBACK ANALYSIS AND ASSOCIATION RU...
WEB-BASED DATA MINING TOOLS : PERFORMING FEEDBACK ANALYSIS AND ASSOCIATION RU...
 
The Architecture of System for Predicting Student Performance based on the Da...
The Architecture of System for Predicting Student Performance based on the Da...The Architecture of System for Predicting Student Performance based on the Da...
The Architecture of System for Predicting Student Performance based on the Da...
 
Asl rof businessintelligencetechnology2019
Asl rof businessintelligencetechnology2019Asl rof businessintelligencetechnology2019
Asl rof businessintelligencetechnology2019
 
Asl rof businessintelligencetechnology2019
Asl rof businessintelligencetechnology2019Asl rof businessintelligencetechnology2019
Asl rof businessintelligencetechnology2019
 
A Comprehensive Review of Relevant Techniques used in Course Recommendation S...
A Comprehensive Review of Relevant Techniques used in Course Recommendation S...A Comprehensive Review of Relevant Techniques used in Course Recommendation S...
A Comprehensive Review of Relevant Techniques used in Course Recommendation S...
 
Data Mining
Data MiningData Mining
Data Mining
 
IRJET- Automated Document Summarization and Classification using Deep Lear...
IRJET- 	  Automated Document Summarization and Classification using Deep Lear...IRJET- 	  Automated Document Summarization and Classification using Deep Lear...
IRJET- Automated Document Summarization and Classification using Deep Lear...
 
KIT-601-L-UNIT-1 (Revised) Introduction to Data Analytcs.pdf
KIT-601-L-UNIT-1 (Revised) Introduction to Data Analytcs.pdfKIT-601-L-UNIT-1 (Revised) Introduction to Data Analytcs.pdf
KIT-601-L-UNIT-1 (Revised) Introduction to Data Analytcs.pdf
 
The Survey of Data Mining Applications And Feature Scope
The Survey of Data Mining Applications  And Feature Scope The Survey of Data Mining Applications  And Feature Scope
The Survey of Data Mining Applications And Feature Scope
 
Unit 1.pptx
Unit 1.pptxUnit 1.pptx
Unit 1.pptx
 

Recently uploaded

一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
nuttdpt
 
Challenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more importantChallenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more important
Sm321
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
Timothy Spann
 
State of Artificial intelligence Report 2023
State of Artificial intelligence Report 2023State of Artificial intelligence Report 2023
State of Artificial intelligence Report 2023
kuntobimo2016
 
Global Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headedGlobal Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headed
vikram sood
 
The Ipsos - AI - Monitor 2024 Report.pdf
The  Ipsos - AI - Monitor 2024 Report.pdfThe  Ipsos - AI - Monitor 2024 Report.pdf
The Ipsos - AI - Monitor 2024 Report.pdf
Social Samosa
 
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
apvysm8
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
Lars Albertsson
 
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
Timothy Spann
 
Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...
Bill641377
 
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
g4dpvqap0
 
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
Social Samosa
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
nuttdpt
 
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
bopyb
 
Intelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicineIntelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicine
AndrzejJarynowski
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
74nqk8xf
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
jerlynmaetalle
 
University of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma TranscriptUniversity of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma Transcript
soxrziqu
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
sameer shah
 

Recently uploaded (20)

一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
 
Challenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more importantChallenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more important
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
 
State of Artificial intelligence Report 2023
State of Artificial intelligence Report 2023State of Artificial intelligence Report 2023
State of Artificial intelligence Report 2023
 
Global Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headedGlobal Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headed
 
The Ipsos - AI - Monitor 2024 Report.pdf
The  Ipsos - AI - Monitor 2024 Report.pdfThe  Ipsos - AI - Monitor 2024 Report.pdf
The Ipsos - AI - Monitor 2024 Report.pdf
 
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
 
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
 
Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...
 
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
 
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
 
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
 
Intelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicineIntelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicine
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
 
University of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma TranscriptUniversity of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma Transcript
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
 

Slide 26 sept2017v2

  • 1. Applying Data Analytics Approach in Systematic Literature Review: Master Data Management Case Study Faizura Haneem, Nazri Kama, Rosmah Ali, Ali Selamat Advanced Informatics School Universiti Teknologi Malaysia 26 Sept 2017 16th International Conference on Intelligent Software Methodologies, Tools, and Techniques
  • 2. 2 ADVANCED INFORMATICS SCHOOL, UTM Problem Background What is the progress of the research domain? Does it still relevant to study? Which database should I go to search the literature of the research domain? What are the common/ related terms in the research domain? What topics that still need an improvement of evidence in the research domain?
  • 3. 3 ADVANCED INFORMATICS SCHOOL, UTM Systematic Literature Review What remains unclear - how they practically conducted the SLR protocol; and - what tools they employed to assist in performing activities in each SLR stage. Thus, the SLR would not be easily replicable to other new researchers as they would need to create their own ways of doing activities in conducting SLR. Okoli and Schabram 2010 Therefore, this paper attempts to address the replicability issue by presenting the SLR procedures using the Data Analytics approach.
  • 4. 4 ADVANCED INFORMATICS SCHOOL, UTM Data Pre-processing & Transformation Data Analytics Technique in SLR Problem Formulation Data Selection Data Analysis . . . Combine searching result Remove article duplications Perform Descriptive Analysis Perform Text Analysis Searching result Source 1 Searching result Source 2 Searching result Source ..n Combined dataset Master dataset Define Review Questions Search articles from selected sources PLANNING SELECTION EXTRACTION EXECUTION SLR Stages Okoli, 2010 Data Analytics Fayyad 1996; Jo 2013; Provost and Fawcett 2013
  • 5. 5 ADVANCED INFORMATICS SCHOOL, UTM Case study: Master Data Management Master Data Management is one of the key functions in Data Management Body of Knowledge. (DAMA-DMBOK, 2009)
  • 6. 6 ADVANCED INFORMATICS SCHOOL, UTM Problem Formulation Define Review Questions Data Analytics Technique in MDM Literature Review ID Questions Purpose RQ1 How did the numbers of the literature of MDM vary by year? To know the progress and current state of the research RQ2 Which database contains the most MDM research work among the selected source of databases? To know the primary sources of the research RQ3 How do the MDM publications vary in different publication types? To know the trend of publication types of the research RQ4 Which keywords are the most frequently used in searching the MDM research domain? To understand the common terms and topic of interest in the research Aim: To understand the progress of MDM research domain
  • 7. 7 ADVANCED INFORMATICS SCHOOL, UTM Problem Formulation Data Selection . . . Searching result Source 1 Searching result Source 2 Searching result Source ..n Define Review Questions Search articles from selected sources Data Analytics Technique in MDM Literature Review Database No. articles found ACM Digital Library 15 Emerald 14 Gartner 220 IEEE 18 Science Direct 48 Scopus 145 Springer Link 293 Web of Science, 12 Google Scholar 9 Total 777 Search keyword strings are (“master data”), (“management”), (“Master Data Management”), and (“MDM”). Then, the search strings were joint using “AND” and “OR” Boolean operators.
  • 8. 8 ADVANCED INFORMATICS SCHOOL, UTM Data Analytics Technique in MDM Literature Review Output File from searching process of ACM Digital Library (File name: acm_searching.csv) SCOPUS export function IEEE export function
  • 9. 9 ADVANCED INFORMATICS SCHOOL, UTM Problem Formulation Data Selection Data Pre-processing & Transformation . . . Combine searching result Remove article duplications Searching result Source 1 Searching result Source 2 Searching result Source ..n Combined dataset Master dataset Define Review Questions Search articles from selected sources Data Analytics Technique in MDM Literature Review 735 publications 777 publications
  • 10. 10 ADVANCED INFORMATICS SCHOOL, UTM Data Analytics Technique in MDM Literature Review Output File after combining and removing duplicates (File name: master.csv)
  • 11. 11 ADVANCED INFORMATICS SCHOOL, UTM Data Pre-processing & Transformation Problem Formulation Data Selection Data Analysis . . . Combine searching result Remove article duplications Perform Descriptive Analysis Perform Text Analysis Searching result Source 1 Searching result Source 2 Searching result Source ..n Combined dataset Master dataset Define Review Questions Search articles from selected sources Data Analytics Technique in MDM Literature Review
  • 12. 12 ADVANCED INFORMATICS SCHOOL, UTM Descriptive Analysis July 2016 RQ1: How did the numbers of publications of MDM vary by year? Articles distribution by Publication Year Slope of Enlightenment: More instances of how the technology can benefit the enterprise become more widely understood.
  • 13. 13 ADVANCED INFORMATICS SCHOOL, UTM Descriptive Analysis RQ2: Which database contains the most MDM research work among the selected source of databases? Articles Distribution by sources This analysis is important so that in future, the researcher could focus on the primary databases in searching the literature.
  • 14. 14 ADVANCED INFORMATICS SCHOOL, UTM Descriptive Analysis RQ3: How do the MDM publications vary in different publication types? Articles Distribution by Publication Types This analysis could help the researchers for their future references. For academic papers, it is highly recommended to cite the paper from journals and conference proceding.
  • 15. 15 ADVANCED INFORMATICS SCHOOL, UTM Text Analysis RQ4: Which keywords are the most frequently used in searching the MDM research domain? Topic of interest Frequency master data 148 data quality 70 business intelligence 57 business process 47 data integration 41 big data 34 data governance 29 information governance 29 data management 28 product data 28 information systems 26 information management 25 business processes 21 data sources 21 best practices 19 data model 18 information quality 18 research highlights 18 customer data 17 case study 16 Word cloud of 1-gram model represents common terms used in the research Table of 2-gram model represents topic of interest in the research
  • 16. 16 ADVANCED INFORMATICS SCHOOL, UTM
  • 17. 17 ADVANCED INFORMATICS SCHOOL, UTM Text Analysis Topics that need attention for improvement Example: • Risk management • Adoption model Master data, data quality, business intelligent
  • 18. 18 ADVANCED INFORMATICS SCHOOL, UTM Contribution Outcome-based approach Effective approach Repetitive approach Data Analytics Technique in Literature Review Structured approach in literature review Repeatable method for other research domain Descriptive and Text Analysis helps researcher in strategizing the research
  • 19. 19 ADVANCED INFORMATICS SCHOOL, UTM Thank You