This document is a final examination report submitted to fulfill the requirements for an Information Technology Research and Innovation course. It presents the results of a systematic literature review of 44 studies on machine learning technique usage from 2010 to 2014. The review aimed to identify the most common machine learning techniques, their applications, appropriate data types, and strengths and weaknesses. The techniques most frequently used were found to be support vector machines, artificial neural networks, naive Bayes, decision trees, and k-nearest neighbors. The studies covered fields such as medicine, pharmacology, agriculture, archaeology, games, and business.
Performance Analysis of Supervised Machine Learning Techniques for Sentiment ...Biswaranjan Samal
Â
Wide use of internet and web applications like, feedback collection systems are now making peoples smarter. In these applications, peoples used to give their feedback about the movies, products, services, etc through which they have gone, and this feedback are publicly available for future references. It is a tedious task for the machines to identify the feedback types, i:e positive or negative. And here Machine Learning Techniques plays vital roles to train the machine and make it intelligent so that the machine will be able to identify the feedback type which may give more benefits and features for those web applications and the users. There are many supervised machine learning techniques are available so it is a difficult task to choose the best one. In this paper, we have collected the movie review datasets of different sizes and have selected some of the widely used and popular supervised machine learning algorithms, for training the model. So that the model will be able to categorize the review. Python's NLTK package along with the WinPython and Spyder are used for processing the movie reviews. Then Python's sklearn package is used for training the model and finding the accuracy of the model.
Predictive models are quasi experimental structures used to determine the future
patterns in data. These meaningful data patterns form the building block of any
decision support system. Researchers all over the world have built many prediction
models for major industries. Research works in the educational sector has increased
steeply. This steep increase may be due to the high availability of data in the
educational domain. This survey tries to comprehend a few literary works on
academic performance prediction of engineering students with the focus on grade
predictions. Meaningful interpretations have been made and inferences are presented
at the end of this paper
Systematic review on evaluating planning process in agile development methodsTELKOMNIKA JOURNAL
Â
Agile development methods have been catering the need of faster delivery of theever-demanding domain of software engineering. These methods are able to deliver value to users and businesses via fast, reliable, and repeatable process. Planning requirements and processes takes the driving seat in a dynamic environment because the value proposition rapidly changes. This paper exhibits asystematic literature review of planning processes implementedby various agile methods in order to find the best suited agile method in terms of robust planning. Keywords: It was found that Scrum is the best suited agile method for planning processes.
Managers Perceptions towards the Success of E-performance Reporting SystemTELKOMNIKA JOURNAL
Â
Managers are the key informants in the information system (IS) success measurements. In fact, besides the determinant agents are rarely involved in the assessments, most of the measurements are also often performed by the technical stakeholders of the systems. Therefore, the results may questionable. This study was carried to explain the factors that influence the success of an e-performance reporting system in an Indonesian university by involving Âą 70% of the managers (n=66) in the sampled institution. The DeLone and McLean model was adopted and adapted here following the suggestions of the previous meta-analysis studies. The collected data was analyzed using the partial least squares-structural equation modelling (PLS-SEM) for examining the four hypotheses. Despite the findings revealed acceptances of the overall hypotheses, the weak explanation of the user satisfaction variable towards the net benefit one had been the highlighted point. Besides the study limitations, the point may also be the practical and theoretical considerations for the next studies, especially for the IS success studies in Indonesia
A Systematic Mapping Review of Software Quality Measurement: Research Trends,...IJECEIAES
Â
Software quality is a key for the success in the business of information and technology. Hence, before be marketed, it needs the software quality measurement to fulfill the user requirements. Some methods of the software quality analysis have been tested in a different perspective, and we have presented the software method in the point of view of users and experts. This study aims to map the method of software quality measurement in any models of quality. Using the method of Systematic Mapping Study, we did a searching and filtering of papers using the inclusion and exclusion criteria. 42 relevant papers have been obtained then. The result of the mapping showed that though the model of ISO SQuaRE has been widely used since the last five years and experienced the dynamics, the researchers in Indonesia still used ISO9126 until the end of 2016.The most commonly used method of the software quality measurement Method is the empirical method, and some researchers have done an AHP and Fuzzy approach in measuring the software quality.
Performance Analysis of Supervised Machine Learning Techniques for Sentiment ...Biswaranjan Samal
Â
Wide use of internet and web applications like, feedback collection systems are now making peoples smarter. In these applications, peoples used to give their feedback about the movies, products, services, etc through which they have gone, and this feedback are publicly available for future references. It is a tedious task for the machines to identify the feedback types, i:e positive or negative. And here Machine Learning Techniques plays vital roles to train the machine and make it intelligent so that the machine will be able to identify the feedback type which may give more benefits and features for those web applications and the users. There are many supervised machine learning techniques are available so it is a difficult task to choose the best one. In this paper, we have collected the movie review datasets of different sizes and have selected some of the widely used and popular supervised machine learning algorithms, for training the model. So that the model will be able to categorize the review. Python's NLTK package along with the WinPython and Spyder are used for processing the movie reviews. Then Python's sklearn package is used for training the model and finding the accuracy of the model.
Predictive models are quasi experimental structures used to determine the future
patterns in data. These meaningful data patterns form the building block of any
decision support system. Researchers all over the world have built many prediction
models for major industries. Research works in the educational sector has increased
steeply. This steep increase may be due to the high availability of data in the
educational domain. This survey tries to comprehend a few literary works on
academic performance prediction of engineering students with the focus on grade
predictions. Meaningful interpretations have been made and inferences are presented
at the end of this paper
Systematic review on evaluating planning process in agile development methodsTELKOMNIKA JOURNAL
Â
Agile development methods have been catering the need of faster delivery of theever-demanding domain of software engineering. These methods are able to deliver value to users and businesses via fast, reliable, and repeatable process. Planning requirements and processes takes the driving seat in a dynamic environment because the value proposition rapidly changes. This paper exhibits asystematic literature review of planning processes implementedby various agile methods in order to find the best suited agile method in terms of robust planning. Keywords: It was found that Scrum is the best suited agile method for planning processes.
Managers Perceptions towards the Success of E-performance Reporting SystemTELKOMNIKA JOURNAL
Â
Managers are the key informants in the information system (IS) success measurements. In fact, besides the determinant agents are rarely involved in the assessments, most of the measurements are also often performed by the technical stakeholders of the systems. Therefore, the results may questionable. This study was carried to explain the factors that influence the success of an e-performance reporting system in an Indonesian university by involving Âą 70% of the managers (n=66) in the sampled institution. The DeLone and McLean model was adopted and adapted here following the suggestions of the previous meta-analysis studies. The collected data was analyzed using the partial least squares-structural equation modelling (PLS-SEM) for examining the four hypotheses. Despite the findings revealed acceptances of the overall hypotheses, the weak explanation of the user satisfaction variable towards the net benefit one had been the highlighted point. Besides the study limitations, the point may also be the practical and theoretical considerations for the next studies, especially for the IS success studies in Indonesia
A Systematic Mapping Review of Software Quality Measurement: Research Trends,...IJECEIAES
Â
Software quality is a key for the success in the business of information and technology. Hence, before be marketed, it needs the software quality measurement to fulfill the user requirements. Some methods of the software quality analysis have been tested in a different perspective, and we have presented the software method in the point of view of users and experts. This study aims to map the method of software quality measurement in any models of quality. Using the method of Systematic Mapping Study, we did a searching and filtering of papers using the inclusion and exclusion criteria. 42 relevant papers have been obtained then. The result of the mapping showed that though the model of ISO SQuaRE has been widely used since the last five years and experienced the dynamics, the researchers in Indonesia still used ISO9126 until the end of 2016.The most commonly used method of the software quality measurement Method is the empirical method, and some researchers have done an AHP and Fuzzy approach in measuring the software quality.
City i-Tick: The android based mobile application for studentsâ attendance at...journalBEEI
Â
This paper presents City i-Tick, the android based mobile application for studentsâ attendance at a university. In this study, we developed mobile application for lecturers to take studentsâ attendance in City University, Petaling Jaya. Managing studentsâ attendance during lecture periods has become a difficult challenge. The research objectives for this study are to identify user requirement for City i-Tick, to design and develop City i-Tick, and to demonstrate the prototype of City i-Tick. The study is a narrative participatory design and exploits Design Thinking as the research methodology. City i-Tick was successfully validated by 14 lecturers and System Usability Scale (SUS) was used to determine the findings of the study. We found that City i-Tick is effective for lecturers in taking attendance because it is easy to use, easy to learn, and the users feel confident when using this application.
Data Mining Application in Advertisement Management of Higher Educational Ins...ijcax
Â
In recent years, Indian higher educational instituteâs competition grows rapidly for attracting students to
get enrollment in their institutes. To attract students educational institutes select a best advertisement
method. There are different advertisements available in the market but a selection of them is very difficult
for institutes. This paper is helpful for institutes to select a best advertisement medium using some data
mining methods.
Data Mining Application in Advertisement Management of Higher Educational Ins...ijcax
Â
In recent years, Indian higher educational instituteâs competition grows rapidly for attracting students to get enrollment in their institutes. To attract students educational institutes select a best advertisement
method. There are different advertisements available in the market but a selection of them is very difficult for institutes. This paper is helpful for institutes to select a best advertisement medium using some data mining methods.
Data Mining Application in Advertisement Management of Higher Educational Ins...ijcax
Â
In recent years, Indian higher educational instituteâs competition grows rapidly for attracting students to get enrollment in their institutes. To attract students educational institutes select a best advertisement method. There are different advertisements available in the market but a selection of them is very difficult
for institutes. This paper is helpful for institutes to select a best advertisement medium using some data mining methods.
Data Mining Application in Advertisement Management of Higher Educational Ins...ijcax
Â
In recent years, Indian higher educational instituteâs competition grows rapidly for attracting students to
get enrollment in their institutes. To attract students educational institutes select a best advertisement
method. There are different advertisements available in the market but a selection of them is very difficult
for institutes. This paper is helpful for institutes to select a best advertisement medium using some data
mining methods.
An Empirical Study of the Applications of Classification Techniques in Studen...IJERA Editor
Â
University servers and databases store a huge amount of data including personal details, registration details, evaluation assessment, performance profiles, and many more for students and lecturers alike. main problem that faces any system administration or any users is data increasing per-second, which is stored in different type and format in the servers, learning about students from a huge amount of data including personal details, registration details, evaluation assessment, performance profiles, and many more for students and lecturers alike. Graduation and academic information in the future and maintaining structure and content of the courses according to their previous results become importance. The paper objectives are extract knowledge from incomplete data structure and what the suitable method or technique of data mining to extract knowledge from a huge amount of data about students to help the administration using technology to make a quick decision. Data mining aims to discover useful information or knowledge by using one of data mining techniques, this paper used classification technique to discover knowledge from studentâs server database, where all studentsâ information were registered and stored. The classification task is used, the classifier tree C4.5, to predict the final academic results, grades, of students. We use classifier tree C4.5 as the method to classify the grades for the students .The data include four years period [2006-2009]. Experiment results show that classification process succeeded in training set. Thus, the predicted instances is similar to the training set, this proves the suggested classification model. Also the efficiency and effectiveness of C4.5 algorithm in predicting the academic results, grades, classification is very good. The model also can improve the efficiency of the academic results retrieving and evidently promote retrieval precision.
LEAN THINKING IN SOFTWARE ENGINEERING: A SYSTEMATIC REVIEWijseajournal
Â
The field of Software Engineering has suffered considerable transformation in the last decades due to the influence of the philosophy of Lean Thinking. The purpose of this systematic review is to identify practices and approaches proposed by researchers in this area in the last 5 years, who have worked under the influence of this thinking. The search strategy brought together 549 studies, 80 of which were classified as
relevant for synthesis in this review. Seventeen tools of Lean Thinking adapted to Software Engineering were catalogued, as well as 35 practices created for the development of software that has been influenced by this philosophy. The study rovides a roadmap of results with the current state of the art and the identification of gaps pointing to opportunities for further esearch.
A Novel approach for Document Clustering using Concept ExtractionAM Publications
Â
In this paper we present a novel approach to extract the concept from a document and cluster such set of documents depending on the concept extracted from each of them. We transform the corpus into vector space by using term frequencyâinverse document frequency then calculate the cosine distance between each document, followed by clustering them using K means algorithm. We also use multidimensional scaling to reduce the dimensionality within the corpus. It results in the grouping of documents which are most similar to each other with respect to their content and the genre.
Supporting Information Management in Selecting Scientific Research Projectsinventionjournals
Â
The information management in KAU (King Abdulaziz University) face a critical problem when selecting the suitable research projects. Most of faculty members in all faculties and research institutes submit scientific research proposals with the hope to be accepted. The management needs to set a scientific approach to help in selecting suitable proposals. TOPSIS (The Technique for Order of Preference by Similarity to Ideal Solution Method) is a powerful a multi-criteria approach in ranking alternatives with different criteria and selecting the best alternatives. Applying the TOPSIS solved the problem that the Information management faces.
Supporting Information Management in Selecting Scientific Research Projectsinventionjournals
Â
The information management in KAU (King Abdulaziz University) face a critical problem when selecting the suitable research projects. Most of faculty members in all faculties and research institutes submit scientific research proposals with the hope to be accepted. The management needs to set a scientific approach to help in selecting suitable proposals. TOPSIS (The Technique for Order of Preference by Similarity to Ideal Solution Method) is a powerful a multi-criteria approach in ranking alternatives with different criteria and selecting the best alternatives. Applying the TOPSIS solved the problem that the Information management faces.
How to write chapter three of your research projectEtieneIma123
Â
Chapter three of the research project or the research methodology is another significant part in the research project writing. In developing the chapter three of the research project, you state the research method you wish to adopt, the instruments to be used, where you will collect your data and how you collected it.
This chapter explains the different methods to be used in the research project. Here you mention the procedures and strategies you will employ in the study such as research design, research area (area of the study), the population of the study, etc.
You also tell the reader why you chose a particular method, how you planned to analyze your data. Your methodology should be written in a simple language such that other researchers can follow the method and arrive at the same conclusion or findings.
How to write chapter three of your research projectEtieneIma123
Â
Chapter three of the research project or the research methodology is another significant part of the research project writing. In developing the chapter three of the research project, you state the research method you wish to adopt, the instruments to be used, where you will collect your data and how you collected it.
A Federated Search Approach to Facilitate Systematic Literature Review in Sof...ijseajournal
Â
To impact industry, researchers developing technologies in academia need to provide tangible evidence of
the advantages of using them. Nowadays, Systematic Literature Review (SLR) has become a prominent
methodology in evidence-based researches. Although adopting SLR in software engineering does not go far
in practice, it has been resulted in valuable researches and is going to be more common. However, digital
libraries and scientific databases as the best research resources do not provide enough mechanism for
SLRs especially in software engineering. On the other hand, any loss of data may change the SLR results
and leads to research bias. Accordingly, the search process and evidence collection in SLR is a critical
point. This paper provides some tips to enhance the SLR process. The main contribution of this work is
presenting a federated search tool which provides an automatic integrated search mechanism in wellknown Software Engineering databases. Results of case study show that this approach not only reduces
required time to do SLR and facilitate its search process, but also improves its reliability and results in the
increasing trend to use SLRs.
City i-Tick: The android based mobile application for studentsâ attendance at...journalBEEI
Â
This paper presents City i-Tick, the android based mobile application for studentsâ attendance at a university. In this study, we developed mobile application for lecturers to take studentsâ attendance in City University, Petaling Jaya. Managing studentsâ attendance during lecture periods has become a difficult challenge. The research objectives for this study are to identify user requirement for City i-Tick, to design and develop City i-Tick, and to demonstrate the prototype of City i-Tick. The study is a narrative participatory design and exploits Design Thinking as the research methodology. City i-Tick was successfully validated by 14 lecturers and System Usability Scale (SUS) was used to determine the findings of the study. We found that City i-Tick is effective for lecturers in taking attendance because it is easy to use, easy to learn, and the users feel confident when using this application.
Data Mining Application in Advertisement Management of Higher Educational Ins...ijcax
Â
In recent years, Indian higher educational instituteâs competition grows rapidly for attracting students to
get enrollment in their institutes. To attract students educational institutes select a best advertisement
method. There are different advertisements available in the market but a selection of them is very difficult
for institutes. This paper is helpful for institutes to select a best advertisement medium using some data
mining methods.
Data Mining Application in Advertisement Management of Higher Educational Ins...ijcax
Â
In recent years, Indian higher educational instituteâs competition grows rapidly for attracting students to get enrollment in their institutes. To attract students educational institutes select a best advertisement
method. There are different advertisements available in the market but a selection of them is very difficult for institutes. This paper is helpful for institutes to select a best advertisement medium using some data mining methods.
Data Mining Application in Advertisement Management of Higher Educational Ins...ijcax
Â
In recent years, Indian higher educational instituteâs competition grows rapidly for attracting students to get enrollment in their institutes. To attract students educational institutes select a best advertisement method. There are different advertisements available in the market but a selection of them is very difficult
for institutes. This paper is helpful for institutes to select a best advertisement medium using some data mining methods.
Data Mining Application in Advertisement Management of Higher Educational Ins...ijcax
Â
In recent years, Indian higher educational instituteâs competition grows rapidly for attracting students to
get enrollment in their institutes. To attract students educational institutes select a best advertisement
method. There are different advertisements available in the market but a selection of them is very difficult
for institutes. This paper is helpful for institutes to select a best advertisement medium using some data
mining methods.
An Empirical Study of the Applications of Classification Techniques in Studen...IJERA Editor
Â
University servers and databases store a huge amount of data including personal details, registration details, evaluation assessment, performance profiles, and many more for students and lecturers alike. main problem that faces any system administration or any users is data increasing per-second, which is stored in different type and format in the servers, learning about students from a huge amount of data including personal details, registration details, evaluation assessment, performance profiles, and many more for students and lecturers alike. Graduation and academic information in the future and maintaining structure and content of the courses according to their previous results become importance. The paper objectives are extract knowledge from incomplete data structure and what the suitable method or technique of data mining to extract knowledge from a huge amount of data about students to help the administration using technology to make a quick decision. Data mining aims to discover useful information or knowledge by using one of data mining techniques, this paper used classification technique to discover knowledge from studentâs server database, where all studentsâ information were registered and stored. The classification task is used, the classifier tree C4.5, to predict the final academic results, grades, of students. We use classifier tree C4.5 as the method to classify the grades for the students .The data include four years period [2006-2009]. Experiment results show that classification process succeeded in training set. Thus, the predicted instances is similar to the training set, this proves the suggested classification model. Also the efficiency and effectiveness of C4.5 algorithm in predicting the academic results, grades, classification is very good. The model also can improve the efficiency of the academic results retrieving and evidently promote retrieval precision.
LEAN THINKING IN SOFTWARE ENGINEERING: A SYSTEMATIC REVIEWijseajournal
Â
The field of Software Engineering has suffered considerable transformation in the last decades due to the influence of the philosophy of Lean Thinking. The purpose of this systematic review is to identify practices and approaches proposed by researchers in this area in the last 5 years, who have worked under the influence of this thinking. The search strategy brought together 549 studies, 80 of which were classified as
relevant for synthesis in this review. Seventeen tools of Lean Thinking adapted to Software Engineering were catalogued, as well as 35 practices created for the development of software that has been influenced by this philosophy. The study rovides a roadmap of results with the current state of the art and the identification of gaps pointing to opportunities for further esearch.
A Novel approach for Document Clustering using Concept ExtractionAM Publications
Â
In this paper we present a novel approach to extract the concept from a document and cluster such set of documents depending on the concept extracted from each of them. We transform the corpus into vector space by using term frequencyâinverse document frequency then calculate the cosine distance between each document, followed by clustering them using K means algorithm. We also use multidimensional scaling to reduce the dimensionality within the corpus. It results in the grouping of documents which are most similar to each other with respect to their content and the genre.
Supporting Information Management in Selecting Scientific Research Projectsinventionjournals
Â
The information management in KAU (King Abdulaziz University) face a critical problem when selecting the suitable research projects. Most of faculty members in all faculties and research institutes submit scientific research proposals with the hope to be accepted. The management needs to set a scientific approach to help in selecting suitable proposals. TOPSIS (The Technique for Order of Preference by Similarity to Ideal Solution Method) is a powerful a multi-criteria approach in ranking alternatives with different criteria and selecting the best alternatives. Applying the TOPSIS solved the problem that the Information management faces.
Supporting Information Management in Selecting Scientific Research Projectsinventionjournals
Â
The information management in KAU (King Abdulaziz University) face a critical problem when selecting the suitable research projects. Most of faculty members in all faculties and research institutes submit scientific research proposals with the hope to be accepted. The management needs to set a scientific approach to help in selecting suitable proposals. TOPSIS (The Technique for Order of Preference by Similarity to Ideal Solution Method) is a powerful a multi-criteria approach in ranking alternatives with different criteria and selecting the best alternatives. Applying the TOPSIS solved the problem that the Information management faces.
How to write chapter three of your research projectEtieneIma123
Â
Chapter three of the research project or the research methodology is another significant part in the research project writing. In developing the chapter three of the research project, you state the research method you wish to adopt, the instruments to be used, where you will collect your data and how you collected it.
This chapter explains the different methods to be used in the research project. Here you mention the procedures and strategies you will employ in the study such as research design, research area (area of the study), the population of the study, etc.
You also tell the reader why you chose a particular method, how you planned to analyze your data. Your methodology should be written in a simple language such that other researchers can follow the method and arrive at the same conclusion or findings.
How to write chapter three of your research projectEtieneIma123
Â
Chapter three of the research project or the research methodology is another significant part of the research project writing. In developing the chapter three of the research project, you state the research method you wish to adopt, the instruments to be used, where you will collect your data and how you collected it.
A Federated Search Approach to Facilitate Systematic Literature Review in Sof...ijseajournal
Â
To impact industry, researchers developing technologies in academia need to provide tangible evidence of
the advantages of using them. Nowadays, Systematic Literature Review (SLR) has become a prominent
methodology in evidence-based researches. Although adopting SLR in software engineering does not go far
in practice, it has been resulted in valuable researches and is going to be more common. However, digital
libraries and scientific databases as the best research resources do not provide enough mechanism for
SLRs especially in software engineering. On the other hand, any loss of data may change the SLR results
and leads to research bias. Accordingly, the search process and evidence collection in SLR is a critical
point. This paper provides some tips to enhance the SLR process. The main contribution of this work is
presenting a federated search tool which provides an automatic integrated search mechanism in wellknown Software Engineering databases. Results of case study show that this approach not only reduces
required time to do SLR and facilitate its search process, but also improves its reliability and results in the
increasing trend to use SLRs.
Similar to A SYSTEMATIC LITERATURE REVIEW OF MACHINE LEARNING TECHNIQUE USAGE (20)
Strategies for Effective Upskilling is a presentation by Chinwendu Peace in a Your Skill Boost Masterclass organisation by the Excellence Foundation for South Sudan on 08th and 09th June 2024 from 1 PM to 3 PM on each day.
Biological screening of herbal drugs: Introduction and Need for
Phyto-Pharmacological Screening, New Strategies for evaluating
Natural Products, In vitro evaluation techniques for Antioxidants, Antimicrobial and Anticancer drugs. In vivo evaluation techniques
for Anti-inflammatory, Antiulcer, Anticancer, Wound healing, Antidiabetic, Hepatoprotective, Cardio protective, Diuretics and
Antifertility, Toxicity studies as per OECD guidelines
Delivering Micro-Credentials in Technical and Vocational Education and TrainingAG2 Design
Â
Explore how micro-credentials are transforming Technical and Vocational Education and Training (TVET) with this comprehensive slide deck. Discover what micro-credentials are, their importance in TVET, the advantages they offer, and the insights from industry experts. Additionally, learn about the top software applications available for creating and managing micro-credentials. This presentation also includes valuable resources and a discussion on the future of these specialised certifications.
For more detailed information on delivering micro-credentials in TVET, visit this https://tvettrainer.com/delivering-micro-credentials-in-tvet/
A review of the growth of the Israel Genealogy Research Association Database Collection for the last 12 months. Our collection is now passed the 3 million mark and still growing. See which archives have contributed the most. See the different types of records we have, and which years have had records added. You can also see what we have for the future.
Thinking of getting a dog? Be aware that breeds like Pit Bulls, Rottweilers, and German Shepherds can be loyal and dangerous. Proper training and socialization are crucial to preventing aggressive behaviors. Ensure safety by understanding their needs and always supervising interactions. Stay safe, and enjoy your furry friends!
Normal Labour/ Stages of Labour/ Mechanism of LabourWasim Ak
Â
Normal labor is also termed spontaneous labor, defined as the natural physiological process through which the fetus, placenta, and membranes are expelled from the uterus through the birth canal at term (37 to 42 weeks
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...Dr. Vinod Kumar Kanvaria
Â
Exploiting Artificial Intelligence for Empowering Researchers and Faculty,
International FDP on Fundamentals of Research in Social Sciences
at Integral University, Lucknow, 06.06.2024
By Dr. Vinod Kumar Kanvaria
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Â
A SYSTEMATIC LITERATURE REVIEW OF MACHINE LEARNING TECHNIQUE USAGE
1. A SYSTEMATIC LITERATURE REVIEW OF
MACHINE LEARNING TECHNIQUE USAGE
A final-term examination report
to fullfill the requirements for
Information Technology Research and Innovation
Lecture: Dr.-Ing. Ir. Suhardi
by:
Arrahman Adnani, S.ST
NIM.23514052
SCHOOL OF INFORMATICS AND ENGINEERING
INSTITUTE TECHNOLOGY BANDUNG
DECEMBER 2014
2. 2
A Systematic Literature Review of
Machine Learning Technique Usage
Arrahman Adnani, S.ST
NIM.23514052
School of Informatics and Engineering
Institute Technology Bandung
Bandung, Indonesia
adnani@s.itb.ac.id
Abstract
The development of machine learning technique is very fast now. Its usage
has spread to various fields, such as learning machines currently used in medical
science, pharmacology, agriculture, archeology, games, business and so forth.
Many researches has been performed to create a more intelligent machines that
can replace or relieve human tasks such as analyzing, communicating, learning, or
making decisions. In this research performed a systematic review of research from
2010 to 2014 in the literature about the use of the machine learning technique.
The purpose of this study is to determine the techniques and problems in the use
of machine learning that may be used as a reference for conducting research in the
future.
Keywords: artificial neural network, classification, clustering, machine learning
technique, prediction, support vector machines, systematic literature
review
3. 3
1. Introduction
Currently very rapid development of machine learning and its use has been
expanded to various fields. For example, the current machine learning is used in
medical science to measure health [1,2] or diagnose a disease such as cancer [3,4].
In the pharmacology, is not only used to find the right formula and reliable drugs
to incapacitate disease virus [5,6], machine learning is also used to determine the
effective therapeutic treatment [7]. Besides being used in medical science and
pharmacology, or better known as bioinformatics, machine learning is also used in
other fields such as agriculture, archeology, games, business and others.
The researchers have done many studies on machine learning in order to
become more intelligent machines that can replace or relieve human task. With
the use of machine learning techniques, the machine has been able to better
analyze, communicate, learn, make decisions, or make predictions. For example,
in agriculture, machine learning is used to increase agricultural production as with
predicting pest plants [8]. Another example is in the business world, where
machine learning is used to predict the stock market and stock price index
movement [9,10].
Considering the rapid use of machine learning and the many studies about
it, this research needs to be conducted. This research aims to determine the
techniques and problems in the use of machine learning. The result of this study is
expected to be a reference to conduct research in the future.
2. Methodology
The method of this research is performed by adapting the systematic
literature review procedure including planning, conducting and reporting the
review given by [11]. In the planning phase will be designed the review protocol
to be conducted including the following steps: research questions identification,
search strategy design, data collection and data analysis. The process of
systematic literature review can be seen in Figure 1.
4. 4
Figure 1. Systematic Literature Review Process (adapted from [11])
In the first step necessary to identify research questions that will be
answered in the systematic literature review. In the second step will be described
search strategy including search keyword identification and selection of sources to
be searched. In the third step will be carried out the collection of relevant studies
based on the research questions. In the next step, will be conducted an analysis of
the data which previously collected.
a. Research question
There are 4 questions to be answered on this research. Here is a list of
questions and objectives of this research.
Table 1. Research questions and objectives
ID Research Questions Objectives
RQ1
What are the techniques of machine
learning?
Identify the machine learning
techniques commonly being used
RQ2
What are the uses of machine
learning techniques?
Identify the use of machine
learning techniques
RQ3
Which data types are used for
machine learning techniques?
Identify appropriate data type for
machine learning techniques
Planning the review
- Identi fy interest of review
- Designing review protocol
Conducting the review
1. Identify research question
2. Designing search strategy
3. Collecting data
4. Data analysis
Reporting the result of review
5. 5
RQ4
What are the strengths and
weaknesses of the machine learning
techniques?
Identify the performances of
machine learning techniques.
b. Search strategy
In this study used a keyword search âmachine learning techniqueâ on three
electronic databases online that are Elsevier, ProQuest, and IEEE. Searches are
limited from 2010 through 2014. Here are the search results for the three sources.
Table 2. Electronic database and search results
ID Databases URL Results (literatures)
DS1 Elsevier http://sciencedirect.com/ 3869
DS2 ProQuest http://search.proquest.com/ 2047
DS3 IEEE http://ieeexplore.ieee.org/ 965
Research on machine learning techniques seem more rapidly seen from the
many studies that have been performed. The amount of research on machine
learning looks increasingly from year to year. Growth in the total of research on
machine learning techniques can be seen in Figure 2. Its composition per year can
be seen in Figure 3.
Figure 2. Total research year 2010-2014
0
250
500
750
1000
1250
2010 2011 2012 2013 2014
Elsevier
ProQuest
IEEE
6. 6
Figure 3. Percentage of total research year 2010-2014
c. Data collection
Details of data to be collected from each selected literature should be able
to answer all the questions of the research. Here are the fields of the data to be
collected from the literature.
Table 3. Fields of data collection
ID Fields
F1 Author (s)
F2 Title
F3 Keyword
F4 Year
F5 Type (Journal/Conference)
F6 Publisher
F7 Machine learning techniques
F8 Use of machine learning techniques
F9 Appropriate data type for machine learning techniques
12,15 13,34 16,48
15,74 16,37
18,76
17,47
23,20
18,45
22,51
26,28 27,88
32,13
20,81 18,45
0%
20%
40%
60%
80%
100%
Elsevier ProQuest IEEE
2014
2013
2012
2011
2010
7. 7
F10 Strengths of the machine learning techniques
F11 Weaknesses of the machine learning techniques
d. Data analysis
In this research as much as 44 selected literatures will be analyzed which
17 literatures taken from Elsevier, 12 from ProQuest, and 15 of the IEEE. While
the composition of the literature based on the year are as follows: 2014 as 17;
2013 as 10; 2012 as 9; 2010 as 5; and 2010 as 3 literatures. Here is a list of
literature that will be analyzed.
Table 4. Selected literatures
ID Author Year Source Type Ref
S1 Paokanta et al. 2010 IEEE Conference [1]
S2 MartÃnez et al. 2014 Elsevier Journal [2]
S3 Kourou et al. 2014 Elsevier Journal [3]
S4 Asadi et al. 2014 ProQuest Journal [4]
S5 Danger et al. 2010 Elsevier Journal [5]
S6 Urquiza et al. 2012 Elsevier Journal [6]
S7 Caravaca et al. 2013 Elsevier Journal [7]
S8 Kim et al. 2013 Elsevier Conference [8]
S9 Patel et al. 2014 Elsevier Journal [9]
S10 Patel et al. 2014 Elsevier Journal [10]
S11 Ajila et al. 2013 IEEE Conference [12]
S12 Alsri et al. 2014 IEEE Conference [13]
S13 Bal et al. 2014 ProQuest Journal [14]
S14 BÊlisle et al. 2014 Elsevier Journal [15]
S15 Betrie et al. 2012 ProQuest Journal [16]
S16 Bhutani 2014 ProQuest Journal [17]
S17 Bohn et al. 2013 Elsevier Conference [18]
8. 8
S18 Chaturvedi et al. 2012 IEEE Conference [19]
S19 Chaudhary et al. 2012 IEEE Conference [20]
S20 Costea 2014 Elsevier Conference [21]
S21 Daybelge et al. 2010 ProQuest Journal [22]
S22 Delen et al. 2012 Elsevier Journal [23]
S23 Fernandes et al. 2014 Elsevier Journal [24]
S24 Frid et al. 2014 IEEE Conference [25]
S25 Holzinger et al. 2014 IEEE Conference [26]
S26 Hosseinifard et al. 2011 IEEE Conference [27]
S27 Huang et al. 2013 ProQuest Journal [28]
S28 Kanewala et al. 2013 IEEE Conference [29]
S29 Kazemian et al. 2014 Elsevier Journal [30]
S30 Lin et al. 2014 IEEE Conference [31]
S31 Ludtke et al. 2011 Elsevier Journal [32]
S32 Oudendag et al. 2012 ProQuest Journal [33]
S33 Oztekin et al. 2013 Elsevier Journal [34]
S34 Panigrahi 2012 IEEE Conference [35]
S35 Pereira et al. 2012 Elsevier Journal [36]
S36 Sarina et al. 2011 ProQuest Journal [37]
S37 Sathyadevan et al. 2014 IEEE Conference [38]
S38 Schuster et al. 2011 IEEE Conference [39]
S39 Silva et al. 2013 IEEE Conference [40]
S40 Singh et al. 2013 IEEE Conference [41]
S41 Suh et al. 2011 ProQuest Journal [42]
S42 Talbi 2013 ProQuest Journal [43]
S43 Yuan et al. 2014 ProQuest Journal [44]
S44 Zhang et al. 2012 ProQuest Journal [45]
9. 9
Furthermore, from the selected literature can be taken keywords used. The
most frequently used keywords can be seen in Figure 4. Machine learning is used
in keyword of almost all the selected literature.
Figure 4. Keywords from selected literatures
3. Result and Discussion
After analyzing the data collected from a variety of selected literature. It
will answer some of the questions that have been the main objective of this
research. Here is the result and discussion.
a. RQ1: What are the techniques of machine learning?
Objective of this research question is to identify the machine learning
technique commonly being used. Five of the techniques often used are Support
Vector Machines (SVM), Artificial Neural Network (ANN), Naive-Bayes (NB),
Decision Trees (DT), and k-Nearest Neighbors (k-NN). The distribution of
machine learning techniques which used in several selected literature can be seen
in Table 5.
10. 10
Table 5. Distribution of machine learning techniques
No Machines Learning Techniques %
1 Support Vector Machines (SVM) 14.46
2 Artificial Neural Network (ANN) 10.84
3 Naive-Bayes (NB) 10.24
4 Decision Trees (DT) 7.83
5 k-Nearest Neighbors (k-NN) 5.42
6 Random Forest (RF) 4.22
7 Bayesian Networks (BN) 3.61
8 Multi-Layer Perceptron (MLP) 3.01
9 k-Means 2.41
10 Logistic Regression (LR) 2.41
11 Multiple Linear Regression (MLR) 1.81
12 Adaboost 1.20
13 Bootstrap Aggregation (Bagging) 1.20
14 Case Based Reasoning (CBR) 1.20
15 Classification and Regression Trees (CART) 1.20
16 Others 28.92
Total 100.00
The first position, a technique often used is SVM while predominantly
developed by Vapnik in 1998, Cherkassky and Mulier in 2007 [16]. SVM
increasingly successfully used in real world applications because it has a good
theoretical basis [20]. SVM theoretical foundation derived from statistical
learning theory which then combined with machine learning techniques [23].
11. 11
b. RQ2: What are the uses of machine learning techniques?
Machine learning techniques are used for classification, prediction,
clustering, ranking, and feature selection. The proportion of machine learning
usage can be seen in Figure 5. The five techniques are often used in the
classification is NB, SVM, DT, k-NN, BN, and MLP. Furthermore, the five
techniques that are often used in prediction are SVM, ANN, DT, RF, k-NN. While
in the clustering, k-Means is the technique most often used.
Figure 5. Proportion of machine learning usage
c. RQ3: Which data types are used for machine learning techniques?
The data type which often used in machine learning technique is structured
data that is in interval, nominal or ordinal scale. However, several studies have
been conducted to use machine learning technique in unstructured data. For
example, is used for identifying moving bodies from videos [38]. In addition,
machine learning technique is also used to analyze other data types such as image
[32,36,44], audio [40], and other e-document [31,35,37,39].
Classification
46,67%
Clustering
8,89%
Feature
Selection
2,22%
Prediction
37,78%
Rangking
4,44%
12. 12
d. RQ4: What are the strengths and weaknesses of the machine learning
techniques?
Some research indicates that SVM is a machine learning technique with
the best accuracy in comparison with other techniques [9,16,19,32,40,45]. But
besides SVM, several other studies showed that ANN is better, especially for
calculating the correlation between variables [34,35]. To get better results in the
accuracy and correlation, can be used hybrid method by combining several
techniques in several stages [10]. However, besides having strength in accuracy
and correlation, SVM and ANN technique has its own weakness where this
technique requires sufficient time to complete the process [16].
4. Conclusion
The use of machine learning technique in the analysis has been growing
rapidly especially to perform classification, prediction, and clustering. Machine
learning technique can be used in structured or unstructured data type. The results
of this research showed that the most often used technique is the SVM because
this technique gives better accuracy than other techniques. Some techniques such
as SVM and ANN can be combined into several stages of analysis to provide
better accuracy and correlation. However, this technique has limitations in terms
of time to complete the process which still requires considerable time.
5. Future Work
Direction of future work can be focused to obtain larger data. Large data
needs not only to obtain the number of samples to be used in the machine learning
process. But it is also to obtain more statistical parameters as input to find a much
better correlation. Currently, the need for larger data represented in big data
analysis. When dealing with big data, not only pay attention to the technique used
to perform the analysis, but also need to pay attention to develop the feature
selection technique in order to obtain the effective and efficient parameters that
will be used in the analysis.
13. 13
References
[1] P. Paokanta, M. Ceccarelli, S. Srichairatanakool, âThe Effeciency of Data
Types for Classification Performance of Machine Learning Techniques for
Screening β-Thalassemiaâ, International Symposium on Applied Sciences
in Biomedical and Communication Technologies, 2010.
[2] J.M. MartÃnez, P.E. Montero, C. Barbieri, E.S.Olivas, F. Mari, M.M.
Sober, C. Amato, A.J.S. LÃŗpez, M. Bassi, R.M. Benedito, A. Stopper,
J.D.M. Guerrero, E. Gatti, âPrediction of the hemoglobin level in
hemodialysis patients using machine learning techniquesâ, International
Journal of Computer Methods and Programs in Biomedicine, 2014.
[3] K. Kourou, T.P. Exarchos, K.P. Exarchos, M.V. Karamouzis, D.I.
Fotiadis, âMachine learning applications in cancer prognosis and
predictionâ, Computational and Structural Biotechnology Journal, 2014.
[4] H. Asadi, R. Dowling, B. Yan, P. Mitchell, âMachine Learning for
Outcome Prediction of Acute Ischemic Stroke Post Intra-Arterial
Therapyâ, Public Library of Science Scholarly Journal, 2014.
[5] R. Danger, I.S. Bedmar, P. MartÃnez, P. Rosso, âA comparison of machine
learning techniques for detection of drug target articlesâ, International
Journal of Biomedical Informatics, 2010.
[6] J.M. Urquiza, I.Rojas, H.Pomares, J.Herrera, J. P.Florido, O.Valenzuela,
M.Cepero, âUsing machine learning techniques and genomic/proteomic
information from known databases for defining relevant features for PPI
classificationâ, International Journal of Computers in Biology and
Medicine, 2012.
[7] J. Caravaca, E.S. Olivas, M. Bataller, A.J. Serrano, L.S. Miquel, J.V.
FrancÊs, J.F. Guerrero, âApplication of machine learning techniques to
analyse the effects of physical exercise in ventricular fibrillationâ,
International Journal of Computers in Biology and Medicine, 2013.
[8] Y.H. Kim, S.J. Yoo, Y.H. Gu, J.H. Lim, D. Han, S.W. Baik, âCrop Pests
Prediction Method Using Regression and Machine Learning Technology
Surveyâ, International Conference on Future Software Engineering and
Multimedia Engineering, 2013.
[9] J. Patel, S. Shah, P. Thakkar, K. Kotecha, âPredicting stock and stock price
index movement using Trend Deterministic Data Preparation and machine
learning techniquesâ, International Journal of Expert System with
Applications, 2014.
[10] J. Patel, S. Shah, P. Thakkar, K. Kotecha, âPredicting stock market index
using fusion of machine learning techniquesâ, International Journal of
Expert System with Applications, 2014.
14. 14
[11] B.A. Kitchenham, âGuidelines for performing systematic literature review
in software engineeringâ, Technical report EBSE-2007-001, UK, 2007.
[12] S.A. Ajila, A.A. Bankole, âCloud Client Prediction Models Using
Machine Learning Techniquesâ, Annual Computer Software and
Applications Conference, 2013.
[13] Alsri, S. Almuhammadi, S. Mahmood, âA model for work distribution in
global software development based on machine learning techniquesâ,
Science and Information Conference, 2014.
[14] M. Bal, M.F. Amasyali, H. Sever, G. Kose, A. Demirhan, âPerformance
Evaluation of the Machine Learning Algorithms Used in Inference
Mechanism of a Medical Decision Support Systemâ, The Scientific World
Journal, 2014.
[15] E. BÊlisle, Z. Huang, S.L. Digabel, A.E. Gheribi, âEvaluation of machine
learning interpolation techniques for prediction of physical propertiesâ,
International Journal of Computational Materials Science, 2014.
[16] G.D. Betrie, S. Tesfamariam, K.A. Morin, R. Sadiq, âPredicting copper
concentrations in acid mine drainage a comparative analysis of five
machine learning techniquesâ, Springer Science & Business Media
Scholarly Journals, 2012.
[17] G. Bhutani, âApplication of Machine-Learning Based Prediction
Techniques in Wireless Networksâ, International Journal of
Communications, Network and System Sciences, 2014.
[18] B. Bohn, J. Garcke, R.I. Teran, A. Paprotny, B. Peherstorfer, U.
Schepsmeier, C.A. Thole, âAnalysis of Car Crash Simulation Data with
Nonlinear Machine Learning Methodsâ, International Conference on
Computational Science, 2013.
[19] K.K. Chaturvedi, V.B. Singh, âDetermining Bug severity using machine
learning techniquesâ, International Conference on Software Engineering,
2012.
[20] Chaudhary, S. Kolhe, R. Kamal, âMachine learning techniques for Mobile
Intelligent Systems: A studyâ, International Conference on Wireless and
Optical Communications Networks, 2012.
[21] Costea, âApplying Fuzzy Logic and Machine Learning Techniques in
Financial Performance Predictionsâ, International Conference on Applied
Statistics, 2014.
[22] T. Daybelge, I. Cicekli, âA ranking method for example based machine
translation results by learning from user feedbackâ, Springer Science &
Business Media Scholarly Journals, 2010.
[23] D. Delen, H. Zaim, C. Kuzey, S. Zaim, âA comparative analysis of
machine learning systems for measuring the impact of knowledge
15. 15
management practicesâ, International Journal of Decision Support
Systems, 2012.
[24] J.A. Fernandes, X. Irigoien, J.A. Lozano, I. Inza, N. Goikoetxea, A. PÊrez,
âEvaluating machine-learning techniques for recruitment forecasting of
seven North East Atlantic fish speciesâ, International Journal of Ecological
Informatics, 2014.
[25] Frid, E.J. Safra, H. Hazan, L.L. Lokey, D. Hilu, L. Manevitz, L.O. Ramig,
S. Sapir, âComputational Diagnosis of Parkinson's Disease Directly from
Natural Speech Using Machine Learning Techniquesâ, International
Conference on Software Science, Technology and Engineering, 2014.
[26] Holzinger, D. Blanchard, M. Bloice, K. Holzinger, V. Palade, R. Rabadan,
âDarwin, Lamarck, or Baldwin: Applying Evolutionary Algorithms to
Machine Learning Techniquesâ, International Joint Conferences on Web
Intelligence and Intelligent Agent Technologies, 2014.
[27] Hosseinifard, M.H. Moradi, R. Rostami, âClassifying depression patients
and normal subjects using machine learning techniquesâ, Iranian
Conference on Electrical Engineering, 2011.
[28] M.J. Huang, H.K. Chiang, P.F. Wu, Y.J. Hsieh, âA multi-strategy machine
learning student modeling for intelligent tutoring systems Based on
blackboard approachâ, Emerald Group Publishing Limited Scholarly
Journals, 2013.
[29] U. Kanewala, J.M. Bieman, âUsing Machine Learning Techniques to
Detect Metamorphic Relations for Programs without Test Oraclesâ,
International Symposium on Software Reliability Engineering, 2013.
[30] H.B. Kazemian, S. Ahmed, âComparisons of machine learning techniques
for detecting malicious webpagesâ, International Journal of Expert System
with Applications, 2014.
[31] M. Lin, D.W. Cheung, âAutomatic Tagging Web Services Using Machine
Learning Techniquesâ, International Joint Conferences on Web
Intelligence and Intelligent Agent Technologies, 2014.
[32] Ludtke, K. Jerosch, O. Herzog, M. Schluter, âDevelopment of a machine
learning technique for automatic analysis of seafloor image data: Case
example, Pogonophora coverage at mud volcanoesâ, International Journal
of Computers & Geosciences, 2011.
[33] Oudendag, Z. SzlÃĄvik, V.V. Hennie, âThe use of Machine Learning
techniques to predict farm size change - An implementation in the Dutch
Dairy sectorâ, American Academic & Scholarly Research Journal, 2012.
[34] Oztekin, D. Delen, A. Turkyilmaz, S. Zaim, âA machine learning-based
usability evaluation method for eLearning systemsâ, International Journal
of Decision Support Systems, 2013.
16. 16
[35] P.K. Panigrahi, âA Comparative Study of Supervised Machine Learning
Techniques for Spam E-mail Filteringâ, International Conference on
Computational Intelligence and Communication Networks, 2012.
[36] L.A.M. Pereira, R.Y.M. Nakamura, G.F.S. de Souza, D. Martins, J. P.
Papa, âAquatic weed automatic classification using machine learning
techniquesâ, International Journal of Computers and Electronics in
Agriculture, 2012.
[37] S. Sarina, S.S. Mariyam, A. Ajith, S. Shahida, âIntelligent Web Caching
using Machine Learning Methodsâ, Institute of Computer Science Czech
Republic Scholarly Journals, 2011.
[38] S. Sathyadevan, A.K. Balakrishnan, S. Arya, S.A. Raghunath, âIdentifying
Moving Bodies from CCTV Videos using Machine Learning Techniquesâ,
International Conference on Networks & Soft Computing, 2014.
[39] E.W. Schuster, S. Kumar, S.E. Sarma, J.L. Willers, G.A. Milliken,
âInfrastructure for Data-Driven Agriculture: Identifying Management
Zones for Cotton using Statistical Modeling and Machine Learning
Techniquesâ, International Conference & Expo on Emerging Technologies
for a Smarter World, 2011.
[40] D.F. Silva, V.M.A. de Souza, G.E.A.P.A. Batista, E. Keogh, D.P.W. Ellis,
âApplying Machine Learning and Audio Analysis Techniques to Insect
Recognition in Intelligent Trapsâ, International Conference on Machine
Learning and Applications, 2013.
[41] S. Singh, S.P. Lal, âEducational courseware evaluation using Machine
Learning techniquesâ, Conference on e-Learning, e-Management and e-
Services, 2013.
[42] M. Suh, A. Nahapetian, J. Woodbridge, M. Rofouei, M. Sarrafzadeh,
âMachine Learning-Based Adaptive Wireless Interval Training Guidance
Systemâ, Springer Science & Business Media Scholarly Journals, 2011.
[43] E.G. Talbi, âCombining metaheuristics with mathematical programming,
constraint programming and machine learningâ, Springer Science &
Business Media Scholarly Journals, 2013.
[44] T. Yuan, X. Zheng, X. Hu, W. Zhou, W. Wang, âA Method for the
Evaluation of Image Quality According to the Recognition Effectiveness
of Objects in the Optical Remote Sensing Image Using Machine Learning
Algorithmâ, Public Library of Science Scholarly Journal, 2014.
[45] M. Zhang, G. Johnson, J. Wang, âPredicting Takeover Success Using
Machine Learning Techniquesâ, Journal of Business & Economics
Research, 2012.