SlideShare a Scribd company logo
1 of 15
A Review of Hybrid Data Mining Algorithm for
Big Data Mining
Presented By
PRASANTA KUMAR PAUL
RESEARCH SCHOLAR
AIIT
AMITY UNIVERSITY RAJASTHAN
First International Conference on Smart
Technologies in Computer and
Communication (SmartTech-2017)
Under the guidance of
DR. SONALI VYAS
ASSISTANT PROFESSOR
AIIT
AMITY UNIVERSITY RAJASTHAN
What is …… ?
• Hybrid Data Mining
‣ Hybrid data mining algorithm can be presented as a combination of different
classifiers. The classification ability of data mining algorithm are different, this why
combining them may increase the performance of the system in term of accuracy.
But they must be well chosen. There are other approach which are more general
Boosting and Bagging. They are very interesting and can be efficient. An example
of application in image processing is the face detection in real time using
Adaboost.
LITERATURE SURVEY
 P Thamilselvan Image classification using hybrid data mining algorithm.
 Deshmukh, A. P., & Pamu, K. S. (2012).Introduction to Hadoop distributed file system .
 Feilong Cao, proposed a new algorithm, combination of Extreme K-Means (EKM) and
Effective Extreme Learning Machine (EELM)
 Alireza Taravat et al, introduced a new hybrid algorithm for automatic cloud
detection in a complete-sky image.
 M.R. et al. [10] in this study they presented a hybrid algorithm using Support Vector
Machine (SVM) and K-nearest neighbor (KNN) algorithm.
RELATED HYBRID ALGORITHMS FOR
BIG DATA MINING
 Hybrid evolutionary clustering with empty clustering solution (H (EC) 2 S)
 RC Part (Representative Construction):
 EFC Part (Enhanced Fireworks algorithm for clustering):
 CSC Part (Cuckoo search for clustering):
Hybrid evolution clustering with empty clustering solution (H (EC) 2 S) indicates better precision when contrasted with other hybrid
approaches.
RELATED HYBRID ALGORITHMS FOR
BIG DATA MINING
 Hybrid Clustering Algorithm (HBCA) using BIRCH and K-Means
Hybrid Clustering Algorithm (HBCA) using BRICH and K-Means, This proposed method gives better
performance then K-Means and K-medoid. By using WEKA datamining tool.
RELATED HYBRID ALGORITHMS FOR
BIG DATA MINING
 GA/DT Hybrid data mining algorithm
GA/DT Hybrid data mining algorithm, This proposed
method gives 20 % more effective then the decision tree
and genetic programming individually.
RELATED HYBRID ALGORITHMS FOR
BIG DATA MINING
 VAMR Algorithm- Vertical-Apriori MapReduce algorithm
Initial scan
Producing frequent 1-item
set and its TID set
Producing frequent (K+1)
item set
More Applicants
END
RELATED HYBRID ALGORITHMS FOR
BIG DATA MINING
 Apriori-MapReduce Algorithm
 Apriori algorithm is redesigned into a map
reduce platform; therefore increase the
efficiency upto 15 %.
RELATED HYBRID ALGORITHMS FOR
BIG DATA MINING
 Hybrid GA-SVM model
COMPARISON OF DIFFERENT HYBRID DATA MINING
ALGORITHMS BASED ON IMAGE CLASSIFICATION
Table
1
Narration of Hybrid Algorithm (Base on Image Classification)
S.No Proposed hybrid Approach Purpose of development Draw backs
1 Genetic Algorithm and Support Vector Machine To reduce the dimensionality and
optimize the classification process
Display the high error rate.
2 Decision Tree and Naive Bayes To improve the classification
accuracy of multi class problem
Given less compact Solution.
3 Extreme K-Means and Effective Extreme
learning Machine
To improve the classification
accuracy
Process rate is very slow for Training.
4 Naïve Bayes and Support Vector machine To improve the performance of
specificity and sensitivity
Several key parameters needed to
achieve the best classification result.
5 Support Vector Machine and Classification
regression tree
To identify the age band of 2D image
face.
The regression provide highly confusion
CONCLUSION AND FUTURE WORK
 The proposed Methodology provides a comprehensive knowledge about how to
deal with large datasets. The methodology is easy but requires good knowledge of
data mining.
 From this review the hybrid method Hybrid evolution clustering with empty clustering
solution (H (EC) 2 S) indicates better precision when contrasted with other hybrid
approaches.
 In future, we means to consolidate at least two data mining methods. By applying
the proposed hybrid technique, it is planned to discover better classification precision
and besides, reduce the computational time complexity then another hybrid
method.
REFERENCES
 Cui, X., Yang, S., & Wang, D. (2016, August). An algorithm of apriori based on medical big data and cloud computing. In Cloud Computing and Intelligence
Systems (CCIS), 2016 4th International Conference on (pp. 361-365). IEEE.
 Grami, M., Gheibi, R., & Rahimi, F. (2016, September). A novel association rule mining using genetic algorithm. In Information and Knowledge Technology (IKT),
2016 Eighth International Conference on (pp. 200-204). IEEE.
 Afzali, M., Singh, N., & Kumar, S. (2016, March). Hadoop-MapReduce: A platform for mining large datasets. In Computing for Sustainable Global Development
(INDIACom), 2016 3rd International Conference on (pp. 1856-1860). IEEE.
 Azizi, N., Zemmal, N., Sellami, M., & Farah, N. (2014, April). A new hybrid method combining genetic algorithm and support vector machine classifier:
Application to CAD system for mammogram images. In Multimedia Computing and Systems (ICMCS), 2014 International Conference on (pp. 415-420). IEEE.
 Cao, F., Liu, B., & Park, D. S. (2013). Image classification based on effective extreme learning machine. Neurocomputing, 102, (pp.90-97) ELSEVIER.
 Yannick, L. L., Sebastien, P., & Djamel, M. (2013, September). Combining regression and classification methods for age band estimation from human faces.
In 2013 8th International Symposium on Image and Signal Processing and Analysis (ISPA) (pp. 136-141). IEEE.
 Taravat, A., Del Frate, F., Cornaro, C., & Vergari, S. (2015). Neural networks and support vector machine algorithms for automatic cloud classification of whole-
sky ground-based images. IEEE Geoscience and remote sensing letters, 12(3), 666-670. IEEE.
 Thamilselvan, P., & Sathiaseelan, J. G. R. (2015). A Comparative Study of Data Mining Algorithms for Image Classification. I.J. Education and Management
Engineering, Modern Education and Computer Science Press (2), 1-9. IEEE.
 Thamilselvan, P., & Sathiaseelan, J. G. R. (2015, March). Image classification using hybrid data mining algorithms-a review. In Innovations in Information,
Embedded and Communication Systems (ICIIECS), 2015 International Conference on (pp. 1-6). IEEE.
 Na, S., Xumin, L., & Yong, G. (2010, April). Research on k-means clustering algorithm: An improved k-means clustering algorithm. In Intelligent Information
Technology and Security Informatics (IITSI), 2010 Third International Symposium on (pp. 63-67). IEEE.
REFERENCES
 Joshi, R., Patidar, A., & Mishra, S. (2011, April). Scaling k-medoid algorithm for clustering large categorical dataset and its performance analysis. In Electronics
Computer Technology (ICECT), 2011 3rd International Conference on (Vol. 2, pp. 117-121). IEEE.
 Kaur, J., & Singh, H. (2015, December). Performance evaluation of a novel hybrid clustering algorithm using birch and K-means. In 2015 Annual IEEE India
Conference (INDICON) (pp. 1-6). IEEE.
 Deshmukh, A. P., & Pamu, K. S. (2012). Introduction to Hadoop distributed file system. IJEIR, 1(2), 230-236.
 Woo, J. (2012, January). Apriori-Map/Reduce Algorithm. In Proceedings of the International Conference on Parallel and Distributed Processing Techniques and
Applications (PDPTA) (p. 1). The Steering Committee of The World Congress in Computer Science, Computer Engineering and Applied Computing (World Comp).
 Karimov, J., & Ozbayoglu, M. (2015, October). High quality clustering of big data and solving empty-clustering problem with an evolutionary hybrid algorithm.
In Big Data (Big Data), 2015 IEEE International Conference on (pp. 1473-1478). IEEE.
 Kaur, J., & Singh, H. (2015, December). Performance evaluation of a novel hybrid clustering algorithm using birch and K-means. In 2015 Annual IEEE India
Conference (INDICON) (pp. 1-6). IEEE.
 Carvalho, D. R., & Freitas, A. A. (2004). A hybrid decision tree/genetic algorithm method for data mining. Information Sciences, 163(1), 13-35.
 ] Dhaka, V. S., & Vyas, S. (2014). Analysis of Server Performance with Different Techniques of Virtual Databases. Journal of Emerging Trends in Computing and
Information Sciences, 5(10).
 Vyas, S. (2015). Analyzing Performance of Virtual and Non-Virtual database. Journal of Global Research Computer Science & Technology, 3(8)Pp 32-42.
Ppt for paper id 696 a review of hybrid data mining algorithm for big data mining
Ppt for paper id 696 a review of hybrid data mining algorithm for big data mining

More Related Content

What's hot

Top-K Dominating Queries on Incomplete Data with Priorities
Top-K Dominating Queries on Incomplete Data with PrioritiesTop-K Dominating Queries on Incomplete Data with Priorities
Top-K Dominating Queries on Incomplete Data with Prioritiesijtsrd
 
IMAGE GENERATION WITH GANS-BASED TECHNIQUES: A SURVEY
IMAGE GENERATION WITH GANS-BASED TECHNIQUES: A SURVEYIMAGE GENERATION WITH GANS-BASED TECHNIQUES: A SURVEY
IMAGE GENERATION WITH GANS-BASED TECHNIQUES: A SURVEYijcsit
 
Indian Monuments Classification using Support Vector Machine
Indian Monuments Classification using Support Vector Machine Indian Monuments Classification using Support Vector Machine
Indian Monuments Classification using Support Vector Machine IJECEIAES
 
Manasa_resume
Manasa_resumeManasa_resume
Manasa_resumeManasa JM
 
Association Rule Mining using RHadoop
Association Rule Mining using RHadoopAssociation Rule Mining using RHadoop
Association Rule Mining using RHadoopIRJET Journal
 
Demand-driven Gaussian window optimization for executing preferred population...
Demand-driven Gaussian window optimization for executing preferred population...Demand-driven Gaussian window optimization for executing preferred population...
Demand-driven Gaussian window optimization for executing preferred population...IJECEIAES
 
Using Embeddings for Dynamic Diverse Summarisation in Heterogeneous Graph Str...
Using Embeddings for Dynamic Diverse Summarisation in Heterogeneous Graph Str...Using Embeddings for Dynamic Diverse Summarisation in Heterogeneous Graph Str...
Using Embeddings for Dynamic Diverse Summarisation in Heterogeneous Graph Str...Niki Pavlopoulou
 
Dynamic approach to k means clustering algorithm-2
Dynamic approach to k means clustering algorithm-2Dynamic approach to k means clustering algorithm-2
Dynamic approach to k means clustering algorithm-2IAEME Publication
 
EDBT 2015: Summer School Overview
EDBT 2015: Summer School OverviewEDBT 2015: Summer School Overview
EDBT 2015: Summer School Overviewdgarijo
 
The MGI and AI
The MGI and AIThe MGI and AI
The MGI and AIaimsnist
 
K-Means Clustering Algorithm - Cluster Analysis | Machine Learning Algorithm ...
K-Means Clustering Algorithm - Cluster Analysis | Machine Learning Algorithm ...K-Means Clustering Algorithm - Cluster Analysis | Machine Learning Algorithm ...
K-Means Clustering Algorithm - Cluster Analysis | Machine Learning Algorithm ...Edureka!
 
10 Algorithms in data mining
10 Algorithms in data mining10 Algorithms in data mining
10 Algorithms in data miningGeorge Ang
 
Extensive Analysis on Generation and Consensus Mechanisms of Clustering Ensem...
Extensive Analysis on Generation and Consensus Mechanisms of Clustering Ensem...Extensive Analysis on Generation and Consensus Mechanisms of Clustering Ensem...
Extensive Analysis on Generation and Consensus Mechanisms of Clustering Ensem...IJECEIAES
 
Content Based Image Retrieval
Content Based Image RetrievalContent Based Image Retrieval
Content Based Image Retrievalijtsrd
 
Leveraging Deep Learning Representation for search-based Image Annotation
Leveraging Deep Learning Representation for search-based Image AnnotationLeveraging Deep Learning Representation for search-based Image Annotation
Leveraging Deep Learning Representation for search-based Image Annotationmahyamk
 
AIAA Future of Fluids 2018 Balaji
AIAA Future of Fluids 2018 BalajiAIAA Future of Fluids 2018 Balaji
AIAA Future of Fluids 2018 BalajiQiqi Wang
 

What's hot (20)

Top-K Dominating Queries on Incomplete Data with Priorities
Top-K Dominating Queries on Incomplete Data with PrioritiesTop-K Dominating Queries on Incomplete Data with Priorities
Top-K Dominating Queries on Incomplete Data with Priorities
 
IMAGE GENERATION WITH GANS-BASED TECHNIQUES: A SURVEY
IMAGE GENERATION WITH GANS-BASED TECHNIQUES: A SURVEYIMAGE GENERATION WITH GANS-BASED TECHNIQUES: A SURVEY
IMAGE GENERATION WITH GANS-BASED TECHNIQUES: A SURVEY
 
Big Data Clustering Model based on Fuzzy Gaussian
Big Data Clustering Model based on Fuzzy GaussianBig Data Clustering Model based on Fuzzy Gaussian
Big Data Clustering Model based on Fuzzy Gaussian
 
Indian Monuments Classification using Support Vector Machine
Indian Monuments Classification using Support Vector Machine Indian Monuments Classification using Support Vector Machine
Indian Monuments Classification using Support Vector Machine
 
Manasa_resume
Manasa_resumeManasa_resume
Manasa_resume
 
Association Rule Mining using RHadoop
Association Rule Mining using RHadoopAssociation Rule Mining using RHadoop
Association Rule Mining using RHadoop
 
Demand-driven Gaussian window optimization for executing preferred population...
Demand-driven Gaussian window optimization for executing preferred population...Demand-driven Gaussian window optimization for executing preferred population...
Demand-driven Gaussian window optimization for executing preferred population...
 
Using Embeddings for Dynamic Diverse Summarisation in Heterogeneous Graph Str...
Using Embeddings for Dynamic Diverse Summarisation in Heterogeneous Graph Str...Using Embeddings for Dynamic Diverse Summarisation in Heterogeneous Graph Str...
Using Embeddings for Dynamic Diverse Summarisation in Heterogeneous Graph Str...
 
Dynamic approach to k means clustering algorithm-2
Dynamic approach to k means clustering algorithm-2Dynamic approach to k means clustering algorithm-2
Dynamic approach to k means clustering algorithm-2
 
EDBT 2015: Summer School Overview
EDBT 2015: Summer School OverviewEDBT 2015: Summer School Overview
EDBT 2015: Summer School Overview
 
50120140503019
5012014050301950120140503019
50120140503019
 
50120130406022
5012013040602250120130406022
50120130406022
 
The MGI and AI
The MGI and AIThe MGI and AI
The MGI and AI
 
K-Means Clustering Algorithm - Cluster Analysis | Machine Learning Algorithm ...
K-Means Clustering Algorithm - Cluster Analysis | Machine Learning Algorithm ...K-Means Clustering Algorithm - Cluster Analysis | Machine Learning Algorithm ...
K-Means Clustering Algorithm - Cluster Analysis | Machine Learning Algorithm ...
 
10 Algorithms in data mining
10 Algorithms in data mining10 Algorithms in data mining
10 Algorithms in data mining
 
50120130406008
5012013040600850120130406008
50120130406008
 
Extensive Analysis on Generation and Consensus Mechanisms of Clustering Ensem...
Extensive Analysis on Generation and Consensus Mechanisms of Clustering Ensem...Extensive Analysis on Generation and Consensus Mechanisms of Clustering Ensem...
Extensive Analysis on Generation and Consensus Mechanisms of Clustering Ensem...
 
Content Based Image Retrieval
Content Based Image RetrievalContent Based Image Retrieval
Content Based Image Retrieval
 
Leveraging Deep Learning Representation for search-based Image Annotation
Leveraging Deep Learning Representation for search-based Image AnnotationLeveraging Deep Learning Representation for search-based Image Annotation
Leveraging Deep Learning Representation for search-based Image Annotation
 
AIAA Future of Fluids 2018 Balaji
AIAA Future of Fluids 2018 BalajiAIAA Future of Fluids 2018 Balaji
AIAA Future of Fluids 2018 Balaji
 

Similar to Ppt for paper id 696 a review of hybrid data mining algorithm for big data mining

Survey on MapReduce in Big Data Clustering using Machine Learning Algorithms
Survey on MapReduce in Big Data Clustering using Machine Learning AlgorithmsSurvey on MapReduce in Big Data Clustering using Machine Learning Algorithms
Survey on MapReduce in Big Data Clustering using Machine Learning AlgorithmsIRJET Journal
 
SCCAI- A Student Career Counselling Artificial Intelligence
SCCAI- A Student Career Counselling Artificial IntelligenceSCCAI- A Student Career Counselling Artificial Intelligence
SCCAI- A Student Career Counselling Artificial Intelligencevivatechijri
 
Top cited articles 2020 - Advanced Computational Intelligence: An Internation...
Top cited articles 2020 - Advanced Computational Intelligence: An Internation...Top cited articles 2020 - Advanced Computational Intelligence: An Internation...
Top cited articles 2020 - Advanced Computational Intelligence: An Internation...aciijournal
 
deeplearningpresentation-180625071236.pptx
deeplearningpresentation-180625071236.pptxdeeplearningpresentation-180625071236.pptx
deeplearningpresentation-180625071236.pptxJeetDesai14
 
Frequent Item set Mining of Big Data for Social Media
Frequent Item set Mining of Big Data for Social MediaFrequent Item set Mining of Big Data for Social Media
Frequent Item set Mining of Big Data for Social MediaIJERA Editor
 
Frequent Item set Mining of Big Data for Social Media
Frequent Item set Mining of Big Data for Social MediaFrequent Item set Mining of Big Data for Social Media
Frequent Item set Mining of Big Data for Social MediaIJERA Editor
 
A h k clustering algorithm for high dimensional data using ensemble learning
A h k clustering algorithm for high dimensional data using ensemble learningA h k clustering algorithm for high dimensional data using ensemble learning
A h k clustering algorithm for high dimensional data using ensemble learningijitcs
 
Recent Database Management Systems Research Articles - September 2020
Recent Database Management Systems Research Articles - September 2020Recent Database Management Systems Research Articles - September 2020
Recent Database Management Systems Research Articles - September 2020ijdms
 
Trends in Advanced Computing in 2020 - Advanced Computing: An International J...
Trends in Advanced Computing in 2020 - Advanced Computing: An International J...Trends in Advanced Computing in 2020 - Advanced Computing: An International J...
Trends in Advanced Computing in 2020 - Advanced Computing: An International J...acijjournal
 
MACHINE LEARNING ON MAPREDUCE FRAMEWORK
MACHINE LEARNING ON MAPREDUCE FRAMEWORKMACHINE LEARNING ON MAPREDUCE FRAMEWORK
MACHINE LEARNING ON MAPREDUCE FRAMEWORKAbhi Jit
 
Comparison of Cost Estimation Methods using Hybrid Artificial Intelligence on...
Comparison of Cost Estimation Methods using Hybrid Artificial Intelligence on...Comparison of Cost Estimation Methods using Hybrid Artificial Intelligence on...
Comparison of Cost Estimation Methods using Hybrid Artificial Intelligence on...IJERA Editor
 
Using R for Classification of Large Social Network Data
Using R for Classification of Large Social Network DataUsing R for Classification of Large Social Network Data
Using R for Classification of Large Social Network DataIJCSIS Research Publications
 
A Novel Integrated Framework to Ensure Better Data Quality in Big Data Analyt...
A Novel Integrated Framework to Ensure Better Data Quality in Big Data Analyt...A Novel Integrated Framework to Ensure Better Data Quality in Big Data Analyt...
A Novel Integrated Framework to Ensure Better Data Quality in Big Data Analyt...IJECEIAES
 
Estimating project development effort using clustered regression approach
Estimating project development effort using clustered regression approachEstimating project development effort using clustered regression approach
Estimating project development effort using clustered regression approachcsandit
 
ESTIMATING PROJECT DEVELOPMENT EFFORT USING CLUSTERED REGRESSION APPROACH
ESTIMATING PROJECT DEVELOPMENT EFFORT USING CLUSTERED REGRESSION APPROACHESTIMATING PROJECT DEVELOPMENT EFFORT USING CLUSTERED REGRESSION APPROACH
ESTIMATING PROJECT DEVELOPMENT EFFORT USING CLUSTERED REGRESSION APPROACHcscpconf
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...ijceronline
 
Classifier Model using Artificial Neural Network
Classifier Model using Artificial Neural NetworkClassifier Model using Artificial Neural Network
Classifier Model using Artificial Neural NetworkAI Publications
 
IRJET- Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...
IRJET-  	  Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...IRJET-  	  Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...
IRJET- Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...IRJET Journal
 
Qiu bosc2010
Qiu bosc2010Qiu bosc2010
Qiu bosc2010BOSC 2010
 

Similar to Ppt for paper id 696 a review of hybrid data mining algorithm for big data mining (20)

Survey on MapReduce in Big Data Clustering using Machine Learning Algorithms
Survey on MapReduce in Big Data Clustering using Machine Learning AlgorithmsSurvey on MapReduce in Big Data Clustering using Machine Learning Algorithms
Survey on MapReduce in Big Data Clustering using Machine Learning Algorithms
 
SCCAI- A Student Career Counselling Artificial Intelligence
SCCAI- A Student Career Counselling Artificial IntelligenceSCCAI- A Student Career Counselling Artificial Intelligence
SCCAI- A Student Career Counselling Artificial Intelligence
 
Deep learning presentation
Deep learning presentationDeep learning presentation
Deep learning presentation
 
Top cited articles 2020 - Advanced Computational Intelligence: An Internation...
Top cited articles 2020 - Advanced Computational Intelligence: An Internation...Top cited articles 2020 - Advanced Computational Intelligence: An Internation...
Top cited articles 2020 - Advanced Computational Intelligence: An Internation...
 
deeplearningpresentation-180625071236.pptx
deeplearningpresentation-180625071236.pptxdeeplearningpresentation-180625071236.pptx
deeplearningpresentation-180625071236.pptx
 
Frequent Item set Mining of Big Data for Social Media
Frequent Item set Mining of Big Data for Social MediaFrequent Item set Mining of Big Data for Social Media
Frequent Item set Mining of Big Data for Social Media
 
Frequent Item set Mining of Big Data for Social Media
Frequent Item set Mining of Big Data for Social MediaFrequent Item set Mining of Big Data for Social Media
Frequent Item set Mining of Big Data for Social Media
 
A h k clustering algorithm for high dimensional data using ensemble learning
A h k clustering algorithm for high dimensional data using ensemble learningA h k clustering algorithm for high dimensional data using ensemble learning
A h k clustering algorithm for high dimensional data using ensemble learning
 
Recent Database Management Systems Research Articles - September 2020
Recent Database Management Systems Research Articles - September 2020Recent Database Management Systems Research Articles - September 2020
Recent Database Management Systems Research Articles - September 2020
 
Trends in Advanced Computing in 2020 - Advanced Computing: An International J...
Trends in Advanced Computing in 2020 - Advanced Computing: An International J...Trends in Advanced Computing in 2020 - Advanced Computing: An International J...
Trends in Advanced Computing in 2020 - Advanced Computing: An International J...
 
MACHINE LEARNING ON MAPREDUCE FRAMEWORK
MACHINE LEARNING ON MAPREDUCE FRAMEWORKMACHINE LEARNING ON MAPREDUCE FRAMEWORK
MACHINE LEARNING ON MAPREDUCE FRAMEWORK
 
Comparison of Cost Estimation Methods using Hybrid Artificial Intelligence on...
Comparison of Cost Estimation Methods using Hybrid Artificial Intelligence on...Comparison of Cost Estimation Methods using Hybrid Artificial Intelligence on...
Comparison of Cost Estimation Methods using Hybrid Artificial Intelligence on...
 
Using R for Classification of Large Social Network Data
Using R for Classification of Large Social Network DataUsing R for Classification of Large Social Network Data
Using R for Classification of Large Social Network Data
 
A Novel Integrated Framework to Ensure Better Data Quality in Big Data Analyt...
A Novel Integrated Framework to Ensure Better Data Quality in Big Data Analyt...A Novel Integrated Framework to Ensure Better Data Quality in Big Data Analyt...
A Novel Integrated Framework to Ensure Better Data Quality in Big Data Analyt...
 
Estimating project development effort using clustered regression approach
Estimating project development effort using clustered regression approachEstimating project development effort using clustered regression approach
Estimating project development effort using clustered regression approach
 
ESTIMATING PROJECT DEVELOPMENT EFFORT USING CLUSTERED REGRESSION APPROACH
ESTIMATING PROJECT DEVELOPMENT EFFORT USING CLUSTERED REGRESSION APPROACHESTIMATING PROJECT DEVELOPMENT EFFORT USING CLUSTERED REGRESSION APPROACH
ESTIMATING PROJECT DEVELOPMENT EFFORT USING CLUSTERED REGRESSION APPROACH
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
 
Classifier Model using Artificial Neural Network
Classifier Model using Artificial Neural NetworkClassifier Model using Artificial Neural Network
Classifier Model using Artificial Neural Network
 
IRJET- Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...
IRJET-  	  Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...IRJET-  	  Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...
IRJET- Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...
 
Qiu bosc2010
Qiu bosc2010Qiu bosc2010
Qiu bosc2010
 

More from Prasanta Paul

More from Prasanta Paul (8)

Aicte-List
Aicte-List Aicte-List
Aicte-List
 
ICard-Prasanta Kumar Paul
ICard-Prasanta Kumar PaulICard-Prasanta Kumar Paul
ICard-Prasanta Kumar Paul
 
Prasanta paul
Prasanta paulPrasanta paul
Prasanta paul
 
Prasanta Kumar Paul
Prasanta Kumar PaulPrasanta Kumar Paul
Prasanta Kumar Paul
 
Techno India Ramgarh
Techno India RamgarhTechno India Ramgarh
Techno India Ramgarh
 
ICardPrasanta
ICardPrasantaICardPrasanta
ICardPrasanta
 
Sl02 2x2 (1)
Sl02 2x2 (1)Sl02 2x2 (1)
Sl02 2x2 (1)
 
Coding
CodingCoding
Coding
 

Recently uploaded

MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINESIVASHANKAR N
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024hassan khalil
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).pptssuser5c9d4b1
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSCAESB
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3
 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escortsranjana rawat
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
Analog to Digital and Digital to Analog Converter
Analog to Digital and Digital to Analog ConverterAnalog to Digital and Digital to Analog Converter
Analog to Digital and Digital to Analog ConverterAbhinavSharma374939
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxDeepakSakkari2
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
Internship report on mechanical engineering
Internship report on mechanical engineeringInternship report on mechanical engineering
Internship report on mechanical engineeringmalavadedarshan25
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130Suhani Kapoor
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxwendy cai
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 

Recently uploaded (20)

MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentation
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
 
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCRCall Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
 
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEDJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
 
Analog to Digital and Digital to Analog Converter
Analog to Digital and Digital to Analog ConverterAnalog to Digital and Digital to Analog Converter
Analog to Digital and Digital to Analog Converter
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptx
 
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
Internship report on mechanical engineering
Internship report on mechanical engineeringInternship report on mechanical engineering
Internship report on mechanical engineering
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptx
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 

Ppt for paper id 696 a review of hybrid data mining algorithm for big data mining

  • 1. A Review of Hybrid Data Mining Algorithm for Big Data Mining Presented By PRASANTA KUMAR PAUL RESEARCH SCHOLAR AIIT AMITY UNIVERSITY RAJASTHAN First International Conference on Smart Technologies in Computer and Communication (SmartTech-2017) Under the guidance of DR. SONALI VYAS ASSISTANT PROFESSOR AIIT AMITY UNIVERSITY RAJASTHAN
  • 2. What is …… ? • Hybrid Data Mining ‣ Hybrid data mining algorithm can be presented as a combination of different classifiers. The classification ability of data mining algorithm are different, this why combining them may increase the performance of the system in term of accuracy. But they must be well chosen. There are other approach which are more general Boosting and Bagging. They are very interesting and can be efficient. An example of application in image processing is the face detection in real time using Adaboost.
  • 3. LITERATURE SURVEY  P Thamilselvan Image classification using hybrid data mining algorithm.  Deshmukh, A. P., & Pamu, K. S. (2012).Introduction to Hadoop distributed file system .  Feilong Cao, proposed a new algorithm, combination of Extreme K-Means (EKM) and Effective Extreme Learning Machine (EELM)  Alireza Taravat et al, introduced a new hybrid algorithm for automatic cloud detection in a complete-sky image.  M.R. et al. [10] in this study they presented a hybrid algorithm using Support Vector Machine (SVM) and K-nearest neighbor (KNN) algorithm.
  • 4. RELATED HYBRID ALGORITHMS FOR BIG DATA MINING  Hybrid evolutionary clustering with empty clustering solution (H (EC) 2 S)  RC Part (Representative Construction):  EFC Part (Enhanced Fireworks algorithm for clustering):  CSC Part (Cuckoo search for clustering): Hybrid evolution clustering with empty clustering solution (H (EC) 2 S) indicates better precision when contrasted with other hybrid approaches.
  • 5. RELATED HYBRID ALGORITHMS FOR BIG DATA MINING  Hybrid Clustering Algorithm (HBCA) using BIRCH and K-Means Hybrid Clustering Algorithm (HBCA) using BRICH and K-Means, This proposed method gives better performance then K-Means and K-medoid. By using WEKA datamining tool.
  • 6. RELATED HYBRID ALGORITHMS FOR BIG DATA MINING  GA/DT Hybrid data mining algorithm GA/DT Hybrid data mining algorithm, This proposed method gives 20 % more effective then the decision tree and genetic programming individually.
  • 7. RELATED HYBRID ALGORITHMS FOR BIG DATA MINING  VAMR Algorithm- Vertical-Apriori MapReduce algorithm Initial scan Producing frequent 1-item set and its TID set Producing frequent (K+1) item set More Applicants END
  • 8. RELATED HYBRID ALGORITHMS FOR BIG DATA MINING  Apriori-MapReduce Algorithm  Apriori algorithm is redesigned into a map reduce platform; therefore increase the efficiency upto 15 %.
  • 9. RELATED HYBRID ALGORITHMS FOR BIG DATA MINING  Hybrid GA-SVM model
  • 10. COMPARISON OF DIFFERENT HYBRID DATA MINING ALGORITHMS BASED ON IMAGE CLASSIFICATION Table 1 Narration of Hybrid Algorithm (Base on Image Classification) S.No Proposed hybrid Approach Purpose of development Draw backs 1 Genetic Algorithm and Support Vector Machine To reduce the dimensionality and optimize the classification process Display the high error rate. 2 Decision Tree and Naive Bayes To improve the classification accuracy of multi class problem Given less compact Solution. 3 Extreme K-Means and Effective Extreme learning Machine To improve the classification accuracy Process rate is very slow for Training. 4 Naïve Bayes and Support Vector machine To improve the performance of specificity and sensitivity Several key parameters needed to achieve the best classification result. 5 Support Vector Machine and Classification regression tree To identify the age band of 2D image face. The regression provide highly confusion
  • 11. CONCLUSION AND FUTURE WORK  The proposed Methodology provides a comprehensive knowledge about how to deal with large datasets. The methodology is easy but requires good knowledge of data mining.  From this review the hybrid method Hybrid evolution clustering with empty clustering solution (H (EC) 2 S) indicates better precision when contrasted with other hybrid approaches.  In future, we means to consolidate at least two data mining methods. By applying the proposed hybrid technique, it is planned to discover better classification precision and besides, reduce the computational time complexity then another hybrid method.
  • 12. REFERENCES  Cui, X., Yang, S., & Wang, D. (2016, August). An algorithm of apriori based on medical big data and cloud computing. In Cloud Computing and Intelligence Systems (CCIS), 2016 4th International Conference on (pp. 361-365). IEEE.  Grami, M., Gheibi, R., & Rahimi, F. (2016, September). A novel association rule mining using genetic algorithm. In Information and Knowledge Technology (IKT), 2016 Eighth International Conference on (pp. 200-204). IEEE.  Afzali, M., Singh, N., & Kumar, S. (2016, March). Hadoop-MapReduce: A platform for mining large datasets. In Computing for Sustainable Global Development (INDIACom), 2016 3rd International Conference on (pp. 1856-1860). IEEE.  Azizi, N., Zemmal, N., Sellami, M., & Farah, N. (2014, April). A new hybrid method combining genetic algorithm and support vector machine classifier: Application to CAD system for mammogram images. In Multimedia Computing and Systems (ICMCS), 2014 International Conference on (pp. 415-420). IEEE.  Cao, F., Liu, B., & Park, D. S. (2013). Image classification based on effective extreme learning machine. Neurocomputing, 102, (pp.90-97) ELSEVIER.  Yannick, L. L., Sebastien, P., & Djamel, M. (2013, September). Combining regression and classification methods for age band estimation from human faces. In 2013 8th International Symposium on Image and Signal Processing and Analysis (ISPA) (pp. 136-141). IEEE.  Taravat, A., Del Frate, F., Cornaro, C., & Vergari, S. (2015). Neural networks and support vector machine algorithms for automatic cloud classification of whole- sky ground-based images. IEEE Geoscience and remote sensing letters, 12(3), 666-670. IEEE.  Thamilselvan, P., & Sathiaseelan, J. G. R. (2015). A Comparative Study of Data Mining Algorithms for Image Classification. I.J. Education and Management Engineering, Modern Education and Computer Science Press (2), 1-9. IEEE.  Thamilselvan, P., & Sathiaseelan, J. G. R. (2015, March). Image classification using hybrid data mining algorithms-a review. In Innovations in Information, Embedded and Communication Systems (ICIIECS), 2015 International Conference on (pp. 1-6). IEEE.  Na, S., Xumin, L., & Yong, G. (2010, April). Research on k-means clustering algorithm: An improved k-means clustering algorithm. In Intelligent Information Technology and Security Informatics (IITSI), 2010 Third International Symposium on (pp. 63-67). IEEE.
  • 13. REFERENCES  Joshi, R., Patidar, A., & Mishra, S. (2011, April). Scaling k-medoid algorithm for clustering large categorical dataset and its performance analysis. In Electronics Computer Technology (ICECT), 2011 3rd International Conference on (Vol. 2, pp. 117-121). IEEE.  Kaur, J., & Singh, H. (2015, December). Performance evaluation of a novel hybrid clustering algorithm using birch and K-means. In 2015 Annual IEEE India Conference (INDICON) (pp. 1-6). IEEE.  Deshmukh, A. P., & Pamu, K. S. (2012). Introduction to Hadoop distributed file system. IJEIR, 1(2), 230-236.  Woo, J. (2012, January). Apriori-Map/Reduce Algorithm. In Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications (PDPTA) (p. 1). The Steering Committee of The World Congress in Computer Science, Computer Engineering and Applied Computing (World Comp).  Karimov, J., & Ozbayoglu, M. (2015, October). High quality clustering of big data and solving empty-clustering problem with an evolutionary hybrid algorithm. In Big Data (Big Data), 2015 IEEE International Conference on (pp. 1473-1478). IEEE.  Kaur, J., & Singh, H. (2015, December). Performance evaluation of a novel hybrid clustering algorithm using birch and K-means. In 2015 Annual IEEE India Conference (INDICON) (pp. 1-6). IEEE.  Carvalho, D. R., & Freitas, A. A. (2004). A hybrid decision tree/genetic algorithm method for data mining. Information Sciences, 163(1), 13-35.  ] Dhaka, V. S., & Vyas, S. (2014). Analysis of Server Performance with Different Techniques of Virtual Databases. Journal of Emerging Trends in Computing and Information Sciences, 5(10).  Vyas, S. (2015). Analyzing Performance of Virtual and Non-Virtual database. Journal of Global Research Computer Science & Technology, 3(8)Pp 32-42.