Submit Search
Upload
Improving Association Rule Mining by Defining a Novel Data Structure
•
0 likes
•
26 views
IRJET Journal
Follow
https://irjet.net/archives/V4/i7/IRJET-V4I7415.pdf
Read less
Read more
Engineering
Report
Share
Report
Share
1 of 4
Download now
Download to read offline
Recommended
Literature Survey on Buliding Confidential and Efficient Query Processing Usi...
Literature Survey on Buliding Confidential and Efficient Query Processing Usi...
paperpublications3
Mining Of Big Data Using Map-Reduce Theorem
Mining Of Big Data Using Map-Reduce Theorem
IOSR Journals
IRJET- Top-K Query Processing using Top Order Preserving Encryption (TOPE)
IRJET- Top-K Query Processing using Top Order Preserving Encryption (TOPE)
IRJET Journal
Analysis of Pattern Transformation Algorithms for Sensitive Knowledge Protect...
Analysis of Pattern Transformation Algorithms for Sensitive Knowledge Protect...
IOSR Journals
Fault tolerance on cloud computing
Fault tolerance on cloud computing
www.pixelsolutionbd.com
Range Query on Big Data Based on Map Reduce
Range Query on Big Data Based on Map Reduce
IJMER
Fast Range Aggregate Queries for Big Data Analysis
Fast Range Aggregate Queries for Big Data Analysis
IRJET Journal
Web Oriented FIM for large scale dataset using Hadoop
Web Oriented FIM for large scale dataset using Hadoop
dbpublications
Recommended
Literature Survey on Buliding Confidential and Efficient Query Processing Usi...
Literature Survey on Buliding Confidential and Efficient Query Processing Usi...
paperpublications3
Mining Of Big Data Using Map-Reduce Theorem
Mining Of Big Data Using Map-Reduce Theorem
IOSR Journals
IRJET- Top-K Query Processing using Top Order Preserving Encryption (TOPE)
IRJET- Top-K Query Processing using Top Order Preserving Encryption (TOPE)
IRJET Journal
Analysis of Pattern Transformation Algorithms for Sensitive Knowledge Protect...
Analysis of Pattern Transformation Algorithms for Sensitive Knowledge Protect...
IOSR Journals
Fault tolerance on cloud computing
Fault tolerance on cloud computing
www.pixelsolutionbd.com
Range Query on Big Data Based on Map Reduce
Range Query on Big Data Based on Map Reduce
IJMER
Fast Range Aggregate Queries for Big Data Analysis
Fast Range Aggregate Queries for Big Data Analysis
IRJET Journal
Web Oriented FIM for large scale dataset using Hadoop
Web Oriented FIM for large scale dataset using Hadoop
dbpublications
An incremental mining algorithm for maintaining sequential patterns using pre...
An incremental mining algorithm for maintaining sequential patterns using pre...
Editor IJMTER
Qo s aware scientific application scheduling algorithm in cloud environment
Qo s aware scientific application scheduling algorithm in cloud environment
Alexander Decker
Data characterization towards modeling frequent pattern mining algorithms
Data characterization towards modeling frequent pattern mining algorithms
csandit
Privacy Preserving Data Mining Technique to Recover Association Rules Using H...
Privacy Preserving Data Mining Technique to Recover Association Rules Using H...
IJSRED
Frequency and similarity aware partitioning for cloud storage based on space ...
Frequency and similarity aware partitioning for cloud storage based on space ...
redpel dot com
An effective classification approach for big data with parallel generalized H...
An effective classification approach for big data with parallel generalized H...
riyaniaes
A fuzzy clustering algorithm for high dimensional streaming data
A fuzzy clustering algorithm for high dimensional streaming data
Alexander Decker
Performance Analysis of Hashing Mathods on the Employment of App
Performance Analysis of Hashing Mathods on the Employment of App
IJECEIAES
Improved Map reduce Framework using High Utility Transactional Databases
Improved Map reduce Framework using High Utility Transactional Databases
International Journal of Engineering Inventions www.ijeijournal.com
Framework for reversible data hiding using cost-effective encoding system for...
Framework for reversible data hiding using cost-effective encoding system for...
IJECEIAES
Berlin Hadoop Get Together Apache Drill
Berlin Hadoop Get Together Apache Drill
MapR Technologies
GROUPING BASED JOB SCHEDULING ALGORITHM USING PRIORITY QUEUE AND HYBRID ALGOR...
GROUPING BASED JOB SCHEDULING ALGORITHM USING PRIORITY QUEUE AND HYBRID ALGOR...
ijgca
Peer-to-Peer Data Sharing and Deduplication using Genetic Algorithm
Peer-to-Peer Data Sharing and Deduplication using Genetic Algorithm
IRJET Journal
IJET-V3I2P24
IJET-V3I2P24
IJET - International Journal of Engineering and Techniques
Role of Operational System Design in Data Warehouse Implementation: Identifyi...
Role of Operational System Design in Data Warehouse Implementation: Identifyi...
iosrjce
PRIVACY PRESERVING DATA MINING BASED ON VECTOR QUANTIZATION
PRIVACY PRESERVING DATA MINING BASED ON VECTOR QUANTIZATION
ijdms
Understanding Big Data Platform from Patents
Understanding Big Data Platform from Patents
Alex G. Lee, Ph.D. Esq. CLP
Scalable scheduling of updates in streaming data warehouses
Scalable scheduling of updates in streaming data warehouses
IRJET Journal
Performance evaluation of Map-reduce jar pig hive and spark with machine lear...
Performance evaluation of Map-reduce jar pig hive and spark with machine lear...
IJECEIAES
Association Rule Mining using RHadoop
Association Rule Mining using RHadoop
IRJET Journal
IRJET- Classification of Pattern Storage System and Analysis of Online Shoppi...
IRJET- Classification of Pattern Storage System and Analysis of Online Shoppi...
IRJET Journal
Mining High Utility Patterns in Large Databases using Mapreduce Framework
Mining High Utility Patterns in Large Databases using Mapreduce Framework
IRJET Journal
More Related Content
What's hot
An incremental mining algorithm for maintaining sequential patterns using pre...
An incremental mining algorithm for maintaining sequential patterns using pre...
Editor IJMTER
Qo s aware scientific application scheduling algorithm in cloud environment
Qo s aware scientific application scheduling algorithm in cloud environment
Alexander Decker
Data characterization towards modeling frequent pattern mining algorithms
Data characterization towards modeling frequent pattern mining algorithms
csandit
Privacy Preserving Data Mining Technique to Recover Association Rules Using H...
Privacy Preserving Data Mining Technique to Recover Association Rules Using H...
IJSRED
Frequency and similarity aware partitioning for cloud storage based on space ...
Frequency and similarity aware partitioning for cloud storage based on space ...
redpel dot com
An effective classification approach for big data with parallel generalized H...
An effective classification approach for big data with parallel generalized H...
riyaniaes
A fuzzy clustering algorithm for high dimensional streaming data
A fuzzy clustering algorithm for high dimensional streaming data
Alexander Decker
Performance Analysis of Hashing Mathods on the Employment of App
Performance Analysis of Hashing Mathods on the Employment of App
IJECEIAES
Improved Map reduce Framework using High Utility Transactional Databases
Improved Map reduce Framework using High Utility Transactional Databases
International Journal of Engineering Inventions www.ijeijournal.com
Framework for reversible data hiding using cost-effective encoding system for...
Framework for reversible data hiding using cost-effective encoding system for...
IJECEIAES
Berlin Hadoop Get Together Apache Drill
Berlin Hadoop Get Together Apache Drill
MapR Technologies
GROUPING BASED JOB SCHEDULING ALGORITHM USING PRIORITY QUEUE AND HYBRID ALGOR...
GROUPING BASED JOB SCHEDULING ALGORITHM USING PRIORITY QUEUE AND HYBRID ALGOR...
ijgca
Peer-to-Peer Data Sharing and Deduplication using Genetic Algorithm
Peer-to-Peer Data Sharing and Deduplication using Genetic Algorithm
IRJET Journal
IJET-V3I2P24
IJET-V3I2P24
IJET - International Journal of Engineering and Techniques
Role of Operational System Design in Data Warehouse Implementation: Identifyi...
Role of Operational System Design in Data Warehouse Implementation: Identifyi...
iosrjce
PRIVACY PRESERVING DATA MINING BASED ON VECTOR QUANTIZATION
PRIVACY PRESERVING DATA MINING BASED ON VECTOR QUANTIZATION
ijdms
Understanding Big Data Platform from Patents
Understanding Big Data Platform from Patents
Alex G. Lee, Ph.D. Esq. CLP
Scalable scheduling of updates in streaming data warehouses
Scalable scheduling of updates in streaming data warehouses
IRJET Journal
Performance evaluation of Map-reduce jar pig hive and spark with machine lear...
Performance evaluation of Map-reduce jar pig hive and spark with machine lear...
IJECEIAES
What's hot
(19)
An incremental mining algorithm for maintaining sequential patterns using pre...
An incremental mining algorithm for maintaining sequential patterns using pre...
Qo s aware scientific application scheduling algorithm in cloud environment
Qo s aware scientific application scheduling algorithm in cloud environment
Data characterization towards modeling frequent pattern mining algorithms
Data characterization towards modeling frequent pattern mining algorithms
Privacy Preserving Data Mining Technique to Recover Association Rules Using H...
Privacy Preserving Data Mining Technique to Recover Association Rules Using H...
Frequency and similarity aware partitioning for cloud storage based on space ...
Frequency and similarity aware partitioning for cloud storage based on space ...
An effective classification approach for big data with parallel generalized H...
An effective classification approach for big data with parallel generalized H...
A fuzzy clustering algorithm for high dimensional streaming data
A fuzzy clustering algorithm for high dimensional streaming data
Performance Analysis of Hashing Mathods on the Employment of App
Performance Analysis of Hashing Mathods on the Employment of App
Improved Map reduce Framework using High Utility Transactional Databases
Improved Map reduce Framework using High Utility Transactional Databases
Framework for reversible data hiding using cost-effective encoding system for...
Framework for reversible data hiding using cost-effective encoding system for...
Berlin Hadoop Get Together Apache Drill
Berlin Hadoop Get Together Apache Drill
GROUPING BASED JOB SCHEDULING ALGORITHM USING PRIORITY QUEUE AND HYBRID ALGOR...
GROUPING BASED JOB SCHEDULING ALGORITHM USING PRIORITY QUEUE AND HYBRID ALGOR...
Peer-to-Peer Data Sharing and Deduplication using Genetic Algorithm
Peer-to-Peer Data Sharing and Deduplication using Genetic Algorithm
IJET-V3I2P24
IJET-V3I2P24
Role of Operational System Design in Data Warehouse Implementation: Identifyi...
Role of Operational System Design in Data Warehouse Implementation: Identifyi...
PRIVACY PRESERVING DATA MINING BASED ON VECTOR QUANTIZATION
PRIVACY PRESERVING DATA MINING BASED ON VECTOR QUANTIZATION
Understanding Big Data Platform from Patents
Understanding Big Data Platform from Patents
Scalable scheduling of updates in streaming data warehouses
Scalable scheduling of updates in streaming data warehouses
Performance evaluation of Map-reduce jar pig hive and spark with machine lear...
Performance evaluation of Map-reduce jar pig hive and spark with machine lear...
Similar to Improving Association Rule Mining by Defining a Novel Data Structure
Association Rule Mining using RHadoop
Association Rule Mining using RHadoop
IRJET Journal
IRJET- Classification of Pattern Storage System and Analysis of Online Shoppi...
IRJET- Classification of Pattern Storage System and Analysis of Online Shoppi...
IRJET Journal
Mining High Utility Patterns in Large Databases using Mapreduce Framework
Mining High Utility Patterns in Large Databases using Mapreduce Framework
IRJET Journal
IRJET- An Integrity Auditing &Data Dedupe withEffective Bandwidth in Cloud St...
IRJET- An Integrity Auditing &Data Dedupe withEffective Bandwidth in Cloud St...
IRJET Journal
IRJET- Towards Efficient Framework for Semantic Query Search Engine in Large-...
IRJET- Towards Efficient Framework for Semantic Query Search Engine in Large-...
IRJET Journal
Privacy Preserving Multi-keyword Top-K Search based on Cosine Similarity Clus...
Privacy Preserving Multi-keyword Top-K Search based on Cosine Similarity Clus...
IRJET Journal
Data Partitioning in Mongo DB with Cloud
Data Partitioning in Mongo DB with Cloud
IJAAS Team
IRJET - A Prognosis Approach for Stock Market Prediction based on Term Streak...
IRJET - A Prognosis Approach for Stock Market Prediction based on Term Streak...
IRJET Journal
Comparing the performance of a business process: using Excel & Python
Comparing the performance of a business process: using Excel & Python
IRJET Journal
The Overview of Discovery and Reconciliation of LTE Network
The Overview of Discovery and Reconciliation of LTE Network
IRJET Journal
A Quantified Approach for large Dataset Compression in Association Mining
A Quantified Approach for large Dataset Compression in Association Mining
IOSR Journals
STOCK MARKET PREDICTION USING NEURAL NETWORKS
STOCK MARKET PREDICTION USING NEURAL NETWORKS
IRJET Journal
Distributed Feature Selection for Efficient Economic Big Data Analysis
Distributed Feature Selection for Efficient Economic Big Data Analysis
IRJET Journal
Entity resolution for hierarchical data using attributes value comparison ove...
Entity resolution for hierarchical data using attributes value comparison ove...
IAEME Publication
IRJET- Review of Existing Methods in K-Means Clustering Algorithm
IRJET- Review of Existing Methods in K-Means Clustering Algorithm
IRJET Journal
Irjet v4 iA Survey on FP (Growth) Tree using Association Rule Mining7351
Irjet v4 iA Survey on FP (Growth) Tree using Association Rule Mining7351
IRJET Journal
IRJET- Analysis for EnhancedForecastof Expense Movement in Stock Exchange
IRJET- Analysis for EnhancedForecastof Expense Movement in Stock Exchange
IRJET Journal
Survey of streaming data warehouse update scheduling
Survey of streaming data warehouse update scheduling
eSAT Journals
Predicting Stock Price Movements with Low Power Consumption LSTM
Predicting Stock Price Movements with Low Power Consumption LSTM
IRJET Journal
Efficient Privacy Preserving Clustering Based Multi Keyword Search
Efficient Privacy Preserving Clustering Based Multi Keyword Search
IRJET Journal
Similar to Improving Association Rule Mining by Defining a Novel Data Structure
(20)
Association Rule Mining using RHadoop
Association Rule Mining using RHadoop
IRJET- Classification of Pattern Storage System and Analysis of Online Shoppi...
IRJET- Classification of Pattern Storage System and Analysis of Online Shoppi...
Mining High Utility Patterns in Large Databases using Mapreduce Framework
Mining High Utility Patterns in Large Databases using Mapreduce Framework
IRJET- An Integrity Auditing &Data Dedupe withEffective Bandwidth in Cloud St...
IRJET- An Integrity Auditing &Data Dedupe withEffective Bandwidth in Cloud St...
IRJET- Towards Efficient Framework for Semantic Query Search Engine in Large-...
IRJET- Towards Efficient Framework for Semantic Query Search Engine in Large-...
Privacy Preserving Multi-keyword Top-K Search based on Cosine Similarity Clus...
Privacy Preserving Multi-keyword Top-K Search based on Cosine Similarity Clus...
Data Partitioning in Mongo DB with Cloud
Data Partitioning in Mongo DB with Cloud
IRJET - A Prognosis Approach for Stock Market Prediction based on Term Streak...
IRJET - A Prognosis Approach for Stock Market Prediction based on Term Streak...
Comparing the performance of a business process: using Excel & Python
Comparing the performance of a business process: using Excel & Python
The Overview of Discovery and Reconciliation of LTE Network
The Overview of Discovery and Reconciliation of LTE Network
A Quantified Approach for large Dataset Compression in Association Mining
A Quantified Approach for large Dataset Compression in Association Mining
STOCK MARKET PREDICTION USING NEURAL NETWORKS
STOCK MARKET PREDICTION USING NEURAL NETWORKS
Distributed Feature Selection for Efficient Economic Big Data Analysis
Distributed Feature Selection for Efficient Economic Big Data Analysis
Entity resolution for hierarchical data using attributes value comparison ove...
Entity resolution for hierarchical data using attributes value comparison ove...
IRJET- Review of Existing Methods in K-Means Clustering Algorithm
IRJET- Review of Existing Methods in K-Means Clustering Algorithm
Irjet v4 iA Survey on FP (Growth) Tree using Association Rule Mining7351
Irjet v4 iA Survey on FP (Growth) Tree using Association Rule Mining7351
IRJET- Analysis for EnhancedForecastof Expense Movement in Stock Exchange
IRJET- Analysis for EnhancedForecastof Expense Movement in Stock Exchange
Survey of streaming data warehouse update scheduling
Survey of streaming data warehouse update scheduling
Predicting Stock Price Movements with Low Power Consumption LSTM
Predicting Stock Price Movements with Low Power Consumption LSTM
Efficient Privacy Preserving Clustering Based Multi Keyword Search
Efficient Privacy Preserving Clustering Based Multi Keyword Search
More from IRJET Journal
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
IRJET Journal
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
IRJET Journal
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
IRJET Journal
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil Characteristics
IRJET Journal
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
IRJET Journal
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
IRJET Journal
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
IRJET Journal
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
IRJET Journal
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADAS
IRJET Journal
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
IRJET Journal
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
IRJET Journal
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
IRJET Journal
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare System
IRJET Journal
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridges
IRJET Journal
React based fullstack edtech web application
React based fullstack edtech web application
IRJET Journal
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
IRJET Journal
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
IRJET Journal
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
IRJET Journal
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
IRJET Journal
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
IRJET Journal
More from IRJET Journal
(20)
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil Characteristics
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADAS
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare System
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridges
React based fullstack edtech web application
React based fullstack edtech web application
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Recently uploaded
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
ranjana rawat
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur High Profile
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
ranjana rawat
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
Asutosh Ranjan
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
João Esperancinha
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptx
upamatechverse
chaitra-1.pptx fake news detection using machine learning
chaitra-1.pptx fake news detection using machine learning
misbanausheenparvam
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024
hassan khalil
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
9953056974 Low Rate Call Girls In Saket, Delhi NCR
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
ranjana rawat
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )
Tsuyoshi Horigome
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
ranjana rawat
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
Call Girls in Nagpur High Profile
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
SIVASHANKAR N
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
null - The Open Security Community
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
slot gacor bisa pakai pulsa
Extrusion Processes and Their Limitations
Extrusion Processes and Their Limitations
120cr0395
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile service
rehmti665
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
9953056974 Low Rate Call Girls In Saket, Delhi NCR
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Dr.Costas Sachpazis
Recently uploaded
(20)
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptx
chaitra-1.pptx fake news detection using machine learning
chaitra-1.pptx fake news detection using machine learning
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
Extrusion Processes and Their Limitations
Extrusion Processes and Their Limitations
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Improving Association Rule Mining by Defining a Novel Data Structure
1.
International Research Journal
of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 04 Issue: 07 | July -2017 www.irjet.net p-ISSN: 2395-0072 © 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 1988 Improving Association Rule Mining By Defining A Novel Data Structure Vinayak Suresh Shukla, Prof.Dr.Mrs.S.A.Itkar 1Student,PES’s Modern College Of Engineering,Pune 5. 2HOD,Computer Engineering Dept, PES’s Modern College Of Engineering, Pune 5. ---------------------------------------------------------------------***--------------------------------------------------------------------- Abstract - In recent years, growth in digital data storagein rapidly increased due to ease of use and lower coast digital storage media. This data is high dimensional and heterogeneous in nature. The process of knowledge discovery is being affected due to high dimensional and heterogeneous data. This process can be abbreviated as association rule mining (ARM). Though, many association rule mining algorithms have been proposed in recent years to deal with large volume of data, theminingprocessunder-performswhen the data size is very large in terms of records. Hence the aim of this work is not to design a new algorithm for mining, butto design a new data structure to store data reliably .The original data is simplified, recognized and access time increased for that data, to meet up efficiency in terms of time and main memory requirements. Lower main memory requirements and faster data access are achieved by means of Shuffling, Inverted Index Mapping and Run Length Encoding. Hence the resulting data structure can be used along with the existing association rule miningalgorithmstospeedupmining and reducing main memory requirements, without changing original algorithms. This is further improvedbyreplacingRun Length Encoding by Modified Run Length EncodingAlgorithm for better memory utilization and efficiency of mining algorithms. Key Words: Association Rule Mining (ARM), Data Compression, Data Structure, Index Compression, Knowledge Discovery, Modified RLE,. 1.INTRODUCTION In early years, digitization of all technical and non-technical fields has leaded to the production of large amount of digital data every day[1]. Storage of this large data is efficient since the cost of storage on media is less thanearlystoragemedias . Hence the cost of storage has negligible association with amount and heterogeneity of this data. But it has tremendously affected the mining process. Because of large amount of data, mining has become interesting but time consuming. Association rule mining[2] is well known and researched field for mining rules form given data. Many algorithms were introduced in order to speed up data analysis. Different strategies were used like reduce either number of candidate sets, numberoftransactionsornumber of comparisons of both. But any mining process with extremely high dimensional data [1]can be a hard process. So instead of changing existing algorithms or finding new one to speed up mining, it is better to introduce a new data structure [3]to store large data in compressed manner. So here we implemented a new compressed data structure which will store the original data in compressed manner without losing original content. This new compressed data structure is obtained by applying three sequential techniques on the original data. viz Shuffling Inverted index mapping Run length encoding. Then it is further improved by Modified Run Length Encoding This new data structure will help in handling, speeding up access to data and fast computing when used with any of the existing mining algorithm. 2.REVIEW OF LITERATURE Association rule mining [2]is the most important technique used for mining the rules. This aims to extract strong and relative items within input data set. Basically it was designed for market basket analysis which would help shopkeepers arranging the sale items in order to grow the business. Apriori[4] is the first algorithm forassociation rulemining.It is based on comprehensive search. It works in two steps as I. finding all frequent item sets from data set and II. Deriving association rules from those frequents item sets. If data set has k single items, and N number of transactions, then [2^k- 1] item sets can be generated. And of the data set size is huge, the processing is hard, becausethecomplexityofabove operation will be O(N X M X k) just to obtain candidate sets. The frequentpatternFP-growth[5]algorithmwasintroduced to reduce the number of transactions and related comparisons. Data needs to be scanned only once since FP- growth stored frequent itemsinatreestructure.Butitsuffers with large number if I/O and large number of memory required storing all sets. To overcome the existing drawback, like large memory requirements and long computational times, many evolutionary association rule mining algorithms were proposed [6],[7],[8],[9]. But, even after considering mining rules from different views, working with high-dimensional data was hard. So as a solution, it is found that creating a new data structure in order to reduce the original data set size would be advantageous, and that can be used for speeding up the mining, with current algorithms without changing their original functioning.
2.
International Research Journal
of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 04 Issue: 07 | July -2017 www.irjet.net p-ISSN: 2395-0072 © 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 1989 The data structureintroducedinthepaperismeanttoreduce the data set size and produce faster data access. Despite this guarantees the data validity and doesn’t change the original data values. The conversion of data into new data structure comprises of three steps viz. 1. Shuffling, 2. Inverted Index Mapping and finally and 3. Data Compression. The Modified Run Length Encoding technique helps in[10] more compression of input data than the original Run Length Encoding algorithm, that eventually helps in improving performance of ARMs. 3.SYSTEM ARCHITECTURE / SYSTEM OVERVIEW As the ultimate goal of this work is to device a new data structure which will reduce the data size and speed up the operating. So as shown in figure the new data structure is obtained by performing three functions on original data. Vizshuffling, inverted index mapping and run length encoding. Each of these will work as follows. 1.Shuffling: Similarity between records is the characteristic of specific data set i.e. two or more records share similar feature values. This characteristic is used to sort records among data using Hamming Distance (HD). HD can be used as similarity metricinordertominimizenumber of changes in the feature values from record to record. Sorted data is compressed by merging features in subsequent records. Lower the HD between two records implies higher probability of clustering those into same group. Original Data Set is input for this function which is modified to sorted data after completing this function. 2.Inverted Index Mapping: This takes the sorted data as an input and aims to construct the efficient structure i.e. index based structure. Attribute valuesareindexed based on satisfied transaction and then indexed attribute values can be joined with consecutive transactions to share same values. This inverted index methodology assigns key-value pair to each list of attribute, and each attributehaspointer to transaction index satisfied by that attribute value. 3.RLE encoding: This step is responsible for data compression and faster data access. This takes inverted index structure as an input and groups consecutive indices for each attribute into a 2-tuple that defines a starting index and the total number of consecutive transactions containing the same feature value {index, displacement}. 4.Modified RLE: The modified run length encoding scheme gives a major improvement in compression ratiofor any kind of data. Analysing the input data is the first and core step. Data is analysed to highlight if there are any largest numbers of sequences that may increase thenumber of bits to represent the length of each run. In this method if input data contains nearest value with its adjacent data then both value are considered as same data. The resulting data structure is now compressed without losing values. 4.SYSTEM ANALYSIS 1.DatasetInformationHere,wehaveuseddifferentdatasets like Adult and Mushroom each having variant number of Instances and attribute. 2.Software and Hardware Requirements The above system is implemented using RAM of 1GB and above, Speed of 1.1 GHz. The processor used is intel - CORE i3 and above.A Hard Disk of 20 is used. Operating System Windows 7 with Java 1.8 is used. Development Environment considered here is Netbeans 8.0.2. 3.Performance Parameters 1.Data Size: After analyzing universal compressors like Zip we state that the proposed structure represents the data into a compressed structure according to the resulting file size. 2.LoadingTime:asthedatasizeisnowcompressed, loading time for compressed data will be less than that of loading the original data. 3.MemoryRequirements:AnalyzingtheAdultdata set of 465 KB, it only requires 53 KB of main memory by using the new data structure, hence processing the compressed data will ultimately require less memory space. Table -1: Data Compression Data Structure Adult Mushroom Original 465 372 RLE 72 68 MRLE 53 46
3.
International Research Journal
of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 04 Issue: 07 | July -2017 www.irjet.net p-ISSN: 2395-0072 © 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 1990 Table-2: Time Required to process Apriori Algorithm Dataset Original RLE MRLE Adult 2663 1204 964 Mushroom 2358 1146 893 Graph 1 Graph 2 4.Results: The data sets Adult and mushroom is compressed using shuffling, mapping and RLE and then Modified RLE compression techniques. Then the result are obtained after applying Apriori algorithm on compressed datasets. The observations and results are then plotted in graphinorderto compare size and time required to store and process respectively for both original and compressed data structures. From the graph and table it can be observed that, the compressed data set reduce the size required to store and time required to process. 5. CONCLUSIONS Association rule mining is being much interesting due to rapid growth of volume of data which. But the mining process is being time consuming for such data using traditional mining algorithms. Hence data compression technique can improve the performance of existing algorithms without changing their original schema.Thedata compression technique used above compresses the data to be used by mining algorithms and hence improves the performance in terms of data size, loading time/processing time and main memory requirement efficiently. The mining process can be further improved in above terms using MP- Graph for better utilization of memory space and efficiency when large data sets are used. ACKNOWLEDGEMENT Every orientation work has a mark of many people and it is the responsibility of author to convey an intense gratitude for the same. I feel colossal pleasure to express deep sense of gratitude and indebtedness to my guide Prof. (Ms) Dr. S. A. Itkar, for constant uplifting and virtuous guidance. I also express my sincere gratitude to the Department of Computer Engineering and Library of my college. Last but not the least; I am grateful tomyfriendsand my parents for their best wishes.. REFERENCES [1]V. Marx, “Biology: The big challenges of big data,” Nature, vol. 498, no. 7453, pp. 255–260, 2013. [2]R. Agrawal and R. Srikant, “Fast algorithms for mining association rules in large databases,” in Proc. 20th Int. Conf. Very Large Data Bases (VLDB), Santiago, Chile, 1994, pp. 487–499. [3]José María Luna, Alberto Cano, Mykola Pechenizkiy, “ Speeding-Up Association Rule Mining With Inverted Index Compression”, IEEE TRANSACTIONS ON CYBERNETICS, 2016. [4]R. Agrawal, H. Mannila, R. Srikant, H. Toivonen, and A. I. Verkamo, “Fast discovery of association rules,” in Advances in Knowledge Discovery and Data Mining. Menlo Park, CA, USA: Amer. Assoc. Artif. Intell., 1996, pp. 307–328. [5]J. Han, J. Pei, Y. Yin, and R. Mao, “Mining frequent patterns without candidate generation: A frequent-pattern tree approach,” Data Min. Knowl. Disc., vol. 8, no. 1, pp. 53–87, 2004. [6]B. Goethals and M. J. Zaki, “Advances in frequent itemset mining implementations: Report on FIMI’03,” ACM SIGKDD Explor. Newslett., vol. 6, no. 1, pp. 109–117, 2004. [7]C. Lucchese, S. Orlando, and R. Perego, “DCI closed: A fast and memory efficient algorithm to mine frequent closed itemsets,” in Proc. IEEE ICDM Workshop Frequent Itemset Min. Implement. (FIMI), Brighton, U.K., 2004, pp. 20–28.
4.
International Research Journal
of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 04 Issue: 07 | July -2017 www.irjet.net p-ISSN: 2395-0072 © 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 1991 [8]J. M. Luna, J. R. Romero, C. Romero, andS.Ventura,“On the use of genetic programming for mining comprehensible rules in subgroup discovery,” IEEE Trans. Cybern., vol. 44, no. 12, pp. 2329–2341, Dec. 2014. [9]C.Borgelt,“Efficient Impleme -ntations of Apriori and Eclat,” in Proc. 1st IEEE ICDM Workshop Frequent Item Set Min. Implement. (FIMI), Melbourne, FL, USA, 2003, pp. 90– 99. [10] S. Joseph, N. Srikanth, J. E. N. Abhilash, “A Novel Approach of Modified Run Length EncodingSchemeforHigh Speed Data Communication Application ,” International Journal of Science and Research Volume, 2 Issue 12, December 2013
Download now