SlideShare a Scribd company logo
1 of 4
CONTACT: PRAVEEN KUMAR. L (, +91 – 9791938249)
MAIL ID: sunsid1989@gmail.com, praveen@nexgenproject.com
Web: www.nexgenproject.com, www.finalyear-ieeeprojects.com
Fuzzy-folded Bloom Filter-as-a-Service for Big Data Storage in the Cloud
Abstract:
With the ongoing trend of smart and Internet-connected objects being deployed across a
broad range of applications, there is also a corresponding increase in the amount of data
movement across different geographical regions. This, in turn, poses a number of challenges
with respect to big data storage across multiple locations, including cloud computing
platform. For example, the underlying distributed file system has a large number of
directories and files in the form of gigantic trees, which are difficult to parse in polynomial
time. Moreover, with the exponential increase of (big) data streams (i.e. unbounded sets of
continuous data flows), challenges associated with indexing and membership queries are
compounded. The capability to process such significant amount of data with high accuracy
can have significant impact on decision-making and formulation of business and risk-related
strategies, particularly in our current Industrial Internet of Things environment (IIoT).
However, existing storage solutions are deterministic in nature. In other words, they tend to
consume considerable memory and CPU time to yield accurate results. This necessitates the
design of efficient quality of service (QoS)-aware IIoT applications that are able to deal with
the challenges of data storage and retrieval in the cloud computing environment. In this
paper, we present an effective space-effective strategy for massive data storage using bloom
filter (BF). Specifically, in the proposed scheme, the standard BF is extended to incorporate
fuzzy-enabled folding approach, hereafter referred to as Fuzzy Folded BF (FFBF). In FFBF,
fuzzy operations are used to accommodate the hashed data of one BF into another to reduce
storage requirements. Evaluations on UCI ML AReM and Facebook datasets demonstrate the
efficacy of FFBF, in terms of dealing with approximately 1.9 times more data as compared to
using the standard BF. This is also achieved without affecting the false positive rate and
query time.
Existing System:
BFs is that query complexity increases as the size grows. Initial size of filter is an important
factor in dynamic BFs as the small initial sized array may lead to computational overhead,
slice addition and query complexity overhead. On the other hand, a larger initial dynamic BF
size may result in memory wastage. Further, streaming applications, such as-approximate
cache, duplicate detection, and membership query, require one-pass processing of data. In
such applications, results are required within a stipulated time-bound. Thus, to serve this
purpose, BF size should be small and constant to be optimally mapped with cache. In order to
accommodate new data, some data needs to be deleted from the BF. Thus, staling of data is
required to manage the trade-off between false positives and false negatives [21].
CONTACT: PRAVEEN KUMAR. L (, +91 – 9791938249)
MAIL ID: sunsid1989@gmail.com, praveen@nexgenproject.com
Web: www.nexgenproject.com, www.finalyear-ieeeprojects.com
Proposed System:
We propose a novel technique of compression of two BFs into one filter without losing any
data. The proposed approach uses fuzzy logic to store data optimally and efficiently use the
storage capacity: Compression of two BFs into one BF using fuzzy fold operation, wherein
large number of elements are accommodated in a single BF of size m. Slow decay of data
which allows streaming data to reside in memory for substantial amount of time. Efficient
and optimal utilization of storage space without any loss of accuracy. Significant reduction
in computational cost by leveraging double hashing to compute the k hash functions. False
positives in the proposed FFBF are not affected by the use of compression operation.
CONCLUSION:
IIoT is likely to be increasingly the norm in our society, particularly in our critical
infrastructure sectors such as the Chemical Sector, the Commercial Facilities Sector, the
Communications Sector, the Critical Manufacturing Sector, the Dams Sector, the Defense
Industrial Base Sector, the Emergency Services Sector, the Energy Sector, the Food and
Agriculture Sector, the Government Facilities Sector, and so on. IIoT also has applications in
a conflict and adversarial environment such as Industrial Internet of Military Things. Hence,
there is a pressing need to address some of the existing challenges, including the challenge
we were seeking to address in this paper. Specifically in this paper, our proposed filter uses a
novel fuzzy based technique to resolve the space requirement problem in BF. We
demonstrated that the proposed approach can accommodate a higher number of elements in
the same space, as compared to SBF. The cost of folding and operations associated with it is
almost negligible because the proposed filter only contains simple fuzzy operation on binary
sets. The false positive rate in compressed, and representation remains the same as that of the
standard BF. The computational time in hashing is also significantly reduced due to the use of
double hashing technique, since it uses only two hash functions to generate k hash functions.
The query complexity of FFBF is dependent on the number of blocks in which BF is divided.
Searching an element from a m sized BF and same sized compressed representation remains
unchanged (i.e., O(k)). Findings from our evaluations using both UCI ML AReM and
Facebook datasets also demonstrated the efficiency of FFBF.
REFERENCES
[1] A. Rajaraman and J. D. Ullman, Mining of Massive Datasets. New York, NY, USA:
Cambridge University Press, 2011.
[2] S. Al-Rubaye, E. Kadhum, Q. Ni, and A. Anpalagan, “Industrial Internet of Things
Driven by SDN Platform for Smart Grid Resiliency,” IEEE Internet of Things Journal, 2017.
CONTACT: PRAVEEN KUMAR. L (, +91 – 9791938249)
MAIL ID: sunsid1989@gmail.com, praveen@nexgenproject.com
Web: www.nexgenproject.com, www.finalyear-ieeeprojects.com
[3] S. Mumtaz, A. Alsohaily, Z. Pang, A. Rayes, K. F. Tsang, and J. Rodriguez, “Massive
Internet of Things for Industrial Applications: Addressing Wireless IIoT Connectivity
Challenges and Ecosystem Fragmentation,” IEEE Industrial Electronics Magazine, vol. 11,
no. 1, pp. 28–33, 2017.
[4] L. Jiang, L. D. Xu, H. Cai, Z. Jiang, F. Bu, and B. Xu, “An IoTOriented Data Storage
Framework in Cloud Computing Platform,” IEEE Transactions on Industrial Informatics, vol.
10, no. 2, pp. 1443–1451, May 2014.
[5] F. Tao, J. Cheng, and Q. Qi, “IIHub: an Industrial Internetof-Things Hub Towards Smart
Manufacturing Based on CyberPhysical System,” IEEE Transactions on Industrial
Informatics, 2017.
[6] A. R. Sfar, E. Natalizio, Y. Challal, and Z. Chtourou, “A roadmap for security challenges
in the internet of things,” Digital Communications and Networks, 2017.
[7] “Gartner says a thirty-fold increase in internet-connected physical devices by 2020 will
significantly alter how the supply chain operates,” Gartner, Mar. 2014, [Accessed on: Oct
2017]. [Online]. Available: {http://www.gartner.com/newsroom/id/2688717}
[8] A. Velosa, “Internet of things — architecture remains a core opportunity and challenge: A
gartner trend insight report,” Gartner, vol. G00317007, 2017.
[9] “Big data and cloud computing-challenges and opportunities,” Big Data Made Simple,
Jun. 2017, [Accessed on: Mar. 2018]. [Online]. Available: http://bigdata-madesimple.com/
big-data-and-cloud-computing-challenges-and-opportunities/
[10] X. Liu, R. Deng, K.-K. R. Choo, Y. Yang, and H. Pang, “Privacypreserving outsourced
calculation toolkit in the cloud,” IEEE Transactions on Dependable and Secure Computing,
2018.
[11] S. Kaisler, F. Armour, J. A. Espinosa, and W. Money, “Big data: issues and challenges
moving forward,” in System Sciences (HICSS), 2013 46th Hawaii International Conference
on. IEEE, 2013, pp. 995– 1004.
[12] A. Broder and M. Mitzenmacher, “Network applications of bloom filters: A survey,”
Internet mathematics, vol. 1, no. 4, pp. 485–509, 2004.
[13] S. Tarkoma, C. E. Rothenberg, and E. Lagerspetz, “Theory and Practice of Bloom Filters
for Distributed Systems,” IEEE Communications Surveys Tutorials, vol. 14, no. 1, pp. 131–
155, First 2012.
CONTACT: PRAVEEN KUMAR. L (, +91 – 9791938249)
MAIL ID: sunsid1989@gmail.com, praveen@nexgenproject.com
Web: www.nexgenproject.com, www.finalyear-ieeeprojects.com
[14] “What are the best applications of bloom filters?” https://www.quora.com/What-are-the-
best-applications-ofBloom-filters, [Online].

More Related Content

What's hot

Cloud Computing Role in Information technology
Cloud Computing  Role in Information technologyCloud Computing  Role in Information technology
Cloud Computing Role in Information technologyKHakash
 
Big data Mining Using Very-Large-Scale Data Processing Platforms
Big data Mining Using Very-Large-Scale Data Processing PlatformsBig data Mining Using Very-Large-Scale Data Processing Platforms
Big data Mining Using Very-Large-Scale Data Processing PlatformsIJERA Editor
 
wireless sensor network
wireless sensor networkwireless sensor network
wireless sensor networkparry prabhu
 
Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...
Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...
Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...Onyebuchi nosiri
 
Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...
Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...
Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...Onyebuchi nosiri
 
A Data Decomposition Method for Stepwise Migration of Complex Legacy Data
A Data Decomposition Method for Stepwise Migration of Complex Legacy DataA Data Decomposition Method for Stepwise Migration of Complex Legacy Data
A Data Decomposition Method for Stepwise Migration of Complex Legacy DataAndy Martens
 
Top 10 Read articles in Web & semantic technology
 Top  10 Read articles in Web & semantic technology Top  10 Read articles in Web & semantic technology
Top 10 Read articles in Web & semantic technologydannyijwest
 
Overcomming Big Data Mining Challenges for Revolutionary Breakthroughs in Com...
Overcomming Big Data Mining Challenges for Revolutionary Breakthroughs in Com...Overcomming Big Data Mining Challenges for Revolutionary Breakthroughs in Com...
Overcomming Big Data Mining Challenges for Revolutionary Breakthroughs in Com...AnthonyOtuonye
 

What's hot (10)

Cloud Computing Role in Information technology
Cloud Computing  Role in Information technologyCloud Computing  Role in Information technology
Cloud Computing Role in Information technology
 
Big data Mining Using Very-Large-Scale Data Processing Platforms
Big data Mining Using Very-Large-Scale Data Processing PlatformsBig data Mining Using Very-Large-Scale Data Processing Platforms
Big data Mining Using Very-Large-Scale Data Processing Platforms
 
Integrated Information System for Construction Operations
Integrated Information System for Construction OperationsIntegrated Information System for Construction Operations
Integrated Information System for Construction Operations
 
wireless sensor network
wireless sensor networkwireless sensor network
wireless sensor network
 
Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...
Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...
Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...
 
Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...
Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...
Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...
 
A Data Decomposition Method for Stepwise Migration of Complex Legacy Data
A Data Decomposition Method for Stepwise Migration of Complex Legacy DataA Data Decomposition Method for Stepwise Migration of Complex Legacy Data
A Data Decomposition Method for Stepwise Migration of Complex Legacy Data
 
Top 10 Read articles in Web & semantic technology
 Top  10 Read articles in Web & semantic technology Top  10 Read articles in Web & semantic technology
Top 10 Read articles in Web & semantic technology
 
Token
TokenToken
Token
 
Overcomming Big Data Mining Challenges for Revolutionary Breakthroughs in Com...
Overcomming Big Data Mining Challenges for Revolutionary Breakthroughs in Com...Overcomming Big Data Mining Challenges for Revolutionary Breakthroughs in Com...
Overcomming Big Data Mining Challenges for Revolutionary Breakthroughs in Com...
 

Similar to Fuzzy folded bloom filter-as-a-service for big data storage in the cloud

Running Head MOBILE COMPUTING INTEGRATION ON IT INFRASTRUCTURE1.docx
Running Head MOBILE COMPUTING INTEGRATION ON IT INFRASTRUCTURE1.docxRunning Head MOBILE COMPUTING INTEGRATION ON IT INFRASTRUCTURE1.docx
Running Head MOBILE COMPUTING INTEGRATION ON IT INFRASTRUCTURE1.docxglendar3
 
Running Head MOBILE COMPUTING INTEGRATION ON IT INFRASTRUCTURE1.docx
Running Head MOBILE COMPUTING INTEGRATION ON IT INFRASTRUCTURE1.docxRunning Head MOBILE COMPUTING INTEGRATION ON IT INFRASTRUCTURE1.docx
Running Head MOBILE COMPUTING INTEGRATION ON IT INFRASTRUCTURE1.docxtodd581
 
An Efficient and Fault Tolerant Data Replica Placement Technique for Cloud ba...
An Efficient and Fault Tolerant Data Replica Placement Technique for Cloud ba...An Efficient and Fault Tolerant Data Replica Placement Technique for Cloud ba...
An Efficient and Fault Tolerant Data Replica Placement Technique for Cloud ba...IJCSIS Research Publications
 
EDGE COMPUTING: VISION AND CHALLENGES
EDGE COMPUTING: VISION AND CHALLENGESEDGE COMPUTING: VISION AND CHALLENGES
EDGE COMPUTING: VISION AND CHALLENGESsuthi
 
IRJET - A Research on Eloquent Salvation and Productive Outsourcing of Massiv...
IRJET - A Research on Eloquent Salvation and Productive Outsourcing of Massiv...IRJET - A Research on Eloquent Salvation and Productive Outsourcing of Massiv...
IRJET - A Research on Eloquent Salvation and Productive Outsourcing of Massiv...IRJET Journal
 
Efficient Image Compression Technique using Clustering and Random Permutation
Efficient Image Compression Technique using Clustering and Random PermutationEfficient Image Compression Technique using Clustering and Random Permutation
Efficient Image Compression Technique using Clustering and Random PermutationIJERA Editor
 
Efficient Image Compression Technique using Clustering and Random Permutation
Efficient Image Compression Technique using Clustering and Random PermutationEfficient Image Compression Technique using Clustering and Random Permutation
Efficient Image Compression Technique using Clustering and Random PermutationIJERA Editor
 
Big Data in Bioinformatics & the Era of Cloud Computing
Big Data in Bioinformatics & the Era of Cloud ComputingBig Data in Bioinformatics & the Era of Cloud Computing
Big Data in Bioinformatics & the Era of Cloud ComputingIOSR Journals
 
A Special Report on Infrastructure Futures: Keeping Pace in the Era of Big Da...
A Special Report on Infrastructure Futures: Keeping Pace in the Era of Big Da...A Special Report on Infrastructure Futures: Keeping Pace in the Era of Big Da...
A Special Report on Infrastructure Futures: Keeping Pace in the Era of Big Da...IBM India Smarter Computing
 
Big Data and Next Generation Network Challenges - Phdassistance
Big Data and Next Generation Network Challenges - PhdassistanceBig Data and Next Generation Network Challenges - Phdassistance
Big Data and Next Generation Network Challenges - PhdassistancePhD Assistance
 
Service Level Comparison for Online Shopping using Data Mining
Service Level Comparison for Online Shopping using Data MiningService Level Comparison for Online Shopping using Data Mining
Service Level Comparison for Online Shopping using Data MiningIIRindia
 
Virtual Machine Allocation Policy in Cloud Computing Environment using CloudSim
Virtual Machine Allocation Policy in Cloud Computing Environment using CloudSim Virtual Machine Allocation Policy in Cloud Computing Environment using CloudSim
Virtual Machine Allocation Policy in Cloud Computing Environment using CloudSim IJECEIAES
 
An efficient approach on spatial big data related to wireless networks and it...
An efficient approach on spatial big data related to wireless networks and it...An efficient approach on spatial big data related to wireless networks and it...
An efficient approach on spatial big data related to wireless networks and it...eSAT Journals
 
Campus realities: forecasting user bandwidth utilization using Monte Carlo si...
Campus realities: forecasting user bandwidth utilization using Monte Carlo si...Campus realities: forecasting user bandwidth utilization using Monte Carlo si...
Campus realities: forecasting user bandwidth utilization using Monte Carlo si...IJECEIAES
 
Fog Computing: A Platform for Internet of Things and Analytics
Fog Computing: A Platform for Internet of Things and AnalyticsFog Computing: A Platform for Internet of Things and Analytics
Fog Computing: A Platform for Internet of Things and AnalyticsHarshitParkar6677
 
Big Data and Internet of Things: A Roadmap For Smart Environments, Fog Comput...
Big Data and Internet of Things: A Roadmap For Smart Environments, Fog Comput...Big Data and Internet of Things: A Roadmap For Smart Environments, Fog Comput...
Big Data and Internet of Things: A Roadmap For Smart Environments, Fog Comput...Jiang Zhu
 
IRJET- Deduplication of Encrypted Bigdata on Cloud
IRJET- Deduplication of Encrypted Bigdata on CloudIRJET- Deduplication of Encrypted Bigdata on Cloud
IRJET- Deduplication of Encrypted Bigdata on CloudIRJET Journal
 
Efficient Cost Minimization for Big Data Processing
Efficient Cost Minimization for Big Data ProcessingEfficient Cost Minimization for Big Data Processing
Efficient Cost Minimization for Big Data ProcessingIRJET Journal
 

Similar to Fuzzy folded bloom filter-as-a-service for big data storage in the cloud (20)

Running Head MOBILE COMPUTING INTEGRATION ON IT INFRASTRUCTURE1.docx
Running Head MOBILE COMPUTING INTEGRATION ON IT INFRASTRUCTURE1.docxRunning Head MOBILE COMPUTING INTEGRATION ON IT INFRASTRUCTURE1.docx
Running Head MOBILE COMPUTING INTEGRATION ON IT INFRASTRUCTURE1.docx
 
Running Head MOBILE COMPUTING INTEGRATION ON IT INFRASTRUCTURE1.docx
Running Head MOBILE COMPUTING INTEGRATION ON IT INFRASTRUCTURE1.docxRunning Head MOBILE COMPUTING INTEGRATION ON IT INFRASTRUCTURE1.docx
Running Head MOBILE COMPUTING INTEGRATION ON IT INFRASTRUCTURE1.docx
 
An Efficient and Fault Tolerant Data Replica Placement Technique for Cloud ba...
An Efficient and Fault Tolerant Data Replica Placement Technique for Cloud ba...An Efficient and Fault Tolerant Data Replica Placement Technique for Cloud ba...
An Efficient and Fault Tolerant Data Replica Placement Technique for Cloud ba...
 
EDGE COMPUTING: VISION AND CHALLENGES
EDGE COMPUTING: VISION AND CHALLENGESEDGE COMPUTING: VISION AND CHALLENGES
EDGE COMPUTING: VISION AND CHALLENGES
 
IRJET - A Research on Eloquent Salvation and Productive Outsourcing of Massiv...
IRJET - A Research on Eloquent Salvation and Productive Outsourcing of Massiv...IRJET - A Research on Eloquent Salvation and Productive Outsourcing of Massiv...
IRJET - A Research on Eloquent Salvation and Productive Outsourcing of Massiv...
 
Efficient Image Compression Technique using Clustering and Random Permutation
Efficient Image Compression Technique using Clustering and Random PermutationEfficient Image Compression Technique using Clustering and Random Permutation
Efficient Image Compression Technique using Clustering and Random Permutation
 
Efficient Image Compression Technique using Clustering and Random Permutation
Efficient Image Compression Technique using Clustering and Random PermutationEfficient Image Compression Technique using Clustering and Random Permutation
Efficient Image Compression Technique using Clustering and Random Permutation
 
Big Data in Bioinformatics & the Era of Cloud Computing
Big Data in Bioinformatics & the Era of Cloud ComputingBig Data in Bioinformatics & the Era of Cloud Computing
Big Data in Bioinformatics & the Era of Cloud Computing
 
A Special Report on Infrastructure Futures: Keeping Pace in the Era of Big Da...
A Special Report on Infrastructure Futures: Keeping Pace in the Era of Big Da...A Special Report on Infrastructure Futures: Keeping Pace in the Era of Big Da...
A Special Report on Infrastructure Futures: Keeping Pace in the Era of Big Da...
 
TOWARDS A MACHINE LEARNING BASED ARTIFICIALLY INTELLIGENT SYSTEM FOR ENERGY E...
TOWARDS A MACHINE LEARNING BASED ARTIFICIALLY INTELLIGENT SYSTEM FOR ENERGY E...TOWARDS A MACHINE LEARNING BASED ARTIFICIALLY INTELLIGENT SYSTEM FOR ENERGY E...
TOWARDS A MACHINE LEARNING BASED ARTIFICIALLY INTELLIGENT SYSTEM FOR ENERGY E...
 
TOWARDS A MACHINE LEARNING BASED ARTIFICIALLY INTELLIGENT SYSTEM FOR ENERGY E...
TOWARDS A MACHINE LEARNING BASED ARTIFICIALLY INTELLIGENT SYSTEM FOR ENERGY E...TOWARDS A MACHINE LEARNING BASED ARTIFICIALLY INTELLIGENT SYSTEM FOR ENERGY E...
TOWARDS A MACHINE LEARNING BASED ARTIFICIALLY INTELLIGENT SYSTEM FOR ENERGY E...
 
Big Data and Next Generation Network Challenges - Phdassistance
Big Data and Next Generation Network Challenges - PhdassistanceBig Data and Next Generation Network Challenges - Phdassistance
Big Data and Next Generation Network Challenges - Phdassistance
 
Service Level Comparison for Online Shopping using Data Mining
Service Level Comparison for Online Shopping using Data MiningService Level Comparison for Online Shopping using Data Mining
Service Level Comparison for Online Shopping using Data Mining
 
Virtual Machine Allocation Policy in Cloud Computing Environment using CloudSim
Virtual Machine Allocation Policy in Cloud Computing Environment using CloudSim Virtual Machine Allocation Policy in Cloud Computing Environment using CloudSim
Virtual Machine Allocation Policy in Cloud Computing Environment using CloudSim
 
An efficient approach on spatial big data related to wireless networks and it...
An efficient approach on spatial big data related to wireless networks and it...An efficient approach on spatial big data related to wireless networks and it...
An efficient approach on spatial big data related to wireless networks and it...
 
Campus realities: forecasting user bandwidth utilization using Monte Carlo si...
Campus realities: forecasting user bandwidth utilization using Monte Carlo si...Campus realities: forecasting user bandwidth utilization using Monte Carlo si...
Campus realities: forecasting user bandwidth utilization using Monte Carlo si...
 
Fog Computing: A Platform for Internet of Things and Analytics
Fog Computing: A Platform for Internet of Things and AnalyticsFog Computing: A Platform for Internet of Things and Analytics
Fog Computing: A Platform for Internet of Things and Analytics
 
Big Data and Internet of Things: A Roadmap For Smart Environments, Fog Comput...
Big Data and Internet of Things: A Roadmap For Smart Environments, Fog Comput...Big Data and Internet of Things: A Roadmap For Smart Environments, Fog Comput...
Big Data and Internet of Things: A Roadmap For Smart Environments, Fog Comput...
 
IRJET- Deduplication of Encrypted Bigdata on Cloud
IRJET- Deduplication of Encrypted Bigdata on CloudIRJET- Deduplication of Encrypted Bigdata on Cloud
IRJET- Deduplication of Encrypted Bigdata on Cloud
 
Efficient Cost Minimization for Big Data Processing
Efficient Cost Minimization for Big Data ProcessingEfficient Cost Minimization for Big Data Processing
Efficient Cost Minimization for Big Data Processing
 

Recently uploaded

psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docxPoojaSen20
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfChris Hunter
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.pptRamjanShidvankar
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.christianmathematics
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsMebane Rash
 
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-IIFood Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-IIShubhangi Sonawane
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxheathfieldcps1
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docxPoojaSen20
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17Celine George
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxVishalSingh1417
 

Recently uploaded (20)

psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Asian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptxAsian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptx
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-IIFood Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 

Fuzzy folded bloom filter-as-a-service for big data storage in the cloud

  • 1. CONTACT: PRAVEEN KUMAR. L (, +91 – 9791938249) MAIL ID: sunsid1989@gmail.com, praveen@nexgenproject.com Web: www.nexgenproject.com, www.finalyear-ieeeprojects.com Fuzzy-folded Bloom Filter-as-a-Service for Big Data Storage in the Cloud Abstract: With the ongoing trend of smart and Internet-connected objects being deployed across a broad range of applications, there is also a corresponding increase in the amount of data movement across different geographical regions. This, in turn, poses a number of challenges with respect to big data storage across multiple locations, including cloud computing platform. For example, the underlying distributed file system has a large number of directories and files in the form of gigantic trees, which are difficult to parse in polynomial time. Moreover, with the exponential increase of (big) data streams (i.e. unbounded sets of continuous data flows), challenges associated with indexing and membership queries are compounded. The capability to process such significant amount of data with high accuracy can have significant impact on decision-making and formulation of business and risk-related strategies, particularly in our current Industrial Internet of Things environment (IIoT). However, existing storage solutions are deterministic in nature. In other words, they tend to consume considerable memory and CPU time to yield accurate results. This necessitates the design of efficient quality of service (QoS)-aware IIoT applications that are able to deal with the challenges of data storage and retrieval in the cloud computing environment. In this paper, we present an effective space-effective strategy for massive data storage using bloom filter (BF). Specifically, in the proposed scheme, the standard BF is extended to incorporate fuzzy-enabled folding approach, hereafter referred to as Fuzzy Folded BF (FFBF). In FFBF, fuzzy operations are used to accommodate the hashed data of one BF into another to reduce storage requirements. Evaluations on UCI ML AReM and Facebook datasets demonstrate the efficacy of FFBF, in terms of dealing with approximately 1.9 times more data as compared to using the standard BF. This is also achieved without affecting the false positive rate and query time. Existing System: BFs is that query complexity increases as the size grows. Initial size of filter is an important factor in dynamic BFs as the small initial sized array may lead to computational overhead, slice addition and query complexity overhead. On the other hand, a larger initial dynamic BF size may result in memory wastage. Further, streaming applications, such as-approximate cache, duplicate detection, and membership query, require one-pass processing of data. In such applications, results are required within a stipulated time-bound. Thus, to serve this purpose, BF size should be small and constant to be optimally mapped with cache. In order to accommodate new data, some data needs to be deleted from the BF. Thus, staling of data is required to manage the trade-off between false positives and false negatives [21].
  • 2. CONTACT: PRAVEEN KUMAR. L (, +91 – 9791938249) MAIL ID: sunsid1989@gmail.com, praveen@nexgenproject.com Web: www.nexgenproject.com, www.finalyear-ieeeprojects.com Proposed System: We propose a novel technique of compression of two BFs into one filter without losing any data. The proposed approach uses fuzzy logic to store data optimally and efficiently use the storage capacity: Compression of two BFs into one BF using fuzzy fold operation, wherein large number of elements are accommodated in a single BF of size m. Slow decay of data which allows streaming data to reside in memory for substantial amount of time. Efficient and optimal utilization of storage space without any loss of accuracy. Significant reduction in computational cost by leveraging double hashing to compute the k hash functions. False positives in the proposed FFBF are not affected by the use of compression operation. CONCLUSION: IIoT is likely to be increasingly the norm in our society, particularly in our critical infrastructure sectors such as the Chemical Sector, the Commercial Facilities Sector, the Communications Sector, the Critical Manufacturing Sector, the Dams Sector, the Defense Industrial Base Sector, the Emergency Services Sector, the Energy Sector, the Food and Agriculture Sector, the Government Facilities Sector, and so on. IIoT also has applications in a conflict and adversarial environment such as Industrial Internet of Military Things. Hence, there is a pressing need to address some of the existing challenges, including the challenge we were seeking to address in this paper. Specifically in this paper, our proposed filter uses a novel fuzzy based technique to resolve the space requirement problem in BF. We demonstrated that the proposed approach can accommodate a higher number of elements in the same space, as compared to SBF. The cost of folding and operations associated with it is almost negligible because the proposed filter only contains simple fuzzy operation on binary sets. The false positive rate in compressed, and representation remains the same as that of the standard BF. The computational time in hashing is also significantly reduced due to the use of double hashing technique, since it uses only two hash functions to generate k hash functions. The query complexity of FFBF is dependent on the number of blocks in which BF is divided. Searching an element from a m sized BF and same sized compressed representation remains unchanged (i.e., O(k)). Findings from our evaluations using both UCI ML AReM and Facebook datasets also demonstrated the efficiency of FFBF. REFERENCES [1] A. Rajaraman and J. D. Ullman, Mining of Massive Datasets. New York, NY, USA: Cambridge University Press, 2011. [2] S. Al-Rubaye, E. Kadhum, Q. Ni, and A. Anpalagan, “Industrial Internet of Things Driven by SDN Platform for Smart Grid Resiliency,” IEEE Internet of Things Journal, 2017.
  • 3. CONTACT: PRAVEEN KUMAR. L (, +91 – 9791938249) MAIL ID: sunsid1989@gmail.com, praveen@nexgenproject.com Web: www.nexgenproject.com, www.finalyear-ieeeprojects.com [3] S. Mumtaz, A. Alsohaily, Z. Pang, A. Rayes, K. F. Tsang, and J. Rodriguez, “Massive Internet of Things for Industrial Applications: Addressing Wireless IIoT Connectivity Challenges and Ecosystem Fragmentation,” IEEE Industrial Electronics Magazine, vol. 11, no. 1, pp. 28–33, 2017. [4] L. Jiang, L. D. Xu, H. Cai, Z. Jiang, F. Bu, and B. Xu, “An IoTOriented Data Storage Framework in Cloud Computing Platform,” IEEE Transactions on Industrial Informatics, vol. 10, no. 2, pp. 1443–1451, May 2014. [5] F. Tao, J. Cheng, and Q. Qi, “IIHub: an Industrial Internetof-Things Hub Towards Smart Manufacturing Based on CyberPhysical System,” IEEE Transactions on Industrial Informatics, 2017. [6] A. R. Sfar, E. Natalizio, Y. Challal, and Z. Chtourou, “A roadmap for security challenges in the internet of things,” Digital Communications and Networks, 2017. [7] “Gartner says a thirty-fold increase in internet-connected physical devices by 2020 will significantly alter how the supply chain operates,” Gartner, Mar. 2014, [Accessed on: Oct 2017]. [Online]. Available: {http://www.gartner.com/newsroom/id/2688717} [8] A. Velosa, “Internet of things — architecture remains a core opportunity and challenge: A gartner trend insight report,” Gartner, vol. G00317007, 2017. [9] “Big data and cloud computing-challenges and opportunities,” Big Data Made Simple, Jun. 2017, [Accessed on: Mar. 2018]. [Online]. Available: http://bigdata-madesimple.com/ big-data-and-cloud-computing-challenges-and-opportunities/ [10] X. Liu, R. Deng, K.-K. R. Choo, Y. Yang, and H. Pang, “Privacypreserving outsourced calculation toolkit in the cloud,” IEEE Transactions on Dependable and Secure Computing, 2018. [11] S. Kaisler, F. Armour, J. A. Espinosa, and W. Money, “Big data: issues and challenges moving forward,” in System Sciences (HICSS), 2013 46th Hawaii International Conference on. IEEE, 2013, pp. 995– 1004. [12] A. Broder and M. Mitzenmacher, “Network applications of bloom filters: A survey,” Internet mathematics, vol. 1, no. 4, pp. 485–509, 2004. [13] S. Tarkoma, C. E. Rothenberg, and E. Lagerspetz, “Theory and Practice of Bloom Filters for Distributed Systems,” IEEE Communications Surveys Tutorials, vol. 14, no. 1, pp. 131– 155, First 2012.
  • 4. CONTACT: PRAVEEN KUMAR. L (, +91 – 9791938249) MAIL ID: sunsid1989@gmail.com, praveen@nexgenproject.com Web: www.nexgenproject.com, www.finalyear-ieeeprojects.com [14] “What are the best applications of bloom filters?” https://www.quora.com/What-are-the- best-applications-ofBloom-filters, [Online].