SlideShare a Scribd company logo
1 of 20
BESTPEER++: A PEER-TO-PEER BASED
LARGE-SCALE DATA PROCESSING
PLATFORM
Submitted by,
PRABHUDEV R
4NI12IS416
1
AGENDA
1. Introduction.
2. Overview of the BestPeer++ system.
3. Bootstrap peer.
4. Normal peer.
5. Benchmarking.
6. Advantages.
7. Conclusion.
2
1. INTRODUCTION
 Corporate network shares information with participating
companies of a common interest.
 Companies reduce their operational cost and increase
the revenue.
 Delivers elastic data sharing services.
 Provides economical , flexible and scalable platform.
 Based on pay as you go business model. 3
2. OVERVIEW OF THE BESTPEER++
SYSTEM
BestPeer data management platform
 Adaptive join query processing
 distributed online aggregation techniques to provide efficient
query processing.
BestPeer++, cloud enabled evolution of BestPeer
 Distributed access control
 Multiple types of indexes
 Pay-as-you-go query processing for delivering elastic data
sharing services in the cloud.
 The software components of BestPeer++ are separated
into two parts:-
1. Core.
2. Adapter. 4
AMAZON CLOUD ADAPTER
 Elastic hardware infrastructure for BestPeer++ to
operate on by using Amazon Cloud services.
 Launching/terminating dedicated MySQL database
servers and monitoring/ backup/auto-scaling those
servers.
 Finally, the Amazon Cloud Adapter also provides
automatic fail-over service.
5
THE BESTPEER++ CORE
 Platform-independent
logic, including query
processing and P2P
overlay.
 Cloud adapter and
consists of two software
components:
1. Bootstrap peer.
2. Normal peer.
6
3. BOOTSTRAP PEER
 The bootstrap peer is run by the BestPeer++ service
provider, and its main functionality is to manage the
BestPeer+ + network
1. Managing Normal Peer Join/Departure.
2. Auto Fail-Over and Auto-Scaling.
7
MANAGING NORMAL PEER JOIN/DEPARTURE
 Each normal peer intends to join an existing
corporate network must first connect to the
bootstrap peer.
 The joined peer will receive the corporate network
information including the current participants, global
schema, role definitions, and an issued certificate.
 When a normal peer needs to leave the network, it
also notifies the bootstrap peer first.
8
AUTO FAIL-OVER AND AUTO-SCALING
 In addition to managing peer join and peer
departure.
 The bootstrap peer spends most of its running-time
on monitoring the health of normal peers.
 Scheduling fail-over and auto-scaling events.
9
4.NORMAL PEER
Offline data flow
The data are extracted
periodically by a data
loader from the business
production system to the
normal peer instance.
Online data flow
 The query processor
performs user queries
using a fetch and
process strategy.
10
SCHEMA MAPPING
 Defines the mapping between the local schema of
each production system and the global shared
schema.
 The mapping consists of metadata mappings and
value mappings and also support instance level
mapping.
11
DATA LOADER
 Extracts data from production systems to normal peer
instances according to the result of schema mapping.
 The data loader also creates a snapshot of the newly
inserted data.
 At interval times, re-extracts data from the production system
to create a new snapshot.
 This snapshot is then compared to the previously stored one
to detect data changes.
 Finally, the changes are used to update the MySQL database
hosted in the normal peer. 12
DATA INDEXER
 BATON
 The first range, R0, is the subdomain maintained by
the node.
 The second range, R1, is the domain of the sub tree
rooted at the node. 13
DISTRIBUTED ACCESS CONTROL
 The basic idea is to use roles as templates to
capture common data access privileges and allow
businesses to override these privileges to meet
their specific needs.
 The information of the users created at one peer is
forwarded to the bootstrap peer and then
broadcasted to other normal peers also.
 The local administrator at this peer can easily
define the role-based access control for any user.
14
PAY-AS-YOU-GO QUERY PROCESSING
 BestPeer++ provides two services for the
participants:
1. Storage service
2. Search service
 After data are exported from the local business
system into a BestPeer++ instance, we apply the
schema mapping rules to transform them into the
predefined formats.
15
5.BENCHMARKING
 This section evaluates the performance and throughput
of BestPeer++ on Amazon cloud platform.
1. For the performance benchmark, they compare the
query latency of BestPeer++ with HadoopDB using
five queries selected from typical corporate network
applications workloads.
2. For the throughput benchmark, they create a simple
supply-chain network consisting of suppliers and
retailers and study the query throughput of the
system.
16
6.ADVANTAGES OF BESTPEER++
1. Deliver near linear query throughput as the number of
normal peers grows.
2. BestPeer++ adopts the pay-as-you-go business model
popularized by cloud computing.
3. The role-based access control for the inherent distributed
environment of corporate networks.
4. P2P technology to retrieve data between business
partners.
5. Efficient data sharing within corporate networks. 17
7.CONCLUSION
 The benchmark conducted on Amazon EC2 cloud
platform shows that our system can efficiently
handle typical workloads in a corporate network
and can deliver near linear query throughput as the
number of normal peers grows.
 Therefore, BestPeer++ is a promising solution for
efficient data sharing within corporate networks.
18
REFERENCES
1. S. Wu, Q.H. Vu, J. Li, and K.-L. Tan, “Adaptive Multi-Join
Query Processing in PDBMS,” Proc. IEEE Int’l Conf. Data
Eng. (ICDE ’09), pp. 1239-1242, 2009.
2. D. Bermbach and S. Tai, “Eventual Consistency: How Soon
is Eventual? An Evaluation of Amazon s3’s Consistency
Behavior,” in Proc. 6th Workshop Middleware Serv. Oriented
Comput. (MW4SOC ’11), pp. 1:1-1:6, NY, USA, 2011.
3. B. Cooper, A. Silberstein, E. Tam, R. Ramakrishnan, and R.
Sears, “Benchmarking Cloud Serving Systems with YCSB,”
Proc. First ACM Symp. Cloud Computing, pp. 143-154,
2010.
4. Oracle Inc., “Achieving the Cloud Computing Vision,” White
Paper, 2010. 19
20

More Related Content

What's hot

Building a Turbo-fast Data Warehousing Platform with Databricks
Building a Turbo-fast Data Warehousing Platform with DatabricksBuilding a Turbo-fast Data Warehousing Platform with Databricks
Building a Turbo-fast Data Warehousing Platform with DatabricksDatabricks
 
Cognos Analytics Performance Tuning: Tips & Tricks to Rev Performance
Cognos Analytics Performance Tuning: Tips & Tricks to Rev Performance Cognos Analytics Performance Tuning: Tips & Tricks to Rev Performance
Cognos Analytics Performance Tuning: Tips & Tricks to Rev Performance Senturus
 
Evolution of Big Data at Intel - Crawl, Walk and Run Approach
Evolution of Big Data at Intel - Crawl, Walk and Run ApproachEvolution of Big Data at Intel - Crawl, Walk and Run Approach
Evolution of Big Data at Intel - Crawl, Walk and Run ApproachDataWorks Summit
 
Whitepaper : Working with Greenplum Database using Toad for Data Analysts
Whitepaper : Working with Greenplum Database using Toad for Data Analysts Whitepaper : Working with Greenplum Database using Toad for Data Analysts
Whitepaper : Working with Greenplum Database using Toad for Data Analysts EMC
 
The Future of Apache Hadoop an Enterprise Architecture View
The Future of Apache Hadoop an Enterprise Architecture ViewThe Future of Apache Hadoop an Enterprise Architecture View
The Future of Apache Hadoop an Enterprise Architecture ViewDataWorks Summit/Hadoop Summit
 
Building and managing complex dependencies pipeline using Apache Oozie
Building and managing complex dependencies pipeline using Apache OozieBuilding and managing complex dependencies pipeline using Apache Oozie
Building and managing complex dependencies pipeline using Apache OozieDataWorks Summit/Hadoop Summit
 
Protecting your Critical Hadoop Clusters Against Disasters
Protecting your Critical Hadoop Clusters Against DisastersProtecting your Critical Hadoop Clusters Against Disasters
Protecting your Critical Hadoop Clusters Against DisastersDataWorks Summit
 

What's hot (8)

Building a Turbo-fast Data Warehousing Platform with Databricks
Building a Turbo-fast Data Warehousing Platform with DatabricksBuilding a Turbo-fast Data Warehousing Platform with Databricks
Building a Turbo-fast Data Warehousing Platform with Databricks
 
Cognos Analytics Performance Tuning: Tips & Tricks to Rev Performance
Cognos Analytics Performance Tuning: Tips & Tricks to Rev Performance Cognos Analytics Performance Tuning: Tips & Tricks to Rev Performance
Cognos Analytics Performance Tuning: Tips & Tricks to Rev Performance
 
Evolution of Big Data at Intel - Crawl, Walk and Run Approach
Evolution of Big Data at Intel - Crawl, Walk and Run ApproachEvolution of Big Data at Intel - Crawl, Walk and Run Approach
Evolution of Big Data at Intel - Crawl, Walk and Run Approach
 
Whitepaper : Working with Greenplum Database using Toad for Data Analysts
Whitepaper : Working with Greenplum Database using Toad for Data Analysts Whitepaper : Working with Greenplum Database using Toad for Data Analysts
Whitepaper : Working with Greenplum Database using Toad for Data Analysts
 
The Future of Apache Hadoop an Enterprise Architecture View
The Future of Apache Hadoop an Enterprise Architecture ViewThe Future of Apache Hadoop an Enterprise Architecture View
The Future of Apache Hadoop an Enterprise Architecture View
 
Building and managing complex dependencies pipeline using Apache Oozie
Building and managing complex dependencies pipeline using Apache OozieBuilding and managing complex dependencies pipeline using Apache Oozie
Building and managing complex dependencies pipeline using Apache Oozie
 
Apache Hive 2.0: SQL, Speed, Scale
Apache Hive 2.0: SQL, Speed, ScaleApache Hive 2.0: SQL, Speed, Scale
Apache Hive 2.0: SQL, Speed, Scale
 
Protecting your Critical Hadoop Clusters Against Disasters
Protecting your Critical Hadoop Clusters Against DisastersProtecting your Critical Hadoop Clusters Against Disasters
Protecting your Critical Hadoop Clusters Against Disasters
 

Viewers also liked

Introdution to HTML
Introdution to HTMLIntrodution to HTML
Introdution to HTMLyashh1402
 
Alfabeto de nomes c
Alfabeto de nomes   cAlfabeto de nomes   c
Alfabeto de nomes cDário Reis
 
Alfabeto de nomes v
Alfabeto de nomes   vAlfabeto de nomes   v
Alfabeto de nomes vDário Reis
 
The Institutional Capital Model - macro economics
The Institutional Capital Model - macro economics  The Institutional Capital Model - macro economics
The Institutional Capital Model - macro economics Ayush Parekh
 
State of Retail & CRM - Time to Re-Imagine
State of Retail & CRM - Time to Re-ImagineState of Retail & CRM - Time to Re-Imagine
State of Retail & CRM - Time to Re-ImagineRobert Eastwood
 
University of Manchester Symposium 2012: Extraction and Representation of in ...
University of Manchester Symposium 2012: Extraction and Representation of in ...University of Manchester Symposium 2012: Extraction and Representation of in ...
University of Manchester Symposium 2012: Extraction and Representation of in ...geraintduck
 
The effect of TAP pipeline for the Balkans, Turkey and Italian gas markets
The effect of TAP pipeline for the Balkans, Turkey and Italian gas marketsThe effect of TAP pipeline for the Balkans, Turkey and Italian gas markets
The effect of TAP pipeline for the Balkans, Turkey and Italian gas marketsARERA
 
La regolazione per i Sistemi di Distribuzione Chiusi (SDC)
La regolazione per i Sistemi di Distribuzione Chiusi (SDC)La regolazione per i Sistemi di Distribuzione Chiusi (SDC)
La regolazione per i Sistemi di Distribuzione Chiusi (SDC)ARERA
 
Making Hadoop based analytics simple for everyone to use
Making Hadoop based analytics simple for everyone to useMaking Hadoop based analytics simple for everyone to use
Making Hadoop based analytics simple for everyone to useSwiss Big Data User Group
 
Estrategia de daytrading para mercados de alta volatilidad basada en gaps
Estrategia de daytrading para mercados de alta volatilidad basada en gapsEstrategia de daytrading para mercados de alta volatilidad basada en gaps
Estrategia de daytrading para mercados de alta volatilidad basada en gapsRaul Canessa
 
GPU power consumption and performance trends
GPU power consumption and performance trendsGPU power consumption and performance trends
GPU power consumption and performance trendsAlessio Villardita
 
CPU vs. GPU presentation
CPU vs. GPU presentationCPU vs. GPU presentation
CPU vs. GPU presentationVishal Singh
 
Cultural Times - The first global map of cultural and creative industries
Cultural Times - The first global map of cultural and creative industriesCultural Times - The first global map of cultural and creative industries
Cultural Times - The first global map of cultural and creative industriesEY
 
Real-Time Analytics with Apache Cassandra and Apache Spark
Real-Time Analytics with Apache Cassandra and Apache SparkReal-Time Analytics with Apache Cassandra and Apache Spark
Real-Time Analytics with Apache Cassandra and Apache SparkGuido Schmutz
 

Viewers also liked (18)

Introdution to HTML
Introdution to HTMLIntrodution to HTML
Introdution to HTML
 
Part 2
Part 2Part 2
Part 2
 
Alfabeto de nomes c
Alfabeto de nomes   cAlfabeto de nomes   c
Alfabeto de nomes c
 
Alfabeto de nomes v
Alfabeto de nomes   vAlfabeto de nomes   v
Alfabeto de nomes v
 
Example dr shriniwas kashalikar
Example dr shriniwas kashalikarExample dr shriniwas kashalikar
Example dr shriniwas kashalikar
 
The Institutional Capital Model - macro economics
The Institutional Capital Model - macro economics  The Institutional Capital Model - macro economics
The Institutional Capital Model - macro economics
 
State of Retail & CRM - Time to Re-Imagine
State of Retail & CRM - Time to Re-ImagineState of Retail & CRM - Time to Re-Imagine
State of Retail & CRM - Time to Re-Imagine
 
University of Manchester Symposium 2012: Extraction and Representation of in ...
University of Manchester Symposium 2012: Extraction and Representation of in ...University of Manchester Symposium 2012: Extraction and Representation of in ...
University of Manchester Symposium 2012: Extraction and Representation of in ...
 
The effect of TAP pipeline for the Balkans, Turkey and Italian gas markets
The effect of TAP pipeline for the Balkans, Turkey and Italian gas marketsThe effect of TAP pipeline for the Balkans, Turkey and Italian gas markets
The effect of TAP pipeline for the Balkans, Turkey and Italian gas markets
 
La regolazione per i Sistemi di Distribuzione Chiusi (SDC)
La regolazione per i Sistemi di Distribuzione Chiusi (SDC)La regolazione per i Sistemi di Distribuzione Chiusi (SDC)
La regolazione per i Sistemi di Distribuzione Chiusi (SDC)
 
Making Hadoop based analytics simple for everyone to use
Making Hadoop based analytics simple for everyone to useMaking Hadoop based analytics simple for everyone to use
Making Hadoop based analytics simple for everyone to use
 
Estrategia de daytrading para mercados de alta volatilidad basada en gaps
Estrategia de daytrading para mercados de alta volatilidad basada en gapsEstrategia de daytrading para mercados de alta volatilidad basada en gaps
Estrategia de daytrading para mercados de alta volatilidad basada en gaps
 
Cluster computing
Cluster computingCluster computing
Cluster computing
 
GPU power consumption and performance trends
GPU power consumption and performance trendsGPU power consumption and performance trends
GPU power consumption and performance trends
 
Cluster computing
Cluster computingCluster computing
Cluster computing
 
CPU vs. GPU presentation
CPU vs. GPU presentationCPU vs. GPU presentation
CPU vs. GPU presentation
 
Cultural Times - The first global map of cultural and creative industries
Cultural Times - The first global map of cultural and creative industriesCultural Times - The first global map of cultural and creative industries
Cultural Times - The first global map of cultural and creative industries
 
Real-Time Analytics with Apache Cassandra and Apache Spark
Real-Time Analytics with Apache Cassandra and Apache SparkReal-Time Analytics with Apache Cassandra and Apache Spark
Real-Time Analytics with Apache Cassandra and Apache Spark
 

Similar to Best peer++

Bestpeer++ a peer to-peer based large-scale data processing platform
Bestpeer++ a peer to-peer based large-scale data processing platformBestpeer++ a peer to-peer based large-scale data processing platform
Bestpeer++ a peer to-peer based large-scale data processing platformPapitha Velumani
 
JPJ1416 BestPeer++: A Peer-to-Peer Based Large-Scale Data Processing Platform
JPJ1416 BestPeer++: A Peer-to-Peer Based Large-Scale Data Processing PlatformJPJ1416 BestPeer++: A Peer-to-Peer Based Large-Scale Data Processing Platform
JPJ1416 BestPeer++: A Peer-to-Peer Based Large-Scale Data Processing Platformchennaijp
 
best peer++ a peer-to-peer based large-scale data processing platform
best peer++ a peer-to-peer based large-scale data processing platformbest peer++ a peer-to-peer based large-scale data processing platform
best peer++ a peer-to-peer based large-scale data processing platformswathi78
 
IEEE 2014 JAVA DATA MINING PROJECTS Best peer++ a peer to-peer based large-sc...
IEEE 2014 JAVA DATA MINING PROJECTS Best peer++ a peer to-peer based large-sc...IEEE 2014 JAVA DATA MINING PROJECTS Best peer++ a peer to-peer based large-sc...
IEEE 2014 JAVA DATA MINING PROJECTS Best peer++ a peer to-peer based large-sc...IEEEFINALYEARSTUDENTPROJECTS
 
2014 IEEE JAVA DATA MINING PROJECT Best peer++ a peer to-peer based large-sca...
2014 IEEE JAVA DATA MINING PROJECT Best peer++ a peer to-peer based large-sca...2014 IEEE JAVA DATA MINING PROJECT Best peer++ a peer to-peer based large-sca...
2014 IEEE JAVA DATA MINING PROJECT Best peer++ a peer to-peer based large-sca...IEEEMEMTECHSTUDENTSPROJECTS
 
Orca: A Modular Query Optimizer Architecture for Big Data
Orca: A Modular Query Optimizer Architecture for Big DataOrca: A Modular Query Optimizer Architecture for Big Data
Orca: A Modular Query Optimizer Architecture for Big DataEMC
 
IDC WHITE PAPER - IBM PureFlex System Ready for Cloud
IDC WHITE PAPER - IBM PureFlex System Ready for CloudIDC WHITE PAPER - IBM PureFlex System Ready for Cloud
IDC WHITE PAPER - IBM PureFlex System Ready for CloudAngel Villar Garea
 
Artur Borycki - Beyond Lambda - how to get from logical to physical - code.ta...
Artur Borycki - Beyond Lambda - how to get from logical to physical - code.ta...Artur Borycki - Beyond Lambda - how to get from logical to physical - code.ta...
Artur Borycki - Beyond Lambda - how to get from logical to physical - code.ta...AboutYouGmbH
 
Information Virtualization: Query Federation on Data Lakes
Information Virtualization: Query Federation on Data LakesInformation Virtualization: Query Federation on Data Lakes
Information Virtualization: Query Federation on Data LakesDataWorks Summit
 
Tuning database performance
Tuning database performanceTuning database performance
Tuning database performanceBinay Acharya
 
TSOLogic_I-P_Overview-2016-08-16
TSOLogic_I-P_Overview-2016-08-16TSOLogic_I-P_Overview-2016-08-16
TSOLogic_I-P_Overview-2016-08-16Terence White
 
Optimized Systems: Matching technologies for business success.
Optimized Systems: Matching technologies for business success.Optimized Systems: Matching technologies for business success.
Optimized Systems: Matching technologies for business success.Karl Roche
 
Demystifying Data Virtualization (ASEAN)
Demystifying Data Virtualization (ASEAN)Demystifying Data Virtualization (ASEAN)
Demystifying Data Virtualization (ASEAN)Denodo
 
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...Denodo
 
Which Change Data Capture Strategy is Right for You?
Which Change Data Capture Strategy is Right for You?Which Change Data Capture Strategy is Right for You?
Which Change Data Capture Strategy is Right for You?Precisely
 
IBM InterConnect 2013 Expert Integrated Systems Keynote: Sotiropoulos & Wieck
IBM InterConnect 2013 Expert Integrated Systems Keynote: Sotiropoulos & WieckIBM InterConnect 2013 Expert Integrated Systems Keynote: Sotiropoulos & Wieck
IBM InterConnect 2013 Expert Integrated Systems Keynote: Sotiropoulos & WieckIBM Events
 
Cisco Big Data Warehouse Expansion Featuring MapR Distribution
Cisco Big Data Warehouse Expansion Featuring MapR DistributionCisco Big Data Warehouse Expansion Featuring MapR Distribution
Cisco Big Data Warehouse Expansion Featuring MapR DistributionAppfluent Technology
 

Similar to Best peer++ (20)

Bestpeer++ a peer to-peer based large-scale data processing platform
Bestpeer++ a peer to-peer based large-scale data processing platformBestpeer++ a peer to-peer based large-scale data processing platform
Bestpeer++ a peer to-peer based large-scale data processing platform
 
JPJ1416 BestPeer++: A Peer-to-Peer Based Large-Scale Data Processing Platform
JPJ1416 BestPeer++: A Peer-to-Peer Based Large-Scale Data Processing PlatformJPJ1416 BestPeer++: A Peer-to-Peer Based Large-Scale Data Processing Platform
JPJ1416 BestPeer++: A Peer-to-Peer Based Large-Scale Data Processing Platform
 
About CDAP
About CDAPAbout CDAP
About CDAP
 
best peer++ a peer-to-peer based large-scale data processing platform
best peer++ a peer-to-peer based large-scale data processing platformbest peer++ a peer-to-peer based large-scale data processing platform
best peer++ a peer-to-peer based large-scale data processing platform
 
IEEE 2014 JAVA DATA MINING PROJECTS Best peer++ a peer to-peer based large-sc...
IEEE 2014 JAVA DATA MINING PROJECTS Best peer++ a peer to-peer based large-sc...IEEE 2014 JAVA DATA MINING PROJECTS Best peer++ a peer to-peer based large-sc...
IEEE 2014 JAVA DATA MINING PROJECTS Best peer++ a peer to-peer based large-sc...
 
2014 IEEE JAVA DATA MINING PROJECT Best peer++ a peer to-peer based large-sca...
2014 IEEE JAVA DATA MINING PROJECT Best peer++ a peer to-peer based large-sca...2014 IEEE JAVA DATA MINING PROJECT Best peer++ a peer to-peer based large-sca...
2014 IEEE JAVA DATA MINING PROJECT Best peer++ a peer to-peer based large-sca...
 
Orca: A Modular Query Optimizer Architecture for Big Data
Orca: A Modular Query Optimizer Architecture for Big DataOrca: A Modular Query Optimizer Architecture for Big Data
Orca: A Modular Query Optimizer Architecture for Big Data
 
IDC WHITE PAPER - IBM PureFlex System Ready for Cloud
IDC WHITE PAPER - IBM PureFlex System Ready for CloudIDC WHITE PAPER - IBM PureFlex System Ready for Cloud
IDC WHITE PAPER - IBM PureFlex System Ready for Cloud
 
Artur Borycki - Beyond Lambda - how to get from logical to physical - code.ta...
Artur Borycki - Beyond Lambda - how to get from logical to physical - code.ta...Artur Borycki - Beyond Lambda - how to get from logical to physical - code.ta...
Artur Borycki - Beyond Lambda - how to get from logical to physical - code.ta...
 
Information Virtualization: Query Federation on Data Lakes
Information Virtualization: Query Federation on Data LakesInformation Virtualization: Query Federation on Data Lakes
Information Virtualization: Query Federation on Data Lakes
 
Tuning database performance
Tuning database performanceTuning database performance
Tuning database performance
 
TSOLogic_I-P_Overview-2016-08-16
TSOLogic_I-P_Overview-2016-08-16TSOLogic_I-P_Overview-2016-08-16
TSOLogic_I-P_Overview-2016-08-16
 
Optimized Systems: Matching technologies for business success.
Optimized Systems: Matching technologies for business success.Optimized Systems: Matching technologies for business success.
Optimized Systems: Matching technologies for business success.
 
Demystifying Data Virtualization (ASEAN)
Demystifying Data Virtualization (ASEAN)Demystifying Data Virtualization (ASEAN)
Demystifying Data Virtualization (ASEAN)
 
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
 
IBM Cloud pak for data brochure
IBM Cloud pak for data   brochureIBM Cloud pak for data   brochure
IBM Cloud pak for data brochure
 
Which Change Data Capture Strategy is Right for You?
Which Change Data Capture Strategy is Right for You?Which Change Data Capture Strategy is Right for You?
Which Change Data Capture Strategy is Right for You?
 
IBM InterConnect 2013 Expert Integrated Systems Keynote: Sotiropoulos & Wieck
IBM InterConnect 2013 Expert Integrated Systems Keynote: Sotiropoulos & WieckIBM InterConnect 2013 Expert Integrated Systems Keynote: Sotiropoulos & Wieck
IBM InterConnect 2013 Expert Integrated Systems Keynote: Sotiropoulos & Wieck
 
Cisco Big Data Warehouse Expansion Featuring MapR Distribution
Cisco Big Data Warehouse Expansion Featuring MapR DistributionCisco Big Data Warehouse Expansion Featuring MapR Distribution
Cisco Big Data Warehouse Expansion Featuring MapR Distribution
 
MKHCV2
MKHCV2MKHCV2
MKHCV2
 

Recently uploaded

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 

Recently uploaded (20)

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 

Best peer++

  • 1. BESTPEER++: A PEER-TO-PEER BASED LARGE-SCALE DATA PROCESSING PLATFORM Submitted by, PRABHUDEV R 4NI12IS416 1
  • 2. AGENDA 1. Introduction. 2. Overview of the BestPeer++ system. 3. Bootstrap peer. 4. Normal peer. 5. Benchmarking. 6. Advantages. 7. Conclusion. 2
  • 3. 1. INTRODUCTION  Corporate network shares information with participating companies of a common interest.  Companies reduce their operational cost and increase the revenue.  Delivers elastic data sharing services.  Provides economical , flexible and scalable platform.  Based on pay as you go business model. 3
  • 4. 2. OVERVIEW OF THE BESTPEER++ SYSTEM BestPeer data management platform  Adaptive join query processing  distributed online aggregation techniques to provide efficient query processing. BestPeer++, cloud enabled evolution of BestPeer  Distributed access control  Multiple types of indexes  Pay-as-you-go query processing for delivering elastic data sharing services in the cloud.  The software components of BestPeer++ are separated into two parts:- 1. Core. 2. Adapter. 4
  • 5. AMAZON CLOUD ADAPTER  Elastic hardware infrastructure for BestPeer++ to operate on by using Amazon Cloud services.  Launching/terminating dedicated MySQL database servers and monitoring/ backup/auto-scaling those servers.  Finally, the Amazon Cloud Adapter also provides automatic fail-over service. 5
  • 6. THE BESTPEER++ CORE  Platform-independent logic, including query processing and P2P overlay.  Cloud adapter and consists of two software components: 1. Bootstrap peer. 2. Normal peer. 6
  • 7. 3. BOOTSTRAP PEER  The bootstrap peer is run by the BestPeer++ service provider, and its main functionality is to manage the BestPeer+ + network 1. Managing Normal Peer Join/Departure. 2. Auto Fail-Over and Auto-Scaling. 7
  • 8. MANAGING NORMAL PEER JOIN/DEPARTURE  Each normal peer intends to join an existing corporate network must first connect to the bootstrap peer.  The joined peer will receive the corporate network information including the current participants, global schema, role definitions, and an issued certificate.  When a normal peer needs to leave the network, it also notifies the bootstrap peer first. 8
  • 9. AUTO FAIL-OVER AND AUTO-SCALING  In addition to managing peer join and peer departure.  The bootstrap peer spends most of its running-time on monitoring the health of normal peers.  Scheduling fail-over and auto-scaling events. 9
  • 10. 4.NORMAL PEER Offline data flow The data are extracted periodically by a data loader from the business production system to the normal peer instance. Online data flow  The query processor performs user queries using a fetch and process strategy. 10
  • 11. SCHEMA MAPPING  Defines the mapping between the local schema of each production system and the global shared schema.  The mapping consists of metadata mappings and value mappings and also support instance level mapping. 11
  • 12. DATA LOADER  Extracts data from production systems to normal peer instances according to the result of schema mapping.  The data loader also creates a snapshot of the newly inserted data.  At interval times, re-extracts data from the production system to create a new snapshot.  This snapshot is then compared to the previously stored one to detect data changes.  Finally, the changes are used to update the MySQL database hosted in the normal peer. 12
  • 13. DATA INDEXER  BATON  The first range, R0, is the subdomain maintained by the node.  The second range, R1, is the domain of the sub tree rooted at the node. 13
  • 14. DISTRIBUTED ACCESS CONTROL  The basic idea is to use roles as templates to capture common data access privileges and allow businesses to override these privileges to meet their specific needs.  The information of the users created at one peer is forwarded to the bootstrap peer and then broadcasted to other normal peers also.  The local administrator at this peer can easily define the role-based access control for any user. 14
  • 15. PAY-AS-YOU-GO QUERY PROCESSING  BestPeer++ provides two services for the participants: 1. Storage service 2. Search service  After data are exported from the local business system into a BestPeer++ instance, we apply the schema mapping rules to transform them into the predefined formats. 15
  • 16. 5.BENCHMARKING  This section evaluates the performance and throughput of BestPeer++ on Amazon cloud platform. 1. For the performance benchmark, they compare the query latency of BestPeer++ with HadoopDB using five queries selected from typical corporate network applications workloads. 2. For the throughput benchmark, they create a simple supply-chain network consisting of suppliers and retailers and study the query throughput of the system. 16
  • 17. 6.ADVANTAGES OF BESTPEER++ 1. Deliver near linear query throughput as the number of normal peers grows. 2. BestPeer++ adopts the pay-as-you-go business model popularized by cloud computing. 3. The role-based access control for the inherent distributed environment of corporate networks. 4. P2P technology to retrieve data between business partners. 5. Efficient data sharing within corporate networks. 17
  • 18. 7.CONCLUSION  The benchmark conducted on Amazon EC2 cloud platform shows that our system can efficiently handle typical workloads in a corporate network and can deliver near linear query throughput as the number of normal peers grows.  Therefore, BestPeer++ is a promising solution for efficient data sharing within corporate networks. 18
  • 19. REFERENCES 1. S. Wu, Q.H. Vu, J. Li, and K.-L. Tan, “Adaptive Multi-Join Query Processing in PDBMS,” Proc. IEEE Int’l Conf. Data Eng. (ICDE ’09), pp. 1239-1242, 2009. 2. D. Bermbach and S. Tai, “Eventual Consistency: How Soon is Eventual? An Evaluation of Amazon s3’s Consistency Behavior,” in Proc. 6th Workshop Middleware Serv. Oriented Comput. (MW4SOC ’11), pp. 1:1-1:6, NY, USA, 2011. 3. B. Cooper, A. Silberstein, E. Tam, R. Ramakrishnan, and R. Sears, “Benchmarking Cloud Serving Systems with YCSB,” Proc. First ACM Symp. Cloud Computing, pp. 143-154, 2010. 4. Oracle Inc., “Achieving the Cloud Computing Vision,” White Paper, 2010. 19
  • 20. 20