SlideShare a Scribd company logo
BESTPEER++: A PEER-TO-PEER BASED
LARGE-SCALE DATA PROCESSING
PLATFORM
Submitted by,
PRABHUDEV R
4NI12IS416
1
AGENDA
1. Introduction.
2. Overview of the BestPeer++ system.
3. Bootstrap peer.
4. Normal peer.
5. Benchmarking.
6. Advantages.
7. Conclusion.
2
1. INTRODUCTION
 Corporate network shares information with participating
companies of a common interest.
 Companies reduce their operational cost and increase
the revenue.
 Delivers elastic data sharing services.
 Provides economical , flexible and scalable platform.
 Based on pay as you go business model. 3
2. OVERVIEW OF THE BESTPEER++
SYSTEM
BestPeer data management platform
 Adaptive join query processing
 distributed online aggregation techniques to provide efficient
query processing.
BestPeer++, cloud enabled evolution of BestPeer
 Distributed access control
 Multiple types of indexes
 Pay-as-you-go query processing for delivering elastic data
sharing services in the cloud.
 The software components of BestPeer++ are separated
into two parts:-
1. Core.
2. Adapter. 4
AMAZON CLOUD ADAPTER
 Elastic hardware infrastructure for BestPeer++ to
operate on by using Amazon Cloud services.
 Launching/terminating dedicated MySQL database
servers and monitoring/ backup/auto-scaling those
servers.
 Finally, the Amazon Cloud Adapter also provides
automatic fail-over service.
5
THE BESTPEER++ CORE
 Platform-independent
logic, including query
processing and P2P
overlay.
 Cloud adapter and
consists of two software
components:
1. Bootstrap peer.
2. Normal peer.
6
3. BOOTSTRAP PEER
 The bootstrap peer is run by the BestPeer++ service
provider, and its main functionality is to manage the
BestPeer+ + network
1. Managing Normal Peer Join/Departure.
2. Auto Fail-Over and Auto-Scaling.
7
MANAGING NORMAL PEER JOIN/DEPARTURE
 Each normal peer intends to join an existing
corporate network must first connect to the
bootstrap peer.
 The joined peer will receive the corporate network
information including the current participants, global
schema, role definitions, and an issued certificate.
 When a normal peer needs to leave the network, it
also notifies the bootstrap peer first.
8
AUTO FAIL-OVER AND AUTO-SCALING
 In addition to managing peer join and peer
departure.
 The bootstrap peer spends most of its running-time
on monitoring the health of normal peers.
 Scheduling fail-over and auto-scaling events.
9
4.NORMAL PEER
Offline data flow
The data are extracted
periodically by a data
loader from the business
production system to the
normal peer instance.
Online data flow
 The query processor
performs user queries
using a fetch and
process strategy.
10
SCHEMA MAPPING
 Defines the mapping between the local schema of
each production system and the global shared
schema.
 The mapping consists of metadata mappings and
value mappings and also support instance level
mapping.
11
DATA LOADER
 Extracts data from production systems to normal peer
instances according to the result of schema mapping.
 The data loader also creates a snapshot of the newly
inserted data.
 At interval times, re-extracts data from the production system
to create a new snapshot.
 This snapshot is then compared to the previously stored one
to detect data changes.
 Finally, the changes are used to update the MySQL database
hosted in the normal peer. 12
DATA INDEXER
 BATON
 The first range, R0, is the subdomain maintained by
the node.
 The second range, R1, is the domain of the sub tree
rooted at the node. 13
DISTRIBUTED ACCESS CONTROL
 The basic idea is to use roles as templates to
capture common data access privileges and allow
businesses to override these privileges to meet
their specific needs.
 The information of the users created at one peer is
forwarded to the bootstrap peer and then
broadcasted to other normal peers also.
 The local administrator at this peer can easily
define the role-based access control for any user.
14
PAY-AS-YOU-GO QUERY PROCESSING
 BestPeer++ provides two services for the
participants:
1. Storage service
2. Search service
 After data are exported from the local business
system into a BestPeer++ instance, we apply the
schema mapping rules to transform them into the
predefined formats.
15
5.BENCHMARKING
 This section evaluates the performance and throughput
of BestPeer++ on Amazon cloud platform.
1. For the performance benchmark, they compare the
query latency of BestPeer++ with HadoopDB using
five queries selected from typical corporate network
applications workloads.
2. For the throughput benchmark, they create a simple
supply-chain network consisting of suppliers and
retailers and study the query throughput of the
system.
16
6.ADVANTAGES OF BESTPEER++
1. Deliver near linear query throughput as the number of
normal peers grows.
2. BestPeer++ adopts the pay-as-you-go business model
popularized by cloud computing.
3. The role-based access control for the inherent distributed
environment of corporate networks.
4. P2P technology to retrieve data between business
partners.
5. Efficient data sharing within corporate networks. 17
7.CONCLUSION
 The benchmark conducted on Amazon EC2 cloud
platform shows that our system can efficiently
handle typical workloads in a corporate network
and can deliver near linear query throughput as the
number of normal peers grows.
 Therefore, BestPeer++ is a promising solution for
efficient data sharing within corporate networks.
18
REFERENCES
1. S. Wu, Q.H. Vu, J. Li, and K.-L. Tan, “Adaptive Multi-Join
Query Processing in PDBMS,” Proc. IEEE Int’l Conf. Data
Eng. (ICDE ’09), pp. 1239-1242, 2009.
2. D. Bermbach and S. Tai, “Eventual Consistency: How Soon
is Eventual? An Evaluation of Amazon s3’s Consistency
Behavior,” in Proc. 6th Workshop Middleware Serv. Oriented
Comput. (MW4SOC ’11), pp. 1:1-1:6, NY, USA, 2011.
3. B. Cooper, A. Silberstein, E. Tam, R. Ramakrishnan, and R.
Sears, “Benchmarking Cloud Serving Systems with YCSB,”
Proc. First ACM Symp. Cloud Computing, pp. 143-154,
2010.
4. Oracle Inc., “Achieving the Cloud Computing Vision,” White
Paper, 2010. 19
20

More Related Content

What's hot

Building a Turbo-fast Data Warehousing Platform with Databricks
Building a Turbo-fast Data Warehousing Platform with DatabricksBuilding a Turbo-fast Data Warehousing Platform with Databricks
Building a Turbo-fast Data Warehousing Platform with Databricks
Databricks
 
Cognos Analytics Performance Tuning: Tips & Tricks to Rev Performance
Cognos Analytics Performance Tuning: Tips & Tricks to Rev Performance Cognos Analytics Performance Tuning: Tips & Tricks to Rev Performance
Cognos Analytics Performance Tuning: Tips & Tricks to Rev Performance
Senturus
 
Evolution of Big Data at Intel - Crawl, Walk and Run Approach
Evolution of Big Data at Intel - Crawl, Walk and Run ApproachEvolution of Big Data at Intel - Crawl, Walk and Run Approach
Evolution of Big Data at Intel - Crawl, Walk and Run Approach
DataWorks Summit
 
Whitepaper : Working with Greenplum Database using Toad for Data Analysts
Whitepaper : Working with Greenplum Database using Toad for Data Analysts Whitepaper : Working with Greenplum Database using Toad for Data Analysts
Whitepaper : Working with Greenplum Database using Toad for Data Analysts
EMC
 
The Future of Apache Hadoop an Enterprise Architecture View
The Future of Apache Hadoop an Enterprise Architecture ViewThe Future of Apache Hadoop an Enterprise Architecture View
The Future of Apache Hadoop an Enterprise Architecture View
DataWorks Summit/Hadoop Summit
 
Building and managing complex dependencies pipeline using Apache Oozie
Building and managing complex dependencies pipeline using Apache OozieBuilding and managing complex dependencies pipeline using Apache Oozie
Building and managing complex dependencies pipeline using Apache Oozie
DataWorks Summit/Hadoop Summit
 
Apache Hive 2.0: SQL, Speed, Scale
Apache Hive 2.0: SQL, Speed, ScaleApache Hive 2.0: SQL, Speed, Scale
Apache Hive 2.0: SQL, Speed, Scale
DataWorks Summit/Hadoop Summit
 
Protecting your Critical Hadoop Clusters Against Disasters
Protecting your Critical Hadoop Clusters Against DisastersProtecting your Critical Hadoop Clusters Against Disasters
Protecting your Critical Hadoop Clusters Against Disasters
DataWorks Summit
 

What's hot (8)

Building a Turbo-fast Data Warehousing Platform with Databricks
Building a Turbo-fast Data Warehousing Platform with DatabricksBuilding a Turbo-fast Data Warehousing Platform with Databricks
Building a Turbo-fast Data Warehousing Platform with Databricks
 
Cognos Analytics Performance Tuning: Tips & Tricks to Rev Performance
Cognos Analytics Performance Tuning: Tips & Tricks to Rev Performance Cognos Analytics Performance Tuning: Tips & Tricks to Rev Performance
Cognos Analytics Performance Tuning: Tips & Tricks to Rev Performance
 
Evolution of Big Data at Intel - Crawl, Walk and Run Approach
Evolution of Big Data at Intel - Crawl, Walk and Run ApproachEvolution of Big Data at Intel - Crawl, Walk and Run Approach
Evolution of Big Data at Intel - Crawl, Walk and Run Approach
 
Whitepaper : Working with Greenplum Database using Toad for Data Analysts
Whitepaper : Working with Greenplum Database using Toad for Data Analysts Whitepaper : Working with Greenplum Database using Toad for Data Analysts
Whitepaper : Working with Greenplum Database using Toad for Data Analysts
 
The Future of Apache Hadoop an Enterprise Architecture View
The Future of Apache Hadoop an Enterprise Architecture ViewThe Future of Apache Hadoop an Enterprise Architecture View
The Future of Apache Hadoop an Enterprise Architecture View
 
Building and managing complex dependencies pipeline using Apache Oozie
Building and managing complex dependencies pipeline using Apache OozieBuilding and managing complex dependencies pipeline using Apache Oozie
Building and managing complex dependencies pipeline using Apache Oozie
 
Apache Hive 2.0: SQL, Speed, Scale
Apache Hive 2.0: SQL, Speed, ScaleApache Hive 2.0: SQL, Speed, Scale
Apache Hive 2.0: SQL, Speed, Scale
 
Protecting your Critical Hadoop Clusters Against Disasters
Protecting your Critical Hadoop Clusters Against DisastersProtecting your Critical Hadoop Clusters Against Disasters
Protecting your Critical Hadoop Clusters Against Disasters
 

Viewers also liked

श्रीराम समर्थ Dr shriniwas kashalikar
श्रीराम समर्थ  Dr shriniwas kashalikarश्रीराम समर्थ  Dr shriniwas kashalikar
श्रीराम समर्थ Dr shriniwas kashalikar
shriniwas kashalikar
 
Project Management Experience - Anthony D. Braunscheidel
Project Management Experience - Anthony D. BraunscheidelProject Management Experience - Anthony D. Braunscheidel
Project Management Experience - Anthony D. Braunscheidel
Anthony Braunscheidel
 
Six Tips for Great Customer Interviews
Six Tips for Great Customer InterviewsSix Tips for Great Customer Interviews
Six Tips for Great Customer Interviews
Anne Piessens
 
Superliving (concise edition) dr. shriniwas kashalikar
Superliving (concise edition) dr. shriniwas kashalikarSuperliving (concise edition) dr. shriniwas kashalikar
Superliving (concise edition) dr. shriniwas kashalikar
shriniwas kashalikar
 
Alfabeto de nomes h
Alfabeto de nomes   hAlfabeto de nomes   h
Alfabeto de nomes hDário Reis
 
Przepisy na zdrowe posiłki dla dzieci, młodzieży, dorosłych - Łukasz Konik - ...
Przepisy na zdrowe posiłki dla dzieci, młodzieży, dorosłych - Łukasz Konik - ...Przepisy na zdrowe posiłki dla dzieci, młodzieży, dorosłych - Łukasz Konik - ...
Przepisy na zdrowe posiłki dla dzieci, młodzieży, dorosłych - Łukasz Konik - ...
SERVIER POLSKA
 
La Regolazione per i Sistemi di Distribuzione Chiusi (SDC)
La Regolazione per i Sistemi di Distribuzione Chiusi (SDC)La Regolazione per i Sistemi di Distribuzione Chiusi (SDC)
La Regolazione per i Sistemi di Distribuzione Chiusi (SDC)
ARERA
 
Financing the water sector
Financing the water sectorFinancing the water sector
Financing the water sector
ARERA
 
Broker de Forex RoboForex
Broker de Forex RoboForexBroker de Forex RoboForex
Broker de Forex RoboForex
Raul Canessa
 
Estrategia de trading Nihilist para Forex basada en MT4
Estrategia de trading Nihilist para Forex basada en MT4Estrategia de trading Nihilist para Forex basada en MT4
Estrategia de trading Nihilist para Forex basada en MT4
Raul Canessa
 
PPt on internet
PPt on internetPPt on internet
PPt on internet
Deepansh Goel
 
Indagine conoscitiva avviata con deliberazione 595/2015/R/idr sulle strategie...
Indagine conoscitiva avviata con deliberazione 595/2015/R/idr sulle strategie...Indagine conoscitiva avviata con deliberazione 595/2015/R/idr sulle strategie...
Indagine conoscitiva avviata con deliberazione 595/2015/R/idr sulle strategie...
ARERA
 
An Overview of Scenario Planning - Introduction, Overview and Examples
An Overview of Scenario Planning - Introduction, Overview and ExamplesAn Overview of Scenario Planning - Introduction, Overview and Examples
An Overview of Scenario Planning - Introduction, Overview and Examples
Axiom EPM
 
Artificial Intelligence and the Law of Armed Conflict
Artificial Intelligence and the Law of Armed ConflictArtificial Intelligence and the Law of Armed Conflict
Artificial Intelligence and the Law of Armed Conflict
Dr. Lydia Kostopoulos
 

Viewers also liked (19)

श्रीराम समर्थ Dr shriniwas kashalikar
श्रीराम समर्थ  Dr shriniwas kashalikarश्रीराम समर्थ  Dr shriniwas kashalikar
श्रीराम समर्थ Dr shriniwas kashalikar
 
The Presentation Secrets of Steve Jobs
The Presentation Secrets of Steve JobsThe Presentation Secrets of Steve Jobs
The Presentation Secrets of Steve Jobs
 
Edi
EdiEdi
Edi
 
Project Management Experience - Anthony D. Braunscheidel
Project Management Experience - Anthony D. BraunscheidelProject Management Experience - Anthony D. Braunscheidel
Project Management Experience - Anthony D. Braunscheidel
 
Ebay
EbayEbay
Ebay
 
Six Tips for Great Customer Interviews
Six Tips for Great Customer InterviewsSix Tips for Great Customer Interviews
Six Tips for Great Customer Interviews
 
Joy of life
Joy of lifeJoy of life
Joy of life
 
Superliving (concise edition) dr. shriniwas kashalikar
Superliving (concise edition) dr. shriniwas kashalikarSuperliving (concise edition) dr. shriniwas kashalikar
Superliving (concise edition) dr. shriniwas kashalikar
 
Alfabeto de nomes h
Alfabeto de nomes   hAlfabeto de nomes   h
Alfabeto de nomes h
 
Przepisy na zdrowe posiłki dla dzieci, młodzieży, dorosłych - Łukasz Konik - ...
Przepisy na zdrowe posiłki dla dzieci, młodzieży, dorosłych - Łukasz Konik - ...Przepisy na zdrowe posiłki dla dzieci, młodzieży, dorosłych - Łukasz Konik - ...
Przepisy na zdrowe posiłki dla dzieci, młodzieży, dorosłych - Łukasz Konik - ...
 
La Regolazione per i Sistemi di Distribuzione Chiusi (SDC)
La Regolazione per i Sistemi di Distribuzione Chiusi (SDC)La Regolazione per i Sistemi di Distribuzione Chiusi (SDC)
La Regolazione per i Sistemi di Distribuzione Chiusi (SDC)
 
Financing the water sector
Financing the water sectorFinancing the water sector
Financing the water sector
 
Broker de Forex RoboForex
Broker de Forex RoboForexBroker de Forex RoboForex
Broker de Forex RoboForex
 
Estrategia de trading Nihilist para Forex basada en MT4
Estrategia de trading Nihilist para Forex basada en MT4Estrategia de trading Nihilist para Forex basada en MT4
Estrategia de trading Nihilist para Forex basada en MT4
 
PPt on internet
PPt on internetPPt on internet
PPt on internet
 
Indagine conoscitiva avviata con deliberazione 595/2015/R/idr sulle strategie...
Indagine conoscitiva avviata con deliberazione 595/2015/R/idr sulle strategie...Indagine conoscitiva avviata con deliberazione 595/2015/R/idr sulle strategie...
Indagine conoscitiva avviata con deliberazione 595/2015/R/idr sulle strategie...
 
An Overview of Scenario Planning - Introduction, Overview and Examples
An Overview of Scenario Planning - Introduction, Overview and ExamplesAn Overview of Scenario Planning - Introduction, Overview and Examples
An Overview of Scenario Planning - Introduction, Overview and Examples
 
Artificial Intelligence and the Law of Armed Conflict
Artificial Intelligence and the Law of Armed ConflictArtificial Intelligence and the Law of Armed Conflict
Artificial Intelligence and the Law of Armed Conflict
 
Sustainability Hub Presentation
Sustainability Hub PresentationSustainability Hub Presentation
Sustainability Hub Presentation
 

Similar to Best peer++

Bestpeer++ a peer to-peer based large-scale data processing platform
Bestpeer++ a peer to-peer based large-scale data processing platformBestpeer++ a peer to-peer based large-scale data processing platform
Bestpeer++ a peer to-peer based large-scale data processing platform
Papitha Velumani
 
JPJ1416 BestPeer++: A Peer-to-Peer Based Large-Scale Data Processing Platform
JPJ1416 BestPeer++: A Peer-to-Peer Based Large-Scale Data Processing PlatformJPJ1416 BestPeer++: A Peer-to-Peer Based Large-Scale Data Processing Platform
JPJ1416 BestPeer++: A Peer-to-Peer Based Large-Scale Data Processing Platform
chennaijp
 
About CDAP
About CDAPAbout CDAP
About CDAP
Cask Data
 
best peer++ a peer-to-peer based large-scale data processing platform
best peer++ a peer-to-peer based large-scale data processing platformbest peer++ a peer-to-peer based large-scale data processing platform
best peer++ a peer-to-peer based large-scale data processing platform
swathi78
 
2014 IEEE JAVA DATA MINING PROJECT Best peer++ a peer to-peer based large-sca...
2014 IEEE JAVA DATA MINING PROJECT Best peer++ a peer to-peer based large-sca...2014 IEEE JAVA DATA MINING PROJECT Best peer++ a peer to-peer based large-sca...
2014 IEEE JAVA DATA MINING PROJECT Best peer++ a peer to-peer based large-sca...
IEEEMEMTECHSTUDENTSPROJECTS
 
IEEE 2014 JAVA DATA MINING PROJECTS Best peer++ a peer to-peer based large-sc...
IEEE 2014 JAVA DATA MINING PROJECTS Best peer++ a peer to-peer based large-sc...IEEE 2014 JAVA DATA MINING PROJECTS Best peer++ a peer to-peer based large-sc...
IEEE 2014 JAVA DATA MINING PROJECTS Best peer++ a peer to-peer based large-sc...
IEEEFINALYEARSTUDENTPROJECTS
 
Orca: A Modular Query Optimizer Architecture for Big Data
Orca: A Modular Query Optimizer Architecture for Big DataOrca: A Modular Query Optimizer Architecture for Big Data
Orca: A Modular Query Optimizer Architecture for Big DataEMC
 
IDC WHITE PAPER - IBM PureFlex System Ready for Cloud
IDC WHITE PAPER - IBM PureFlex System Ready for CloudIDC WHITE PAPER - IBM PureFlex System Ready for Cloud
IDC WHITE PAPER - IBM PureFlex System Ready for Cloud
Angel Villar Garea
 
Artur Borycki - Beyond Lambda - how to get from logical to physical - code.ta...
Artur Borycki - Beyond Lambda - how to get from logical to physical - code.ta...Artur Borycki - Beyond Lambda - how to get from logical to physical - code.ta...
Artur Borycki - Beyond Lambda - how to get from logical to physical - code.ta...
AboutYouGmbH
 
Information Virtualization: Query Federation on Data Lakes
Information Virtualization: Query Federation on Data LakesInformation Virtualization: Query Federation on Data Lakes
Information Virtualization: Query Federation on Data Lakes
DataWorks Summit
 
Tuning database performance
Tuning database performanceTuning database performance
Tuning database performanceBinay Acharya
 
TSOLogic_I-P_Overview-2016-08-16
TSOLogic_I-P_Overview-2016-08-16TSOLogic_I-P_Overview-2016-08-16
TSOLogic_I-P_Overview-2016-08-16Terence White
 
Optimized Systems: Matching technologies for business success.
Optimized Systems: Matching technologies for business success.Optimized Systems: Matching technologies for business success.
Optimized Systems: Matching technologies for business success.
Karl Roche
 
Demystifying Data Virtualization (ASEAN)
Demystifying Data Virtualization (ASEAN)Demystifying Data Virtualization (ASEAN)
Demystifying Data Virtualization (ASEAN)
Denodo
 
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Denodo
 
IBM Cloud pak for data brochure
IBM Cloud pak for data   brochureIBM Cloud pak for data   brochure
IBM Cloud pak for data brochure
Simon Harrison ACMA CGMA
 
SAP-Cloud-and-SAP-On-Premise-Integration.pptx
SAP-Cloud-and-SAP-On-Premise-Integration.pptxSAP-Cloud-and-SAP-On-Premise-Integration.pptx
SAP-Cloud-and-SAP-On-Premise-Integration.pptx
snakconsultancy
 
Which Change Data Capture Strategy is Right for You?
Which Change Data Capture Strategy is Right for You?Which Change Data Capture Strategy is Right for You?
Which Change Data Capture Strategy is Right for You?
Precisely
 
IBM InterConnect 2013 Expert Integrated Systems Keynote: Sotiropoulos & Wieck
IBM InterConnect 2013 Expert Integrated Systems Keynote: Sotiropoulos & WieckIBM InterConnect 2013 Expert Integrated Systems Keynote: Sotiropoulos & Wieck
IBM InterConnect 2013 Expert Integrated Systems Keynote: Sotiropoulos & Wieck
IBM Events
 
Cisco Big Data Warehouse Expansion Featuring MapR Distribution
Cisco Big Data Warehouse Expansion Featuring MapR DistributionCisco Big Data Warehouse Expansion Featuring MapR Distribution
Cisco Big Data Warehouse Expansion Featuring MapR Distribution
Appfluent Technology
 

Similar to Best peer++ (20)

Bestpeer++ a peer to-peer based large-scale data processing platform
Bestpeer++ a peer to-peer based large-scale data processing platformBestpeer++ a peer to-peer based large-scale data processing platform
Bestpeer++ a peer to-peer based large-scale data processing platform
 
JPJ1416 BestPeer++: A Peer-to-Peer Based Large-Scale Data Processing Platform
JPJ1416 BestPeer++: A Peer-to-Peer Based Large-Scale Data Processing PlatformJPJ1416 BestPeer++: A Peer-to-Peer Based Large-Scale Data Processing Platform
JPJ1416 BestPeer++: A Peer-to-Peer Based Large-Scale Data Processing Platform
 
About CDAP
About CDAPAbout CDAP
About CDAP
 
best peer++ a peer-to-peer based large-scale data processing platform
best peer++ a peer-to-peer based large-scale data processing platformbest peer++ a peer-to-peer based large-scale data processing platform
best peer++ a peer-to-peer based large-scale data processing platform
 
2014 IEEE JAVA DATA MINING PROJECT Best peer++ a peer to-peer based large-sca...
2014 IEEE JAVA DATA MINING PROJECT Best peer++ a peer to-peer based large-sca...2014 IEEE JAVA DATA MINING PROJECT Best peer++ a peer to-peer based large-sca...
2014 IEEE JAVA DATA MINING PROJECT Best peer++ a peer to-peer based large-sca...
 
IEEE 2014 JAVA DATA MINING PROJECTS Best peer++ a peer to-peer based large-sc...
IEEE 2014 JAVA DATA MINING PROJECTS Best peer++ a peer to-peer based large-sc...IEEE 2014 JAVA DATA MINING PROJECTS Best peer++ a peer to-peer based large-sc...
IEEE 2014 JAVA DATA MINING PROJECTS Best peer++ a peer to-peer based large-sc...
 
Orca: A Modular Query Optimizer Architecture for Big Data
Orca: A Modular Query Optimizer Architecture for Big DataOrca: A Modular Query Optimizer Architecture for Big Data
Orca: A Modular Query Optimizer Architecture for Big Data
 
IDC WHITE PAPER - IBM PureFlex System Ready for Cloud
IDC WHITE PAPER - IBM PureFlex System Ready for CloudIDC WHITE PAPER - IBM PureFlex System Ready for Cloud
IDC WHITE PAPER - IBM PureFlex System Ready for Cloud
 
Artur Borycki - Beyond Lambda - how to get from logical to physical - code.ta...
Artur Borycki - Beyond Lambda - how to get from logical to physical - code.ta...Artur Borycki - Beyond Lambda - how to get from logical to physical - code.ta...
Artur Borycki - Beyond Lambda - how to get from logical to physical - code.ta...
 
Information Virtualization: Query Federation on Data Lakes
Information Virtualization: Query Federation on Data LakesInformation Virtualization: Query Federation on Data Lakes
Information Virtualization: Query Federation on Data Lakes
 
Tuning database performance
Tuning database performanceTuning database performance
Tuning database performance
 
TSOLogic_I-P_Overview-2016-08-16
TSOLogic_I-P_Overview-2016-08-16TSOLogic_I-P_Overview-2016-08-16
TSOLogic_I-P_Overview-2016-08-16
 
Optimized Systems: Matching technologies for business success.
Optimized Systems: Matching technologies for business success.Optimized Systems: Matching technologies for business success.
Optimized Systems: Matching technologies for business success.
 
Demystifying Data Virtualization (ASEAN)
Demystifying Data Virtualization (ASEAN)Demystifying Data Virtualization (ASEAN)
Demystifying Data Virtualization (ASEAN)
 
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
 
IBM Cloud pak for data brochure
IBM Cloud pak for data   brochureIBM Cloud pak for data   brochure
IBM Cloud pak for data brochure
 
SAP-Cloud-and-SAP-On-Premise-Integration.pptx
SAP-Cloud-and-SAP-On-Premise-Integration.pptxSAP-Cloud-and-SAP-On-Premise-Integration.pptx
SAP-Cloud-and-SAP-On-Premise-Integration.pptx
 
Which Change Data Capture Strategy is Right for You?
Which Change Data Capture Strategy is Right for You?Which Change Data Capture Strategy is Right for You?
Which Change Data Capture Strategy is Right for You?
 
IBM InterConnect 2013 Expert Integrated Systems Keynote: Sotiropoulos & Wieck
IBM InterConnect 2013 Expert Integrated Systems Keynote: Sotiropoulos & WieckIBM InterConnect 2013 Expert Integrated Systems Keynote: Sotiropoulos & Wieck
IBM InterConnect 2013 Expert Integrated Systems Keynote: Sotiropoulos & Wieck
 
Cisco Big Data Warehouse Expansion Featuring MapR Distribution
Cisco Big Data Warehouse Expansion Featuring MapR DistributionCisco Big Data Warehouse Expansion Featuring MapR Distribution
Cisco Big Data Warehouse Expansion Featuring MapR Distribution
 

Recently uploaded

The French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free downloadThe French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free download
Vivekanand Anglo Vedic Academy
 
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
Levi Shapiro
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
Balvir Singh
 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
BhavyaRajput3
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
Thiyagu K
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
TechSoup
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
Jean Carlos Nunes Paixão
 
The basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptxThe basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptx
heathfieldcps1
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
SACHIN R KONDAGURI
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
GeoBlogs
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
siemaillard
 
Embracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativeEmbracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic Imperative
Peter Windle
 
Digital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and ResearchDigital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and Research
Vikramjit Singh
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
Atul Kumar Singh
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
Jisc
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Thiyagu K
 
Additional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdfAdditional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdf
joachimlavalley1
 
The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
DhatriParmar
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
Jisc
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
Jisc
 

Recently uploaded (20)

The French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free downloadThe French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free download
 
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
 
The basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptxThe basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptx
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
 
Embracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativeEmbracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic Imperative
 
Digital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and ResearchDigital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and Research
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
 
Additional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdfAdditional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdf
 
The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
 

Best peer++

  • 1. BESTPEER++: A PEER-TO-PEER BASED LARGE-SCALE DATA PROCESSING PLATFORM Submitted by, PRABHUDEV R 4NI12IS416 1
  • 2. AGENDA 1. Introduction. 2. Overview of the BestPeer++ system. 3. Bootstrap peer. 4. Normal peer. 5. Benchmarking. 6. Advantages. 7. Conclusion. 2
  • 3. 1. INTRODUCTION  Corporate network shares information with participating companies of a common interest.  Companies reduce their operational cost and increase the revenue.  Delivers elastic data sharing services.  Provides economical , flexible and scalable platform.  Based on pay as you go business model. 3
  • 4. 2. OVERVIEW OF THE BESTPEER++ SYSTEM BestPeer data management platform  Adaptive join query processing  distributed online aggregation techniques to provide efficient query processing. BestPeer++, cloud enabled evolution of BestPeer  Distributed access control  Multiple types of indexes  Pay-as-you-go query processing for delivering elastic data sharing services in the cloud.  The software components of BestPeer++ are separated into two parts:- 1. Core. 2. Adapter. 4
  • 5. AMAZON CLOUD ADAPTER  Elastic hardware infrastructure for BestPeer++ to operate on by using Amazon Cloud services.  Launching/terminating dedicated MySQL database servers and monitoring/ backup/auto-scaling those servers.  Finally, the Amazon Cloud Adapter also provides automatic fail-over service. 5
  • 6. THE BESTPEER++ CORE  Platform-independent logic, including query processing and P2P overlay.  Cloud adapter and consists of two software components: 1. Bootstrap peer. 2. Normal peer. 6
  • 7. 3. BOOTSTRAP PEER  The bootstrap peer is run by the BestPeer++ service provider, and its main functionality is to manage the BestPeer+ + network 1. Managing Normal Peer Join/Departure. 2. Auto Fail-Over and Auto-Scaling. 7
  • 8. MANAGING NORMAL PEER JOIN/DEPARTURE  Each normal peer intends to join an existing corporate network must first connect to the bootstrap peer.  The joined peer will receive the corporate network information including the current participants, global schema, role definitions, and an issued certificate.  When a normal peer needs to leave the network, it also notifies the bootstrap peer first. 8
  • 9. AUTO FAIL-OVER AND AUTO-SCALING  In addition to managing peer join and peer departure.  The bootstrap peer spends most of its running-time on monitoring the health of normal peers.  Scheduling fail-over and auto-scaling events. 9
  • 10. 4.NORMAL PEER Offline data flow The data are extracted periodically by a data loader from the business production system to the normal peer instance. Online data flow  The query processor performs user queries using a fetch and process strategy. 10
  • 11. SCHEMA MAPPING  Defines the mapping between the local schema of each production system and the global shared schema.  The mapping consists of metadata mappings and value mappings and also support instance level mapping. 11
  • 12. DATA LOADER  Extracts data from production systems to normal peer instances according to the result of schema mapping.  The data loader also creates a snapshot of the newly inserted data.  At interval times, re-extracts data from the production system to create a new snapshot.  This snapshot is then compared to the previously stored one to detect data changes.  Finally, the changes are used to update the MySQL database hosted in the normal peer. 12
  • 13. DATA INDEXER  BATON  The first range, R0, is the subdomain maintained by the node.  The second range, R1, is the domain of the sub tree rooted at the node. 13
  • 14. DISTRIBUTED ACCESS CONTROL  The basic idea is to use roles as templates to capture common data access privileges and allow businesses to override these privileges to meet their specific needs.  The information of the users created at one peer is forwarded to the bootstrap peer and then broadcasted to other normal peers also.  The local administrator at this peer can easily define the role-based access control for any user. 14
  • 15. PAY-AS-YOU-GO QUERY PROCESSING  BestPeer++ provides two services for the participants: 1. Storage service 2. Search service  After data are exported from the local business system into a BestPeer++ instance, we apply the schema mapping rules to transform them into the predefined formats. 15
  • 16. 5.BENCHMARKING  This section evaluates the performance and throughput of BestPeer++ on Amazon cloud platform. 1. For the performance benchmark, they compare the query latency of BestPeer++ with HadoopDB using five queries selected from typical corporate network applications workloads. 2. For the throughput benchmark, they create a simple supply-chain network consisting of suppliers and retailers and study the query throughput of the system. 16
  • 17. 6.ADVANTAGES OF BESTPEER++ 1. Deliver near linear query throughput as the number of normal peers grows. 2. BestPeer++ adopts the pay-as-you-go business model popularized by cloud computing. 3. The role-based access control for the inherent distributed environment of corporate networks. 4. P2P technology to retrieve data between business partners. 5. Efficient data sharing within corporate networks. 17
  • 18. 7.CONCLUSION  The benchmark conducted on Amazon EC2 cloud platform shows that our system can efficiently handle typical workloads in a corporate network and can deliver near linear query throughput as the number of normal peers grows.  Therefore, BestPeer++ is a promising solution for efficient data sharing within corporate networks. 18
  • 19. REFERENCES 1. S. Wu, Q.H. Vu, J. Li, and K.-L. Tan, “Adaptive Multi-Join Query Processing in PDBMS,” Proc. IEEE Int’l Conf. Data Eng. (ICDE ’09), pp. 1239-1242, 2009. 2. D. Bermbach and S. Tai, “Eventual Consistency: How Soon is Eventual? An Evaluation of Amazon s3’s Consistency Behavior,” in Proc. 6th Workshop Middleware Serv. Oriented Comput. (MW4SOC ’11), pp. 1:1-1:6, NY, USA, 2011. 3. B. Cooper, A. Silberstein, E. Tam, R. Ramakrishnan, and R. Sears, “Benchmarking Cloud Serving Systems with YCSB,” Proc. First ACM Symp. Cloud Computing, pp. 143-154, 2010. 4. Oracle Inc., “Achieving the Cloud Computing Vision,” White Paper, 2010. 19
  • 20. 20