SlideShare a Scribd company logo
1 of 30
BranchReduce	
  
Distributed	
  Branch-­‐and-­‐Bound	
  on	
  YARN	
  
June	
  14,	
  2012	
  
	
  
About	
  Me	
  




                  Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
     2	
  
Hadoop	
  Distributed	
  Processing	
  Frameworks	
  




                  Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
Lots	
  of	
  Other	
  Parallel	
  Processing	
  PlaIorms	
  




                     Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
Hadoop	
  2.0:	
  Resource	
  Scheduling	
  with	
  YARN	
  




                    Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
The	
  Data	
  Deluge	
  and	
  the	
  Cambrian	
  Explosion	
  




                     Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
Parallel	
  Distributed	
  Processing	
  For	
  Everyone	
  




                    Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
Building	
  a	
  New	
  Processing	
  Framework	
  on	
  YARN	
  




                 Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
A	
  Terrifyingly	
  Accurate	
  Paraphrasing	
  of	
  JWZ	
  

	
  
	
  
Some	
  people,	
  when	
  confronted	
  with	
  a	
  tedious
                                                            	
  
problem,	
  say,	
  “I	
  know,	
  I’ll	
  write	
  a	
  
framework.”	
  Now	
  they	
  have	
  two	
  tedious	
  
problems.	
  



                     Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
On	
  Designing	
  Frameworks	
  




                  Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
The	
  Example	
  YARN	
  App:	
  Distributed	
  Shell	
  




                    Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
Do	
  We	
  Need	
  a	
  New	
  Programming	
  Language	
  for	
  
            Developing	
  YARN	
  ApplicaUons?     	
  




                  Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
Do	
  We	
  Need	
  a	
  New	
  Programming	
  Language	
  for	
  
            Developing	
  YARN	
  ApplicaUons?     	
  




                  Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
Leverage	
  ExisUng	
  Frameworks	
  

 •  Popular	
  RPC	
  libraries	
  
    with	
  support	
  for	
  
    mul@ple	
  languages	
  
     •  C++,	
  Java,	
  Python	
  


 •  We	
  need	
  to	
  make	
  it	
  
    easy	
  to	
  deploy	
  exisUng	
  
    applicaUons	
  on	
  YARN	
  



                          Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
KiXen:	
  Playing	
  with	
  YARN	
  




                     Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
Design	
  PaXern:	
  The	
  Unified	
  ApplicaUon	
  Master	
  

                                                                •  Contains	
  business	
  logic	
  
                                                                   and	
  YARN	
  logic	
  
                                                                •  Primary	
  reason:	
  
                                                                   Communica@on	
  
                                                                          •  Also:	
  dynamic	
  resource	
  
                                                                             alloca@on	
  
                                                                •  Develop	
  our	
  master/
                                                                   worker	
  applicaUons	
  
                                                                   locally	
  and	
  then	
  deploy	
  
                                                                   them	
  on	
  YARN	
  
                  Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
YARN	
  Lifecycle	
  Management	
  as	
  a	
  Service	
  

 •  Specifically,	
  extensions	
  
    of	
  Guava’s	
  Service	
  
    interface	
  
      •  YarnClientService	
  
      •  AppMasterService	
  
 •  Contains	
  all	
  of	
  the	
  logic	
  
    for	
  crea@ng	
  applica@ons	
  
    and	
  keeping	
  an	
  eye	
  on	
  
    them	
  


                          Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
Moving	
  the	
  ConfiguraUon	
  Logic	
  Out	
  of	
  Java	
  




                    Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
Lua	
  as	
  a	
  ConfiguraUon	
  Language	
  

 •  Small	
  and	
  Simple	
  
     •  Looks	
  like	
  a	
  
        configura@on	
  file	
  
     •  Func@ons	
  are	
  there	
  
        when/if	
  you	
  need	
  them	
  
 •  Inheritance	
  
     •  Don’t	
  Repeat	
  Yourself	
  
 •  Forgiving	
  of	
  undefined	
  
    values	
  
 •  Java/C++	
  Integra@on	
  
                        Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
First	
  KiXen	
  UUlity:	
  The	
  cat	
  FuncUon	
  




                      Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
Second	
  KiXen	
  UUlity:	
  The	
  yarn	
  FuncUon	
  




                    Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
BranchReduce	
  




Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
Branch-­‐and-­‐Bound	
  




                  Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
The	
  Challenge	
  of	
  Parallel	
  Branch	
  and	
  Bound:	
  
Unbalanced	
  Search	
  Space	
  
                                                                  •  Some	
  branches	
  are	
  
                                                                     pruned	
  quickly	
  

                                                                  •  Can	
  be	
  difficult	
  to	
  
                                                                     determine	
  the	
  best	
  
                                                                     splits	
  a	
  priori	
  

                                                                  •  Easy	
  to	
  revert	
  to	
  a	
  de	
  
                                                                     facto	
  single-­‐threaded	
  
                                                                     search	
  
                    Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
The	
  SoluUon:	
  Work	
  Stealing	
  




                    Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
You	
  Write	
  Three	
  Classes	
  

•  A	
  Task	
  class	
  that	
  implements	
  Writable	
  

•  A	
  GlobalState	
  class	
  that	
  implements	
  Writable	
  and	
  has	
  a	
  
   mergeWith(GlobalState	
  other)	
  method	
  

•  A	
  Processor	
  class	
  that	
  defines:	
  
    •  execute(T	
  task,	
  BranchReduceContext<T,	
  GlobalState>	
  ctxt);	
  
    •  With	
  op@onal	
  iniUalize	
  and	
  cleanup	
  methods	
  


•  Configura@on	
  is	
  done	
  via	
  BranchReduceJob	
  
                          Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
Example:	
  The	
  Knapsack	
  Problem	
  




       Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
0-­‐1	
  Integer	
  Programming	
  Problems	
  


 •  NP-­‐Hard	
  Resource	
  
    Alloca@on	
  Problem	
  

 •  Por_olio	
  Op@miza@on	
  

 •  Asset	
  Securi@za@on	
  



                    Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
Problem	
  FormulaUon:	
  (Simplified)	
  LP	
  Format	
  




                   Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
QuesUons?	
  
@josh_wills	
  

More Related Content

What's hot

Oracle cloud story short
Oracle cloud story   shortOracle cloud story   short
Oracle cloud story shortYuri Grinshteyn
 
Cloud Computing - Making IT Simple
 Cloud Computing - Making IT Simple Cloud Computing - Making IT Simple
Cloud Computing - Making IT SimpleBob Rhubart
 
IO Resource Management on Exadata
IO Resource Management on ExadataIO Resource Management on Exadata
IO Resource Management on ExadataEnkitec
 
Engineered Systems: Oracle’s Vision for the Future
Engineered Systems: Oracle’s Vision for the FutureEngineered Systems: Oracle’s Vision for the Future
Engineered Systems: Oracle’s Vision for the FutureBob Rhubart
 
Protection against Lost or Stolen Data with Novell ZENworks Endpoint Security...
Protection against Lost or Stolen Data with Novell ZENworks Endpoint Security...Protection against Lost or Stolen Data with Novell ZENworks Endpoint Security...
Protection against Lost or Stolen Data with Novell ZENworks Endpoint Security...Novell
 
Accelerate to the Cloud
Accelerate to the CloudAccelerate to the Cloud
Accelerate to the CloudNovell
 
Improving h base availability and repair
Improving h base availability and repairImproving h base availability and repair
Improving h base availability and repairDataWorks Summit
 
Advanced DNS/DHCP for Novell eDirectory Environments
Advanced DNS/DHCP for Novell eDirectory EnvironmentsAdvanced DNS/DHCP for Novell eDirectory Environments
Advanced DNS/DHCP for Novell eDirectory EnvironmentsNovell
 
Managed Services Seminar Presentation
Managed Services Seminar PresentationManaged Services Seminar Presentation
Managed Services Seminar Presentationgerrymark
 
How Cloud PaaS Improves Performance while Reducing Costs
How Cloud PaaS Improves Performance while Reducing CostsHow Cloud PaaS Improves Performance while Reducing Costs
How Cloud PaaS Improves Performance while Reducing CostsAcquia
 
Distributed Block-level Storage Management for OpenStack, by Danile lee
Distributed Block-level Storage Management for OpenStack, by Danile leeDistributed Block-level Storage Management for OpenStack, by Danile lee
Distributed Block-level Storage Management for OpenStack, by Danile leeHui Cheng
 
Oracle VM Consolidation and Path to the Cloud
Oracle VM Consolidation and Path to the CloudOracle VM Consolidation and Path to the Cloud
Oracle VM Consolidation and Path to the CloudBob Rhubart
 
Shannon McFarland OpenStack/Cisco Intro
Shannon McFarland OpenStack/Cisco IntroShannon McFarland OpenStack/Cisco Intro
Shannon McFarland OpenStack/Cisco IntroShannon McFarland
 
Novell Identity Manager Troubleshooting
Novell Identity Manager TroubleshootingNovell Identity Manager Troubleshooting
Novell Identity Manager TroubleshootingNovell
 
Lego Cloud SAP Virtualization Week 2012
Lego Cloud SAP Virtualization Week 2012Lego Cloud SAP Virtualization Week 2012
Lego Cloud SAP Virtualization Week 2012Benoit Hudzia
 
Applying Novell Identity Manager to Your Everyday Problems
Applying Novell Identity Manager to Your Everyday ProblemsApplying Novell Identity Manager to Your Everyday Problems
Applying Novell Identity Manager to Your Everyday ProblemsNovell
 
OSDC-2010 Database Full-text Search.... making it not suck
OSDC-2010 Database Full-text Search.... making it not suckOSDC-2010 Database Full-text Search.... making it not suck
OSDC-2010 Database Full-text Search.... making it not suckAaron Walker
 
Common and unique use cases for Apache Hadoop
Common and unique use cases for Apache HadoopCommon and unique use cases for Apache Hadoop
Common and unique use cases for Apache HadoopBrock Noland
 
Tim Mackinnon Agile And Beyond
Tim Mackinnon Agile And BeyondTim Mackinnon Agile And Beyond
Tim Mackinnon Agile And Beyonddeimos
 

What's hot (19)

Oracle cloud story short
Oracle cloud story   shortOracle cloud story   short
Oracle cloud story short
 
Cloud Computing - Making IT Simple
 Cloud Computing - Making IT Simple Cloud Computing - Making IT Simple
Cloud Computing - Making IT Simple
 
IO Resource Management on Exadata
IO Resource Management on ExadataIO Resource Management on Exadata
IO Resource Management on Exadata
 
Engineered Systems: Oracle’s Vision for the Future
Engineered Systems: Oracle’s Vision for the FutureEngineered Systems: Oracle’s Vision for the Future
Engineered Systems: Oracle’s Vision for the Future
 
Protection against Lost or Stolen Data with Novell ZENworks Endpoint Security...
Protection against Lost or Stolen Data with Novell ZENworks Endpoint Security...Protection against Lost or Stolen Data with Novell ZENworks Endpoint Security...
Protection against Lost or Stolen Data with Novell ZENworks Endpoint Security...
 
Accelerate to the Cloud
Accelerate to the CloudAccelerate to the Cloud
Accelerate to the Cloud
 
Improving h base availability and repair
Improving h base availability and repairImproving h base availability and repair
Improving h base availability and repair
 
Advanced DNS/DHCP for Novell eDirectory Environments
Advanced DNS/DHCP for Novell eDirectory EnvironmentsAdvanced DNS/DHCP for Novell eDirectory Environments
Advanced DNS/DHCP for Novell eDirectory Environments
 
Managed Services Seminar Presentation
Managed Services Seminar PresentationManaged Services Seminar Presentation
Managed Services Seminar Presentation
 
How Cloud PaaS Improves Performance while Reducing Costs
How Cloud PaaS Improves Performance while Reducing CostsHow Cloud PaaS Improves Performance while Reducing Costs
How Cloud PaaS Improves Performance while Reducing Costs
 
Distributed Block-level Storage Management for OpenStack, by Danile lee
Distributed Block-level Storage Management for OpenStack, by Danile leeDistributed Block-level Storage Management for OpenStack, by Danile lee
Distributed Block-level Storage Management for OpenStack, by Danile lee
 
Oracle VM Consolidation and Path to the Cloud
Oracle VM Consolidation and Path to the CloudOracle VM Consolidation and Path to the Cloud
Oracle VM Consolidation and Path to the Cloud
 
Shannon McFarland OpenStack/Cisco Intro
Shannon McFarland OpenStack/Cisco IntroShannon McFarland OpenStack/Cisco Intro
Shannon McFarland OpenStack/Cisco Intro
 
Novell Identity Manager Troubleshooting
Novell Identity Manager TroubleshootingNovell Identity Manager Troubleshooting
Novell Identity Manager Troubleshooting
 
Lego Cloud SAP Virtualization Week 2012
Lego Cloud SAP Virtualization Week 2012Lego Cloud SAP Virtualization Week 2012
Lego Cloud SAP Virtualization Week 2012
 
Applying Novell Identity Manager to Your Everyday Problems
Applying Novell Identity Manager to Your Everyday ProblemsApplying Novell Identity Manager to Your Everyday Problems
Applying Novell Identity Manager to Your Everyday Problems
 
OSDC-2010 Database Full-text Search.... making it not suck
OSDC-2010 Database Full-text Search.... making it not suckOSDC-2010 Database Full-text Search.... making it not suck
OSDC-2010 Database Full-text Search.... making it not suck
 
Common and unique use cases for Apache Hadoop
Common and unique use cases for Apache HadoopCommon and unique use cases for Apache Hadoop
Common and unique use cases for Apache Hadoop
 
Tim Mackinnon Agile And Beyond
Tim Mackinnon Agile And BeyondTim Mackinnon Agile And Beyond
Tim Mackinnon Agile And Beyond
 

Viewers also liked

Grow Revenue with Big Data
Grow Revenue with Big DataGrow Revenue with Big Data
Grow Revenue with Big DataLattice Engines
 
Big_data for marketing and sales
Big_data for marketing and salesBig_data for marketing and sales
Big_data for marketing and salesCMR WORLD TECH
 
Building a Big Data Solution
Building a Big Data SolutionBuilding a Big Data Solution
Building a Big Data SolutionJames Serra
 
Oracle big data appliance and solutions
Oracle big data appliance and solutionsOracle big data appliance and solutions
Oracle big data appliance and solutionssolarisyougood
 
Top 5 Considerations for a Big Data Solution
Top 5 Considerations for a Big Data SolutionTop 5 Considerations for a Big Data Solution
Top 5 Considerations for a Big Data SolutionDataStax
 

Viewers also liked (7)

Grow Revenue with Big Data
Grow Revenue with Big DataGrow Revenue with Big Data
Grow Revenue with Big Data
 
Big_data for marketing and sales
Big_data for marketing and salesBig_data for marketing and sales
Big_data for marketing and sales
 
Oracle's BigData solutions
Oracle's BigData solutionsOracle's BigData solutions
Oracle's BigData solutions
 
Building a Big Data Solution
Building a Big Data SolutionBuilding a Big Data Solution
Building a Big Data Solution
 
Oracle big data appliance and solutions
Oracle big data appliance and solutionsOracle big data appliance and solutions
Oracle big data appliance and solutions
 
Top 5 Considerations for a Big Data Solution
Top 5 Considerations for a Big Data SolutionTop 5 Considerations for a Big Data Solution
Top 5 Considerations for a Big Data Solution
 
Big data ppt
Big  data pptBig  data ppt
Big data ppt
 

Similar to BranchReduce Distributed Branch-and-Bound on YARN

Machine Learning and Hadoop: Present and Future
Machine Learning and Hadoop: Present and FutureMachine Learning and Hadoop: Present and Future
Machine Learning and Hadoop: Present and FutureData Science London
 
Building Blocks for Private and Hybrid Clouds
Building Blocks for Private and Hybrid CloudsBuilding Blocks for Private and Hybrid Clouds
Building Blocks for Private and Hybrid CloudsRightScale
 
Hadoop and Machine Learning
Hadoop and Machine LearningHadoop and Machine Learning
Hadoop and Machine Learningjoshwills
 
Machine Learning and Hadoop: Present and future
Machine Learning and Hadoop: Present and futureMachine Learning and Hadoop: Present and future
Machine Learning and Hadoop: Present and futureCloudera, Inc.
 
Rightscale Webinar: Building Blocks for Private and Hybrid Clouds
Rightscale Webinar: Building Blocks for Private and Hybrid CloudsRightscale Webinar: Building Blocks for Private and Hybrid Clouds
Rightscale Webinar: Building Blocks for Private and Hybrid CloudsRightScale
 
VMware - Snapshot sessions - Deploy and manage tomorrow's applications today
VMware - Snapshot sessions  - Deploy and manage tomorrow's applications todayVMware - Snapshot sessions  - Deploy and manage tomorrow's applications today
VMware - Snapshot sessions - Deploy and manage tomorrow's applications todayAnnSteyaert_vmware
 
Dell OpenStack Powered Cloud Solution and Case Sharing
Dell OpenStack Powered Cloud Solution and Case SharingDell OpenStack Powered Cloud Solution and Case Sharing
Dell OpenStack Powered Cloud Solution and Case SharingHui Cheng
 
VA Smalltalk Update
VA Smalltalk UpdateVA Smalltalk Update
VA Smalltalk UpdateESUG
 
Santo Leto - MySQL Connect 2012 - Getting Started with Mysql Cluster
Santo Leto - MySQL Connect 2012 - Getting Started with Mysql ClusterSanto Leto - MySQL Connect 2012 - Getting Started with Mysql Cluster
Santo Leto - MySQL Connect 2012 - Getting Started with Mysql ClusterSanto Leto
 
Keynote -henry xu--dell open stack powered cloud solution and case sharing-
Keynote -henry xu--dell open stack powered cloud solution and case sharing-Keynote -henry xu--dell open stack powered cloud solution and case sharing-
Keynote -henry xu--dell open stack powered cloud solution and case sharing-OpenCity Community
 
Extending The Value Of Oracle Crm On Demand Through Cloud Based Extensibility
Extending The Value Of Oracle Crm On Demand Through Cloud Based ExtensibilityExtending The Value Of Oracle Crm On Demand Through Cloud Based Extensibility
Extending The Value Of Oracle Crm On Demand Through Cloud Based ExtensibilityJerome Leonard
 
Achieving genuine elastic multitenancy with the Waratek Cloud VM for Java : J...
Achieving genuine elastic multitenancy with the Waratek Cloud VM for Java : J...Achieving genuine elastic multitenancy with the Waratek Cloud VM for Java : J...
Achieving genuine elastic multitenancy with the Waratek Cloud VM for Java : J...JAX London
 
Yarns about YARN: Migrating to MapReduce v2
Yarns about YARN: Migrating to MapReduce v2Yarns about YARN: Migrating to MapReduce v2
Yarns about YARN: Migrating to MapReduce v2DataWorks Summit
 
Open stack powered_cloud_solution_interop
Open stack powered_cloud_solution_interopOpen stack powered_cloud_solution_interop
Open stack powered_cloud_solution_interopKamesh Pemmaraju
 
Strata + Hadoop World 2012: High Availability for the HDFS NameNode Phase 2
Strata + Hadoop World 2012: High Availability for the HDFS NameNode Phase 2Strata + Hadoop World 2012: High Availability for the HDFS NameNode Phase 2
Strata + Hadoop World 2012: High Availability for the HDFS NameNode Phase 2Cloudera, Inc.
 
Houston Hadoop Meetup Presentation by Vikram Oberoi of Cloudera
Houston Hadoop Meetup Presentation by Vikram Oberoi of ClouderaHouston Hadoop Meetup Presentation by Vikram Oberoi of Cloudera
Houston Hadoop Meetup Presentation by Vikram Oberoi of ClouderaMark Kerzner
 
WebLogic 12c Developer Deep Dive at Oracle Develop India 2012
WebLogic 12c Developer Deep Dive at Oracle Develop India 2012WebLogic 12c Developer Deep Dive at Oracle Develop India 2012
WebLogic 12c Developer Deep Dive at Oracle Develop India 2012Arun Gupta
 
CARA User Interface for Oracle WebCenter
CARA User Interface for Oracle WebCenterCARA User Interface for Oracle WebCenter
CARA User Interface for Oracle WebCentercara4oraclewebcenter
 
Oracle my sql cluster cge
Oracle my sql cluster cgeOracle my sql cluster cge
Oracle my sql cluster cgeseungdon1
 

Similar to BranchReduce Distributed Branch-and-Bound on YARN (20)

Machine Learning and Hadoop: Present and Future
Machine Learning and Hadoop: Present and FutureMachine Learning and Hadoop: Present and Future
Machine Learning and Hadoop: Present and Future
 
Building Blocks for Private and Hybrid Clouds
Building Blocks for Private and Hybrid CloudsBuilding Blocks for Private and Hybrid Clouds
Building Blocks for Private and Hybrid Clouds
 
Hadoop and Machine Learning
Hadoop and Machine LearningHadoop and Machine Learning
Hadoop and Machine Learning
 
Machine Learning and Hadoop: Present and future
Machine Learning and Hadoop: Present and futureMachine Learning and Hadoop: Present and future
Machine Learning and Hadoop: Present and future
 
Rightscale Webinar: Building Blocks for Private and Hybrid Clouds
Rightscale Webinar: Building Blocks for Private and Hybrid CloudsRightscale Webinar: Building Blocks for Private and Hybrid Clouds
Rightscale Webinar: Building Blocks for Private and Hybrid Clouds
 
VMware - Snapshot sessions - Deploy and manage tomorrow's applications today
VMware - Snapshot sessions  - Deploy and manage tomorrow's applications todayVMware - Snapshot sessions  - Deploy and manage tomorrow's applications today
VMware - Snapshot sessions - Deploy and manage tomorrow's applications today
 
Dell OpenStack Powered Cloud Solution and Case Sharing
Dell OpenStack Powered Cloud Solution and Case SharingDell OpenStack Powered Cloud Solution and Case Sharing
Dell OpenStack Powered Cloud Solution and Case Sharing
 
VA Smalltalk Update
VA Smalltalk UpdateVA Smalltalk Update
VA Smalltalk Update
 
Santo Leto - MySQL Connect 2012 - Getting Started with Mysql Cluster
Santo Leto - MySQL Connect 2012 - Getting Started with Mysql ClusterSanto Leto - MySQL Connect 2012 - Getting Started with Mysql Cluster
Santo Leto - MySQL Connect 2012 - Getting Started with Mysql Cluster
 
Keynote -henry xu--dell open stack powered cloud solution and case sharing-
Keynote -henry xu--dell open stack powered cloud solution and case sharing-Keynote -henry xu--dell open stack powered cloud solution and case sharing-
Keynote -henry xu--dell open stack powered cloud solution and case sharing-
 
Extending The Value Of Oracle Crm On Demand Through Cloud Based Extensibility
Extending The Value Of Oracle Crm On Demand Through Cloud Based ExtensibilityExtending The Value Of Oracle Crm On Demand Through Cloud Based Extensibility
Extending The Value Of Oracle Crm On Demand Through Cloud Based Extensibility
 
Achieving genuine elastic multitenancy with the Waratek Cloud VM for Java : J...
Achieving genuine elastic multitenancy with the Waratek Cloud VM for Java : J...Achieving genuine elastic multitenancy with the Waratek Cloud VM for Java : J...
Achieving genuine elastic multitenancy with the Waratek Cloud VM for Java : J...
 
Yarns about YARN: Migrating to MapReduce v2
Yarns about YARN: Migrating to MapReduce v2Yarns about YARN: Migrating to MapReduce v2
Yarns about YARN: Migrating to MapReduce v2
 
YARN
YARNYARN
YARN
 
Open stack powered_cloud_solution_interop
Open stack powered_cloud_solution_interopOpen stack powered_cloud_solution_interop
Open stack powered_cloud_solution_interop
 
Strata + Hadoop World 2012: High Availability for the HDFS NameNode Phase 2
Strata + Hadoop World 2012: High Availability for the HDFS NameNode Phase 2Strata + Hadoop World 2012: High Availability for the HDFS NameNode Phase 2
Strata + Hadoop World 2012: High Availability for the HDFS NameNode Phase 2
 
Houston Hadoop Meetup Presentation by Vikram Oberoi of Cloudera
Houston Hadoop Meetup Presentation by Vikram Oberoi of ClouderaHouston Hadoop Meetup Presentation by Vikram Oberoi of Cloudera
Houston Hadoop Meetup Presentation by Vikram Oberoi of Cloudera
 
WebLogic 12c Developer Deep Dive at Oracle Develop India 2012
WebLogic 12c Developer Deep Dive at Oracle Develop India 2012WebLogic 12c Developer Deep Dive at Oracle Develop India 2012
WebLogic 12c Developer Deep Dive at Oracle Develop India 2012
 
CARA User Interface for Oracle WebCenter
CARA User Interface for Oracle WebCenterCARA User Interface for Oracle WebCenter
CARA User Interface for Oracle WebCenter
 
Oracle my sql cluster cge
Oracle my sql cluster cgeOracle my sql cluster cge
Oracle my sql cluster cge
 

More from DataWorks Summit

Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisDataWorks Summit
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiDataWorks Summit
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...DataWorks Summit
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...DataWorks Summit
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal SystemDataWorks Summit
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExampleDataWorks Summit
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberDataWorks Summit
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixDataWorks Summit
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiDataWorks Summit
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsDataWorks Summit
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureDataWorks Summit
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EngineDataWorks Summit
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...DataWorks Summit
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudDataWorks Summit
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiDataWorks Summit
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerDataWorks Summit
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...DataWorks Summit
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouDataWorks Summit
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkDataWorks Summit
 

More from DataWorks Summit (20)

Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
 
Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache Ratis
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal System
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist Example
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at Uber
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability Improvements
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant Architecture
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything Engine
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google Cloud
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near You
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
 

Recently uploaded

Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxRemote DBA Services
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Zilliz
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontologyjohnbeverley2021
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityWSO2
 

Recently uploaded (20)

Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 

BranchReduce Distributed Branch-and-Bound on YARN

  • 1. BranchReduce   Distributed  Branch-­‐and-­‐Bound  on  YARN   June  14,  2012    
  • 2. About  Me   Copyright  2012  Cloudera  Inc.  All  rights  reserved   2  
  • 3. Hadoop  Distributed  Processing  Frameworks   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 4. Lots  of  Other  Parallel  Processing  PlaIorms   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 5. Hadoop  2.0:  Resource  Scheduling  with  YARN   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 6. The  Data  Deluge  and  the  Cambrian  Explosion   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 7. Parallel  Distributed  Processing  For  Everyone   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 8. Building  a  New  Processing  Framework  on  YARN   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 9. A  Terrifyingly  Accurate  Paraphrasing  of  JWZ       Some  people,  when  confronted  with  a  tedious   problem,  say,  “I  know,  I’ll  write  a   framework.”  Now  they  have  two  tedious   problems.   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 10. On  Designing  Frameworks   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 11. The  Example  YARN  App:  Distributed  Shell   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 12. Do  We  Need  a  New  Programming  Language  for   Developing  YARN  ApplicaUons?   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 13. Do  We  Need  a  New  Programming  Language  for   Developing  YARN  ApplicaUons?   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 14. Leverage  ExisUng  Frameworks   •  Popular  RPC  libraries   with  support  for   mul@ple  languages   •  C++,  Java,  Python   •  We  need  to  make  it   easy  to  deploy  exisUng   applicaUons  on  YARN   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 15. KiXen:  Playing  with  YARN   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 16. Design  PaXern:  The  Unified  ApplicaUon  Master   •  Contains  business  logic   and  YARN  logic   •  Primary  reason:   Communica@on   •  Also:  dynamic  resource   alloca@on   •  Develop  our  master/ worker  applicaUons   locally  and  then  deploy   them  on  YARN   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 17. YARN  Lifecycle  Management  as  a  Service   •  Specifically,  extensions   of  Guava’s  Service   interface   •  YarnClientService   •  AppMasterService   •  Contains  all  of  the  logic   for  crea@ng  applica@ons   and  keeping  an  eye  on   them   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 18. Moving  the  ConfiguraUon  Logic  Out  of  Java   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 19. Lua  as  a  ConfiguraUon  Language   •  Small  and  Simple   •  Looks  like  a   configura@on  file   •  Func@ons  are  there   when/if  you  need  them   •  Inheritance   •  Don’t  Repeat  Yourself   •  Forgiving  of  undefined   values   •  Java/C++  Integra@on   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 20. First  KiXen  UUlity:  The  cat  FuncUon   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 21. Second  KiXen  UUlity:  The  yarn  FuncUon   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 22. BranchReduce   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 23. Branch-­‐and-­‐Bound   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 24. The  Challenge  of  Parallel  Branch  and  Bound:   Unbalanced  Search  Space   •  Some  branches  are   pruned  quickly   •  Can  be  difficult  to   determine  the  best   splits  a  priori   •  Easy  to  revert  to  a  de   facto  single-­‐threaded   search   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 25. The  SoluUon:  Work  Stealing   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 26. You  Write  Three  Classes   •  A  Task  class  that  implements  Writable   •  A  GlobalState  class  that  implements  Writable  and  has  a   mergeWith(GlobalState  other)  method   •  A  Processor  class  that  defines:   •  execute(T  task,  BranchReduceContext<T,  GlobalState>  ctxt);   •  With  op@onal  iniUalize  and  cleanup  methods   •  Configura@on  is  done  via  BranchReduceJob   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 27. Example:  The  Knapsack  Problem   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 28. 0-­‐1  Integer  Programming  Problems   •  NP-­‐Hard  Resource   Alloca@on  Problem   •  Por_olio  Op@miza@on   •  Asset  Securi@za@on   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 29. Problem  FormulaUon:  (Simplified)  LP  Format   Copyright  2012  Cloudera  Inc.  All  rights  reserved