SlideShare a Scribd company logo
BranchReduce	
  
Distributed	
  Branch-­‐and-­‐Bound	
  on	
  YARN	
  
June	
  14,	
  2012	
  
	
  
About	
  Me	
  




                  Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
     2	
  
Hadoop	
  Distributed	
  Processing	
  Frameworks	
  




                  Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
Lots	
  of	
  Other	
  Parallel	
  Processing	
  PlaIorms	
  




                     Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
Hadoop	
  2.0:	
  Resource	
  Scheduling	
  with	
  YARN	
  




                    Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
The	
  Data	
  Deluge	
  and	
  the	
  Cambrian	
  Explosion	
  




                     Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
Parallel	
  Distributed	
  Processing	
  For	
  Everyone	
  




                    Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
Building	
  a	
  New	
  Processing	
  Framework	
  on	
  YARN	
  




                 Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
A	
  Terrifyingly	
  Accurate	
  Paraphrasing	
  of	
  JWZ	
  

	
  
	
  
Some	
  people,	
  when	
  confronted	
  with	
  a	
  tedious
                                                            	
  
problem,	
  say,	
  “I	
  know,	
  I’ll	
  write	
  a	
  
framework.”	
  Now	
  they	
  have	
  two	
  tedious	
  
problems.	
  



                     Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
On	
  Designing	
  Frameworks	
  




                  Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
The	
  Example	
  YARN	
  App:	
  Distributed	
  Shell	
  




                    Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
Do	
  We	
  Need	
  a	
  New	
  Programming	
  Language	
  for	
  
            Developing	
  YARN	
  ApplicaUons?     	
  




                  Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
Do	
  We	
  Need	
  a	
  New	
  Programming	
  Language	
  for	
  
            Developing	
  YARN	
  ApplicaUons?     	
  




                  Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
Leverage	
  ExisUng	
  Frameworks	
  

 •  Popular	
  RPC	
  libraries	
  
    with	
  support	
  for	
  
    mul@ple	
  languages	
  
     •  C++,	
  Java,	
  Python	
  


 •  We	
  need	
  to	
  make	
  it	
  
    easy	
  to	
  deploy	
  exisUng	
  
    applicaUons	
  on	
  YARN	
  



                          Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
KiXen:	
  Playing	
  with	
  YARN	
  




                     Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
Design	
  PaXern:	
  The	
  Unified	
  ApplicaUon	
  Master	
  

                                                                •  Contains	
  business	
  logic	
  
                                                                   and	
  YARN	
  logic	
  
                                                                •  Primary	
  reason:	
  
                                                                   Communica@on	
  
                                                                          •  Also:	
  dynamic	
  resource	
  
                                                                             alloca@on	
  
                                                                •  Develop	
  our	
  master/
                                                                   worker	
  applicaUons	
  
                                                                   locally	
  and	
  then	
  deploy	
  
                                                                   them	
  on	
  YARN	
  
                  Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
YARN	
  Lifecycle	
  Management	
  as	
  a	
  Service	
  

 •  Specifically,	
  extensions	
  
    of	
  Guava’s	
  Service	
  
    interface	
  
      •  YarnClientService	
  
      •  AppMasterService	
  
 •  Contains	
  all	
  of	
  the	
  logic	
  
    for	
  crea@ng	
  applica@ons	
  
    and	
  keeping	
  an	
  eye	
  on	
  
    them	
  


                          Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
Moving	
  the	
  ConfiguraUon	
  Logic	
  Out	
  of	
  Java	
  




                    Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
Lua	
  as	
  a	
  ConfiguraUon	
  Language	
  

 •  Small	
  and	
  Simple	
  
     •  Looks	
  like	
  a	
  
        configura@on	
  file	
  
     •  Func@ons	
  are	
  there	
  
        when/if	
  you	
  need	
  them	
  
 •  Inheritance	
  
     •  Don’t	
  Repeat	
  Yourself	
  
 •  Forgiving	
  of	
  undefined	
  
    values	
  
 •  Java/C++	
  Integra@on	
  
                        Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
First	
  KiXen	
  UUlity:	
  The	
  cat	
  FuncUon	
  




                      Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
Second	
  KiXen	
  UUlity:	
  The	
  yarn	
  FuncUon	
  




                    Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
BranchReduce	
  




Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
Branch-­‐and-­‐Bound	
  




                  Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
The	
  Challenge	
  of	
  Parallel	
  Branch	
  and	
  Bound:	
  
Unbalanced	
  Search	
  Space	
  
                                                                  •  Some	
  branches	
  are	
  
                                                                     pruned	
  quickly	
  

                                                                  •  Can	
  be	
  difficult	
  to	
  
                                                                     determine	
  the	
  best	
  
                                                                     splits	
  a	
  priori	
  

                                                                  •  Easy	
  to	
  revert	
  to	
  a	
  de	
  
                                                                     facto	
  single-­‐threaded	
  
                                                                     search	
  
                    Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
The	
  SoluUon:	
  Work	
  Stealing	
  




                    Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
You	
  Write	
  Three	
  Classes	
  

•  A	
  Task	
  class	
  that	
  implements	
  Writable	
  

•  A	
  GlobalState	
  class	
  that	
  implements	
  Writable	
  and	
  has	
  a	
  
   mergeWith(GlobalState	
  other)	
  method	
  

•  A	
  Processor	
  class	
  that	
  defines:	
  
    •  execute(T	
  task,	
  BranchReduceContext<T,	
  GlobalState>	
  ctxt);	
  
    •  With	
  op@onal	
  iniUalize	
  and	
  cleanup	
  methods	
  


•  Configura@on	
  is	
  done	
  via	
  BranchReduceJob	
  
                          Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
Example:	
  The	
  Knapsack	
  Problem	
  




       Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
0-­‐1	
  Integer	
  Programming	
  Problems	
  


 •  NP-­‐Hard	
  Resource	
  
    Alloca@on	
  Problem	
  

 •  Por_olio	
  Op@miza@on	
  

 •  Asset	
  Securi@za@on	
  



                    Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
Problem	
  FormulaUon:	
  (Simplified)	
  LP	
  Format	
  




                   Copyright	
  2012	
  Cloudera	
  Inc.	
  All	
  rights	
  reserved	
  
QuesUons?	
  
@josh_wills	
  

More Related Content

What's hot

Oracle cloud story short
Oracle cloud story   shortOracle cloud story   short
Oracle cloud story short
Yuri Grinshteyn
 
Cloud Computing - Making IT Simple
 Cloud Computing - Making IT Simple Cloud Computing - Making IT Simple
Cloud Computing - Making IT Simple
Bob Rhubart
 
IO Resource Management on Exadata
IO Resource Management on ExadataIO Resource Management on Exadata
IO Resource Management on Exadata
Enkitec
 
Engineered Systems: Oracle’s Vision for the Future
Engineered Systems: Oracle’s Vision for the FutureEngineered Systems: Oracle’s Vision for the Future
Engineered Systems: Oracle’s Vision for the Future
Bob Rhubart
 
Protection against Lost or Stolen Data with Novell ZENworks Endpoint Security...
Protection against Lost or Stolen Data with Novell ZENworks Endpoint Security...Protection against Lost or Stolen Data with Novell ZENworks Endpoint Security...
Protection against Lost or Stolen Data with Novell ZENworks Endpoint Security...
Novell
 
Accelerate to the Cloud
Accelerate to the CloudAccelerate to the Cloud
Accelerate to the Cloud
Novell
 
Improving h base availability and repair
Improving h base availability and repairImproving h base availability and repair
Improving h base availability and repair
DataWorks Summit
 
Advanced DNS/DHCP for Novell eDirectory Environments
Advanced DNS/DHCP for Novell eDirectory EnvironmentsAdvanced DNS/DHCP for Novell eDirectory Environments
Advanced DNS/DHCP for Novell eDirectory Environments
Novell
 
Managed Services Seminar Presentation
Managed Services Seminar PresentationManaged Services Seminar Presentation
Managed Services Seminar Presentation
gerrymark
 
How Cloud PaaS Improves Performance while Reducing Costs
How Cloud PaaS Improves Performance while Reducing CostsHow Cloud PaaS Improves Performance while Reducing Costs
How Cloud PaaS Improves Performance while Reducing Costs
Acquia
 
Distributed Block-level Storage Management for OpenStack, by Danile lee
Distributed Block-level Storage Management for OpenStack, by Danile leeDistributed Block-level Storage Management for OpenStack, by Danile lee
Distributed Block-level Storage Management for OpenStack, by Danile lee
Hui Cheng
 
Oracle VM Consolidation and Path to the Cloud
Oracle VM Consolidation and Path to the CloudOracle VM Consolidation and Path to the Cloud
Oracle VM Consolidation and Path to the Cloud
Bob Rhubart
 
Shannon McFarland OpenStack/Cisco Intro
Shannon McFarland OpenStack/Cisco IntroShannon McFarland OpenStack/Cisco Intro
Shannon McFarland OpenStack/Cisco Intro
Shannon McFarland
 
Novell Identity Manager Troubleshooting
Novell Identity Manager TroubleshootingNovell Identity Manager Troubleshooting
Novell Identity Manager Troubleshooting
Novell
 
Lego Cloud SAP Virtualization Week 2012
Lego Cloud SAP Virtualization Week 2012Lego Cloud SAP Virtualization Week 2012
Lego Cloud SAP Virtualization Week 2012
Benoit Hudzia
 
Applying Novell Identity Manager to Your Everyday Problems
Applying Novell Identity Manager to Your Everyday ProblemsApplying Novell Identity Manager to Your Everyday Problems
Applying Novell Identity Manager to Your Everyday Problems
Novell
 
OSDC-2010 Database Full-text Search.... making it not suck
OSDC-2010 Database Full-text Search.... making it not suckOSDC-2010 Database Full-text Search.... making it not suck
OSDC-2010 Database Full-text Search.... making it not suck
Aaron Walker
 
Common and unique use cases for Apache Hadoop
Common and unique use cases for Apache HadoopCommon and unique use cases for Apache Hadoop
Common and unique use cases for Apache Hadoop
Brock Noland
 
Tim Mackinnon Agile And Beyond
Tim Mackinnon Agile And BeyondTim Mackinnon Agile And Beyond
Tim Mackinnon Agile And Beyond
deimos
 

What's hot (19)

Oracle cloud story short
Oracle cloud story   shortOracle cloud story   short
Oracle cloud story short
 
Cloud Computing - Making IT Simple
 Cloud Computing - Making IT Simple Cloud Computing - Making IT Simple
Cloud Computing - Making IT Simple
 
IO Resource Management on Exadata
IO Resource Management on ExadataIO Resource Management on Exadata
IO Resource Management on Exadata
 
Engineered Systems: Oracle’s Vision for the Future
Engineered Systems: Oracle’s Vision for the FutureEngineered Systems: Oracle’s Vision for the Future
Engineered Systems: Oracle’s Vision for the Future
 
Protection against Lost or Stolen Data with Novell ZENworks Endpoint Security...
Protection against Lost or Stolen Data with Novell ZENworks Endpoint Security...Protection against Lost or Stolen Data with Novell ZENworks Endpoint Security...
Protection against Lost or Stolen Data with Novell ZENworks Endpoint Security...
 
Accelerate to the Cloud
Accelerate to the CloudAccelerate to the Cloud
Accelerate to the Cloud
 
Improving h base availability and repair
Improving h base availability and repairImproving h base availability and repair
Improving h base availability and repair
 
Advanced DNS/DHCP for Novell eDirectory Environments
Advanced DNS/DHCP for Novell eDirectory EnvironmentsAdvanced DNS/DHCP for Novell eDirectory Environments
Advanced DNS/DHCP for Novell eDirectory Environments
 
Managed Services Seminar Presentation
Managed Services Seminar PresentationManaged Services Seminar Presentation
Managed Services Seminar Presentation
 
How Cloud PaaS Improves Performance while Reducing Costs
How Cloud PaaS Improves Performance while Reducing CostsHow Cloud PaaS Improves Performance while Reducing Costs
How Cloud PaaS Improves Performance while Reducing Costs
 
Distributed Block-level Storage Management for OpenStack, by Danile lee
Distributed Block-level Storage Management for OpenStack, by Danile leeDistributed Block-level Storage Management for OpenStack, by Danile lee
Distributed Block-level Storage Management for OpenStack, by Danile lee
 
Oracle VM Consolidation and Path to the Cloud
Oracle VM Consolidation and Path to the CloudOracle VM Consolidation and Path to the Cloud
Oracle VM Consolidation and Path to the Cloud
 
Shannon McFarland OpenStack/Cisco Intro
Shannon McFarland OpenStack/Cisco IntroShannon McFarland OpenStack/Cisco Intro
Shannon McFarland OpenStack/Cisco Intro
 
Novell Identity Manager Troubleshooting
Novell Identity Manager TroubleshootingNovell Identity Manager Troubleshooting
Novell Identity Manager Troubleshooting
 
Lego Cloud SAP Virtualization Week 2012
Lego Cloud SAP Virtualization Week 2012Lego Cloud SAP Virtualization Week 2012
Lego Cloud SAP Virtualization Week 2012
 
Applying Novell Identity Manager to Your Everyday Problems
Applying Novell Identity Manager to Your Everyday ProblemsApplying Novell Identity Manager to Your Everyday Problems
Applying Novell Identity Manager to Your Everyday Problems
 
OSDC-2010 Database Full-text Search.... making it not suck
OSDC-2010 Database Full-text Search.... making it not suckOSDC-2010 Database Full-text Search.... making it not suck
OSDC-2010 Database Full-text Search.... making it not suck
 
Common and unique use cases for Apache Hadoop
Common and unique use cases for Apache HadoopCommon and unique use cases for Apache Hadoop
Common and unique use cases for Apache Hadoop
 
Tim Mackinnon Agile And Beyond
Tim Mackinnon Agile And BeyondTim Mackinnon Agile And Beyond
Tim Mackinnon Agile And Beyond
 

Viewers also liked

Grow Revenue with Big Data
Grow Revenue with Big DataGrow Revenue with Big Data
Grow Revenue with Big Data
Lattice Engines
 
Big_data for marketing and sales
Big_data for marketing and salesBig_data for marketing and sales
Big_data for marketing and sales
CMR WORLD TECH
 
Oracle's BigData solutions
Oracle's BigData solutionsOracle's BigData solutions
Oracle's BigData solutions
Swiss Big Data User Group
 
Building a Big Data Solution
Building a Big Data SolutionBuilding a Big Data Solution
Building a Big Data Solution
James Serra
 
Oracle big data appliance and solutions
Oracle big data appliance and solutionsOracle big data appliance and solutions
Oracle big data appliance and solutions
solarisyougood
 
Top 5 Considerations for a Big Data Solution
Top 5 Considerations for a Big Data SolutionTop 5 Considerations for a Big Data Solution
Top 5 Considerations for a Big Data Solution
DataStax
 
Big data ppt
Big  data pptBig  data ppt
Big data ppt
Nasrin Hussain
 

Viewers also liked (7)

Grow Revenue with Big Data
Grow Revenue with Big DataGrow Revenue with Big Data
Grow Revenue with Big Data
 
Big_data for marketing and sales
Big_data for marketing and salesBig_data for marketing and sales
Big_data for marketing and sales
 
Oracle's BigData solutions
Oracle's BigData solutionsOracle's BigData solutions
Oracle's BigData solutions
 
Building a Big Data Solution
Building a Big Data SolutionBuilding a Big Data Solution
Building a Big Data Solution
 
Oracle big data appliance and solutions
Oracle big data appliance and solutionsOracle big data appliance and solutions
Oracle big data appliance and solutions
 
Top 5 Considerations for a Big Data Solution
Top 5 Considerations for a Big Data SolutionTop 5 Considerations for a Big Data Solution
Top 5 Considerations for a Big Data Solution
 
Big data ppt
Big  data pptBig  data ppt
Big data ppt
 

Similar to BranchReduce Distributed Branch-and-Bound on YARN

Machine Learning and Hadoop: Present and Future
Machine Learning and Hadoop: Present and FutureMachine Learning and Hadoop: Present and Future
Machine Learning and Hadoop: Present and Future
Data Science London
 
Building Blocks for Private and Hybrid Clouds
Building Blocks for Private and Hybrid CloudsBuilding Blocks for Private and Hybrid Clouds
Building Blocks for Private and Hybrid Clouds
RightScale
 
Hadoop and Machine Learning
Hadoop and Machine LearningHadoop and Machine Learning
Hadoop and Machine Learning
joshwills
 
Machine Learning and Hadoop: Present and future
Machine Learning and Hadoop: Present and futureMachine Learning and Hadoop: Present and future
Machine Learning and Hadoop: Present and future
Cloudera, Inc.
 
Rightscale Webinar: Building Blocks for Private and Hybrid Clouds
Rightscale Webinar: Building Blocks for Private and Hybrid CloudsRightscale Webinar: Building Blocks for Private and Hybrid Clouds
Rightscale Webinar: Building Blocks for Private and Hybrid Clouds
RightScale
 
VMware - Snapshot sessions - Deploy and manage tomorrow's applications today
VMware - Snapshot sessions  - Deploy and manage tomorrow's applications todayVMware - Snapshot sessions  - Deploy and manage tomorrow's applications today
VMware - Snapshot sessions - Deploy and manage tomorrow's applications today
AnnSteyaert_vmware
 
Dell OpenStack Powered Cloud Solution and Case Sharing
Dell OpenStack Powered Cloud Solution and Case SharingDell OpenStack Powered Cloud Solution and Case Sharing
Dell OpenStack Powered Cloud Solution and Case Sharing
Hui Cheng
 
VA Smalltalk Update
VA Smalltalk UpdateVA Smalltalk Update
VA Smalltalk Update
ESUG
 
Santo Leto - MySQL Connect 2012 - Getting Started with Mysql Cluster
Santo Leto - MySQL Connect 2012 - Getting Started with Mysql ClusterSanto Leto - MySQL Connect 2012 - Getting Started with Mysql Cluster
Santo Leto - MySQL Connect 2012 - Getting Started with Mysql Cluster
Santo Leto
 
Keynote -henry xu--dell open stack powered cloud solution and case sharing-
Keynote -henry xu--dell open stack powered cloud solution and case sharing-Keynote -henry xu--dell open stack powered cloud solution and case sharing-
Keynote -henry xu--dell open stack powered cloud solution and case sharing-
OpenCity Community
 
Extending The Value Of Oracle Crm On Demand Through Cloud Based Extensibility
Extending The Value Of Oracle Crm On Demand Through Cloud Based ExtensibilityExtending The Value Of Oracle Crm On Demand Through Cloud Based Extensibility
Extending The Value Of Oracle Crm On Demand Through Cloud Based Extensibility
Jerome Leonard
 
Achieving genuine elastic multitenancy with the Waratek Cloud VM for Java : J...
Achieving genuine elastic multitenancy with the Waratek Cloud VM for Java : J...Achieving genuine elastic multitenancy with the Waratek Cloud VM for Java : J...
Achieving genuine elastic multitenancy with the Waratek Cloud VM for Java : J...
JAX London
 
Yarns about YARN: Migrating to MapReduce v2
Yarns about YARN: Migrating to MapReduce v2Yarns about YARN: Migrating to MapReduce v2
Yarns about YARN: Migrating to MapReduce v2
DataWorks Summit
 
YARN
YARNYARN
Open stack powered_cloud_solution_interop
Open stack powered_cloud_solution_interopOpen stack powered_cloud_solution_interop
Open stack powered_cloud_solution_interop
Kamesh Pemmaraju
 
Strata + Hadoop World 2012: High Availability for the HDFS NameNode Phase 2
Strata + Hadoop World 2012: High Availability for the HDFS NameNode Phase 2Strata + Hadoop World 2012: High Availability for the HDFS NameNode Phase 2
Strata + Hadoop World 2012: High Availability for the HDFS NameNode Phase 2
Cloudera, Inc.
 
Houston Hadoop Meetup Presentation by Vikram Oberoi of Cloudera
Houston Hadoop Meetup Presentation by Vikram Oberoi of ClouderaHouston Hadoop Meetup Presentation by Vikram Oberoi of Cloudera
Houston Hadoop Meetup Presentation by Vikram Oberoi of Cloudera
Mark Kerzner
 
WebLogic 12c Developer Deep Dive at Oracle Develop India 2012
WebLogic 12c Developer Deep Dive at Oracle Develop India 2012WebLogic 12c Developer Deep Dive at Oracle Develop India 2012
WebLogic 12c Developer Deep Dive at Oracle Develop India 2012
Arun Gupta
 
CARA User Interface for Oracle WebCenter
CARA User Interface for Oracle WebCenterCARA User Interface for Oracle WebCenter
CARA User Interface for Oracle WebCenter
cara4oraclewebcenter
 
Oracle my sql cluster cge
Oracle my sql cluster cgeOracle my sql cluster cge
Oracle my sql cluster cge
seungdon1
 

Similar to BranchReduce Distributed Branch-and-Bound on YARN (20)

Machine Learning and Hadoop: Present and Future
Machine Learning and Hadoop: Present and FutureMachine Learning and Hadoop: Present and Future
Machine Learning and Hadoop: Present and Future
 
Building Blocks for Private and Hybrid Clouds
Building Blocks for Private and Hybrid CloudsBuilding Blocks for Private and Hybrid Clouds
Building Blocks for Private and Hybrid Clouds
 
Hadoop and Machine Learning
Hadoop and Machine LearningHadoop and Machine Learning
Hadoop and Machine Learning
 
Machine Learning and Hadoop: Present and future
Machine Learning and Hadoop: Present and futureMachine Learning and Hadoop: Present and future
Machine Learning and Hadoop: Present and future
 
Rightscale Webinar: Building Blocks for Private and Hybrid Clouds
Rightscale Webinar: Building Blocks for Private and Hybrid CloudsRightscale Webinar: Building Blocks for Private and Hybrid Clouds
Rightscale Webinar: Building Blocks for Private and Hybrid Clouds
 
VMware - Snapshot sessions - Deploy and manage tomorrow's applications today
VMware - Snapshot sessions  - Deploy and manage tomorrow's applications todayVMware - Snapshot sessions  - Deploy and manage tomorrow's applications today
VMware - Snapshot sessions - Deploy and manage tomorrow's applications today
 
Dell OpenStack Powered Cloud Solution and Case Sharing
Dell OpenStack Powered Cloud Solution and Case SharingDell OpenStack Powered Cloud Solution and Case Sharing
Dell OpenStack Powered Cloud Solution and Case Sharing
 
VA Smalltalk Update
VA Smalltalk UpdateVA Smalltalk Update
VA Smalltalk Update
 
Santo Leto - MySQL Connect 2012 - Getting Started with Mysql Cluster
Santo Leto - MySQL Connect 2012 - Getting Started with Mysql ClusterSanto Leto - MySQL Connect 2012 - Getting Started with Mysql Cluster
Santo Leto - MySQL Connect 2012 - Getting Started with Mysql Cluster
 
Keynote -henry xu--dell open stack powered cloud solution and case sharing-
Keynote -henry xu--dell open stack powered cloud solution and case sharing-Keynote -henry xu--dell open stack powered cloud solution and case sharing-
Keynote -henry xu--dell open stack powered cloud solution and case sharing-
 
Extending The Value Of Oracle Crm On Demand Through Cloud Based Extensibility
Extending The Value Of Oracle Crm On Demand Through Cloud Based ExtensibilityExtending The Value Of Oracle Crm On Demand Through Cloud Based Extensibility
Extending The Value Of Oracle Crm On Demand Through Cloud Based Extensibility
 
Achieving genuine elastic multitenancy with the Waratek Cloud VM for Java : J...
Achieving genuine elastic multitenancy with the Waratek Cloud VM for Java : J...Achieving genuine elastic multitenancy with the Waratek Cloud VM for Java : J...
Achieving genuine elastic multitenancy with the Waratek Cloud VM for Java : J...
 
Yarns about YARN: Migrating to MapReduce v2
Yarns about YARN: Migrating to MapReduce v2Yarns about YARN: Migrating to MapReduce v2
Yarns about YARN: Migrating to MapReduce v2
 
YARN
YARNYARN
YARN
 
Open stack powered_cloud_solution_interop
Open stack powered_cloud_solution_interopOpen stack powered_cloud_solution_interop
Open stack powered_cloud_solution_interop
 
Strata + Hadoop World 2012: High Availability for the HDFS NameNode Phase 2
Strata + Hadoop World 2012: High Availability for the HDFS NameNode Phase 2Strata + Hadoop World 2012: High Availability for the HDFS NameNode Phase 2
Strata + Hadoop World 2012: High Availability for the HDFS NameNode Phase 2
 
Houston Hadoop Meetup Presentation by Vikram Oberoi of Cloudera
Houston Hadoop Meetup Presentation by Vikram Oberoi of ClouderaHouston Hadoop Meetup Presentation by Vikram Oberoi of Cloudera
Houston Hadoop Meetup Presentation by Vikram Oberoi of Cloudera
 
WebLogic 12c Developer Deep Dive at Oracle Develop India 2012
WebLogic 12c Developer Deep Dive at Oracle Develop India 2012WebLogic 12c Developer Deep Dive at Oracle Develop India 2012
WebLogic 12c Developer Deep Dive at Oracle Develop India 2012
 
CARA User Interface for Oracle WebCenter
CARA User Interface for Oracle WebCenterCARA User Interface for Oracle WebCenter
CARA User Interface for Oracle WebCenter
 
Oracle my sql cluster cge
Oracle my sql cluster cgeOracle my sql cluster cge
Oracle my sql cluster cge
 

More from DataWorks Summit

Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
DataWorks Summit
 
Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache Ratis
DataWorks Summit
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
DataWorks Summit
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...
DataWorks Summit
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
DataWorks Summit
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal System
DataWorks Summit
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist Example
DataWorks Summit
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at Uber
DataWorks Summit
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
DataWorks Summit
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
DataWorks Summit
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability Improvements
DataWorks Summit
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant Architecture
DataWorks Summit
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything Engine
DataWorks Summit
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
DataWorks Summit
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google Cloud
DataWorks Summit
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
DataWorks Summit
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
DataWorks Summit
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
DataWorks Summit
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near You
DataWorks Summit
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
DataWorks Summit
 

More from DataWorks Summit (20)

Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
 
Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache Ratis
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal System
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist Example
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at Uber
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability Improvements
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant Architecture
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything Engine
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google Cloud
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near You
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
 

Recently uploaded

Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
SOFTTECHHUB
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
Adtran
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
20 Comprehensive Checklist of Designing and Developing a Website
20 Comprehensive Checklist of Designing and Developing a Website20 Comprehensive Checklist of Designing and Developing a Website
20 Comprehensive Checklist of Designing and Developing a Website
Pixlogix Infotech
 
Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...
Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...
Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...
Zilliz
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
innovationoecd
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Speck&Tech
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Aggregage
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
DianaGray10
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
shyamraj55
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
Neo4j
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
mikeeftimakis1
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
Neo4j
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
Neo4j
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
DianaGray10
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
James Anderson
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
Octavian Nadolu
 

Recently uploaded (20)

Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
20 Comprehensive Checklist of Designing and Developing a Website
20 Comprehensive Checklist of Designing and Developing a Website20 Comprehensive Checklist of Designing and Developing a Website
20 Comprehensive Checklist of Designing and Developing a Website
 
Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...
Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...
Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
 

BranchReduce Distributed Branch-and-Bound on YARN

  • 1. BranchReduce   Distributed  Branch-­‐and-­‐Bound  on  YARN   June  14,  2012    
  • 2. About  Me   Copyright  2012  Cloudera  Inc.  All  rights  reserved   2  
  • 3. Hadoop  Distributed  Processing  Frameworks   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 4. Lots  of  Other  Parallel  Processing  PlaIorms   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 5. Hadoop  2.0:  Resource  Scheduling  with  YARN   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 6. The  Data  Deluge  and  the  Cambrian  Explosion   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 7. Parallel  Distributed  Processing  For  Everyone   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 8. Building  a  New  Processing  Framework  on  YARN   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 9. A  Terrifyingly  Accurate  Paraphrasing  of  JWZ       Some  people,  when  confronted  with  a  tedious   problem,  say,  “I  know,  I’ll  write  a   framework.”  Now  they  have  two  tedious   problems.   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 10. On  Designing  Frameworks   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 11. The  Example  YARN  App:  Distributed  Shell   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 12. Do  We  Need  a  New  Programming  Language  for   Developing  YARN  ApplicaUons?   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 13. Do  We  Need  a  New  Programming  Language  for   Developing  YARN  ApplicaUons?   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 14. Leverage  ExisUng  Frameworks   •  Popular  RPC  libraries   with  support  for   mul@ple  languages   •  C++,  Java,  Python   •  We  need  to  make  it   easy  to  deploy  exisUng   applicaUons  on  YARN   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 15. KiXen:  Playing  with  YARN   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 16. Design  PaXern:  The  Unified  ApplicaUon  Master   •  Contains  business  logic   and  YARN  logic   •  Primary  reason:   Communica@on   •  Also:  dynamic  resource   alloca@on   •  Develop  our  master/ worker  applicaUons   locally  and  then  deploy   them  on  YARN   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 17. YARN  Lifecycle  Management  as  a  Service   •  Specifically,  extensions   of  Guava’s  Service   interface   •  YarnClientService   •  AppMasterService   •  Contains  all  of  the  logic   for  crea@ng  applica@ons   and  keeping  an  eye  on   them   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 18. Moving  the  ConfiguraUon  Logic  Out  of  Java   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 19. Lua  as  a  ConfiguraUon  Language   •  Small  and  Simple   •  Looks  like  a   configura@on  file   •  Func@ons  are  there   when/if  you  need  them   •  Inheritance   •  Don’t  Repeat  Yourself   •  Forgiving  of  undefined   values   •  Java/C++  Integra@on   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 20. First  KiXen  UUlity:  The  cat  FuncUon   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 21. Second  KiXen  UUlity:  The  yarn  FuncUon   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 22. BranchReduce   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 23. Branch-­‐and-­‐Bound   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 24. The  Challenge  of  Parallel  Branch  and  Bound:   Unbalanced  Search  Space   •  Some  branches  are   pruned  quickly   •  Can  be  difficult  to   determine  the  best   splits  a  priori   •  Easy  to  revert  to  a  de   facto  single-­‐threaded   search   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 25. The  SoluUon:  Work  Stealing   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 26. You  Write  Three  Classes   •  A  Task  class  that  implements  Writable   •  A  GlobalState  class  that  implements  Writable  and  has  a   mergeWith(GlobalState  other)  method   •  A  Processor  class  that  defines:   •  execute(T  task,  BranchReduceContext<T,  GlobalState>  ctxt);   •  With  op@onal  iniUalize  and  cleanup  methods   •  Configura@on  is  done  via  BranchReduceJob   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 27. Example:  The  Knapsack  Problem   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 28. 0-­‐1  Integer  Programming  Problems   •  NP-­‐Hard  Resource   Alloca@on  Problem   •  Por_olio  Op@miza@on   •  Asset  Securi@za@on   Copyright  2012  Cloudera  Inc.  All  rights  reserved  
  • 29. Problem  FormulaUon:  (Simplified)  LP  Format   Copyright  2012  Cloudera  Inc.  All  rights  reserved