Your SlideShare is downloading. ×
3. ami big data hadoop on ucs seminar may 2013
Upcoming SlideShare
Loading in...5
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×
Saving this for later? Get the SlideShare app to save on your phone or tablet. Read anywhere, anytime – even offline.
Text the download link to your phone
Standard text messaging rates apply

3. ami big data hadoop on ucs seminar may 2013

584
views

Published on

Published in: Technology, News & Politics

0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
584
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
20
Comments
0
Likes
1
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
No notes for slide

Transcript

  • 1. Cisco Confidential NDA Required. 1© 2013 Cisco and/or its affiliates. All rights reserved.Designing HadoopInfrastructure with CiscoData Center Solutions,Blueprint for Success.AmiBen-Amram,amib@cisco.comDataCenterArchitectureLeader,Cisco
  • 2. © 2013 Cisco and/or its affiliates. All rights reserved. Cisco Confidential NDA Required. 2Massively ParallelProcessing; RDBMSfor EDWUnstructuredKey-Value StoreDatabaseDocument DatabaseApache Opens Source ProjectManage and Process Massive Amounts of DataNo SQL MPP DatabasesHadoopCisco has partnered with leading software providers to offer a comprehensiveinfrastructure and management solution to Big Data..
  • 3. 3© 2013 Cisco and/or its affiliates. All rights reserved. Cisco Confidential NDA Required.DatabaseNoSQL DatabaseTested and Validated Reference Architectures, Joint engineering LabSolution BundlesTechnical CollateralsApache-Hadoop reengineeredUCS is the exclusive hardwarereferenceSeveral joint engagementsMPP Column storeUCS is exclusive hardware referenceUCS is the only partner platformCommercial, distributed key-valuedatabase.MPP row storeApache-Hadoop software and servicesFew 100 node production cluster (UCSM)CommercialDocument-oriented database
  • 4. © 2013 Cisco and/or its affiliates. All rights reserved. Cisco Confidential NDA Required. 4
  • 5. © 2013 Cisco and/or its affiliates. All rights reserved. Cisco Confidential NDA Required. 5Small Flows/Messaging(Heart-beats, Keep-alive, delay sensitiveapplication messaging)Small – Medium Incast(Hadoop Shuffle, Scatter-Gather, Distributed Storage)Large Flows(HDFS Insert, File Copy)Large Incast(Hadoop Replication, Distributed Storage)
  • 6. 6Many-to-Many Traffic PatternMap 1 Map 2 Map NMap 3Reducer 1 Reducer 2 Reducer 3 Reducer NHDFSShuffleOutputReplicationNameNodeJobTrackerZooKeeper
  • 7. AnalyzeSimulated withShakespeareWordcount[ 10s-20s Mgbps]Extract Transform Load(ETL)Simulated withYahoo TeraSort[ Larger than 1 Gbps]Extract Transform Load(ETL)Simulated withYahoo TeraSort with outputreplication[ 2 – 4 Gbps]Job Patterns have varying impact on network utilizationJob Pattern - network graph of data coming into one node.
  • 8. 8Map 1 Map 2 Map NMap 3Reducer1Reducer2Reducer3ReducerNHDFSShuffleOutputReplicationRegionServerRegionServerClient ClientMajorCompactionReadReadReadUpdateUpdateReadMajorCompaction
  • 9. 9Hbase During Major Compaction.0100020003000400050006000700080009000Latency(us)TimeUPDATE - Average Latency (us) READ - Average Latency (us) QoS - UPDATE - Average Latency (us) QoS - READ - Average Latency (us)Read/UpdateLatencyComparison ofNon-QoS vs. QoSPolicy~45% for ReadImprovementSwitch BufferUsageWith NetworkQoS Policy toprioritize HbaseUpdate/ReadOperationsevery 24 hours HBase wakes up and has this stampede of elephants that does thismassive push into HDFS.
  • 10. 10© 2013 Cisco and/or its affiliates. All rights reserved. Cisco Confidential NDA Required.Validated 96 Node Hadoop Cluster• NetworkThree Racks each with 32 nodesDistribution Layer – Nexus 7000 orNexus 5000ToR – FEX or Nexus 30002 FEX per RackEach Rack with either 32 single ordual attached host• Hadoop FrameworkApache 0.20.2Linux 6.2Slots – 10 Maps & 2 Reducers per node• Compute – UCS C200 M2Cores: 12Processor: 2 x Intel(R) Xeon(R)CPU X5670 @ 2.93GHzDisk: 4 x 2TB (7.2K RPM)Network: 1G: LOM, 10G: Cisco UCS P81EName NodeCisco UCS C200Single NIC2248TP-ENexus 5548 Nexus 5548Data Nodes 1 – 48Cisco UCS C 200 Single NIC…Data Nodes 49- 96Cisco UCS 200 Single NIC…Traditional DC Design Nexus 55xx/22482248TP-EName NodeCisco UCS C 200Single NICNexus 7000 Nexus 7000Data Nodes 1 – 48Cisco UCS C 200 Single NIC…Data Nodes 49 - 96Cisco UCS C 200 Single NIC…Nexus 3000Nexus 3000Nexus 7K-N3K based Topology
  • 11. © 2013 Cisco and/or its affiliates. All rights reserved. Cisco Confidential NDA Required. 11
  • 12. © 2013 Cisco and/or its affiliates. All rights reserved. Cisco Confidential NDA Required. 12• companies are often challenged by the complexities of traditionalserver solutions.• Big data solutions must enable high performance and scale as thebusiness demands.• To meet these requirements Cisco designed a comprehensivesolution: Cisco® Common Platform Architecture (CPA) for BigData.• Cisco CPA for Big Data includes compute, storage, connectivity,and unified management features that enable rapid deployment,predictable performance, and reduced total cost of ownership(TCO).• In addition to these benefits, Cisco CPA for Big Data offers uniquedata and management integration with enterprise applicationshosted on the Cisco Unified Computing System™ (Cisco UCS®)
  • 13. © 2013 Cisco and/or its affiliates. All rights reserved. Cisco Confidential NDA Required. 1313TECHNICAL LEADERSHIP MARKET MOMENTUM• Unified Infrastructure• Management Automation• Design Flexibility• Optimize for virtualization.• Best Cloud Infrastructure• 61 industry benchmark worldrecords• $2 billion revenue run rate• 20,000 customers: almost50% of Fortune 500• #2 US blade server marketshare by revenue• #3 WW blade server marketshare by revenue• More than 200 customers inIsrael.
  • 14. © 2012 Cisco and/or its affiliates. All rights reserved. Cisco Confidential 14UCS 6200 SeriesFabric Internments:High speed connectivity andmanagement, integrationwith enterprise applicationon bladesNexus 2232 Fabric Extenders:Scalability at lower costUCS ManagerUCS 240 M3 Servers:Compute, storageLAN, SAN, ManagementBuilding BlocksCisco Big Data Common Platform (CPA) is a highly scalable architecturedesigned to meet variety of scale-out application demandsUCS Central
  • 15. © 2013 Cisco and/or its affiliates. All rights reserved. Cisco Confidential NDA Required. 15Big DataHigh PerformanceRackUCS-EZ-BD-HP(2) UCS 96-Port 6296Fabric Interconnect(2) Nexus 2232 PP(16) UCS C240 M3 Serversw/ dual Intel Xeon E5-26652.4 GHz Processors, 256GBof Memory, 1 x Mega RAID9266-CV-8i Card, 24 x 1TB7.2K SATA HDDsMPPHigh PerformanceHalf-RackUCS-EZ-BD-MPP(2) UCS 96-Port 6248 FabricInterconnect(2) Nexus 2232 PP(8) UCS C240 M3 Servers w/dual Intel Xeon E5-2690 2.9 GHzProcessors, 256GB of Memory, 1x Mega RAID 9266-CV-8i Card,24 x 600GB 10K SAS HDDsSolution BundlesBig DataHigh CapacityRackUCS-EZ-BD-HC(2) UCS 96-Port 6296Fabric Interconnect(2) Nexus 2232 PP(16) UCS C240 M3Servers w/ dual Intel XeonE5-2640 2.5 GHzProcessors, 128GB ofMemory, 1 x Mega RAID9266-CV-8i Card, 12 x3TB 7.2K SAS HDDsStorage Density Optimized;Low $/TB (under $500/TB)Balanced Compute and IO Bandwidth;Price-Performance OptimizedHigh Performance Compute and IOBandwidth and IOPS (under $10K/GBPS)Optimized for Cost, Tested and Validated for Performance and Rapid DeploymentsAdditional Racks2 x N2K-UCS2232PF16 x UCS-EZ-C240-2665Additional Racks2 x N2K-UCS2232PF16 x UCS-EZ-C240-2640Additional ServersUCS-EZ-C240-2690
  • 16. © 2013 Cisco and/or its affiliates. All rights reserved. Cisco Confidential NDA Required. 16PerformanceOptimized (SAS)CapacityOptimized (SAS)Price-PerformanceOptimized (SATA)C240 M3 (SFF) C240 M3 (SFF) C240 M3 (LFF) C220 M3 (SFF)RU 2 2 2 2CPU E5-2690 E5-2665 E5-2640 E5-2680Cores 16 16 12 16Memory 256GB 256GB 128GB 256GBDisk Drives24 x (300 GB 15K,600GB 10K,900GB 15K)24 x 1TB 7.2K 12 x 3TB 7.2K ExternalCompute UnitsNOSHCompute
  • 17. © 2013 Cisco and/or its affiliates. All rights reserved. Cisco Confidential NDA Required. 17Differentiation 0:Big Data Benefits• Unified Management - UCS Manager• Unified Fabric - “Single Wire Management”• Seamless management integration and data integration• Direct SAN access6200Fabric A6200Fabric BB200 B200CNAFEXACNA CNAFEXBFEXAFEXBSAN A SAN BETH 1 ETH 2MGMTMGMTChassis 1 Chassis 2Fabric SwitchFabric ExtendersUplinkPortsCompute BladesHalf/Full WidthOOBMgmtServer PortsVirtualized Adapters6200Fabric A6200Fabric BB200 BladeCNAFEXBFEXASAN A SAN BETH 1 ETH 2MGMTMGMTChassis 1FabricSwitchFabric ExtendersUplinkPortsCompute BladesHalf/Full WidthOOBMgmtServer PortsVirtualized AdaptersC240 RackMountCNAFEX A FEX B
  • 18. © 2013 Cisco and/or its affiliates. All rights reserved. Cisco Confidential NDA Required. 18Big Data• Dozens to 100s of severs are typical• 20–50% annual growthUCSM Enables• Global view of the cluster• Proactive monitoring of health• 1 Click software bios and firmware upgrades• 1 Click bios setting• 1 Click tunables like jumbo framesUCS Central Enables• Scaling to large cluster• Application isolationUnified ManagementA Single Unified SystemFor Blade and Rack ServersC-Series RackOptimized ServersDifferentiation 1:
  • 19. © 2013 Cisco and/or its affiliates. All rights reserved. Cisco Confidential NDA Required. 19Big Data Benefits• Optimized service profile template for CPA enable quick and consistentdeployments• One click power shell script to configure CPA.LANSAN•RAID settings•Disk scrub actions•Number of vHBAs•HBA WWN assignments•FC Boot Parameters•HBA firmware•FC Fabric assignments forHBAs•QoS settings•Border port assignment pervNIC•NIC Transmit/Receive RateLimiting•VLAN assignments for NICs•VLAN tagging config for NICs•Number of vNICs•PXE settings•NIC firmware•Advanced feature settings•Remote KVM IP settings•Call Home behavior•Remote KVM firmware•Server UUID•Serial over LAN settings•Boot order•IPMI settings•BIOS scrub actions•BIOS firmware•BIOS SettingsLANSANTraditional UCS Service ProfileDifferentiation 2:
  • 20. © 2013 Cisco and/or its affiliates. All rights reserved. Cisco Confidential NDA Required. 20Big Data Benefits• “Single Wire Management”• Fully redundant active-active fabric cluster interconnect• Can be configured for direct SAN accessTraditional Unified Fabric10 GE EthernetCisco VIC Technology66% Less Switch Ports and CablesDifferentiation 3:
  • 21. © 2013 Cisco and/or its affiliates. All rights reserved. Cisco Confidential NDA Required. 21Data Center Applications Big Data ApplicationsUnified FabricUnified ManagementIntegratedDataManagementData Integration Using ConnectorsData FeedsCisco Big Data Common PlatformArchitectureUsing C-Series Rack-Mount ServersCisco UCS B-SeriesBlade ServersSANArrayCisco UCS Big Data Common Platform Architecture:Extending Enterprise Application Ecosystem to Big DataHadoopNoSQLMPP DatabaseRNDifferentiation 4:
  • 22. © 2013 Cisco and/or its affiliates. All rights reserved. Cisco Confidential NDA Required. 22No Additional Switching for up to 10 Racks (160 Servers)10,000 using UCS CentralExample Configuration:Servers Per Domain(Pair of Fabric Interconnects)North-Bound Bandwidth(GBits/sec)Any Node to Any Node Bandwidth(GBits/sec)160 320 10144 480 10128 640 10Differentiation 5:
  • 23. © 2013 Cisco and/or its affiliates. All rights reserved. Cisco Confidential NDA Required. 23
  • 24. © 2013 Cisco and/or its affiliates. All rights reserved. Cisco Confidential NDA Required. 24• Tested and Validated Reference Architectures• Joint engineering Lab• Solution Bundles• Technical CollateralsNoSQL DatabaseNo SQL MPP DatabasesHadoop
  • 25. workload automation facilitates the flow of datacostsTwitterFeedsMap ReduceHiveBI AnalyticsSQLSqoopMap ReduceMap ReduceCall logsWeb ClicksGather Data Data IntegrationLoad Data Data AnalysisReport Generationand DistributionWeb ServicesSSHDB/JDBCERP/CRMData MoverSqoopMapReduceInformaticaHiveSqoopInformaticaBusiness ObjectsCognosWeb Services
  • 26. © 2012 Cisco and/or its affiliates. All rights reserved. Cisco Confidential 26ReportsDashboardsAnalyticsOLAPAlertsERP ApplicationsDBCRM ApplicationsDBDWDWERP/CRM Apps& DatabasesData ExchangeSystem(s)ETL/DW/Big Data/BISystems & ApplicationsManages Enterprise WorkloadsDWDataIntegrationBusiness IntelligenceApplication(s)File Drop BoxFTP/SFTP/FTPSSaas, AWSFTP ServerDBAPI Feeds(Twitter, FB, LI etc)BigData
  • 27. © 2012 Cisco and/or its affiliates. All rights reserved. Cisco Confidential 27DataAcquisitionData load Analysis ofSales DataExport toEnterpriseGenerateReport112233 44
  • 28. © 2012 Cisco and/or its affiliates. All rights reserved. Cisco Confidential 28IntegratedCisco UCSServerManagementIntegratedNetworkManagement w/FabricInterconnectand NexusSwitchesIntegratedDataManagementCisco UCSB-seriesCisco UCSC-seriesw/ Direct AttachStorageData Center Applications Big Data ApplicationsCisco Workload Automation Delivers AutomatedBusiness Processing Abstraction LayerData FeedsBig DataJobsData Center ApplicationsAutomated Backup andStorageIn/out of Big Data GridsRapid error free deployment – service profileMaintenance activities like BIOS, FW upgrade across the clusterMonitoring the health, powerSeamless data movement
  • 29. Thank you.
  • 30. © 2013 Cisco and/or its affiliates. All rights reserved. Cisco Confidential NDA Required. 30• Hadoop has many building blocks…At the core it is an architecture tostore and process unstructured and semi-structured data…Hadoop Distributed File System(HDFS)At the base is aSelf-healingclustered storagesystem.Map-ReduceDistributed DataProcessingPIG Hive SqoopTop levelabstractionsTop levelInterfaces ETL ToolsBIReportingRDBMSHBASEDatabase withReal-timeaccess
  • 31. © 2012 Cisco and/or its affiliates. All rights reserved. Cisco Confidential 31Extreme PerformanceOptimized for fast queryexecution and unmatcheddata loadingElastic ScalabilityExpand capacity andperformanceHighly AvailableFully redundant andreliable configurationUnified NetworkingConverged data andmanagement planenetworkingRapidly DeployablePre-validated configuration,rapid deployment viaservice profilesUnified ManagementPower of UCS Managerto manage the compute,networking, I/OIndustry LeadingPartnershipsJoint solutions with majorsoftware playersEnterprise ApplicationIntegrationSeamless integrationwith enterpriseapplications on blades

×