SlideShare a Scribd company logo
1 of 33
1 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Taming the Elephant:
Efficient and Effective
Apache Hadoop
Management
Paul Codding
2016 Hadoop Summit Dublin, Ireland
2 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Presenters
Paul Codding
Senior Product Manager, Cloud & Operations
Apache Ambari, SmartSense
3 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Agenda
 Introduction
 Observations & Recommendations
– Observations from analyzing ~1000 customer bundles
– Common operational mistakes
4 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Agenda
Introduction (Like 2 minutes)
5 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
SmartSense Architecture
L A N D I N G Z O N E
S E RV E R
G AT E WAY
A M B A R I
A G E N T A G E N T
A G E N TA G E N TA G E N T
A G E N T
B U N D L E
WO R K E R
N O D E
WO R K E R
N O D E
WO R K E R
N O D E
WO R K E R
N O D E
WO R K E R
N O D E
WO R K E R
N O D E
S m a r t S e n s e
A n a l y t i c s
6 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Agenda
Introduction
Obligatory Poll
7 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Agenda
Introduction
Obligatory Poll
Observations & Recommendations
8 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
EVERY node counts…
Common difficult to diagnose issues
9 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Operation System Configuration: Locale
 /etc/localtime – Dictates which timezone your machine & the JDK thinks it’s in
 Hive
– unix_timestamp(…)
– current_date()
SELECT sum(amount) from sales
where sale_date >
unix_timestamp('2016-03-01 00:00:00')
“default timezone and the default locale”
Inconsistent Locale Configuration
10 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Operating System Configuration: Transparent Huge Pages (THP)
 THP is an abstraction layer that automates creating, managing, and using huge pages
 Pages == memory managed in blocks by the Linux Kernel
 Huge pages are pages that come in larger sizes 2MB-1GB.
11 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Operating System Configuration: NSCD/SSSD
 Name Service Cache Daemon
– getpwnam
– getpwuid
– getgrnam
– getgrid
– gethostbyname
cp10005.xxxxxx.com:1
cp10006.xxxxxx.com:5
cp10007.xxxxxx.com:1
cp10008.xxxxxx.com:0
cp10009.xxxxxx.com:1
cp10010.xxxxxx.com:3
cp10011.xxxxxx.com:0
cp10012.xxxxxx.com:1
cp10013.xxxxxx.com:0
cp10014.xxxxxx.com:2
cp10015.xxxxxx.com:0
cp10005.xxxxxx.com:0
cp10006.xxxxxx.com:0
cp10007.xxxxxx.com:0
cp10008.xxxxxx.com:0
cp10009.xxxxxx.com:0
cp10010.xxxxxx.com:0
cp10011.xxxxxx.com:0
cp10012.xxxxxx.com:0
cp10013.xxxxxx.com:0
cp10014.xxxxxx.com:0
cp10015.xxxxxx.com:0
12 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Operating System Configuration: NTPD
 Network Time Protocol daemon
2016-03-31 18:40:28,585 FATAL [regionserver/ip-10-0-x-x.ec2.internal/10.0.x.x:16020]
regionserver.HRegionServer: Master rejected startup because clock is out of
syncorg.apache.hadoop.hbase.ClockOutOfSyncException:
org.apache.hadoop.hbase.ClockOutOfSyncException: Server ip-10-0-x-
x.ec2.internal,16020,1459449626477 has been rejected; Reported time is too far out of sync
with master. Time difference of 74097ms > max allowed of 30000ms
$ kinit -kt
/etc/security/keytabs/hdfs.headless.keytab
hdfs-HDP1@HORTONWORKS.LOCAL
kinit: Clock skew too great while getting
initial credentials
13 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Operating System: Legacy Kernel Issues
 Specific NIC’s & Kernel Versions
– Broadcom bnx2x module prior to RHEL 5.7 (kernel earlier than 2.6.18-274.el5)
– QLogic NetXen netxen_nic module prior to RHEL 5.9 (kernel earlier than 2.6.18-348.el5)
– Intel 10Gbps ixgbe module prior to RHEL 6.4 (kernel earlier than 2.6.32-358.el6)
– Intel 10Gbps ixgbe module from RHEL 5.6 (kernel version 2.6.18-238.el5 and later)
 Symptoms
– NFS transfers over 10Gbps links are only transferring at 100MiB/sec (i.e. 1Gbps)
– TCP connections never reach anywhere near wirespeed
– TCP Window size reduced 720 bytesnic.generic-receive-offload
 Workaround
– nic.large-receive-offload
– nic.generic-receive-offload RHEL Knowledgebase Solution: 20278
14 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
The Core Counts
HDFS & YARN
15 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
HDFS
NameNode Configuration
16 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
HDFS: NameNode Group Mapping Lookup Implementations
 org.apache.hadoop.security.ShellBasedUnixGroupsMapping
 org.apache.hadoop.security.LdapGroupsMapping
 org.apache.hadoop.security.CompositeGroupsMapping
 org.apache.hadoop.security.JniBasedUnixGroupsMappingWithFallback
hadoop.security.group.mapping
17 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
HDFS: NameNode Metadata Directories
 Multiple Entries – Each directory gets a replica of the fsimage data
 Very common “second directory” is an NFS Mount
 soft mount vs hard mount
dfs.namenode.name.dir
18 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
HDFS: NameNode Handler Count
 Math.log(${currentDataNodeCount}) * 20
 10 node cluster – 46
 100 node cluster – 92
 1000 node cluster - 138
dfs.namenode.handler.count
19 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
HDFS: HA Retry Policy
 When primary NameNode is killed, clients can retry for up to 10 minutes instead of
failing over
dfs.client.retry.policy.enabled = true
20 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
HDFS
DataNode Configuration
21 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
HDFS: DataNode Failed Volumes
 dmesg
 smartctl
dfs.datanode.failed.volumes.tolerated
ata1.00: failed to IDENTIFY (I/O error, err_mask=0x4)
=== START OF READ SMART DATA SECTION ===
SMART Self-test log structure revision number 1
Num Test_Description Status Remaining LifeTime(hours)
LBA_of_first_error
# 1 Short offline Completed: read failure 20% 717
22 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
HDFS: DataNode
 Default: 4096
 Increase depends on other services deployed in the cluster and workload type
dfs.datanode.max.transfer.threads
23 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
YARN
Resource Manager Configuration
24 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
YARN: ResourceManager Min/Max Container Size Allocation
yarn.scheduler.minimum-allocation-mb & yarn.scheduler.maximum-allocation-mb
25 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
YARN: NodeManager Memory
yarn.nodemanager.resource.memory-mb
RAM
Operating System
DataNode
Region Server
NodeManager
26 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
YARN: NodeManager Local Directories
yarn.nodemanager.local-dirs
27 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
YARN ATS: Rolling LevelDB Timeline store
 org.apache.hadoop.yarn.server.timeline.EntityGroupFSTimelineStore
 org.apache.hadoop.yarn.server.timeline.LeveldbTimelineStore
 org.apache.hadoop.yarn.server.timeline.RollingLevelDBTimelineStore
yarn.timeline-service.store-class
28 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
YARN ATS: TTL
yarn.timeline-service.ttl-enable & yarn.timeline-service.ttl-ms
29 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Agenda
Introduction
Obligatory Poll
Observations & Recommendations
Summary
30 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
SmartSense Recommendations
 We’ve covered 16 of ~250 rules
 Built into Support Case close/Sev1 postmortem process
 Onramp into core products and Apache Ambari
– Stack Advisor
– New Defaults
– New Alerts
hbase_tcp_nodelay
hdfs_check_point_period
hdfs_dn_suboptimal_mounts
hdfs_dn_volume_tolerance
hdfs_enable_security_check
hdfs_mount_options
hdfs_nn_checkpoint_txns
hdfs_nn_handler_count
hdfs_nn_protect_imp_dirs
hdfs_nn_soft_mount
hdfs_nn_super_user_group
hdfs_short_circuit
hive_enable_cbo
hive_vectorized_exec
jvm_optsmr_min_split_size
mr_reduce_parallel_copies
mr_slow_start
os_cpu_scaling
os_ssd_tuning
tez_enable_reuse
tez_session_release_delay
tez_shuffle_buffer
yarn_ats_security
yarn_nm_black_listed_mount_logdir
31 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
All Bundles are:
• Encrypted and Anonymized by default
Configurable options to:
• Exclude properties within specific Hadoop configuration files
• Global REGEX replacements across all configuration, metrics, and logs
By default:
• Ambari clear text passwords are not collected
• Hive and Oozie database properties are not collected
• All IP addresses and host names are anonymized
Bundle Security
32 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
SmartSense Stack Support
HDP 2.4 HDP 2.3 HDP 2.2 HDP 2.1 HDP 2.0
SmartSense 1.x
Ambari 2.2
Built-In!
Ambari 2.1
Plug-In
Ambari 2.0
Plug-In
Ambari 1.7 Ambari 1.6
SmartSense 1.x
33 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Questions?

More Related Content

What's hot

HDFS Tiered Storage: Mounting Object Stores in HDFS
HDFS Tiered Storage: Mounting Object Stores in HDFSHDFS Tiered Storage: Mounting Object Stores in HDFS
HDFS Tiered Storage: Mounting Object Stores in HDFSDataWorks Summit
 
Storage and-compute-hdfs-map reduce
Storage and-compute-hdfs-map reduceStorage and-compute-hdfs-map reduce
Storage and-compute-hdfs-map reduceChris Nauroth
 
Near Real-Time Network Anomaly Detection and Traffic Analysis using Spark bas...
Near Real-Time Network Anomaly Detection and Traffic Analysis using Spark bas...Near Real-Time Network Anomaly Detection and Traffic Analysis using Spark bas...
Near Real-Time Network Anomaly Detection and Traffic Analysis using Spark bas...DataWorks Summit/Hadoop Summit
 
Managing Hadoop, HBase and Storm Clusters at Yahoo Scale
Managing Hadoop, HBase and Storm Clusters at Yahoo ScaleManaging Hadoop, HBase and Storm Clusters at Yahoo Scale
Managing Hadoop, HBase and Storm Clusters at Yahoo ScaleDataWorks Summit/Hadoop Summit
 
Hadoop Operations - Best Practices from the Field
Hadoop Operations - Best Practices from the FieldHadoop Operations - Best Practices from the Field
Hadoop Operations - Best Practices from the FieldDataWorks Summit
 
HBase Read High Availability Using Timeline-Consistent Region Replicas
HBase Read High Availability Using Timeline-Consistent Region ReplicasHBase Read High Availability Using Timeline-Consistent Region Replicas
HBase Read High Availability Using Timeline-Consistent Region ReplicasHBaseCon
 
Operating and Supporting Apache HBase Best Practices and Improvements
Operating and Supporting Apache HBase Best Practices and ImprovementsOperating and Supporting Apache HBase Best Practices and Improvements
Operating and Supporting Apache HBase Best Practices and ImprovementsDataWorks Summit/Hadoop Summit
 
Evolving HDFS to a Generalized Distributed Storage Subsystem
Evolving HDFS to a Generalized Distributed Storage SubsystemEvolving HDFS to a Generalized Distributed Storage Subsystem
Evolving HDFS to a Generalized Distributed Storage SubsystemDataWorks Summit/Hadoop Summit
 
HBaseCon 2015: HBase and Spark
HBaseCon 2015: HBase and SparkHBaseCon 2015: HBase and Spark
HBaseCon 2015: HBase and SparkHBaseCon
 
Large-scale Web Apps @ Pinterest
Large-scale Web Apps @ PinterestLarge-scale Web Apps @ Pinterest
Large-scale Web Apps @ PinterestHBaseCon
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsDataWorks Summit
 
Flexible and Real-Time Stream Processing with Apache Flink
Flexible and Real-Time Stream Processing with Apache FlinkFlexible and Real-Time Stream Processing with Apache Flink
Flexible and Real-Time Stream Processing with Apache FlinkDataWorks Summit
 
A Survey of HBase Application Archetypes
A Survey of HBase Application ArchetypesA Survey of HBase Application Archetypes
A Survey of HBase Application ArchetypesHBaseCon
 

What's hot (20)

LLAP: Sub-Second Analytical Queries in Hive
LLAP: Sub-Second Analytical Queries in HiveLLAP: Sub-Second Analytical Queries in Hive
LLAP: Sub-Second Analytical Queries in Hive
 
HDFS Tiered Storage: Mounting Object Stores in HDFS
HDFS Tiered Storage: Mounting Object Stores in HDFSHDFS Tiered Storage: Mounting Object Stores in HDFS
HDFS Tiered Storage: Mounting Object Stores in HDFS
 
Evolving HDFS to Generalized Storage Subsystem
Evolving HDFS to Generalized Storage SubsystemEvolving HDFS to Generalized Storage Subsystem
Evolving HDFS to Generalized Storage Subsystem
 
Achieving 100k Queries per Hour on Hive on Tez
Achieving 100k Queries per Hour on Hive on TezAchieving 100k Queries per Hour on Hive on Tez
Achieving 100k Queries per Hour on Hive on Tez
 
Storage and-compute-hdfs-map reduce
Storage and-compute-hdfs-map reduceStorage and-compute-hdfs-map reduce
Storage and-compute-hdfs-map reduce
 
Near Real-Time Network Anomaly Detection and Traffic Analysis using Spark bas...
Near Real-Time Network Anomaly Detection and Traffic Analysis using Spark bas...Near Real-Time Network Anomaly Detection and Traffic Analysis using Spark bas...
Near Real-Time Network Anomaly Detection and Traffic Analysis using Spark bas...
 
Evolving HDFS to a Generalized Storage Subsystem
Evolving HDFS to a Generalized Storage SubsystemEvolving HDFS to a Generalized Storage Subsystem
Evolving HDFS to a Generalized Storage Subsystem
 
Managing Hadoop, HBase and Storm Clusters at Yahoo Scale
Managing Hadoop, HBase and Storm Clusters at Yahoo ScaleManaging Hadoop, HBase and Storm Clusters at Yahoo Scale
Managing Hadoop, HBase and Storm Clusters at Yahoo Scale
 
Hadoop Operations - Best Practices from the Field
Hadoop Operations - Best Practices from the FieldHadoop Operations - Best Practices from the Field
Hadoop Operations - Best Practices from the Field
 
HBase Read High Availability Using Timeline-Consistent Region Replicas
HBase Read High Availability Using Timeline-Consistent Region ReplicasHBase Read High Availability Using Timeline-Consistent Region Replicas
HBase Read High Availability Using Timeline-Consistent Region Replicas
 
Operating and Supporting Apache HBase Best Practices and Improvements
Operating and Supporting Apache HBase Best Practices and ImprovementsOperating and Supporting Apache HBase Best Practices and Improvements
Operating and Supporting Apache HBase Best Practices and Improvements
 
Evolving HDFS to a Generalized Distributed Storage Subsystem
Evolving HDFS to a Generalized Distributed Storage SubsystemEvolving HDFS to a Generalized Distributed Storage Subsystem
Evolving HDFS to a Generalized Distributed Storage Subsystem
 
HBaseCon 2015: HBase and Spark
HBaseCon 2015: HBase and SparkHBaseCon 2015: HBase and Spark
HBaseCon 2015: HBase and Spark
 
Empower Data-Driven Organizations with HPE and Hadoop
Empower Data-Driven Organizations with HPE and HadoopEmpower Data-Driven Organizations with HPE and Hadoop
Empower Data-Driven Organizations with HPE and Hadoop
 
Breaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
Breaking the 1 Million OPS/SEC Barrier in HOPS HadoopBreaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
Breaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
 
Large-scale Web Apps @ Pinterest
Large-scale Web Apps @ PinterestLarge-scale Web Apps @ Pinterest
Large-scale Web Apps @ Pinterest
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability Improvements
 
Flexible and Real-Time Stream Processing with Apache Flink
Flexible and Real-Time Stream Processing with Apache FlinkFlexible and Real-Time Stream Processing with Apache Flink
Flexible and Real-Time Stream Processing with Apache Flink
 
A Survey of HBase Application Archetypes
A Survey of HBase Application ArchetypesA Survey of HBase Application Archetypes
A Survey of HBase Application Archetypes
 
Curb your insecurity with HDP
Curb your insecurity with HDPCurb your insecurity with HDP
Curb your insecurity with HDP
 

Viewers also liked

Starting the Hadoop Journey at a Global Leader in Cancer Research
Starting the Hadoop Journey at a Global Leader in Cancer ResearchStarting the Hadoop Journey at a Global Leader in Cancer Research
Starting the Hadoop Journey at a Global Leader in Cancer ResearchDataWorks Summit/Hadoop Summit
 
Presentation from physical to virtual to cloud emc
Presentation   from physical to virtual to cloud emcPresentation   from physical to virtual to cloud emc
Presentation from physical to virtual to cloud emcxKinAnx
 
It's not the size of your cluster, it's how you use it
It's not the size of your cluster, it's how you use itIt's not the size of your cluster, it's how you use it
It's not the size of your cluster, it's how you use itDataWorks Summit/Hadoop Summit
 
Accelerating Apache Hadoop through High-Performance Networking and I/O Techno...
Accelerating Apache Hadoop through High-Performance Networking and I/O Techno...Accelerating Apache Hadoop through High-Performance Networking and I/O Techno...
Accelerating Apache Hadoop through High-Performance Networking and I/O Techno...DataWorks Summit/Hadoop Summit
 
Apache Hive 2.0 SQL, Speed, Scale by Alan Gates
Apache Hive 2.0 SQL, Speed, Scale by Alan GatesApache Hive 2.0 SQL, Speed, Scale by Alan Gates
Apache Hive 2.0 SQL, Speed, Scale by Alan GatesBig Data Spain
 
Advanced Hadoop Tuning and Optimization - Hadoop Consulting
Advanced Hadoop Tuning and Optimization - Hadoop ConsultingAdvanced Hadoop Tuning and Optimization - Hadoop Consulting
Advanced Hadoop Tuning and Optimization - Hadoop ConsultingImpetus Technologies
 

Viewers also liked (18)

HDFS: Optimization, Stabilization and Supportability
HDFS: Optimization, Stabilization and SupportabilityHDFS: Optimization, Stabilization and Supportability
HDFS: Optimization, Stabilization and Supportability
 
SmartSense Suite
SmartSense SuiteSmartSense Suite
SmartSense Suite
 
Starting the Hadoop Journey at a Global Leader in Cancer Research
Starting the Hadoop Journey at a Global Leader in Cancer ResearchStarting the Hadoop Journey at a Global Leader in Cancer Research
Starting the Hadoop Journey at a Global Leader in Cancer Research
 
Presentation from physical to virtual to cloud emc
Presentation   from physical to virtual to cloud emcPresentation   from physical to virtual to cloud emc
Presentation from physical to virtual to cloud emc
 
It's not the size of your cluster, it's how you use it
It's not the size of your cluster, it's how you use itIt's not the size of your cluster, it's how you use it
It's not the size of your cluster, it's how you use it
 
Tame that Beast
Tame that BeastTame that Beast
Tame that Beast
 
Contributing to Open Source - A Beginners Guide
Contributing to Open Source - A Beginners GuideContributing to Open Source - A Beginners Guide
Contributing to Open Source - A Beginners Guide
 
Apache Hadoop YARN: Past, Present and Future
Apache Hadoop YARN: Past, Present and FutureApache Hadoop YARN: Past, Present and Future
Apache Hadoop YARN: Past, Present and Future
 
Rocking the World of Big Data at Centrica
Rocking the World of Big Data at CentricaRocking the World of Big Data at Centrica
Rocking the World of Big Data at Centrica
 
Accelerating Apache Hadoop through High-Performance Networking and I/O Techno...
Accelerating Apache Hadoop through High-Performance Networking and I/O Techno...Accelerating Apache Hadoop through High-Performance Networking and I/O Techno...
Accelerating Apache Hadoop through High-Performance Networking and I/O Techno...
 
Apache Hive 2.0: SQL, Speed, Scale
Apache Hive 2.0: SQL, Speed, ScaleApache Hive 2.0: SQL, Speed, Scale
Apache Hive 2.0: SQL, Speed, Scale
 
Keep your Hadoop Cluster at its Best
Keep your Hadoop Cluster at its BestKeep your Hadoop Cluster at its Best
Keep your Hadoop Cluster at its Best
 
Powering a Virtual Power Station with Big Data
Powering a Virtual Power Station with Big DataPowering a Virtual Power Station with Big Data
Powering a Virtual Power Station with Big Data
 
Running Spark in Production
Running Spark in ProductionRunning Spark in Production
Running Spark in Production
 
Apache Hive on ACID
Apache Hive on ACIDApache Hive on ACID
Apache Hive on ACID
 
Apache Hive 2.0 SQL, Speed, Scale by Alan Gates
Apache Hive 2.0 SQL, Speed, Scale by Alan GatesApache Hive 2.0 SQL, Speed, Scale by Alan Gates
Apache Hive 2.0 SQL, Speed, Scale by Alan Gates
 
On Demand HDP Clusters using Cloudbreak and Ambari
On Demand HDP Clusters using Cloudbreak and AmbariOn Demand HDP Clusters using Cloudbreak and Ambari
On Demand HDP Clusters using Cloudbreak and Ambari
 
Advanced Hadoop Tuning and Optimization - Hadoop Consulting
Advanced Hadoop Tuning and Optimization - Hadoop ConsultingAdvanced Hadoop Tuning and Optimization - Hadoop Consulting
Advanced Hadoop Tuning and Optimization - Hadoop Consulting
 

Similar to Taming the Elephant: Efficient and Effective Apache Hadoop Management

How YARN Enables Multiple Data Processing Engines in Hadoop
How YARN Enables Multiple Data Processing Engines in HadoopHow YARN Enables Multiple Data Processing Engines in Hadoop
How YARN Enables Multiple Data Processing Engines in HadoopPOSSCON
 
Dancing elephants - efficiently working with object stores from Apache Spark ...
Dancing elephants - efficiently working with object stores from Apache Spark ...Dancing elephants - efficiently working with object stores from Apache Spark ...
Dancing elephants - efficiently working with object stores from Apache Spark ...DataWorks Summit
 
Apache Spark and Object Stores
Apache Spark and Object StoresApache Spark and Object Stores
Apache Spark and Object StoresSteve Loughran
 
Dataworks Berlin Summit 18' - Apache hadoop YARN State Of The Union
Dataworks Berlin Summit 18' - Apache hadoop YARN State Of The UnionDataworks Berlin Summit 18' - Apache hadoop YARN State Of The Union
Dataworks Berlin Summit 18' - Apache hadoop YARN State Of The UnionWangda Tan
 
Apache Hadoop YARN: state of the union
Apache Hadoop YARN: state of the unionApache Hadoop YARN: state of the union
Apache Hadoop YARN: state of the unionDataWorks Summit
 
2013 Nov 20 Toronto Hadoop User Group (THUG) - Hadoop 2.2.0
2013 Nov 20 Toronto Hadoop User Group (THUG) - Hadoop 2.2.02013 Nov 20 Toronto Hadoop User Group (THUG) - Hadoop 2.2.0
2013 Nov 20 Toronto Hadoop User Group (THUG) - Hadoop 2.2.0Adam Muise
 
Cloudy with a chance of Hadoop - DataWorks Summit 2017 San Jose
Cloudy with a chance of Hadoop - DataWorks Summit 2017 San JoseCloudy with a chance of Hadoop - DataWorks Summit 2017 San Jose
Cloudy with a chance of Hadoop - DataWorks Summit 2017 San JoseMingliang Liu
 
Cloudy with a chance of Hadoop - real world considerations
Cloudy with a chance of Hadoop - real world considerationsCloudy with a chance of Hadoop - real world considerations
Cloudy with a chance of Hadoop - real world considerationsDataWorks Summit
 
Druid Scaling Realtime Analytics
Druid Scaling Realtime AnalyticsDruid Scaling Realtime Analytics
Druid Scaling Realtime AnalyticsAaron Brooks
 
Improving Hadoop Resiliency and Operational Efficiency with EMC Isilon
Improving Hadoop Resiliency and Operational Efficiency with EMC IsilonImproving Hadoop Resiliency and Operational Efficiency with EMC Isilon
Improving Hadoop Resiliency and Operational Efficiency with EMC IsilonDataWorks Summit/Hadoop Summit
 
FOD Paris Meetup - Global Data Management with DataPlane Services (DPS)
FOD Paris Meetup -  Global Data Management with DataPlane Services (DPS)FOD Paris Meetup -  Global Data Management with DataPlane Services (DPS)
FOD Paris Meetup - Global Data Management with DataPlane Services (DPS)Abdelkrim Hadjidj
 
Apache Hadoop 3.0 Community Update
Apache Hadoop 3.0 Community UpdateApache Hadoop 3.0 Community Update
Apache Hadoop 3.0 Community UpdateDataWorks Summit
 
Operating and supporting HBase Clusters
Operating and supporting HBase ClustersOperating and supporting HBase Clusters
Operating and supporting HBase Clustersenissoz
 
Ambari Management Packs (Apache Ambari Meetup 2018)
Ambari Management Packs (Apache Ambari Meetup 2018)Ambari Management Packs (Apache Ambari Meetup 2018)
Ambari Management Packs (Apache Ambari Meetup 2018)Swapan Shridhar
 
Streamline Apache Hadoop Operations with Apache Ambari and SmartSense
Streamline Apache Hadoop Operations with Apache Ambari and SmartSenseStreamline Apache Hadoop Operations with Apache Ambari and SmartSense
Streamline Apache Hadoop Operations with Apache Ambari and SmartSenseHortonworks
 
An Apache Hive Based Data Warehouse
An Apache Hive Based Data WarehouseAn Apache Hive Based Data Warehouse
An Apache Hive Based Data WarehouseDataWorks Summit
 
Hive edw-dataworks summit-eu-april-2017
Hive edw-dataworks summit-eu-april-2017Hive edw-dataworks summit-eu-april-2017
Hive edw-dataworks summit-eu-april-2017alanfgates
 

Similar to Taming the Elephant: Efficient and Effective Apache Hadoop Management (20)

How YARN Enables Multiple Data Processing Engines in Hadoop
How YARN Enables Multiple Data Processing Engines in HadoopHow YARN Enables Multiple Data Processing Engines in Hadoop
How YARN Enables Multiple Data Processing Engines in Hadoop
 
Dancing elephants - efficiently working with object stores from Apache Spark ...
Dancing elephants - efficiently working with object stores from Apache Spark ...Dancing elephants - efficiently working with object stores from Apache Spark ...
Dancing elephants - efficiently working with object stores from Apache Spark ...
 
Apache Spark and Object Stores
Apache Spark and Object StoresApache Spark and Object Stores
Apache Spark and Object Stores
 
Dataworks Berlin Summit 18' - Apache hadoop YARN State Of The Union
Dataworks Berlin Summit 18' - Apache hadoop YARN State Of The UnionDataworks Berlin Summit 18' - Apache hadoop YARN State Of The Union
Dataworks Berlin Summit 18' - Apache hadoop YARN State Of The Union
 
Apache Hadoop YARN: state of the union
Apache Hadoop YARN: state of the unionApache Hadoop YARN: state of the union
Apache Hadoop YARN: state of the union
 
Druid deep dive
Druid deep diveDruid deep dive
Druid deep dive
 
2013 Nov 20 Toronto Hadoop User Group (THUG) - Hadoop 2.2.0
2013 Nov 20 Toronto Hadoop User Group (THUG) - Hadoop 2.2.02013 Nov 20 Toronto Hadoop User Group (THUG) - Hadoop 2.2.0
2013 Nov 20 Toronto Hadoop User Group (THUG) - Hadoop 2.2.0
 
Apache Hadoop YARN: Past, Present and Future
Apache Hadoop YARN: Past, Present and FutureApache Hadoop YARN: Past, Present and Future
Apache Hadoop YARN: Past, Present and Future
 
Cloudy with a chance of Hadoop - DataWorks Summit 2017 San Jose
Cloudy with a chance of Hadoop - DataWorks Summit 2017 San JoseCloudy with a chance of Hadoop - DataWorks Summit 2017 San Jose
Cloudy with a chance of Hadoop - DataWorks Summit 2017 San Jose
 
Hadoop 3 in a Nutshell
Hadoop 3 in a NutshellHadoop 3 in a Nutshell
Hadoop 3 in a Nutshell
 
Cloudy with a chance of Hadoop - real world considerations
Cloudy with a chance of Hadoop - real world considerationsCloudy with a chance of Hadoop - real world considerations
Cloudy with a chance of Hadoop - real world considerations
 
Druid Scaling Realtime Analytics
Druid Scaling Realtime AnalyticsDruid Scaling Realtime Analytics
Druid Scaling Realtime Analytics
 
Improving Hadoop Resiliency and Operational Efficiency with EMC Isilon
Improving Hadoop Resiliency and Operational Efficiency with EMC IsilonImproving Hadoop Resiliency and Operational Efficiency with EMC Isilon
Improving Hadoop Resiliency and Operational Efficiency with EMC Isilon
 
FOD Paris Meetup - Global Data Management with DataPlane Services (DPS)
FOD Paris Meetup -  Global Data Management with DataPlane Services (DPS)FOD Paris Meetup -  Global Data Management with DataPlane Services (DPS)
FOD Paris Meetup - Global Data Management with DataPlane Services (DPS)
 
Apache Hadoop 3.0 Community Update
Apache Hadoop 3.0 Community UpdateApache Hadoop 3.0 Community Update
Apache Hadoop 3.0 Community Update
 
Operating and supporting HBase Clusters
Operating and supporting HBase ClustersOperating and supporting HBase Clusters
Operating and supporting HBase Clusters
 
Ambari Management Packs (Apache Ambari Meetup 2018)
Ambari Management Packs (Apache Ambari Meetup 2018)Ambari Management Packs (Apache Ambari Meetup 2018)
Ambari Management Packs (Apache Ambari Meetup 2018)
 
Streamline Apache Hadoop Operations with Apache Ambari and SmartSense
Streamline Apache Hadoop Operations with Apache Ambari and SmartSenseStreamline Apache Hadoop Operations with Apache Ambari and SmartSense
Streamline Apache Hadoop Operations with Apache Ambari and SmartSense
 
An Apache Hive Based Data Warehouse
An Apache Hive Based Data WarehouseAn Apache Hive Based Data Warehouse
An Apache Hive Based Data Warehouse
 
Hive edw-dataworks summit-eu-april-2017
Hive edw-dataworks summit-eu-april-2017Hive edw-dataworks summit-eu-april-2017
Hive edw-dataworks summit-eu-april-2017
 

More from DataWorks Summit/Hadoop Summit

Unleashing the Power of Apache Atlas with Apache Ranger
Unleashing the Power of Apache Atlas with Apache RangerUnleashing the Power of Apache Atlas with Apache Ranger
Unleashing the Power of Apache Atlas with Apache RangerDataWorks Summit/Hadoop Summit
 
Enabling Digital Diagnostics with a Data Science Platform
Enabling Digital Diagnostics with a Data Science PlatformEnabling Digital Diagnostics with a Data Science Platform
Enabling Digital Diagnostics with a Data Science PlatformDataWorks Summit/Hadoop Summit
 
Double Your Hadoop Performance with Hortonworks SmartSense
Double Your Hadoop Performance with Hortonworks SmartSenseDouble Your Hadoop Performance with Hortonworks SmartSense
Double Your Hadoop Performance with Hortonworks SmartSenseDataWorks Summit/Hadoop Summit
 
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...DataWorks Summit/Hadoop Summit
 
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...DataWorks Summit/Hadoop Summit
 
Mool - Automated Log Analysis using Data Science and ML
Mool - Automated Log Analysis using Data Science and MLMool - Automated Log Analysis using Data Science and ML
Mool - Automated Log Analysis using Data Science and MLDataWorks Summit/Hadoop Summit
 
The Challenge of Driving Business Value from the Analytics of Things (AOT)
The Challenge of Driving Business Value from the Analytics of Things (AOT)The Challenge of Driving Business Value from the Analytics of Things (AOT)
The Challenge of Driving Business Value from the Analytics of Things (AOT)DataWorks Summit/Hadoop Summit
 
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...DataWorks Summit/Hadoop Summit
 
Scaling HDFS to Manage Billions of Files with Distributed Storage Schemes
Scaling HDFS to Manage Billions of Files with Distributed Storage SchemesScaling HDFS to Manage Billions of Files with Distributed Storage Schemes
Scaling HDFS to Manage Billions of Files with Distributed Storage SchemesDataWorks Summit/Hadoop Summit
 

More from DataWorks Summit/Hadoop Summit (20)

Running Apache Spark & Apache Zeppelin in Production
Running Apache Spark & Apache Zeppelin in ProductionRunning Apache Spark & Apache Zeppelin in Production
Running Apache Spark & Apache Zeppelin in Production
 
State of Security: Apache Spark & Apache Zeppelin
State of Security: Apache Spark & Apache ZeppelinState of Security: Apache Spark & Apache Zeppelin
State of Security: Apache Spark & Apache Zeppelin
 
Unleashing the Power of Apache Atlas with Apache Ranger
Unleashing the Power of Apache Atlas with Apache RangerUnleashing the Power of Apache Atlas with Apache Ranger
Unleashing the Power of Apache Atlas with Apache Ranger
 
Enabling Digital Diagnostics with a Data Science Platform
Enabling Digital Diagnostics with a Data Science PlatformEnabling Digital Diagnostics with a Data Science Platform
Enabling Digital Diagnostics with a Data Science Platform
 
Revolutionize Text Mining with Spark and Zeppelin
Revolutionize Text Mining with Spark and ZeppelinRevolutionize Text Mining with Spark and Zeppelin
Revolutionize Text Mining with Spark and Zeppelin
 
Double Your Hadoop Performance with Hortonworks SmartSense
Double Your Hadoop Performance with Hortonworks SmartSenseDouble Your Hadoop Performance with Hortonworks SmartSense
Double Your Hadoop Performance with Hortonworks SmartSense
 
Hadoop Crash Course
Hadoop Crash CourseHadoop Crash Course
Hadoop Crash Course
 
Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
 
Apache Spark Crash Course
Apache Spark Crash CourseApache Spark Crash Course
Apache Spark Crash Course
 
Dataflow with Apache NiFi
Dataflow with Apache NiFiDataflow with Apache NiFi
Dataflow with Apache NiFi
 
Schema Registry - Set you Data Free
Schema Registry - Set you Data FreeSchema Registry - Set you Data Free
Schema Registry - Set you Data Free
 
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
 
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
 
Mool - Automated Log Analysis using Data Science and ML
Mool - Automated Log Analysis using Data Science and MLMool - Automated Log Analysis using Data Science and ML
Mool - Automated Log Analysis using Data Science and ML
 
How Hadoop Makes the Natixis Pack More Efficient
How Hadoop Makes the Natixis Pack More Efficient How Hadoop Makes the Natixis Pack More Efficient
How Hadoop Makes the Natixis Pack More Efficient
 
HBase in Practice
HBase in Practice HBase in Practice
HBase in Practice
 
The Challenge of Driving Business Value from the Analytics of Things (AOT)
The Challenge of Driving Business Value from the Analytics of Things (AOT)The Challenge of Driving Business Value from the Analytics of Things (AOT)
The Challenge of Driving Business Value from the Analytics of Things (AOT)
 
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
 
Backup and Disaster Recovery in Hadoop
Backup and Disaster Recovery in Hadoop Backup and Disaster Recovery in Hadoop
Backup and Disaster Recovery in Hadoop
 
Scaling HDFS to Manage Billions of Files with Distributed Storage Schemes
Scaling HDFS to Manage Billions of Files with Distributed Storage SchemesScaling HDFS to Manage Billions of Files with Distributed Storage Schemes
Scaling HDFS to Manage Billions of Files with Distributed Storage Schemes
 

Recently uploaded

SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 

Recently uploaded (20)

SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 

Taming the Elephant: Efficient and Effective Apache Hadoop Management

  • 1. 1 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Taming the Elephant: Efficient and Effective Apache Hadoop Management Paul Codding 2016 Hadoop Summit Dublin, Ireland
  • 2. 2 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Presenters Paul Codding Senior Product Manager, Cloud & Operations Apache Ambari, SmartSense
  • 3. 3 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Agenda  Introduction  Observations & Recommendations – Observations from analyzing ~1000 customer bundles – Common operational mistakes
  • 4. 4 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Agenda Introduction (Like 2 minutes)
  • 5. 5 © Hortonworks Inc. 2011 – 2016. All Rights Reserved SmartSense Architecture L A N D I N G Z O N E S E RV E R G AT E WAY A M B A R I A G E N T A G E N T A G E N TA G E N TA G E N T A G E N T B U N D L E WO R K E R N O D E WO R K E R N O D E WO R K E R N O D E WO R K E R N O D E WO R K E R N O D E WO R K E R N O D E S m a r t S e n s e A n a l y t i c s
  • 6. 6 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Agenda Introduction Obligatory Poll
  • 7. 7 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Agenda Introduction Obligatory Poll Observations & Recommendations
  • 8. 8 © Hortonworks Inc. 2011 – 2016. All Rights Reserved EVERY node counts… Common difficult to diagnose issues
  • 9. 9 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Operation System Configuration: Locale  /etc/localtime – Dictates which timezone your machine & the JDK thinks it’s in  Hive – unix_timestamp(…) – current_date() SELECT sum(amount) from sales where sale_date > unix_timestamp('2016-03-01 00:00:00') “default timezone and the default locale” Inconsistent Locale Configuration
  • 10. 10 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Operating System Configuration: Transparent Huge Pages (THP)  THP is an abstraction layer that automates creating, managing, and using huge pages  Pages == memory managed in blocks by the Linux Kernel  Huge pages are pages that come in larger sizes 2MB-1GB.
  • 11. 11 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Operating System Configuration: NSCD/SSSD  Name Service Cache Daemon – getpwnam – getpwuid – getgrnam – getgrid – gethostbyname cp10005.xxxxxx.com:1 cp10006.xxxxxx.com:5 cp10007.xxxxxx.com:1 cp10008.xxxxxx.com:0 cp10009.xxxxxx.com:1 cp10010.xxxxxx.com:3 cp10011.xxxxxx.com:0 cp10012.xxxxxx.com:1 cp10013.xxxxxx.com:0 cp10014.xxxxxx.com:2 cp10015.xxxxxx.com:0 cp10005.xxxxxx.com:0 cp10006.xxxxxx.com:0 cp10007.xxxxxx.com:0 cp10008.xxxxxx.com:0 cp10009.xxxxxx.com:0 cp10010.xxxxxx.com:0 cp10011.xxxxxx.com:0 cp10012.xxxxxx.com:0 cp10013.xxxxxx.com:0 cp10014.xxxxxx.com:0 cp10015.xxxxxx.com:0
  • 12. 12 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Operating System Configuration: NTPD  Network Time Protocol daemon 2016-03-31 18:40:28,585 FATAL [regionserver/ip-10-0-x-x.ec2.internal/10.0.x.x:16020] regionserver.HRegionServer: Master rejected startup because clock is out of syncorg.apache.hadoop.hbase.ClockOutOfSyncException: org.apache.hadoop.hbase.ClockOutOfSyncException: Server ip-10-0-x- x.ec2.internal,16020,1459449626477 has been rejected; Reported time is too far out of sync with master. Time difference of 74097ms > max allowed of 30000ms $ kinit -kt /etc/security/keytabs/hdfs.headless.keytab hdfs-HDP1@HORTONWORKS.LOCAL kinit: Clock skew too great while getting initial credentials
  • 13. 13 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Operating System: Legacy Kernel Issues  Specific NIC’s & Kernel Versions – Broadcom bnx2x module prior to RHEL 5.7 (kernel earlier than 2.6.18-274.el5) – QLogic NetXen netxen_nic module prior to RHEL 5.9 (kernel earlier than 2.6.18-348.el5) – Intel 10Gbps ixgbe module prior to RHEL 6.4 (kernel earlier than 2.6.32-358.el6) – Intel 10Gbps ixgbe module from RHEL 5.6 (kernel version 2.6.18-238.el5 and later)  Symptoms – NFS transfers over 10Gbps links are only transferring at 100MiB/sec (i.e. 1Gbps) – TCP connections never reach anywhere near wirespeed – TCP Window size reduced 720 bytesnic.generic-receive-offload  Workaround – nic.large-receive-offload – nic.generic-receive-offload RHEL Knowledgebase Solution: 20278
  • 14. 14 © Hortonworks Inc. 2011 – 2016. All Rights Reserved The Core Counts HDFS & YARN
  • 15. 15 © Hortonworks Inc. 2011 – 2016. All Rights Reserved HDFS NameNode Configuration
  • 16. 16 © Hortonworks Inc. 2011 – 2016. All Rights Reserved HDFS: NameNode Group Mapping Lookup Implementations  org.apache.hadoop.security.ShellBasedUnixGroupsMapping  org.apache.hadoop.security.LdapGroupsMapping  org.apache.hadoop.security.CompositeGroupsMapping  org.apache.hadoop.security.JniBasedUnixGroupsMappingWithFallback hadoop.security.group.mapping
  • 17. 17 © Hortonworks Inc. 2011 – 2016. All Rights Reserved HDFS: NameNode Metadata Directories  Multiple Entries – Each directory gets a replica of the fsimage data  Very common “second directory” is an NFS Mount  soft mount vs hard mount dfs.namenode.name.dir
  • 18. 18 © Hortonworks Inc. 2011 – 2016. All Rights Reserved HDFS: NameNode Handler Count  Math.log(${currentDataNodeCount}) * 20  10 node cluster – 46  100 node cluster – 92  1000 node cluster - 138 dfs.namenode.handler.count
  • 19. 19 © Hortonworks Inc. 2011 – 2016. All Rights Reserved HDFS: HA Retry Policy  When primary NameNode is killed, clients can retry for up to 10 minutes instead of failing over dfs.client.retry.policy.enabled = true
  • 20. 20 © Hortonworks Inc. 2011 – 2016. All Rights Reserved HDFS DataNode Configuration
  • 21. 21 © Hortonworks Inc. 2011 – 2016. All Rights Reserved HDFS: DataNode Failed Volumes  dmesg  smartctl dfs.datanode.failed.volumes.tolerated ata1.00: failed to IDENTIFY (I/O error, err_mask=0x4) === START OF READ SMART DATA SECTION === SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Short offline Completed: read failure 20% 717
  • 22. 22 © Hortonworks Inc. 2011 – 2016. All Rights Reserved HDFS: DataNode  Default: 4096  Increase depends on other services deployed in the cluster and workload type dfs.datanode.max.transfer.threads
  • 23. 23 © Hortonworks Inc. 2011 – 2016. All Rights Reserved YARN Resource Manager Configuration
  • 24. 24 © Hortonworks Inc. 2011 – 2016. All Rights Reserved YARN: ResourceManager Min/Max Container Size Allocation yarn.scheduler.minimum-allocation-mb & yarn.scheduler.maximum-allocation-mb
  • 25. 25 © Hortonworks Inc. 2011 – 2016. All Rights Reserved YARN: NodeManager Memory yarn.nodemanager.resource.memory-mb RAM Operating System DataNode Region Server NodeManager
  • 26. 26 © Hortonworks Inc. 2011 – 2016. All Rights Reserved YARN: NodeManager Local Directories yarn.nodemanager.local-dirs
  • 27. 27 © Hortonworks Inc. 2011 – 2016. All Rights Reserved YARN ATS: Rolling LevelDB Timeline store  org.apache.hadoop.yarn.server.timeline.EntityGroupFSTimelineStore  org.apache.hadoop.yarn.server.timeline.LeveldbTimelineStore  org.apache.hadoop.yarn.server.timeline.RollingLevelDBTimelineStore yarn.timeline-service.store-class
  • 28. 28 © Hortonworks Inc. 2011 – 2016. All Rights Reserved YARN ATS: TTL yarn.timeline-service.ttl-enable & yarn.timeline-service.ttl-ms
  • 29. 29 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Agenda Introduction Obligatory Poll Observations & Recommendations Summary
  • 30. 30 © Hortonworks Inc. 2011 – 2016. All Rights Reserved SmartSense Recommendations  We’ve covered 16 of ~250 rules  Built into Support Case close/Sev1 postmortem process  Onramp into core products and Apache Ambari – Stack Advisor – New Defaults – New Alerts hbase_tcp_nodelay hdfs_check_point_period hdfs_dn_suboptimal_mounts hdfs_dn_volume_tolerance hdfs_enable_security_check hdfs_mount_options hdfs_nn_checkpoint_txns hdfs_nn_handler_count hdfs_nn_protect_imp_dirs hdfs_nn_soft_mount hdfs_nn_super_user_group hdfs_short_circuit hive_enable_cbo hive_vectorized_exec jvm_optsmr_min_split_size mr_reduce_parallel_copies mr_slow_start os_cpu_scaling os_ssd_tuning tez_enable_reuse tez_session_release_delay tez_shuffle_buffer yarn_ats_security yarn_nm_black_listed_mount_logdir
  • 31. 31 © Hortonworks Inc. 2011 – 2016. All Rights Reserved All Bundles are: • Encrypted and Anonymized by default Configurable options to: • Exclude properties within specific Hadoop configuration files • Global REGEX replacements across all configuration, metrics, and logs By default: • Ambari clear text passwords are not collected • Hive and Oozie database properties are not collected • All IP addresses and host names are anonymized Bundle Security
  • 32. 32 © Hortonworks Inc. 2011 – 2016. All Rights Reserved SmartSense Stack Support HDP 2.4 HDP 2.3 HDP 2.2 HDP 2.1 HDP 2.0 SmartSense 1.x Ambari 2.2 Built-In! Ambari 2.1 Plug-In Ambari 2.0 Plug-In Ambari 1.7 Ambari 1.6 SmartSense 1.x
  • 33. 33 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Questions?

Editor's Notes

  1. /sys/kernel/mm/redhat_transparent_hugepage/enabled [always] never
  2. Usernames, User ID’s, Group Names, Group ID’s, as well as DNS positive-time-to-live
  3. If the NFS file system is hard mounted, the NFS daemons try repeatedly to contact the server. The NFS daemon retries will not time out unless nfstimeout value is set and reached
  4. SmartSense bundles include configuration, and metrics, and bundles used for Support Case troubleshooting included configuration, metrics, and log files. This data is captured for the Operating System of cluster nodes, as well as for all of the installed HDP services. The capture process can be configured to exclude specific files from capture, or specific Hadoop properties within HDP configuration files. In order to provide protection to organization-specific data, such as customer ID’s, patient ID’s, Credit Card #’s, etc. We provide the capability to specify a regular expression that can be removed or replaced in any file that is captured by SmartSense. This allows protection of sensitive data in the event that data is unintentionally leaked into log files. By default we remove all properties associated with clear text passwords. Ambari, Hive, and Oozie by default store DB credentials as cleartext, unless they’ve been configured to encrypt them. Just in case Hadoop Operators have not taken the time to do so, we exclude those properties by default.