3. Hadoop @ ebay
10+ large Hadoop clusters
10,000+ nodes
50,000+ jobs per day
50,000,000+ tasks per day
500+ types of Hadoop/Hbase metrics
Billions of audit events per day
3
4. Dedicated clusters
• Very specific use case like index building (Near Real Time Indexing)
• Tight SLAs for jobs (seconds to few minutes)
• Immediate revenue impact
• TSDB clusters for monitoring
Shared clusters
• Used primarily for analytics of user behavior and inventory
• Batch and ad-hoc jobs
• YARN, Hbase, Hive, Pig, Hue, Spark, etc.
• Security enabled with Kerberos
HAAS clusters
• Used primarily for DEV and QA
Hadoop @ ebay
5. 5
5.5PBof data generates a 650
million item index in only 2.5
hours
1.68 million items
processed in
3 minutes
Hadoop Platform Use Case: Search Backend
8. Eagle Data Activity Monitoring
Perimeter Security
Enable 2FA
Data Loss Prevention
Authorization and Access Control
Deploy Ranger for centralized Access Control
Kerberoized cluster
10. Use Case:
Analyze HDFS file/directory metadata to find anomalies
in users' HDFS usage patterns in pseudo real time
Retrieve HDFS Metadata like permissions, size, block
level properties etc which are not visible in HDFS Audit
logs
Block unauthorized operations and send alerts
Challenges:
RPC to Namenode for this is an overhead !
OIV is SLOW !!
Real Time Incremental Hadoop Image Processing
14. What’s Eagle
The uniform monitoring and alerting framework
to monitor large-scale distributed system like
hadoop, spark, cloud, etc. in real time.
15. Eagle Ecosystem
Apps
DAM
JPA
Interface
Web Portal
REST Services
Ambari Plugin
Integration
Kafka
Storm
HBase
Druid
Elastic Search
Eagle Framework
Provide full-stack monitoring framework for efficiently
developing highly scalable real-time monitoring
applications.
Eagle Apps
Provide built-in monitoring applications for domains like
hadoop, storm and cloud.
Eagle Integration
Integrate with distributed real-time execution environment
like storm, message bus like kafka and storage layer like
hbase, and also support extensions.
Eagle Interface
Allow to access or manage eagle through REST service,
web UI or Ambari plugin.
Eagle
Framework
16. JPA: Job Performance Analyser
Historical job analysis
Running job analysis
Anomaly host detection
Job data skew detection
Job performance suggestion
Anomaly Prediction based on machine learning
Monitor and analyze job performance in real-time