SlideShare a Scribd company logo
1 of 15
Download to read offline
We Make Hadoop Fly
The TeamNinja is the SolutionOur	TeamWho	we	are?
Avkash	Chauhan,	Founder- Deep	expertise	in	distributed	data	platforms	including	Hadoop,	with	various	publications	
and	speaking	engagements.	Worked	at	Microsoft,	Platfora	&	experience	includes:
§17+	years	in	tech	industry,	worked	with	15+	fortune	#100	companies	worldwide		
§Large	scale	distributed	application	development	and	performance	expert
§Core	Developer	and	product	implementation	specialist	for	Microsoft	Azure,	HDInsight	&	Platfora.
§Microsoft	distinguished	engineer,	key	architect	on	Hadoop	implementation	for	Azure.		
Henry	Ohara,	Architect	(Backend	engineer)- Has	a	career	of	designing	and	developing	scalable	backend	
systems	for	over	21+	years.	
§Founding	member	of	Yahoo’s	mobile	search	team	servicing	over	30	million	users.	
§Developed	a	recommendation	system	for	Yahoo's	media	site.
§Developed	top	#1	application	to	manage	secure	enterprise	communication	in	Japan	for	mobile	
Hamid	Behnam,	UX	Engineer	–
• Over	7+	years	of	front	end	engineering	and	development	experience	with	a	strong	interest	in	JavaScript
• Expert	in	creating	complex	&	multilayer	web	application	using	JavaScript	frameworks
• Amazing	work	ethic
Sal	Sferlazza,	principle	investor	&	chairman	of	board	– LaunchCapital.co
• Overall	21+	years	in	tech	industry	from	developer	to	CTO	to	CEO
• Worked	at	Anderson	Consulting,	Quest	Software,	SonicWALL,	Dell	
• Total	5	exits	in	last	10	years,	3	of	the	exits	are	in	MSP	space
Board
Engineering
Ninja is the SolutionWhat	we	have	built?
Operational	Analytics
for	Big	Data	
Heterogeneous	Monitoring			
Smart	Thresholds	
Unified	Alerting	
Forecasting	
Recommendations
BIG	DATA	APM
Ninja is the SolutionWhy?
There	is	no	APM	for	
big	data	stack	
Enterprise
BI	Vendors
Valley	Startups
IT	Organizations
3rd Party	Applications
Ninja is the SolutionTrue	IntegrationsMarket	Placement
Data	Perpetration	
Data	Quality
Data	Lake
Hadoop
ETL
Batch	Processing
Business	
Intelligence
Machine	Learning
Big	Data	
Analytics
Platform	Monitoring	&	Analytics
Big	Data	Platforms
Environments	where	Big	Data	platform	is	deployed
Big	Data
Ninja is the SolutionTrue	IntegrationsSupported	Components
Platform	Monitoring	&	Analytics
Ec2	Container
Service
• Multi Vendor Monitoring
• Smart Thresholds
• Unified Alerting
• Forecasting
• Recommendations
Ninja is the Solution
Proprietary	And	Confidential
Platform	Features
The TeamNinja is the SolutionSummaryArchitecture	Complete
Hadoop	Cluster
System	
Collection
Server
Hadoop	V1
Collection
Server
Hadoop	V2
Collection
Server
Web	Server Time	Series	DB
Scheduler
Server
Analysis
Server
RESTful Interface
AJAX
• Major	Hadoop	Distributions
• Amazon	&	HDInsight
• Cluster	Deployment	platform
• On	Premise
• Any	Hadoop	Farm	
HTML5,	CSS,	
JavaScript,
Knockout
And	D3
Postgres	DB
Server	interaction	over	REST	(admin/communication)
KAFKA
Streams
Messaging
Server
Non	invasive	appliance	design	&	Self	serving	deployment	model
Jetty	+	JAX-RS
Ninja is the SolutionIntegrations	Complete
Hadoop	
Distributions
Cloud	Based	
Hadoop
Distributed	File
Systems
Distributed	data
Processing
Distributed	
Services
Cluster &	container	
Management
Machine
Learning
*
*Prototyping	Phase
*
*
Ninja is the SolutionOur	TeamFeatures	Completed
• Data	Collection	Micro-services	connected	over	REST	interface
• Micro	servers	for	data	collection	
• Micro	servers	for	tunneling	over	SSH
• Distributed	server	architecture	to	support	distributed	deployment
• Hadoop	Distributions
• Apache	Hadoop
• Hortonworks
• Pivotal
• Dynamic	Views
• Cross-cluster	and	individual	cluster	views	through	deep-dive	navigation
• Dynamic	widgets	creation	and	deployment	for	any	collected	data
• Dynamic	page	generation	for	various	Hadoop	distributions	and	versions
• Dynamic	graph	details	for	both	system	and	Hadoop	data	combined
• Dashboards
• Dynamic	dashboards	generation	from	any	data	point	
• Dashboard	sharing
• Dashboard	scheduled	delivery	as	PDF/Image	over	email
• Reporting
• Report	generation	from	any	page	or	dashboard	in	PDF	or	image	format
• Custom	Complex	reporting	for	multi	point	data	across	multi	cluster	environment
• Scheduled	delivery	for	any	report	over	email	in	PDF/Image	format
• Process	data	analysis	per	process
• Example	process	i.e.	Kafka,	Zookeeper	etc
Ninja is the SolutionOur	TeamFeatures	Completed	(Cont..)
• Query	engine	to	access	any	data	from	any	table	at	any	duration,	example	query	includes:
• Display	System	CPU	load	average	last	2	hours
• Display	System	Memory	Heap	Memory		top	last	1	day
• Display	System	Network	Interface	rx Bytes	eth0	today
• Display	System	Disk	Space	Usage	Average	this	week
• Display	System	Disk	IO	Time	Spent	IO	xvda1	last	2	hours
• MapReduce
• MapReduce	jobs	analysis	individual	or	batch
• MapReduce	jobs	batch	mode	SLA	
• Data	collection	for	MapReduce	2	and	Spark	over	YARN*
• Cluster	Utilization	
• In-depth	cluster	utilization	metrics
• Dynamic	scheduled	delivery	for	cluster	utilization	reports	and	alert
• HDFS
• HDFS	data	analysis,	visualization	&	reporting	at	folder	level
• Alerting
• Scheduled	alerts	and	notification	for	any	data	point	collected
• Deployment
• Self	serving	deployment	&	Remote	Patching
• System	Data	Analysis	per	machine
• CPU
• Memory
• Network	data	at	interface	level
• Disk	I/O	data	at	at	partition	level
• Disk	Space	at	mounted	disk	level *	Only	data	collection
Ninja is the SolutionIntegrations	Roadmap
Hadoop	
Distributions
Cloud	Based	
Hadoop
Distributed	File
Systems
Distributed	data
Processing
Distributed	
Services
Cluster &	container	
Management
Machine
Learning
*
*Prototyping	Phase
*
*
Ninja is the SolutionOur	TeamFeatures	Roadmap
• Use	ODPi (http://www.odpi.org)	to	connect	with	supported	big	data	platform
• Self	service	model	to	slice	&	dice	any	data	collection
• MapReduce	&	Spark	Support	for	YARN
• EMR/AMI	Support
• Other	Hadoop	vendors	Support	(Cloudera	 &	MapR)
• Cartridge	design	for	Cassandra	data	collection
• Recommendation
• Docker	based	application	deployment
• Cost	Analysis	Matrix
• Forecasting
The TeamNinja is the SolutionSummaryDemo
Cluster	Monitoring HDFS	Monitoring
MapReduce	Job	
Monitoring	&	Analysis
Data	Analysis,	Graphs	
(D3/C3)
Alerts	&	Notifications
On-demand	&	
Scheduled	Reporting
Dashboards, HOD,
UI,	Control Panel,	
Time	Span
Data	Collection	
System
(Total	500+	data	
points)
Server	design	and	
communication
Cluster	Utilization Self	Management Troubleshooting
Ninja is the SolutionSummaryContact
Founder	&	Principal
Big	Data	Perspective	LLC
657	Mission	St.	Suite	602,	San	Francisco,	CA	94105
E:	avkash@bigdataperspective.com M:	650-713-9055
Avkash	Chauhan

More Related Content

Similar to Big Data Perspective (Company Information)

Prototype And Test Eh
Prototype And Test EhPrototype And Test Eh
Prototype And Test Eh
novo_ed
 
Cloud_DevOps_Data-Center_Infrastructure_Security_Compliancey_&_Emerging_Tech-...
Cloud_DevOps_Data-Center_Infrastructure_Security_Compliancey_&_Emerging_Tech-...Cloud_DevOps_Data-Center_Infrastructure_Security_Compliancey_&_Emerging_Tech-...
Cloud_DevOps_Data-Center_Infrastructure_Security_Compliancey_&_Emerging_Tech-...
Jarrett Neil Ridlinghafer
 
Sureshundley_Principal_webdesiger
Sureshundley_Principal_webdesigerSureshundley_Principal_webdesiger
Sureshundley_Principal_webdesiger
Suresh Undley
 
CVOscarMendez2016
CVOscarMendez2016CVOscarMendez2016
CVOscarMendez2016
omendez85
 

Similar to Big Data Perspective (Company Information) (20)

Full Stack Developers Hire.pptx
Full Stack Developers Hire.pptxFull Stack Developers Hire.pptx
Full Stack Developers Hire.pptx
 
Discover the World of Full Stack Development and Ignite Your Career.pdf
Discover the World of Full Stack Development and Ignite Your Career.pdfDiscover the World of Full Stack Development and Ignite Your Career.pdf
Discover the World of Full Stack Development and Ignite Your Career.pdf
 
State of Drupal keynote, DrupalCon New Orleans
State of Drupal keynote, DrupalCon New OrleansState of Drupal keynote, DrupalCon New Orleans
State of Drupal keynote, DrupalCon New Orleans
 
Ruby on Rails Specialists - TkXel
Ruby on Rails Specialists - TkXelRuby on Rails Specialists - TkXel
Ruby on Rails Specialists - TkXel
 
What Is Full Stack Development? A Complete Guide
What Is Full Stack Development? A Complete GuideWhat Is Full Stack Development? A Complete Guide
What Is Full Stack Development? A Complete Guide
 
Full Stack Engineer.docx
Full Stack Engineer.docxFull Stack Engineer.docx
Full Stack Engineer.docx
 
Prototype And Test Eh
Prototype And Test EhPrototype And Test Eh
Prototype And Test Eh
 
Prototype And Test Eh5
Prototype And Test Eh5Prototype And Test Eh5
Prototype And Test Eh5
 
Make Web, Not War - Keynote: Embracing Chaos, David Crow
Make Web, Not War - Keynote: Embracing Chaos, David CrowMake Web, Not War - Keynote: Embracing Chaos, David Crow
Make Web, Not War - Keynote: Embracing Chaos, David Crow
 
Cloud_DevOps_Data-Center_Infrastructure_Security_Compliancey_&_Emerging_Tech-...
Cloud_DevOps_Data-Center_Infrastructure_Security_Compliancey_&_Emerging_Tech-...Cloud_DevOps_Data-Center_Infrastructure_Security_Compliancey_&_Emerging_Tech-...
Cloud_DevOps_Data-Center_Infrastructure_Security_Compliancey_&_Emerging_Tech-...
 
resume-michael-garvin
resume-michael-garvinresume-michael-garvin
resume-michael-garvin
 
Outsourcing Full-stack Developer for Web Application Development? Here’s What...
Outsourcing Full-stack Developer for Web Application Development? Here’s What...Outsourcing Full-stack Developer for Web Application Development? Here’s What...
Outsourcing Full-stack Developer for Web Application Development? Here’s What...
 
Tau_Technologies_RhoMobile_and_services
Tau_Technologies_RhoMobile_and_servicesTau_Technologies_RhoMobile_and_services
Tau_Technologies_RhoMobile_and_services
 
Resume_20160508
Resume_20160508Resume_20160508
Resume_20160508
 
Google Developer Student Clubs.pdf
Google Developer Student Clubs.pdfGoogle Developer Student Clubs.pdf
Google Developer Student Clubs.pdf
 
Trucknet VP RnD
Trucknet VP RnDTrucknet VP RnD
Trucknet VP RnD
 
The Roles & Responsibility Of Freelance Dot Net Web Developer.docx
The Roles & Responsibility Of Freelance Dot Net Web Developer.docxThe Roles & Responsibility Of Freelance Dot Net Web Developer.docx
The Roles & Responsibility Of Freelance Dot Net Web Developer.docx
 
5 Benefits of attaining Full-Stack Development Skills.pdf
5 Benefits of attaining Full-Stack Development Skills.pdf5 Benefits of attaining Full-Stack Development Skills.pdf
5 Benefits of attaining Full-Stack Development Skills.pdf
 
Sureshundley_Principal_webdesiger
Sureshundley_Principal_webdesigerSureshundley_Principal_webdesiger
Sureshundley_Principal_webdesiger
 
CVOscarMendez2016
CVOscarMendez2016CVOscarMendez2016
CVOscarMendez2016
 

More from Avkash Chauhan

More from Avkash Chauhan (15)

AI Solutions with Macnica.ai - AI Expo 2018 Tokyo Japan
AI Solutions with Macnica.ai - AI Expo 2018 Tokyo JapanAI Solutions with Macnica.ai - AI Expo 2018 Tokyo Japan
AI Solutions with Macnica.ai - AI Expo 2018 Tokyo Japan
 
AI Expo - AI Revolution in Silicon Valley
AI Expo - AI Revolution in Silicon ValleyAI Expo - AI Revolution in Silicon Valley
AI Expo - AI Revolution in Silicon Valley
 
Nikkei xTech coverage on macnica.ai announcement
Nikkei xTech coverage on macnica.ai announcementNikkei xTech coverage on macnica.ai announcement
Nikkei xTech coverage on macnica.ai announcement
 
H2O Core Introduction
H2O Core IntroductionH2O Core Introduction
H2O Core Introduction
 
Creating AnswerBot with Keras and TensorFlow (TensorBeat)
Creating AnswerBot with Keras and TensorFlow (TensorBeat)Creating AnswerBot with Keras and TensorFlow (TensorBeat)
Creating AnswerBot with Keras and TensorFlow (TensorBeat)
 
Applied Machine learning using H2O, python and R Workshop
Applied Machine learning using H2O, python and R WorkshopApplied Machine learning using H2O, python and R Workshop
Applied Machine learning using H2O, python and R Workshop
 
Data 360 Conference: Introduction to Big Data, Hadoop and Big Data Analytics
Data 360 Conference: Introduction to Big Data, Hadoop and Big Data AnalyticsData 360 Conference: Introduction to Big Data, Hadoop and Big Data Analytics
Data 360 Conference: Introduction to Big Data, Hadoop and Big Data Analytics
 
Introduction to Big Data Analytics on Apache Hadoop
Introduction to Big Data Analytics on Apache HadoopIntroduction to Big Data Analytics on Apache Hadoop
Introduction to Big Data Analytics on Apache Hadoop
 
The concept of Datalake with Hadoop
The concept of Datalake with HadoopThe concept of Datalake with Hadoop
The concept of Datalake with Hadoop
 
Developing Hadoop strategy for your Enterprise
Developing Hadoop strategy for your EnterpriseDeveloping Hadoop strategy for your Enterprise
Developing Hadoop strategy for your Enterprise
 
Introduction to Hadoop at Data-360 Conference
Introduction to Hadoop at Data-360 ConferenceIntroduction to Hadoop at Data-360 Conference
Introduction to Hadoop at Data-360 Conference
 
Introduction to Apache Hive
Introduction to Apache HiveIntroduction to Apache Hive
Introduction to Apache Hive
 
Introduction to Apache Sqoop
Introduction to Apache SqoopIntroduction to Apache Sqoop
Introduction to Apache Sqoop
 
Introduction to Apache Pig
Introduction to Apache PigIntroduction to Apache Pig
Introduction to Apache Pig
 
Introduction to HBase
Introduction to HBaseIntroduction to HBase
Introduction to HBase
 

Recently uploaded

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Recently uploaded (20)

Design and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data ScienceDesign and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data Science
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Choreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software EngineeringChoreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software Engineering
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
 
API Governance and Monetization - The evolution of API governance
API Governance and Monetization -  The evolution of API governanceAPI Governance and Monetization -  The evolution of API governance
API Governance and Monetization - The evolution of API governance
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
JavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate GuideJavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate Guide
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
ChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps ProductivityChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps Productivity
 
Stronger Together: Developing an Organizational Strategy for Accessible Desig...
Stronger Together: Developing an Organizational Strategy for Accessible Desig...Stronger Together: Developing an Organizational Strategy for Accessible Desig...
Stronger Together: Developing an Organizational Strategy for Accessible Desig...
 
Modernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using BallerinaModernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using Ballerina
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptx
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 

Big Data Perspective (Company Information)

  • 2. The TeamNinja is the SolutionOur TeamWho we are? Avkash Chauhan, Founder- Deep expertise in distributed data platforms including Hadoop, with various publications and speaking engagements. Worked at Microsoft, Platfora & experience includes: §17+ years in tech industry, worked with 15+ fortune #100 companies worldwide §Large scale distributed application development and performance expert §Core Developer and product implementation specialist for Microsoft Azure, HDInsight & Platfora. §Microsoft distinguished engineer, key architect on Hadoop implementation for Azure. Henry Ohara, Architect (Backend engineer)- Has a career of designing and developing scalable backend systems for over 21+ years. §Founding member of Yahoo’s mobile search team servicing over 30 million users. §Developed a recommendation system for Yahoo's media site. §Developed top #1 application to manage secure enterprise communication in Japan for mobile Hamid Behnam, UX Engineer – • Over 7+ years of front end engineering and development experience with a strong interest in JavaScript • Expert in creating complex & multilayer web application using JavaScript frameworks • Amazing work ethic Sal Sferlazza, principle investor & chairman of board – LaunchCapital.co • Overall 21+ years in tech industry from developer to CTO to CEO • Worked at Anderson Consulting, Quest Software, SonicWALL, Dell • Total 5 exits in last 10 years, 3 of the exits are in MSP space Board Engineering
  • 3. Ninja is the SolutionWhat we have built? Operational Analytics for Big Data Heterogeneous Monitoring Smart Thresholds Unified Alerting Forecasting Recommendations BIG DATA APM
  • 4. Ninja is the SolutionWhy? There is no APM for big data stack Enterprise BI Vendors Valley Startups IT Organizations
  • 5. 3rd Party Applications Ninja is the SolutionTrue IntegrationsMarket Placement Data Perpetration Data Quality Data Lake Hadoop ETL Batch Processing Business Intelligence Machine Learning Big Data Analytics Platform Monitoring & Analytics Big Data Platforms Environments where Big Data platform is deployed Big Data
  • 6. Ninja is the SolutionTrue IntegrationsSupported Components Platform Monitoring & Analytics Ec2 Container Service
  • 7. • Multi Vendor Monitoring • Smart Thresholds • Unified Alerting • Forecasting • Recommendations Ninja is the Solution Proprietary And Confidential Platform Features
  • 8. The TeamNinja is the SolutionSummaryArchitecture Complete Hadoop Cluster System Collection Server Hadoop V1 Collection Server Hadoop V2 Collection Server Web Server Time Series DB Scheduler Server Analysis Server RESTful Interface AJAX • Major Hadoop Distributions • Amazon & HDInsight • Cluster Deployment platform • On Premise • Any Hadoop Farm HTML5, CSS, JavaScript, Knockout And D3 Postgres DB Server interaction over REST (admin/communication) KAFKA Streams Messaging Server Non invasive appliance design & Self serving deployment model Jetty + JAX-RS
  • 9. Ninja is the SolutionIntegrations Complete Hadoop Distributions Cloud Based Hadoop Distributed File Systems Distributed data Processing Distributed Services Cluster & container Management Machine Learning * *Prototyping Phase * *
  • 10. Ninja is the SolutionOur TeamFeatures Completed • Data Collection Micro-services connected over REST interface • Micro servers for data collection • Micro servers for tunneling over SSH • Distributed server architecture to support distributed deployment • Hadoop Distributions • Apache Hadoop • Hortonworks • Pivotal • Dynamic Views • Cross-cluster and individual cluster views through deep-dive navigation • Dynamic widgets creation and deployment for any collected data • Dynamic page generation for various Hadoop distributions and versions • Dynamic graph details for both system and Hadoop data combined • Dashboards • Dynamic dashboards generation from any data point • Dashboard sharing • Dashboard scheduled delivery as PDF/Image over email • Reporting • Report generation from any page or dashboard in PDF or image format • Custom Complex reporting for multi point data across multi cluster environment • Scheduled delivery for any report over email in PDF/Image format • Process data analysis per process • Example process i.e. Kafka, Zookeeper etc
  • 11. Ninja is the SolutionOur TeamFeatures Completed (Cont..) • Query engine to access any data from any table at any duration, example query includes: • Display System CPU load average last 2 hours • Display System Memory Heap Memory top last 1 day • Display System Network Interface rx Bytes eth0 today • Display System Disk Space Usage Average this week • Display System Disk IO Time Spent IO xvda1 last 2 hours • MapReduce • MapReduce jobs analysis individual or batch • MapReduce jobs batch mode SLA • Data collection for MapReduce 2 and Spark over YARN* • Cluster Utilization • In-depth cluster utilization metrics • Dynamic scheduled delivery for cluster utilization reports and alert • HDFS • HDFS data analysis, visualization & reporting at folder level • Alerting • Scheduled alerts and notification for any data point collected • Deployment • Self serving deployment & Remote Patching • System Data Analysis per machine • CPU • Memory • Network data at interface level • Disk I/O data at at partition level • Disk Space at mounted disk level * Only data collection
  • 12. Ninja is the SolutionIntegrations Roadmap Hadoop Distributions Cloud Based Hadoop Distributed File Systems Distributed data Processing Distributed Services Cluster & container Management Machine Learning * *Prototyping Phase * *
  • 13. Ninja is the SolutionOur TeamFeatures Roadmap • Use ODPi (http://www.odpi.org) to connect with supported big data platform • Self service model to slice & dice any data collection • MapReduce & Spark Support for YARN • EMR/AMI Support • Other Hadoop vendors Support (Cloudera & MapR) • Cartridge design for Cassandra data collection • Recommendation • Docker based application deployment • Cost Analysis Matrix • Forecasting
  • 14. The TeamNinja is the SolutionSummaryDemo Cluster Monitoring HDFS Monitoring MapReduce Job Monitoring & Analysis Data Analysis, Graphs (D3/C3) Alerts & Notifications On-demand & Scheduled Reporting Dashboards, HOD, UI, Control Panel, Time Span Data Collection System (Total 500+ data points) Server design and communication Cluster Utilization Self Management Troubleshooting
  • 15. Ninja is the SolutionSummaryContact Founder & Principal Big Data Perspective LLC 657 Mission St. Suite 602, San Francisco, CA 94105 E: avkash@bigdataperspective.com M: 650-713-9055 Avkash Chauhan