SlideShare a Scribd company logo
1 of 16
Download to read offline
1 ©	Hortonworks	Inc.	2011	– 2018.	All	Rights	Reserved
IBM	Cloud	Paris	Meetup
Abdelkrim	Hadjidj
Solutions	Engineering	@	Hortonworks
2 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Hortonworks:	Enabling	the	Modern	Data	Architecture
Our	mission	continues…
à Make	Hadoop	an	enterprise	viable	data	platform
à Bring	all	data	under	management	– all	sources	and	types
à Extend	to	Global	Data	Management
Hortonworks	consistent	and	continuous	track	record	of	innovation
3 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
MULTIPLE	CLUSTERS	AND		SOURCES
MULTIHYBRID
DATAPLANE	SERVICE	(DPS)
MANAGE,	GOVERN,	SECURE
DATA
LIFECYCLE
MANAGER
DATA	
STEWARD
STUDIO*
ISV
SERVICES
*not	yet	available,	coming	soon
EXTENSIBLE	SERVICES
IBM	DSX*CLOUD-
BREAK*
DATA
ANALYTICS
STUDIO*
CONNECTED	DATA	PLATFORMS
HORTONWORKS
DATA	PLATFORM	(HDP®)
DATA-AT-REST
HORTONWORKS
DATAFLOW	(HDF™)
DATA-IN-MOTION
MODERN	DATA	USE	CASES
EDW
OPTIMIZATION
CYBER	SECURITY DATA	SCIENCE
ADVANCED
ANALYTICS
PARTNER
SOLUTIONS
IOT/	STREAMING	
ANALYTICS
HORTONWORKS
CONNECTION
ENTERPRISE	SUPPORT
PREMIER	SUPPORT
EDUCATIONAL	SERVICES
PROFESSIONAL	SERVICES
COMMUNITY	CONNECTION
HORTONWORKS
PLATFORM	SERVICES
OPERATIONAL	SERVICES
SMARTSENSE™
Global	Data	Management	With	Hortonworks
4 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Data	Lake	3.0:	Scale	&	Cut	TCO;	Deploy	in	Minutes	
-Siloed Multiple	deployments	w/	low	utilization
-Longer time	to	deployment	(days)
HDP HDP HDP
1.6
Dockerized
HDP	Clusters
Ambari YARN
HDFS
Dockerized
Kubernetes
Clusters
Dockerized
Stateful
Microservices
CBS	
(Container	
Block	Storage)
Hadoop	
APIs
S3	APIs
YARN Ozone
Bare	Metal	Shared	Services	in	the	Physical	Cluster
Pre-Data	Lake	3.0 Data	Lake	3.0
-Scale 10s	of	Billions	of	files,	Thousands	of	nodes
-Data	Efficiency 2x	storage	reduction
-Compute	Efficiency Dockerized micro-services	on	demand	(separate	compute	&	storage)
-Generic	file-system Hadoop	APIs,	S3	APIs,	Block
-3rd Party Kubernetes,	ISVs	such	as	IBM	DSX
Dockerized
Ephemeral
HDP	Clusters
Ambari YARN
HDFS
Dockerized
Multi-Version
Dockerized
Stateless
Microservices
Hadoop	
APIs
YARN HDFS
Bare	Metal	Shared	Services	in	the	Physical	Cluster
HDP	3.0 Post	HDP	3.0
5 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Data	Lake	3.0	Architecture
Hortonworks	Data	Plane	Service	
(DPS)	User	Interface
EC2 Google
Application/Container	Registry
Templatized	
app#2
Templatized	
app#3
Templatized	
app#1
YARN	API
ISVs	(Modern	Data	Applications)
CyberSecurity
IoT	apps
Container	Scheduler	for	Data	Intensive	Apps	in	a	Shared	Cluster	(YARN)
(Queue/App	Priority,	User	Pre-emption,	SLA	guarantee,	GPU	Isolation/pooling)
Service	Discovery
App	Logs
App	Metrics	&	Monitoring
App	Security	&	Governance
Cold	Tier	
-Erasure	Coding	with	1.5x	OH
-Archive	HDD
-
Hot	Tier
-Fast	SSD	(3x	replica)
GPU	Resources
IBM
6 ©	Hortonworks	Inc.	2011	– 2018.	All	Rights	Reserved
Hadoop	3.x
7 ©	Hortonworks	Inc.	2011	– 2018.	All	Rights	Reserved
HDP	3.0	Focus	Areas
• Business	Agility
• Allow	Hive	/	Spark	/	Zeppelin	to	innovate	faster	than	core
• 3.0	Adoption
• Time	to	Market	– Hadoop	3.0
• Upgradability	- Painless	for	users	to	take	on	3.0
• Scalability,	Containerized	Micro-Services	&	Storage	Efficient	Archive
• NN	Federation/ViewFS gives	a	way	to	scale	the	Hadoop	solution	
• Containerization	gives	more	workloads	on	the	solution	and	DS	workloads	can	control	dependencies
• Erasure	Coding		allows	for	keeping	more	data	longer
• GPUs	for	AI	&	Deep	Learning
• GPU	Scheduling
• Docker	Container	Example	with	DL	Libraries
• 3rd	Party	tools
• DSX
8 ©	Hortonworks	Inc.	2011	– 2018.	All	Rights	Reserved
HDP	3.0	–Erasure	Coding	Enables	Active	Archive
⬢ Erasure	Coding	in	HDP	3.0
– Default:	Reed	Solomon	6	data	+	3	
parity
– 2x	storage	savings	(1.5x	vs.	3x	w/	
replica)
– Tolerates	up	to	3	node	failure	
(continues	writing	w/	6	data	nodes)
– Support	Hive/Tez	query	on	data	
written	directly	to	EC	zone
⬢ Other	Notable	Items
– Intra-node	disk	balancing
– Data	tiering	with	policy
– Optimize	around	slow	disk/network
– Support	more	than	2	Name	Nodes
Erasure	Coding	(Archive:	1.5)
Cold
3	parity	blocks
Data	Nodes Data	Nodes
Nodes
Media
Disk	- Hard	DriveSSD	- Solid	State	Drive
N		Replicas	(SSD:	3)
Hot	(default) Warm
N	Replicas	(Disk:	1,	Archive:	2)
Files/
Directories
Archive	- High	Density	Drives
Data	Nodes
b3
b1
b2
P1
b6
b4
b5
P2
P3
6	data	blocks
9 ©	Hortonworks	Inc.	2011	– 2018.	All	Rights	Reserved
Ozone	– Why	an	Object	Store?
• With	workloads	like	IoT we	are	looking	at	scaling	to	trillions	of	objects.
• Apache	HDFS	is	designed	for	large	objects	– not	for	many	small	objects
• Small	files	create	memory	pressure	on	namenode.
• Each	small	file	creates	a	block	in	the	datanode.
• Datanodes send	all	block	information	to	namenode in	BlockReports.	
• Both	of	these	create	scalability	issues	on	Namenode.
• Metadata	in	memory	is	the	strength	of	the	original	GFS	and	HDFS	design,	but	also	its	
weakness	in	scaling	number	of	files	and	blocks.
• An	object	store	has	simpler	semantics	than	a	file	system	and	is	easier	to	scale
10 ©	Hortonworks	Inc.	2011	– 2018.	All	Rights	Reserved
Ozone	– Why	an	Object	Store	(continued)
• Ozone	attempts	to	scale	to	trillions	of	objects
• Ozone	is	built	on	a	distributed	metadata	store.
• Avoids	any	single	server	becoming	a	bottleneck
• More	parallelism	possible	in	both	data	and	metadata	operations
• Build	on	well	tested	components	and	understood	protocols
• RAFT	for	consensus
• RAFT	is	a	protocol	for	reaching	consensus	between	a	set	of	machines	in	an	
unreliable	environment	where	machines	and	network	may	fail.
• Off-the-shelf	Key-Value	store	like	RocksDB
11 ©	Hortonworks	Inc.	2011	– 2018.	All	Rights	Reserved
HDP	3.0	- HDFS	Federation	Enables	Linear	Scale
DN	1 DN	2 DN	m
.. .. ..
NS1
Future	NS
e.g.	Ozone	
FS... ...
NS	k
Block	Management	Layer
Block	Pool		
n
Block	Pool		
k
Block	
Pool	1
NN-1 NN-k
Common	Storage
Block	StorageNamespace
⬢ Extends	HDFS	to	multiple	volumes
– But	the	volumes	can	shares	the	physical	storage	of	
the	DNs
– You	still	get	the	Hadoop	data	locality	for	compute
⬢ Optionally,	can	mount	a	Hadoop	Compatible	FS	
volume
– Commonly	used	in	cloud	for	cloud	storage	– S3,	
WASB,	ADLS
– Use	with	upcoming	OzoneFS
Hot	
data
Hot	
data
Cold	data
12 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Details	YARN	Functionalities
Building	Blocks
RAS
-Application	Timeline	Services	v2
-Delete	queues	without	RM	restart
-Efficient	resource	handling	on	queue	stop
-Easier	YARN	UI
Powerful	YARN
-Disk	isolation
-Network	IO	throttling
-Application	priority	in	Cap	Scheduler
-Intra-queue	pre-emption
-Support	admin	configurable	resource	
type	(e.g.	GPU)
-Affinity/anti-affinity	support
New	Features
-Docker	containers	on	YARN
-Seamless	upgrade	of	apps	on	YARN
Assembly	of	Micro-
Services
• Consumable	micro-services	(Metron)
• Deployed	and	managed	Ambari
Long	Running	Apps
• Spark,	HBase,	Kafka,	Solr,	Storm	on	Docker
• Deployed	and	managed	by	Ambari
Multiple	Ephemeral	HDPs
• Deployed	by	Cloudbreak;	individual	
clusters	managed	by	Ambari
13 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Built-in	support	for	long	running	Service	in	YARN
à A	native	YARN	framework.		YARN-4692
• Abstract	common	Framework	(Similar	to	Slider)	to	support	long	running	service
• More	simplified	API	(to	manage	service	lifecycle)
• Better	support	for	long	running	service
à Recognition	of	long	running	service
• Affect	the	policy	of	preemption,	container	reservation,	etc.
• Auto-restart	of	containers
• Containers	for	long	running	service	are	retried	to	same	node	in	case	of	local	state
à Service/application	upgrade	support	– YARN-4726
• In	general,	services	are	expected	to	run	long	enough	to	cross	versions
à Dynamic	container	configuration
14 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
More	Powerful	YARN
à Resource	Isolation
– Resource	isolation	support	for	disk	and	network
• YARN-2619	(disk),	YARN-2140	(network)
• Containers	get	a	fair	share	of	disk	and	network	resources	using	Cgroups
– Docker	support	in	LinuxContainerExecutor
• YARN-3611
• Support	to	launch	Docker	containers	alongside	process
• Packaging and resource isolation
• Complements	YARN’s	support	for	long	running	services
15 ©	Hortonworks	Inc.	2011	– 2018.	All	Rights	Reserved
Data	Lake	3.0	Architecture
Hortonworks	Data	Plane	Service	
(DPS)	User	Interface
EC2 Google
Application/Container	Registry
Templatized	
app#2
Templatized	
app#3
Templatized	
app#1
YARN	API
ISVs	(Modern	Data	Applications)
CyberSecurity
IoT	apps
Container	Scheduler	for	Data	Intensive	Apps	in	a	Shared	Cluster	(YARN)
(Queue/App	Priority,	User	Pre-emption,	SLA	guarantee,	GPU	Isolation/pooling)
Service	Discovery
App	Logs
App	Metrics	&	Monitoring
App	Security	&	Governance
Cold	Tier	
-Erasure	Coding	with	1.5x	OH
-Archive	HDD
-
Hot	Tier
-Fast	SSD	(3x	replica)
GPU	Resources
IBM
16 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Thanks

More Related Content

What's hot

Appplications – Driving Expansion In The Cloud
Appplications – Driving Expansion In The CloudAppplications – Driving Expansion In The Cloud
Appplications – Driving Expansion In The CloudNetAppUK
 
Part 1: Introducing the Cloudera Data Science Workbench
Part 1: Introducing the Cloudera Data Science WorkbenchPart 1: Introducing the Cloudera Data Science Workbench
Part 1: Introducing the Cloudera Data Science WorkbenchCloudera, Inc.
 
DataOps or how I learned to love production - Michael Hausenblas
DataOps or how I learned to love production  - Michael HausenblasDataOps or how I learned to love production  - Michael Hausenblas
DataOps or how I learned to love production - Michael HausenblasEvention
 
IBM Big Data Analytics Concepts and Use Cases
IBM Big Data Analytics Concepts and Use CasesIBM Big Data Analytics Concepts and Use Cases
IBM Big Data Analytics Concepts and Use CasesTony Pearson
 
Data Science with Hadoop: A Primer
Data Science with Hadoop: A PrimerData Science with Hadoop: A Primer
Data Science with Hadoop: A PrimerDataWorks Summit
 
Beyond Batch: Is ETL still relevant in the API economy?
Beyond Batch: Is ETL still relevant in the API economy?Beyond Batch: Is ETL still relevant in the API economy?
Beyond Batch: Is ETL still relevant in the API economy?SnapLogic
 
Accelerate and Scale Big Data Analytics and Machine Learning Pipelines with D...
Accelerate and Scale Big Data Analytics and Machine Learning Pipelines with D...Accelerate and Scale Big Data Analytics and Machine Learning Pipelines with D...
Accelerate and Scale Big Data Analytics and Machine Learning Pipelines with D...Alluxio, Inc.
 
Using Neo4j and Machine Learning to Create a Decision Engine, CluedIn
Using Neo4j and Machine Learning  to Create a Decision Engine, CluedInUsing Neo4j and Machine Learning  to Create a Decision Engine, CluedIn
Using Neo4j and Machine Learning to Create a Decision Engine, CluedInNeo4j
 
NetApp at Gartner Symposium Show Guide
NetApp at Gartner Symposium Show GuideNetApp at Gartner Symposium Show Guide
NetApp at Gartner Symposium Show GuideNetAppUK
 
Use .NET Core to create IoT Solutions
Use .NET Core to create IoT SolutionsUse .NET Core to create IoT Solutions
Use .NET Core to create IoT SolutionsJohn Chang
 
NVIDIA Supply Chain Finance CAPSTONE
NVIDIA Supply Chain Finance CAPSTONENVIDIA Supply Chain Finance CAPSTONE
NVIDIA Supply Chain Finance CAPSTONEParam Parikh
 
Democratizing Data Science on Kubernetes
Democratizing Data Science on Kubernetes Democratizing Data Science on Kubernetes
Democratizing Data Science on Kubernetes John Archer
 
10 Good Reasons: NetApp for Healthcare
10 Good Reasons: NetApp for Healthcare10 Good Reasons: NetApp for Healthcare
10 Good Reasons: NetApp for HealthcareNetApp
 
Trends towards the merge of HPC + Big Data systems
Trends towards the merge of HPC + Big Data systemsTrends towards the merge of HPC + Big Data systems
Trends towards the merge of HPC + Big Data systemsIgor José F. Freitas
 
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...Precisely
 
Benefits of Transferring Real-Time Data to Hadoop at Scale
Benefits of Transferring Real-Time Data to Hadoop at ScaleBenefits of Transferring Real-Time Data to Hadoop at Scale
Benefits of Transferring Real-Time Data to Hadoop at ScaleHortonworks
 

What's hot (20)

Appplications – Driving Expansion In The Cloud
Appplications – Driving Expansion In The CloudAppplications – Driving Expansion In The Cloud
Appplications – Driving Expansion In The Cloud
 
Part 1: Introducing the Cloudera Data Science Workbench
Part 1: Introducing the Cloudera Data Science WorkbenchPart 1: Introducing the Cloudera Data Science Workbench
Part 1: Introducing the Cloudera Data Science Workbench
 
DataOps or how I learned to love production - Michael Hausenblas
DataOps or how I learned to love production  - Michael HausenblasDataOps or how I learned to love production  - Michael Hausenblas
DataOps or how I learned to love production - Michael Hausenblas
 
IBM Big Data Analytics Concepts and Use Cases
IBM Big Data Analytics Concepts and Use CasesIBM Big Data Analytics Concepts and Use Cases
IBM Big Data Analytics Concepts and Use Cases
 
Destroying Data Silos
Destroying Data SilosDestroying Data Silos
Destroying Data Silos
 
Hadoop dev 01
Hadoop dev 01Hadoop dev 01
Hadoop dev 01
 
Data Science with Hadoop: A Primer
Data Science with Hadoop: A PrimerData Science with Hadoop: A Primer
Data Science with Hadoop: A Primer
 
Beyond Batch: Is ETL still relevant in the API economy?
Beyond Batch: Is ETL still relevant in the API economy?Beyond Batch: Is ETL still relevant in the API economy?
Beyond Batch: Is ETL still relevant in the API economy?
 
Accelerate and Scale Big Data Analytics and Machine Learning Pipelines with D...
Accelerate and Scale Big Data Analytics and Machine Learning Pipelines with D...Accelerate and Scale Big Data Analytics and Machine Learning Pipelines with D...
Accelerate and Scale Big Data Analytics and Machine Learning Pipelines with D...
 
Using Neo4j and Machine Learning to Create a Decision Engine, CluedIn
Using Neo4j and Machine Learning  to Create a Decision Engine, CluedInUsing Neo4j and Machine Learning  to Create a Decision Engine, CluedIn
Using Neo4j and Machine Learning to Create a Decision Engine, CluedIn
 
NetApp at Gartner Symposium Show Guide
NetApp at Gartner Symposium Show GuideNetApp at Gartner Symposium Show Guide
NetApp at Gartner Symposium Show Guide
 
Big Data: Myths and Realities
Big Data: Myths and RealitiesBig Data: Myths and Realities
Big Data: Myths and Realities
 
Use .NET Core to create IoT Solutions
Use .NET Core to create IoT SolutionsUse .NET Core to create IoT Solutions
Use .NET Core to create IoT Solutions
 
NVIDIA Supply Chain Finance CAPSTONE
NVIDIA Supply Chain Finance CAPSTONENVIDIA Supply Chain Finance CAPSTONE
NVIDIA Supply Chain Finance CAPSTONE
 
AI meets Big Data
AI meets Big DataAI meets Big Data
AI meets Big Data
 
Democratizing Data Science on Kubernetes
Democratizing Data Science on Kubernetes Democratizing Data Science on Kubernetes
Democratizing Data Science on Kubernetes
 
10 Good Reasons: NetApp for Healthcare
10 Good Reasons: NetApp for Healthcare10 Good Reasons: NetApp for Healthcare
10 Good Reasons: NetApp for Healthcare
 
Trends towards the merge of HPC + Big Data systems
Trends towards the merge of HPC + Big Data systemsTrends towards the merge of HPC + Big Data systems
Trends towards the merge of HPC + Big Data systems
 
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...
 
Benefits of Transferring Real-Time Data to Hadoop at Scale
Benefits of Transferring Real-Time Data to Hadoop at ScaleBenefits of Transferring Real-Time Data to Hadoop at Scale
Benefits of Transferring Real-Time Data to Hadoop at Scale
 

Similar to IBM Cloud Paris meetup 20180213 - Hortonworks

Hortonworks - What's Possible with a Modern Data Architecture?
Hortonworks - What's Possible with a Modern Data Architecture?Hortonworks - What's Possible with a Modern Data Architecture?
Hortonworks - What's Possible with a Modern Data Architecture?Hortonworks
 
Rescue your Big Data from Downtime with HP Operations Bridge and Apache Hadoop
Rescue your Big Data from Downtime with HP Operations Bridge and Apache HadoopRescue your Big Data from Downtime with HP Operations Bridge and Apache Hadoop
Rescue your Big Data from Downtime with HP Operations Bridge and Apache HadoopHortonworks
 
Apache Hadoop on the Open Cloud
Apache Hadoop on the Open CloudApache Hadoop on the Open Cloud
Apache Hadoop on the Open CloudHortonworks
 
Hortonworks Data In Motion Webinar Series Pt. 2
Hortonworks Data In Motion Webinar Series Pt. 2Hortonworks Data In Motion Webinar Series Pt. 2
Hortonworks Data In Motion Webinar Series Pt. 2Hortonworks
 
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...Innovative Management Services
 
Introduction to Hadoop
Introduction to HadoopIntroduction to Hadoop
Introduction to HadoopPOSSCON
 
Hortonworks - IBM - Cloud Event
Hortonworks - IBM - Cloud EventHortonworks - IBM - Cloud Event
Hortonworks - IBM - Cloud EventThiago Santiago
 
Real-Time Processing in Hadoop for IoT Use Cases - Phoenix HUG
Real-Time Processing in Hadoop for IoT Use Cases - Phoenix HUGReal-Time Processing in Hadoop for IoT Use Cases - Phoenix HUG
Real-Time Processing in Hadoop for IoT Use Cases - Phoenix HUGskumpf
 
Supporting Financial Services with a More Flexible Approach to Big Data
Supporting Financial Services with a More Flexible Approach to Big DataSupporting Financial Services with a More Flexible Approach to Big Data
Supporting Financial Services with a More Flexible Approach to Big DataWANdisco Plc
 
[Hortonworks] Future Of Data: Madrid - HDF & Data in motion
[Hortonworks] Future Of Data: Madrid - HDF & Data in motion[Hortonworks] Future Of Data: Madrid - HDF & Data in motion
[Hortonworks] Future Of Data: Madrid - HDF & Data in motionRaúl Marín
 
Storm Demo Talk - Colorado Springs May 2015
Storm Demo Talk - Colorado Springs May 2015Storm Demo Talk - Colorado Springs May 2015
Storm Demo Talk - Colorado Springs May 2015Mac Moore
 
Internet of things Crash Course Workshop
Internet of things Crash Course WorkshopInternet of things Crash Course Workshop
Internet of things Crash Course WorkshopDataWorks Summit
 
Internet of Things Crash Course Workshop at Hadoop Summit
Internet of Things Crash Course Workshop at Hadoop SummitInternet of Things Crash Course Workshop at Hadoop Summit
Internet of Things Crash Course Workshop at Hadoop SummitDataWorks Summit
 
Discover Red Hat and Apache Hadoop for the Modern Data Architecture - Part 3
Discover Red Hat and Apache Hadoop for the Modern Data Architecture - Part 3Discover Red Hat and Apache Hadoop for the Modern Data Architecture - Part 3
Discover Red Hat and Apache Hadoop for the Modern Data Architecture - Part 3Hortonworks
 
Transform You Business with Big Data and Hortonworks
Transform You Business with Big Data and HortonworksTransform You Business with Big Data and Hortonworks
Transform You Business with Big Data and HortonworksHortonworks
 
Supporting Financial Services with a More Flexible Approach to Big Data
Supporting Financial Services with a More Flexible Approach to Big DataSupporting Financial Services with a More Flexible Approach to Big Data
Supporting Financial Services with a More Flexible Approach to Big DataHortonworks
 
Edw Optimization Solution
Edw Optimization Solution Edw Optimization Solution
Edw Optimization Solution Hortonworks
 
Transform Your Business with Big Data and Hortonworks
Transform Your Business with Big Data and Hortonworks Transform Your Business with Big Data and Hortonworks
Transform Your Business with Big Data and Hortonworks Pactera_US
 
FOD Paris Meetup - Global Data Management with DataPlane Services (DPS)
FOD Paris Meetup -  Global Data Management with DataPlane Services (DPS)FOD Paris Meetup -  Global Data Management with DataPlane Services (DPS)
FOD Paris Meetup - Global Data Management with DataPlane Services (DPS)Abdelkrim Hadjidj
 
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...Hortonworks
 

Similar to IBM Cloud Paris meetup 20180213 - Hortonworks (20)

Hortonworks - What's Possible with a Modern Data Architecture?
Hortonworks - What's Possible with a Modern Data Architecture?Hortonworks - What's Possible with a Modern Data Architecture?
Hortonworks - What's Possible with a Modern Data Architecture?
 
Rescue your Big Data from Downtime with HP Operations Bridge and Apache Hadoop
Rescue your Big Data from Downtime with HP Operations Bridge and Apache HadoopRescue your Big Data from Downtime with HP Operations Bridge and Apache Hadoop
Rescue your Big Data from Downtime with HP Operations Bridge and Apache Hadoop
 
Apache Hadoop on the Open Cloud
Apache Hadoop on the Open CloudApache Hadoop on the Open Cloud
Apache Hadoop on the Open Cloud
 
Hortonworks Data In Motion Webinar Series Pt. 2
Hortonworks Data In Motion Webinar Series Pt. 2Hortonworks Data In Motion Webinar Series Pt. 2
Hortonworks Data In Motion Webinar Series Pt. 2
 
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
 
Introduction to Hadoop
Introduction to HadoopIntroduction to Hadoop
Introduction to Hadoop
 
Hortonworks - IBM - Cloud Event
Hortonworks - IBM - Cloud EventHortonworks - IBM - Cloud Event
Hortonworks - IBM - Cloud Event
 
Real-Time Processing in Hadoop for IoT Use Cases - Phoenix HUG
Real-Time Processing in Hadoop for IoT Use Cases - Phoenix HUGReal-Time Processing in Hadoop for IoT Use Cases - Phoenix HUG
Real-Time Processing in Hadoop for IoT Use Cases - Phoenix HUG
 
Supporting Financial Services with a More Flexible Approach to Big Data
Supporting Financial Services with a More Flexible Approach to Big DataSupporting Financial Services with a More Flexible Approach to Big Data
Supporting Financial Services with a More Flexible Approach to Big Data
 
[Hortonworks] Future Of Data: Madrid - HDF & Data in motion
[Hortonworks] Future Of Data: Madrid - HDF & Data in motion[Hortonworks] Future Of Data: Madrid - HDF & Data in motion
[Hortonworks] Future Of Data: Madrid - HDF & Data in motion
 
Storm Demo Talk - Colorado Springs May 2015
Storm Demo Talk - Colorado Springs May 2015Storm Demo Talk - Colorado Springs May 2015
Storm Demo Talk - Colorado Springs May 2015
 
Internet of things Crash Course Workshop
Internet of things Crash Course WorkshopInternet of things Crash Course Workshop
Internet of things Crash Course Workshop
 
Internet of Things Crash Course Workshop at Hadoop Summit
Internet of Things Crash Course Workshop at Hadoop SummitInternet of Things Crash Course Workshop at Hadoop Summit
Internet of Things Crash Course Workshop at Hadoop Summit
 
Discover Red Hat and Apache Hadoop for the Modern Data Architecture - Part 3
Discover Red Hat and Apache Hadoop for the Modern Data Architecture - Part 3Discover Red Hat and Apache Hadoop for the Modern Data Architecture - Part 3
Discover Red Hat and Apache Hadoop for the Modern Data Architecture - Part 3
 
Transform You Business with Big Data and Hortonworks
Transform You Business with Big Data and HortonworksTransform You Business with Big Data and Hortonworks
Transform You Business with Big Data and Hortonworks
 
Supporting Financial Services with a More Flexible Approach to Big Data
Supporting Financial Services with a More Flexible Approach to Big DataSupporting Financial Services with a More Flexible Approach to Big Data
Supporting Financial Services with a More Flexible Approach to Big Data
 
Edw Optimization Solution
Edw Optimization Solution Edw Optimization Solution
Edw Optimization Solution
 
Transform Your Business with Big Data and Hortonworks
Transform Your Business with Big Data and Hortonworks Transform Your Business with Big Data and Hortonworks
Transform Your Business with Big Data and Hortonworks
 
FOD Paris Meetup - Global Data Management with DataPlane Services (DPS)
FOD Paris Meetup -  Global Data Management with DataPlane Services (DPS)FOD Paris Meetup -  Global Data Management with DataPlane Services (DPS)
FOD Paris Meetup - Global Data Management with DataPlane Services (DPS)
 
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
 

More from IBM France Lab

20200113 - IBM Cloud Côte d'Azur - DeepDive Kubernetes
20200113 - IBM Cloud Côte d'Azur - DeepDive Kubernetes20200113 - IBM Cloud Côte d'Azur - DeepDive Kubernetes
20200113 - IBM Cloud Côte d'Azur - DeepDive KubernetesIBM France Lab
 
20200114 - IBM Cloud Paris Meetup - DevOps
20200114 - IBM Cloud Paris Meetup - DevOps20200114 - IBM Cloud Paris Meetup - DevOps
20200114 - IBM Cloud Paris Meetup - DevOpsIBM France Lab
 
20200128 - Meetup Nice Côte d'Azur - Agile Mindset
20200128 - Meetup Nice Côte d'Azur - Agile Mindset20200128 - Meetup Nice Côte d'Azur - Agile Mindset
20200128 - Meetup Nice Côte d'Azur - Agile MindsetIBM France Lab
 
Défis de l'IA : droits, devoirs, enjeux économiques et éthiques
Défis de l'IA : droits, devoirs, enjeux économiques et éthiquesDéfis de l'IA : droits, devoirs, enjeux économiques et éthiques
Défis de l'IA : droits, devoirs, enjeux économiques et éthiquesIBM France Lab
 
Meetup ibm abakus banque postale
Meetup ibm abakus banque postaleMeetup ibm abakus banque postale
Meetup ibm abakus banque postaleIBM France Lab
 
20190613 - IBM Cloud Côte d'Azur meetup - "Cloud & Containers"
20190613 - IBM Cloud Côte d'Azur meetup - "Cloud & Containers"20190613 - IBM Cloud Côte d'Azur meetup - "Cloud & Containers"
20190613 - IBM Cloud Côte d'Azur meetup - "Cloud & Containers"IBM France Lab
 
20190613 - IBM Cloud Côte d'Azur meetup - "Cloud & Containers"
20190613 - IBM Cloud Côte d'Azur meetup - "Cloud & Containers"20190613 - IBM Cloud Côte d'Azur meetup - "Cloud & Containers"
20190613 - IBM Cloud Côte d'Azur meetup - "Cloud & Containers"IBM France Lab
 
IBM Watson IOT - Acoustic or Visual Insights
IBM Watson IOT - Acoustic or Visual InsightsIBM Watson IOT - Acoustic or Visual Insights
IBM Watson IOT - Acoustic or Visual InsightsIBM France Lab
 
Retour expérience Track & Trace - IBM using Sigfox.
Retour expérience Track & Trace - IBM using Sigfox.Retour expérience Track & Trace - IBM using Sigfox.
Retour expérience Track & Trace - IBM using Sigfox.IBM France Lab
 
20190520 - IBM Cloud Paris-Saclay Meetup - Hardis Group
20190520  - IBM Cloud Paris-Saclay Meetup - Hardis Group20190520  - IBM Cloud Paris-Saclay Meetup - Hardis Group
20190520 - IBM Cloud Paris-Saclay Meetup - Hardis GroupIBM France Lab
 
IBM Cloud Paris Meetup - 20190520 - IA & Power
IBM Cloud Paris Meetup - 20190520 - IA & PowerIBM Cloud Paris Meetup - 20190520 - IA & Power
IBM Cloud Paris Meetup - 20190520 - IA & PowerIBM France Lab
 
IBM Cloud Côte d'Azur Meetup - 20190328 - Optimisation
IBM Cloud Côte d'Azur Meetup - 20190328 - OptimisationIBM Cloud Côte d'Azur Meetup - 20190328 - Optimisation
IBM Cloud Côte d'Azur Meetup - 20190328 - OptimisationIBM France Lab
 
IBM Cloud Côte d'Azur Meetup - 20190328 - Optimisation
IBM Cloud Côte d'Azur Meetup - 20190328 - OptimisationIBM Cloud Côte d'Azur Meetup - 20190328 - Optimisation
IBM Cloud Côte d'Azur Meetup - 20190328 - OptimisationIBM France Lab
 
IBM Cloud Bordeaux Meetup - 20190325 - Software Factory
IBM Cloud Bordeaux Meetup - 20190325 - Software FactoryIBM Cloud Bordeaux Meetup - 20190325 - Software Factory
IBM Cloud Bordeaux Meetup - 20190325 - Software FactoryIBM France Lab
 
IBM Cloud Paris Meetup - 20190129 - Assima
IBM Cloud Paris Meetup - 20190129 - AssimaIBM Cloud Paris Meetup - 20190129 - Assima
IBM Cloud Paris Meetup - 20190129 - AssimaIBM France Lab
 
IBM Cloud Paris Meetup - 20190129 - Myrtea
IBM Cloud Paris Meetup - 20190129 - MyrteaIBM Cloud Paris Meetup - 20190129 - Myrtea
IBM Cloud Paris Meetup - 20190129 - MyrteaIBM France Lab
 
IBM Cloud Paris Meetup - 20181016 - L'agilité à l'échelle
IBM Cloud Paris Meetup - 20181016 - L'agilité à l'échelleIBM Cloud Paris Meetup - 20181016 - L'agilité à l'échelle
IBM Cloud Paris Meetup - 20181016 - L'agilité à l'échelleIBM France Lab
 
IBM Cloud Côte d'Azur Meetup - Blockchain Business Processes & Rule-based Sm...
IBM Cloud Côte d'Azur Meetup - Blockchain Business Processes &  Rule-based Sm...IBM Cloud Côte d'Azur Meetup - Blockchain Business Processes &  Rule-based Sm...
IBM Cloud Côte d'Azur Meetup - Blockchain Business Processes & Rule-based Sm...IBM France Lab
 
IBM Cloud Côte D'Azur Meetup - 20181004 - Blockchain Hyperledger Workshop
IBM Cloud Côte D'Azur Meetup - 20181004 - Blockchain Hyperledger WorkshopIBM Cloud Côte D'Azur Meetup - 20181004 - Blockchain Hyperledger Workshop
IBM Cloud Côte D'Azur Meetup - 20181004 - Blockchain Hyperledger WorkshopIBM France Lab
 
IBM Cloud Paris Meetup - 20180911 - Common Ledger for Public Administration
IBM Cloud Paris Meetup - 20180911 - Common Ledger for Public AdministrationIBM Cloud Paris Meetup - 20180911 - Common Ledger for Public Administration
IBM Cloud Paris Meetup - 20180911 - Common Ledger for Public AdministrationIBM France Lab
 

More from IBM France Lab (20)

20200113 - IBM Cloud Côte d'Azur - DeepDive Kubernetes
20200113 - IBM Cloud Côte d'Azur - DeepDive Kubernetes20200113 - IBM Cloud Côte d'Azur - DeepDive Kubernetes
20200113 - IBM Cloud Côte d'Azur - DeepDive Kubernetes
 
20200114 - IBM Cloud Paris Meetup - DevOps
20200114 - IBM Cloud Paris Meetup - DevOps20200114 - IBM Cloud Paris Meetup - DevOps
20200114 - IBM Cloud Paris Meetup - DevOps
 
20200128 - Meetup Nice Côte d'Azur - Agile Mindset
20200128 - Meetup Nice Côte d'Azur - Agile Mindset20200128 - Meetup Nice Côte d'Azur - Agile Mindset
20200128 - Meetup Nice Côte d'Azur - Agile Mindset
 
Défis de l'IA : droits, devoirs, enjeux économiques et éthiques
Défis de l'IA : droits, devoirs, enjeux économiques et éthiquesDéfis de l'IA : droits, devoirs, enjeux économiques et éthiques
Défis de l'IA : droits, devoirs, enjeux économiques et éthiques
 
Meetup ibm abakus banque postale
Meetup ibm abakus banque postaleMeetup ibm abakus banque postale
Meetup ibm abakus banque postale
 
20190613 - IBM Cloud Côte d'Azur meetup - "Cloud & Containers"
20190613 - IBM Cloud Côte d'Azur meetup - "Cloud & Containers"20190613 - IBM Cloud Côte d'Azur meetup - "Cloud & Containers"
20190613 - IBM Cloud Côte d'Azur meetup - "Cloud & Containers"
 
20190613 - IBM Cloud Côte d'Azur meetup - "Cloud & Containers"
20190613 - IBM Cloud Côte d'Azur meetup - "Cloud & Containers"20190613 - IBM Cloud Côte d'Azur meetup - "Cloud & Containers"
20190613 - IBM Cloud Côte d'Azur meetup - "Cloud & Containers"
 
IBM Watson IOT - Acoustic or Visual Insights
IBM Watson IOT - Acoustic or Visual InsightsIBM Watson IOT - Acoustic or Visual Insights
IBM Watson IOT - Acoustic or Visual Insights
 
Retour expérience Track & Trace - IBM using Sigfox.
Retour expérience Track & Trace - IBM using Sigfox.Retour expérience Track & Trace - IBM using Sigfox.
Retour expérience Track & Trace - IBM using Sigfox.
 
20190520 - IBM Cloud Paris-Saclay Meetup - Hardis Group
20190520  - IBM Cloud Paris-Saclay Meetup - Hardis Group20190520  - IBM Cloud Paris-Saclay Meetup - Hardis Group
20190520 - IBM Cloud Paris-Saclay Meetup - Hardis Group
 
IBM Cloud Paris Meetup - 20190520 - IA & Power
IBM Cloud Paris Meetup - 20190520 - IA & PowerIBM Cloud Paris Meetup - 20190520 - IA & Power
IBM Cloud Paris Meetup - 20190520 - IA & Power
 
IBM Cloud Côte d'Azur Meetup - 20190328 - Optimisation
IBM Cloud Côte d'Azur Meetup - 20190328 - OptimisationIBM Cloud Côte d'Azur Meetup - 20190328 - Optimisation
IBM Cloud Côte d'Azur Meetup - 20190328 - Optimisation
 
IBM Cloud Côte d'Azur Meetup - 20190328 - Optimisation
IBM Cloud Côte d'Azur Meetup - 20190328 - OptimisationIBM Cloud Côte d'Azur Meetup - 20190328 - Optimisation
IBM Cloud Côte d'Azur Meetup - 20190328 - Optimisation
 
IBM Cloud Bordeaux Meetup - 20190325 - Software Factory
IBM Cloud Bordeaux Meetup - 20190325 - Software FactoryIBM Cloud Bordeaux Meetup - 20190325 - Software Factory
IBM Cloud Bordeaux Meetup - 20190325 - Software Factory
 
IBM Cloud Paris Meetup - 20190129 - Assima
IBM Cloud Paris Meetup - 20190129 - AssimaIBM Cloud Paris Meetup - 20190129 - Assima
IBM Cloud Paris Meetup - 20190129 - Assima
 
IBM Cloud Paris Meetup - 20190129 - Myrtea
IBM Cloud Paris Meetup - 20190129 - MyrteaIBM Cloud Paris Meetup - 20190129 - Myrtea
IBM Cloud Paris Meetup - 20190129 - Myrtea
 
IBM Cloud Paris Meetup - 20181016 - L'agilité à l'échelle
IBM Cloud Paris Meetup - 20181016 - L'agilité à l'échelleIBM Cloud Paris Meetup - 20181016 - L'agilité à l'échelle
IBM Cloud Paris Meetup - 20181016 - L'agilité à l'échelle
 
IBM Cloud Côte d'Azur Meetup - Blockchain Business Processes & Rule-based Sm...
IBM Cloud Côte d'Azur Meetup - Blockchain Business Processes &  Rule-based Sm...IBM Cloud Côte d'Azur Meetup - Blockchain Business Processes &  Rule-based Sm...
IBM Cloud Côte d'Azur Meetup - Blockchain Business Processes & Rule-based Sm...
 
IBM Cloud Côte D'Azur Meetup - 20181004 - Blockchain Hyperledger Workshop
IBM Cloud Côte D'Azur Meetup - 20181004 - Blockchain Hyperledger WorkshopIBM Cloud Côte D'Azur Meetup - 20181004 - Blockchain Hyperledger Workshop
IBM Cloud Côte D'Azur Meetup - 20181004 - Blockchain Hyperledger Workshop
 
IBM Cloud Paris Meetup - 20180911 - Common Ledger for Public Administration
IBM Cloud Paris Meetup - 20180911 - Common Ledger for Public AdministrationIBM Cloud Paris Meetup - 20180911 - Common Ledger for Public Administration
IBM Cloud Paris Meetup - 20180911 - Common Ledger for Public Administration
 

Recently uploaded

Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 

Recently uploaded (20)

Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 

IBM Cloud Paris meetup 20180213 - Hortonworks

  • 2. 2 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Hortonworks: Enabling the Modern Data Architecture Our mission continues… à Make Hadoop an enterprise viable data platform à Bring all data under management – all sources and types à Extend to Global Data Management Hortonworks consistent and continuous track record of innovation
  • 3. 3 © Hortonworks Inc. 2011 – 2016. All Rights Reserved MULTIPLE CLUSTERS AND SOURCES MULTIHYBRID DATAPLANE SERVICE (DPS) MANAGE, GOVERN, SECURE DATA LIFECYCLE MANAGER DATA STEWARD STUDIO* ISV SERVICES *not yet available, coming soon EXTENSIBLE SERVICES IBM DSX*CLOUD- BREAK* DATA ANALYTICS STUDIO* CONNECTED DATA PLATFORMS HORTONWORKS DATA PLATFORM (HDP®) DATA-AT-REST HORTONWORKS DATAFLOW (HDF™) DATA-IN-MOTION MODERN DATA USE CASES EDW OPTIMIZATION CYBER SECURITY DATA SCIENCE ADVANCED ANALYTICS PARTNER SOLUTIONS IOT/ STREAMING ANALYTICS HORTONWORKS CONNECTION ENTERPRISE SUPPORT PREMIER SUPPORT EDUCATIONAL SERVICES PROFESSIONAL SERVICES COMMUNITY CONNECTION HORTONWORKS PLATFORM SERVICES OPERATIONAL SERVICES SMARTSENSE™ Global Data Management With Hortonworks
  • 4. 4 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Data Lake 3.0: Scale & Cut TCO; Deploy in Minutes -Siloed Multiple deployments w/ low utilization -Longer time to deployment (days) HDP HDP HDP 1.6 Dockerized HDP Clusters Ambari YARN HDFS Dockerized Kubernetes Clusters Dockerized Stateful Microservices CBS (Container Block Storage) Hadoop APIs S3 APIs YARN Ozone Bare Metal Shared Services in the Physical Cluster Pre-Data Lake 3.0 Data Lake 3.0 -Scale 10s of Billions of files, Thousands of nodes -Data Efficiency 2x storage reduction -Compute Efficiency Dockerized micro-services on demand (separate compute & storage) -Generic file-system Hadoop APIs, S3 APIs, Block -3rd Party Kubernetes, ISVs such as IBM DSX Dockerized Ephemeral HDP Clusters Ambari YARN HDFS Dockerized Multi-Version Dockerized Stateless Microservices Hadoop APIs YARN HDFS Bare Metal Shared Services in the Physical Cluster HDP 3.0 Post HDP 3.0
  • 5. 5 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Data Lake 3.0 Architecture Hortonworks Data Plane Service (DPS) User Interface EC2 Google Application/Container Registry Templatized app#2 Templatized app#3 Templatized app#1 YARN API ISVs (Modern Data Applications) CyberSecurity IoT apps Container Scheduler for Data Intensive Apps in a Shared Cluster (YARN) (Queue/App Priority, User Pre-emption, SLA guarantee, GPU Isolation/pooling) Service Discovery App Logs App Metrics & Monitoring App Security & Governance Cold Tier -Erasure Coding with 1.5x OH -Archive HDD - Hot Tier -Fast SSD (3x replica) GPU Resources IBM
  • 7. 7 © Hortonworks Inc. 2011 – 2018. All Rights Reserved HDP 3.0 Focus Areas • Business Agility • Allow Hive / Spark / Zeppelin to innovate faster than core • 3.0 Adoption • Time to Market – Hadoop 3.0 • Upgradability - Painless for users to take on 3.0 • Scalability, Containerized Micro-Services & Storage Efficient Archive • NN Federation/ViewFS gives a way to scale the Hadoop solution • Containerization gives more workloads on the solution and DS workloads can control dependencies • Erasure Coding allows for keeping more data longer • GPUs for AI & Deep Learning • GPU Scheduling • Docker Container Example with DL Libraries • 3rd Party tools • DSX
  • 8. 8 © Hortonworks Inc. 2011 – 2018. All Rights Reserved HDP 3.0 –Erasure Coding Enables Active Archive ⬢ Erasure Coding in HDP 3.0 – Default: Reed Solomon 6 data + 3 parity – 2x storage savings (1.5x vs. 3x w/ replica) – Tolerates up to 3 node failure (continues writing w/ 6 data nodes) – Support Hive/Tez query on data written directly to EC zone ⬢ Other Notable Items – Intra-node disk balancing – Data tiering with policy – Optimize around slow disk/network – Support more than 2 Name Nodes Erasure Coding (Archive: 1.5) Cold 3 parity blocks Data Nodes Data Nodes Nodes Media Disk - Hard DriveSSD - Solid State Drive N Replicas (SSD: 3) Hot (default) Warm N Replicas (Disk: 1, Archive: 2) Files/ Directories Archive - High Density Drives Data Nodes b3 b1 b2 P1 b6 b4 b5 P2 P3 6 data blocks
  • 9. 9 © Hortonworks Inc. 2011 – 2018. All Rights Reserved Ozone – Why an Object Store? • With workloads like IoT we are looking at scaling to trillions of objects. • Apache HDFS is designed for large objects – not for many small objects • Small files create memory pressure on namenode. • Each small file creates a block in the datanode. • Datanodes send all block information to namenode in BlockReports. • Both of these create scalability issues on Namenode. • Metadata in memory is the strength of the original GFS and HDFS design, but also its weakness in scaling number of files and blocks. • An object store has simpler semantics than a file system and is easier to scale
  • 10. 10 © Hortonworks Inc. 2011 – 2018. All Rights Reserved Ozone – Why an Object Store (continued) • Ozone attempts to scale to trillions of objects • Ozone is built on a distributed metadata store. • Avoids any single server becoming a bottleneck • More parallelism possible in both data and metadata operations • Build on well tested components and understood protocols • RAFT for consensus • RAFT is a protocol for reaching consensus between a set of machines in an unreliable environment where machines and network may fail. • Off-the-shelf Key-Value store like RocksDB
  • 11. 11 © Hortonworks Inc. 2011 – 2018. All Rights Reserved HDP 3.0 - HDFS Federation Enables Linear Scale DN 1 DN 2 DN m .. .. .. NS1 Future NS e.g. Ozone FS... ... NS k Block Management Layer Block Pool n Block Pool k Block Pool 1 NN-1 NN-k Common Storage Block StorageNamespace ⬢ Extends HDFS to multiple volumes – But the volumes can shares the physical storage of the DNs – You still get the Hadoop data locality for compute ⬢ Optionally, can mount a Hadoop Compatible FS volume – Commonly used in cloud for cloud storage – S3, WASB, ADLS – Use with upcoming OzoneFS Hot data Hot data Cold data
  • 12. 12 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Details YARN Functionalities Building Blocks RAS -Application Timeline Services v2 -Delete queues without RM restart -Efficient resource handling on queue stop -Easier YARN UI Powerful YARN -Disk isolation -Network IO throttling -Application priority in Cap Scheduler -Intra-queue pre-emption -Support admin configurable resource type (e.g. GPU) -Affinity/anti-affinity support New Features -Docker containers on YARN -Seamless upgrade of apps on YARN Assembly of Micro- Services • Consumable micro-services (Metron) • Deployed and managed Ambari Long Running Apps • Spark, HBase, Kafka, Solr, Storm on Docker • Deployed and managed by Ambari Multiple Ephemeral HDPs • Deployed by Cloudbreak; individual clusters managed by Ambari
  • 13. 13 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Built-in support for long running Service in YARN à A native YARN framework. YARN-4692 • Abstract common Framework (Similar to Slider) to support long running service • More simplified API (to manage service lifecycle) • Better support for long running service à Recognition of long running service • Affect the policy of preemption, container reservation, etc. • Auto-restart of containers • Containers for long running service are retried to same node in case of local state à Service/application upgrade support – YARN-4726 • In general, services are expected to run long enough to cross versions à Dynamic container configuration
  • 14. 14 © Hortonworks Inc. 2011 – 2016. All Rights Reserved More Powerful YARN Ã Resource Isolation – Resource isolation support for disk and network • YARN-2619 (disk), YARN-2140 (network) • Containers get a fair share of disk and network resources using Cgroups – Docker support in LinuxContainerExecutor • YARN-3611 • Support to launch Docker containers alongside process • Packaging and resource isolation • Complements YARN’s support for long running services
  • 15. 15 © Hortonworks Inc. 2011 – 2018. All Rights Reserved Data Lake 3.0 Architecture Hortonworks Data Plane Service (DPS) User Interface EC2 Google Application/Container Registry Templatized app#2 Templatized app#3 Templatized app#1 YARN API ISVs (Modern Data Applications) CyberSecurity IoT apps Container Scheduler for Data Intensive Apps in a Shared Cluster (YARN) (Queue/App Priority, User Pre-emption, SLA guarantee, GPU Isolation/pooling) Service Discovery App Logs App Metrics & Monitoring App Security & Governance Cold Tier -Erasure Coding with 1.5x OH -Archive HDD - Hot Tier -Fast SSD (3x replica) GPU Resources IBM
  • 16. 16 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Thanks