SlideShare a Scribd company logo
1 of 12
@dez_blanchfield
Distributed & Parallel computing – a slice of my personal journey
• Mainframes
• Mini-Computers ( i.e. PDP / VAX )
• Micro-Computers ( 386 PC Servers )
• Desktop Servers ( UltraSPARC 2 )
• High End Proprietary Clusters
• Home Built COTS Clusters
• OpenSource platforms
• Software vs Hardware
@dez_blanchfield
Distributed & Parallel computing – my own NUMA circa 1994 !!
@dez_blanchfield
The Challenges Of Large Scale Parallel and Distributed Computing
• Parallel vs Distributed
• Parallel Computing is Rocket Science
• Filesystems evolved to deliver
demand for distributed storage
• Distributed became more accessible
• Distributed also won the cost battle
• Open Frameworks are king
• Distributed increasing rapidly
• Everyone wants their own Hadoop
• DIY is fun but comes with wide
ranging costs / risks
@dez_blanchfield
Closed vs Open Platforms – Where have we come from
• Proprietary Systems
• IBM SP/2 ( AIX )
• Sun Solaris
• Sun Grid Engine ( Solaris SPARC + x86 )
• A+ Edition ( Fujitsu Mainframes )
• E10k -> E15k big iron
• Custom Systems ( Top 500 list )
• Nations rule the roost here!!
• Open Frameworks
• PVM / MPI / MPI-CH (Chameleon Lib)
• SMP
• Beowulf linux clusters
• Rocks Cluster Distribution
• Hadoop v1 / Hadoop 2 & YARN
@dez_blanchfield
Searching for little green men at Home ( aka SETI@Home )
• Projects like BOINC and the
SETI@Home project, launched in
1999, put distributed computing on
millions of screens and introduced
many of the core ideas and concepts
we now take for granted with
distributed computing
• SETI@Home put a tiny piece of a
super computer on your desktop, and
let you participate in a global project
for social good, and for the most part
it cost you nothing to participate as it
only uses spare CPU cycles that
would otherwise go to waste
@dez_blanchfield
1943 – “5 x Computers should about do it”
“I think there is
a world market
for maybe five
computers.”
Thomas J. Watson
@dez_blanchfield
What Happened – How did the shift happen so quickly
• Open Source & the FSF
• BSD, Minix & Linux
• Network & Clustered Filesystems
• Networking & In Particular Ethernet
• COTS Hardware
• Disks & RAM got a lot cheaper
• Modern CPU Design
• Multi-threading architectures
• Multi-core backplanes
• Smarter Memory Designs
@dez_blanchfield
What Happened – Yellow Elephants & a big Yahoo
• Doug Cutting
• Mike Cafarella
• The internet got very big very fast
• 2nd generation search engines
• Google’s MapReduce and the
Google File System paper
• 2006 & The Nutch Search Engine
• Distributing Index & Search at scale
• Yahoo became home to open
source Big Data
@dez_blanchfield
The cost of DISK has plummeted in the last 20 years
@dez_blanchfield
The cost of RAM has plummeted in the last 20 years
@dez_blanchfield
Where are we today – Pitfalls & Brick Walls
• Distributed Computing is Hard !!
• Now everyone CAN try it
• But not everyone SHOULD do it
• Rocket science does still apply
• Small clusters with one or two
workloads are safe
• Once you scale out to hundreds of
workloads and thousands of users
you will find pain, and lots of it
• You will not solve your
performance issues in a timely or
cost effective manner on your own
@dez_blanchfield
Big Hadoop Systems need automated Performance Management
• Automated Performance Management
is a must, humans can’t respond fast
enough to resolve multi-workload
issues, Ganglia just isn’t enough
• Systems are required to manage
systems, to monitor, discover, and
respond instantly, to deliver good
outcomes from investments in Hadoop
• Even sophisticated large Hadoop
implementations in Govt. & Large
Enterprise who do have rocket
scientists struggle with performance
issues which arise from multi-tenant
multi-workload use of Hadoop

More Related Content

Viewers also liked

Viewers also liked (8)

Parallel sorting Algorithms
Parallel  sorting AlgorithmsParallel  sorting Algorithms
Parallel sorting Algorithms
 
Parallel Algorithms
Parallel AlgorithmsParallel Algorithms
Parallel Algorithms
 
parallel Merging
parallel Mergingparallel Merging
parallel Merging
 
Parallel Algorithm Models
Parallel Algorithm ModelsParallel Algorithm Models
Parallel Algorithm Models
 
Parallel Algorithms
Parallel AlgorithmsParallel Algorithms
Parallel Algorithms
 
Parallel Algorithms Advantages and Disadvantages
Parallel Algorithms Advantages and DisadvantagesParallel Algorithms Advantages and Disadvantages
Parallel Algorithms Advantages and Disadvantages
 
Parallel Computing
Parallel Computing Parallel Computing
Parallel Computing
 
Applications of paralleL processing
Applications of paralleL processingApplications of paralleL processing
Applications of paralleL processing
 

Similar to Tech lab 2016-ep01-pepper-data-dez-slides-20160303-final

Hadoop - Where did it come from and what's next? (Pasadena Sept 2014)
Hadoop - Where did it come from and what's next? (Pasadena Sept 2014)Hadoop - Where did it come from and what's next? (Pasadena Sept 2014)
Hadoop - Where did it come from and what's next? (Pasadena Sept 2014)Eric Baldeschwieler
 
Things Every Oracle DBA Needs to Know about the Hadoop Ecosystem
Things Every Oracle DBA Needs to Know about the Hadoop EcosystemThings Every Oracle DBA Needs to Know about the Hadoop Ecosystem
Things Every Oracle DBA Needs to Know about the Hadoop EcosystemZohar Elkayam
 
Things Every Oracle DBA Needs to Know About the Hadoop Ecosystem 20170527
Things Every Oracle DBA Needs to Know About the Hadoop Ecosystem 20170527Things Every Oracle DBA Needs to Know About the Hadoop Ecosystem 20170527
Things Every Oracle DBA Needs to Know About the Hadoop Ecosystem 20170527Zohar Elkayam
 
What ya gonna do?
What ya gonna do?What ya gonna do?
What ya gonna do?CQD
 
MarkLittle_EnterpriseMiddlewareForThe21stCentury
MarkLittle_EnterpriseMiddlewareForThe21stCenturyMarkLittle_EnterpriseMiddlewareForThe21stCentury
MarkLittle_EnterpriseMiddlewareForThe21stCenturyKostas Mavridis
 
How Open Source is Transforming the Internet. Again.
How Open Source is Transforming the Internet. Again.How Open Source is Transforming the Internet. Again.
How Open Source is Transforming the Internet. Again.Steve Hoffman
 
Things Every Oracle DBA Needs To Know About The Hadoop Ecosystem
Things Every Oracle DBA Needs To Know About The Hadoop EcosystemThings Every Oracle DBA Needs To Know About The Hadoop Ecosystem
Things Every Oracle DBA Needs To Know About The Hadoop EcosystemZohar Elkayam
 
Rapid Cluster Computing with Apache Spark 2016
Rapid Cluster Computing with Apache Spark 2016Rapid Cluster Computing with Apache Spark 2016
Rapid Cluster Computing with Apache Spark 2016Zohar Elkayam
 
Apache Druid: The Foundation of Fortune 500 “Analytical Decision-Making"
Apache Druid: The Foundation of Fortune 500 “Analytical Decision-Making"Apache Druid: The Foundation of Fortune 500 “Analytical Decision-Making"
Apache Druid: The Foundation of Fortune 500 “Analytical Decision-Making"Rommel Garcia
 
Platform Clouds, Containers, Immutable Infrastructure Oh My!
Platform Clouds, Containers, Immutable Infrastructure Oh My!Platform Clouds, Containers, Immutable Infrastructure Oh My!
Platform Clouds, Containers, Immutable Infrastructure Oh My!Stuart Charlton
 
Big data and hadoop
Big data and hadoopBig data and hadoop
Big data and hadoopMohit Tare
 
Introduction to Cloud computing and Big Data-Hadoop
Introduction to Cloud computing and  Big Data-HadoopIntroduction to Cloud computing and  Big Data-Hadoop
Introduction to Cloud computing and Big Data-HadoopNagarjuna D.N
 
IoT is Something to Figure Out
IoT is Something to Figure OutIoT is Something to Figure Out
IoT is Something to Figure OutPeter Hoddie
 
Bi 2.0 hadoop everywhere
Bi 2.0   hadoop everywhereBi 2.0   hadoop everywhere
Bi 2.0 hadoop everywhereDmitry Tolpeko
 
Open stack jobs avoiding the axe
Open stack jobs   avoiding the axeOpen stack jobs   avoiding the axe
Open stack jobs avoiding the axeJim Leitch
 
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...Larry Smarr
 
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...Larry Smarr
 
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...Larry Smarr
 

Similar to Tech lab 2016-ep01-pepper-data-dez-slides-20160303-final (20)

Hadoop - Where did it come from and what's next? (Pasadena Sept 2014)
Hadoop - Where did it come from and what's next? (Pasadena Sept 2014)Hadoop - Where did it come from and what's next? (Pasadena Sept 2014)
Hadoop - Where did it come from and what's next? (Pasadena Sept 2014)
 
Intro to Big Data
Intro to Big DataIntro to Big Data
Intro to Big Data
 
Things Every Oracle DBA Needs to Know about the Hadoop Ecosystem
Things Every Oracle DBA Needs to Know about the Hadoop EcosystemThings Every Oracle DBA Needs to Know about the Hadoop Ecosystem
Things Every Oracle DBA Needs to Know about the Hadoop Ecosystem
 
Things Every Oracle DBA Needs to Know About the Hadoop Ecosystem 20170527
Things Every Oracle DBA Needs to Know About the Hadoop Ecosystem 20170527Things Every Oracle DBA Needs to Know About the Hadoop Ecosystem 20170527
Things Every Oracle DBA Needs to Know About the Hadoop Ecosystem 20170527
 
What ya gonna do?
What ya gonna do?What ya gonna do?
What ya gonna do?
 
MarkLittle_EnterpriseMiddlewareForThe21stCentury
MarkLittle_EnterpriseMiddlewareForThe21stCenturyMarkLittle_EnterpriseMiddlewareForThe21stCentury
MarkLittle_EnterpriseMiddlewareForThe21stCentury
 
How Open Source is Transforming the Internet. Again.
How Open Source is Transforming the Internet. Again.How Open Source is Transforming the Internet. Again.
How Open Source is Transforming the Internet. Again.
 
Things Every Oracle DBA Needs To Know About The Hadoop Ecosystem
Things Every Oracle DBA Needs To Know About The Hadoop EcosystemThings Every Oracle DBA Needs To Know About The Hadoop Ecosystem
Things Every Oracle DBA Needs To Know About The Hadoop Ecosystem
 
Rapid Cluster Computing with Apache Spark 2016
Rapid Cluster Computing with Apache Spark 2016Rapid Cluster Computing with Apache Spark 2016
Rapid Cluster Computing with Apache Spark 2016
 
Apache Druid: The Foundation of Fortune 500 “Analytical Decision-Making"
Apache Druid: The Foundation of Fortune 500 “Analytical Decision-Making"Apache Druid: The Foundation of Fortune 500 “Analytical Decision-Making"
Apache Druid: The Foundation of Fortune 500 “Analytical Decision-Making"
 
Platform Clouds, Containers, Immutable Infrastructure Oh My!
Platform Clouds, Containers, Immutable Infrastructure Oh My!Platform Clouds, Containers, Immutable Infrastructure Oh My!
Platform Clouds, Containers, Immutable Infrastructure Oh My!
 
Big data and hadoop
Big data and hadoopBig data and hadoop
Big data and hadoop
 
002 Introduction to hadoop v3
002   Introduction to hadoop v3002   Introduction to hadoop v3
002 Introduction to hadoop v3
 
Introduction to Cloud computing and Big Data-Hadoop
Introduction to Cloud computing and  Big Data-HadoopIntroduction to Cloud computing and  Big Data-Hadoop
Introduction to Cloud computing and Big Data-Hadoop
 
IoT is Something to Figure Out
IoT is Something to Figure OutIoT is Something to Figure Out
IoT is Something to Figure Out
 
Bi 2.0 hadoop everywhere
Bi 2.0   hadoop everywhereBi 2.0   hadoop everywhere
Bi 2.0 hadoop everywhere
 
Open stack jobs avoiding the axe
Open stack jobs   avoiding the axeOpen stack jobs   avoiding the axe
Open stack jobs avoiding the axe
 
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
 
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
 
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
 

More from Dez Blanchfield

Hot tech 20170329-idera - health check - maintaining enterprise bi-dez-slides
Hot tech 20170329-idera - health check - maintaining enterprise bi-dez-slidesHot tech 20170329-idera - health check - maintaining enterprise bi-dez-slides
Hot tech 20170329-idera - health check - maintaining enterprise bi-dez-slidesDez Blanchfield
 
CDO Summit 2017 - Sydney - 20170315 - Dez Blanchfield
CDO Summit 2017 - Sydney - 20170315 - Dez BlanchfieldCDO Summit 2017 - Sydney - 20170315 - Dez Blanchfield
CDO Summit 2017 - Sydney - 20170315 - Dez BlanchfieldDez Blanchfield
 
Briefing Room 20161213 - ep019 - Red Hat - Modern Business Storage
Briefing Room 20161213 - ep019 - Red Hat - Modern Business StorageBriefing Room 20161213 - ep019 - Red Hat - Modern Business Storage
Briefing Room 20161213 - ep019 - Red Hat - Modern Business StorageDez Blanchfield
 
Young it - digital transformation on a personal level
Young it - digital transformation on a personal levelYoung it - digital transformation on a personal level
Young it - digital transformation on a personal levelDez Blanchfield
 
Hot tech 20161221 - ep0022 - IDERA - an ounce of prevention - Forging Healthy BI
Hot tech 20161221 - ep0022 - IDERA - an ounce of prevention - Forging Healthy BIHot tech 20161221 - ep0022 - IDERA - an ounce of prevention - Forging Healthy BI
Hot tech 20161221 - ep0022 - IDERA - an ounce of prevention - Forging Healthy BIDez Blanchfield
 
Smart Cities Expo - World Forum - Sydney - 2016 - Dez Blanchfield
Smart Cities Expo - World Forum - Sydney - 2016 - Dez BlanchfieldSmart Cities Expo - World Forum - Sydney - 2016 - Dez Blanchfield
Smart Cities Expo - World Forum - Sydney - 2016 - Dez BlanchfieldDez Blanchfield
 
Hot tech 20161207 - ep0021 - IDERA - Protect your database - High availabilit...
Hot tech 20161207 - ep0021 - IDERA - Protect your database - High availabilit...Hot tech 20161207 - ep0021 - IDERA - Protect your database - High availabilit...
Hot tech 20161207 - ep0021 - IDERA - Protect your database - High availabilit...Dez Blanchfield
 
Hot tech 20161116-ep0019-idera - data modeling in an agile environment-dez-sl...
Hot tech 20161116-ep0019-idera - data modeling in an agile environment-dez-sl...Hot tech 20161116-ep0019-idera - data modeling in an agile environment-dez-sl...
Hot tech 20161116-ep0019-idera - data modeling in an agile environment-dez-sl...Dez Blanchfield
 
Hot tech 20160602-ep008-idera - forward momentum - moving relational beyond t...
Hot tech 20160602-ep008-idera - forward momentum - moving relational beyond t...Hot tech 20160602-ep008-idera - forward momentum - moving relational beyond t...
Hot tech 20160602-ep008-idera - forward momentum - moving relational beyond t...Dez Blanchfield
 
Hot tech 20161005-ep0016-idera - index insanity - how to avoid database chaos...
Hot tech 20161005-ep0016-idera - index insanity - how to avoid database chaos...Hot tech 20161005-ep0016-idera - index insanity - how to avoid database chaos...
Hot tech 20161005-ep0016-idera - index insanity - how to avoid database chaos...Dez Blanchfield
 
Hot tech 20160510-ep005-magnitude software-the_biggest_picture-knowing_your_c...
Hot tech 20160510-ep005-magnitude software-the_biggest_picture-knowing_your_c...Hot tech 20160510-ep005-magnitude software-the_biggest_picture-knowing_your_c...
Hot tech 20160510-ep005-magnitude software-the_biggest_picture-knowing_your_c...Dez Blanchfield
 
Hot tech 20160922-ep0015-dell statistica - edge analytics - the io_t economy ...
Hot tech 20160922-ep0015-dell statistica - edge analytics - the io_t economy ...Hot tech 20160922-ep0015-dell statistica - edge analytics - the io_t economy ...
Hot tech 20160922-ep0015-dell statistica - edge analytics - the io_t economy ...Dez Blanchfield
 
Briefing room 20160920-ep017-striim - a real-time version of the truth-dez-sl...
Briefing room 20160920-ep017-striim - a real-time version of the truth-dez-sl...Briefing room 20160920-ep017-striim - a real-time version of the truth-dez-sl...
Briefing room 20160920-ep017-striim - a real-time version of the truth-dez-sl...Dez Blanchfield
 
OpenStack Australia Government Day 2016 - Dez Blanchfield
OpenStack Australia Government Day 2016 - Dez BlanchfieldOpenStack Australia Government Day 2016 - Dez Blanchfield
OpenStack Australia Government Day 2016 - Dez BlanchfieldDez Blanchfield
 
Hot tech 20161102 - ep0018 - idera - application acceleration - faster perfor...
Hot tech 20161102 - ep0018 - idera - application acceleration - faster perfor...Hot tech 20161102 - ep0018 - idera - application acceleration - faster perfor...
Hot tech 20161102 - ep0018 - idera - application acceleration - faster perfor...Dez Blanchfield
 
Health Data Management - Clear Data - 5 reasons hospital CIOs are extending t...
Health Data Management - Clear Data - 5 reasons hospital CIOs are extending t...Health Data Management - Clear Data - 5 reasons hospital CIOs are extending t...
Health Data Management - Clear Data - 5 reasons hospital CIOs are extending t...Dez Blanchfield
 
Hot tech 20160602 - ep007 - sync sort - big iron meet big data - liberating m...
Hot tech 20160602 - ep007 - sync sort - big iron meet big data - liberating m...Hot tech 20160602 - ep007 - sync sort - big iron meet big data - liberating m...
Hot tech 20160602 - ep007 - sync sort - big iron meet big data - liberating m...Dez Blanchfield
 
Hot tech 20160914-ep0014-idera - who what where and how - why you want to kno...
Hot tech 20160914-ep0014-idera - who what where and how - why you want to kno...Hot tech 20160914-ep0014-idera - who what where and how - why you want to kno...
Hot tech 20160914-ep0014-idera - who what where and how - why you want to kno...Dez Blanchfield
 
Briefing room 20160913-ep0016-sap-anomalies-or-alerts-streaming-analytics-to-...
Briefing room 20160913-ep0016-sap-anomalies-or-alerts-streaming-analytics-to-...Briefing room 20160913-ep0016-sap-anomalies-or-alerts-streaming-analytics-to-...
Briefing room 20160913-ep0016-sap-anomalies-or-alerts-streaming-analytics-to-...Dez Blanchfield
 
Hot tech 20160825-ep0012-dell statistica-embed-analytics-everywhere-enabling-...
Hot tech 20160825-ep0012-dell statistica-embed-analytics-everywhere-enabling-...Hot tech 20160825-ep0012-dell statistica-embed-analytics-everywhere-enabling-...
Hot tech 20160825-ep0012-dell statistica-embed-analytics-everywhere-enabling-...Dez Blanchfield
 

More from Dez Blanchfield (20)

Hot tech 20170329-idera - health check - maintaining enterprise bi-dez-slides
Hot tech 20170329-idera - health check - maintaining enterprise bi-dez-slidesHot tech 20170329-idera - health check - maintaining enterprise bi-dez-slides
Hot tech 20170329-idera - health check - maintaining enterprise bi-dez-slides
 
CDO Summit 2017 - Sydney - 20170315 - Dez Blanchfield
CDO Summit 2017 - Sydney - 20170315 - Dez BlanchfieldCDO Summit 2017 - Sydney - 20170315 - Dez Blanchfield
CDO Summit 2017 - Sydney - 20170315 - Dez Blanchfield
 
Briefing Room 20161213 - ep019 - Red Hat - Modern Business Storage
Briefing Room 20161213 - ep019 - Red Hat - Modern Business StorageBriefing Room 20161213 - ep019 - Red Hat - Modern Business Storage
Briefing Room 20161213 - ep019 - Red Hat - Modern Business Storage
 
Young it - digital transformation on a personal level
Young it - digital transformation on a personal levelYoung it - digital transformation on a personal level
Young it - digital transformation on a personal level
 
Hot tech 20161221 - ep0022 - IDERA - an ounce of prevention - Forging Healthy BI
Hot tech 20161221 - ep0022 - IDERA - an ounce of prevention - Forging Healthy BIHot tech 20161221 - ep0022 - IDERA - an ounce of prevention - Forging Healthy BI
Hot tech 20161221 - ep0022 - IDERA - an ounce of prevention - Forging Healthy BI
 
Smart Cities Expo - World Forum - Sydney - 2016 - Dez Blanchfield
Smart Cities Expo - World Forum - Sydney - 2016 - Dez BlanchfieldSmart Cities Expo - World Forum - Sydney - 2016 - Dez Blanchfield
Smart Cities Expo - World Forum - Sydney - 2016 - Dez Blanchfield
 
Hot tech 20161207 - ep0021 - IDERA - Protect your database - High availabilit...
Hot tech 20161207 - ep0021 - IDERA - Protect your database - High availabilit...Hot tech 20161207 - ep0021 - IDERA - Protect your database - High availabilit...
Hot tech 20161207 - ep0021 - IDERA - Protect your database - High availabilit...
 
Hot tech 20161116-ep0019-idera - data modeling in an agile environment-dez-sl...
Hot tech 20161116-ep0019-idera - data modeling in an agile environment-dez-sl...Hot tech 20161116-ep0019-idera - data modeling in an agile environment-dez-sl...
Hot tech 20161116-ep0019-idera - data modeling in an agile environment-dez-sl...
 
Hot tech 20160602-ep008-idera - forward momentum - moving relational beyond t...
Hot tech 20160602-ep008-idera - forward momentum - moving relational beyond t...Hot tech 20160602-ep008-idera - forward momentum - moving relational beyond t...
Hot tech 20160602-ep008-idera - forward momentum - moving relational beyond t...
 
Hot tech 20161005-ep0016-idera - index insanity - how to avoid database chaos...
Hot tech 20161005-ep0016-idera - index insanity - how to avoid database chaos...Hot tech 20161005-ep0016-idera - index insanity - how to avoid database chaos...
Hot tech 20161005-ep0016-idera - index insanity - how to avoid database chaos...
 
Hot tech 20160510-ep005-magnitude software-the_biggest_picture-knowing_your_c...
Hot tech 20160510-ep005-magnitude software-the_biggest_picture-knowing_your_c...Hot tech 20160510-ep005-magnitude software-the_biggest_picture-knowing_your_c...
Hot tech 20160510-ep005-magnitude software-the_biggest_picture-knowing_your_c...
 
Hot tech 20160922-ep0015-dell statistica - edge analytics - the io_t economy ...
Hot tech 20160922-ep0015-dell statistica - edge analytics - the io_t economy ...Hot tech 20160922-ep0015-dell statistica - edge analytics - the io_t economy ...
Hot tech 20160922-ep0015-dell statistica - edge analytics - the io_t economy ...
 
Briefing room 20160920-ep017-striim - a real-time version of the truth-dez-sl...
Briefing room 20160920-ep017-striim - a real-time version of the truth-dez-sl...Briefing room 20160920-ep017-striim - a real-time version of the truth-dez-sl...
Briefing room 20160920-ep017-striim - a real-time version of the truth-dez-sl...
 
OpenStack Australia Government Day 2016 - Dez Blanchfield
OpenStack Australia Government Day 2016 - Dez BlanchfieldOpenStack Australia Government Day 2016 - Dez Blanchfield
OpenStack Australia Government Day 2016 - Dez Blanchfield
 
Hot tech 20161102 - ep0018 - idera - application acceleration - faster perfor...
Hot tech 20161102 - ep0018 - idera - application acceleration - faster perfor...Hot tech 20161102 - ep0018 - idera - application acceleration - faster perfor...
Hot tech 20161102 - ep0018 - idera - application acceleration - faster perfor...
 
Health Data Management - Clear Data - 5 reasons hospital CIOs are extending t...
Health Data Management - Clear Data - 5 reasons hospital CIOs are extending t...Health Data Management - Clear Data - 5 reasons hospital CIOs are extending t...
Health Data Management - Clear Data - 5 reasons hospital CIOs are extending t...
 
Hot tech 20160602 - ep007 - sync sort - big iron meet big data - liberating m...
Hot tech 20160602 - ep007 - sync sort - big iron meet big data - liberating m...Hot tech 20160602 - ep007 - sync sort - big iron meet big data - liberating m...
Hot tech 20160602 - ep007 - sync sort - big iron meet big data - liberating m...
 
Hot tech 20160914-ep0014-idera - who what where and how - why you want to kno...
Hot tech 20160914-ep0014-idera - who what where and how - why you want to kno...Hot tech 20160914-ep0014-idera - who what where and how - why you want to kno...
Hot tech 20160914-ep0014-idera - who what where and how - why you want to kno...
 
Briefing room 20160913-ep0016-sap-anomalies-or-alerts-streaming-analytics-to-...
Briefing room 20160913-ep0016-sap-anomalies-or-alerts-streaming-analytics-to-...Briefing room 20160913-ep0016-sap-anomalies-or-alerts-streaming-analytics-to-...
Briefing room 20160913-ep0016-sap-anomalies-or-alerts-streaming-analytics-to-...
 
Hot tech 20160825-ep0012-dell statistica-embed-analytics-everywhere-enabling-...
Hot tech 20160825-ep0012-dell statistica-embed-analytics-everywhere-enabling-...Hot tech 20160825-ep0012-dell statistica-embed-analytics-everywhere-enabling-...
Hot tech 20160825-ep0012-dell statistica-embed-analytics-everywhere-enabling-...
 

Recently uploaded

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 

Recently uploaded (20)

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 

Tech lab 2016-ep01-pepper-data-dez-slides-20160303-final

  • 1. @dez_blanchfield Distributed & Parallel computing – a slice of my personal journey • Mainframes • Mini-Computers ( i.e. PDP / VAX ) • Micro-Computers ( 386 PC Servers ) • Desktop Servers ( UltraSPARC 2 ) • High End Proprietary Clusters • Home Built COTS Clusters • OpenSource platforms • Software vs Hardware
  • 2. @dez_blanchfield Distributed & Parallel computing – my own NUMA circa 1994 !!
  • 3. @dez_blanchfield The Challenges Of Large Scale Parallel and Distributed Computing • Parallel vs Distributed • Parallel Computing is Rocket Science • Filesystems evolved to deliver demand for distributed storage • Distributed became more accessible • Distributed also won the cost battle • Open Frameworks are king • Distributed increasing rapidly • Everyone wants their own Hadoop • DIY is fun but comes with wide ranging costs / risks
  • 4. @dez_blanchfield Closed vs Open Platforms – Where have we come from • Proprietary Systems • IBM SP/2 ( AIX ) • Sun Solaris • Sun Grid Engine ( Solaris SPARC + x86 ) • A+ Edition ( Fujitsu Mainframes ) • E10k -> E15k big iron • Custom Systems ( Top 500 list ) • Nations rule the roost here!! • Open Frameworks • PVM / MPI / MPI-CH (Chameleon Lib) • SMP • Beowulf linux clusters • Rocks Cluster Distribution • Hadoop v1 / Hadoop 2 & YARN
  • 5. @dez_blanchfield Searching for little green men at Home ( aka SETI@Home ) • Projects like BOINC and the SETI@Home project, launched in 1999, put distributed computing on millions of screens and introduced many of the core ideas and concepts we now take for granted with distributed computing • SETI@Home put a tiny piece of a super computer on your desktop, and let you participate in a global project for social good, and for the most part it cost you nothing to participate as it only uses spare CPU cycles that would otherwise go to waste
  • 6. @dez_blanchfield 1943 – “5 x Computers should about do it” “I think there is a world market for maybe five computers.” Thomas J. Watson
  • 7. @dez_blanchfield What Happened – How did the shift happen so quickly • Open Source & the FSF • BSD, Minix & Linux • Network & Clustered Filesystems • Networking & In Particular Ethernet • COTS Hardware • Disks & RAM got a lot cheaper • Modern CPU Design • Multi-threading architectures • Multi-core backplanes • Smarter Memory Designs
  • 8. @dez_blanchfield What Happened – Yellow Elephants & a big Yahoo • Doug Cutting • Mike Cafarella • The internet got very big very fast • 2nd generation search engines • Google’s MapReduce and the Google File System paper • 2006 & The Nutch Search Engine • Distributing Index & Search at scale • Yahoo became home to open source Big Data
  • 9. @dez_blanchfield The cost of DISK has plummeted in the last 20 years
  • 10. @dez_blanchfield The cost of RAM has plummeted in the last 20 years
  • 11. @dez_blanchfield Where are we today – Pitfalls & Brick Walls • Distributed Computing is Hard !! • Now everyone CAN try it • But not everyone SHOULD do it • Rocket science does still apply • Small clusters with one or two workloads are safe • Once you scale out to hundreds of workloads and thousands of users you will find pain, and lots of it • You will not solve your performance issues in a timely or cost effective manner on your own
  • 12. @dez_blanchfield Big Hadoop Systems need automated Performance Management • Automated Performance Management is a must, humans can’t respond fast enough to resolve multi-workload issues, Ganglia just isn’t enough • Systems are required to manage systems, to monitor, discover, and respond instantly, to deliver good outcomes from investments in Hadoop • Even sophisticated large Hadoop implementations in Govt. & Large Enterprise who do have rocket scientists struggle with performance issues which arise from multi-tenant multi-workload use of Hadoop