Beyond TCO

2016-06-29
Beyond TCO
Architecting Hadoop for adoption and data applications
Reid Levesque – Head, Solution Engineering

Topics
Technology Use cases Deployment Impact Next steps

Technology – Let’s talk Hadoop

Every company is a
technology company…
some just don’t know it
yet.

Traditional systems under pressure
Conventional wisdom
• Put the code on an Application Server
• Move the data to/from database
• Move the data to/from NAS
Reality check
• This works well for small amounts of data
• As data volumes increase this design falls apart

How do we get Hadoop into the organization?

How about these use cases?
 File archive +Hadoop
 Data-intensive
grid compute
analytics
 Database
replacement
 ETL off-load +Hadoop
+Hadoop
+Hadoop
•Data is online; no need for tape
backup
•Cheaper than NAS / SAN
•Increased performance /
scalability
•Metadata is easier to get; all the
data is in one spot
•Improved performance
•Lower TCO
•Reduced dependence on
proprietary software
•Reduce RDBMS licensing
•Reduced operational cost for
analysis
•Improved functionality with
stored XML
•Lower TCO
•Additional analytic capability
•Better hardware utilization
•Lower platform management

Not so much
 Data-intensive
grid compute
analytics
 Database
replacement
+Hadoop
+Hadoop
TCO

Which use case did work?
 Current batch was taking 4
hours; which limited the way
they did their job
 Users wanted interactive
response times to design and
test their financial models
 This was net new functionality
that could only be achieved in
Hadoop

Now TCO makes more sense
 Data-intensive
grid compute
analytics
 Database
replacement
+Hadoop
+Hadoop
With Hadoop TCO covered,
previous use cases are
now more compelling.

Time to pick the hardware
Is this true?

Commodity hardware + commodity networking = bad architecture

Before there was Hadoop, there were enterprise IT standards
To name a few conflicts during the rollout…
• Local account UID / names
• OS settings
• Root access
• File locations
• Standard mount sizes
• Enterprise Active Directory
• Monitoring systems
Hadoop is NOT flexible on deployment requirements

Who does the work?
Single team including:
• Dedicated infrastructure team (Compute, Network, Data Center, Operations)
• Dedicated Hadoop team (sysadmin/operations, engineering)
• Hardware vendor engineers
• Hadoop distribution engineers

Impact across the organization
Infrastructure
• Networking / Data Center designs
• Relationship with storage, cloud,
virtualization capabilities
• Generating analytic use cases
Development
• Mega-attractor for talent
• Application consolidation
• Shifting from IT to business focus
Management
• Understanding (or accepting) new
paradigm
• Cross-department architecture alignment
• Data-focus rather than application-focus
Business
• Continuously evolving understanding of
capability / possibilities
• Next generation IT w/ rapidly evolving
ecosystem
• Self-service innovation for business users

Lessons Learned
Hadoop doesn’t remove hardware maintenance
Hadoop development is still development!
New paradigm – requires skilled developers
A whole new set of error messages to decode
There aren’t that many experts

Selling Hadoop internally
• This journey has taught me a lot about Hadoop; more than most people at the organization
• The biggest tasks are educating the organization and doing simple things as a first step

Beyond TCO

More Related Content

What's hot

Viewers also liked

Similar to Beyond TCO

More from DataWorks Summit/Hadoop Summit

Recently uploaded

Beyond TCO

Editor's Notes