Strata+Hadoop World | New York, NY | Sept 29-Oct 1, 2015
About the talk:
Hadoop is often used as a cost-effective augmentation of an enterprise data warehouse, or a new business intelligence tool. A greater opportunity exists if a broader range of developers could harness the power of Hadoop analytics not just for BI, but also for operational applications. The developer challenges to implement solutions on Hadoop are significant.
In this presentation, we will provide a brief background on the Cask Data Application Platform (CDAP), and then delve into three customer case studies. The case studies will show how organizations can accelerate Hadoop solution development and deployment across a range of use cases and vertical markets. In all of the examples, we will show how a data application platform can help organizations more effectively use existing talent, minimizing the cost of specialized Hadoop developers or consultants.
Each of these customers dealt with a common set of challenges driven by the evolution of Hadoop. Tremendous innovation around Hadoop and NoSQL technologies has resulted in a proliferation of data storage and processing options that require unique skills. The tight coupling of data, logic, and infrastructure makes reuse difficult. Finally, any operational systems require life cycle management, which is largely absent from the current Hadoop ecosystem.
Much like application servers such as Weblogic opened the door for application innovation on relational databases, the Hadoop ecosystem will progress faster with a data application platform. It has been about a year since the CDAP was contributed to open source to address the needs of developers. Since then, thousands of developers have downloaded the platform or cloned the GitHub repo, highlighting the project’s role as the leading platform for developers on Hadoop.
Jonathan Gray, founder and CEO of Cask, is an entrepreneur and software engineer with a background in startups, open source, and all things data. Prior to founding Cask, Jonathan was a software engineer at Facebook where he helped drive HBase engineering efforts, including Facebook Messages and several other large-scale projects, from inception to production.
An open source evangelist, Jonathan was responsible for helping build the Facebook engineering brand through developer outreach and refocusing the open source strategy of the company. Prior to Facebook, Jonathan founded Streamy.com, where he became an early adopter of Hadoop and HBase and is now a core contributor and active committer in the community.
Jonathan holds a bachelor’s degree in electrical and computer engineering from Carnegie Mellon University.