Mike Miller is the Co-Founder and Chief Scientist of Cloudant, a company that provides a globally distributed data layer for web applications. He has a background in machine learning, analysis, big data, and distributed systems. Cloudant was founded in 2009 by MIT data scientists and provides a hyper-scalable document database and analytics platform that runs across multiple data centers.
5. The face of big data
“The future is stranger and sooner than you think”
Reid Hoffman, LinkedIn/Greylock
Cloudant, 9-26-2012 5
6. Perfect Storm
Parallel
Processing
Big Data
HTML5/JS
Mobile 9M Trained
Developers
Cloudant, 9-26-2012 6
7. Focus on your Application
not data operations
Cloudant, 9-26-2012 7
8. If your data is stuck in the warehouse...
... you’re losing
Cloudant, 9-26-2012 8
9. Data Layer for the Web
Founded (2009) by leading MIT data
scientists
Funded by Y Combinator & Avalon
Global network of 20+ data centers
-- Application Data Network (ADN)
Built on leading NoSQL standard:
most durable data store on planet
10,000 users and growing.
Cloudant: Akamai of dynamic content
Cloudant, 9-26-2012 9
10. Cloudant Product Line
• Application State
Hyper-Scalable Document Store (JSON+HTTP)
MVCC
Secondary indexes for flexible query
• Application Data Security
Accounts/API keys, data sharing, permission roles
• Application Analytics
Fully Integrated (Incremental) MapReduce engine
• Application Search
Fully Integrated (Incremental) Lucene + Geospatial
API Compatible
• Application Object Storage
images, audio, video...
• Application State Distribution
cloud <==> tablet <==> PC <==> mobile
Cloudant, 9-26-2012 10
11. Cloudant Install
You do this:
We give you:
That’s It
Cloudant, 9-26-2012 11
12. API Examples
Write a doc...from the browser
No client install necessary
Cloudant, 9-26-2012 12
13. API Examples
Create Secondary Indexes
Query Those indexes
Cloudant, 9-26-2012 13
30. Big and Getting Bigger
• And of course, we are hiring
Languages
erlang, scala, c, javascript, python, clojure, html5, iOS, Android, ruby/chef
Sample problems in the Seattle office
Create file format optimized for (huge) structured time-series data
Integrate Cubism into two-tier application stack
Profile creation of 100M databases (real customer)
PIG / HIVE integration
Prototype read-in-place Hadoop connector
Cloudant, 9-26-2012 30