Your SlideShare is downloading. ×
  • Like
Upcoming SlideShare
Loading in...5

Thanks for flagging this SlideShare!

Oops! An error has occurred.


Now you can save presentations on your phone or tablet

Available for both IPhone and Android

Text the download link to your phone

Standard text messaging rates apply


Published in Technology
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Be the first to comment
    Be the first to like this
No Downloads


Total Views
On SlideShare
From Embeds
Number of Embeds



Embeds 0

No embeds

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

    No notes for slide
  • - put examples on this,e.g. disconnected, etc - mention ‘to the N’ capacity


  • 1.
    • Mike Miller
    • Cofounder, Chief Scientist
    Integrated Data Management, Search, and Analytics
  • 2. Our background
    • Goal: understand universe at most fundamental of levels
    • 27 km, 100B+ sensors, 100+ PB/sec raw
    • Architected, commissioned global data infrastructure
    Large Hadron Collider
  • 3. The problem
    • 150+ Data Centers
    • 30+ countries
    • 150k+ cores Text Things break. Distribution/access must be transparent. Data is king!
  • 4. What’s needed
    • Scalable : volume, rate, concurrency
    • Distributed : within and between data centers. Offline devices.
    • Flexible : data format evolves. structured and unstructured
    • Integrated : remove artificial boundaries between storage, analyses, and search
    Applies to businesses from the smallest to largest of scales dbcore storage Analytics Search Other API DSL Visualization
  • 5. How: send Compute to the data
    • Distributed Analytics
    • parallel algorithms
    • dynamically provisioned resources
    • in-DB map reduce
    • multi-variate analysis
    • machine learning
    • text and metadata search
    • artificial intellegence
    ‘ Node’
    • Networked Computing Element
    • commodity servers
    • mobile devices
    • switches
    • heterogeneous hardware and OS
    write record Horizontally Scalable DB filtered replication
    • Secondary
    • Deployment
    • disaster recovery
    • geographic distributed access
    • filtered data
    disconnected devices read results Edge Database Cluster results at the edge
  • 6. History and team
    • 5+ years as a team (MIT/Cloudant)
    • Broad expertise in sensor data, multivariate analyses, distributed systems, global data management
    • Record of success in great challenges
    • Bring new vision to the problem space
    • Series A Dec. 2008
  • 7. go-to-market
    • Hosted self-signup service : Amazon EC2. Target rapidly growing data-driven companies. Consumption based pricing model.
    • Dedicated Hosting : Cloud providers and/or private data center.
    • Enterprise package : software, services, support at the largest of scales.
  • 8. From development to revenue Real-Time Search and Data Aggregation Advertising and Clickstream Analytics Web Applications & Games Cloud Configuration System Integration 400+ beta users. 1B+ documents. ~500M transactions / day
  • 9. Going forward
    • Core technology largely established.
    • Runway through 2010
    • Aggressively scale current offering for small-medium businesses
    • Leverage key strengths to expand into enterprise markets. Energy, clean tech and smart grid are excellent fits
    • Our ask: help finding the right problem set in the enterprise
  • 10.
    • Changing the way you manage, share, and analyze data
    Integrated Data Management, Search, and Analytics