Claremont Report on Database Research: Research Directions (Raghu Ramakrishnan)

Loading...

Flash Player 9 (or above) is needed to view presentations.
We have detected that you do not have it on your computer. To install it, go here.

0 comments

Post a comment

    Post a comment
    Embed Video
    Edit your comment Cancel

    2 Favorites

    Claremont Report on Database Research: Research Directions (Raghu Ramakrishnan) - Presentation Transcript

    1. Web Data Management Raghu Ramakrishnan
    2. QUIQ Lessons
      • Structured data management powers scalable collaboration environments
      • ASP
      • Multi-tenancy
      • Massively distributed
      • Fine-grained permissions, hierarchical acls
      • RDBMSs were a lousy fit
    3. Cloud Computing: Computing as a Service Cloud Computing CPU Intensive Data Intensive Analytic E.g., SSDS, Hadoop Packaged Software High-throughput E.g., Condor “ Transactional” Storage & Serving E.g., PNUTS, S3, SSDS, UDB
    4. Implications
      • Data management as a service
        • Scientists and others who’ve resisted (installing, maintaining, and) using DBMSs will find it much easier to reap the benefits
        • “ Data centers” and “Computing Centers” will come into vogue again
      • Hosted back-ends and RAD tools will make Web application development accessible to all
        • The Web is becoming open
          • E.g., OpenSocial, OpenID
          • Ideas will be the most valuable currency, not the wherewithal to build complex systems
      • Paradigm shifts possible for how we do research in many fields
        • Build applications that embed your algorithms and test them directly in the field—Computer Scientists can interact directly with users (ironically, this would still be a breakthrough of sorts after four decades!)
        • Many other disciplines (e.g., Sociology, microeconomics) can design and conduct online experiments involving unprecedented numbers of participants
    5. PNUTS: DB in the Cloud CREATE TABLE Parts ( ID VARCHAR, StockNumber INT, Status VARCHAR … ) Parallel database Geographic replication Indexes and views Structured, flexible schema Hosted, managed infrastructure E 75656 C A 42342 E B 42521 W C 66354 W D 12352 E F 15677 E E 75656 C A 42342 E B 42521 W C 66354 W D 12352 E F 15677 E E 75656 C A 42342 E B 42521 W C 66354 W D 12352 E F 15677 E
      • Goal:
      • Make it easier for applications to reason about updates and cope with asynchrony—alternative to “transactions” in an asynchronous world
      • What happens to a record with primary key “Brian”?
      • Guarantees:
      • Every reader will always see some consistent, but possibly stale version
      • Readers can request a more up-to-date version, but may pay extra latency
        • Special case: Critical read (writer/readers see their own writes)
      • Writers can verify that the record is still at the version they expect
      Basic Consistency Model Time Record inserted Update Update Delete v. 1 v. 2 v. 3 Generation 1 Record inserted Update Update Delete v. 1 v. 2 v. 4 Generation 2 Update v. 3 Record inserted Delete v. 1 Generation 3
    6. Lots of Issues to Re-think
      • Massive distribution & replication
        • Asynchrony
        • Availability
        • Consistency
      • DBA to the world
        • Auto-tuning
        • Multi-tenancy
        • Access control (granularity, online ids)
        • Encryption
      • App-support
        • Caching
    7. Querying the Web
      • Search will become more semantic—best-effort match-making between:
        • Query intent (NLP, query logs …)
        • Interpreted web content
      • Deep web has a lot of structured data
        • How we get a handle on it is an interesting problem
        • But this is only part of the problem … lots of data not here
      • Semantic web isn’t working
      • Site-wrapping doesn’t scale
      • Solutions?
        • Domain-wrapping
        • Mass collaboration
        • ??

    + infobloginfoblog, 11 months ago

    custom

    666 views, 2 favs, 3 embeds more stats

    This is a set of slides from the Claremont Report o more

    More info about this document

    © All Rights Reserved

    Go to text version

    • Total Views 666
      • 566 on SlideShare
      • 100 from embeds
    • Comments 0
    • Favorites 2
    • Downloads 0
    Most viewed embeds
    • 98 views on http://infoblog.stanford.edu
    • 1 views on http://static.slideshare.net
    • 1 views on http://infoblog.stanford.edu.sixxs.org

    more

    All embeds
    • 98 views on http://infoblog.stanford.edu
    • 1 views on http://static.slideshare.net
    • 1 views on http://infoblog.stanford.edu.sixxs.org

    less

    Flagged as inappropriate Flag as inappropriate
    Flag as inappropriate

    Select your reason for flagging this presentation as inappropriate. If needed, use the feedback form to let us know more details.

    Cancel
    File a copyright complaint
    Having problems? Go to our helpdesk?

    Categories