The Web of Data as a NoSQL Database                                Sam Tunnicliffe                                     @be...
entity retrievalusing xDBC & ORM  or custom SQL       version 1.0
entity retrievalusing store specific   protocols and      clients                       schema-last
sharding strategy                           may be encapsulated schema knowledge          by clients/servers orresides in ...
What if you could use the Web as a database?
loose couplinghttp://www.flickr.com/photos/11950mike/4707805552
http://www.flickr.com/photos/juniorvelo/2861770108 outsource dataacquisition costs
proven, extreme scalabilityhttp://www.flickr.com/photos/krayker/2268587409
http://www.flickr.com/photos/ranjithsiji/4897513366                                                      leverage existing...
http://www.flickr.com/photos/mandy_pantz/2512569926   more and more diverse data
serendipityhttp://www.flickr.com/photos/sylvar/3291628571
http://www.flickr.com/photos/zivkovic/5850008238                   high latency
giving away control                      http://www.flickr.com/photos/kecko/4052526123
variable availabilityhttp://www.flickr.com/photos/numberstumper/3057162582
global names
1969-059Aglobal names
1969-059A               spacecraft/1969-059Aglobal names
1969-059A                                      spacecraft/1969-059A               nasa.dataincubator.org/spacecraft/1969-0...
1969-059A                                          spacecraft/1969-059A                   nasa.dataincubator.org/spacecraf...
mass 28801.1things have attributes
mass 28801.1                         name “Apollo 11 CSM”things have attributes
mass 28801.1                           name “Apollo 11 CSM”                         launch launch/1969-059things have attr...
http://purl.org/net/schemas/space/mass 28801.1              http://xmlns.com/foaf/0.1/name “Apollo 11 CSM”    http://purl....
http://www.flickr.com/photos/juniorvelo/457197656links
dereference to get data
DNS is your routing componenthttp://www.flickr.com/photos/cjschmit/4623783487
subject                      predicate                                  objectRDF and linked data
1969-59A                      launch                               launch/1969-59RDF and linked data
1969-59Amass: 28801.1name: Apollo 11 CSM         launch                                     launch/1969-59                ...
nasa.gov                    1969-059A                                     Apollo 11                                       ...
routes betweenlinked entities is explicit in data DNS does the  hard work             entity lookups                      ...
realtime discovery   of additional   data sourcesweb enabled data
expandeddata universe                simplified access                    protocol                but some things         ...
http://www.flickr.com/photos/vhanes/3722327096                                                 local caches
outcomes           http://www.flickr.com/photos/carbonnyc/293733099
shared efforthttp://www.flickr.com/photos/toffehoff/244870160/
more simple data integrationhttp://www.flickr.com/photos/thedailyenglishshow/3947409618/
more linked data                   http://www.flickr.com/photos/ninjanoodles/114033269
http://www.flickr.com/photos/asurroca/66225176network effects
use the web as a database by...●  using global names  ●    for entities  ●    for attributes●  using standard formats●  ma...
http://talis.comthank you
Upcoming SlideShare
Loading in …5
×

The Web of Data as a Massively Scalable NoSQL Database

828 views
677 views

Published on

Linked Data is about using the Web to connect related data that wasn't previously linked, or using the Web to lower the barriers to linking data currently linked using other methods. It leverages fundamental characteristics of Web architecture (loose coupling, decentralization, simple and well defined access patterns) and builds on RDF (a W3C standard data model). We'll give a brief overview of RDF and show how Linked Data principles decouple its use for interoperability and data modelling from the "heavyweight" Semantic Web baggage that has long been considered a barrier to entry.

The characteristics that allowed the Web to scale so quickly and widely include decentralization, a massively distributed architecture, an absence of integrity constraints, and weak guarantees about consistency. The Web of data aims to achieve the same end for data, promoting it to a first class Web citizen and making linking data as easy and ubiquitous as linking HTML documents. Many of the same characteristics that make the Web so successful and scalable also apply to the Web of Data.

The rise of NoSQL databases is a response to the changing requirements of Web scale data. Typically these databases deliver performance at scale by relaxing consistency guarantees, eschewing transactions, using flexible data models and distributed architectures, and placing constraints on access patterns. Linked Data and RDF turn the Web itself into a decentralized and massively scalable sparse column store with globally identifiable column names; an enormous, globally distributed repository of linked, structured data.

In this talk we will highlight the common characteristics of various flavors of NoSQL database and the Web of Data. We will also discuss important differences, and outline the trade-offs involved when deciding on a storage solution for your application data, such as the importance of query performance, availability or ACID transactions. We will be delving into concerns around:

Scalability
Data portability
Common query languages
Tool chain interoperability

Published in: Technology, Sports
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
828
On SlideShare
0
From Embeds
0
Number of Embeds
8
Actions
Shares
0
Downloads
19
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

The Web of Data as a Massively Scalable NoSQL Database

  1. 1. The Web of Data as a NoSQL Database Sam Tunnicliffe @beobal Talis Systems Ltd http://talis.com http://github.com/talis NoSQL Now! 2011
  2. 2. entity retrievalusing xDBC & ORM or custom SQL version 1.0
  3. 3. entity retrievalusing store specific protocols and clients schema-last
  4. 4. sharding strategy may be encapsulated schema knowledge by clients/servers orresides in application may require the or access layer application to handle routing/addressing as well as managing store specific protocols and clients sharded, polyglot storage
  5. 5. What if you could use the Web as a database?
  6. 6. loose couplinghttp://www.flickr.com/photos/11950mike/4707805552
  7. 7. http://www.flickr.com/photos/juniorvelo/2861770108 outsource dataacquisition costs
  8. 8. proven, extreme scalabilityhttp://www.flickr.com/photos/krayker/2268587409
  9. 9. http://www.flickr.com/photos/ranjithsiji/4897513366 leverage existing infrastructure
  10. 10. http://www.flickr.com/photos/mandy_pantz/2512569926 more and more diverse data
  11. 11. serendipityhttp://www.flickr.com/photos/sylvar/3291628571
  12. 12. http://www.flickr.com/photos/zivkovic/5850008238 high latency
  13. 13. giving away control http://www.flickr.com/photos/kecko/4052526123
  14. 14. variable availabilityhttp://www.flickr.com/photos/numberstumper/3057162582
  15. 15. global names
  16. 16. 1969-059Aglobal names
  17. 17. 1969-059A spacecraft/1969-059Aglobal names
  18. 18. 1969-059A spacecraft/1969-059A nasa.dataincubator.org/spacecraft/1969-059Aglobal names
  19. 19. 1969-059A spacecraft/1969-059A nasa.dataincubator.org/spacecraft/1969-059A http://nasa.dataincubator.org/spacecraft/1969-059AURIs for entity names
  20. 20. mass 28801.1things have attributes
  21. 21. mass 28801.1 name “Apollo 11 CSM”things have attributes
  22. 22. mass 28801.1 name “Apollo 11 CSM” launch launch/1969-059things have attributes
  23. 23. http://purl.org/net/schemas/space/mass 28801.1 http://xmlns.com/foaf/0.1/name “Apollo 11 CSM” http://purl.org/net/schemas/space/launch launch/1969-059URIs for attribute names
  24. 24. http://www.flickr.com/photos/juniorvelo/457197656links
  25. 25. dereference to get data
  26. 26. DNS is your routing componenthttp://www.flickr.com/photos/cjschmit/4623783487
  27. 27. subject predicate objectRDF and linked data
  28. 28. 1969-59A launch launch/1969-59RDF and linked data
  29. 29. 1969-59Amass: 28801.1name: Apollo 11 CSM launch launch/1969-59 launch date: 16 July 1969 launch vehicle: Saturn V RDF and linked data weather: clear, dry
  30. 30. nasa.gov 1969-059A Apollo 11 geonames.org Capelaunch/1969-59 Canaveral Washington D.C. launch date: 16 July 1969 launch vehicle: Saturn V weather: clear, dry United Mexico States alternate name: Stati Uniti alternate name: Estados Unidos alternate name: アメリカ合衆国 population: 311,874,000 RDF and linked data Canada
  31. 31. routes betweenlinked entities is explicit in data DNS does the hard work entity lookups come from authoritative sources web enabled data
  32. 32. realtime discovery of additional data sourcesweb enabled data
  33. 33. expandeddata universe simplified access protocol but some things are now outside of your control web enabled data
  34. 34. http://www.flickr.com/photos/vhanes/3722327096 local caches
  35. 35. outcomes http://www.flickr.com/photos/carbonnyc/293733099
  36. 36. shared efforthttp://www.flickr.com/photos/toffehoff/244870160/
  37. 37. more simple data integrationhttp://www.flickr.com/photos/thedailyenglishshow/3947409618/
  38. 38. more linked data http://www.flickr.com/photos/ninjanoodles/114033269
  39. 39. http://www.flickr.com/photos/asurroca/66225176network effects
  40. 40. use the web as a database by...● using global names ● for entities ● for attributes● using standard formats● making data dereferenceable● linking to other data http://www.flickr.com/photos/ryanwick/3461847552
  41. 41. http://talis.comthank you

×