Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.
Techniques                 used in    RDF Data Publishing                      atNature Publishing Group           Tony Ha...
Nature Publishing Group● NPG a division of Macmillan (a privately  owned company)● Publishes ~120 titles in all  ● 34 Natu...
Semantic Publishing at NPG• Prior Work  •   RSS 1.0 webfeeds  •   HTML metadata  •   PDF metadata (XMP)  •   Urchin – RSS ...
Public Data         4
NPG by Numbers                 5
NPG Ontology               6
Cloud Hosting•   TSO OpenUp® SaaS platform•   Offers 5store as a triplestore•   Scale-out architecture (C/C++)•   Supports...
data.nature.com                  8
data.nature.com/query                9
Hub      10
Hub: Problem               11
Hub: Solution                12
Hub: Method              13
XMP      14
Building the Graph                15
Local Hosting•   Apache TDB•   Single-node architecture (Java)•   Supports up to ~1.5b triples (tested)•   SPARQL 1.1     ...
Data Publishing                  17
Hub Finder             18
Hub Finder: Results                19
Techniques        20
Naming Architecture                21
Naming Policynpg:     http://ns.nature.com/terms/npgg:    http://ns.nature.com/graphs/Object     Example          UsageGra...
Publishing             23
Monitoring             24
ETL Process              25
Datastore: Imports                26
Datastore: Exports                27
Contractsnpgg:affiliations                                                        void:property vcard:region ;  a npg:Grap...
Linked Data API•   ./api/articles [.json, .rdf, .xml]•   ./api/articles?hasProduct.pcode=ng•   ./api/contributors?familyNa...
Closing          30
Positions Available                   goo.gl/bYIt8   www.linkedin.com/jobs?jobId=4890057&viewJob                       31
Information                data.nature.com         developers.nature.com/docs              datahub.io/group/npg           ...
Upcoming SlideShare
Loading in …5
×

Techniques used in RDF Data Publishing at Nature Publishing Group

12,540 views

Published on

Lotico London Semweb Meetup - March 2013

Published in: Technology
  • If you are looking for trusted essay writing service I highly recommend ⇒⇒⇒WRITE-MY-PAPER.net ⇐⇐⇐ The service I received was great. I got an A on my final paper which really helped my grade. Knowing that I can count on them in the future has really helped relieve the stress, anxiety and workload. I recommend everyone to give them a try. You'll be glad you did.
       Reply 
    Are you sure you want to  Yes  No
    Your message goes here
  • Don't forget another good way of simplifying your writing is using external resources (such as ⇒ www.HelpWriting.net ⇐ ). This will definitely make your life more easier
       Reply 
    Are you sure you want to  Yes  No
    Your message goes here
  • Don't forget another good way of simplifying your writing is using external resources (such as ⇒ www.WritePaper.info ⇐ ). This will definitely make your life more easier
       Reply 
    Are you sure you want to  Yes  No
    Your message goes here
  • Follow the link, new dating source: ♥♥♥ http://bit.ly/2u6xbL5 ♥♥♥
       Reply 
    Are you sure you want to  Yes  No
    Your message goes here
  • Dating for everyone is here: ❤❤❤ http://bit.ly/2u6xbL5 ❤❤❤
       Reply 
    Are you sure you want to  Yes  No
    Your message goes here

Techniques used in RDF Data Publishing at Nature Publishing Group

  1. Techniques used in RDF Data Publishing atNature Publishing Group Tony Hammond Data Architect, NPG March 5, 2013
  2. Nature Publishing Group● NPG a division of Macmillan (a privately owned company)● Publishes ~120 titles in all ● 34 Nature branded titles ● 53 academic and society journals ● 16 magazines (incl. Scientific American)● ~1000 employees,17 offices (5 continents)● ~30 society partners● Databases, conferences/events, multimedia 2
  3. Semantic Publishing at NPG• Prior Work • RSS 1.0 webfeeds • HTML metadata • PDF metadata (XMP) • Urchin – RSS aggregator • OAI-PMH, OpenSearch (SRU), OpenURL• Linked Data Apps • Public Data: test viability of data publishing • Hub: application of technology internally 3
  4. Public Data 4
  5. NPG by Numbers 5
  6. NPG Ontology 6
  7. Cloud Hosting• TSO OpenUp® SaaS platform• Offers 5store as a triplestore• Scale-out architecture (C/C++)• Supports up to a trillion triples• 150,000tps load speed• SPARQL 1.0, with 1.1 features (aggregates, etc) 7
  8. data.nature.com 8
  9. data.nature.com/query 9
  10. Hub 10
  11. Hub: Problem 11
  12. Hub: Solution 12
  13. Hub: Method 13
  14. XMP 14
  15. Building the Graph 15
  16. Local Hosting• Apache TDB• Single-node architecture (Java)• Supports up to ~1.5b triples (tested)• SPARQL 1.1 16
  17. Data Publishing 17
  18. Hub Finder 18
  19. Hub Finder: Results 19
  20. Techniques 20
  21. Naming Architecture 21
  22. Naming Policynpg: http://ns.nature.com/terms/npgg: http://ns.nature.com/graphs/Object Example UsageGraph npgg:gadgets gadgets:33 ex:title "Title" npgg:gadgets .Class npg:Gadget gadgets:33 a npg:Gadget npgg:gadgets .Object npg:hasGadget _:12 npg:hasGadget gadgets:33 npgg:_ .PropertyData ex:title gadgets:33 ex:title "Title" npgg:gadgets .PropertyInstance gadgets:33 gadgets:33 ex:title "Title" npgg:gadgets . 22
  23. Publishing 23
  24. Monitoring 24
  25. ETL Process 25
  26. Datastore: Imports 26
  27. Datastore: Exports 27
  28. Contractsnpgg:affiliations void:property vcard:region ; a npg:Graph, void:Dataset ; void:triples "183483"^^xsd:int dcterms:description "Graph of npg:Affiliation objects" ; ], [ dcterms:issued "2013-02-15"^^xsd:date ; void:property vcard:organisation-name ; dcterms:modified "2013-02-15"^^xsd:date ; void:triples "694290"^^xsd:int dcterms:publisher [ ], [ a foaf:Organization ; void:property vcard:locality ; foaf:mbox <mailto:developers@nature.com> ; void:triples "412042"^^xsd:int foaf:name "Nature Publishing Group" ], [ ]; void:property vcard:email ; dcterms:source "extractor-xml" ; void:triples "21650"^^xsd:int dcterms:title "npgg:affiliations" ; ], [ rdfs:label "npgg:affiliations" ; void:property vcard:country-name ; void:classPartition [ void:triples 0 void:class npg:Affiliation ; ], [ void:entities "973208"^^xsd:int void:property rdfs:label ; ]; void:triples "973208"^^xsd:int void:propertyPartition [ ], [ void:property vcard:url ; void:property rdf:type ; void:triples "326"^^xsd:int void:triples "973208"^^xsd:int ], [ ]; void:property vcard:street-address ; void:triples "3340845"^^xsd:int ; void:triples "82638"^^xsd:int void:vocabulary npg:, rdf:, rdfs:, void: . ], [ 28
  29. Linked Data API• ./api/articles [.json, .rdf, .xml]• ./api/articles?hasProduct.pcode=ng• ./api/contributors?familyName=Smith• ./api/products.json?pcode=ng&_page=2• ./api/products?_view=none&_properties=pcode• ./api/search?title=black+hole• ./api/tree/subjects/children.xml?_sort=title 29
  30. Closing 30
  31. Positions Available goo.gl/bYIt8 www.linkedin.com/jobs?jobId=4890057&viewJob 31
  32. Information data.nature.com developers.nature.com/docs datahub.io/group/npg prefix.cc/npg 32

×