LinkChains: Decentralised Trustworthy Linked Data

John Domingue
John DomingueProfessor at Knowledge Media Institute, The Open University
LinkChains:
Decentralised
Trustworthy Linked
Data
Allan Third & John Domingue (@johndmk)
Knowledge Media Institute
The Open University, UK
kmi.open.ac.uk
blockchain.open.ac.uk
Background and
Blockchains
Handling Sensitive Linked
Data
• Need to ensure that Linked Data Set is
unchanged
• E.g.
– Healthcare
– Education
– Finance
– Scholarly publications
Cryptographic Hash Function
https://en.wikipedia.org/wiki/Cryptographic_hash_function
Blockchain is a Linked List
A blockchain can be thought of as a linked list of transactions
that is built with hash pointers instead of pointers
Source: Bitcoin and Cryptocurrency Technologies - Arvind Narayanan, Joseph Bonneau, Edward Felten, Andrew Miller, Steven Goldfeder
Peer to Peer Network
http://www.terndrup.net/2015/10/27/Building-a-P2P-Peer-Client-with-Node-js/
Add everyone has a complete copy of the data
Who Next?
Proof of Work
• Hard to outpace the entire rest of the network… a 51% attack could
do it, but otherwise it is like buying thousands of lottery tickets –
doesn’t help you that much!
Source: Marc Eisenstadt ‘What is the genius behind Bitcoin’
Ethereum Blockchain Platform
Sources: Ethereum Development Tutorial
Ethereum Virtual Machine
Sources: Ethereum Development Tutorial
The Ethereum Virtual Machine can be thought of as a large decentralized computer
containing millions of objects, called "accounts", which have the ability to maintain
an internal database, execute code and talk to each other.
There are 2 types of Accounts:
Externally owned account (EOA):
an account controlled by a private
key that has the ability to send
ether and messages from it.
‘Smart’ Contract: an account
that has its own code, and is
controlled by code.
Any user can trigger an action by sending a transaction from an EOA, setting
Ethereum's wheels in motion.
If the destination of the transaction
is another EOA, then the transaction
may transfer some ether but
otherwise does nothing
However, if the destination is a
‘Smart’ Contract, then the
contract in turn activates, and
automatically runs its code.
Interplanetary File System
(IPFS)
• Content-addressed distributed storage
(CADS)
• Files identified by hash of contents
• Shared across BitTorrent-based network
Exploring the DL/LD Decentralisation/Trust
Space
Decentralised Linked Data on
Distributed Ledgers
• Guarantees of immutability
– Data cannot be changed once published
• Integrity of valuable data
– Financial
– Medical
– Political/politically-sensitive
• e.g., climate science data
– Academic Publishing
Dimensions of decentralisation
for Linked Data
• Decentralised
– Data storage
– Querying
– Verification
• Other criteria
– Storage costs
– Query costs
– Level of integrity guarantee
Decentralising LOD storage &
querying
• Identified 5 approaches
– CADS
– CADS + distributed ledger
– Standard LOD + distributed ledger verifier
– Standard LOD + distributed ledger backend
– “Pure” distributed ledger
• Compared with base case of standard LOD
– SPARQL/Linked Data Fragments querying
Base case
• Centralised storage & querying
• No verification
Query = Linked Data Fragments
Linked Data Fragments
Ruben Verborgh’s
Linked Data Fragments
CADS
CADS
• Data decentralised (copy-on-demand)
• Queries centralised
• Verification
– Centralised (central source of IPFS hash)
– Weak (need to trust source of IPFS hash)
– Need to re-compute hash over entire data set
– No timestamping
CADS + Distributed Ledger
CADS + DL (2)
• Data decentralised
– But copy-on-demand
• Queries centralised
• Verification
– Decentralised (blockchain source of IPFS
hash)
– Strong (IPFS hash immutable, signable)
– Need to re-compute hash over entire data set
– Timestamping
Base case + DL Verifier
• Centralised storage & querying
• Verify query results with copy of original data on
blockchain
Base case + DL Backend
• Semi-decentralised queries - any node can be a
query frontend
• Decentralised data verified directly from blockchain
Linked Data Fragments
Ruben Verborgh’s
Linked Data Fragments
Base case + DL Backend
“Pure” Distributed Ledger
• Decentralised storage & querying
• Data comes directly from blockchain
Summary
Implementation
• Fully implemented:
– Base case with
• Blockchain verifier
• Blockchain backend
– ”Pure” distributed ledger
• In progress
– IPFS-based approaches
Issues
• Performance
– Cap on data per-block
– Speed of Ethereum
• Cost
– Experiments on (free) private chain so far
• Metadata
– How do we tell clients the verification
status/provenance of query results in
SPARQL/Linked Data Fragments queries?
Future Work
• Performance analysis and improvements
• Cost analysis
• Extension (in progress) of Linked Data
Fragments server
– Extensible query result metadata
• Extension of LDF client
– Display verification status of results to user
blockchain.open.ac.uk
1 of 30

Recommended

Blockchains and new educational models v 2.0 by
Blockchains and new educational models v 2.0Blockchains and new educational models v 2.0
Blockchains and new educational models v 2.0John Domingue
36.2K views59 slides
Blockchains and Adult Education by
Blockchains and Adult EducationBlockchains and Adult Education
Blockchains and Adult EducationJohn Domingue
58.2K views59 slides
Blockchains a new platform for semantically enabled transactions public by
Blockchains  a new platform for semantically enabled transactions publicBlockchains  a new platform for semantically enabled transactions public
Blockchains a new platform for semantically enabled transactions publicJohn Domingue
62.1K views60 slides
Blockchains as a Component of the Next generation Internet by
Blockchains as a Component of the Next generation InternetBlockchains as a Component of the Next generation Internet
Blockchains as a Component of the Next generation InternetJohn Domingue
56.5K views68 slides
How Blockchains Are Transforming Adult Education by
How Blockchains Are Transforming Adult EducationHow Blockchains Are Transforming Adult Education
How Blockchains Are Transforming Adult EducationJohn Domingue
5.8K views69 slides
Towards the decentralisation of personal data through blockchains and linked ... by
Towards the decentralisation of personal data through blockchains and linked ...Towards the decentralisation of personal data through blockchains and linked ...
Towards the decentralisation of personal data through blockchains and linked ...John Domingue
44K views78 slides

More Related Content

More from John Domingue

Linked data indexing of distributed ledgers by
Linked data indexing of distributed ledgersLinked data indexing of distributed ledgers
Linked data indexing of distributed ledgersJohn Domingue
2.4K views39 slides
1st workshop on linked data and distributed ledgers introduction v 1.0 by
1st workshop on linked data and distributed ledgers introduction v 1.01st workshop on linked data and distributed ledgers introduction v 1.0
1st workshop on linked data and distributed ledgers introduction v 1.0John Domingue
633 views13 slides
The Potential of Blockchain in Higher Education by
The Potential of Blockchain in Higher EducationThe Potential of Blockchain in Higher Education
The Potential of Blockchain in Higher EducationJohn Domingue
9.8K views41 slides
Transforming Education through FORGE by
Transforming Education through FORGETransforming Education through FORGE
Transforming Education through FORGEJohn Domingue
3.4K views65 slides
FIRE in a Book by
FIRE in a BookFIRE in a Book
FIRE in a BookJohn Domingue
416 views25 slides
Developing rich interactive eBooks to teach linked open data to professionals... by
Developing rich interactive eBooks to teach linked open data to professionals...Developing rich interactive eBooks to teach linked open data to professionals...
Developing rich interactive eBooks to teach linked open data to professionals...John Domingue
1.4K views44 slides

More from John Domingue(14)

Linked data indexing of distributed ledgers by John Domingue
Linked data indexing of distributed ledgersLinked data indexing of distributed ledgers
Linked data indexing of distributed ledgers
John Domingue2.4K views
1st workshop on linked data and distributed ledgers introduction v 1.0 by John Domingue
1st workshop on linked data and distributed ledgers introduction v 1.01st workshop on linked data and distributed ledgers introduction v 1.0
1st workshop on linked data and distributed ledgers introduction v 1.0
John Domingue633 views
The Potential of Blockchain in Higher Education by John Domingue
The Potential of Blockchain in Higher EducationThe Potential of Blockchain in Higher Education
The Potential of Blockchain in Higher Education
John Domingue9.8K views
Transforming Education through FORGE by John Domingue
Transforming Education through FORGETransforming Education through FORGE
Transforming Education through FORGE
John Domingue3.4K views
Developing rich interactive eBooks to teach linked open data to professionals... by John Domingue
Developing rich interactive eBooks to teach linked open data to professionals...Developing rich interactive eBooks to teach linked open data to professionals...
Developing rich interactive eBooks to teach linked open data to professionals...
John Domingue1.4K views
Linked services: Connecting services to the Web of Data by John Domingue
Linked services: Connecting services to the Web of DataLinked services: Connecting services to the Web of Data
Linked services: Connecting services to the Web of Data
John Domingue2.4K views
Beyond sparql linked data, software, services and applications. Keynote at D... by John Domingue
Beyond sparql  linked data, software, services and applications. Keynote at D...Beyond sparql  linked data, software, services and applications. Keynote at D...
Beyond sparql linked data, software, services and applications. Keynote at D...
John Domingue1.3K views
Linked services for the Web of Data by John Domingue
Linked services for the Web of DataLinked services for the Web of Data
Linked services for the Web of Data
John Domingue884 views
Linking Services and Linked Data: Keynote for AIMSA 2012 by John Domingue
Linking Services and Linked Data: Keynote for AIMSA 2012Linking Services and Linked Data: Keynote for AIMSA 2012
Linking Services and Linked Data: Keynote for AIMSA 2012
John Domingue855 views
Inaugural Lecture on Future Internet by John Domingue
Inaugural Lecture on Future InternetInaugural Lecture on Future Internet
Inaugural Lecture on Future Internet
John Domingue1.3K views
Soa4 all technical achievements final by John Domingue
Soa4 all technical achievements finalSoa4 all technical achievements final
Soa4 all technical achievements final
John Domingue1K views
Super ontology stack_review_m36_051 by John Domingue
Super ontology stack_review_m36_051Super ontology stack_review_m36_051
Super ontology stack_review_m36_051
John Domingue1.2K views
ESWC 2012 Dinner Keynote: Improving the Impact and Takeup of Semantic Web Res... by John Domingue
ESWC 2012 Dinner Keynote: Improving the Impact and Takeup of Semantic Web Res...ESWC 2012 Dinner Keynote: Improving the Impact and Takeup of Semantic Web Res...
ESWC 2012 Dinner Keynote: Improving the Impact and Takeup of Semantic Web Res...
John Domingue697 views

Recently uploaded

SAP Automation Using Bar Code and FIORI.pdf by
SAP Automation Using Bar Code and FIORI.pdfSAP Automation Using Bar Code and FIORI.pdf
SAP Automation Using Bar Code and FIORI.pdfVirendra Rai, PMP
25 views38 slides
Uni Systems for Power Platform.pptx by
Uni Systems for Power Platform.pptxUni Systems for Power Platform.pptx
Uni Systems for Power Platform.pptxUni Systems S.M.S.A.
58 views21 slides
Microsoft Power Platform.pptx by
Microsoft Power Platform.pptxMicrosoft Power Platform.pptx
Microsoft Power Platform.pptxUni Systems S.M.S.A.
61 views38 slides
MVP and prioritization.pdf by
MVP and prioritization.pdfMVP and prioritization.pdf
MVP and prioritization.pdfrahuldharwal141
37 views8 slides
20231123_Camunda Meetup Vienna.pdf by
20231123_Camunda Meetup Vienna.pdf20231123_Camunda Meetup Vienna.pdf
20231123_Camunda Meetup Vienna.pdfPhactum Softwareentwicklung GmbH
45 views73 slides
virtual reality.pptx by
virtual reality.pptxvirtual reality.pptx
virtual reality.pptxG036GaikwadSnehal
18 views15 slides

Recently uploaded(20)

SAP Automation Using Bar Code and FIORI.pdf by Virendra Rai, PMP
SAP Automation Using Bar Code and FIORI.pdfSAP Automation Using Bar Code and FIORI.pdf
SAP Automation Using Bar Code and FIORI.pdf
Five Things You SHOULD Know About Postman by Postman
Five Things You SHOULD Know About PostmanFive Things You SHOULD Know About Postman
Five Things You SHOULD Know About Postman
Postman38 views
Igniting Next Level Productivity with AI-Infused Data Integration Workflows by Safe Software
Igniting Next Level Productivity with AI-Infused Data Integration Workflows Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Safe Software317 views
"Running students' code in isolation. The hard way", Yurii Holiuk by Fwdays
"Running students' code in isolation. The hard way", Yurii Holiuk "Running students' code in isolation. The hard way", Yurii Holiuk
"Running students' code in isolation. The hard way", Yurii Holiuk
Fwdays24 views
TouchLog: Finger Micro Gesture Recognition Using Photo-Reflective Sensors by sugiuralab
TouchLog: Finger Micro Gesture Recognition  Using Photo-Reflective SensorsTouchLog: Finger Micro Gesture Recognition  Using Photo-Reflective Sensors
TouchLog: Finger Micro Gesture Recognition Using Photo-Reflective Sensors
sugiuralab23 views
【USB韌體設計課程】精選講義節錄-USB的列舉過程_艾鍗學院 by IttrainingIttraining
【USB韌體設計課程】精選講義節錄-USB的列舉過程_艾鍗學院【USB韌體設計課程】精選講義節錄-USB的列舉過程_艾鍗學院
【USB韌體設計課程】精選講義節錄-USB的列舉過程_艾鍗學院
"Surviving highload with Node.js", Andrii Shumada by Fwdays
"Surviving highload with Node.js", Andrii Shumada "Surviving highload with Node.js", Andrii Shumada
"Surviving highload with Node.js", Andrii Shumada
Fwdays33 views
"Node.js Development in 2024: trends and tools", Nikita Galkin by Fwdays
"Node.js Development in 2024: trends and tools", Nikita Galkin "Node.js Development in 2024: trends and tools", Nikita Galkin
"Node.js Development in 2024: trends and tools", Nikita Galkin
Fwdays17 views
Piloting & Scaling Successfully With Microsoft Viva by Richard Harbridge
Piloting & Scaling Successfully With Microsoft VivaPiloting & Scaling Successfully With Microsoft Viva
Piloting & Scaling Successfully With Microsoft Viva

LinkChains: Decentralised Trustworthy Linked Data

  • 1. LinkChains: Decentralised Trustworthy Linked Data Allan Third & John Domingue (@johndmk) Knowledge Media Institute The Open University, UK kmi.open.ac.uk blockchain.open.ac.uk
  • 3. Handling Sensitive Linked Data • Need to ensure that Linked Data Set is unchanged • E.g. – Healthcare – Education – Finance – Scholarly publications
  • 5. Blockchain is a Linked List A blockchain can be thought of as a linked list of transactions that is built with hash pointers instead of pointers Source: Bitcoin and Cryptocurrency Technologies - Arvind Narayanan, Joseph Bonneau, Edward Felten, Andrew Miller, Steven Goldfeder
  • 6. Peer to Peer Network http://www.terndrup.net/2015/10/27/Building-a-P2P-Peer-Client-with-Node-js/ Add everyone has a complete copy of the data Who Next?
  • 7. Proof of Work • Hard to outpace the entire rest of the network… a 51% attack could do it, but otherwise it is like buying thousands of lottery tickets – doesn’t help you that much! Source: Marc Eisenstadt ‘What is the genius behind Bitcoin’
  • 8. Ethereum Blockchain Platform Sources: Ethereum Development Tutorial
  • 9. Ethereum Virtual Machine Sources: Ethereum Development Tutorial The Ethereum Virtual Machine can be thought of as a large decentralized computer containing millions of objects, called "accounts", which have the ability to maintain an internal database, execute code and talk to each other. There are 2 types of Accounts: Externally owned account (EOA): an account controlled by a private key that has the ability to send ether and messages from it. ‘Smart’ Contract: an account that has its own code, and is controlled by code. Any user can trigger an action by sending a transaction from an EOA, setting Ethereum's wheels in motion. If the destination of the transaction is another EOA, then the transaction may transfer some ether but otherwise does nothing However, if the destination is a ‘Smart’ Contract, then the contract in turn activates, and automatically runs its code.
  • 10. Interplanetary File System (IPFS) • Content-addressed distributed storage (CADS) • Files identified by hash of contents • Shared across BitTorrent-based network
  • 11. Exploring the DL/LD Decentralisation/Trust Space
  • 12. Decentralised Linked Data on Distributed Ledgers • Guarantees of immutability – Data cannot be changed once published • Integrity of valuable data – Financial – Medical – Political/politically-sensitive • e.g., climate science data – Academic Publishing
  • 13. Dimensions of decentralisation for Linked Data • Decentralised – Data storage – Querying – Verification • Other criteria – Storage costs – Query costs – Level of integrity guarantee
  • 14. Decentralising LOD storage & querying • Identified 5 approaches – CADS – CADS + distributed ledger – Standard LOD + distributed ledger verifier – Standard LOD + distributed ledger backend – “Pure” distributed ledger • Compared with base case of standard LOD – SPARQL/Linked Data Fragments querying
  • 15. Base case • Centralised storage & querying • No verification Query = Linked Data Fragments
  • 16. Linked Data Fragments Ruben Verborgh’s Linked Data Fragments
  • 17. CADS
  • 18. CADS • Data decentralised (copy-on-demand) • Queries centralised • Verification – Centralised (central source of IPFS hash) – Weak (need to trust source of IPFS hash) – Need to re-compute hash over entire data set – No timestamping
  • 20. CADS + DL (2) • Data decentralised – But copy-on-demand • Queries centralised • Verification – Decentralised (blockchain source of IPFS hash) – Strong (IPFS hash immutable, signable) – Need to re-compute hash over entire data set – Timestamping
  • 21. Base case + DL Verifier • Centralised storage & querying • Verify query results with copy of original data on blockchain
  • 22. Base case + DL Backend • Semi-decentralised queries - any node can be a query frontend • Decentralised data verified directly from blockchain
  • 23. Linked Data Fragments Ruben Verborgh’s Linked Data Fragments
  • 24. Base case + DL Backend
  • 25. “Pure” Distributed Ledger • Decentralised storage & querying • Data comes directly from blockchain
  • 27. Implementation • Fully implemented: – Base case with • Blockchain verifier • Blockchain backend – ”Pure” distributed ledger • In progress – IPFS-based approaches
  • 28. Issues • Performance – Cap on data per-block – Speed of Ethereum • Cost – Experiments on (free) private chain so far • Metadata – How do we tell clients the verification status/provenance of query results in SPARQL/Linked Data Fragments queries?
  • 29. Future Work • Performance analysis and improvements • Cost analysis • Extension (in progress) of Linked Data Fragments server – Extensible query result metadata • Extension of LDF client – Display verification status of results to user

Editor's Notes

  1. By default, the Ethereum execution environment is lifeless; nothing happens and the state of every account remains the same. However, any user can trigger an action by sending a transaction from an externally owned account, setting Ethereum's wheels in motion.