SlideShare a Scribd company logo
1
Mind My Value:
A decentralised infrastructure
for fair and trusted IoT data trading
Paolo Missier
Shaimaa Bajoudah
{firstname.lastname}@ncl.ac.uk
School of Computing, Newcastle
University, UK
Angelo Capossele
Andrea Gaglione
Michele Nati
{firstname.lastname}@digitalcatapult.org.uk
Digital Catapult Centre
London, UK
IoT Conference
Linz, Austria
Oct. 24th, 2017
2
Motivation: monetising our own IoT data streams
Focus on IoT data streams that carry information about people
• Wearables - Health care, Personal fitness, “quantified self”
• Smart home hubs / smart vehicles
Three questions:
• Business: Do these data streams have a value as digital assets?
• Technical: How do unlock the value / enable trading of such assets?
• Legal: Who is entitled to trade them?
3
Primary and secondary IoT data markets
Working assumptions:
1. VAS (Value Added Services) that aggregate granular IoT data streams exist
2. They have both technical capabilities and business incentives
3. There will exist both primary and secondary markets for IoT data
Native
Value Added Services
Data Generators
- Personal / home / vehicle devices
Secondary
Value Added Services
Edge network
IoT data flows
VAS
VAS
VAS
VAS
VAS
Secondary markets
Primary market
P1
PN
…
P2
G1
Pi
……
…
Edge / Trusted zone
Network server /
Broker
Network infrastructure
C1
…
CN
Ci
…
Trusted zone
IoT data tracking
Pi
Tk
Nijk(W)
Nik(W)
Pi
Cj
Tk
Blockchain ecosystem
Interface
Subscriber app /
application server
(VAS)
…
GN
Gi
4
Re-selling Scenario – Traffic For London
Tube stations gates
TFL
Primary marketOyster card
Data
User
Data
Taxi
company
Secondary markets
Targeted
Ads
Card + user
Data
TFL is a re-seller (do tube users get a cut of their extra profits?)
5
Marketplace Scenario
“I am diabetic and I monitor my own fitness regime.
Can I trade my data for a better health insurance deal?”
Primary IoT data flows
Strava
Secondary markets
Primary VAS
Garmin server
Edge network
IoT data marketplace
Health
Insurance
Follow-your-runner
portal
“I am a (really good) runner.
How do Iicense my live data feed on the marketplace?”
IoT device owners may trade
data streams directly:
6
A new type of data marketplace?
Goal:
To develop a blueprint for the next generation IoT data marketplaces and
demonstrate its technical feasibility
(Ideal) Requirements:
1. Dynamic and flexible. Enable un-anticipated business relationships
• Quickly establish and fulfil new primary–VAS contracts
2. Guaranteed compliance and fairness
3. Granular: Should allow individuals to gain value from their data
4. Decentralized: Governance rules are defined, but
• No central trusted authority is appointed to enforce those rules
5. “Big Data” challenges: high Volume, high Velocity, high Variety
7
Initial scenario: brokered IoT traffic
Observable IoT data streams
- Simply count messages: <from, to, topic>
An IoT pub/sub setting (eg MQTT)
8
Solution: broker-centric traffic metering
message count cubes
Only contain metadata
Example Topics:
- Heart rate
- Speed
- GPS trace
- Glucose level
- Gait
- Home water consumption
- Vehicle driving data
- …
9
Settlement
At the end of each W:
Total fee owed by each cj to each pi computed by aggregating counts in the cube:
Unit cost
for topic k messagespi revenue after W:
if we assume:
- Complete and correct traffic metering
- A trusted authority to compute revenues
Problem solved?
10
Marketplace with central trusted components
D
D
D
IoT data flows
VAS
VAS
Centralised Traffic Metering
Contract Compliance
Data Discovery
Service
Transaction settlement
Vocabulary /
Ontology
Negotiation and
Agreement
Service
Registration /
Identity Service
Dispute arbitration
msg count cubes
Observed
data flow
Contracts
DB
Participants
DB
Setup
Exec
Analyse
Low density
High density
(MQTT)
(Fees)
11
Removing trust and de-centralising control
Can we remove the trust assumption and still fulfill these functions?
Trusted broker provides
• Accountability (accurate, complete counts)
• Dispute resolution
• Revenue distribution
12
Approach
1) Accountability: Each participant is responsible for reporting their own counts of
messages sent / received
2) Transparency: Reports become part of blockchain transactions
Unilateral count cubes:
- Provided by each participant
- Partial – reflect a participant’s point of view
Count of messages sent by pi about tk
Provider cube:
Subscriber cube:
Count of messages received by cj from pi about tk
13
Consistency and settlement with unilateral cubes
Cubes collected at the end of W:
Consistency constraint:
For each tk, for each subscriber cj to tk:
Number of messages sent by pi = number of messages received by cj from pi
But:
• Producers have an incentive to over-report the data they send
• VAS have an incentive to under-report the data they receive
Thus for some combination of pi, cj, tk we may expect:
14
Blockchain + smart contracts
1. Associate identity to marketplace participants
2. Agree on contract specification
3. Settlement of contractual disputes given unilaterally generated traffic reports
“A blockchain is a globally shared, transactional database”
Smart contracts (Ethereum blockchain)
Smart contract is a term used to describe computer
program code that is capable of facilitating, executing, and
enforcing the negotiation or performance of an agreement
(i.e. contract) using blockchain technology.
Contract logic triggered by data input  the batches of (unilateral) cubes
15
Removing central trust: approach
broker-controlled “message count cubes”  each participant unilaterally reports data sent / received
D
D
D
IoT data flows
VAS
VAS
Contract Compliance
Data Discovery
Service
Vocabulary /
Ontology
Negotiation and
Agreement
Service
Registration /
Identity Service
Settlement /
Reputation Management
Setup
Exec
Analyse
D
D
D
IoT data flows
VAS
VAS
Contract Compliance
Data Discovery
Service
Vocabulary /
Ontology
Negotiation and
Agreement
Service
Registration /
Identity Service
Settlement /
Reputation Management
Setup
Exec
Analyse
- Identities
- Contracts
- Unilateral
traffic reports
Sm
art
C
ontract
Sm
art
C
ontract
Sm
art
C
ontract
Sm
art
C
ontract
Unilateral cubes
cubep cubes
16
Evaluation
Where should cubes data live?
- Off chain: cubes remain natively located within participants’ trusted zones
- Oraclize (Ethereum-specfic mechanism)
- Adds to cost of Smart Contract execution
- guarantees the authenticity and integrity of the retrieved data.
- Oraclize requires a query fee: 0.01$ to 0.04$
- On chain: transactions embed the cubes in the blockchain
- No cost but adds to transaction size
Execution cost of cube
settlement operations
How much does it cost to run the settlement smart contract?
17
Evaluation
Overhead: (cost of contract execution) x (settlement rate)
- cost of contract execution  cube size)  #PROD X #CONS x #TOPICS
(possibly compressed)
1
10
100
1000
10000
0 5 10 15 20 25
Datatransferrate
Cube settlement operations
0.0000010
0.0000038
0.0000153
0.0000610
0.0002441
0.0009766
0.0039063
0.0156250
0.0625000
0.2500000
Dataprice[ETH]
Impact of overhead on unit cost for varying settlement rate and data transfer rate
18
Evaluation
Cost per message for varying data volume, settlement rate, gas price
0.0000002
0.0000010
0.0000038
0.0000153
0.0000610
0.0002441
0.0009766
0.0039063
200 400 600 800 1000 1200 1400 1600 1800 2000
0.00006
0.00024
0.00098
0.00391
0.01563
0.06250
0.25000
1.00000
Cubesettlementcost[ETH]
Cubesettlementcost[USD]
Data transfer rate
1 settlement at low gas price
5 settlements at low gas price
1 settlement at avg gas price
5 settlements at avg gas price
off-chain data
(retrieved using Oraclize with
additional cost overhead)
Estimated data prices
0.0000002
0.0000010
0.0000038
0.0000153
0.0000610
0.0002441
0.0009766
0.0039063
200 400 600 800 1000 1200 1400 1600 1800 2000
0.00006
0.00024
0.00098
0.00391
0.01563
0.06250
0.25000
1.00000
Cubesettlementcost[ETH]
Cubesettlementcost[USD]
Data transfer rate
1 settlement at low gas price
5 settlements at low gas price
1 settlement at avg gas price
5 settlements at avg gas price
on-chain data
19
Open challenges
Fairness in the presence of malicious behaviour
 reputation model:
based on history of disagreements on past transactions
What’s in a trading agreement?
• From atomic data trading (single message) to complex SLA
Think “follow-your-runner”
System challenges:
• Evolving Smart Contract technology: Ethereum vs Hyperledger
• Public vs permissioned blockchains
• Scalability
20
Thank you
Contact:
Paolo Missier
School of Computing
Newcastle University, UK
paolo.missier@ncl.ac.uk
http://tinyurl.com/paolomissier

More Related Content

What's hot

Cri big data
Cri big dataCri big data
Cri big data
Putchong Uthayopas
 
Big Data
Big Data Big Data
Comparison Between WEKA and Salford System in Data Mining Software
Comparison Between WEKA and Salford System in Data Mining SoftwareComparison Between WEKA and Salford System in Data Mining Software
Comparison Between WEKA and Salford System in Data Mining Software
Universitas Pembangunan Panca Budi
 
Association rule visualization technique
Association rule visualization techniqueAssociation rule visualization technique
Association rule visualization technique
mustafasmart
 
Utilities White Paper Final Versant
Utilities White Paper Final VersantUtilities White Paper Final Versant
Utilities White Paper Final Versant
Bert Taube
 
OPEN DATA: ECOSYSTEM, CURRENT AND FUTURE TRENDS, SUCCESS STORIES AND BARRIERS
OPEN DATA: ECOSYSTEM, CURRENT AND FUTURE TRENDS, SUCCESS STORIES AND BARRIERSOPEN DATA: ECOSYSTEM, CURRENT AND FUTURE TRENDS, SUCCESS STORIES AND BARRIERS
OPEN DATA: ECOSYSTEM, CURRENT AND FUTURE TRENDS, SUCCESS STORIES AND BARRIERS
Anastasija Nikiforova
 
Data mining seminar report
Data mining seminar reportData mining seminar report
Data mining seminar report
mayurik19
 
IRJET- Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...
IRJET-  	  Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...IRJET-  	  Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...
IRJET- Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...
IRJET Journal
 

What's hot (8)

Cri big data
Cri big dataCri big data
Cri big data
 
Big Data
Big Data Big Data
Big Data
 
Comparison Between WEKA and Salford System in Data Mining Software
Comparison Between WEKA and Salford System in Data Mining SoftwareComparison Between WEKA and Salford System in Data Mining Software
Comparison Between WEKA and Salford System in Data Mining Software
 
Association rule visualization technique
Association rule visualization techniqueAssociation rule visualization technique
Association rule visualization technique
 
Utilities White Paper Final Versant
Utilities White Paper Final VersantUtilities White Paper Final Versant
Utilities White Paper Final Versant
 
OPEN DATA: ECOSYSTEM, CURRENT AND FUTURE TRENDS, SUCCESS STORIES AND BARRIERS
OPEN DATA: ECOSYSTEM, CURRENT AND FUTURE TRENDS, SUCCESS STORIES AND BARRIERSOPEN DATA: ECOSYSTEM, CURRENT AND FUTURE TRENDS, SUCCESS STORIES AND BARRIERS
OPEN DATA: ECOSYSTEM, CURRENT AND FUTURE TRENDS, SUCCESS STORIES AND BARRIERS
 
Data mining seminar report
Data mining seminar reportData mining seminar report
Data mining seminar report
 
IRJET- Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...
IRJET-  	  Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...IRJET-  	  Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...
IRJET- Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...
 

Similar to Mind My Value: A decentralised infrastructure for fair and trusted IoT data trading

Block-Chain technology to boost Port Community System
Block-Chain technology to boost Port Community SystemBlock-Chain technology to boost Port Community System
Block-Chain technology to boost Port Community System
md. tanvir hossain
 
TCXC IoT AAA Integration Whitepaper
TCXC IoT AAA Integration WhitepaperTCXC IoT AAA Integration Whitepaper
TCXC IoT AAA Integration Whitepaper
TelecomsXChange
 
Distributed Intelligence
Distributed IntelligenceDistributed Intelligence
Distributed Intelligence
Nuri Cankaya
 
Ocean Protocol Presentation by CEO Bruce Pon 20171129
Ocean Protocol Presentation by CEO Bruce Pon 20171129Ocean Protocol Presentation by CEO Bruce Pon 20171129
Ocean Protocol Presentation by CEO Bruce Pon 20171129
Team AI
 
Blockchain as a new cyber strategy for your business
Blockchain as a new cyber strategy for your businessBlockchain as a new cyber strategy for your business
Blockchain as a new cyber strategy for your business
David Joao Vieira Carvalho
 
Alternative Consensus & Enterprise Blockchain
Alternative Consensus & Enterprise BlockchainAlternative Consensus & Enterprise Blockchain
Alternative Consensus & Enterprise Blockchain
Tobias Disse
 
Blockchain - Primer for City CIOs v05 01 22.pdf
Blockchain - Primer for City CIOs v05 01 22.pdfBlockchain - Primer for City CIOs v05 01 22.pdf
Blockchain - Primer for City CIOs v05 01 22.pdf
ssusera441c2
 
Blockchain in industry 4.0
Blockchain in industry 4.0Blockchain in industry 4.0
Blockchain in industry 4.0
Mujahid Hussain
 
Block chain technology in pcs
Block chain technology in pcsBlock chain technology in pcs
Block chain technology in pcs
MOHIMENUL
 
Cryptocurrenty and Blockchain - SSMRV.pptx
Cryptocurrenty and Blockchain - SSMRV.pptxCryptocurrenty and Blockchain - SSMRV.pptx
Cryptocurrenty and Blockchain - SSMRV.pptx
ChristopherDevakumar1
 
First-North - EUSN Presentation (November 16 2016) Final-v1 Yogi Notes 2016-1...
First-North - EUSN Presentation (November 16 2016) Final-v1 Yogi Notes 2016-1...First-North - EUSN Presentation (November 16 2016) Final-v1 Yogi Notes 2016-1...
First-North - EUSN Presentation (November 16 2016) Final-v1 Yogi Notes 2016-1...
Yogi Golle
 
Blockchain & Telecommunication Services Provider
Blockchain & Telecommunication Services ProviderBlockchain & Telecommunication Services Provider
Blockchain & Telecommunication Services Provider
Samuel Liu
 
Blockchain point of view for the telco, media and entertainment industry
Blockchain point of view for the telco, media and entertainment industryBlockchain point of view for the telco, media and entertainment industry
Blockchain point of view for the telco, media and entertainment industry
IBM Blockchain
 
Nov 2 security for blockchain and analytics ulf mattsson 2020 nov 2b
Nov 2 security for blockchain and analytics   ulf mattsson 2020 nov 2bNov 2 security for blockchain and analytics   ulf mattsson 2020 nov 2b
Nov 2 security for blockchain and analytics ulf mattsson 2020 nov 2b
Ulf Mattsson
 
TGC12 e book
TGC12 e bookTGC12 e book
TGC12 e book
Sadiq Malik
 
Introduction to Blockchain and Smart Contracts
Introduction to Blockchain and Smart ContractsIntroduction to Blockchain and Smart Contracts
Introduction to Blockchain and Smart Contracts
Saad Zaher
 
Blockchains For The IOT - EVRYTHNG
Blockchains For The IOT - EVRYTHNGBlockchains For The IOT - EVRYTHNG
Blockchains For The IOT - EVRYTHNG
Rids Vazi
 
SCALES: Supply Chain Architecture Leading to Enhanced Services
SCALES: Supply Chain Architecture Leading to Enhanced Services SCALES: Supply Chain Architecture Leading to Enhanced Services
SCALES: Supply Chain Architecture Leading to Enhanced Services
Fabio Massimi
 
Insight Into Cryptocurrencies - Methods and Tools for Analyzing Blockchain-ba...
Insight Into Cryptocurrencies - Methods and Tools for Analyzing Blockchain-ba...Insight Into Cryptocurrencies - Methods and Tools for Analyzing Blockchain-ba...
Insight Into Cryptocurrencies - Methods and Tools for Analyzing Blockchain-ba...
Bernhard Haslhofer
 
Applying Blockchain for P2P Energy Trading
Applying Blockchain for P2P Energy TradingApplying Blockchain for P2P Energy Trading
Applying Blockchain for P2P Energy Trading
Tsinghua University
 

Similar to Mind My Value: A decentralised infrastructure for fair and trusted IoT data trading (20)

Block-Chain technology to boost Port Community System
Block-Chain technology to boost Port Community SystemBlock-Chain technology to boost Port Community System
Block-Chain technology to boost Port Community System
 
TCXC IoT AAA Integration Whitepaper
TCXC IoT AAA Integration WhitepaperTCXC IoT AAA Integration Whitepaper
TCXC IoT AAA Integration Whitepaper
 
Distributed Intelligence
Distributed IntelligenceDistributed Intelligence
Distributed Intelligence
 
Ocean Protocol Presentation by CEO Bruce Pon 20171129
Ocean Protocol Presentation by CEO Bruce Pon 20171129Ocean Protocol Presentation by CEO Bruce Pon 20171129
Ocean Protocol Presentation by CEO Bruce Pon 20171129
 
Blockchain as a new cyber strategy for your business
Blockchain as a new cyber strategy for your businessBlockchain as a new cyber strategy for your business
Blockchain as a new cyber strategy for your business
 
Alternative Consensus & Enterprise Blockchain
Alternative Consensus & Enterprise BlockchainAlternative Consensus & Enterprise Blockchain
Alternative Consensus & Enterprise Blockchain
 
Blockchain - Primer for City CIOs v05 01 22.pdf
Blockchain - Primer for City CIOs v05 01 22.pdfBlockchain - Primer for City CIOs v05 01 22.pdf
Blockchain - Primer for City CIOs v05 01 22.pdf
 
Blockchain in industry 4.0
Blockchain in industry 4.0Blockchain in industry 4.0
Blockchain in industry 4.0
 
Block chain technology in pcs
Block chain technology in pcsBlock chain technology in pcs
Block chain technology in pcs
 
Cryptocurrenty and Blockchain - SSMRV.pptx
Cryptocurrenty and Blockchain - SSMRV.pptxCryptocurrenty and Blockchain - SSMRV.pptx
Cryptocurrenty and Blockchain - SSMRV.pptx
 
First-North - EUSN Presentation (November 16 2016) Final-v1 Yogi Notes 2016-1...
First-North - EUSN Presentation (November 16 2016) Final-v1 Yogi Notes 2016-1...First-North - EUSN Presentation (November 16 2016) Final-v1 Yogi Notes 2016-1...
First-North - EUSN Presentation (November 16 2016) Final-v1 Yogi Notes 2016-1...
 
Blockchain & Telecommunication Services Provider
Blockchain & Telecommunication Services ProviderBlockchain & Telecommunication Services Provider
Blockchain & Telecommunication Services Provider
 
Blockchain point of view for the telco, media and entertainment industry
Blockchain point of view for the telco, media and entertainment industryBlockchain point of view for the telco, media and entertainment industry
Blockchain point of view for the telco, media and entertainment industry
 
Nov 2 security for blockchain and analytics ulf mattsson 2020 nov 2b
Nov 2 security for blockchain and analytics   ulf mattsson 2020 nov 2bNov 2 security for blockchain and analytics   ulf mattsson 2020 nov 2b
Nov 2 security for blockchain and analytics ulf mattsson 2020 nov 2b
 
TGC12 e book
TGC12 e bookTGC12 e book
TGC12 e book
 
Introduction to Blockchain and Smart Contracts
Introduction to Blockchain and Smart ContractsIntroduction to Blockchain and Smart Contracts
Introduction to Blockchain and Smart Contracts
 
Blockchains For The IOT - EVRYTHNG
Blockchains For The IOT - EVRYTHNGBlockchains For The IOT - EVRYTHNG
Blockchains For The IOT - EVRYTHNG
 
SCALES: Supply Chain Architecture Leading to Enhanced Services
SCALES: Supply Chain Architecture Leading to Enhanced Services SCALES: Supply Chain Architecture Leading to Enhanced Services
SCALES: Supply Chain Architecture Leading to Enhanced Services
 
Insight Into Cryptocurrencies - Methods and Tools for Analyzing Blockchain-ba...
Insight Into Cryptocurrencies - Methods and Tools for Analyzing Blockchain-ba...Insight Into Cryptocurrencies - Methods and Tools for Analyzing Blockchain-ba...
Insight Into Cryptocurrencies - Methods and Tools for Analyzing Blockchain-ba...
 
Applying Blockchain for P2P Energy Trading
Applying Blockchain for P2P Energy TradingApplying Blockchain for P2P Energy Trading
Applying Blockchain for P2P Energy Trading
 

More from Paolo Missier

(Explainable) Data-Centric AI: what are you explaininhg, and to whom?
(Explainable) Data-Centric AI: what are you explaininhg, and to whom?(Explainable) Data-Centric AI: what are you explaininhg, and to whom?
(Explainable) Data-Centric AI: what are you explaininhg, and to whom?
Paolo Missier
 
Design and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data ScienceDesign and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data Science
Paolo Missier
 
Towards explanations for Data-Centric AI using provenance records
Towards explanations for Data-Centric AI using provenance recordsTowards explanations for Data-Centric AI using provenance records
Towards explanations for Data-Centric AI using provenance records
Paolo Missier
 
Interpretable and robust hospital readmission predictions from Electronic Hea...
Interpretable and robust hospital readmission predictions from Electronic Hea...Interpretable and robust hospital readmission predictions from Electronic Hea...
Interpretable and robust hospital readmission predictions from Electronic Hea...
Paolo Missier
 
Data-centric AI and the convergence of data and model engineering: opportunit...
Data-centric AI and the convergence of data and model engineering:opportunit...Data-centric AI and the convergence of data and model engineering:opportunit...
Data-centric AI and the convergence of data and model engineering: opportunit...
Paolo Missier
 
Realising the potential of Health Data Science: opportunities and challenges ...
Realising the potential of Health Data Science:opportunities and challenges ...Realising the potential of Health Data Science:opportunities and challenges ...
Realising the potential of Health Data Science: opportunities and challenges ...
Paolo Missier
 
Provenance Week 2023 talk on DP4DS (Data Provenance for Data Science)
Provenance Week 2023 talk on DP4DS (Data Provenance for Data Science)Provenance Week 2023 talk on DP4DS (Data Provenance for Data Science)
Provenance Week 2023 talk on DP4DS (Data Provenance for Data Science)
Paolo Missier
 
A Data-centric perspective on Data-driven healthcare: a short overview
A Data-centric perspective on Data-driven healthcare: a short overviewA Data-centric perspective on Data-driven healthcare: a short overview
A Data-centric perspective on Data-driven healthcare: a short overview
Paolo Missier
 
Capturing and querying fine-grained provenance of preprocessing pipelines in ...
Capturing and querying fine-grained provenance of preprocessing pipelines in ...Capturing and querying fine-grained provenance of preprocessing pipelines in ...
Capturing and querying fine-grained provenance of preprocessing pipelines in ...
Paolo Missier
 
Tracking trajectories of multiple long-term conditions using dynamic patient...
Tracking trajectories of  multiple long-term conditions using dynamic patient...Tracking trajectories of  multiple long-term conditions using dynamic patient...
Tracking trajectories of multiple long-term conditions using dynamic patient...
Paolo Missier
 
Delivering on the promise of data-driven healthcare: trade-offs, challenges, ...
Delivering on the promise of data-driven healthcare: trade-offs, challenges, ...Delivering on the promise of data-driven healthcare: trade-offs, challenges, ...
Delivering on the promise of data-driven healthcare: trade-offs, challenges, ...
Paolo Missier
 
Digital biomarkers for preventive personalised healthcare
Digital biomarkers for preventive personalised healthcareDigital biomarkers for preventive personalised healthcare
Digital biomarkers for preventive personalised healthcare
Paolo Missier
 
Digital biomarkers for preventive personalised healthcare
Digital biomarkers for preventive personalised healthcareDigital biomarkers for preventive personalised healthcare
Digital biomarkers for preventive personalised healthcare
Paolo Missier
 
Capturing and querying fine-grained provenance of preprocessing pipelines in ...
Capturing and querying fine-grained provenance of preprocessing pipelines in ...Capturing and querying fine-grained provenance of preprocessing pipelines in ...
Capturing and querying fine-grained provenance of preprocessing pipelines in ...
Paolo Missier
 
Quo vadis, provenancer?  Cui prodest?  our own trajectory: provenance of data...
Quo vadis, provenancer? Cui prodest? our own trajectory: provenance of data...Quo vadis, provenancer? Cui prodest? our own trajectory: provenance of data...
Quo vadis, provenancer?  Cui prodest?  our own trajectory: provenance of data...
Paolo Missier
 
Data Science for (Health) Science: tales from a challenging front line, and h...
Data Science for (Health) Science:tales from a challenging front line, and h...Data Science for (Health) Science:tales from a challenging front line, and h...
Data Science for (Health) Science: tales from a challenging front line, and h...
Paolo Missier
 
Analytics of analytics pipelines: from optimising re-execution to general Dat...
Analytics of analytics pipelines:from optimising re-execution to general Dat...Analytics of analytics pipelines:from optimising re-execution to general Dat...
Analytics of analytics pipelines: from optimising re-execution to general Dat...
Paolo Missier
 
ReComp: optimising the re-execution of analytics pipelines in response to cha...
ReComp: optimising the re-execution of analytics pipelines in response to cha...ReComp: optimising the re-execution of analytics pipelines in response to cha...
ReComp: optimising the re-execution of analytics pipelines in response to cha...
Paolo Missier
 
ReComp, the complete story: an invited talk at Cardiff University
ReComp, the complete story:  an invited talk at Cardiff UniversityReComp, the complete story:  an invited talk at Cardiff University
ReComp, the complete story: an invited talk at Cardiff University
Paolo Missier
 
Efficient Re-computation of Big Data Analytics Processes in the Presence of C...
Efficient Re-computation of Big Data Analytics Processes in the Presence of C...Efficient Re-computation of Big Data Analytics Processes in the Presence of C...
Efficient Re-computation of Big Data Analytics Processes in the Presence of C...
Paolo Missier
 

More from Paolo Missier (20)

(Explainable) Data-Centric AI: what are you explaininhg, and to whom?
(Explainable) Data-Centric AI: what are you explaininhg, and to whom?(Explainable) Data-Centric AI: what are you explaininhg, and to whom?
(Explainable) Data-Centric AI: what are you explaininhg, and to whom?
 
Design and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data ScienceDesign and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data Science
 
Towards explanations for Data-Centric AI using provenance records
Towards explanations for Data-Centric AI using provenance recordsTowards explanations for Data-Centric AI using provenance records
Towards explanations for Data-Centric AI using provenance records
 
Interpretable and robust hospital readmission predictions from Electronic Hea...
Interpretable and robust hospital readmission predictions from Electronic Hea...Interpretable and robust hospital readmission predictions from Electronic Hea...
Interpretable and robust hospital readmission predictions from Electronic Hea...
 
Data-centric AI and the convergence of data and model engineering: opportunit...
Data-centric AI and the convergence of data and model engineering:opportunit...Data-centric AI and the convergence of data and model engineering:opportunit...
Data-centric AI and the convergence of data and model engineering: opportunit...
 
Realising the potential of Health Data Science: opportunities and challenges ...
Realising the potential of Health Data Science:opportunities and challenges ...Realising the potential of Health Data Science:opportunities and challenges ...
Realising the potential of Health Data Science: opportunities and challenges ...
 
Provenance Week 2023 talk on DP4DS (Data Provenance for Data Science)
Provenance Week 2023 talk on DP4DS (Data Provenance for Data Science)Provenance Week 2023 talk on DP4DS (Data Provenance for Data Science)
Provenance Week 2023 talk on DP4DS (Data Provenance for Data Science)
 
A Data-centric perspective on Data-driven healthcare: a short overview
A Data-centric perspective on Data-driven healthcare: a short overviewA Data-centric perspective on Data-driven healthcare: a short overview
A Data-centric perspective on Data-driven healthcare: a short overview
 
Capturing and querying fine-grained provenance of preprocessing pipelines in ...
Capturing and querying fine-grained provenance of preprocessing pipelines in ...Capturing and querying fine-grained provenance of preprocessing pipelines in ...
Capturing and querying fine-grained provenance of preprocessing pipelines in ...
 
Tracking trajectories of multiple long-term conditions using dynamic patient...
Tracking trajectories of  multiple long-term conditions using dynamic patient...Tracking trajectories of  multiple long-term conditions using dynamic patient...
Tracking trajectories of multiple long-term conditions using dynamic patient...
 
Delivering on the promise of data-driven healthcare: trade-offs, challenges, ...
Delivering on the promise of data-driven healthcare: trade-offs, challenges, ...Delivering on the promise of data-driven healthcare: trade-offs, challenges, ...
Delivering on the promise of data-driven healthcare: trade-offs, challenges, ...
 
Digital biomarkers for preventive personalised healthcare
Digital biomarkers for preventive personalised healthcareDigital biomarkers for preventive personalised healthcare
Digital biomarkers for preventive personalised healthcare
 
Digital biomarkers for preventive personalised healthcare
Digital biomarkers for preventive personalised healthcareDigital biomarkers for preventive personalised healthcare
Digital biomarkers for preventive personalised healthcare
 
Capturing and querying fine-grained provenance of preprocessing pipelines in ...
Capturing and querying fine-grained provenance of preprocessing pipelines in ...Capturing and querying fine-grained provenance of preprocessing pipelines in ...
Capturing and querying fine-grained provenance of preprocessing pipelines in ...
 
Quo vadis, provenancer?  Cui prodest?  our own trajectory: provenance of data...
Quo vadis, provenancer? Cui prodest? our own trajectory: provenance of data...Quo vadis, provenancer? Cui prodest? our own trajectory: provenance of data...
Quo vadis, provenancer?  Cui prodest?  our own trajectory: provenance of data...
 
Data Science for (Health) Science: tales from a challenging front line, and h...
Data Science for (Health) Science:tales from a challenging front line, and h...Data Science for (Health) Science:tales from a challenging front line, and h...
Data Science for (Health) Science: tales from a challenging front line, and h...
 
Analytics of analytics pipelines: from optimising re-execution to general Dat...
Analytics of analytics pipelines:from optimising re-execution to general Dat...Analytics of analytics pipelines:from optimising re-execution to general Dat...
Analytics of analytics pipelines: from optimising re-execution to general Dat...
 
ReComp: optimising the re-execution of analytics pipelines in response to cha...
ReComp: optimising the re-execution of analytics pipelines in response to cha...ReComp: optimising the re-execution of analytics pipelines in response to cha...
ReComp: optimising the re-execution of analytics pipelines in response to cha...
 
ReComp, the complete story: an invited talk at Cardiff University
ReComp, the complete story:  an invited talk at Cardiff UniversityReComp, the complete story:  an invited talk at Cardiff University
ReComp, the complete story: an invited talk at Cardiff University
 
Efficient Re-computation of Big Data Analytics Processes in the Presence of C...
Efficient Re-computation of Big Data Analytics Processes in the Presence of C...Efficient Re-computation of Big Data Analytics Processes in the Presence of C...
Efficient Re-computation of Big Data Analytics Processes in the Presence of C...
 

Recently uploaded

Large Language Model (LLM) and it’s Geospatial Applications
Large Language Model (LLM) and it’s Geospatial ApplicationsLarge Language Model (LLM) and it’s Geospatial Applications
Large Language Model (LLM) and it’s Geospatial Applications
Rohit Gautam
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
innovationoecd
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
Alpen-Adria-Universität
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
Zilliz
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
Quotidiano Piemontese
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Paige Cruz
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Malak Abu Hammad
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
Kumud Singh
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
Neo4j
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
DianaGray10
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
Uni Systems S.M.S.A.
 
“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”
Claudio Di Ciccio
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
KAMESHS29
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
Daiki Mogmet Ito
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
Neo4j
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
Aftab Hussain
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
名前 です男
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
ControlCase
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
Adtran
 

Recently uploaded (20)

Large Language Model (LLM) and it’s Geospatial Applications
Large Language Model (LLM) and it’s Geospatial ApplicationsLarge Language Model (LLM) and it’s Geospatial Applications
Large Language Model (LLM) and it’s Geospatial Applications
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
 
“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
 

Mind My Value: A decentralised infrastructure for fair and trusted IoT data trading

  • 1. 1 Mind My Value: A decentralised infrastructure for fair and trusted IoT data trading Paolo Missier Shaimaa Bajoudah {firstname.lastname}@ncl.ac.uk School of Computing, Newcastle University, UK Angelo Capossele Andrea Gaglione Michele Nati {firstname.lastname}@digitalcatapult.org.uk Digital Catapult Centre London, UK IoT Conference Linz, Austria Oct. 24th, 2017
  • 2. 2 Motivation: monetising our own IoT data streams Focus on IoT data streams that carry information about people • Wearables - Health care, Personal fitness, “quantified self” • Smart home hubs / smart vehicles Three questions: • Business: Do these data streams have a value as digital assets? • Technical: How do unlock the value / enable trading of such assets? • Legal: Who is entitled to trade them?
  • 3. 3 Primary and secondary IoT data markets Working assumptions: 1. VAS (Value Added Services) that aggregate granular IoT data streams exist 2. They have both technical capabilities and business incentives 3. There will exist both primary and secondary markets for IoT data Native Value Added Services Data Generators - Personal / home / vehicle devices Secondary Value Added Services Edge network IoT data flows VAS VAS VAS VAS VAS Secondary markets Primary market P1 PN … P2 G1 Pi …… … Edge / Trusted zone Network server / Broker Network infrastructure C1 … CN Ci … Trusted zone IoT data tracking Pi Tk Nijk(W) Nik(W) Pi Cj Tk Blockchain ecosystem Interface Subscriber app / application server (VAS) … GN Gi
  • 4. 4 Re-selling Scenario – Traffic For London Tube stations gates TFL Primary marketOyster card Data User Data Taxi company Secondary markets Targeted Ads Card + user Data TFL is a re-seller (do tube users get a cut of their extra profits?)
  • 5. 5 Marketplace Scenario “I am diabetic and I monitor my own fitness regime. Can I trade my data for a better health insurance deal?” Primary IoT data flows Strava Secondary markets Primary VAS Garmin server Edge network IoT data marketplace Health Insurance Follow-your-runner portal “I am a (really good) runner. How do Iicense my live data feed on the marketplace?” IoT device owners may trade data streams directly:
  • 6. 6 A new type of data marketplace? Goal: To develop a blueprint for the next generation IoT data marketplaces and demonstrate its technical feasibility (Ideal) Requirements: 1. Dynamic and flexible. Enable un-anticipated business relationships • Quickly establish and fulfil new primary–VAS contracts 2. Guaranteed compliance and fairness 3. Granular: Should allow individuals to gain value from their data 4. Decentralized: Governance rules are defined, but • No central trusted authority is appointed to enforce those rules 5. “Big Data” challenges: high Volume, high Velocity, high Variety
  • 7. 7 Initial scenario: brokered IoT traffic Observable IoT data streams - Simply count messages: <from, to, topic> An IoT pub/sub setting (eg MQTT)
  • 8. 8 Solution: broker-centric traffic metering message count cubes Only contain metadata Example Topics: - Heart rate - Speed - GPS trace - Glucose level - Gait - Home water consumption - Vehicle driving data - …
  • 9. 9 Settlement At the end of each W: Total fee owed by each cj to each pi computed by aggregating counts in the cube: Unit cost for topic k messagespi revenue after W: if we assume: - Complete and correct traffic metering - A trusted authority to compute revenues Problem solved?
  • 10. 10 Marketplace with central trusted components D D D IoT data flows VAS VAS Centralised Traffic Metering Contract Compliance Data Discovery Service Transaction settlement Vocabulary / Ontology Negotiation and Agreement Service Registration / Identity Service Dispute arbitration msg count cubes Observed data flow Contracts DB Participants DB Setup Exec Analyse Low density High density (MQTT) (Fees)
  • 11. 11 Removing trust and de-centralising control Can we remove the trust assumption and still fulfill these functions? Trusted broker provides • Accountability (accurate, complete counts) • Dispute resolution • Revenue distribution
  • 12. 12 Approach 1) Accountability: Each participant is responsible for reporting their own counts of messages sent / received 2) Transparency: Reports become part of blockchain transactions Unilateral count cubes: - Provided by each participant - Partial – reflect a participant’s point of view Count of messages sent by pi about tk Provider cube: Subscriber cube: Count of messages received by cj from pi about tk
  • 13. 13 Consistency and settlement with unilateral cubes Cubes collected at the end of W: Consistency constraint: For each tk, for each subscriber cj to tk: Number of messages sent by pi = number of messages received by cj from pi But: • Producers have an incentive to over-report the data they send • VAS have an incentive to under-report the data they receive Thus for some combination of pi, cj, tk we may expect:
  • 14. 14 Blockchain + smart contracts 1. Associate identity to marketplace participants 2. Agree on contract specification 3. Settlement of contractual disputes given unilaterally generated traffic reports “A blockchain is a globally shared, transactional database” Smart contracts (Ethereum blockchain) Smart contract is a term used to describe computer program code that is capable of facilitating, executing, and enforcing the negotiation or performance of an agreement (i.e. contract) using blockchain technology. Contract logic triggered by data input  the batches of (unilateral) cubes
  • 15. 15 Removing central trust: approach broker-controlled “message count cubes”  each participant unilaterally reports data sent / received D D D IoT data flows VAS VAS Contract Compliance Data Discovery Service Vocabulary / Ontology Negotiation and Agreement Service Registration / Identity Service Settlement / Reputation Management Setup Exec Analyse D D D IoT data flows VAS VAS Contract Compliance Data Discovery Service Vocabulary / Ontology Negotiation and Agreement Service Registration / Identity Service Settlement / Reputation Management Setup Exec Analyse - Identities - Contracts - Unilateral traffic reports Sm art C ontract Sm art C ontract Sm art C ontract Sm art C ontract Unilateral cubes cubep cubes
  • 16. 16 Evaluation Where should cubes data live? - Off chain: cubes remain natively located within participants’ trusted zones - Oraclize (Ethereum-specfic mechanism) - Adds to cost of Smart Contract execution - guarantees the authenticity and integrity of the retrieved data. - Oraclize requires a query fee: 0.01$ to 0.04$ - On chain: transactions embed the cubes in the blockchain - No cost but adds to transaction size Execution cost of cube settlement operations How much does it cost to run the settlement smart contract?
  • 17. 17 Evaluation Overhead: (cost of contract execution) x (settlement rate) - cost of contract execution  cube size)  #PROD X #CONS x #TOPICS (possibly compressed) 1 10 100 1000 10000 0 5 10 15 20 25 Datatransferrate Cube settlement operations 0.0000010 0.0000038 0.0000153 0.0000610 0.0002441 0.0009766 0.0039063 0.0156250 0.0625000 0.2500000 Dataprice[ETH] Impact of overhead on unit cost for varying settlement rate and data transfer rate
  • 18. 18 Evaluation Cost per message for varying data volume, settlement rate, gas price 0.0000002 0.0000010 0.0000038 0.0000153 0.0000610 0.0002441 0.0009766 0.0039063 200 400 600 800 1000 1200 1400 1600 1800 2000 0.00006 0.00024 0.00098 0.00391 0.01563 0.06250 0.25000 1.00000 Cubesettlementcost[ETH] Cubesettlementcost[USD] Data transfer rate 1 settlement at low gas price 5 settlements at low gas price 1 settlement at avg gas price 5 settlements at avg gas price off-chain data (retrieved using Oraclize with additional cost overhead) Estimated data prices 0.0000002 0.0000010 0.0000038 0.0000153 0.0000610 0.0002441 0.0009766 0.0039063 200 400 600 800 1000 1200 1400 1600 1800 2000 0.00006 0.00024 0.00098 0.00391 0.01563 0.06250 0.25000 1.00000 Cubesettlementcost[ETH] Cubesettlementcost[USD] Data transfer rate 1 settlement at low gas price 5 settlements at low gas price 1 settlement at avg gas price 5 settlements at avg gas price on-chain data
  • 19. 19 Open challenges Fairness in the presence of malicious behaviour  reputation model: based on history of disagreements on past transactions What’s in a trading agreement? • From atomic data trading (single message) to complex SLA Think “follow-your-runner” System challenges: • Evolving Smart Contract technology: Ethereum vs Hyperledger • Public vs permissioned blockchains • Scalability
  • 20. 20 Thank you Contact: Paolo Missier School of Computing Newcastle University, UK paolo.missier@ncl.ac.uk http://tinyurl.com/paolomissier

Editor's Notes

  1. Value Added Services (VAS) shall emerge that have both the technical capability and the business motivation to  continuously acquire and analyse data streams produced on a massive scale by millions of connected devices, with the aim to extract a variety of types of value-added knowledge from them.
  2. The native eco-system consists of devices owned by TFL (the ticket gates), which generate persons data (assumed granular and anonymous) which is available to TFL. Secondary markets for this data may include for instance a taxi company that is interested in reacting to anomalous passenger traffic flows through the tube stations, by proactively positioning its fleet outside certain stations at certain times of the day. Also, when some demographics is known about the users, advertisers may be interested in accessing the data for accurate ad positioning.
  3. Garmin (2) controls access to the data generated by its devices (1). Currently, data flows natively from device to its server where Garmin makes it accessible to third parties through a service interface (2->3). An example of (3) is Strava, a social network for fitness that does not own any of the devices. It receives data from Garmin and offers VAS to Garmin users (for instance, advanced analytics). This is a secondary market for individuals’ data, which benefits (1) the individuals, who see added value from their device, (2) Garmin, who hopes to sell more devices, and (3) Strava, who may have a separate business model out of its analytics. In this scenario, Strava may be able to resell this data, in particular about individuals’ commuting habits, further down the chain to VAS (4) such as city planners.
  4. a 2012 survey of data vendors [6], for example, includes 46 data suppliers, however the definition of data marketplace used in the paper is generic (“a platform on which anybody can upload and maintain data sets, with license-regulated access to and use of the data”) and geared towards static data, like Microsoft’s Azure Data Market.
  5. Solidity Runs on the EVM (Ethereum Virtual Machine) Executing Smart Contracts on the Ethereum network incurs a fee (in Ether) Operations consume a fixed amount of Gas Miners fees are proportional to the amount of Gas used - Every transaction specifies the Gas price it is willing to pay High price  high incentive  faster transaction execution
  6. Oraclize 3 to 11 times more expensive adapted the open-source Mosquitto MQTT broke to support message logging and cubes generation into a Cassandra NoSQL database. real producers using channels provided by the ThingSpeak platform https://thingspeak.com Using the TrackerDB, we are able to simulate the generation of unilateral cubes that can be either complete and correct, or reflect malicious behaviour, for evaluation purposes. Smart Contracts interact with the service through an Ethereum-specific mechanism
  7.  fisso un intervallo T e vario il data rate quindi il numero di messaggi totali che passano durante T. Poi decido ogni quanto fare settlement durante T.  A  sx ho un settlement solo (una invocazione del contract) ma un costo / message alto quando ho un transfer rate basso (cioè’ pochi messaggi) che scende quando aumento il rate cioè’ il numero di messaggi.  A dx scelgo di fare settlement spesso, quindi pago per diciamo 20 invocazioni, e di nuovo il costo e’ molto alto se ho pochi message (low rate) e scende quando ne ho molti (high rate).