SlideShare a Scribd company logo
Big Data +
Blockchain
in HealthIT
Building a solution with Bluemix’s Blockchain and
Big Data
Introduction
● This project started during the Blockchain in Healthcare Code-A-Thon in Washington DC for the
2017 Blockchain in Healthcare Summit.
● Overall, the goal was to develop a solution that enhances interoperability and demonstrates the
potential to seamlessly incorporate blockchain technology into existing HealthIT systems.
●
Specifically, I took the Metadata Tagging and Policy Expression track, to demonstrate the use of of
blockchain for security metadata and tagging to manage access, provide auditing and provenance
information.
●
I took my lead from the Blockchain For Health Data and Its Potential Use in Health IT and Health
Care Related Research paper,
Our proposal involves the use of a public blockchain as an access-control manager to health records that are
stored off blockchain.
….
There are currently no open standards or implementations of blockchain that utilize this approach but
research supports the feasibility of the proposed solution.
● My goal was to provide an implementation that could be used by corporate IT departments in the
Health space with an eye to Innovation Risk Management.
– Don’t change more than you absolutely must
– New technologies fail in the enterprise because of Governance standards issues rather than technical issues
– Don’t let the perfect be the enemy of the good
Current Architecture
● According to the Blockchain for HealthcareIT paper, any blockchain for
health care would
– Need to be public
– need to include technological solutions for three key elements:
● Scalability
● Access security
●
Data privacy
● According to the MIT paper on decentralizing privacy, the off-blockchain
storage needed for scalability should be on a similar data store
following the Kademlia specification for a distributed hash table (DHT).
● The overall architecture would look similar to the diagram on the
following page with services as a proxy for corporate health IT
departments and a public blockchain and, presumably, DHT.
Perspective
● Private blockchains are technically far inferior to public blockchain implementation. If it was
only a technical discussion, we would be able to move on. But technical will always take a
backseat to the perception of regulatory requirement. Best to take a page from Supply
Chain Management and EDI and make the specification, not the implementation, public.
– Prefer publicly available standards to publicly available systems.
● Try not to introduce a new technology to an IT departments with a new programming
language. R3 spent $53M to produce a framework in Kotlin. Solidity, Ethereum?
● Regulated IT departments distrust external data the way mice distrust tigers. They will
launch a POC, but it’ll stay a science project.
Proposal
● We’ve never had a data revolution and we’re not getting one now.
– When we needed more flexible data models, we added relational database
in addition to mainframes.
– When we needed to aggregate enterprise data, we added data warehouses.
– When we needed to store internet-scale data, we added Hadoop clusters.
– When we needed to react to device-scale data in real-time, we added Spark.
– Now we need to manage trust externally at scale.
● How do we make this merely additive?
TaLIa
Trust Layer Injection
The voice of the patient
emboldened by blockchain
Blockchain Model for Healthcare
● I am going to suggest a blueprint that hopefully follows the path of least resistance through your
enterprise data governance stage gates.
● I am going to make a few datacenter assumptions:
– There is an onsite Hadoop data lake
– The data lake is secured
●
Authentication – Apache Knox
●
Authorization – Apache Ranger
●
Audit – Apache Ranger
●
Data Protection – Apache Ranger KMS
– You have the tools in place for a standard Lambda architecture (Hbase for speed, HDFS for batch, Hive/Impala for
reporting). Hbase 0.98+ is really the only mandatory component, though.
– There is an ability to consume and serve public RESTful APIs, but they have to go through a some sort of
integration layer
●
We will be building the chaincode using IBM’s Hyperledger Fabric, so hopefully your organization
– allows Apache-licensed open-source apps
– primarily codes in JVM languages (Java, Scala)
– having an open mind to Google’s Go is a really, really nice to have.
Innovation Risk Management
Trust
innovation
High Level Architecure
Specification
Healthcare IT + MIT Paper
Implementation
Hadoop Datalake + Blockchain-based Authorization
Interface
Blockchain-based Contract for Inversion of Trust
Apps
RESTful API
Application Layer
● At the API level, the MIT Enigma project provides the guidance we need to build our APIs.
– Note that in all of these examples, it is assumed that the user has previously created a profile in the company’s existing identity
and access management system.
● A user installs an application to access their EHR from the provider’s system.
● When the user signs up, a new compound identity (user, company) is created and sent, with predefined
permissions, to the blockchain.
● Data sent from the user to the system is encrypted using this compound identity.
● Permissions can be set at any time by the user, limited by certain corporate rules (no one is allowed to request
that PHI be made public, for example.)
● Services that request permission to query data, for research purposes for example, also create a compound
identity.
● Requests for data return some anonymized subset of data. The principal of least exposure will be adhered to
based on the corporate governance strategy and the user’s preference. Any exposed data must be agreed to be
exposed by both the company and the user.
● Note that a compound key is created, just like the Enigma model where a public blockchain is used. While
technically this would not be necessary in a private blockchain, this is meant to be consumed by another company
with their own private blockchain, or even a consortium with a public blockchain. Regardless, we are implementing
locally but planning to distribute globally.
Interface Layer
●
Data that is sent from a device to the system is stored on Kafka.
●
A Spark job will pull the data from the Kafka topic, create a key for Hbase, and then store the
data in both Hbase and Blockchain.
●
HBase is going to rely on cell-based tags to enforce the terms of the blockchain contract by
providing role-based visibility at the datum level. This is the injection part of of trust injection.
●
When a record is first created in Hbase through this system, this record will be persisted but not
yet visible. Hbase is extremely fast while blockchain is extremely slow. However, the assumption
will be that eventual trust will be sufficient for most use cases. This is an asynchronous system
with a fairly hefty latency.
●
Periodically, through some scheduler like Oozie, the blockchain will be queried to retrieve
records that have been authenticated but not yet minimized. Those records will compare a have
of their content against a hash of the Hbase content using the rowkey as a match. When there is
a match, the cell-level visibulity will be updated in Hbase and the blockchain will just store a
pointer to the Hbase record, rather than the full dataset.
Implementation
● Hbase is a sparse, distributed, persistent multidimensional sorted map, which is indexed by a row key,
column key, and a timestamp. Row keys are unique. Cells store data as key value pairs and are stored
in a column family.
● Every cell can have zero or more tags. Every tag has a type and the actual tag byte array. Just as row
keys, column families, qualifiers and values can be encoded , tags can also be encoded as well. You can
enable or disable tag encoding at the level of the column family, and it is enabled by default.
● Hbase provides two levels of cell level access control
– Cell level ACL
● Sets RW using Role-based Access Control (RBAC)
– Cell Level Visibility
● Allows labels to be associated with the data cells. Cells across rows and columns can have visibility
labels. Users or groups can be granted authorization to the labels.
● For example, certain cells can have a visibility label called ‘private’. Only users granted
authorization to ‘private’ can access the labeled data.
● Another example, patient or customer data can be labeled ‘branch1′, branch2’ etc based on where
the patient or customer is located or registered. A doctor or administrator can only access the data
that he or she is authorized.
Implementation
● Interpreting the labels authenticated for a given get/scan request is a pluggable algorithm.
By creating an algorith, it will be possible to add metadata management to this process,
using an existing corporate metadata repository, a tool such as Cloudera navigator, or even
storing the metadata in a column family on the same Hbase row.
● The ID of the blockchain serves as a pointer to the Hbase record. Access to Hbase is not
done through the blockchain, so its necessary to reflect blockchain rules at this level.
● Apache Ranger is used to manage role-based and tag based solutions and can also store
and operate on metadata. At this time, only Hive and HDFS can have their access and
auditing needs handled by Ranger. This functionality will need to be provided in order to
enable this level of security out of the box.
Summary
● The tools to implement blockchain are, to some extent, already available
in most Health Care enterprises.
● While a network of loosely connected private blockchains
communicating though a RESTful APIs conforming to a public
specification is not ideal from a technical perspective, it is more likely to
be implemented into an existing IT infrastructure.
● The goal of having the patient being involved in their own EHR is
preserved though the implementation of IBM’s Hyperledger Fabric using
MIT’s specifications.
● Hbase can be used as the off-blockchain storage. While it is not as
seamless as storing to another distributed hash tag, it does provide the
ability to store and manage metadata and providence at the cell level.

More Related Content

What's hot

The Soft Grid 2013 Opening Presentation
The Soft Grid 2013 Opening PresentationThe Soft Grid 2013 Opening Presentation
The Soft Grid 2013 Opening Presentation
GTMevents
 
Predictive maintenance withsensors_in_utilities_
Predictive maintenance withsensors_in_utilities_Predictive maintenance withsensors_in_utilities_
Predictive maintenance withsensors_in_utilities_
Tina Zhang
 
Michael Hummel - Stop Storing Data! - Parstream
Michael Hummel - Stop Storing Data! - ParstreamMichael Hummel - Stop Storing Data! - Parstream
Michael Hummel - Stop Storing Data! - Parstream
Business of Software Conference
 
Transforming GE Healthcare with Data Platform Strategy
Transforming GE Healthcare with Data Platform StrategyTransforming GE Healthcare with Data Platform Strategy
Transforming GE Healthcare with Data Platform Strategy
Databricks
 
Big data analytics for telecom operators final use cases 0712-2014_prof_m erdas
Big data analytics for telecom operators final use cases 0712-2014_prof_m erdasBig data analytics for telecom operators final use cases 0712-2014_prof_m erdas
Big data analytics for telecom operators final use cases 0712-2014_prof_m erdas
Prof Dr Mehmed ERDAS
 
Conduit - A Lightweight Data Virtualization Tool
Conduit - A Lightweight Data Virtualization ToolConduit - A Lightweight Data Virtualization Tool
Conduit - A Lightweight Data Virtualization Tool
Ruthie Senanayake
 
Whitepaper - Transforming the Energy & Utilities Industry with Smart Analytics
Whitepaper - Transforming the Energy & Utilities Industry with Smart AnalyticsWhitepaper - Transforming the Energy & Utilities Industry with Smart Analytics
Whitepaper - Transforming the Energy & Utilities Industry with Smart Analytics
eInfochips (An Arrow Company)
 
MachinePulse Products
MachinePulse ProductsMachinePulse Products
MachinePulse Products
MachinePulse
 
Momentum in Big Data, IoT and Machine Intelligence
Momentum in Big Data, IoT and Machine IntelligenceMomentum in Big Data, IoT and Machine Intelligence
Momentum in Big Data, IoT and Machine Intelligence
Shamshad Ansari
 
Real-time data integration to the cloud
Real-time data integration to the cloudReal-time data integration to the cloud
Real-time data integration to the cloud
Sankar Nagarajan
 
SmartCity StreamApp Platform: Real-time Information for Smart Cities and Tran...
SmartCity StreamApp Platform: Real-time Information for Smart Cities and Tran...SmartCity StreamApp Platform: Real-time Information for Smart Cities and Tran...
SmartCity StreamApp Platform: Real-time Information for Smart Cities and Tran...
Cubic Corporation
 
Research Problem Presentation - Research in Supply Chain Digital Twins
Research Problem Presentation - Research in Supply Chain Digital TwinsResearch Problem Presentation - Research in Supply Chain Digital Twins
Research Problem Presentation - Research in Supply Chain Digital Twins
Arwa Abougharib
 
GITEX Big Data Conference 2014 – SAP Presentation
GITEX Big Data Conference 2014 – SAP PresentationGITEX Big Data Conference 2014 – SAP Presentation
GITEX Big Data Conference 2014 – SAP Presentation
Pedro Pereira
 
IoT underthe hood
IoT underthe hoodIoT underthe hood
IoT underthe hood
Dave Callaghan
 
Realizing your AIOps goals with machine learning in Elastic
Realizing your AIOps goals with machine learning in ElasticRealizing your AIOps goals with machine learning in Elastic
Realizing your AIOps goals with machine learning in Elastic
Elasticsearch
 
Customer insights from telecom data using deep learning
Customer insights from telecom data using deep learning Customer insights from telecom data using deep learning
Customer insights from telecom data using deep learning
Armando Vieira
 
Indigo Capability Primer - Transformation Tools for Utilities
Indigo Capability Primer - Transformation Tools for Utilities Indigo Capability Primer - Transformation Tools for Utilities
Indigo Capability Primer - Transformation Tools for Utilities
Indigo Advisory Group
 
Data as the New Oil: Producing Value in the Oil and Gas Industry
 Data as the New Oil: Producing Value in the Oil and Gas Industry Data as the New Oil: Producing Value in the Oil and Gas Industry
Data as the New Oil: Producing Value in the Oil and Gas Industry
VMware Tanzu
 
Building Analytics: Energy Information Systems
Building Analytics: Energy Information SystemsBuilding Analytics: Energy Information Systems
Building Analytics: Energy Information SystemsE Source Companies, LLC
 
Real-time Energy Data Analytics with Storm
Real-time Energy Data Analytics with StormReal-time Energy Data Analytics with Storm
Real-time Energy Data Analytics with StormDataWorks Summit
 

What's hot (20)

The Soft Grid 2013 Opening Presentation
The Soft Grid 2013 Opening PresentationThe Soft Grid 2013 Opening Presentation
The Soft Grid 2013 Opening Presentation
 
Predictive maintenance withsensors_in_utilities_
Predictive maintenance withsensors_in_utilities_Predictive maintenance withsensors_in_utilities_
Predictive maintenance withsensors_in_utilities_
 
Michael Hummel - Stop Storing Data! - Parstream
Michael Hummel - Stop Storing Data! - ParstreamMichael Hummel - Stop Storing Data! - Parstream
Michael Hummel - Stop Storing Data! - Parstream
 
Transforming GE Healthcare with Data Platform Strategy
Transforming GE Healthcare with Data Platform StrategyTransforming GE Healthcare with Data Platform Strategy
Transforming GE Healthcare with Data Platform Strategy
 
Big data analytics for telecom operators final use cases 0712-2014_prof_m erdas
Big data analytics for telecom operators final use cases 0712-2014_prof_m erdasBig data analytics for telecom operators final use cases 0712-2014_prof_m erdas
Big data analytics for telecom operators final use cases 0712-2014_prof_m erdas
 
Conduit - A Lightweight Data Virtualization Tool
Conduit - A Lightweight Data Virtualization ToolConduit - A Lightweight Data Virtualization Tool
Conduit - A Lightweight Data Virtualization Tool
 
Whitepaper - Transforming the Energy & Utilities Industry with Smart Analytics
Whitepaper - Transforming the Energy & Utilities Industry with Smart AnalyticsWhitepaper - Transforming the Energy & Utilities Industry with Smart Analytics
Whitepaper - Transforming the Energy & Utilities Industry with Smart Analytics
 
MachinePulse Products
MachinePulse ProductsMachinePulse Products
MachinePulse Products
 
Momentum in Big Data, IoT and Machine Intelligence
Momentum in Big Data, IoT and Machine IntelligenceMomentum in Big Data, IoT and Machine Intelligence
Momentum in Big Data, IoT and Machine Intelligence
 
Real-time data integration to the cloud
Real-time data integration to the cloudReal-time data integration to the cloud
Real-time data integration to the cloud
 
SmartCity StreamApp Platform: Real-time Information for Smart Cities and Tran...
SmartCity StreamApp Platform: Real-time Information for Smart Cities and Tran...SmartCity StreamApp Platform: Real-time Information for Smart Cities and Tran...
SmartCity StreamApp Platform: Real-time Information for Smart Cities and Tran...
 
Research Problem Presentation - Research in Supply Chain Digital Twins
Research Problem Presentation - Research in Supply Chain Digital TwinsResearch Problem Presentation - Research in Supply Chain Digital Twins
Research Problem Presentation - Research in Supply Chain Digital Twins
 
GITEX Big Data Conference 2014 – SAP Presentation
GITEX Big Data Conference 2014 – SAP PresentationGITEX Big Data Conference 2014 – SAP Presentation
GITEX Big Data Conference 2014 – SAP Presentation
 
IoT underthe hood
IoT underthe hoodIoT underthe hood
IoT underthe hood
 
Realizing your AIOps goals with machine learning in Elastic
Realizing your AIOps goals with machine learning in ElasticRealizing your AIOps goals with machine learning in Elastic
Realizing your AIOps goals with machine learning in Elastic
 
Customer insights from telecom data using deep learning
Customer insights from telecom data using deep learning Customer insights from telecom data using deep learning
Customer insights from telecom data using deep learning
 
Indigo Capability Primer - Transformation Tools for Utilities
Indigo Capability Primer - Transformation Tools for Utilities Indigo Capability Primer - Transformation Tools for Utilities
Indigo Capability Primer - Transformation Tools for Utilities
 
Data as the New Oil: Producing Value in the Oil and Gas Industry
 Data as the New Oil: Producing Value in the Oil and Gas Industry Data as the New Oil: Producing Value in the Oil and Gas Industry
Data as the New Oil: Producing Value in the Oil and Gas Industry
 
Building Analytics: Energy Information Systems
Building Analytics: Energy Information SystemsBuilding Analytics: Energy Information Systems
Building Analytics: Energy Information Systems
 
Real-time Energy Data Analytics with Storm
Real-time Energy Data Analytics with StormReal-time Energy Data Analytics with Storm
Real-time Energy Data Analytics with Storm
 

Viewers also liked

Adedanha matemática
Adedanha matemáticaAdedanha matemática
Adedanha matemática
Janyedja Carvalho de Andrade
 
Reproduccion sexual y asexual
Reproduccion sexual y asexualReproduccion sexual y asexual
Reproduccion sexual y asexual
Juan Carlos Garcia Roldan
 
A janelinha do kaua 1
A janelinha do kaua 1A janelinha do kaua 1
A janelinha do kaua 1
SimoneHelenDrumond
 
Evaluation
EvaluationEvaluation
Evaluation
Richie Merkz
 
Palitos divertidos de coordenação motora
Palitos divertidos de coordenação motoraPalitos divertidos de coordenação motora
Palitos divertidos de coordenação motora
SimoneHelenDrumond
 
Revista bolinia news o lúdico na construção da aprendizagem para autistas
Revista bolinia news o lúdico na construção da aprendizagem para autistasRevista bolinia news o lúdico na construção da aprendizagem para autistas
Revista bolinia news o lúdico na construção da aprendizagem para autistas
SimoneHelenDrumond
 
Divine Sight 2
Divine Sight 2Divine Sight 2
Divine Sight 2
Sam Hager
 
Pareamento números e quantidades
Pareamento números e quantidadesPareamento números e quantidades
Pareamento números e quantidades
SimoneHelenDrumond
 
Palitos quebra cabeça das cores
Palitos quebra cabeça das coresPalitos quebra cabeça das cores
Palitos quebra cabeça das cores
SimoneHelenDrumond
 
Applying Machine Learning using H2O
Applying Machine Learning using H2OApplying Machine Learning using H2O
Applying Machine Learning using H2O
Sri Ambati
 
17 01-24 ASOCIACION KID´S KITCAR - GREEN POWER IBERIA
17 01-24 ASOCIACION KID´S KITCAR - GREEN POWER IBERIA17 01-24 ASOCIACION KID´S KITCAR - GREEN POWER IBERIA
17 01-24 ASOCIACION KID´S KITCAR - GREEN POWER IBERIA
PRIMITIVO CARRANZA TORME
 
Formulas de estadistica y probabilidades
Formulas de estadistica y probabilidadesFormulas de estadistica y probabilidades
Formulas de estadistica y probabilidades
ederelreyrata
 
Technology in the early years
Technology in the early yearsTechnology in the early years
Technology in the early years
Rebecca Davies
 
Actividad
ActividadActividad
Actividad
Eva Gallardo
 
Bertha 6g 2
Bertha 6g 2Bertha 6g 2
Actividad Eva
Actividad Eva Actividad Eva
Actividad Eva
Eva Gallardo
 
Startup Series #1: What To Do During Formation
Startup Series #1: What To Do During FormationStartup Series #1: What To Do During Formation
Startup Series #1: What To Do During Formation
Erin McClarty
 
Bertha act10
Bertha act10Bertha act10
Pablo sanchez diapositivas
Pablo sanchez diapositivasPablo sanchez diapositivas
Pablo sanchez diapositivas
Pablo Sanchez
 

Viewers also liked (19)

Adedanha matemática
Adedanha matemáticaAdedanha matemática
Adedanha matemática
 
Reproduccion sexual y asexual
Reproduccion sexual y asexualReproduccion sexual y asexual
Reproduccion sexual y asexual
 
A janelinha do kaua 1
A janelinha do kaua 1A janelinha do kaua 1
A janelinha do kaua 1
 
Evaluation
EvaluationEvaluation
Evaluation
 
Palitos divertidos de coordenação motora
Palitos divertidos de coordenação motoraPalitos divertidos de coordenação motora
Palitos divertidos de coordenação motora
 
Revista bolinia news o lúdico na construção da aprendizagem para autistas
Revista bolinia news o lúdico na construção da aprendizagem para autistasRevista bolinia news o lúdico na construção da aprendizagem para autistas
Revista bolinia news o lúdico na construção da aprendizagem para autistas
 
Divine Sight 2
Divine Sight 2Divine Sight 2
Divine Sight 2
 
Pareamento números e quantidades
Pareamento números e quantidadesPareamento números e quantidades
Pareamento números e quantidades
 
Palitos quebra cabeça das cores
Palitos quebra cabeça das coresPalitos quebra cabeça das cores
Palitos quebra cabeça das cores
 
Applying Machine Learning using H2O
Applying Machine Learning using H2OApplying Machine Learning using H2O
Applying Machine Learning using H2O
 
17 01-24 ASOCIACION KID´S KITCAR - GREEN POWER IBERIA
17 01-24 ASOCIACION KID´S KITCAR - GREEN POWER IBERIA17 01-24 ASOCIACION KID´S KITCAR - GREEN POWER IBERIA
17 01-24 ASOCIACION KID´S KITCAR - GREEN POWER IBERIA
 
Formulas de estadistica y probabilidades
Formulas de estadistica y probabilidadesFormulas de estadistica y probabilidades
Formulas de estadistica y probabilidades
 
Technology in the early years
Technology in the early yearsTechnology in the early years
Technology in the early years
 
Actividad
ActividadActividad
Actividad
 
Bertha 6g 2
Bertha 6g 2Bertha 6g 2
Bertha 6g 2
 
Actividad Eva
Actividad Eva Actividad Eva
Actividad Eva
 
Startup Series #1: What To Do During Formation
Startup Series #1: What To Do During FormationStartup Series #1: What To Do During Formation
Startup Series #1: What To Do During Formation
 
Bertha act10
Bertha act10Bertha act10
Bertha act10
 
Pablo sanchez diapositivas
Pablo sanchez diapositivasPablo sanchez diapositivas
Pablo sanchez diapositivas
 

Similar to Big data and Blockchain in HealthIT

Blockchain, Hyperledger and the Oracle Blockchain Platform
Blockchain, Hyperledger and the Oracle Blockchain PlatformBlockchain, Hyperledger and the Oracle Blockchain Platform
Blockchain, Hyperledger and the Oracle Blockchain Platform
Juarez Junior
 
How to Build a Decentralized Blockchain App with the Oracle Blockchain Platform
How to Build a Decentralized BlockchainApp with the Oracle Blockchain PlatformHow to Build a Decentralized BlockchainApp with the Oracle Blockchain Platform
How to Build a Decentralized Blockchain App with the Oracle Blockchain Platform
Juarez Junior
 
77201924
7720192477201924
77201924
IJRAT
 
77201924
7720192477201924
77201924
IJRAT
 
Oracle Blockchain Experience Day
Oracle Blockchain Experience DayOracle Blockchain Experience Day
Oracle Blockchain Experience Day
Juarez Junior
 
Decentralized App Blockchain
Decentralized App BlockchainDecentralized App Blockchain
Decentralized App Blockchain
Decentralized applications development
 
Oracle Blockchain Cloud Service
Oracle Blockchain Cloud ServiceOracle Blockchain Cloud Service
Oracle Blockchain Cloud Service
Monte Kluemper
 
Blockchain & Security in Oracle by Emmanuel Abiodun
Blockchain & Security in Oracle by Emmanuel AbiodunBlockchain & Security in Oracle by Emmanuel Abiodun
Blockchain & Security in Oracle by Emmanuel Abiodun
Vishwas Manral
 
HBaseCon 2013: Using Coprocessors to Index Columns in an Elasticsearch Cluster
HBaseCon 2013: Using Coprocessors to Index Columns in an Elasticsearch Cluster HBaseCon 2013: Using Coprocessors to Index Columns in an Elasticsearch Cluster
HBaseCon 2013: Using Coprocessors to Index Columns in an Elasticsearch Cluster
Cloudera, Inc.
 
HBaseCon 2013: Using Coprocessors to Index Columns in an Elasticsearch Cluster
HBaseCon 2013: Using Coprocessors to Index Columns in an Elasticsearch Cluster HBaseCon 2013: Using Coprocessors to Index Columns in an Elasticsearch Cluster
HBaseCon 2013: Using Coprocessors to Index Columns in an Elasticsearch Cluster
Cloudera, Inc.
 
hyperledger-chaincode & hyperl fabric.pptx
hyperledger-chaincode & hyperl fabric.pptxhyperledger-chaincode & hyperl fabric.pptx
hyperledger-chaincode & hyperl fabric.pptx
deepaksingh160910
 
IRJET- Secured Hadoop Environment
IRJET- Secured Hadoop EnvironmentIRJET- Secured Hadoop Environment
IRJET- Secured Hadoop Environment
IRJET Journal
 
Episode 3: Better on Blockchain | Office Suite
Episode 3: Better on Blockchain | Office SuiteEpisode 3: Better on Blockchain | Office Suite
Episode 3: Better on Blockchain | Office Suite
Michael Pirovano
 
Cloudera federal summit
Cloudera federal summitCloudera federal summit
Cloudera federal summit
Matt Carroll
 
Hd insight overview
Hd insight overviewHd insight overview
Hd insight overview
vhrocca
 
CouchBase The Complete NoSql Solution for Big Data
CouchBase The Complete NoSql Solution for Big DataCouchBase The Complete NoSql Solution for Big Data
CouchBase The Complete NoSql Solution for Big Data
Debajani Mohanty
 
Key aspects of big data storage and its architecture
Key aspects of big data storage and its architectureKey aspects of big data storage and its architecture
Key aspects of big data storage and its architecture
Rahul Chaturvedi
 
Blockchain Technology ,Architecture and its Structure
Blockchain Technology ,Architecture and its StructureBlockchain Technology ,Architecture and its Structure
Blockchain Technology ,Architecture and its Structure
kasthurimukila
 
IBM Blockchain Platform - Architectural Good Practices v1.0
IBM Blockchain Platform - Architectural Good Practices v1.0IBM Blockchain Platform - Architectural Good Practices v1.0
IBM Blockchain Platform - Architectural Good Practices v1.0
Matt Lucas
 
CHAPTER 12 Integrating Non-Blockchain Apps with Ethereum
CHAPTER 12  Integrating Non-Blockchain Apps with Ethereum     CHAPTER 12  Integrating Non-Blockchain Apps with Ethereum
CHAPTER 12 Integrating Non-Blockchain Apps with Ethereum
EstelaJeffery653
 

Similar to Big data and Blockchain in HealthIT (20)

Blockchain, Hyperledger and the Oracle Blockchain Platform
Blockchain, Hyperledger and the Oracle Blockchain PlatformBlockchain, Hyperledger and the Oracle Blockchain Platform
Blockchain, Hyperledger and the Oracle Blockchain Platform
 
How to Build a Decentralized Blockchain App with the Oracle Blockchain Platform
How to Build a Decentralized BlockchainApp with the Oracle Blockchain PlatformHow to Build a Decentralized BlockchainApp with the Oracle Blockchain Platform
How to Build a Decentralized Blockchain App with the Oracle Blockchain Platform
 
77201924
7720192477201924
77201924
 
77201924
7720192477201924
77201924
 
Oracle Blockchain Experience Day
Oracle Blockchain Experience DayOracle Blockchain Experience Day
Oracle Blockchain Experience Day
 
Decentralized App Blockchain
Decentralized App BlockchainDecentralized App Blockchain
Decentralized App Blockchain
 
Oracle Blockchain Cloud Service
Oracle Blockchain Cloud ServiceOracle Blockchain Cloud Service
Oracle Blockchain Cloud Service
 
Blockchain & Security in Oracle by Emmanuel Abiodun
Blockchain & Security in Oracle by Emmanuel AbiodunBlockchain & Security in Oracle by Emmanuel Abiodun
Blockchain & Security in Oracle by Emmanuel Abiodun
 
HBaseCon 2013: Using Coprocessors to Index Columns in an Elasticsearch Cluster
HBaseCon 2013: Using Coprocessors to Index Columns in an Elasticsearch Cluster HBaseCon 2013: Using Coprocessors to Index Columns in an Elasticsearch Cluster
HBaseCon 2013: Using Coprocessors to Index Columns in an Elasticsearch Cluster
 
HBaseCon 2013: Using Coprocessors to Index Columns in an Elasticsearch Cluster
HBaseCon 2013: Using Coprocessors to Index Columns in an Elasticsearch Cluster HBaseCon 2013: Using Coprocessors to Index Columns in an Elasticsearch Cluster
HBaseCon 2013: Using Coprocessors to Index Columns in an Elasticsearch Cluster
 
hyperledger-chaincode & hyperl fabric.pptx
hyperledger-chaincode & hyperl fabric.pptxhyperledger-chaincode & hyperl fabric.pptx
hyperledger-chaincode & hyperl fabric.pptx
 
IRJET- Secured Hadoop Environment
IRJET- Secured Hadoop EnvironmentIRJET- Secured Hadoop Environment
IRJET- Secured Hadoop Environment
 
Episode 3: Better on Blockchain | Office Suite
Episode 3: Better on Blockchain | Office SuiteEpisode 3: Better on Blockchain | Office Suite
Episode 3: Better on Blockchain | Office Suite
 
Cloudera federal summit
Cloudera federal summitCloudera federal summit
Cloudera federal summit
 
Hd insight overview
Hd insight overviewHd insight overview
Hd insight overview
 
CouchBase The Complete NoSql Solution for Big Data
CouchBase The Complete NoSql Solution for Big DataCouchBase The Complete NoSql Solution for Big Data
CouchBase The Complete NoSql Solution for Big Data
 
Key aspects of big data storage and its architecture
Key aspects of big data storage and its architectureKey aspects of big data storage and its architecture
Key aspects of big data storage and its architecture
 
Blockchain Technology ,Architecture and its Structure
Blockchain Technology ,Architecture and its StructureBlockchain Technology ,Architecture and its Structure
Blockchain Technology ,Architecture and its Structure
 
IBM Blockchain Platform - Architectural Good Practices v1.0
IBM Blockchain Platform - Architectural Good Practices v1.0IBM Blockchain Platform - Architectural Good Practices v1.0
IBM Blockchain Platform - Architectural Good Practices v1.0
 
CHAPTER 12 Integrating Non-Blockchain Apps with Ethereum
CHAPTER 12  Integrating Non-Blockchain Apps with Ethereum     CHAPTER 12  Integrating Non-Blockchain Apps with Ethereum
CHAPTER 12 Integrating Non-Blockchain Apps with Ethereum
 

More from Dave Callaghan

Big Brother Big Sister Bluemix Architecture from #HackathonCLT
Big Brother Big Sister Bluemix Architecture from #HackathonCLTBig Brother Big Sister Bluemix Architecture from #HackathonCLT
Big Brother Big Sister Bluemix Architecture from #HackathonCLT
Dave Callaghan
 
Stormwater analytics with MongoDB and Pentaho
Stormwater analytics with MongoDB and PentahoStormwater analytics with MongoDB and Pentaho
Stormwater analytics with MongoDB and Pentaho
Dave Callaghan
 
MongoDB – Build, Adapt, Reduce, Improve
MongoDB – Build, Adapt, Reduce, ImproveMongoDB – Build, Adapt, Reduce, Improve
MongoDB – Build, Adapt, Reduce, Improve
Dave Callaghan
 
MongoDB - Build, Adapt, Reduce, Improve
MongoDB - Build, Adapt, Reduce, ImproveMongoDB - Build, Adapt, Reduce, Improve
MongoDB - Build, Adapt, Reduce, Improve
Dave Callaghan
 
Orphans in the Desert Presentation
Orphans in the Desert PresentationOrphans in the Desert Presentation
Orphans in the Desert PresentationDave Callaghan
 

More from Dave Callaghan (9)

Big Brother Big Sister Bluemix Architecture from #HackathonCLT
Big Brother Big Sister Bluemix Architecture from #HackathonCLTBig Brother Big Sister Bluemix Architecture from #HackathonCLT
Big Brother Big Sister Bluemix Architecture from #HackathonCLT
 
Stormwater analytics with MongoDB and Pentaho
Stormwater analytics with MongoDB and PentahoStormwater analytics with MongoDB and Pentaho
Stormwater analytics with MongoDB and Pentaho
 
MongoDB – Build, Adapt, Reduce, Improve
MongoDB – Build, Adapt, Reduce, ImproveMongoDB – Build, Adapt, Reduce, Improve
MongoDB – Build, Adapt, Reduce, Improve
 
MongoDB - Build, Adapt, Reduce, Improve
MongoDB - Build, Adapt, Reduce, ImproveMongoDB - Build, Adapt, Reduce, Improve
MongoDB - Build, Adapt, Reduce, Improve
 
SegmentOfOne
SegmentOfOneSegmentOfOne
SegmentOfOne
 
BigFastData
BigFastDataBigFastData
BigFastData
 
Orphans in the Desert Presentation
Orphans in the Desert PresentationOrphans in the Desert Presentation
Orphans in the Desert Presentation
 
AtlasCHUG
AtlasCHUGAtlasCHUG
AtlasCHUG
 
BigDataInTelco
BigDataInTelcoBigDataInTelco
BigDataInTelco
 

Recently uploaded

Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
Subhajit Sahu
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Boston Institute of Analytics
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 
Jpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization Sample
James Polillo
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
TravisMalana
 
Tabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsTabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflows
alex933524
 
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
correoyaya
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
yhkoc
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
ewymefz
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
NABLAS株式会社
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
enxupq
 
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
Tiktokethiodaily
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
Opendatabay
 
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape ReportSOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
axoqas
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
ewymefz
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
vcaxypu
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
vcaxypu
 

Recently uploaded (20)

Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 
Jpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization Sample
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
 
Tabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsTabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflows
 
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
 
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape ReportSOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape Report
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
 

Big data and Blockchain in HealthIT

  • 1. Big Data + Blockchain in HealthIT Building a solution with Bluemix’s Blockchain and Big Data
  • 2. Introduction ● This project started during the Blockchain in Healthcare Code-A-Thon in Washington DC for the 2017 Blockchain in Healthcare Summit. ● Overall, the goal was to develop a solution that enhances interoperability and demonstrates the potential to seamlessly incorporate blockchain technology into existing HealthIT systems. ● Specifically, I took the Metadata Tagging and Policy Expression track, to demonstrate the use of of blockchain for security metadata and tagging to manage access, provide auditing and provenance information. ● I took my lead from the Blockchain For Health Data and Its Potential Use in Health IT and Health Care Related Research paper, Our proposal involves the use of a public blockchain as an access-control manager to health records that are stored off blockchain. …. There are currently no open standards or implementations of blockchain that utilize this approach but research supports the feasibility of the proposed solution. ● My goal was to provide an implementation that could be used by corporate IT departments in the Health space with an eye to Innovation Risk Management. – Don’t change more than you absolutely must – New technologies fail in the enterprise because of Governance standards issues rather than technical issues – Don’t let the perfect be the enemy of the good
  • 3. Current Architecture ● According to the Blockchain for HealthcareIT paper, any blockchain for health care would – Need to be public – need to include technological solutions for three key elements: ● Scalability ● Access security ● Data privacy ● According to the MIT paper on decentralizing privacy, the off-blockchain storage needed for scalability should be on a similar data store following the Kademlia specification for a distributed hash table (DHT). ● The overall architecture would look similar to the diagram on the following page with services as a proxy for corporate health IT departments and a public blockchain and, presumably, DHT.
  • 4.
  • 5. Perspective ● Private blockchains are technically far inferior to public blockchain implementation. If it was only a technical discussion, we would be able to move on. But technical will always take a backseat to the perception of regulatory requirement. Best to take a page from Supply Chain Management and EDI and make the specification, not the implementation, public. – Prefer publicly available standards to publicly available systems. ● Try not to introduce a new technology to an IT departments with a new programming language. R3 spent $53M to produce a framework in Kotlin. Solidity, Ethereum? ● Regulated IT departments distrust external data the way mice distrust tigers. They will launch a POC, but it’ll stay a science project.
  • 6. Proposal ● We’ve never had a data revolution and we’re not getting one now. – When we needed more flexible data models, we added relational database in addition to mainframes. – When we needed to aggregate enterprise data, we added data warehouses. – When we needed to store internet-scale data, we added Hadoop clusters. – When we needed to react to device-scale data in real-time, we added Spark. – Now we need to manage trust externally at scale. ● How do we make this merely additive?
  • 7. TaLIa Trust Layer Injection The voice of the patient emboldened by blockchain
  • 8. Blockchain Model for Healthcare ● I am going to suggest a blueprint that hopefully follows the path of least resistance through your enterprise data governance stage gates. ● I am going to make a few datacenter assumptions: – There is an onsite Hadoop data lake – The data lake is secured ● Authentication – Apache Knox ● Authorization – Apache Ranger ● Audit – Apache Ranger ● Data Protection – Apache Ranger KMS – You have the tools in place for a standard Lambda architecture (Hbase for speed, HDFS for batch, Hive/Impala for reporting). Hbase 0.98+ is really the only mandatory component, though. – There is an ability to consume and serve public RESTful APIs, but they have to go through a some sort of integration layer ● We will be building the chaincode using IBM’s Hyperledger Fabric, so hopefully your organization – allows Apache-licensed open-source apps – primarily codes in JVM languages (Java, Scala) – having an open mind to Google’s Go is a really, really nice to have.
  • 10. High Level Architecure Specification Healthcare IT + MIT Paper Implementation Hadoop Datalake + Blockchain-based Authorization Interface Blockchain-based Contract for Inversion of Trust Apps RESTful API
  • 11. Application Layer ● At the API level, the MIT Enigma project provides the guidance we need to build our APIs. – Note that in all of these examples, it is assumed that the user has previously created a profile in the company’s existing identity and access management system. ● A user installs an application to access their EHR from the provider’s system. ● When the user signs up, a new compound identity (user, company) is created and sent, with predefined permissions, to the blockchain. ● Data sent from the user to the system is encrypted using this compound identity. ● Permissions can be set at any time by the user, limited by certain corporate rules (no one is allowed to request that PHI be made public, for example.) ● Services that request permission to query data, for research purposes for example, also create a compound identity. ● Requests for data return some anonymized subset of data. The principal of least exposure will be adhered to based on the corporate governance strategy and the user’s preference. Any exposed data must be agreed to be exposed by both the company and the user. ● Note that a compound key is created, just like the Enigma model where a public blockchain is used. While technically this would not be necessary in a private blockchain, this is meant to be consumed by another company with their own private blockchain, or even a consortium with a public blockchain. Regardless, we are implementing locally but planning to distribute globally.
  • 12. Interface Layer ● Data that is sent from a device to the system is stored on Kafka. ● A Spark job will pull the data from the Kafka topic, create a key for Hbase, and then store the data in both Hbase and Blockchain. ● HBase is going to rely on cell-based tags to enforce the terms of the blockchain contract by providing role-based visibility at the datum level. This is the injection part of of trust injection. ● When a record is first created in Hbase through this system, this record will be persisted but not yet visible. Hbase is extremely fast while blockchain is extremely slow. However, the assumption will be that eventual trust will be sufficient for most use cases. This is an asynchronous system with a fairly hefty latency. ● Periodically, through some scheduler like Oozie, the blockchain will be queried to retrieve records that have been authenticated but not yet minimized. Those records will compare a have of their content against a hash of the Hbase content using the rowkey as a match. When there is a match, the cell-level visibulity will be updated in Hbase and the blockchain will just store a pointer to the Hbase record, rather than the full dataset.
  • 13. Implementation ● Hbase is a sparse, distributed, persistent multidimensional sorted map, which is indexed by a row key, column key, and a timestamp. Row keys are unique. Cells store data as key value pairs and are stored in a column family. ● Every cell can have zero or more tags. Every tag has a type and the actual tag byte array. Just as row keys, column families, qualifiers and values can be encoded , tags can also be encoded as well. You can enable or disable tag encoding at the level of the column family, and it is enabled by default. ● Hbase provides two levels of cell level access control – Cell level ACL ● Sets RW using Role-based Access Control (RBAC) – Cell Level Visibility ● Allows labels to be associated with the data cells. Cells across rows and columns can have visibility labels. Users or groups can be granted authorization to the labels. ● For example, certain cells can have a visibility label called ‘private’. Only users granted authorization to ‘private’ can access the labeled data. ● Another example, patient or customer data can be labeled ‘branch1′, branch2’ etc based on where the patient or customer is located or registered. A doctor or administrator can only access the data that he or she is authorized.
  • 14. Implementation ● Interpreting the labels authenticated for a given get/scan request is a pluggable algorithm. By creating an algorith, it will be possible to add metadata management to this process, using an existing corporate metadata repository, a tool such as Cloudera navigator, or even storing the metadata in a column family on the same Hbase row. ● The ID of the blockchain serves as a pointer to the Hbase record. Access to Hbase is not done through the blockchain, so its necessary to reflect blockchain rules at this level. ● Apache Ranger is used to manage role-based and tag based solutions and can also store and operate on metadata. At this time, only Hive and HDFS can have their access and auditing needs handled by Ranger. This functionality will need to be provided in order to enable this level of security out of the box.
  • 15. Summary ● The tools to implement blockchain are, to some extent, already available in most Health Care enterprises. ● While a network of loosely connected private blockchains communicating though a RESTful APIs conforming to a public specification is not ideal from a technical perspective, it is more likely to be implemented into an existing IT infrastructure. ● The goal of having the patient being involved in their own EHR is preserved though the implementation of IBM’s Hyperledger Fabric using MIT’s specifications. ● Hbase can be used as the off-blockchain storage. While it is not as seamless as storing to another distributed hash tag, it does provide the ability to store and manage metadata and providence at the cell level.

Editor's Notes

  1. 62.3.4. Implementing Your Own Visibility Label Algorithm https://hbase.apache.org/book.html#_securing_access_to_your_data