HBaseCon 2015: HBase as an IoT Stream Analytics Platform for Parkinson's Disease Research

HBaseCon
Enable breakthroughs in Parkinson
disease research through wearables and
Big Data analytics technologies
About us…
• Part of the Big Data Analytics Solutions group @Intel reporting to
the Data Center Group
• Developing products & solutions leveraging:
• Big Data edge-technologies
• Self developed machine learning & steam analytics algorithms
• Our team includes developers, data scientists and system analysts
• I am a Big Data Analytics Architect and Development Manager responsible for
leading-edge technology projects within Intel involving Big Data and stream
analytics solutions in the Internet of Things and Parkinson's disease research
2
How It All Started?
3
Big data analytics
IOT
Parkinson’s Disease
4
OVER AGE
0F 60
1/100 60,000
NEW
1M/US
5M/WORLD
NO CURE,
MEDICATION ONLY HELPS WITH
SYMPTOMSThere is
NO TEST
and no
PROGRESSION
MARKER
PARKINSON’S DISEASE IS CAUSED BY THE DEATH
OF DOPAMINE CELLS.
OF THESE CELLS ARE ALREADY LOST BY THE TIME
MOTOR SYMPTOMS APPEAR.
60 80%TO
Challenges To Address
NO
OBJECTIVE
MEASURE
3-6 MONTHS
BETWEEN
PHYSICIAN
VISITS
CHANGES ARE
SLOW
AND HARD TO
DETECT
AVERAGE
TRIAL SIZE
< 100
PATIENTS
VERY
SMALL
number of
patients
contribute
to research
COST OF
TRIALS
are in the
scales of
$M
5
HOW?
6
The Solution
Wear a watch Start an application
7
1 2
Use Cases
MANAGE
THE
DISEASE
USING
DATA
FREE DATA
FOR 1000’S
OF PATIENTS
ACCURATE
REPORT
SINCE LAST
VISIT
MEASURE
MEDICATION
EFFECT
RESEARCHER
PHARMACEUTICAL
CLINICIAN
INTEL BIG DATA CLOUDANALYSTICS
INSIGHT / VALUE
8
DEMO
9
THE APPLICATION
10
10
Medication
reporting
Medication
reminder
Report
something
PATIENT
REPORTED
OTHER
Configurable
data
collections
Contribution
score
Integrated
Login and
registration Pebble
notifications
OBJECTIVE
MEASURES
Gait
Sleep
Tremor
Activity Level
Controlled
Tests
BIG-DATA and IOT
TECHNOLOGIES
12
IoT Cloud Simplified Framework
DatacenterNetworkThing
Cloud Infrastructure
Data Platform
Analytics Platform
UI Services
Gateway
13
SERVICE LAYER
BATCH LAYER
STREAM ANALYICS LAYER
INGESTION LAYER
STORAGE LAYER
USER INTERFACE LAYER
Mosquitt
o
14
CLOUD COMPUTING SERVICES
Storage Layer
• Cloudera Enterprise Data Hub
• HBase as main scalable time series data storage layer
• Allows high writes throughput
• Random real-time access to stored data
• Highly available MySQL as metadata storage
15
• Multi-protocol pipeline built over AKKA & KAFKA
• KAKFA is a fast, scalable, durable & distributed messaging system
• AKKA is an Actor based framework allowing high concurrency, distributed
and resilient based on events / messaging
• This layer is responsible for:
• Pulling messages
• Parse & Process
• Concurrent & controlled write
HBase
Load Balancer
Device
Device
Device
Device
Mosquito
Mosquito
16
Data Ingestion Layer
• Based on AKKA actors framework
• Contains millions of concurrent actors handling different
streams and operations
• Each actor is a small peace of code performing its role
• A set of actors creates a topology which is
responsible for device’s data stream processing
17
Stream Analytics Layer
Subscriber Parser Aggregator
HBase
Writer
Analytics
Manager
Change
Detection
UnZip
Real Time
Rules
Sleep
Quality
• Based on Apache Spark over HBase
• Spark is a fast and general engine for large-scale data processing
• Algorithms & Calculations are being executed on large data sets on a daily basis
• Layer includes:
• Set of complex machine learning algorithms
• Rule engine rules baseline calculations
18
Batch Analytics Layer
• Interactive and scalable web services layer
• A set of RESTful APIs allowing:
• Registration to platform
• Row & calculated data retrieval from HBase
• Built on top of Play framework and
providing secured entry point
• Uses Apache Phoenix & native HBase client
HBase
Load Balancer
19
Service Layer
HBase
Challenges
20
Data Ingestion to HBase
• Challenge
Concurrent ingesting millions messages into HBase creates a massive load on
HBase region servers and causes disconnections
• Development Evolution
1. HBase client per topology (millions of writers)
2. Pool of HBase clients, each is using separate HTable
3. Pool of HBase clients, all are using same
HBase connection pool (HConnectionManager)
• Solution
Creating “fixed” number of connections to HBase, allowing batch writes and
load balancing
21
.
.
.
Hbase
Writer
Hbase
Writer
Hbase
Writer
.
..
Pool
Router
Table Indicators over Large Tables
• Challenge
Gathering indicators (i.e. counts) on large HBase tables results in long table scans
and performance reduction
• Solution
• Real time update new indicator columns using incrementColumnValue
• Allows atomic increment of a specific column
• Large table counts successfully implemented
• Allowed implementation of required Indicators
• Real time hourly counts
• Real time Max values (i.e. last time a user transmitted data)
22
Batch Processing Input Format
• Challenge
• Batch processing is done using Spark – InputFormat is required for scan
• TableInputFormat was used and is equivalent to a single scan
• Poor performance when data from “remote” parts of a table are required
• Solution
• Using MultiTableInputFormat
• Allows usage of multiple scans
• Successfully used with more than 100 scans per MultiTableInputFormat
23
ANALYTICS
24
Activity Level
• Measure that will continuously describe the
intensity of the patient’s activity throughout the day
and will motivate the patients to be more active.
• Motivates the patients to be more active (known to
be important for PD patients)
• Personalized measure per patient based on their
average activity of walking periods (avoid
frustration)
• Based on intensity measurement from the
accelerometer
• Filters our tremor as
25
Activity Level – An Example
26
Activity Level in Controlled Session (ON State)
Activity Level in Controlled Session (OFF State)
Tremor
• Tremor is one of the most obvious symptoms of PD
• Most PD patients experience tremor
• Tremor is detectable using signal processing
techniques
27
TRAILS AND PARTNERS
28
REAL PD
L-DOPA RESPONSE TRIAL
DATA GATHERING TRIAL
FOX INTEL APPLICATION TRIAL
1000
50
30
20
FOX INSIGHT WEAR 1000
20
20
30
20
10
29
Trial And Partners
SCRIPS TREMMOR TRIAL 1000
WHAT’S NEXT?
30
SCALE PLATFORM
• Scale to 1000’s of
patients in the US
• Scale to 1000’s of patients in the
Netherlands
• IOS support
• Support additional wearable's
• Build more value
generating capabilities
• Upgrade to HBase 1.0
• Upgrade Spark to 1.3
• Enrich Platform (i.e. Advanced
Export, Reporting)
• Enrich Parkinson Disease solution
• Analytics
• Value to patients
31
Q&A
Thank you!
33
• Strategic direction
34
R: 0
G: 112
B: 197
R: 247
G: 127
B: 0
R: 130
G: 170
B: 50
R: 0
G: 172
B: 240
1 of 34

Recommended

HBaseCon 2013: Near Real Time Indexing for eBay Search by
HBaseCon 2013: Near Real Time Indexing for eBay SearchHBaseCon 2013: Near Real Time Indexing for eBay Search
HBaseCon 2013: Near Real Time Indexing for eBay SearchCloudera, Inc.
5.9K views22 slides
Rolling Out Apache HBase for Mobile Offerings at Visa by
Rolling Out Apache HBase for Mobile Offerings at Visa Rolling Out Apache HBase for Mobile Offerings at Visa
Rolling Out Apache HBase for Mobile Offerings at Visa HBaseCon
2.6K views39 slides
Cassandra & Spark for IoT by
Cassandra & Spark for IoTCassandra & Spark for IoT
Cassandra & Spark for IoTMatthias Niehoff
2.3K views20 slides
Argus Production Monitoring at Salesforce by
Argus Production Monitoring at SalesforceArgus Production Monitoring at Salesforce
Argus Production Monitoring at SalesforceHBaseCon
3.2K views21 slides
Next Gen Big Data Analytics with Apache Apex by
Next Gen Big Data Analytics with Apache Apex Next Gen Big Data Analytics with Apache Apex
Next Gen Big Data Analytics with Apache Apex DataWorks Summit/Hadoop Summit
1.5K views27 slides
Druid @ branch by
Druid @ branch Druid @ branch
Druid @ branch Biswajit Das
1.5K views15 slides

More Related Content

What's hot

Distributed Stream Processing - Spark Summit East 2017 by
Distributed Stream Processing - Spark Summit East 2017Distributed Stream Processing - Spark Summit East 2017
Distributed Stream Processing - Spark Summit East 2017Petr Zapletal
883 views89 slides
DataStax and Esri: Geotemporal IoT Search and Analytics by
DataStax and Esri: Geotemporal IoT Search and AnalyticsDataStax and Esri: Geotemporal IoT Search and Analytics
DataStax and Esri: Geotemporal IoT Search and AnalyticsDataStax Academy
2K views45 slides
(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C... by
(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...
(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...Reynold Xin
18.1K views66 slides
Interactive Realtime Dashboards on Data Streams using Kafka, Druid and Superset by
Interactive Realtime Dashboards on Data Streams using Kafka, Druid and SupersetInteractive Realtime Dashboards on Data Streams using Kafka, Druid and Superset
Interactive Realtime Dashboards on Data Streams using Kafka, Druid and SupersetHortonworks
3.6K views36 slides
So you think you can stream.pptx by
So you think you can stream.pptxSo you think you can stream.pptx
So you think you can stream.pptxPrakash Chockalingam
542 views48 slides
HBaseCon 2013: ETL for Apache HBase by
HBaseCon 2013: ETL for Apache HBaseHBaseCon 2013: ETL for Apache HBase
HBaseCon 2013: ETL for Apache HBaseCloudera, Inc.
6.9K views25 slides

What's hot(20)

Distributed Stream Processing - Spark Summit East 2017 by Petr Zapletal
Distributed Stream Processing - Spark Summit East 2017Distributed Stream Processing - Spark Summit East 2017
Distributed Stream Processing - Spark Summit East 2017
Petr Zapletal883 views
DataStax and Esri: Geotemporal IoT Search and Analytics by DataStax Academy
DataStax and Esri: Geotemporal IoT Search and AnalyticsDataStax and Esri: Geotemporal IoT Search and Analytics
DataStax and Esri: Geotemporal IoT Search and Analytics
DataStax Academy2K views
(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C... by Reynold Xin
(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...
(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...
Reynold Xin18.1K views
Interactive Realtime Dashboards on Data Streams using Kafka, Druid and Superset by Hortonworks
Interactive Realtime Dashboards on Data Streams using Kafka, Druid and SupersetInteractive Realtime Dashboards on Data Streams using Kafka, Druid and Superset
Interactive Realtime Dashboards on Data Streams using Kafka, Druid and Superset
Hortonworks3.6K views
HBaseCon 2013: ETL for Apache HBase by Cloudera, Inc.
HBaseCon 2013: ETL for Apache HBaseHBaseCon 2013: ETL for Apache HBase
HBaseCon 2013: ETL for Apache HBase
Cloudera, Inc.6.9K views
Spark Summit EU talk by Zoltan Zvara by Spark Summit
Spark Summit EU talk by Zoltan ZvaraSpark Summit EU talk by Zoltan Zvara
Spark Summit EU talk by Zoltan Zvara
Spark Summit730 views
Taboola's experience with Apache Spark (presentation @ Reversim 2014) by tsliwowicz
Taboola's experience with Apache Spark (presentation @ Reversim 2014)Taboola's experience with Apache Spark (presentation @ Reversim 2014)
Taboola's experience with Apache Spark (presentation @ Reversim 2014)
tsliwowicz8.1K views
Time-Series Apache HBase by HBaseCon
Time-Series Apache HBaseTime-Series Apache HBase
Time-Series Apache HBase
HBaseCon5.6K views
Евгений Бобров "Powered by OSS. Масштабируемая потоковая обработка и анализ б... by Fwdays
Евгений Бобров "Powered by OSS. Масштабируемая потоковая обработка и анализ б...Евгений Бобров "Powered by OSS. Масштабируемая потоковая обработка и анализ б...
Евгений Бобров "Powered by OSS. Масштабируемая потоковая обработка и анализ б...
Fwdays501 views
Amazon aws big data demystified | Introduction to streaming and messaging flu... by Omid Vahdaty
Amazon aws big data demystified | Introduction to streaming and messaging flu...Amazon aws big data demystified | Introduction to streaming and messaging flu...
Amazon aws big data demystified | Introduction to streaming and messaging flu...
Omid Vahdaty467 views
HBaseCon 2015: Optimizing HBase for the Cloud in Microsoft Azure HDInsight by HBaseCon
HBaseCon 2015: Optimizing HBase for the Cloud in Microsoft Azure HDInsightHBaseCon 2015: Optimizing HBase for the Cloud in Microsoft Azure HDInsight
HBaseCon 2015: Optimizing HBase for the Cloud in Microsoft Azure HDInsight
HBaseCon3.8K views
CERN’s Next Generation Data Analysis Platform with Apache Spark with Enric Te... by Databricks
CERN’s Next Generation Data Analysis Platform with Apache Spark with Enric Te...CERN’s Next Generation Data Analysis Platform with Apache Spark with Enric Te...
CERN’s Next Generation Data Analysis Platform with Apache Spark with Enric Te...
Databricks1.8K views
Foundations of streaming SQL: stream & table theory by DataWorks Summit
Foundations of streaming SQL: stream & table theoryFoundations of streaming SQL: stream & table theory
Foundations of streaming SQL: stream & table theory
DataWorks Summit598 views
Apache HBase in the Enterprise Data Hub at Cerner by HBaseCon
Apache HBase in the Enterprise Data Hub at CernerApache HBase in the Enterprise Data Hub at Cerner
Apache HBase in the Enterprise Data Hub at Cerner
HBaseCon2.1K views
Building a Versatile Analytics Pipeline on Top of Apache Spark with Mikhail C... by Databricks
Building a Versatile Analytics Pipeline on Top of Apache Spark with Mikhail C...Building a Versatile Analytics Pipeline on Top of Apache Spark with Mikhail C...
Building a Versatile Analytics Pipeline on Top of Apache Spark with Mikhail C...
Databricks6K views
Денис Резник "Моя база данных не справляется с нагрузкой. Что делать?" by Fwdays
Денис Резник "Моя база данных не справляется с нагрузкой. Что делать?"Денис Резник "Моя база данных не справляется с нагрузкой. Что делать?"
Денис Резник "Моя база данных не справляется с нагрузкой. Что делать?"
Fwdays562 views

Viewers also liked

Degenerative diseases in aging patients by
Degenerative diseases in aging patientsDegenerative diseases in aging patients
Degenerative diseases in aging patientsGustavo Duarte Viana
10K views29 slides
HBaseCon 2015: Industrial Internet Case Study using HBase and TSDB by
HBaseCon 2015: Industrial Internet Case Study using HBase and TSDBHBaseCon 2015: Industrial Internet Case Study using HBase and TSDB
HBaseCon 2015: Industrial Internet Case Study using HBase and TSDBHBaseCon
5.6K views23 slides
A Survey of HBase Application Archetypes by
A Survey of HBase Application ArchetypesA Survey of HBase Application Archetypes
A Survey of HBase Application ArchetypesHBaseCon
20K views60 slides
IoT architecture by
IoT architectureIoT architecture
IoT architectureSumit Sharma
99.4K views45 slides
Internet of Things Overview by
Internet of Things OverviewInternet of Things Overview
Internet of Things OverviewRomin Irani
2K views9 slides
Android and the Web of Things: NFC, QR, BLE, Bluetooth, EPC, Arduino by
Android and the Web of Things: NFC, QR, BLE, Bluetooth, EPC, ArduinoAndroid and the Web of Things: NFC, QR, BLE, Bluetooth, EPC, Arduino
Android and the Web of Things: NFC, QR, BLE, Bluetooth, EPC, ArduinoDominique Guinard
17.4K views67 slides

Viewers also liked(20)

HBaseCon 2015: Industrial Internet Case Study using HBase and TSDB by HBaseCon
HBaseCon 2015: Industrial Internet Case Study using HBase and TSDBHBaseCon 2015: Industrial Internet Case Study using HBase and TSDB
HBaseCon 2015: Industrial Internet Case Study using HBase and TSDB
HBaseCon5.6K views
A Survey of HBase Application Archetypes by HBaseCon
A Survey of HBase Application ArchetypesA Survey of HBase Application Archetypes
A Survey of HBase Application Archetypes
HBaseCon20K views
IoT architecture by Sumit Sharma
IoT architectureIoT architecture
IoT architecture
Sumit Sharma99.4K views
Internet of Things Overview by Romin Irani
Internet of Things OverviewInternet of Things Overview
Internet of Things Overview
Romin Irani2K views
Android and the Web of Things: NFC, QR, BLE, Bluetooth, EPC, Arduino by Dominique Guinard
Android and the Web of Things: NFC, QR, BLE, Bluetooth, EPC, ArduinoAndroid and the Web of Things: NFC, QR, BLE, Bluetooth, EPC, Arduino
Android and the Web of Things: NFC, QR, BLE, Bluetooth, EPC, Arduino
Dominique Guinard17.4K views
2010 Jeep Compass Viva Chrysler Jeep Dodge El Paso TX by Viva Jeep
2010 Jeep Compass Viva Chrysler Jeep Dodge El Paso TX2010 Jeep Compass Viva Chrysler Jeep Dodge El Paso TX
2010 Jeep Compass Viva Chrysler Jeep Dodge El Paso TX
Viva Jeep743 views
Day 5 - Real-time Data Processing/Internet of Things (IoT) with Amazon Kinesis by Amazon Web Services
Day 5 - Real-time Data Processing/Internet of Things (IoT) with Amazon KinesisDay 5 - Real-time Data Processing/Internet of Things (IoT) with Amazon Kinesis
Day 5 - Real-time Data Processing/Internet of Things (IoT) with Amazon Kinesis
Amazon Web Services10.1K views
Azure Stream Analytics by Davide Mauri
Azure Stream AnalyticsAzure Stream Analytics
Azure Stream Analytics
Davide Mauri893 views
HBaseCon 2013: Project Valta - A Resource Management Layer over Apache HBase by Cloudera, Inc.
HBaseCon 2013: Project Valta - A Resource Management Layer over Apache HBaseHBaseCon 2013: Project Valta - A Resource Management Layer over Apache HBase
HBaseCon 2013: Project Valta - A Resource Management Layer over Apache HBase
Cloudera, Inc.3.2K views
HBaseCon 2013: Apache HBase, Meet Ops. Ops, Meet Apache HBase. by Cloudera, Inc.
HBaseCon 2013: Apache HBase, Meet Ops. Ops, Meet Apache HBase.HBaseCon 2013: Apache HBase, Meet Ops. Ops, Meet Apache HBase.
HBaseCon 2013: Apache HBase, Meet Ops. Ops, Meet Apache HBase.
Cloudera, Inc.7.1K views
HBaseCon 2013: Apache HBase on Flash by Cloudera, Inc.
HBaseCon 2013: Apache HBase on FlashHBaseCon 2013: Apache HBase on Flash
HBaseCon 2013: Apache HBase on Flash
Cloudera, Inc.4.3K views
HBaseCon 2012 | Living Data: Applying Adaptable Schemas to HBase - Aaron Kimb... by Cloudera, Inc.
HBaseCon 2012 | Living Data: Applying Adaptable Schemas to HBase - Aaron Kimb...HBaseCon 2012 | Living Data: Applying Adaptable Schemas to HBase - Aaron Kimb...
HBaseCon 2012 | Living Data: Applying Adaptable Schemas to HBase - Aaron Kimb...
Cloudera, Inc.3.2K views
HBase Read High Availability Using Timeline-Consistent Region Replicas by HBaseCon
HBase Read High Availability Using Timeline-Consistent Region ReplicasHBase Read High Availability Using Timeline-Consistent Region Replicas
HBase Read High Availability Using Timeline-Consistent Region Replicas
HBaseCon4.1K views
HBaseCon 2015: Trafodion - Integrating Operational SQL into HBase by HBaseCon
HBaseCon 2015: Trafodion - Integrating Operational SQL into HBaseHBaseCon 2015: Trafodion - Integrating Operational SQL into HBase
HBaseCon 2015: Trafodion - Integrating Operational SQL into HBase
HBaseCon3.3K views
HBaseCon 2012 | Leveraging HBase for the World’s Largest Curated Genomic Data... by Cloudera, Inc.
HBaseCon 2012 | Leveraging HBase for the World’s Largest Curated Genomic Data...HBaseCon 2012 | Leveraging HBase for the World’s Largest Curated Genomic Data...
HBaseCon 2012 | Leveraging HBase for the World’s Largest Curated Genomic Data...
Cloudera, Inc.3.5K views
HBaseCon 2013: Apache Hadoop and Apache HBase for Real-Time Video Analytics by Cloudera, Inc.
HBaseCon 2013: Apache Hadoop and Apache HBase for Real-Time Video Analytics HBaseCon 2013: Apache Hadoop and Apache HBase for Real-Time Video Analytics
HBaseCon 2013: Apache Hadoop and Apache HBase for Real-Time Video Analytics
Cloudera, Inc.4.8K views
HBaseCon 2012 | Building Mobile Infrastructure with HBase by Cloudera, Inc.
HBaseCon 2012 | Building Mobile Infrastructure with HBaseHBaseCon 2012 | Building Mobile Infrastructure with HBase
HBaseCon 2012 | Building Mobile Infrastructure with HBase
Cloudera, Inc.2.6K views
HBaseCon 2012 | Content Addressable Storages for Fun and Profit - Berk Demir,... by Cloudera, Inc.
HBaseCon 2012 | Content Addressable Storages for Fun and Profit - Berk Demir,...HBaseCon 2012 | Content Addressable Storages for Fun and Profit - Berk Demir,...
HBaseCon 2012 | Content Addressable Storages for Fun and Profit - Berk Demir,...
Cloudera, Inc.3.8K views

Similar to HBaseCon 2015: HBase as an IoT Stream Analytics Platform for Parkinson's Disease Research

Enable breakthrough in Parkinson disease research- Ido Karavany- by
Enable breakthrough in Parkinson disease research- Ido Karavany-Enable breakthrough in Parkinson disease research- Ido Karavany-
Enable breakthrough in Parkinson disease research- Ido Karavany-Spark Summit
1.9K views29 slides
Hadoop Summit SJ 2016: Next Gen Big Data Analytics with Apache Apex by
Hadoop Summit SJ 2016: Next Gen Big Data Analytics with Apache ApexHadoop Summit SJ 2016: Next Gen Big Data Analytics with Apache Apex
Hadoop Summit SJ 2016: Next Gen Big Data Analytics with Apache ApexApache Apex
4K views27 slides
Apache Big Data 2016: Next Gen Big Data Analytics with Apache Apex by
Apache Big Data 2016: Next Gen Big Data Analytics with Apache ApexApache Big Data 2016: Next Gen Big Data Analytics with Apache Apex
Apache Big Data 2016: Next Gen Big Data Analytics with Apache ApexApache Apex
828 views33 slides
Thomas Weise, Apache Apex PMC Member and Architect/Co-Founder, DataTorrent - ... by
Thomas Weise, Apache Apex PMC Member and Architect/Co-Founder, DataTorrent - ...Thomas Weise, Apache Apex PMC Member and Architect/Co-Founder, DataTorrent - ...
Thomas Weise, Apache Apex PMC Member and Architect/Co-Founder, DataTorrent - ...Dataconomy Media
1.2K views23 slides
Big Data Berlin v8.0 Stream Processing with Apache Apex by
Big Data Berlin v8.0 Stream Processing with Apache Apex Big Data Berlin v8.0 Stream Processing with Apache Apex
Big Data Berlin v8.0 Stream Processing with Apache Apex Apache Apex
1.1K views23 slides
Intro to Apache Apex - Next Gen Platform for Ingest and Transform by
Intro to Apache Apex - Next Gen Platform for Ingest and TransformIntro to Apache Apex - Next Gen Platform for Ingest and Transform
Intro to Apache Apex - Next Gen Platform for Ingest and TransformApache Apex
1.2K views30 slides

Similar to HBaseCon 2015: HBase as an IoT Stream Analytics Platform for Parkinson's Disease Research(20)

Enable breakthrough in Parkinson disease research- Ido Karavany- by Spark Summit
Enable breakthrough in Parkinson disease research- Ido Karavany-Enable breakthrough in Parkinson disease research- Ido Karavany-
Enable breakthrough in Parkinson disease research- Ido Karavany-
Spark Summit1.9K views
Hadoop Summit SJ 2016: Next Gen Big Data Analytics with Apache Apex by Apache Apex
Hadoop Summit SJ 2016: Next Gen Big Data Analytics with Apache ApexHadoop Summit SJ 2016: Next Gen Big Data Analytics with Apache Apex
Hadoop Summit SJ 2016: Next Gen Big Data Analytics with Apache Apex
Apache Apex4K views
Apache Big Data 2016: Next Gen Big Data Analytics with Apache Apex by Apache Apex
Apache Big Data 2016: Next Gen Big Data Analytics with Apache ApexApache Big Data 2016: Next Gen Big Data Analytics with Apache Apex
Apache Big Data 2016: Next Gen Big Data Analytics with Apache Apex
Apache Apex828 views
Thomas Weise, Apache Apex PMC Member and Architect/Co-Founder, DataTorrent - ... by Dataconomy Media
Thomas Weise, Apache Apex PMC Member and Architect/Co-Founder, DataTorrent - ...Thomas Weise, Apache Apex PMC Member and Architect/Co-Founder, DataTorrent - ...
Thomas Weise, Apache Apex PMC Member and Architect/Co-Founder, DataTorrent - ...
Dataconomy Media1.2K views
Big Data Berlin v8.0 Stream Processing with Apache Apex by Apache Apex
Big Data Berlin v8.0 Stream Processing with Apache Apex Big Data Berlin v8.0 Stream Processing with Apache Apex
Big Data Berlin v8.0 Stream Processing with Apache Apex
Apache Apex1.1K views
Intro to Apache Apex - Next Gen Platform for Ingest and Transform by Apache Apex
Intro to Apache Apex - Next Gen Platform for Ingest and TransformIntro to Apache Apex - Next Gen Platform for Ingest and Transform
Intro to Apache Apex - Next Gen Platform for Ingest and Transform
Apache Apex1.2K views
The challenges of Analytical Data Management in R&D by Laura Berry
The challenges of Analytical Data Management in R&DThe challenges of Analytical Data Management in R&D
The challenges of Analytical Data Management in R&D
Laura Berry256 views
Introduction to Apache Apex by Thomas Weise by Big Data Spain
Introduction to Apache Apex by Thomas WeiseIntroduction to Apache Apex by Thomas Weise
Introduction to Apache Apex by Thomas Weise
Big Data Spain662 views
Automating the process of continuously prioritising data, updating and deploy... by Ola Spjuth
Automating the process of continuously prioritising data, updating and deploy...Automating the process of continuously prioritising data, updating and deploy...
Automating the process of continuously prioritising data, updating and deploy...
Ola Spjuth253 views
adaptive_ecg_cdr_edittedforpublic.pptx by ssuser6f1a8e1
adaptive_ecg_cdr_edittedforpublic.pptxadaptive_ecg_cdr_edittedforpublic.pptx
adaptive_ecg_cdr_edittedforpublic.pptx
ssuser6f1a8e15 views
Intro to Apache Apex (next gen Hadoop) & comparison to Spark Streaming by Apache Apex
Intro to Apache Apex (next gen Hadoop) & comparison to Spark StreamingIntro to Apache Apex (next gen Hadoop) & comparison to Spark Streaming
Intro to Apache Apex (next gen Hadoop) & comparison to Spark Streaming
Apache Apex2.2K views
Application and Challenges of Streaming Analytics and Machine Learning on Mu... by Databricks
 Application and Challenges of Streaming Analytics and Machine Learning on Mu... Application and Challenges of Streaming Analytics and Machine Learning on Mu...
Application and Challenges of Streaming Analytics and Machine Learning on Mu...
Databricks550 views
Large scale Click-streaming and tranaction log mining by itstuff
Large scale Click-streaming and tranaction log miningLarge scale Click-streaming and tranaction log mining
Large scale Click-streaming and tranaction log mining
itstuff505 views
IEEE.BigData.Tutorial.2.slides by Nish Parikh
IEEE.BigData.Tutorial.2.slidesIEEE.BigData.Tutorial.2.slides
IEEE.BigData.Tutorial.2.slides
Nish Parikh213 views
Apache Apex: Stream Processing Architecture and Applications by Thomas Weise
Apache Apex: Stream Processing Architecture and ApplicationsApache Apex: Stream Processing Architecture and Applications
Apache Apex: Stream Processing Architecture and Applications
Thomas Weise1.6K views
Apache Apex: Stream Processing Architecture and Applications by Comsysto Reply GmbH
Apache Apex: Stream Processing Architecture and Applications Apache Apex: Stream Processing Architecture and Applications
Apache Apex: Stream Processing Architecture and Applications
Inside Kafka Streams—Monitoring Comcast’s Outside Plant by confluent
Inside Kafka Streams—Monitoring Comcast’s Outside Plant Inside Kafka Streams—Monitoring Comcast’s Outside Plant
Inside Kafka Streams—Monitoring Comcast’s Outside Plant
confluent1.2K views

More from HBaseCon

hbaseconasia2017: Building online HBase cluster of Zhihu based on Kubernetes by
hbaseconasia2017: Building online HBase cluster of Zhihu based on Kuberneteshbaseconasia2017: Building online HBase cluster of Zhihu based on Kubernetes
hbaseconasia2017: Building online HBase cluster of Zhihu based on KubernetesHBaseCon
3.9K views36 slides
hbaseconasia2017: HBase on Beam by
hbaseconasia2017: HBase on Beamhbaseconasia2017: HBase on Beam
hbaseconasia2017: HBase on BeamHBaseCon
1.3K views26 slides
hbaseconasia2017: HBase Disaster Recovery Solution at Huawei by
hbaseconasia2017: HBase Disaster Recovery Solution at Huaweihbaseconasia2017: HBase Disaster Recovery Solution at Huawei
hbaseconasia2017: HBase Disaster Recovery Solution at HuaweiHBaseCon
1.4K views21 slides
hbaseconasia2017: Removable singularity: a story of HBase upgrade in Pinterest by
hbaseconasia2017: Removable singularity: a story of HBase upgrade in Pinteresthbaseconasia2017: Removable singularity: a story of HBase upgrade in Pinterest
hbaseconasia2017: Removable singularity: a story of HBase upgrade in PinterestHBaseCon
936 views42 slides
hbaseconasia2017: HareQL:快速HBase查詢工具的發展過程 by
hbaseconasia2017: HareQL:快速HBase查詢工具的發展過程hbaseconasia2017: HareQL:快速HBase查詢工具的發展過程
hbaseconasia2017: HareQL:快速HBase查詢工具的發展過程HBaseCon
1.1K views21 slides
hbaseconasia2017: Apache HBase at Netease by
hbaseconasia2017: Apache HBase at Neteasehbaseconasia2017: Apache HBase at Netease
hbaseconasia2017: Apache HBase at NeteaseHBaseCon
754 views27 slides

More from HBaseCon(20)

hbaseconasia2017: Building online HBase cluster of Zhihu based on Kubernetes by HBaseCon
hbaseconasia2017: Building online HBase cluster of Zhihu based on Kuberneteshbaseconasia2017: Building online HBase cluster of Zhihu based on Kubernetes
hbaseconasia2017: Building online HBase cluster of Zhihu based on Kubernetes
HBaseCon3.9K views
hbaseconasia2017: HBase on Beam by HBaseCon
hbaseconasia2017: HBase on Beamhbaseconasia2017: HBase on Beam
hbaseconasia2017: HBase on Beam
HBaseCon1.3K views
hbaseconasia2017: HBase Disaster Recovery Solution at Huawei by HBaseCon
hbaseconasia2017: HBase Disaster Recovery Solution at Huaweihbaseconasia2017: HBase Disaster Recovery Solution at Huawei
hbaseconasia2017: HBase Disaster Recovery Solution at Huawei
HBaseCon1.4K views
hbaseconasia2017: Removable singularity: a story of HBase upgrade in Pinterest by HBaseCon
hbaseconasia2017: Removable singularity: a story of HBase upgrade in Pinteresthbaseconasia2017: Removable singularity: a story of HBase upgrade in Pinterest
hbaseconasia2017: Removable singularity: a story of HBase upgrade in Pinterest
HBaseCon936 views
hbaseconasia2017: HareQL:快速HBase查詢工具的發展過程 by HBaseCon
hbaseconasia2017: HareQL:快速HBase查詢工具的發展過程hbaseconasia2017: HareQL:快速HBase查詢工具的發展過程
hbaseconasia2017: HareQL:快速HBase查詢工具的發展過程
HBaseCon1.1K views
hbaseconasia2017: Apache HBase at Netease by HBaseCon
hbaseconasia2017: Apache HBase at Neteasehbaseconasia2017: Apache HBase at Netease
hbaseconasia2017: Apache HBase at Netease
HBaseCon754 views
hbaseconasia2017: HBase在Hulu的使用和实践 by HBaseCon
hbaseconasia2017: HBase在Hulu的使用和实践hbaseconasia2017: HBase在Hulu的使用和实践
hbaseconasia2017: HBase在Hulu的使用和实践
HBaseCon878 views
hbaseconasia2017: 基于HBase的企业级大数据平台 by HBaseCon
hbaseconasia2017: 基于HBase的企业级大数据平台hbaseconasia2017: 基于HBase的企业级大数据平台
hbaseconasia2017: 基于HBase的企业级大数据平台
HBaseCon701 views
hbaseconasia2017: HBase at JD.com by HBaseCon
hbaseconasia2017: HBase at JD.comhbaseconasia2017: HBase at JD.com
hbaseconasia2017: HBase at JD.com
HBaseCon828 views
hbaseconasia2017: Large scale data near-line loading method and architecture by HBaseCon
hbaseconasia2017: Large scale data near-line loading method and architecturehbaseconasia2017: Large scale data near-line loading method and architecture
hbaseconasia2017: Large scale data near-line loading method and architecture
HBaseCon598 views
hbaseconasia2017: Ecosystems with HBase and CloudTable service at Huawei by HBaseCon
hbaseconasia2017: Ecosystems with HBase and CloudTable service at Huaweihbaseconasia2017: Ecosystems with HBase and CloudTable service at Huawei
hbaseconasia2017: Ecosystems with HBase and CloudTable service at Huawei
HBaseCon683 views
hbaseconasia2017: HBase Practice At XiaoMi by HBaseCon
hbaseconasia2017: HBase Practice At XiaoMihbaseconasia2017: HBase Practice At XiaoMi
hbaseconasia2017: HBase Practice At XiaoMi
HBaseCon1.8K views
hbaseconasia2017: hbase-2.0.0 by HBaseCon
hbaseconasia2017: hbase-2.0.0hbaseconasia2017: hbase-2.0.0
hbaseconasia2017: hbase-2.0.0
HBaseCon1.8K views
HBaseCon2017 Democratizing HBase by HBaseCon
HBaseCon2017 Democratizing HBaseHBaseCon2017 Democratizing HBase
HBaseCon2017 Democratizing HBase
HBaseCon897 views
HBaseCon2017 Removable singularity: a story of HBase upgrade in Pinterest by HBaseCon
HBaseCon2017 Removable singularity: a story of HBase upgrade in PinterestHBaseCon2017 Removable singularity: a story of HBase upgrade in Pinterest
HBaseCon2017 Removable singularity: a story of HBase upgrade in Pinterest
HBaseCon646 views
HBaseCon2017 Quanta: Quora's hierarchical counting system on HBase by HBaseCon
HBaseCon2017 Quanta: Quora's hierarchical counting system on HBaseHBaseCon2017 Quanta: Quora's hierarchical counting system on HBase
HBaseCon2017 Quanta: Quora's hierarchical counting system on HBase
HBaseCon608 views
HBaseCon2017 Transactions in HBase by HBaseCon
HBaseCon2017 Transactions in HBaseHBaseCon2017 Transactions in HBase
HBaseCon2017 Transactions in HBase
HBaseCon1.8K views
HBaseCon2017 Highly-Available HBase by HBaseCon
HBaseCon2017 Highly-Available HBaseHBaseCon2017 Highly-Available HBase
HBaseCon2017 Highly-Available HBase
HBaseCon1.1K views
HBaseCon2017 Apache HBase at Didi by HBaseCon
HBaseCon2017 Apache HBase at DidiHBaseCon2017 Apache HBase at Didi
HBaseCon2017 Apache HBase at Didi
HBaseCon996 views
HBaseCon2017 gohbase: Pure Go HBase Client by HBaseCon
HBaseCon2017 gohbase: Pure Go HBase ClientHBaseCon2017 gohbase: Pure Go HBase Client
HBaseCon2017 gohbase: Pure Go HBase Client
HBaseCon1.7K views

Recently uploaded

Playwright Retries by
Playwright RetriesPlaywright Retries
Playwright Retriesartembondar5
5 views1 slide
Navigating container technology for enhanced security by Niklas Saari by
Navigating container technology for enhanced security by Niklas SaariNavigating container technology for enhanced security by Niklas Saari
Navigating container technology for enhanced security by Niklas SaariMetosin Oy
14 views34 slides
Keep by
KeepKeep
KeepGeniusee
78 views10 slides
FIMA 2023 Neo4j & FS - Entity Resolution.pptx by
FIMA 2023 Neo4j & FS - Entity Resolution.pptxFIMA 2023 Neo4j & FS - Entity Resolution.pptx
FIMA 2023 Neo4j & FS - Entity Resolution.pptxNeo4j
17 views26 slides
tecnologia18.docx by
tecnologia18.docxtecnologia18.docx
tecnologia18.docxnosi6702
5 views5 slides
AI and Ml presentation .pptx by
AI and Ml presentation .pptxAI and Ml presentation .pptx
AI and Ml presentation .pptxFayazAli87
14 views15 slides

Recently uploaded(20)

Navigating container technology for enhanced security by Niklas Saari by Metosin Oy
Navigating container technology for enhanced security by Niklas SaariNavigating container technology for enhanced security by Niklas Saari
Navigating container technology for enhanced security by Niklas Saari
Metosin Oy14 views
FIMA 2023 Neo4j & FS - Entity Resolution.pptx by Neo4j
FIMA 2023 Neo4j & FS - Entity Resolution.pptxFIMA 2023 Neo4j & FS - Entity Resolution.pptx
FIMA 2023 Neo4j & FS - Entity Resolution.pptx
Neo4j17 views
tecnologia18.docx by nosi6702
tecnologia18.docxtecnologia18.docx
tecnologia18.docx
nosi67025 views
AI and Ml presentation .pptx by FayazAli87
AI and Ml presentation .pptxAI and Ml presentation .pptx
AI and Ml presentation .pptx
FayazAli8714 views
Generic or specific? Making sensible software design decisions by Bert Jan Schrijver
Generic or specific? Making sensible software design decisionsGeneric or specific? Making sensible software design decisions
Generic or specific? Making sensible software design decisions
Top-5-production-devconMunich-2023.pptx by Tier1 app
Top-5-production-devconMunich-2023.pptxTop-5-production-devconMunich-2023.pptx
Top-5-production-devconMunich-2023.pptx
Tier1 app9 views
360 graden fabriek by info33492
360 graden fabriek360 graden fabriek
360 graden fabriek
info33492162 views
Unlocking the Power of AI in Product Management - A Comprehensive Guide for P... by NimaTorabi2
Unlocking the Power of AI in Product Management - A Comprehensive Guide for P...Unlocking the Power of AI in Product Management - A Comprehensive Guide for P...
Unlocking the Power of AI in Product Management - A Comprehensive Guide for P...
NimaTorabi216 views
Dev-HRE-Ops - Addressing the _Last Mile DevOps Challenge_ in Highly Regulated... by TomHalpin9
Dev-HRE-Ops - Addressing the _Last Mile DevOps Challenge_ in Highly Regulated...Dev-HRE-Ops - Addressing the _Last Mile DevOps Challenge_ in Highly Regulated...
Dev-HRE-Ops - Addressing the _Last Mile DevOps Challenge_ in Highly Regulated...
TomHalpin96 views
Gen Apps on Google Cloud PaLM2 and Codey APIs in Action by Márton Kodok
Gen Apps on Google Cloud PaLM2 and Codey APIs in ActionGen Apps on Google Cloud PaLM2 and Codey APIs in Action
Gen Apps on Google Cloud PaLM2 and Codey APIs in Action
Márton Kodok16 views
Quality Engineer: A Day in the Life by John Valentino
Quality Engineer: A Day in the LifeQuality Engineer: A Day in the Life
Quality Engineer: A Day in the Life
John Valentino7 views

HBaseCon 2015: HBase as an IoT Stream Analytics Platform for Parkinson's Disease Research

  • 1. Enable breakthroughs in Parkinson disease research through wearables and Big Data analytics technologies
  • 2. About us… • Part of the Big Data Analytics Solutions group @Intel reporting to the Data Center Group • Developing products & solutions leveraging: • Big Data edge-technologies • Self developed machine learning & steam analytics algorithms • Our team includes developers, data scientists and system analysts • I am a Big Data Analytics Architect and Development Manager responsible for leading-edge technology projects within Intel involving Big Data and stream analytics solutions in the Internet of Things and Parkinson's disease research 2
  • 3. How It All Started? 3 Big data analytics IOT
  • 4. Parkinson’s Disease 4 OVER AGE 0F 60 1/100 60,000 NEW 1M/US 5M/WORLD NO CURE, MEDICATION ONLY HELPS WITH SYMPTOMSThere is NO TEST and no PROGRESSION MARKER PARKINSON’S DISEASE IS CAUSED BY THE DEATH OF DOPAMINE CELLS. OF THESE CELLS ARE ALREADY LOST BY THE TIME MOTOR SYMPTOMS APPEAR. 60 80%TO
  • 5. Challenges To Address NO OBJECTIVE MEASURE 3-6 MONTHS BETWEEN PHYSICIAN VISITS CHANGES ARE SLOW AND HARD TO DETECT AVERAGE TRIAL SIZE < 100 PATIENTS VERY SMALL number of patients contribute to research COST OF TRIALS are in the scales of $M 5
  • 7. The Solution Wear a watch Start an application 7 1 2
  • 8. Use Cases MANAGE THE DISEASE USING DATA FREE DATA FOR 1000’S OF PATIENTS ACCURATE REPORT SINCE LAST VISIT MEASURE MEDICATION EFFECT RESEARCHER PHARMACEUTICAL CLINICIAN INTEL BIG DATA CLOUDANALYSTICS INSIGHT / VALUE 8
  • 13. IoT Cloud Simplified Framework DatacenterNetworkThing Cloud Infrastructure Data Platform Analytics Platform UI Services Gateway 13
  • 14. SERVICE LAYER BATCH LAYER STREAM ANALYICS LAYER INGESTION LAYER STORAGE LAYER USER INTERFACE LAYER Mosquitt o 14 CLOUD COMPUTING SERVICES
  • 15. Storage Layer • Cloudera Enterprise Data Hub • HBase as main scalable time series data storage layer • Allows high writes throughput • Random real-time access to stored data • Highly available MySQL as metadata storage 15
  • 16. • Multi-protocol pipeline built over AKKA & KAFKA • KAKFA is a fast, scalable, durable & distributed messaging system • AKKA is an Actor based framework allowing high concurrency, distributed and resilient based on events / messaging • This layer is responsible for: • Pulling messages • Parse & Process • Concurrent & controlled write HBase Load Balancer Device Device Device Device Mosquito Mosquito 16 Data Ingestion Layer
  • 17. • Based on AKKA actors framework • Contains millions of concurrent actors handling different streams and operations • Each actor is a small peace of code performing its role • A set of actors creates a topology which is responsible for device’s data stream processing 17 Stream Analytics Layer Subscriber Parser Aggregator HBase Writer Analytics Manager Change Detection UnZip Real Time Rules Sleep Quality
  • 18. • Based on Apache Spark over HBase • Spark is a fast and general engine for large-scale data processing • Algorithms & Calculations are being executed on large data sets on a daily basis • Layer includes: • Set of complex machine learning algorithms • Rule engine rules baseline calculations 18 Batch Analytics Layer
  • 19. • Interactive and scalable web services layer • A set of RESTful APIs allowing: • Registration to platform • Row & calculated data retrieval from HBase • Built on top of Play framework and providing secured entry point • Uses Apache Phoenix & native HBase client HBase Load Balancer 19 Service Layer
  • 21. Data Ingestion to HBase • Challenge Concurrent ingesting millions messages into HBase creates a massive load on HBase region servers and causes disconnections • Development Evolution 1. HBase client per topology (millions of writers) 2. Pool of HBase clients, each is using separate HTable 3. Pool of HBase clients, all are using same HBase connection pool (HConnectionManager) • Solution Creating “fixed” number of connections to HBase, allowing batch writes and load balancing 21 . . . Hbase Writer Hbase Writer Hbase Writer . .. Pool Router
  • 22. Table Indicators over Large Tables • Challenge Gathering indicators (i.e. counts) on large HBase tables results in long table scans and performance reduction • Solution • Real time update new indicator columns using incrementColumnValue • Allows atomic increment of a specific column • Large table counts successfully implemented • Allowed implementation of required Indicators • Real time hourly counts • Real time Max values (i.e. last time a user transmitted data) 22
  • 23. Batch Processing Input Format • Challenge • Batch processing is done using Spark – InputFormat is required for scan • TableInputFormat was used and is equivalent to a single scan • Poor performance when data from “remote” parts of a table are required • Solution • Using MultiTableInputFormat • Allows usage of multiple scans • Successfully used with more than 100 scans per MultiTableInputFormat 23
  • 25. Activity Level • Measure that will continuously describe the intensity of the patient’s activity throughout the day and will motivate the patients to be more active. • Motivates the patients to be more active (known to be important for PD patients) • Personalized measure per patient based on their average activity of walking periods (avoid frustration) • Based on intensity measurement from the accelerometer • Filters our tremor as 25
  • 26. Activity Level – An Example 26 Activity Level in Controlled Session (ON State) Activity Level in Controlled Session (OFF State)
  • 27. Tremor • Tremor is one of the most obvious symptoms of PD • Most PD patients experience tremor • Tremor is detectable using signal processing techniques 27
  • 29. REAL PD L-DOPA RESPONSE TRIAL DATA GATHERING TRIAL FOX INTEL APPLICATION TRIAL 1000 50 30 20 FOX INSIGHT WEAR 1000 20 20 30 20 10 29 Trial And Partners SCRIPS TREMMOR TRIAL 1000
  • 31. SCALE PLATFORM • Scale to 1000’s of patients in the US • Scale to 1000’s of patients in the Netherlands • IOS support • Support additional wearable's • Build more value generating capabilities • Upgrade to HBase 1.0 • Upgrade Spark to 1.3 • Enrich Platform (i.e. Advanced Export, Reporting) • Enrich Parkinson Disease solution • Analytics • Value to patients 31
  • 32. Q&A
  • 34. • Strategic direction 34 R: 0 G: 112 B: 197 R: 247 G: 127 B: 0 R: 130 G: 170 B: 50 R: 0 G: 172 B: 240

Editor's Notes

  1. On Wednesday, August 13, Intel and the Michael J. Fox Foundation (MJFF) announced a collaboration to improve research and treatment of Parkinson’s disease using wearable computing and big data analytics. The collaboration includes a multi-stage study using wearable devices to collect patient data and an Intel-built big data analytics platform to analyze the vast pools of data with the goal of developing objective measures for Parkinson’s disease progression, treatment response and drug development. Key Messages: Intel and the Michael J. Fox Foundation are joining forces to use wearable computing and big data analytics to help improve Parkinson’s disease research and treatments. The collaboration includes a multi-stage study to validate the use of wearable devices and big data analytics to track disease symptoms and develop objective measures for disease progression. The big data analytics platform combines Intel’s hardware and software technologies to provide a powerful cloud-based platform to collect, transform, store, and visualize data from sensors.
  2. The story The name of the man in the picture on the left is Andy Grove and he is one of Intel’s founders and has Parkinson (PD) The story begins when he reads an article in the NY times about Big Data and decides to start a project within Intel related to PD and Big Data He contacts Michael J fox foundation and they make a decision to start a joint effort together The idea is to elaborate Internet of things, wearable's technology and big data platforms to assist PD research
  3. Disclaimer: I’m not a neurologist, and do not intend to provide an extensive introduction of the disease Parkinson Disease (PD) Parkinson’s disease is a degenerative disorder of the central nervous system that is characterized by serious motoric disabilities, such as shaking, rigidity and slowness of movement It is also characterized also by complicated non-motoric implications, like low sleeping quality, depression and tendency to compulsive behavior There are ~6M Parkinson patients out of them about 1M in the US and about 5M in the rest of the globe 1 out 100 people over the age of 60 in the US is a Parkinson patient and in the US only, ~60 thousand new patients are diagnosed every year Parkinson patients life expectancy is usually between 10 to 15 years There is on cure for the disease and existing medications are mainly for improving patients quality of life by helping with symptoms The disease progresses slowly, meaning that changes in patients condition and disease progress can be observed only over the course of months or years, making management and research of disease difficult Parkinson test & progression are subjectively assessed by physicians being and there is no standard test or progression marker Parkinson’s is a complex disease with symptoms and treatment responses that vary widely. The disease progresses slowly, meaning that changes in clinical and molecular features can be observed only over the course of months or years, making management of and research into Parkinson’s disease difficult. Today, the diagnosis of Parkinson’s, assessment of disease progression, and clinical trials for treatments and medications have largely relied on periodic clinical assessments by a physician and on patient reports. The advent of wearable computing and big data analytics could dramatically enhance our understanding of Parkinson’s disease by enabling scientists and physicians to gather vital data continuously and unobtrusively, without putting a burden on patients, and in significantly larger populations than in traditional clinical trials.
  4. So… Why do we really need this solution? I’ll try to describe part of the challenges we’re addressing One of the main challenges is the lack of objective measures, both for patients and physicians Today, patients are monitored only during occasional clinical visits – usually every 3-6 months In those visits, mainly due to stress, patients behave differently and their daily collected reports are subjective to their opinion In addition and as I mentioned in previous slide – the disease progress is also subjective and highly depended on physicians observation during patient visits Additional challenge is related to clinical trials Today, only small amount of data is available to the research community Collecting meaningful amounts of good and reliable data is not trivial Only small amounts of data is available to the research community: One of the main reasons is the fact that the cost of clinical trials is in the scales of millions of dollars, takes extensive time and effort to arrange and complete Today, there are very small number of patients contributing to research resulting in small trial sizes – actually, average trial size is less than 100 patients In addition, collecting data in fine granularity & good quality cannot scale due to trial’s technology limitations Lack Need to handcraft medication regime Levodopa’s positive effect progressively declines, and some patients suffers from dyskinesia No biomarkers (diagnosis is hard: PD ≠ Parkinsonism)
  5. Actually, the theoretical solution very simple, we just ask patients to do 2 very simple steps… Wear a watch & start a cell phone application If we want patients to follow those two simple steps we must make sure that the value that our solution is providing is greater than the burden on patients That is our mission definition In our solution we’ll address main challenges I reviewed in previous slides by: Continuous collecting of movement data and objective measurement 24 hours a day, 7 days a week and 365 days a year And providing value to patients by providing them real time insights on their disease and condition (in the form of activity level, tremor detection and sleep quality indicators)
  6. So far, so good… we have provided value to patients and collected objective measurement – but what’s next? Who will use this? What are the main use cases? After collection of the data, advanced analytics algorithms are applied on it (I’ll elaborate on those later on) and it is being saved into Intel Big Data Cloud platform Using this data we’re providing: Researchers access to free reliable data of thousands of patients of patients (LDopa) Clinicians get accurate repot for their patients condition since their last visit (RealPD) And pharmaceutical companies the capability to measure their medication effectiveness during its test phase ()
  7. Either use real demo or show the next 2 slides
  8. As mentioned before The application is based on sensorial data
  9. The solution is based on a self developed generic Internet Of Things platform Platform allows “Things” which can practically be any type of devices with some kind of internet connectivity (can be direct or using gateways) sending data to the platform Data transmission can be done using different protocols and can be transformed during transition or after landing in cloud The Entire code stack for this platform is based on open source with Hadoop eco system at its core The platform is cloud based and offers application developers tools to develop their own application on top of it. The key tools are: Data storage which is based on top of Hadoop and HBase Analytics platform – allows both batch and stream analytics developments Built in analytics features such as near real time rule engine and change detection engines And data extraction tools such as an export service
  10. Parkinson disease solution is was developed on top of the generic IoT platform I described a moment ago I’ll quickly review the different layers and will dive into few of those later on Computing services are Batch Layer based on Spark Storage layer using Hadoop, HBase & MySQL for Metadata Powerful, scalable ingestion layer based on Akka & Kafka A dynamic stream analytics layer based on Akka actor system framework Scalable Service layer providing set of APIs for registration & data extraction out of the platform UI layer – the only layer in this diagram which is unique to PD solution – using Pebble watch and Android application to collect data and interact with patients You can note that 5 out of the presented 6 layers (excluding the UI layer) are part of the IoT platform and can be used for similar products / verticals
  11. Need to redo the whole slide
  12. Need to redo the whole slide
  13. Need to redo the whole slide
  14. This is the activity level of the same subject in his two successive visits in the clinic. In the on state and in the off state (the two sessions were recorded in different days). Although the patient repeated the same (or at least highly similar) protocol in the two visits, we can see that his activity level while in OFF is around half than the activity level while ON. We checked, and this result is also seen while comparing particular activities.
  15. Need to redo the whole slide