SlideShare a Scribd company logo
1 of 14
BIG DATA IN HEALTHCARE
MSc. Information Systems
2014 / 2015
Presented by:
Younes Hamdaoui (Std_ID 21266804)
Supervised by:
Dr Samia Oussena
Dr Wei Jie
INTRODUCTION
 Clinics and health organisations have more reasons to become data-driven.
 Patients are expecting quality and safety in treatments, fast clinical results
and doctors need electronic systems and analytical tools to make precise and
rapid judgements.
 The size of data generated in healthcare sector is rapidly growing.
“U.S. health care data alone reached 150 exabytes in 2011”. (Institute for
Health Technology Transformation. Transforming Health Care Through Big Data. (2013))
PLAN
INTRODUCTION
WHY BIG DATA IN HEALTHCARE (HC) ?
• HISTORY OF DATA USAGE IN HC
• BIG DATA IN HC TODAY
BIG DATA APPLICATION IN HEALTH SECTOR
• COLLECTING DATA
• PROCESSING AND ANALYSING DATA
FUTURE OF BIG DATA IN HC
CONCLUSION
WHY BIG DATA IN HEALTHCARE ?
HISTORY OF DATA USAGE IN HC (1)
 Paper based forms are still used in many developing countries (Knocking on doors
to collect information). The collected data needs to be moved to a computerized
system;
 The process of computerizing data affects the quality of data and takes time;
 Most of the decisions in global health are based on old data;
 HC sector lacked important data that could help improve this area, questions like :
How many people are affected by diseases and disasters ? How many children were
born or died last week? And under what circumstances ? Were impossible to answer
rapidly.
 Today health organisations use more developed tools to deal with the huge amount
of data created daily, to track their data and use it for analysis.
• Microsoft Excel monitoring spreadsheets
• Electronic Health Record (EHR)
• Relational Database Management Systems (RDBMS)
• Business Intelligence (BI) tools (SAP Business Objects, SQL Server
Integration/Analysis/Reporting Services, IBM Cognos)
 Complex, slow, very expensive and no real-time analysis.
HISTORY OF DATA USAGE IN HC (2)
“80% of the development effort in a traditional big data project goes into
data integration and only 20% percent goes toward data analysis.”(Big Data
Analytics. Extract, Transform, and Load Big Data with Apache Hadoop.
(n.d.) INTEL White Paper.)
BIG DATA IN HEALTHCARE TODAY
Real cases :
 Propeller Health: Uses data from sensors for asthma inhalers and
from mobile applications to help determine patients with asthma risks
before an attack arises. Collects weather and air quality information
to classify risk-level and risk factors by area.
 NextBio: Uses personal patients information, molecular and genomic
data to help making personalised medical decisions.
(http://profitable-practice.softwareadvice.com/what-is-big-data-in-healthcare-
0813/)
Other Examples:
 Comparative Effectiveness research: analysing clinical and financial efficiency of
interventions to enhance clinical care services in terms of quality and
performance.
 Clinical Operation Intelligence: find misuse in clinical operations in order to
improve them.
 Public Health Analysis: analysing health data sets of populations to determine the
overall effectiveness of medications.
(http://big-project.eu/blog/potential-big-data-applications-healthcare-sector)
THE ETL PROCESS
 The extract, transform and load process is a crucial component for populating
data systems.
 An ETL process recaptures data from various systems, refine and prepare it
for future investigation by using analytic and reporting tools.
ETL, Songini, M.L. (2004)
BIG DATA APPLICATION IN HEALTH SECTOR
COLLECTING DATA
% sqoop import
--connect jdbc:mysql://localhost/UWLHealth 
--table BloodPressure -m 1
Importing a
single Table
Source Channel Sink
Specifying the path
to the locations of
log files
Holding area where
events flows are defined
through interceptors and
channel selectors
-Process events only
through the channel
-Writes data to HDFS
or HBase
Agent
e
v
e
n
t
s
Processed
Logs
Hadoop Image Processing Interface
PROCESSING AND ANALYSING DATA
Refining the
extracted
data using
Hive Query
Language
CREATE TABLE UWL_BloodPressure as select *,
normalerate – stdrate as rate_diff,
IF((normalerate - stdrate) > 20, ‘LOW',
IF((normalerate – stdrate) < -20, ‘HIGH', 'NORMAL')) AS
BPressure,
IF((normalerate - stdrate) > 20, ‘NOTOK',
IF((normalerate - stdrate) < -20, ‘NOTOK', ‘OK’)) AS
BPressure_variation
from UWL_Health;
“The Apache Hive data warehouse software facilitates querying and managing large
datasets residing in distributed storage. Hive provides a mechanism to project
structure onto this data and query the data using a SQL-like language called
HiveQL.” (https://hive.apache.org/)
ODBC DRIVER
Visualization of the
UWL_BloodPressure
table
Analysis
FUTURE OF BIG DATA IN HC
SENSORS
Real-time
visualization
HEALTH
DATA
Real-time Interaction
with patients
Real-time tracking of health
Real-time detection of anomalies
Real-time recommendations
Automatic urgency detection
Doctors will
diagnose patients
the minute they
step in hospitals
Real-time
prescriptions,
advices and
response to
emergencies
Disease surveillance
Conclusion
References
 Transforming Health Care Through Big Data. (n.d.) .Institute for Health Technology
Transformation. Available at: http://ihealthtran.com/big-data-in-healthcare [Accessed 5
Mar. 2015].
 Big Data Analytics. Extract, Transform, and Load Big Data with Apache Hadoop. (n.d.)
INTEL White Paper.
 Profitable-practice.softwareadvice.com. What is “Big Data” in Healthcare, and Who’s
Already Doing It?. Available at: http://profitable-practice.softwareadvice.com/what-is-
big-data-in-healthcare-0813/ [Accessed 11 Mar. 2015].
 Big-project.eu. The Potential of Big Data Applications for the Healthcare Sector | BIG -
Big Data Public Private Forum. Available at: http://big-project.eu/blog/potential-big-
data-applications-healthcare-sector [Accessed 11 Mar. 2015].
 Songini, M.L. 2004, "ETL", Computerworld, [Online], vol. 38, no. 5, pp. 23.
 Reach1to1 Technologies, (2015). Hive, Pig and Sqoop. [online] Available at:
http://reach1to1.com/technology/hive-pig-sqoop/ [Accessed 11 Mar. 2015].
 Hortonworks, (2015). Apache Flume. [online] Available at:
http://hortonworks.com/hadoop/flume/ [Accessed 11 Mar. 2015].
 Wikipedia, (2015). Apache Hive. [online] Available at:
http://en.wikipedia.org/wiki/Apache_Hive [Accessed 11 Mar. 2015].
 Wikipedia, (2015). Pig (programming tool). [online] Available at:
http://en.wikipedia.org/wiki/Pig_%28programming_tool%29 [Accessed 11 Mar. 2015].
Big-Data in HealthCare _ Overview

More Related Content

What's hot

Artificial intelligence (a.i) copy (1)
Artificial intelligence (a.i) copy (1)Artificial intelligence (a.i) copy (1)
Artificial intelligence (a.i) copy (1)Sharda University
 
Data Mining in Healthcare: How Health Systems Can Improve Quality and Reduce...
Data Mining in Healthcare:  How Health Systems Can Improve Quality and Reduce...Data Mining in Healthcare:  How Health Systems Can Improve Quality and Reduce...
Data Mining in Healthcare: How Health Systems Can Improve Quality and Reduce...Health Catalyst
 
Predictive Analytics and Machine Learning for Healthcare - Diabetes
Predictive Analytics and Machine Learning for Healthcare - DiabetesPredictive Analytics and Machine Learning for Healthcare - Diabetes
Predictive Analytics and Machine Learning for Healthcare - DiabetesDr Purnendu Sekhar Das
 
Deploying Predictive Analytics in Healthcare
Deploying Predictive Analytics in HealthcareDeploying Predictive Analytics in Healthcare
Deploying Predictive Analytics in HealthcareHealth Catalyst
 
AI in Healthcare | Future of Smart Hospitals
AI in Healthcare | Future of Smart Hospitals AI in Healthcare | Future of Smart Hospitals
AI in Healthcare | Future of Smart Hospitals Renee Yao
 
Big Data applications in Health Care
Big Data applications in Health CareBig Data applications in Health Care
Big Data applications in Health CareLeo Barella
 
Analytics in healthcare
Analytics in healthcareAnalytics in healthcare
Analytics in healthcareAnushkaAlok
 
Big Data & Analytics (Conceptual and Practical Introduction)
Big Data & Analytics (Conceptual and Practical Introduction)Big Data & Analytics (Conceptual and Practical Introduction)
Big Data & Analytics (Conceptual and Practical Introduction)Yaman Hajja, Ph.D.
 
Big Data Solutions for Healthcare
Big Data Solutions for HealthcareBig Data Solutions for Healthcare
Big Data Solutions for HealthcareOdinot Stanislas
 
Ai in healthcare
Ai in healthcareAi in healthcare
Ai in healthcaremuskannn
 
Big Data in Pharma - Overview and Use Cases
Big Data in Pharma - Overview and Use CasesBig Data in Pharma - Overview and Use Cases
Big Data in Pharma - Overview and Use CasesJosef Scheiber
 
Data Visualization in Health
Data Visualization in HealthData Visualization in Health
Data Visualization in HealthRamon Martinez
 
Big Data Analytics for Smart Health Care
Big Data Analytics for Smart Health CareBig Data Analytics for Smart Health Care
Big Data Analytics for Smart Health CareEshan Bhuiyan
 
Artificial Intelligence in Health Care
Artificial Intelligence in Health Care Artificial Intelligence in Health Care
Artificial Intelligence in Health Care 247 Labs Inc
 
AI in Healthcare 2017
AI in Healthcare 2017AI in Healthcare 2017
AI in Healthcare 2017Peter Morgan
 
A brief history of "big data"
A brief history of "big data"A brief history of "big data"
A brief history of "big data"Nicola Ferraro
 
BIG Data & Hadoop Applications in Healthcare
BIG Data & Hadoop Applications in HealthcareBIG Data & Hadoop Applications in Healthcare
BIG Data & Hadoop Applications in HealthcareSkillspeed
 
Big data analytics in healthcare
Big data analytics in healthcareBig data analytics in healthcare
Big data analytics in healthcareJoseph Thottungal
 

What's hot (20)

Artificial intelligence (a.i) copy (1)
Artificial intelligence (a.i) copy (1)Artificial intelligence (a.i) copy (1)
Artificial intelligence (a.i) copy (1)
 
Data Mining in Healthcare: How Health Systems Can Improve Quality and Reduce...
Data Mining in Healthcare:  How Health Systems Can Improve Quality and Reduce...Data Mining in Healthcare:  How Health Systems Can Improve Quality and Reduce...
Data Mining in Healthcare: How Health Systems Can Improve Quality and Reduce...
 
Predictive Analytics and Machine Learning for Healthcare - Diabetes
Predictive Analytics and Machine Learning for Healthcare - DiabetesPredictive Analytics and Machine Learning for Healthcare - Diabetes
Predictive Analytics and Machine Learning for Healthcare - Diabetes
 
Deploying Predictive Analytics in Healthcare
Deploying Predictive Analytics in HealthcareDeploying Predictive Analytics in Healthcare
Deploying Predictive Analytics in Healthcare
 
Big Data ppt
Big Data pptBig Data ppt
Big Data ppt
 
AI in Healthcare | Future of Smart Hospitals
AI in Healthcare | Future of Smart Hospitals AI in Healthcare | Future of Smart Hospitals
AI in Healthcare | Future of Smart Hospitals
 
Big Data applications in Health Care
Big Data applications in Health CareBig Data applications in Health Care
Big Data applications in Health Care
 
ARTIFICIAL INTELLIGENCE ROLE IN HEALTH CARE Dr.T.V.Rao MD
ARTIFICIAL INTELLIGENCE ROLE IN HEALTH CARE  Dr.T.V.Rao MDARTIFICIAL INTELLIGENCE ROLE IN HEALTH CARE  Dr.T.V.Rao MD
ARTIFICIAL INTELLIGENCE ROLE IN HEALTH CARE Dr.T.V.Rao MD
 
Analytics in healthcare
Analytics in healthcareAnalytics in healthcare
Analytics in healthcare
 
Big Data & Analytics (Conceptual and Practical Introduction)
Big Data & Analytics (Conceptual and Practical Introduction)Big Data & Analytics (Conceptual and Practical Introduction)
Big Data & Analytics (Conceptual and Practical Introduction)
 
Big Data Solutions for Healthcare
Big Data Solutions for HealthcareBig Data Solutions for Healthcare
Big Data Solutions for Healthcare
 
Ai in healthcare
Ai in healthcareAi in healthcare
Ai in healthcare
 
Big Data in Pharma - Overview and Use Cases
Big Data in Pharma - Overview and Use CasesBig Data in Pharma - Overview and Use Cases
Big Data in Pharma - Overview and Use Cases
 
Data Visualization in Health
Data Visualization in HealthData Visualization in Health
Data Visualization in Health
 
Big Data Analytics for Smart Health Care
Big Data Analytics for Smart Health CareBig Data Analytics for Smart Health Care
Big Data Analytics for Smart Health Care
 
Artificial Intelligence in Health Care
Artificial Intelligence in Health Care Artificial Intelligence in Health Care
Artificial Intelligence in Health Care
 
AI in Healthcare 2017
AI in Healthcare 2017AI in Healthcare 2017
AI in Healthcare 2017
 
A brief history of "big data"
A brief history of "big data"A brief history of "big data"
A brief history of "big data"
 
BIG Data & Hadoop Applications in Healthcare
BIG Data & Hadoop Applications in HealthcareBIG Data & Hadoop Applications in Healthcare
BIG Data & Hadoop Applications in Healthcare
 
Big data analytics in healthcare
Big data analytics in healthcareBig data analytics in healthcare
Big data analytics in healthcare
 

Similar to Big-Data in HealthCare _ Overview

76 s201915
76 s20191576 s201915
76 s201915IJRAT
 
Shanghai health bureau_big_data_healthcare
Shanghai health bureau_big_data_healthcareShanghai health bureau_big_data_healthcare
Shanghai health bureau_big_data_healthcarexband
 
Big data in IoT for healthcare - www.pepgra.com
Big data in IoT for healthcare - www.pepgra.comBig data in IoT for healthcare - www.pepgra.com
Big data in IoT for healthcare - www.pepgra.comPEPGRA Healthcare
 
Moving Health Care Analytics to Hadoop to Build a Better Predictive Model
Moving Health Care Analytics to Hadoop to Build a Better Predictive ModelMoving Health Care Analytics to Hadoop to Build a Better Predictive Model
Moving Health Care Analytics to Hadoop to Build a Better Predictive ModelDataWorks Summit
 
Technological Forecasting & Social Change 126 (2018) 3–13C.docx
Technological Forecasting & Social Change 126 (2018) 3–13C.docxTechnological Forecasting & Social Change 126 (2018) 3–13C.docx
Technological Forecasting & Social Change 126 (2018) 3–13C.docxbradburgess22840
 
2016.10 HPDA in Precision Medicine
2016.10 HPDA in Precision Medicine2016.10 HPDA in Precision Medicine
2016.10 HPDA in Precision MedicineMichael Atkins
 
IRJET- Predictive Analysis and Healthcare of Diabetes
IRJET- Predictive Analysis and Healthcare of DiabetesIRJET- Predictive Analysis and Healthcare of Diabetes
IRJET- Predictive Analysis and Healthcare of DiabetesIRJET Journal
 
Ehr challenges [bigdata]
Ehr challenges [bigdata]Ehr challenges [bigdata]
Ehr challenges [bigdata]Nesma Almoazamy
 
Infosession for IQED dataproviders (14-22.04.2016)
Infosession for IQED dataproviders (14-22.04.2016)Infosession for IQED dataproviders (14-22.04.2016)
Infosession for IQED dataproviders (14-22.04.2016)healthdata be
 
Presentation (2015.11.16) of healthdata.be project @ Abrumet
Presentation (2015.11.16) of healthdata.be project @ AbrumetPresentation (2015.11.16) of healthdata.be project @ Abrumet
Presentation (2015.11.16) of healthdata.be project @ Abrumethealthdata be
 
Presentation project healthdata.be for hospitals in Limburg (dd. 2015.11.30.)...
Presentation project healthdata.be for hospitals in Limburg (dd. 2015.11.30.)...Presentation project healthdata.be for hospitals in Limburg (dd. 2015.11.30.)...
Presentation project healthdata.be for hospitals in Limburg (dd. 2015.11.30.)...healthdata be
 
Data-driven Healthcare
Data-driven HealthcareData-driven Healthcare
Data-driven HealthcareLindaWatson19
 
Private Hidden Data for Health Care
Private Hidden Data for Health CarePrivate Hidden Data for Health Care
Private Hidden Data for Health CareIRJET Journal
 
Rapid Response Analytics Solution Accelerates Analytics ROI
Rapid Response Analytics Solution Accelerates Analytics ROIRapid Response Analytics Solution Accelerates Analytics ROI
Rapid Response Analytics Solution Accelerates Analytics ROIHealth Catalyst
 
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short TimeBig Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short TimeDataWorks Summit
 
Intel next-generation-medical-imaging-data-and-analytics
Intel next-generation-medical-imaging-data-and-analyticsIntel next-generation-medical-imaging-data-and-analytics
Intel next-generation-medical-imaging-data-and-analyticsCarestream
 
Presentation (2015.11.20) of healthdata.be project for partners of Collaborat...
Presentation (2015.11.20) of healthdata.be project for partners of Collaborat...Presentation (2015.11.20) of healthdata.be project for partners of Collaborat...
Presentation (2015.11.20) of healthdata.be project for partners of Collaborat...healthdata be
 
Ajith M Jose_Report1.docx
Ajith M Jose_Report1.docxAjith M Jose_Report1.docx
Ajith M Jose_Report1.docxmca2206
 

Similar to Big-Data in HealthCare _ Overview (20)

76 s201915
76 s20191576 s201915
76 s201915
 
Shanghai health bureau_big_data_healthcare
Shanghai health bureau_big_data_healthcareShanghai health bureau_big_data_healthcare
Shanghai health bureau_big_data_healthcare
 
Big data in IoT for healthcare - www.pepgra.com
Big data in IoT for healthcare - www.pepgra.comBig data in IoT for healthcare - www.pepgra.com
Big data in IoT for healthcare - www.pepgra.com
 
Moving Health Care Analytics to Hadoop to Build a Better Predictive Model
Moving Health Care Analytics to Hadoop to Build a Better Predictive ModelMoving Health Care Analytics to Hadoop to Build a Better Predictive Model
Moving Health Care Analytics to Hadoop to Build a Better Predictive Model
 
Technological Forecasting & Social Change 126 (2018) 3–13C.docx
Technological Forecasting & Social Change 126 (2018) 3–13C.docxTechnological Forecasting & Social Change 126 (2018) 3–13C.docx
Technological Forecasting & Social Change 126 (2018) 3–13C.docx
 
2016.10 HPDA in Precision Medicine
2016.10 HPDA in Precision Medicine2016.10 HPDA in Precision Medicine
2016.10 HPDA in Precision Medicine
 
Big data analystics
Big data analysticsBig data analystics
Big data analystics
 
Hadoop Enabled Healthcare
Hadoop Enabled HealthcareHadoop Enabled Healthcare
Hadoop Enabled Healthcare
 
IRJET- Predictive Analysis and Healthcare of Diabetes
IRJET- Predictive Analysis and Healthcare of DiabetesIRJET- Predictive Analysis and Healthcare of Diabetes
IRJET- Predictive Analysis and Healthcare of Diabetes
 
Ehr challenges [bigdata]
Ehr challenges [bigdata]Ehr challenges [bigdata]
Ehr challenges [bigdata]
 
Infosession for IQED dataproviders (14-22.04.2016)
Infosession for IQED dataproviders (14-22.04.2016)Infosession for IQED dataproviders (14-22.04.2016)
Infosession for IQED dataproviders (14-22.04.2016)
 
Presentation (2015.11.16) of healthdata.be project @ Abrumet
Presentation (2015.11.16) of healthdata.be project @ AbrumetPresentation (2015.11.16) of healthdata.be project @ Abrumet
Presentation (2015.11.16) of healthdata.be project @ Abrumet
 
Presentation project healthdata.be for hospitals in Limburg (dd. 2015.11.30.)...
Presentation project healthdata.be for hospitals in Limburg (dd. 2015.11.30.)...Presentation project healthdata.be for hospitals in Limburg (dd. 2015.11.30.)...
Presentation project healthdata.be for hospitals in Limburg (dd. 2015.11.30.)...
 
Data-driven Healthcare
Data-driven HealthcareData-driven Healthcare
Data-driven Healthcare
 
Private Hidden Data for Health Care
Private Hidden Data for Health CarePrivate Hidden Data for Health Care
Private Hidden Data for Health Care
 
Rapid Response Analytics Solution Accelerates Analytics ROI
Rapid Response Analytics Solution Accelerates Analytics ROIRapid Response Analytics Solution Accelerates Analytics ROI
Rapid Response Analytics Solution Accelerates Analytics ROI
 
Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short TimeBig Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short Time
 
Intel next-generation-medical-imaging-data-and-analytics
Intel next-generation-medical-imaging-data-and-analyticsIntel next-generation-medical-imaging-data-and-analytics
Intel next-generation-medical-imaging-data-and-analytics
 
Presentation (2015.11.20) of healthdata.be project for partners of Collaborat...
Presentation (2015.11.20) of healthdata.be project for partners of Collaborat...Presentation (2015.11.20) of healthdata.be project for partners of Collaborat...
Presentation (2015.11.20) of healthdata.be project for partners of Collaborat...
 
Ajith M Jose_Report1.docx
Ajith M Jose_Report1.docxAjith M Jose_Report1.docx
Ajith M Jose_Report1.docx
 

Recently uploaded

What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 

Recently uploaded (20)

What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 

Big-Data in HealthCare _ Overview

  • 1. BIG DATA IN HEALTHCARE MSc. Information Systems 2014 / 2015 Presented by: Younes Hamdaoui (Std_ID 21266804) Supervised by: Dr Samia Oussena Dr Wei Jie
  • 2. INTRODUCTION  Clinics and health organisations have more reasons to become data-driven.  Patients are expecting quality and safety in treatments, fast clinical results and doctors need electronic systems and analytical tools to make precise and rapid judgements.  The size of data generated in healthcare sector is rapidly growing. “U.S. health care data alone reached 150 exabytes in 2011”. (Institute for Health Technology Transformation. Transforming Health Care Through Big Data. (2013))
  • 3. PLAN INTRODUCTION WHY BIG DATA IN HEALTHCARE (HC) ? • HISTORY OF DATA USAGE IN HC • BIG DATA IN HC TODAY BIG DATA APPLICATION IN HEALTH SECTOR • COLLECTING DATA • PROCESSING AND ANALYSING DATA FUTURE OF BIG DATA IN HC CONCLUSION
  • 4. WHY BIG DATA IN HEALTHCARE ? HISTORY OF DATA USAGE IN HC (1)  Paper based forms are still used in many developing countries (Knocking on doors to collect information). The collected data needs to be moved to a computerized system;  The process of computerizing data affects the quality of data and takes time;  Most of the decisions in global health are based on old data;  HC sector lacked important data that could help improve this area, questions like : How many people are affected by diseases and disasters ? How many children were born or died last week? And under what circumstances ? Were impossible to answer rapidly.
  • 5.  Today health organisations use more developed tools to deal with the huge amount of data created daily, to track their data and use it for analysis. • Microsoft Excel monitoring spreadsheets • Electronic Health Record (EHR) • Relational Database Management Systems (RDBMS) • Business Intelligence (BI) tools (SAP Business Objects, SQL Server Integration/Analysis/Reporting Services, IBM Cognos)  Complex, slow, very expensive and no real-time analysis. HISTORY OF DATA USAGE IN HC (2) “80% of the development effort in a traditional big data project goes into data integration and only 20% percent goes toward data analysis.”(Big Data Analytics. Extract, Transform, and Load Big Data with Apache Hadoop. (n.d.) INTEL White Paper.)
  • 6. BIG DATA IN HEALTHCARE TODAY Real cases :  Propeller Health: Uses data from sensors for asthma inhalers and from mobile applications to help determine patients with asthma risks before an attack arises. Collects weather and air quality information to classify risk-level and risk factors by area.  NextBio: Uses personal patients information, molecular and genomic data to help making personalised medical decisions. (http://profitable-practice.softwareadvice.com/what-is-big-data-in-healthcare- 0813/)
  • 7. Other Examples:  Comparative Effectiveness research: analysing clinical and financial efficiency of interventions to enhance clinical care services in terms of quality and performance.  Clinical Operation Intelligence: find misuse in clinical operations in order to improve them.  Public Health Analysis: analysing health data sets of populations to determine the overall effectiveness of medications. (http://big-project.eu/blog/potential-big-data-applications-healthcare-sector)
  • 8. THE ETL PROCESS  The extract, transform and load process is a crucial component for populating data systems.  An ETL process recaptures data from various systems, refine and prepare it for future investigation by using analytic and reporting tools. ETL, Songini, M.L. (2004)
  • 9. BIG DATA APPLICATION IN HEALTH SECTOR COLLECTING DATA % sqoop import --connect jdbc:mysql://localhost/UWLHealth --table BloodPressure -m 1 Importing a single Table Source Channel Sink Specifying the path to the locations of log files Holding area where events flows are defined through interceptors and channel selectors -Process events only through the channel -Writes data to HDFS or HBase Agent e v e n t s Processed Logs Hadoop Image Processing Interface
  • 10. PROCESSING AND ANALYSING DATA Refining the extracted data using Hive Query Language CREATE TABLE UWL_BloodPressure as select *, normalerate – stdrate as rate_diff, IF((normalerate - stdrate) > 20, ‘LOW', IF((normalerate – stdrate) < -20, ‘HIGH', 'NORMAL')) AS BPressure, IF((normalerate - stdrate) > 20, ‘NOTOK', IF((normalerate - stdrate) < -20, ‘NOTOK', ‘OK’)) AS BPressure_variation from UWL_Health; “The Apache Hive data warehouse software facilitates querying and managing large datasets residing in distributed storage. Hive provides a mechanism to project structure onto this data and query the data using a SQL-like language called HiveQL.” (https://hive.apache.org/) ODBC DRIVER Visualization of the UWL_BloodPressure table Analysis
  • 11. FUTURE OF BIG DATA IN HC SENSORS Real-time visualization HEALTH DATA Real-time Interaction with patients Real-time tracking of health Real-time detection of anomalies Real-time recommendations Automatic urgency detection Doctors will diagnose patients the minute they step in hospitals Real-time prescriptions, advices and response to emergencies Disease surveillance
  • 13. References  Transforming Health Care Through Big Data. (n.d.) .Institute for Health Technology Transformation. Available at: http://ihealthtran.com/big-data-in-healthcare [Accessed 5 Mar. 2015].  Big Data Analytics. Extract, Transform, and Load Big Data with Apache Hadoop. (n.d.) INTEL White Paper.  Profitable-practice.softwareadvice.com. What is “Big Data” in Healthcare, and Who’s Already Doing It?. Available at: http://profitable-practice.softwareadvice.com/what-is- big-data-in-healthcare-0813/ [Accessed 11 Mar. 2015].  Big-project.eu. The Potential of Big Data Applications for the Healthcare Sector | BIG - Big Data Public Private Forum. Available at: http://big-project.eu/blog/potential-big- data-applications-healthcare-sector [Accessed 11 Mar. 2015].  Songini, M.L. 2004, "ETL", Computerworld, [Online], vol. 38, no. 5, pp. 23.  Reach1to1 Technologies, (2015). Hive, Pig and Sqoop. [online] Available at: http://reach1to1.com/technology/hive-pig-sqoop/ [Accessed 11 Mar. 2015].  Hortonworks, (2015). Apache Flume. [online] Available at: http://hortonworks.com/hadoop/flume/ [Accessed 11 Mar. 2015].  Wikipedia, (2015). Apache Hive. [online] Available at: http://en.wikipedia.org/wiki/Apache_Hive [Accessed 11 Mar. 2015].  Wikipedia, (2015). Pig (programming tool). [online] Available at: http://en.wikipedia.org/wiki/Pig_%28programming_tool%29 [Accessed 11 Mar. 2015].

Editor's Notes

  1. “Five exabytes of data would contain all the words ever spoken by human beings on earth.” (Institute for Health Technology Transformation. Transforming Health Care Through Big Data. (2013))
  2. Propeller Health : company focused on asthma management
  3. Clinical Operation Intelligence: “e.g. analysing medical procedures to find performance opportunities, such as improved clinical processes, fine-tuning and adaptation of clinical guidelines.” Public Health Analysis: “e.g. using nation-wide disease registries, databases covering secondary data related to patients with specific diagnosis or procedure.” (http://big-project.eu/blog/potential-big-data-applications-healthcare-sector)
  4. The ETL process in Big Data tools is more efficient that traditional ETL tools (BI tools): BI ETL : Expensive, difficult to implement, time consuming BIG DATA ETL : faster, huge amount of data, parallel processing, cheaper
  5. “Sqoop is a tool designed to transfer data between Hadoop and relational database management system (RDBMS). We can use Sqoop to import data from a RDBMS such as MySQL or Oracle into the Hadoop Distributed File System (HDFS), transform the data in Hadoop by running MapReduce job, and then export the data back into an RDBMS.” (http://reach1to1.com/technology/hive-pig-sqoop/) “Apache™ Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of streaming data into the Hadoop Distributed File System (HDFS).” (http://hortonworks.com/hadoop/flume/)
  6. “Apache Hive is datawarehouse infrastructure built on top of hadoop for providing data summarization, query and analysis”(http://en.wikipedia.org/wiki/Apache_Hive) “Pig is a high-level platform for creating MapReduce programs used with Hadoop. The language for this platform is called Pig Latin. Pig Latin abstracts the programming from the Java MapReduce idiom into a notation which makes MapReduce programming high level, similar to that of SQL for RDBMS.” ( ) ODBC : Open Database Connectivity.