SlideShare a Scribd company logo
1 of 30
April 10-12, Chicago, IL
Ensuring Compliance of
Patient Data with Big Data
and BI
Ayad Shammout & Denny Lee
April 10-12, Chicago, IL
Please silence
cell phones
3
Agenda
A Quick Big Data Primer
Healthcare and Big Data
Compliance and Auditing
SQL Compliance Project
Compliance and Auditing with Big Data and BI
Big Data: Unstructured Volumes of Data
Analytics: PowerPivot, Power View
4
What is Big Data?
Volume
Exceeds physical limits of vertical scalability
Velocity
Decision window small compared to data
change rate
Variety
Many different formats makes integration
expensive
Variability
Many options or variable interpretations
confound analysis
5
10x
increase every
five years
85%from
new data types
Data
explosion
Volume
Velocity
Variety
Hadoop
Cloud
By 2015, organizations that
build a modern information
management system will
outperform their peers
financially by 20 percent.

 – Gartner, Mark Beyer
“Information Management in the
21st Century”
7
Big Data Business Value
140,000-190,000
1.5 million
$300 billion
15 out of 17
€250 billion 50-60%
8
Data
9
Hadoop: The most visible face of Big Data
10
HDInsight: Visit HadoopOnAzure.com
10
Healthcare
and Big Data
12
Healthcare and IT
Often the laggard in technology
Yet application of IT to healthcare can radically change what we can do
Genomic Sequencing
Proteomic sequencing
Incidence Prediction
13
Healthcare Big Data Example Scenarios
Clinical Trial Deviations
Originally Viagra was developed to lower blood pressure and treat Angina
Now its used to help newborn pulmonary hypertension and altitude sickness
Incidence Prediction
Missed 4 or more visits, twice as likely to have an asthmatic incident
Particular Cardiac monitor sine wave points to highly likelihood of heart attack
Campaigns
Social media and advertising campaigns to understand user behavior and sentiment
Patient Satisfaction
Social media and advertising campaigns to understand user behavior and sentiment
14
BIDMC Auditing Scenario
Auditing is critical component HIPAA in ensuring patient privacy
1 Billion rows+ of audit data
146 mission critical clinical applications
Comprehensive audits yield 300-500k transactions/day
HIPAA requires audit system with 20 years of data
Auditing Project
Available to community as part of Compliance SDK
Updating for SQL Server 2012, HDInsight, Power View, and MobileBI*
Creating an enterprise tool for consolidated storage, reporting and alerting of all application audit
data - that's cool!
John Halamka’s Cool Technology of the Week
(Wellsphere Top Health Blogger, Health Impact Award)
15
BIDMC Compliance Project
HDInsight
Windows
HDInsight
Azure
SQLServer
2008/2012
Audit LogsETL Logs to
HDFS
Use Excel 2013
PowerPivot and Power
View
SSAS (tabular)
16
Auditing Sensitive Information
16
Querying Audit Information
Use PowerPivot / Power View / Analysis Services to Query the data.
Security InformationPolicy Information
Process Audit Information
Use SSIS to process SQL2008 All-Actions Audit Information and other CG application
audit log data; potentially can use Management Performance DW framework.
Caregroup Environment
File Server
SQL Audit
Connect/Logic
SSIS
CG Application Data
Intersystems
Cache
SQL2005
Oracle
SQL2008 All-Actions Audit Data
SQL 2008 / 2012 R2
SSRS 2008 /
Power View
Policy Analysis
Policy Reports
Policy Best
Practices
Security Analysis
Security Reports
Compliance
Reports
Feedback Action Loop
Update systems to keep them
compliant and secure
Audit Logs
17
Storage Infrastructure
Transfer files to ASV via AzCopy,
CloudExplorer, etc.
18
Storage Infrastructure
18
Hadoop on Azure
Compute Nodes (Medium VMs)
Azure Storage Vault (ASV)
Azure Blob Storage
Azure Flat Network Storage
19
Storage Infrastructure
19
Hadoop on Azure
Compute Nodes (Medium VMs)
Azure Storage Vault (ASV)
Azure Blob Storage
Azure Flat Network Storage
Stream data
To compute
Push data
Back to Storage
map sort shuffle reduce
http://dennyglee.com/2013/03/18/why-use-blob-storage-with-hdinsight-on-azure/
2020
SSIS
Processing
2121
SSIS to SSAS
Partition
Management
22
SSAS
Tabular
of HoA
Audit
Data
23
Hadoop / Auditing: File sizes
Currently testing gz vs. raw
E.g. 12MB raw text file vs. 633Kb gz file (~20x compression)
20x smaller size, ~same query time
Approx same map / reduce task utilization
File Size is 250MB-1GB
SSIS package takes care of the size
Future testing: avro, protobuf
23
Query Duration (s)
select count(*) from sql_audit_asv_raw 56.066
select count(*) from sql_audit_asv_gz 58.994
24
Hadoop / Auditing: Formats
For ease of processing, replace carriage returns within embedded SQL
statements, e.g.
select col1, col2
from tableA
to
select col1, col2 from tableA
This allows you to create a Hive table using CR as row delimiter (i.e.
does not have things like SQL quoted identifiers)
24
25
SQOOP, HiveODBC,
Templeton, CSV, etc
BI Connectivity
27
Big Data … Excel-lerated!
2 Server, 3mo
110 GB
binary
files
SSIS extraction
1.2GB of text
120MB gz
Hadoop to
PowerPivot
6MB
28
PowerPivot workbook of HoA Audit data
29
Power View of HoA Audit Data
April 10-12, Chicago, IL
Thank you!
Diamond Sponsor

More Related Content

What's hot

How Financial Services can Save On File Storage
How Financial Services can Save On File Storage How Financial Services can Save On File Storage
How Financial Services can Save On File Storage Charly Mostert
 
From Data Lakes to the Data Fabric: Our Vision for Digital Strategy
From Data Lakes to the Data Fabric: Our Vision for Digital StrategyFrom Data Lakes to the Data Fabric: Our Vision for Digital Strategy
From Data Lakes to the Data Fabric: Our Vision for Digital StrategyCambridge Semantics
 
Big Data Fabric 2.0 Drives Data Democratization
Big Data Fabric 2.0 Drives Data DemocratizationBig Data Fabric 2.0 Drives Data Democratization
Big Data Fabric 2.0 Drives Data DemocratizationCambridge Semantics
 
Xanadu Based Big Data CBIR System:Automated Diseases Classification & Diagnosis
Xanadu Based Big Data CBIR System:Automated Diseases Classification & DiagnosisXanadu Based Big Data CBIR System:Automated Diseases Classification & Diagnosis
Xanadu Based Big Data CBIR System:Automated Diseases Classification & DiagnosisAlex G. Lee, Ph.D. Esq. CLP
 
Graph-driven Data Integration: Accelerating and Automating Data Delivery for ...
Graph-driven Data Integration: Accelerating and Automating Data Delivery for ...Graph-driven Data Integration: Accelerating and Automating Data Delivery for ...
Graph-driven Data Integration: Accelerating and Automating Data Delivery for ...Cambridge Semantics
 
Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...
Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...
Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...Cambridge Semantics
 
Conduit - A Lightweight Data Virtualization Tool
Conduit - A Lightweight Data Virtualization ToolConduit - A Lightweight Data Virtualization Tool
Conduit - A Lightweight Data Virtualization ToolRuthie Senanayake
 
Top 5 Trends in Big Data & Analytics
Top 5 Trends in Big Data & AnalyticsTop 5 Trends in Big Data & Analytics
Top 5 Trends in Big Data & AnalyticsTeqforce Solutions
 
Top 5 Trends in Big Data & Analytics
Top 5 Trends in Big Data & AnalyticsTop 5 Trends in Big Data & Analytics
Top 5 Trends in Big Data & AnalyticsTeqforce Solutions
 
Business Insight
Business InsightBusiness Insight
Business InsightMicrosoft
 
BIG Data & Hadoop Applications in Logistics
BIG Data & Hadoop Applications in LogisticsBIG Data & Hadoop Applications in Logistics
BIG Data & Hadoop Applications in LogisticsSkillspeed
 
Denodo DataFest 2016: The Role of Data Virtualization in IoT Integration
Denodo DataFest 2016: The Role of Data Virtualization in IoT IntegrationDenodo DataFest 2016: The Role of Data Virtualization in IoT Integration
Denodo DataFest 2016: The Role of Data Virtualization in IoT IntegrationDenodo
 
Delivering Self-Service Analytics using Big Data and Data Virtualization on t...
Delivering Self-Service Analytics using Big Data and Data Virtualization on t...Delivering Self-Service Analytics using Big Data and Data Virtualization on t...
Delivering Self-Service Analytics using Big Data and Data Virtualization on t...Denodo
 
Supercharging Smart Meter BIG DATA Analytics with Microsoft Azure Cloud- SRP ...
Supercharging Smart Meter BIG DATA Analytics with Microsoft Azure Cloud- SRP ...Supercharging Smart Meter BIG DATA Analytics with Microsoft Azure Cloud- SRP ...
Supercharging Smart Meter BIG DATA Analytics with Microsoft Azure Cloud- SRP ...Mike Rossi
 
Leveraging a big data model in the IT domain
Leveraging a big data model in the IT domainLeveraging a big data model in the IT domain
Leveraging a big data model in the IT domainVSS Monitoring
 
Data in Motion vs Data at Rest
Data in Motion vs Data at RestData in Motion vs Data at Rest
Data in Motion vs Data at RestInternap
 
LendingClub RealTime BigData Platform with Oracle GoldenGate
LendingClub RealTime BigData Platform with Oracle GoldenGateLendingClub RealTime BigData Platform with Oracle GoldenGate
LendingClub RealTime BigData Platform with Oracle GoldenGateRajit Saha
 
Building trust in your data lake. A fintech case study on automated data disc...
Building trust in your data lake. A fintech case study on automated data disc...Building trust in your data lake. A fintech case study on automated data disc...
Building trust in your data lake. A fintech case study on automated data disc...DataWorks Summit
 

What's hot (20)

How Financial Services can Save On File Storage
How Financial Services can Save On File Storage How Financial Services can Save On File Storage
How Financial Services can Save On File Storage
 
From Data Lakes to the Data Fabric: Our Vision for Digital Strategy
From Data Lakes to the Data Fabric: Our Vision for Digital StrategyFrom Data Lakes to the Data Fabric: Our Vision for Digital Strategy
From Data Lakes to the Data Fabric: Our Vision for Digital Strategy
 
Big Data Fabric 2.0 Drives Data Democratization
Big Data Fabric 2.0 Drives Data DemocratizationBig Data Fabric 2.0 Drives Data Democratization
Big Data Fabric 2.0 Drives Data Democratization
 
Xanadu Based Big Data CBIR System:Automated Diseases Classification & Diagnosis
Xanadu Based Big Data CBIR System:Automated Diseases Classification & DiagnosisXanadu Based Big Data CBIR System:Automated Diseases Classification & Diagnosis
Xanadu Based Big Data CBIR System:Automated Diseases Classification & Diagnosis
 
Graph-driven Data Integration: Accelerating and Automating Data Delivery for ...
Graph-driven Data Integration: Accelerating and Automating Data Delivery for ...Graph-driven Data Integration: Accelerating and Automating Data Delivery for ...
Graph-driven Data Integration: Accelerating and Automating Data Delivery for ...
 
Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...
Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...
Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...
 
The Year of the Graph
The Year of the GraphThe Year of the Graph
The Year of the Graph
 
Conduit - A Lightweight Data Virtualization Tool
Conduit - A Lightweight Data Virtualization ToolConduit - A Lightweight Data Virtualization Tool
Conduit - A Lightweight Data Virtualization Tool
 
Top 5 Trends in Big Data & Analytics
Top 5 Trends in Big Data & AnalyticsTop 5 Trends in Big Data & Analytics
Top 5 Trends in Big Data & Analytics
 
Top 5 Trends in Big Data & Analytics
Top 5 Trends in Big Data & AnalyticsTop 5 Trends in Big Data & Analytics
Top 5 Trends in Big Data & Analytics
 
Big Idea For Big Data
Big Idea For Big DataBig Idea For Big Data
Big Idea For Big Data
 
Business Insight
Business InsightBusiness Insight
Business Insight
 
BIG Data & Hadoop Applications in Logistics
BIG Data & Hadoop Applications in LogisticsBIG Data & Hadoop Applications in Logistics
BIG Data & Hadoop Applications in Logistics
 
Denodo DataFest 2016: The Role of Data Virtualization in IoT Integration
Denodo DataFest 2016: The Role of Data Virtualization in IoT IntegrationDenodo DataFest 2016: The Role of Data Virtualization in IoT Integration
Denodo DataFest 2016: The Role of Data Virtualization in IoT Integration
 
Delivering Self-Service Analytics using Big Data and Data Virtualization on t...
Delivering Self-Service Analytics using Big Data and Data Virtualization on t...Delivering Self-Service Analytics using Big Data and Data Virtualization on t...
Delivering Self-Service Analytics using Big Data and Data Virtualization on t...
 
Supercharging Smart Meter BIG DATA Analytics with Microsoft Azure Cloud- SRP ...
Supercharging Smart Meter BIG DATA Analytics with Microsoft Azure Cloud- SRP ...Supercharging Smart Meter BIG DATA Analytics with Microsoft Azure Cloud- SRP ...
Supercharging Smart Meter BIG DATA Analytics with Microsoft Azure Cloud- SRP ...
 
Leveraging a big data model in the IT domain
Leveraging a big data model in the IT domainLeveraging a big data model in the IT domain
Leveraging a big data model in the IT domain
 
Data in Motion vs Data at Rest
Data in Motion vs Data at RestData in Motion vs Data at Rest
Data in Motion vs Data at Rest
 
LendingClub RealTime BigData Platform with Oracle GoldenGate
LendingClub RealTime BigData Platform with Oracle GoldenGateLendingClub RealTime BigData Platform with Oracle GoldenGate
LendingClub RealTime BigData Platform with Oracle GoldenGate
 
Building trust in your data lake. A fintech case study on automated data disc...
Building trust in your data lake. A fintech case study on automated data disc...Building trust in your data lake. A fintech case study on automated data disc...
Building trust in your data lake. A fintech case study on automated data disc...
 

Viewers also liked

An introduction to Tryzens
An introduction to TryzensAn introduction to Tryzens
An introduction to TryzensTryzens
 
Cyber security event
Cyber security eventCyber security event
Cyber security eventTryzens
 
Where is Nigeria on Universal Health Coverage (UHC)?
Where is Nigeria on Universal Health Coverage (UHC)?Where is Nigeria on Universal Health Coverage (UHC)?
Where is Nigeria on Universal Health Coverage (UHC)?HFG Project
 
Institutional Roles and Relationships Governing the Quality of Health Care
Institutional Roles and Relationships Governing the Quality of Health CareInstitutional Roles and Relationships Governing the Quality of Health Care
Institutional Roles and Relationships Governing the Quality of Health CareHFG Project
 
Ethiopia: Governing for Quality Improvement in the Context of UHC
Ethiopia: Governing for Quality Improvement in the Context of UHCEthiopia: Governing for Quality Improvement in the Context of UHC
Ethiopia: Governing for Quality Improvement in the Context of UHCHFG Project
 
AMI - Corp Presentation - ENG - 20140207b
AMI - Corp Presentation - ENG - 20140207bAMI - Corp Presentation - ENG - 20140207b
AMI - Corp Presentation - ENG - 20140207bGuillaume Corpart
 
Options for Developing a Collective Payment System and Co-payment Mechanism f...
Options for Developing a Collective Payment System and Co-payment Mechanism f...Options for Developing a Collective Payment System and Co-payment Mechanism f...
Options for Developing a Collective Payment System and Co-payment Mechanism f...HFG Project
 
Raising Revenue for Health: Revenue Generation
Raising Revenue for Health: Revenue GenerationRaising Revenue for Health: Revenue Generation
Raising Revenue for Health: Revenue GenerationHFG Project
 
The Devil is in the Details: Designing and Implementing UHC Policies that Rea...
The Devil is in the Details: Designing and Implementing UHC Policies that Rea...The Devil is in the Details: Designing and Implementing UHC Policies that Rea...
The Devil is in the Details: Designing and Implementing UHC Policies that Rea...HFG Project
 
Proposal concept ngày hội gia đình case kết nối yêu thương (revised 1)
Proposal concept ngày hội gia đình case kết nối yêu thương (revised 1)Proposal concept ngày hội gia đình case kết nối yêu thương (revised 1)
Proposal concept ngày hội gia đình case kết nối yêu thương (revised 1)Vietnam Event & Communication Services J.S.C
 
Expanding Health Coverage to Informal Workers in USAID Priority Countries
Expanding Health Coverage to Informal Workers in USAID Priority CountriesExpanding Health Coverage to Informal Workers in USAID Priority Countries
Expanding Health Coverage to Informal Workers in USAID Priority CountriesHFG Project
 
Twitter Training for the Medical Sector
Twitter Training for the Medical SectorTwitter Training for the Medical Sector
Twitter Training for the Medical SectorTryzens
 
MBC Twitter Training
MBC Twitter TrainingMBC Twitter Training
MBC Twitter TrainingTryzens
 

Viewers also liked (20)

Minecraft birthday concept
Minecraft birthday concept Minecraft birthday concept
Minecraft birthday concept
 
An introduction to Tryzens
An introduction to TryzensAn introduction to Tryzens
An introduction to Tryzens
 
Cyber security event
Cyber security eventCyber security event
Cyber security event
 
Proposal concept Ninjago (final)
Proposal concept  Ninjago (final)Proposal concept  Ninjago (final)
Proposal concept Ninjago (final)
 
Where is Nigeria on Universal Health Coverage (UHC)?
Where is Nigeria on Universal Health Coverage (UHC)?Where is Nigeria on Universal Health Coverage (UHC)?
Where is Nigeria on Universal Health Coverage (UHC)?
 
Institutional Roles and Relationships Governing the Quality of Health Care
Institutional Roles and Relationships Governing the Quality of Health CareInstitutional Roles and Relationships Governing the Quality of Health Care
Institutional Roles and Relationships Governing the Quality of Health Care
 
Ethiopia: Governing for Quality Improvement in the Context of UHC
Ethiopia: Governing for Quality Improvement in the Context of UHCEthiopia: Governing for Quality Improvement in the Context of UHC
Ethiopia: Governing for Quality Improvement in the Context of UHC
 
AMI - Corp Presentation - ENG - 20140207b
AMI - Corp Presentation - ENG - 20140207bAMI - Corp Presentation - ENG - 20140207b
AMI - Corp Presentation - ENG - 20140207b
 
Options for Developing a Collective Payment System and Co-payment Mechanism f...
Options for Developing a Collective Payment System and Co-payment Mechanism f...Options for Developing a Collective Payment System and Co-payment Mechanism f...
Options for Developing a Collective Payment System and Co-payment Mechanism f...
 
Raising Revenue for Health: Revenue Generation
Raising Revenue for Health: Revenue GenerationRaising Revenue for Health: Revenue Generation
Raising Revenue for Health: Revenue Generation
 
The Devil is in the Details: Designing and Implementing UHC Policies that Rea...
The Devil is in the Details: Designing and Implementing UHC Policies that Rea...The Devil is in the Details: Designing and Implementing UHC Policies that Rea...
The Devil is in the Details: Designing and Implementing UHC Policies that Rea...
 
Proposal Family Day Event_ Kimberly Clark
Proposal Family Day Event_ Kimberly ClarkProposal Family Day Event_ Kimberly Clark
Proposal Family Day Event_ Kimberly Clark
 
Proposal concept ngày hội gia đình case kết nối yêu thương (revised 1)
Proposal concept ngày hội gia đình case kết nối yêu thương (revised 1)Proposal concept ngày hội gia đình case kết nối yêu thương (revised 1)
Proposal concept ngày hội gia đình case kết nối yêu thương (revised 1)
 
Seeds of the soul party
Seeds of the soul partySeeds of the soul party
Seeds of the soul party
 
Expanding Health Coverage to Informal Workers in USAID Priority Countries
Expanding Health Coverage to Informal Workers in USAID Priority CountriesExpanding Health Coverage to Informal Workers in USAID Priority Countries
Expanding Health Coverage to Informal Workers in USAID Priority Countries
 
Twitter Training for the Medical Sector
Twitter Training for the Medical SectorTwitter Training for the Medical Sector
Twitter Training for the Medical Sector
 
Proposal concept_wedding party_ Phuong & Gavin
Proposal concept_wedding party_ Phuong & GavinProposal concept_wedding party_ Phuong & Gavin
Proposal concept_wedding party_ Phuong & Gavin
 
Bao nhu ‘s 10th birthday party(1)
Bao nhu ‘s 10th birthday party(1)Bao nhu ‘s 10th birthday party(1)
Bao nhu ‘s 10th birthday party(1)
 
MBC Twitter Training
MBC Twitter TrainingMBC Twitter Training
MBC Twitter Training
 
VECS_ Portfolio Corporate 25.07.2016
VECS_ Portfolio Corporate 25.07.2016VECS_ Portfolio Corporate 25.07.2016
VECS_ Portfolio Corporate 25.07.2016
 

Similar to Ensuring compliance of patient data with big data

Ensuring compliance of patient data with big data and bi [bdii 301-m] - (4078)
Ensuring compliance of patient data with big data and bi [bdii 301-m] - (4078)Ensuring compliance of patient data with big data and bi [bdii 301-m] - (4078)
Ensuring compliance of patient data with big data and bi [bdii 301-m] - (4078)Denny Lee
 
Three Dimensions of Data as a Service
Three Dimensions of Data as a ServiceThree Dimensions of Data as a Service
Three Dimensions of Data as a ServiceDenodo
 
Matthew Johnston - Big Data Futures Outlook BCM
Matthew Johnston - Big Data Futures Outlook BCMMatthew Johnston - Big Data Futures Outlook BCM
Matthew Johnston - Big Data Futures Outlook BCMHoi Lan Leong
 
Overview - IBM Big Data Platform
Overview - IBM Big Data PlatformOverview - IBM Big Data Platform
Overview - IBM Big Data PlatformVikas Manoria
 
Future of Data Strategy (ASEAN)
Future of Data Strategy (ASEAN)Future of Data Strategy (ASEAN)
Future of Data Strategy (ASEAN)Denodo
 
Qo Introduction V2
Qo Introduction V2Qo Introduction V2
Qo Introduction V2Joe_F
 
Guest Lecture: Introduction to Big Data at Indian Institute of Technology
Guest Lecture: Introduction to Big Data at Indian Institute of TechnologyGuest Lecture: Introduction to Big Data at Indian Institute of Technology
Guest Lecture: Introduction to Big Data at Indian Institute of TechnologyNishant Gandhi
 
The future of scaling forrester research - GigaSpaces Road Show 2011
The future of scaling forrester research - GigaSpaces Road Show 2011The future of scaling forrester research - GigaSpaces Road Show 2011
The future of scaling forrester research - GigaSpaces Road Show 2011Nati Shalom
 
Making Hadoop Ready for the Enterprise
Making Hadoop Ready for the Enterprise Making Hadoop Ready for the Enterprise
Making Hadoop Ready for the Enterprise DataWorks Summit
 
Get Started Quickly with IBM's Hadoop as a Service
Get Started Quickly with IBM's Hadoop as a ServiceGet Started Quickly with IBM's Hadoop as a Service
Get Started Quickly with IBM's Hadoop as a ServiceIBM Cloud Data Services
 
Information Virtualization: Query Federation on Data Lakes
Information Virtualization: Query Federation on Data LakesInformation Virtualization: Query Federation on Data Lakes
Information Virtualization: Query Federation on Data LakesDataWorks Summit
 
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BIAugmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BIDenodo
 
2014 Big_Data_Forum_HGST
2014 Big_Data_Forum_HGST2014 Big_Data_Forum_HGST
2014 Big_Data_Forum_HGSTCOMPUTEX TAIPEI
 
SC7 Workshop 1: Big Data in Secure Societies
SC7 Workshop 1: Big Data in Secure Societies SC7 Workshop 1: Big Data in Secure Societies
SC7 Workshop 1: Big Data in Secure Societies BigData_Europe
 
Big Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATION
Big Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATIONBig Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATION
Big Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATIONMatt Stubbs
 
Using real time big data analytics for competitive advantage
 Using real time big data analytics for competitive advantage Using real time big data analytics for competitive advantage
Using real time big data analytics for competitive advantageAmazon Web Services
 
Big Data Expo 2015 - Pentaho The Future of Analytics
Big Data Expo 2015 - Pentaho The Future of AnalyticsBig Data Expo 2015 - Pentaho The Future of Analytics
Big Data Expo 2015 - Pentaho The Future of AnalyticsBigDataExpo
 
Hd insight overview
Hd insight overviewHd insight overview
Hd insight overviewvhrocca
 

Similar to Ensuring compliance of patient data with big data (20)

Ensuring compliance of patient data with big data and bi [bdii 301-m] - (4078)
Ensuring compliance of patient data with big data and bi [bdii 301-m] - (4078)Ensuring compliance of patient data with big data and bi [bdii 301-m] - (4078)
Ensuring compliance of patient data with big data and bi [bdii 301-m] - (4078)
 
Three Dimensions of Data as a Service
Three Dimensions of Data as a ServiceThree Dimensions of Data as a Service
Three Dimensions of Data as a Service
 
Matthew Johnston - Big Data Futures Outlook BCM
Matthew Johnston - Big Data Futures Outlook BCMMatthew Johnston - Big Data Futures Outlook BCM
Matthew Johnston - Big Data Futures Outlook BCM
 
Overview - IBM Big Data Platform
Overview - IBM Big Data PlatformOverview - IBM Big Data Platform
Overview - IBM Big Data Platform
 
Future of Data Strategy (ASEAN)
Future of Data Strategy (ASEAN)Future of Data Strategy (ASEAN)
Future of Data Strategy (ASEAN)
 
Qo Introduction V2
Qo Introduction V2Qo Introduction V2
Qo Introduction V2
 
13 pv-do es-18-bigdata-v3
13 pv-do es-18-bigdata-v313 pv-do es-18-bigdata-v3
13 pv-do es-18-bigdata-v3
 
Guest Lecture: Introduction to Big Data at Indian Institute of Technology
Guest Lecture: Introduction to Big Data at Indian Institute of TechnologyGuest Lecture: Introduction to Big Data at Indian Institute of Technology
Guest Lecture: Introduction to Big Data at Indian Institute of Technology
 
The future of scaling forrester research - GigaSpaces Road Show 2011
The future of scaling forrester research - GigaSpaces Road Show 2011The future of scaling forrester research - GigaSpaces Road Show 2011
The future of scaling forrester research - GigaSpaces Road Show 2011
 
Making Hadoop Ready for the Enterprise
Making Hadoop Ready for the Enterprise Making Hadoop Ready for the Enterprise
Making Hadoop Ready for the Enterprise
 
Get Started Quickly with IBM's Hadoop as a Service
Get Started Quickly with IBM's Hadoop as a ServiceGet Started Quickly with IBM's Hadoop as a Service
Get Started Quickly with IBM's Hadoop as a Service
 
Information Virtualization: Query Federation on Data Lakes
Information Virtualization: Query Federation on Data LakesInformation Virtualization: Query Federation on Data Lakes
Information Virtualization: Query Federation on Data Lakes
 
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BIAugmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
 
2014 Big_Data_Forum_HGST
2014 Big_Data_Forum_HGST2014 Big_Data_Forum_HGST
2014 Big_Data_Forum_HGST
 
SC7 Workshop 1: Big Data in Secure Societies
SC7 Workshop 1: Big Data in Secure Societies SC7 Workshop 1: Big Data in Secure Societies
SC7 Workshop 1: Big Data in Secure Societies
 
Big Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATION
Big Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATIONBig Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATION
Big Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATION
 
Using real time big data analytics for competitive advantage
 Using real time big data analytics for competitive advantage Using real time big data analytics for competitive advantage
Using real time big data analytics for competitive advantage
 
Big Data Expo 2015 - Pentaho The Future of Analytics
Big Data Expo 2015 - Pentaho The Future of AnalyticsBig Data Expo 2015 - Pentaho The Future of Analytics
Big Data Expo 2015 - Pentaho The Future of Analytics
 
Hd insight overview
Hd insight overviewHd insight overview
Hd insight overview
 
Big data analysis concepts and references
Big data analysis concepts and referencesBig data analysis concepts and references
Big data analysis concepts and references
 

Recently uploaded

Bangalore call girl 👯‍♀️@ Simran Independent Call Girls in Bangalore GIUXUZ...
Bangalore call girl  👯‍♀️@ Simran Independent Call Girls in Bangalore  GIUXUZ...Bangalore call girl  👯‍♀️@ Simran Independent Call Girls in Bangalore  GIUXUZ...
Bangalore call girl 👯‍♀️@ Simran Independent Call Girls in Bangalore GIUXUZ...Gfnyt
 
Russian Escorts Aishbagh Road * 9548273370 Naughty Call Girls Service in Lucknow
Russian Escorts Aishbagh Road * 9548273370 Naughty Call Girls Service in LucknowRussian Escorts Aishbagh Road * 9548273370 Naughty Call Girls Service in Lucknow
Russian Escorts Aishbagh Road * 9548273370 Naughty Call Girls Service in Lucknowgragteena
 
VIP Kolkata Call Girl New Town 👉 8250192130 Available With Room
VIP Kolkata Call Girl New Town 👉 8250192130  Available With RoomVIP Kolkata Call Girl New Town 👉 8250192130  Available With Room
VIP Kolkata Call Girl New Town 👉 8250192130 Available With Roomdivyansh0kumar0
 
Hot Call Girl In Chandigarh 👅🥵 9053'900678 Call Girls Service In Chandigarh
Hot  Call Girl In Chandigarh 👅🥵 9053'900678 Call Girls Service In ChandigarhHot  Call Girl In Chandigarh 👅🥵 9053'900678 Call Girls Service In Chandigarh
Hot Call Girl In Chandigarh 👅🥵 9053'900678 Call Girls Service In ChandigarhVip call girls In Chandigarh
 
Russian Call Girls Lucknow ₹7.5k Pick Up & Drop With Cash Payment 8923113531 ...
Russian Call Girls Lucknow ₹7.5k Pick Up & Drop With Cash Payment 8923113531 ...Russian Call Girls Lucknow ₹7.5k Pick Up & Drop With Cash Payment 8923113531 ...
Russian Call Girls Lucknow ₹7.5k Pick Up & Drop With Cash Payment 8923113531 ...gurkirankumar98700
 
Chandigarh Call Girls 👙 7001035870 👙 Genuine WhatsApp Number for Real Meet
Chandigarh Call Girls 👙 7001035870 👙 Genuine WhatsApp Number for Real MeetChandigarh Call Girls 👙 7001035870 👙 Genuine WhatsApp Number for Real Meet
Chandigarh Call Girls 👙 7001035870 👙 Genuine WhatsApp Number for Real Meetpriyashah722354
 
VIP Call Girl Sector 88 Gurgaon Delhi Just Call Me 9899900591
VIP Call Girl Sector 88 Gurgaon Delhi Just Call Me 9899900591VIP Call Girl Sector 88 Gurgaon Delhi Just Call Me 9899900591
VIP Call Girl Sector 88 Gurgaon Delhi Just Call Me 9899900591adityaroy0215
 
Krishnagiri call girls Tamil aunty 7877702510
Krishnagiri call girls Tamil aunty 7877702510Krishnagiri call girls Tamil aunty 7877702510
Krishnagiri call girls Tamil aunty 7877702510Vipesco
 
indian Call Girl Panchkula ❤️🍑 9907093804 Low Rate Call Girls Ludhiana Tulsi
indian Call Girl Panchkula ❤️🍑 9907093804 Low Rate Call Girls Ludhiana Tulsiindian Call Girl Panchkula ❤️🍑 9907093804 Low Rate Call Girls Ludhiana Tulsi
indian Call Girl Panchkula ❤️🍑 9907093804 Low Rate Call Girls Ludhiana TulsiHigh Profile Call Girls Chandigarh Aarushi
 
Basics of Anatomy- Language of Anatomy.pptx
Basics of Anatomy- Language of Anatomy.pptxBasics of Anatomy- Language of Anatomy.pptx
Basics of Anatomy- Language of Anatomy.pptxAyush Gupta
 
Call Girls Hyderabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Hyderabad Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Hyderabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Hyderabad Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
💚😋Mumbai Escort Service Call Girls, ₹5000 To 25K With AC💚😋
💚😋Mumbai Escort Service Call Girls, ₹5000 To 25K With AC💚😋💚😋Mumbai Escort Service Call Girls, ₹5000 To 25K With AC💚😋
💚😋Mumbai Escort Service Call Girls, ₹5000 To 25K With AC💚😋Sheetaleventcompany
 
💚😋Kolkata Escort Service Call Girls, ₹5000 To 25K With AC💚😋
💚😋Kolkata Escort Service Call Girls, ₹5000 To 25K With AC💚😋💚😋Kolkata Escort Service Call Girls, ₹5000 To 25K With AC💚😋
💚😋Kolkata Escort Service Call Girls, ₹5000 To 25K With AC💚😋Sheetaleventcompany
 
❤️♀️@ Jaipur Call Girls ❤️♀️@ Jaispreet Call Girl Services in Jaipur QRYPCF ...
❤️♀️@ Jaipur Call Girls ❤️♀️@ Jaispreet Call Girl Services in Jaipur QRYPCF  ...❤️♀️@ Jaipur Call Girls ❤️♀️@ Jaispreet Call Girl Services in Jaipur QRYPCF  ...
❤️♀️@ Jaipur Call Girls ❤️♀️@ Jaispreet Call Girl Services in Jaipur QRYPCF ...Gfnyt.com
 
No Advance 9053900678 Chandigarh Call Girls , Indian Call Girls For Full Ni...
No Advance 9053900678 Chandigarh  Call Girls , Indian Call Girls  For Full Ni...No Advance 9053900678 Chandigarh  Call Girls , Indian Call Girls  For Full Ni...
No Advance 9053900678 Chandigarh Call Girls , Indian Call Girls For Full Ni...Vip call girls In Chandigarh
 
Local Housewife and effective ☎️ 8250192130 🍉🍓 Sexy Girls VIP Call Girls Chan...
Local Housewife and effective ☎️ 8250192130 🍉🍓 Sexy Girls VIP Call Girls Chan...Local Housewife and effective ☎️ 8250192130 🍉🍓 Sexy Girls VIP Call Girls Chan...
Local Housewife and effective ☎️ 8250192130 🍉🍓 Sexy Girls VIP Call Girls Chan...Russian Call Girls Amritsar
 
pOOJA sexy Call Girls In Sector 49,9999965857 Young Female Escorts Service In...
pOOJA sexy Call Girls In Sector 49,9999965857 Young Female Escorts Service In...pOOJA sexy Call Girls In Sector 49,9999965857 Young Female Escorts Service In...
pOOJA sexy Call Girls In Sector 49,9999965857 Young Female Escorts Service In...Call Girls Noida
 
Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Book me...
Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Book me...Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Book me...
Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Book me...gragteena
 

Recently uploaded (20)

Bangalore call girl 👯‍♀️@ Simran Independent Call Girls in Bangalore GIUXUZ...
Bangalore call girl  👯‍♀️@ Simran Independent Call Girls in Bangalore  GIUXUZ...Bangalore call girl  👯‍♀️@ Simran Independent Call Girls in Bangalore  GIUXUZ...
Bangalore call girl 👯‍♀️@ Simran Independent Call Girls in Bangalore GIUXUZ...
 
Russian Escorts Aishbagh Road * 9548273370 Naughty Call Girls Service in Lucknow
Russian Escorts Aishbagh Road * 9548273370 Naughty Call Girls Service in LucknowRussian Escorts Aishbagh Road * 9548273370 Naughty Call Girls Service in Lucknow
Russian Escorts Aishbagh Road * 9548273370 Naughty Call Girls Service in Lucknow
 
VIP Kolkata Call Girl New Town 👉 8250192130 Available With Room
VIP Kolkata Call Girl New Town 👉 8250192130  Available With RoomVIP Kolkata Call Girl New Town 👉 8250192130  Available With Room
VIP Kolkata Call Girl New Town 👉 8250192130 Available With Room
 
Hot Call Girl In Chandigarh 👅🥵 9053'900678 Call Girls Service In Chandigarh
Hot  Call Girl In Chandigarh 👅🥵 9053'900678 Call Girls Service In ChandigarhHot  Call Girl In Chandigarh 👅🥵 9053'900678 Call Girls Service In Chandigarh
Hot Call Girl In Chandigarh 👅🥵 9053'900678 Call Girls Service In Chandigarh
 
Russian Call Girls Lucknow ₹7.5k Pick Up & Drop With Cash Payment 8923113531 ...
Russian Call Girls Lucknow ₹7.5k Pick Up & Drop With Cash Payment 8923113531 ...Russian Call Girls Lucknow ₹7.5k Pick Up & Drop With Cash Payment 8923113531 ...
Russian Call Girls Lucknow ₹7.5k Pick Up & Drop With Cash Payment 8923113531 ...
 
(ILA) Call Girls in Kolkata Call Now 8617697112 Kolkata Escorts
(ILA) Call Girls in Kolkata Call Now 8617697112 Kolkata Escorts(ILA) Call Girls in Kolkata Call Now 8617697112 Kolkata Escorts
(ILA) Call Girls in Kolkata Call Now 8617697112 Kolkata Escorts
 
Chandigarh Call Girls 👙 7001035870 👙 Genuine WhatsApp Number for Real Meet
Chandigarh Call Girls 👙 7001035870 👙 Genuine WhatsApp Number for Real MeetChandigarh Call Girls 👙 7001035870 👙 Genuine WhatsApp Number for Real Meet
Chandigarh Call Girls 👙 7001035870 👙 Genuine WhatsApp Number for Real Meet
 
VIP Call Girl Sector 88 Gurgaon Delhi Just Call Me 9899900591
VIP Call Girl Sector 88 Gurgaon Delhi Just Call Me 9899900591VIP Call Girl Sector 88 Gurgaon Delhi Just Call Me 9899900591
VIP Call Girl Sector 88 Gurgaon Delhi Just Call Me 9899900591
 
Krishnagiri call girls Tamil aunty 7877702510
Krishnagiri call girls Tamil aunty 7877702510Krishnagiri call girls Tamil aunty 7877702510
Krishnagiri call girls Tamil aunty 7877702510
 
indian Call Girl Panchkula ❤️🍑 9907093804 Low Rate Call Girls Ludhiana Tulsi
indian Call Girl Panchkula ❤️🍑 9907093804 Low Rate Call Girls Ludhiana Tulsiindian Call Girl Panchkula ❤️🍑 9907093804 Low Rate Call Girls Ludhiana Tulsi
indian Call Girl Panchkula ❤️🍑 9907093804 Low Rate Call Girls Ludhiana Tulsi
 
Basics of Anatomy- Language of Anatomy.pptx
Basics of Anatomy- Language of Anatomy.pptxBasics of Anatomy- Language of Anatomy.pptx
Basics of Anatomy- Language of Anatomy.pptx
 
Call Girls Hyderabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Hyderabad Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Hyderabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Hyderabad Just Call 9907093804 Top Class Call Girl Service Available
 
💚😋Mumbai Escort Service Call Girls, ₹5000 To 25K With AC💚😋
💚😋Mumbai Escort Service Call Girls, ₹5000 To 25K With AC💚😋💚😋Mumbai Escort Service Call Girls, ₹5000 To 25K With AC💚😋
💚😋Mumbai Escort Service Call Girls, ₹5000 To 25K With AC💚😋
 
💚😋Kolkata Escort Service Call Girls, ₹5000 To 25K With AC💚😋
💚😋Kolkata Escort Service Call Girls, ₹5000 To 25K With AC💚😋💚😋Kolkata Escort Service Call Girls, ₹5000 To 25K With AC💚😋
💚😋Kolkata Escort Service Call Girls, ₹5000 To 25K With AC💚😋
 
#9711199012# African Student Escorts in Delhi 😘 Call Girls Delhi
#9711199012# African Student Escorts in Delhi 😘 Call Girls Delhi#9711199012# African Student Escorts in Delhi 😘 Call Girls Delhi
#9711199012# African Student Escorts in Delhi 😘 Call Girls Delhi
 
❤️♀️@ Jaipur Call Girls ❤️♀️@ Jaispreet Call Girl Services in Jaipur QRYPCF ...
❤️♀️@ Jaipur Call Girls ❤️♀️@ Jaispreet Call Girl Services in Jaipur QRYPCF  ...❤️♀️@ Jaipur Call Girls ❤️♀️@ Jaispreet Call Girl Services in Jaipur QRYPCF  ...
❤️♀️@ Jaipur Call Girls ❤️♀️@ Jaispreet Call Girl Services in Jaipur QRYPCF ...
 
No Advance 9053900678 Chandigarh Call Girls , Indian Call Girls For Full Ni...
No Advance 9053900678 Chandigarh  Call Girls , Indian Call Girls  For Full Ni...No Advance 9053900678 Chandigarh  Call Girls , Indian Call Girls  For Full Ni...
No Advance 9053900678 Chandigarh Call Girls , Indian Call Girls For Full Ni...
 
Local Housewife and effective ☎️ 8250192130 🍉🍓 Sexy Girls VIP Call Girls Chan...
Local Housewife and effective ☎️ 8250192130 🍉🍓 Sexy Girls VIP Call Girls Chan...Local Housewife and effective ☎️ 8250192130 🍉🍓 Sexy Girls VIP Call Girls Chan...
Local Housewife and effective ☎️ 8250192130 🍉🍓 Sexy Girls VIP Call Girls Chan...
 
pOOJA sexy Call Girls In Sector 49,9999965857 Young Female Escorts Service In...
pOOJA sexy Call Girls In Sector 49,9999965857 Young Female Escorts Service In...pOOJA sexy Call Girls In Sector 49,9999965857 Young Female Escorts Service In...
pOOJA sexy Call Girls In Sector 49,9999965857 Young Female Escorts Service In...
 
Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Book me...
Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Book me...Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Book me...
Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Book me...
 

Ensuring compliance of patient data with big data

  • 1. April 10-12, Chicago, IL Ensuring Compliance of Patient Data with Big Data and BI Ayad Shammout & Denny Lee
  • 2. April 10-12, Chicago, IL Please silence cell phones
  • 3. 3 Agenda A Quick Big Data Primer Healthcare and Big Data Compliance and Auditing SQL Compliance Project Compliance and Auditing with Big Data and BI Big Data: Unstructured Volumes of Data Analytics: PowerPivot, Power View
  • 4. 4 What is Big Data? Volume Exceeds physical limits of vertical scalability Velocity Decision window small compared to data change rate Variety Many different formats makes integration expensive Variability Many options or variable interpretations confound analysis
  • 5. 5 10x increase every five years 85%from new data types Data explosion Volume Velocity Variety Hadoop Cloud By 2015, organizations that build a modern information management system will outperform their peers financially by 20 percent.   – Gartner, Mark Beyer “Information Management in the 21st Century”
  • 6.
  • 7. 7 Big Data Business Value 140,000-190,000 1.5 million $300 billion 15 out of 17 €250 billion 50-60%
  • 9. 9 Hadoop: The most visible face of Big Data
  • 12. 12 Healthcare and IT Often the laggard in technology Yet application of IT to healthcare can radically change what we can do Genomic Sequencing Proteomic sequencing Incidence Prediction
  • 13. 13 Healthcare Big Data Example Scenarios Clinical Trial Deviations Originally Viagra was developed to lower blood pressure and treat Angina Now its used to help newborn pulmonary hypertension and altitude sickness Incidence Prediction Missed 4 or more visits, twice as likely to have an asthmatic incident Particular Cardiac monitor sine wave points to highly likelihood of heart attack Campaigns Social media and advertising campaigns to understand user behavior and sentiment Patient Satisfaction Social media and advertising campaigns to understand user behavior and sentiment
  • 14. 14 BIDMC Auditing Scenario Auditing is critical component HIPAA in ensuring patient privacy 1 Billion rows+ of audit data 146 mission critical clinical applications Comprehensive audits yield 300-500k transactions/day HIPAA requires audit system with 20 years of data Auditing Project Available to community as part of Compliance SDK Updating for SQL Server 2012, HDInsight, Power View, and MobileBI* Creating an enterprise tool for consolidated storage, reporting and alerting of all application audit data - that's cool! John Halamka’s Cool Technology of the Week (Wellsphere Top Health Blogger, Health Impact Award)
  • 15. 15 BIDMC Compliance Project HDInsight Windows HDInsight Azure SQLServer 2008/2012 Audit LogsETL Logs to HDFS Use Excel 2013 PowerPivot and Power View SSAS (tabular)
  • 16. 16 Auditing Sensitive Information 16 Querying Audit Information Use PowerPivot / Power View / Analysis Services to Query the data. Security InformationPolicy Information Process Audit Information Use SSIS to process SQL2008 All-Actions Audit Information and other CG application audit log data; potentially can use Management Performance DW framework. Caregroup Environment File Server SQL Audit Connect/Logic SSIS CG Application Data Intersystems Cache SQL2005 Oracle SQL2008 All-Actions Audit Data SQL 2008 / 2012 R2 SSRS 2008 / Power View Policy Analysis Policy Reports Policy Best Practices Security Analysis Security Reports Compliance Reports Feedback Action Loop Update systems to keep them compliant and secure
  • 17. Audit Logs 17 Storage Infrastructure Transfer files to ASV via AzCopy, CloudExplorer, etc.
  • 18. 18 Storage Infrastructure 18 Hadoop on Azure Compute Nodes (Medium VMs) Azure Storage Vault (ASV) Azure Blob Storage Azure Flat Network Storage
  • 19. 19 Storage Infrastructure 19 Hadoop on Azure Compute Nodes (Medium VMs) Azure Storage Vault (ASV) Azure Blob Storage Azure Flat Network Storage Stream data To compute Push data Back to Storage map sort shuffle reduce http://dennyglee.com/2013/03/18/why-use-blob-storage-with-hdinsight-on-azure/
  • 23. 23 Hadoop / Auditing: File sizes Currently testing gz vs. raw E.g. 12MB raw text file vs. 633Kb gz file (~20x compression) 20x smaller size, ~same query time Approx same map / reduce task utilization File Size is 250MB-1GB SSIS package takes care of the size Future testing: avro, protobuf 23 Query Duration (s) select count(*) from sql_audit_asv_raw 56.066 select count(*) from sql_audit_asv_gz 58.994
  • 24. 24 Hadoop / Auditing: Formats For ease of processing, replace carriage returns within embedded SQL statements, e.g. select col1, col2 from tableA to select col1, col2 from tableA This allows you to create a Hive table using CR as row delimiter (i.e. does not have things like SQL quoted identifiers) 24
  • 25. 25
  • 26. SQOOP, HiveODBC, Templeton, CSV, etc BI Connectivity
  • 27. 27 Big Data … Excel-lerated! 2 Server, 3mo 110 GB binary files SSIS extraction 1.2GB of text 120MB gz Hadoop to PowerPivot 6MB
  • 28. 28 PowerPivot workbook of HoA Audit data
  • 29. 29 Power View of HoA Audit Data
  • 30. April 10-12, Chicago, IL Thank you! Diamond Sponsor

Editor's Notes

  1. Centralizing Logs Allows you to have one system process all audit logs from your servers Easier manageability Set files to 250MB in size (less files, but not too large to process) Optimized for Hadoop General Rule of Thumb: 250MB-1GB file sizes Can also centralize processing … and centralize reporting Compliance SDK contains the full project Organized by Server, Database, DDL, and DML actions