SlideShare a Scribd company logo
1 of 36
Big Data: Why the big fuss?
Presenter
My blog: Information Management, Life & Petrol
http://infomanagementlifeandpetrol.blogspot.
com
@InfoRacer
Chris Bradley
Chief Development Officer
chris.bradley@ipl.com
+44 1225 475000
Introductions
Chris has spent 32 years in the Information management field, working for
leading organisations in Data Management Strategy, Master Data Management,
Metadata Management, Data Warehouse and Business Intelligence.
Graduating in 1979 Chris worked for the MoD(Navy), Volvo, Thorn EMI (as Head
of Information Management), Readers Digest Inc (as European CIO), and
Coopers and Lybrand Management Consultancy where he established and ran
the International Data Management practice.
Chris heads IPL’s Business Consultancy practice and is advising several
Energy, Pharmaceutical, Finance and Government clients on Business Process
and Information Asset Management.
Chris is a member of the MPO, Director of DAMA UK and holds the CDMP
Master certification. He co-authored “Data Modelling For The Business – A
Handbook for aligning the business with IT using high-level data models”.
Chris is a columnist and frequent contributor to industry publications. He authors
an experts channel on the influential BeyeNETWORK, is a recognised thought-
leader in Information Management and regular key speaker at major
International Information Management conferences.
chris.bradley@ipl.com
+44 1225 475000
Blog: Information Management, Life & Petrol
http://infomanagementlifeandpetrol.blogspot.
com
@InfoRacer
Christopher Bradley
Chief Development Officer
Who is IPL?
Trusted, independent consulting & solutions co
30 year track record
300 staff, £28m+ turnover
High-stakes, business & mission critical contexts
Consistently exceed expectations
Business Consulting Division
Information Management
- IM Strategy
- Information Security & Assurance
- Data Governance
- Information Exploitation
- Master Data Management
- Information Architecture
- Business Intelligence
.......turning Information into a strategic asset
Enterprise Architecture
Business Process Management
Programme Management
IPL Consulting Clients
Three V’s
Three V’s
Three V’s
• Big data comes in one size: large. All enterprises are
awash with data, and can easily amass terabytes and
petabytes of information.
• Can systems scale up without degrading performance
intolerably?
Volume
• Frequently time-sensitive, big data should be used as
it streams into the enterprise in order to maximise its
value to the business.
• How can you calculate mean values across a
constantly changing landscape?
Velocity
• Big data extends beyond structured data to include
unstructured data of all varieties: text, audio, video,
click streams, log files and more.
• How do you apply the normal methods of analytics
and reporting with unknown structures?
Variety
Data volume keeps growing
The total amount of global data is expected to grow to 2.7
zettabytes during 2012 (up 48% from 2011)*
Equivalent of every person sending 30 tweets/hour for the
next 1200 years!
Enterprises will manage 50 times more data and files will
grow 75 times in the next decade
80% of the world’s data is unstructured
* IDC Digital Universe Study 2011
Isn’t it all relative?
The 7 dimensions of data
Users
Devices
Capacity
Media
Advances
Software
Automation
•Population increase
•Computing demographic
•Proliferation
•Portability
•Miniturisation
•Reducing costs
•More choice
•Temptation to fill
•File sizes
•New formats
•Needs more space
•More files
•Solution fulfillment
•Augmentation
Then and now
Dimension
• Users
• Devices
• Capacity
• Media
• Advances
• Software
• Automation
Then
• IT in the workplace
• 3270 / Green screen
• KBs and MBs
• Expensive floppy disks
• Dedicated
• Minimal/business
• Business processes
Now
• Anywhere
• Fixed and mobile
• PBs, ZBs & YBs
• Cheap cards and sticks
• Multi-purpose
• Complex/everything
• What isn’t?
Big data is not a new problem…
Then Now
Users
Devices
Capacity
Media
Advances
Software
Automation
Then Now
Users
Devices
Capacity
Media
Advances
Software
Automation
Data
It’s all about scale ……
+ the combination
Back to basics
Still all about good Information and Data Management
Driver = Need to act faster
Challenge = Joining it all up … and that’s getting harder
Objective = Remains the same … Information Exploitation
The three Vs
The fourth V
What is needed? In what quantity? And by when?
What’s the point of Big Data yielding
Little Information?
Understand what it is that you need
Remember “Garbage in…”
Quality is a key factor:
Unstructured – Homeland Security may not care
Structured – poorly calibrated meters = bigger garbage
Faults in the technology and processes produce
exaggerated errors
Bad decisions get made faster
It’s all about scale…
…get the IM basics for ‘little data’ right first
More data isn’t necessarily better
The fundamentals
Data Architecture
Data Governance
Master Data Management
Information Security
Data Quality
Metadata Management
Business Intelligence
Information Management Core Disciplines
Source: DAMA-I
Managing Big Data successfully
Data quality
Sort out your ‘little data’ first
Managing Big Data successfully
Data quality
Sort out your ‘little data’ first
Select the right technology solution(s)
Understand the analytics required:
Near real-time
Mining deeper than before
Design optimal presentation channels
Target the skills you need
Key/value Data Stores eg Cassandra
Columnar/tabular NoSQL Data Stores eg
Hadoop, Hypertable
MPP Appliances eg Greenplum , Netezza
XML Data Stores eg CuDB, Marklogic
Conclusions
Keep it all in perspective, most of this is not new
True value comes from deep understanding of the three Vs
Remember the fourth V is the bottom line
More data does not necessarily mean better information or
wiser decisions
Apply data management fundamentals before the
technology for Big Data
Questions
My blog: Information Management, Life & Petrol
http://infomanagementlifeandpetrol.blogspot.com
@InfoRacer
Tel: +44 1225 475000
email: Chris.Bradley@ipl.com
Financial Services Opportunities
Creating actionable intelligence – credit history
Customer insight
Fraud detection
Regulatory compliance
Big Data sources
Key/value Data Stores such as Cassandra
Columnar/tabular NoSQL Data Stores such as Hadoop &
Hypertable
Massively Parallel Processing Appliances such as Greenplum
& Netezza
XML Data Stores such as CuDB & Marklogic
Data Federation/ Data Virtualisation approaches are stepping up to meet this
challenge
Don’t forget Data Quality
Managing the quality of the data is of the upmost
importance
What’s the use of this vast resource if its quality and
trustworthiness is questionable?
Driving your data quality capability up the maturity levels is
key
Data Quality Maturity Assessment
Level 1 - Initial Level 2 - Repeatable Level 3 - Defined Level 4 - Managed Level 5 - Optimised
Limited awareness
within the enterprise
of the importance of
information quality.
Very few, if any,
processes in place to
measure quality of
information. Data is
often not trusted by
business users.
The quality of few
data sources is
measured in an ad
hoc manner. A
number of different
tools used to measure
quality. The activity is
driven by a projects
or departments.
Limited
understanding of
good versus bad
quality. Identified
issues are not
consistently
managed.
Quality measures
have been defined for
some key data
sources. Specific
tools adopted to
measure quality with
some standards in
place. The processes
for measuring quality
are applied at
consistent intervals.
Data issues are
addressed where
critical.
Data quality is
measured for all key
data sources on a
regular basis. Quality
metrics information is
published via
dashboards etc.
Active management
of data issues through
the data ownership
model ensures issues
are often resolved.
Quality
considerations baked
into the SDLC.
The measurement of
data quality is
embedded in many
business processes
across the enterprise.
Data quality issues
addressed through
the data ownership
model. Data quality
issues fed back to be
fixed at source.

More Related Content

What's hot

Data Governance by stealth v0.0.2
Data Governance by stealth v0.0.2Data Governance by stealth v0.0.2
Data Governance by stealth v0.0.2Christopher Bradley
 
The role of Data Virtualisation in your EIM strategy
The role of Data Virtualisation in your EIM strategyThe role of Data Virtualisation in your EIM strategy
The role of Data Virtualisation in your EIM strategyChristopher Bradley
 
Information Management Training Courses & Certification
Information Management Training Courses & CertificationInformation Management Training Courses & Certification
Information Management Training Courses & CertificationChristopher Bradley
 
Information is at the heart of all architecture disciplines
Information is at the heart of all architecture disciplinesInformation is at the heart of all architecture disciplines
Information is at the heart of all architecture disciplinesChristopher Bradley
 
Fate of the Chief Data Officer
Fate of the Chief Data OfficerFate of the Chief Data Officer
Fate of the Chief Data OfficerTamarah Usher
 
Data-Ed: Emerging Trends in Data Jobs
Data-Ed: Emerging Trends in Data JobsData-Ed: Emerging Trends in Data Jobs
Data-Ed: Emerging Trends in Data JobsData Blueprint
 
Incorporating ERP metadata in your data models
Incorporating ERP metadata in your data modelsIncorporating ERP metadata in your data models
Incorporating ERP metadata in your data modelsChristopher Bradley
 
CDMP preparation workshop EDW2016
CDMP preparation workshop EDW2016CDMP preparation workshop EDW2016
CDMP preparation workshop EDW2016Christopher Bradley
 
Information Management best_practice_guide
Information Management best_practice_guideInformation Management best_practice_guide
Information Management best_practice_guideChristopher Bradley
 
Metadata Strategies
Metadata StrategiesMetadata Strategies
Metadata StrategiesDATAVERSITY
 
Selecting Data Management Tools - A practical approach
Selecting Data Management Tools - A practical approachSelecting Data Management Tools - A practical approach
Selecting Data Management Tools - A practical approachChristopher Bradley
 
Incorporating SAP Metadata within your Information Architecture
Incorporating SAP Metadata within your Information ArchitectureIncorporating SAP Metadata within your Information Architecture
Incorporating SAP Metadata within your Information ArchitectureChristopher Bradley
 
Trends in Data Modeling
Trends in Data ModelingTrends in Data Modeling
Trends in Data ModelingDATAVERSITY
 
Data Stewardship and Governance: how to reach global adoption and systematic ...
Data Stewardship and Governance: how to reach global adoption and systematic ...Data Stewardship and Governance: how to reach global adoption and systematic ...
Data Stewardship and Governance: how to reach global adoption and systematic ...Pieter De Leenheer
 
Data Governance for Clinical Information
Data Governance for Clinical InformationData Governance for Clinical Information
Data Governance for Clinical InformationChristopher Bradley
 
Information is at the heart of all architecture disciplines & why Conceptual ...
Information is at the heart of all architecture disciplines & why Conceptual ...Information is at the heart of all architecture disciplines & why Conceptual ...
Information is at the heart of all architecture disciplines & why Conceptual ...Christopher Bradley
 

What's hot (20)

Data Governance by stealth v0.0.2
Data Governance by stealth v0.0.2Data Governance by stealth v0.0.2
Data Governance by stealth v0.0.2
 
The role of Data Virtualisation in your EIM strategy
The role of Data Virtualisation in your EIM strategyThe role of Data Virtualisation in your EIM strategy
The role of Data Virtualisation in your EIM strategy
 
Information Management Training Courses & Certification
Information Management Training Courses & CertificationInformation Management Training Courses & Certification
Information Management Training Courses & Certification
 
Information is at the heart of all architecture disciplines
Information is at the heart of all architecture disciplinesInformation is at the heart of all architecture disciplines
Information is at the heart of all architecture disciplines
 
Fate of the Chief Data Officer
Fate of the Chief Data OfficerFate of the Chief Data Officer
Fate of the Chief Data Officer
 
Data-Ed: Emerging Trends in Data Jobs
Data-Ed: Emerging Trends in Data JobsData-Ed: Emerging Trends in Data Jobs
Data-Ed: Emerging Trends in Data Jobs
 
Incorporating ERP metadata in your data models
Incorporating ERP metadata in your data modelsIncorporating ERP metadata in your data models
Incorporating ERP metadata in your data models
 
CDMP preparation workshop EDW2016
CDMP preparation workshop EDW2016CDMP preparation workshop EDW2016
CDMP preparation workshop EDW2016
 
Information Management best_practice_guide
Information Management best_practice_guideInformation Management best_practice_guide
Information Management best_practice_guide
 
Metadata Strategies
Metadata StrategiesMetadata Strategies
Metadata Strategies
 
Selecting Data Management Tools - A practical approach
Selecting Data Management Tools - A practical approachSelecting Data Management Tools - A practical approach
Selecting Data Management Tools - A practical approach
 
Data modeling for the business
Data modeling for the businessData modeling for the business
Data modeling for the business
 
Incorporating SAP Metadata within your Information Architecture
Incorporating SAP Metadata within your Information ArchitectureIncorporating SAP Metadata within your Information Architecture
Incorporating SAP Metadata within your Information Architecture
 
Data modelling 101
Data modelling 101Data modelling 101
Data modelling 101
 
Trends in Data Modeling
Trends in Data ModelingTrends in Data Modeling
Trends in Data Modeling
 
Data Stewardship and Governance: how to reach global adoption and systematic ...
Data Stewardship and Governance: how to reach global adoption and systematic ...Data Stewardship and Governance: how to reach global adoption and systematic ...
Data Stewardship and Governance: how to reach global adoption and systematic ...
 
Data Governance for Clinical Information
Data Governance for Clinical InformationData Governance for Clinical Information
Data Governance for Clinical Information
 
DAMA CDMP exam cram
DAMA CDMP exam cramDAMA CDMP exam cram
DAMA CDMP exam cram
 
Big data Readiness white paper
Big data  Readiness white paperBig data  Readiness white paper
Big data Readiness white paper
 
Information is at the heart of all architecture disciplines & why Conceptual ...
Information is at the heart of all architecture disciplines & why Conceptual ...Information is at the heart of all architecture disciplines & why Conceptual ...
Information is at the heart of all architecture disciplines & why Conceptual ...
 

Similar to BDA 2012 Big data why the big fuss?

The Bigger They Are The Harder They Fall
The Bigger They Are The Harder They FallThe Bigger They Are The Harder They Fall
The Bigger They Are The Harder They FallTrillium Software
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big dataRaul Chong
 
Gse uk-cedrinemadera-2018-shared
Gse uk-cedrinemadera-2018-sharedGse uk-cedrinemadera-2018-shared
Gse uk-cedrinemadera-2018-sharedcedrinemadera
 
Understanding Big Data so you can act with confidence
Understanding Big Data so you can act with confidenceUnderstanding Big Data so you can act with confidence
Understanding Big Data so you can act with confidenceIBM Software India
 
Big data and the data quality imperative
Big data and the data quality imperativeBig data and the data quality imperative
Big data and the data quality imperativeTrillium Software
 
final oracle presentation
final oracle presentationfinal oracle presentation
final oracle presentationPriyesh Patel
 
Deliveinrg explainable AI
Deliveinrg explainable AIDeliveinrg explainable AI
Deliveinrg explainable AIGary Allemann
 
INTRODUCTION TO BIG DATA AND HADOOP
INTRODUCTION TO BIG DATA AND HADOOPINTRODUCTION TO BIG DATA AND HADOOP
INTRODUCTION TO BIG DATA AND HADOOPDr Geetha Mohan
 
Are You Prepared For The Future Of Data Technologies?
Are You Prepared For The Future Of Data Technologies?Are You Prepared For The Future Of Data Technologies?
Are You Prepared For The Future Of Data Technologies?Dell World
 
Oceans of big data: Take the plunge or wade in slowly?
Oceans of big data: Take the plunge or wade in slowly?Oceans of big data: Take the plunge or wade in slowly?
Oceans of big data: Take the plunge or wade in slowly?Deloitte Canada
 
Big Data Analytics_Unit1.pptx
Big Data Analytics_Unit1.pptxBig Data Analytics_Unit1.pptx
Big Data Analytics_Unit1.pptxPrabhaJoshi4
 
From Near to Maturity - Presentation to European Data Forum
From Near to Maturity - Presentation to European Data ForumFrom Near to Maturity - Presentation to European Data Forum
From Near to Maturity - Presentation to European Data ForumCastlebridge Associates
 
Why Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A LieWhy Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A LieSunil Ranka
 
Big Data Expo 2015 - Trillium software Big Data and the Data Quality
Big Data Expo 2015 - Trillium software Big Data and the Data QualityBig Data Expo 2015 - Trillium software Big Data and the Data Quality
Big Data Expo 2015 - Trillium software Big Data and the Data QualityBigDataExpo
 
Artificial Intelligence Expert Session Webinar
Artificial Intelligence Expert Session Webinar Artificial Intelligence Expert Session Webinar
Artificial Intelligence Expert Session Webinar ibi
 
What Managers Need to Know about Data Science
What Managers Need to Know about Data ScienceWhat Managers Need to Know about Data Science
What Managers Need to Know about Data ScienceAnnie Flippo
 
Data Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & ApproachesData Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & ApproachesDATAVERSITY
 
DataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
DataOps - Big Data and AI World London - March 2020 - Harvinder AtwalDataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
DataOps - Big Data and AI World London - March 2020 - Harvinder AtwalHarvinder Atwal
 
Big data
Big dataBig data
Big dataRiya
 

Similar to BDA 2012 Big data why the big fuss? (20)

The Bigger They Are The Harder They Fall
The Bigger They Are The Harder They FallThe Bigger They Are The Harder They Fall
The Bigger They Are The Harder They Fall
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big data
 
Why data governance is the new buzz?
Why data governance is the new buzz?Why data governance is the new buzz?
Why data governance is the new buzz?
 
Gse uk-cedrinemadera-2018-shared
Gse uk-cedrinemadera-2018-sharedGse uk-cedrinemadera-2018-shared
Gse uk-cedrinemadera-2018-shared
 
Understanding Big Data so you can act with confidence
Understanding Big Data so you can act with confidenceUnderstanding Big Data so you can act with confidence
Understanding Big Data so you can act with confidence
 
Big data and the data quality imperative
Big data and the data quality imperativeBig data and the data quality imperative
Big data and the data quality imperative
 
final oracle presentation
final oracle presentationfinal oracle presentation
final oracle presentation
 
Deliveinrg explainable AI
Deliveinrg explainable AIDeliveinrg explainable AI
Deliveinrg explainable AI
 
INTRODUCTION TO BIG DATA AND HADOOP
INTRODUCTION TO BIG DATA AND HADOOPINTRODUCTION TO BIG DATA AND HADOOP
INTRODUCTION TO BIG DATA AND HADOOP
 
Are You Prepared For The Future Of Data Technologies?
Are You Prepared For The Future Of Data Technologies?Are You Prepared For The Future Of Data Technologies?
Are You Prepared For The Future Of Data Technologies?
 
Oceans of big data: Take the plunge or wade in slowly?
Oceans of big data: Take the plunge or wade in slowly?Oceans of big data: Take the plunge or wade in slowly?
Oceans of big data: Take the plunge or wade in slowly?
 
Big Data Analytics_Unit1.pptx
Big Data Analytics_Unit1.pptxBig Data Analytics_Unit1.pptx
Big Data Analytics_Unit1.pptx
 
From Near to Maturity - Presentation to European Data Forum
From Near to Maturity - Presentation to European Data ForumFrom Near to Maturity - Presentation to European Data Forum
From Near to Maturity - Presentation to European Data Forum
 
Why Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A LieWhy Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A Lie
 
Big Data Expo 2015 - Trillium software Big Data and the Data Quality
Big Data Expo 2015 - Trillium software Big Data and the Data QualityBig Data Expo 2015 - Trillium software Big Data and the Data Quality
Big Data Expo 2015 - Trillium software Big Data and the Data Quality
 
Artificial Intelligence Expert Session Webinar
Artificial Intelligence Expert Session Webinar Artificial Intelligence Expert Session Webinar
Artificial Intelligence Expert Session Webinar
 
What Managers Need to Know about Data Science
What Managers Need to Know about Data ScienceWhat Managers Need to Know about Data Science
What Managers Need to Know about Data Science
 
Data Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & ApproachesData Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & Approaches
 
DataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
DataOps - Big Data and AI World London - March 2020 - Harvinder AtwalDataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
DataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
 
Big data
Big dataBig data
Big data
 

More from Christopher Bradley

Data is NOT the new oil - the Data Asset IS different
Data is NOT the new oil - the Data Asset IS differentData is NOT the new oil - the Data Asset IS different
Data is NOT the new oil - the Data Asset IS differentChristopher Bradley
 
Information Management Capabilities, Competencies & Staff Maturity Assessment
Information Management Capabilities, Competencies & Staff Maturity AssessmentInformation Management Capabilities, Competencies & Staff Maturity Assessment
Information Management Capabilities, Competencies & Staff Maturity AssessmentChristopher Bradley
 
Information Management Training & Certification
Information Management Training & CertificationInformation Management Training & Certification
Information Management Training & CertificationChristopher Bradley
 
Is the Data asset really different?
Is the Data asset really different?Is the Data asset really different?
Is the Data asset really different?Christopher Bradley
 
How to identify the correct Master Data subject areas & tooling for your MDM...
How to identify the correct Master Data subject areas & tooling for your MDM...How to identify the correct Master Data subject areas & tooling for your MDM...
How to identify the correct Master Data subject areas & tooling for your MDM...Christopher Bradley
 
BP Data Modelling as a Service (DMaaS)
BP Data Modelling as a Service (DMaaS)BP Data Modelling as a Service (DMaaS)
BP Data Modelling as a Service (DMaaS)Christopher Bradley
 
DMBOK 2.0 and other frameworks including TOGAF & COBIT - keynote from DAMA Au...
DMBOK 2.0 and other frameworks including TOGAF & COBIT - keynote from DAMA Au...DMBOK 2.0 and other frameworks including TOGAF & COBIT - keynote from DAMA Au...
DMBOK 2.0 and other frameworks including TOGAF & COBIT - keynote from DAMA Au...Christopher Bradley
 
Data Modelling is NOT just for RDBMS's
Data Modelling is NOT just for RDBMS'sData Modelling is NOT just for RDBMS's
Data Modelling is NOT just for RDBMS'sChristopher Bradley
 
Implementing Effective Data Governance
Implementing Effective Data GovernanceImplementing Effective Data Governance
Implementing Effective Data GovernanceChristopher Bradley
 

More from Christopher Bradley (11)

Data is NOT the new oil - the Data Asset IS different
Data is NOT the new oil - the Data Asset IS differentData is NOT the new oil - the Data Asset IS different
Data is NOT the new oil - the Data Asset IS different
 
Big Data Readiness Assessment
Big Data Readiness AssessmentBig Data Readiness Assessment
Big Data Readiness Assessment
 
Information Management Capabilities, Competencies & Staff Maturity Assessment
Information Management Capabilities, Competencies & Staff Maturity AssessmentInformation Management Capabilities, Competencies & Staff Maturity Assessment
Information Management Capabilities, Competencies & Staff Maturity Assessment
 
Information Management Training & Certification
Information Management Training & CertificationInformation Management Training & Certification
Information Management Training & Certification
 
Is the Data asset really different?
Is the Data asset really different?Is the Data asset really different?
Is the Data asset really different?
 
How to identify the correct Master Data subject areas & tooling for your MDM...
How to identify the correct Master Data subject areas & tooling for your MDM...How to identify the correct Master Data subject areas & tooling for your MDM...
How to identify the correct Master Data subject areas & tooling for your MDM...
 
BP Data Modelling as a Service (DMaaS)
BP Data Modelling as a Service (DMaaS)BP Data Modelling as a Service (DMaaS)
BP Data Modelling as a Service (DMaaS)
 
DMBOK 2.0 and other frameworks including TOGAF & COBIT - keynote from DAMA Au...
DMBOK 2.0 and other frameworks including TOGAF & COBIT - keynote from DAMA Au...DMBOK 2.0 and other frameworks including TOGAF & COBIT - keynote from DAMA Au...
DMBOK 2.0 and other frameworks including TOGAF & COBIT - keynote from DAMA Au...
 
Data Modelling and WITSML
Data Modelling and WITSMLData Modelling and WITSML
Data Modelling and WITSML
 
Data Modelling is NOT just for RDBMS's
Data Modelling is NOT just for RDBMS'sData Modelling is NOT just for RDBMS's
Data Modelling is NOT just for RDBMS's
 
Implementing Effective Data Governance
Implementing Effective Data GovernanceImplementing Effective Data Governance
Implementing Effective Data Governance
 

Recently uploaded

Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetEnjoy Anytime
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 

Recently uploaded (20)

Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 

BDA 2012 Big data why the big fuss?

  • 1. Big Data: Why the big fuss?
  • 2. Presenter My blog: Information Management, Life & Petrol http://infomanagementlifeandpetrol.blogspot. com @InfoRacer Chris Bradley Chief Development Officer chris.bradley@ipl.com +44 1225 475000
  • 3. Introductions Chris has spent 32 years in the Information management field, working for leading organisations in Data Management Strategy, Master Data Management, Metadata Management, Data Warehouse and Business Intelligence. Graduating in 1979 Chris worked for the MoD(Navy), Volvo, Thorn EMI (as Head of Information Management), Readers Digest Inc (as European CIO), and Coopers and Lybrand Management Consultancy where he established and ran the International Data Management practice. Chris heads IPL’s Business Consultancy practice and is advising several Energy, Pharmaceutical, Finance and Government clients on Business Process and Information Asset Management. Chris is a member of the MPO, Director of DAMA UK and holds the CDMP Master certification. He co-authored “Data Modelling For The Business – A Handbook for aligning the business with IT using high-level data models”. Chris is a columnist and frequent contributor to industry publications. He authors an experts channel on the influential BeyeNETWORK, is a recognised thought- leader in Information Management and regular key speaker at major International Information Management conferences. chris.bradley@ipl.com +44 1225 475000 Blog: Information Management, Life & Petrol http://infomanagementlifeandpetrol.blogspot. com @InfoRacer Christopher Bradley Chief Development Officer
  • 4. Who is IPL? Trusted, independent consulting & solutions co 30 year track record 300 staff, £28m+ turnover High-stakes, business & mission critical contexts Consistently exceed expectations Business Consulting Division Information Management - IM Strategy - Information Security & Assurance - Data Governance - Information Exploitation - Master Data Management - Information Architecture - Business Intelligence .......turning Information into a strategic asset Enterprise Architecture Business Process Management Programme Management IPL Consulting Clients
  • 8. • Big data comes in one size: large. All enterprises are awash with data, and can easily amass terabytes and petabytes of information. • Can systems scale up without degrading performance intolerably? Volume • Frequently time-sensitive, big data should be used as it streams into the enterprise in order to maximise its value to the business. • How can you calculate mean values across a constantly changing landscape? Velocity • Big data extends beyond structured data to include unstructured data of all varieties: text, audio, video, click streams, log files and more. • How do you apply the normal methods of analytics and reporting with unknown structures? Variety
  • 9. Data volume keeps growing The total amount of global data is expected to grow to 2.7 zettabytes during 2012 (up 48% from 2011)* Equivalent of every person sending 30 tweets/hour for the next 1200 years! Enterprises will manage 50 times more data and files will grow 75 times in the next decade 80% of the world’s data is unstructured * IDC Digital Universe Study 2011
  • 10. Isn’t it all relative?
  • 11. The 7 dimensions of data Users Devices Capacity Media Advances Software Automation
  • 12. •Population increase •Computing demographic •Proliferation •Portability •Miniturisation •Reducing costs •More choice •Temptation to fill •File sizes •New formats •Needs more space •More files •Solution fulfillment •Augmentation
  • 13. Then and now Dimension • Users • Devices • Capacity • Media • Advances • Software • Automation Then • IT in the workplace • 3270 / Green screen • KBs and MBs • Expensive floppy disks • Dedicated • Minimal/business • Business processes Now • Anywhere • Fixed and mobile • PBs, ZBs & YBs • Cheap cards and sticks • Multi-purpose • Complex/everything • What isn’t?
  • 14. Big data is not a new problem…
  • 17. It’s all about scale …… + the combination
  • 18. Back to basics Still all about good Information and Data Management Driver = Need to act faster Challenge = Joining it all up … and that’s getting harder Objective = Remains the same … Information Exploitation
  • 20. The fourth V What is needed? In what quantity? And by when?
  • 21. What’s the point of Big Data yielding Little Information?
  • 22. Understand what it is that you need
  • 23. Remember “Garbage in…” Quality is a key factor: Unstructured – Homeland Security may not care Structured – poorly calibrated meters = bigger garbage Faults in the technology and processes produce exaggerated errors Bad decisions get made faster It’s all about scale… …get the IM basics for ‘little data’ right first
  • 24. More data isn’t necessarily better
  • 25. The fundamentals Data Architecture Data Governance Master Data Management Information Security Data Quality Metadata Management Business Intelligence Information Management Core Disciplines Source: DAMA-I
  • 26. Managing Big Data successfully Data quality Sort out your ‘little data’ first
  • 27.
  • 28.
  • 29. Managing Big Data successfully Data quality Sort out your ‘little data’ first Select the right technology solution(s) Understand the analytics required: Near real-time Mining deeper than before Design optimal presentation channels Target the skills you need Key/value Data Stores eg Cassandra Columnar/tabular NoSQL Data Stores eg Hadoop, Hypertable MPP Appliances eg Greenplum , Netezza XML Data Stores eg CuDB, Marklogic
  • 30. Conclusions Keep it all in perspective, most of this is not new True value comes from deep understanding of the three Vs Remember the fourth V is the bottom line More data does not necessarily mean better information or wiser decisions Apply data management fundamentals before the technology for Big Data
  • 31. Questions My blog: Information Management, Life & Petrol http://infomanagementlifeandpetrol.blogspot.com @InfoRacer Tel: +44 1225 475000 email: Chris.Bradley@ipl.com
  • 32.
  • 33. Financial Services Opportunities Creating actionable intelligence – credit history Customer insight Fraud detection Regulatory compliance
  • 34. Big Data sources Key/value Data Stores such as Cassandra Columnar/tabular NoSQL Data Stores such as Hadoop & Hypertable Massively Parallel Processing Appliances such as Greenplum & Netezza XML Data Stores such as CuDB & Marklogic Data Federation/ Data Virtualisation approaches are stepping up to meet this challenge
  • 35. Don’t forget Data Quality Managing the quality of the data is of the upmost importance What’s the use of this vast resource if its quality and trustworthiness is questionable? Driving your data quality capability up the maturity levels is key
  • 36. Data Quality Maturity Assessment Level 1 - Initial Level 2 - Repeatable Level 3 - Defined Level 4 - Managed Level 5 - Optimised Limited awareness within the enterprise of the importance of information quality. Very few, if any, processes in place to measure quality of information. Data is often not trusted by business users. The quality of few data sources is measured in an ad hoc manner. A number of different tools used to measure quality. The activity is driven by a projects or departments. Limited understanding of good versus bad quality. Identified issues are not consistently managed. Quality measures have been defined for some key data sources. Specific tools adopted to measure quality with some standards in place. The processes for measuring quality are applied at consistent intervals. Data issues are addressed where critical. Data quality is measured for all key data sources on a regular basis. Quality metrics information is published via dashboards etc. Active management of data issues through the data ownership model ensures issues are often resolved. Quality considerations baked into the SDLC. The measurement of data quality is embedded in many business processes across the enterprise. Data quality issues addressed through the data ownership model. Data quality issues fed back to be fixed at source.

Editor's Notes

  1. Chris
  2. Chris
  3. Chris
  4. In 1859, Thomas Austin brought out 24 rabbits, 5 hares and 72 partridges and released them on his property, just outside of Geelong in Victoria, called ‘Barwon Park' on Christmas Day. Within 15 years, over 2million per year were being shot or trapped without denting the population.Biological controls in 2nd half of 20th Century reduced the population to aprox 300M. 1991 estimated 600M as resistance to the specific controls has built up.
  5. Churchill V for VictoryV “visitors” 1983 TV min seriesV Vendetta originally 1980s comic book, 2005 film, Dystopian backdrop seeks to destroy Totalitarian govt.Gibson flying V guitar; first released 1958