The document discusses the role of CIOs and big data collaboration. It notes that big data is growing exponentially, with 2.5 quintillion bytes of data created every day from a variety of sources. Big data offers significant value if organizations can analyze it, with potential savings in healthcare, retail, and other sectors. However, big data requires collaboration both internally within organizations and externally with partners. The document provides examples of successful big data collaborations and argues that CIOs will continue playing an important role in facilitating information management and digital transformation through big data initiatives.
EU Data Market study. Presentation at NESSI Summit 2014 IDC & Open EvidenceKasia Szkuta
The study aims to define, assess and measure the European data economy as well as build a genuine stakeholders’ ecosystem. Find us on http://datalandscape.eu and @eudatalandscape
People are sometimes intimidated by big data because it seems overwhelming and they’re much more familiar with using statistics on survey data or analyzing opinions from focus group data. But here are nine examples from companies like Netflix, Ceasars Entertainment, Walmart, eBay, and UPS, that could have conducted survey or focus group research have instead used big data to accomplish big things.
Our guest speaker, Cavan Capps, who is Big Data Lead services presented this talk as part of the Program on Information Science Brown Bag Series.
[slideshare id]
Big Data provides both challenges and opportunities for the official statistical community. The difficult issues of privacy, statistical reliability, and methodological transparency will need to be addressed in order to make full use of Big Data in the official statistical community. Improvements in statistical coverage at small geographies, new statistical measures, more timely data at perhaps lower costs are the potential opportunities. This talk will provides an overview of some of the research being done by the Census Bureau as it explores the use of “Big Data” for statistical agency purposes.
Speaker Bio: Cavan Capps is the U.S. Census Bureau’s Lead on Big Data processing. In that role he is focusing on new Big Data sources for use in official statistics, best practice private sector processing techniques and software/hardware configurations that may be used to improve statistical processes and products. Previously, Mr. Capps initiated, designed and managed a multi-enterprise, fully distributed, statistical network called the DataWeb. The 'DataWeb' is a data library of networked statistical databases from all federal statistical data domains, with sophisticated visualization, descriptive analytics, data integration and dashboard construction tools. The DataWeb is the source of official API to Census data products.
A top-down look at current industry and technology trends for Big Data, Data Analytics and Machine Learning (cognitive technologies, AI etc.). New slides added for Ark Group presentation on 1st December 2016.
EU Data Market study. Presentation at NESSI Summit 2014 IDC & Open EvidenceKasia Szkuta
The study aims to define, assess and measure the European data economy as well as build a genuine stakeholders’ ecosystem. Find us on http://datalandscape.eu and @eudatalandscape
People are sometimes intimidated by big data because it seems overwhelming and they’re much more familiar with using statistics on survey data or analyzing opinions from focus group data. But here are nine examples from companies like Netflix, Ceasars Entertainment, Walmart, eBay, and UPS, that could have conducted survey or focus group research have instead used big data to accomplish big things.
Our guest speaker, Cavan Capps, who is Big Data Lead services presented this talk as part of the Program on Information Science Brown Bag Series.
[slideshare id]
Big Data provides both challenges and opportunities for the official statistical community. The difficult issues of privacy, statistical reliability, and methodological transparency will need to be addressed in order to make full use of Big Data in the official statistical community. Improvements in statistical coverage at small geographies, new statistical measures, more timely data at perhaps lower costs are the potential opportunities. This talk will provides an overview of some of the research being done by the Census Bureau as it explores the use of “Big Data” for statistical agency purposes.
Speaker Bio: Cavan Capps is the U.S. Census Bureau’s Lead on Big Data processing. In that role he is focusing on new Big Data sources for use in official statistics, best practice private sector processing techniques and software/hardware configurations that may be used to improve statistical processes and products. Previously, Mr. Capps initiated, designed and managed a multi-enterprise, fully distributed, statistical network called the DataWeb. The 'DataWeb' is a data library of networked statistical databases from all federal statistical data domains, with sophisticated visualization, descriptive analytics, data integration and dashboard construction tools. The DataWeb is the source of official API to Census data products.
A top-down look at current industry and technology trends for Big Data, Data Analytics and Machine Learning (cognitive technologies, AI etc.). New slides added for Ark Group presentation on 1st December 2016.
Analysis on big data concepts and applicationsIJARIIT
The term, Big Data ‘ h a s been referred as a large amount of data that cannot be handled by traditional database
systems. It consists of large volumes of data which is been generated at a very fast rate, these cannot be handled and processed by
traditional data management tools, so it requires a new set of tools or frameworks to handle these types of data. Big data
works under V’s namely Volume, Velocity, and Variety. Volume refers to the size of the data whereas Velocity refers to the
speed that the data is being generated. Variety refers to different formats of data that is generated. Mostly in today’s world
thee average volumes of unstructured data like audio, video, image, sensor data etc. One can get these types of data through
social media, enterprise data, and Transactional data. Through Big data analytics, one can able to examine large data sets
containing a variety of data types. Primary goals of big data analytics are to help the organizations to take important decisions
by appointing data scientists and other analytics professionals to analyses large volumes of data. Challenges one can face
during large volume of data, especially machine-generated data, is exploding, how fast that data is growing every year, with
new sources of data that are emerging. Through the article, the authors intend to decipher the notions in an intelligible
manner embodying in text several use-cases and illustrations
Big Data is a term to describe technologies which help to collect and analyze big data amounts that cannot be easily processed. Focus of Big Data is on real-time analyses and the recognition of unknown correlations within acquired information.
By using Big Data tools, you will gather competitive advantages through, for instance, customer analyses and simulations of business scenarios, encourage innovations and grow added values of your company. With the help of Big Data, loose data collections will be restructured and supportive tools for decision-making management processes are provided.
The Pros and Cons of Big Data in an ePatient WorldPYA, P.C.
PYA Principal Dr. Kent Bottles, who is also PYA Analytics’ Chief Medical Officer, presented “The Pros and Cons of Big Data in an ePatient World” at the ePatient Connections 2013 conference.
Analysis on big data concepts and applicationsIJARIIT
The term, Big Data ‘ h a s been referred as a large amount of data that cannot be handled by traditional database
systems. It consists of large volumes of data which is been generated at a very fast rate, these cannot be handled and processed by
traditional data management tools, so it requires a new set of tools or frameworks to handle these types of data. Big data
works under V’s namely Volume, Velocity, and Variety. Volume refers to the size of the data whereas Velocity refers to the
speed that the data is being generated. Variety refers to different formats of data that is generated. Mostly in today’s world
thee average volumes of unstructured data like audio, video, image, sensor data etc. One can get these types of data through
social media, enterprise data, and Transactional data. Through Big data analytics, one can able to examine large data sets
containing a variety of data types. Primary goals of big data analytics are to help the organizations to take important decisions
by appointing data scientists and other analytics professionals to analyses large volumes of data. Challenges one can face
during large volume of data, especially machine-generated data, is exploding, how fast that data is growing every year, with
new sources of data that are emerging. Through the article, the authors intend to decipher the notions in an intelligible
manner embodying in text several use-cases and illustrations
Big Data is a term to describe technologies which help to collect and analyze big data amounts that cannot be easily processed. Focus of Big Data is on real-time analyses and the recognition of unknown correlations within acquired information.
By using Big Data tools, you will gather competitive advantages through, for instance, customer analyses and simulations of business scenarios, encourage innovations and grow added values of your company. With the help of Big Data, loose data collections will be restructured and supportive tools for decision-making management processes are provided.
The Pros and Cons of Big Data in an ePatient WorldPYA, P.C.
PYA Principal Dr. Kent Bottles, who is also PYA Analytics’ Chief Medical Officer, presented “The Pros and Cons of Big Data in an ePatient World” at the ePatient Connections 2013 conference.
Introduction to digital image processing, image processing, digital image, analog image, formation of digital image, level of digital image processing, components of a digital image processing system, advantages of digital image processing, limitations of digital image processing, fields of digital image processing, ultrasound imaging, x-ray imaging, SEM, PET, TEM
Forecast to contribute £216 billion to the UK economy via business creation, efficiency and innovation, and generate 360,000 new jobs by 2020, big data is a key area for recruiters.
In this QuickView:
- Big data in numbers
- Top 10 industries hiring big data professionals
- Top 10 qualifications sought by hirers
- Top 10 database and BI skills sought by hirers
- Getting started in big data: popular big data techniques and vendors
Streaming and Visual Data Discovery for the Internet of ThingsDatawatchCorporation
Sensor devices and their associated data streams are rapidly becoming a big source of differentiation for organizations that can effectively harness this information to drive new insights and take action. The breakthrough is enabled by new solutions for applying visual data discovery to streaming data in motion. This session will focus on industrial analytics and how best to apply new technologies that drive synergies between IT and OT.
Qu'est ce que le Big Data ? Avec Victoria Galano Data Scientist chez Air FranceJedha Bootcamp
Depuis les 5 dernières années, nous avons créé plus de données que depuis les débuts de l'humanité. Nous produisons aujourd'hui tellement de données qu'il devient difficile de les gérer. C'est ce qu'on appelle le Big Data. Durant ce workshop nous parlerons des enjeux du Big Data et de ses applications concrètes dans notre société.
Lecture given at the University of Catania on December 2nd, 2014.
Start from Big Data definitions, continue with real life examples of successful Big Data Projects, go a little bit deeper with Sentiment Analysis, and conclude with a brief overview of Big Data tools and Big Data with Microsoft.
Summary:
1. What is Big Data? (includes the 5Vs of Big Data)
2. Big Data Examples (includes 6 Real Life Examples and comments on Privacy concerns)
3. How to Tackle a Big Data Problem (my 4 Universal Steps to follow)
4. Sentiment Analysis (what is sentiment analysis? Why do we care? A Technique and a plan)
5. Big Data tools (Hadoop, Hadoop Ecosystem, Hive, Pig, Sqoop, Oozie; Azure HDInsight, Excel Power Query, Power Pivot, Power View, Power Map)
Fundamentals of Big Data in 2 minutes!!Simplify360
In today’s world where information is increasing every second, BIG DATA takes up a major role in transforming any business.
Learn the fundamentals of big data in just 2 minutes!
In this issue of TOP TEN we provide the reader with a wealth of information related to current and future usages of BIG DATA. The reader will get an insight into usages in the realm of education, health, construction, management as well as marketing.
Similar to 151111 BASE ELN 151112 CIO Big Data Collaboration (20)
6. CIO Today
The ‘Perfect Storm’
• Mobile
• Broad-band
• ‘Consumerisation’
• BYOD
• Social Media
• Cloud
• IoT
• Digital Disruption/Revolution
• Big Data
8. Big Data?
• Every day create 2.5 quintillion bytes*
• 90% data today created last 2 years*
• Data from everywhere : -
• sensors gathering climate information,
• social media sites,
• digital pictures/videos,
• purchase transaction records, and
• cell phone GPS signals to name a few.
• *IBM This is Big Data.
9. Big Data— a growing torrent
$600 - disk drive store all world’s music
5 billion - mobile phones
30 billion - pieces shared on Facebook every month
40% - global data growth pa
5% - global IT spend growth pa
235 - terabytes - US Library of Congress
15 of 17 - sectors more data stored
per company than Library of Congress
10. V Big Data
Big Data =
vast data
too large and complex for
conventional processing?
11. V Big Data
6 Vs:
• Volume - Terabytes to Petabytes, Exabytes & Zettabytes
(petabyte = 500 bn pages printed text)
• Velocity - Near real time sub second delivery
• Variety - Both structured & unstructured data
• Volatility - 100s new online data sources –
new apps, web services & social networks
• Veracity - 1 in 3 business leaders don’t trust the information.
• Value - Opportunity – new competitive insights &
operating models & products based on customer insight & intelligence.
Big data: The next frontier for innovation, competition, and productivity McKinsey Global Institute
12. Data
capturing Potential Value
$300 billion pa - US health care— > 2x Spain’s health
care spend pa
€250 billion pa- Euro public sector admin > Greece GDP
$600 billion - personal location
data
+60% - retailer operating margins
+190,000 - Analysts
+1.5 million - data-savvy
managers (US)
Big data: The next frontier for innovation, competition, and productivity McKinsey Global Institute
13. • €250 billion value per year
• 0.5 percent annual productivity growth
McKinsey Global Institute analysis
Big data can generate big value …
Europe public sector administration
14. Big data can generate big value…
US Health Care
• $300 billion value per year
• 0.7 percent annual productivity growth
McKinsey Global Institute analysis
15. Big data can generate big value…
Manufacturing
• Up to 50 percent decrease in product development,
assembly costs
• Up to 7 percent reduction in working capital
McKinsey Global Institute analysis
16. Big data can generate big value …
US Retail
• 60+% increase in net margin possible
• 0.5–1.0 percent annual productivity growth
McKinsey Global Institute analysis
17. • $100 billion+ revenue for service providers
• Up to $700 billion value to end users
McKinsey Global Institute analysis
Big data can generate big value …
Global personal location data
18. How to Benefit from Big Data
Multiple Data Sources - Collaborate
Creatively source Internal and External Data
Upgrade IT for easy Data Merging
Prediction and Optimisation Models
Focus on biggest Performance Drivers
Balance Complexity and Ease of Use
Transform Company Capabilities
Develop Useable Business-relevant Analytics
Embed Analytics into Simple Tools for Front Lines
Update Processes and Develop Capabilities to Exploit
Big Data
20. Big Data Collaboration
Open Data
External
Internal
“With Big Data the sum is more valuable than its parts,
and when we recombine the sums of multiple datasets
together, that sum too is worth more than its individual
ingredients” (Mayer-Schonberger & Cukier – Big Data).
Countless tools…but technology is only part of it- it must
be embedded in organisational culture.
Photo: Andy
21. Collaboration – Open Data
TfL ‘Countdown’
Photo: Clive A Brown
Photo: Nico Hogg
Photo:James Darling
23. Matt Hancock - Minister Cabinet Office
data.gov.uk - Open to Usable
Published > 20k datasets = £200Bn Public
Spend
Performance Metrics for All 800
Citizen/Government Transactions
UK - 1st Global Open Data Index & top
WWW Foundation Barometer
Defra published LiDAR elevation data:-
Apps - e.g. flood defences, precision farming,
archaeology, urban planning, Minecraft
Land Registry released Price Paid Dataset:-
Apps – RightMove, Zoopla, Develop Valuation
SW, Improve Planning Policy, Market Trends,
Academic Research.
Photo: Derfelphotogen
24. Matt Hancock - Minister Cabinet Office
Next Steps
Modernise Data Infrastructure: –
Data Standards & Maintenance
‘Dog-fooding’ – eat your own.
Build Capability across Civil Service
Put Trust at Heart of Process - CIA
Collaborative approach to Data Policy & Governance:-
Data at Heart of Open Government Partnership Action Plan
UK Lead Steward for International Open Data Charter
Whitehall Data Leaders Network
Data Driven Government wants to engage UK’s Data Economy
Connect with Businesses, Start-ups & Innovators leading across
Data Spectrum
Network & Interlink currently dispersed and decentralised
Knowledge & Expertise.
Photo: Wesley & Dannells
25. Kaggle
Kaggle data scientists helped automatically
diagnose schizophrenia subjects from MRI
scans in MLSP's research competition ›
GE utilized multiple sources of flight and
weather data to improve runway arrival
time prediction, saving airlines millions of
dollars.
platform for predictive modelling
and analytics competitions on
which companies and researchers
post their data and statisticians
and data miners from all over the
world compete to produce the
best models.
26. Kaggle
Some Customers
Heritage Provider NetworkImproved revenue for hospital network through patient admissions.
Broader recognition of gestures for Xbox Kinect
Earlier detection of driver drowsiness
Better predictability for drug targets.
Better and broader identification of talent
Increase effectiveness of click through rates
Retailer
($10B+ revenue)
Optimized decision for store location
Better and broader identification of talent
Earlier and more accurate detection of seizures in patients with epilepsy.
More accurate airline departure and arrival times; improved hospital
operations.
Improved estimate of customer claims costs; Reduced customer churn.
More accurate imaging of dark matter.
Improved lifetime customer value.
Better predictability for drug targets.
Beverage Co.
($10B+ revenue)
Improved sales and demand forecasting
Oil & Gas Co.
($100B+ revenue)
Improved prediction of oil reserves
Regional Bank in
Northeast
Improved risk profile by identifying financial distress
27. External Big Data Collaboration
‘Big Pharma’ - 3 Different Models
• Project Data Sphere (PDS)
Objective – accelerate drug discovery & development
Companies share clinical trial data for Cancer Research:-
o Astra Zeneca, Bayer, Celgene, Janssen (J&J),Pfizer, Sloan
Kettering, Sanofi; National Cancer Inst., Amgen, Quintiles
EU Legislative Model
EMA establish publicly accessible clinical trials register -
Targets clinical trial data transparency
Strict conditions & penalties
Genentech & PatientsLikeMe - 5 Year Commercial Agreement
Patients provide their Data during current treatment
Wide range of conditions, but Cancer lead
Photo: Ynse
Photo: Go2net Vaughn
28. ‘Big Pharma’ 2015 Bio-IT World Conference
3 Big Data Collaboration Tips
Ruthlessly Select ALL Collaboration Partners
Polygamous
External and Business Partners & Suppliers
Multi-cloud solutions becoming popular
Take Coding out of Collaboration
Flexibility v Standardisation
Standardised, cloud-based GUI platform
BUT Get Own ‘Big Data’ House in Order
Internal & External
Especially Information Security, IAM
Photo: Adam
Photo: Ray Morris
29. Internal Big Data Collaboration
UK Government
Potential Rewards
Policy makers – think differently & enhance service provision
Public access ever-increasing, more easily ‘digestible information
aids informed decision making
BIS – Business Innovation & Skills
Building Data Science capability – interdisciplinary collaboration
Innovative interactive visualisation UN trade data for Global audience
DWP – Dept. Work & Pensions – Universal Credit Data
Increase effectiveness of departmental operations
Jobcentre Plus & other organisations e.g. Housing Associations
understand claimant caseload by location
Public has UC roll-out timetable by locality
Data Science Competition
Supported by Government Data Science Partnership (Cabinet Office,
GDS, Go-Science & ONS)
Using Big Data techniques & SW to answer business questions,
visualise information across wide range of depts.
30. Internal Collaboration
Data driven companies
(McAfee & Brynjolfsson – HBR)
McKinsey/MIT HBR study 330 N American Companies
Data Driven Companies better performers than Competitors:
5% more productive
6% more profitable
Reflected in Stock Market Valuations
Better predictions, surprising sources:-
Wharton Big Data House Market Prices better than National Association of Realtors
John Hopkins used Google Flu Trends to predict surges week before Centre for
Disease Control
Twitter updated & tracked Haiti Cholera spread 2 weeks ahead of Official Reports
Passur Aerospace – RightETA virtually eliminated gaps between Airline Estimated and
Actual Arrival Times. “Worth $ Millions”
Sears – Big Data Technologies & Practices (Hadoop clusters) – directly analysed cluster
data. Promotion lead times dropped from 8 weeks to 1 week & still dropping.
31. Internal
Collaboration
Companies have applied Big Data and analytics capabilities to
sales and marketing data to better understand and engage their
customers, but scant few have directed these types of
technologies ‘inward’ to improve their own organizations.
Collaboration data can reveal how an organization’s biggest
assets – its people – are working together and what the
organization is really focused on.
In a Collaboration centric culture, data sharing helps build bridges
and collapse silos to develop competitive advantage
Today collaboration data is likely the most valuable untapped
data source in the enterprise
http://www.digitalistmag.com/technologies/big-data/leveraging-big-data-
redefine-collaboration-enterprise-01243242
Photo: JP Goguen
32. Collaboration & Security
• Collaboration & Sharing may pose new risks
• Opportunity to advocate & enable secure collaboration in high-risk
environments
• Take both data- & identity-centric approach to controlling information
in collaboration environments
• 4 main steps:-
1. ‘Turn Big into Small Data’ means System keeps up to enable faster, more
accurate & business-relevant decisions
2. Determine Information context; Who communicating How. Leads to better, fine-
tuned decisions.
3. Deploy controls to securely enable on-premise email & collaboration. Then
determine how & where to deploy data controls
4. Deploy controls to securely enable cloud and mobile collaboration. Manage
areas of High Risk
• Instead of security of “No”. Empower security of “Know”
(Tyson Whitten – CA)
Photo: Paulo Valdivieso
33. CIO Role
• Lead
• Business Solutions
• Business Knowledge
• Information
• Governance
• Strategy
• Deliver – Results
• Value for Money
• Informed Purchaser
34. CIO Mission
CIO is Custodian of KEY INFORMATION & DATA ASSETS of
organisation, responsible for Governance, Management &
Security; just as CFO is responsible for financial assets.
CIO mission is to facilitate & improve, cost effectively:-
o ‘Digital Engagement’ with Customers, Clients, Citizens &
Stakeholders
o Secure management & communication of Information &
Data
o Streamlining of business processes
o Sharing of Knowledge, across the organization.
As Peter Drucker said: as Knowledge Workers, All we have to
manage is INFORMATION.
37. • Understand the Business
• Be the Business
• Hybrid
• Exploit ICT
• Bridge the GAPS
CIO or DIM?
38. 1. Let users pull - don't push I.T. (too much)
2. It's easy for the organisation to tie
itself in I.T. knots
3. Harness the potential
4. Keep your balance
5. Avoid the lynching party
CIO Rope Tricks
39. CIOs will disappear ?
• Career Is Over?
• When IT standard, organisations will no longer
need CIOs?
• Information ASSET
• Change & Innovation Originators
• Organisations still need CIOs
40. “In the End
All that we have to manage is -
Information.”
Big Data?