SlideShare a Scribd company logo
1 of 36
Download to read offline
Intelligence, Insight, and the role of Scale:
  Data stories from the business world
  Dr Paul Miller
  Cloud of Data




        @PaulMiller
        http://cloudofdata.com

Big Data, we are told, is everywhere. And transformative.
And disruptive.

But how much has actually changed?
Topics

• Data Speaks


• Size Matters


• Personal Data, Privacy, Trust, and a Right to be Forgotten
Data Speaks          But listening may not be enough


Data is cool right now. Everything is “data-driven,” from science and
journalism to decision-making and policy shaping.

But actually, we’ve always gathered data and used it to craft
hypotheses, win arguments, and support theories.
en.wikipedia.org/wiki/File:Cholera_bacteria_SEM.jpg


Insight without scale. Severe Cholera Outbreak. London, 1854.
Hundreds died. Physician John Snow did not accept prevailing theory
that cholera was caused by ‘bad air.’
If he’d had access to an electron microscope, he would clearly have
spotted that cholera is a key ingredient in marmalade.
This image is in the public domain because its copyright has expired.




 en.wikipedia.org/wiki/File:Snow-cholera-map-1.jpg - Original map made by John Snow in 1854.

But Snow didn’t have an electron microscope. He plotted 12 water
pumps around Soho. Plotted deaths. CLEAR link to one pump.

Handle removed and outbreak stopped. Tho Snow himself admitted it
may have passed its peak before he acted.
infinity-imagined.tumblr.com/image/4889213516

staying small… neurons in the retina!
inmaps.linkedinlabs.com/                                         Image © LinkedIn



1,000 connections. They join me to 11 million connections of
connections and connections of connections of connections.

All of those 11 million influence what this picture looks like.

Inadvertently Crowdsourced Art ?
Talis
                                                Escapees
                            Library
                             land


     Museum
      pieces                                     Semantically Challenged



                                                                     European
                                                                    Commission
               Jisc
               world




                                                     In the
 inmaps.linkedinlabs.com/                            Cloud                 Image © LinkedIn



All I did was provide the labels. LinkedIn identified the clusters
algorithmically.

See influencers. Zoom in, and see people (like Richard Wallis) move
from one cluster to another.

Understand my network. If I were actively seeking to extract value, I
might identify the influencers and find ways to work with them, or have
them notice me, or something.
en.wikipedia.org/wiki/File:Colorized_transmission_electron_micrograph_of_Avian_influenza_A_H5N1_viruses.jpg


more bugs and germs and viruses, and things. This time - influenza.

This is H5N1 - Avian Flu.
Image © Google Foundation




 www.google.org/flutrends/

Millions of searches on Google for flu-related terms.

Not every searcher has flu, but at Google scale, the oddities are smoothed out; spikes in flu-
related searches correlate to increases in flu cases… and run slightly ahead of formal reports
to doctors.

Flu prevalent in the north just now, where it’s winter. Low in the south where it’s summer.
That’s hardly surprising. But why are Germany and Norway so much lighter than their
neighbours this week?
Images © Google Foundation
 www.google.org/flutrends/


A little closer to home… Don’t panic, but this area of the
Netherlands is the worst right now.

Google Flu Trends ‘predictions’ correlate well to historical data
from medical authorities… but Google Flu Trends data is
updated daily. EARLY WARNING ?
Image: www.flickr.com/photos/29803258@N02/4997562292/
Size Matters   Or does it?


this Scottish Castle is just Lego...
This image is in the public domain because its copyright has expired.
 en.wikipedia.org/wiki/File:Great_Wave_off_Kanagawa2.jpg


Data Deluge. Tsunami. Flood. Emotive language, and emotive
imagery. There’s too much. We can’t cope. It’s BAD.

Big Data… Despite the name… it isn’t actually just about size.

2001 report from META Group (now Gartner) proposed 3 V’s.
1. Volume.

Implicit presumption that Bigger is better. That is not always
true. Sometimes bigger just means the value is even more
hidden than before. From needle in a haystack to needle in
Germany. Finding the needle just got harder.
107 trillion emails in 2010. 340 million tweets per day. 50
billion pages in Google’s index. 82 petabytes in a single Hadoop
cluster at Yahoo - even more at Facebook. 72 hours of video
uploaded to YouTube every minute. 15 terabytes of data added
to Facebook every day.

Moving beyond the Terabyte. Petabytes, Zettabytes, and more.
2. Velocity.

How fast does it change - and how fast must I act?
Financial Institutions… increasingly moving from models and samples to
real-time authorisation.

Analyse purchase history. Analyse similar customers’ history. Decide
whether or not to authorise… as you are actually buying. Decisions in a
second or so.

Beginning to get smarter about context. You know I bought a plane ticket
to Amsterdam, so why are you querying a restaurant payment in
Amsterdam? Not there yet...
Much slower - hours rather than seconds.

always been well known for mining loyalty cards. But also leverages big
data techniques to reduce stock wastage in 3,000 UK stores by
£30million per annum.

weather forecast updated 3 times per day… implications for 18million
items analysed 3 times per day… and orders changed accordingly.
£50million less tied up in warehouse stock than previously.
Image: www.flickr.com/photos/heydrienne/22078028/


 3. Variety.

 from neatly structured database tables to structured, semi-
 structured & unstructured
Image: www.flickr.com/photos/johnjobby/2253823110/


 the usual example.

 But...
customer support

monitoring sentiment on social networks
mining sentiment and insight from customer forums
using semantics to understand and translate customer contributions,
lowering the cost of delivering quality support in minority languages.
People also now talk about a 4th V - Value. Not just how much it’s worth in monetary terms.
How much benefit does it deliver?

Surely this is the important V?

In some contexts, massive scale will be required to deliver value.
In some contexts, rapid response will be required to deliver value.
In some contexts, lots of different data sets will be required to deliver value.

But the business value should lead. If you don’t NEED petabytes of data, why collect and store
them?
http://dilbert.com/strips/comic/2013-01-09/              Image © Scott Adams



Big Data latest cool toy. It gets on the front of the Economist.
It’s in BusinessWeek and Fortune and Forbes, and the Financial
Times and the Wall Street Journal.

It’s what all the cool execs want. Conversations on the golf
course no doubt turn regularly to a competition in which
participants vie for the largest database.

But we need to see past that, and understand when and why
there is VALUE.
Opportunity or Threat?         Can we Trust Them?


huge opportunity lies in connections.

Within massive databases, but also between different silos of
information.

Strange disconnect between growing suspicion of corporate/
government motives… and growing reliance upon the results of
their data mining.
Customers who bought this…

Restricted to a single site.

Balance snooping with recommendations for items you might
actually want.
also this… becoming more contextually aware and more
personalised all the time.

Balance fear of being observed or manipulated with the clear
value of more relevant results.
And this. Widely reported last year to have found that Mac users tend to
spend more.

Misunderstood. Doesn’t mean that it is charging Mac users more for a
given room. But DOES mean that Mac users tend to pick more expensive
rooms… so help them and make more money by SHOWING THEM the
expensive rooms first.

Is that bad? Not really. It’s good use of available data. You’re not
stopping a Mac user from scrolling until they find the cheaper places… or
reordering the results by price.
Image: www.flickr.com/photos/stigster/3761714132/


 Or this.

 US insurance companies beginning to offer discounts for drivers
 who allow their location to be tracked. Even cheaper if you drive
 on certain roads, at certain times, in a certain way.

 Good today, as it saves you money and is optional. But where
 might it lead?
Image: www.flickr.com/photos/2e14/4631577447/

 Policy makers, businesses and individuals grapple with trying to find the
 boundaries.

 EC right to be forgotten… Makes sense in principle, but how far should it
 go? A list of books I bought from Amazon? Yes. My tiny influence upon
 the recommendations YOU get in Amazon? Possibly not.

 We don’t TRUST business. So what’s the answer? Regulation? Status Quo?
 Or something that recognises value of personal data… and makes it an
 asset to be traded (or not) ?
Image: www.flickr.com/photos/archeon/582708424/

 Personal Data Locker. MY data about me and my interactions with
 government, banks, businesses and more.

 I might give Barnes & Noble or Waterstones access to my Amazon
 purchase history… in return for discounts.

 AttentionTrust all over again… Or a data contract? The data IS valuable…
 but individuals need to see some of that value… and companies need to
 be more transparent.
UK gov making a start… with Midata...
we announced in November 2012
             that we’d use the law to compel
            businesses to release consumers’
              electronic personal data if they
                  didn’t do it voluntarily.




UK gov making a start… with Midata...
Conclusions

Data is incredibly powerful, and more and more of it is becoming freely and
openly available for our use. BUT we need skills and tools.

We need to keep sight of the point… and the value we’re trying to extract. Big
Data isn’t always necessary, despite what old companies and new startups tell
you!

Personally Identifiable Information is the next big opportunity, and the next big
battleground. How do we protect individuals AND create the market conditions for
new businesses to emerge.

Over-regulation would be as bad as unfettered exploitation.
Thank You!
         Paul Miller
        Cloud of Data

email paul.miller@cloudofdata.com
   web http://cloudofdata.com
        skype cloudofdata
       twitter @PaulMiller

More Related Content

What's hot

Ark Group Social Media - June 2007
Ark Group Social Media - June 2007Ark Group Social Media - June 2007
Ark Group Social Media - June 2007Lee Hopkins
 
Social media course 2010 2011: what's going on online?
Social media course 2010   2011: what's going on online?Social media course 2010   2011: what's going on online?
Social media course 2010 2011: what's going on online?guillaume ereteo
 
Silicon Beach 2013 - presenter notes
Silicon Beach 2013 - presenter notesSilicon Beach 2013 - presenter notes
Silicon Beach 2013 - presenter notesPlan-B Studio
 
Changing contexts: museums, audiences and technology
Changing contexts: museums, audiences and technologyChanging contexts: museums, audiences and technology
Changing contexts: museums, audiences and technologyMia
 
Implications of the near and far future
Implications of the near and far futureImplications of the near and far future
Implications of the near and far futureJon McMillan
 
Internet of Things: Luxury for the Rich or Sustainable Equity for All?
Internet of Things: Luxury for the Rich or Sustainable Equity for All?Internet of Things: Luxury for the Rich or Sustainable Equity for All?
Internet of Things: Luxury for the Rich or Sustainable Equity for All?The Transformation Society
 
Data Science with Human in the Loop @Faculty of Science #Leiden University
Data Science with Human in the Loop @Faculty of Science #Leiden UniversityData Science with Human in the Loop @Faculty of Science #Leiden University
Data Science with Human in the Loop @Faculty of Science #Leiden UniversityLora Aroyo
 
Why databases suck for messaging
Why databases suck for messagingWhy databases suck for messaging
Why databases suck for messagingRabbit MQ
 
2600 v01 n07 (july 1984)
2600 v01 n07 (july 1984)2600 v01 n07 (july 1984)
2600 v01 n07 (july 1984)Felipe Prado
 
Engage moms shaun quigley lo_sophomo
Engage moms shaun quigley lo_sophomoEngage moms shaun quigley lo_sophomo
Engage moms shaun quigley lo_sophomoBrunner
 
Should little brother not help Big Brother? - Google and the privacy issue. F...
Should little brother not help Big Brother? - Google and the privacy issue. F...Should little brother not help Big Brother? - Google and the privacy issue. F...
Should little brother not help Big Brother? - Google and the privacy issue. F...Pär Lannerö
 
Hemlock grove for slideshare
Hemlock grove for slideshareHemlock grove for slideshare
Hemlock grove for slideshareC W
 
Data-driven journalism: What is there to learn? (Stanford, June 2010) #ddj
Data-driven journalism: What is there to learn? (Stanford, June 2010) #ddjData-driven journalism: What is there to learn? (Stanford, June 2010) #ddj
Data-driven journalism: What is there to learn? (Stanford, June 2010) #ddjMirko Lorenz
 
"It's all coming together"
"It's all coming together""It's all coming together"
"It's all coming together"cmazzucc
 
Connections a life in the day of - david walker
Connections   a life in the day of - david walkerConnections   a life in the day of - david walker
Connections a life in the day of - david walkerDavid Walker
 

What's hot (19)

Close - A clever way to make new friends
Close - A clever way to make new friendsClose - A clever way to make new friends
Close - A clever way to make new friends
 
Ark Group Social Media - June 2007
Ark Group Social Media - June 2007Ark Group Social Media - June 2007
Ark Group Social Media - June 2007
 
Social media course 2010 2011: what's going on online?
Social media course 2010   2011: what's going on online?Social media course 2010   2011: what's going on online?
Social media course 2010 2011: what's going on online?
 
Silicon Beach 2013 - presenter notes
Silicon Beach 2013 - presenter notesSilicon Beach 2013 - presenter notes
Silicon Beach 2013 - presenter notes
 
Changing contexts: museums, audiences and technology
Changing contexts: museums, audiences and technologyChanging contexts: museums, audiences and technology
Changing contexts: museums, audiences and technology
 
Implications of the near and far future
Implications of the near and far futureImplications of the near and far future
Implications of the near and far future
 
Internet of Things: Luxury for the Rich or Sustainable Equity for All?
Internet of Things: Luxury for the Rich or Sustainable Equity for All?Internet of Things: Luxury for the Rich or Sustainable Equity for All?
Internet of Things: Luxury for the Rich or Sustainable Equity for All?
 
Data Science with Human in the Loop @Faculty of Science #Leiden University
Data Science with Human in the Loop @Faculty of Science #Leiden UniversityData Science with Human in the Loop @Faculty of Science #Leiden University
Data Science with Human in the Loop @Faculty of Science #Leiden University
 
Silicon Beach 2013
Silicon Beach 2013Silicon Beach 2013
Silicon Beach 2013
 
Why databases suck for messaging
Why databases suck for messagingWhy databases suck for messaging
Why databases suck for messaging
 
2600 v01 n07 (july 1984)
2600 v01 n07 (july 1984)2600 v01 n07 (july 1984)
2600 v01 n07 (july 1984)
 
Engage moms shaun quigley lo_sophomo
Engage moms shaun quigley lo_sophomoEngage moms shaun quigley lo_sophomo
Engage moms shaun quigley lo_sophomo
 
Should little brother not help Big Brother? - Google and the privacy issue. F...
Should little brother not help Big Brother? - Google and the privacy issue. F...Should little brother not help Big Brother? - Google and the privacy issue. F...
Should little brother not help Big Brother? - Google and the privacy issue. F...
 
Hemlock grove for slideshare
Hemlock grove for slideshareHemlock grove for slideshare
Hemlock grove for slideshare
 
Digital reality
Digital realityDigital reality
Digital reality
 
Data-driven journalism: What is there to learn? (Stanford, June 2010) #ddj
Data-driven journalism: What is there to learn? (Stanford, June 2010) #ddjData-driven journalism: What is there to learn? (Stanford, June 2010) #ddj
Data-driven journalism: What is there to learn? (Stanford, June 2010) #ddj
 
"It's all coming together"
"It's all coming together""It's all coming together"
"It's all coming together"
 
Future of addiction
Future of addictionFuture of addiction
Future of addiction
 
Connections a life in the day of - david walker
Connections   a life in the day of - david walkerConnections   a life in the day of - david walker
Connections a life in the day of - david walker
 

Viewers also liked

Exploiting Data in the Cloud
Exploiting Data in the CloudExploiting Data in the Cloud
Exploiting Data in the CloudPaul Miller
 
Unlocking the Database of Intentions
Unlocking the Database of IntentionsUnlocking the Database of Intentions
Unlocking the Database of IntentionsPaul Miller
 
Surfing the Data Flow
Surfing the Data FlowSurfing the Data Flow
Surfing the Data FlowPaul Miller
 
Sociale media voor uitzendondernemingen
Sociale media voor uitzendondernemingenSociale media voor uitzendondernemingen
Sociale media voor uitzendondernemingenMarco Hendrikse
 
Towards a Web of Data?
Towards a Web of Data?Towards a Web of Data?
Towards a Web of Data?Paul Miller
 

Viewers also liked (6)

Exploiting Data in the Cloud
Exploiting Data in the CloudExploiting Data in the Cloud
Exploiting Data in the Cloud
 
Unlocking the Database of Intentions
Unlocking the Database of IntentionsUnlocking the Database of Intentions
Unlocking the Database of Intentions
 
Surfing the Data Flow
Surfing the Data FlowSurfing the Data Flow
Surfing the Data Flow
 
Sociale media voor uitzendondernemingen
Sociale media voor uitzendondernemingenSociale media voor uitzendondernemingen
Sociale media voor uitzendondernemingen
 
Assignment 01
Assignment 01Assignment 01
Assignment 01
 
Towards a Web of Data?
Towards a Web of Data?Towards a Web of Data?
Towards a Web of Data?
 

Similar to Intelligence, Insight, and the role of Scale: Data stories from the business world

Heavy, Messy, Misleading: why Big Data is a human problem, not a tech one
Heavy, Messy, Misleading: why Big Data is a human problem, not a tech oneHeavy, Messy, Misleading: why Big Data is a human problem, not a tech one
Heavy, Messy, Misleading: why Big Data is a human problem, not a tech onePulsar
 
DEEP Scott Killoh-3
DEEP Scott Killoh-3DEEP Scott Killoh-3
DEEP Scott Killoh-3jonobermeyer
 
23 ijcse-01238-1indhunisha
23 ijcse-01238-1indhunisha23 ijcse-01238-1indhunisha
23 ijcse-01238-1indhunishaShivlal Mewada
 
The renaissance brand
The renaissance brandThe renaissance brand
The renaissance brandYoni Kish
 
TED Wiley Visualizing .docx
TED  Wiley Visualizing .docxTED  Wiley Visualizing .docx
TED Wiley Visualizing .docxssuserf9c51d
 
Future Dot Now, LIFT France 10, Keynote Speech
 Future Dot Now, LIFT France 10, Keynote Speech Future Dot Now, LIFT France 10, Keynote Speech
Future Dot Now, LIFT France 10, Keynote SpeechSuperflux
 
Emergent MEDIA, NEXT GEN THINKING
Emergent MEDIA, NEXT GEN THINKINGEmergent MEDIA, NEXT GEN THINKING
Emergent MEDIA, NEXT GEN THINKINGAnn DeMarle
 
Is this BIG DATA which I see before me?
Is this BIG DATA which I see before me?Is this BIG DATA which I see before me?
Is this BIG DATA which I see before me?Dorothea Salo
 
"If you love your content, set it free" ?
"If you love your content, set it free" ?"If you love your content, set it free" ?
"If you love your content, set it free" ?Mike Ellis
 
Thinking in networks: what it means for policy makers
Thinking in networks: what it means for policy makersThinking in networks: what it means for policy makers
Thinking in networks: what it means for policy makersAlberto Cottica
 
Sensory transformation
Sensory transformationSensory transformation
Sensory transformationKarlos Svoboda
 
Collective Intelligence At Itu
Collective Intelligence At ItuCollective Intelligence At Itu
Collective Intelligence At ItuMazzara1976
 
Technology in Business Law by Ammar Younas
Technology in Business Law by Ammar YounasTechnology in Business Law by Ammar Younas
Technology in Business Law by Ammar YounasAmmar Younas
 
Heavy, messy, misleading. Why Big Data is a human problem, not a technology one.
Heavy, messy, misleading. Why Big Data is a human problem, not a technology one.Heavy, messy, misleading. Why Big Data is a human problem, not a technology one.
Heavy, messy, misleading. Why Big Data is a human problem, not a technology one.Francesco D'Orazio
 
Bitcoins May 2013
Bitcoins May 2013Bitcoins May 2013
Bitcoins May 2013WesWWeber
 
Researchers, Discovery and the Internet: What Next?
Researchers, Discovery and the Internet: What Next?Researchers, Discovery and the Internet: What Next?
Researchers, Discovery and the Internet: What Next?David Smith
 

Similar to Intelligence, Insight, and the role of Scale: Data stories from the business world (20)

So what if it's a bubble?
So what if it's a bubble?So what if it's a bubble?
So what if it's a bubble?
 
Heavy, Messy, Misleading: why Big Data is a human problem, not a tech one
Heavy, Messy, Misleading: why Big Data is a human problem, not a tech oneHeavy, Messy, Misleading: why Big Data is a human problem, not a tech one
Heavy, Messy, Misleading: why Big Data is a human problem, not a tech one
 
DEEP Scott Killoh-3
DEEP Scott Killoh-3DEEP Scott Killoh-3
DEEP Scott Killoh-3
 
23 ijcse-01238-1indhunisha
23 ijcse-01238-1indhunisha23 ijcse-01238-1indhunisha
23 ijcse-01238-1indhunisha
 
The renaissance brand
The renaissance brandThe renaissance brand
The renaissance brand
 
Big Data-Job 2
Big Data-Job 2Big Data-Job 2
Big Data-Job 2
 
TED Wiley Visualizing .docx
TED  Wiley Visualizing .docxTED  Wiley Visualizing .docx
TED Wiley Visualizing .docx
 
Future Dot Now, LIFT France 10, Keynote Speech
 Future Dot Now, LIFT France 10, Keynote Speech Future Dot Now, LIFT France 10, Keynote Speech
Future Dot Now, LIFT France 10, Keynote Speech
 
Emergent MEDIA, NEXT GEN THINKING
Emergent MEDIA, NEXT GEN THINKINGEmergent MEDIA, NEXT GEN THINKING
Emergent MEDIA, NEXT GEN THINKING
 
Is this BIG DATA which I see before me?
Is this BIG DATA which I see before me?Is this BIG DATA which I see before me?
Is this BIG DATA which I see before me?
 
Overloaded
OverloadedOverloaded
Overloaded
 
Overloaded
OverloadedOverloaded
Overloaded
 
"If you love your content, set it free" ?
"If you love your content, set it free" ?"If you love your content, set it free" ?
"If you love your content, set it free" ?
 
Thinking in networks: what it means for policy makers
Thinking in networks: what it means for policy makersThinking in networks: what it means for policy makers
Thinking in networks: what it means for policy makers
 
Sensory transformation
Sensory transformationSensory transformation
Sensory transformation
 
Collective Intelligence At Itu
Collective Intelligence At ItuCollective Intelligence At Itu
Collective Intelligence At Itu
 
Technology in Business Law by Ammar Younas
Technology in Business Law by Ammar YounasTechnology in Business Law by Ammar Younas
Technology in Business Law by Ammar Younas
 
Heavy, messy, misleading. Why Big Data is a human problem, not a technology one.
Heavy, messy, misleading. Why Big Data is a human problem, not a technology one.Heavy, messy, misleading. Why Big Data is a human problem, not a technology one.
Heavy, messy, misleading. Why Big Data is a human problem, not a technology one.
 
Bitcoins May 2013
Bitcoins May 2013Bitcoins May 2013
Bitcoins May 2013
 
Researchers, Discovery and the Internet: What Next?
Researchers, Discovery and the Internet: What Next?Researchers, Discovery and the Internet: What Next?
Researchers, Discovery and the Internet: What Next?
 

More from Paul Miller

Big Data, Linked Data
Big Data, Linked DataBig Data, Linked Data
Big Data, Linked DataPaul Miller
 
Information Security and Cloud Computing
Information Security and Cloud ComputingInformation Security and Cloud Computing
Information Security and Cloud ComputingPaul Miller
 
Why Linked Data?
Why Linked Data?Why Linked Data?
Why Linked Data?Paul Miller
 
Cutting through the hype: Cloud Computing in Context
Cutting through the hype: Cloud Computing in ContextCutting through the hype: Cloud Computing in Context
Cutting through the hype: Cloud Computing in ContextPaul Miller
 
The Cloud Computing Stack
The Cloud Computing StackThe Cloud Computing Stack
The Cloud Computing StackPaul Miller
 
Toward the Data Cloud
Toward the Data CloudToward the Data Cloud
Toward the Data CloudPaul Miller
 

More from Paul Miller (6)

Big Data, Linked Data
Big Data, Linked DataBig Data, Linked Data
Big Data, Linked Data
 
Information Security and Cloud Computing
Information Security and Cloud ComputingInformation Security and Cloud Computing
Information Security and Cloud Computing
 
Why Linked Data?
Why Linked Data?Why Linked Data?
Why Linked Data?
 
Cutting through the hype: Cloud Computing in Context
Cutting through the hype: Cloud Computing in ContextCutting through the hype: Cloud Computing in Context
Cutting through the hype: Cloud Computing in Context
 
The Cloud Computing Stack
The Cloud Computing StackThe Cloud Computing Stack
The Cloud Computing Stack
 
Toward the Data Cloud
Toward the Data CloudToward the Data Cloud
Toward the Data Cloud
 

Recently uploaded

How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 

Recently uploaded (20)

How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 

Intelligence, Insight, and the role of Scale: Data stories from the business world

  • 1. Intelligence, Insight, and the role of Scale: Data stories from the business world Dr Paul Miller Cloud of Data @PaulMiller http://cloudofdata.com Big Data, we are told, is everywhere. And transformative. And disruptive. But how much has actually changed?
  • 2. Topics • Data Speaks • Size Matters • Personal Data, Privacy, Trust, and a Right to be Forgotten
  • 3. Data Speaks But listening may not be enough Data is cool right now. Everything is “data-driven,” from science and journalism to decision-making and policy shaping. But actually, we’ve always gathered data and used it to craft hypotheses, win arguments, and support theories.
  • 4. en.wikipedia.org/wiki/File:Cholera_bacteria_SEM.jpg Insight without scale. Severe Cholera Outbreak. London, 1854. Hundreds died. Physician John Snow did not accept prevailing theory that cholera was caused by ‘bad air.’ If he’d had access to an electron microscope, he would clearly have spotted that cholera is a key ingredient in marmalade.
  • 5. This image is in the public domain because its copyright has expired. en.wikipedia.org/wiki/File:Snow-cholera-map-1.jpg - Original map made by John Snow in 1854. But Snow didn’t have an electron microscope. He plotted 12 water pumps around Soho. Plotted deaths. CLEAR link to one pump. Handle removed and outbreak stopped. Tho Snow himself admitted it may have passed its peak before he acted.
  • 7. inmaps.linkedinlabs.com/ Image © LinkedIn 1,000 connections. They join me to 11 million connections of connections and connections of connections of connections. All of those 11 million influence what this picture looks like. Inadvertently Crowdsourced Art ?
  • 8. Talis Escapees Library land Museum pieces Semantically Challenged European Commission Jisc world In the inmaps.linkedinlabs.com/ Cloud Image © LinkedIn All I did was provide the labels. LinkedIn identified the clusters algorithmically. See influencers. Zoom in, and see people (like Richard Wallis) move from one cluster to another. Understand my network. If I were actively seeking to extract value, I might identify the influencers and find ways to work with them, or have them notice me, or something.
  • 9. en.wikipedia.org/wiki/File:Colorized_transmission_electron_micrograph_of_Avian_influenza_A_H5N1_viruses.jpg more bugs and germs and viruses, and things. This time - influenza. This is H5N1 - Avian Flu.
  • 10. Image © Google Foundation www.google.org/flutrends/ Millions of searches on Google for flu-related terms. Not every searcher has flu, but at Google scale, the oddities are smoothed out; spikes in flu- related searches correlate to increases in flu cases… and run slightly ahead of formal reports to doctors. Flu prevalent in the north just now, where it’s winter. Low in the south where it’s summer. That’s hardly surprising. But why are Germany and Norway so much lighter than their neighbours this week?
  • 11. Images © Google Foundation www.google.org/flutrends/ A little closer to home… Don’t panic, but this area of the Netherlands is the worst right now. Google Flu Trends ‘predictions’ correlate well to historical data from medical authorities… but Google Flu Trends data is updated daily. EARLY WARNING ?
  • 13.
  • 14. Size Matters Or does it? this Scottish Castle is just Lego...
  • 15. This image is in the public domain because its copyright has expired. en.wikipedia.org/wiki/File:Great_Wave_off_Kanagawa2.jpg Data Deluge. Tsunami. Flood. Emotive language, and emotive imagery. There’s too much. We can’t cope. It’s BAD. Big Data… Despite the name… it isn’t actually just about size. 2001 report from META Group (now Gartner) proposed 3 V’s.
  • 16. 1. Volume. Implicit presumption that Bigger is better. That is not always true. Sometimes bigger just means the value is even more hidden than before. From needle in a haystack to needle in Germany. Finding the needle just got harder.
  • 17. 107 trillion emails in 2010. 340 million tweets per day. 50 billion pages in Google’s index. 82 petabytes in a single Hadoop cluster at Yahoo - even more at Facebook. 72 hours of video uploaded to YouTube every minute. 15 terabytes of data added to Facebook every day. Moving beyond the Terabyte. Petabytes, Zettabytes, and more.
  • 18. 2. Velocity. How fast does it change - and how fast must I act?
  • 19. Financial Institutions… increasingly moving from models and samples to real-time authorisation. Analyse purchase history. Analyse similar customers’ history. Decide whether or not to authorise… as you are actually buying. Decisions in a second or so. Beginning to get smarter about context. You know I bought a plane ticket to Amsterdam, so why are you querying a restaurant payment in Amsterdam? Not there yet...
  • 20. Much slower - hours rather than seconds. always been well known for mining loyalty cards. But also leverages big data techniques to reduce stock wastage in 3,000 UK stores by £30million per annum. weather forecast updated 3 times per day… implications for 18million items analysed 3 times per day… and orders changed accordingly. £50million less tied up in warehouse stock than previously.
  • 21. Image: www.flickr.com/photos/heydrienne/22078028/ 3. Variety. from neatly structured database tables to structured, semi- structured & unstructured
  • 23. customer support monitoring sentiment on social networks mining sentiment and insight from customer forums using semantics to understand and translate customer contributions, lowering the cost of delivering quality support in minority languages.
  • 24. People also now talk about a 4th V - Value. Not just how much it’s worth in monetary terms. How much benefit does it deliver? Surely this is the important V? In some contexts, massive scale will be required to deliver value. In some contexts, rapid response will be required to deliver value. In some contexts, lots of different data sets will be required to deliver value. But the business value should lead. If you don’t NEED petabytes of data, why collect and store them?
  • 25. http://dilbert.com/strips/comic/2013-01-09/ Image © Scott Adams Big Data latest cool toy. It gets on the front of the Economist. It’s in BusinessWeek and Fortune and Forbes, and the Financial Times and the Wall Street Journal. It’s what all the cool execs want. Conversations on the golf course no doubt turn regularly to a competition in which participants vie for the largest database. But we need to see past that, and understand when and why there is VALUE.
  • 26. Opportunity or Threat? Can we Trust Them? huge opportunity lies in connections. Within massive databases, but also between different silos of information. Strange disconnect between growing suspicion of corporate/ government motives… and growing reliance upon the results of their data mining.
  • 27. Customers who bought this… Restricted to a single site. Balance snooping with recommendations for items you might actually want.
  • 28. also this… becoming more contextually aware and more personalised all the time. Balance fear of being observed or manipulated with the clear value of more relevant results.
  • 29. And this. Widely reported last year to have found that Mac users tend to spend more. Misunderstood. Doesn’t mean that it is charging Mac users more for a given room. But DOES mean that Mac users tend to pick more expensive rooms… so help them and make more money by SHOWING THEM the expensive rooms first. Is that bad? Not really. It’s good use of available data. You’re not stopping a Mac user from scrolling until they find the cheaper places… or reordering the results by price.
  • 30. Image: www.flickr.com/photos/stigster/3761714132/ Or this. US insurance companies beginning to offer discounts for drivers who allow their location to be tracked. Even cheaper if you drive on certain roads, at certain times, in a certain way. Good today, as it saves you money and is optional. But where might it lead?
  • 31. Image: www.flickr.com/photos/2e14/4631577447/ Policy makers, businesses and individuals grapple with trying to find the boundaries. EC right to be forgotten… Makes sense in principle, but how far should it go? A list of books I bought from Amazon? Yes. My tiny influence upon the recommendations YOU get in Amazon? Possibly not. We don’t TRUST business. So what’s the answer? Regulation? Status Quo? Or something that recognises value of personal data… and makes it an asset to be traded (or not) ?
  • 32. Image: www.flickr.com/photos/archeon/582708424/ Personal Data Locker. MY data about me and my interactions with government, banks, businesses and more. I might give Barnes & Noble or Waterstones access to my Amazon purchase history… in return for discounts. AttentionTrust all over again… Or a data contract? The data IS valuable… but individuals need to see some of that value… and companies need to be more transparent.
  • 33. UK gov making a start… with Midata...
  • 34. we announced in November 2012 that we’d use the law to compel businesses to release consumers’ electronic personal data if they didn’t do it voluntarily. UK gov making a start… with Midata...
  • 35. Conclusions Data is incredibly powerful, and more and more of it is becoming freely and openly available for our use. BUT we need skills and tools. We need to keep sight of the point… and the value we’re trying to extract. Big Data isn’t always necessary, despite what old companies and new startups tell you! Personally Identifiable Information is the next big opportunity, and the next big battleground. How do we protect individuals AND create the market conditions for new businesses to emerge. Over-regulation would be as bad as unfettered exploitation.
  • 36. Thank You! Paul Miller Cloud of Data email paul.miller@cloudofdata.com web http://cloudofdata.com skype cloudofdata twitter @PaulMiller