SlideShare a Scribd company logo
1 of 67
BIG DATA AT HUMAN SCALE.
Matt LeMay, @mattlemay
BIG DATA
IS BIG
How BIG is it?
We have built the capacity to store
more bytes of data than the
Earth has grains of sand.
... about 315 times more.
If each bit of data we have the capacity to
store were to represent a star, then
there would be a GALAXY OF
DATA for every person on Earth.
The data Walmart generates every hour from
its customer transactions represents 167 times the
information contained in all the books in the United
States Library of Congress.
PWNED
The number of bytes
we’ve built the
capacity to store
constitutes only a
TINY FRACTION
of the number of
atoms you have in
your body.
... or the amount of
data stored in your
DNA.
In fact, the data storage capacity of the entire
world is less than one percent of the information
stored in the DNA molecules of a single person.
as we approach human scale...
...big data seems smaller.
... but it’s bigger than it’s ever been before.
=
ALL the data
created until the
year 2003
ALL the data
created every
two days
Scale of Data ~3,000 Years Ago:
Scale of Data ~300 Years Ago:
Scale of Data ~30 Years Ago:
Scale of Data ~3 Years Ago:
We’ve been writing stuff on walls for 30,000 years...
... and we’re still not entirely what it all means.
“BIG DATA” is US*,
in higher resolution.
“We’re distracted by a bunch of nonsense.”
“Ephemeral thoughts and actions, which were once
lost to time, are now recorded forever.”
That record is “BIG DATA.”
According to , 43% of all data
gathered on people comes from social media.
We overshare compulsively, but we are more
concerned than ever before about our privacy.
Privacy vs Permission
Privacy = “My data is valuable, and
others want access so that they can spy
on me or sell me stuff I don’t want.”
Permission = “My data is valuable, so
I will explicitly grant others access to it
in specific situations where it is
worthwhile for me to do so.”
Privacy is something we need to worry about
when expectations are violated around the
permissions we agree to.
Even explicit permission...
... doesn’t override expectation.
... often struggles to square permission with
expectation, at times to their own detriment.
weknowwhatyouredoing.com
We expect clicks to be private gestures,
and shares to be public gestures.
Facebook’s social reader violated those
expectations.
We share who we want to be.
We click who we fear we are.
... and it matters.
We share our
information
because we trust
that sharing will
make it more
valuable to us.
“The future has an ancient heart.”
- Carlo Levi
My data Your data
BIG DATA “MAGIC”
Me You
BIG DATA “MAGIC”
“HADOOP!”
MAGICKAL RABBITS OF INSIGHT!!11
Me You
... but “BIG DATA” is not magic.
“MAGIC BIG DATA TECHNOLOGY”
is a set of tools...
... necessitated by scale.
- Tim O’Brien, O’Reilly Strata Conference
COUNTING
is not
UNDERSTANDING
THE ALGORITHM
WON’T SAVE YOU
BIG DATA is only as
good as the questions
we ask of it.
... and many of those questions haven’t changed.
Loyalty clubs and targeted coupons are the
oldest trick in the “big data” book.
- Andrew Pole,Target
Big Data could make advertising and
marketing better.*
(Which will, in turn, hopefully pay for all those nifty services we use to generate all that data.)
Twitter Search == BIG Data.
*
... but the potential goes beyond advertising.
When done right, BIG DATA encourages
you to SHARE MORE, not less.
“BIG DATA” is all around us.
...and it doesn’t feel ZOMG WORLD-CHANGING
... because it’s in our cells.
Thank you.
Questions?
@MATTLEMAY

More Related Content

Viewers also liked

Chapter 4 scale and proportion
Chapter 4 scale and proportionChapter 4 scale and proportion
Chapter 4 scale and proportionTracie King
 
2015 Upload Campaigns Calendar - SlideShare
2015 Upload Campaigns Calendar - SlideShare2015 Upload Campaigns Calendar - SlideShare
2015 Upload Campaigns Calendar - SlideShareSlideShare
 
What to Upload to SlideShare
What to Upload to SlideShareWhat to Upload to SlideShare
What to Upload to SlideShareSlideShare
 
Getting Started With SlideShare
Getting Started With SlideShareGetting Started With SlideShare
Getting Started With SlideShareSlideShare
 

Viewers also liked (6)

HUMAN SCALE
HUMAN SCALEHUMAN SCALE
HUMAN SCALE
 
Scale & Proportion
Scale & ProportionScale & Proportion
Scale & Proportion
 
Chapter 4 scale and proportion
Chapter 4 scale and proportionChapter 4 scale and proportion
Chapter 4 scale and proportion
 
2015 Upload Campaigns Calendar - SlideShare
2015 Upload Campaigns Calendar - SlideShare2015 Upload Campaigns Calendar - SlideShare
2015 Upload Campaigns Calendar - SlideShare
 
What to Upload to SlideShare
What to Upload to SlideShareWhat to Upload to SlideShare
What to Upload to SlideShare
 
Getting Started With SlideShare
Getting Started With SlideShareGetting Started With SlideShare
Getting Started With SlideShare
 

Similar to "Big Data at Human Scale," Wharton Web Conference 2013

Big Data, Small Data, Data that Totally Rocks - SMWTO
Big Data, Small Data, Data that Totally Rocks - SMWTOBig Data, Small Data, Data that Totally Rocks - SMWTO
Big Data, Small Data, Data that Totally Rocks - SMWTORob Clark
 
Big Data in the Legal Industry
Big Data in the Legal IndustryBig Data in the Legal Industry
Big Data in the Legal IndustryEvolve Law
 
Risk Factory Big Daddy Digs Big Data
Risk Factory Big Daddy Digs Big DataRisk Factory Big Daddy Digs Big Data
Risk Factory Big Daddy Digs Big DataRisk Crew
 
Data Days 2014 - Nina Dierks
Data Days 2014 - Nina DierksData Days 2014 - Nina Dierks
Data Days 2014 - Nina Dierksdatadays
 
Thriving in the 21st century
Thriving in the 21st centuryThriving in the 21st century
Thriving in the 21st centuryGlenn Wiebe
 
Bitcoins May 2013
Bitcoins May 2013Bitcoins May 2013
Bitcoins May 2013WesWWeber
 
Family. Our Future in Cyberspace
Family. Our Future in CyberspaceFamily. Our Future in Cyberspace
Family. Our Future in Cyberspacemangoups
 
InfographicsMadeEasy.pdf
InfographicsMadeEasy.pdfInfographicsMadeEasy.pdf
InfographicsMadeEasy.pdfzdczxcxzczx1
 
SXSW 2012 - Big Data Conversation
SXSW 2012 - Big Data ConversationSXSW 2012 - Big Data Conversation
SXSW 2012 - Big Data Conversationjohn st.
 
2600 v21 n3 (autumn 2004)
2600 v21 n3 (autumn 2004)2600 v21 n3 (autumn 2004)
2600 v21 n3 (autumn 2004)Felipe Prado
 
Homeland security
Homeland securityHomeland security
Homeland securityWes Widner
 
2600 v24 n4 (winter 2007)
2600 v24 n4 (winter 2007)2600 v24 n4 (winter 2007)
2600 v24 n4 (winter 2007)Felipe Prado
 
The Intranets of Babel
The Intranets of BabelThe Intranets of Babel
The Intranets of BabelIqbal Mohammed
 
GnoTag - Semantically Barcoding Our World
GnoTag - Semantically Barcoding Our WorldGnoTag - Semantically Barcoding Our World
GnoTag - Semantically Barcoding Our WorldLee Livezey
 
SSI Meetup – interpersonal data, identity and collective minds
SSI Meetup – interpersonal data, identity and collective mindsSSI Meetup – interpersonal data, identity and collective minds
SSI Meetup – interpersonal data, identity and collective mindsPhilip Sheldrake
 
Business considerations for privacy and open data: how not to get caught out
Business considerations for privacy and open data: how not to get caught outBusiness considerations for privacy and open data: how not to get caught out
Business considerations for privacy and open data: how not to get caught outtheODI
 

Similar to "Big Data at Human Scale," Wharton Web Conference 2013 (20)

Big Data, Small Data, Data that Totally Rocks - SMWTO
Big Data, Small Data, Data that Totally Rocks - SMWTOBig Data, Small Data, Data that Totally Rocks - SMWTO
Big Data, Small Data, Data that Totally Rocks - SMWTO
 
Big Data in the Legal Industry
Big Data in the Legal IndustryBig Data in the Legal Industry
Big Data in the Legal Industry
 
Big Data! Dopey Quotes!
Big Data! Dopey Quotes!Big Data! Dopey Quotes!
Big Data! Dopey Quotes!
 
Big Data, Deep Thought
Big Data, Deep ThoughtBig Data, Deep Thought
Big Data, Deep Thought
 
Risk Factory Big Daddy Digs Big Data
Risk Factory Big Daddy Digs Big DataRisk Factory Big Daddy Digs Big Data
Risk Factory Big Daddy Digs Big Data
 
Data Days 2014 - Nina Dierks
Data Days 2014 - Nina DierksData Days 2014 - Nina Dierks
Data Days 2014 - Nina Dierks
 
Thriving in the 21st century
Thriving in the 21st centuryThriving in the 21st century
Thriving in the 21st century
 
Bitcoins May 2013
Bitcoins May 2013Bitcoins May 2013
Bitcoins May 2013
 
Family. Our Future in Cyberspace
Family. Our Future in CyberspaceFamily. Our Future in Cyberspace
Family. Our Future in Cyberspace
 
Algorithms
AlgorithmsAlgorithms
Algorithms
 
InfographicsMadeEasy.pdf
InfographicsMadeEasy.pdfInfographicsMadeEasy.pdf
InfographicsMadeEasy.pdf
 
SXSW 2012 - Big Data Conversation
SXSW 2012 - Big Data ConversationSXSW 2012 - Big Data Conversation
SXSW 2012 - Big Data Conversation
 
2600 v21 n3 (autumn 2004)
2600 v21 n3 (autumn 2004)2600 v21 n3 (autumn 2004)
2600 v21 n3 (autumn 2004)
 
Homeland security
Homeland securityHomeland security
Homeland security
 
Big Human Data
Big Human DataBig Human Data
Big Human Data
 
2600 v24 n4 (winter 2007)
2600 v24 n4 (winter 2007)2600 v24 n4 (winter 2007)
2600 v24 n4 (winter 2007)
 
The Intranets of Babel
The Intranets of BabelThe Intranets of Babel
The Intranets of Babel
 
GnoTag - Semantically Barcoding Our World
GnoTag - Semantically Barcoding Our WorldGnoTag - Semantically Barcoding Our World
GnoTag - Semantically Barcoding Our World
 
SSI Meetup – interpersonal data, identity and collective minds
SSI Meetup – interpersonal data, identity and collective mindsSSI Meetup – interpersonal data, identity and collective minds
SSI Meetup – interpersonal data, identity and collective minds
 
Business considerations for privacy and open data: how not to get caught out
Business considerations for privacy and open data: how not to get caught outBusiness considerations for privacy and open data: how not to get caught out
Business considerations for privacy and open data: how not to get caught out
 

Recently uploaded

Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 

Recently uploaded (20)

Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 

"Big Data at Human Scale," Wharton Web Conference 2013