SlideShare a Scribd company logo
THE DAWN OF BIG DATA

New Rules; New Structures




   Neal J. Hannon
   University of Kansas
   February 9, 2012
Data Mania
• http://www.youtube.com/watch?v=HQ_3g2hUC
  n4&feature=player_embedded
Definition

• Big data is a term applied to data sets whose
  size is beyond the ability of commonly used
  software tools to capture, manage, and process
  the data within a tolerable elapsed time. Big
  data sizes are a constantly moving target
  currently ranging from a few dozen terabytes to
  many petabytes of data in a single data set.
More Data Please…
• In a 2001 research report[14] and related
  conference presentations, then META Group
  (now Gartner) analyst, Doug Laney, defined
  data growth challenges (and opportunities) as
  being three-dimensional, i.e. increasing volume
  (amount of data), velocity (speed of data in/out),
  and variety (range of data types, sources).
  Gartner continues to use this model for
  describing big data.[15]
Gartner
• Worldwide information volume is growing
  annually at a minimum rate of 59 percent
  annually, and while volume is a significant
  challenge in managing big data, business and
  IT leaders must focus on information volume,
  variety and velocity.
  • Volume
  • Variety
  • Velocity
Volume

• Volume: The increase in data volumes within
  enterprise systems is caused by transaction
  volumes and other traditional data types, as
  well as by new types of data. Too much volume
  is a storage issue, but too much data is also a
  massive analysis issue.
Variety
• Variety: IT leaders have always had an issue
  translating large volumes of transactional
  information into decisions — now there are
  more types of information to analyze — mainly
  coming from social media and mobile (context-
  aware). Variety includes tabular data
  (databases), hierarchical data, documents, e-
  mail, metering data, video, still images, audio,
  stock ticker data, financial transactions and
  more.
Velocity
• Velocity: This involves streams of data,
  structured record creation, and availability for
  access and delivery. Velocity means both how
  fast data is being produced and how fast the
  data must be processed to meet demand.
Why now?
There were 5 exabytes of information created between the dawn of
civilization through 2003, but that much information is now created every 2
days, and the pace is increasing
                                     Eric Schmidt, Google CEO, Techonomy Conference, August 4, 2010




Data is becoming the new raw material of business: an economic input
almost on a par with capital and labour. “Every day I wake up and ask, ‘how
can I flow data better, manage data better, analyse data better?” says Rollin
Ford, the CIO of Wal-Mart.
                                     Source: Data, Data Everywhere, The Economist, February 25, 2010
Source: Mike Driscoll, CTO Metamarkets: The Three Sexy Skills of Data Scientists (& Data Driven Startups)
Source: Mike Driscoll, CTO Metamarkets: The Three Sexy Skills of Data Scientists (& Data Driven Startups)
How can big data create value?

• Creating transparency – enabling, for example,
  the manufacturing sector to integrate ―data from
  R&D, engineering, and manufacturing units to
  enable concurrent engineering ... (to)
  significantly cut time to market and improve
  quality.‖ This seems much like traditional data
  warehousing.
How can Big Data create value?
• Enabling experimentation – ―organizations can
  collect more accurate and detailed performance
  data ... to instrument processes and then set up
  controlled experiments … (which) can enable
  leaders to manage performance at higher
  levels.‖ Super-crunching equals analytics +
  experiments.
How can Big Data create value?
• Innovating new business models – ―The
  emergence of real-time location data has
  created an entirely new set of location-based
  services from navigation to pricing property and
  casualty insurance based on where, and how,
  people drive their cars.‖ This affirms Mike
  Loukides' assertion ―that data science enables
  the creation of data products.‖
How can Big Data create value?
• Supporting human decision making with
  automated algorithms – ―decision making may
  never be the same; some organizations are
  already making better decisions by analyzing
  entire datasets from customers, employees, or
  even sensors embedded in products.‖ The
  statistical learning world continues to progress.
SAS - unstructured text

• http://www.youtube.com/user/SASsoftware?v=
  NHAq8jG4FX4&feature=pyv&ad=8557352196&
  kw=data%20analytics
Pattern Based Strategy
•   "The ability to manage extreme data will be a core competency of enterprises that
    are increasingly using new forms of information — such as text, social and context —
    to look for patterns that support business decisions in what we call Pattern-Based
    Strategy," said Yvonne Genovese, vice president and distinguished analyst at
    Gartner. "Pattern-Based Strategy, as an engine of change, utilizes all the
    dimensions in its pattern-seeking process. It then provides the basis of the modeling
    for new business solutions, which allows the business to adapt. The seek-model-
    and-adapt cycle can then be completed in various mediums, such as social
    computing analysis or context-aware computing engines."
Pattern Based Strategy

• http://www.youtube.com/watch?v=r8N0L8Cz1q
  g&feature=BFa&list=UUSNX50LYGXWV_e5U
  WZGPGbw&lf=plpp_video
EMC’s Big Data Video

• http://www.youtube.com/watch?v=ILBV391a8Ic



• O’Reilly’s Take
• http://www.youtube.com/watch?v=Rn5rVGGfzy
  0&feature=related
Tricks of the Trade

• New Architecture

• In Memory Analytics
In-Memory Indexing at SAP
• We have also got enterprise search time, we really started doing that back in
  2003/2004 time period, that’s also when we started coming out with
  business warehouse accelerator that was when Google was just really
  starting to become Google, and we tried to do the same thing with enterprise
  data that Google does with website data as far as indexing it. So we also
  put the indexes in memory, so its speeded up even further and you know
  now if you actually look at HANA really is kind of the next evolutionary step
  in that that chain. This is in-memory process and this isn’t something just
  for a specialist. It really is a technology that’s matured to a level that it can
  run the entire business suite and run your entire company in-memory and
  get all those benefits for everything.

• http://docs.media.bitpipe.com/io_10x/io_102428/item_477005/The%20Next
  %20Chapter%20of%20In-Memory%20Computing_PT_12.22.11.pdf
For more on HADOOP
• http://www.slideshare.net/PhilippeJulio/hadoop-
  architecture
Obligatory Questions slide


• Any Questions?

More Related Content

What's hot

BIG Data & Hadoop Applications in Social Media
BIG Data & Hadoop Applications in Social MediaBIG Data & Hadoop Applications in Social Media
BIG Data & Hadoop Applications in Social Media
Skillspeed
 
Data-Ed Webinar: Demystifying Big Data
Data-Ed Webinar: Demystifying Big Data Data-Ed Webinar: Demystifying Big Data
Data-Ed Webinar: Demystifying Big Data
DATAVERSITY
 
Big data
Big dataBig data
Big Data Trends - WorldFuture 2015 Conference
Big Data Trends - WorldFuture 2015 ConferenceBig Data Trends - WorldFuture 2015 Conference
Big Data Trends - WorldFuture 2015 Conference
David Feinleib
 
Everis big data_wilson_v1.4
Everis big data_wilson_v1.4Everis big data_wilson_v1.4
Everis big data_wilson_v1.4
wilson_lucas
 
Hadoop Overview
Hadoop OverviewHadoop Overview
Hadoop Overview
Gregg Barrett
 
Research issues in the big data and its Challenges
Research issues in the big data and its ChallengesResearch issues in the big data and its Challenges
Research issues in the big data and its Challenges
Kathirvel Ayyaswamy
 
Big Data Analytics Proposal #1
Big Data Analytics Proposal #1Big Data Analytics Proposal #1
Big Data Analytics Proposal #1
Ziyad Saleh
 
Big data introduction - Big Data from a Consulting perspective - Sogeti
Big data introduction - Big Data from a Consulting perspective - SogetiBig data introduction - Big Data from a Consulting perspective - Sogeti
Big data introduction - Big Data from a Consulting perspective - Sogeti
Edzo Botjes
 
Applications of Big Data Analytics in Businesses
Applications of Big Data Analytics in BusinessesApplications of Big Data Analytics in Businesses
Applications of Big Data Analytics in Businesses
T.S. Lim
 
Big Data - Big Deal? - Edison's Academic Paper in SMU
Big Data - Big Deal? - Edison's Academic Paper in SMUBig Data - Big Deal? - Edison's Academic Paper in SMU
Big Data - Big Deal? - Edison's Academic Paper in SMU
Edison Lim Jun Hao
 
Datapreneurs
DatapreneursDatapreneurs
Datapreneurs
suresh sood
 
Business analytics
Business analyticsBusiness analytics
Business analytics
SwarnaLatha177
 
Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate
Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate
Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate
Oomph! Recruitment
 
Jobs Complexity
Jobs ComplexityJobs Complexity
Jobs Complexity
suresh sood
 
Big Data Challenges faced by Organizations
Big Data Challenges faced by OrganizationsBig Data Challenges faced by Organizations
Big Data Challenges faced by Organizations
IJCSIS Research Publications
 
Australia bureau of statistics some initiatives on big data - 23 july 2014
Australia bureau of statistics   some initiatives on big data - 23 july 2014Australia bureau of statistics   some initiatives on big data - 23 july 2014
Australia bureau of statistics some initiatives on big data - 23 july 2014
noviari sugianto
 
Shane Greenstein Future Assembly 11/17/2015
Shane Greenstein Future Assembly 11/17/2015Shane Greenstein Future Assembly 11/17/2015
Shane Greenstein Future Assembly 11/17/2015
Adrienne Debigare
 
Big data unit i
Big data unit iBig data unit i
Big data unit i
Navjot Kaur
 

What's hot (20)

BIG Data & Hadoop Applications in Social Media
BIG Data & Hadoop Applications in Social MediaBIG Data & Hadoop Applications in Social Media
BIG Data & Hadoop Applications in Social Media
 
Data-Ed Webinar: Demystifying Big Data
Data-Ed Webinar: Demystifying Big Data Data-Ed Webinar: Demystifying Big Data
Data-Ed Webinar: Demystifying Big Data
 
Big data
Big dataBig data
Big data
 
Big Data Trends - WorldFuture 2015 Conference
Big Data Trends - WorldFuture 2015 ConferenceBig Data Trends - WorldFuture 2015 Conference
Big Data Trends - WorldFuture 2015 Conference
 
Everis big data_wilson_v1.4
Everis big data_wilson_v1.4Everis big data_wilson_v1.4
Everis big data_wilson_v1.4
 
Hadoop Overview
Hadoop OverviewHadoop Overview
Hadoop Overview
 
Research issues in the big data and its Challenges
Research issues in the big data and its ChallengesResearch issues in the big data and its Challenges
Research issues in the big data and its Challenges
 
Big Data Analytics Proposal #1
Big Data Analytics Proposal #1Big Data Analytics Proposal #1
Big Data Analytics Proposal #1
 
Big data introduction - Big Data from a Consulting perspective - Sogeti
Big data introduction - Big Data from a Consulting perspective - SogetiBig data introduction - Big Data from a Consulting perspective - Sogeti
Big data introduction - Big Data from a Consulting perspective - Sogeti
 
Applications of Big Data Analytics in Businesses
Applications of Big Data Analytics in BusinessesApplications of Big Data Analytics in Businesses
Applications of Big Data Analytics in Businesses
 
Big Data - Big Deal? - Edison's Academic Paper in SMU
Big Data - Big Deal? - Edison's Academic Paper in SMUBig Data - Big Deal? - Edison's Academic Paper in SMU
Big Data - Big Deal? - Edison's Academic Paper in SMU
 
Datapreneurs
DatapreneursDatapreneurs
Datapreneurs
 
Business analytics
Business analyticsBusiness analytics
Business analytics
 
Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate
Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate
Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate
 
Jobs Complexity
Jobs ComplexityJobs Complexity
Jobs Complexity
 
Big Data Challenges faced by Organizations
Big Data Challenges faced by OrganizationsBig Data Challenges faced by Organizations
Big Data Challenges faced by Organizations
 
Australia bureau of statistics some initiatives on big data - 23 july 2014
Australia bureau of statistics   some initiatives on big data - 23 july 2014Australia bureau of statistics   some initiatives on big data - 23 july 2014
Australia bureau of statistics some initiatives on big data - 23 july 2014
 
Shane Greenstein Future Assembly 11/17/2015
Shane Greenstein Future Assembly 11/17/2015Shane Greenstein Future Assembly 11/17/2015
Shane Greenstein Future Assembly 11/17/2015
 
Big Data
Big DataBig Data
Big Data
 
Big data unit i
Big data unit iBig data unit i
Big data unit i
 

Similar to The dawn of big data

Unit III.pdf
Unit III.pdfUnit III.pdf
Unit III.pdf
PreethaSuresh2
 
Big data
Big dataBig data
Big data
Pooja Shah
 
Bda assignment can also be used for BDA notes and concept understanding.
Bda assignment can also be used for BDA notes and concept understanding.Bda assignment can also be used for BDA notes and concept understanding.
Bda assignment can also be used for BDA notes and concept understanding.
Aditya205306
 
An Encyclopedic Overview Of Big Data Analytics
An Encyclopedic Overview Of Big Data AnalyticsAn Encyclopedic Overview Of Big Data Analytics
An Encyclopedic Overview Of Big Data Analytics
Audrey Britton
 
What is big data
What is big dataWhat is big data
What is big data
mintubutani2212
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
Akshata Humbe
 
Monetize Big Data
Monetize Big DataMonetize Big Data
Big data Paper
Big data PaperBig data Paper
Big data Paper
Daryaz Fares
 
Data Mining With Big Data
Data Mining With Big DataData Mining With Big Data
Data Mining With Big Data
Muhammad Rumman Islam Nur
 
Sample
Sample Sample
Big Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data ManagementBig Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data Management
Tony Bain
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
Umair Shafique
 
elgendy2014.pdf
elgendy2014.pdfelgendy2014.pdf
elgendy2014.pdf
Akuhuruf
 
Ab cs of big data
Ab cs of big dataAb cs of big data
Ab cs of big data
Digimark
 
Big Data in Practice.pdf
Big Data in Practice.pdfBig Data in Practice.pdf
Big Data in Practice.pdf
Tom Tan
 
Let's make money from big data!
Let's make money from big data! Let's make money from big data!
Let's make money from big data!
B Spot
 
Big Data Evolution
Big Data EvolutionBig Data Evolution
Big Data Evolution
itnewsafrica
 
QuickView #3 - Big Data
QuickView #3 - Big DataQuickView #3 - Big Data
QuickView #3 - Big Data
Sonovate
 
Big data seminor
Big data seminorBig data seminor
Big data seminor
berasrujana
 

Similar to The dawn of big data (20)

Unit III.pdf
Unit III.pdfUnit III.pdf
Unit III.pdf
 
Big data
Big dataBig data
Big data
 
Bda assignment can also be used for BDA notes and concept understanding.
Bda assignment can also be used for BDA notes and concept understanding.Bda assignment can also be used for BDA notes and concept understanding.
Bda assignment can also be used for BDA notes and concept understanding.
 
An Encyclopedic Overview Of Big Data Analytics
An Encyclopedic Overview Of Big Data AnalyticsAn Encyclopedic Overview Of Big Data Analytics
An Encyclopedic Overview Of Big Data Analytics
 
What is big data
What is big dataWhat is big data
What is big data
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Monetize Big Data
Monetize Big DataMonetize Big Data
Monetize Big Data
 
Big data Paper
Big data PaperBig data Paper
Big data Paper
 
Data Mining With Big Data
Data Mining With Big DataData Mining With Big Data
Data Mining With Big Data
 
Sample
Sample Sample
Sample
 
Big Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data ManagementBig Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data Management
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
elgendy2014.pdf
elgendy2014.pdfelgendy2014.pdf
elgendy2014.pdf
 
Ab cs of big data
Ab cs of big dataAb cs of big data
Ab cs of big data
 
Big Data in Practice.pdf
Big Data in Practice.pdfBig Data in Practice.pdf
Big Data in Practice.pdf
 
Let's make money from big data!
Let's make money from big data! Let's make money from big data!
Let's make money from big data!
 
The ABCs of Big Data
The ABCs of Big DataThe ABCs of Big Data
The ABCs of Big Data
 
Big Data Evolution
Big Data EvolutionBig Data Evolution
Big Data Evolution
 
QuickView #3 - Big Data
QuickView #3 - Big DataQuickView #3 - Big Data
QuickView #3 - Big Data
 
Big data seminor
Big data seminorBig data seminor
Big data seminor
 

More from Neal Hannon

Cabinets Brochure March 2018
Cabinets Brochure March 2018Cabinets Brochure March 2018
Cabinets Brochure March 2018
Neal Hannon
 
Ima national june 2013 Find Accounting in XBRL
Ima national june 2013 Find Accounting in XBRLIma national june 2013 Find Accounting in XBRL
Ima national june 2013 Find Accounting in XBRLNeal Hannon
 
Accounting Inside XBRL December 2011.
Accounting Inside XBRL December 2011.Accounting Inside XBRL December 2011.
Accounting Inside XBRL December 2011.
Neal Hannon
 
Client Driven Marketing June 2011
Client Driven Marketing June 2011Client Driven Marketing June 2011
Client Driven Marketing June 2011
Neal Hannon
 
XBRL Ficpa May 29 2009
XBRL Ficpa May 29 2009XBRL Ficpa May 29 2009
XBRL Ficpa May 29 2009
Neal Hannon
 
IFRA US GAAP Convergence
IFRA US GAAP ConvergenceIFRA US GAAP Convergence
IFRA US GAAP Convergence
Neal Hannon
 
IFRS.. Full ahead or caution?
IFRS.. Full ahead or caution?IFRS.. Full ahead or caution?
IFRS.. Full ahead or caution?
Neal Hannon
 

More from Neal Hannon (7)

Cabinets Brochure March 2018
Cabinets Brochure March 2018Cabinets Brochure March 2018
Cabinets Brochure March 2018
 
Ima national june 2013 Find Accounting in XBRL
Ima national june 2013 Find Accounting in XBRLIma national june 2013 Find Accounting in XBRL
Ima national june 2013 Find Accounting in XBRL
 
Accounting Inside XBRL December 2011.
Accounting Inside XBRL December 2011.Accounting Inside XBRL December 2011.
Accounting Inside XBRL December 2011.
 
Client Driven Marketing June 2011
Client Driven Marketing June 2011Client Driven Marketing June 2011
Client Driven Marketing June 2011
 
XBRL Ficpa May 29 2009
XBRL Ficpa May 29 2009XBRL Ficpa May 29 2009
XBRL Ficpa May 29 2009
 
IFRA US GAAP Convergence
IFRA US GAAP ConvergenceIFRA US GAAP Convergence
IFRA US GAAP Convergence
 
IFRS.. Full ahead or caution?
IFRS.. Full ahead or caution?IFRS.. Full ahead or caution?
IFRS.. Full ahead or caution?
 

Recently uploaded

ModelingMarketingStrategiesMKS.CollumbiaUniversitypdf
ModelingMarketingStrategiesMKS.CollumbiaUniversitypdfModelingMarketingStrategiesMKS.CollumbiaUniversitypdf
ModelingMarketingStrategiesMKS.CollumbiaUniversitypdf
fisherameliaisabella
 
Affordable Stationery Printing Services in Jaipur | Navpack n Print
Affordable Stationery Printing Services in Jaipur | Navpack n PrintAffordable Stationery Printing Services in Jaipur | Navpack n Print
Affordable Stationery Printing Services in Jaipur | Navpack n Print
Navpack & Print
 
Attending a job Interview for B1 and B2 Englsih learners
Attending a job Interview for B1 and B2 Englsih learnersAttending a job Interview for B1 and B2 Englsih learners
Attending a job Interview for B1 and B2 Englsih learners
Erika906060
 
Brand Analysis for an artist named Struan
Brand Analysis for an artist named StruanBrand Analysis for an artist named Struan
Brand Analysis for an artist named Struan
sarahvanessa51503
 
Project File Report BBA 6th semester.pdf
Project File Report BBA 6th semester.pdfProject File Report BBA 6th semester.pdf
Project File Report BBA 6th semester.pdf
RajPriye
 
The Parable of the Pipeline a book every new businessman or business student ...
The Parable of the Pipeline a book every new businessman or business student ...The Parable of the Pipeline a book every new businessman or business student ...
The Parable of the Pipeline a book every new businessman or business student ...
awaisafdar
 
amptalk_RecruitingDeck_english_2024.06.05
amptalk_RecruitingDeck_english_2024.06.05amptalk_RecruitingDeck_english_2024.06.05
amptalk_RecruitingDeck_english_2024.06.05
marketing317746
 
Skye Residences | Extended Stay Residences Near Toronto Airport
Skye Residences | Extended Stay Residences Near Toronto AirportSkye Residences | Extended Stay Residences Near Toronto Airport
Skye Residences | Extended Stay Residences Near Toronto Airport
marketingjdass
 
BeMetals Presentation_May_22_2024 .pdf
BeMetals Presentation_May_22_2024   .pdfBeMetals Presentation_May_22_2024   .pdf
BeMetals Presentation_May_22_2024 .pdf
DerekIwanaka1
 
falcon-invoice-discounting-a-premier-platform-for-investors-in-india
falcon-invoice-discounting-a-premier-platform-for-investors-in-indiafalcon-invoice-discounting-a-premier-platform-for-investors-in-india
falcon-invoice-discounting-a-premier-platform-for-investors-in-india
Falcon Invoice Discounting
 
Putting the SPARK into Virtual Training.pptx
Putting the SPARK into Virtual Training.pptxPutting the SPARK into Virtual Training.pptx
Putting the SPARK into Virtual Training.pptx
Cynthia Clay
 
一比一原版加拿大渥太华大学毕业证(uottawa毕业证书)如何办理
一比一原版加拿大渥太华大学毕业证(uottawa毕业证书)如何办理一比一原版加拿大渥太华大学毕业证(uottawa毕业证书)如何办理
一比一原版加拿大渥太华大学毕业证(uottawa毕业证书)如何办理
taqyed
 
Tata Group Dials Taiwan for Its Chipmaking Ambition in Gujarat’s Dholera
Tata Group Dials Taiwan for Its Chipmaking Ambition in Gujarat’s DholeraTata Group Dials Taiwan for Its Chipmaking Ambition in Gujarat’s Dholera
Tata Group Dials Taiwan for Its Chipmaking Ambition in Gujarat’s Dholera
Avirahi City Dholera
 
VAT Registration Outlined In UAE: Benefits and Requirements
VAT Registration Outlined In UAE: Benefits and RequirementsVAT Registration Outlined In UAE: Benefits and Requirements
VAT Registration Outlined In UAE: Benefits and Requirements
uae taxgpt
 
April 2024 Nostalgia Products Newsletter
April 2024 Nostalgia Products NewsletterApril 2024 Nostalgia Products Newsletter
April 2024 Nostalgia Products Newsletter
NathanBaughman3
 
Memorandum Of Association Constitution of Company.ppt
Memorandum Of Association Constitution of Company.pptMemorandum Of Association Constitution of Company.ppt
Memorandum Of Association Constitution of Company.ppt
seri bangash
 
ikea_woodgreen_petscharity_dog-alogue_digital.pdf
ikea_woodgreen_petscharity_dog-alogue_digital.pdfikea_woodgreen_petscharity_dog-alogue_digital.pdf
ikea_woodgreen_petscharity_dog-alogue_digital.pdf
agatadrynko
 
Cree_Rey_BrandIdentityKit.PDF_PersonalBd
Cree_Rey_BrandIdentityKit.PDF_PersonalBdCree_Rey_BrandIdentityKit.PDF_PersonalBd
Cree_Rey_BrandIdentityKit.PDF_PersonalBd
creerey
 
3.0 Project 2_ Developing My Brand Identity Kit.pptx
3.0 Project 2_ Developing My Brand Identity Kit.pptx3.0 Project 2_ Developing My Brand Identity Kit.pptx
3.0 Project 2_ Developing My Brand Identity Kit.pptx
tanyjahb
 
Discover the innovative and creative projects that highlight my journey throu...
Discover the innovative and creative projects that highlight my journey throu...Discover the innovative and creative projects that highlight my journey throu...
Discover the innovative and creative projects that highlight my journey throu...
dylandmeas
 

Recently uploaded (20)

ModelingMarketingStrategiesMKS.CollumbiaUniversitypdf
ModelingMarketingStrategiesMKS.CollumbiaUniversitypdfModelingMarketingStrategiesMKS.CollumbiaUniversitypdf
ModelingMarketingStrategiesMKS.CollumbiaUniversitypdf
 
Affordable Stationery Printing Services in Jaipur | Navpack n Print
Affordable Stationery Printing Services in Jaipur | Navpack n PrintAffordable Stationery Printing Services in Jaipur | Navpack n Print
Affordable Stationery Printing Services in Jaipur | Navpack n Print
 
Attending a job Interview for B1 and B2 Englsih learners
Attending a job Interview for B1 and B2 Englsih learnersAttending a job Interview for B1 and B2 Englsih learners
Attending a job Interview for B1 and B2 Englsih learners
 
Brand Analysis for an artist named Struan
Brand Analysis for an artist named StruanBrand Analysis for an artist named Struan
Brand Analysis for an artist named Struan
 
Project File Report BBA 6th semester.pdf
Project File Report BBA 6th semester.pdfProject File Report BBA 6th semester.pdf
Project File Report BBA 6th semester.pdf
 
The Parable of the Pipeline a book every new businessman or business student ...
The Parable of the Pipeline a book every new businessman or business student ...The Parable of the Pipeline a book every new businessman or business student ...
The Parable of the Pipeline a book every new businessman or business student ...
 
amptalk_RecruitingDeck_english_2024.06.05
amptalk_RecruitingDeck_english_2024.06.05amptalk_RecruitingDeck_english_2024.06.05
amptalk_RecruitingDeck_english_2024.06.05
 
Skye Residences | Extended Stay Residences Near Toronto Airport
Skye Residences | Extended Stay Residences Near Toronto AirportSkye Residences | Extended Stay Residences Near Toronto Airport
Skye Residences | Extended Stay Residences Near Toronto Airport
 
BeMetals Presentation_May_22_2024 .pdf
BeMetals Presentation_May_22_2024   .pdfBeMetals Presentation_May_22_2024   .pdf
BeMetals Presentation_May_22_2024 .pdf
 
falcon-invoice-discounting-a-premier-platform-for-investors-in-india
falcon-invoice-discounting-a-premier-platform-for-investors-in-indiafalcon-invoice-discounting-a-premier-platform-for-investors-in-india
falcon-invoice-discounting-a-premier-platform-for-investors-in-india
 
Putting the SPARK into Virtual Training.pptx
Putting the SPARK into Virtual Training.pptxPutting the SPARK into Virtual Training.pptx
Putting the SPARK into Virtual Training.pptx
 
一比一原版加拿大渥太华大学毕业证(uottawa毕业证书)如何办理
一比一原版加拿大渥太华大学毕业证(uottawa毕业证书)如何办理一比一原版加拿大渥太华大学毕业证(uottawa毕业证书)如何办理
一比一原版加拿大渥太华大学毕业证(uottawa毕业证书)如何办理
 
Tata Group Dials Taiwan for Its Chipmaking Ambition in Gujarat’s Dholera
Tata Group Dials Taiwan for Its Chipmaking Ambition in Gujarat’s DholeraTata Group Dials Taiwan for Its Chipmaking Ambition in Gujarat’s Dholera
Tata Group Dials Taiwan for Its Chipmaking Ambition in Gujarat’s Dholera
 
VAT Registration Outlined In UAE: Benefits and Requirements
VAT Registration Outlined In UAE: Benefits and RequirementsVAT Registration Outlined In UAE: Benefits and Requirements
VAT Registration Outlined In UAE: Benefits and Requirements
 
April 2024 Nostalgia Products Newsletter
April 2024 Nostalgia Products NewsletterApril 2024 Nostalgia Products Newsletter
April 2024 Nostalgia Products Newsletter
 
Memorandum Of Association Constitution of Company.ppt
Memorandum Of Association Constitution of Company.pptMemorandum Of Association Constitution of Company.ppt
Memorandum Of Association Constitution of Company.ppt
 
ikea_woodgreen_petscharity_dog-alogue_digital.pdf
ikea_woodgreen_petscharity_dog-alogue_digital.pdfikea_woodgreen_petscharity_dog-alogue_digital.pdf
ikea_woodgreen_petscharity_dog-alogue_digital.pdf
 
Cree_Rey_BrandIdentityKit.PDF_PersonalBd
Cree_Rey_BrandIdentityKit.PDF_PersonalBdCree_Rey_BrandIdentityKit.PDF_PersonalBd
Cree_Rey_BrandIdentityKit.PDF_PersonalBd
 
3.0 Project 2_ Developing My Brand Identity Kit.pptx
3.0 Project 2_ Developing My Brand Identity Kit.pptx3.0 Project 2_ Developing My Brand Identity Kit.pptx
3.0 Project 2_ Developing My Brand Identity Kit.pptx
 
Discover the innovative and creative projects that highlight my journey throu...
Discover the innovative and creative projects that highlight my journey throu...Discover the innovative and creative projects that highlight my journey throu...
Discover the innovative and creative projects that highlight my journey throu...
 

The dawn of big data

  • 1. THE DAWN OF BIG DATA New Rules; New Structures Neal J. Hannon University of Kansas February 9, 2012
  • 3. Definition • Big data is a term applied to data sets whose size is beyond the ability of commonly used software tools to capture, manage, and process the data within a tolerable elapsed time. Big data sizes are a constantly moving target currently ranging from a few dozen terabytes to many petabytes of data in a single data set.
  • 4.
  • 6. • In a 2001 research report[14] and related conference presentations, then META Group (now Gartner) analyst, Doug Laney, defined data growth challenges (and opportunities) as being three-dimensional, i.e. increasing volume (amount of data), velocity (speed of data in/out), and variety (range of data types, sources). Gartner continues to use this model for describing big data.[15]
  • 7. Gartner • Worldwide information volume is growing annually at a minimum rate of 59 percent annually, and while volume is a significant challenge in managing big data, business and IT leaders must focus on information volume, variety and velocity. • Volume • Variety • Velocity
  • 8. Volume • Volume: The increase in data volumes within enterprise systems is caused by transaction volumes and other traditional data types, as well as by new types of data. Too much volume is a storage issue, but too much data is also a massive analysis issue.
  • 9. Variety • Variety: IT leaders have always had an issue translating large volumes of transactional information into decisions — now there are more types of information to analyze — mainly coming from social media and mobile (context- aware). Variety includes tabular data (databases), hierarchical data, documents, e- mail, metering data, video, still images, audio, stock ticker data, financial transactions and more.
  • 10. Velocity • Velocity: This involves streams of data, structured record creation, and availability for access and delivery. Velocity means both how fast data is being produced and how fast the data must be processed to meet demand.
  • 11. Why now? There were 5 exabytes of information created between the dawn of civilization through 2003, but that much information is now created every 2 days, and the pace is increasing Eric Schmidt, Google CEO, Techonomy Conference, August 4, 2010 Data is becoming the new raw material of business: an economic input almost on a par with capital and labour. “Every day I wake up and ask, ‘how can I flow data better, manage data better, analyse data better?” says Rollin Ford, the CIO of Wal-Mart. Source: Data, Data Everywhere, The Economist, February 25, 2010
  • 12. Source: Mike Driscoll, CTO Metamarkets: The Three Sexy Skills of Data Scientists (& Data Driven Startups)
  • 13. Source: Mike Driscoll, CTO Metamarkets: The Three Sexy Skills of Data Scientists (& Data Driven Startups)
  • 14.
  • 15.
  • 16.
  • 17. How can big data create value? • Creating transparency – enabling, for example, the manufacturing sector to integrate ―data from R&D, engineering, and manufacturing units to enable concurrent engineering ... (to) significantly cut time to market and improve quality.‖ This seems much like traditional data warehousing.
  • 18. How can Big Data create value? • Enabling experimentation – ―organizations can collect more accurate and detailed performance data ... to instrument processes and then set up controlled experiments … (which) can enable leaders to manage performance at higher levels.‖ Super-crunching equals analytics + experiments.
  • 19. How can Big Data create value? • Innovating new business models – ―The emergence of real-time location data has created an entirely new set of location-based services from navigation to pricing property and casualty insurance based on where, and how, people drive their cars.‖ This affirms Mike Loukides' assertion ―that data science enables the creation of data products.‖
  • 20. How can Big Data create value? • Supporting human decision making with automated algorithms – ―decision making may never be the same; some organizations are already making better decisions by analyzing entire datasets from customers, employees, or even sensors embedded in products.‖ The statistical learning world continues to progress.
  • 21. SAS - unstructured text • http://www.youtube.com/user/SASsoftware?v= NHAq8jG4FX4&feature=pyv&ad=8557352196& kw=data%20analytics
  • 22. Pattern Based Strategy • "The ability to manage extreme data will be a core competency of enterprises that are increasingly using new forms of information — such as text, social and context — to look for patterns that support business decisions in what we call Pattern-Based Strategy," said Yvonne Genovese, vice president and distinguished analyst at Gartner. "Pattern-Based Strategy, as an engine of change, utilizes all the dimensions in its pattern-seeking process. It then provides the basis of the modeling for new business solutions, which allows the business to adapt. The seek-model- and-adapt cycle can then be completed in various mediums, such as social computing analysis or context-aware computing engines."
  • 23. Pattern Based Strategy • http://www.youtube.com/watch?v=r8N0L8Cz1q g&feature=BFa&list=UUSNX50LYGXWV_e5U WZGPGbw&lf=plpp_video
  • 24. EMC’s Big Data Video • http://www.youtube.com/watch?v=ILBV391a8Ic • O’Reilly’s Take • http://www.youtube.com/watch?v=Rn5rVGGfzy 0&feature=related
  • 25. Tricks of the Trade • New Architecture • In Memory Analytics
  • 26.
  • 27.
  • 28.
  • 29.
  • 30. In-Memory Indexing at SAP • We have also got enterprise search time, we really started doing that back in 2003/2004 time period, that’s also when we started coming out with business warehouse accelerator that was when Google was just really starting to become Google, and we tried to do the same thing with enterprise data that Google does with website data as far as indexing it. So we also put the indexes in memory, so its speeded up even further and you know now if you actually look at HANA really is kind of the next evolutionary step in that that chain. This is in-memory process and this isn’t something just for a specialist. It really is a technology that’s matured to a level that it can run the entire business suite and run your entire company in-memory and get all those benefits for everything. • http://docs.media.bitpipe.com/io_10x/io_102428/item_477005/The%20Next %20Chapter%20of%20In-Memory%20Computing_PT_12.22.11.pdf
  • 31.
  • 32.
  • 33. For more on HADOOP • http://www.slideshare.net/PhilippeJulio/hadoop- architecture

Editor's Notes

  1. Organizations everywhere now realize that there is immense insight and value locked inside of the data, and new infrastructure and approaches to data analysis allow us to unlock that value
  2. This is what’s happened in the last four decades.
  3. These four factors also happen to be inputs for data generation processes.
  4. Sizes that were unimaginable a few years ago are now commonplaceJust storing and accessing the data can be difficultSIZE – MANAGED WITH – STOREDSmall :: Excel, R :: fits in memory on one machineMedium :: indexed files, monolithic DB :: fits on disk on one machineBig :: Hadoop, Distributed DB :: stored across many machinesGenerally - data too big to fit on a disk :: ‘data-center’ scale
  5. Data that is difficult for computers to understand Principal example being natural langauagetext, Images, Video and moreValuable info locked up inside this data (e.g. twitter)
  6. More data coming in fasterDecision windows getting smallerValuable to worthless in a matter of minutes. (seconds … no milliseconds)
  7. Source: Architecture for Big Data Analytics: MarkLogic white paper