SlideShare a Scribd company logo
White Paper
The ABCs of Big Data –
Analytics, Bandwidth and Content
Richard Treadway and Ingo Fuchs, NetApp
March 2012 | WP-7147
EXECUTIVE SUMMARY
Enterprises are entering a new era of scale, where the amount of data processed and stored
is breaking down every architectural construct in the storage industry. NetApp delivers
solutions that address big data scale through the “Big Data ABCs” – analytics, bandwidth and
content – enabling customers to gain insight into massive datasets, move data quickly, and
store important content for long periods of time without increasing operational complexity.
2 The ABCs of Big Data – Analytics, Bandwidth and Content
TABLE OF CONTENTS
1 ENTERING A NEW ERA OF BIG SCALE............................................................................................. 3
2 SOURCES OF BIG SCALE ................................................................................................................... 3
3 SHORTCOMINGS OF CURRENT APPROACHES............................................................................... 3
4 INFRASTRUCTURE BREAKING POINTS............................................................................................ 4
5 THE ABCS OF DATA AT SCALE ......................................................................................................... 5
6 SUMMARY ............................................................................................................................................. 5
LIST OF FIGURES
Figure 1) Where is your infrastructure breaking? ......................................................................................... 4
Figure 2) The NetApp Big Data ABCs: Analytics, Bandwidth, and Content ................................................. 5
3 The ABCs of Big Data – Analytics, Bandwidth and Content
1 ENTERING A NEW ERA OF BIG SCALE
In the 1990s, IT teams were focused on obtaining optimal performance from the key applications and
infrastructure of their enterprises. These siloed “systems of record” typically did a good job of keeping
track of vital information, but they were very expensive and did not offer sufficient drill-down insight into
the data to drive business advantage. In the 2000s, the IT focus shifted to efficiency how to do more
with less. Technologies like virtualization, sharing, and consolidation of the enterprise’s existing
infrastructure became the key drivers for IT.
We are now entering a new era of big scale, where the amount of data processed and stored by
enterprises is breaking down every architectural construct in the storage industry today. As a result, IT
teams are trying to convert these existing systems of record, built back in the 1990s and 2000s, into
“systems of engagement” – systems that can efficiently deliver the necessary information, to the right
people, in real time, to help them perform more sophisticated analyses and make better business
decisions.
Evolving from Systems of Record to Systems of Engagement
Data by itself has no value. Value comes from using the data to drive business results, offer services to
customers, and increase revenue. The challenge for scalable storage is to enable these business
outcomes from dramatically larger datasets.
2 SOURCES OF BIG SCALE
This massive increase in scale is occurring for a number of reasons. Because of cost pressures, many
companies are consolidating their data centers they can no longer afford for every business unit to have
its own IT infrastructure distributed around the globe. The move to cloud computing also contributes to
increased scale, aggregating the demand of hundreds of thousands of users onto fewer, centralized
systems.
Another source of the increase in scale is the massive growth in machine-generated and user-generated
data. Digital technologies are moving to denser media, photos have all gone digital, videos are using
higher resolution, and advanced analytics require more storage. Furthermore, machine-generated data
from sensor networks, buyer behavior tracking, and other sources contribute to much larger datasets that
need to be understood and commercialized. In short, the amount of data is increasing and the data
objects themselves are getting bigger. All of these forces together put an enormous amount of scale
pressure on existing infrastructures, especially the storage platform. This is what NetApp refers to as the
Big Data Challenge.
Where is Big Data Coming From?
Although human-generated data, such as Facebook pictures and Tweets, is getting the most attention
in the media, the biggest data growth comes from machine-generated datasets, such as consumer
behavior tracking and financial market analyses.
3 SHORTCOMINGS OF CURRENT APPROACHES
Today’s enterprises are finding it difficult to manage the exponential growth in big data. Traditional
approaches can’t scale to the level needed to be able to ingest all of the data, analyze it at the speed at
4 The ABCs of Big Data – Analytics, Bandwidth and Content
which it arrives, and store the relevant datasets efficiently for extended periods of time. The industry as a
whole has started to get a handle on how to manage the increased infrastructure complexity in a virtual
world, but handling infrastructure in a scalable world presents some very serious challenges.
Time-to-information is critical for enterprises to derive maximum value from their data. If it takes weeks or
months to run an analysis, it won’t be timely enough to detect a pattern that may affect the business in an
instant. Compliance is also a significant challenge for many enterprises. Regulated organizations may
have to keep data for very long periods of time – or forever. And they are required to find the data quickly
when needed for reporting or during industry audits.
In summary, the Big Data Challenge is all about gaining business advantage – how to obtain the most
value for the enterprise from this immense digital universe of information.
4 INFRASTRUCTURE BREAKING POINTS
Big data is breaking today’s storage infrastructure along three major axes, as illustrated in Figure 1.
Complexity. Data is no longer just about text and numbers; it's about real-time events and shared
infrastructure. The information is now linked, it is high fidelity, and it consists of multiple data types.
Applying normal algorithms for search, storage, and categorization is becoming much more complex
and inefficient.
Speed. How fast is the data coming in? High-definition video, streaming media over the Internet to
player devices, slow-motion video for surveillance – all of these have very high ingestion rates.
Businesses have to keep up with the data flow to make the information useful. They also have to
keep up with ingestion rates to drive faster business outcomes – or in the military, to save lives.
Volume. All collected data must be stored in a location that is secure and always available. With such
high volumes of data, IT teams have to make decisions about what is “too much data.” For example,
they might flush all data each week and start all over the following week. But for many applications
this is not an option, so more data must be stored longer – without increasing the operational
complexity. This can cause the infrastructure to quickly break on the axis of volume.
Figure 1) Where is your infrastructure breaking?
5 The ABCs of Big Data – Analytics, Bandwidth and Content
5 THE ABCS OF DATA AT SCALE
NetApp has divided the solution sets for managing data at scale into three main areas, called the “Big
Data ABCs” – analytics, bandwidth, and content. As shown in Figure 2, each area has its own specific
challenges and unique infrastructure requirements.
Analytics. This solution area focuses on providing efficient analytics for extremely large datasets.
Analytics is all about gaining insight, taking advantage of the digital universe, and turning data
into high-quality information, providing deeper insights about the business to enable better
decisions.
Bandwidth. This solution area focuses on obtaining better performance for very fast workloads.
High-bandwidth applications include high-performance computing: the ability to perform complex
analyses at extremely high speeds; high-performance video streaming for surveillance and
mission planning; and as video editing and play-out in media and entertainment.
Content. This solution area focuses on the need to provide boundless secure scalable data
storage. Content solutions must enable storing virtually unlimited amounts of data, so that
enterprises can store as much data as they want, find it when they need it, and never lose it.
Figure 2) The NetApp Big Data ABCs: Analytics, Bandwidth, and Content
6 SUMMARY
The new era of scale is breaking existing storage architectures. Enterprises need to ask the following
questions: Are there any opportunities for us to take better advantage of our data? What insights can
really help our business? Where could we use our data to competitive advantage? What if we could link
the trends in buying patterns to people's physical location at a point in time to give them a better
6 The ABCs of Big Data – Analytics, Bandwidth and Content
experience? What if we could detect when fraud is about to happen? Can we identify the likely hotspots
for failure before it happens?
The list of questions is unlimited. But the answer is always the same. NetApp offers the storage solutions
that enable enterprises to take advantage of big data and transform it into greater business value. The
universe of data can be an information gold mine. NetApp helps enterprises find the value of this data and
turn it into real business advantage.
Your Big Data innovation is basd on NetApp
NtApp’s Big Data offerings provide a foundation to spark innovation, make better decisions and drive
successful business outcomes at the speed of today’s business.
NetApp provides no representations or warranties regarding the accuracy, reliability or serviceability of any
information or recommendations provided in this publication, or with respect to any results that may be
obtained by the use of the information or observance of any recommendations provided herein. The
information in this document is distributed AS IS, and the use of this information or the implementation of
any recommendations or techniques herein is a customer’s responsibility and depends on the customer’s
ability to evaluate and integrate them into the customer’s operational environment. This document and
the information contained herein may be used solely in connection with the NetApp products discussed
in this document.
Go further, faster®
© 2012 NetApp, Inc. All rights reserved. No portions of this document may be reproduced without prior written consent of NetApp, Inc.
Specifications are subject to change without notice. NetApp, the NetApp logo, Go further, faster, are trademarks or registered trademarks
of NetApp, Inc. in the United States and/or other countries. All other brands or products are trademarks or registered trademarks of their
respective holders and should be treated as such. WP-7147-0312

More Related Content

What's hot

Big Data : Risks and Opportunities
Big Data : Risks and OpportunitiesBig Data : Risks and Opportunities
Big Data : Risks and Opportunities
Kenny Huang Ph.D.
 
Big Data and BI Best Practices
Big Data and BI Best PracticesBig Data and BI Best Practices
Big Data and BI Best Practices
Yellowfin
 
IBM Big Data References
IBM Big Data ReferencesIBM Big Data References
IBM Big Data References
Rob Thomas
 
Simplifying Big Data Analytics for the Business
Simplifying Big Data Analytics for the BusinessSimplifying Big Data Analytics for the Business
Simplifying Big Data Analytics for the Business
Teradata Aster
 
Big Data Tutorial For Beginners | What Is Big Data | Big Data Tutorial | Hado...
Big Data Tutorial For Beginners | What Is Big Data | Big Data Tutorial | Hado...Big Data Tutorial For Beginners | What Is Big Data | Big Data Tutorial | Hado...
Big Data Tutorial For Beginners | What Is Big Data | Big Data Tutorial | Hado...
Edureka!
 
Big Data Applications | Big Data Analytics Use-Cases | Big Data Tutorial for ...
Big Data Applications | Big Data Analytics Use-Cases | Big Data Tutorial for ...Big Data Applications | Big Data Analytics Use-Cases | Big Data Tutorial for ...
Big Data Applications | Big Data Analytics Use-Cases | Big Data Tutorial for ...
Edureka!
 
Big Data Trends
Big Data TrendsBig Data Trends
Big Data Trends
Collabor8now Ltd
 
Analytics: The Real-world Use of Big Data
Analytics: The Real-world Use of Big DataAnalytics: The Real-world Use of Big Data
Analytics: The Real-world Use of Big Data
David Pittman
 
Big Data Characteristics And Process PowerPoint Presentation Slides
Big Data Characteristics And Process PowerPoint Presentation SlidesBig Data Characteristics And Process PowerPoint Presentation Slides
Big Data Characteristics And Process PowerPoint Presentation Slides
SlideTeam
 
Fundamentals of Big Data
Fundamentals of Big DataFundamentals of Big Data
Fundamentals of Big Data
The Wisdom Daily
 
An Introduction to Big Data
An Introduction to Big DataAn Introduction to Big Data
An Introduction to Big Data
eXascale Infolab
 
Big data overview external
Big data overview externalBig data overview external
Big data overview external
Brett Colbert
 
The Business of Big Data - IA Ventures
The Business of Big Data - IA VenturesThe Business of Big Data - IA Ventures
The Business of Big Data - IA Ventures
Ben Siscovick
 
Big Data Trends - WorldFuture 2015 Conference
Big Data Trends - WorldFuture 2015 ConferenceBig Data Trends - WorldFuture 2015 Conference
Big Data Trends - WorldFuture 2015 Conference
David Feinleib
 
Creating the Foundations for the Internet of Things
Creating the Foundations for the Internet of ThingsCreating the Foundations for the Internet of Things
Creating the Foundations for the Internet of Things
Capgemini
 
Future of Big Data
Future of Big DataFuture of Big Data
Future of Big Data
IRJET Journal
 
Embracing data science
Embracing data scienceEmbracing data science
Embracing data science
Vipul Kalamkar
 
Analytics 3.0 Measurable business impact from analytics & big data
Analytics 3.0 Measurable business impact from analytics & big dataAnalytics 3.0 Measurable business impact from analytics & big data
Analytics 3.0 Measurable business impact from analytics & big data
Microsoft
 

What's hot (20)

Big Data : Risks and Opportunities
Big Data : Risks and OpportunitiesBig Data : Risks and Opportunities
Big Data : Risks and Opportunities
 
Big Data and BI Best Practices
Big Data and BI Best PracticesBig Data and BI Best Practices
Big Data and BI Best Practices
 
IBM Big Data References
IBM Big Data ReferencesIBM Big Data References
IBM Big Data References
 
Simplifying Big Data Analytics for the Business
Simplifying Big Data Analytics for the BusinessSimplifying Big Data Analytics for the Business
Simplifying Big Data Analytics for the Business
 
Big Data Tutorial For Beginners | What Is Big Data | Big Data Tutorial | Hado...
Big Data Tutorial For Beginners | What Is Big Data | Big Data Tutorial | Hado...Big Data Tutorial For Beginners | What Is Big Data | Big Data Tutorial | Hado...
Big Data Tutorial For Beginners | What Is Big Data | Big Data Tutorial | Hado...
 
Big Data Applications | Big Data Analytics Use-Cases | Big Data Tutorial for ...
Big Data Applications | Big Data Analytics Use-Cases | Big Data Tutorial for ...Big Data Applications | Big Data Analytics Use-Cases | Big Data Tutorial for ...
Big Data Applications | Big Data Analytics Use-Cases | Big Data Tutorial for ...
 
Big Data Trends
Big Data TrendsBig Data Trends
Big Data Trends
 
Analytics: The Real-world Use of Big Data
Analytics: The Real-world Use of Big DataAnalytics: The Real-world Use of Big Data
Analytics: The Real-world Use of Big Data
 
Big Data Overview
Big Data OverviewBig Data Overview
Big Data Overview
 
Big Data Characteristics And Process PowerPoint Presentation Slides
Big Data Characteristics And Process PowerPoint Presentation SlidesBig Data Characteristics And Process PowerPoint Presentation Slides
Big Data Characteristics And Process PowerPoint Presentation Slides
 
Fundamentals of Big Data
Fundamentals of Big DataFundamentals of Big Data
Fundamentals of Big Data
 
An Introduction to Big Data
An Introduction to Big DataAn Introduction to Big Data
An Introduction to Big Data
 
Big data overview external
Big data overview externalBig data overview external
Big data overview external
 
The Business of Big Data - IA Ventures
The Business of Big Data - IA VenturesThe Business of Big Data - IA Ventures
The Business of Big Data - IA Ventures
 
Big Data Trends - WorldFuture 2015 Conference
Big Data Trends - WorldFuture 2015 ConferenceBig Data Trends - WorldFuture 2015 Conference
Big Data Trends - WorldFuture 2015 Conference
 
Creating the Foundations for the Internet of Things
Creating the Foundations for the Internet of ThingsCreating the Foundations for the Internet of Things
Creating the Foundations for the Internet of Things
 
Big data-analytics-ebook
Big data-analytics-ebookBig data-analytics-ebook
Big data-analytics-ebook
 
Future of Big Data
Future of Big DataFuture of Big Data
Future of Big Data
 
Embracing data science
Embracing data scienceEmbracing data science
Embracing data science
 
Analytics 3.0 Measurable business impact from analytics & big data
Analytics 3.0 Measurable business impact from analytics & big dataAnalytics 3.0 Measurable business impact from analytics & big data
Analytics 3.0 Measurable business impact from analytics & big data
 

Viewers also liked

Desaindanstrukturorganisasi 150405063819-conversion-gate01
Desaindanstrukturorganisasi 150405063819-conversion-gate01Desaindanstrukturorganisasi 150405063819-conversion-gate01
Desaindanstrukturorganisasi 150405063819-conversion-gate01
mahmud .
 
Modelo de-diapositivas definitivo
Modelo de-diapositivas definitivoModelo de-diapositivas definitivo
Modelo de-diapositivas definitivoNodo Antioquia
 
Dissertation transfer pricing
Dissertation transfer pricingDissertation transfer pricing
Dissertation transfer pricing
Michiel van der Ven
 
د حاتم البيطار
د حاتم البيطارد حاتم البيطار
د حاتم البيطار
د حاتم البيطار
 
Nagham K.T Albhaisi Resume
Nagham K.T Albhaisi ResumeNagham K.T Albhaisi Resume
Nagham K.T Albhaisi ResumeNagham Albhaisi
 
Aula tema 5 desenvolvimento economico
Aula tema 5 desenvolvimento economicoAula tema 5 desenvolvimento economico
Aula tema 5 desenvolvimento economico
Guilherme Abreu
 
Sesion materiales de laboratorio
Sesion materiales de laboratorioSesion materiales de laboratorio
Sesion materiales de laboratorio
Ana Ines Rojas Garcia
 
Herramentas de medicion tema
Herramentas de medicion tema Herramentas de medicion tema
Herramentas de medicion tema
Irala Barrios Ivan Alvaro
 
CV Willekens Matthias
CV Willekens MatthiasCV Willekens Matthias
CV Willekens Matthias
Matthias Willekens
 
Protista Materi SMA kelas X
Protista Materi SMA kelas XProtista Materi SMA kelas X
Protista Materi SMA kelas X
Nina Hernawati
 
ENJ-300 Procedimientos Penales del Juez de Paz III
ENJ-300 Procedimientos Penales del Juez de Paz IIIENJ-300 Procedimientos Penales del Juez de Paz III
ENJ-300 Procedimientos Penales del Juez de Paz IIIENJ
 
Timetable presentation
Timetable presentationTimetable presentation
Timetable presentationjordanjnolan
 
Programas de diseño. RHINOCEROS 5
Programas de diseño. RHINOCEROS 5Programas de diseño. RHINOCEROS 5
Programas de diseño. RHINOCEROS 5
JuanJo Odar
 
Terminata la norma sui BACS
Terminata la norma sui BACSTerminata la norma sui BACS
Terminata la norma sui BACS
Mattia Merlini
 
Exercicos forzas1
Exercicos forzas1Exercicos forzas1
Exercicos forzas1
Lolo Nirvioso
 

Viewers also liked (19)

Desaindanstrukturorganisasi 150405063819-conversion-gate01
Desaindanstrukturorganisasi 150405063819-conversion-gate01Desaindanstrukturorganisasi 150405063819-conversion-gate01
Desaindanstrukturorganisasi 150405063819-conversion-gate01
 
Ofimática
OfimáticaOfimática
Ofimática
 
зош 19 день миколая
зош 19 день миколаязош 19 день миколая
зош 19 день миколая
 
Modelo de-diapositivas definitivo
Modelo de-diapositivas definitivoModelo de-diapositivas definitivo
Modelo de-diapositivas definitivo
 
Dissertation transfer pricing
Dissertation transfer pricingDissertation transfer pricing
Dissertation transfer pricing
 
Redes 1
Redes 1Redes 1
Redes 1
 
د حاتم البيطار
د حاتم البيطارد حاتم البيطار
د حاتم البيطار
 
Nagham K.T Albhaisi Resume
Nagham K.T Albhaisi ResumeNagham K.T Albhaisi Resume
Nagham K.T Albhaisi Resume
 
Aula tema 5 desenvolvimento economico
Aula tema 5 desenvolvimento economicoAula tema 5 desenvolvimento economico
Aula tema 5 desenvolvimento economico
 
Sesion materiales de laboratorio
Sesion materiales de laboratorioSesion materiales de laboratorio
Sesion materiales de laboratorio
 
Herramentas de medicion tema
Herramentas de medicion tema Herramentas de medicion tema
Herramentas de medicion tema
 
CV Willekens Matthias
CV Willekens MatthiasCV Willekens Matthias
CV Willekens Matthias
 
Protista Materi SMA kelas X
Protista Materi SMA kelas XProtista Materi SMA kelas X
Protista Materi SMA kelas X
 
ENJ-300 Procedimientos Penales del Juez de Paz III
ENJ-300 Procedimientos Penales del Juez de Paz IIIENJ-300 Procedimientos Penales del Juez de Paz III
ENJ-300 Procedimientos Penales del Juez de Paz III
 
Timetable presentation
Timetable presentationTimetable presentation
Timetable presentation
 
Programas de diseño. RHINOCEROS 5
Programas de diseño. RHINOCEROS 5Programas de diseño. RHINOCEROS 5
Programas de diseño. RHINOCEROS 5
 
Terminata la norma sui BACS
Terminata la norma sui BACSTerminata la norma sui BACS
Terminata la norma sui BACS
 
Exercicos forzas1
Exercicos forzas1Exercicos forzas1
Exercicos forzas1
 
El patito feo
El patito feoEl patito feo
El patito feo
 

Similar to Ab cs of big data

Big Data at a Glance
Big Data at a GlanceBig Data at a Glance
Big Data at a Glance
Softweb Solutions
 
Analytics big data ibm
Analytics big data ibmAnalytics big data ibm
Analytics big data ibmAccenture
 
Analysis of Big Data
Analysis of Big DataAnalysis of Big Data
Analysis of Big Data
IRJET Journal
 
An Encyclopedic Overview Of Big Data Analytics
An Encyclopedic Overview Of Big Data AnalyticsAn Encyclopedic Overview Of Big Data Analytics
An Encyclopedic Overview Of Big Data Analytics
Audrey Britton
 
Snowball Group Whitepaper - Spotlight on Big Data
Snowball Group Whitepaper - Spotlight on Big DataSnowball Group Whitepaper - Spotlight on Big Data
Snowball Group Whitepaper - Spotlight on Big Data
Snowball Group
 
Big data – A Review
Big data – A ReviewBig data – A Review
Big data – A Review
IRJET Journal
 
6 Reasons to Use Data Analytics
6 Reasons to Use Data Analytics6 Reasons to Use Data Analytics
6 Reasons to Use Data Analytics
Ray Business Technologies
 
Using Big Data Smarter Decision Making
Using Big Data Smarter Decision MakingUsing Big Data Smarter Decision Making
Using Big Data Smarter Decision Making
IBM India Smarter Computing
 
Introduction to big data – convergences.
Introduction to big data – convergences.Introduction to big data – convergences.
Introduction to big data – convergences.
saranya270513
 
Project 3 – Hollywood and IT· Find 10 incidents of Hollywood p.docx
Project 3 – Hollywood and IT· Find 10 incidents of Hollywood p.docxProject 3 – Hollywood and IT· Find 10 incidents of Hollywood p.docx
Project 3 – Hollywood and IT· Find 10 incidents of Hollywood p.docx
stilliegeorgiana
 
IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...
IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...
IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...
IRJET Journal
 
Getting down to business on Big Data analytics
Getting down to business on Big Data analyticsGetting down to business on Big Data analytics
Getting down to business on Big Data analyticsThe Marketing Distillery
 
How In-memory Computing Drives IT Simplification
How In-memory Computing Drives IT SimplificationHow In-memory Computing Drives IT Simplification
How In-memory Computing Drives IT Simplification
SAP Technology
 
06. 9534 14985-1-ed b edit dhyan
06. 9534 14985-1-ed b edit dhyan06. 9534 14985-1-ed b edit dhyan
06. 9534 14985-1-ed b edit dhyan
IAESIJEECS
 
Big data document (basic concepts,3vs,Bigdata vs Smalldata,importance,storage...
Big data document (basic concepts,3vs,Bigdata vs Smalldata,importance,storage...Big data document (basic concepts,3vs,Bigdata vs Smalldata,importance,storage...
Big data document (basic concepts,3vs,Bigdata vs Smalldata,importance,storage...
Taniya Fansupkar
 
Unit III.pdf
Unit III.pdfUnit III.pdf
Unit III.pdf
PreethaSuresh2
 
Big data
Big dataBig data
Big data
Pooja Shah
 
BIG DATA & DATA ANALYTICS
BIG  DATA & DATA  ANALYTICSBIG  DATA & DATA  ANALYTICS
BIG DATA & DATA ANALYTICS
NAGARAJAGIDDE
 

Similar to Ab cs of big data (20)

The ABCs of Big Data
The ABCs of Big DataThe ABCs of Big Data
The ABCs of Big Data
 
Big Data at a Glance
Big Data at a GlanceBig Data at a Glance
Big Data at a Glance
 
Analytics big data ibm
Analytics big data ibmAnalytics big data ibm
Analytics big data ibm
 
Analysis of Big Data
Analysis of Big DataAnalysis of Big Data
Analysis of Big Data
 
An Encyclopedic Overview Of Big Data Analytics
An Encyclopedic Overview Of Big Data AnalyticsAn Encyclopedic Overview Of Big Data Analytics
An Encyclopedic Overview Of Big Data Analytics
 
Snowball Group Whitepaper - Spotlight on Big Data
Snowball Group Whitepaper - Spotlight on Big DataSnowball Group Whitepaper - Spotlight on Big Data
Snowball Group Whitepaper - Spotlight on Big Data
 
Big data – A Review
Big data – A ReviewBig data – A Review
Big data – A Review
 
6 Reasons to Use Data Analytics
6 Reasons to Use Data Analytics6 Reasons to Use Data Analytics
6 Reasons to Use Data Analytics
 
Using Big Data Smarter Decision Making
Using Big Data Smarter Decision MakingUsing Big Data Smarter Decision Making
Using Big Data Smarter Decision Making
 
Introduction to big data – convergences.
Introduction to big data – convergences.Introduction to big data – convergences.
Introduction to big data – convergences.
 
1
11
1
 
Project 3 – Hollywood and IT· Find 10 incidents of Hollywood p.docx
Project 3 – Hollywood and IT· Find 10 incidents of Hollywood p.docxProject 3 – Hollywood and IT· Find 10 incidents of Hollywood p.docx
Project 3 – Hollywood and IT· Find 10 incidents of Hollywood p.docx
 
IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...
IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...
IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...
 
Getting down to business on Big Data analytics
Getting down to business on Big Data analyticsGetting down to business on Big Data analytics
Getting down to business on Big Data analytics
 
How In-memory Computing Drives IT Simplification
How In-memory Computing Drives IT SimplificationHow In-memory Computing Drives IT Simplification
How In-memory Computing Drives IT Simplification
 
06. 9534 14985-1-ed b edit dhyan
06. 9534 14985-1-ed b edit dhyan06. 9534 14985-1-ed b edit dhyan
06. 9534 14985-1-ed b edit dhyan
 
Big data document (basic concepts,3vs,Bigdata vs Smalldata,importance,storage...
Big data document (basic concepts,3vs,Bigdata vs Smalldata,importance,storage...Big data document (basic concepts,3vs,Bigdata vs Smalldata,importance,storage...
Big data document (basic concepts,3vs,Bigdata vs Smalldata,importance,storage...
 
Unit III.pdf
Unit III.pdfUnit III.pdf
Unit III.pdf
 
Big data
Big dataBig data
Big data
 
BIG DATA & DATA ANALYTICS
BIG  DATA & DATA  ANALYTICSBIG  DATA & DATA  ANALYTICS
BIG DATA & DATA ANALYTICS
 

Recently uploaded

FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
Paul Groth
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
Product School
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
Cheryl Hung
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Tobias Schneck
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
BookNet Canada
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
RTTS
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
Alison B. Lowndes
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
Frank van Harmelen
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
Product School
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
UiPathCommunity
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Ramesh Iyer
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Product School
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
OnBoard
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
91mobiles
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
Elena Simperl
 

Recently uploaded (20)

FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 

Ab cs of big data

  • 1. White Paper The ABCs of Big Data – Analytics, Bandwidth and Content Richard Treadway and Ingo Fuchs, NetApp March 2012 | WP-7147 EXECUTIVE SUMMARY Enterprises are entering a new era of scale, where the amount of data processed and stored is breaking down every architectural construct in the storage industry. NetApp delivers solutions that address big data scale through the “Big Data ABCs” – analytics, bandwidth and content – enabling customers to gain insight into massive datasets, move data quickly, and store important content for long periods of time without increasing operational complexity.
  • 2. 2 The ABCs of Big Data – Analytics, Bandwidth and Content TABLE OF CONTENTS 1 ENTERING A NEW ERA OF BIG SCALE............................................................................................. 3 2 SOURCES OF BIG SCALE ................................................................................................................... 3 3 SHORTCOMINGS OF CURRENT APPROACHES............................................................................... 3 4 INFRASTRUCTURE BREAKING POINTS............................................................................................ 4 5 THE ABCS OF DATA AT SCALE ......................................................................................................... 5 6 SUMMARY ............................................................................................................................................. 5 LIST OF FIGURES Figure 1) Where is your infrastructure breaking? ......................................................................................... 4 Figure 2) The NetApp Big Data ABCs: Analytics, Bandwidth, and Content ................................................. 5
  • 3. 3 The ABCs of Big Data – Analytics, Bandwidth and Content 1 ENTERING A NEW ERA OF BIG SCALE In the 1990s, IT teams were focused on obtaining optimal performance from the key applications and infrastructure of their enterprises. These siloed “systems of record” typically did a good job of keeping track of vital information, but they were very expensive and did not offer sufficient drill-down insight into the data to drive business advantage. In the 2000s, the IT focus shifted to efficiency how to do more with less. Technologies like virtualization, sharing, and consolidation of the enterprise’s existing infrastructure became the key drivers for IT. We are now entering a new era of big scale, where the amount of data processed and stored by enterprises is breaking down every architectural construct in the storage industry today. As a result, IT teams are trying to convert these existing systems of record, built back in the 1990s and 2000s, into “systems of engagement” – systems that can efficiently deliver the necessary information, to the right people, in real time, to help them perform more sophisticated analyses and make better business decisions. Evolving from Systems of Record to Systems of Engagement Data by itself has no value. Value comes from using the data to drive business results, offer services to customers, and increase revenue. The challenge for scalable storage is to enable these business outcomes from dramatically larger datasets. 2 SOURCES OF BIG SCALE This massive increase in scale is occurring for a number of reasons. Because of cost pressures, many companies are consolidating their data centers they can no longer afford for every business unit to have its own IT infrastructure distributed around the globe. The move to cloud computing also contributes to increased scale, aggregating the demand of hundreds of thousands of users onto fewer, centralized systems. Another source of the increase in scale is the massive growth in machine-generated and user-generated data. Digital technologies are moving to denser media, photos have all gone digital, videos are using higher resolution, and advanced analytics require more storage. Furthermore, machine-generated data from sensor networks, buyer behavior tracking, and other sources contribute to much larger datasets that need to be understood and commercialized. In short, the amount of data is increasing and the data objects themselves are getting bigger. All of these forces together put an enormous amount of scale pressure on existing infrastructures, especially the storage platform. This is what NetApp refers to as the Big Data Challenge. Where is Big Data Coming From? Although human-generated data, such as Facebook pictures and Tweets, is getting the most attention in the media, the biggest data growth comes from machine-generated datasets, such as consumer behavior tracking and financial market analyses. 3 SHORTCOMINGS OF CURRENT APPROACHES Today’s enterprises are finding it difficult to manage the exponential growth in big data. Traditional approaches can’t scale to the level needed to be able to ingest all of the data, analyze it at the speed at
  • 4. 4 The ABCs of Big Data – Analytics, Bandwidth and Content which it arrives, and store the relevant datasets efficiently for extended periods of time. The industry as a whole has started to get a handle on how to manage the increased infrastructure complexity in a virtual world, but handling infrastructure in a scalable world presents some very serious challenges. Time-to-information is critical for enterprises to derive maximum value from their data. If it takes weeks or months to run an analysis, it won’t be timely enough to detect a pattern that may affect the business in an instant. Compliance is also a significant challenge for many enterprises. Regulated organizations may have to keep data for very long periods of time – or forever. And they are required to find the data quickly when needed for reporting or during industry audits. In summary, the Big Data Challenge is all about gaining business advantage – how to obtain the most value for the enterprise from this immense digital universe of information. 4 INFRASTRUCTURE BREAKING POINTS Big data is breaking today’s storage infrastructure along three major axes, as illustrated in Figure 1. Complexity. Data is no longer just about text and numbers; it's about real-time events and shared infrastructure. The information is now linked, it is high fidelity, and it consists of multiple data types. Applying normal algorithms for search, storage, and categorization is becoming much more complex and inefficient. Speed. How fast is the data coming in? High-definition video, streaming media over the Internet to player devices, slow-motion video for surveillance – all of these have very high ingestion rates. Businesses have to keep up with the data flow to make the information useful. They also have to keep up with ingestion rates to drive faster business outcomes – or in the military, to save lives. Volume. All collected data must be stored in a location that is secure and always available. With such high volumes of data, IT teams have to make decisions about what is “too much data.” For example, they might flush all data each week and start all over the following week. But for many applications this is not an option, so more data must be stored longer – without increasing the operational complexity. This can cause the infrastructure to quickly break on the axis of volume. Figure 1) Where is your infrastructure breaking?
  • 5. 5 The ABCs of Big Data – Analytics, Bandwidth and Content 5 THE ABCS OF DATA AT SCALE NetApp has divided the solution sets for managing data at scale into three main areas, called the “Big Data ABCs” – analytics, bandwidth, and content. As shown in Figure 2, each area has its own specific challenges and unique infrastructure requirements. Analytics. This solution area focuses on providing efficient analytics for extremely large datasets. Analytics is all about gaining insight, taking advantage of the digital universe, and turning data into high-quality information, providing deeper insights about the business to enable better decisions. Bandwidth. This solution area focuses on obtaining better performance for very fast workloads. High-bandwidth applications include high-performance computing: the ability to perform complex analyses at extremely high speeds; high-performance video streaming for surveillance and mission planning; and as video editing and play-out in media and entertainment. Content. This solution area focuses on the need to provide boundless secure scalable data storage. Content solutions must enable storing virtually unlimited amounts of data, so that enterprises can store as much data as they want, find it when they need it, and never lose it. Figure 2) The NetApp Big Data ABCs: Analytics, Bandwidth, and Content 6 SUMMARY The new era of scale is breaking existing storage architectures. Enterprises need to ask the following questions: Are there any opportunities for us to take better advantage of our data? What insights can really help our business? Where could we use our data to competitive advantage? What if we could link the trends in buying patterns to people's physical location at a point in time to give them a better
  • 6. 6 The ABCs of Big Data – Analytics, Bandwidth and Content experience? What if we could detect when fraud is about to happen? Can we identify the likely hotspots for failure before it happens? The list of questions is unlimited. But the answer is always the same. NetApp offers the storage solutions that enable enterprises to take advantage of big data and transform it into greater business value. The universe of data can be an information gold mine. NetApp helps enterprises find the value of this data and turn it into real business advantage. Your Big Data innovation is basd on NetApp NtApp’s Big Data offerings provide a foundation to spark innovation, make better decisions and drive successful business outcomes at the speed of today’s business. NetApp provides no representations or warranties regarding the accuracy, reliability or serviceability of any information or recommendations provided in this publication, or with respect to any results that may be obtained by the use of the information or observance of any recommendations provided herein. The information in this document is distributed AS IS, and the use of this information or the implementation of any recommendations or techniques herein is a customer’s responsibility and depends on the customer’s ability to evaluate and integrate them into the customer’s operational environment. This document and the information contained herein may be used solely in connection with the NetApp products discussed in this document. Go further, faster® © 2012 NetApp, Inc. All rights reserved. No portions of this document may be reproduced without prior written consent of NetApp, Inc. Specifications are subject to change without notice. NetApp, the NetApp logo, Go further, faster, are trademarks or registered trademarks of NetApp, Inc. in the United States and/or other countries. All other brands or products are trademarks or registered trademarks of their respective holders and should be treated as such. WP-7147-0312