SlideShare a Scribd company logo
Guide to Big Data
Analytics
BY GAHYA PANDIAN
Big Data Analytics
What is Big Data
Data analysis is nothing new. Even before computers were used, information gained in the course
of business or other activities was reviewed with the aim of making those processes more efficient
and more profitable. These were, of course, comparatively small-scale undertakings given the
limitations posed by resources and manpower; analysis had to be manual and was slow by
modern standards, but it was still worthwhile. Opinion polling, for example, has been carried out
since early in the 19th century, almost 200 years ago. The first national survey took place in 1916
and involved the publication Literary Digest sending out millions of postcards and counting the
returns. As a result, they correctly predicted Woodrow Wilson’s election as president.
Since then, volumes of data have grown exponentially. The advent of the internet and faster
computing has meant that huge quantities of information can now be harvested and used to
optimise business processes. The problem is that conventional methods were simply not suited to
crunching through all the numbers and making sense of them. The amount of information is
phenomenal, and within that information lies insights that can be extremely beneficial. Once
patterns are identified, they can be used to adjust business practices, create targeted campaigns
and discard ones that are not effective. However, as well as large amounts of storage, it takes
specialised software to be able to make sense of all this data in a useful way.
Big Data
Big Data’ is the emerging discipline of capturing, storing, processing,
analysing and visualising these huge quantities of information. The data sets
may start at a few terabytes and run to many petabytes – far more than
traditional data analysis packages can handle. In 2012 Gartner defined it as,
‘high volume, high velocity, and/or high variety information assets that
require new forms of processing to enable enhanced decision making, insight
discovery and process optimization.’ This ‘3V’ classification has been built on
since (particularly with the addition of veracity), such that Big Data is often
described in terms of the following characteristics:
Big Data
 Volume. Terabytes or petabytes of data are analysed. An estimated 2.5 quintillion bytes
of data (2.5 trillion gigabytes) are created every day, an amount which will only rise in
the future. However, the size of the dataset is not the only variable that characterises Big
Data.
 Variety. The dataset may contain many different forms of data – not simply a large
amount of the same type. The profusion of different kinds of mobile device and the
variety of content consumed on them on a wide range of platforms, for example, means
that companies can harvest data from an enormous array of sources, each telling them a
different part of the same picture.
 Velocity. Data may change on a constant basis. For example, modern cars may have 100
or so different sensors that continually monitor different aspects of performance.
Markets change on a moment-to-moment scale. Data is highly fluid, and snapshots are
not always enough.
 Veracity. The data acquired may not all be accurate, or much of it may be uncertain or
provisional in nature. Data quality is unreliable, especially when there is so much of it.
Any system of analysis must take this into account.
Big Data
In addition to the 4V characteristics, there are also two others to deal with:
 Variability. Data capture and volume may be inconsistent, not just
inaccurate, so varying quantities and qualities of data will be acquired at
different times.
 Together, these factors mean that managing the data can be an extremely
complex process, since there are many data sources with differing types
and formats of data, but these need to be correlated and made sense of if
they are to be useful.
Conclusion
 Big data isn’t just an emerging phenomenon. It’s already here and being used by
major companies to drive their business forwards. Traditional analytics packages
simply aren’t capable of dealing with the quantity, variety and changeability of data
that can now be harvested from diverse sources – machine sensors, text documents,
structured and unstructured data, social media and more. When these are combined
and analysed as a whole, new patterns emerge. The right big data package will allow
enterprises to track these trends in real time, spotting them as they occur and
enabling businesses to leverage the insights provided.
 However, not all big data platforms and software are alike. As ever, which you decide
on will depend on a number of factors. These include not just the nature of the data
you are working with, but organisational budgets, infrastructure and the skillset of your
team, amongst other things. Some solutions are designed to be used off-the-peg,
providing powerful visualisations and connecting easily to your data stores. Others are
intended to be more flexible but should only be used by those with coding expertise.
You should also think to the future, and the long-term implications of being tied to
your platform of choice – particularly in terms of open-source vs proprietary software.
Other Guides:
 Public Cloud: Top Rated Public Cloud Computing Providers, Services,
Security & Technologies
 Cloud Backup: Guide to Cloud Backup Services, Companies, Software and
Solutions
 Hybrid Cloud: Guide to Hybrid Cloud Storage and Computing Companies
 Virtual Data Room: Virtual Data Room Providers, Services, Reviews and
Comparisons

More Related Content

What's hot

2 pc enterprise summit cronin newfinal aug 18
2 pc enterprise summit cronin newfinal aug 182 pc enterprise summit cronin newfinal aug 18
2 pc enterprise summit cronin newfinal aug 18
IntelAPAC
 
2018 Big Data Trends: Liberate, Integrate, and Trust Your Data
2018 Big Data Trends: Liberate, Integrate, and Trust Your Data2018 Big Data Trends: Liberate, Integrate, and Trust Your Data
2018 Big Data Trends: Liberate, Integrate, and Trust Your Data
Precisely
 
Internet of Things and Big Data: Vision and Concrete Use Cases
Internet of Things and Big Data: Vision and Concrete Use CasesInternet of Things and Big Data: Vision and Concrete Use Cases
Internet of Things and Big Data: Vision and Concrete Use CasesMongoDB
 
Introduction to edge analytics- Intelligent IoT
Introduction to edge analytics- Intelligent IoTIntroduction to edge analytics- Intelligent IoT
Introduction to edge analytics- Intelligent IoT
Shreya Mukhopadhyay
 
Short introduction to Big Data Analytics, the Internet of Things, and their s...
Short introduction to Big Data Analytics, the Internet of Things, and their s...Short introduction to Big Data Analytics, the Internet of Things, and their s...
Short introduction to Big Data Analytics, the Internet of Things, and their s...
Andrei Khurshudov
 
Internet of Things
Internet of ThingsInternet of Things
Internet of Things
Mphasis
 
Big data: the next frontier for innovation, competition and productivity
Big data: the next frontier for innovation, competition and productivityBig data: the next frontier for innovation, competition and productivity
Big data: the next frontier for innovation, competition and productivity
Andrea Rabbaglietti
 
Big Data Analytics - A Glimpse
Big Data Analytics - A GlimpseBig Data Analytics - A Glimpse
Big Data Analytics - A Glimpse
Laguna State Polytechnic University
 
Essential Tools For Your Big Data Arsenal
Essential Tools For Your Big Data ArsenalEssential Tools For Your Big Data Arsenal
Essential Tools For Your Big Data Arsenal
MongoDB
 
Big Data
Big DataBig Data
Big Data
Seminar Links
 
BIG DATA & DATA ANALYTICS
BIG  DATA & DATA  ANALYTICSBIG  DATA & DATA  ANALYTICS
BIG DATA & DATA ANALYTICS
NAGARAJAGIDDE
 
The Internet of Things
The Internet of ThingsThe Internet of Things
The Internet of Things
Michelle Marin
 
Harnessing the Power of IoT - Xamarin Experience 2017
Harnessing the Power of IoT - Xamarin Experience 2017 Harnessing the Power of IoT - Xamarin Experience 2017
Harnessing the Power of IoT - Xamarin Experience 2017
Xpand IT
 
Keamanan Siber di Era Big Data
Keamanan Siber di Era Big DataKeamanan Siber di Era Big Data
Keamanan Siber di Era Big Data
Heru Sutadi
 
Ppt for Application of big data
Ppt for Application of big dataPpt for Application of big data
Ppt for Application of big data
Prashant Sharma
 
Key Data Management Requirements for the IoT
Key Data Management Requirements for the IoTKey Data Management Requirements for the IoT
Key Data Management Requirements for the IoTMongoDB
 
Big data
Big dataBig data
SymEx 2015 - Agile Process for Big Data Analytic
SymEx 2015 - Agile Process for Big Data AnalyticSymEx 2015 - Agile Process for Big Data Analytic
SymEx 2015 - Agile Process for Big Data Analytic
PMI Indonesia Chapter
 
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
Hritika Raj
 

What's hot (20)

2 pc enterprise summit cronin newfinal aug 18
2 pc enterprise summit cronin newfinal aug 182 pc enterprise summit cronin newfinal aug 18
2 pc enterprise summit cronin newfinal aug 18
 
2018 Big Data Trends: Liberate, Integrate, and Trust Your Data
2018 Big Data Trends: Liberate, Integrate, and Trust Your Data2018 Big Data Trends: Liberate, Integrate, and Trust Your Data
2018 Big Data Trends: Liberate, Integrate, and Trust Your Data
 
Internet of Things and Big Data: Vision and Concrete Use Cases
Internet of Things and Big Data: Vision and Concrete Use CasesInternet of Things and Big Data: Vision and Concrete Use Cases
Internet of Things and Big Data: Vision and Concrete Use Cases
 
Introduction to edge analytics- Intelligent IoT
Introduction to edge analytics- Intelligent IoTIntroduction to edge analytics- Intelligent IoT
Introduction to edge analytics- Intelligent IoT
 
Big Data
Big DataBig Data
Big Data
 
Short introduction to Big Data Analytics, the Internet of Things, and their s...
Short introduction to Big Data Analytics, the Internet of Things, and their s...Short introduction to Big Data Analytics, the Internet of Things, and their s...
Short introduction to Big Data Analytics, the Internet of Things, and their s...
 
Internet of Things
Internet of ThingsInternet of Things
Internet of Things
 
Big data: the next frontier for innovation, competition and productivity
Big data: the next frontier for innovation, competition and productivityBig data: the next frontier for innovation, competition and productivity
Big data: the next frontier for innovation, competition and productivity
 
Big Data Analytics - A Glimpse
Big Data Analytics - A GlimpseBig Data Analytics - A Glimpse
Big Data Analytics - A Glimpse
 
Essential Tools For Your Big Data Arsenal
Essential Tools For Your Big Data ArsenalEssential Tools For Your Big Data Arsenal
Essential Tools For Your Big Data Arsenal
 
Big Data
Big DataBig Data
Big Data
 
BIG DATA & DATA ANALYTICS
BIG  DATA & DATA  ANALYTICSBIG  DATA & DATA  ANALYTICS
BIG DATA & DATA ANALYTICS
 
The Internet of Things
The Internet of ThingsThe Internet of Things
The Internet of Things
 
Harnessing the Power of IoT - Xamarin Experience 2017
Harnessing the Power of IoT - Xamarin Experience 2017 Harnessing the Power of IoT - Xamarin Experience 2017
Harnessing the Power of IoT - Xamarin Experience 2017
 
Keamanan Siber di Era Big Data
Keamanan Siber di Era Big DataKeamanan Siber di Era Big Data
Keamanan Siber di Era Big Data
 
Ppt for Application of big data
Ppt for Application of big dataPpt for Application of big data
Ppt for Application of big data
 
Key Data Management Requirements for the IoT
Key Data Management Requirements for the IoTKey Data Management Requirements for the IoT
Key Data Management Requirements for the IoT
 
Big data
Big dataBig data
Big data
 
SymEx 2015 - Agile Process for Big Data Analytic
SymEx 2015 - Agile Process for Big Data AnalyticSymEx 2015 - Agile Process for Big Data Analytic
SymEx 2015 - Agile Process for Big Data Analytic
 
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
 

Viewers also liked

Cloud computing security
Cloud computing securityCloud computing security
Cloud computing security
Gahya Pandian
 
ATAGTR2017 Artificial Intelligence in Software Testing – Demystified
ATAGTR2017 Artificial Intelligence in Software Testing – DemystifiedATAGTR2017 Artificial Intelligence in Software Testing – Demystified
ATAGTR2017 Artificial Intelligence in Software Testing – Demystified
Agile Testing Alliance
 
Artificial Intelligence in Project Management by Dr. Khaled A. Hamdy
Artificial Intelligence in Project Management by  Dr. Khaled A. HamdyArtificial Intelligence in Project Management by  Dr. Khaled A. Hamdy
Artificial Intelligence in Project Management by Dr. Khaled A. Hamdy
Agile ME
 
When Content Meets Data, Big Things Happen - Peter Krmpotic, Adobe
When Content Meets Data, Big Things Happen - Peter Krmpotic, AdobeWhen Content Meets Data, Big Things Happen - Peter Krmpotic, Adobe
When Content Meets Data, Big Things Happen - Peter Krmpotic, Adobe
NewsCred
 
Oferta en artículos para Ortodoncia - primavera 2017 - DM Ceosa / Ortoceosa
Oferta en artículos para Ortodoncia - primavera 2017 - DM Ceosa / OrtoceosaOferta en artículos para Ortodoncia - primavera 2017 - DM Ceosa / Ortoceosa
Oferta en artículos para Ortodoncia - primavera 2017 - DM Ceosa / Ortoceosa
Ortocervera
 
Adobe Media Optimizer
Adobe Media OptimizerAdobe Media Optimizer
Adobe Media Optimizer
Pankaj Kashyap
 
Adobe Media Optimizer_What is Adobe Media Optimizer
Adobe Media Optimizer_What is Adobe Media OptimizerAdobe Media Optimizer_What is Adobe Media Optimizer
Adobe Media Optimizer_What is Adobe Media OptimizerAdam Higdon
 
Cloud Computing Presentation
Cloud Computing PresentationCloud Computing Presentation
Cloud Computing Presentation
JReifman
 
Call of Duty
Call of DutyCall of Duty
Call of Duty
Eric Sevilla
 
Ps4 vs xbox one | àlex Gómez i Arnau Marín
Ps4 vs xbox one | àlex Gómez i Arnau MarínPs4 vs xbox one | àlex Gómez i Arnau Marín
Ps4 vs xbox one | àlex Gómez i Arnau Marín
TheFirecraker
 
iPhone Cost Components
iPhone Cost ComponentsiPhone Cost Components
iPhone Cost Components
Mekko Graphics
 
Emergence of ITOA: An Evolution in IT Monitoring and Management
Emergence of ITOA: An Evolution in IT Monitoring and ManagementEmergence of ITOA: An Evolution in IT Monitoring and Management
Emergence of ITOA: An Evolution in IT Monitoring and Management
HCL Technologies
 
Civil War and American Literature (General Perspective )
Civil War and American Literature (General Perspective )Civil War and American Literature (General Perspective )
Civil War and American Literature (General Perspective )
Mecnun Genç
 
образовательный форум экспосибирь 2017
образовательный форум  экспосибирь 2017образовательный форум  экспосибирь 2017
образовательный форум экспосибирь 2017
Galina02
 
The What, Why and How of Big Data
The What, Why and How of Big DataThe What, Why and How of Big Data
The What, Why and How of Big Data
Luca Naso
 

Viewers also liked (16)

Cloud computing security
Cloud computing securityCloud computing security
Cloud computing security
 
ATAGTR2017 Artificial Intelligence in Software Testing – Demystified
ATAGTR2017 Artificial Intelligence in Software Testing – DemystifiedATAGTR2017 Artificial Intelligence in Software Testing – Demystified
ATAGTR2017 Artificial Intelligence in Software Testing – Demystified
 
Artificial Intelligence in Project Management by Dr. Khaled A. Hamdy
Artificial Intelligence in Project Management by  Dr. Khaled A. HamdyArtificial Intelligence in Project Management by  Dr. Khaled A. Hamdy
Artificial Intelligence in Project Management by Dr. Khaled A. Hamdy
 
When Content Meets Data, Big Things Happen - Peter Krmpotic, Adobe
When Content Meets Data, Big Things Happen - Peter Krmpotic, AdobeWhen Content Meets Data, Big Things Happen - Peter Krmpotic, Adobe
When Content Meets Data, Big Things Happen - Peter Krmpotic, Adobe
 
Oferta en artículos para Ortodoncia - primavera 2017 - DM Ceosa / Ortoceosa
Oferta en artículos para Ortodoncia - primavera 2017 - DM Ceosa / OrtoceosaOferta en artículos para Ortodoncia - primavera 2017 - DM Ceosa / Ortoceosa
Oferta en artículos para Ortodoncia - primavera 2017 - DM Ceosa / Ortoceosa
 
Adobe Media Optimizer
Adobe Media OptimizerAdobe Media Optimizer
Adobe Media Optimizer
 
CloudSecurity
CloudSecurityCloudSecurity
CloudSecurity
 
Adobe Media Optimizer_What is Adobe Media Optimizer
Adobe Media Optimizer_What is Adobe Media OptimizerAdobe Media Optimizer_What is Adobe Media Optimizer
Adobe Media Optimizer_What is Adobe Media Optimizer
 
Cloud Computing Presentation
Cloud Computing PresentationCloud Computing Presentation
Cloud Computing Presentation
 
Call of Duty
Call of DutyCall of Duty
Call of Duty
 
Ps4 vs xbox one | àlex Gómez i Arnau Marín
Ps4 vs xbox one | àlex Gómez i Arnau MarínPs4 vs xbox one | àlex Gómez i Arnau Marín
Ps4 vs xbox one | àlex Gómez i Arnau Marín
 
iPhone Cost Components
iPhone Cost ComponentsiPhone Cost Components
iPhone Cost Components
 
Emergence of ITOA: An Evolution in IT Monitoring and Management
Emergence of ITOA: An Evolution in IT Monitoring and ManagementEmergence of ITOA: An Evolution in IT Monitoring and Management
Emergence of ITOA: An Evolution in IT Monitoring and Management
 
Civil War and American Literature (General Perspective )
Civil War and American Literature (General Perspective )Civil War and American Literature (General Perspective )
Civil War and American Literature (General Perspective )
 
образовательный форум экспосибирь 2017
образовательный форум  экспосибирь 2017образовательный форум  экспосибирь 2017
образовательный форум экспосибирь 2017
 
The What, Why and How of Big Data
The What, Why and How of Big DataThe What, Why and How of Big Data
The What, Why and How of Big Data
 

Similar to Guide to big data analytics

BigData Analytics_1.7
BigData Analytics_1.7BigData Analytics_1.7
BigData Analytics_1.7Rohit Mittal
 
Big data lecture notes
Big data lecture notesBig data lecture notes
Big data lecture notes
Mohit Saini
 
Introduction to big data – convergences.
Introduction to big data – convergences.Introduction to big data – convergences.
Introduction to big data – convergences.
saranya270513
 
Snowball Group Whitepaper - Spotlight on Big Data
Snowball Group Whitepaper - Spotlight on Big DataSnowball Group Whitepaper - Spotlight on Big Data
Snowball Group Whitepaper - Spotlight on Big Data
Snowball Group
 
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
Experfy
 
Big data
Big dataBig data
Big data
Abhishek Palo
 
Big data
Big dataBig data
Big data
Abhishek Palo
 
Unlocking big data
Unlocking big dataUnlocking big data
Move It Don't Lose It: Is Your Big Data Collecting Dust?
Move It Don't Lose It: Is Your Big Data Collecting Dust?Move It Don't Lose It: Is Your Big Data Collecting Dust?
Move It Don't Lose It: Is Your Big Data Collecting Dust?
Jennifer Walker
 
Bda assignment can also be used for BDA notes and concept understanding.
Bda assignment can also be used for BDA notes and concept understanding.Bda assignment can also be used for BDA notes and concept understanding.
Bda assignment can also be used for BDA notes and concept understanding.
Aditya205306
 
new.pptx
new.pptxnew.pptx
Big data upload
Big data uploadBig data upload
Big data upload
Bhavin Tandel
 
What Is Big Data How Big Data Works.pdf
What Is Big Data How Big Data Works.pdfWhat Is Big Data How Big Data Works.pdf
What Is Big Data How Big Data Works.pdf
Pridesys IT Ltd.
 
What Is Big Data How Big Data Works.pdf
What Is Big Data How Big Data Works.pdfWhat Is Big Data How Big Data Works.pdf
What Is Big Data How Big Data Works.pdf
Pridesys IT Ltd.
 
Practical analytics john enoch white paper
Practical analytics john enoch white paperPractical analytics john enoch white paper
Practical analytics john enoch white paperJohn Enoch
 
Know The What, Why, and How of Big Data_.pdf
Know The What, Why, and How of Big Data_.pdfKnow The What, Why, and How of Big Data_.pdf
Know The What, Why, and How of Big Data_.pdf
Anil
 
Mejorar la toma de decisiones con Big Data
Mejorar la toma de decisiones con Big DataMejorar la toma de decisiones con Big Data
Mejorar la toma de decisiones con Big Data
Miguel Ángel Gómez
 
What's the Big Deal About Big Data?
What's the Big Deal About Big Data?What's the Big Deal About Big Data?
What's the Big Deal About Big Data?
Logi Analytics
 
Monetize Big Data
Monetize Big DataMonetize Big Data
Bidata
BidataBidata
Bidata
Tamojit Das
 

Similar to Guide to big data analytics (20)

BigData Analytics_1.7
BigData Analytics_1.7BigData Analytics_1.7
BigData Analytics_1.7
 
Big data lecture notes
Big data lecture notesBig data lecture notes
Big data lecture notes
 
Introduction to big data – convergences.
Introduction to big data – convergences.Introduction to big data – convergences.
Introduction to big data – convergences.
 
Snowball Group Whitepaper - Spotlight on Big Data
Snowball Group Whitepaper - Spotlight on Big DataSnowball Group Whitepaper - Spotlight on Big Data
Snowball Group Whitepaper - Spotlight on Big Data
 
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
Unlocking big data
Unlocking big dataUnlocking big data
Unlocking big data
 
Move It Don't Lose It: Is Your Big Data Collecting Dust?
Move It Don't Lose It: Is Your Big Data Collecting Dust?Move It Don't Lose It: Is Your Big Data Collecting Dust?
Move It Don't Lose It: Is Your Big Data Collecting Dust?
 
Bda assignment can also be used for BDA notes and concept understanding.
Bda assignment can also be used for BDA notes and concept understanding.Bda assignment can also be used for BDA notes and concept understanding.
Bda assignment can also be used for BDA notes and concept understanding.
 
new.pptx
new.pptxnew.pptx
new.pptx
 
Big data upload
Big data uploadBig data upload
Big data upload
 
What Is Big Data How Big Data Works.pdf
What Is Big Data How Big Data Works.pdfWhat Is Big Data How Big Data Works.pdf
What Is Big Data How Big Data Works.pdf
 
What Is Big Data How Big Data Works.pdf
What Is Big Data How Big Data Works.pdfWhat Is Big Data How Big Data Works.pdf
What Is Big Data How Big Data Works.pdf
 
Practical analytics john enoch white paper
Practical analytics john enoch white paperPractical analytics john enoch white paper
Practical analytics john enoch white paper
 
Know The What, Why, and How of Big Data_.pdf
Know The What, Why, and How of Big Data_.pdfKnow The What, Why, and How of Big Data_.pdf
Know The What, Why, and How of Big Data_.pdf
 
Mejorar la toma de decisiones con Big Data
Mejorar la toma de decisiones con Big DataMejorar la toma de decisiones con Big Data
Mejorar la toma de decisiones con Big Data
 
What's the Big Deal About Big Data?
What's the Big Deal About Big Data?What's the Big Deal About Big Data?
What's the Big Deal About Big Data?
 
Monetize Big Data
Monetize Big DataMonetize Big Data
Monetize Big Data
 
Bidata
BidataBidata
Bidata
 

Recently uploaded

How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
Product School
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
Jemma Hussein Allen
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Product School
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
Product School
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
Paul Groth
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Sri Ambati
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
Product School
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Inflectra
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
Alison B. Lowndes
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
DianaGray10
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Product School
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
DianaGray10
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
Safe Software
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
Elena Simperl
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 

Recently uploaded (20)

How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 

Guide to big data analytics

  • 1. Guide to Big Data Analytics BY GAHYA PANDIAN
  • 3. What is Big Data Data analysis is nothing new. Even before computers were used, information gained in the course of business or other activities was reviewed with the aim of making those processes more efficient and more profitable. These were, of course, comparatively small-scale undertakings given the limitations posed by resources and manpower; analysis had to be manual and was slow by modern standards, but it was still worthwhile. Opinion polling, for example, has been carried out since early in the 19th century, almost 200 years ago. The first national survey took place in 1916 and involved the publication Literary Digest sending out millions of postcards and counting the returns. As a result, they correctly predicted Woodrow Wilson’s election as president. Since then, volumes of data have grown exponentially. The advent of the internet and faster computing has meant that huge quantities of information can now be harvested and used to optimise business processes. The problem is that conventional methods were simply not suited to crunching through all the numbers and making sense of them. The amount of information is phenomenal, and within that information lies insights that can be extremely beneficial. Once patterns are identified, they can be used to adjust business practices, create targeted campaigns and discard ones that are not effective. However, as well as large amounts of storage, it takes specialised software to be able to make sense of all this data in a useful way.
  • 4. Big Data Big Data’ is the emerging discipline of capturing, storing, processing, analysing and visualising these huge quantities of information. The data sets may start at a few terabytes and run to many petabytes – far more than traditional data analysis packages can handle. In 2012 Gartner defined it as, ‘high volume, high velocity, and/or high variety information assets that require new forms of processing to enable enhanced decision making, insight discovery and process optimization.’ This ‘3V’ classification has been built on since (particularly with the addition of veracity), such that Big Data is often described in terms of the following characteristics:
  • 5. Big Data  Volume. Terabytes or petabytes of data are analysed. An estimated 2.5 quintillion bytes of data (2.5 trillion gigabytes) are created every day, an amount which will only rise in the future. However, the size of the dataset is not the only variable that characterises Big Data.  Variety. The dataset may contain many different forms of data – not simply a large amount of the same type. The profusion of different kinds of mobile device and the variety of content consumed on them on a wide range of platforms, for example, means that companies can harvest data from an enormous array of sources, each telling them a different part of the same picture.  Velocity. Data may change on a constant basis. For example, modern cars may have 100 or so different sensors that continually monitor different aspects of performance. Markets change on a moment-to-moment scale. Data is highly fluid, and snapshots are not always enough.  Veracity. The data acquired may not all be accurate, or much of it may be uncertain or provisional in nature. Data quality is unreliable, especially when there is so much of it. Any system of analysis must take this into account.
  • 6. Big Data In addition to the 4V characteristics, there are also two others to deal with:  Variability. Data capture and volume may be inconsistent, not just inaccurate, so varying quantities and qualities of data will be acquired at different times.  Together, these factors mean that managing the data can be an extremely complex process, since there are many data sources with differing types and formats of data, but these need to be correlated and made sense of if they are to be useful.
  • 7. Conclusion  Big data isn’t just an emerging phenomenon. It’s already here and being used by major companies to drive their business forwards. Traditional analytics packages simply aren’t capable of dealing with the quantity, variety and changeability of data that can now be harvested from diverse sources – machine sensors, text documents, structured and unstructured data, social media and more. When these are combined and analysed as a whole, new patterns emerge. The right big data package will allow enterprises to track these trends in real time, spotting them as they occur and enabling businesses to leverage the insights provided.  However, not all big data platforms and software are alike. As ever, which you decide on will depend on a number of factors. These include not just the nature of the data you are working with, but organisational budgets, infrastructure and the skillset of your team, amongst other things. Some solutions are designed to be used off-the-peg, providing powerful visualisations and connecting easily to your data stores. Others are intended to be more flexible but should only be used by those with coding expertise. You should also think to the future, and the long-term implications of being tied to your platform of choice – particularly in terms of open-source vs proprietary software.
  • 8. Other Guides:  Public Cloud: Top Rated Public Cloud Computing Providers, Services, Security & Technologies  Cloud Backup: Guide to Cloud Backup Services, Companies, Software and Solutions  Hybrid Cloud: Guide to Hybrid Cloud Storage and Computing Companies  Virtual Data Room: Virtual Data Room Providers, Services, Reviews and Comparisons