SlideShare a Scribd company logo
1 of 21
By ,
Abhishek Palo
Regd no:1301308234
CONTENTS
 AN INTRODUCTION TOTHE WORLD OF DATA ?
 WHAT IS BIG DATA ?
 WHO CREATED BIG DATA ?
 WHEN IT WAS CREATED ?
 WHY BIG DATA ?
 WHERE WE’RE USING IT ?
 HOW TO USE IT ?
5’W 1’H
OF BIG DATA
WHAT IS DATA.....?
facts and statistics collected together for reference or analysis.
OR
the quantities, characters, or symbols on which operations
are performed by a computer, which may be stored and
transmitted in the form of electrical signals and recorded
on magnetic, optical, or mechanical recording media.
Types of data
 Traditional
Document
Finance
Stock record
Personal files
 Modern
Photographs
Audio & Video
3D Model
Simulation
Location Data
CAN BE HANDLED BY
RDBMS
DIFFICULT TO BE HANDLED BY
RDBMS
What is BIG DATA ?
Big data generally uses that data sets which can’t be
handled by the traditional software tools to capture ,curate
, manage and process data within a tolerable elapsed time.
As we know that size is a constantly varying target
ranging from few terabytes to many petabytes of data so we
can also say big data is a set of techniques required to
uncover large hidden values from large data sets that are
diverse complex and of a massive scale.
Who created BIG DATA ?
 In his in the year 2001 paper 3D Data Management: Controlling
Data Volume, Velocity and Variety Doug Laney, analyst at Gartner,
defines three of what will come to be the commonly-accepted
characteristics of Big Data.
 Commentators announce in 2005 that we are witnessing the
birth of “Web 2.0” – the user-generated web where the majority of
content will be provided by users of services, rather than the service
providers themselves. This year also witnessed the emerging of
HADOOP an open source platform used to store and process big data.
Contd...
 In 2008 the world’s servers process 9.57 zettabytes (9.57 trillion
gigabytes) of information – equivalent to 12 gigabytes of information
per person, per day), according to the How Much Information? 2010
report. In International Production and Dissemination of Information,
it is estimated that 14.7 exabytes of new information are produced this
year.
 In 2010 Eric Schmidt, executive chairman of Google, tells a conference
that as much data is now being created every two days, as was created
from the beginning of human civilization to the year 2003.
 In 2014 The rise of the mobile machines – as for the first time, more
people are using mobile devices to access digital data, than office or
home computers. 88% of business executives surveyed by GE working
with Accenture report that big data analytics is a top priority for their
business.
Contd...
What this teaches us is that Big Data is not a new or
isolated phenomenon, but one that is part of a long
evolution of capturing and using data. Like other key
developments in data storage, data processing and the
Internet, Big Data is just a further step that will bring
change to the way we run business and society. At the same
time it will lay the foundations on which many evolutions
will be built.
The 3 V’s of big data....
Volume
(Amount of Data)
Velocity
(speed of processing)
Variety
(range and source)
Volume....
As the number of users increasing day by day the mount of
data used by them also increasing simultaneously.
Organisation Data processed(per day)
Ebay 100 pb
Google 100 pb
Baidu 10-100 pb
NSA 29 pb
Spotify 600 pb
Facebook 100 pb
Twitter 64 pb
Contd...
If we analyse these amount of data it would be easier for
the companies to know about their customers. however
traditional data processing system is not able to process
these amount of data .So we need a more reliable data
processing concept which is nothing but BIG DATA.
Velocity
 The amount of data which are uploaded or downloaded by
the users of some organisation are exceeding the capacity
of their IT systems.
 As we can see that the amount of data produced in last 5
years is the 90% of the whole data which are produced by
in last 20 years.
 And in this speed data processing can’t be done by using
traditional RDBMS concepts.
Variety
 Previously we’re dealing with few varieties of data such as
Document Finance Stock record Personal files
 But now a days we’ve to deal with many kinds f data
such as videos ,music ,photographs , simulations and
3D models.
Big data classification
Contd...
 Analysis type — Whether the data is analyzed in real time or batched
for later analysis. Give careful consideration to choosing the analysis
type, since it affects several other decisions about products, tools,
hardware, data sources, and expected data frequency. A mix of both
types may be required by the use case:
 Fraud detection; analysis must be done in real time or near real time.
 Trend analysis for strategic business decisions; analysis can be in batch
mode.
 Processing methodology — The type of technique to be applied for
processing data (e.g., predictive, analytical, ad-hoc query, and
reporting). Business requirements determine the appropriate
processing methodology. A combination of techniques can be used.
The choice of processing methodology helps identify the appropriate
tools and techniques to be used in your big data solution.
Contd...
 Content format — Format of incoming data — structured (RDMBS, for
example), unstructured (audio, video, and images, for example), or
semi-structured. Format determines how the incoming data needs to
be processed and is key to choosing tools and techniques and defining
a solution from a business perspective.
 Data type — Type of data to be processed — transactional, historical,
master data, and others. Knowing the data type helps segregate the
data in storage.
 Data frequency and size — How much data is expected and at what
frequency does it arrive. Knowing frequency and size helps determine
the storage mechanism, storage format, and the necessary pre-
processing tools. Data frequency and size depend on data sources:
 On demand, as with social media data
 Continuous feed, real-time (weather data, transactional data)
 Time series (time-based data)
Contd...
 Data source — Sources of data (where the data is generated) — web
and social media, machine-generated, human-generated, etc.
Identifying all the data sources helps determine the scope from a
business perspective. The figure shows the most widely used data
sources.
 Data consumers — A list of all of the possible consumers of the
processed data: Business processes, Business users, Enterprise
applications, Individual people in various business roles, Part of the
process flows, Other data repositories or enterprise applications.
 Hardware — The type of hardware on which the big data solution will
be implemented — commodity hardware or state of the art.
Understanding the limitations of hardware helps inform the choice of
big data solution.
Big data technology
Where we’re using....
 In medicals
 In social networking sites
 For surveys
 Science and Research
 Real estate
 Retail
 Banking
 Internet of things
 Government sectors
Conclusion
The availability of Big Data, low-cost commodity hardware, and
new information management and analytic software have produced a
unique moment in the history of data analysis. The convergence of
these trends means that we have the capabilities required to analyze
astonishing data sets quickly and cost-effectively for the first time in
history. These capabilities are neither theoretical nor trivial. They
represent a genuine leap forward and a clear opportunity to realize
enormous gains in terms of efficiency, productivity, revenue, and
profitability.
The Age of Big Data is here, and these are truly revolutionary times
if both business and technology professionals continue to work
together and deliver on the promise.
THANK YOU

More Related Content

What's hot (20)

Big Data Evolution
Big Data EvolutionBig Data Evolution
Big Data Evolution
 
Chapter 4 what is data and data types
Chapter 4  what is data and data typesChapter 4  what is data and data types
Chapter 4 what is data and data types
 
Big Data : Risks and Opportunities
Big Data : Risks and OpportunitiesBig Data : Risks and Opportunities
Big Data : Risks and Opportunities
 
Big Data for Beginners
Big Data for BeginnersBig Data for Beginners
Big Data for Beginners
 
Big data-ppt
Big data-pptBig data-ppt
Big data-ppt
 
Presentation on Big Data
Presentation on Big DataPresentation on Big Data
Presentation on Big Data
 
Big data-analytics-ebook
Big data-analytics-ebookBig data-analytics-ebook
Big data-analytics-ebook
 
Big Data & Future - Big Data, Analytics, Cloud, SDN, Internet of things
Big Data & Future - Big Data, Analytics, Cloud, SDN, Internet of thingsBig Data & Future - Big Data, Analytics, Cloud, SDN, Internet of things
Big Data & Future - Big Data, Analytics, Cloud, SDN, Internet of things
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
Applications of Big Data
Applications of Big DataApplications of Big Data
Applications of Big Data
 
Big data-ppt-
Big data-ppt-Big data-ppt-
Big data-ppt-
 
Research issues in the big data and its Challenges
Research issues in the big data and its ChallengesResearch issues in the big data and its Challenges
Research issues in the big data and its Challenges
 
Fraud and Risk in Big Data
Fraud and Risk in Big DataFraud and Risk in Big Data
Fraud and Risk in Big Data
 
A Big Data Concept
A Big Data ConceptA Big Data Concept
A Big Data Concept
 
Big data-analytics-cpe8035
Big data-analytics-cpe8035Big data-analytics-cpe8035
Big data-analytics-cpe8035
 
Big data
Big dataBig data
Big data
 
Big Data and Classification
Big Data and ClassificationBig Data and Classification
Big Data and Classification
 

Viewers also liked

Inteligencia emocional
Inteligencia emocionalInteligencia emocional
Inteligencia emocionalLuzsolm
 
Luz adriana guzman perez sena -simposio formacion
Luz adriana guzman perez sena -simposio formacionLuz adriana guzman perez sena -simposio formacion
Luz adriana guzman perez sena -simposio formacionAdriana Guzman Perez
 
Programación Neurolingüística
Programación NeurolingüísticaProgramación Neurolingüística
Programación Neurolingüísticaemar2015
 
Neurodidáctica aplicasda a la educaciónm. Class of clans 4 asignaturas 1 juego
Neurodidáctica aplicasda a la educaciónm. Class of clans 4 asignaturas 1 juegoNeurodidáctica aplicasda a la educaciónm. Class of clans 4 asignaturas 1 juego
Neurodidáctica aplicasda a la educaciónm. Class of clans 4 asignaturas 1 juegoPlataforma Proyecta
 
Reporte de-invest.-1-métodos-o-estrategias-de-enseñanza-y-aprendizaje-de-la-g...
Reporte de-invest.-1-métodos-o-estrategias-de-enseñanza-y-aprendizaje-de-la-g...Reporte de-invest.-1-métodos-o-estrategias-de-enseñanza-y-aprendizaje-de-la-g...
Reporte de-invest.-1-métodos-o-estrategias-de-enseñanza-y-aprendizaje-de-la-g...Kashia Ponce
 
"En el laberinto del tiempo". Bases neurodidácticas para un aprendizaje con j...
"En el laberinto del tiempo". Bases neurodidácticas para un aprendizaje con j..."En el laberinto del tiempo". Bases neurodidácticas para un aprendizaje con j...
"En el laberinto del tiempo". Bases neurodidácticas para un aprendizaje con j...Dr.Francisco Ignacio Revuelta Domínguez
 
Neurodidactica erly lopez
Neurodidactica erly lopezNeurodidactica erly lopez
Neurodidactica erly lopezerly lopez
 
Clase 3 funciones ejecutivas
Clase 3 funciones ejecutivasClase 3 funciones ejecutivas
Clase 3 funciones ejecutivasYohanna Agurto
 

Viewers also liked (16)

Estrategias de aprendizaje
Estrategias de aprendizajeEstrategias de aprendizaje
Estrategias de aprendizaje
 
Hi juguem ?
Hi juguem ?Hi juguem ?
Hi juguem ?
 
Inteligencia emocional
Inteligencia emocionalInteligencia emocional
Inteligencia emocional
 
Inteligencia emocional, resiliencia en el síndrome postpolio, Silvia Mª Sánch...
Inteligencia emocional, resiliencia en el síndrome postpolio, Silvia Mª Sánch...Inteligencia emocional, resiliencia en el síndrome postpolio, Silvia Mª Sánch...
Inteligencia emocional, resiliencia en el síndrome postpolio, Silvia Mª Sánch...
 
Luz adriana guzman perez sena -simposio formacion
Luz adriana guzman perez sena -simposio formacionLuz adriana guzman perez sena -simposio formacion
Luz adriana guzman perez sena -simposio formacion
 
Programación Neurolingüística
Programación NeurolingüísticaProgramación Neurolingüística
Programación Neurolingüística
 
Taller neurodidáctica
Taller neurodidácticaTaller neurodidáctica
Taller neurodidáctica
 
Taller estilos y metodos de aprendizaje
Taller estilos y metodos de aprendizajeTaller estilos y metodos de aprendizaje
Taller estilos y metodos de aprendizaje
 
Neurodidáctica aplicasda a la educaciónm. Class of clans 4 asignaturas 1 juego
Neurodidáctica aplicasda a la educaciónm. Class of clans 4 asignaturas 1 juegoNeurodidáctica aplicasda a la educaciónm. Class of clans 4 asignaturas 1 juego
Neurodidáctica aplicasda a la educaciónm. Class of clans 4 asignaturas 1 juego
 
Inteligencia emocional y competencias docentes
Inteligencia emocional y competencias docentesInteligencia emocional y competencias docentes
Inteligencia emocional y competencias docentes
 
Reporte de-invest.-1-métodos-o-estrategias-de-enseñanza-y-aprendizaje-de-la-g...
Reporte de-invest.-1-métodos-o-estrategias-de-enseñanza-y-aprendizaje-de-la-g...Reporte de-invest.-1-métodos-o-estrategias-de-enseñanza-y-aprendizaje-de-la-g...
Reporte de-invest.-1-métodos-o-estrategias-de-enseñanza-y-aprendizaje-de-la-g...
 
"En el laberinto del tiempo". Bases neurodidácticas para un aprendizaje con j...
"En el laberinto del tiempo". Bases neurodidácticas para un aprendizaje con j..."En el laberinto del tiempo". Bases neurodidácticas para un aprendizaje con j...
"En el laberinto del tiempo". Bases neurodidácticas para un aprendizaje con j...
 
Neurodidactica erly lopez
Neurodidactica erly lopezNeurodidactica erly lopez
Neurodidactica erly lopez
 
Clase 3 funciones ejecutivas
Clase 3 funciones ejecutivasClase 3 funciones ejecutivas
Clase 3 funciones ejecutivas
 
Neuroplasticidad
NeuroplasticidadNeuroplasticidad
Neuroplasticidad
 
Las funciones ejecutivas
Las funciones ejecutivasLas funciones ejecutivas
Las funciones ejecutivas
 

Similar to Big data

Introduction to big data – convergences.
Introduction to big data – convergences.Introduction to big data – convergences.
Introduction to big data – convergences.saranya270513
 
IRJET- Big Data Management and Growth Enhancement
IRJET- Big Data Management and Growth EnhancementIRJET- Big Data Management and Growth Enhancement
IRJET- Big Data Management and Growth EnhancementIRJET Journal
 
big-datagroup6-150317090053-conversion-gate01.pdf
big-datagroup6-150317090053-conversion-gate01.pdfbig-datagroup6-150317090053-conversion-gate01.pdf
big-datagroup6-150317090053-conversion-gate01.pdfVirajSaud
 
Guide to big data analytics
Guide to big data analyticsGuide to big data analytics
Guide to big data analyticsGahya Pandian
 
Age Friendly Economy - Introduction to Big Data
Age Friendly Economy - Introduction to Big DataAge Friendly Economy - Introduction to Big Data
Age Friendly Economy - Introduction to Big DataAgeFriendlyEconomy
 
An Investigation on Scalable and Efficient Privacy Preserving Challenges for ...
An Investigation on Scalable and Efficient Privacy Preserving Challenges for ...An Investigation on Scalable and Efficient Privacy Preserving Challenges for ...
An Investigation on Scalable and Efficient Privacy Preserving Challenges for ...IJERDJOURNAL
 
BIG DATA & DATA ANALYTICS
BIG  DATA & DATA  ANALYTICSBIG  DATA & DATA  ANALYTICS
BIG DATA & DATA ANALYTICSNAGARAJAGIDDE
 
An Overview of BigData
An Overview of BigDataAn Overview of BigData
An Overview of BigDataValarmathi V
 
MBA-TU-Thailand:BigData for business startup.
MBA-TU-Thailand:BigData for business startup.MBA-TU-Thailand:BigData for business startup.
MBA-TU-Thailand:BigData for business startup.stelligence
 
Big data seminor
Big data seminorBig data seminor
Big data seminorberasrujana
 
Big-Data-Analytics.8592259.powerpoint.pdf
Big-Data-Analytics.8592259.powerpoint.pdfBig-Data-Analytics.8592259.powerpoint.pdf
Big-Data-Analytics.8592259.powerpoint.pdfrajsharma159890
 
The book of elephant tattoo
The book of elephant tattooThe book of elephant tattoo
The book of elephant tattooMohamed Magdy
 
Analysis on big data concepts and applications
Analysis on big data concepts and applicationsAnalysis on big data concepts and applications
Analysis on big data concepts and applicationsIJARIIT
 

Similar to Big data (20)

Introduction to big data – convergences.
Introduction to big data – convergences.Introduction to big data – convergences.
Introduction to big data – convergences.
 
IRJET- Big Data Management and Growth Enhancement
IRJET- Big Data Management and Growth EnhancementIRJET- Big Data Management and Growth Enhancement
IRJET- Big Data Management and Growth Enhancement
 
Big data
Big dataBig data
Big data
 
big-datagroup6-150317090053-conversion-gate01.pdf
big-datagroup6-150317090053-conversion-gate01.pdfbig-datagroup6-150317090053-conversion-gate01.pdf
big-datagroup6-150317090053-conversion-gate01.pdf
 
Guide to big data analytics
Guide to big data analyticsGuide to big data analytics
Guide to big data analytics
 
new.pptx
new.pptxnew.pptx
new.pptx
 
Age Friendly Economy - Introduction to Big Data
Age Friendly Economy - Introduction to Big DataAge Friendly Economy - Introduction to Big Data
Age Friendly Economy - Introduction to Big Data
 
An Investigation on Scalable and Efficient Privacy Preserving Challenges for ...
An Investigation on Scalable and Efficient Privacy Preserving Challenges for ...An Investigation on Scalable and Efficient Privacy Preserving Challenges for ...
An Investigation on Scalable and Efficient Privacy Preserving Challenges for ...
 
Big data.pptx
Big data.pptxBig data.pptx
Big data.pptx
 
Data mining with big data
Data mining with big dataData mining with big data
Data mining with big data
 
Big data upload
Big data uploadBig data upload
Big data upload
 
Big data
Big dataBig data
Big data
 
BIG DATA & DATA ANALYTICS
BIG  DATA & DATA  ANALYTICSBIG  DATA & DATA  ANALYTICS
BIG DATA & DATA ANALYTICS
 
An Overview of BigData
An Overview of BigDataAn Overview of BigData
An Overview of BigData
 
MBA-TU-Thailand:BigData for business startup.
MBA-TU-Thailand:BigData for business startup.MBA-TU-Thailand:BigData for business startup.
MBA-TU-Thailand:BigData for business startup.
 
Sample
Sample Sample
Sample
 
Big data seminor
Big data seminorBig data seminor
Big data seminor
 
Big-Data-Analytics.8592259.powerpoint.pdf
Big-Data-Analytics.8592259.powerpoint.pdfBig-Data-Analytics.8592259.powerpoint.pdf
Big-Data-Analytics.8592259.powerpoint.pdf
 
The book of elephant tattoo
The book of elephant tattooThe book of elephant tattoo
The book of elephant tattoo
 
Analysis on big data concepts and applications
Analysis on big data concepts and applicationsAnalysis on big data concepts and applications
Analysis on big data concepts and applications
 

Recently uploaded

Abortion Clinic in Kempton Park +27791653574 WhatsApp Abortion Clinic Service...
Abortion Clinic in Kempton Park +27791653574 WhatsApp Abortion Clinic Service...Abortion Clinic in Kempton Park +27791653574 WhatsApp Abortion Clinic Service...
Abortion Clinic in Kempton Park +27791653574 WhatsApp Abortion Clinic Service...mikehavy0
 
Case Study 4 Where the cry of rebellion happen?
Case Study 4 Where the cry of rebellion happen?Case Study 4 Where the cry of rebellion happen?
Case Study 4 Where the cry of rebellion happen?RemarkSemacio
 
Northern New England Tableau User Group (TUG) May 2024
Northern New England Tableau User Group (TUG) May 2024Northern New England Tableau User Group (TUG) May 2024
Northern New England Tableau User Group (TUG) May 2024patrickdtherriault
 
Predictive Precipitation: Advanced Rain Forecasting Techniques
Predictive Precipitation: Advanced Rain Forecasting TechniquesPredictive Precipitation: Advanced Rain Forecasting Techniques
Predictive Precipitation: Advanced Rain Forecasting TechniquesBoston Institute of Analytics
 
DBMS UNIT 5 46 CONTAINS NOTES FOR THE STUDENTS
DBMS UNIT 5 46 CONTAINS NOTES FOR THE STUDENTSDBMS UNIT 5 46 CONTAINS NOTES FOR THE STUDENTS
DBMS UNIT 5 46 CONTAINS NOTES FOR THE STUDENTSSnehalVinod
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteedamy56318795
 
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证acoha1
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...Bertram Ludäscher
 
DAA Assignment Solution.pdf is the best1
DAA Assignment Solution.pdf is the best1DAA Assignment Solution.pdf is the best1
DAA Assignment Solution.pdf is the best1sinhaabhiyanshu
 
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格q6pzkpark
 
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptxRESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptxronsairoathenadugay
 
Introduction to Statistics Presentation.pptx
Introduction to Statistics Presentation.pptxIntroduction to Statistics Presentation.pptx
Introduction to Statistics Presentation.pptxAniqa Zai
 
Displacement, Velocity, Acceleration, and Second Derivatives
Displacement, Velocity, Acceleration, and Second DerivativesDisplacement, Velocity, Acceleration, and Second Derivatives
Displacement, Velocity, Acceleration, and Second Derivatives23050636
 
Huawei Ransomware Protection Storage Solution Technical Overview Presentation...
Huawei Ransomware Protection Storage Solution Technical Overview Presentation...Huawei Ransomware Protection Storage Solution Technical Overview Presentation...
Huawei Ransomware Protection Storage Solution Technical Overview Presentation...LuisMiguelPaz5
 
Seven tools of quality control.slideshare
Seven tools of quality control.slideshareSeven tools of quality control.slideshare
Seven tools of quality control.slideshareraiaryan448
 
Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...
Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...
Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...ThinkInnovation
 

Recently uploaded (20)

Abortion pills in Doha {{ QATAR }} +966572737505) Get Cytotec
Abortion pills in Doha {{ QATAR }} +966572737505) Get CytotecAbortion pills in Doha {{ QATAR }} +966572737505) Get Cytotec
Abortion pills in Doha {{ QATAR }} +966572737505) Get Cytotec
 
Abortion Clinic in Kempton Park +27791653574 WhatsApp Abortion Clinic Service...
Abortion Clinic in Kempton Park +27791653574 WhatsApp Abortion Clinic Service...Abortion Clinic in Kempton Park +27791653574 WhatsApp Abortion Clinic Service...
Abortion Clinic in Kempton Park +27791653574 WhatsApp Abortion Clinic Service...
 
Case Study 4 Where the cry of rebellion happen?
Case Study 4 Where the cry of rebellion happen?Case Study 4 Where the cry of rebellion happen?
Case Study 4 Where the cry of rebellion happen?
 
Northern New England Tableau User Group (TUG) May 2024
Northern New England Tableau User Group (TUG) May 2024Northern New England Tableau User Group (TUG) May 2024
Northern New England Tableau User Group (TUG) May 2024
 
Abortion pills in Riyadh Saudi Arabia (+966572737505 buy cytotec
Abortion pills in Riyadh Saudi Arabia (+966572737505 buy cytotecAbortion pills in Riyadh Saudi Arabia (+966572737505 buy cytotec
Abortion pills in Riyadh Saudi Arabia (+966572737505 buy cytotec
 
Predictive Precipitation: Advanced Rain Forecasting Techniques
Predictive Precipitation: Advanced Rain Forecasting TechniquesPredictive Precipitation: Advanced Rain Forecasting Techniques
Predictive Precipitation: Advanced Rain Forecasting Techniques
 
Abortion pills in Riyadh Saudi Arabia| +966572737505 | Get Cytotec, Unwanted Kit
Abortion pills in Riyadh Saudi Arabia| +966572737505 | Get Cytotec, Unwanted KitAbortion pills in Riyadh Saudi Arabia| +966572737505 | Get Cytotec, Unwanted Kit
Abortion pills in Riyadh Saudi Arabia| +966572737505 | Get Cytotec, Unwanted Kit
 
DBMS UNIT 5 46 CONTAINS NOTES FOR THE STUDENTS
DBMS UNIT 5 46 CONTAINS NOTES FOR THE STUDENTSDBMS UNIT 5 46 CONTAINS NOTES FOR THE STUDENTS
DBMS UNIT 5 46 CONTAINS NOTES FOR THE STUDENTS
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
 
DAA Assignment Solution.pdf is the best1
DAA Assignment Solution.pdf is the best1DAA Assignment Solution.pdf is the best1
DAA Assignment Solution.pdf is the best1
 
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
 
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptxRESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
 
Introduction to Statistics Presentation.pptx
Introduction to Statistics Presentation.pptxIntroduction to Statistics Presentation.pptx
Introduction to Statistics Presentation.pptx
 
Displacement, Velocity, Acceleration, and Second Derivatives
Displacement, Velocity, Acceleration, and Second DerivativesDisplacement, Velocity, Acceleration, and Second Derivatives
Displacement, Velocity, Acceleration, and Second Derivatives
 
Huawei Ransomware Protection Storage Solution Technical Overview Presentation...
Huawei Ransomware Protection Storage Solution Technical Overview Presentation...Huawei Ransomware Protection Storage Solution Technical Overview Presentation...
Huawei Ransomware Protection Storage Solution Technical Overview Presentation...
 
Seven tools of quality control.slideshare
Seven tools of quality control.slideshareSeven tools of quality control.slideshare
Seven tools of quality control.slideshare
 
Abortion pills in Jeddah |+966572737505 | get cytotec
Abortion pills in Jeddah |+966572737505 | get cytotecAbortion pills in Jeddah |+966572737505 | get cytotec
Abortion pills in Jeddah |+966572737505 | get cytotec
 
Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...
Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...
Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...
 

Big data

  • 1. By , Abhishek Palo Regd no:1301308234
  • 2. CONTENTS  AN INTRODUCTION TOTHE WORLD OF DATA ?  WHAT IS BIG DATA ?  WHO CREATED BIG DATA ?  WHEN IT WAS CREATED ?  WHY BIG DATA ?  WHERE WE’RE USING IT ?  HOW TO USE IT ? 5’W 1’H OF BIG DATA
  • 3. WHAT IS DATA.....? facts and statistics collected together for reference or analysis. OR the quantities, characters, or symbols on which operations are performed by a computer, which may be stored and transmitted in the form of electrical signals and recorded on magnetic, optical, or mechanical recording media.
  • 4. Types of data  Traditional Document Finance Stock record Personal files  Modern Photographs Audio & Video 3D Model Simulation Location Data CAN BE HANDLED BY RDBMS DIFFICULT TO BE HANDLED BY RDBMS
  • 5. What is BIG DATA ? Big data generally uses that data sets which can’t be handled by the traditional software tools to capture ,curate , manage and process data within a tolerable elapsed time. As we know that size is a constantly varying target ranging from few terabytes to many petabytes of data so we can also say big data is a set of techniques required to uncover large hidden values from large data sets that are diverse complex and of a massive scale.
  • 6. Who created BIG DATA ?  In his in the year 2001 paper 3D Data Management: Controlling Data Volume, Velocity and Variety Doug Laney, analyst at Gartner, defines three of what will come to be the commonly-accepted characteristics of Big Data.  Commentators announce in 2005 that we are witnessing the birth of “Web 2.0” – the user-generated web where the majority of content will be provided by users of services, rather than the service providers themselves. This year also witnessed the emerging of HADOOP an open source platform used to store and process big data.
  • 7. Contd...  In 2008 the world’s servers process 9.57 zettabytes (9.57 trillion gigabytes) of information – equivalent to 12 gigabytes of information per person, per day), according to the How Much Information? 2010 report. In International Production and Dissemination of Information, it is estimated that 14.7 exabytes of new information are produced this year.  In 2010 Eric Schmidt, executive chairman of Google, tells a conference that as much data is now being created every two days, as was created from the beginning of human civilization to the year 2003.  In 2014 The rise of the mobile machines – as for the first time, more people are using mobile devices to access digital data, than office or home computers. 88% of business executives surveyed by GE working with Accenture report that big data analytics is a top priority for their business.
  • 8. Contd... What this teaches us is that Big Data is not a new or isolated phenomenon, but one that is part of a long evolution of capturing and using data. Like other key developments in data storage, data processing and the Internet, Big Data is just a further step that will bring change to the way we run business and society. At the same time it will lay the foundations on which many evolutions will be built.
  • 9. The 3 V’s of big data.... Volume (Amount of Data) Velocity (speed of processing) Variety (range and source)
  • 10. Volume.... As the number of users increasing day by day the mount of data used by them also increasing simultaneously. Organisation Data processed(per day) Ebay 100 pb Google 100 pb Baidu 10-100 pb NSA 29 pb Spotify 600 pb Facebook 100 pb Twitter 64 pb
  • 11. Contd... If we analyse these amount of data it would be easier for the companies to know about their customers. however traditional data processing system is not able to process these amount of data .So we need a more reliable data processing concept which is nothing but BIG DATA.
  • 12. Velocity  The amount of data which are uploaded or downloaded by the users of some organisation are exceeding the capacity of their IT systems.  As we can see that the amount of data produced in last 5 years is the 90% of the whole data which are produced by in last 20 years.  And in this speed data processing can’t be done by using traditional RDBMS concepts.
  • 13. Variety  Previously we’re dealing with few varieties of data such as Document Finance Stock record Personal files  But now a days we’ve to deal with many kinds f data such as videos ,music ,photographs , simulations and 3D models.
  • 15. Contd...  Analysis type — Whether the data is analyzed in real time or batched for later analysis. Give careful consideration to choosing the analysis type, since it affects several other decisions about products, tools, hardware, data sources, and expected data frequency. A mix of both types may be required by the use case:  Fraud detection; analysis must be done in real time or near real time.  Trend analysis for strategic business decisions; analysis can be in batch mode.  Processing methodology — The type of technique to be applied for processing data (e.g., predictive, analytical, ad-hoc query, and reporting). Business requirements determine the appropriate processing methodology. A combination of techniques can be used. The choice of processing methodology helps identify the appropriate tools and techniques to be used in your big data solution.
  • 16. Contd...  Content format — Format of incoming data — structured (RDMBS, for example), unstructured (audio, video, and images, for example), or semi-structured. Format determines how the incoming data needs to be processed and is key to choosing tools and techniques and defining a solution from a business perspective.  Data type — Type of data to be processed — transactional, historical, master data, and others. Knowing the data type helps segregate the data in storage.  Data frequency and size — How much data is expected and at what frequency does it arrive. Knowing frequency and size helps determine the storage mechanism, storage format, and the necessary pre- processing tools. Data frequency and size depend on data sources:  On demand, as with social media data  Continuous feed, real-time (weather data, transactional data)  Time series (time-based data)
  • 17. Contd...  Data source — Sources of data (where the data is generated) — web and social media, machine-generated, human-generated, etc. Identifying all the data sources helps determine the scope from a business perspective. The figure shows the most widely used data sources.  Data consumers — A list of all of the possible consumers of the processed data: Business processes, Business users, Enterprise applications, Individual people in various business roles, Part of the process flows, Other data repositories or enterprise applications.  Hardware — The type of hardware on which the big data solution will be implemented — commodity hardware or state of the art. Understanding the limitations of hardware helps inform the choice of big data solution.
  • 19. Where we’re using....  In medicals  In social networking sites  For surveys  Science and Research  Real estate  Retail  Banking  Internet of things  Government sectors
  • 20. Conclusion The availability of Big Data, low-cost commodity hardware, and new information management and analytic software have produced a unique moment in the history of data analysis. The convergence of these trends means that we have the capabilities required to analyze astonishing data sets quickly and cost-effectively for the first time in history. These capabilities are neither theoretical nor trivial. They represent a genuine leap forward and a clear opportunity to realize enormous gains in terms of efficiency, productivity, revenue, and profitability. The Age of Big Data is here, and these are truly revolutionary times if both business and technology professionals continue to work together and deliver on the promise.