SlideShare a Scribd company logo
Big Data
What is Data?
• The quantities, characters, or symbols on which operations are
performed by a computer, which may be stored and
transmitted in the form of electrical signals and recorded on
magnetic, optical, or mechanical recording media.
• What is Big Data?
• So before I explain what is Big Data, let me also tell you what it
is not!
• The most common myth associated with it is that it is just
about the size or volume of data.
• But actually, it’s not just about the “big” amounts of data being
collected.
• Big Data refers to the large amounts of data which is pouring
in from various data sources and has different formats.
• Even previously there was huge data which were being stored
in databases, but because of the varied nature of this Data, the
traditional relational database systems are incapable of
handling this Data.
• Big data refers to the large, diverse sets of information
that grow at ever-increasing rates.
• It encompasses the volume of information, the velocity or
speed at which it is created and collected, and the variety
or scope of the data points being covered.
• Big data often comes from multiple sources and arrives in
multiple formats.
• To really understand big data, it’s helpful to have some
historical background. Here is Gartner’s definition, circa
2001 (which is still the go-to definition): Big data is data
that contains greater variety arriving in increasing volumes
and with ever-higher velocity.
• big data is larger, more complex data sets, especially from
new data sources. These data sets are so voluminous that
traditional data processing software just can’t manage
them
The History of Big Data
•Although the concept of big data itself is relatively new, the
origins of large data sets go back to the 1960s and '70s,
• when the world of data was just getting started with the first
data centers and the development of the relational database.
•Around 2005, people began to realize just how much data
users generated through Facebook, YouTube, and other online
services.
• Hadoop (an open-source framework created specifically to
store and analyze big data sets) was developed that same year.
• NoSQL also began to gain popularity during this time.
•The development of open-source frameworks, such as
Hadoop was essential for the growth of big data because they
make big data easier to work with and cheaper to store.
• In the years since then, the volume of big data has
skyrocketed.
•Users are still generating huge amounts of data—but it’s
not just humans who are doing it.
•With the advent of the Internet of Things (IoT), more
objects and devices are connected to the internet, gathering
data on customer usage patterns and product performance.
•The emergence of machine learning has produced still more
data.
•While big data has come far, its usefulness is only just
beginning.
What is Big Data?
Big Data is also data but with a huge size. Big Data
is a term used to describe a collection of data that is
huge in volume and yet growing exponentially with
time.
In short such data is so large and complex that none
of the traditional data management tools are able to
store it or process it efficiently.
Examples Of Big Data
Following are some the examples of Big Data-
1. The New York Stock Exchange generates
about one terabyte of new trade data per day.
2.Social Media
The statistic shows that 500+terabytes of new data
get ingested into the databases of social media
site Facebook, every day. This data is mainly
generated in terms of photo and video uploads,
message exchanges, putting comments etc.
3. A single Jet engine can generate 10+terabytes of
data in 30 minutes of flight time. With many
thousand flights per day, generation of data
reaches up to many Petabytes.
Characteristics of Big Data
1. Volume
•With big data, you’ll have to process high volumes of
low-density, unstructured data.
•This can be data of unknown value, such as Twitter
data feeds, click streams on a webpage or a mobile app,
or sensor-enabled equipment.
•For some organizations, this might be tens of terabytes
of data. For others, it may be hundreds of petabytes.
2. Velocity
•Velocity is the fast rate at which data is received and
(perhaps) acted on.
•Normally, the highest velocity of data streams directly
into memory versus being written to disk.
•Some internet-enabled smart products operate in real
time or near real time and will require real-time
evaluation and action.
3. Variety
•Variety refers to the many types of data that are available.
•Traditional data types were structured and fit neatly in a
relational database.
•With the rise of big data, data comes in new unstructured data
types. Unstructured and semi-structured data types, such as
text, audio, and video, require additional preprocessing to
derive meaning and support metadata.
4. Value
•Data has intrinsic value. But it’s of no use until that value is
discovered
5. Veracity(Truth)
•How truthful is your data—and how much can you rely on it?
•This refers to the inconsistency which can be shown by the
data at times, thus hampering the process of being able to
handle and manage the data effectively.

More Related Content

Similar to Big Data basics-Unit-1.pptx

big data
big data big data
big data
subhakirthi
 
BigData.pptx
BigData.pptxBigData.pptx
BigData.pptx
vidhi171881
 
ppt final.pptx
ppt final.pptxppt final.pptx
ppt final.pptx
kalai75
 
Big data ppt
Big  data pptBig  data ppt
Big data ppt
Nasrin Hussain
 
Big_Data_ppt[1] (1).pptx
Big_Data_ppt[1] (1).pptxBig_Data_ppt[1] (1).pptx
Big_Data_ppt[1] (1).pptx
TanguturiAvinash
 
Overview of Big Data
Overview of Big DataOverview of Big Data
Overview of Big Data
LexiConn Content Services
 
Content1. Introduction2. What is Big Data3. Characte.docx
Content1. Introduction2. What is Big Data3. Characte.docxContent1. Introduction2. What is Big Data3. Characte.docx
Content1. Introduction2. What is Big Data3. Characte.docx
dickonsondorris
 
Unit-I- Introduction- Traits of Big Data-Final.pptx
Unit-I- Introduction- Traits of Big Data-Final.pptxUnit-I- Introduction- Traits of Big Data-Final.pptx
Unit-I- Introduction- Traits of Big Data-Final.pptx
subhashchandra197
 
Big Data
Big DataBig Data
Big Data
Rohit Jain
 
Big data with Hadoop - Introduction
Big data with Hadoop - IntroductionBig data with Hadoop - Introduction
Big data with Hadoop - Introduction
Tomy Rhymond
 
Big Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data ManagementBig Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data Management
Tony Bain
 
Big Data Analytics - A Glimpse
Big Data Analytics - A GlimpseBig Data Analytics - A Glimpse
Big Data Analytics - A Glimpse
Laguna State Polytechnic University
 
Ictam big data
Ictam big dataIctam big data
Ictam big data
Terry Bunio
 
Kartikey tripathi
Kartikey tripathiKartikey tripathi
Kartikey tripathi
KARTIKEY TRIPATHI
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
SpringPeople
 
Bigdata " new level"
Bigdata " new level"Bigdata " new level"
Bigdata " new level"
Vamshikrishna Goud
 
Understanding big data
Understanding big dataUnderstanding big data
Understanding big data
Praneet Samaiya
 
What is big data
What is big dataWhat is big data
What is big data
mintubutani2212
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
Umair Shafique
 
Big data
Big dataBig data

Similar to Big Data basics-Unit-1.pptx (20)

big data
big data big data
big data
 
BigData.pptx
BigData.pptxBigData.pptx
BigData.pptx
 
ppt final.pptx
ppt final.pptxppt final.pptx
ppt final.pptx
 
Big data ppt
Big  data pptBig  data ppt
Big data ppt
 
Big_Data_ppt[1] (1).pptx
Big_Data_ppt[1] (1).pptxBig_Data_ppt[1] (1).pptx
Big_Data_ppt[1] (1).pptx
 
Overview of Big Data
Overview of Big DataOverview of Big Data
Overview of Big Data
 
Content1. Introduction2. What is Big Data3. Characte.docx
Content1. Introduction2. What is Big Data3. Characte.docxContent1. Introduction2. What is Big Data3. Characte.docx
Content1. Introduction2. What is Big Data3. Characte.docx
 
Unit-I- Introduction- Traits of Big Data-Final.pptx
Unit-I- Introduction- Traits of Big Data-Final.pptxUnit-I- Introduction- Traits of Big Data-Final.pptx
Unit-I- Introduction- Traits of Big Data-Final.pptx
 
Big Data
Big DataBig Data
Big Data
 
Big data with Hadoop - Introduction
Big data with Hadoop - IntroductionBig data with Hadoop - Introduction
Big data with Hadoop - Introduction
 
Big Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data ManagementBig Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data Management
 
Big Data Analytics - A Glimpse
Big Data Analytics - A GlimpseBig Data Analytics - A Glimpse
Big Data Analytics - A Glimpse
 
Ictam big data
Ictam big dataIctam big data
Ictam big data
 
Kartikey tripathi
Kartikey tripathiKartikey tripathi
Kartikey tripathi
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Bigdata " new level"
Bigdata " new level"Bigdata " new level"
Bigdata " new level"
 
Understanding big data
Understanding big dataUnderstanding big data
Understanding big data
 
What is big data
What is big dataWhat is big data
What is big data
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Big data
Big dataBig data
Big data
 

Recently uploaded

Leadership Ambassador club Adventist module
Leadership Ambassador club Adventist moduleLeadership Ambassador club Adventist module
Leadership Ambassador club Adventist module
kakomaeric00
 
lab.123456789123456789123456789123456789
lab.123456789123456789123456789123456789lab.123456789123456789123456789123456789
lab.123456789123456789123456789123456789
Ghh
 
Jill Pizzola's Tenure as Senior Talent Acquisition Partner at THOMSON REUTERS...
Jill Pizzola's Tenure as Senior Talent Acquisition Partner at THOMSON REUTERS...Jill Pizzola's Tenure as Senior Talent Acquisition Partner at THOMSON REUTERS...
Jill Pizzola's Tenure as Senior Talent Acquisition Partner at THOMSON REUTERS...
dsnow9802
 
官方认证美国旧金山州立大学毕业证学位证书案例原版一模一样
官方认证美国旧金山州立大学毕业证学位证书案例原版一模一样官方认证美国旧金山州立大学毕业证学位证书案例原版一模一样
官方认证美国旧金山州立大学毕业证学位证书案例原版一模一样
2zjra9bn
 
thyroid case presentation.pptx Kamala's Lakshaman palatial
thyroid case presentation.pptx Kamala's Lakshaman palatialthyroid case presentation.pptx Kamala's Lakshaman palatial
thyroid case presentation.pptx Kamala's Lakshaman palatial
Aditya Raghav
 
IT Career Hacks Navigate the Tech Jungle with a Roadmap
IT Career Hacks Navigate the Tech Jungle with a RoadmapIT Career Hacks Navigate the Tech Jungle with a Roadmap
IT Career Hacks Navigate the Tech Jungle with a Roadmap
Base Camp
 
labb123456789123456789123456789123456789
labb123456789123456789123456789123456789labb123456789123456789123456789123456789
labb123456789123456789123456789123456789
Ghh
 
A Guide to a Winning Interview June 2024
A Guide to a Winning Interview June 2024A Guide to a Winning Interview June 2024
A Guide to a Winning Interview June 2024
Bruce Bennett
 
Job Finding Apps Everything You Need to Know in 2024
Job Finding Apps Everything You Need to Know in 2024Job Finding Apps Everything You Need to Know in 2024
Job Finding Apps Everything You Need to Know in 2024
SnapJob
 
Leave-rules.ppt CCS leave rules 1972 for central govt employees
Leave-rules.ppt CCS leave rules 1972 for central govt employeesLeave-rules.ppt CCS leave rules 1972 for central govt employees
Leave-rules.ppt CCS leave rules 1972 for central govt employees
Sreenivas702647
 
Gabrielle M. A. Sinaga Portfolio, Film Student (2024)
Gabrielle M. A. Sinaga Portfolio, Film Student (2024)Gabrielle M. A. Sinaga Portfolio, Film Student (2024)
Gabrielle M. A. Sinaga Portfolio, Film Student (2024)
GabrielleSinaga
 
Status of Women in Pakistan.pptxStatus of Women in Pakistan.pptx
Status of Women in Pakistan.pptxStatus of Women in Pakistan.pptxStatus of Women in Pakistan.pptxStatus of Women in Pakistan.pptx
Status of Women in Pakistan.pptxStatus of Women in Pakistan.pptx
MuhammadWaqasBaloch1
 
Lbs last rank 2023 9988kr47h4744j445.pdf
Lbs last rank 2023 9988kr47h4744j445.pdfLbs last rank 2023 9988kr47h4744j445.pdf
Lbs last rank 2023 9988kr47h4744j445.pdf
ashiquepa3
 
Resumes, Cover Letters, and Applying Online
Resumes, Cover Letters, and Applying OnlineResumes, Cover Letters, and Applying Online
Resumes, Cover Letters, and Applying Online
Bruce Bennett
 
0624.speakingengagementsandteaching-01.pdf
0624.speakingengagementsandteaching-01.pdf0624.speakingengagementsandteaching-01.pdf
0624.speakingengagementsandteaching-01.pdf
Thomas GIRARD BDes
 
Learnings from Successful Jobs Searchers
Learnings from Successful Jobs SearchersLearnings from Successful Jobs Searchers
Learnings from Successful Jobs Searchers
Bruce Bennett
 
Introducing Gopay Mobile App For Environment.pptx
Introducing Gopay Mobile App For Environment.pptxIntroducing Gopay Mobile App For Environment.pptx
Introducing Gopay Mobile App For Environment.pptx
FauzanHarits1
 
How to Prepare for Fortinet FCP_FAC_AD-6.5 Certification?
How to Prepare for Fortinet FCP_FAC_AD-6.5 Certification?How to Prepare for Fortinet FCP_FAC_AD-6.5 Certification?
How to Prepare for Fortinet FCP_FAC_AD-6.5 Certification?
NWEXAM
 
一比一原版布拉德福德大学毕业证(bradford毕业证)如何办理
一比一原版布拉德福德大学毕业证(bradford毕业证)如何办理一比一原版布拉德福德大学毕业证(bradford毕业证)如何办理
一比一原版布拉德福德大学毕业证(bradford毕业证)如何办理
taqyea
 
Switching Careers Slides - JoyceMSullivan SocMediaFin - 2024Jun11.pdf
Switching Careers Slides - JoyceMSullivan SocMediaFin -  2024Jun11.pdfSwitching Careers Slides - JoyceMSullivan SocMediaFin -  2024Jun11.pdf
Switching Careers Slides - JoyceMSullivan SocMediaFin - 2024Jun11.pdf
SocMediaFin - Joyce Sullivan
 

Recently uploaded (20)

Leadership Ambassador club Adventist module
Leadership Ambassador club Adventist moduleLeadership Ambassador club Adventist module
Leadership Ambassador club Adventist module
 
lab.123456789123456789123456789123456789
lab.123456789123456789123456789123456789lab.123456789123456789123456789123456789
lab.123456789123456789123456789123456789
 
Jill Pizzola's Tenure as Senior Talent Acquisition Partner at THOMSON REUTERS...
Jill Pizzola's Tenure as Senior Talent Acquisition Partner at THOMSON REUTERS...Jill Pizzola's Tenure as Senior Talent Acquisition Partner at THOMSON REUTERS...
Jill Pizzola's Tenure as Senior Talent Acquisition Partner at THOMSON REUTERS...
 
官方认证美国旧金山州立大学毕业证学位证书案例原版一模一样
官方认证美国旧金山州立大学毕业证学位证书案例原版一模一样官方认证美国旧金山州立大学毕业证学位证书案例原版一模一样
官方认证美国旧金山州立大学毕业证学位证书案例原版一模一样
 
thyroid case presentation.pptx Kamala's Lakshaman palatial
thyroid case presentation.pptx Kamala's Lakshaman palatialthyroid case presentation.pptx Kamala's Lakshaman palatial
thyroid case presentation.pptx Kamala's Lakshaman palatial
 
IT Career Hacks Navigate the Tech Jungle with a Roadmap
IT Career Hacks Navigate the Tech Jungle with a RoadmapIT Career Hacks Navigate the Tech Jungle with a Roadmap
IT Career Hacks Navigate the Tech Jungle with a Roadmap
 
labb123456789123456789123456789123456789
labb123456789123456789123456789123456789labb123456789123456789123456789123456789
labb123456789123456789123456789123456789
 
A Guide to a Winning Interview June 2024
A Guide to a Winning Interview June 2024A Guide to a Winning Interview June 2024
A Guide to a Winning Interview June 2024
 
Job Finding Apps Everything You Need to Know in 2024
Job Finding Apps Everything You Need to Know in 2024Job Finding Apps Everything You Need to Know in 2024
Job Finding Apps Everything You Need to Know in 2024
 
Leave-rules.ppt CCS leave rules 1972 for central govt employees
Leave-rules.ppt CCS leave rules 1972 for central govt employeesLeave-rules.ppt CCS leave rules 1972 for central govt employees
Leave-rules.ppt CCS leave rules 1972 for central govt employees
 
Gabrielle M. A. Sinaga Portfolio, Film Student (2024)
Gabrielle M. A. Sinaga Portfolio, Film Student (2024)Gabrielle M. A. Sinaga Portfolio, Film Student (2024)
Gabrielle M. A. Sinaga Portfolio, Film Student (2024)
 
Status of Women in Pakistan.pptxStatus of Women in Pakistan.pptx
Status of Women in Pakistan.pptxStatus of Women in Pakistan.pptxStatus of Women in Pakistan.pptxStatus of Women in Pakistan.pptx
Status of Women in Pakistan.pptxStatus of Women in Pakistan.pptx
 
Lbs last rank 2023 9988kr47h4744j445.pdf
Lbs last rank 2023 9988kr47h4744j445.pdfLbs last rank 2023 9988kr47h4744j445.pdf
Lbs last rank 2023 9988kr47h4744j445.pdf
 
Resumes, Cover Letters, and Applying Online
Resumes, Cover Letters, and Applying OnlineResumes, Cover Letters, and Applying Online
Resumes, Cover Letters, and Applying Online
 
0624.speakingengagementsandteaching-01.pdf
0624.speakingengagementsandteaching-01.pdf0624.speakingengagementsandteaching-01.pdf
0624.speakingengagementsandteaching-01.pdf
 
Learnings from Successful Jobs Searchers
Learnings from Successful Jobs SearchersLearnings from Successful Jobs Searchers
Learnings from Successful Jobs Searchers
 
Introducing Gopay Mobile App For Environment.pptx
Introducing Gopay Mobile App For Environment.pptxIntroducing Gopay Mobile App For Environment.pptx
Introducing Gopay Mobile App For Environment.pptx
 
How to Prepare for Fortinet FCP_FAC_AD-6.5 Certification?
How to Prepare for Fortinet FCP_FAC_AD-6.5 Certification?How to Prepare for Fortinet FCP_FAC_AD-6.5 Certification?
How to Prepare for Fortinet FCP_FAC_AD-6.5 Certification?
 
一比一原版布拉德福德大学毕业证(bradford毕业证)如何办理
一比一原版布拉德福德大学毕业证(bradford毕业证)如何办理一比一原版布拉德福德大学毕业证(bradford毕业证)如何办理
一比一原版布拉德福德大学毕业证(bradford毕业证)如何办理
 
Switching Careers Slides - JoyceMSullivan SocMediaFin - 2024Jun11.pdf
Switching Careers Slides - JoyceMSullivan SocMediaFin -  2024Jun11.pdfSwitching Careers Slides - JoyceMSullivan SocMediaFin -  2024Jun11.pdf
Switching Careers Slides - JoyceMSullivan SocMediaFin - 2024Jun11.pdf
 

Big Data basics-Unit-1.pptx

  • 2. What is Data? • The quantities, characters, or symbols on which operations are performed by a computer, which may be stored and transmitted in the form of electrical signals and recorded on magnetic, optical, or mechanical recording media. • What is Big Data? • So before I explain what is Big Data, let me also tell you what it is not! • The most common myth associated with it is that it is just about the size or volume of data. • But actually, it’s not just about the “big” amounts of data being collected. • Big Data refers to the large amounts of data which is pouring in from various data sources and has different formats. • Even previously there was huge data which were being stored in databases, but because of the varied nature of this Data, the traditional relational database systems are incapable of handling this Data.
  • 3. • Big data refers to the large, diverse sets of information that grow at ever-increasing rates. • It encompasses the volume of information, the velocity or speed at which it is created and collected, and the variety or scope of the data points being covered. • Big data often comes from multiple sources and arrives in multiple formats. • To really understand big data, it’s helpful to have some historical background. Here is Gartner’s definition, circa 2001 (which is still the go-to definition): Big data is data that contains greater variety arriving in increasing volumes and with ever-higher velocity. • big data is larger, more complex data sets, especially from new data sources. These data sets are so voluminous that traditional data processing software just can’t manage them
  • 4. The History of Big Data •Although the concept of big data itself is relatively new, the origins of large data sets go back to the 1960s and '70s, • when the world of data was just getting started with the first data centers and the development of the relational database. •Around 2005, people began to realize just how much data users generated through Facebook, YouTube, and other online services. • Hadoop (an open-source framework created specifically to store and analyze big data sets) was developed that same year. • NoSQL also began to gain popularity during this time. •The development of open-source frameworks, such as Hadoop was essential for the growth of big data because they make big data easier to work with and cheaper to store. • In the years since then, the volume of big data has skyrocketed.
  • 5. •Users are still generating huge amounts of data—but it’s not just humans who are doing it. •With the advent of the Internet of Things (IoT), more objects and devices are connected to the internet, gathering data on customer usage patterns and product performance. •The emergence of machine learning has produced still more data. •While big data has come far, its usefulness is only just beginning.
  • 6. What is Big Data? Big Data is also data but with a huge size. Big Data is a term used to describe a collection of data that is huge in volume and yet growing exponentially with time. In short such data is so large and complex that none of the traditional data management tools are able to store it or process it efficiently.
  • 7. Examples Of Big Data Following are some the examples of Big Data- 1. The New York Stock Exchange generates about one terabyte of new trade data per day.
  • 8. 2.Social Media The statistic shows that 500+terabytes of new data get ingested into the databases of social media site Facebook, every day. This data is mainly generated in terms of photo and video uploads, message exchanges, putting comments etc.
  • 9. 3. A single Jet engine can generate 10+terabytes of data in 30 minutes of flight time. With many thousand flights per day, generation of data reaches up to many Petabytes.
  • 11. 1. Volume •With big data, you’ll have to process high volumes of low-density, unstructured data. •This can be data of unknown value, such as Twitter data feeds, click streams on a webpage or a mobile app, or sensor-enabled equipment. •For some organizations, this might be tens of terabytes of data. For others, it may be hundreds of petabytes. 2. Velocity •Velocity is the fast rate at which data is received and (perhaps) acted on. •Normally, the highest velocity of data streams directly into memory versus being written to disk. •Some internet-enabled smart products operate in real time or near real time and will require real-time evaluation and action.
  • 12. 3. Variety •Variety refers to the many types of data that are available. •Traditional data types were structured and fit neatly in a relational database. •With the rise of big data, data comes in new unstructured data types. Unstructured and semi-structured data types, such as text, audio, and video, require additional preprocessing to derive meaning and support metadata. 4. Value •Data has intrinsic value. But it’s of no use until that value is discovered 5. Veracity(Truth) •How truthful is your data—and how much can you rely on it? •This refers to the inconsistency which can be shown by the data at times, thus hampering the process of being able to handle and manage the data effectively.