SlideShare a Scribd company logo
…Big Data...
14404- FANSUPKAR TANIYA
14418- RAIS ZOYA
Big Data is the ocean of information we swim in every day vast
sources of data flowing from our computers, mobile devices, and
machine sensors. Big Data is being generated by everything around
us at all times. Every digital process and social media exchange
produces it, while systems, sensors, and mobile devices transmit it.
New sources of data come from a variety of machines, such as
website interactions, search engine optimizations, and social
business sites by using click-stream data. These changing business
requirements demand that the right information be available at the
right time.[1]
What is Big Data?
Big Data Versus Small Data
Small Data
• Usually designed to answer a
specific question or serve a
particular goal.
• Typically, small data is
contained within one
institution, often on one
computer, sometimes in one
file.
• In many cases, the data user
prepares her own data, for her
own purposes.
Big Data
• Usually designed with a goal in
mind, but the goal is flexible
and the questions posed are
protean.
• Typically spread throughout
electronic space, typically
spread through multiple
Internet servers, located
anywhere on earth.
• The data comes from many
differ sources, and it is
prepared by many people.
Small Data
• Ordinarily contains highly
structured data. The data
domain is restricted to a
single discipline or sub
discipline.
• Typically, the data is
measured using one
experimental protocol, and
the data can be represented
using one set of standard
units.
Big Data
• Must be capable of
absorbing unstructured
data (e.g., such as free-text
documents, images, motion
pictures, sound recordings,
physical objects).
• Many different types of
data are delivered in many
different electronic formats
by different people.[2]
Let’s look at
Big Data
in a different way.
Byte
Byte : one grain of rice
Kilobyte
Byte : one grain of rice
Kilobyte : cup of rice
Megabyte
Byte : one grain of rice
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Gigabyte
Byte : one grain of rice
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Gigabyte : 3 Semi trucks
Terabyte
Byte : one grain of rice
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Gigabyte : 3 Semi trucks
Terabyte : 2 Container Ships
Petabyte
Byte : one grain of rice
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Gigabyte : 3 Semi trucks
Terabyte : 2 Container Ships
Petabyte : Blankets Manhattan
One ByteExabyte
Byte : one grain of rice
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Gigabyte : 3 Semi trucks
Terabyte : 2 Container Ships
Petabyte : Blankets Manhattan
Exabyte : Blankets west coast states
Byte : one grain of rice
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Gigabyte : 3 Semi trucks
Terabyte : 2 Container Ships
Petabyte : Blankets Manhattan
Exabyte : Blankets west coast states
Zettabyte : Fills the Pacific Ocean
Zettabyte
Byte : one grain of rice
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Gigabyte : 3 Semi trucks
Terabyte : 2 Container Ships
Petabyte : Blankets Manhattan
Exabyte : Blankets west coast states
Zettabyte : Fills the Pacific Ocean
Yottabyte : A EARTH SIZE RICE BALL! Yottabyte
HobbyistByte : one grain of rice
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Gigabyte : 3 Semi trucks
Terabyte : 2 Container Ships
Petabyte : Blankets Manhattan
Exabyte : Blankets west coast states
Zettabyte : Fills the Pacific Ocean
Yottabyte : A EARTH SIZE RICE BALL!
Desktop
HobbyistByte : one grain of rice
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Gigabyte : 3 Semi trucks
Terabyte : 2 Container Ships
Petabyte : Blankets Manhattan
Exabyte : Blankets west coast states
Zettabyte : Fills the Pacific Ocean
Yottabyte : A EARTH SIZE RICE BALL!
Desktop
Hobbyist
Internet
Byte : one grain of rice
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Gigabyte : 3 Semi trucks
Terabyte : 2 Container Ships
Petabyte : Blankets Manhattan
Exabyte : Blankets west coast states
Zettabyte : Fills the Pacific Ocean
Yottabyte : A EARTH SIZE RICE BALL!
Desktop
Hobbyist
Internet
Big Data
Byte : one grain of rice
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Gigabyte : 3 Semi trucks
Terabyte : 2 Container Ships
Petabyte : Blankets Manhattan
Exabyte : Blankets west coast states
Zettabyte : Fills the Pacific Ocean
Yottabyte : A EARTH SIZE RICE BALL!
Byte : one grain of rice
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Gigabyte : 3 Semi trucks
Terabyte : 2 Container Ships
Petabyte : Blankets Manhattan
Exabyte : Blankets west coast states
Zettabyte : Fills the Pacific Ocean
Yottabyte : A EARTH SIZE RICE BALL!
Desktop
Hobbyist
The Future?[3]
Internet
Big Data
Byte : one grain of rice
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Gigabyte : 3 Semi trucks
Terabyte : 2 Container Ships
Petabyte : Blankets Manhattan
Exabyte : Blankets west coast states
Zettabyte : Fills the Pacific Ocean
Yottabyte : A EARTH SIZE RICE BALL!
Volume...
100 terabytes of data are uploaded daily to Facebook ; Akamai
analyses 75 million events a day to target online ads; Walmart
handles 1 million customer transactions every single hour. 90%
of all data ever created was generated in the past 2 years.
Scale is certainly a part of what makes Big Data big. The
internet-mobile revolution, bringing with it a torrent of social
media updates, sensor data from devices and an explosion of e-
commerce, means that every industry is swamped with data-
which can be incredibly valuable, if you know how to use it.
3vs of Big Data
Velocity…
In 1999, Wal-Mart’s data warehouse stored 1,000 terabytes (1,000,000
gigabytes) of data. In 2012, it had access to over 2.5 petabytes (2,500,000
gigabytes) of data.
Every minute of every day, we upload 100 hours of video on Youtube, send
over 200 million emails and send 300,000 tweets. ‘Velocity’ refers to the
increasing speed at which this data is created, and the increasing speed at
which the data can be processed, stored and analysed by relational
databases. The possibilities of processing data in real-time is an area of
particular interest, which allows companies to do things like display
personalised ads on the web pages you visit, based on your recent search,
viewing and purchase history.
Variety…
Gone are the days when a company’s data could be neatly
slotted into a table and analysed. 90% of data generated is
‘unstructured’, coming in all shapes and forms- from geo-spatial
data, to tweets which can be analysed for content and
sentiment, to visual data such as photos and videos.
The ‘3 V’s’ certainly give us an insight into the almost
unimaginable scale of data, and the break-neck speeds at which
these vast datasets grow and multiply. But only ‘Variety’ really
begins to scratch the surface of the depth- and crucially, the
challenges- of Big Data.[4]
Benefits of Big Data…
High Maintenance.
Skill needed to access Data.
Difficult to Handle.
Violates the Privacy Principle.[5]
Drawbacks of Big Data...
Government.
International development
 Manufacturing
Cyber-Physical Models
Media
Technology
Private sector
Science and Research.[6]
Applications of Big Data…
[1].Book: "Big Data for Beginners" by Alonzo Williams,Stepanie Foor.
[2].Book: "Principles of Big Data: Preparing, Sharing, and Analyzing Complex
Information" by Jules J. Berman.
[4].Book: "Understanding Big Data: A Beginners Guide to Data Science & the Business
Applications" by Eileen McNulty-Holmes.
[5].http://www.oii.ox.ac.uk/research/project/?id=98.
[6].https://www.google.co.in/#q=applications+of+big+data+wikipedia.
[3]. http://www.slideshare.net/dwellman/what-is-big-data-24401517?qid=6e8e2726-
6681-486c-880b-f973f6b61e2c&v=&b=&from_search=5
Big data

More Related Content

Similar to Big data

Big data anuj
Big data anujBig data anuj
Big data anuj
Anuj Pandey
 
Whatisbigdata 130718170809-phpapp01
Whatisbigdata 130718170809-phpapp01Whatisbigdata 130718170809-phpapp01
Whatisbigdata 130718170809-phpapp01
Vera Kovaleva
 
What is big data
What is big dataWhat is big data
What is big data?
What is big data?What is big data?
What is big data?
David Wellman
 
Big Data Chapter1.pdf
Big Data Chapter1.pdfBig Data Chapter1.pdf
Big Data Chapter1.pdf
SantoshUpreti6
 
Big data
Big data Big data
Big data
lia borsha
 
Intro to big data and how it works
Intro to big data and how it worksIntro to big data and how it works
Intro to big data and how it works
Nadeem Tahir
 
Big data
Big dataBig data
Big data
Samira Riki
 
BIG DATA
BIG DATABIG DATA
BIG DATA
Abhishek Bhurke
 
Bigdata presentation
Bigdata presentationBigdata presentation
Bigdata presentation
SatishAlerts
 
Bigdata presentation
Bigdata presentationBigdata presentation
Bigdata presentation
SatishAlerts
 
Data-Ed: Demystifying Big Data
Data-Ed: Demystifying Big DataData-Ed: Demystifying Big Data
Data-Ed: Demystifying Big Data
Data Blueprint
 
Big data and Hadoop
Big data and HadoopBig data and Hadoop
Big data and Hadoop
Rahul Mahawar
 
L18 Big Data and Analytics
L18 Big Data and AnalyticsL18 Big Data and Analytics
L18 Big Data and Analytics
Ólafur Andri Ragnarsson
 
Big data overview
Big data overviewBig data overview
Big data overview
Ganesan Vetriselvan
 
Big data overview
Big data overviewBig data overview
Big data overview
Ganesan Vetriselvan
 
Big data and Internet
Big data and InternetBig data and Internet
Big data and Internet
Sanoj Kumar
 
What's the Big Deal About Big Data?.pdf
What's the Big Deal About Big Data?.pdfWhat's the Big Deal About Big Data?.pdf
What's the Big Deal About Big Data?.pdf
Steven Jong
 
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
Hritika Raj
 
Big Data basics-Unit-1.pptx
Big Data basics-Unit-1.pptxBig Data basics-Unit-1.pptx
Big Data basics-Unit-1.pptx
varun453331
 

Similar to Big data (20)

Big data anuj
Big data anujBig data anuj
Big data anuj
 
Whatisbigdata 130718170809-phpapp01
Whatisbigdata 130718170809-phpapp01Whatisbigdata 130718170809-phpapp01
Whatisbigdata 130718170809-phpapp01
 
What is big data
What is big dataWhat is big data
What is big data
 
What is big data?
What is big data?What is big data?
What is big data?
 
Big Data Chapter1.pdf
Big Data Chapter1.pdfBig Data Chapter1.pdf
Big Data Chapter1.pdf
 
Big data
Big data Big data
Big data
 
Intro to big data and how it works
Intro to big data and how it worksIntro to big data and how it works
Intro to big data and how it works
 
Big data
Big dataBig data
Big data
 
BIG DATA
BIG DATABIG DATA
BIG DATA
 
Bigdata presentation
Bigdata presentationBigdata presentation
Bigdata presentation
 
Bigdata presentation
Bigdata presentationBigdata presentation
Bigdata presentation
 
Data-Ed: Demystifying Big Data
Data-Ed: Demystifying Big DataData-Ed: Demystifying Big Data
Data-Ed: Demystifying Big Data
 
Big data and Hadoop
Big data and HadoopBig data and Hadoop
Big data and Hadoop
 
L18 Big Data and Analytics
L18 Big Data and AnalyticsL18 Big Data and Analytics
L18 Big Data and Analytics
 
Big data overview
Big data overviewBig data overview
Big data overview
 
Big data overview
Big data overviewBig data overview
Big data overview
 
Big data and Internet
Big data and InternetBig data and Internet
Big data and Internet
 
What's the Big Deal About Big Data?.pdf
What's the Big Deal About Big Data?.pdfWhat's the Big Deal About Big Data?.pdf
What's the Big Deal About Big Data?.pdf
 
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
 
Big Data basics-Unit-1.pptx
Big Data basics-Unit-1.pptxBig Data basics-Unit-1.pptx
Big Data basics-Unit-1.pptx
 

Recently uploaded

20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
Tomaz Bratanic
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
Quotidiano Piemontese
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
saastr
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Safe Software
 
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Jeffrey Haguewood
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
Ivanti
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
Octavian Nadolu
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
Tatiana Kojar
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
ssuserfac0301
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
Matthew Sinclair
 
UI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentationUI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentation
Wouter Lemaire
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Tosin Akinosho
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
MichaelKnudsen27
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
Zilliz
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
Hiroshi SHIBATA
 
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
Edge AI and Vision Alliance
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
名前 です男
 
Recommendation System using RAG Architecture
Recommendation System using RAG ArchitectureRecommendation System using RAG Architecture
Recommendation System using RAG Architecture
fredae14
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
shyamraj55
 

Recently uploaded (20)

20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
 
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
 
UI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentationUI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentation
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
 
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
 
Recommendation System using RAG Architecture
Recommendation System using RAG ArchitectureRecommendation System using RAG Architecture
Recommendation System using RAG Architecture
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
 

Big data

  • 1. …Big Data... 14404- FANSUPKAR TANIYA 14418- RAIS ZOYA
  • 2. Big Data is the ocean of information we swim in every day vast sources of data flowing from our computers, mobile devices, and machine sensors. Big Data is being generated by everything around us at all times. Every digital process and social media exchange produces it, while systems, sensors, and mobile devices transmit it. New sources of data come from a variety of machines, such as website interactions, search engine optimizations, and social business sites by using click-stream data. These changing business requirements demand that the right information be available at the right time.[1] What is Big Data?
  • 3. Big Data Versus Small Data Small Data • Usually designed to answer a specific question or serve a particular goal. • Typically, small data is contained within one institution, often on one computer, sometimes in one file. • In many cases, the data user prepares her own data, for her own purposes. Big Data • Usually designed with a goal in mind, but the goal is flexible and the questions posed are protean. • Typically spread throughout electronic space, typically spread through multiple Internet servers, located anywhere on earth. • The data comes from many differ sources, and it is prepared by many people.
  • 4. Small Data • Ordinarily contains highly structured data. The data domain is restricted to a single discipline or sub discipline. • Typically, the data is measured using one experimental protocol, and the data can be represented using one set of standard units. Big Data • Must be capable of absorbing unstructured data (e.g., such as free-text documents, images, motion pictures, sound recordings, physical objects). • Many different types of data are delivered in many different electronic formats by different people.[2]
  • 5. Let’s look at Big Data in a different way.
  • 6. Byte Byte : one grain of rice
  • 7. Kilobyte Byte : one grain of rice Kilobyte : cup of rice
  • 8. Megabyte Byte : one grain of rice Kilobyte : cup of rice Megabyte : 8 bags of rice
  • 9. Gigabyte Byte : one grain of rice Kilobyte : cup of rice Megabyte : 8 bags of rice Gigabyte : 3 Semi trucks
  • 10. Terabyte Byte : one grain of rice Kilobyte : cup of rice Megabyte : 8 bags of rice Gigabyte : 3 Semi trucks Terabyte : 2 Container Ships
  • 11. Petabyte Byte : one grain of rice Kilobyte : cup of rice Megabyte : 8 bags of rice Gigabyte : 3 Semi trucks Terabyte : 2 Container Ships Petabyte : Blankets Manhattan
  • 12. One ByteExabyte Byte : one grain of rice Kilobyte : cup of rice Megabyte : 8 bags of rice Gigabyte : 3 Semi trucks Terabyte : 2 Container Ships Petabyte : Blankets Manhattan Exabyte : Blankets west coast states
  • 13. Byte : one grain of rice Kilobyte : cup of rice Megabyte : 8 bags of rice Gigabyte : 3 Semi trucks Terabyte : 2 Container Ships Petabyte : Blankets Manhattan Exabyte : Blankets west coast states Zettabyte : Fills the Pacific Ocean Zettabyte
  • 14. Byte : one grain of rice Kilobyte : cup of rice Megabyte : 8 bags of rice Gigabyte : 3 Semi trucks Terabyte : 2 Container Ships Petabyte : Blankets Manhattan Exabyte : Blankets west coast states Zettabyte : Fills the Pacific Ocean Yottabyte : A EARTH SIZE RICE BALL! Yottabyte
  • 15. HobbyistByte : one grain of rice Kilobyte : cup of rice Megabyte : 8 bags of rice Gigabyte : 3 Semi trucks Terabyte : 2 Container Ships Petabyte : Blankets Manhattan Exabyte : Blankets west coast states Zettabyte : Fills the Pacific Ocean Yottabyte : A EARTH SIZE RICE BALL!
  • 16. Desktop HobbyistByte : one grain of rice Kilobyte : cup of rice Megabyte : 8 bags of rice Gigabyte : 3 Semi trucks Terabyte : 2 Container Ships Petabyte : Blankets Manhattan Exabyte : Blankets west coast states Zettabyte : Fills the Pacific Ocean Yottabyte : A EARTH SIZE RICE BALL!
  • 17. Desktop Hobbyist Internet Byte : one grain of rice Kilobyte : cup of rice Megabyte : 8 bags of rice Gigabyte : 3 Semi trucks Terabyte : 2 Container Ships Petabyte : Blankets Manhattan Exabyte : Blankets west coast states Zettabyte : Fills the Pacific Ocean Yottabyte : A EARTH SIZE RICE BALL!
  • 18. Desktop Hobbyist Internet Big Data Byte : one grain of rice Kilobyte : cup of rice Megabyte : 8 bags of rice Gigabyte : 3 Semi trucks Terabyte : 2 Container Ships Petabyte : Blankets Manhattan Exabyte : Blankets west coast states Zettabyte : Fills the Pacific Ocean Yottabyte : A EARTH SIZE RICE BALL!
  • 19. Byte : one grain of rice Kilobyte : cup of rice Megabyte : 8 bags of rice Gigabyte : 3 Semi trucks Terabyte : 2 Container Ships Petabyte : Blankets Manhattan Exabyte : Blankets west coast states Zettabyte : Fills the Pacific Ocean Yottabyte : A EARTH SIZE RICE BALL!
  • 20. Desktop Hobbyist The Future?[3] Internet Big Data Byte : one grain of rice Kilobyte : cup of rice Megabyte : 8 bags of rice Gigabyte : 3 Semi trucks Terabyte : 2 Container Ships Petabyte : Blankets Manhattan Exabyte : Blankets west coast states Zettabyte : Fills the Pacific Ocean Yottabyte : A EARTH SIZE RICE BALL!
  • 21. Volume... 100 terabytes of data are uploaded daily to Facebook ; Akamai analyses 75 million events a day to target online ads; Walmart handles 1 million customer transactions every single hour. 90% of all data ever created was generated in the past 2 years. Scale is certainly a part of what makes Big Data big. The internet-mobile revolution, bringing with it a torrent of social media updates, sensor data from devices and an explosion of e- commerce, means that every industry is swamped with data- which can be incredibly valuable, if you know how to use it. 3vs of Big Data
  • 22. Velocity… In 1999, Wal-Mart’s data warehouse stored 1,000 terabytes (1,000,000 gigabytes) of data. In 2012, it had access to over 2.5 petabytes (2,500,000 gigabytes) of data. Every minute of every day, we upload 100 hours of video on Youtube, send over 200 million emails and send 300,000 tweets. ‘Velocity’ refers to the increasing speed at which this data is created, and the increasing speed at which the data can be processed, stored and analysed by relational databases. The possibilities of processing data in real-time is an area of particular interest, which allows companies to do things like display personalised ads on the web pages you visit, based on your recent search, viewing and purchase history.
  • 23. Variety… Gone are the days when a company’s data could be neatly slotted into a table and analysed. 90% of data generated is ‘unstructured’, coming in all shapes and forms- from geo-spatial data, to tweets which can be analysed for content and sentiment, to visual data such as photos and videos. The ‘3 V’s’ certainly give us an insight into the almost unimaginable scale of data, and the break-neck speeds at which these vast datasets grow and multiply. But only ‘Variety’ really begins to scratch the surface of the depth- and crucially, the challenges- of Big Data.[4]
  • 24.
  • 25.
  • 26. Benefits of Big Data…
  • 27. High Maintenance. Skill needed to access Data. Difficult to Handle. Violates the Privacy Principle.[5] Drawbacks of Big Data...
  • 28. Government. International development  Manufacturing Cyber-Physical Models Media Technology Private sector Science and Research.[6] Applications of Big Data…
  • 29. [1].Book: "Big Data for Beginners" by Alonzo Williams,Stepanie Foor. [2].Book: "Principles of Big Data: Preparing, Sharing, and Analyzing Complex Information" by Jules J. Berman. [4].Book: "Understanding Big Data: A Beginners Guide to Data Science & the Business Applications" by Eileen McNulty-Holmes. [5].http://www.oii.ox.ac.uk/research/project/?id=98. [6].https://www.google.co.in/#q=applications+of+big+data+wikipedia. [3]. http://www.slideshare.net/dwellman/what-is-big-data-24401517?qid=6e8e2726- 6681-486c-880b-f973f6b61e2c&v=&b=&from_search=5