SlideShare a Scribd company logo
1 of 11
BY: SHIVAM JAIN & CHAITANYA BANKANHAL
SYCM,
EKLAVYA SIKSHAN SANSTHA’S POLYTECNIC,
PUNE
WHAT IS BIG DATA?
• There is no single standard definition…
• Data sets with sizes beyond the ability of commonly used software tools to capture, curate,
manage & process data with a tolerable elapsed time.
• In 2012, Gartner updated its definition as "Big data is high volume, high velocity, and/or high
variety information assets that require new forms of processing to enable enhanced decision
making, insight discovery and process optimization.“
THE 3V’S CLASSIFICATION OF BIG DATA
• Volume
The quantity of data generated
• Velocity
The rate at which the data can be transferred
• Variety
The different types of data that have to be stored.
VOLUME
• Every day…
• More than 1.5 billion shares are traded on the NYSE
• Facebook stores more than 2.6 billion likes & comments.
• Every Minute….
• McDonalds serves 2000 customers
• A new user is registered on G-mail
• Every Second….
• Banks process more than 10,000 transactions.
VELOCITY
• Data is being generated fast and needs to be processed fast.
• Late decisions → missing opportunities
Examples
• E-Promotions:- Based on your location, your purchase history, what you like → send promotions
right now for store next to you.
• Healthcare monitoring:- sensors monitoring your activities and body → any abnormal
measurements require immediate reactions.
VARIETY
• Various formats, types and structures.
• Text, numerical, images, audio, videos, sequences, time series, social media data, multi-dim
arrays, etc…
• A single application can be generated by collecting many types of data.
Advantages Limitations
Ability to make better decisions and take
meaningful actions at the right time.
Big risks on security and privacy
Cost Reduction Difficult to learn, requires expert training
to use in an organization
Technologies such as MapReduce, hive
and impala enable to run the queries
without changing the data structures
underneath.
Making relationships, applying
algorithms is very difficult
LATEST TECHNOLOGIES AND DEVELOPMENT
• Hadoop
• MapReduce
• MongoDB
Big Data

More Related Content

What's hot

20170613 iasa architecture - Tim Willoughby presentation
20170613   iasa architecture  - Tim Willoughby presentation20170613   iasa architecture  - Tim Willoughby presentation
20170613 iasa architecture - Tim Willoughby presentationTim Willoughby
 
Big Data Analytics for Dodd-Frank
Big Data Analytics for Dodd-FrankBig Data Analytics for Dodd-Frank
Big Data Analytics for Dodd-FrankDataWorks Summit
 
20170614 Tim Willoughby - data conference
20170614   Tim Willoughby - data conference20170614   Tim Willoughby - data conference
20170614 Tim Willoughby - data conferenceTim Willoughby
 
SME Breakfast Seminar - Keynote Session - The Data Landscape
SME Breakfast Seminar - Keynote Session - The Data LandscapeSME Breakfast Seminar - Keynote Session - The Data Landscape
SME Breakfast Seminar - Keynote Session - The Data LandscapeNathean Technologies
 
Gh raisoni mba 1st year class 1
Gh raisoni mba 1st year class 1Gh raisoni mba 1st year class 1
Gh raisoni mba 1st year class 1Shishant Mahato
 
Big Data for Beginners
Big Data for BeginnersBig Data for Beginners
Big Data for BeginnersMichael Perez
 
Too Much Information? What Big Data Means for the Council
Too Much Information? What Big Data Means for the CouncilToo Much Information? What Big Data Means for the Council
Too Much Information? What Big Data Means for the CouncilJoe Chapman AMIRMS
 
Big data analytics final
Big data analytics finalBig data analytics final
Big data analytics finalAmit Kumar
 
Business Intelligence Engineering - Voices 2015
Business Intelligence Engineering - Voices 2015Business Intelligence Engineering - Voices 2015
Business Intelligence Engineering - Voices 2015Deanna Kosaraju
 
Data-Ed Webinar: Demystifying Big Data
Data-Ed Webinar: Demystifying Big Data Data-Ed Webinar: Demystifying Big Data
Data-Ed Webinar: Demystifying Big Data DATAVERSITY
 
Documaster – The true value of documents
Documaster – The true value of documentsDocumaster – The true value of documents
Documaster – The true value of documentsNXC Switzerland
 
Tim Willoughby - Ideas and Ideals on an ICT Strategy for Local Government
Tim Willoughby - Ideas and Ideals on an ICT Strategy for Local Government Tim Willoughby - Ideas and Ideals on an ICT Strategy for Local Government
Tim Willoughby - Ideas and Ideals on an ICT Strategy for Local Government Tim Willoughby
 

What's hot (16)

20170613 iasa architecture - Tim Willoughby presentation
20170613   iasa architecture  - Tim Willoughby presentation20170613   iasa architecture  - Tim Willoughby presentation
20170613 iasa architecture - Tim Willoughby presentation
 
Big Data Analytics for Dodd-Frank
Big Data Analytics for Dodd-FrankBig Data Analytics for Dodd-Frank
Big Data Analytics for Dodd-Frank
 
20170614 Tim Willoughby - data conference
20170614   Tim Willoughby - data conference20170614   Tim Willoughby - data conference
20170614 Tim Willoughby - data conference
 
SME Breakfast Seminar - Keynote Session - The Data Landscape
SME Breakfast Seminar - Keynote Session - The Data LandscapeSME Breakfast Seminar - Keynote Session - The Data Landscape
SME Breakfast Seminar - Keynote Session - The Data Landscape
 
Gh raisoni mba 1st year class 1
Gh raisoni mba 1st year class 1Gh raisoni mba 1st year class 1
Gh raisoni mba 1st year class 1
 
Big Data for Beginners
Big Data for BeginnersBig Data for Beginners
Big Data for Beginners
 
Too Much Information? What Big Data Means for the Council
Too Much Information? What Big Data Means for the CouncilToo Much Information? What Big Data Means for the Council
Too Much Information? What Big Data Means for the Council
 
Big data analytics final
Big data analytics finalBig data analytics final
Big data analytics final
 
Presentation on Big Data
Presentation on Big DataPresentation on Big Data
Presentation on Big Data
 
Big Data
Big DataBig Data
Big Data
 
Business Intelligence Engineering - Voices 2015
Business Intelligence Engineering - Voices 2015Business Intelligence Engineering - Voices 2015
Business Intelligence Engineering - Voices 2015
 
Data-Ed Webinar: Demystifying Big Data
Data-Ed Webinar: Demystifying Big Data Data-Ed Webinar: Demystifying Big Data
Data-Ed Webinar: Demystifying Big Data
 
Big data
Big dataBig data
Big data
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Documaster – The true value of documents
Documaster – The true value of documentsDocumaster – The true value of documents
Documaster – The true value of documents
 
Tim Willoughby - Ideas and Ideals on an ICT Strategy for Local Government
Tim Willoughby - Ideas and Ideals on an ICT Strategy for Local Government Tim Willoughby - Ideas and Ideals on an ICT Strategy for Local Government
Tim Willoughby - Ideas and Ideals on an ICT Strategy for Local Government
 

Viewers also liked

презентацияд. к. ушинский
презентацияд. к. ушинскийпрезентацияд. к. ушинский
презентацияд. к. ушинскийvd23
 
Pengayaan pecahan-berjenjang
Pengayaan pecahan-berjenjangPengayaan pecahan-berjenjang
Pengayaan pecahan-berjenjangignaswujon
 
Grammar presentation-by
Grammar presentation-byGrammar presentation-by
Grammar presentation-byMirian Quigla
 
Nuevo documento[1]
Nuevo documento[1]Nuevo documento[1]
Nuevo documento[1]Abril Reyna
 

Viewers also liked (7)

Apuntes
ApuntesApuntes
Apuntes
 
презентацияд. к. ушинский
презентацияд. к. ушинскийпрезентацияд. к. ушинский
презентацияд. к. ушинский
 
Apuntes física.
Apuntes física.Apuntes física.
Apuntes física.
 
Pengayaan pecahan-berjenjang
Pengayaan pecahan-berjenjangPengayaan pecahan-berjenjang
Pengayaan pecahan-berjenjang
 
Itil v3
Itil v3Itil v3
Itil v3
 
Grammar presentation-by
Grammar presentation-byGrammar presentation-by
Grammar presentation-by
 
Nuevo documento[1]
Nuevo documento[1]Nuevo documento[1]
Nuevo documento[1]
 

Similar to Big Data

Big Data Analytics.pdfbgfjgjgghfhhffhdfyf
Big Data Analytics.pdfbgfjgjgghfhhffhdfyfBig Data Analytics.pdfbgfjgjgghfhhffhdfyf
Big Data Analytics.pdfbgfjgjgghfhhffhdfyfVijayKaran7
 
Big Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data ManagementBig Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data ManagementTony Bain
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big DataUmair Shafique
 
Big-Data-Seminar-6-Aug-2014-Koenig
Big-Data-Seminar-6-Aug-2014-KoenigBig-Data-Seminar-6-Aug-2014-Koenig
Big-Data-Seminar-6-Aug-2014-KoenigManish Chopra
 
TOPIC.pptx
TOPIC.pptxTOPIC.pptx
TOPIC.pptxinfinix8
 
Big Data Evolution
Big Data EvolutionBig Data Evolution
Big Data Evolutionitnewsafrica
 
Big data with Hadoop - Introduction
Big data with Hadoop - IntroductionBig data with Hadoop - Introduction
Big data with Hadoop - IntroductionTomy Rhymond
 
Data warehouse Vs Big Data
Data warehouse Vs Big Data Data warehouse Vs Big Data
Data warehouse Vs Big Data Lisette ZOUNON
 
Group 2 Handling and Processing of big data.pptx
Group 2 Handling and Processing of big data.pptxGroup 2 Handling and Processing of big data.pptx
Group 2 Handling and Processing of big data.pptxsalutiontechnology
 
Big Data: Big Deal or Buzzword
Big Data: Big Deal or Buzzword Big Data: Big Deal or Buzzword
Big Data: Big Deal or Buzzword Hiring Solved
 
Content1. Introduction2. What is Big Data3. Characte.docx
Content1. Introduction2. What is Big Data3. Characte.docxContent1. Introduction2. What is Big Data3. Characte.docx
Content1. Introduction2. What is Big Data3. Characte.docxdickonsondorris
 
Big Data Analytics Materials, Chapter: 1
Big Data Analytics Materials, Chapter: 1Big Data Analytics Materials, Chapter: 1
Big Data Analytics Materials, Chapter: 1RUHULAMINHAZARIKA
 
Special issues on big data
Special issues on big dataSpecial issues on big data
Special issues on big dataVedanand Singh
 
Big data
Big dataBig data
Big dataRiya
 

Similar to Big Data (20)

bigdatappt.pptx
bigdatappt.pptxbigdatappt.pptx
bigdatappt.pptx
 
Big Data Analytics.pdfbgfjgjgghfhhffhdfyf
Big Data Analytics.pdfbgfjgjgghfhhffhdfyfBig Data Analytics.pdfbgfjgjgghfhhffhdfyf
Big Data Analytics.pdfbgfjgjgghfhhffhdfyf
 
Big Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data ManagementBig Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data Management
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Big data
Big dataBig data
Big data
 
Big-Data-Seminar-6-Aug-2014-Koenig
Big-Data-Seminar-6-Aug-2014-KoenigBig-Data-Seminar-6-Aug-2014-Koenig
Big-Data-Seminar-6-Aug-2014-Koenig
 
TOPIC.pptx
TOPIC.pptxTOPIC.pptx
TOPIC.pptx
 
Big Data Evolution
Big Data EvolutionBig Data Evolution
Big Data Evolution
 
Big data with Hadoop - Introduction
Big data with Hadoop - IntroductionBig data with Hadoop - Introduction
Big data with Hadoop - Introduction
 
Data warehouse Vs Big Data
Data warehouse Vs Big Data Data warehouse Vs Big Data
Data warehouse Vs Big Data
 
Group 2 Handling and Processing of big data.pptx
Group 2 Handling and Processing of big data.pptxGroup 2 Handling and Processing of big data.pptx
Group 2 Handling and Processing of big data.pptx
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
Big Data: Big Deal or Buzzword
Big Data: Big Deal or Buzzword Big Data: Big Deal or Buzzword
Big Data: Big Deal or Buzzword
 
Content1. Introduction2. What is Big Data3. Characte.docx
Content1. Introduction2. What is Big Data3. Characte.docxContent1. Introduction2. What is Big Data3. Characte.docx
Content1. Introduction2. What is Big Data3. Characte.docx
 
Trends in data analytics
Trends in data analyticsTrends in data analytics
Trends in data analytics
 
Big Data Analytics Materials, Chapter: 1
Big Data Analytics Materials, Chapter: 1Big Data Analytics Materials, Chapter: 1
Big Data Analytics Materials, Chapter: 1
 
Special issues on big data
Special issues on big dataSpecial issues on big data
Special issues on big data
 
BigData.pptx
BigData.pptxBigData.pptx
BigData.pptx
 
Big data
Big dataBig data
Big data
 

Recently uploaded

"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Neo4j
 

Recently uploaded (20)

"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 

Big Data

  • 1. BY: SHIVAM JAIN & CHAITANYA BANKANHAL SYCM, EKLAVYA SIKSHAN SANSTHA’S POLYTECNIC, PUNE
  • 2.
  • 3. WHAT IS BIG DATA? • There is no single standard definition… • Data sets with sizes beyond the ability of commonly used software tools to capture, curate, manage & process data with a tolerable elapsed time. • In 2012, Gartner updated its definition as "Big data is high volume, high velocity, and/or high variety information assets that require new forms of processing to enable enhanced decision making, insight discovery and process optimization.“
  • 4. THE 3V’S CLASSIFICATION OF BIG DATA • Volume The quantity of data generated • Velocity The rate at which the data can be transferred • Variety The different types of data that have to be stored.
  • 5. VOLUME • Every day… • More than 1.5 billion shares are traded on the NYSE • Facebook stores more than 2.6 billion likes & comments. • Every Minute…. • McDonalds serves 2000 customers • A new user is registered on G-mail • Every Second…. • Banks process more than 10,000 transactions.
  • 6. VELOCITY • Data is being generated fast and needs to be processed fast. • Late decisions → missing opportunities Examples • E-Promotions:- Based on your location, your purchase history, what you like → send promotions right now for store next to you. • Healthcare monitoring:- sensors monitoring your activities and body → any abnormal measurements require immediate reactions.
  • 7. VARIETY • Various formats, types and structures. • Text, numerical, images, audio, videos, sequences, time series, social media data, multi-dim arrays, etc… • A single application can be generated by collecting many types of data.
  • 8. Advantages Limitations Ability to make better decisions and take meaningful actions at the right time. Big risks on security and privacy Cost Reduction Difficult to learn, requires expert training to use in an organization Technologies such as MapReduce, hive and impala enable to run the queries without changing the data structures underneath. Making relationships, applying algorithms is very difficult
  • 9.
  • 10. LATEST TECHNOLOGIES AND DEVELOPMENT • Hadoop • MapReduce • MongoDB