SlideShare a Scribd company logo
1 of 32
Overview of Big Data
CONTENT
 What is Data?
 What is Big Data?
 What is an example of Big Data?
 Why Is Big Data Important?
 Big Data Analytics
 Benefits of Big Data Analytics
 Types of Big Data
 Characteristics of Big Data
 Primary Source of Big Data
 Big Data Tools and Software
 Big Data Mining
WHAT IS DATA?
 A collection of raw facts and figures is called Data.
 It is collected for different purposes.
WHAT IS BIG DATA?
 Big data primarily refers to data sets that are too large or complex to be dealt with by traditional
data-processing application software
 Big data is a type of data that is extremely large in size.
WHAT IS AN EXAMPLE OF BIG DATA?
The following are some Big Data examples:
 Big Data is hugely used for cyber security engineers protect networks and data from unauthorized access.
 Spotify, on-demand music-providing platform, uses Big Data Analytics collects data from all its users
around the globe, and then uses the analyzed data to give informed music recommendations and
suggestions to every individual user.
 Amazon Prime which offers, videos, music, and Kindle books in a one-stop shop is also big on using big
data.
WHY IS BIG DATA IMPORTANT?
 In the energy industry, big data helps oil and gas companies identify potential drilling locations and
monitor pipeline operations; likewise, utilities use it to track electrical grids.
 Financial services firms use big data systems for risk management and real-time analysis of market
data.
 Manufacturers and transportation companies rely on big data to manage their supply chains and
optimize delivery routes.
 Other government uses include emergency response, crime prevention and smart city initiatives.
BIG DATA ANALYTICS
Big data analytics refers to collecting, processing, cleaning, and analyzing large datasets to help
organizations operationalize their big data.
1. Collect Data
2. Process Data
3. Clean Data
4. Analyze Data
BENEFITS OF BIG DATA ANALYTICS
Some benefits of big data analytics include:
 Cost savings. Helping organizations identify ways to do business more efficiently
 Product development. Providing a better understanding of customer needs
 Market insights. Tracking purchase behavior and market trends
TYPES OF BIG DATA
 Following are the types of Big Data:
 Structured
 Unstructured
 Semi-structured
STRUCTURED
 Structured Data is used to refer to the data which is already stored in databases, in an ordered
manner.
 There are two sources of structured data;
 Human-Generated
 Machine-Generated
STRUCTURED
UN-STRUCTURED
 Unstructured data is defined as any data with an unknown form or structure.
 A typical example of unstructured data is a heterogeneous data source containing a combination of
simple text files, images, videos etc
UN-STRUCTURED
SEMI-STRUCTURED
 Semi-structured data can contain both types of information.
 Semi-structured data appears to be structured, but it is not defined in the same way that a table
definition in a relational DBMS is.
 A data representation in an XML file is an example of semi-structured data.
SEMI-STRUCTURED
CHARACTERISTICS OF BIG DATA
VOLUME
 The name Big Data itself is related to a size which is enormous.
 Size of data plays a very crucial role in determining value out of data. Also, whether a particular
data can actually be considered as a Big Data or not, is dependent upon the volume of data.
 Hence, Volume is one characteristic which needs to be considered while dealing with Big Data
solutions.
 For example;
Organizational data
Social media data
VELOCITY
 The term ‘velocity’ refers to the speed of generation of data.
 How fast the data is generated and processed to meet the demands, determines real potential in
the data.
 Big Data Velocity deals with the speed at which data flows in from sources like business processes,
application logs, networks, and social media sites, sensors, Mobile devices, etc.
 The flow of data is massive and continuous.
VERACITY
 When we are dealing with a high volume, velocity and variety of data, it is not possible that all of
the data is going to be 100% correct, there will be dirty data.
 The quality of the data being captured can vary greatly.
 The data accuracy of analysis depends on the veracity of the source data.
VALUE
 Value is the most important aspect in the big data.
 Though, the potential value of the big data is huge.
 It is all well and good having access to big data but unless we can turn it into value it is become
useless.
 It becomes very costly to implement IT infrastructure systems to store big data, and businesses are
going to require a return on investment.
VARIETY
 Big data is not always structured data and it is not always easy to put big data into a relational
database.
 This means that the category to which Big Data belongs to is also a very essential fact that needs to
be known by the data analysis.
 Dealing with a variety of structured and unstructured data greatly increases the complexity of both
storing and analyzing Big Data.
 90% of data generated is data is in unstructured form.
PRIMARY SOURCE OF BIG DATA
 Primary sources of Big Data are;
 Social Data
 Machine Data
 Transactional Data
SOCIAL DATA
 Social data comes from the Likes, Tweets & Retweets, Comments, Video Uploads, and general
media that are uploaded and shared via the world’s favorite social media platforms.
 This kind of data provides invaluable insights into consumer behavior and sentiment and can be
enormously influential in marketing analytics.
 The public web is another good source of social data, and tools like Google Trends can be used to
good effect to increase the volume of big data.
MACHINE DATA
 Machine data is defined as information which is generated by industrial equipment, sensors that are
installed in machinery, and even web logs which track user behavior.
 This type of data is expected to grow exponentially as the internet of things grows ever more
universal and expands around the world.
 Sensors such as medical devices, smart meters, road cameras, satellites, games and the rapidly
growing Internet Of Things will deliver high velocity, value, volume and variety of data in the very
near future.
TRANSACTIONAL DATA
 Transactional data is generated from all the daily transactions that take place both online and
offline.
 Invoices, payment orders, storage records, delivery receipts – all are characterized as transactional
data yet data alone is almost meaningless, and most organizations struggle to make sense of the
data that they are generating and how it can be put to good use.
BIG DATA TOOLS AND SOFTWARE
• Hadoop
• Atlas.it
• HPCC
• Storm
• Cassandra
• Kaggle
• CouchDB
• Pentaho
BIG DATA MINING
 Big data mining is referred to the collective data mining or extraction techniques that are
performed on large sets /volume of data or the big data.
 Big data mining is primarily done to extract and retrieve desired information or pattern from
humongous quantity of data.
 Big data mining works on data searching, refinement , extraction and comparison algorithms.
Big data and Machine Learning
 Big Data and Machine Learning have become the reason behind the success of various industries. Both
these technologies are becoming popular day by day among all data scientists and professionals. Big
data is a term that is used to describe large, hard-to-manage, structured, and unstructured voluminous
data. Whereas, Machine learning is a subfield of Artificial Intelligence that enables machines to
automatically learn and improve from experience/past data.
 Both Machine learning and big data technologies are being used together by most companies because it
becomes difficult for the companies to manage, store, and process the collected data efficiently; hence in such a
case, Machine learning helps them.
Relationship between AI and big data
 By bringing together big data and AI technology, companies can improve business performance
and efficiency by:
 Anticipating and capitalizing on emerging industry and market trends.
 Analyzing consumer behavior and automating customer segmentation
 Personalizing and optimizing the performance of digital marketing campaigns
 Using intelligent decision support systems fueled by big data, AI, and predictive analytics
BIG DATA AND AI
BIG DATA is being used in AI
Learn how to get big value from big data
 Dive deeper on big data
How is AI used with big data?
 AI makes big data analytics simpler by automating and enhancing data preparation, data
visualization, predictive modeling, and other complex analytical tasks that would otherwise be
labor-intensive and time-consuming. AI helps users work with, manipulate, and surface actionable
insights faster from large, complex datasets.
Future Direction
 BIG DATA will be further
collected to improve our models
and improve our AI models.
 Big data will help us in
Our future projects as well.

More Related Content

Similar to new.pptx

Choosing the Right Big Data Architecture for your Business
Choosing the Right Big Data Architecture for your BusinessChoosing the Right Big Data Architecture for your Business
Choosing the Right Big Data Architecture for your BusinessChicago Hadoop Users Group
 
What Is Big Data How Big Data Works.pdf
What Is Big Data How Big Data Works.pdfWhat Is Big Data How Big Data Works.pdf
What Is Big Data How Big Data Works.pdfPridesys IT Ltd.
 
What is big data ? | Big Data Applications
What is big data ? | Big Data ApplicationsWhat is big data ? | Big Data Applications
What is big data ? | Big Data ApplicationsShilpaKrishna6
 
Big Data - Everything you need to know
Big Data - Everything you need to knowBig Data - Everything you need to know
Big Data - Everything you need to knowV2Soft
 
Data foundation for analytics excellence
Data foundation for analytics excellenceData foundation for analytics excellence
Data foundation for analytics excellenceMudit Mangal
 
Guide to big data analytics
Guide to big data analyticsGuide to big data analytics
Guide to big data analyticsGahya Pandian
 
What exactly is big data? What exactly is big data? .pptx
What exactly is big data? What exactly is big data? .pptxWhat exactly is big data? What exactly is big data? .pptx
What exactly is big data? What exactly is big data? .pptxTusharSengar6
 
BIG DATA & DATA ANALYTICS
BIG  DATA & DATA  ANALYTICSBIG  DATA & DATA  ANALYTICS
BIG DATA & DATA ANALYTICSNAGARAJAGIDDE
 
Data set Introduction to Big Data
Data set   Introduction to Big DataData set   Introduction to Big Data
Data set Introduction to Big DataData-Set
 
Data set module 1
Data set   module 1Data set   module 1
Data set module 1Data-Set
 
Policy paper need for focussed big data & analytics skillset building throu...
Policy  paper  need for focussed big data & analytics skillset building throu...Policy  paper  need for focussed big data & analytics skillset building throu...
Policy paper need for focussed big data & analytics skillset building throu...Ritesh Shrivastava
 
Analysis of Big Data
Analysis of Big DataAnalysis of Big Data
Analysis of Big DataIRJET Journal
 
Data Mining: The Top 3 Things You Need to Know to Achieve Business Improvemen...
Data Mining: The Top 3 Things You Need to Know to Achieve Business Improvemen...Data Mining: The Top 3 Things You Need to Know to Achieve Business Improvemen...
Data Mining: The Top 3 Things You Need to Know to Achieve Business Improvemen...Dr. Cedric Alford
 

Similar to new.pptx (20)

Choosing the Right Big Data Architecture for your Business
Choosing the Right Big Data Architecture for your BusinessChoosing the Right Big Data Architecture for your Business
Choosing the Right Big Data Architecture for your Business
 
What Is Big Data How Big Data Works.pdf
What Is Big Data How Big Data Works.pdfWhat Is Big Data How Big Data Works.pdf
What Is Big Data How Big Data Works.pdf
 
Bidata
BidataBidata
Bidata
 
Difference b/w DataScience, Data Analyst
Difference b/w DataScience, Data AnalystDifference b/w DataScience, Data Analyst
Difference b/w DataScience, Data Analyst
 
What is big data ? | Big Data Applications
What is big data ? | Big Data ApplicationsWhat is big data ? | Big Data Applications
What is big data ? | Big Data Applications
 
Big Data - Everything you need to know
Big Data - Everything you need to knowBig Data - Everything you need to know
Big Data - Everything you need to know
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
Data foundation for analytics excellence
Data foundation for analytics excellenceData foundation for analytics excellence
Data foundation for analytics excellence
 
Guide to big data analytics
Guide to big data analyticsGuide to big data analytics
Guide to big data analytics
 
What exactly is big data? What exactly is big data? .pptx
What exactly is big data? What exactly is big data? .pptxWhat exactly is big data? What exactly is big data? .pptx
What exactly is big data? What exactly is big data? .pptx
 
BIG DATA & DATA ANALYTICS
BIG  DATA & DATA  ANALYTICSBIG  DATA & DATA  ANALYTICS
BIG DATA & DATA ANALYTICS
 
Data set Introduction to Big Data
Data set   Introduction to Big DataData set   Introduction to Big Data
Data set Introduction to Big Data
 
130214 copy
130214   copy130214   copy
130214 copy
 
Data set module 1
Data set   module 1Data set   module 1
Data set module 1
 
Policy paper need for focussed big data & analytics skillset building throu...
Policy  paper  need for focussed big data & analytics skillset building throu...Policy  paper  need for focussed big data & analytics skillset building throu...
Policy paper need for focussed big data & analytics skillset building throu...
 
Big data-ppt-
Big data-ppt-Big data-ppt-
Big data-ppt-
 
Analysis of Big Data
Analysis of Big DataAnalysis of Big Data
Analysis of Big Data
 
Big data-ppt
Big data-pptBig data-ppt
Big data-ppt
 
Data Mining: The Top 3 Things You Need to Know to Achieve Business Improvemen...
Data Mining: The Top 3 Things You Need to Know to Achieve Business Improvemen...Data Mining: The Top 3 Things You Need to Know to Achieve Business Improvemen...
Data Mining: The Top 3 Things You Need to Know to Achieve Business Improvemen...
 

More from salutiontechnology

Ch1 Cryptography network security slides.pptx
Ch1 Cryptography network security slides.pptxCh1 Cryptography network security slides.pptx
Ch1 Cryptography network security slides.pptxsalutiontechnology
 
Lecture 1 database system notes full.pptx
Lecture 1 database system notes full.pptxLecture 1 database system notes full.pptx
Lecture 1 database system notes full.pptxsalutiontechnology
 
databasesystemsconollyslide1-151102101031-lva1-app6892.pptx
databasesystemsconollyslide1-151102101031-lva1-app6892.pptxdatabasesystemsconollyslide1-151102101031-lva1-app6892.pptx
databasesystemsconollyslide1-151102101031-lva1-app6892.pptxsalutiontechnology
 
Intrusion detection system and intrusion prevention system
Intrusion detection system and intrusion prevention systemIntrusion detection system and intrusion prevention system
Intrusion detection system and intrusion prevention systemsalutiontechnology
 
smart grid, traditional power grids.pptx
smart grid, traditional power grids.pptxsmart grid, traditional power grids.pptx
smart grid, traditional power grids.pptxsalutiontechnology
 
Information security software security presentation.pptx
Information security software security presentation.pptxInformation security software security presentation.pptx
Information security software security presentation.pptxsalutiontechnology
 
Key Management, key management three tools ,
Key Management, key management three tools ,Key Management, key management three tools ,
Key Management, key management three tools ,salutiontechnology
 
imageenhancementtechniques-140316011049-phpapp01 (1).pptx
imageenhancementtechniques-140316011049-phpapp01 (1).pptximageenhancementtechniques-140316011049-phpapp01 (1).pptx
imageenhancementtechniques-140316011049-phpapp01 (1).pptxsalutiontechnology
 
Big data analytics with R tool.pptx
Big data analytics with R tool.pptxBig data analytics with R tool.pptx
Big data analytics with R tool.pptxsalutiontechnology
 
Group 2 Handling and Processing of big data.pptx
Group 2 Handling and Processing of big data.pptxGroup 2 Handling and Processing of big data.pptx
Group 2 Handling and Processing of big data.pptxsalutiontechnology
 

More from salutiontechnology (14)

Ch1 Cryptography network security slides.pptx
Ch1 Cryptography network security slides.pptxCh1 Cryptography network security slides.pptx
Ch1 Cryptography network security slides.pptx
 
Lecture 1 database system notes full.pptx
Lecture 1 database system notes full.pptxLecture 1 database system notes full.pptx
Lecture 1 database system notes full.pptx
 
databasesystemsconollyslide1-151102101031-lva1-app6892.pptx
databasesystemsconollyslide1-151102101031-lva1-app6892.pptxdatabasesystemsconollyslide1-151102101031-lva1-app6892.pptx
databasesystemsconollyslide1-151102101031-lva1-app6892.pptx
 
Intrusion detection system and intrusion prevention system
Intrusion detection system and intrusion prevention systemIntrusion detection system and intrusion prevention system
Intrusion detection system and intrusion prevention system
 
smart grid, traditional power grids.pptx
smart grid, traditional power grids.pptxsmart grid, traditional power grids.pptx
smart grid, traditional power grids.pptx
 
Information security software security presentation.pptx
Information security software security presentation.pptxInformation security software security presentation.pptx
Information security software security presentation.pptx
 
Key Management, key management three tools ,
Key Management, key management three tools ,Key Management, key management three tools ,
Key Management, key management three tools ,
 
Lec2.pptx
Lec2.pptxLec2.pptx
Lec2.pptx
 
Distributed Systems.pptx
Distributed Systems.pptxDistributed Systems.pptx
Distributed Systems.pptx
 
3.pptx
3.pptx3.pptx
3.pptx
 
imageenhancementtechniques-140316011049-phpapp01 (1).pptx
imageenhancementtechniques-140316011049-phpapp01 (1).pptximageenhancementtechniques-140316011049-phpapp01 (1).pptx
imageenhancementtechniques-140316011049-phpapp01 (1).pptx
 
aip.pptx
aip.pptxaip.pptx
aip.pptx
 
Big data analytics with R tool.pptx
Big data analytics with R tool.pptxBig data analytics with R tool.pptx
Big data analytics with R tool.pptx
 
Group 2 Handling and Processing of big data.pptx
Group 2 Handling and Processing of big data.pptxGroup 2 Handling and Processing of big data.pptx
Group 2 Handling and Processing of big data.pptx
 

Recently uploaded

Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.natarajan8993
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queensdataanalyticsqueen03
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改yuu sss
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一fhwihughh
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一F La
 

Recently uploaded (20)

Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queens
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
 

new.pptx

  • 2. CONTENT  What is Data?  What is Big Data?  What is an example of Big Data?  Why Is Big Data Important?  Big Data Analytics  Benefits of Big Data Analytics  Types of Big Data  Characteristics of Big Data  Primary Source of Big Data  Big Data Tools and Software  Big Data Mining
  • 3. WHAT IS DATA?  A collection of raw facts and figures is called Data.  It is collected for different purposes.
  • 4. WHAT IS BIG DATA?  Big data primarily refers to data sets that are too large or complex to be dealt with by traditional data-processing application software  Big data is a type of data that is extremely large in size.
  • 5. WHAT IS AN EXAMPLE OF BIG DATA? The following are some Big Data examples:  Big Data is hugely used for cyber security engineers protect networks and data from unauthorized access.  Spotify, on-demand music-providing platform, uses Big Data Analytics collects data from all its users around the globe, and then uses the analyzed data to give informed music recommendations and suggestions to every individual user.  Amazon Prime which offers, videos, music, and Kindle books in a one-stop shop is also big on using big data.
  • 6. WHY IS BIG DATA IMPORTANT?  In the energy industry, big data helps oil and gas companies identify potential drilling locations and monitor pipeline operations; likewise, utilities use it to track electrical grids.  Financial services firms use big data systems for risk management and real-time analysis of market data.  Manufacturers and transportation companies rely on big data to manage their supply chains and optimize delivery routes.  Other government uses include emergency response, crime prevention and smart city initiatives.
  • 7. BIG DATA ANALYTICS Big data analytics refers to collecting, processing, cleaning, and analyzing large datasets to help organizations operationalize their big data. 1. Collect Data 2. Process Data 3. Clean Data 4. Analyze Data
  • 8. BENEFITS OF BIG DATA ANALYTICS Some benefits of big data analytics include:  Cost savings. Helping organizations identify ways to do business more efficiently  Product development. Providing a better understanding of customer needs  Market insights. Tracking purchase behavior and market trends
  • 9. TYPES OF BIG DATA  Following are the types of Big Data:  Structured  Unstructured  Semi-structured
  • 10. STRUCTURED  Structured Data is used to refer to the data which is already stored in databases, in an ordered manner.  There are two sources of structured data;  Human-Generated  Machine-Generated
  • 12. UN-STRUCTURED  Unstructured data is defined as any data with an unknown form or structure.  A typical example of unstructured data is a heterogeneous data source containing a combination of simple text files, images, videos etc
  • 14. SEMI-STRUCTURED  Semi-structured data can contain both types of information.  Semi-structured data appears to be structured, but it is not defined in the same way that a table definition in a relational DBMS is.  A data representation in an XML file is an example of semi-structured data.
  • 17. VOLUME  The name Big Data itself is related to a size which is enormous.  Size of data plays a very crucial role in determining value out of data. Also, whether a particular data can actually be considered as a Big Data or not, is dependent upon the volume of data.  Hence, Volume is one characteristic which needs to be considered while dealing with Big Data solutions.  For example; Organizational data Social media data
  • 18. VELOCITY  The term ‘velocity’ refers to the speed of generation of data.  How fast the data is generated and processed to meet the demands, determines real potential in the data.  Big Data Velocity deals with the speed at which data flows in from sources like business processes, application logs, networks, and social media sites, sensors, Mobile devices, etc.  The flow of data is massive and continuous.
  • 19. VERACITY  When we are dealing with a high volume, velocity and variety of data, it is not possible that all of the data is going to be 100% correct, there will be dirty data.  The quality of the data being captured can vary greatly.  The data accuracy of analysis depends on the veracity of the source data.
  • 20. VALUE  Value is the most important aspect in the big data.  Though, the potential value of the big data is huge.  It is all well and good having access to big data but unless we can turn it into value it is become useless.  It becomes very costly to implement IT infrastructure systems to store big data, and businesses are going to require a return on investment.
  • 21. VARIETY  Big data is not always structured data and it is not always easy to put big data into a relational database.  This means that the category to which Big Data belongs to is also a very essential fact that needs to be known by the data analysis.  Dealing with a variety of structured and unstructured data greatly increases the complexity of both storing and analyzing Big Data.  90% of data generated is data is in unstructured form.
  • 22. PRIMARY SOURCE OF BIG DATA  Primary sources of Big Data are;  Social Data  Machine Data  Transactional Data
  • 23. SOCIAL DATA  Social data comes from the Likes, Tweets & Retweets, Comments, Video Uploads, and general media that are uploaded and shared via the world’s favorite social media platforms.  This kind of data provides invaluable insights into consumer behavior and sentiment and can be enormously influential in marketing analytics.  The public web is another good source of social data, and tools like Google Trends can be used to good effect to increase the volume of big data.
  • 24. MACHINE DATA  Machine data is defined as information which is generated by industrial equipment, sensors that are installed in machinery, and even web logs which track user behavior.  This type of data is expected to grow exponentially as the internet of things grows ever more universal and expands around the world.  Sensors such as medical devices, smart meters, road cameras, satellites, games and the rapidly growing Internet Of Things will deliver high velocity, value, volume and variety of data in the very near future.
  • 25. TRANSACTIONAL DATA  Transactional data is generated from all the daily transactions that take place both online and offline.  Invoices, payment orders, storage records, delivery receipts – all are characterized as transactional data yet data alone is almost meaningless, and most organizations struggle to make sense of the data that they are generating and how it can be put to good use.
  • 26. BIG DATA TOOLS AND SOFTWARE • Hadoop • Atlas.it • HPCC • Storm • Cassandra • Kaggle • CouchDB • Pentaho
  • 27. BIG DATA MINING  Big data mining is referred to the collective data mining or extraction techniques that are performed on large sets /volume of data or the big data.  Big data mining is primarily done to extract and retrieve desired information or pattern from humongous quantity of data.  Big data mining works on data searching, refinement , extraction and comparison algorithms.
  • 28. Big data and Machine Learning  Big Data and Machine Learning have become the reason behind the success of various industries. Both these technologies are becoming popular day by day among all data scientists and professionals. Big data is a term that is used to describe large, hard-to-manage, structured, and unstructured voluminous data. Whereas, Machine learning is a subfield of Artificial Intelligence that enables machines to automatically learn and improve from experience/past data.  Both Machine learning and big data technologies are being used together by most companies because it becomes difficult for the companies to manage, store, and process the collected data efficiently; hence in such a case, Machine learning helps them.
  • 29. Relationship between AI and big data  By bringing together big data and AI technology, companies can improve business performance and efficiency by:  Anticipating and capitalizing on emerging industry and market trends.  Analyzing consumer behavior and automating customer segmentation  Personalizing and optimizing the performance of digital marketing campaigns  Using intelligent decision support systems fueled by big data, AI, and predictive analytics
  • 30. BIG DATA AND AI BIG DATA is being used in AI Learn how to get big value from big data  Dive deeper on big data
  • 31. How is AI used with big data?  AI makes big data analytics simpler by automating and enhancing data preparation, data visualization, predictive modeling, and other complex analytical tasks that would otherwise be labor-intensive and time-consuming. AI helps users work with, manipulate, and surface actionable insights faster from large, complex datasets.
  • 32. Future Direction  BIG DATA will be further collected to improve our models and improve our AI models.  Big data will help us in Our future projects as well.