SlideShare a Scribd company logo
1 of 16
Mrs. D. Suja Mary
Assistant Professor
Nanjil Catholic College of Arts and Science,
Kaliyakkavilai
What is Data?
 The quantities, characters, or symbols on which
operations are performed by a computer, which
may be stored and transmitted in the form of
electrical signals and recorded on magnetic,
optical, or mechanical recording media.
What is Big Data?
 Big Data is a collection of data that is huge in
volume, yet growing exponentially with time. It
is a data with so large size and complexity that
none of traditional data management tools can
store it or process it efficiently. Big data is also a
data but with huge size.
What Is Data Analytics?
 The term data analytics refers to the process of
examining datasets to draw conclusions about the
information they contain. Data analytic techniques
enable you to take raw data and uncover patterns to
extract valuable insights from it.
 Data Scientists and Analysts use data analytics
techniques in their research, and businesses also use
it to inform their decisions.
 Data analysis can help companies better understand
their customers, evaluate their ad campaigns,
personalize content, create content strategies and
develop products.
 businesses can use data analytics to boost business
performance and improve their bottom line.
 Analysis techniques give businesses access to insights that can help
them to improve their performance.
 As the importance of data analytics in the business world
increases, it becomes more critical that our company understand
how to implement it. Some benefits of data analytics include:
 1. Improved Decision Making
Companies can use the insights they gain from data analytics to
inform their decisions, leading to better outcomes.
 2. More Effective Marketing
Data analytics also gives us useful insights into how our campaigns
are performing so that can fine-tune them for optimal outcomes.
 3. Better Customer Service
Data analytics provide us with more insights into our customers,
allowing us to tailor customer service to their needs, provide more
personalization and build stronger relationships with them.
 4. More Efficient Operations
Data analytics can help us streamline our processes, save money
and boost our bottom line. When we have an improved
understanding of what our audience wants, we waste less time on
creating ads and content that don’t match our audience’s
interests.
 It is an organized collection of structured data. It is a
collection of related information.
 DB stores and access data electronically.
 A database is stored as a file or a set of files on magnetic
disk or tape, optical disk, or some other secondary storage
device.
 It is an data structure that stores organized information.
 They are administrated to facilitate the storage of data,
retrieval of data, modification of data, and deletion of
data.
 It allows processing various data-processing operations.
 Databases bolster stockpiling and control of information.
 Databases make information administration simple.
 Any database developer with certain sets of syntax can
process can work on the database
 A DB is a collection of related data. There are
two types of databases – Relation Database
Management System while other is Non –
Relational Database Management System.
 If we are storing and capable of processing a
very huge volume of data in databases,
Definitely we can store and process Big Data
through relational or Non-relational Databases.
 Big data is not going to replace databases. In one
form or other we will be using SQL databases to
store and process Big Data. In this regard, Big
Data is completely separate from DB.
Given below is the difference between Big Data and Database:
 Big Data is a term applied to data sets whose size or type is beyond
the ability of traditional relational databases. A traditional
database is not able to capture, manage, and process the high
volume of data with low-latency While Database is a collection of
information that is organized so that it can be easily captured,
accessed, managed and updated.
 Big Data refers to technologies and initiatives that involve data
that is too diverse i.e. varieties, rapid-changing or massive for
skills, conventional technologies, and infrastructure to address
efficiently While Database management system (DBMS) extracts
information from the database in response to queries but it in
restricted conditions.
 There can be any varieties of data while DB can be defined through
some schema.
 It is difficult to store and process while Databases like SQL, data
can be easily stored and process.
 Raw data is the data that is collected from a
source, but in its initial state. It has not yet
been processed — or cleaned, organized, and
visually presented. Raw data can be
manually written down or typed, recorded,
or automatically input by a machine. You can
find raw data in a variety of places, including
databases, files, spreadsheets, and even on
source devices, such as a camera.
 Data analysts, software, and artificial
intelligence (AI) all work to transform raw data
into processed data.
 They start by organizing and cleaning the raw
data. One of the most important parts of this
process is removing outliers and duplicates within
the data set.
 The next step is an initial analysis that may
involve data manipulation. Especially if analysts
are analyzing raw data based on human responses
to a question, they will look closely at those
responses and determine if respondents
inaccurately replied to the question in a way that
will change the results.
 Raw data serves several purposes, particularly in businesses
where full data visibility is key to statistical and predictive
analytics.
Here are a few reasons why businesses heavily rely on raw data
sources:
 Raw data is the starting phase of all data and the initial source of
data-based decisions. You can’t make visually compelling charts
or overarching analytical statements about processed data until
we’ve worked through all of the raw data.
 We can trust the integrity of raw data. We don’t have to worry
that something has been removed or adjusted, because the
format has not yet been manipulated by humans or machines.
 AI and machine learning methods can only analyze data in a raw
format. Once the data has been processed, it is illegible to these
technologies.
 Raw data gives you a backup resource. We can check our work
and go back to the source after processing and manipulating our
data sets. It’s all there for your reference if we run into a
problem and need a new analysis.
All data inside a computer is transmitted as a
series of electrical signals that are either on
or off. Therefore, in order for a computer to
be able to process any kind of data, including
text, images and sound, they must be
converted into binary form.
 Data Representation Types of data: – Numbers – Text – Images
– Audio & Video
 Text
 When any key on a keyboard is pressed, it needs to be
converted into a binary number so that it can be processed by
the computer and the typed character can appear on the
screen.
 A code where each number represents a character can be used
to convert text into binary. One code we can use for this is
called ASCII. The ASCII code takes each character on the
keyboard and assigns it a binary number.
 Images also need to be converted into binary in
order for a computer to process them so that
they can be seen on our screen. Digital images
are made up of pixels. Each pixel in an image is
made up of binary numbers.
 If we say that 1 is black (or on) and 0 is white (or
off), then a simple black and white picture can
be created using binary.
 The terms audio and video commonly refers
to the time-based media storage format for
sound/music and moving pictures
information. Audio and video digital
recording, also referred as audio and video
codecs, can be uncompressed, lossless
compressed, or lossy compressed depending
on the desired quality and use cases.
 Connected objects are another source of raw
data, which retrieves a large amount of data
through their sensors.
 The Internet of Things (IoT) contributes to
double the size of the digital universe every 2
years, which could be 44,000 billion gigabytes in
2020, 10 times more than in 2013
 The connected object thus allows extend the
scope of internet allowing any object, machine
or living element to transmit information about
its environment and eventually be activated
remotely.
Analyzing Raw Data Sources

More Related Content

What's hot

Introduction to Real-time data processing
Introduction to Real-time data processingIntroduction to Real-time data processing
Introduction to Real-time data processingYogi Devendra Vyavahare
 
Basic Introduction of Data Warehousing from Adiva Consulting
Basic Introduction of  Data Warehousing from Adiva ConsultingBasic Introduction of  Data Warehousing from Adiva Consulting
Basic Introduction of Data Warehousing from Adiva Consultingadivasoft
 
Client server-computing
Client server-computingClient server-computing
Client server-computingjayasreep3
 
Online analytical processing (olap) tools
Online analytical processing (olap) toolsOnline analytical processing (olap) tools
Online analytical processing (olap) toolskulkarnivaibhav
 
Memory Organization of a Computer System
Memory Organization of a Computer SystemMemory Organization of a Computer System
Memory Organization of a Computer SystemTaminul Islam
 
Big Data Open Source Technologies
Big Data Open Source TechnologiesBig Data Open Source Technologies
Big Data Open Source Technologiesneeraj rathore
 
Query mechanisms for NoSQL databases
Query mechanisms for NoSQL databasesQuery mechanisms for NoSQL databases
Query mechanisms for NoSQL databasesArangoDB Database
 
Data warehousing and data mart
Data warehousing and data martData warehousing and data mart
Data warehousing and data martAmit Sarkar
 
data warehouse , data mart, etl
data warehouse , data mart, etldata warehouse , data mart, etl
data warehouse , data mart, etlAashish Rathod
 
البنية النجمية
البنية النجميةالبنية النجمية
البنية النجميةtechalsaadi
 
DeView2013 Big Data Platform Architecture with Hadoop - Hyeong-jun Kim
DeView2013 Big Data Platform Architecture with Hadoop - Hyeong-jun KimDeView2013 Big Data Platform Architecture with Hadoop - Hyeong-jun Kim
DeView2013 Big Data Platform Architecture with Hadoop - Hyeong-jun KimGruter
 
01. Introduction to Data Mining and BI
01. Introduction to Data Mining and BI01. Introduction to Data Mining and BI
01. Introduction to Data Mining and BIAchmad Solichin
 
Data Mining Concepts
Data Mining ConceptsData Mining Concepts
Data Mining ConceptsDung Nguyen
 
Big Data Evolution
Big Data EvolutionBig Data Evolution
Big Data Evolutionitnewsafrica
 
Fundamentos de DataWareHouse - FISI - UNMSM - DataWareHouse
Fundamentos de DataWareHouse - FISI - UNMSM - DataWareHouseFundamentos de DataWareHouse - FISI - UNMSM - DataWareHouse
Fundamentos de DataWareHouse - FISI - UNMSM - DataWareHouseJulio Pari
 

What's hot (20)

Big data Analytics Hadoop
Big data Analytics HadoopBig data Analytics Hadoop
Big data Analytics Hadoop
 
Introduction to Real-time data processing
Introduction to Real-time data processingIntroduction to Real-time data processing
Introduction to Real-time data processing
 
Basic Introduction of Data Warehousing from Adiva Consulting
Basic Introduction of  Data Warehousing from Adiva ConsultingBasic Introduction of  Data Warehousing from Adiva Consulting
Basic Introduction of Data Warehousing from Adiva Consulting
 
Client server-computing
Client server-computingClient server-computing
Client server-computing
 
Online analytical processing (olap) tools
Online analytical processing (olap) toolsOnline analytical processing (olap) tools
Online analytical processing (olap) tools
 
Memory Organization of a Computer System
Memory Organization of a Computer SystemMemory Organization of a Computer System
Memory Organization of a Computer System
 
Thinking big
Thinking bigThinking big
Thinking big
 
Sqoop
SqoopSqoop
Sqoop
 
Big Data Open Source Technologies
Big Data Open Source TechnologiesBig Data Open Source Technologies
Big Data Open Source Technologies
 
Query mechanisms for NoSQL databases
Query mechanisms for NoSQL databasesQuery mechanisms for NoSQL databases
Query mechanisms for NoSQL databases
 
Data warehousing and data mart
Data warehousing and data martData warehousing and data mart
Data warehousing and data mart
 
data warehousing
data warehousingdata warehousing
data warehousing
 
data warehouse , data mart, etl
data warehouse , data mart, etldata warehouse , data mart, etl
data warehouse , data mart, etl
 
البنية النجمية
البنية النجميةالبنية النجمية
البنية النجمية
 
DeView2013 Big Data Platform Architecture with Hadoop - Hyeong-jun Kim
DeView2013 Big Data Platform Architecture with Hadoop - Hyeong-jun KimDeView2013 Big Data Platform Architecture with Hadoop - Hyeong-jun Kim
DeView2013 Big Data Platform Architecture with Hadoop - Hyeong-jun Kim
 
Active database
Active databaseActive database
Active database
 
01. Introduction to Data Mining and BI
01. Introduction to Data Mining and BI01. Introduction to Data Mining and BI
01. Introduction to Data Mining and BI
 
Data Mining Concepts
Data Mining ConceptsData Mining Concepts
Data Mining Concepts
 
Big Data Evolution
Big Data EvolutionBig Data Evolution
Big Data Evolution
 
Fundamentos de DataWareHouse - FISI - UNMSM - DataWareHouse
Fundamentos de DataWareHouse - FISI - UNMSM - DataWareHouseFundamentos de DataWareHouse - FISI - UNMSM - DataWareHouse
Fundamentos de DataWareHouse - FISI - UNMSM - DataWareHouse
 

Similar to Analyzing Raw Data Sources

Business Intelligence
Business IntelligenceBusiness Intelligence
Business IntelligenceSukirti Garg
 
introduction to data science
introduction to data scienceintroduction to data science
introduction to data scienceJohnson Ubah
 
Chapter 2 - Intro to Data Sciences[2].pptx
Chapter 2 - Intro to Data Sciences[2].pptxChapter 2 - Intro to Data Sciences[2].pptx
Chapter 2 - Intro to Data Sciences[2].pptxJethroDignadice2
 
Security issues in big data
Security issues in big data Security issues in big data
Security issues in big data Shallote Dsouza
 
C21027_Aditya_Big Data Analytics In Baking Sector.pptx
C21027_Aditya_Big Data Analytics In Baking Sector.pptxC21027_Aditya_Big Data Analytics In Baking Sector.pptx
C21027_Aditya_Big Data Analytics In Baking Sector.pptxAdityaDeshpande674450
 
IRJET- Comparative Study of Efficacy of Big Data Analysis and Deep Learni...
IRJET-  	  Comparative Study of Efficacy of Big Data Analysis and Deep Learni...IRJET-  	  Comparative Study of Efficacy of Big Data Analysis and Deep Learni...
IRJET- Comparative Study of Efficacy of Big Data Analysis and Deep Learni...IRJET Journal
 
Datawarehousing
DatawarehousingDatawarehousing
Datawarehousingwork
 
Dataware housing
Dataware housingDataware housing
Dataware housingwork
 
Harness the power of data
Harness the power of dataHarness the power of data
Harness the power of dataHarsha MV
 
Using Data Lakes to Sail Through Your Sales Goals
Using Data Lakes to Sail Through Your Sales GoalsUsing Data Lakes to Sail Through Your Sales Goals
Using Data Lakes to Sail Through Your Sales GoalsIrshadKhan682442
 
Using Data Lakes to Sail Through Your Sales Goals
Using Data Lakes to Sail Through Your Sales GoalsUsing Data Lakes to Sail Through Your Sales Goals
Using Data Lakes to Sail Through Your Sales GoalsWilliamJohnson288536
 
Using Data Lakes To Sail Through Your Sales Goals
Using Data Lakes To Sail Through Your Sales GoalsUsing Data Lakes To Sail Through Your Sales Goals
Using Data Lakes To Sail Through Your Sales GoalsKevinJohnson667312
 
WHAT IS A DATA LAKE? Know DATA LAKES & SALES ECOSYSTEM
WHAT IS A DATA LAKE? Know DATA LAKES & SALES ECOSYSTEMWHAT IS A DATA LAKE? Know DATA LAKES & SALES ECOSYSTEM
WHAT IS A DATA LAKE? Know DATA LAKES & SALES ECOSYSTEMRajaraj64
 

Similar to Analyzing Raw Data Sources (20)

Business Intelligence
Business IntelligenceBusiness Intelligence
Business Intelligence
 
introduction to data science
introduction to data scienceintroduction to data science
introduction to data science
 
Big Data.pptx
Big Data.pptxBig Data.pptx
Big Data.pptx
 
1 UNIT-DSP.pptx
1 UNIT-DSP.pptx1 UNIT-DSP.pptx
1 UNIT-DSP.pptx
 
Data Science
Data ScienceData Science
Data Science
 
Chapter 2 - Intro to Data Sciences[2].pptx
Chapter 2 - Intro to Data Sciences[2].pptxChapter 2 - Intro to Data Sciences[2].pptx
Chapter 2 - Intro to Data Sciences[2].pptx
 
Bigdata
Bigdata Bigdata
Bigdata
 
U - 2 Emerging.pptx
U - 2 Emerging.pptxU - 2 Emerging.pptx
U - 2 Emerging.pptx
 
365 Data Science
365 Data Science365 Data Science
365 Data Science
 
Security issues in big data
Security issues in big data Security issues in big data
Security issues in big data
 
IT Ready - DW: 1st Day
IT Ready - DW: 1st Day IT Ready - DW: 1st Day
IT Ready - DW: 1st Day
 
C21027_Aditya_Big Data Analytics In Baking Sector.pptx
C21027_Aditya_Big Data Analytics In Baking Sector.pptxC21027_Aditya_Big Data Analytics In Baking Sector.pptx
C21027_Aditya_Big Data Analytics In Baking Sector.pptx
 
IRJET- Comparative Study of Efficacy of Big Data Analysis and Deep Learni...
IRJET-  	  Comparative Study of Efficacy of Big Data Analysis and Deep Learni...IRJET-  	  Comparative Study of Efficacy of Big Data Analysis and Deep Learni...
IRJET- Comparative Study of Efficacy of Big Data Analysis and Deep Learni...
 
Datawarehousing
DatawarehousingDatawarehousing
Datawarehousing
 
Dataware housing
Dataware housingDataware housing
Dataware housing
 
Harness the power of data
Harness the power of dataHarness the power of data
Harness the power of data
 
Using Data Lakes to Sail Through Your Sales Goals
Using Data Lakes to Sail Through Your Sales GoalsUsing Data Lakes to Sail Through Your Sales Goals
Using Data Lakes to Sail Through Your Sales Goals
 
Using Data Lakes to Sail Through Your Sales Goals
Using Data Lakes to Sail Through Your Sales GoalsUsing Data Lakes to Sail Through Your Sales Goals
Using Data Lakes to Sail Through Your Sales Goals
 
Using Data Lakes To Sail Through Your Sales Goals
Using Data Lakes To Sail Through Your Sales GoalsUsing Data Lakes To Sail Through Your Sales Goals
Using Data Lakes To Sail Through Your Sales Goals
 
WHAT IS A DATA LAKE? Know DATA LAKES & SALES ECOSYSTEM
WHAT IS A DATA LAKE? Know DATA LAKES & SALES ECOSYSTEMWHAT IS A DATA LAKE? Know DATA LAKES & SALES ECOSYSTEM
WHAT IS A DATA LAKE? Know DATA LAKES & SALES ECOSYSTEM
 

Recently uploaded

RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.natarajan8993
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectBoston Institute of Analytics
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhijennyeacort
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfchwongval
 
Machine learning classification ppt.ppt
Machine learning classification  ppt.pptMachine learning classification  ppt.ppt
Machine learning classification ppt.pptamreenkhanum0307
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanMYRABACSAFRA2
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degreeyuu sss
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样vhwb25kk
 
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSINGmarianagonzalez07
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一fhwihughh
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDRafezzaman
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss
 

Recently uploaded (20)

RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis Project
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data Story
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdf
 
Machine learning classification ppt.ppt
Machine learning classification  ppt.pptMachine learning classification  ppt.ppt
Machine learning classification ppt.ppt
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population Mean
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
 
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts Service
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
 

Analyzing Raw Data Sources

  • 1. Mrs. D. Suja Mary Assistant Professor Nanjil Catholic College of Arts and Science, Kaliyakkavilai
  • 2. What is Data?  The quantities, characters, or symbols on which operations are performed by a computer, which may be stored and transmitted in the form of electrical signals and recorded on magnetic, optical, or mechanical recording media. What is Big Data?  Big Data is a collection of data that is huge in volume, yet growing exponentially with time. It is a data with so large size and complexity that none of traditional data management tools can store it or process it efficiently. Big data is also a data but with huge size.
  • 3. What Is Data Analytics?  The term data analytics refers to the process of examining datasets to draw conclusions about the information they contain. Data analytic techniques enable you to take raw data and uncover patterns to extract valuable insights from it.  Data Scientists and Analysts use data analytics techniques in their research, and businesses also use it to inform their decisions.  Data analysis can help companies better understand their customers, evaluate their ad campaigns, personalize content, create content strategies and develop products.  businesses can use data analytics to boost business performance and improve their bottom line.
  • 4.  Analysis techniques give businesses access to insights that can help them to improve their performance.  As the importance of data analytics in the business world increases, it becomes more critical that our company understand how to implement it. Some benefits of data analytics include:  1. Improved Decision Making Companies can use the insights they gain from data analytics to inform their decisions, leading to better outcomes.  2. More Effective Marketing Data analytics also gives us useful insights into how our campaigns are performing so that can fine-tune them for optimal outcomes.  3. Better Customer Service Data analytics provide us with more insights into our customers, allowing us to tailor customer service to their needs, provide more personalization and build stronger relationships with them.  4. More Efficient Operations Data analytics can help us streamline our processes, save money and boost our bottom line. When we have an improved understanding of what our audience wants, we waste less time on creating ads and content that don’t match our audience’s interests.
  • 5.  It is an organized collection of structured data. It is a collection of related information.  DB stores and access data electronically.  A database is stored as a file or a set of files on magnetic disk or tape, optical disk, or some other secondary storage device.  It is an data structure that stores organized information.  They are administrated to facilitate the storage of data, retrieval of data, modification of data, and deletion of data.  It allows processing various data-processing operations.  Databases bolster stockpiling and control of information.  Databases make information administration simple.  Any database developer with certain sets of syntax can process can work on the database
  • 6.  A DB is a collection of related data. There are two types of databases – Relation Database Management System while other is Non – Relational Database Management System.  If we are storing and capable of processing a very huge volume of data in databases, Definitely we can store and process Big Data through relational or Non-relational Databases.  Big data is not going to replace databases. In one form or other we will be using SQL databases to store and process Big Data. In this regard, Big Data is completely separate from DB.
  • 7. Given below is the difference between Big Data and Database:  Big Data is a term applied to data sets whose size or type is beyond the ability of traditional relational databases. A traditional database is not able to capture, manage, and process the high volume of data with low-latency While Database is a collection of information that is organized so that it can be easily captured, accessed, managed and updated.  Big Data refers to technologies and initiatives that involve data that is too diverse i.e. varieties, rapid-changing or massive for skills, conventional technologies, and infrastructure to address efficiently While Database management system (DBMS) extracts information from the database in response to queries but it in restricted conditions.  There can be any varieties of data while DB can be defined through some schema.  It is difficult to store and process while Databases like SQL, data can be easily stored and process.
  • 8.  Raw data is the data that is collected from a source, but in its initial state. It has not yet been processed — or cleaned, organized, and visually presented. Raw data can be manually written down or typed, recorded, or automatically input by a machine. You can find raw data in a variety of places, including databases, files, spreadsheets, and even on source devices, such as a camera.
  • 9.  Data analysts, software, and artificial intelligence (AI) all work to transform raw data into processed data.  They start by organizing and cleaning the raw data. One of the most important parts of this process is removing outliers and duplicates within the data set.  The next step is an initial analysis that may involve data manipulation. Especially if analysts are analyzing raw data based on human responses to a question, they will look closely at those responses and determine if respondents inaccurately replied to the question in a way that will change the results.
  • 10.  Raw data serves several purposes, particularly in businesses where full data visibility is key to statistical and predictive analytics. Here are a few reasons why businesses heavily rely on raw data sources:  Raw data is the starting phase of all data and the initial source of data-based decisions. You can’t make visually compelling charts or overarching analytical statements about processed data until we’ve worked through all of the raw data.  We can trust the integrity of raw data. We don’t have to worry that something has been removed or adjusted, because the format has not yet been manipulated by humans or machines.  AI and machine learning methods can only analyze data in a raw format. Once the data has been processed, it is illegible to these technologies.  Raw data gives you a backup resource. We can check our work and go back to the source after processing and manipulating our data sets. It’s all there for your reference if we run into a problem and need a new analysis.
  • 11. All data inside a computer is transmitted as a series of electrical signals that are either on or off. Therefore, in order for a computer to be able to process any kind of data, including text, images and sound, they must be converted into binary form.
  • 12.  Data Representation Types of data: – Numbers – Text – Images – Audio & Video  Text  When any key on a keyboard is pressed, it needs to be converted into a binary number so that it can be processed by the computer and the typed character can appear on the screen.  A code where each number represents a character can be used to convert text into binary. One code we can use for this is called ASCII. The ASCII code takes each character on the keyboard and assigns it a binary number.
  • 13.  Images also need to be converted into binary in order for a computer to process them so that they can be seen on our screen. Digital images are made up of pixels. Each pixel in an image is made up of binary numbers.  If we say that 1 is black (or on) and 0 is white (or off), then a simple black and white picture can be created using binary.
  • 14.  The terms audio and video commonly refers to the time-based media storage format for sound/music and moving pictures information. Audio and video digital recording, also referred as audio and video codecs, can be uncompressed, lossless compressed, or lossy compressed depending on the desired quality and use cases.
  • 15.  Connected objects are another source of raw data, which retrieves a large amount of data through their sensors.  The Internet of Things (IoT) contributes to double the size of the digital universe every 2 years, which could be 44,000 billion gigabytes in 2020, 10 times more than in 2013  The connected object thus allows extend the scope of internet allowing any object, machine or living element to transmit information about its environment and eventually be activated remotely.