SlideShare a Scribd company logo
1 of 15
Exploratory Data
Analysis (EDA)
ZAHID RIAZ HAANS (LECTURER)
KAMYAB INSTIUTE OF MEDICAL SCIENCES KOT ADDU
E-mail. zahidriazhaans50@gmail.com
• Exploratory data analysis (EDA)
involves using graphics and visualizations
to explore and analyze a data set.
The goal is to explore, investigate
and learn, as opposed to confirming
statistical hypotheses.
• The process of using numerical summaries and visualizations
to explore your data and to identify potential relationships
between variables is called exploratory data analysis, or EDA.
• The distribution of variables in your data set. That is, what is
the shape of your data? Is the distribution skewed? Mound-
shaped? Bimodal?
• The relationships between variables.
• Whether or not your data have outliers or unusual points
that may indicate data quality issues or lead to interesting
insights.
• Whether or not your data have patterns over time.
Functions and Techniques
• Clustering and dimension reduction techniques, which help
create graphical displays of high-dimensional data containing
many variables.
• Univariate visualization of each field in the raw dataset, with
summary statistics.
• Bivariate visualizations and summary statistics that allow you
to assess the relationship between each variable in the
dataset and the target variable you’re looking at.
• Multivariate visualizations, for mapping and understanding
interactions between different fields in the data.
TYPES OF EXPLORATORY
DATA ANALYSIS:
• Univariate Non-graphical
• Multivariate Non-graphical
• Univariate graphical
• Multivariate graphical
1. Univariate Non-graphical:
• This is the simplest form of data analysis as during this we use
just one variable to research the info.
• The standard goal of univariate non-graphical EDA is to know
the underlying sample distribution/ data and make
observations about the population.
• Outlier detection is additionally part of the analysis. The
characteristics of population distribution include:
Types
• Central tendency: The central tendency or location of
distribution has got to do with typical or middle values. The
commonly useful measures of central tendency are statistics
called mean, median, and sometimes mode during
• Spread: Spread is an indicator of what proportion distant
from the middle we are to seek out the find the info values.
the quality deviation and variance are two useful measures of
spread.
• Skewness and kurtosis: Two more useful univariates
descriptors are the skewness and kurtosis of the distribution.
2. Multivariate Non-
graphical:
• Multivariate non-graphical EDA technique is usually wont to
show the connection between two or more variables within
the sort of either cross-tabulation or statistics.
• For categorical data, an extension of tabulation called cross-
tabulation is extremely useful. For 2 variables, cross-tabulation
is preferred by making a two-way table with column headings
that match the amount of one-variable and row headings that
match the amount of the opposite two variables, then filling
the counts with all subjects that share an equivalent pair of
levels.
3. Univariate graphical:
• Non-graphical methods are quantitative and objective, they
are not able to give the complete picture of the data;
therefore, graphical methods are used more as they involve a
degree of subjective analysis, also are required. Common sorts
of univariate graphics are:
• Histogram
• Stem-and-leaf plots
• Boxplots
• Quantile-normal plots
4 Multivariate graphical:
• Multivariate graphical data uses graphics to display
relationships between two or more sets of knowledge. The
sole one used commonly may be a grouped barplot with each
group representing one level of 1 of the variables
• Scatterplot:
• Run chart: It’s a line graph of data plotted over time.
• Heat map: It’s a graphical representation of data where
values are depicted by color.
• Multivariate chart: It’s a graphical representation of the
relationships between factors and response.
• Bubble chart: It’s a data visualization that displays multiple
circles (bubbles) in two-dimensional plot.
TOOLS REQUIRED FOR
EXPLORATORY DATA ANALYSIS:
• 1. R: An open-source programming language and free software
environment for statistical computing and graphics supported by the R
foundation for statistical computing. The R language is widely used
among statisticians in developing statistical observations and data
analysis.
• 2. Python: An interpreted, object-oriented programming language with
dynamic semantics. Its high level, built-in data structures, combined
with dynamic binding, make it very attractive for rapid application
development, also as to be used as a scripting or glue language to attach
existing components together. Python and EDA are often used together
to spot missing values in the data set, which is vital so you’ll decide the
way to handle missing values for machine learning.
• Apart from these functions described above, EDA can also:
• Perform k-means clustering: Perform k-means clustering: it’s an
unsupervised learning algorithm where the info points are assigned to
clusters, also referred to as k-groups, k-means clustering is usually
utilized in market segmentation, image compression, and pattern
recognition
Exploratory Data Analysis (EDA) .pptx
Exploratory Data Analysis (EDA) .pptx

More Related Content

Similar to Exploratory Data Analysis (EDA) .pptx

Making abstract data visible
Making abstract data visibleMaking abstract data visible
Making abstract data visiblePriyanshi Jain
 
Exploring Data (1).pptx
Exploring Data (1).pptxExploring Data (1).pptx
Exploring Data (1).pptxgina458018
 
Chapter 12 Data Analysis Descriptive Methods and Index Numbers
Chapter 12 Data Analysis Descriptive Methods and Index NumbersChapter 12 Data Analysis Descriptive Methods and Index Numbers
Chapter 12 Data Analysis Descriptive Methods and Index NumbersInternational advisers
 
Unit 4 editing and coding (2)
Unit 4 editing and coding (2)Unit 4 editing and coding (2)
Unit 4 editing and coding (2)kalailakshmi
 
Data mining techniques unit 2
Data mining techniques unit 2Data mining techniques unit 2
Data mining techniques unit 2malathieswaran29
 
fundamentals of data science and analytics on descriptive analysis.pptx
fundamentals of data science and analytics on descriptive analysis.pptxfundamentals of data science and analytics on descriptive analysis.pptx
fundamentals of data science and analytics on descriptive analysis.pptxkumaragurusv
 
Visual Analytics in Big Data
Visual Analytics in Big DataVisual Analytics in Big Data
Visual Analytics in Big DataSaurabh Shanbhag
 
Graphicalrepresntationofdatausingstatisticaltools2019_210902_105156.pdf
Graphicalrepresntationofdatausingstatisticaltools2019_210902_105156.pdfGraphicalrepresntationofdatausingstatisticaltools2019_210902_105156.pdf
Graphicalrepresntationofdatausingstatisticaltools2019_210902_105156.pdfHimakshi7
 
CSUN 2020 Accessible Visualizations: Maps, Annotations, and Spark lines
CSUN 2020 Accessible Visualizations: Maps, Annotations, and Spark linesCSUN 2020 Accessible Visualizations: Maps, Annotations, and Spark lines
CSUN 2020 Accessible Visualizations: Maps, Annotations, and Spark linesTed Gies
 
Quality Tools & Techniques Presentation.pptx
Quality Tools & Techniques Presentation.pptxQuality Tools & Techniques Presentation.pptx
Quality Tools & Techniques Presentation.pptxSAJIDAli83655
 
EXPLORATORY DATA ANALYSIS IN STATISTICAL MODeLING.pptx
EXPLORATORY DATA ANALYSIS IN STATISTICAL MODeLING.pptxEXPLORATORY DATA ANALYSIS IN STATISTICAL MODeLING.pptx
EXPLORATORY DATA ANALYSIS IN STATISTICAL MODeLING.pptxrakeshreghu98
 
ch 2 Tools of Research.docx
ch 2 Tools of Research.docxch 2 Tools of Research.docx
ch 2 Tools of Research.docxssuserf200491
 

Similar to Exploratory Data Analysis (EDA) .pptx (20)

Data Science 1.pdf
Data Science 1.pdfData Science 1.pdf
Data Science 1.pdf
 
Data Visualization.pptx
Data Visualization.pptxData Visualization.pptx
Data Visualization.pptx
 
Making abstract data visible
Making abstract data visibleMaking abstract data visible
Making abstract data visible
 
Exploring Data (1).pptx
Exploring Data (1).pptxExploring Data (1).pptx
Exploring Data (1).pptx
 
Lec 3.pptx
Lec 3.pptxLec 3.pptx
Lec 3.pptx
 
Chapter 12 Data Analysis Descriptive Methods and Index Numbers
Chapter 12 Data Analysis Descriptive Methods and Index NumbersChapter 12 Data Analysis Descriptive Methods and Index Numbers
Chapter 12 Data Analysis Descriptive Methods and Index Numbers
 
Unit 4 editing and coding (2)
Unit 4 editing and coding (2)Unit 4 editing and coding (2)
Unit 4 editing and coding (2)
 
Data mining techniques unit 2
Data mining techniques unit 2Data mining techniques unit 2
Data mining techniques unit 2
 
Unit III.pptx
Unit III.pptxUnit III.pptx
Unit III.pptx
 
fundamentals of data science and analytics on descriptive analysis.pptx
fundamentals of data science and analytics on descriptive analysis.pptxfundamentals of data science and analytics on descriptive analysis.pptx
fundamentals of data science and analytics on descriptive analysis.pptx
 
Visual Analytics in Big Data
Visual Analytics in Big DataVisual Analytics in Big Data
Visual Analytics in Big Data
 
Graphicalrepresntationofdatausingstatisticaltools2019_210902_105156.pdf
Graphicalrepresntationofdatausingstatisticaltools2019_210902_105156.pdfGraphicalrepresntationofdatausingstatisticaltools2019_210902_105156.pdf
Graphicalrepresntationofdatausingstatisticaltools2019_210902_105156.pdf
 
EDA.pptx
EDA.pptxEDA.pptx
EDA.pptx
 
CSUN 2020 Accessible Visualizations: Maps, Annotations, and Spark lines
CSUN 2020 Accessible Visualizations: Maps, Annotations, and Spark linesCSUN 2020 Accessible Visualizations: Maps, Annotations, and Spark lines
CSUN 2020 Accessible Visualizations: Maps, Annotations, and Spark lines
 
Introduction to Descriptive Statistics
Introduction to Descriptive StatisticsIntroduction to Descriptive Statistics
Introduction to Descriptive Statistics
 
Quality Tools & Techniques Presentation.pptx
Quality Tools & Techniques Presentation.pptxQuality Tools & Techniques Presentation.pptx
Quality Tools & Techniques Presentation.pptx
 
EXPLORATORY DATA ANALYSIS IN STATISTICAL MODeLING.pptx
EXPLORATORY DATA ANALYSIS IN STATISTICAL MODeLING.pptxEXPLORATORY DATA ANALYSIS IN STATISTICAL MODeLING.pptx
EXPLORATORY DATA ANALYSIS IN STATISTICAL MODeLING.pptx
 
Quantitative research
Quantitative researchQuantitative research
Quantitative research
 
Data Visulalization
Data VisulalizationData Visulalization
Data Visulalization
 
ch 2 Tools of Research.docx
ch 2 Tools of Research.docxch 2 Tools of Research.docx
ch 2 Tools of Research.docx
 

Recently uploaded

Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsKarinaGenton
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesFatimaKhan178732
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfSumit Tiwari
 
Concept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfConcept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfUmakantAnnand
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 

Recently uploaded (20)

Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its Characteristics
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and Actinides
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
 
Concept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfConcept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.Compdf
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 

Exploratory Data Analysis (EDA) .pptx

  • 1. Exploratory Data Analysis (EDA) ZAHID RIAZ HAANS (LECTURER) KAMYAB INSTIUTE OF MEDICAL SCIENCES KOT ADDU E-mail. zahidriazhaans50@gmail.com
  • 2. • Exploratory data analysis (EDA) involves using graphics and visualizations to explore and analyze a data set. The goal is to explore, investigate and learn, as opposed to confirming statistical hypotheses. • The process of using numerical summaries and visualizations to explore your data and to identify potential relationships between variables is called exploratory data analysis, or EDA.
  • 3. • The distribution of variables in your data set. That is, what is the shape of your data? Is the distribution skewed? Mound- shaped? Bimodal? • The relationships between variables. • Whether or not your data have outliers or unusual points that may indicate data quality issues or lead to interesting insights. • Whether or not your data have patterns over time.
  • 4. Functions and Techniques • Clustering and dimension reduction techniques, which help create graphical displays of high-dimensional data containing many variables. • Univariate visualization of each field in the raw dataset, with summary statistics. • Bivariate visualizations and summary statistics that allow you to assess the relationship between each variable in the dataset and the target variable you’re looking at. • Multivariate visualizations, for mapping and understanding interactions between different fields in the data.
  • 5.
  • 6.
  • 7. TYPES OF EXPLORATORY DATA ANALYSIS: • Univariate Non-graphical • Multivariate Non-graphical • Univariate graphical • Multivariate graphical
  • 8. 1. Univariate Non-graphical: • This is the simplest form of data analysis as during this we use just one variable to research the info. • The standard goal of univariate non-graphical EDA is to know the underlying sample distribution/ data and make observations about the population. • Outlier detection is additionally part of the analysis. The characteristics of population distribution include:
  • 9. Types • Central tendency: The central tendency or location of distribution has got to do with typical or middle values. The commonly useful measures of central tendency are statistics called mean, median, and sometimes mode during • Spread: Spread is an indicator of what proportion distant from the middle we are to seek out the find the info values. the quality deviation and variance are two useful measures of spread. • Skewness and kurtosis: Two more useful univariates descriptors are the skewness and kurtosis of the distribution.
  • 10. 2. Multivariate Non- graphical: • Multivariate non-graphical EDA technique is usually wont to show the connection between two or more variables within the sort of either cross-tabulation or statistics. • For categorical data, an extension of tabulation called cross- tabulation is extremely useful. For 2 variables, cross-tabulation is preferred by making a two-way table with column headings that match the amount of one-variable and row headings that match the amount of the opposite two variables, then filling the counts with all subjects that share an equivalent pair of levels.
  • 11. 3. Univariate graphical: • Non-graphical methods are quantitative and objective, they are not able to give the complete picture of the data; therefore, graphical methods are used more as they involve a degree of subjective analysis, also are required. Common sorts of univariate graphics are: • Histogram • Stem-and-leaf plots • Boxplots • Quantile-normal plots
  • 12. 4 Multivariate graphical: • Multivariate graphical data uses graphics to display relationships between two or more sets of knowledge. The sole one used commonly may be a grouped barplot with each group representing one level of 1 of the variables • Scatterplot: • Run chart: It’s a line graph of data plotted over time. • Heat map: It’s a graphical representation of data where values are depicted by color. • Multivariate chart: It’s a graphical representation of the relationships between factors and response. • Bubble chart: It’s a data visualization that displays multiple circles (bubbles) in two-dimensional plot.
  • 13. TOOLS REQUIRED FOR EXPLORATORY DATA ANALYSIS: • 1. R: An open-source programming language and free software environment for statistical computing and graphics supported by the R foundation for statistical computing. The R language is widely used among statisticians in developing statistical observations and data analysis. • 2. Python: An interpreted, object-oriented programming language with dynamic semantics. Its high level, built-in data structures, combined with dynamic binding, make it very attractive for rapid application development, also as to be used as a scripting or glue language to attach existing components together. Python and EDA are often used together to spot missing values in the data set, which is vital so you’ll decide the way to handle missing values for machine learning. • Apart from these functions described above, EDA can also: • Perform k-means clustering: Perform k-means clustering: it’s an unsupervised learning algorithm where the info points are assigned to clusters, also referred to as k-groups, k-means clustering is usually utilized in market segmentation, image compression, and pattern recognition