SlideShare a Scribd company logo
1 of 16
Download to read offline
gabriella.casalino@uniba.it
Semi-Supervised Fuzzy C-Means
for Regression
Gabriella Casalino, Giovanna Castellano, Corrado Mencar
IJCCI 2023 - 15th International Joint Conference on Computational Intelligence
13-15 November 2023
Rome (Italy)
Supervised learning Semi-supervised learning Unsupervised learning
Text analysis E-health Learning analytics
Manufacturing Energy management Cyber security
Classi
fi
cation Regression
Semi-Supervised Fuzzy C-Means
• Semi-supervised version of fuzzy C-Means (FCM)
• Exploits partially labeled data to drive the clustering process
• Minimizes the following objective function:
• Outcomes:
Membership matrix and a set of k centroids
U = [ujk] ck =
∑
N
j=1
u2
jkxj
∑
N
j=1
u2
jk
J =
K
∑
k=1
N
∑
j=1
um
jkd2
jk + α
K
∑
k=1
N
∑
j=1
(ujk − bj fjk)
m
d2
jk
supervised component
unsupervised component
Semi-Supervised Fuzzy C-Means for Regression
• SSFCM-R Semi-Supervised Fuzzy C-Means for Regression
• Regression algorithm based on the classi
fi
cation algorithm SSFCM (Pedrycz
and Waletzky, 1997)
• Labeled prototypes
• Three main stages:
• Pre-processing: discretization and relabeling is applied to the target values
• Clustering: SSFCM
• Post-processing: matching method using the derived label prototypes
SSFCM-R Pre-processing
• Let the set of numerical labels
• The set is discretized into intervals
• For each interval the subset is
computed
• The average value is computed
• New labels:
• The number intervals is a hyperparameter
Y = {y ∈
𝒴
|(x, y) ∈ L}
Y C
[ai, bi], i = 1,2,…, C Yi = Y ∩ [ai, bi]
̂
yi
̂
L = {(x, ̂
y) ∈ ̂
D|y ≠ □ }
C
SSFCM-R Pre-processing
• Three discretization strategies:
• D1: Equal-width discretization, separating all
possible values into bins, each having the same
width;
• D2: Equal-frequency discretization, separating all
possible values into bins, each having the same
amount of observations;
• D3: The intervals are de
fi
ned on the basis of the
centroids produced by K-Means clustering
C
C
SSFCM-R Post-processing
Given a new input , the estimated value is computed according to one
out of two possible strategies:
• max: The closest prototype to is determined and corresponds to the
class label
• sum: The membership degrees of to each cluster are determined by using
SSFCM, the estimated value corresponds to the weighted average
x ∈
𝒳
y
ck x ymax
̂
yik
x
y
ysum =
K
∑
k=1
uk(x) ̂
yik
Experiments - Data
S1 S2 S3
Experimental settings
Eight labeling percentages
Three synthetic data Three bin sizes
Three discretization strategies Two post-processing methods MSE and TIME
Experiments - Results
Experiments - Results
Experiments - Results
Conclusions and future work
• SSFCM-R leverages a discretization mechanism to move from a continuous domain to
a discrete one
• The in
fl
uence of data complexity, discretization strategy, labeling percentage, and
number of bins, on the results, has been studied
• The equal width strategy has been proven to be the more e
ff
ective
• A small number of bins is preferable
• The post-processing method sum achieved lower errors than the max method
• Study di
ff
erent discretization strategies
• The e
ff
ectiveness of the proposed approach will be evaluated on real-world applications
• It will be compared with other semi-supervised regression algorithms
Thanks!
Gabriella Casalino
Computer Science Department
University of Bari, Italy
gabriella.casalino@uniba.it

More Related Content

Similar to IJCCI2023.pdf

Clustering techniques
Clustering techniquesClustering techniques
Clustering techniquestalktoharry
 
IRJET- Handwritten Decimal Image Compression using Deep Stacked Autoencoder
IRJET- Handwritten Decimal Image Compression using Deep Stacked AutoencoderIRJET- Handwritten Decimal Image Compression using Deep Stacked Autoencoder
IRJET- Handwritten Decimal Image Compression using Deep Stacked AutoencoderIRJET Journal
 
Restricting the Flow: Information Bottlenecks for Attribution
Restricting the Flow: Information Bottlenecks for AttributionRestricting the Flow: Information Bottlenecks for Attribution
Restricting the Flow: Information Bottlenecks for Attributiontaeseon ryu
 
Parallel k nn on gpu architecture using opencl
Parallel k nn on gpu architecture using openclParallel k nn on gpu architecture using opencl
Parallel k nn on gpu architecture using opencleSAT Publishing House
 
Parallel knn on gpu architecture using opencl
Parallel knn on gpu architecture using openclParallel knn on gpu architecture using opencl
Parallel knn on gpu architecture using opencleSAT Journals
 
IRJET- Different Data Mining Techniques for Weather Prediction
IRJET-  	  Different Data Mining Techniques for Weather PredictionIRJET-  	  Different Data Mining Techniques for Weather Prediction
IRJET- Different Data Mining Techniques for Weather PredictionIRJET Journal
 
Novel algorithms for Knowledge discovery from neural networks in Classificat...
Novel algorithms for  Knowledge discovery from neural networks in Classificat...Novel algorithms for  Knowledge discovery from neural networks in Classificat...
Novel algorithms for Knowledge discovery from neural networks in Classificat...Dr.(Mrs).Gethsiyal Augasta
 
Unsupervised Learning Clustering KMean and Hirarchical.pptx
Unsupervised Learning Clustering KMean and Hirarchical.pptxUnsupervised Learning Clustering KMean and Hirarchical.pptx
Unsupervised Learning Clustering KMean and Hirarchical.pptxFaridAliMousa1
 
Clustering on database systems rkm
Clustering on database systems rkmClustering on database systems rkm
Clustering on database systems rkmVahid Mirjalili
 
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETS
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETSFAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETS
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETScsandit
 
Extended Fuzzy C-Means with Random Sampling Techniques for Clustering Large Data
Extended Fuzzy C-Means with Random Sampling Techniques for Clustering Large DataExtended Fuzzy C-Means with Random Sampling Techniques for Clustering Large Data
Extended Fuzzy C-Means with Random Sampling Techniques for Clustering Large DataAM Publications
 
Semantic Segmentation on Satellite Imagery
Semantic Segmentation on Satellite ImagerySemantic Segmentation on Satellite Imagery
Semantic Segmentation on Satellite ImageryRAHUL BHOJWANI
 
New Approach of Preprocessing For Numeral Recognition
New Approach of Preprocessing For Numeral RecognitionNew Approach of Preprocessing For Numeral Recognition
New Approach of Preprocessing For Numeral RecognitionIJERA Editor
 
Experimental study of Data clustering using k- Means and modified algorithms
Experimental study of Data clustering using k- Means and modified algorithmsExperimental study of Data clustering using k- Means and modified algorithms
Experimental study of Data clustering using k- Means and modified algorithmsIJDKP
 
Clustering introduction
Clustering introductionClustering introduction
Clustering introductionYan Xu
 
Machine Learning Algorithms for Image Classification of Hand Digits and Face ...
Machine Learning Algorithms for Image Classification of Hand Digits and Face ...Machine Learning Algorithms for Image Classification of Hand Digits and Face ...
Machine Learning Algorithms for Image Classification of Hand Digits and Face ...IRJET Journal
 
A Methodology for Automatic GPU Kernel Optimization - NECSTTechTalk 4/06/2020
A Methodology for Automatic GPU Kernel Optimization - NECSTTechTalk 4/06/2020A Methodology for Automatic GPU Kernel Optimization - NECSTTechTalk 4/06/2020
A Methodology for Automatic GPU Kernel Optimization - NECSTTechTalk 4/06/2020NECST Lab @ Politecnico di Milano
 
IMPROVEMENT IN IMAGE DENOISING OF HANDWRITTEN DIGITS USING AUTOENCODERS IN DE...
IMPROVEMENT IN IMAGE DENOISING OF HANDWRITTEN DIGITS USING AUTOENCODERS IN DE...IMPROVEMENT IN IMAGE DENOISING OF HANDWRITTEN DIGITS USING AUTOENCODERS IN DE...
IMPROVEMENT IN IMAGE DENOISING OF HANDWRITTEN DIGITS USING AUTOENCODERS IN DE...IRJET Journal
 

Similar to IJCCI2023.pdf (20)

Clustering techniques
Clustering techniquesClustering techniques
Clustering techniques
 
IRJET- Handwritten Decimal Image Compression using Deep Stacked Autoencoder
IRJET- Handwritten Decimal Image Compression using Deep Stacked AutoencoderIRJET- Handwritten Decimal Image Compression using Deep Stacked Autoencoder
IRJET- Handwritten Decimal Image Compression using Deep Stacked Autoencoder
 
Restricting the Flow: Information Bottlenecks for Attribution
Restricting the Flow: Information Bottlenecks for AttributionRestricting the Flow: Information Bottlenecks for Attribution
Restricting the Flow: Information Bottlenecks for Attribution
 
Parallel k nn on gpu architecture using opencl
Parallel k nn on gpu architecture using openclParallel k nn on gpu architecture using opencl
Parallel k nn on gpu architecture using opencl
 
Parallel knn on gpu architecture using opencl
Parallel knn on gpu architecture using openclParallel knn on gpu architecture using opencl
Parallel knn on gpu architecture using opencl
 
IRJET- Different Data Mining Techniques for Weather Prediction
IRJET-  	  Different Data Mining Techniques for Weather PredictionIRJET-  	  Different Data Mining Techniques for Weather Prediction
IRJET- Different Data Mining Techniques for Weather Prediction
 
Novel algorithms for Knowledge discovery from neural networks in Classificat...
Novel algorithms for  Knowledge discovery from neural networks in Classificat...Novel algorithms for  Knowledge discovery from neural networks in Classificat...
Novel algorithms for Knowledge discovery from neural networks in Classificat...
 
Unsupervised Learning Clustering KMean and Hirarchical.pptx
Unsupervised Learning Clustering KMean and Hirarchical.pptxUnsupervised Learning Clustering KMean and Hirarchical.pptx
Unsupervised Learning Clustering KMean and Hirarchical.pptx
 
Clustering on database systems rkm
Clustering on database systems rkmClustering on database systems rkm
Clustering on database systems rkm
 
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETS
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETSFAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETS
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETS
 
Extended Fuzzy C-Means with Random Sampling Techniques for Clustering Large Data
Extended Fuzzy C-Means with Random Sampling Techniques for Clustering Large DataExtended Fuzzy C-Means with Random Sampling Techniques for Clustering Large Data
Extended Fuzzy C-Means with Random Sampling Techniques for Clustering Large Data
 
Semantic Segmentation on Satellite Imagery
Semantic Segmentation on Satellite ImagerySemantic Segmentation on Satellite Imagery
Semantic Segmentation on Satellite Imagery
 
Unsupervised learning networks
Unsupervised learning networksUnsupervised learning networks
Unsupervised learning networks
 
New Approach of Preprocessing For Numeral Recognition
New Approach of Preprocessing For Numeral RecognitionNew Approach of Preprocessing For Numeral Recognition
New Approach of Preprocessing For Numeral Recognition
 
Experimental study of Data clustering using k- Means and modified algorithms
Experimental study of Data clustering using k- Means and modified algorithmsExperimental study of Data clustering using k- Means and modified algorithms
Experimental study of Data clustering using k- Means and modified algorithms
 
Clustering introduction
Clustering introductionClustering introduction
Clustering introduction
 
Machine Learning Algorithms for Image Classification of Hand Digits and Face ...
Machine Learning Algorithms for Image Classification of Hand Digits and Face ...Machine Learning Algorithms for Image Classification of Hand Digits and Face ...
Machine Learning Algorithms for Image Classification of Hand Digits and Face ...
 
A Methodology for Automatic GPU Kernel Optimization - NECSTTechTalk 4/06/2020
A Methodology for Automatic GPU Kernel Optimization - NECSTTechTalk 4/06/2020A Methodology for Automatic GPU Kernel Optimization - NECSTTechTalk 4/06/2020
A Methodology for Automatic GPU Kernel Optimization - NECSTTechTalk 4/06/2020
 
IMPROVEMENT IN IMAGE DENOISING OF HANDWRITTEN DIGITS USING AUTOENCODERS IN DE...
IMPROVEMENT IN IMAGE DENOISING OF HANDWRITTEN DIGITS USING AUTOENCODERS IN DE...IMPROVEMENT IN IMAGE DENOISING OF HANDWRITTEN DIGITS USING AUTOENCODERS IN DE...
IMPROVEMENT IN IMAGE DENOISING OF HANDWRITTEN DIGITS USING AUTOENCODERS IN DE...
 
Iiwas19 yamazaki slide
Iiwas19 yamazaki slideIiwas19 yamazaki slide
Iiwas19 yamazaki slide
 

More from Gabriella Casalino

On the use of the Dynamic Incremental Semi-Supervised Fuzzy Clustering Algori...
On the use of the Dynamic Incremental Semi-Supervised Fuzzy Clustering Algori...On the use of the Dynamic Incremental Semi-Supervised Fuzzy Clustering Algori...
On the use of the Dynamic Incremental Semi-Supervised Fuzzy Clustering Algori...Gabriella Casalino
 
A mHealth solution for contact-less self-monitoring of vital sign parameters
A mHealth solution for contact-less self-monitoring of vital sign parametersA mHealth solution for contact-less self-monitoring of vital sign parameters
A mHealth solution for contact-less self-monitoring of vital sign parametersGabriella Casalino
 
Text mining through Non Negative Matrix Factorizations
Text mining through Non Negative Matrix FactorizationsText mining through Non Negative Matrix Factorizations
Text mining through Non Negative Matrix FactorizationsGabriella Casalino
 
Dynamic Incremental Semi-supervised Fuzzy Clustering for Bipolar Disorder Epi...
Dynamic Incremental Semi-supervised Fuzzy Clustering for Bipolar Disorder Epi...Dynamic Incremental Semi-supervised Fuzzy Clustering for Bipolar Disorder Epi...
Dynamic Incremental Semi-supervised Fuzzy Clustering for Bipolar Disorder Epi...Gabriella Casalino
 
A mHealth solution for contact-less self-monitoring of vital signs parameters
A mHealth solution for contact-less  self-monitoring of vital signs parametersA mHealth solution for contact-less  self-monitoring of vital signs parameters
A mHealth solution for contact-less self-monitoring of vital signs parametersGabriella Casalino
 
Dynamic Incremental Semi-Supervised Fuzzy Clustering for Data Stream Classifi...
Dynamic Incremental Semi-Supervised Fuzzy Clustering for Data Stream Classifi...Dynamic Incremental Semi-Supervised Fuzzy Clustering for Data Stream Classifi...
Dynamic Incremental Semi-Supervised Fuzzy Clustering for Data Stream Classifi...Gabriella Casalino
 
Incremental and Adaptive fuzzy clustering for Virtual Learning Environments D...
Incremental and Adaptive fuzzy clustering for Virtual Learning Environments D...Incremental and Adaptive fuzzy clustering for Virtual Learning Environments D...
Incremental and Adaptive fuzzy clustering for Virtual Learning Environments D...Gabriella Casalino
 
The use of an Explainable Artificial Intelligence Tool for Decision-making Su...
The use of an Explainable Artificial Intelligence Tool for Decision-making Su...The use of an Explainable Artificial Intelligence Tool for Decision-making Su...
The use of an Explainable Artificial Intelligence Tool for Decision-making Su...Gabriella Casalino
 
Data stream classification by incremental semi-supervised fuzzy clustering
Data stream classification  by incremental  semi-supervised fuzzy clusteringData stream classification  by incremental  semi-supervised fuzzy clustering
Data stream classification by incremental semi-supervised fuzzy clusteringGabriella Casalino
 
Incremental adaptive semi-supervised fuzzy clustering for data stream classif...
Incremental adaptive semi-supervised fuzzy clustering for data stream classif...Incremental adaptive semi-supervised fuzzy clustering for data stream classif...
Incremental adaptive semi-supervised fuzzy clustering for data stream classif...Gabriella Casalino
 
Intelligent data analysis through Nonnegative matrix factorization (NMF): app...
Intelligent data analysis through Nonnegative matrix factorization (NMF): app...Intelligent data analysis through Nonnegative matrix factorization (NMF): app...
Intelligent data analysis through Nonnegative matrix factorization (NMF): app...Gabriella Casalino
 
Non-negative factorization methods for extracting semantically relevant featu...
Non-negative factorization methods for extracting semantically relevant featu...Non-negative factorization methods for extracting semantically relevant featu...
Non-negative factorization methods for extracting semantically relevant featu...Gabriella Casalino
 

More from Gabriella Casalino (15)

On the use of the Dynamic Incremental Semi-Supervised Fuzzy Clustering Algori...
On the use of the Dynamic Incremental Semi-Supervised Fuzzy Clustering Algori...On the use of the Dynamic Incremental Semi-Supervised Fuzzy Clustering Algori...
On the use of the Dynamic Incremental Semi-Supervised Fuzzy Clustering Algori...
 
A mHealth solution for contact-less self-monitoring of vital sign parameters
A mHealth solution for contact-less self-monitoring of vital sign parametersA mHealth solution for contact-less self-monitoring of vital sign parameters
A mHealth solution for contact-less self-monitoring of vital sign parameters
 
Text mining through Non Negative Matrix Factorizations
Text mining through Non Negative Matrix FactorizationsText mining through Non Negative Matrix Factorizations
Text mining through Non Negative Matrix Factorizations
 
Dynamic Incremental Semi-supervised Fuzzy Clustering for Bipolar Disorder Epi...
Dynamic Incremental Semi-supervised Fuzzy Clustering for Bipolar Disorder Epi...Dynamic Incremental Semi-supervised Fuzzy Clustering for Bipolar Disorder Epi...
Dynamic Incremental Semi-supervised Fuzzy Clustering for Bipolar Disorder Epi...
 
A mHealth solution for contact-less self-monitoring of vital signs parameters
A mHealth solution for contact-less  self-monitoring of vital signs parametersA mHealth solution for contact-less  self-monitoring of vital signs parameters
A mHealth solution for contact-less self-monitoring of vital signs parameters
 
Dynamic Incremental Semi-Supervised Fuzzy Clustering for Data Stream Classifi...
Dynamic Incremental Semi-Supervised Fuzzy Clustering for Data Stream Classifi...Dynamic Incremental Semi-Supervised Fuzzy Clustering for Data Stream Classifi...
Dynamic Incremental Semi-Supervised Fuzzy Clustering for Data Stream Classifi...
 
Incremental and Adaptive fuzzy clustering for Virtual Learning Environments D...
Incremental and Adaptive fuzzy clustering for Virtual Learning Environments D...Incremental and Adaptive fuzzy clustering for Virtual Learning Environments D...
Incremental and Adaptive fuzzy clustering for Virtual Learning Environments D...
 
The use of an Explainable Artificial Intelligence Tool for Decision-making Su...
The use of an Explainable Artificial Intelligence Tool for Decision-making Su...The use of an Explainable Artificial Intelligence Tool for Decision-making Su...
The use of an Explainable Artificial Intelligence Tool for Decision-making Su...
 
Data stream classification by incremental semi-supervised fuzzy clustering
Data stream classification  by incremental  semi-supervised fuzzy clusteringData stream classification  by incremental  semi-supervised fuzzy clustering
Data stream classification by incremental semi-supervised fuzzy clustering
 
Incremental adaptive semi-supervised fuzzy clustering for data stream classif...
Incremental adaptive semi-supervised fuzzy clustering for data stream classif...Incremental adaptive semi-supervised fuzzy clustering for data stream classif...
Incremental adaptive semi-supervised fuzzy clustering for data stream classif...
 
Intelligent data analysis through Nonnegative matrix factorization (NMF): app...
Intelligent data analysis through Nonnegative matrix factorization (NMF): app...Intelligent data analysis through Nonnegative matrix factorization (NMF): app...
Intelligent data analysis through Nonnegative matrix factorization (NMF): app...
 
Non-negative factorization methods for extracting semantically relevant featu...
Non-negative factorization methods for extracting semantically relevant featu...Non-negative factorization methods for extracting semantically relevant featu...
Non-negative factorization methods for extracting semantically relevant featu...
 
ICCSA2014 - slides
ICCSA2014 - slidesICCSA2014 - slides
ICCSA2014 - slides
 
Didamatica2012 - slides
Didamatica2012 - slidesDidamatica2012 - slides
Didamatica2012 - slides
 
WILF2011 - slides
WILF2011 - slidesWILF2011 - slides
WILF2011 - slides
 

Recently uploaded

Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingTeacherCyreneCayanan
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfAyushMahapatra5
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...Sapna Thakur
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Disha Kariya
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room servicediscovermytutordmt
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 

Recently uploaded (20)

Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writing
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room service
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 

IJCCI2023.pdf

  • 1. gabriella.casalino@uniba.it Semi-Supervised Fuzzy C-Means for Regression Gabriella Casalino, Giovanna Castellano, Corrado Mencar IJCCI 2023 - 15th International Joint Conference on Computational Intelligence 13-15 November 2023 Rome (Italy)
  • 2. Supervised learning Semi-supervised learning Unsupervised learning
  • 3. Text analysis E-health Learning analytics Manufacturing Energy management Cyber security
  • 5. Semi-Supervised Fuzzy C-Means • Semi-supervised version of fuzzy C-Means (FCM) • Exploits partially labeled data to drive the clustering process • Minimizes the following objective function: • Outcomes: Membership matrix and a set of k centroids U = [ujk] ck = ∑ N j=1 u2 jkxj ∑ N j=1 u2 jk J = K ∑ k=1 N ∑ j=1 um jkd2 jk + α K ∑ k=1 N ∑ j=1 (ujk − bj fjk) m d2 jk supervised component unsupervised component
  • 6. Semi-Supervised Fuzzy C-Means for Regression • SSFCM-R Semi-Supervised Fuzzy C-Means for Regression • Regression algorithm based on the classi fi cation algorithm SSFCM (Pedrycz and Waletzky, 1997) • Labeled prototypes • Three main stages: • Pre-processing: discretization and relabeling is applied to the target values • Clustering: SSFCM • Post-processing: matching method using the derived label prototypes
  • 7. SSFCM-R Pre-processing • Let the set of numerical labels • The set is discretized into intervals • For each interval the subset is computed • The average value is computed • New labels: • The number intervals is a hyperparameter Y = {y ∈ 𝒴 |(x, y) ∈ L} Y C [ai, bi], i = 1,2,…, C Yi = Y ∩ [ai, bi] ̂ yi ̂ L = {(x, ̂ y) ∈ ̂ D|y ≠ □ } C
  • 8. SSFCM-R Pre-processing • Three discretization strategies: • D1: Equal-width discretization, separating all possible values into bins, each having the same width; • D2: Equal-frequency discretization, separating all possible values into bins, each having the same amount of observations; • D3: The intervals are de fi ned on the basis of the centroids produced by K-Means clustering C C
  • 9. SSFCM-R Post-processing Given a new input , the estimated value is computed according to one out of two possible strategies: • max: The closest prototype to is determined and corresponds to the class label • sum: The membership degrees of to each cluster are determined by using SSFCM, the estimated value corresponds to the weighted average x ∈ 𝒳 y ck x ymax ̂ yik x y ysum = K ∑ k=1 uk(x) ̂ yik
  • 11. Experimental settings Eight labeling percentages Three synthetic data Three bin sizes Three discretization strategies Two post-processing methods MSE and TIME
  • 15. Conclusions and future work • SSFCM-R leverages a discretization mechanism to move from a continuous domain to a discrete one • The in fl uence of data complexity, discretization strategy, labeling percentage, and number of bins, on the results, has been studied • The equal width strategy has been proven to be the more e ff ective • A small number of bins is preferable • The post-processing method sum achieved lower errors than the max method • Study di ff erent discretization strategies • The e ff ectiveness of the proposed approach will be evaluated on real-world applications • It will be compared with other semi-supervised regression algorithms
  • 16. Thanks! Gabriella Casalino Computer Science Department University of Bari, Italy gabriella.casalino@uniba.it