SlideShare a Scribd company logo
An Weighting Dissimilarity Function of CLARANS for
Clustering Spatial Data Education in Java
ICHWANUL MUSLIM KARO KARO, S.KOM
What is Spatial Data ?
What’s special about spatial data mining
The 1854 Asiatic Cholera in London
Data in Spatial data mining
 Non spatial information
 Same like data in traditional data mining
 Catagorical, Numeric, Boolean, etc
 E.g postal code, city name, number of victim natural disaster
Spatial Information
Spatial Attribute
 Neigborhood
 Location: longitude, latitude
Representation of spatial data
Raster: gridded and space
Vector: point, line, polygon
Relationships on Data in Spatial Data
Mining
Relationships on non-spatial data
• Explicit
• Arithmetic, ranking(ordering), etc.
Relationships on Spatial Data
• Many are implicit
• Relationship Categories
◦ Set-oriented: union, intersection, and membership, etc
◦ Topological: meet, within, overlap, etc
◦ Directional: North, NE, left, above, behind, etc
◦ Metric: e.g., Euclidean: distance, area, perimeter
Approach to solve spatial attribute
☼
MV-approximation
Calculating the
Exact Separation
Distance
IR Approximation
PDF
Polygon Dissimilarity Function
Given P ={𝑃1, 𝑃2, … 𝑃𝑛}, where P is set of Polygon
Non spatial attribute of polygon are all of non spatial attribute that independent
of the spatial polygon, average income, number of damaged house because
natural disaster
Spatial attribute of polygon divide into two catagories, intrinsic and extrinsic.
Attribute intrinsic of polygon describe polygon geometric characteristic, like
location, shape, area
Attribute extrinsic of polygon include various spatial object that may exist in
polygon, there are three object spatial, point, line and area
PDF between two polygon
𝐷 𝑃𝐷𝐹 = 𝑤 𝑛,𝑠, 𝑑 𝑛,𝑠 𝑃𝑖, 𝑃𝑗 + 𝑤𝑠 𝑑 𝑠 𝑃𝑖, 𝑃𝑗
𝑤 𝑛,𝑠 + 𝑤𝑠 = 1
Distance between non spatial attribute can solve by euclide or manhattan distance.
Distance between spatial attribute
𝑑 𝑠 = 𝑤𝑖𝑛𝑠 𝑑𝑖𝑛𝑠 𝑃𝑖, 𝑃𝑗 + 𝑤 𝑒𝑘𝑠 𝑑 𝑒𝑘𝑠 𝑃𝑖, 𝑃𝑗
𝑤𝑖𝑛𝑠 + 𝑤 𝑒𝑘𝑠 = 1
Design Process
Data
Start
Seperation
Attribute
Handling Spatial
Attribute
Handling non
Spatial Attribute
Similarity
Clustering
Evaluation
end
Modify Dissimilarity
𝐷 𝑎,𝑏 = 𝑤 𝑛,𝑠, 𝑑 𝑛,𝑠 𝑃𝑎, 𝑃𝑏 + 𝑤𝑠 𝑑 𝑠 𝑃𝑎, 𝑃𝑏
𝐷 𝑃𝐷𝐹 = 𝑤 𝑛,𝑠, 𝑑 𝑛,𝑠 𝑃𝑖, 𝑃𝑗 + 𝑤𝑠 𝑑 𝑠 𝑃𝑖, 𝑃𝑗
Without consider intrinsic attribute and extrinsic attribute
CLARANS
Clarans Algorithm
 Input parameters numlocal and maxneighbor. Initi- alize i to 1, and
mincost to a large number.
 Set current to an arbitrary node in Gn,k.
 Set j to 1.
 Consider a random neighbor S of current, and based on 5, calculate
the cost differential of the two nodes.
 If S has a lower cost, set current to S, and go to Step 3.
 Otherwise, increment j by 1. If j ≤ maxneighbor,go to Step 4.
 Otherwise, when j > maxneighbor, compare the cost of current with
mincost. If the former is less than mincost, set mincost to the cost of
current and set bestnode to current.
 Increment i by 1. If i > numlocal, output bestnode and halt.
Otherwise, go to Step 2.
Data
Data Ratio of student and Class in Java
Island
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
Silhoutteindex
Weigth
Comparation silhouette index
K=2 K=3
0
20
40
60
80
100
120
140
160
180
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
Times(s)
weigth
Comparation Runing Time
K=2 K=3
Weigth
Spatial
Silhouette
index
Running
time (s)
0,1 0.6909 128.828
0,2 0.5168 129.218
0,3 0.5940 129.028
0,4 0.5227 128.525
0,5 0.2783 130.221
0,6 0.4123 130.597
0,7 0.3740 131.458
0,8 0.4874 130.918
0,9 0.5321 135.227
0
50,000
100,000
150,000
200,000
250,000
0 0.5 1 1.5 2 2.5 3 3.5 4 4.5
runningtime(s)
number of cluster
Influent number of cluster to computational time
Conclusion
Clarans algorithm can combine with weigting dissimilarity function for solve spatial data
clustering. Weigting dissimilarity function more effeciecy than traditional dissimilarity to solve
spatial relationship. Increasing number of cluster will be followed by increasing computational
time but not efficiency of cluster. Weigthing spatial very influent to solve spatial data clustering.

More Related Content

What's hot

Shortest path problem
Shortest path problemShortest path problem
Shortest path problem
Ifra Ilyas
 
Finding Meaning in Points, Areas and Surfaces: Spatial Analysis in R
Finding Meaning in Points, Areas and Surfaces: Spatial Analysis in RFinding Meaning in Points, Areas and Surfaces: Spatial Analysis in R
Finding Meaning in Points, Areas and Surfaces: Spatial Analysis in R
Revolution Analytics
 
Transfer learningforclp
Transfer learningforclpTransfer learningforclp
Transfer learningforclp
Danushka Bollegala
 
Integration
IntegrationIntegration
Integration
Chhitiz Shrestha
 
Dijkstra's Algorithm
Dijkstra's AlgorithmDijkstra's Algorithm
Dijkstra's Algorithm
ArijitDhali
 
3D Graphics : Computer Graphics Fundamentals
3D Graphics : Computer Graphics Fundamentals3D Graphics : Computer Graphics Fundamentals
3D Graphics : Computer Graphics Fundamentals
Muhammed Afsal Villan
 
d
dd
Wind/ Solar Power Forecasting
Wind/ Solar Power Forecasting  Wind/ Solar Power Forecasting
Wind/ Solar Power Forecasting
Das A. K.
 
Shortest path algorithm
Shortest  path algorithmShortest  path algorithm
Shortest path algorithm
Subrata Kumer Paul
 
Lesson 25: The Definite Integral
Lesson 25: The Definite IntegralLesson 25: The Definite Integral
Lesson 25: The Definite Integral
Matthew Leingang
 
Dijkstra & flooding ppt(Routing algorithm)
Dijkstra & flooding ppt(Routing algorithm)Dijkstra & flooding ppt(Routing algorithm)
Dijkstra & flooding ppt(Routing algorithm)
Anshul gour
 
ONLINE SUBGRAPH SKYLINE ANALYSIS OVER KNOWLEDGE GRAPHS
ONLINE SUBGRAPH SKYLINE ANALYSIS OVER KNOWLEDGE GRAPHSONLINE SUBGRAPH SKYLINE ANALYSIS OVER KNOWLEDGE GRAPHS
ONLINE SUBGRAPH SKYLINE ANALYSIS OVER KNOWLEDGE GRAPHS
Nexgen Technology
 
3 d transformation
3 d transformation3 d transformation
3 d transformation
Pooja Dixit
 
Probabilistic Graph Layout for Uncertain Network Visualization
Probabilistic Graph Layout for Uncertain Network VisualizationProbabilistic Graph Layout for Uncertain Network Visualization
Probabilistic Graph Layout for Uncertain Network Visualization
Subhashis Hazarika
 
Spatial Transformation
Spatial TransformationSpatial Transformation
Spatial Transformation
Ehsan Hamzei
 
Suft
SuftSuft
On Skyline Groups
On Skyline GroupsOn Skyline Groups
Gram-Schmidt and QR Decomposition (Factorization) of Matrices
Gram-Schmidt and QR Decomposition (Factorization) of MatricesGram-Schmidt and QR Decomposition (Factorization) of Matrices
Gram-Schmidt and QR Decomposition (Factorization) of Matrices
Isaac Yowetu
 

What's hot (18)

Shortest path problem
Shortest path problemShortest path problem
Shortest path problem
 
Finding Meaning in Points, Areas and Surfaces: Spatial Analysis in R
Finding Meaning in Points, Areas and Surfaces: Spatial Analysis in RFinding Meaning in Points, Areas and Surfaces: Spatial Analysis in R
Finding Meaning in Points, Areas and Surfaces: Spatial Analysis in R
 
Transfer learningforclp
Transfer learningforclpTransfer learningforclp
Transfer learningforclp
 
Integration
IntegrationIntegration
Integration
 
Dijkstra's Algorithm
Dijkstra's AlgorithmDijkstra's Algorithm
Dijkstra's Algorithm
 
3D Graphics : Computer Graphics Fundamentals
3D Graphics : Computer Graphics Fundamentals3D Graphics : Computer Graphics Fundamentals
3D Graphics : Computer Graphics Fundamentals
 
d
dd
d
 
Wind/ Solar Power Forecasting
Wind/ Solar Power Forecasting  Wind/ Solar Power Forecasting
Wind/ Solar Power Forecasting
 
Shortest path algorithm
Shortest  path algorithmShortest  path algorithm
Shortest path algorithm
 
Lesson 25: The Definite Integral
Lesson 25: The Definite IntegralLesson 25: The Definite Integral
Lesson 25: The Definite Integral
 
Dijkstra & flooding ppt(Routing algorithm)
Dijkstra & flooding ppt(Routing algorithm)Dijkstra & flooding ppt(Routing algorithm)
Dijkstra & flooding ppt(Routing algorithm)
 
ONLINE SUBGRAPH SKYLINE ANALYSIS OVER KNOWLEDGE GRAPHS
ONLINE SUBGRAPH SKYLINE ANALYSIS OVER KNOWLEDGE GRAPHSONLINE SUBGRAPH SKYLINE ANALYSIS OVER KNOWLEDGE GRAPHS
ONLINE SUBGRAPH SKYLINE ANALYSIS OVER KNOWLEDGE GRAPHS
 
3 d transformation
3 d transformation3 d transformation
3 d transformation
 
Probabilistic Graph Layout for Uncertain Network Visualization
Probabilistic Graph Layout for Uncertain Network VisualizationProbabilistic Graph Layout for Uncertain Network Visualization
Probabilistic Graph Layout for Uncertain Network Visualization
 
Spatial Transformation
Spatial TransformationSpatial Transformation
Spatial Transformation
 
Suft
SuftSuft
Suft
 
On Skyline Groups
On Skyline GroupsOn Skyline Groups
On Skyline Groups
 
Gram-Schmidt and QR Decomposition (Factorization) of Matrices
Gram-Schmidt and QR Decomposition (Factorization) of MatricesGram-Schmidt and QR Decomposition (Factorization) of Matrices
Gram-Schmidt and QR Decomposition (Factorization) of Matrices
 

Similar to An Weighting Dissimilarity Function of CLARANS for Clustering Spatial Data Education in Java

Data Mining Lecture_5.pptx
Data Mining Lecture_5.pptxData Mining Lecture_5.pptx
Data Mining Lecture_5.pptx
Subrata Kumer Paul
 
Spatially resolved pair correlation functions for point cloud data
Spatially resolved pair correlation functions for point cloud dataSpatially resolved pair correlation functions for point cloud data
Spatially resolved pair correlation functions for point cloud data
Tony Fast
 
Machine Learning 1
Machine Learning 1Machine Learning 1
Machine Learning 1
cairo university
 
KNN CLASSIFIER, INTRODUCTION TO K-NEAREST NEIGHBOR ALGORITHM.pptx
KNN CLASSIFIER, INTRODUCTION TO K-NEAREST NEIGHBOR ALGORITHM.pptxKNN CLASSIFIER, INTRODUCTION TO K-NEAREST NEIGHBOR ALGORITHM.pptx
KNN CLASSIFIER, INTRODUCTION TO K-NEAREST NEIGHBOR ALGORITHM.pptx
Nishant83346
 
[PPT]
[PPT][PPT]
[PPT]
butest
 
Data For Datamining
Data For DataminingData For Datamining
Data For Datamining
Datamining Tools
 
Data For Datamining
Data For DataminingData For Datamining
Data For Datamining
DataminingTools Inc
 
PMED Transition Workshop - Non-parametric Techniques for Estimating Tumor Het...
PMED Transition Workshop - Non-parametric Techniques for Estimating Tumor Het...PMED Transition Workshop - Non-parametric Techniques for Estimating Tumor Het...
PMED Transition Workshop - Non-parametric Techniques for Estimating Tumor Het...
The Statistical and Applied Mathematical Sciences Institute
 
SPATIAL POINT PATTERNS
SPATIAL POINT PATTERNSSPATIAL POINT PATTERNS
SPATIAL POINT PATTERNS
LiemNguyenDuy
 
Magellan FOSS4G Talk, Boston 2017
Magellan FOSS4G Talk, Boston 2017Magellan FOSS4G Talk, Boston 2017
Magellan FOSS4G Talk, Boston 2017
Ram Sriharsha
 
Machine Learning Foundations for Professional Managers
Machine Learning Foundations for Professional ManagersMachine Learning Foundations for Professional Managers
Machine Learning Foundations for Professional Managers
Albert Y. C. Chen
 
How Does Math Matter in Data Science
How Does Math Matter in Data ScienceHow Does Math Matter in Data Science
How Does Math Matter in Data Science
Mutia Ulfi
 
Google BigQuery is a very popular enterprise warehouse that’s built with a co...
Google BigQuery is a very popular enterprise warehouse that’s built with a co...Google BigQuery is a very popular enterprise warehouse that’s built with a co...
Google BigQuery is a very popular enterprise warehouse that’s built with a co...
Abebe Admasu
 
[Paper reading] L-SHAPLEY AND C-SHAPLEY: EFFICIENT MODEL INTERPRETATION FOR S...
[Paper reading] L-SHAPLEY AND C-SHAPLEY: EFFICIENT MODEL INTERPRETATION FOR S...[Paper reading] L-SHAPLEY AND C-SHAPLEY: EFFICIENT MODEL INTERPRETATION FOR S...
[Paper reading] L-SHAPLEY AND C-SHAPLEY: EFFICIENT MODEL INTERPRETATION FOR S...
Daiki Tanaka
 
GAN(と強化学習との関係)
GAN(と強化学習との関係)GAN(と強化学習との関係)
GAN(と強化学習との関係)
Masahiro Suzuki
 
Tutorial on Object Detection (Faster R-CNN)
Tutorial on Object Detection (Faster R-CNN)Tutorial on Object Detection (Faster R-CNN)
Tutorial on Object Detection (Faster R-CNN)
Hwa Pyung Kim
 
Clique and sting
Clique and stingClique and sting
Clique and sting
Subramanyam Natarajan
 
Data preprocessing
Data preprocessingData preprocessing
Data preprocessing
Jason Rodrigues
 
Raster data analysis
Raster data analysisRaster data analysis
Raster data analysis
Abdul Raziq
 
Python for data science
Python for data sciencePython for data science
Python for data science
botsplash.com
 

Similar to An Weighting Dissimilarity Function of CLARANS for Clustering Spatial Data Education in Java (20)

Data Mining Lecture_5.pptx
Data Mining Lecture_5.pptxData Mining Lecture_5.pptx
Data Mining Lecture_5.pptx
 
Spatially resolved pair correlation functions for point cloud data
Spatially resolved pair correlation functions for point cloud dataSpatially resolved pair correlation functions for point cloud data
Spatially resolved pair correlation functions for point cloud data
 
Machine Learning 1
Machine Learning 1Machine Learning 1
Machine Learning 1
 
KNN CLASSIFIER, INTRODUCTION TO K-NEAREST NEIGHBOR ALGORITHM.pptx
KNN CLASSIFIER, INTRODUCTION TO K-NEAREST NEIGHBOR ALGORITHM.pptxKNN CLASSIFIER, INTRODUCTION TO K-NEAREST NEIGHBOR ALGORITHM.pptx
KNN CLASSIFIER, INTRODUCTION TO K-NEAREST NEIGHBOR ALGORITHM.pptx
 
[PPT]
[PPT][PPT]
[PPT]
 
Data For Datamining
Data For DataminingData For Datamining
Data For Datamining
 
Data For Datamining
Data For DataminingData For Datamining
Data For Datamining
 
PMED Transition Workshop - Non-parametric Techniques for Estimating Tumor Het...
PMED Transition Workshop - Non-parametric Techniques for Estimating Tumor Het...PMED Transition Workshop - Non-parametric Techniques for Estimating Tumor Het...
PMED Transition Workshop - Non-parametric Techniques for Estimating Tumor Het...
 
SPATIAL POINT PATTERNS
SPATIAL POINT PATTERNSSPATIAL POINT PATTERNS
SPATIAL POINT PATTERNS
 
Magellan FOSS4G Talk, Boston 2017
Magellan FOSS4G Talk, Boston 2017Magellan FOSS4G Talk, Boston 2017
Magellan FOSS4G Talk, Boston 2017
 
Machine Learning Foundations for Professional Managers
Machine Learning Foundations for Professional ManagersMachine Learning Foundations for Professional Managers
Machine Learning Foundations for Professional Managers
 
How Does Math Matter in Data Science
How Does Math Matter in Data ScienceHow Does Math Matter in Data Science
How Does Math Matter in Data Science
 
Google BigQuery is a very popular enterprise warehouse that’s built with a co...
Google BigQuery is a very popular enterprise warehouse that’s built with a co...Google BigQuery is a very popular enterprise warehouse that’s built with a co...
Google BigQuery is a very popular enterprise warehouse that’s built with a co...
 
[Paper reading] L-SHAPLEY AND C-SHAPLEY: EFFICIENT MODEL INTERPRETATION FOR S...
[Paper reading] L-SHAPLEY AND C-SHAPLEY: EFFICIENT MODEL INTERPRETATION FOR S...[Paper reading] L-SHAPLEY AND C-SHAPLEY: EFFICIENT MODEL INTERPRETATION FOR S...
[Paper reading] L-SHAPLEY AND C-SHAPLEY: EFFICIENT MODEL INTERPRETATION FOR S...
 
GAN(と強化学習との関係)
GAN(と強化学習との関係)GAN(と強化学習との関係)
GAN(と強化学習との関係)
 
Tutorial on Object Detection (Faster R-CNN)
Tutorial on Object Detection (Faster R-CNN)Tutorial on Object Detection (Faster R-CNN)
Tutorial on Object Detection (Faster R-CNN)
 
Clique and sting
Clique and stingClique and sting
Clique and sting
 
Data preprocessing
Data preprocessingData preprocessing
Data preprocessing
 
Raster data analysis
Raster data analysisRaster data analysis
Raster data analysis
 
Python for data science
Python for data sciencePython for data science
Python for data science
 

Recently uploaded

Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptxChapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Mohd Adib Abd Muin, Senior Lecturer at Universiti Utara Malaysia
 
The basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptxThe basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptx
heathfieldcps1
 
BBR 2024 Summer Sessions Interview Training
BBR  2024 Summer Sessions Interview TrainingBBR  2024 Summer Sessions Interview Training
BBR 2024 Summer Sessions Interview Training
Katrina Pritchard
 
PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.
Dr. Shivangi Singh Parihar
 
Walmart Business+ and Spark Good for Nonprofits.pdf
Walmart Business+ and Spark Good for Nonprofits.pdfWalmart Business+ and Spark Good for Nonprofits.pdf
Walmart Business+ and Spark Good for Nonprofits.pdf
TechSoup
 
DRUGS AND ITS classification slide share
DRUGS AND ITS classification slide shareDRUGS AND ITS classification slide share
DRUGS AND ITS classification slide share
taiba qazi
 
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
Nguyen Thanh Tu Collection
 
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat  Leveraging AI for Diversity, Equity, and InclusionExecutive Directors Chat  Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
TechSoup
 
S1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptxS1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptx
tarandeep35
 
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...
PECB
 
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama UniversityNatural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
Akanksha trivedi rama nursing college kanpur.
 
Main Java[All of the Base Concepts}.docx
Main Java[All of the Base Concepts}.docxMain Java[All of the Base Concepts}.docx
Main Java[All of the Base Concepts}.docx
adhitya5119
 
Liberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdfLiberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdf
WaniBasim
 
PIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf IslamabadPIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf Islamabad
AyyanKhan40
 
RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3
RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3
RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3
IreneSebastianRueco1
 
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdfANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
Priyankaranawat4
 
How to Make a Field Mandatory in Odoo 17
How to Make a Field Mandatory in Odoo 17How to Make a Field Mandatory in Odoo 17
How to Make a Field Mandatory in Odoo 17
Celine George
 
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
National Information Standards Organization (NISO)
 
How to Add Chatter in the odoo 17 ERP Module
How to Add Chatter in the odoo 17 ERP ModuleHow to Add Chatter in the odoo 17 ERP Module
How to Add Chatter in the odoo 17 ERP Module
Celine George
 
Advanced Java[Extra Concepts, Not Difficult].docx
Advanced Java[Extra Concepts, Not Difficult].docxAdvanced Java[Extra Concepts, Not Difficult].docx
Advanced Java[Extra Concepts, Not Difficult].docx
adhitya5119
 

Recently uploaded (20)

Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptxChapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
 
The basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptxThe basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptx
 
BBR 2024 Summer Sessions Interview Training
BBR  2024 Summer Sessions Interview TrainingBBR  2024 Summer Sessions Interview Training
BBR 2024 Summer Sessions Interview Training
 
PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.
 
Walmart Business+ and Spark Good for Nonprofits.pdf
Walmart Business+ and Spark Good for Nonprofits.pdfWalmart Business+ and Spark Good for Nonprofits.pdf
Walmart Business+ and Spark Good for Nonprofits.pdf
 
DRUGS AND ITS classification slide share
DRUGS AND ITS classification slide shareDRUGS AND ITS classification slide share
DRUGS AND ITS classification slide share
 
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
 
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat  Leveraging AI for Diversity, Equity, and InclusionExecutive Directors Chat  Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
 
S1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptxS1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptx
 
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...
 
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama UniversityNatural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
 
Main Java[All of the Base Concepts}.docx
Main Java[All of the Base Concepts}.docxMain Java[All of the Base Concepts}.docx
Main Java[All of the Base Concepts}.docx
 
Liberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdfLiberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdf
 
PIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf IslamabadPIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf Islamabad
 
RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3
RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3
RPMS TEMPLATE FOR SCHOOL YEAR 2023-2024 FOR TEACHER 1 TO TEACHER 3
 
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdfANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
 
How to Make a Field Mandatory in Odoo 17
How to Make a Field Mandatory in Odoo 17How to Make a Field Mandatory in Odoo 17
How to Make a Field Mandatory in Odoo 17
 
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
 
How to Add Chatter in the odoo 17 ERP Module
How to Add Chatter in the odoo 17 ERP ModuleHow to Add Chatter in the odoo 17 ERP Module
How to Add Chatter in the odoo 17 ERP Module
 
Advanced Java[Extra Concepts, Not Difficult].docx
Advanced Java[Extra Concepts, Not Difficult].docxAdvanced Java[Extra Concepts, Not Difficult].docx
Advanced Java[Extra Concepts, Not Difficult].docx
 

An Weighting Dissimilarity Function of CLARANS for Clustering Spatial Data Education in Java

  • 1. An Weighting Dissimilarity Function of CLARANS for Clustering Spatial Data Education in Java ICHWANUL MUSLIM KARO KARO, S.KOM
  • 3. What’s special about spatial data mining The 1854 Asiatic Cholera in London
  • 4.
  • 5. Data in Spatial data mining  Non spatial information  Same like data in traditional data mining  Catagorical, Numeric, Boolean, etc  E.g postal code, city name, number of victim natural disaster Spatial Information Spatial Attribute  Neigborhood  Location: longitude, latitude
  • 6. Representation of spatial data Raster: gridded and space Vector: point, line, polygon
  • 7. Relationships on Data in Spatial Data Mining Relationships on non-spatial data • Explicit • Arithmetic, ranking(ordering), etc. Relationships on Spatial Data • Many are implicit • Relationship Categories ◦ Set-oriented: union, intersection, and membership, etc ◦ Topological: meet, within, overlap, etc ◦ Directional: North, NE, left, above, behind, etc ◦ Metric: e.g., Euclidean: distance, area, perimeter
  • 8. Approach to solve spatial attribute ☼ MV-approximation Calculating the Exact Separation Distance IR Approximation PDF
  • 9. Polygon Dissimilarity Function Given P ={𝑃1, 𝑃2, … 𝑃𝑛}, where P is set of Polygon Non spatial attribute of polygon are all of non spatial attribute that independent of the spatial polygon, average income, number of damaged house because natural disaster Spatial attribute of polygon divide into two catagories, intrinsic and extrinsic. Attribute intrinsic of polygon describe polygon geometric characteristic, like location, shape, area Attribute extrinsic of polygon include various spatial object that may exist in polygon, there are three object spatial, point, line and area
  • 10. PDF between two polygon 𝐷 𝑃𝐷𝐹 = 𝑤 𝑛,𝑠, 𝑑 𝑛,𝑠 𝑃𝑖, 𝑃𝑗 + 𝑤𝑠 𝑑 𝑠 𝑃𝑖, 𝑃𝑗 𝑤 𝑛,𝑠 + 𝑤𝑠 = 1 Distance between non spatial attribute can solve by euclide or manhattan distance. Distance between spatial attribute 𝑑 𝑠 = 𝑤𝑖𝑛𝑠 𝑑𝑖𝑛𝑠 𝑃𝑖, 𝑃𝑗 + 𝑤 𝑒𝑘𝑠 𝑑 𝑒𝑘𝑠 𝑃𝑖, 𝑃𝑗 𝑤𝑖𝑛𝑠 + 𝑤 𝑒𝑘𝑠 = 1
  • 11. Design Process Data Start Seperation Attribute Handling Spatial Attribute Handling non Spatial Attribute Similarity Clustering Evaluation end
  • 12. Modify Dissimilarity 𝐷 𝑎,𝑏 = 𝑤 𝑛,𝑠, 𝑑 𝑛,𝑠 𝑃𝑎, 𝑃𝑏 + 𝑤𝑠 𝑑 𝑠 𝑃𝑎, 𝑃𝑏 𝐷 𝑃𝐷𝐹 = 𝑤 𝑛,𝑠, 𝑑 𝑛,𝑠 𝑃𝑖, 𝑃𝑗 + 𝑤𝑠 𝑑 𝑠 𝑃𝑖, 𝑃𝑗 Without consider intrinsic attribute and extrinsic attribute
  • 13. CLARANS Clarans Algorithm  Input parameters numlocal and maxneighbor. Initi- alize i to 1, and mincost to a large number.  Set current to an arbitrary node in Gn,k.  Set j to 1.  Consider a random neighbor S of current, and based on 5, calculate the cost differential of the two nodes.  If S has a lower cost, set current to S, and go to Step 3.  Otherwise, increment j by 1. If j ≤ maxneighbor,go to Step 4.  Otherwise, when j > maxneighbor, compare the cost of current with mincost. If the former is less than mincost, set mincost to the cost of current and set bestnode to current.  Increment i by 1. If i > numlocal, output bestnode and halt. Otherwise, go to Step 2.
  • 14. Data Data Ratio of student and Class in Java Island
  • 15. 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Silhoutteindex Weigth Comparation silhouette index K=2 K=3 0 20 40 60 80 100 120 140 160 180 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Times(s) weigth Comparation Runing Time K=2 K=3 Weigth Spatial Silhouette index Running time (s) 0,1 0.6909 128.828 0,2 0.5168 129.218 0,3 0.5940 129.028 0,4 0.5227 128.525 0,5 0.2783 130.221 0,6 0.4123 130.597 0,7 0.3740 131.458 0,8 0.4874 130.918 0,9 0.5321 135.227
  • 16. 0 50,000 100,000 150,000 200,000 250,000 0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 runningtime(s) number of cluster Influent number of cluster to computational time
  • 17. Conclusion Clarans algorithm can combine with weigting dissimilarity function for solve spatial data clustering. Weigting dissimilarity function more effeciecy than traditional dissimilarity to solve spatial relationship. Increasing number of cluster will be followed by increasing computational time but not efficiency of cluster. Weigthing spatial very influent to solve spatial data clustering.