SlideShare a Scribd company logo
This course is prepared under the Erasmus+ KA-210-YOU Project titled
«Skilling Youth for the Next Generation Air Transport Management»
Machine Learning
Applications in Aviation
Clustering
Asst. Prof. Dr. Emircan Özdemir
Eskişehir Technical University
• Clustering, as a fundamental unsupervised learning technique, holds paramount
importance in discerning inherent structures within aviation datasets. It's the compass that
guides analysts through the vast sea of information, allowing them to uncover hidden
relationships, group similar entities, and derive meaningful insights.
• At its core, clustering is the art of finding natural groupings or clusters within a dataset.
These clusters represent entities that share similarities, creating a valuable framework for
understanding the underlying structure of aviation data.
Clustering 2
Introduction
• Clustering operates in the unsupervised learning realm, where the algorithm explores the
data without predefined labels. In aviation analytics, where the intricacies of flight
patterns, maintenance records, and passenger behaviors are multi-faceted, unsupervised
learning becomes the compass guiding analysts through uncharted territories. Clustering,
in particular, becomes the lens through which intricate patterns and relationships come
into focus.
• Consider clustering as the air traffic controller of data points, guiding them to form
coherent patterns and groupings. This grouping mechanism is crucial in aviation for
various applications – from categorizing aircraft maintenance profiles with shared
characteristics to segmenting passenger behaviors for targeted marketing strategies.
Clustering 3
Introduction
• K-Means clustering is a foundational algorithm in unsupervised learning, widely
employed for its simplicity and efficiency. This algorithm partitions data points into K
clusters, where each cluster is represented by its centroid. The iterative process refines
cluster assignments until convergence, making it a valuable tool data analysis.
Clustering 4
Types of Clustering Algorithms
Source (left): https://medium.com/data-folks-indonesia/step-by-step-to-understanding-k-means-clustering-and-implementation-with-sklearn-b55803f519d6
Source (right): https://www.ejable.com/tech-corner/ai-machine-learning-and-deep-learning/k-means-clustering/
• Hierarchical clustering stands out for its ability to create a hierarchical tree-like structure
of clusters. This method is particularly advantageous when the hierarchy of relationships
within the data is of interest. In data analytics, hierarchical clustering finds utility in
scenarios where data exhibits nested patterns.
Clustering 5
Types of Clustering Algorithms
Source: https://codinginfinite.com/hierarchical-clustering-applications-advantages-and-disadvantages/
• DBSCAN (Density-Based Spatial Clustering of Applications with Noise) is a density-based
clustering algorithm that excels in identifying clusters of varying shapes and sizes. Unlike
K-Means, DBSCAN doesn't require specifying the number of clusters beforehand and can
uncover outliers or noise in the data.
Clustering 6
Types of Clustering Algorithms
Source: https://towardsdatascience.com/understanding-dbscan-and-implementation-with-python-5de75a786f9f
Aircraft Maintenance Grouping
• Clustering can be used to categorize maintenance profiles of aircrafts. This application
involves grouping similar maintenance patterns based on historical data. By identifying
commonalities in maintenance needs, aviation experts can proactively plan and optimize
maintenance schedules, ensuring the fleet's operational efficiency and safety.
Clustering 7
Clustering Use Cases in Aviation
https://investinestonia.com/magnetic-mro-wants-to-conquer-the-world/
Passenger Segmentation
• In the dynamic world of aviation, understanding passenger behavior is akin to deciphering
a complex code. Clustering steps in as the linguist, segmenting passenger behaviors into
meaningful groups. Whether it's frequent flyers, leisure travelers, or business executives,
clustering enables airlines to tailor services, marketing, and experiences for distinct
passenger segments, enhancing overall customer satisfaction and loyalty.
Clustering 8
Clustering Use Cases in Aviation
Source: https://investor-relations.lufthansagroup.com/fileadmin/downloads/en/charts-speeches/capital-markets-day-2019/capital-markets-day-2019-presentations.pdf
Route Optimization
• Flight paths are the arteries of aviation operations, and clustering serves as the compass
for optimal route planning. By analyzing geographical patterns, clustering algorithms
assist in grouping destinations with similar characteristics. This allows airlines to optimize
flight routes, considering factors like weather, fuel efficiency, and airspace constraints.
The results are: streamlined operations, reduced fuel costs, and improved on-time
performance.
Clustering 9
Clustering Use Cases in Aviation
Source: http://coolinfographics.squarespace.com/blog/2016/6/3/the-global-air-transportation-network.html;jsessionid=E11686681A0ACCCF95D74B92A8C72E4E.v5-web014
Feature Selection
Feature selection stands as a crucial consideration, emphasizing the
importance of choosing the most relevant variables for effective clustering.
Like a skilled pilot choosing the essential instruments for a smooth flight,
feature selection ensures that the clustering algorithm focuses on the data
aspects most pertinent to the aviation context.
Clustering 10
Considerations and Challenges in Aviation Clustering
Scalability
• Scalability emerges as a challenge that requires careful attention. As
datasets soar in size and complexity, clustering algorithms must efficiently
handle the increasing volume of information. Like managing air traffic,
scalability in clustering ensures that algorithms remain effective and
responsive even when dealing with vast aviation datasets, contributing to
the seamless analysis of patterns and insights.
Clustering 11
Considerations and Challenges in Aviation Clustering
Data Preprocessing: Data preprocessing emerges as a critical best practice, emphasizing
the significance of cleaning and preparing data for clustering tasks. Similar to ensuring that
an aircraft is in optimal condition before takeoff, data preprocessing ensures that the input
data is refined and ready for the intricate process of pattern recognition through clustering.
Evaluation Metrics: Assessing the success of clustering models requires reliable metrics.
Incorporating evaluation metrics is essential for gauging the performance of clustering
algorithms. These metrics act as measurement tools, allowing analysts to quantitatively
understand how well the clustering process aligns with the goals of aviation analytics.
Clustering 12
Best Practices for Aviation Clustering
• In RapidMiner, using the Repository window, follow the
path Training Resources-Model-Unsupervised-
Segmentation and open the Credit Risk k-Means
Clustering solution process.
• In this example, the customers are aimed to be segmented
into groups according to their credit risks.
• There is no label attribute and several attributes related to
the credit risk of customers are taken into account to
segment the customers.
• Therefore, clustering model is choosen to reach this goal.
Clustering 13
RapidMiner Example on Clustering
• In the process window, there are data importing (ETL) operator, clustering model operator,
and cluster model visualizer operator. In ETL oeprator, there are suboperators to
prerocess the data. Z-transformation (normalization) stands here in the subprocess
window. Also model parameters (number of k, numerical measure type etc.) can be set on
the window right.
Clustering 14
RapidMiner Example on Clustering
• After you run the model, you can
find outputs in the Results view.
• You can find cluster graph,
members of each cluser, centroid
table, and plot view of clusters
here.
• Figure on the right shows the
cluster graph (tree).
Clustering 15
RapidMiner Example on Clustering
• In the results view, cluster model
visualizer operator provides further
insights.
• In the overview tab, you can see
the clusters and breakdowns
based on attributes.
Clustering 16
RapidMiner Example on Clustering
• In the Scatter Plot tab, you can
create scatter plots focusing your
each cluster.
Clustering 17
RapidMiner Example on Clustering
In the Plot tab, you can see the
attributes that:
- Clusters differ most
- Clusters not differ
- The complexiy of differences
between clusters
Clustering 18
RapidMiner Example on Clustering
• Moreover, if you want to analyze the performance of the clustering model, you can use
the performance operators under the segmentation folder in the operators window.
Clustering 19
RapidMiner Example on Clustering
• Incorporation of AI in Clustering
As aviation analytics evolves, the integration of Artificial Intelligence (AI) into clustering
methods represents a significant trend. Advanced AI techniques enhance the capabilities of
clustering algorithms, allowing them to adapt and discover intricate patterns within aviation
datasets, leading to more accurate and nuanced insights.
• Explainable Clustering
The future of aviation clustering emphasizes the importance of transparency and
interpretability. As clustering models become more sophisticated, the need for
understanding the rationale behind clustering outcomes grows. The concept of explainable
clustering ensures that results are not only accurate but also comprehensible, fostering trust
in the decision-making processes driven by clustering algorithms.
Clustering 20
Future Trends in Clustering
• In this lesson, we exlored the clustering in aviation comprehensively.
• Various clustering algrotihms were introduced and the main characteristics/differences
were explained.
• You can further explore clustering algortihms using different datasets in RapidMiner.
• Feature selection (selection of attributes) is a key point to build accurate clustering
models. So, try different combinations for your attributes in the clustering models and try
to figure out differences between outputs.
• Also keep in mind to compare performances of your clustering models.
Clustering 21
Conclusion

More Related Content

Similar to ml-06x01.pdf

A Result on Novel Approach for Load Balancing in Cloud Computing
A Result on Novel Approach for Load Balancing in Cloud ComputingA Result on Novel Approach for Load Balancing in Cloud Computing
A Result on Novel Approach for Load Balancing in Cloud Computing
ijtsrd
 
Predicting Steering Angle for Self Driving Vehicles
Predicting Steering Angle for Self Driving VehiclesPredicting Steering Angle for Self Driving Vehicles
Predicting Steering Angle for Self Driving Vehicles
IRJET Journal
 
MPSKM Algorithm to Cluster Uneven Dimensional Time Series Subspace Data
MPSKM Algorithm to Cluster Uneven Dimensional Time Series Subspace DataMPSKM Algorithm to Cluster Uneven Dimensional Time Series Subspace Data
MPSKM Algorithm to Cluster Uneven Dimensional Time Series Subspace Data
IRJET Journal
 
Mapping the road to better data storage strategies
Mapping the road to better data storage strategiesMapping the road to better data storage strategies
Mapping the road to better data storage strategies
ClearSky Data
 
A Survey on the Clustering Algorithms in Sales Data Mining
A Survey on the Clustering Algorithms in Sales Data MiningA Survey on the Clustering Algorithms in Sales Data Mining
A Survey on the Clustering Algorithms in Sales Data Mining
Editor IJCATR
 
A Survey of Machine Learning Methods Applied to Computer ...
A Survey of Machine Learning Methods Applied to Computer ...A Survey of Machine Learning Methods Applied to Computer ...
A Survey of Machine Learning Methods Applied to Computer ...butest
 
Integration of cost-risk within an intelligent maintenance system
Integration of cost-risk within an intelligent maintenance systemIntegration of cost-risk within an intelligent maintenance system
Integration of cost-risk within an intelligent maintenance systemLaurent Carlander
 
Machine Learning-Based Phishing Detection
Machine Learning-Based Phishing DetectionMachine Learning-Based Phishing Detection
Machine Learning-Based Phishing Detection
IRJET Journal
 
Knowledge Acquisition Based on Repertory Grid Analysis System
Knowledge Acquisition Based on Repertory Grid Analysis SystemKnowledge Acquisition Based on Repertory Grid Analysis System
Knowledge Acquisition Based on Repertory Grid Analysis System
ijtsrd
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADAS
IRJET Journal
 
Paige Roberts: Shortcut MLOps with In-Database Machine Learning
Paige Roberts: Shortcut MLOps with In-Database Machine LearningPaige Roberts: Shortcut MLOps with In-Database Machine Learning
Paige Roberts: Shortcut MLOps with In-Database Machine Learning
Edunomica
 
How Partitioning Clustering Technique For Implementing...
How Partitioning Clustering Technique For Implementing...How Partitioning Clustering Technique For Implementing...
How Partitioning Clustering Technique For Implementing...
Nicolle Dammann
 
IRJET- Cluster Analysis for Effective Information Retrieval through Cohesive ...
IRJET- Cluster Analysis for Effective Information Retrieval through Cohesive ...IRJET- Cluster Analysis for Effective Information Retrieval through Cohesive ...
IRJET- Cluster Analysis for Effective Information Retrieval through Cohesive ...
IRJET Journal
 
MACHINE LEARNING CLASSIFIERS TO ANALYZE CREDIT RISK
MACHINE LEARNING CLASSIFIERS TO ANALYZE CREDIT RISKMACHINE LEARNING CLASSIFIERS TO ANALYZE CREDIT RISK
MACHINE LEARNING CLASSIFIERS TO ANALYZE CREDIT RISK
IRJET Journal
 
IRJET- Cluster Analysis for Effective Information Retrieval through Cohesive ...
IRJET- Cluster Analysis for Effective Information Retrieval through Cohesive ...IRJET- Cluster Analysis for Effective Information Retrieval through Cohesive ...
IRJET- Cluster Analysis for Effective Information Retrieval through Cohesive ...
IRJET Journal
 
ATLAS_Analytics_for_Total_Life_Cycle_for_Automotive_Systems
ATLAS_Analytics_for_Total_Life_Cycle_for_Automotive_SystemsATLAS_Analytics_for_Total_Life_Cycle_for_Automotive_Systems
ATLAS_Analytics_for_Total_Life_Cycle_for_Automotive_SystemsNarendra Ashar
 
IRJET- Smart Railway System using Trip Chaining Method
IRJET- Smart Railway System using Trip Chaining MethodIRJET- Smart Railway System using Trip Chaining Method
IRJET- Smart Railway System using Trip Chaining Method
IRJET Journal
 
Architecting a machine learning pipeline for online traffic classification in...
Architecting a machine learning pipeline for online traffic classification in...Architecting a machine learning pipeline for online traffic classification in...
Architecting a machine learning pipeline for online traffic classification in...
IAESIJAI
 
Data mining techniques using weka
Data mining techniques using wekaData mining techniques using weka
Data mining techniques using wekaPrashant Menon
 

Similar to ml-06x01.pdf (20)

A Result on Novel Approach for Load Balancing in Cloud Computing
A Result on Novel Approach for Load Balancing in Cloud ComputingA Result on Novel Approach for Load Balancing in Cloud Computing
A Result on Novel Approach for Load Balancing in Cloud Computing
 
Predicting Steering Angle for Self Driving Vehicles
Predicting Steering Angle for Self Driving VehiclesPredicting Steering Angle for Self Driving Vehicles
Predicting Steering Angle for Self Driving Vehicles
 
MPSKM Algorithm to Cluster Uneven Dimensional Time Series Subspace Data
MPSKM Algorithm to Cluster Uneven Dimensional Time Series Subspace DataMPSKM Algorithm to Cluster Uneven Dimensional Time Series Subspace Data
MPSKM Algorithm to Cluster Uneven Dimensional Time Series Subspace Data
 
Mapping the road to better data storage strategies
Mapping the road to better data storage strategiesMapping the road to better data storage strategies
Mapping the road to better data storage strategies
 
A Survey on the Clustering Algorithms in Sales Data Mining
A Survey on the Clustering Algorithms in Sales Data MiningA Survey on the Clustering Algorithms in Sales Data Mining
A Survey on the Clustering Algorithms in Sales Data Mining
 
A Survey of Machine Learning Methods Applied to Computer ...
A Survey of Machine Learning Methods Applied to Computer ...A Survey of Machine Learning Methods Applied to Computer ...
A Survey of Machine Learning Methods Applied to Computer ...
 
Integration of cost-risk within an intelligent maintenance system
Integration of cost-risk within an intelligent maintenance systemIntegration of cost-risk within an intelligent maintenance system
Integration of cost-risk within an intelligent maintenance system
 
Machine Learning-Based Phishing Detection
Machine Learning-Based Phishing DetectionMachine Learning-Based Phishing Detection
Machine Learning-Based Phishing Detection
 
Knowledge Acquisition Based on Repertory Grid Analysis System
Knowledge Acquisition Based on Repertory Grid Analysis SystemKnowledge Acquisition Based on Repertory Grid Analysis System
Knowledge Acquisition Based on Repertory Grid Analysis System
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADAS
 
Paige Roberts: Shortcut MLOps with In-Database Machine Learning
Paige Roberts: Shortcut MLOps with In-Database Machine LearningPaige Roberts: Shortcut MLOps with In-Database Machine Learning
Paige Roberts: Shortcut MLOps with In-Database Machine Learning
 
How Partitioning Clustering Technique For Implementing...
How Partitioning Clustering Technique For Implementing...How Partitioning Clustering Technique For Implementing...
How Partitioning Clustering Technique For Implementing...
 
IRJET- Cluster Analysis for Effective Information Retrieval through Cohesive ...
IRJET- Cluster Analysis for Effective Information Retrieval through Cohesive ...IRJET- Cluster Analysis for Effective Information Retrieval through Cohesive ...
IRJET- Cluster Analysis for Effective Information Retrieval through Cohesive ...
 
MACHINE LEARNING CLASSIFIERS TO ANALYZE CREDIT RISK
MACHINE LEARNING CLASSIFIERS TO ANALYZE CREDIT RISKMACHINE LEARNING CLASSIFIERS TO ANALYZE CREDIT RISK
MACHINE LEARNING CLASSIFIERS TO ANALYZE CREDIT RISK
 
IRJET- Cluster Analysis for Effective Information Retrieval through Cohesive ...
IRJET- Cluster Analysis for Effective Information Retrieval through Cohesive ...IRJET- Cluster Analysis for Effective Information Retrieval through Cohesive ...
IRJET- Cluster Analysis for Effective Information Retrieval through Cohesive ...
 
ATLAS_Analytics_for_Total_Life_Cycle_for_Automotive_Systems
ATLAS_Analytics_for_Total_Life_Cycle_for_Automotive_SystemsATLAS_Analytics_for_Total_Life_Cycle_for_Automotive_Systems
ATLAS_Analytics_for_Total_Life_Cycle_for_Automotive_Systems
 
IRJET- Smart Railway System using Trip Chaining Method
IRJET- Smart Railway System using Trip Chaining MethodIRJET- Smart Railway System using Trip Chaining Method
IRJET- Smart Railway System using Trip Chaining Method
 
Architecting a machine learning pipeline for online traffic classification in...
Architecting a machine learning pipeline for online traffic classification in...Architecting a machine learning pipeline for online traffic classification in...
Architecting a machine learning pipeline for online traffic classification in...
 
Data mining techniques using weka
Data mining techniques using wekaData mining techniques using weka
Data mining techniques using weka
 
E502024047
E502024047E502024047
E502024047
 

More from NextGenATM Erasmus+ Project (20)

ml-09x01.pdf
ml-09x01.pdfml-09x01.pdf
ml-09x01.pdf
 
ml-08x01.pdf
ml-08x01.pdfml-08x01.pdf
ml-08x01.pdf
 
ml-05x01.pdf
ml-05x01.pdfml-05x01.pdf
ml-05x01.pdf
 
ml-03x01.pdf
ml-03x01.pdfml-03x01.pdf
ml-03x01.pdf
 
ml-02x01.pdf
ml-02x01.pdfml-02x01.pdf
ml-02x01.pdf
 
EAVA presentation.pdf
EAVA presentation.pdfEAVA presentation.pdf
EAVA presentation.pdf
 
ESTU presentation.pdf
ESTU presentation.pdfESTU presentation.pdf
ESTU presentation.pdf
 
HSW presentation.pdf
HSW presentation.pdfHSW presentation.pdf
HSW presentation.pdf
 
ts-07x01.pdf
ts-07x01.pdfts-07x01.pdf
ts-07x01.pdf
 
ts-06x01.pdf
ts-06x01.pdfts-06x01.pdf
ts-06x01.pdf
 
ts-05x01.pdf
ts-05x01.pdfts-05x01.pdf
ts-05x01.pdf
 
ts-04x01.pdf
ts-04x01.pdfts-04x01.pdf
ts-04x01.pdf
 
ts-03x01.pdf
ts-03x01.pdfts-03x01.pdf
ts-03x01.pdf
 
ts-02x01.pdf
ts-02x01.pdfts-02x01.pdf
ts-02x01.pdf
 
ts-01x01.pdf
ts-01x01.pdfts-01x01.pdf
ts-01x01.pdf
 
sa-07x01.pdf
sa-07x01.pdfsa-07x01.pdf
sa-07x01.pdf
 
sa-06x01.pdf
sa-06x01.pdfsa-06x01.pdf
sa-06x01.pdf
 
sa-05x01.pdf
sa-05x01.pdfsa-05x01.pdf
sa-05x01.pdf
 
sa-04x02.pdf
sa-04x02.pdfsa-04x02.pdf
sa-04x02.pdf
 
sa-04x01.pdf
sa-04x01.pdfsa-04x01.pdf
sa-04x01.pdf
 

Recently uploaded

678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf
CarlosHernanMontoyab2
 
Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.
Ashokrao Mane college of Pharmacy Peth-Vadgaon
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
Atul Kumar Singh
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
JosvitaDsouza2
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
Jean Carlos Nunes Paixão
 
Digital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and ResearchDigital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and Research
Vikramjit Singh
 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
TechSoup
 
Acetabularia Information For Class 9 .docx
Acetabularia Information For Class 9  .docxAcetabularia Information For Class 9  .docx
Acetabularia Information For Class 9 .docx
vaibhavrinwa19
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
GeoBlogs
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
Sandy Millin
 
Honest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptxHonest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptx
timhan337
 
The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
DhatriParmar
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
Mohd Adib Abd Muin, Senior Lecturer at Universiti Utara Malaysia
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
Jisc
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
SACHIN R KONDAGURI
 
The basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptxThe basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptx
heathfieldcps1
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
Celine George
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
Balvir Singh
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
EverAndrsGuerraGuerr
 
Polish students' mobility in the Czech Republic
Polish students' mobility in the Czech RepublicPolish students' mobility in the Czech Republic
Polish students' mobility in the Czech Republic
Anna Sz.
 

Recently uploaded (20)

678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf
 
Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
 
Digital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and ResearchDigital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and Research
 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
 
Acetabularia Information For Class 9 .docx
Acetabularia Information For Class 9  .docxAcetabularia Information For Class 9  .docx
Acetabularia Information For Class 9 .docx
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
 
Honest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptxHonest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptx
 
The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
 
The basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptxThe basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptx
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
 
Polish students' mobility in the Czech Republic
Polish students' mobility in the Czech RepublicPolish students' mobility in the Czech Republic
Polish students' mobility in the Czech Republic
 

ml-06x01.pdf

  • 1. This course is prepared under the Erasmus+ KA-210-YOU Project titled «Skilling Youth for the Next Generation Air Transport Management» Machine Learning Applications in Aviation Clustering Asst. Prof. Dr. Emircan Özdemir Eskişehir Technical University
  • 2. • Clustering, as a fundamental unsupervised learning technique, holds paramount importance in discerning inherent structures within aviation datasets. It's the compass that guides analysts through the vast sea of information, allowing them to uncover hidden relationships, group similar entities, and derive meaningful insights. • At its core, clustering is the art of finding natural groupings or clusters within a dataset. These clusters represent entities that share similarities, creating a valuable framework for understanding the underlying structure of aviation data. Clustering 2 Introduction
  • 3. • Clustering operates in the unsupervised learning realm, where the algorithm explores the data without predefined labels. In aviation analytics, where the intricacies of flight patterns, maintenance records, and passenger behaviors are multi-faceted, unsupervised learning becomes the compass guiding analysts through uncharted territories. Clustering, in particular, becomes the lens through which intricate patterns and relationships come into focus. • Consider clustering as the air traffic controller of data points, guiding them to form coherent patterns and groupings. This grouping mechanism is crucial in aviation for various applications – from categorizing aircraft maintenance profiles with shared characteristics to segmenting passenger behaviors for targeted marketing strategies. Clustering 3 Introduction
  • 4. • K-Means clustering is a foundational algorithm in unsupervised learning, widely employed for its simplicity and efficiency. This algorithm partitions data points into K clusters, where each cluster is represented by its centroid. The iterative process refines cluster assignments until convergence, making it a valuable tool data analysis. Clustering 4 Types of Clustering Algorithms Source (left): https://medium.com/data-folks-indonesia/step-by-step-to-understanding-k-means-clustering-and-implementation-with-sklearn-b55803f519d6 Source (right): https://www.ejable.com/tech-corner/ai-machine-learning-and-deep-learning/k-means-clustering/
  • 5. • Hierarchical clustering stands out for its ability to create a hierarchical tree-like structure of clusters. This method is particularly advantageous when the hierarchy of relationships within the data is of interest. In data analytics, hierarchical clustering finds utility in scenarios where data exhibits nested patterns. Clustering 5 Types of Clustering Algorithms Source: https://codinginfinite.com/hierarchical-clustering-applications-advantages-and-disadvantages/
  • 6. • DBSCAN (Density-Based Spatial Clustering of Applications with Noise) is a density-based clustering algorithm that excels in identifying clusters of varying shapes and sizes. Unlike K-Means, DBSCAN doesn't require specifying the number of clusters beforehand and can uncover outliers or noise in the data. Clustering 6 Types of Clustering Algorithms Source: https://towardsdatascience.com/understanding-dbscan-and-implementation-with-python-5de75a786f9f
  • 7. Aircraft Maintenance Grouping • Clustering can be used to categorize maintenance profiles of aircrafts. This application involves grouping similar maintenance patterns based on historical data. By identifying commonalities in maintenance needs, aviation experts can proactively plan and optimize maintenance schedules, ensuring the fleet's operational efficiency and safety. Clustering 7 Clustering Use Cases in Aviation https://investinestonia.com/magnetic-mro-wants-to-conquer-the-world/
  • 8. Passenger Segmentation • In the dynamic world of aviation, understanding passenger behavior is akin to deciphering a complex code. Clustering steps in as the linguist, segmenting passenger behaviors into meaningful groups. Whether it's frequent flyers, leisure travelers, or business executives, clustering enables airlines to tailor services, marketing, and experiences for distinct passenger segments, enhancing overall customer satisfaction and loyalty. Clustering 8 Clustering Use Cases in Aviation Source: https://investor-relations.lufthansagroup.com/fileadmin/downloads/en/charts-speeches/capital-markets-day-2019/capital-markets-day-2019-presentations.pdf
  • 9. Route Optimization • Flight paths are the arteries of aviation operations, and clustering serves as the compass for optimal route planning. By analyzing geographical patterns, clustering algorithms assist in grouping destinations with similar characteristics. This allows airlines to optimize flight routes, considering factors like weather, fuel efficiency, and airspace constraints. The results are: streamlined operations, reduced fuel costs, and improved on-time performance. Clustering 9 Clustering Use Cases in Aviation Source: http://coolinfographics.squarespace.com/blog/2016/6/3/the-global-air-transportation-network.html;jsessionid=E11686681A0ACCCF95D74B92A8C72E4E.v5-web014
  • 10. Feature Selection Feature selection stands as a crucial consideration, emphasizing the importance of choosing the most relevant variables for effective clustering. Like a skilled pilot choosing the essential instruments for a smooth flight, feature selection ensures that the clustering algorithm focuses on the data aspects most pertinent to the aviation context. Clustering 10 Considerations and Challenges in Aviation Clustering
  • 11. Scalability • Scalability emerges as a challenge that requires careful attention. As datasets soar in size and complexity, clustering algorithms must efficiently handle the increasing volume of information. Like managing air traffic, scalability in clustering ensures that algorithms remain effective and responsive even when dealing with vast aviation datasets, contributing to the seamless analysis of patterns and insights. Clustering 11 Considerations and Challenges in Aviation Clustering
  • 12. Data Preprocessing: Data preprocessing emerges as a critical best practice, emphasizing the significance of cleaning and preparing data for clustering tasks. Similar to ensuring that an aircraft is in optimal condition before takeoff, data preprocessing ensures that the input data is refined and ready for the intricate process of pattern recognition through clustering. Evaluation Metrics: Assessing the success of clustering models requires reliable metrics. Incorporating evaluation metrics is essential for gauging the performance of clustering algorithms. These metrics act as measurement tools, allowing analysts to quantitatively understand how well the clustering process aligns with the goals of aviation analytics. Clustering 12 Best Practices for Aviation Clustering
  • 13. • In RapidMiner, using the Repository window, follow the path Training Resources-Model-Unsupervised- Segmentation and open the Credit Risk k-Means Clustering solution process. • In this example, the customers are aimed to be segmented into groups according to their credit risks. • There is no label attribute and several attributes related to the credit risk of customers are taken into account to segment the customers. • Therefore, clustering model is choosen to reach this goal. Clustering 13 RapidMiner Example on Clustering
  • 14. • In the process window, there are data importing (ETL) operator, clustering model operator, and cluster model visualizer operator. In ETL oeprator, there are suboperators to prerocess the data. Z-transformation (normalization) stands here in the subprocess window. Also model parameters (number of k, numerical measure type etc.) can be set on the window right. Clustering 14 RapidMiner Example on Clustering
  • 15. • After you run the model, you can find outputs in the Results view. • You can find cluster graph, members of each cluser, centroid table, and plot view of clusters here. • Figure on the right shows the cluster graph (tree). Clustering 15 RapidMiner Example on Clustering
  • 16. • In the results view, cluster model visualizer operator provides further insights. • In the overview tab, you can see the clusters and breakdowns based on attributes. Clustering 16 RapidMiner Example on Clustering
  • 17. • In the Scatter Plot tab, you can create scatter plots focusing your each cluster. Clustering 17 RapidMiner Example on Clustering
  • 18. In the Plot tab, you can see the attributes that: - Clusters differ most - Clusters not differ - The complexiy of differences between clusters Clustering 18 RapidMiner Example on Clustering
  • 19. • Moreover, if you want to analyze the performance of the clustering model, you can use the performance operators under the segmentation folder in the operators window. Clustering 19 RapidMiner Example on Clustering
  • 20. • Incorporation of AI in Clustering As aviation analytics evolves, the integration of Artificial Intelligence (AI) into clustering methods represents a significant trend. Advanced AI techniques enhance the capabilities of clustering algorithms, allowing them to adapt and discover intricate patterns within aviation datasets, leading to more accurate and nuanced insights. • Explainable Clustering The future of aviation clustering emphasizes the importance of transparency and interpretability. As clustering models become more sophisticated, the need for understanding the rationale behind clustering outcomes grows. The concept of explainable clustering ensures that results are not only accurate but also comprehensible, fostering trust in the decision-making processes driven by clustering algorithms. Clustering 20 Future Trends in Clustering
  • 21. • In this lesson, we exlored the clustering in aviation comprehensively. • Various clustering algrotihms were introduced and the main characteristics/differences were explained. • You can further explore clustering algortihms using different datasets in RapidMiner. • Feature selection (selection of attributes) is a key point to build accurate clustering models. So, try different combinations for your attributes in the clustering models and try to figure out differences between outputs. • Also keep in mind to compare performances of your clustering models. Clustering 21 Conclusion