SlideShare a Scribd company logo
1 of 10
Download to read offline
Build an AI Auto
Mechanic Using
Reinforcement
Learning
All views expressed on this project are my
own and do not represent the opinions of
any entity whatsoever with which I have
been, am now, or will be affiliated.
.
Disclaimer
This presentation would not dive deep into
any of the technical methodology of
reinforcement learning. Anyone who is
interested to learn more about reinforcement
learning, please refer to “Reinforcement
Learning: An Introduction” by Richard S.
Sutton and Andrew G. Barto.
.
What is reinforcement learning ?
Imagine you are trying to build an AI Agent using
reinforcement learning(RL) to play “Space invaders”,
you give it a bunch of action button, live image and
score from the game and the agent do not have
access to any of back end system of the game.
The agent know there is a reward that it have to
maximize, but it have no idea what does each
button do and what does each image represent,
therefore, it would act like a child who is expose to
video game for the first time, by randomly pressing
action in the beginning and gradually learn to select
the correct action at the right time to achieve the
highest score.
A very simplified explanation using a example:
Case
Study
As APS is an important
function for trucks in its
everyday usages, which
generates pressurized air
that are utilized in various
functions in a truck, such as
braking and gear changes.
Hence, failure in APS would
resulted in costly repairs
and render the truck
unusable for any business
operation that would lead to
further profit loses.
This project will use the
publicized data by Scania
Trucks which consist of Air
Pressure System (APS)
Failure and Operational Data,
to simulate the environment
that is require to train the AI
agent using reinforcement
learning on real life
application.
Therefore, a functioning AI
agent that could predict the
if the truck predict APS failure
and take action can save a lot
of cost and time for the
business.
Information regarding about the case study
03
01
04
02
The training set contain about 60,000 observation
and test set contain 16,000, each observation
represent a truck.
Dataset
The attribute names of the data have been
anonymized by the provider for proprietary reasons.
Attribute
The dataset's positive class consists of component
failures for a specific component of the APS system.
The negative class consists of trucks with failures
for components not related to the APS.
Class
Each APS failure would cost $500 and $10 for any
false alarm for APS failure
Cost
PortfolioPresentation
A2CER PPO - Actor Advantage Critic with prioritized Experience
Replay Proximal Policy Optimization
• Actor Advantage Critic - A hybrid model that
implement policy based methods for the actors to
take action and value based methods for the Critic
to evaluate its action.
• Prioritized Experience Replay – a technique that
allowed rare experience to be learn more frequent
by the agent.
• Proximal Policy Optimization – avoid a single
update to be too great.
Reinforcement Learning Architecture used in this case –
A2CER PPO
Architecture High level layout
Result
01 02
Fail take action against 35 APS fail truck, cost
$17,500, 447 false alarm which cost $4,470.
Total cost $21,970, without the AI, all 375 APS
failure would cost $187,500, which save about
$165,530 in total.
Cost:
As RL are build for continuous learning, hence, as
time past the AI Agent would only keep improving
and could adjust itself across time to handle a
completely unseen environment.
Continuous Learning
* Total number of truck in the test set is 16,000
Recommendation to improve
RL performance
Reinforcement learning would perform
better if there is a stream of data for each
observation (Truck) rather than using a
single dataset at a particular time, as the
agent would be able to learn more
preciously about the chain of reaction or
early symptom of the truck failure and
take a action to alert the owner at much
earlier stage.
Real Life Application

More Related Content

Similar to Build an-ai-auto-mechanic

Role of SunTec India's Data Annotation in Smart Parking Development.pdf
Role of SunTec India's Data Annotation in Smart Parking Development.pdfRole of SunTec India's Data Annotation in Smart Parking Development.pdf
Role of SunTec India's Data Annotation in Smart Parking Development.pdfSunTec India
 
SplunkLive! Frankfurt 2018 - Monitoring the End User Experience with Splunk
SplunkLive! Frankfurt 2018 - Monitoring the End User Experience with SplunkSplunkLive! Frankfurt 2018 - Monitoring the End User Experience with Splunk
SplunkLive! Frankfurt 2018 - Monitoring the End User Experience with SplunkSplunk
 
SplunkLive! Munich 2018: Monitoring the End-User Experience with Splunk
SplunkLive! Munich 2018: Monitoring the End-User Experience with SplunkSplunkLive! Munich 2018: Monitoring the End-User Experience with Splunk
SplunkLive! Munich 2018: Monitoring the End-User Experience with SplunkSplunk
 
SplunkLive! Zurich 2018: Monitoring the End User Experience with Splunk
SplunkLive! Zurich 2018: Monitoring the End User Experience with SplunkSplunkLive! Zurich 2018: Monitoring the End User Experience with Splunk
SplunkLive! Zurich 2018: Monitoring the End User Experience with SplunkSplunk
 
IRJET- Machine Learning
IRJET- Machine LearningIRJET- Machine Learning
IRJET- Machine LearningIRJET Journal
 
IRJET- Quality Inspection of Tire using Deep Learning based Computer Vision
IRJET-  	  Quality Inspection of Tire using Deep Learning based Computer VisionIRJET-  	  Quality Inspection of Tire using Deep Learning based Computer Vision
IRJET- Quality Inspection of Tire using Deep Learning based Computer VisionIRJET Journal
 
IRJET - Airplane Crash Analysis and Prediction using Machine Learning
IRJET - Airplane Crash Analysis and Prediction using Machine LearningIRJET - Airplane Crash Analysis and Prediction using Machine Learning
IRJET - Airplane Crash Analysis and Prediction using Machine LearningIRJET Journal
 
5 Essential Tips for Load Testing Beginners
5 Essential Tips for Load Testing Beginners5 Essential Tips for Load Testing Beginners
5 Essential Tips for Load Testing BeginnersNeotys
 
IRJET- Parking Space Detection using Image Processing in MATLAB
IRJET- Parking Space Detection using Image Processing in MATLABIRJET- Parking Space Detection using Image Processing in MATLAB
IRJET- Parking Space Detection using Image Processing in MATLABIRJET Journal
 
IRJET - Obstacle Detection using a Stereo Vision of a Car
IRJET -  	  Obstacle Detection using a Stereo Vision of a CarIRJET -  	  Obstacle Detection using a Stereo Vision of a Car
IRJET - Obstacle Detection using a Stereo Vision of a CarIRJET Journal
 
How to find defects early and increase the reliability of software systems
How to find defects early and increase the reliability of software systemsHow to find defects early and increase the reliability of software systems
How to find defects early and increase the reliability of software systemsRAKESH RANA
 
IRJET- Number Plate Extraction from Vehicle Front View Image using Image ...
IRJET-  	  Number Plate Extraction from Vehicle Front View Image using Image ...IRJET-  	  Number Plate Extraction from Vehicle Front View Image using Image ...
IRJET- Number Plate Extraction from Vehicle Front View Image using Image ...IRJET Journal
 
Traffic Signboard Classification with Voice alert to the driver.pptx
Traffic Signboard Classification with Voice alert to the driver.pptxTraffic Signboard Classification with Voice alert to the driver.pptx
Traffic Signboard Classification with Voice alert to the driver.pptxharimaxwell0712
 
IRJET- Self-Driving Cars: Automation Testing using Udacity Simulator
IRJET- Self-Driving Cars: Automation Testing using Udacity SimulatorIRJET- Self-Driving Cars: Automation Testing using Udacity Simulator
IRJET- Self-Driving Cars: Automation Testing using Udacity SimulatorIRJET Journal
 
Ake parking guidance system
Ake  parking guidance systemAke  parking guidance system
Ake parking guidance systemakevincent
 
Identifying Parking Spots from Surveillance Cameras using CNN
Identifying Parking Spots from Surveillance Cameras using CNNIdentifying Parking Spots from Surveillance Cameras using CNN
Identifying Parking Spots from Surveillance Cameras using CNNIRJET Journal
 
IRJET- Portable Automated Test System for Automotive Industry
IRJET- Portable Automated Test System for Automotive IndustryIRJET- Portable Automated Test System for Automotive Industry
IRJET- Portable Automated Test System for Automotive IndustryIRJET Journal
 
REAL TIME DROWSINESS DETECTION
REAL TIME DROWSINESS DETECTIONREAL TIME DROWSINESS DETECTION
REAL TIME DROWSINESS DETECTIONIRJET Journal
 
IRJET- Smart Parking Assistance By Nameplate Recognition Using OCR
IRJET-  	  Smart Parking Assistance By Nameplate Recognition Using OCRIRJET-  	  Smart Parking Assistance By Nameplate Recognition Using OCR
IRJET- Smart Parking Assistance By Nameplate Recognition Using OCRIRJET Journal
 
deep-reinforcement-learning-framework.pdf
deep-reinforcement-learning-framework.pdfdeep-reinforcement-learning-framework.pdf
deep-reinforcement-learning-framework.pdfYugank Aman
 

Similar to Build an-ai-auto-mechanic (20)

Role of SunTec India's Data Annotation in Smart Parking Development.pdf
Role of SunTec India's Data Annotation in Smart Parking Development.pdfRole of SunTec India's Data Annotation in Smart Parking Development.pdf
Role of SunTec India's Data Annotation in Smart Parking Development.pdf
 
SplunkLive! Frankfurt 2018 - Monitoring the End User Experience with Splunk
SplunkLive! Frankfurt 2018 - Monitoring the End User Experience with SplunkSplunkLive! Frankfurt 2018 - Monitoring the End User Experience with Splunk
SplunkLive! Frankfurt 2018 - Monitoring the End User Experience with Splunk
 
SplunkLive! Munich 2018: Monitoring the End-User Experience with Splunk
SplunkLive! Munich 2018: Monitoring the End-User Experience with SplunkSplunkLive! Munich 2018: Monitoring the End-User Experience with Splunk
SplunkLive! Munich 2018: Monitoring the End-User Experience with Splunk
 
SplunkLive! Zurich 2018: Monitoring the End User Experience with Splunk
SplunkLive! Zurich 2018: Monitoring the End User Experience with SplunkSplunkLive! Zurich 2018: Monitoring the End User Experience with Splunk
SplunkLive! Zurich 2018: Monitoring the End User Experience with Splunk
 
IRJET- Machine Learning
IRJET- Machine LearningIRJET- Machine Learning
IRJET- Machine Learning
 
IRJET- Quality Inspection of Tire using Deep Learning based Computer Vision
IRJET-  	  Quality Inspection of Tire using Deep Learning based Computer VisionIRJET-  	  Quality Inspection of Tire using Deep Learning based Computer Vision
IRJET- Quality Inspection of Tire using Deep Learning based Computer Vision
 
IRJET - Airplane Crash Analysis and Prediction using Machine Learning
IRJET - Airplane Crash Analysis and Prediction using Machine LearningIRJET - Airplane Crash Analysis and Prediction using Machine Learning
IRJET - Airplane Crash Analysis and Prediction using Machine Learning
 
5 Essential Tips for Load Testing Beginners
5 Essential Tips for Load Testing Beginners5 Essential Tips for Load Testing Beginners
5 Essential Tips for Load Testing Beginners
 
IRJET- Parking Space Detection using Image Processing in MATLAB
IRJET- Parking Space Detection using Image Processing in MATLABIRJET- Parking Space Detection using Image Processing in MATLAB
IRJET- Parking Space Detection using Image Processing in MATLAB
 
IRJET - Obstacle Detection using a Stereo Vision of a Car
IRJET -  	  Obstacle Detection using a Stereo Vision of a CarIRJET -  	  Obstacle Detection using a Stereo Vision of a Car
IRJET - Obstacle Detection using a Stereo Vision of a Car
 
How to find defects early and increase the reliability of software systems
How to find defects early and increase the reliability of software systemsHow to find defects early and increase the reliability of software systems
How to find defects early and increase the reliability of software systems
 
IRJET- Number Plate Extraction from Vehicle Front View Image using Image ...
IRJET-  	  Number Plate Extraction from Vehicle Front View Image using Image ...IRJET-  	  Number Plate Extraction from Vehicle Front View Image using Image ...
IRJET- Number Plate Extraction from Vehicle Front View Image using Image ...
 
Traffic Signboard Classification with Voice alert to the driver.pptx
Traffic Signboard Classification with Voice alert to the driver.pptxTraffic Signboard Classification with Voice alert to the driver.pptx
Traffic Signboard Classification with Voice alert to the driver.pptx
 
IRJET- Self-Driving Cars: Automation Testing using Udacity Simulator
IRJET- Self-Driving Cars: Automation Testing using Udacity SimulatorIRJET- Self-Driving Cars: Automation Testing using Udacity Simulator
IRJET- Self-Driving Cars: Automation Testing using Udacity Simulator
 
Ake parking guidance system
Ake  parking guidance systemAke  parking guidance system
Ake parking guidance system
 
Identifying Parking Spots from Surveillance Cameras using CNN
Identifying Parking Spots from Surveillance Cameras using CNNIdentifying Parking Spots from Surveillance Cameras using CNN
Identifying Parking Spots from Surveillance Cameras using CNN
 
IRJET- Portable Automated Test System for Automotive Industry
IRJET- Portable Automated Test System for Automotive IndustryIRJET- Portable Automated Test System for Automotive Industry
IRJET- Portable Automated Test System for Automotive Industry
 
REAL TIME DROWSINESS DETECTION
REAL TIME DROWSINESS DETECTIONREAL TIME DROWSINESS DETECTION
REAL TIME DROWSINESS DETECTION
 
IRJET- Smart Parking Assistance By Nameplate Recognition Using OCR
IRJET-  	  Smart Parking Assistance By Nameplate Recognition Using OCRIRJET-  	  Smart Parking Assistance By Nameplate Recognition Using OCR
IRJET- Smart Parking Assistance By Nameplate Recognition Using OCR
 
deep-reinforcement-learning-framework.pdf
deep-reinforcement-learning-framework.pdfdeep-reinforcement-learning-framework.pdf
deep-reinforcement-learning-framework.pdf
 

Recently uploaded

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 

Recently uploaded (20)

DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 

Build an-ai-auto-mechanic

  • 1. Build an AI Auto Mechanic Using Reinforcement Learning
  • 2. All views expressed on this project are my own and do not represent the opinions of any entity whatsoever with which I have been, am now, or will be affiliated. . Disclaimer This presentation would not dive deep into any of the technical methodology of reinforcement learning. Anyone who is interested to learn more about reinforcement learning, please refer to “Reinforcement Learning: An Introduction” by Richard S. Sutton and Andrew G. Barto. .
  • 3. What is reinforcement learning ? Imagine you are trying to build an AI Agent using reinforcement learning(RL) to play “Space invaders”, you give it a bunch of action button, live image and score from the game and the agent do not have access to any of back end system of the game. The agent know there is a reward that it have to maximize, but it have no idea what does each button do and what does each image represent, therefore, it would act like a child who is expose to video game for the first time, by randomly pressing action in the beginning and gradually learn to select the correct action at the right time to achieve the highest score. A very simplified explanation using a example:
  • 4. Case Study As APS is an important function for trucks in its everyday usages, which generates pressurized air that are utilized in various functions in a truck, such as braking and gear changes. Hence, failure in APS would resulted in costly repairs and render the truck unusable for any business operation that would lead to further profit loses. This project will use the publicized data by Scania Trucks which consist of Air Pressure System (APS) Failure and Operational Data, to simulate the environment that is require to train the AI agent using reinforcement learning on real life application. Therefore, a functioning AI agent that could predict the if the truck predict APS failure and take action can save a lot of cost and time for the business.
  • 5. Information regarding about the case study 03 01 04 02 The training set contain about 60,000 observation and test set contain 16,000, each observation represent a truck. Dataset The attribute names of the data have been anonymized by the provider for proprietary reasons. Attribute The dataset's positive class consists of component failures for a specific component of the APS system. The negative class consists of trucks with failures for components not related to the APS. Class Each APS failure would cost $500 and $10 for any false alarm for APS failure Cost
  • 6. PortfolioPresentation A2CER PPO - Actor Advantage Critic with prioritized Experience Replay Proximal Policy Optimization • Actor Advantage Critic - A hybrid model that implement policy based methods for the actors to take action and value based methods for the Critic to evaluate its action. • Prioritized Experience Replay – a technique that allowed rare experience to be learn more frequent by the agent. • Proximal Policy Optimization – avoid a single update to be too great. Reinforcement Learning Architecture used in this case – A2CER PPO
  • 8. Result 01 02 Fail take action against 35 APS fail truck, cost $17,500, 447 false alarm which cost $4,470. Total cost $21,970, without the AI, all 375 APS failure would cost $187,500, which save about $165,530 in total. Cost: As RL are build for continuous learning, hence, as time past the AI Agent would only keep improving and could adjust itself across time to handle a completely unseen environment. Continuous Learning * Total number of truck in the test set is 16,000
  • 9. Recommendation to improve RL performance Reinforcement learning would perform better if there is a stream of data for each observation (Truck) rather than using a single dataset at a particular time, as the agent would be able to learn more preciously about the chain of reaction or early symptom of the truck failure and take a action to alert the owner at much earlier stage.