Build an-ai-auto-mechanic

•

0 likes•67 views

Chen Wai Pui

Building an Ai using reinforcement learning

Technology

Build an AI Auto
Mechanic Using
Reinforcement
Learning

All views expressed on this project are my
own and do not represent the opinions of
any entity whatsoever with which I have
been, am now, or will be affiliated.
.
Disclaimer
This presentation would not dive deep into
any of the technical methodology of
reinforcement learning. Anyone who is
interested to learn more about reinforcement
learning, please refer to “Reinforcement
Learning: An Introduction” by Richard S.
Sutton and Andrew G. Barto.
.

What is reinforcement learning ?
Imagine you are trying to build an AI Agent using
reinforcement learning(RL) to play “Space invaders”,
you give it a bunch of action button, live image and
score from the game and the agent do not have
access to any of back end system of the game.
The agent know there is a reward that it have to
maximize, but it have no idea what does each
button do and what does each image represent,
therefore, it would act like a child who is expose to
video game for the first time, by randomly pressing
action in the beginning and gradually learn to select
the correct action at the right time to achieve the
highest score.
A very simplified explanation using a example:

Case
Study
As APS is an important
function for trucks in its
everyday usages, which
generates pressurized air
that are utilized in various
functions in a truck, such as
braking and gear changes.
Hence, failure in APS would
resulted in costly repairs
and render the truck
unusable for any business
operation that would lead to
further profit loses.
This project will use the
publicized data by Scania
Trucks which consist of Air
Pressure System (APS)
Failure and Operational Data,
to simulate the environment
that is require to train the AI
agent using reinforcement
learning on real life
application.
Therefore, a functioning AI
agent that could predict the
if the truck predict APS failure
and take action can save a lot
of cost and time for the
business.

Information regarding about the case study
03
01
04
02
The training set contain about 60,000 observation
and test set contain 16,000, each observation
represent a truck.
Dataset
The attribute names of the data have been
anonymized by the provider for proprietary reasons.
Attribute
The dataset's positive class consists of component
failures for a specific component of the APS system.
The negative class consists of trucks with failures
for components not related to the APS.
Class
Each APS failure would cost $500 and $10 for any
false alarm for APS failure
Cost

PortfolioPresentation
A2CER PPO - Actor Advantage Critic with prioritized Experience
Replay Proximal Policy Optimization
• Actor Advantage Critic - A hybrid model that
implement policy based methods for the actors to
take action and value based methods for the Critic
to evaluate its action.
• Prioritized Experience Replay – a technique that
allowed rare experience to be learn more frequent
by the agent.
• Proximal Policy Optimization – avoid a single
update to be too great.
Reinforcement Learning Architecture used in this case –
A2CER PPO

Result
01 02
Fail take action against 35 APS fail truck, cost
$17,500, 447 false alarm which cost $4,470.
Total cost $21,970, without the AI, all 375 APS
failure would cost $187,500, which save about
$165,530 in total.
Cost:
As RL are build for continuous learning, hence, as
time past the AI Agent would only keep improving
and could adjust itself across time to handle a
completely unseen environment.
Continuous Learning
* Total number of truck in the test set is 16,000

Recommendation to improve
RL performance
Reinforcement learning would perform
better if there is a stream of data for each
observation (Truck) rather than using a
single dataset at a particular time, as the
agent would be able to learn more
preciously about the chain of reaction or
early symptom of the truck failure and
take a action to alert the owner at much
earlier stage.

Similar to Build an-ai-auto-mechanic

Role of SunTec India's Data Annotation in Smart Parking Development.pdfSunTec India

SplunkLive! Frankfurt 2018 - Monitoring the End User Experience with SplunkSplunk

SplunkLive! Munich 2018: Monitoring the End-User Experience with SplunkSplunk

SplunkLive! Zurich 2018: Monitoring the End User Experience with SplunkSplunk

IRJET- Machine LearningIRJET Journal

IRJET- Quality Inspection of Tire using Deep Learning based Computer VisionIRJET Journal

IRJET - Airplane Crash Analysis and Prediction using Machine LearningIRJET Journal

5 Essential Tips for Load Testing BeginnersNeotys

IRJET- Parking Space Detection using Image Processing in MATLABIRJET Journal

IRJET - Obstacle Detection using a Stereo Vision of a CarIRJET Journal

How to find defects early and increase the reliability of software systemsRAKESH RANA

IRJET- Number Plate Extraction from Vehicle Front View Image using Image ...IRJET Journal

Traffic Signboard Classification with Voice alert to the driver.pptxharimaxwell0712

IRJET- Self-Driving Cars: Automation Testing using Udacity SimulatorIRJET Journal

Ake parking guidance systemakevincent

Identifying Parking Spots from Surveillance Cameras using CNNIRJET Journal

IRJET- Portable Automated Test System for Automotive IndustryIRJET Journal

REAL TIME DROWSINESS DETECTIONIRJET Journal

IRJET- Smart Parking Assistance By Nameplate Recognition Using OCRIRJET Journal

deep-reinforcement-learning-framework.pdfYugank Aman

Similar to Build an-ai-auto-mechanic (20)

Role of SunTec India's Data Annotation in Smart Parking Development.pdf

SplunkLive! Frankfurt 2018 - Monitoring the End User Experience with Splunk

SplunkLive! Munich 2018: Monitoring the End-User Experience with Splunk

SplunkLive! Zurich 2018: Monitoring the End User Experience with Splunk

IRJET- Machine Learning

IRJET- Quality Inspection of Tire using Deep Learning based Computer Vision

IRJET - Airplane Crash Analysis and Prediction using Machine Learning

5 Essential Tips for Load Testing Beginners

IRJET- Parking Space Detection using Image Processing in MATLAB

IRJET - Obstacle Detection using a Stereo Vision of a Car

How to find defects early and increase the reliability of software systems

IRJET- Number Plate Extraction from Vehicle Front View Image using Image ...

Traffic Signboard Classification with Voice alert to the driver.pptx

IRJET- Self-Driving Cars: Automation Testing using Udacity Simulator

Ake parking guidance system

Identifying Parking Spots from Surveillance Cameras using CNN

IRJET- Portable Automated Test System for Automotive Industry

REAL TIME DROWSINESS DETECTION

IRJET- Smart Parking Assistance By Nameplate Recognition Using OCR

deep-reinforcement-learning-framework.pdf

Recently uploaded

DMCC Future of Trade Web3 - Special EditionDubai Multi Commodity Centre

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106

Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK

CloudStudio User manual (basic edition):comworks

Vulnerability_Management_GRC_by Sohang Sengupta.pptxnull - The Open Security Community

Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed

Understanding the Laravel MVC ArchitecturePixlogix Infotech

Gen AI in Business - Global Trends Report 2024.pdfAddepto

My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxnull - The Open Security Community

"ML in Production",Oleksandr BaganFwdays

Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi

"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays

"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays

Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar

New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada

Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software

Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada

APIForce Zurich 5 April Automation LPDGMarianaLemus7

Recently uploaded (20)

DMCC Future of Trade Web3 - Special Edition

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics

Unblocking The Main Thread Solving ANRs and Frozen Frames

CloudStudio User manual (basic edition):

Vulnerability_Management_GRC_by Sohang Sengupta.pptx

Scanning the Internet for External Cloud Exposures via SSL Certs

Understanding the Laravel MVC Architecture

Gen AI in Business - Global Trends Report 2024.pdf

My Hashitalk Indonesia April 2024 Presentation

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx

"ML in Production",Oleksandr Bagan

Vertex AI Gemini Prompt Engineering Tips

"Federated learning: out of reach no matter how close",Oleksandr Lapshyn

"Debugging python applications inside k8s environment", Andrii Soldatenko

Unleash Your Potential - Namagunga Girls Coding Club

New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024

Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation

Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024

APIForce Zurich 5 April Automation LPDG

Build an-ai-auto-mechanic

1. Build an AI Auto Mechanic Using Reinforcement Learning

2. All views expressed on this project are my own and do not represent the opinions of any entity whatsoever with which I have been, am now, or will be affiliated. . Disclaimer This presentation would not dive deep into any of the technical methodology of reinforcement learning. Anyone who is interested to learn more about reinforcement learning, please refer to “Reinforcement Learning: An Introduction” by Richard S. Sutton and Andrew G. Barto. .

3. What is reinforcement learning ? Imagine you are trying to build an AI Agent using reinforcement learning(RL) to play “Space invaders”, you give it a bunch of action button, live image and score from the game and the agent do not have access to any of back end system of the game. The agent know there is a reward that it have to maximize, but it have no idea what does each button do and what does each image represent, therefore, it would act like a child who is expose to video game for the first time, by randomly pressing action in the beginning and gradually learn to select the correct action at the right time to achieve the highest score. A very simplified explanation using a example:

4. Case Study As APS is an important function for trucks in its everyday usages, which generates pressurized air that are utilized in various functions in a truck, such as braking and gear changes. Hence, failure in APS would resulted in costly repairs and render the truck unusable for any business operation that would lead to further profit loses. This project will use the publicized data by Scania Trucks which consist of Air Pressure System (APS) Failure and Operational Data, to simulate the environment that is require to train the AI agent using reinforcement learning on real life application. Therefore, a functioning AI agent that could predict the if the truck predict APS failure and take action can save a lot of cost and time for the business.

5. Information regarding about the case study 03 01 04 02 The training set contain about 60,000 observation and test set contain 16,000, each observation represent a truck. Dataset The attribute names of the data have been anonymized by the provider for proprietary reasons. Attribute The dataset's positive class consists of component failures for a specific component of the APS system. The negative class consists of trucks with failures for components not related to the APS. Class Each APS failure would cost $500 and $10 for any false alarm for APS failure Cost

6. PortfolioPresentation A2CER PPO - Actor Advantage Critic with prioritized Experience Replay Proximal Policy Optimization • Actor Advantage Critic - A hybrid model that implement policy based methods for the actors to take action and value based methods for the Critic to evaluate its action. • Prioritized Experience Replay – a technique that allowed rare experience to be learn more frequent by the agent. • Proximal Policy Optimization – avoid a single update to be too great. Reinforcement Learning Architecture used in this case – A2CER PPO

7. Architecture High level layout

8. Result 01 02 Fail take action against 35 APS fail truck, cost $17,500, 447 false alarm which cost $4,470. Total cost $21,970, without the AI, all 375 APS failure would cost $187,500, which save about $165,530 in total. Cost: As RL are build for continuous learning, hence, as time past the AI Agent would only keep improving and could adjust itself across time to handle a completely unseen environment. Continuous Learning * Total number of truck in the test set is 16,000

9. Recommendation to improve RL performance Reinforcement learning would perform better if there is a stream of data for each observation (Truck) rather than using a single dataset at a particular time, as the agent would be able to learn more preciously about the chain of reaction or early symptom of the truck failure and take a action to alert the owner at much earlier stage.

10. Real Life Application

Build an-ai-auto-mechanic

Recommended

Recommended

More Related Content

Similar to Build an-ai-auto-mechanic

Similar to Build an-ai-auto-mechanic (20)

Recently uploaded

Recently uploaded (20)

Build an-ai-auto-mechanic