SlideShare a Scribd company logo
Generation of plausible
incident stories by using
recurrent neural networks
Toru Nakata
National Institute of Advanced Industrial Science
and Technology
1
LSTM
Incident
Story made
by me
Research Problem:
Can we predict future incidents?
Type of incident Recorded Data Prediction Tool
Mechanical/Electric
al Failures
Numerical. (Failure
rate, etc)
Statistics, Reliability
Engineering
Human Error
Text (Reports,
Narratives, etc) NLP?
Incidents by
complex causes
Various ?
2
Text data are crucial for major incident prevention, but we
cannot use math for them.
Natural Language Processsing (NLP) may solve it.
LSTM: Generation of Stories
•LSTM (“Long Short-Term Memory”) is the most
popular (and almost de facto standard of)
recurrent neural networks that can learn
patterns of sequences.
•Pro:
• High quality of output. Generation of sequence of
words with almost perfect grammar.
•Cons:
• Huge amount of teaching data, calculation time, cost
are required.
3
Imaginal Stories for Prediction
•LSTM can generate plausible incident reports
by learning the sequences of real incident
reports.
4
Real
Report
Real
Report
Real
Report
Real
Report
Real
Report
Real
Report LSTM
Unreal but
learned
sequence
pattern
Learn sequence
patterns of words
Generate
sequence of
words
The words are plausible in respect to
the teaching data statics. Is it a
candidate of future incident?
Teaching Data
LSTM learns patterns of sequential data
5
wi-s wi
… wi+1
LSTM
Lean
and
Guess
A given word sequence of a real incident report
wi-s+1
LSTM
• Guess the statistically most likely(*) word to follow given
phrase.
• *: Randomness of word selection can be adjusted manually.
• Repeating the guesses, LSTM generates a plausible (but may
not be real) report.
wi-s wi
…wi+1
LSTM
Lean
and
Guess
Learning in reverse direction
6
wi-s+1
LSTM
• LSTM can lean backward.
• From given ending of an incident report, LSTM guess the
beginning situation of the report.
A given word sequence of a real incident report
Teaching Data
• NASA’s Aviation Safety Reporting System (ASRS) Database
• Narrative Part Texts
• Report written by pilots or other agents involved in incidents.
Described from point of view of first-person.
7
Period of incidents dates Jan 2013 to Dec 2018
Number of reports 31,285
Number of characters 54,880,919
Number of words 11,008,384
Number of kinds of words (+ 2
auxiliary marks)
38,228 (+2)
The bigger data, the better quality of LSTM learning.
LSTM (19,762,176 param.)
Drop out (prob. = 10%)
Dense (4,926,690 param.)
Activation
Sequence of 30 Words
(vocabulary = 9,098)
Prediction of Next Word
256 dim. vector
256 dim. vector
19170 dim. vector
To choose a plausible word
depending on probability.
To calculate probabilities
of all words.
To learn sequence
patterns.
To erase data randomly
to avoid over-learning.
8
Days are required for learning
on my desktop PC.
Adopted some famous text generation
programs with Python, Tensor Flow,
and Keras posted at Github for free. No
need to code by yourself.
Generation Example
• <Mark of Start of Report> I had a pilot of the flight to zzz2. I was
flying the aircraft to fly the aircraft and that I was going to fly the
aircraft in the vicinity of the airport. I was able to get the aircraft to the
east and the aircraft was on the ILS to Runway 16 at the same time. I
requested a heading of 150 and told the controller that we were not
going to be on the arrival. I asked the controller if we were on the
approach and told me to maintain visual separation. I told him to
maintain visual separation. I then turned Aircraft X to 10,500 feet to
avoid the traffic. I then told the controller that I had to climb to 11,000
feet to 3,000 feet. I could not see the traffic at the time, I was not able
to see the aircraft. I was not sure if I had the radios and I had to do so.
I was trying to get the airplane out of the way down the runway. I was
not sure where the airplane was on and that I had a chance to do the
job. …
• Vicinity of airport  Ready to land with ILS (Instrumental Landing
System)  Negotiation with Controller  Visual Separation 
Hidden Traffic
• Randomness = 0.4
9
Generation with specific initial phrase
• I commanded LSTM start with phrase of ”And then I found a
strange …”
• {And then I found a strange} smell. I told the captain that the
battery was off the door and the lavatory was in the galley. I
was told that there was a passenger in the cockpit and that the
door was still closed. I asked the fa flight attendant to hear me
and that she was leaving the lavatory door and said she was not
injured. I told her that she was injured and she was not going to
declare an emergency. I also told her that we were returning to
[departure airport].
• Strange  Smell  Lavatory  Returning to departure airport
• LSTM generated as a typical word sequence after the initial
phrase.
10
Example: Reverse direction with specific
ending of Hard-Landing
• I notified dispatch of the weather in weather conditions, and
return to our destination. We notified ATC and directed us to
divert to ZZZ. We ran the before takeoff checklist. We
completed the overweight landing checklist. We completed the
QRH and after completing the landing checklist. We were asked
if we needed to return to ZZZ. We replied that we were at FL330,
and were given a descent to 4,000 feet. We were cleared for the
approach. We were cleared for the approach, and we were
cleared for a visual approach. However, we were cleared for the
approach. We briefed the approach and we were cleared for the
approach. On the final approach, {we made a hard landing. We
made a hard landing. We made a hard landing.}
• Before takeoff checklist  Overweight landing checklist 
Quick Reference Handbook (of Engine, QRH)  Hard Landing
11
Output Example failing consistency
• <Mark of Start of Report> After landing on the ground, we
were cleared to taxi to the ramp and cleared us to taxi to the
gate. I was concerned about the situation with the aircraft in the
gate and the aircraft was towed to the gate. The ramp agent
called the fire trucks and they informed us that the gear was still
down, but they did not have the gear to retract the gear. I then
called for flaps 20. He then called for flaps 30 and then
proceeded to complete the approach. The aircraft was cleared
for the ILS.
• Under Randomness = 0.5.
• After Landed  Taxi  Need to tell us gear down?
• Too much random to keep context persistent.
12
Example of too low randomness
• I was the pilot flying and I was flying pilot flying. I was the pilot
flying (PF) and I was flying the airplane and I was flying the
aircraft. I was the pilot flying and I was flying the aircraft. I was
flying a visual approach and landing. I was the pilot flying and I
had the first officer and I was flying the aircraft.
• Randomness = 0.3
• Repeating the loop of “I was the pilot flying”, one of the most
frequently appearing sequence.
• After days of leaning using huge text data, LSTM still becomes
silly, but sometimes it generates good outputs (if we train LSTM
more).
13
Conclusion
• LSTM can generate
• (perhaps) unreal incident stories
• with specific beginning or ending
• that might tell us possibilities of unheard-of incidents in the
future
• Adjusting randomness was difficult.
• Too low randomness results in loops of frequently used words.
• Too high randomness breaks persistency of context.
• Many know-hows are very important to improve the quality.
• The hope is that more data and more large LSTM model
will solve such problem of quality.
14

More Related Content

Recently uploaded

TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
Tomaz Bratanic
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
innovationoecd
 
Azure API Management to expose backend services securely
Azure API Management to expose backend services securelyAzure API Management to expose backend services securely
Azure API Management to expose backend services securely
Dinusha Kumarasiri
 
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
alexjohnson7307
 
Digital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
Digital Banking in the Cloud: How Citizens Bank Unlocked Their MainframeDigital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
Digital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
Precisely
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
Zilliz
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Tosin Akinosho
 
Public CyberSecurity Awareness Presentation 2024.pptx
Public CyberSecurity Awareness Presentation 2024.pptxPublic CyberSecurity Awareness Presentation 2024.pptx
Public CyberSecurity Awareness Presentation 2024.pptx
marufrahmanstratejm
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
Jakub Marek
 
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying AheadDigital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Wask
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
panagenda
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
panagenda
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
MichaelKnudsen27
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
Ivanti
 
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
saastr
 
5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides
DanBrown980551
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
ssuserfac0301
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
Chart Kalyan
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
Zilliz
 

Recently uploaded (20)

TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
 
Azure API Management to expose backend services securely
Azure API Management to expose backend services securelyAzure API Management to expose backend services securely
Azure API Management to expose backend services securely
 
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
 
Digital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
Digital Banking in the Cloud: How Citizens Bank Unlocked Their MainframeDigital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
Digital Banking in the Cloud: How Citizens Bank Unlocked Their Mainframe
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
 
Public CyberSecurity Awareness Presentation 2024.pptx
Public CyberSecurity Awareness Presentation 2024.pptxPublic CyberSecurity Awareness Presentation 2024.pptx
Public CyberSecurity Awareness Presentation 2024.pptx
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
 
Digital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying AheadDigital Marketing Trends in 2024 | Guide for Staying Ahead
Digital Marketing Trends in 2024 | Guide for Staying Ahead
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
 
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
 
5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
 

Featured

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
Marius Sescu
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
Expeed Software
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
Pixeldarts
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
ThinkNow
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
marketingartwork
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
Skeleton Technologies
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
SpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Lily Ray
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
Rajiv Jayarajah, MAppComm, ACC
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
Christy Abraham Joy
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
Vit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
MindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
RachelPearson36
 

Featured (20)

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 

Generation of plausible incident stories by using recurrent neural networks

  • 1. Generation of plausible incident stories by using recurrent neural networks Toru Nakata National Institute of Advanced Industrial Science and Technology 1 LSTM Incident Story made by me
  • 2. Research Problem: Can we predict future incidents? Type of incident Recorded Data Prediction Tool Mechanical/Electric al Failures Numerical. (Failure rate, etc) Statistics, Reliability Engineering Human Error Text (Reports, Narratives, etc) NLP? Incidents by complex causes Various ? 2 Text data are crucial for major incident prevention, but we cannot use math for them. Natural Language Processsing (NLP) may solve it.
  • 3. LSTM: Generation of Stories •LSTM (“Long Short-Term Memory”) is the most popular (and almost de facto standard of) recurrent neural networks that can learn patterns of sequences. •Pro: • High quality of output. Generation of sequence of words with almost perfect grammar. •Cons: • Huge amount of teaching data, calculation time, cost are required. 3
  • 4. Imaginal Stories for Prediction •LSTM can generate plausible incident reports by learning the sequences of real incident reports. 4 Real Report Real Report Real Report Real Report Real Report Real Report LSTM Unreal but learned sequence pattern Learn sequence patterns of words Generate sequence of words The words are plausible in respect to the teaching data statics. Is it a candidate of future incident? Teaching Data
  • 5. LSTM learns patterns of sequential data 5 wi-s wi … wi+1 LSTM Lean and Guess A given word sequence of a real incident report wi-s+1 LSTM • Guess the statistically most likely(*) word to follow given phrase. • *: Randomness of word selection can be adjusted manually. • Repeating the guesses, LSTM generates a plausible (but may not be real) report.
  • 6. wi-s wi …wi+1 LSTM Lean and Guess Learning in reverse direction 6 wi-s+1 LSTM • LSTM can lean backward. • From given ending of an incident report, LSTM guess the beginning situation of the report. A given word sequence of a real incident report
  • 7. Teaching Data • NASA’s Aviation Safety Reporting System (ASRS) Database • Narrative Part Texts • Report written by pilots or other agents involved in incidents. Described from point of view of first-person. 7 Period of incidents dates Jan 2013 to Dec 2018 Number of reports 31,285 Number of characters 54,880,919 Number of words 11,008,384 Number of kinds of words (+ 2 auxiliary marks) 38,228 (+2) The bigger data, the better quality of LSTM learning.
  • 8. LSTM (19,762,176 param.) Drop out (prob. = 10%) Dense (4,926,690 param.) Activation Sequence of 30 Words (vocabulary = 9,098) Prediction of Next Word 256 dim. vector 256 dim. vector 19170 dim. vector To choose a plausible word depending on probability. To calculate probabilities of all words. To learn sequence patterns. To erase data randomly to avoid over-learning. 8 Days are required for learning on my desktop PC. Adopted some famous text generation programs with Python, Tensor Flow, and Keras posted at Github for free. No need to code by yourself.
  • 9. Generation Example • <Mark of Start of Report> I had a pilot of the flight to zzz2. I was flying the aircraft to fly the aircraft and that I was going to fly the aircraft in the vicinity of the airport. I was able to get the aircraft to the east and the aircraft was on the ILS to Runway 16 at the same time. I requested a heading of 150 and told the controller that we were not going to be on the arrival. I asked the controller if we were on the approach and told me to maintain visual separation. I told him to maintain visual separation. I then turned Aircraft X to 10,500 feet to avoid the traffic. I then told the controller that I had to climb to 11,000 feet to 3,000 feet. I could not see the traffic at the time, I was not able to see the aircraft. I was not sure if I had the radios and I had to do so. I was trying to get the airplane out of the way down the runway. I was not sure where the airplane was on and that I had a chance to do the job. … • Vicinity of airport  Ready to land with ILS (Instrumental Landing System)  Negotiation with Controller  Visual Separation  Hidden Traffic • Randomness = 0.4 9
  • 10. Generation with specific initial phrase • I commanded LSTM start with phrase of ”And then I found a strange …” • {And then I found a strange} smell. I told the captain that the battery was off the door and the lavatory was in the galley. I was told that there was a passenger in the cockpit and that the door was still closed. I asked the fa flight attendant to hear me and that she was leaving the lavatory door and said she was not injured. I told her that she was injured and she was not going to declare an emergency. I also told her that we were returning to [departure airport]. • Strange  Smell  Lavatory  Returning to departure airport • LSTM generated as a typical word sequence after the initial phrase. 10
  • 11. Example: Reverse direction with specific ending of Hard-Landing • I notified dispatch of the weather in weather conditions, and return to our destination. We notified ATC and directed us to divert to ZZZ. We ran the before takeoff checklist. We completed the overweight landing checklist. We completed the QRH and after completing the landing checklist. We were asked if we needed to return to ZZZ. We replied that we were at FL330, and were given a descent to 4,000 feet. We were cleared for the approach. We were cleared for the approach, and we were cleared for a visual approach. However, we were cleared for the approach. We briefed the approach and we were cleared for the approach. On the final approach, {we made a hard landing. We made a hard landing. We made a hard landing.} • Before takeoff checklist  Overweight landing checklist  Quick Reference Handbook (of Engine, QRH)  Hard Landing 11
  • 12. Output Example failing consistency • <Mark of Start of Report> After landing on the ground, we were cleared to taxi to the ramp and cleared us to taxi to the gate. I was concerned about the situation with the aircraft in the gate and the aircraft was towed to the gate. The ramp agent called the fire trucks and they informed us that the gear was still down, but they did not have the gear to retract the gear. I then called for flaps 20. He then called for flaps 30 and then proceeded to complete the approach. The aircraft was cleared for the ILS. • Under Randomness = 0.5. • After Landed  Taxi  Need to tell us gear down? • Too much random to keep context persistent. 12
  • 13. Example of too low randomness • I was the pilot flying and I was flying pilot flying. I was the pilot flying (PF) and I was flying the airplane and I was flying the aircraft. I was the pilot flying and I was flying the aircraft. I was flying a visual approach and landing. I was the pilot flying and I had the first officer and I was flying the aircraft. • Randomness = 0.3 • Repeating the loop of “I was the pilot flying”, one of the most frequently appearing sequence. • After days of leaning using huge text data, LSTM still becomes silly, but sometimes it generates good outputs (if we train LSTM more). 13
  • 14. Conclusion • LSTM can generate • (perhaps) unreal incident stories • with specific beginning or ending • that might tell us possibilities of unheard-of incidents in the future • Adjusting randomness was difficult. • Too low randomness results in loops of frequently used words. • Too high randomness breaks persistency of context. • Many know-hows are very important to improve the quality. • The hope is that more data and more large LSTM model will solve such problem of quality. 14