SlideShare a Scribd company logo
Thirty Years of
Software Problems
in the News
Bryan Dosono, Syracuse University
Andrew J. Ko, University of Washington
Neeraja Duriseti, AT&T
Related work
• Software transforms society as it reshapes
globalization, social relationships, and other aspects of
everyday work
• Software isn’t perfect; it can fail spectacularly
• Most water, gas, and electricity failures now involve
software failures (Rahman, Beznosov & Martí, 2009)
• Security breaches are now more prevalent and more
costly than ever before (Garrison & Ncube, 2011)
• Cars recalled more frequently than ever because of
software defects (Mark, Bagdouri, Palen, Martin, Al-Ani &
Anderson, 2012)
INTRODUCTION
METHOD
RESULTS
DISCUSSION
CONCLUSION
QUESTIONS
Research question
•How have the consequences of
software problems changed over
the past 30 years?
INTRODUCTION
METHOD
RESULTS
DISCUSSION
CONCLUSION
QUESTIONS
Data collection
• Gathered text to build a corpus for analysis
• Three major news archives: ProQuest, LexisNexus, Factiva
• Databases included 93 out of 100 most circulated
newspapers
• Only considered English news coverage
• Searched for words commonly referring to software
problems
• i.e., bug, defect, error, fault, flaw, malfunction, glitch, mistake
• “Glitch” consistently referred to software problems (60%)
• 386,381 articles published from 1980-2012 with word “glitch”
INTRODUCTION
METHOD
RESULTS
DISCUSSION
CONCLUSION
QUESTIONS
Termsconsideredforretrievingarticles
Term
% About
Software
# Articles
With Term
# About
Software
Dominant Topics
glitch 60% 49,537 ~30,000 software problems
bug 11% 182,150 ~20,000 insects, illness
malfunction 8% 24,154 ~2,000 mechanical problems
defect 3% 92,300 ~3,000 taxes, birth defects
flaw 3% 106,240 ~3,000 construction, decisions
error 2% 629,832 ~12,000 sports, news, medicine
mistake 1% 586,720 ~6,000 decision making
fault 1% 265,383 ~3,000 politics
INTRODUCTION
RESULTS
DISCUSSION
CONCLUSION
QUESTIONS
METHOD
Clustering articles to stories
• Developed new clustering algorithm
• Converted each article’s words into a vector of tf-idf scores
• Iterated through all articles in order of publication date
• Compared each article to all subsequent articles published
within a 7-day window
INTRODUCTION
METHOD
RESULTS
DISCUSSION
CONCLUSION
QUESTIONS
Story measurements
Quantitative variables
• Normalized article
count
• Frequency of news
source location by
geographic region
Qualitative variables
• Categorized articles
with specific codes
among the three
authors (Table 2)
• Three rounds of
redundant coding until
reaching satisfactory
intercoder reliability
INTRODUCTION
METHOD
RESULTS
DISCUSSION
CONCLUSION
QUESTIONS
Codesusedtoclassifyeachstory
Code Description
software system failure caused by a software defect
death one or more people lost their lives
harm one or more people were physically harmed
basic one or more people lost access to food or shelter
property one or more people permanently lost material goods, money, or data
delay one or more people had to postpone an activity due
INTRODUCTION
METHOD
RESULTS
DISCUSSION
CONCLUSION
QUESTIONS
Top10storiesbynormalizedarticlecount
Software Problem Reported Year # Articles
The aftermath of the year 2000 millennium bug 2000 17,193
E-voting defects in the 2004 US elections 2004 2,509
Preparations for the year 2000 millennium bug 1998 1,745
Medicare drug eligibility defect 2006 1,983
Toyota Prius sudden acceleration bug 2010 3,782
Delayed e-voting tallies in the 2008 US elections 2008 1,796
Incorrect voting tallies from e-voting machines 2006 1,236
Incorrect New Mexico voting tally 2000 1,003
Atlanta Olympics results delivery to news delayed 1996 447
E-voting defects in the 2002 US mid-term elections 2002 636
INTRODUCTION
METHOD
RESULTS
DISCUSSION
CONCLUSION
QUESTIONS
Storiesreportedpermonthbyregion
INTRODUCTION
METHOD
RESULTS
DISCUSSION
CONCLUSION
QUESTIONS
Reportsofdeath
INTRODUCTION
METHOD
RESULTS
DISCUSSION
CONCLUSION
QUESTIONS
Reportsofphysicalharm
INTRODUCTION
METHOD
RESULTS
DISCUSSION
CONCLUSION
QUESTIONS
Reportsoflostaccesstofoodandshelter
INTRODUCTION
METHOD
RESULTS
DISCUSSION
CONCLUSION
QUESTIONS
Reportsofpropertyloss
INTRODUCTION
METHOD
RESULTS
DISCUSSION
CONCLUSION
QUESTIONS
Reportsofdelay
INTRODUCTION
METHOD
RESULTS
DISCUSSION
CONCLUSION
QUESTIONS
Discussion
• Several possible causes could underlie the
observed news reporting trends
• Software problems are becoming harder to observe
• Most of the stories in our data are based on press
releases or investigations of public organizations
• Software failures could be more common within
private organizations but are simply not reported
INTRODUCTION
METHOD
DISCUSSION
RESULTS
CONCLUSION
QUESTIONS
Discussion
• New directions for software engineering practice
• Software should be used in supplement to legacy
process systems rather than as a cure-all for
problems
• Software design practices should consider the
potential for failure in all parts of a system’s
functionality
• Adapt frameworks on universal human needs into a
software inspection checklist
INTRODUCTION
METHOD
DISCUSSION
RESULTS
CONCLUSION
QUESTIONS
Discussion
• Current software lacks
• A middle ground between working and not working
• Tools to quickly isolate and identify causes for failure
• Highly informative error messages to speed the
recovery process
• Software should be designed with the recovery
process in mind
INTRODUCTION
METHOD
DISCUSSION
RESULTS
CONCLUSION
QUESTIONS
Conclusion
• Consequences of software problems on society can
range from trivial to catastrophic
• The majority of reported software problems are
brief inconveniences
• Software engineering practices still need
improvement as we design software for the future
INTRODUCTION
METHOD
CONCLUSION
DISCUSSION
RESULTS
QUESTIONS
Any questions?
• Acknowledgements
• Thank you to Parmit Chilana for her comments on early
drafts of this work
• This material is based in part upon work supported by the
National Science Foundation under Grant Numbers CCF-
0952733 and CCF-1153625
• Any opinions, findings, and conclusions or
recommendations expressed in this material are those of
the author(s) and do not necessarily reflect the views of
the National Science Foundation
INTRODUCTION
METHOD
QUESTIONS
DISCUSSION
RESULTS
CONCLUSION

More Related Content

Similar to Thirty Years of Software Problems in the News

Systems Engineering and Requirements Management in Medical Device Product Dev...
Systems Engineering and Requirements Management in Medical Device Product Dev...Systems Engineering and Requirements Management in Medical Device Product Dev...
Systems Engineering and Requirements Management in Medical Device Product Dev...
UBMCanon
 
Direct Project HIT Standards 10.27
Direct Project HIT Standards 10.27Direct Project HIT Standards 10.27
Direct Project HIT Standards 10.27
Brian Ahier
 
Advance Software Engineering notes for ME students
Advance Software Engineering notes for ME studentsAdvance Software Engineering notes for ME students
Advance Software Engineering notes for ME students
poornank05
 
IEEE 1633 Recommended Practices for Reliable Software
IEEE 1633 Recommended Practices for Reliable SoftwareIEEE 1633 Recommended Practices for Reliable Software
IEEE 1633 Recommended Practices for Reliable Software
Ann Marie Neufelder
 
SD West 2008: Call the requirements police, you've entered design!
SD West 2008: Call the requirements police, you've entered design!SD West 2008: Call the requirements police, you've entered design!
SD West 2008: Call the requirements police, you've entered design!
Alan Bustamante
 
Understanding and Improving Software Productivity
Understanding and Improving Software ProductivityUnderstanding and Improving Software Productivity
Understanding and Improving Software Productivity
ssuser2be5eb
 
Decision CAMP 2014 - Benjamin Grosof Janine Bloomfield - Explanation-based E-...
Decision CAMP 2014 - Benjamin Grosof Janine Bloomfield - Explanation-based E-...Decision CAMP 2014 - Benjamin Grosof Janine Bloomfield - Explanation-based E-...
Decision CAMP 2014 - Benjamin Grosof Janine Bloomfield - Explanation-based E-...
Decision CAMP
 
State of Florida Neo4J Graph Briefing - Keynote
State of Florida Neo4J Graph Briefing - KeynoteState of Florida Neo4J Graph Briefing - Keynote
State of Florida Neo4J Graph Briefing - Keynote
Neo4j
 
Dr Abel Sanchez at Bristlecone Pulse 2017 MIT
Dr Abel Sanchez at Bristlecone Pulse 2017 MITDr Abel Sanchez at Bristlecone Pulse 2017 MIT
Dr Abel Sanchez at Bristlecone Pulse 2017 MIT
Bristlecone SCC
 
lecture 1-5.pdf
lecture 1-5.pdflecture 1-5.pdf
lecture 1-5.pdf
AkankshaJha53
 
software engineering documents.pptx
software engineering documents.pptxsoftware engineering documents.pptx
software engineering documents.pptx
Makinde Akindeji
 
Unsustainable Regaining Control of Uncontrollable Apps
Unsustainable Regaining Control of Uncontrollable AppsUnsustainable Regaining Control of Uncontrollable Apps
Unsustainable Regaining Control of Uncontrollable Apps
CAST
 
Angelbeat -Cut Your Troubleshooting_Time-In-Half
Angelbeat -Cut Your Troubleshooting_Time-In-HalfAngelbeat -Cut Your Troubleshooting_Time-In-Half
Angelbeat -Cut Your Troubleshooting_Time-In-Half
Rick Kingsley
 
Chap01
Chap01Chap01
Knowledge Matters Issue 15 - Technology at Concern
Knowledge Matters Issue 15 - Technology at ConcernKnowledge Matters Issue 15 - Technology at Concern
Knowledge Matters Issue 15 - Technology at Concern
Ellen Ward
 
Redgrave LLP New Review Strategies - eDiscovery
Redgrave LLP New Review Strategies - eDiscoveryRedgrave LLP New Review Strategies - eDiscovery
Redgrave LLP New Review Strategies - eDiscovery
Redgrave LLP
 
Redgrave LLP New Review Strategies - eDiscovery
Redgrave LLP New Review Strategies - eDiscoveryRedgrave LLP New Review Strategies - eDiscovery
Redgrave LLP New Review Strategies - eDiscovery
Redgrave LLP
 
2016 velocity santa clara state of dev ops report deck final
2016 velocity santa clara state of dev ops report deck final2016 velocity santa clara state of dev ops report deck final
2016 velocity santa clara state of dev ops report deck final
Nicole Forsgren
 
Java learn from basic part chapter_01 short notes to understand the java quic...
Java learn from basic part chapter_01 short notes to understand the java quic...Java learn from basic part chapter_01 short notes to understand the java quic...
Java learn from basic part chapter_01 short notes to understand the java quic...
GaytriMate
 
Topic 4 - impact of Information Technology.pptx.pdf
Topic 4 - impact of Information Technology.pptx.pdfTopic 4 - impact of Information Technology.pptx.pdf
Topic 4 - impact of Information Technology.pptx.pdf
lundamiko
 

Similar to Thirty Years of Software Problems in the News (20)

Systems Engineering and Requirements Management in Medical Device Product Dev...
Systems Engineering and Requirements Management in Medical Device Product Dev...Systems Engineering and Requirements Management in Medical Device Product Dev...
Systems Engineering and Requirements Management in Medical Device Product Dev...
 
Direct Project HIT Standards 10.27
Direct Project HIT Standards 10.27Direct Project HIT Standards 10.27
Direct Project HIT Standards 10.27
 
Advance Software Engineering notes for ME students
Advance Software Engineering notes for ME studentsAdvance Software Engineering notes for ME students
Advance Software Engineering notes for ME students
 
IEEE 1633 Recommended Practices for Reliable Software
IEEE 1633 Recommended Practices for Reliable SoftwareIEEE 1633 Recommended Practices for Reliable Software
IEEE 1633 Recommended Practices for Reliable Software
 
SD West 2008: Call the requirements police, you've entered design!
SD West 2008: Call the requirements police, you've entered design!SD West 2008: Call the requirements police, you've entered design!
SD West 2008: Call the requirements police, you've entered design!
 
Understanding and Improving Software Productivity
Understanding and Improving Software ProductivityUnderstanding and Improving Software Productivity
Understanding and Improving Software Productivity
 
Decision CAMP 2014 - Benjamin Grosof Janine Bloomfield - Explanation-based E-...
Decision CAMP 2014 - Benjamin Grosof Janine Bloomfield - Explanation-based E-...Decision CAMP 2014 - Benjamin Grosof Janine Bloomfield - Explanation-based E-...
Decision CAMP 2014 - Benjamin Grosof Janine Bloomfield - Explanation-based E-...
 
State of Florida Neo4J Graph Briefing - Keynote
State of Florida Neo4J Graph Briefing - KeynoteState of Florida Neo4J Graph Briefing - Keynote
State of Florida Neo4J Graph Briefing - Keynote
 
Dr Abel Sanchez at Bristlecone Pulse 2017 MIT
Dr Abel Sanchez at Bristlecone Pulse 2017 MITDr Abel Sanchez at Bristlecone Pulse 2017 MIT
Dr Abel Sanchez at Bristlecone Pulse 2017 MIT
 
lecture 1-5.pdf
lecture 1-5.pdflecture 1-5.pdf
lecture 1-5.pdf
 
software engineering documents.pptx
software engineering documents.pptxsoftware engineering documents.pptx
software engineering documents.pptx
 
Unsustainable Regaining Control of Uncontrollable Apps
Unsustainable Regaining Control of Uncontrollable AppsUnsustainable Regaining Control of Uncontrollable Apps
Unsustainable Regaining Control of Uncontrollable Apps
 
Angelbeat -Cut Your Troubleshooting_Time-In-Half
Angelbeat -Cut Your Troubleshooting_Time-In-HalfAngelbeat -Cut Your Troubleshooting_Time-In-Half
Angelbeat -Cut Your Troubleshooting_Time-In-Half
 
Chap01
Chap01Chap01
Chap01
 
Knowledge Matters Issue 15 - Technology at Concern
Knowledge Matters Issue 15 - Technology at ConcernKnowledge Matters Issue 15 - Technology at Concern
Knowledge Matters Issue 15 - Technology at Concern
 
Redgrave LLP New Review Strategies - eDiscovery
Redgrave LLP New Review Strategies - eDiscoveryRedgrave LLP New Review Strategies - eDiscovery
Redgrave LLP New Review Strategies - eDiscovery
 
Redgrave LLP New Review Strategies - eDiscovery
Redgrave LLP New Review Strategies - eDiscoveryRedgrave LLP New Review Strategies - eDiscovery
Redgrave LLP New Review Strategies - eDiscovery
 
2016 velocity santa clara state of dev ops report deck final
2016 velocity santa clara state of dev ops report deck final2016 velocity santa clara state of dev ops report deck final
2016 velocity santa clara state of dev ops report deck final
 
Java learn from basic part chapter_01 short notes to understand the java quic...
Java learn from basic part chapter_01 short notes to understand the java quic...Java learn from basic part chapter_01 short notes to understand the java quic...
Java learn from basic part chapter_01 short notes to understand the java quic...
 
Topic 4 - impact of Information Technology.pptx.pdf
Topic 4 - impact of Information Technology.pptx.pdfTopic 4 - impact of Information Technology.pptx.pdf
Topic 4 - impact of Information Technology.pptx.pdf
 

Recently uploaded

一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
mbawufebxi
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
u86oixdj
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
g4dpvqap0
 
The Ipsos - AI - Monitor 2024 Report.pdf
The  Ipsos - AI - Monitor 2024 Report.pdfThe  Ipsos - AI - Monitor 2024 Report.pdf
The Ipsos - AI - Monitor 2024 Report.pdf
Social Samosa
 
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
dwreak4tg
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
Roger Valdez
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
sameer shah
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
Lars Albertsson
 
University of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma TranscriptUniversity of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma Transcript
soxrziqu
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
roli9797
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
aqzctr7x
 
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
bopyb
 
Global Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headedGlobal Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headed
vikram sood
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
slg6lamcq
 
State of Artificial intelligence Report 2023
State of Artificial intelligence Report 2023State of Artificial intelligence Report 2023
State of Artificial intelligence Report 2023
kuntobimo2016
 
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
u86oixdj
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
Walaa Eldin Moustafa
 
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfEnhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
GetInData
 
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
g4dpvqap0
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
ahzuo
 

Recently uploaded (20)

一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
 
The Ipsos - AI - Monitor 2024 Report.pdf
The  Ipsos - AI - Monitor 2024 Report.pdfThe  Ipsos - AI - Monitor 2024 Report.pdf
The Ipsos - AI - Monitor 2024 Report.pdf
 
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
 
University of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma TranscriptUniversity of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma Transcript
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
 
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
 
Global Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headedGlobal Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headed
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
 
State of Artificial intelligence Report 2023
State of Artificial intelligence Report 2023State of Artificial intelligence Report 2023
State of Artificial intelligence Report 2023
 
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
原版制作(swinburne毕业证书)斯威本科技大学毕业证毕业完成信一模一样
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
 
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfEnhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
 
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
 

Thirty Years of Software Problems in the News

  • 1. Thirty Years of Software Problems in the News Bryan Dosono, Syracuse University Andrew J. Ko, University of Washington Neeraja Duriseti, AT&T
  • 2. Related work • Software transforms society as it reshapes globalization, social relationships, and other aspects of everyday work • Software isn’t perfect; it can fail spectacularly • Most water, gas, and electricity failures now involve software failures (Rahman, Beznosov & Martí, 2009) • Security breaches are now more prevalent and more costly than ever before (Garrison & Ncube, 2011) • Cars recalled more frequently than ever because of software defects (Mark, Bagdouri, Palen, Martin, Al-Ani & Anderson, 2012) INTRODUCTION METHOD RESULTS DISCUSSION CONCLUSION QUESTIONS
  • 3. Research question •How have the consequences of software problems changed over the past 30 years? INTRODUCTION METHOD RESULTS DISCUSSION CONCLUSION QUESTIONS
  • 4. Data collection • Gathered text to build a corpus for analysis • Three major news archives: ProQuest, LexisNexus, Factiva • Databases included 93 out of 100 most circulated newspapers • Only considered English news coverage • Searched for words commonly referring to software problems • i.e., bug, defect, error, fault, flaw, malfunction, glitch, mistake • “Glitch” consistently referred to software problems (60%) • 386,381 articles published from 1980-2012 with word “glitch” INTRODUCTION METHOD RESULTS DISCUSSION CONCLUSION QUESTIONS
  • 5. Termsconsideredforretrievingarticles Term % About Software # Articles With Term # About Software Dominant Topics glitch 60% 49,537 ~30,000 software problems bug 11% 182,150 ~20,000 insects, illness malfunction 8% 24,154 ~2,000 mechanical problems defect 3% 92,300 ~3,000 taxes, birth defects flaw 3% 106,240 ~3,000 construction, decisions error 2% 629,832 ~12,000 sports, news, medicine mistake 1% 586,720 ~6,000 decision making fault 1% 265,383 ~3,000 politics INTRODUCTION RESULTS DISCUSSION CONCLUSION QUESTIONS METHOD
  • 6. Clustering articles to stories • Developed new clustering algorithm • Converted each article’s words into a vector of tf-idf scores • Iterated through all articles in order of publication date • Compared each article to all subsequent articles published within a 7-day window INTRODUCTION METHOD RESULTS DISCUSSION CONCLUSION QUESTIONS
  • 7. Story measurements Quantitative variables • Normalized article count • Frequency of news source location by geographic region Qualitative variables • Categorized articles with specific codes among the three authors (Table 2) • Three rounds of redundant coding until reaching satisfactory intercoder reliability INTRODUCTION METHOD RESULTS DISCUSSION CONCLUSION QUESTIONS
  • 8. Codesusedtoclassifyeachstory Code Description software system failure caused by a software defect death one or more people lost their lives harm one or more people were physically harmed basic one or more people lost access to food or shelter property one or more people permanently lost material goods, money, or data delay one or more people had to postpone an activity due INTRODUCTION METHOD RESULTS DISCUSSION CONCLUSION QUESTIONS
  • 9. Top10storiesbynormalizedarticlecount Software Problem Reported Year # Articles The aftermath of the year 2000 millennium bug 2000 17,193 E-voting defects in the 2004 US elections 2004 2,509 Preparations for the year 2000 millennium bug 1998 1,745 Medicare drug eligibility defect 2006 1,983 Toyota Prius sudden acceleration bug 2010 3,782 Delayed e-voting tallies in the 2008 US elections 2008 1,796 Incorrect voting tallies from e-voting machines 2006 1,236 Incorrect New Mexico voting tally 2000 1,003 Atlanta Olympics results delivery to news delayed 1996 447 E-voting defects in the 2002 US mid-term elections 2002 636 INTRODUCTION METHOD RESULTS DISCUSSION CONCLUSION QUESTIONS
  • 16. Discussion • Several possible causes could underlie the observed news reporting trends • Software problems are becoming harder to observe • Most of the stories in our data are based on press releases or investigations of public organizations • Software failures could be more common within private organizations but are simply not reported INTRODUCTION METHOD DISCUSSION RESULTS CONCLUSION QUESTIONS
  • 17. Discussion • New directions for software engineering practice • Software should be used in supplement to legacy process systems rather than as a cure-all for problems • Software design practices should consider the potential for failure in all parts of a system’s functionality • Adapt frameworks on universal human needs into a software inspection checklist INTRODUCTION METHOD DISCUSSION RESULTS CONCLUSION QUESTIONS
  • 18. Discussion • Current software lacks • A middle ground between working and not working • Tools to quickly isolate and identify causes for failure • Highly informative error messages to speed the recovery process • Software should be designed with the recovery process in mind INTRODUCTION METHOD DISCUSSION RESULTS CONCLUSION QUESTIONS
  • 19. Conclusion • Consequences of software problems on society can range from trivial to catastrophic • The majority of reported software problems are brief inconveniences • Software engineering practices still need improvement as we design software for the future INTRODUCTION METHOD CONCLUSION DISCUSSION RESULTS QUESTIONS
  • 20. Any questions? • Acknowledgements • Thank you to Parmit Chilana for her comments on early drafts of this work • This material is based in part upon work supported by the National Science Foundation under Grant Numbers CCF- 0952733 and CCF-1153625 • Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation INTRODUCTION METHOD QUESTIONS DISCUSSION RESULTS CONCLUSION

Editor's Notes

  1. Hello, everyone. My name is Bryan Dosono and I am a PhD student in Information Science and Technology at Syracuse University. This is my first time at CHASE and I am incredibly thrilled to be here! On behalf of Andrew Ko (a professor from the University of Washington Information School) and Neeraja Duriseti (of AT&T), I will be presenting the research we conducted a couple of summers ago when I was still an undergraduate at the University of Washington that investigated the past thirty years of software problems in the news.
  2. The ubiquity of software has transformed society in innumerable ways, reshaping globalization, social relationships, work, and countless other aspects of society. With these changes, however, have also come a dramatic shift in how society works: behind much of our modern infrastructure is now the automated digital logic of algorithms, often supplementing or even replacing the slow subjectivity of human decision making. Such automation naturally has tradeoffs: it can greatly increase productivity and lower costs, but it can also fail spectacularly. Most water, gas, and electricity failures now involve software failures, security breaches are now more prevalent and more costly than ever before, and cars are recalled more frequently than ever because of software defects.
  3. How have the consequences of software problems changed over the past 30 years? In this paper, we begin to understand answer this question at the macro scale, investigating the trends in software problems and their consequences by mining archives of failures. Perhaps the most extensive archive of software problems and their consequences is news archives. News is probably the largest record of the acute effects of software failure on society. Through both quantitative and qualitative methods, we investigate the problems reported, the consequences they have had on people and society, and how the consequences reported in the news have changed over the last 30 years.
  4. We retrieved news from ProQuest, LexisNexis, and Factiva, the three major news archives. These databases included full text articles for 93 of the top 100 most circulated newspapers in the United States. This data set may represent one of the most comprehensive samples of stories about software problems in history. To retrieve articles about software problems, we began by searching for words commonly referring to software problems, specifically searching for bug, defect, error, fault, flaw, malfunction, glitch and mistake.
  5. We selected a random sample of search results for each of these terms to assess their viability, randomly selecting one article per quarter, per database, per term, from 1980 to 2012, for a total of 260 articles per term (2,080 articles overall). We then analyzed each article for whether it referred to a software problem or some other type of problem. As seen in this table, the only phrase that consistently referred to software problems was glitch. Only 11% of articles with the word bug and 9% of articles with the word malfunction referred to software: across the sources we sampled, these terms typically referred to insects, illness and mechanical failures. Thus, we decided to focus only on articles with the word glitch.
  6. We developed an algorithm that converted each article’s words into a vector of tf-idf scores and then iterated through all articles in order of publication date, comparing each article to all subsequent articles published within a 7-day window. If the cosine similarity of any two article’s tf-idf term vector was .25 or greater (representing a modest similarity), then an edge was created between the two articles, creating a graph of related articles. When a similar article was found, the comparison window was then set to 7 days ahead of the date of the new relevant article. This allowed articles to be linked across weeks, months, and years, as long as at least one similar article was published within a week. After comparisons for each article were complete for a day, all disjoint graph components of the article graph whose latest dated article was more than 7 days before the current date were removed from the graph and treated as a story cluster. We chose 7 days because of the weekly news cycle. The application of our algorithm grouped the 386,381 articles into 58,577 story clusters. In the rest of this paper, an article will refer to a single news report from a single news organization, whereas a story will refer to the set of articles identified by our clustering algorithm as reporting on the same software problem. To assess the accuracy of clustering, the authors independently read all articles in a random sample of 550 story clusters spanning 6,935 articles, judging each article as either relevant to the cluster or off-topic.
  7. We measured the attention a story was given by the press by its computing normalized article count, which was the number of articles in a story cluster, excluding duplicates published in different sources, divided by the number of news sources that were available in the news databases in the year the story was published. This measured how many news organizations committed effort to reporting on the story, while controlling for the confounding factor of full text availability over time. To measure the geographical location from which a story was reported, we assigned a world region, country, and state (for US stories) to each news source that appeared more than 5 times in our data set. All other aspects of stories were qualitative measurements, which we obtained by coding a sample of story clusters and their articles using content analysis methods. Our sample included the subset of the 58,577 story clusters with 6 or more articles and a random sample of stories with fewer articles, to focus our analysis on the stories that received the most attention from the press. Reading these articles spanned about 1,000 person-hours across 2 months.
  8. This table shows the variables that we coded. To assess the reliability of each variable in this table, the authors iteratively and redundantly coded random samples of 100 stories, each time computing Krippendorff’s alpha for each variable and discussing disagreements. After three rounds of redundant coding (covering 5% of all story clusters), we reached satisfactory agreement on all variables. The authors then split this sample into three sets and coded each story cluster independently, reading each article in a cluster. For each cluster with more than 100 articles, we read a random sample of at least 10% of stories to identify the dominant story topic.
  9. While the notable stories varied greatly in their consequences, the top 100 stories receiving the most press (as measured by normalized article count of each story) were quite homogenous. The biggest story was by far the Y2K millennium defects. Another 13 stories covered fears and consequences of problems with e-voting software from 1998 to 2008: tallies were delayed and incorrect, machines froze, digital voter registration records were missing names, and votes were lost.
  10. This figure shows the number of stories reported per month, separated by region. As the plot shows, there were few problems reported in the 1980s, with a rapid rise in the number of stories approaching the year 2000. After the year 2000, however, the frequency of reporting appears to have slowly declined overall. In interpreting these trends, it is important to remember that declines in stories in North America may represent declines in US journalism budgets, as opposed to declines in software problem frequency.
  11. Death was rare. Among our coded sample, 47 stories reported one or more deaths related to software. Extrapolating this to the ~35,000 stories in our data, as shown in the top figure above, about 400 stories over 32 years likely reported deaths (about 1 story per month worldwide). A third of reported deaths were due to collisions in planes and cars, with most plane crashes accounting for most death counts above 50. On land, transit lines and railroads that used the same signaling system have repeatedly failed to detect trains traveling the wrong way. Toyota recalled its 2010 Hybrid Prius after discovering a design flaw that required a software change to fix the anti lock brake-system problem; the LA times reported in March 2010 that 102 people had died from this and other problems with sudden acceleration. In the air, plane crashes stemmed from pilots receiving faulty readings on cockpit screens, engines and thrusters being reversed mid-flight, and air traffic controllers guiding dangerous landings.
  12. Injury was also rare. In our sample, 39 stories reported physical harm. Extrapolating this rate to all 35,000 stories, the number of injured since 1980 is likely about 12,500 (or about 30 people per month). Half of the stories reported injuries sustained on boats and ships because of bumps and tilts. In 1997, autopilot problems led a double-decker tourist sightseeing boat to ram into a bridge on the Seine River in Paris, injuring 28 people. Other stories reported injuries in entertainment, health, and manufacturing. On Broadway, a Spiderman stunt actor was injured after plunging 30 feet due to a computer-controlled harness failure. In 2008, a big television screen at the Piccadilly Gardens fan zone failed to show a highly anticipated football game as planned, resulting in thousands of fans violently rioting against the police, leading to injuries in both parties.
  13. Threats to food and shelter were even less common than death and physical harm, appearing in only 28 coded stories. Extrapolating this to all 35,000 stories, the number of people reported to have lost access to food and/or shelter since 1980 is likely to be about 80 million (30 per month). A third of stories involved problems that resulted in the delay of government aid to vulnerable populations. Some involved welfare recipients that qualified for food stamps and students waiting for financial aid that was incorrectly denied or backlogged. Five stories reported on faulty natural disaster detection systems, including overlooked tornado warnings, missed wildfire notification dispatches, and 911 emergency calls that did not reach an operator. Threats to basic needs were related to region, with basic needs losses overrepresented in the US.
  14. The loss of money, data, and material goods was much more common, reported in 498 of coded stories, and therefore about 4,400 stories over the 32 years. About 20% of stories reported data loss, often e-mails, court documents, or data that NASA probes failed to transmit. Another 20% were stories about companies losing revenue because customers had problems with the company’s software. The remaining bulk of property losses included incorrect charges that were never reimbursed, regulatory fines due to software defects, stock market loss, and security breaches. While it was difficulty to quantify the loss of data and physical goods, we were able to quantify loss money, as many articles explicitly reported it. Of a random sample of 100 stories, 28 reported an amount lost due to the software problem. We converted all of these amounts to USD using the average exchange rate for the month the story was published. As seen in the top figure, the resulting range was from $625 to $7 billion, with a median of $7.7 million lost.
  15. Delay was the most common consequence, mentioned in 40.4% (14,160 stories) of articles. The most common were outages in government services, forcing people to wait days. Software delays were common and typically reported on widely publicized projects such as new government websites or utility services. Other common types of delays occurred due to outages in stock market trading, banks and ATMs, flights, voting and elections, NASA launches, phone service, public transportation, and web sites. The shortest delays concerned emergency responses to fires, injuries, and other time-critical services, where a few minutes of delay was a life and death matter.
  16. Several possible causes could underlie the observed news reporting trends. One instance may be the increased experience with software and how it fails amongst common public expectations over the last three decades, making the failures less surprising and less newsworthy. A related cause could be that most software problems are becoming harder to observe. Most of the stories in our data are based on press releases or investigations of public organizations. Software failures could be more common within private organizations but are simply not reported. It could also be that people pay more attention to the public sector due to a distrust that public organizations can provide reliable service. This interpretation aligns with our finding that developing regions of the world where governments are believed to be less reliable report a greater number of software failures in the public sector.
  17. Regardless of the underlying causes, our research suggests new directions for software engineering practice. People are highly reliant on software as the only solution. Software should be used in supplement to legacy process systems rather than as a cure-all for problems. For example, many airports had many alternative processes such as manual check-ins, transporting baggage by hand, etc. to keep operating during software failures. Software design practices should consider the potential for failure in all parts of a system’s functionality. The design of even a near perfect software system can lead to breakdowns in human-computer communication, making it hard to detect and fix errors before they escalate. One solution would adapt frameworks on universal human needs into a software inspection checklist. This checklist would focus on the data computed by the software and the decisions it makes and help review the potential for delay, loss, or other human risks.
  18. It is impossible to predict all circumstances in which software will run. Delay and property loss (most common consequences of software failures according to our results) were often caused due to the time it takes support teams to detect, diagnose, and correct an error, rather than the software failure itself. Software should be designed with the recovery process in mind. Current software lacks a middle ground between working and not working, tools to quickly isolate and identify causes for failure, or highly informative error messages to speed the recovery process. Future software should be designed taking these factors into consideration.
  19. We sought to understand the consequences of software problems on society and how they changing over time. Our results indicate a dramatic fluctuation in the seriousness of these consequences, but the majority of the consequences were brief inconveniences, such as delay or property loss. While the most severe consequences are becoming increasingly common, they are still relatively rare compared to other risks to humanity. Our results call for a clear definition of how software is correct in software engineering research and beckons practices to educate end users that software is, in fact, fallible.
  20. Thank you all for being a gracious audience. I’d now like to open the floor for any questions you may have. We would like to thank Parmit Chilana for her comments on early drafts of this work. This material is based in part upon work supported by the National Science Foundation under Grant Numbers CCF-0952733 and CCF-1153625. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation.