Highly dependable automotive software

Alan Tatourian
Intel Automotive
ighly-Dependable Automotive Softwar
Auto-ISAC 2018

Auto-ISAC 2018
Progress in Technology has been Astonishing
Every generation of technology has enabled remarkable outcomes
Apollo 11
2048 words RAM (16-bit word) ~4KB
36,864 words ROM
Average Smartphone
256MB – 512MB Cache
2GB – 64GB RAM
Next 10 to 20 years
???
45 years
62M x RAM
Cognitive Systems
???
???

Auto-ISAC 2018
• Design Goals
• Security Goals
• Advanced Design
• Summary
Agenda

Auto-ISAC 2018
I always talk about this to folks at Microsoft, especially to developers. What’s the most important operating system you’ll write
applications for? Ain’t Windows, or the Macintosh, or Linux. It’s Homo Sapiens Version 1.0. It shipped about a hundred thousand
years ago. There’s no upgrade in sight. But it’s the one that runs everything.
– Bill Buxton from Microsoft Research
Economic Utility
There is an axiom in economics called
economic utility, it says that feature value
with time tend to zero. As soon as you put
a feature (product) on a shelf it starts to
depreciate.
The goal of any well-defined process
including SDL is ‘continuous improvement’.

Auto-ISAC 2018
Architecture Goals
1. The most obvious approach might be to imagine the future you want and build it.
Unfortunately, that doesn’t work that well because technology co-evolves with people.
It’s a two step—technology pushes people to move forward and then people move past
technology and it has to catch up. The way we see the future is constantly evolving and
the path you take to get there matters. In technical terms we can call this ‘continuous
improvement.’
2. Establish modular and composable design making it possible to (1) use your system in
different (standardized) configurations and applications and (2) evolve it as the
requirements and technologies change.
3. Control (or manage) and reduce complexity!
Civilization advances by extending the number of important operations we can perform without thinking about them.
– Alfred North Whitehead

Auto-ISAC 2018
Complexity, Safety, Security . . .
• 3.22 trillion miles (US, 2016)
• 40,200 fatalities (US, 2016) – roughly 100 people each day
• 1 fatality per 80 million miles
• 1 in 625 chance of dying in car crash (in your lifetime)
• Human error is approximately 0.000,001%  this is what AI
needs to improve on!!!Source of the images: Stanford and Wikipedia
1993 Accident to Airbus A320-211 Aircraft in Warsaw
Question: How safe do autonomous vehicles need to be?
• As safe as human-driven cars (7 death every 109 miles)
• As safe as busses and trains (0.1-0.4 death every 109 miles)
• As safe as airplanes (0.07 death every 109 miles)
I. Savage, “Comparing the fatality risks in United States transportation across modes and over time”, Research in
Transportation Economics, 2013

Auto-ISAC 2018
As the complexity of a system increases, the accuracy of any single agent's own model of
that system decreases rapidly.
Technical debt is a runaway complexity. For example, if it takes you enormous effort and
money to upgrade your system you have accumulated huge technical debt. Remember
that value of your system is inversely proportional to its maintainability.
Dark debt is a form of technical debt that is invisible until it causes failures.
Dark debt is found in complex systems and the anomalies it generates are complex
system failures. Dark debt is not recognizable at the time of creation. … It arises from the
unforeseen interactions of hardware or software with other parts of the framework. …
Unlike technical debt, which can be detected and, in principle at least, corrected by
refactoring, dark debt surfaces through anomalies.
Technical & Dark Debt
Perfection is achieved, not when there is nothing more to add, but when there is nothing left to take away.
– Antoine de Saint-Exupery

Auto-ISAC 2018
New challenges brought by AI
A single bit-ﬂip error leads to a misclassiﬁcation of image by DNN
From research by Karthik Pattabiraman
University of British Columbia

Auto-ISAC 2018
• Design Goals
• Security Goals
• Vehicle architectures in the future: Software Defined
• Security, Functional Safety, Reliability
• Summary
Agenda

Auto-ISAC 2018
Information Security Goals
1. Secure boot
2. Secure auditing and logging
3. Authentication and authorization
4. Session Management
5. Input validation and output encoding
6. Exception management
7. Key management, cryptography, integrity, and availability
8. Security of data at rest
9. Security of data in motion
10.Configuration management
11.Incidence response and patching
Together, these formulate the end-to-end security architecture for the product and thus should be considered alongside
one another—not in isolation. Also, each of the categories has many sub-topics within it. For example, under
authentication and authorization there are aspects of discretionary access controls and mandatory access controls to
consider. Security policies for the product are an outcome of the implementation decisions made during development
across these nine categories.
We already know that a “control” strategy fails
worse than a “resilience” strategy.

Auto-ISAC 2018
Cyberattacks to CPS Control Layers
Control Layer
Regulatory Control Supervisory Control
Deception attacks
Spoofing, replay Set-point change
Measurement substitution Controller substitution
DoS attacks
Physical jamming Network flooding
Increase in latency Operational disruption
Estimation of CPS risks by naively aggregating risks due to reliability and security
failures does not capture the externalities,
and can lead to grossly suboptimal responses to CPS risks.
To thwart the outcomes that follow sentient opponent actions,
diversity of mechanism is required.

Auto-ISAC 2018
The Honeymoon Affect
Design specifications miss important security details that appear only in code.
For most programmers it's hard enough to get the code into a state where the compiler
reads it and correctly interprets it; worrying about making human-readable code is a
luxury.
The software industry needs to change its outlook from trying to achieve code perfection
to recognizing that code will always have security bugs.
FailureRate
Number of Months
0.09
0.08
0.07
0.06
0.05
0.04
0.03
0.02
0.01
0
1 2 43 5 6 7 8 109 11
VulnerabilitiesperMonth
Months since Release
Current Software Engineering literature supports the
Brooks life-cycle model - image taken from “Post-
release reliability growth in software products”, ACM
Trans. Softw. Eng Methodol. 2008

Auto-ISAC 2018
Cryptography ≠ Security
Whoever thinks his problem can be solved using cryptography, doesn’t understand his problem and doesn’t understand cryptography.
– Attributed by Roger Needham and Butler Lampson to each other
Cryptography rots, just like food. Every key and every algorithm has shelf time. Some have very short shelf time.
• How long do you need your cryptographic keys or algorithms to be secure? – this is cryptography shelf life (x years)
• How long will it take to extract secrets out of your system? – this is the end of honeymoon (z years)
• What are your parameters to reduce attack surface and to update keys or algorithms? -  (pronounced Xi)
𝐼𝑓 𝑧 < 𝑥 + 𝜉, 𝑖𝑚𝑝𝑟𝑜𝑣𝑒 𝑦𝑜𝑢𝑟 𝑎𝑟𝑐ℎ𝑖𝑡𝑒𝑐𝑡𝑢𝑟𝑒 𝑎𝑛𝑑 𝑖𝑛𝑓𝑟𝑎𝑠𝑡𝑟𝑢𝑐𝑡𝑢𝑟𝑒!
Cryptographic Agility

Auto-ISAC 2018
Anti-Virus and other security SW
On a recent software vulnerability watch list, about one-third of the reported software
vulnerabilities were in the security software itself.
The average time it takes to identify a cybersecurity incident discovery is 197 days.
From DARPA High-Assurance Cyber Military Systems (HACMS) Proposer’s Day Brief.

Auto-ISAC 2018
1. Restrict all code to very simple control flow constructs, do not use goto statements, setjmp or longjmp constructs, direct or indirect recursion.
2. Give all loops a fixed upper bound. It must be trivially possible for a checking tool to prove statically that the loop cannot exceed a preset upper bound on
the number of iterations. If a tool cannot prove the loop bound statically, the rule is considered violated.
3. Do not use dynamic memory allocation after initialization.
4. No function should be longer than what can be printed on a single sheet of paper in a standard format with one line per statement and one line per
declaration. Typically, this means no more than about 60 lines of code per function.
5. The code’s assertion density should average to minimally two assertions per function. Assertions must be used to check for anomalous conditions that
should never happen in real-life executions. Assertions must be side effect-free and should be defined as Boolean tests. When an assertion fails, an
explicit recovery action must be taken, such as returning an error condition to the caller of the function that executes the failing assertion. Any assertion for
which a static checking tool can prove that it can never fail or never hold violates this rule.
6. Declare all data objects at the smallest possible level of scope.
7. Each calling function must check the return value of non-void functions, and each called function must check the validity of all parameters provided by the
caller.
8. The use of the preprocessor must be limited to the inclusion of header files and simple macro definitions. Token pasting, variable argument lists (ellipses),
and recursive macro calls are not allowed. All macros must expand into complete syntactic units. The use of conditional compilation directives must be
kept to a minimum.
9. The use of pointers must be restricted. Specifically, no more than one level of dereferencing should be used. Pointer dereference operations may not be
hidden in macro definitions or inside typedef declarations. Function pointers are not permitted.
10.All code must be compiled, from the first day of development, with all compiler warnings enabled at the most pedantic setting available. All code must
compile without warnings. All code must also be checked daily with at least one, but preferably more than one, strong static source code analyzer and
should pass all analyses with zero warnings.
NASA’s Ten Principles of Safety-Critical Code
Gerard J Holzmann. The power of 10: rules for developing safety-critical code. Computer, 39(6):95–99, 2006.

Auto-ISAC 2018
No single point of failure—this means that no component should be exclusively dependent
on the operation of another component. Service-oriented architectures and middleware
architectures often do not have a single point of failure.
Diagnosing the problems—the diagnostics of the system should be able to detect
malfunctioning of the components, so mechanisms like heartbeat synchronization should be
implemented. The layered architectures support the diagnostics functionality as they allow us
to build two separate hierarchies—one for handling functionality and one for monitoring it.
Timeouts instead of deadlocks—when waiting for data from another component, the
component under operation should be able to abort its operation after a period of time
(timeout) and signal to the diagnostics that there was a problem in the communication.
Service-oriented architectures have built-in mechanisms for monitoring timeouts.
Reliability and Fault Tolerance

Auto-ISAC 2018
Example: Dynamic heap memory allocation shall not be used.
This rule in practice prohibits dynamic memory allocations for the variables. The rationale behind this
rule is the fact that dynamic memory allocations can lead to memory leaks, overﬂow errors and failures
which occur randomly.
Taking just the defects related to the memory leaks can be very difﬁcult to trace and thus very costly. If
left in the code, the memory leaks can cause undeterministic behavior and crashes of the software.
These crashes might require restart of the node, which is impossible during the runtime of a safety-
critical system.
Following this rule, however, also means that there is a limit on the size of the data structures that can be
used, and that the need for memory of the system is predetermined at design time, thus making the use
of this software “safer”.
Programming of Safety-Critical Systems

Auto-ISAC 2018
Three Pillars of Autonomous systems
Autonomous vehicles are a key example where
designers are challenged with the simultaneous
integration of three critical areas:
1. supercomputing complexity,
2. hard real-time embedded performance
3. functional safety.

Auto-ISAC 2018
The Four Pillars of CPS
The four key pillars driving cyber-physical systems are:
1. Connectivity,
2. Monitoring,
3. Prediction, and
4. Self-Optimization.
While the first two have experienced recent technological enablement, prediction
and optimization are expected to radically change every aspect of our society.
Components associated with physical
control of the vehicle
Components associated with safety
Components associated with
entertainment and convenience

Auto-ISAC 2018
Ultra-Reliable Systems
Air Force F-15 flying despite the absence of one of its wings.
The image demonstrates why self-repairing flight control systems play vital role in aircraft
control.
From The Story of Self-repairing Flight Control
Systems by James E. Tomayko
NASA photo (EC 88203-6) shows an Air Force F-15
flying despite the absence of one of the wings.

Auto-ISAC 2018
3-Dimensional Structure of Digital Security
Defense in Depth
Defense in Diversity
4 i‘s
Isolation
Inoperability
Incompatibility
Independence
But eventually everything fails. You have to make it fail in a predictable way.
Temporal Redundancy
Information Redundancy
Majority voting
Software and Services
Hardware security services
Hardware security building blocks
Security features in the silicon
Analog security monitoring under the CPU
HardwareRootofTrust
Self-Healing
Two-tier architecture is required!

Auto-ISAC 2018
Self-* and High Dependability
Self-healing is the ability of the system to autonomously change its structure so that its
behavior stays the same.
Trend of using self-adaptation is used increasingly in safety-critical systems as it allows us
to change the operation of a component in the presence of errors and failures.
Self-Monitor Self-Diagnosis
Anomalous Event
Deployment
Self-Testing
Candidate Fix
Generation
Self-Adaptation
Fault Identification

Auto-ISAC 2018
Summary
1. Absolutely secure systems are impossible, with enough money and
commitment any system can be broken
2. Assume your system is compromised and build it so that it can recover
3. Strive for continuous incremental improvement, not perfection
4. We do not know how to build 100% reliable systems, we only know how to
manage risk – your system will fail and your design has to ensure that it
fails in a predictable way.

Legal Disclaimer
This presentation contains the general insights and opinions of Intel Corporation (“Intel”). The information in this presentation is provided for
information only and is not to be relied upon for any other purpose than educational. Use at your own risk! Intel makes no representations or
warranties regarding the accuracy or completeness of the information in this presentation. Intel accepts no duty to update this presentation based on
more current information. Intel is not liable for any damages, direct or indirect, consequential or otherwise, that may arise, directly or indirectly, from
the use or misuse of the information in this presentation.
Intel and the Intel logo are trademarks of Intel Corporation in the U.S. and/or other countries.
* Other names and brands may be claimed as the property of others.
© 2018 Intel Corporation.

Highly dependable automotive software

More Related Content

What's hot

Similar to Highly dependable automotive software

Recently uploaded

Highly dependable automotive software

Editor's Notes