Abstract Interpretation meets model checking near the 1000000 LOC mark: Finding errors in the Linux Kernel Source (AVIS '06)

•

0 likes•367 views

Slides for presentation on "Abstract Interpretation meets model checking near the 1000000 LOC mark" at 5th International Workshop on Automated Verification of Infinite-State Systems (AVIS'06), Apr 1, 2006. A preprint of the full paper is available at http://www.academia.edu/2494187/Abstract_Interpretation_meets_Model_Checking_near_the_10_6_LOC_mark .

Technology

Abstract Interpretation meets model
checking near the 1000000 LOC
mark
- Finding errors in the Linux Kernel
Source
Peter T. Breuer & Simon Pickin
Universidad Carlos III de Madrid

Goal
•
Apply
Formal Methods
to the
Linux kernel
•
Methods must be
➢ post-hoc
➢ capable of application by non-experts
➢ able to handle 6.5 millions of lines of
rapidly changing C code

Analysis Example -Sleep under
Spinlock Hunt (SluSH)

What is sleep under spinlock?
• Sleep = thread scheduled out of CPU
• Spinlock = busy wait for lock release
• Two CPUs
+ two threads waiting on spinlocks
= one dead machine

Example of bad code
• snd_sb_csp_load() in sb16_csp.c

Another piece of guilty code
• Kernel 2.6.12 sound/oss/sequencer.c midi_outc()

Other classes of problems detected
• Access (read/write) to kfreed memory
• Overflow 4096B of stack
• Spinlock under spinlock
• Call to function that expects non-NULL
parameters with possibly NULL argument
• ...
– Logic is configured, so new tests can be invented

Example of kfree/access
• drivers/scsi/aix7xxx_old.c in kernel 2.6.3

Components of analysis system
• Description of statements as logic transformers
– p .... p[n-1/n]
• Trigger/action system for raising alarms!
• Combining logic NRB
• Guiding abstract interpretation s to state x
x ∈s ∩ p
stops dead code evaluation, etc.

Statement Logic - NRB
• Single code statement
– maintains condition P normally
– empty statement cannot return (F)
– empty statement cannot break (F)

Sequence logic -NRB
• normal exit: traverse A then B
• return exit: return from A
OR traverse A then return from B
• break exit: break from A
OR traverse A then break from B

Loop logic -NRB
• break from body is only normal exit from while(1)
• relax p until it
is invariant

Programmable trigger/action engine
• Three rules handle propagation of call graph and
other housekeeping.
– a sleep call while the objective function is
positive causes output:

Using the analyser
• Call with the same arguments as given to the gcc
compiler

Limitations
• Predicates are restricted to unions of n-cubes
• State is not followed well enough:
– x = 1; if (x) A else B;
● treated correctly - only A is evaluated
– if (x) A else B; if (x) C else D;
● over-abstracted - A;C | A;D | B;C | B;D
– possible solution is to push state into the
predicates
((x!=0);A | (x==0);B) ; ((x!=0);C | (x==0);D)
● but we can't follow calculation well - quickly get
to 

Implication of predicates is decidable
• Basic evaluation is C  U Ci
of cubes
– i.e. U Ci
covers C

Summary
• A step towards analyses of 100MLoC.
– No expertise needed
– Fast
– Copes with massive amounts of code
– Soundly based
• Negatives
– Not good tracking program state; model
checking?
– Not yet easy to extend to new problem classes

What's hot

AutomataJena Catherine Bel D

Assembly lab up to 6 up (1)ilias ahmed

Lec14 Intro to Computer Engineering by Hsien-Hsin Sean Lee Georgia Tech -- Se...Hsien-Hsin Sean Lee, Ph.D.

Building Efficient and Highly Run-Time Adaptable Virtual MachinesGuido Chari

The Search for Gravitational Wavesinside-BigData.com

IIUG 2016 Gathering Informix data into RKevin Smith

[Question Paper] Microprocessor and Microcontrollers (Revised Course) [Septem...Mumbai B.Sc.IT Study

2_4 Finite Automata.pptRatnakar Mikkili

Mit cilkRaymond Kung

Functional Reactive Programming by Gerold MeisingerGeroldMeisinger

CILK/CILK++ and ReducersYunming Zhang

LTO pluginWang Hsiangkai

Q4.11: Using GCC Auto-VectorizerLinaro

Programmable PiplelinesSyed Zaid Irshad

Flip flops & registersShah Ishtiyaq Mehfooze

Model-counting Approaches For Nonlinear Numerical ConstraintsQuoc-Sang Phan

Topology hiding Multipath Routing Protocol in MANETAkshay Phalke

Hidden Truths in Dead Software PathsBen Hermann

AutomataRabiRehman1

SLE2015: Distributed ATLAmine Benelallam

What's hot (20)

Automata

Assembly lab up to 6 up (1)

Lec14 Intro to Computer Engineering by Hsien-Hsin Sean Lee Georgia Tech -- Se...

Building Efficient and Highly Run-Time Adaptable Virtual Machines

The Search for Gravitational Waves

IIUG 2016 Gathering Informix data into R

[Question Paper] Microprocessor and Microcontrollers (Revised Course) [Septem...

2_4 Finite Automata.ppt

Mit cilk

Functional Reactive Programming by Gerold Meisinger

CILK/CILK++ and Reducers

LTO plugin

Q4.11: Using GCC Auto-Vectorizer

Programmable Piplelines

Flip flops & registers

Model-counting Approaches For Nonlinear Numerical Constraints

Topology hiding Multipath Routing Protocol in MANET

Hidden Truths in Dead Software Paths

Automata

SLE2015: Distributed ATL

Viewers also liked

A Question Of Interpretation: the role of archivists in an online ageAmanda Hill

Supply Chain Project MCDmcdeacon

Jagger release 2.0Grid Dynamics

Chapter 6 - Introduction to 12 Lead Interpretationryanhall911

Introduction to Statutory Interpretationthorogl01

Data Interpretation sonakshi saxena

Data interpretationbaabtra.com - No. 1 supplier of quality freshers

A Project on Supply Chain Management_1Ashok Kond

Presentation and analysis and interpretation of dataLovely Ann Azanza

Writing the 'Discussion and Analysis'Aiden Yeh

Supply chain management projectMuskan Asnani

Chapter 4 presentation of dataPolytechnic University of the Philippines

Correlation and regressionKhalid Aziz

Viewers also liked (13)

A Question Of Interpretation: the role of archivists in an online age

Supply Chain Project MCD

Jagger release 2.0

Chapter 6 - Introduction to 12 Lead Interpretation

Introduction to Statutory Interpretation

Data Interpretation

Data interpretation

A Project on Supply Chain Management_1

Presentation and analysis and interpretation of data

Writing the 'Discussion and Analysis'

Supply chain management project

Chapter 4 presentation of data

Correlation and regression

Similar to Abstract Interpretation meets model checking near the 1000000 LOC mark: Finding errors in the Linux Kernel Source (AVIS '06)

Detecting Deadlock, Double-Free and Other Abuses in a Million Lines of Linux ...Peter Breuer

Understanding low latency jvm gcsJean-Philippe BEMPEL

07 control+structuresbaran19901990

Computer Organization1CS1400Feng JiangBoolean al.docxladonnacamplin

CNIT 127 Ch 2: Stack overflows on LinuxSam Bowne

Understanding low latency jvm gcs V2Jean-Philippe BEMPEL

Efficient Bytecode Analysis: Linespeed Shellcode DetectionGeorg Wicherski

Concurrency in Distributed Systems : Leslie Lamport papersSubhajit Sahu

CryptographyHardik Sondagar

127 Ch 2: Stack overflows on LinuxSam Bowne

CNIT 127: Ch 2: Stack overflows on LinuxSam Bowne

2017 10 17_quantum_program_v2Francisco J. Gálvez Ramírez

L3-.pptxasdq4

k10790 nilesh prajapati control me 6th semharshprajapati12

127 Ch 2: Stack overflows on LinuxSam Bowne

Using R in remote computer clustersBurak Himmetoglu

13.pptDiptarshiBhowmick1

Pepe Vila - Cache and Syphilis [rooted2019]RootedCON

HiPEAC'19 Tutorial on Quantum algorithms using QX - 2019-01-23Aritra Sarkar

Understanding jvm gc advancedJean-Philippe BEMPEL

Similar to Abstract Interpretation meets model checking near the 1000000 LOC mark: Finding errors in the Linux Kernel Source (AVIS '06) (20)

Detecting Deadlock, Double-Free and Other Abuses in a Million Lines of Linux ...

Understanding low latency jvm gcs

07 control+structures

Computer Organization1CS1400Feng JiangBoolean al.docx

CNIT 127 Ch 2: Stack overflows on Linux

Understanding low latency jvm gcs V2

Efficient Bytecode Analysis: Linespeed Shellcode Detection

Concurrency in Distributed Systems : Leslie Lamport papers

Cryptography

127 Ch 2: Stack overflows on Linux

CNIT 127: Ch 2: Stack overflows on Linux

2017 10 17_quantum_program_v2

L3-.pptx

k10790 nilesh prajapati control me 6th sem

127 Ch 2: Stack overflows on Linux

Using R in remote computer clusters

13.ppt

Pepe Vila - Cache and Syphilis [rooted2019]

HiPEAC'19 Tutorial on Quantum algorithms using QX - 2019-01-23

Understanding jvm gc advanced

Recently uploaded

"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays

Artificial intelligence in the post-deep learning eraDeakin University

Gen AI in Business - Global Trends Report 2024.pdfAddepto

Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation

Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55

Understanding the Laravel MVC ArchitecturePixlogix Infotech

Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren

Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos

Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University

Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm

New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada

"ML in Production",Oleksandr BaganFwdays

Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed

Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst

Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity

Science&tech:THE INFORMATION AGE STS.pdfjimielynbastida

Pigging Solutions in Pet Food ManufacturingPigging Solutions

Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106

Recently uploaded (20)

"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...

Artificial intelligence in the post-deep learning era

Gen AI in Business - Global Trends Report 2024.pdf

Connect Wave/ connectwave Pitch Deck Presentation

Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...

Understanding the Laravel MVC Architecture

Advanced Test Driven-Development @ php[tek] 2024

Benefits Of Flutter Compared To Other Frameworks

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)

Nell’iperspazio con Rocket: il Framework Web di Rust!

Streamlining Python Development: A Guide to a Modern Project Setup

New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024

"ML in Production",Oleksandr Bagan

Scanning the Internet for External Cloud Exposures via SSL Certs

Human Factors of XR: Using Human Factors to Design XR Systems

Dev Dives: Streamline document processing with UiPath Studio Web

Science&tech:THE INFORMATION AGE STS.pdf

Pigging Solutions in Pet Food Manufacturing

Designing IA for AI - Information Architecture Conference 2024

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics

Abstract Interpretation meets model checking near the 1000000 LOC mark: Finding errors in the Linux Kernel Source (AVIS '06)

1. Abstract Interpretation meets model checking near the 1000000 LOC mark - Finding errors in the Linux Kernel Source Peter T. Breuer & Simon Pickin Universidad Carlos III de Madrid

2. Goal • Apply Formal Methods to the Linux kernel • Methods must be ➢ post-hoc ➢ capable of application by non-experts ➢ able to handle 6.5 millions of lines of rapidly changing C code

3. Analysis Example -Sleep under Spinlock Hunt (SluSH)

4. Output from SluSH run

5. What is sleep under spinlock? • Sleep = thread scheduled out of CPU • Spinlock = busy wait for lock release • Two CPUs + two threads waiting on spinlocks = one dead machine

6. Example of bad code • snd_sb_csp_load() in sb16_csp.c

7. Another piece of guilty code • Kernel 2.6.12 sound/oss/sequencer.c midi_outc()

8. Cox owns up

9. Output summarises liklihoods

10. Other classes of problems detected • Access (read/write) to kfreed memory • Overflow 4096B of stack • Spinlock under spinlock • Call to function that expects non-NULL parameters with possibly NULL argument • ... – Logic is configured, so new tests can be invented

11. Example of kfree/access • drivers/scsi/aix7xxx_old.c in kernel 2.6.3

12. Basic technique

13. The abstract view

14. Components of analysis system • Description of statements as logic transformers – p .... p[n-1/n] • Trigger/action system for raising alarms! • Combining logic NRB • Guiding abstract interpretation s to state x x ∈s ∩ p stops dead code evaluation, etc.

15. Statement Logic - NRB • Single code statement – maintains condition P normally – empty statement cannot return (F) – empty statement cannot break (F)

16. Sequence logic -NRB • normal exit: traverse A then B • return exit: return from A OR traverse A then return from B • break exit: break from A OR traverse A then break from B

17. Loop logic -NRB • break from body is only normal exit from while(1) • relax p until it is invariant

18. Conditional logic -NRB

19. Programmable trigger/action engine • Three rules handle propagation of call graph and other housekeeping. – a sleep call while the objective function is positive causes output:

20. Using the analyser • Call with the same arguments as given to the gcc compiler

21. Limitations • Predicates are restricted to unions of n-cubes • State is not followed well enough: – x = 1; if (x) A else B; ● treated correctly - only A is evaluated – if (x) A else B; if (x) C else D; ● over-abstracted - A;C | A;D | B;C | B;D – possible solution is to push state into the predicates ((x!=0);A | (x==0);B) ; ((x!=0);C | (x==0);D) ● but we can't follow calculation well - quickly get to 

22. Implication of predicates is decidable • Basic evaluation is C  U Ci of cubes – i.e. U Ci covers C

23. Summary • A step towards analyses of 100MLoC. – No expertise needed – Fast – Copes with massive amounts of code – Soundly based • Negatives – Not good tracking program state; model checking? – Not yet easy to extend to new problem classes

Abstract Interpretation meets model checking near the 1000000 LOC mark: Finding errors in the Linux Kernel Source (AVIS '06)

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (13)

Similar to Abstract Interpretation meets model checking near the 1000000 LOC mark: Finding errors in the Linux Kernel Source (AVIS '06)

Similar to Abstract Interpretation meets model checking near the 1000000 LOC mark: Finding errors in the Linux Kernel Source (AVIS '06) (20)

More from Peter Breuer

More from Peter Breuer (9)

Recently uploaded

Recently uploaded (20)

Abstract Interpretation meets model checking near the 1000000 LOC mark: Finding errors in the Linux Kernel Source (AVIS '06)