Supporting Concurrency Abstractions in High-level Language Virtual Machines

Supporting Concurrency Abstractions in
High-level Language Virtual Machines

Stefan Marr
Promotor: Prof. Dr. Theo D’Hondt
Copromotor: Dr. Michael Haupt

Software Languages Lab
Public PhD Defense, 2013-01-18

Applications
Need to be adapted for each Platform

Virtual Machines: Cross-Platform
“Write Once, Run Anywhere”

Virtual Machine

VMs provide modern Tools

Just-In-Time Compilation
for Performance

Automated Memory
Management

Classic C/C++
Virtual Machine 4

Used as Multi-Language VMs

F#
Just-In-Time Compilation
for Performance

Automated Memory
Management

Virtual Machine 5

CPUs don’t get Faster Anymore

GHz 3.8
4

3

1.5
2

1
0.2

0
1990 1995 2000 2005

CPUs don’t get Faster But Multiply

GHz 3.8 3.8
4
3.5
3.33

3
1 core 6 cores
1.5
2

1
0.2

0
1990 1995 2000 2005 2005 2010 2015

What developers now have to keep
track of in their heads:

How to approach complexity?

Solution:
Better tools
9

How To Support Concurrent and Parallel
Programming Abstractions in a VM?

Main
Contribution
Virtual Machine + OMOP
Virtual Machine
Context 10

Thesis Statement
There exists a
relevant and significant subset of concurrent and
parallel programming concepts
that can be realized on top of a unifying substrate.
This substrate enables the
flexible definition of language semantics
that build on the identified set of concepts, and this
substrate
lends itself to an efficient implementation.
1/21/201 11

Agenda

Survey Problems Requirements The OMOP Evaluation Implementation Evaluation
Substrate Performance

There exists a
relevant and significant subset of concurrent and parallel programming concepts
that can be realized on top of a unifying substrate.
This substrate enables the
flexible definition of language semantics
that build on the identified set of concepts, and this substrate
lends itself to an efficient implementation.

12

What do we need for a Multi-Language Virtual Machine?

2 SURVEYS: VMS AND CONCEPTS
1/21/201 13

Survey 1: Today’s VM Support
Always a mismatch with some concepts
VM Threads & Locks Communicating Communicating
Threads Isolates
CLI X X
DisVM X
Erlang X
GHC X
JavaScript X
JVM + Dalvik X
Mozart/Oz X X
Python X X
Ruby X
Self X
Squeak X
* Table is abbreviated (Diss. Tab. 3.2)

Marr, S.; Haupt, M.; and D’Hondt, T. (2009), Intermediate language design of high-level language virtual machines: Towards
comprehensive concurrency support. In VMIL’09 Workshop, pages 3:1–3:2. ACM. (extended abstract)

Surveys 14

Survey 2: Concurrent and Parallel
Programming Concepts

1/21/201 Surveys 15

Survey Goal: Understand Concepts
LIBRARY Can it be implemented as a library? Solved
Problems
PRIOR ART Is it supported by a mainstream VM?

SEMANTICS Does it require runtime support to
guarantee its semantics?

PERFORMANCE Would runtime support enable significant
performance improvements?

Marr, S. and D’Hondt, T. (2012), Identifying A Unifying Mechanism for the Implementation of Concurrency Abstractions on Multi-
Language Virtual Machines, in TOOLS’12, Springer, pp. 171–186.

1/21/201 Surveys 16

Existing and Library Solutions
• 97 concepts identified, 66 considered distinct
Prior Art Library Solutions
Asynchronous Operations Join Agents Guards
Atomic Primitives Locks
Atoms MVars
Co-routines Memory Model
Condition Variables Method Invocation Concurrent Objects Message Queue
Critical Sections Race-And-Repair Event-Loop Parallel Bulk Operations
Fences Thread Pools
Global Address Spaces Thread-local Variables
Events Reducers
Global Interpreter Lock Threads Far-References Single Blocks
Green Threads Volatiles
Futures State Reconciliation
Immutability Wrapper Objects

Surveys 17

VM Support Required for:
Performance Improvements Semantic Guarantees
APGAS Implicit Parallelism Active Objects Message sends
Actors No-Intercession
Barriers Locality
Asynchronous Persistent Data
Clocks Mirrors Invocation Structures

Data Movement One-sided Axum-Domains Replication
Communication By-Value Side-Effect Free
Channels Speculative
Data-Flow Graphs Ownership Execution

Data-Flow Variables PGAS Data Streams Transactions
Isolation Tuple Spaces
Fork/Join Vector Operations
Map/Reduce Vats

Surveys 18

Results: Two Independent Sets of
Requirements
Improving Performance Ensuring Correct Semantics
• Optimization infrastructure • Custom language behavior
• Monitoring facilities • Semantic enforcement
Exposed to language implementer

Surveys 19

Focus on Semantic Aspects
Improving Performance Ensuring Correct Semantics
• Optimization infrastructure • Custom language behavior
• Monitoring facilities • Semantic enforcement
Exposed to language implementer

1/21/201 Surveys 20

Challenges for concurrent programming concepts on today’s VM

COMMON PROBLEMS
1/21/201 21

Need to Solve Common
Language-Implementation Problems
• Isolation
– State encapsulation
– Safe message passing
• Scheduling guarantees
– Fairness, ordering,…
• Immutability
• Reflection violates
concurrency properties
[1] Karmani, R. K.; Shali, A. & Agha, G. (2009), Actor Frameworks for the JVM Platform: A Comparative Analysis, in PPPJ '09 , ACM.
[2] Marr, S.; De Wael, M.; Haupt, M. & D'Hondt, T. (2011), Which Problems Does a Multi-Language Virtual Machine Need to 22
Solve in the Multicore/Manycore Era?, in VMIL’11, ACM.

Isolation?
Broken State Encapsulation
Weak
object semaphore {
Semantics
class SemaActor() extends Actor {
are Common
def enter() {
if (num < MAX) { // Race • Scala
Condition! • Akka
num = num + 1; } } } • JCSP
• Kilim
def main() : Unit = { • Clojure
var gate = new SemaActor() • Swing UI
gate.start • …
gate ! enter // gate's thread
gate.enter // main thread
} }
Example from: Karmani, R. K.; Shali, A. & Agha, G. (2009), Actor Frameworks for the
1/21/201 JVM Platform: A Comparative Analysis, in PPPJ '09 , ACM. 23 23

Reflection?
Voids Concurrency Semantics
Client actor

// ...
customer report<-add(customer.order)
order // ...

add(order)
Report actor

void add(obj) {
report fields = obj.class.getFields()
for (field : fields) {
field.setAccessible(true)
// breaks isolation
list += field.get(obj)
}
}

Problems 24

From Analysis to Construction
25

What does the VM need to support?

DEDUCING REQUIREMENTS

26

Deduced Requirements for VMs
Based on Survey and Common Problems

Managed Managed Notion of Controlled
State Execution Ownership Enforcement

Requirements 27

Required: Managed State

• Variety of state access policies
– Actors, agents, axum domains,
by-value, immutability, isolation,
side-effect freeness, transactions, vats

• Problematic on today’s VMs
Managed State
– Isolation
– Immutability

Requirements 28

Required: Managed Execution

• Variety of execution policies
– asynchronous invocation, actors,
agents, APGAS, by-value,
execution guards

• Reflective invocations voids
Managed concurrency semantics
Execution

Requirements 29

Required: Notion of Ownership
• Policies based on Object Ownership
– Execution
• Asynchronous invocation, agents, actors,
APGAS places, CSP, …
– State access
• Isolation, vats, agents, tuple spaces,…
Notion of
Ownership
• Typically on object granularity

Requirements 30

Required: Controlled Enforcement
• Ubiquitous use of reflection
– Serialization
– Unit testing, mockups
– Frameworks, annotations
– Language restriction workarounds

Controlled • Voids all semantics, incl.
Enforcement concurrency

Requirements 31

Flexible definition of Concurrency Policies

AN OWNERSHIP-BASED
METAOBJECT PROTOCOL
1/21/201 32

All discussed Experiments are Implemented
With Smalltalk, the RoarVM, and SOM

The Ownership-based MOP 33

Concurrency Domains:
Ownership-based Metaobject Protocol
Domain
0..* 1 0..*
1 readField:of:(idx, obj) : Object
Object owned by write:toField:of:(val, idx, obj) : Object Thread

runs in
requestExecOf:on:with:(sel, obj, args): Object enforced: bool
requestThreadResume:(thread) : Thread
initialDomainForNewObjects() : Domain
primCopy:(obj) : Object
Specific

prim*(…) : Object Method
VM

readGlobal:(global) : Object unenforced: bool
write:toGlobal:(val, global) : Object
adopt:(obj) : Object
Helper

Notion of
Controlled
Managed evaluateEnforced:(block) : Object
Ownership
Enforcement
Execution
State spawnHere:(block) : Thread



Domain Definition for Immutability
ImmutableDomain = Domain (
write: val toField: idx of: obj = unenforced
(
ImmutabilityError signal
)

prim_at: idx put: val on: obj = unenforced (
ImmutabilityError signal
)
"… all mutating operations + primitives"
)

Domain enforcing Immutability
current: immDomain:
main()
Domain ImmutableDomain
new cell: Cell
set: 1
evaluateEnforced:[ adopt: cell
cell set: 2]

set: 2 reqExecOf: #set on: cell with: 2

perform: #set with: 2

cell set: 2
enforced

write: 2 toField: 1 of: cell

ImmutabilityError


OMOP vs. Common Problems: Isolation
Realized based on OWNERSHIP and MANAGED STATE

Domain
0..* 1 0..*
1 readField:of:(idx, obj) : Object
Object write:toField:of:(val, idx, obj) : Object Thread

runs in
owned by

Specific

VM



OMOP vs. Common Problems: Reflection
Realized based on MANAGED EXECUTION and
CONTROLLED ENFORCEMENT

Domain
0..* 1 1 0..*
readField:of:(idx, obj) : Object
Object write:toField:of:(val, idx, obj) : Object Thread

runs in
owned by

Specific

VM



The OMOP: Proposed Design
Key Elements
Object
Domain
readField:of: owned by
write:toField:of: Domain
primCopy: requestThreadResume:
prim* initialDomainForNewObjects
Managed Notion of runs in
readGlobal:
State Ownership
write:toGlobal:
Thread

Domain
requestExecOf:on:with:
Thread
primCopy:
Managed Controlled enforced :bool
prim*
Execution Enforcement


Semantics

EVALUATION
1/21/201 40

Case Studies: Cover complete OMOP
STM Event-Loop Actors
Clojure Agents LRSTM AmbientTalkST

Ad Hoc
No Guarantees Custom AST Transformation Wrapper Objects for
Safe Message Passing

OMOP-based
Full Guarantees Custom State Access Policies Custom Execution Policies
based on Ownership
LRSTM ported from: Renggli, L. & Nierstrasz, O. (2007), Transactional Memory for Smalltalk, in ‘Proceedings of
Evaluation: Semantics 41
the International Conference on Dynamic Languages 2007’, ACM, pp. 207-221.

Evaluation Thesis Statement Part 1
“relevant and significant subset of concurrent
and parallel programming concepts”

Relevant:
Novel set of supported
concepts
Significant:
Support all concepts X +


Novel Set of Supported Concepts
VM Threads & Locks Communicating Communicating
Threads Isolates
CLI X X
DisVM X
Erlang X
GHC X
JavaScript X
JVM + Dalvik X
Mozart/Oz X X
Python X X
Ruby X
OMOP X X X
* Table is abbreviated (Diss. Tab. 3.2)


Surveys 43

Degree of Support for Concepts
X direct support + partial support
Semantics require Support OMOP Semantics require Support OMOP
SUPPORT SUPPORT
Active Objects X Message sends +
Actors X No-Intercession X
Asynchronous Invocation X Ownership X
Axum-Domains X Persistent Data Structures +
By-Value + Replication +
Channels + Side-Effect Free X
Data Streams + Speculative Execution +
Implicit Parallelism + Transactions X
Isolation X Tuple Spaces +
Map/Reduce + Vats X


“relevant and significant subset of concurrent
and parallel programming concepts”

Relevant:
Novel set of supported
concepts ✔
Significant:
Support all concepts X + ✔



“on top of a unifying substrate”

Abstraction:
No 1-to-1 mapping of concepts ✔
Minimalism:
All elements required ✔


Implementation Size in LOC

Ad hoc Ownership-
based MOP
Agents 36* 113.
LRSTM 990. 262.
AmbientTalkST 183⁺ 83⁺
ActiveObjects - 68.
CSP+π - 53.
*) without enforcement of semantics
+) incomplete state encapsulation


“enables the
flexible definition of language semantics”

Demonstration:
✔
Case studies
Concepts X + ✔
Implementation Assessment:
Size ✔

IMPLEMENTATION STRATEGIES
1/21/201 49

Impl 1: AST Transformation Applied
cell := Cell new. "unenforced"
cell set: 1.
immDomain adopt: cell.

result := ["enforced"
"cell set: 2"
cell domain reqExecOf: #set:
with: 2
on: cell.
] enforced.

1/21/201 Implementation 50

Impl 3: Bytecodes + Primitives Adapted
void unenforced_extendedSendBytecode() { Impl 2: does a test
messageSelector = literal(fetchByte()); in every bytecode
set_argCount(fetchByte());
normalSend();
}

void enforced_extendedSendBytecode() {
int args = fetchByte();
set_argCount(args + 2);

Oop rcvr = stackValue(args);
Oop domain = rcvr.domain();
set_stackValue(args, domain);
push(literal(fetchByte()));
push(rcvr);
messageSelector = req_exec_selector(args);
normalSend();
}

1/21/201 Implementation 51

Performance

EVALUATION
1/21/201 52

Methodology 3000

• Benchmarks AmbientTalkST STM 2500

– Custom Micro + preexisting kernel

Runtime (in ms)
2000

1500

• Measurement setup 1000

– Reduce systematic measurement bias 500
– Reduce interpreter overhead
– Removed parallelism
0

RoarVM (opt)

RoarVM (std)
• Results generalizable for interpreters

Evaluation: Performance 53

Performance
Runtime, normalized to corr. Ad Hoc implementation

AST−OMOP on CogVM
RoarVM+OMOP3 (opt)
10.00

Overall slowdown:
25% (min -3%, max 3x)
3.16

Performance Sweet Spot
• Custom state access policies
• Within 5% on average for
1.00
LRSTM (min -3%, max. 12%)
Binary Trees (AT)

Fannkuch (AT)

Fasta (AT)

NBody (AT)

Binary Trees (STM)

Fannkuch (STM)

Fasta (STM)

NBody (STM)

AmbientTalkST STM

Evaluation: Performance 54

Identified Requirements
for Unifying Abstraction

Managed State

Managed Execution

Notion of Ownership

Requirements based on surveys Controlled Enforcement
and common problems.

Conclusion 56

Ownership-based MOP as Concurrency
Abstraction for VMs

Agents
AmbientTalkST
ActiveObjects
CSP+π
STM

Conclusion 57

Benefits of the OMOP
Solves Common Problems Supports Semantics of Concepts
• Isolation • Active objects, actors, async
• Immutability invocation, axum domains,
• Reflection isolation, side-effect free,
transactions, vats, …
• Improves Enforceability of
Scheduling Policies

Conclusion 58

Future Work
• VM support for parallel performance
• Evaluate OMOP overhead in JIT’ed VMs, JVM
• Widen sweet spot (actors, CSP, …): use MMU
• Limitations
– Interactions between domains: Actors + STM,…
– Interface for scheduling policies

Conclusion 59

Main Contributions
Determined Devised Solved
Requirements the OMOP Common
Problems

Main Publications
Marr, S.; De Wael, M.; Haupt, M. & D'Hondt, T. (2011), Which Problems Does a Multi-Language Virtual Machine Need to Solve in
the Multicore/Manycore Era?, in VMIL’11, ACM.

Supporting Concurrency Abstractions in High-level Language Virtual Machines

Recommended

Recommended

More Related Content

Similar to Supporting Concurrency Abstractions in High-level Language Virtual Machines

Similar to Supporting Concurrency Abstractions in High-level Language Virtual Machines (20)

More from Stefan Marr

More from Stefan Marr (19)

Recently uploaded

Recently uploaded (20)

Supporting Concurrency Abstractions in High-level Language Virtual Machines

Editor's Notes