Uploaded byDirkjan Bussink

KEY, PDF2,002 views

Lecture on Rubinius for Compiler Construction at University of Twente

This document summarizes Rubinius, an implementation of the Ruby programming language that includes a bytecode virtual machine written in C++ and Ruby. Some key points: - Rubinius compiles Ruby code to bytecode that runs on its built-in virtual machine. This provides performance improvements over interpreting Ruby code. - The virtual machine is implemented in both C++ and Ruby to provide flexibility. It can inline methods, perform just-in-time compilation, and garbage collect memory. - Rubinius aims to be a complete Ruby implementation while also improving performance through techniques like inline caching, profiling, and garbage collection optimizations.

Rubinius
Use Ruby

Dirkjan Bussink
d.bussink@gmail.com

2008

Dynamic
vs.
Static

x = 10
x = 'a'

int x = 10
string x = 'a' => ERROR

Strong
vs.
Weak

x = 10
y = "20"

x + y

# Javascript
=> 1020

# PHP
=> 30

Ruby

puts "Hello world!"

class MyAwesomeObject

def initialize
puts "Running constructor"
end

def cool_method(arg1, arg2)
arg1 + arg2
end

end

[1, 2, 3, 4].each do |e|
puts e
end

module Person
def name
puts "Every person has name"
end
end

class Student
include Person
end

Student.new.name

Rubinius

Virtual Machine

Kernel

2006

Evan Phoenix
Brian Ford

Virtual Machine

C++

Ruby

0000: meta_push_0
0001: set_local 0 # i
0003: pop
0004: push_local 0 # i
0006: push_literal 1000
0008: meta_send_op_lt :<
0010: goto_if_false 23
i = 0
0012: push_local 0 # i
while i < 1000 do 0014: meta_push_1
i = i + 1 0015: meta_send_op_plus :+
0017: set_local 0 # i
end 0019: pop
0020: check_interrupts
0021: goto 4
0023: push_nil
0024: pop
0025: push_true
0026: ret

$Fancy class Person { read_write_slots: ['name, 'age, 'city] def initialize: @name age: @age city: @city { } def go_to: city { if: (city is_a?: City) then: { @city = city } } def to_s { "Person: #{@name}, #{@age} years old, living in #{@city}" } }$

JavaScript

JavaScript
Python

JavaScript
Python
Brainfuck

Ruby in Ruby

def each
return to_enum(:each) unless block_given?

i = @start
total = i + @total
tuple = @tuple

while i < total
yield tuple.at(i)
i += 1
end

self
end

Ruby for Rubyists

Improve
everything

Compiler

class FixnumLiteral < NumberLiteral
def initialize(line, value)
@line = line
@value = value
end

def bytecode(g)
pos(g)

g.push @value
end

def defined(g)
g.push_literal "expression"
end
end

Ruby is slow!

Flexibility

$class Array def sum inject(0) {|total, e| total + e.to_i} end end$

class Bignum
def +(other)
self - other
end
end

"The edges of the sword are life and death,
no one knows which is which"
Ikkyu Sojun, 15th Century Zen master

Hard, but not
impossible

Inline caching

p = Person.new
...
p.name

class Person
def name
"me"
end
end

module Named
def name
class Person
"named"
def name
end
"me"
end
end
class Person
end
include Named
end

module Named
def name class Person
class Person
"named" end
def name
end
"me"
end def p.name
end
class Person "specific"
end
include Named end
end

class Person
attr_accessor :name
end

10000.times do
p = Person.new
p.name
end

There are only two hard
problems in Computer Science:
cache invalidation,
naming things
and off-by-one errors

module Naming
def name
"me2"
end
end

class Person
include Naming
end

10000.times do
p = Person.new
p.name
end

module Naming
def name
"me2" class Person
end def name
end "new_name"
end
class Person end
include Naming
end 10000.times do
p = Person.new
10000.times do p.name
p = Person.new end
p.name
end

p = OtherObject.new
...
p.name

JIT

“...we ﬁnally managed to get our Linux (...) builds to use
GCC 4.5, ... and proﬁle guided optimization enabled”
Mike Hommey - on Firefox 6 performance

def method1
1 + 1
end

def method2
2 + 1
end

10000.times do
method1
end

members of rubinius::VMMethod:
total_args = 0,
call_count = 21,
llvm_function_ = 0x0,
name_ = 0x6306,

static const int default_jit_call_til_compile = 4000;

pushl %ebp
movl %esp, %ebp
subl $4, %esp
movl $10, -4(%ebp)
leal -4(%ebp), %eax
addl $66, (%eax)
leave
ret

#include <stdio.h>

int func() {
int i = 0;
i += 10;
return i;
}

$; ModuleID = '<stdin>' target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16- i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64- v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64" target triple = "x86_64-apple-darwin10.7" define i32 @func() nounwind ssp { entry: %retval = alloca i32 %0 = alloca i32 %i = alloca i32 %"alloca point" = bitcast i32 0 to i32 store i32 0, i32* %i, align 4 %1 = load i32* %i, align 4 %2 = add nsw i32 %1, 10 store i32 %2, i32* %i, align 4 %3 = load i32* %i, align 4 store i32 %3, i32* %0, align 4 %4 = load i32* %0, align 4 store i32 %4, i32* %retval, align 4 br label %return return: ; preds = %entry %retval1 = load i32* %retval ret i32 %retval1 }$

$; ModuleID = '<stdin>' target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8- i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64- v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128- n8:16:32:64" target triple = "x86_64-apple-darwin10.7" define i32 @func() nounwind readnone ssp { entry: ret i32 10 }$

Go JIT!

RBX LLVM
thread(s) thread
Here it is!

members of rubinius::VMMethod:
total_args = 0,
call_count = 21,
llvm_function_ = 0x102c07b30,
name_ = 0x6306,

Code inlining

array = [1] * 1000000

array.each do |element|
puts "element: #{element}"
end

def method1
1 + 1
end

def method2
method1
end

100.times do
method2
end

def method1 def method1
1 + 1 1 + 1
end end

def method2 def method2
method1 method1
end end

100.times do 100.times do
method2 method1
end end

def method1 def method1
1 + 1 1 + 1
end end

def method2 def method2
method1 method1
end end

100.times do 100.times do
method2 method1
end end

def method1 def method1 def method1
1 + 1 1 + 1 1 + 1
end end end

def method2 def method2 def method2
method1 method1 method1
end end end

100.times do 100.times do 100.times do
method2 method1 1 + 1
end end end

def method1 def method1 def method1
1 + 1 1 + 1 1 + 1
end end end

def method2 def method2 def method2
method1 method1 method1
end end end

100.times do 100.times do 100.times do
method2 method1 1 + 1
end end end

def awesome(x)
return 0 if x == 0
x + 1
end

def use_awesomeness_regular
a = awesome(0)
b = awesome(1)
a + b
end

def use_awesomeness_inlined
a = begin
return 0 if 0 == 0
0 + 1
end
b = begin
return 1 if 1 == 0
1 + 1
end
a + b
end

array = [1] * 1000000

array.each do |element|
puts "element: #{element}"
end

array = [1] * 1000000

array = [1] * 1000000
i = 0
size = array.size
array.each do |element|
while i < size
puts "element: #{element}"
puts "element: #{array[i]}"
end
i += 1
end

array = [1] * 1000000

array = [1] * 1000000
i = 0
array.each do |element|
puts "element: #{element}"
== size = array.size
while i < size
puts "element: #{array[i]}"
end
i += 1
end

array = [1] * 100

array.each do |element|
puts "element: #{element}"
b = 2
end

puts b

array = [1] * 100

array = [1] * 100
i = 0
size = array.size
array.each do |element|
while i < size
puts "element: #{element}"
puts "element: #{array[i]}"
b = 2
b = 2
end
i += 1
end
puts b
puts b

array = [1] * 100

array = [1] * 100
i = 0
size = array.size
array.each do |element|
while i < size
puts "element: #{element}"
b = 2 != puts "element: #{array[i]}"
b = 2
end
i += 1
end
puts b
puts b

Control flow issues

Control flow issues
Scoping issues

Control flow issues
Scoping issues
Too big piece of code

Garbage
Collection

Mark
Sweep

Generational
Garbage
Collection

Young
Mature
Large

Young
Semi Space
collector

Mature
Immix

Large
Mark - sweep

Creating less
garbage

class Address
attr_reader :street
attr_reader :number
attr_reader :city
end

class Address
attr_reader :street
attr_reader :number
attr_reader :city
end

Address.instance_variable_get("@seen_ivars")
=> [:@street, :@number, :@city]

a = Address.new
a.street = "Street"
a.number = "1"
a.city = "Enschede"

Rubinius.memory_size(a) => 56

VS
Rubinius.memory_size(a) => 160

What else?

http://rubini.us/

https://github.com/evanphx/rubinius

1 patch == commit access

Questions?

Recommended

PDF

Swift for TensorFlow - CoreML Personalization

byJacopo Mangiavacchi

PDF

ハイブリッド言語Scalaを使う

KEY

ddd+scala

PDF

Idioms in swift 2016 05c

byKaz Yoshikawa

PPTX

Max Koretskyi "Why are Angular and React so fast?"

PDF

Codeware

PDF

Array notes

PDF

Type safe embedded domain-specific languages

byArthur Xavier

PDF

Kotlin on Android: Delegate with pleasure

byDmytro Zaitsev

PDF

Object Orientation vs Functional Programming in Python

byTendayi Mawushe

KEY

Indexing thousands of writes per second with redis

KEY

Go <-> Ruby

byEleanor McHugh

PDF

Python Part 1

byMohamed Ramadan

PDF

Python Puzzlers

byTendayi Mawushe

PDF

Hidden Gems in Swift

PDF

Postobjektové programovanie v Ruby

PDF

Drinking the free kool-aid

PPTX

Python: Basic Inheritance

byDamian T. Gordon

PDF

Python Part 2

byMohamed Ramadan

PPTX

JS Fest 2019. Max Koretskiy. A sneak peek into super optimized code in JS fra...

PDF

Steady with ruby

byChristopher Spring

PDF

Beautiful python - PyLadies

byAlicia Pérez

KEY

jRuby: The best of both worlds

byChristopher Spring

KEY

EventMachine for RubyFuZa 2012

byChristopher Spring

PPTX

LINQ Internals - STLDODN

PDF

Values

PDF

Ruby 程式語言入門導覽

byWen-Tien Chang

KEY

Ruby

byKerry Buckley

PDF

Ruby Intro {spection}

byChristian KAKESA

PDF

Ruby 入門第一次就上手

byWen-Tien Chang

More Related Content

PDF

Swift for TensorFlow - CoreML Personalization

byJacopo Mangiavacchi

PDF

ハイブリッド言語Scalaを使う

KEY

ddd+scala

PDF

Idioms in swift 2016 05c

byKaz Yoshikawa

PPTX

Max Koretskyi "Why are Angular and React so fast?"

PDF

Codeware

PDF

Array notes

PDF

Type safe embedded domain-specific languages

byArthur Xavier

Swift for TensorFlow - CoreML Personalization

byJacopo Mangiavacchi

ハイブリッド言語Scalaを使う

ddd+scala

Idioms in swift 2016 05c

byKaz Yoshikawa

Max Koretskyi "Why are Angular and React so fast?"

Codeware

Array notes

Type safe embedded domain-specific languages

byArthur Xavier

What's hot

PDF

Kotlin on Android: Delegate with pleasure

byDmytro Zaitsev

PDF

Object Orientation vs Functional Programming in Python

byTendayi Mawushe

KEY

Indexing thousands of writes per second with redis

KEY

Go <-> Ruby

byEleanor McHugh

PDF

Python Part 1

byMohamed Ramadan

PDF

Python Puzzlers

byTendayi Mawushe

PDF

Hidden Gems in Swift

PDF

Postobjektové programovanie v Ruby

PDF

Drinking the free kool-aid

PPTX

Python: Basic Inheritance

byDamian T. Gordon

PDF

Python Part 2

byMohamed Ramadan

PPTX

JS Fest 2019. Max Koretskiy. A sneak peek into super optimized code in JS fra...

PDF

Steady with ruby

byChristopher Spring

PDF

Beautiful python - PyLadies

byAlicia Pérez

KEY

jRuby: The best of both worlds

byChristopher Spring

KEY

EventMachine for RubyFuZa 2012

byChristopher Spring

PPTX

LINQ Internals - STLDODN

PDF

Values

Kotlin on Android: Delegate with pleasure

byDmytro Zaitsev

Object Orientation vs Functional Programming in Python

byTendayi Mawushe

Indexing thousands of writes per second with redis

Go <-> Ruby

byEleanor McHugh

Python Part 1

byMohamed Ramadan

Python Puzzlers

byTendayi Mawushe

Hidden Gems in Swift

Postobjektové programovanie v Ruby

Drinking the free kool-aid

Python: Basic Inheritance

byDamian T. Gordon

Python Part 2

byMohamed Ramadan

JS Fest 2019. Max Koretskiy. A sneak peek into super optimized code in JS fra...

Steady with ruby

byChristopher Spring

Beautiful python - PyLadies

byAlicia Pérez

jRuby: The best of both worlds

byChristopher Spring

EventMachine for RubyFuZa 2012

byChristopher Spring

LINQ Internals - STLDODN

Values

Similar to Lecture on Rubinius for Compiler Construction at University of Twente

PDF

Ruby 程式語言入門導覽

byWen-Tien Chang

KEY

Ruby

byKerry Buckley

PDF

Ruby Intro {spection}

byChristian KAKESA

PDF

Ruby 入門第一次就上手

byWen-Tien Chang

PDF

Ruby19 osdc-090418222718-phpapp02

byApoorvi Kapoor

KEY

Test First Teaching

PPT

Ruby: OOP, metaprogramming, blocks, iterators, mix-ins, duck typing. Code style

byAnton Shemerey

PDF

ruby1_6up

bytutorialsruby

PDF

ruby1_6up

bytutorialsruby

KEY

Ruby on rails presentation

KEY

Test First Teaching and the path to TDD

KEY

Refactor like a boss

PDF

how to rate a Rails application

PDF

Introduction to Ruby

KEY

An introduction to Ruby

byWes Oldenbeuving

KEY

Introducing Ruby

byJames Thompson

ODP

RailswayCon 2010 - Dynamic Language VMs

byLourens Naudé

PDF

Rapid Development with Ruby/JRuby and Rails

byelliando dias

PDF

Ruby — An introduction

byGonçalo Silva

PPTX

Ruby introduction part1

Ruby 程式語言入門導覽

byWen-Tien Chang

Ruby

byKerry Buckley

Ruby Intro {spection}

byChristian KAKESA

Ruby 入門第一次就上手

byWen-Tien Chang

Ruby19 osdc-090418222718-phpapp02

byApoorvi Kapoor

Test First Teaching

Ruby: OOP, metaprogramming, blocks, iterators, mix-ins, duck typing. Code style

byAnton Shemerey

ruby1_6up

bytutorialsruby

ruby1_6up

bytutorialsruby

Ruby on rails presentation

Test First Teaching and the path to TDD

Refactor like a boss

how to rate a Rails application

Introduction to Ruby

An introduction to Ruby

byWes Oldenbeuving

Introducing Ruby

byJames Thompson

RailswayCon 2010 - Dynamic Language VMs

byLourens Naudé

Rapid Development with Ruby/JRuby and Rails

byelliando dias

Ruby — An introduction

byGonçalo Silva

Ruby introduction part1

Recently uploaded

PDF

The State of the Gen AI economy - 2025 - The Meliora Company

byClive Dickens

PDF

DMVPN Tunnels to multi sites in networks

byMahboabAliGhalib

PPTX

Blue Futuristic Cyber Security Presentation.pptx

byrdelosreyes538795

PDF

Day 4 - Access, Deployments, and Monitoring - 2nd Sight Lab Cloud Security Class

by2nd Sight Lab

PPTX

Building Cyber Resilience for 2026: Best Practices for a Secure, AI-Driven Bu...

byYasir Naveed Riaz

PDF

Data Virtualization in Action: Scaling APIs and Apps with FME

bySafe Software

PDF

Lab 4.2 Multi-cloud Deployments - 2nd Sight Lab Cloud Security Class

by2nd Sight Lab

PDF

Smart Buildings 2025 Wrapped! The Rise of Outcomes-as-a-Service

PPTX

Building Agents in Microsoft Agent Framework.pptx

byUdaiappa Ramachandran

PPTX

Why Most GenAI Projects Fail to Scale and How to Become One of the Success St...

byEarley Information Science

PDF

Our Digital Tribe_ Cultivating Connection and Growth in Our Slack Community 🌿...

bysanjeetmishra30

PPTX

Cloud Backup Tips for IT Professionals..

byordersoftwarekeys

PDF

Dev Dives: AI that builds with you - UiPath Autopilot for effortless RPA & AP...

byUiPathCommunity

PDF

Lab 4.3 Automated security scans with Jenkins - 2nd Sight Lab Cloud Security ...

by2nd Sight Lab

PDF

Day 5 - Red Team + Blue Team in the Cloud - 2nd Sight Lab Cloud Security Class

by2nd Sight Lab

PDF

Access Control 2025: From Security Silo to Software-Defined Ecosystem

PPTX

Emancipatory Information Retrieval: Radically Reorienting Information Retriev...

byBhaskar Mitra

PDF

What Is a Private LLM and Why Enterprises Need It

PDF

Six Shifts For 2026 (And The Next Six Years)

PDF

Real-Time Data Insight Using Microsoft Forms for Business

The State of the Gen AI economy - 2025 - The Meliora Company

byClive Dickens

DMVPN Tunnels to multi sites in networks

byMahboabAliGhalib

Blue Futuristic Cyber Security Presentation.pptx

byrdelosreyes538795

Day 4 - Access, Deployments, and Monitoring - 2nd Sight Lab Cloud Security Class

by2nd Sight Lab

Building Cyber Resilience for 2026: Best Practices for a Secure, AI-Driven Bu...

byYasir Naveed Riaz

Data Virtualization in Action: Scaling APIs and Apps with FME

bySafe Software

Lab 4.2 Multi-cloud Deployments - 2nd Sight Lab Cloud Security Class

by2nd Sight Lab

Smart Buildings 2025 Wrapped! The Rise of Outcomes-as-a-Service

Building Agents in Microsoft Agent Framework.pptx

byUdaiappa Ramachandran

Why Most GenAI Projects Fail to Scale and How to Become One of the Success St...

byEarley Information Science

Our Digital Tribe_ Cultivating Connection and Growth in Our Slack Community 🌿...

bysanjeetmishra30

Cloud Backup Tips for IT Professionals..

byordersoftwarekeys

Dev Dives: AI that builds with you - UiPath Autopilot for effortless RPA & AP...

byUiPathCommunity

Lab 4.3 Automated security scans with Jenkins - 2nd Sight Lab Cloud Security ...

by2nd Sight Lab

Day 5 - Red Team + Blue Team in the Cloud - 2nd Sight Lab Cloud Security Class

by2nd Sight Lab

Access Control 2025: From Security Silo to Software-Defined Ecosystem

Emancipatory Information Retrieval: Radically Reorienting Information Retriev...

byBhaskar Mitra

What Is a Private LLM and Why Enterprises Need It

Six Shifts For 2026 (And The Next Six Years)

Real-Time Data Insight Using Microsoft Forms for Business

Lecture on Rubinius for Compiler Construction at University of Twente

1.
Rubinius Use Ruby
2.
Dirkjan Bussink d.bussink@gmail.com
4.
2008
5.
Dynamic vs. Static
6.
x = 10 x = 'a' int x = 10 string x = 'a' => ERROR
7.
Strong vs. Weak
8.
x = 10 y= "20" x + y # Javascript => 1020 # PHP => 30
9.
Ruby
10.
puts "Hello world!"
11.
class MyAwesomeObject def initialize puts "Running constructor" end def cool_method(arg1, arg2) arg1 + arg2 end end
12.
[1, 2, 3,4].each do |e| puts e end
13.
module Person def name puts "Every person has name" end end class Student include Person end Student.new.name
14.
Rubinius
15.
Virtual Machine
16.
Kernel
17.
2006 Evan Phoenix BrianFord
18.
Virtual Machine
19.
C++
20.
Ruby
21.
0000: meta_push_0 0001: set_local 0 # i 0003: pop 0004: push_local 0 # i 0006: push_literal 1000 0008: meta_send_op_lt :< 0010: goto_if_false 23 i = 0 0012: push_local 0 # i while i < 1000 do 0014: meta_push_1 i = i + 1 0015: meta_send_op_plus :+ 0017: set_local 0 # i end 0019: pop 0020: check_interrupts 0021: goto 4 0023: push_nil 0024: pop 0025: push_true 0026: ret
22.
Fancy class Person { read_write_slots: ['name, 'age, 'city] def initialize: @name age: @age city: @city { } def go_to: city { if: (city is_a?: City) then: { @city = city } } def to_s { "Person: #{@name}, #{@age} years old, living in #{@city}" } }
24.
JavaScript
25.
JavaScript Python
26.
JavaScript Python Brainfuck
28.
Ruby in Ruby
29.
def each return to_enum(:each) unless block_given? i = @start total = i + @total tuple = @tuple while i < total yield tuple.at(i) i += 1 end self end
30.
Ruby for Rubyists
31.
Improve everything
32.
Compiler
33.
class FixnumLiteral <NumberLiteral def initialize(line, value) @line = line @value = value end def bytecode(g) pos(g) g.push @value end def defined(g) g.push_literal "expression" end end
34.
Ruby is slow!
35.
Flexibility
36.
class Array def sum inject(0) {|total, e| total + e.to_i} end end
37.
class Bignum def +(other) self - other end end
38.
"The edges ofthe sword are life and death, no one knows which is which" Ikkyu Sojun, 15th Century Zen master
39.
Hard, but not impossible
40.
Inline caching
41.
p = Person.new ... p.name
43.
class Person def name "me" end end
44.
module Named def name class Person "named" def name end "me" end end class Person end include Named end
45.
module Named def name class Person class Person "named" end def name end "me" end def p.name end class Person "specific" end include Named end end
46.
class Person attr_accessor :name end 10000.times do p = Person.new p.name end
47.
There are onlytwo hard problems in Computer Science: cache invalidation, naming things and off-by-one errors
49.
module Naming def name "me2" end end class Person include Naming end 10000.times do p = Person.new p.name end
50.
module Naming def name "me2" class Person end def name end "new_name" end class Person end include Naming end 10000.times do p = Person.new 10000.times do p.name p = Person.new end p.name end
51.
p = OtherObject.new ... p.name
52.
JIT
53.
“...we ﬁnally managedto get our Linux (...) builds to use GCC 4.5, ... and proﬁle guided optimization enabled” Mike Hommey - on Firefox 6 performance
54.
def method1 1 + 1 end def method2 2 + 1 end 10000.times do method1 end
55.
members of rubinius::VMMethod: total_args= 0, call_count = 21, llvm_function_ = 0x0, name_ = 0x6306,
56.
static const intdefault_jit_call_til_compile = 4000;
57.
pushl %ebp movl %esp,%ebp subl $4, %esp movl $10, -4(%ebp) leal -4(%ebp), %eax addl $66, (%eax) leave ret
59.
#include <stdio.h> int func(){ int i = 0; i += 10; return i; }
60.
; ModuleID ='<stdin>' target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16- i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64- v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64" target triple = "x86_64-apple-darwin10.7" define i32 @func() nounwind ssp { entry: %retval = alloca i32 %0 = alloca i32 %i = alloca i32 %"alloca point" = bitcast i32 0 to i32 store i32 0, i32* %i, align 4 %1 = load i32* %i, align 4 %2 = add nsw i32 %1, 10 store i32 %2, i32* %i, align 4 %3 = load i32* %i, align 4 store i32 %3, i32* %0, align 4 %4 = load i32* %0, align 4 store i32 %4, i32* %retval, align 4 br label %return return: ; preds = %entry %retval1 = load i32* %retval ret i32 %retval1 }
61.
; ModuleID ='<stdin>' target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8- i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64- v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128- n8:16:32:64" target triple = "x86_64-apple-darwin10.7" define i32 @func() nounwind readnone ssp { entry: ret i32 10 }
62.
Go JIT! RBX LLVM thread(s) thread Here it is!
63.
members of rubinius::VMMethod: total_args= 0, call_count = 21, llvm_function_ = 0x102c07b30, name_ = 0x6306,
64.
Code inlining
65.
array = [1]* 1000000 array.each do |element| puts "element: #{element}" end
67.
def method1 1 + 1 end def method2 method1 end 100.times do method2 end
68.
def method1 def method1 1 + 1 1 + 1 end end def method2 def method2 method1 method1 end end 100.times do 100.times do method2 method1 end end
69.
def method1 def method1 1 + 1 1 + 1 end end def method2 def method2 method1 method1 end end 100.times do 100.times do method2 method1 end end
70.
def method1 def method1 def method1 1 + 1 1 + 1 1 + 1 end end end def method2 def method2 def method2 method1 method1 method1 end end end 100.times do 100.times do 100.times do method2 method1 1 + 1 end end end
71.
def method1 def method1 def method1 1 + 1 1 + 1 1 + 1 end end end def method2 def method2 def method2 method1 method1 method1 end end end 100.times do 100.times do 100.times do method2 method1 1 + 1 end end end
73.
def awesome(x) return 0 if x == 0 x + 1 end def use_awesomeness_regular a = awesome(0) b = awesome(1) a + b end
74.
def use_awesomeness_inlined a = begin return 0 if 0 == 0 0 + 1 end b = begin return 1 if 1 == 0 1 + 1 end a + b end
76.
array = [1]* 1000000 array.each do |element| puts "element: #{element}" end
77.
array = [1]* 1000000 array = [1] * 1000000 i = 0 size = array.size array.each do |element| while i < size puts "element: #{element}" puts "element: #{array[i]}" end i += 1 end
78.
array = [1]* 1000000 array = [1] * 1000000 i = 0 array.each do |element| puts "element: #{element}" == size = array.size while i < size puts "element: #{array[i]}" end i += 1 end
80.
array = [1]* 100 array.each do |element| puts "element: #{element}" b = 2 end puts b
81.
array = [1]* 100 array = [1] * 100 i = 0 size = array.size array.each do |element| while i < size puts "element: #{element}" puts "element: #{array[i]}" b = 2 b = 2 end i += 1 end puts b puts b
82.
array = [1]* 100 array = [1] * 100 i = 0 size = array.size array.each do |element| while i < size puts "element: #{element}" b = 2 != puts "element: #{array[i]}" b = 2 end i += 1 end puts b puts b
84.
Control flow issues
85.
Control flow issues Scopingissues
86.
Control flow issues Scopingissues Too big piece of code
87.
Garbage Collection
89.
Mark Sweep
91.
Generational Garbage Collection
92.
Young Mature Large
93.
Young Semi Space collector
108.
Mature Immix
113.
Large Mark - sweep
114.
Creating less garbage
115.
class Address attr_reader :street attr_reader :number attr_reader :city end
116.
class Address attr_reader :street attr_reader :number attr_reader :city end Address.instance_variable_get("@seen_ivars") => [:@street, :@number, :@city]
117.
a = Address.new a.street = "Street" a.number = "1" a.city = "Enschede" Rubinius.memory_size(a) => 56 VS Rubinius.memory_size(a) => 160
118.
What else?
119.
http://rubini.us/ https://github.com/evanphx/rubinius
120.
1 patch ==commit access
121.
Questions?

Editor's Notes

#2 \n
#3 \n
#4 \n
#5 Regular contributor since the early beginnings of 2008\n
#6 First I want to explain two concepts\nThe first concept is about dynamic versus static languages.\n\n- Dynamic languages\nTypes checking is done at runtime\n- Static languages\nTypes are checked during compile time\n\n \n
#7 The first is what happens in a dynamic language. Types are not fixed and you can use different types if you want\nThe second is a static language. Once a certain type is defined, the type can&#x2019;t be changed. \n
#8 Another difference is how types are enforced.\nThis is often confused with static and dynamic languages, seeing the dynamic language as weakly typed. This however is often not true.\n\n-Strong typing\nThis enforces specific rules and behavior on what happens when you run certain operations on differently typed objects\n-Weak typing\nImplicit type conversion when different types are used\n\n
#9 \n
#10 So what is Ruby?\n\nRuby is a dynamically strongly typed language. So you don&#x2019;t specify types when writing ruby code, but during runtime it&#x2019;s types are not just changed automatically.\n\nIt has primarily been influenced by Perl, Smalltalk and Lisp. The primary design philosophy is that a language should be productive and fun to use. \n
#11 So how does it look?\nThe classic Hello world! example\n
#12 This is how basic class definition and method definition look like\n
#13 Block syntax. They are lambda like constructs. It&#x2019;s like passing a piece of code as a special argument to a method. \n
#14 Modules. They allow for code being mixed into other classes. It&#x2019;s a bit like multiple inheritance. This is made possible because of the dynamic nature of Ruby\n
#15 Rubinius includes everything needed to run Ruby code\n
#16 That means it includes a bytecode Virtual Machine, primarily designed for running Ruby code.\n
#17 But it also includes all core classes etc. you expect in Ruby, like String, Hash and Array. In other languages t&#x2019;s often called the standard library, but that has a different meaning in Ruby, so hence this name. \n
#18 It was started in 2006, thought up by Evan Phoenix during his honeymoon. He and Brian Ford are payed full time to work on Rubinius.\n
#19 So what does the virtual machine entail? \n
#20 The first and initial version was written in Ruby. After this proof of concept, a first version of the virtual machine was written C. In 2008 the choice was made to rewrite it in C++, because that is a better fit with the architecture of the VM. \n
#21 The bytecode instruction set includes the necessary instructions for running Ruby code efficiently. This doesn&#x2019;t mean other languages can&#x2019;t be written on top of Rubinius.\n
#22 So what does the bytecode look like?\nThe comments at the end show how the variable names are mapped to slots\n
#23 One example is Fancy. It has a few different aspect compare to ruby, like named parameters\n
#24 There&#x2019;s a bunch of other languages people created, mostly as an experiment and not very mature\n
#25 There&#x2019;s a bunch of other languages people created, mostly as an experiment and not very mature\n
#26 There&#x2019;s a bunch of other languages people created, mostly as an experiment and not very mature\n
#27 So how do we make it fast? There&#x2019;s quite a few techniques for that which will be discussed later in the lecture\n
#28 The kernel code is written in Ruby as much as possible. This means that classes like String, Hash and Array are written in Ruby itself\n
#29 This is how Array#each looks like. It uses a Tuple, which is a fixed size array like structure that Array is built upon. Hash is written in Ruby too\n
#30 Having more in Ruby also means that it&#x2019;s easier for people to help out with. You don&#x2019;t need to know C / C++ in order to contribute to Rubinius\n
#31 \n
#32 \n
#33 It parses into an AST and then outputs bytecode by walking the AST. Each node knows how to emit bytecode, such as this example for Fixnum literals. It stores the line number and the given value for the literal.\n
#34 People often claim that Ruby, or dynamic languages in general are slow. \n
#35 \n
#36 Add a method useful for you on a core type. Not always the best solution, but it&#x2019;s a very flexible way to handle things. \n
#37 You can also do very nasty things.\n
#38 \n
#39 \n
#40 So in order to improve performance of a dynamic language, there are various techniques. One of the most basic ones that gives an easy improvement is inline caching.\n
#41 So we have this little piece of code. First, we create a Person, then we do some other stuff and then we call the method name on it. p.name here is know as a call site.\n
#42 So where can this method be?\n- There&#x2019;s the simple case of it being a method defined for all instances of the class\n-It can also be in a module included into the class\n-Another option is defining a method on the metaclass of the object. This means it&#x2019;s only available for this specific instance of the Person.\n
#43 So where can this method be?\n- There&#x2019;s the simple case of it being a method defined for all instances of the class\n-It can also be in a module included into the class\n-Another option is defining a method on the metaclass of the object. This means it&#x2019;s only available for this specific instance of the Person.\n
#44 So where can this method be?\n- There&#x2019;s the simple case of it being a method defined for all instances of the class\n-It can also be in a module included into the class\n-Another option is defining a method on the metaclass of the object. This means it&#x2019;s only available for this specific instance of the Person.\n
#45 So consider this extended example with complete code. If you look at the code, you see that when running this, p.name ends up in the same method every time you execute the code. So even though Ruby is very dynamic, most of the code looks like it&#x2019;s static anyway. \n\nSo the expensive computation can perhaps be stored and reused so it&#x2019;s not needed each time. The caching of this method dispatch result is called inline caching.\n
#46 \n
#47 So, here we need to be sure to invalidate our cache, otherwise it would run the wrong method\n
#48 So, here we need to be sure to invalidate our cache, otherwise it would run the wrong method\n
#49 So, now the question is, is this lookup changed too? The problem is that we want to keep the caches simple. So there&#x2019;s a cache entry at each call site that stores for a specific type, the method it dispatched to and module that method is defined. This makes the cache entries small so there&#x2019;s isn&#x2019;t a lot of memory overhead.\n\n\nSo we don&#x2019;t want to store the complete chain. Which means that we only have OtherObject as a type stored here. It also means that we need to invalidate the cache because we don&#x2019;t know whether OtherObject includes Naming or not. \n\nTherefore invalidating caches is a brute force measure that removes all caches with the same name, so in this case &#x201C;name&#x201D;.\n
#50 Just In Time compilation means that code is compiled to native code during execution of your programs. \n
#51 This quote shows that using runtime information, you can optimize code a lot better than just only ahead of time. \n\nSo we need to track this runtime information so we know what we can compile into native code. \n
#52 So here we have a bunch of code that is executed quite often. We don&#x2019;t know ahead of time that we don&#x2019;t actually need the method2 method, but we do during runtime. So we can use this information at runtime.\n
#53 Each VMMethod object keeps track of various things\nYou can see how often a method is called\nThe llvm_function_ pointer points at jitted code for this method\nThere&#x2019;s a name for the method of course\nAnd a whole bunch of other stuff removed here\n
#54 So right now, a method is going to be compiled after it has been executed 4000 times. \n
#55 The initial version used hand crafted assembly to output the code for it. This was cumbersome and would need a lot of work to optimize it to a decently performing level. \n
#56 That&#x2019;s why the LLVM compiler infrastructure was chosen. It already did a huge amount of legwork in generating good native code for various platforms. \n
#57 \n
#58 This is what the LLVM bytecode looks like without any optimizations. This is actually kind of like how with Rubinius code is translated. There is a C++ API for creating this bytecode. \n
#59 This is what the LLVM optimizer makes out of it. It can reason about so in the end the value 10 can be returned directly. LLVM can run similar passes over code generated through the C++ API. \n
#60 So how is this implemented? The code compilation actually happens in a background thread. The virtual machine requests that a certain method is jitted, which is then given to the LLVM thread. The LLVM thread then goes to work and creates the LLVM bytecode. After this is compiled, it sets the llvm_function_ pointer seen earlier. \n\nThis means that the VM just runs along nicely without being interrupted by the compilation of code. \n
#61 \n
#62 No overhead is faster than no overhead. We already saw that inline caches can greatly improve method dispatching, but there&#x2019;s certainly room for even more optimizations. One of these is code inlining, which means that method dispatch overhead is completely removed. \n\n\n
#63 Ruby has another place that can benefit greatly from code inline, which is the usage of blocks. A block is a construct which has it&#x2019;s own scope and can also be captured explicitly. This means that it does add overhead and looking at removing that overhead is very interesting. \n
#64 So we start here with a simple piece of code. What inlining does, is moving the actual code of the method into the method that the method is called from. So this means that instead of dispatching a method, it executes the code of that method directly. \n
#65 So we start here with a simple piece of code. What inlining does, is moving the actual code of the method into the method that the method is called from. So this means that instead of dispatching a method, it executes the code of that method directly. \n
#66 So we start here with a simple piece of code. What inlining does, is moving the actual code of the method into the method that the method is called from. So this means that instead of dispatching a method, it executes the code of that method directly. \n
#67 So we start here with a simple piece of code. What inlining does, is moving the actual code of the method into the method that the method is called from. So this means that instead of dispatching a method, it executes the code of that method directly. \n
#68 So we start here with a simple piece of code. What inlining does, is moving the actual code of the method into the method that the method is called from. So this means that instead of dispatching a method, it executes the code of that method directly. \n
#69 So inline has some great potential, but there are quite a few caveats. \n
#70 We take a look at this example. Here is the same method called with a different parameter, which we could perhaps inline. So we take a naive attempt in the next slide.\n
#71 Here we manually inlined the two calls to awesome() and injected the code in this place. The begin / end block is used so we have the correct scope of the inlined code.\n\nBut if we look at this, this code behaves differently! Let me show this by running it. What you can see here is that the control flow is different because of the return. While the return first meant that it would return from the method, with the inlining this method is no longer present. So when inlining, control flow is something that needs considering. \n
#72 This is a very simple example of how block code inlining should work. The left version is a much prettier and nicer version and that is how Ruby code should look.\n\nThe example on the right is equivalent, but it has the block used in the version on the left removed. This is what you can call block inlining in Rubinius. \n
#73 This is a very simple example of how block code inlining should work. The left version is a much prettier and nicer version and that is how Ruby code should look.\n\nThe example on the right is equivalent, but it has the block used in the version on the left removed. This is what you can call block inlining in Rubinius. \n
#74 This is a very simple example of how block code inlining should work. The left version is a much prettier and nicer version and that is how Ruby code should look.\n\nThe example on the right is equivalent, but it has the block used in the version on the left removed. This is what you can call block inlining in Rubinius. \n
#75 But beware with scoping issues. Who thinks he knows what happens here?\n
#76 But beware with scoping issues. Who thinks he knows what happens here?\n
#77 But beware with scoping issues. Who thinks he knows what happens here?\n
#78 So there are few reasons why code isn&#x2019;t inlined. These properties can be determined by analyzing the bytecode for a method. If these issues come forth from it, the code isn&#x2019;t inlined. What can be inlined is something that is improved over time. More code can be written to support more complex structures for inlining. \n
#79 So there are few reasons why code isn&#x2019;t inlined. These properties can be determined by analyzing the bytecode for a method. If these issues come forth from it, the code isn&#x2019;t inlined. What can be inlined is something that is improved over time. More code can be written to support more complex structures for inlining. \n
#80 So there are few reasons why code isn&#x2019;t inlined. These properties can be determined by analyzing the bytecode for a method. If these issues come forth from it, the code isn&#x2019;t inlined. What can be inlined is something that is improved over time. More code can be written to support more complex structures for inlining. \n
#81 \n
#82 People are really happy that others clean up their garbage. No need to worry about it anymore. Using automatic memory management has various advantages, such as not having to worry about object ownership, double free bugs and possible memory leaks. \n\nRuby also uses automatic memory management and needs garbage collection.\n
#83 A simple naive way of doing garbage collection is to start at the root of the object graph and go from there. Going through each object, marking the objects as you go along. After this marking phase, you go through all objects again. Everything that doesn&#x2019;t have a mark set, is freed as it is apparently not reachable anymore. \n
#84 So how can we make this faster? We can look at the properties of different objects. Young objects are often only around for a very short time. So we want to have a mechanism to suit this properly. \n
#85 \n
#86 \n
#87 \n
#88 \n
#89 \n
#90 \n
#91 \n
#92 \n
#93 \n
#94 \n
#95 \n
#96 \n
#97 \n
#98 \n
#99 \n
#100 \n
#101 \n
#102 Immix uses different block of memory. It allocates objects in a block, so it has contiguous allocation for objects. This means it doesn&#x2019;t need a free list for allocation. On the other hand, this technique can limit memory fragmentation.\n
#103 Immix uses different block of memory. It allocates objects in a block, so it has contiguous allocation for objects. This means it doesn&#x2019;t need a free list for allocation. On the other hand, this technique can limit memory fragmentation.\n
#104 Immix uses different block of memory. It allocates objects in a block, so it has contiguous allocation for objects. This means it doesn&#x2019;t need a free list for allocation. On the other hand, this technique can limit memory fragmentation.\n
#105 Very large objects are directly allocated in a special area. This is because you don&#x2019;t want to copy them, since copying is an expensive operation and they also don&#x2019;t fit in the immix blocks. \n\nThis special area doesn&#x2019;t copy objects, but just uses a very simple mark and sweep algorithm.\n
#106 \n
#107 This is kind of how an object looks like in memory. By default there&#x2019;s an instance variable table, because you can add and remove instance variables on the fly in Ruby. There&#x2019;s no definitive way to know which instance variable we need up front.\n\nBut we can make a very nice educated guess on how it would look like. We use the compiler to track all the variables we see. So if we compile the class given here, we can track all instance variables we encounter. \n
#108 Here we see the effect on memory usage. With the instance variable table, it uses 160 bytes for each object of type Address, but with the packing it only uses 56 bytes!\n
#109 \n
#110 So if you want to know more, please look it up here. You can also of course ask me additional questions. \n
#111 If you think it&#x2019;s interesting and want to contribute, just one patch accepted means commit access. \n
#112 \n