SlideShare a Scribd company logo
1 of 279
Download to read offline
The BondMachine Toolkit
Enabling Machine Learning on FPGA
Mirko Mariotti
Department of Physics and Geology - University of Perugia
INFN Perugia
NiPS Summer School 2019
Architectures and Algorithms for Energy-Efficient IoT and HPC
Applications
3-6 September 2019 - Perugia
Introduction
The BondMachine: a comprehensive approach to computing with
FPGA.
In this presentation i will talk about:
 Technological background of the project
 The BondMachine Project: the Architecture
 The BondMachine Project: the Tools
 Use cases
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 2/117
Demo sessions
Some topic will have a demo session.
The code will be available at:
http://bondmachine.fisica.unipg.it
Requirements:
 Linux Workstation
 Vivado
 Zedboard
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 3/117
Technological Background
Technological Background
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 4/117
Technological Background
Current challenges in computing
 Von Neumann Bottleneck:
New computational problems show that current architectural
models has to be improved or changed to address future payloads.
 Energy Efficient computation:
Not wasting resources (silicon, time, energy, instructions).
Using the right resource for the specific case
 Edge/Fog/Cloud Computing:
Making the computation where it make sense
Avoiding the transfer of unnecessary data
Creating consistent interfaces for distributed systems
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 5/117
Technological Background
Current challenges in computing
 Von Neumann Bottleneck:
New computational problems show that current architectural
models has to be improved or changed to address future payloads.
 Energy Efficient computation:
Not wasting resources (silicon, time, energy, instructions).
Using the right resource for the specific case
 Edge/Fog/Cloud Computing:
Making the computation where it make sense
Avoiding the transfer of unnecessary data
Creating consistent interfaces for distributed systems
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 5/117
Technological Background
Current challenges in computing
 Von Neumann Bottleneck:
New computational problems show that current architectural
models has to be improved or changed to address future payloads.
 Energy Efficient computation:
Not wasting resources (silicon, time, energy, instructions).
Using the right resource for the specific case
 Edge/Fog/Cloud Computing:
Making the computation where it make sense
Avoiding the transfer of unnecessary data
Creating consistent interfaces for distributed systems
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 5/117
Technological Background Programmable Logic
FPGA
What is it ?
 A field-programmable gate array (FPGA) is an integrated circuit whose
logic is re-programmable. It’s used to build reconfigurable digital circuits.

FPGAs contain an array of programmable
logic blocks, and a hierarchy of reconfig-
urable interconnects that allow the blocks
to be wired together.

Logic blocks can be configured to perform
complex combinational functions.

The FPGA configuration is generally specified using a hardware
description language (HDL).
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 6/117
Technological Background Programmable Logic
FPGA
What is it ?
 A field-programmable gate array (FPGA) is an integrated circuit whose
logic is re-programmable. It’s used to build reconfigurable digital circuits.

FPGAs contain an array of programmable
logic blocks, and a hierarchy of reconfig-
urable interconnects that allow the blocks
to be wired together.

Logic blocks can be configured to perform
complex combinational functions.

The FPGA configuration is generally specified using a hardware
description language (HDL).
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 6/117
Technological Background Programmable Logic
FPGA
What is it ?
 A field-programmable gate array (FPGA) is an integrated circuit whose
logic is re-programmable. It’s used to build reconfigurable digital circuits.

FPGAs contain an array of programmable
logic blocks, and a hierarchy of reconfig-
urable interconnects that allow the blocks
to be wired together.

Logic blocks can be configured to perform
complex combinational functions.

The FPGA configuration is generally specified using a hardware
description language (HDL).
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 6/117
Technological Background Programmable Logic
FPGA
What is it ?
 A field-programmable gate array (FPGA) is an integrated circuit whose
logic is re-programmable. It’s used to build reconfigurable digital circuits.

FPGAs contain an array of programmable
logic blocks, and a hierarchy of reconfig-
urable interconnects that allow the blocks
to be wired together.

Logic blocks can be configured to perform
complex combinational functions.

The FPGA configuration is generally specified using a hardware
description language (HDL).
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 6/117
Technological Background Programmable Logic
FPGA
Use in computing
The use of FPGA in computing is growing due several reasons:
 can potentially deliver great performance via massive parallelism
 can address payloads which are not performing well on
uniprocessors (Neural Networks, Deep Learning)
 can handle efficiently non-standard data types
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 7/117
Technological Background Programmable Logic
FPGA
Use in computing
The use of FPGA in computing is growing due several reasons:
 can potentially deliver great performance via massive parallelism
 can address payloads which are not performing well on
uniprocessors (Neural Networks, Deep Learning)
 can handle efficiently non-standard data types
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 7/117
Technological Background Programmable Logic
FPGA
Use in computing
The use of FPGA in computing is growing due several reasons:
 can potentially deliver great performance via massive parallelism
 can address payloads which are not performing well on
uniprocessors (Neural Networks, Deep Learning)
 can handle efficiently non-standard data types
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 7/117
Technological Background Programmable Logic
FPGA
Challenges in computing
On the other hand the adoption on FPGA poses several challenges:
 Porting of legacy code is usually hard.
 Interoperability with standard applications is problematic.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 8/117
Technological Background Programmable Logic
FPGA
Challenges in computing
On the other hand the adoption on FPGA poses several challenges:
 Porting of legacy code is usually hard.
 Interoperability with standard applications is problematic.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 8/117
Technological Background Computer Architectures
Computer Architectures
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 9/117
Technological Background Computer Architectures
Computer Architectures
Multi-core and Heterogeneous
Today’s computer architecture are:
 Multi-core,Two or more independent actual processing units
execute multiple instructions at the same time.
 The power is given by the number of cores.
 Parallelism has to be addressed.
 Heterogeneous, different types of processing units.
 Cell, GPU, Parallela, TPU.
 The power is given by the specialization.
 The units data transfer has to be addressed.
 The payloads scheduling has to be addressed.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 10/117
Technological Background Computer Architectures
Computer Architectures
Multi-core and Heterogeneous
Today’s computer architecture are:
 Multi-core,Two or more independent actual processing units
execute multiple instructions at the same time.
 The power is given by the number of cores.
 Parallelism has to be addressed.
 Heterogeneous, different types of processing units.
 Cell, GPU, Parallela, TPU.
 The power is given by the specialization.
 The units data transfer has to be addressed.
 The payloads scheduling has to be addressed.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 10/117
Technological Background Computer Architectures
Computer Architectures
Multi-core and Heterogeneous
Today’s computer architecture are:
 Multi-core,Two or more independent actual processing units
execute multiple instructions at the same time.
 The power is given by the number of cores.
 Parallelism has to be addressed.
 Heterogeneous, different types of processing units.
 Cell, GPU, Parallela, TPU.
 The power is given by the specialization.
 The units data transfer has to be addressed.
 The payloads scheduling has to be addressed.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 10/117
Technological Background Computer Architectures
Computer Architectures
Multi-core and Heterogeneous
Today’s computer architecture are:
 Multi-core,Two or more independent actual processing units
execute multiple instructions at the same time.
 The power is given by the number of cores.
 Parallelism has to be addressed.
 Heterogeneous, different types of processing units.
 Cell, GPU, Parallela, TPU.
 The power is given by the specialization.
 The units data transfer has to be addressed.
 The payloads scheduling has to be addressed.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 10/117
Technological Background Computer Architectures
Computer Architectures
Multi-core and Heterogeneous
Today’s computer architecture are:
 Multi-core,Two or more independent actual processing units
execute multiple instructions at the same time.
 The power is given by the number of cores.
 Parallelism has to be addressed.
 Heterogeneous, different types of processing units.
 Cell, GPU, Parallela, TPU.
 The power is given by the specialization.
 The units data transfer has to be addressed.
 The payloads scheduling has to be addressed.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 10/117
Technological Background Computer Architectures
Computer Architectures
Multi-core and Heterogeneous
Today’s computer architecture are:
 Multi-core,Two or more independent actual processing units
execute multiple instructions at the same time.
 The power is given by the number of cores.
 Parallelism has to be addressed.
 Heterogeneous, different types of processing units.
 Cell, GPU, Parallela, TPU.
 The power is given by the specialization.
 The units data transfer has to be addressed.
 The payloads scheduling has to be addressed.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 10/117
Technological Background Computer Architectures
Computer Architectures
Multi-core and Heterogeneous
Today’s computer architecture are:
 Multi-core,Two or more independent actual processing units
execute multiple instructions at the same time.
 The power is given by the number of cores.
 Parallelism has to be addressed.
 Heterogeneous, different types of processing units.
 Cell, GPU, Parallela, TPU.
 The power is given by the specialization.
 The units data transfer has to be addressed.
 The payloads scheduling has to be addressed.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 10/117
Technological Background Computer Architectures
Computer Architectures
Multi-core and Heterogeneous
Today’s computer architecture are:
 Multi-core,Two or more independent actual processing units
execute multiple instructions at the same time.
 The power is given by the number of cores.
 Parallelism has to be addressed.
 Heterogeneous, different types of processing units.
 Cell, GPU, Parallela, TPU.
 The power is given by the specialization.
 The units data transfer has to be addressed.
 The payloads scheduling has to be addressed.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 10/117
Technological Background Computer Architectures
Computer Architectures
Multi-core and Heterogeneous
Today’s computer architecture are:
 Multi-core,Two or more independent actual processing units
execute multiple instructions at the same time.
 The power is given by the number of cores.
 Parallelism has to be addressed.
 Heterogeneous, different types of processing units.
 Cell, GPU, Parallela, TPU.
 The power is given by the specialization.
 The units data transfer has to be addressed.
 The payloads scheduling has to be addressed.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 10/117
Technological Background Computer Architectures
Computer Architectures
Multi-core and Heterogeneous
Today’s computer architecture are:
 Multi-core,Two or more independent actual processing units
execute multiple instructions at the same time.
 The power is given by the number of cores.
 Parallelism has to be addressed.
 Heterogeneous, different types of processing units.
 Cell, GPU, Parallela, TPU.
 The power is given by the specialization.
 The units data transfer has to be addressed.
 The payloads scheduling has to be addressed.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 10/117
Technological Background The first Idea
The BondMachine
The first idea
High level sources: Go, TensorFlow, NN, ...
Building a new kind of computer architecture (multi-core and
heterogeneous both in cores types and interconnections) which
dynamically adapt to the specific computational problem rather than
be static.
BM architecture Layer
FPGA
Concurrency
and Special-
ization
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 11/117
Technological Background The first Idea
The BondMachine
The first idea
High level sources: Go, TensorFlow, NN, ...
Building a new kind of computer architecture (multi-core and
heterogeneous both in cores types and interconnections) which
dynamically adapt to the specific computational problem rather than
be static.
BM architecture Layer
FPGA
Concurrency
and Special-
ization
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 11/117
Technological Background The first Idea
The BondMachine
The first idea
High level sources: Go, TensorFlow, NN, ...
Building a new kind of computer architecture (multi-core and
heterogeneous both in cores types and interconnections) which
dynamically adapt to the specific computational problem rather than
be static.
BM architecture Layer
FPGA
Concurrency
and Special-
ization
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 11/117
Technological Background The first Idea
The BondMachine
The first idea
High level sources: Go, TensorFlow, NN, ...
Building a new kind of computer architecture (multi-core and
heterogeneous both in cores types and interconnections) which
dynamically adapt to the specific computational problem rather than
be static.
BM architecture Layer
FPGA
Concurrency
and Special-
ization
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 11/117
Technological Background The first Idea
The BondMachine
The first idea
High level sources: Go, TensorFlow, NN, ...
Building a new kind of computer architecture (multi-core and
heterogeneous both in cores types and interconnections) which
dynamically adapt to the specific computational problem rather than
be static.
BM architecture Layer
FPGA
Concurrency
and Special-
ization
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 11/117
Technological Background Abstractions
Abstractions and Interfaces
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 12/117
Technological Background Abstractions
Layer, Abstractions and Interfaces
Introduction
A Computing system is a matter of abstraction and interfaces. A lower
layer exposes its functionalities (via interfaces) to the above layer
hiding (abstraction) its inner details.
The quality of a computing system is determined by how abstractions
are simple and how interfaces are clean.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 13/117
Technological Background Abstractions
Layers, Abstractions and Interfaces
An example
Programming language
User mode
Kernel mode
Processor
Transistors
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 14/117
Technological Background Abstractions
Layers, Abstractions and Interfaces
An example
Programming language
User mode
Kernel mode
Processor
Register Machine
Transistors
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 14/117
Technological Background Abstractions
Layers, Abstractions and Interfaces
An example
Programming language
User mode
Kernel mode
Processor
Register Machine
Opcodes
Transistors
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 14/117
Technological Background Abstractions
Layers, Abstractions and Interfaces
An example
Programming language
User mode
Kernel mode
Kernel
Processor
Register Machine
Opcodes
Transistors
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 14/117
Technological Background Abstractions
Layers, Abstractions and Interfaces
An example
Programming language
User mode
Kernel mode
Kernel
System calls
Processor
Register Machine
Opcodes
Transistors
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 14/117
Technological Background Abstractions
Layers, Abstractions and Interfaces
An example
Programming language
User mode
Standard Library
Kernel mode
Kernel
System calls
Processor
Register Machine
Opcodes
Transistors
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 14/117
Technological Background Abstractions
Layers, Abstractions and Interfaces
An example
Programming language
User mode
Standard Library Stl calls
Kernel mode
Kernel
System calls
Processor
Register Machine
Opcodes
Transistors
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 14/117
Technological Background Abstractions
Layers, Abstractions and Interfaces
An example
Programming language
Compiler/Interpreter
User mode
Standard Library Stl calls
Kernel mode
Kernel
System calls
Processor
Register Machine
Opcodes
Transistors
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 14/117
Technological Background Abstractions
Layers, Abstractions and Interfaces
An example
Programming language
Compiler/Interpreter
Language
User mode
Standard Library Stl calls
Kernel mode
Kernel
System calls
Processor
Register Machine
Opcodes
Transistors
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 14/117
Technological Background Abstractions
Layers, Abstractions and Interfaces
An example
Programming language
Compiler/Interpreter
Language
User mode
Standard Library Stl calls
Kernel mode
Kernel
System calls
Processor
Register Machine
Opcodes
Transistors
Hardware
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 14/117
Technological Background Abstractions
Layers, Abstractions and Interfaces
An example
Programming language
Compiler/Interpreter
Language
User mode
Standard Library Stl calls
Kernel mode
Kernel
System calls
Processor
Register Machine
Opcodes
Transistors
Hardware
Software
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 14/117
Technological Background Abstractions
Layers, Abstractions and Interfaces
An example
Programming language
Compiler/Interpreter
Language
User mode
Standard Library Stl calls
Kernel mode
Kernel
System calls
Processor
Register Machine
Opcodes
Transistors
Hardware
Specialization
Software
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 14/117
Technological Background Abstractions
Layers, Abstractions and Interfaces
An example
Programming language
Compiler/Interpreter
Language
User mode
Standard Library Stl calls
Kernel mode
Kernel
System calls
Processor
Register Machine
Opcodes
Transistors
Hardware
Specialization
Software
Generalization
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 14/117
Technological Background Abstractions
Layers, Abstractions and Interfaces
An example
Programming language
Compiler/Interpreter
Language
User mode
Standard Library Stl calls
Kernel mode
Kernel
System calls
Processor
Register Machine
Opcodes
Transistors
Hardware
Specialization
Performance
Software
Generalization
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 14/117
Technological Background Abstractions
Layers, Abstractions and Interfaces
An example
Programming language
Compiler/Interpreter
Language
User mode
Standard Library Stl calls
Kernel mode
Kernel
System calls
Processor
Register Machine
Opcodes
Transistors
Hardware
Specialization
Performance
Software
Generalization
User
friendly
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 14/117
Technological Background The second idea
Layers, Abstractions and Interfaces
The second idea
Rethinking the stack
Build a computing system with a
decreased number of layers resulting in a
minor gap between HW and SW but
keeping an user friendly way of
programming it.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 15/117
BondMachine
BondMachine
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 16/117
BondMachine
Introducing the BondMachine (BM)
The BondMachine is a software ecosystem for the dynamic generation
of computer architectures that:
 Are composed by many, possibly hundreds, computing cores.
 Have very small cores and not necessarily of the same type
(different ISA and ABI).
 Have a not fixed way of interconnecting cores.
 May have some elements shared among cores (for example channels
and shared memories).
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 17/117
BondMachine
Introducing the BondMachine (BM)
The BondMachine is a software ecosystem for the dynamic generation
of computer architectures that:
 Are composed by many, possibly hundreds, computing cores.
 Have very small cores and not necessarily of the same type
(different ISA and ABI).
 Have a not fixed way of interconnecting cores.
 May have some elements shared among cores (for example channels
and shared memories).
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 17/117
BondMachine
Introducing the BondMachine (BM)
The BondMachine is a software ecosystem for the dynamic generation
of computer architectures that:
 Are composed by many, possibly hundreds, computing cores.
 Have very small cores and not necessarily of the same type
(different ISA and ABI).
 Have a not fixed way of interconnecting cores.
 May have some elements shared among cores (for example channels
and shared memories).
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 17/117
BondMachine
Introducing the BondMachine (BM)
The BondMachine is a software ecosystem for the dynamic generation
of computer architectures that:
 Are composed by many, possibly hundreds, computing cores.
 Have very small cores and not necessarily of the same type
(different ISA and ABI).
 Have a not fixed way of interconnecting cores.
 May have some elements shared among cores (for example channels
and shared memories).
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 17/117
BondMachine
Introducing the BondMachine (BM)
The BondMachine is a software ecosystem for the dynamic generation
of computer architectures that:
 Are composed by many, possibly hundreds, computing cores.
 Have very small cores and not necessarily of the same type
(different ISA and ABI).
 Have a not fixed way of interconnecting cores.
 May have some elements shared among cores (for example channels
and shared memories).
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 17/117
BondMachine
The BondMachine
An example
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 18/117
BondMachine Connecting Processors
Connecting Processor (CP)
The computational unit of the BM
The atomic computational unit of a BM is the “connecting processor”
(CP) and has:
 Some general purpose registers of size Rsize.
 Some I/O dedicated registers of size Rsize.
 A set of implemented opcodes chosen among many available.
 Dedicated ROM and RAM.
 Three possible operating modes.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 19/117
BondMachine Connecting Processors
Connecting Processor (CP)
The computational unit of the BM
The atomic computational unit of a BM is the “connecting processor”
(CP) and has:
 Some general purpose registers of size Rsize.
 Some I/O dedicated registers of size Rsize.
 A set of implemented opcodes chosen among many available.
 Dedicated ROM and RAM.
 Three possible operating modes.
General purpose registers
2R registers: r0,r1,r2,r3 ... r2R
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 19/117
BondMachine Connecting Processors
Connecting Processor (CP)
The computational unit of the BM
The atomic computational unit of a BM is the “connecting processor”
(CP) and has:
 Some general purpose registers of size Rsize.
 Some I/O dedicated registers of size Rsize.
 A set of implemented opcodes chosen among many available.
 Dedicated ROM and RAM.
 Three possible operating modes.
I/O specialized registers
N input registers: i0,i1 ... iN
M output registers: o0,o1 ... oM
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 19/117
BondMachine Connecting Processors
Connecting Processor (CP)
The computational unit of the BM
The atomic computational unit of a BM is the “connecting processor”
(CP) and has:
 Some general purpose registers of size Rsize.
 Some I/O dedicated registers of size Rsize.
 A set of implemented opcodes chosen among many available.
 Dedicated ROM and RAM.
 Three possible operating modes.
Full set of possible opcodes
add,addf,addi,chc,chw,clr,cpy,dec,divf,dpc,hit,hlt,i2r,inc,j,
je,jz,m2r,mult,multf,nop,r2m,r2o,r2s,rset,sic,s2r,saj,sub,
wrd,wwr
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 19/117
BondMachine Connecting Processors
Connecting Processor (CP)
The computational unit of the BM
The atomic computational unit of a BM is the “connecting processor”
(CP) and has:
 Some general purpose registers of size Rsize.
 Some I/O dedicated registers of size Rsize.
 A set of implemented opcodes chosen among many available.
 Dedicated ROM and RAM.
 Three possible operating modes.
RAM and ROM
 2L RAM memory cells.
 2O ROM memory cells.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 19/117
BondMachine Connecting Processors
Connecting Processor (CP)
The computational unit of the BM
The atomic computational unit of a BM is the “connecting processor”
(CP) and has:
 Some general purpose registers of size Rsize.
 Some I/O dedicated registers of size Rsize.
 A set of implemented opcodes chosen among many available.
 Dedicated ROM and RAM.
 Three possible operating modes.
Operating modes
 Full Harvard mode.
 Full Von Neuman mode.
 Hybrid mode.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 19/117
BondMachine Connecting Processors
Connecting Processor (CP)
Full set of possible opcodes
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 20/117
BondMachine Shared Objects
Shared Objects (SO)
The non-computational element of the BM
Alongside CPs, BondMachines include non-computing units called
“Shared Objects” (SO).
Examples of their purposes are:
 Data storage (Memories).
 Message passing.
 CP synchronization.
A single SO can be shared among different CPs. To use it CPs have
special instructions (opcodes) oriented to the specific SO.
Four kind of SO have been developed so far: the Channel, the
Shared Memory, the Barrier and a Pseudo Random Numbers Gen-
erator.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 21/117
BondMachine Shared Objects
Shared Objects (SO)
The non-computational element of the BM
Alongside CPs, BondMachines include non-computing units called
“Shared Objects” (SO).
Examples of their purposes are:
 Data storage (Memories).
 Message passing.
 CP synchronization.
A single SO can be shared among different CPs. To use it CPs have
special instructions (opcodes) oriented to the specific SO.
Four kind of SO have been developed so far: the Channel, the
Shared Memory, the Barrier and a Pseudo Random Numbers Gen-
erator.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 21/117
BondMachine Shared Objects
Shared Objects (SO)
The non-computational element of the BM
Alongside CPs, BondMachines include non-computing units called
“Shared Objects” (SO).
Examples of their purposes are:
 Data storage (Memories).
 Message passing.
 CP synchronization.
A single SO can be shared among different CPs. To use it CPs have
special instructions (opcodes) oriented to the specific SO.
Four kind of SO have been developed so far: the Channel, the
Shared Memory, the Barrier and a Pseudo Random Numbers Gen-
erator.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 21/117
BondMachine Shared Objects
Shared Objects (SO)
The non-computational element of the BM
Alongside CPs, BondMachines include non-computing units called
“Shared Objects” (SO).
Examples of their purposes are:
 Data storage (Memories).
 Message passing.
 CP synchronization.
A single SO can be shared among different CPs. To use it CPs have
special instructions (opcodes) oriented to the specific SO.
Four kind of SO have been developed so far: the Channel, the
Shared Memory, the Barrier and a Pseudo Random Numbers Gen-
erator.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 21/117
BondMachine Shared Objects
Shared Objects (SO)
The non-computational element of the BM
Alongside CPs, BondMachines include non-computing units called
“Shared Objects” (SO).
Examples of their purposes are:
 Data storage (Memories).
 Message passing.
 CP synchronization.
A single SO can be shared among different CPs. To use it CPs have
special instructions (opcodes) oriented to the specific SO.
Four kind of SO have been developed so far: the Channel, the
Shared Memory, the Barrier and a Pseudo Random Numbers Gen-
erator.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 21/117
BondMachine Shared Objects
Shared Objects (SO)
The non-computational element of the BM
Alongside CPs, BondMachines include non-computing units called
“Shared Objects” (SO).
Examples of their purposes are:
 Data storage (Memories).
 Message passing.
 CP synchronization.
A single SO can be shared among different CPs. To use it CPs have
special instructions (opcodes) oriented to the specific SO.
Four kind of SO have been developed so far: the Channel, the
Shared Memory, the Barrier and a Pseudo Random Numbers Gen-
erator.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 21/117
BondMachine Shared Objects
Channel
The Channel SO is an hardware implementation of the CSP (communi-
cating sequential processes) channel.
It is a model for inter-core communication and synchronization via mes-
sage passing.
CPs use channels via 4 opcodes
 wrd: Want Read.
 wwr: Want Write.
 chc: Channel Check.
 chw: Channel Wait.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 22/117
BondMachine Shared Objects
Channel
The Channel SO is an hardware implementation of the CSP (communi-
cating sequential processes) channel.
It is a model for inter-core communication and synchronization via mes-
sage passing.
CPs use channels via 4 opcodes
 wrd: Want Read.
 wwr: Want Write.
 chc: Channel Check.
 chw: Channel Wait.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 22/117
BondMachine Shared Objects
Channel
The Channel SO is an hardware implementation of the CSP (communi-
cating sequential processes) channel.
It is a model for inter-core communication and synchronization via mes-
sage passing.
CPs use channels via 4 opcodes
 wrd: Want Read.
 wwr: Want Write.
 chc: Channel Check.
 chw: Channel Wait.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 22/117
BondMachine Shared Objects
Shared Memory
The Shared Memory SO is a RAM block accessible from more than one
CP.
Different Shared Memories can be used by different CP and not neces-
sarily by all of them.
CPs use shared memories via 2 opcodes
 s2r: Shared memory read.
 r2s: Shared memory write.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 23/117
BondMachine Shared Objects
Shared Memory
The Shared Memory SO is a RAM block accessible from more than one
CP.
Different Shared Memories can be used by different CP and not neces-
sarily by all of them.
CPs use shared memories via 2 opcodes
 s2r: Shared memory read.
 r2s: Shared memory write.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 23/117
BondMachine Shared Objects
Shared Memory
The Shared Memory SO is a RAM block accessible from more than one
CP.
Different Shared Memories can be used by different CP and not neces-
sarily by all of them.
CPs use shared memories via 2 opcodes
 s2r: Shared memory read.
 r2s: Shared memory write.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 23/117
BondMachine Shared Objects
Barrier
The Barrier SO is used to make CPs act synchronously.
When a CP hits a barrier, the execution stop until all the CPs that share
the same barrier hit it.
CPs use barriers via 1 opcode
 hit: Hit the barrier.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 24/117
BondMachine Shared Objects
Barrier
The Barrier SO is used to make CPs act synchronously.
When a CP hits a barrier, the execution stop until all the CPs that share
the same barrier hit it.
CPs use barriers via 1 opcode
 hit: Hit the barrier.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 24/117
BondMachine Shared Objects
Barrier
The Barrier SO is used to make CPs act synchronously.
When a CP hits a barrier, the execution stop until all the CPs that share
the same barrier hit it.
CPs use barriers via 1 opcode
 hit: Hit the barrier.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 24/117
BondMachine Shared Objects
Multicore and Heterogeneous
First idea on the BondMachine
The idea was:
Having a multi-core architecture completely heterogeneous both in cores types and interconnections.
The BondMachine may have many
cores, eventually all different, arbitrarily
interconnected and sharing non
computing elements.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 25/117
Architectures Handling
Architectures Handling
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 26/117
Architectures Handling
Handle BM computer architectures
The BM computer architecture is managed by a set of tools to:
 build a specify architecture
 modify a pre-existing architecture
 simulate or emulate the behavior
 Generate the Hardware Description Code (HDL)
Processor Builder
Selects the single pro-
cessor, assembles and
disassembles, saves on
disk as JSON, creates
the HDL code of a CP
BondMachine Builder
Connects CPs and SOs
together in custom
topologies, loads and
saves on disk as JSON,
create BM’s HDL code
Simulation Framework
Simulates the be-
haviour, emulates a
BM on a standard
Linux workstation
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 27/117
Architectures Handling
Handle BM computer architectures
The BM computer architecture is managed by a set of tools to:
 build a specify architecture
 modify a pre-existing architecture
 simulate or emulate the behavior
 Generate the Hardware Description Code (HDL)
Processor Builder
Selects the single pro-
cessor, assembles and
disassembles, saves on
disk as JSON, creates
the HDL code of a CP
BondMachine Builder
Connects CPs and SOs
together in custom
topologies, loads and
saves on disk as JSON,
create BM’s HDL code
Simulation Framework
Simulates the be-
haviour, emulates a
BM on a standard
Linux workstation
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 27/117
Architectures Handling
Handle BM computer architectures
The BM computer architecture is managed by a set of tools to:
 build a specify architecture
 modify a pre-existing architecture
 simulate or emulate the behavior
 Generate the Hardware Description Code (HDL)
Processor Builder
Selects the single pro-
cessor, assembles and
disassembles, saves on
disk as JSON, creates
the HDL code of a CP
BondMachine Builder
Connects CPs and SOs
together in custom
topologies, loads and
saves on disk as JSON,
create BM’s HDL code
Simulation Framework
Simulates the be-
haviour, emulates a
BM on a standard
Linux workstation
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 27/117
Architectures Handling
Handle BM computer architectures
The BM computer architecture is managed by a set of tools to:
 build a specify architecture
 modify a pre-existing architecture
 simulate or emulate the behavior
 Generate the Hardware Description Code (HDL)
Processor Builder
Selects the single pro-
cessor, assembles and
disassembles, saves on
disk as JSON, creates
the HDL code of a CP
BondMachine Builder
Connects CPs and SOs
together in custom
topologies, loads and
saves on disk as JSON,
create BM’s HDL code
Simulation Framework
Simulates the be-
haviour, emulates a
BM on a standard
Linux workstation
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 27/117
Architectures Handling
Processor Builder
Procbuilder is the CP manipulation tool.
CP Creation
CP Load/Save
CP Assembler/Disassembler
CP HDL
Examples
(32 bit registers counter machine)
procbuilder -register-size 32 -opcodes clr,cpy,dec,inc,je,jz
(Input and Output registers)
procbuilder -inputs 3 -outputs 2 ...
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 28/117
Architectures Handling
Processor Builder
Procbuilder is the CP manipulation tool.
CP Creation
CP Load/Save
CP Assembler/Disassembler
CP HDL
Examples
(Loading a CP)
procbuilder -load-machine conproc.json ...
(Saving a CP)
procbuilder -save-machine conproc.json ...
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 28/117
Architectures Handling
Processor Builder
Procbuilder is the CP manipulation tool.
CP Creation
CP Load/Save
CP Assembler/Disassembler
CP HDL
Examples
(Assembiling)
procbuilder -input-assembly program.asm ...
(Disassembling)
procbuilder -show-program-disassembled ...
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 28/117
Architectures Handling
Processor Builder
Procbuilder is the CP manipulation tool.
CP Creation
CP Load/Save
CP Assembler/Disassembler
CP HDL
Examples
(Create the CP HDL code in Verilog)
procbuilder -create-verilog ...
(Create testbench)
procbuilder -create-verilog-testbench test.v ...
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 28/117
Architectures Handling
Procbuilder demo
Demo N.1
It will be shown how:
 To create a simple processor
 To assemble and disassemble code for it
 To produce its HDL code
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 29/117
Architectures Handling
BondMachine Builder
Bondmachine is the tool that compose CP and SO to form
BondMachines.
BM CP insert and remove
BM SO insert and remove
BM Inputs and Outputs
BM Bonding Processors and/or IO
BM Visualizing or HDL
Examples
(Add a processor)
bondmachine -add-domains proc.json ... ; ... -add-processor 0
(Remove a processor)
bondmachine -bondmachine-file bmach.json -del-processor n
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 30/117
Architectures Handling
BondMachine Builder
Bondmachine is the tool that compose CP and SO to form
BondMachines.
BM CP insert and remove
BM SO insert and remove
BM Inputs and Outputs
BM Bonding Processors and/or IO
BM Visualizing or HDL
Examples
(Add a Shared Object)
bondmachine -add-shared-objects specs ...
(Connect an SO to a processor)
bondmachine -connect-processor-shared-object ...
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 30/117
Architectures Handling
BondMachine Builder
Bondmachine is the tool that compose CP and SO to form
BondMachines.
BM CP insert and remove
BM SO insert and remove
BM Inputs and Outputs
BM Bonding Processors and/or IO
BM Visualizing or HDL
Examples
(Adding inputs or outputs)
bondmachine -add-inputs ... ; bondmachine -add-outputs ...
(Removing inputs or outputs)
bondmachine -del-input ... ; bondmachine -del-output ...
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 30/117
Architectures Handling
BondMachine Builder
Bondmachine is the tool that compose CP and SO to form
BondMachines.
BM CP insert and remove
BM SO insert and remove
BM Inputs and Outputs
BM Bonding Processors and/or IO
BM Visualizing or HDL
Examples
(Bonding processor)
bondmachine -add-bond p0i2,p1o4 ...
(Bonding IO)
bondmachine -add-bond i2,p0i6 ...
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 30/117
Architectures Handling
BondMachine Builder
Bondmachine is the tool that compose CP and SO to form
BondMachines.
BM CP insert and remove
BM SO insert and remove
BM Inputs and Outputs
BM Bonding Processors and/or IO
BM Visualizing or HDL
Examples
(Visualizing)
bondmachine -emit-dot ...
(Create HDL code)
bondmachine -create-verilog ...
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 30/117
Architectures Handling
BondMachine demo
Demo N.2
It will be shown how:
 To create a single-core BondMachine
 To attach an external output
 To produce its HDL code
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 31/117
Architectures Handling Toolchains
Toolchains
A set of toolchains allow the build and the direct deploy to a target
device of BondMachines.
Bondgo Toolchain example
A file local.mk contains references to the source code as well all the
build necessities.
make bondmachine creates the JSON representation of the BM and
assemble its code.
make show displays a graphical representation of the BM.
make simulate start a simulation.
make videosim create a simulation video.
make flash the device into the destination target.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 32/117
Architectures Handling Toolchains
BondMachine demo
Demo N.3
It will be shown how:
 To explore the toolchain
 To flash the board with the code from the previous example
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 33/117
Architectures Handling Toolchains
BondMachine demo
Demo N.4
It will be shown how:
 To build a BondMachine with a processor and a shared object
 To flash the board
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 34/117
Architectures Handling Toolchains
BondMachine demo
Demo N.5
It will be shown how:
 To build a dual-core BondMachine
 To connect cores
 To flash the board
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 35/117
Architectures Handling Web frontend
BondMachine web front-end
Operations on BondMachines can also be performed via an under
development web framework
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 36/117
Architectures Handling Simulation
Simulation
An important feature of the tools is the possibility of simulating
BondMachine behavior.
An event input file describes how BondMachines elements has to change
during the simulation timespan and which one has to be be reported.
The simulator can produce results in the form of:
 Activity log of the BM internal.
 Graphical representation of the simulation.
 Report file with quantitative data. Useful to construct metrics
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 37/117
Architectures Handling Simulation
Simulation
An important feature of the tools is the possibility of simulating
BondMachine behavior.
An event input file describes how BondMachines elements has to change
during the simulation timespan and which one has to be be reported.
The simulator can produce results in the form of:
 Activity log of the BM internal.
 Graphical representation of the simulation.
 Report file with quantitative data. Useful to construct metrics
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 37/117
Architectures Handling Simulation
Simulation
An important feature of the tools is the possibility of simulating
BondMachine behavior.
An event input file describes how BondMachines elements has to change
during the simulation timespan and which one has to be be reported.
The simulator can produce results in the form of:
 Activity log of the BM internal.
 Graphical representation of the simulation.
 Report file with quantitative data. Useful to construct metrics
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 37/117
Architectures Handling Simulation
Simulation
examples
Activity log example:
Simulation Youtube video
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 38/117
Architectures Handling Simulation
Simulation demo
Demo N.6
It will be shown how:
 To show the simulation capabilities of the framework
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 39/117
Architectures Handling Simulation
Emulation
The same engine that simulate BondMachines can be used as emulator.
Through the emulator BondMachines can be used on Linux
workstations.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 40/117
Architectures Molding
Architectures Molding
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 41/117
Architectures Molding API and Libraries
Molding the BondMachine
As stated before BondMachines are not general purpose architectures,
and to be effective have to be shaped according the specific problem.
Several methods (apart from writing in assembly and building a Bond-
Machine from scratch) have been developed to do that:
 bondgo: A new type of compiler that create not only the CPs
assembly but also the architecture itself.
 A set of API to create BondMachine to fit a specific computational
problems.
 An Evolutionary Computation framework to “grow” BondMachines
according some fitness function via simulation.
 A set of tools to use BondMachine in Machine Learning.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 42/117
Architectures Molding API and Libraries
Molding the BondMachine
As stated before BondMachines are not general purpose architectures,
and to be effective have to be shaped according the specific problem.
Several methods (apart from writing in assembly and building a Bond-
Machine from scratch) have been developed to do that:
 bondgo: A new type of compiler that create not only the CPs
assembly but also the architecture itself.
 A set of API to create BondMachine to fit a specific computational
problems.
 An Evolutionary Computation framework to “grow” BondMachines
according some fitness function via simulation.
 A set of tools to use BondMachine in Machine Learning.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 42/117
Architectures Molding API and Libraries
Molding the BondMachine
As stated before BondMachines are not general purpose architectures,
and to be effective have to be shaped according the specific problem.
Several methods (apart from writing in assembly and building a Bond-
Machine from scratch) have been developed to do that:
 bondgo: A new type of compiler that create not only the CPs
assembly but also the architecture itself.
 A set of API to create BondMachine to fit a specific computational
problems.
 An Evolutionary Computation framework to “grow” BondMachines
according some fitness function via simulation.
 A set of tools to use BondMachine in Machine Learning.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 42/117
Architectures Molding API and Libraries
Molding the BondMachine
As stated before BondMachines are not general purpose architectures,
and to be effective have to be shaped according the specific problem.
Several methods (apart from writing in assembly and building a Bond-
Machine from scratch) have been developed to do that:
 bondgo: A new type of compiler that create not only the CPs
assembly but also the architecture itself.
 A set of API to create BondMachine to fit a specific computational
problems.
 An Evolutionary Computation framework to “grow” BondMachines
according some fitness function via simulation.
 A set of tools to use BondMachine in Machine Learning.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 42/117
Architectures Molding API and Libraries
Molding the BondMachine
As stated before BondMachines are not general purpose architectures,
and to be effective have to be shaped according the specific problem.
Several methods (apart from writing in assembly and building a Bond-
Machine from scratch) have been developed to do that:
 bondgo: A new type of compiler that create not only the CPs
assembly but also the architecture itself.
 A set of API to create BondMachine to fit a specific computational
problems.
 An Evolutionary Computation framework to “grow” BondMachines
according some fitness function via simulation.
 A set of tools to use BondMachine in Machine Learning.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 42/117
Architectures Molding API and Libraries
Molding the BondMachine
As stated before BondMachines are not general purpose architectures,
and to be effective have to be shaped according the specific problem.
Several methods (apart from writing in assembly and building a Bond-
Machine from scratch) have been developed to do that:
 bondgo: A new type of compiler that create not only the CPs
assembly but also the architecture itself.
 A set of API to create BondMachine to fit a specific computational
problems.
 An Evolutionary Computation framework to “grow” BondMachines
according some fitness function via simulation.
 A set of tools to use BondMachine in Machine Learning.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 42/117
Architectures Molding API and Libraries
Use the BM computer architecture
Mapping specific computational problems to BMs
Symbond
Map symbolic
mathematical
expressions to BM
Boolbond
Map boolean systems
to BM
Matrixwork
Basic matrix
computation
Evolutive BM
Evolutionary
computing to BM
Bondgo
The architecture
compiler
ML tools
Map computational
graphs to BM
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 43/117
Architectures Molding API and Libraries
Use the BM computer architecture
Mapping specific computational problems to BMs
Symbond
Map symbolic
mathematical
expressions to BM
Boolbond
Map boolean systems
to BM
Matrixwork
Basic matrix
computation
Evolutive BM
Evolutionary
computing to BM
Bondgo
The architecture
compiler
ML tools
Map computational
graphs to BM
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 43/117
Architectures Molding API and Libraries
Use the BM computer architecture
Mapping specific computational problems to BMs
Symbond
Map symbolic
mathematical
expressions to BM
Boolbond
Map boolean systems
to BM
Matrixwork
Basic matrix
computation
Evolutive BM
Evolutionary
computing to BM
Bondgo
The architecture
compiler
ML tools
Map computational
graphs to BM
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 43/117
Architectures Molding API and Libraries
Use the BM computer architecture
Mapping specific computational problems to BMs
Symbond
Map symbolic
mathematical
expressions to BM
Boolbond
Map boolean systems
to BM
Matrixwork
Basic matrix
computation
Evolutive BM
Evolutionary
computing to BM
Bondgo
The architecture
compiler
ML tools
Map computational
graphs to BM
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 43/117
Architectures Molding API and Libraries
Use the BM computer architecture
Mapping specific computational problems to BMs
Symbond
Map symbolic
mathematical
expressions to BM
Boolbond
Map boolean systems
to BM
Matrixwork
Basic matrix
computation
Evolutive BM
Evolutionary
computing to BM
Bondgo
The architecture
compiler
ML tools
Map computational
graphs to BM
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 43/117
Architectures Molding API and Libraries
Use the BM computer architecture
Mapping specific computational problems to BMs
Symbond
Map symbolic
mathematical
expressions to BM
Boolbond
Map boolean systems
to BM
Matrixwork
Basic matrix
computation
Evolutive BM
Evolutionary
computing to BM
Bondgo
The architecture
compiler
ML tools
Map computational
graphs to BM
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 43/117
Architectures Molding API and Libraries
Use the BM computer architecture
Mapping specific computational problems to BMs
Symbond
Map symbolic
mathematical
expressions to BM
Boolbond
Map boolean systems
to BM
Matrixwork
Basic matrix
computation
Evolutive BM
Evolutionary
computing to BM
Bondgo
The architecture
compiler
ML tools
Map computational
graphs to BM
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 43/117
Architectures Molding Bondgo
Bondgo
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 44/117
Architectures Molding Bondgo
Bondgo
The major innovation of the BondMachine Project is its compiler.
Bondgo is the name chosen for the compiler developed for the
BondMachine.
The compiler source language is Go as the name suggest.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 45/117
Architectures Molding Bondgo
Bondgo
This is the standard flow when building computer programs
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 46/117
Architectures Molding Bondgo
Bondgo
This is the standard flow when building computer programs
high level language source
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 46/117
Architectures Molding Bondgo
Bondgo
This is the standard flow when building computer programs
high level language source
assembly file
Compiling
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 46/117
Architectures Molding Bondgo
Bondgo
This is the standard flow when building computer programs
high level language source
assembly file
Compiling
machine code
Assembling
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 46/117
Architectures Molding Bondgo
Bondgo
bondgo loop example
package main
import ()
func main () {
var reg_aa uint8
var reg_ab uint8
for reg_aa = 10; reg_aa  0; reg_aa -- {
reg_ab = reg_aa
break
}
}
bondgo loop example in asm
clr aa
clr ab
rset ac 10
cpy aa ac
cpy ac aa
jz ac 11
cpy ac aa
cpy ab ac
j 11
dec aa
j 4
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 47/117
Architectures Molding Bondgo
Bondgo
Bondgo does something different from standard compilers ...
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 48/117
Architectures Molding Bondgo
Bondgo
Bondgo does something different from standard compilers ...
high level GO source
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 48/117
Architectures Molding Bondgo
Bondgo
Bondgo does something different from standard compilers ...
high level GO source
assembly file
Compiling
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 48/117
Architectures Molding Bondgo
Bondgo
Bondgo does something different from standard compilers ...
high level GO source
assembly file
Compiling
CP specs
Arch generating
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 48/117
Architectures Molding Bondgo
Bondgo
Bondgo does something different from standard compilers ...
high level GO source
assembly file
Compiling
CP specs
Arch generating
machine code
Assembling
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 48/117
Architectures Molding Bondgo
Bondgo
Bondgo does something different from standard compilers ...
high level GO source
assembly file
Compiling
CP specs
Arch generating
machine code
Assembling
Processor implementation
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 48/117
Architectures Molding Bondgo
Bondgo
Bondgo does something different from standard compilers ...
high level GO source
assembly file
Compiling
CP specs
Arch generating
machine code
Assembling
Processor implementation
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 48/117
Architectures Molding Bondgo
Bondgo
A first example
counter go source
package main
import (
bondgo
)
func main () {
var outgoing bondgo.Output
var reg_int uint8
outgoing =
bondgo.Make(bondgo.Output , 3)
reg_int = 0
for {
bondgo.IOWrite(outgoing ,
reg_int)
reg_int ++
}
}
bondgo –input-file counter.go
-mpm
BM JSON rapresentation
counter asm
clr r0
rset r1 0
cpy r0 r1
cpy r1 r0
r2o r1 o0
inc r0
j 3
bondmachine tool
BM HDL code
FPGA
procbuilder tool
Binary
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 49/117
Architectures Molding Bondgo
Bondgo
A first example
counter go source
package main
import (
bondgo
)
func main () {
var outgoing bondgo.Output
var reg_int uint8
outgoing =
bondgo.Make(bondgo.Output , 3)
reg_int = 0
for {
bondgo.IOWrite(outgoing ,
reg_int)
reg_int ++
}
}
bondgo –input-file counter.go
-mpm
BM JSON rapresentation
counter asm
clr r0
rset r1 0
cpy r0 r1
cpy r1 r0
r2o r1 o0
inc r0
j 3
bondmachine tool
BM HDL code
FPGA
procbuilder tool
Binary
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 49/117
Architectures Molding Bondgo
Bondgo
A first example
counter go source
package main
import (
bondgo
)
func main () {
var outgoing bondgo.Output
var reg_int uint8
outgoing =
bondgo.Make(bondgo.Output , 3)
reg_int = 0
for {
bondgo.IOWrite(outgoing ,
reg_int)
reg_int ++
}
}
bondgo –input-file counter.go
-mpm
BM JSON rapresentation
counter asm
clr r0
rset r1 0
cpy r0 r1
cpy r1 r0
r2o r1 o0
inc r0
j 3
bondmachine tool
BM HDL code
FPGA
procbuilder tool
Binary
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 49/117
Architectures Molding Bondgo
Bondgo
A first example
counter go source
package main
import (
bondgo
)
func main () {
var outgoing bondgo.Output
var reg_int uint8
outgoing =
bondgo.Make(bondgo.Output , 3)
reg_int = 0
for {
bondgo.IOWrite(outgoing ,
reg_int)
reg_int ++
}
}
bondgo –input-file counter.go
-mpm
BM JSON rapresentation
counter asm
clr r0
rset r1 0
cpy r0 r1
cpy r1 r0
r2o r1 o0
inc r0
j 3
bondmachine tool
BM HDL code
FPGA
procbuilder tool
Binary
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 49/117
Architectures Molding Bondgo
Bondgo
A first example
counter go source
package main
import (
bondgo
)
func main () {
var outgoing bondgo.Output
var reg_int uint8
outgoing =
bondgo.Make(bondgo.Output , 3)
reg_int = 0
for {
bondgo.IOWrite(outgoing ,
reg_int)
reg_int ++
}
}
bondgo –input-file counter.go
-mpm
BM JSON rapresentation
counter asm
clr r0
rset r1 0
cpy r0 r1
cpy r1 r0
r2o r1 o0
inc r0
j 3
bondmachine tool
BM HDL code
FPGA
procbuilder tool
Binary
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 49/117
Architectures Molding Bondgo
Bondgo
A first example
counter go source
package main
import (
bondgo
)
func main () {
var outgoing bondgo.Output
var reg_int uint8
outgoing =
bondgo.Make(bondgo.Output , 3)
reg_int = 0
for {
bondgo.IOWrite(outgoing ,
reg_int)
reg_int ++
}
}
bondgo –input-file counter.go
-mpm
BM JSON rapresentation
counter asm
clr r0
rset r1 0
cpy r0 r1
cpy r1 r0
r2o r1 o0
inc r0
j 3
bondmachine tool
BM HDL code
FPGA
procbuilder tool
Binary
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 49/117
Architectures Molding Bondgo
Bondgo
A first example
counter go source
package main
import (
bondgo
)
func main () {
var outgoing bondgo.Output
var reg_int uint8
outgoing =
bondgo.Make(bondgo.Output , 3)
reg_int = 0
for {
bondgo.IOWrite(outgoing ,
reg_int)
reg_int ++
}
}
bondgo –input-file counter.go
-mpm
BM JSON rapresentation
counter asm
clr r0
rset r1 0
cpy r0 r1
cpy r1 r0
r2o r1 o0
inc r0
j 3
bondmachine tool
BM HDL code
FPGA
procbuilder tool
Binary
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 49/117
Architectures Molding Bondgo
Bondgo
A first example
counter go source
package main
import (
bondgo
)
func main () {
var outgoing bondgo.Output
var reg_int uint8
outgoing =
bondgo.Make(bondgo.Output , 3)
reg_int = 0
for {
bondgo.IOWrite(outgoing ,
reg_int)
reg_int ++
}
}
bondgo –input-file counter.go
-mpm
BM JSON rapresentation
counter asm
clr r0
rset r1 0
cpy r0 r1
cpy r1 r0
r2o r1 o0
inc r0
j 3
bondmachine tool
BM HDL code
FPGA
procbuilder tool
Binary
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 49/117
Architectures Molding Bondgo
Bondgo
A first example
counter go source
package main
import (
bondgo
)
func main () {
var outgoing bondgo.Output
var reg_int uint8
outgoing =
bondgo.Make(bondgo.Output , 3)
reg_int = 0
for {
bondgo.IOWrite(outgoing ,
reg_int)
reg_int ++
}
}
bondgo –input-file counter.go
-mpm
BM JSON rapresentation
counter asm
clr r0
rset r1 0
cpy r0 r1
cpy r1 r0
r2o r1 o0
inc r0
j 3
bondmachine tool
BM HDL code
FPGA
procbuilder tool
Binary
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 49/117
Architectures Molding Bondgo
Bondgo
A first example
counter go source
package main
import (
bondgo
)
func main () {
var outgoing bondgo.Output
var reg_int uint8
outgoing =
bondgo.Make(bondgo.Output , 3)
reg_int = 0
for {
bondgo.IOWrite(outgoing ,
reg_int)
reg_int ++
}
}
bondgo –input-file counter.go
-mpm
BM JSON rapresentation
counter asm
clr r0
rset r1 0
cpy r0 r1
cpy r1 r0
r2o r1 o0
inc r0
j 3
bondmachine tool
BM HDL code
FPGA
procbuilder tool
Binary
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 49/117
Architectures Molding Bondgo
Bondgo demo
Demo N.7
It will be shown how:
 To create a BondMachine from a Go source file
 To build the architecture
 To build the program
 To create the firmware and flash it to the board
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 50/117
Architectures Molding Bondgo
Bondgo
... bondgo may not only create the binaries, but also the CP
architecture, and ...
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 51/117
Architectures Molding Bondgo
Bondgo
... it can do even much more interesting things when compiling
concurrent programs.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 52/117
Architectures Molding Bondgo
Bondgo
... it can do even much more interesting things when compiling
concurrent programs.
high level GO source
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 52/117
Architectures Molding Bondgo
Bondgo
... it can do even much more interesting things when compiling
concurrent programs.
high level GO source
assembly CP 1
assembly CP 2
assembly CP 3
assembly CP 4
assembly CP ...
assembly CP N
Compiling
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 52/117
Architectures Molding Bondgo
Bondgo
... it can do even much more interesting things when compiling
concurrent programs.
high level GO source
assembly CP 1
assembly CP 2
assembly CP 3
assembly CP 4
assembly CP ...
assembly CP N
Compiling
CP 1 specs
CP 2
CP 3
CP 4
CP ...
CP N
Archs generating
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 52/117
Architectures Molding Bondgo
Bondgo
... it can do even much more interesting things when compiling
concurrent programs.
high level GO source
assembly CP 1
assembly CP 2
assembly CP 3
assembly CP 4
assembly CP ...
assembly CP N
Compiling
CP 1 specs
CP 2
CP 3
CP 4
CP ...
CP N
Archs generating
Asm and Binaries
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 52/117
Architectures Molding Bondgo
Bondgo
... it can do even much more interesting things when compiling
concurrent programs.
high level GO source
assembly CP 1
assembly CP 2
assembly CP 3
assembly CP 4
assembly CP ...
assembly CP N
Compiling
CP 1 specs
CP 2
CP 3
CP 4
CP ...
CP N
Archs generating
Asm and Binaries
Interconnections
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 52/117
Architectures Molding Bondgo
Bondgo
... it can do even much more interesting things when compiling
concurrent programs.
high level GO source
assembly CP 1
assembly CP 2
assembly CP 3
assembly CP 4
assembly CP ...
assembly CP N
Compiling
CP 1 specs
CP 2
CP 3
CP 4
CP ...
CP N
Archs generating
Asm and Binaries
BondMachine
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 52/117
Architectures Molding Bondgo
Bondgo
A multi-core example
multi-core counter
package main
import (
bondgo
)
func pong () {
var in0 bondgo.Input
var out0 bondgo.Output
in0 = bondgo.Make(bondgo.Input , 3)
out0 = bondgo.Make(bondgo.Output , 5)
for {
bondgo.IOWrite(out0 , bondgo.IORead(in0)+1)
}
}
func main () {
var in0 bondgo.Input
var out0 bondgo.Output
in0 = bondgo.Make(bondgo.Input , 5)
out0 = bondgo.Make(bondgo.Output , 3)
device_0:
go pong ()
for {
bondgo.IOWrite(out0 , bondgo.IORead(in0))
}
}
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 53/117
Architectures Molding Bondgo
Bondgo
A multi-core example
Compiling the code with the bondgo compiler:
bondgo -input-file ds.go -mpm
The toolchain perform the following steps:
 Map the two goroutines to two hardware cores.
 Creates two types of core, each one optimized to execute the
assigned goroutine.
 Creates the two binaries.
 Connected the two core as inferred from the source code, using
special IO registers.
The result is a multicore BondMachine:
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 54/117
Architectures Molding Bondgo
Bondgo
A multi-core example
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 55/117
Architectures Molding Bondgo
Simulation demo
Demo N.8
It will be shown how:
 To use bondgo to create a chain of interconnected processors
 To flash the firmware to the board
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 56/117
Architectures Molding Bondgo
Compiling Architectures
One of the most important result
The architecture creation is a part of the compilation process.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 57/117
Architectures Molding Bondgo
Bondgo
Go in hardware
Bondgo implements a sort of “Go in hardware”.
High level Go source code is directly mapped to interconnected
processors without Operating Systems or runtimes.
Goroutines Cores
Channels Channel SOs
Variables (local) CP RAM segments
Variables (by val) CH messages or IO regs
Variables (by ref) Shared RAM segments
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 58/117
Architectures Molding Bondgo
Bondgo
Go in hardware
Bondgo implements a sort of “Go in hardware”.
High level Go source code is directly mapped to interconnected
processors without Operating Systems or runtimes.
Goroutines Cores
Channels Channel SOs
Variables (local) CP RAM segments
Variables (by val) CH messages or IO regs
Variables (by ref) Shared RAM segments
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 58/117
Architectures Molding Bondgo
Bondgo
Go in hardware
Bondgo implements a sort of “Go in hardware”.
High level Go source code is directly mapped to interconnected
processors without Operating Systems or runtimes.
Goroutines Cores
Channels Channel SOs
Variables (local) CP RAM segments
Variables (by val) CH messages or IO regs
Variables (by ref) Shared RAM segments
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 58/117
Architectures Molding Bondgo
Bondgo
Go in hardware
Bondgo implements a sort of “Go in hardware”.
High level Go source code is directly mapped to interconnected
processors without Operating Systems or runtimes.
Goroutines Cores
Channels Channel SOs
Variables (local) CP RAM segments
Variables (by val) CH messages or IO regs
Variables (by ref) Shared RAM segments
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 58/117
Architectures Molding Bondgo
Bondgo
Go in hardware
Bondgo implements a sort of “Go in hardware”.
High level Go source code is directly mapped to interconnected
processors without Operating Systems or runtimes.
Goroutines Cores
Channels Channel SOs
Variables (local) CP RAM segments
Variables (by val) CH messages or IO regs
Variables (by ref) Shared RAM segments
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 58/117
Architectures Molding Bondgo
Bondgo
Go in hardware
Bondgo implements a sort of “Go in hardware”.
High level Go source code is directly mapped to interconnected
processors without Operating Systems or runtimes.
Goroutines Cores
Channels Channel SOs
Variables (local) CP RAM segments
Variables (by val) CH messages or IO regs
Variables (by ref) Shared RAM segments
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 58/117
Architectures Molding Bondgo
Bondgo
Go in hardware
Bondgo implements a sort of “Go in hardware”.
High level Go source code is directly mapped to interconnected
processors without Operating Systems or runtimes.
Goroutines Cores
Channels Channel SOs
Variables (local) CP RAM segments
Variables (by val) CH messages or IO regs
Variables (by ref) Shared RAM segments
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 58/117
Architectures Molding Bondgo
Bondgo
Go in hardware
Bondgo implements a sort of “Go in hardware”.
High level Go source code is directly mapped to interconnected
processors without Operating Systems or runtimes.
Goroutines Cores
Channels Channel SOs
Variables (local) CP RAM segments
Variables (by val) CH messages or IO regs
Variables (by ref) Shared RAM segments
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 58/117
Architectures Molding Bondgo
Bondgo
Go in hardware
Bondgo implements a sort of “Go in hardware”.
High level Go source code is directly mapped to interconnected
processors without Operating Systems or runtimes.
Goroutines Cores
Channels Channel SOs
Variables (local) CP RAM segments
Variables (by val) CH messages or IO regs
Variables (by ref) Shared RAM segments
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 58/117
Architectures Molding Bondgo
Bondgo
Go in hardware
Bondgo implements a sort of “Go in hardware”.
High level Go source code is directly mapped to interconnected
processors without Operating Systems or runtimes.
Goroutines Cores
Channels Channel SOs
Variables (local) CP RAM segments
Variables (by val) CH messages or IO regs
Variables (by ref) Shared RAM segments
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 58/117
Architectures Molding Bondgo
Bondgo
Go in hardware
Bondgo implements a sort of “Go in hardware”.
High level Go source code is directly mapped to interconnected
processors without Operating Systems or runtimes.
Goroutines Cores
Channels Channel SOs
Variables (local) CP RAM segments
Variables (by val) CH messages or IO regs
Variables (by ref) Shared RAM segments
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 58/117
Architectures Molding Bondgo
Go in hardware
Second idea on the BondMachine
The idea was:
Build a computing system with a decreased number of layers resulting in a lower HW/SW gap.
This would raise the overall performances yet keeping an user friendly way of programming.
Between HW and SW there is only the
processor abstraction, no Operating
System nor runtimes. Despite that
programming is done at high level.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 59/117
Architectures Molding Bondgo
Layers, Abstractions and Interfaces
and BondMachines
Programming language
Compiler
Language
BondMachine
Register Machine
Opcodes
Transistors
Hardware
Specialization
Performance
Software
Generalization
User
friendly
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 60/117
Architectures Molding Bondgo
Bondgo
An example
bondgo stream processing example
package main
import (
bondgo
)
func streamprocessor (a *[] uint8 , b *[] uint8 ,
c *[] uint8 , gid uint8) {
(*c)[gid] = (*a)[gid] + (*b)[gid]
}
func main () {
a := make ([] uint8 , 256)
b := make ([] uint8 , 256)
c := make ([] uint8 , 256)
// ... some a and b values fill
for i := 0; i  256; i++ {
go streamprocessor (a, b, c, uint8(i))
}
}
The compilation of this example results in the creation of a 257 CPs where 256 are the stream
processors executing the code in the function called streamprocessor, and one is the coordinating
CP. Each stream processor is optimized and capable only to make additions since it is the only
operation requested by the source code. The three slices created on the main function are passed
by reference to the Goroutines then a shared RAM is created by the Bondgo compiler available
to the generated CPs.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 61/117
Architectures Molding Bondgo
Bondgo
Abstract Assembly
The Assembly language for the BM has been kept as independent as
possible from the particular CP.
Given a specific piece of assembly code Bondgo has the ability to
compute the “minimum CP” that can execute that code.
These are Building Blocks for complex BondMachines.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 62/117
Architectures Molding Builders API
Builders API
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 63/117
Architectures Molding Builders API
Builders API
With these Building Blocks
Several libraries have been developed to map specific problems on
BondMachines:
 Symbond, to handle mathematical expression.
 Boolbond, to map boolean expression.
 Matrixwork, to perform matrices operations.
 Neuralbond, to use neural networks.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 64/117
Architectures Molding Builders API
Builders API
With these Building Blocks
Several libraries have been developed to map specific problems on
BondMachines:
 Symbond, to handle mathematical expression.
 Boolbond, to map boolean expression.
 Matrixwork, to perform matrices operations.
 Neuralbond, to use neural networks.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 64/117
Architectures Molding Builders API
Builders API
With these Building Blocks
Several libraries have been developed to map specific problems on
BondMachines:
 Symbond, to handle mathematical expression.
 Boolbond, to map boolean expression.
 Matrixwork, to perform matrices operations.
 Neuralbond, to use neural networks.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 64/117
Architectures Molding Builders API
Builders API
With these Building Blocks
Several libraries have been developed to map specific problems on
BondMachines:
 Symbond, to handle mathematical expression.
 Boolbond, to map boolean expression.
 Matrixwork, to perform matrices operations.
 Neuralbond, to use neural networks.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 64/117
Architectures Molding Builders API
Builders API
With these Building Blocks
Several libraries have been developed to map specific problems on
BondMachines:
 Symbond, to handle mathematical expression.
 Boolbond, to map boolean expression.
 Matrixwork, to perform matrices operations.
 Neuralbond, to use neural networks.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 64/117
Architectures Molding Builders API
Builders API
Symbond
A mathematical expression, or a system can be converted to a
BondMachine:
sum(var(x),const(2))
Boolbond
symbond -expression sum(var(x),const(2)) -save-bondmachine
bondmachine.json
Resulting in:
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 65/117
Architectures Molding Builders API
Builders API
Symbond
A mathematical expression, or a system can be converted to a
BondMachine:
sum(var(x),const(2))
Boolbond
symbond -expression sum(var(x),const(2)) -save-bondmachine
bondmachine.json
Resulting in:
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 65/117
Architectures Molding Builders API
Builders API
Symbond
A mathematical expression, or a system can be converted to a
BondMachine:
sum(var(x),const(2))
Boolbond
symbond -expression sum(var(x),const(2)) -save-bondmachine
bondmachine.json
Resulting in:
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 65/117
Architectures Molding Builders API
Builders API
Symbond
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 66/117
Architectures Molding Builders API
Builders API
Boolbond
A system of boolean equations, input and output variables are
expressed as in the example file:
var(z)=or(var(x),not(var(y)))
var(t)=or(and(var(x),var(y)),var(z))
var(l)=and(xor(var(x),var(y)),var(t))
i:var(x)
i:var(y)
o:var(z)
o:var(t)
o:var(l)
Boolbond
boolbond -system-file expression.txt -save-bondmachine
bondmachine.json
Resulting in:
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 67/117
Architectures Molding Builders API
Builders API
Boolbond
A system of boolean equations, input and output variables are
expressed as in the example file:
var(z)=or(var(x),not(var(y)))
var(t)=or(and(var(x),var(y)),var(z))
var(l)=and(xor(var(x),var(y)),var(t))
i:var(x)
i:var(y)
o:var(z)
o:var(t)
o:var(l)
Boolbond
boolbond -system-file expression.txt -save-bondmachine
bondmachine.json
Resulting in:
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 67/117
Architectures Molding Builders API
Builders API
Boolbond
A system of boolean equations, input and output variables are
expressed as in the example file:
var(z)=or(var(x),not(var(y)))
var(t)=or(and(var(x),var(y)),var(z))
var(l)=and(xor(var(x),var(y)),var(t))
i:var(x)
i:var(y)
o:var(z)
o:var(t)
o:var(l)
Boolbond
boolbond -system-file expression.txt -save-bondmachine
bondmachine.json
Resulting in:
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 67/117
Architectures Molding Builders API
Builders API
Boolbond
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 68/117
Architectures Molding Builders API
Boolbond demo
Demo N.9
It will be shown how:
 To create complex multi-cores from boolean expressions
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 69/117
Architectures Molding Builders API
Builders API
Matrixwork
Matrix multiplication
if mymachine, ok := matrixwork.Build_M(n, t); ok == nil ...
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 70/117
Architectures Molding Evolutionary BondMachine
Evolutionary BondMachine
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 71/117
Architectures Molding Evolutionary BondMachine
Evolutionary BondMachine
Find an architecture that
solve a problem
Simulation
Framework
Metrics to check
how well a BM
solve a problem
Building Blocks
of the BM
Population of
BondMachines
Genetically Evolved
Architectures
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 72/117
Architectures Molding Evolutionary BondMachine
Evolutionary BondMachine
Find an architecture that
solve a problem
Simulation
Framework
Metrics to check
how well a BM
solve a problem
Building Blocks
of the BM
Population of
BondMachines
Genetically Evolved
Architectures
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 72/117
Architectures Molding Evolutionary BondMachine
Evolutionary BondMachine
Find an architecture that
solve a problem
Simulation
Framework
Metrics to check
how well a BM
solve a problem
Building Blocks
of the BM
Population of
BondMachines
Genetically Evolved
Architectures
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 72/117
Architectures Molding Evolutionary BondMachine
Evolutionary BondMachine
Find an architecture that
solve a problem
Simulation
Framework
Metrics to check
how well a BM
solve a problem
Building Blocks
of the BM
Population of
BondMachines
Genetically Evolved
Architectures
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 72/117
Architectures Molding Evolutionary BondMachine
Evolutionary BondMachine
Find an architecture that
solve a problem
Simulation
Framework
Metrics to check
how well a BM
solve a problem
Building Blocks
of the BM
Population of
BondMachines
Genetically Evolved
Architectures
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 72/117
Architectures Molding Evolutionary BondMachine
Evolutionary BondMachine
Find an architecture that
solve a problem
Simulation
Framework
Metrics to check
how well a BM
solve a problem
Building Blocks
of the BM
Population of
BondMachines
Genetically Evolved
Architectures
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 72/117
Architectures Molding Machine Learning with BondMachine section
Machine Learning
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 73/117
Architectures Molding Machine Learning with BondMachine section
Machine Learning with BondMachine
Architectures with multiple interconnected processors like the ones
produced by the BondMachine Toolkit are a perfect fit for Neural
Networks and Computational Graphs.
Several ways to map this structures to BondMachine has been
developed:
 A native Neural Network library
 A Tensorflow to BondMachine translator
 An NNEF based BondMachine composer
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 74/117
Architectures Molding Machine Learning with BondMachine section
Machine Learning with BondMachine
Architectures with multiple interconnected processors like the ones
produced by the BondMachine Toolkit are a perfect fit for Neural
Networks and Computational Graphs.
Several ways to map this structures to BondMachine has been
developed:
 A native Neural Network library
 A Tensorflow to BondMachine translator
 An NNEF based BondMachine composer
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 74/117
Architectures Molding Bondmachine native Neural Network library
Machine Learning with BondMachine
Native Neural Network library
The tool neuralbond allow the creation of BM-based neural chips from
an API go interface.
 Neurons are converted to BondMachine connecting processors.
 Tensors are mapped to CP connections.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 75/117
Architectures Molding TensorFlow to Bondmachine
TensorFlow™ to Bondmachine
tf2bm
TensorFlow™ is an open source software library for numerical
computation using data flow graphs.
Graphs can be converted to BondMachines with the tf2bm tool.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 76/117
Architectures Molding NNEF Composer
Machine Learning with BondMachine
NNEF Composer
Neural Network Exchange Format (NNEF) is a standard from Khronos
Group to enable the easy transfer of trained networks among
frameworks, inference engines and devices
The NNEF BM tool approach is to descent NNEF models and build
BondMachine multi-core accordingly
This approch has several advandages over the previous:
 It is not limited to a single framework
 NNEF is a textual file, so no complex operations are needed to
read models
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 77/117
Hardware
Hardware
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 78/117
Hardware
Hardware implementation
FPGA
The HDL code for the BondMachine is written in Verilog and System
Verilog, and has been tested on these devices/system:
 Digilent Basys3 - Xilinx Artix-7 - Vivado.
 Kintex7 Evaluation Board - Vivado.
 Digilent Zedboard - Xilinx Zynq 7020 - Vivado.
 Linux - Iverilog.
 Terasic De10nano - Intel Cyclone V - Quartus
Within the project other firmwares have been written or tested:
 Microchip ENC28J60 Ethernet interface controller.
 Microchip ENC424J600 10/100 Base-T Ethernet interface
controller.
 ESP8266 Wi-Fi chip.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 79/117
Prototype
The Prototype
The project has been selected for the participation at MakerFaire 2016
Rome (The Europen Edition) and a prototype has been assembled and
presented.
First run Youtube video
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 80/117
Clustering
Clustering
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 81/117
Clustering
BondMachine Clustering
So far we saw:
 An user friendly approach to create processors (single core).
 Optimizing a single device to support intricate computational
work-flows (multi-cores) over an heterogeneous layer.
Interconnected BondMachines
What if we could extend the this layer to multiple interconnected
devices ?
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 82/117
Clustering
BondMachine Clustering
So far we saw:
 An user friendly approach to create processors (single core).
 Optimizing a single device to support intricate computational
work-flows (multi-cores) over an heterogeneous layer.
Interconnected BondMachines
What if we could extend the this layer to multiple interconnected
devices ?
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 82/117
Clustering
BondMachine Clustering
So far we saw:
 An user friendly approach to create processors (single core).
 Optimizing a single device to support intricate computational
work-flows (multi-cores) over an heterogeneous layer.
Interconnected BondMachines
What if we could extend the this layer to multiple interconnected
devices ?
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 82/117
Clustering
BondMachine Clustering
So far we saw:
 An user friendly approach to create processors (single core).
 Optimizing a single device to support intricate computational
work-flows (multi-cores) over an heterogeneous layer.
Interconnected BondMachines
What if we could extend the this layer to multiple interconnected
devices ?
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 82/117
Clustering
BondMachine Clustering
The same logic existing among CP have been extended among different
BondMachines organized in clusters.
Protocols, one ethernet called etherbond and one using UDP called
udpbond have been created for the purpose.
FPGA based BondMachines, standard Linux Workstations, Emulated
BondMachines might join a cluster an contribute to a single distributed
computational problem.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 83/117
Clustering
BondMachine Clustering
The same logic existing among CP have been extended among different
BondMachines organized in clusters.
Protocols, one ethernet called etherbond and one using UDP called
udpbond have been created for the purpose.
FPGA based BondMachines, standard Linux Workstations, Emulated
BondMachines might join a cluster an contribute to a single distributed
computational problem.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 83/117
Clustering
BondMachine Clustering
The same logic existing among CP have been extended among different
BondMachines organized in clusters.
Protocols, one ethernet called etherbond and one using UDP called
udpbond have been created for the purpose.
FPGA based BondMachines, standard Linux Workstations, Emulated
BondMachines might join a cluster an contribute to a single distributed
computational problem.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 83/117
Clustering
BondMachine Clustering
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 84/117
Clustering Redeployer
BondMachine Clustering
A distributed example
distributed counter
package main
import (
bondgo
)
func pong () {
var in0 bondgo.Input
var out0 bondgo.Output
in0 = bondgo.Make(bondgo.Input , 3)
out0 = bondgo.Make(bondgo.Output , 5)
for {
bondgo.IOWrite(out0 , bondgo.IORead(in0)+1)
}
}
func main () {
var in0 bondgo.Input
var out0 bondgo.Output
in0 = bondgo.Make(bondgo.Input , 5)
out0 = bondgo.Make(bondgo.Output , 3)
device_1:
go pong ()
for {
bondgo.IOWrite(out0 , bondgo.IORead(in0))
}
}
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 85/117
Clustering Redeployer
BondMachine Clustering
A distributed example
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 86/117
Clustering Redeployer
BondMachine Clustering
A distributed example
The result is:
BondMachine Clustering Youtube video
A general result
Parts of the system can be redeployed among different devices without
changing the system behavior (only the performances).
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 87/117
Clustering Redeployer
BondMachine Clustering
Results
Results
 User can deploy an entire HW/SW cluster starting from code
written in a high level description (Go, NNEF, etc)
 Workstation with emulated BondMachines, workstation with
etherbond drivers, standalone BondMachines (FPGA) may join
these clusters.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 88/117
Clustering Redeployer
BondMachine Clustering
Results
Results
 User can deploy an entire HW/SW cluster starting from code
written in a high level description (Go, NNEF, etc)
 Workstation with emulated BondMachines, workstation with
etherbond drivers, standalone BondMachines (FPGA) may join
these clusters.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 88/117
Use cases
Use cases
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 89/117
Use cases
Use cases
Two use cases in Physics experiments are currently being developed:
 Real time pulse shape analysis in neutron detectors
 bringing the intelligence to the edge
 Test beam for space experiments (DAMPE, HERD)
 increasing testbed operations efficiency
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 90/117
Use cases Physics
Real time pulse shape analysis in neutron detectors
The operation of the new generation of high-intensity neutron sources
like SNS, JSNS and European Spallation Source (ESS, Lund, Sweden),
now under construction, are introducing a new demand for neutron
detection capabilities.
These demands yield to the need for new data collection procedures
and new technology based on solid state Si devices.
We are trying to use BondMachines to make the real time shape
analysis in this kind of detecting devices.
Courtesy of Prof. F.Sacchetti
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 91/117
Use cases Physics
Test beam for space experiments (DAMPE, HERD)
Trigger logic for test beams
Courtesy of V.Vagelli and M.Duranti
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 92/117
Use cases Physics
Test beam for space experiments (DAMPE, HERD)
Trigger logic for test beams
Courtesy of V.Vagelli and M.Duranti
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 93/117
Use cases Physics
Test beam for space experiments (DAMPE, HERD)
Trigger logic for test beams
We are trying to explore the possibility of using BondMachine to
handle efficiently this kind of operations.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 94/117
Use cases Other uses
Possible other uses
The BondMachine could be used in several types of real world
applications, some of them being:
 IoT and CyberPhysical systems.
 Computer Science educational applications.
Computing Accelerator
Our effort is now in enabling the possibility of building computing
accelerators to be used from within standard (Linux) applications.
Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 95/117
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture
NiPS Summer School 2019 Lecture

More Related Content

What's hot

Microkernels in the Era of Data-Centric Computing
Microkernels in the Era of Data-Centric ComputingMicrokernels in the Era of Data-Centric Computing
Microkernels in the Era of Data-Centric ComputingMartin Děcký
 
Design and Implementation of Quintuple Processor Architecture Using FPGA
Design and Implementation of Quintuple Processor Architecture Using FPGADesign and Implementation of Quintuple Processor Architecture Using FPGA
Design and Implementation of Quintuple Processor Architecture Using FPGAIJERA Editor
 
Smalltalk-80 : hardware and software
Smalltalk-80 : hardware and softwareSmalltalk-80 : hardware and software
Smalltalk-80 : hardware and softwareESUG
 
MPEG-21 Digital Items in Research and Practice
MPEG-21 Digital Items in Research and PracticeMPEG-21 Digital Items in Research and Practice
MPEG-21 Digital Items in Research and PracticeAlpen-Adria-Universität
 
Multimedia Content Understanding: Bringing Context to Content
Multimedia Content Understanding: Bringing Context to ContentMultimedia Content Understanding: Bringing Context to Content
Multimedia Content Understanding: Bringing Context to ContentBenoit HUET
 
A model-driven, component generation approach for the xWoT
A model-driven, component generation approach for the xWoTA model-driven, component generation approach for the xWoT
A model-driven, component generation approach for the xWoTAndreas Ruppen
 

What's hot (9)

Microkernels in the Era of Data-Centric Computing
Microkernels in the Era of Data-Centric ComputingMicrokernels in the Era of Data-Centric Computing
Microkernels in the Era of Data-Centric Computing
 
2020-04-29 SIT Insights in Technology - Bertrand Meyer
2020-04-29 SIT Insights in Technology - Bertrand Meyer2020-04-29 SIT Insights in Technology - Bertrand Meyer
2020-04-29 SIT Insights in Technology - Bertrand Meyer
 
J044084349
J044084349J044084349
J044084349
 
Design and Implementation of Quintuple Processor Architecture Using FPGA
Design and Implementation of Quintuple Processor Architecture Using FPGADesign and Implementation of Quintuple Processor Architecture Using FPGA
Design and Implementation of Quintuple Processor Architecture Using FPGA
 
Smalltalk-80 : hardware and software
Smalltalk-80 : hardware and softwareSmalltalk-80 : hardware and software
Smalltalk-80 : hardware and software
 
MPEG-21 Digital Items in Research and Practice
MPEG-21 Digital Items in Research and PracticeMPEG-21 Digital Items in Research and Practice
MPEG-21 Digital Items in Research and Practice
 
Multimedia Content Understanding: Bringing Context to Content
Multimedia Content Understanding: Bringing Context to ContentMultimedia Content Understanding: Bringing Context to Content
Multimedia Content Understanding: Bringing Context to Content
 
A model-driven, component generation approach for the xWoT
A model-driven, component generation approach for the xWoTA model-driven, component generation approach for the xWoT
A model-driven, component generation approach for the xWoT
 
Smart Built-Environments for People Living with Dementia_Crimson Publishers
Smart Built-Environments for People Living with Dementia_Crimson PublishersSmart Built-Environments for People Living with Dementia_Crimson Publishers
Smart Built-Environments for People Living with Dementia_Crimson Publishers
 

Similar to NiPS Summer School 2019 Lecture

Course: "Introductory course to HLS FPGA programming"
Course: "Introductory course to HLS FPGA programming"Course: "Introductory course to HLS FPGA programming"
Course: "Introductory course to HLS FPGA programming"Mirko Mariotti
 
INFN Advanced ML Hackaton 2022 Talk
INFN Advanced ML Hackaton 2022 TalkINFN Advanced ML Hackaton 2022 Talk
INFN Advanced ML Hackaton 2022 TalkMirko Mariotti
 
Short course on FPGA programming
Short course on FPGA programmingShort course on FPGA programming
Short course on FPGA programmingMirko Mariotti
 
INFN ML FPGA Course 2022
INFN ML FPGA Course 2022INFN ML FPGA Course 2022
INFN ML FPGA Course 2022Mirko Mariotti
 
A Primer on FPGAs - Field Programmable Gate Arrays
A Primer on FPGAs - Field Programmable Gate ArraysA Primer on FPGAs - Field Programmable Gate Arrays
A Primer on FPGAs - Field Programmable Gate ArraysTaylor Riggan
 
Firmware Develpment for hybrid (ARM and FPGA) processors
Firmware Develpment for hybrid (ARM and FPGA) processorsFirmware Develpment for hybrid (ARM and FPGA) processors
Firmware Develpment for hybrid (ARM and FPGA) processorsMirko Mariotti
 
Performance Optimization of SPH Algorithms for Multi/Many-Core Architectures
Performance Optimization of SPH Algorithms for Multi/Many-Core ArchitecturesPerformance Optimization of SPH Algorithms for Multi/Many-Core Architectures
Performance Optimization of SPH Algorithms for Multi/Many-Core ArchitecturesDr. Fabio Baruffa
 
3rd 3DDRESD: RC historical contextualization
3rd 3DDRESD: RC historical contextualization3rd 3DDRESD: RC historical contextualization
3rd 3DDRESD: RC historical contextualizationMarco Santambrogio
 
FPGA-based soft-processors: 6G nodes and post-quantum security in space
 FPGA-based soft-processors: 6G nodes and post-quantum security in space FPGA-based soft-processors: 6G nodes and post-quantum security in space
FPGA-based soft-processors: 6G nodes and post-quantum security in spaceFacultad de Informática UCM
 
⭐⭐⭐⭐⭐ CHARLA FIEC: Monitoring of system memory usage embedded in #FPGA
⭐⭐⭐⭐⭐ CHARLA FIEC: Monitoring of system memory usage embedded in #FPGA⭐⭐⭐⭐⭐ CHARLA FIEC: Monitoring of system memory usage embedded in #FPGA
⭐⭐⭐⭐⭐ CHARLA FIEC: Monitoring of system memory usage embedded in #FPGAVictor Asanza
 
Selective fitting strategy based real time placement algorithm for dynamicall...
Selective fitting strategy based real time placement algorithm for dynamicall...Selective fitting strategy based real time placement algorithm for dynamicall...
Selective fitting strategy based real time placement algorithm for dynamicall...eSAT Publishing House
 
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER) International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER) ijceronline
 
Adaptive Computing Seminar - Suyog Potdar
Adaptive Computing Seminar - Suyog PotdarAdaptive Computing Seminar - Suyog Potdar
Adaptive Computing Seminar - Suyog PotdarSuyog Potdar
 
NGD Systems and Microsoft Keynote Presentation at IPDPS MPP in Vacouver
NGD Systems and Microsoft Keynote Presentation at IPDPS MPP in VacouverNGD Systems and Microsoft Keynote Presentation at IPDPS MPP in Vacouver
NGD Systems and Microsoft Keynote Presentation at IPDPS MPP in VacouverScott Shadley, MBA,PMC-III
 
Automatic generation of platform architectures using open cl and fpga roadmap
Automatic generation of platform architectures using open cl and fpga roadmapAutomatic generation of platform architectures using open cl and fpga roadmap
Automatic generation of platform architectures using open cl and fpga roadmapManolis Vavalis
 

Similar to NiPS Summer School 2019 Lecture (20)

Course: "Introductory course to HLS FPGA programming"
Course: "Introductory course to HLS FPGA programming"Course: "Introductory course to HLS FPGA programming"
Course: "Introductory course to HLS FPGA programming"
 
INFN Advanced ML Hackaton 2022 Talk
INFN Advanced ML Hackaton 2022 TalkINFN Advanced ML Hackaton 2022 Talk
INFN Advanced ML Hackaton 2022 Talk
 
ICTP Workshop 2022
ICTP Workshop 2022ICTP Workshop 2022
ICTP Workshop 2022
 
INFN SOSC 2022 Talk
INFN SOSC 2022 TalkINFN SOSC 2022 Talk
INFN SOSC 2022 Talk
 
Short course on FPGA programming
Short course on FPGA programmingShort course on FPGA programming
Short course on FPGA programming
 
INFN ML FPGA Course 2022
INFN ML FPGA Course 2022INFN ML FPGA Course 2022
INFN ML FPGA Course 2022
 
A Primer on FPGAs - Field Programmable Gate Arrays
A Primer on FPGAs - Field Programmable Gate ArraysA Primer on FPGAs - Field Programmable Gate Arrays
A Primer on FPGAs - Field Programmable Gate Arrays
 
91 94
91 9491 94
91 94
 
91 94
91 9491 94
91 94
 
Firmware Develpment for hybrid (ARM and FPGA) processors
Firmware Develpment for hybrid (ARM and FPGA) processorsFirmware Develpment for hybrid (ARM and FPGA) processors
Firmware Develpment for hybrid (ARM and FPGA) processors
 
FPGA In a Nutshell
FPGA In a NutshellFPGA In a Nutshell
FPGA In a Nutshell
 
Performance Optimization of SPH Algorithms for Multi/Many-Core Architectures
Performance Optimization of SPH Algorithms for Multi/Many-Core ArchitecturesPerformance Optimization of SPH Algorithms for Multi/Many-Core Architectures
Performance Optimization of SPH Algorithms for Multi/Many-Core Architectures
 
3rd 3DDRESD: RC historical contextualization
3rd 3DDRESD: RC historical contextualization3rd 3DDRESD: RC historical contextualization
3rd 3DDRESD: RC historical contextualization
 
FPGA-based soft-processors: 6G nodes and post-quantum security in space
 FPGA-based soft-processors: 6G nodes and post-quantum security in space FPGA-based soft-processors: 6G nodes and post-quantum security in space
FPGA-based soft-processors: 6G nodes and post-quantum security in space
 
⭐⭐⭐⭐⭐ CHARLA FIEC: Monitoring of system memory usage embedded in #FPGA
⭐⭐⭐⭐⭐ CHARLA FIEC: Monitoring of system memory usage embedded in #FPGA⭐⭐⭐⭐⭐ CHARLA FIEC: Monitoring of system memory usage embedded in #FPGA
⭐⭐⭐⭐⭐ CHARLA FIEC: Monitoring of system memory usage embedded in #FPGA
 
Selective fitting strategy based real time placement algorithm for dynamicall...
Selective fitting strategy based real time placement algorithm for dynamicall...Selective fitting strategy based real time placement algorithm for dynamicall...
Selective fitting strategy based real time placement algorithm for dynamicall...
 
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER) International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)
 
Adaptive Computing Seminar - Suyog Potdar
Adaptive Computing Seminar - Suyog PotdarAdaptive Computing Seminar - Suyog Potdar
Adaptive Computing Seminar - Suyog Potdar
 
NGD Systems and Microsoft Keynote Presentation at IPDPS MPP in Vacouver
NGD Systems and Microsoft Keynote Presentation at IPDPS MPP in VacouverNGD Systems and Microsoft Keynote Presentation at IPDPS MPP in Vacouver
NGD Systems and Microsoft Keynote Presentation at IPDPS MPP in Vacouver
 
Automatic generation of platform architectures using open cl and fpga roadmap
Automatic generation of platform architectures using open cl and fpga roadmapAutomatic generation of platform architectures using open cl and fpga roadmap
Automatic generation of platform architectures using open cl and fpga roadmap
 

Recently uploaded

SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxkessiyaTpeter
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...jana861314
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCEPRINCE C P
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real timeSatoshi NAKAHIRA
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡anilsa9823
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 
Dashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tanta
Dashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tantaDashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tanta
Dashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tantaPraksha3
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
Genomic DNA And Complementary DNA Libraries construction.
Genomic DNA And Complementary DNA Libraries construction.Genomic DNA And Complementary DNA Libraries construction.
Genomic DNA And Complementary DNA Libraries construction.k64182334
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 
Recombination DNA Technology (Microinjection)
Recombination DNA Technology (Microinjection)Recombination DNA Technology (Microinjection)
Recombination DNA Technology (Microinjection)Jshifa
 
TOPIC 8 Temperature and Heat.pdf physics
TOPIC 8 Temperature and Heat.pdf physicsTOPIC 8 Temperature and Heat.pdf physics
TOPIC 8 Temperature and Heat.pdf physicsssuserddc89b
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
Luciferase in rDNA technology (biotechnology).pptx
Luciferase in rDNA technology (biotechnology).pptxLuciferase in rDNA technology (biotechnology).pptx
Luciferase in rDNA technology (biotechnology).pptxAleenaTreesaSaji
 
The Black hole shadow in Modified Gravity
The Black hole shadow in Modified GravityThe Black hole shadow in Modified Gravity
The Black hole shadow in Modified GravitySubhadipsau21168
 
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.PraveenaKalaiselvan1
 
A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfnehabiju2046
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxyaramohamed343013
 

Recently uploaded (20)

SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real time
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
Dashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tanta
Dashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tantaDashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tanta
Dashanga agada a formulation of Agada tantra dealt in 3 Rd year bams agada tanta
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
Genomic DNA And Complementary DNA Libraries construction.
Genomic DNA And Complementary DNA Libraries construction.Genomic DNA And Complementary DNA Libraries construction.
Genomic DNA And Complementary DNA Libraries construction.
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
Recombination DNA Technology (Microinjection)
Recombination DNA Technology (Microinjection)Recombination DNA Technology (Microinjection)
Recombination DNA Technology (Microinjection)
 
TOPIC 8 Temperature and Heat.pdf physics
TOPIC 8 Temperature and Heat.pdf physicsTOPIC 8 Temperature and Heat.pdf physics
TOPIC 8 Temperature and Heat.pdf physics
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
Luciferase in rDNA technology (biotechnology).pptx
Luciferase in rDNA technology (biotechnology).pptxLuciferase in rDNA technology (biotechnology).pptx
Luciferase in rDNA technology (biotechnology).pptx
 
The Black hole shadow in Modified Gravity
The Black hole shadow in Modified GravityThe Black hole shadow in Modified Gravity
The Black hole shadow in Modified Gravity
 
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
 
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
 
A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdf
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docx
 

NiPS Summer School 2019 Lecture

  • 1. The BondMachine Toolkit Enabling Machine Learning on FPGA Mirko Mariotti Department of Physics and Geology - University of Perugia INFN Perugia NiPS Summer School 2019 Architectures and Algorithms for Energy-Efficient IoT and HPC Applications 3-6 September 2019 - Perugia
  • 2. Introduction The BondMachine: a comprehensive approach to computing with FPGA. In this presentation i will talk about: Technological background of the project The BondMachine Project: the Architecture The BondMachine Project: the Tools Use cases Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 2/117
  • 3. Demo sessions Some topic will have a demo session. The code will be available at: http://bondmachine.fisica.unipg.it Requirements: Linux Workstation Vivado Zedboard Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 3/117
  • 4. Technological Background Technological Background Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 4/117
  • 5. Technological Background Current challenges in computing Von Neumann Bottleneck: New computational problems show that current architectural models has to be improved or changed to address future payloads. Energy Efficient computation: Not wasting resources (silicon, time, energy, instructions). Using the right resource for the specific case Edge/Fog/Cloud Computing: Making the computation where it make sense Avoiding the transfer of unnecessary data Creating consistent interfaces for distributed systems Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 5/117
  • 6. Technological Background Current challenges in computing Von Neumann Bottleneck: New computational problems show that current architectural models has to be improved or changed to address future payloads. Energy Efficient computation: Not wasting resources (silicon, time, energy, instructions). Using the right resource for the specific case Edge/Fog/Cloud Computing: Making the computation where it make sense Avoiding the transfer of unnecessary data Creating consistent interfaces for distributed systems Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 5/117
  • 7. Technological Background Current challenges in computing Von Neumann Bottleneck: New computational problems show that current architectural models has to be improved or changed to address future payloads. Energy Efficient computation: Not wasting resources (silicon, time, energy, instructions). Using the right resource for the specific case Edge/Fog/Cloud Computing: Making the computation where it make sense Avoiding the transfer of unnecessary data Creating consistent interfaces for distributed systems Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 5/117
  • 8. Technological Background Programmable Logic FPGA What is it ? A field-programmable gate array (FPGA) is an integrated circuit whose logic is re-programmable. It’s used to build reconfigurable digital circuits. FPGAs contain an array of programmable logic blocks, and a hierarchy of reconfig- urable interconnects that allow the blocks to be wired together. Logic blocks can be configured to perform complex combinational functions. The FPGA configuration is generally specified using a hardware description language (HDL). Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 6/117
  • 9. Technological Background Programmable Logic FPGA What is it ? A field-programmable gate array (FPGA) is an integrated circuit whose logic is re-programmable. It’s used to build reconfigurable digital circuits. FPGAs contain an array of programmable logic blocks, and a hierarchy of reconfig- urable interconnects that allow the blocks to be wired together. Logic blocks can be configured to perform complex combinational functions. The FPGA configuration is generally specified using a hardware description language (HDL). Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 6/117
  • 10. Technological Background Programmable Logic FPGA What is it ? A field-programmable gate array (FPGA) is an integrated circuit whose logic is re-programmable. It’s used to build reconfigurable digital circuits. FPGAs contain an array of programmable logic blocks, and a hierarchy of reconfig- urable interconnects that allow the blocks to be wired together. Logic blocks can be configured to perform complex combinational functions. The FPGA configuration is generally specified using a hardware description language (HDL). Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 6/117
  • 11. Technological Background Programmable Logic FPGA What is it ? A field-programmable gate array (FPGA) is an integrated circuit whose logic is re-programmable. It’s used to build reconfigurable digital circuits. FPGAs contain an array of programmable logic blocks, and a hierarchy of reconfig- urable interconnects that allow the blocks to be wired together. Logic blocks can be configured to perform complex combinational functions. The FPGA configuration is generally specified using a hardware description language (HDL). Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 6/117
  • 12. Technological Background Programmable Logic FPGA Use in computing The use of FPGA in computing is growing due several reasons: can potentially deliver great performance via massive parallelism can address payloads which are not performing well on uniprocessors (Neural Networks, Deep Learning) can handle efficiently non-standard data types Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 7/117
  • 13. Technological Background Programmable Logic FPGA Use in computing The use of FPGA in computing is growing due several reasons: can potentially deliver great performance via massive parallelism can address payloads which are not performing well on uniprocessors (Neural Networks, Deep Learning) can handle efficiently non-standard data types Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 7/117
  • 14. Technological Background Programmable Logic FPGA Use in computing The use of FPGA in computing is growing due several reasons: can potentially deliver great performance via massive parallelism can address payloads which are not performing well on uniprocessors (Neural Networks, Deep Learning) can handle efficiently non-standard data types Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 7/117
  • 15. Technological Background Programmable Logic FPGA Challenges in computing On the other hand the adoption on FPGA poses several challenges: Porting of legacy code is usually hard. Interoperability with standard applications is problematic. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 8/117
  • 16. Technological Background Programmable Logic FPGA Challenges in computing On the other hand the adoption on FPGA poses several challenges: Porting of legacy code is usually hard. Interoperability with standard applications is problematic. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 8/117
  • 17. Technological Background Computer Architectures Computer Architectures Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 9/117
  • 18. Technological Background Computer Architectures Computer Architectures Multi-core and Heterogeneous Today’s computer architecture are: Multi-core,Two or more independent actual processing units execute multiple instructions at the same time. The power is given by the number of cores. Parallelism has to be addressed. Heterogeneous, different types of processing units. Cell, GPU, Parallela, TPU. The power is given by the specialization. The units data transfer has to be addressed. The payloads scheduling has to be addressed. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 10/117
  • 19. Technological Background Computer Architectures Computer Architectures Multi-core and Heterogeneous Today’s computer architecture are: Multi-core,Two or more independent actual processing units execute multiple instructions at the same time. The power is given by the number of cores. Parallelism has to be addressed. Heterogeneous, different types of processing units. Cell, GPU, Parallela, TPU. The power is given by the specialization. The units data transfer has to be addressed. The payloads scheduling has to be addressed. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 10/117
  • 20. Technological Background Computer Architectures Computer Architectures Multi-core and Heterogeneous Today’s computer architecture are: Multi-core,Two or more independent actual processing units execute multiple instructions at the same time. The power is given by the number of cores. Parallelism has to be addressed. Heterogeneous, different types of processing units. Cell, GPU, Parallela, TPU. The power is given by the specialization. The units data transfer has to be addressed. The payloads scheduling has to be addressed. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 10/117
  • 21. Technological Background Computer Architectures Computer Architectures Multi-core and Heterogeneous Today’s computer architecture are: Multi-core,Two or more independent actual processing units execute multiple instructions at the same time. The power is given by the number of cores. Parallelism has to be addressed. Heterogeneous, different types of processing units. Cell, GPU, Parallela, TPU. The power is given by the specialization. The units data transfer has to be addressed. The payloads scheduling has to be addressed. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 10/117
  • 22. Technological Background Computer Architectures Computer Architectures Multi-core and Heterogeneous Today’s computer architecture are: Multi-core,Two or more independent actual processing units execute multiple instructions at the same time. The power is given by the number of cores. Parallelism has to be addressed. Heterogeneous, different types of processing units. Cell, GPU, Parallela, TPU. The power is given by the specialization. The units data transfer has to be addressed. The payloads scheduling has to be addressed. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 10/117
  • 23. Technological Background Computer Architectures Computer Architectures Multi-core and Heterogeneous Today’s computer architecture are: Multi-core,Two or more independent actual processing units execute multiple instructions at the same time. The power is given by the number of cores. Parallelism has to be addressed. Heterogeneous, different types of processing units. Cell, GPU, Parallela, TPU. The power is given by the specialization. The units data transfer has to be addressed. The payloads scheduling has to be addressed. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 10/117
  • 24. Technological Background Computer Architectures Computer Architectures Multi-core and Heterogeneous Today’s computer architecture are: Multi-core,Two or more independent actual processing units execute multiple instructions at the same time. The power is given by the number of cores. Parallelism has to be addressed. Heterogeneous, different types of processing units. Cell, GPU, Parallela, TPU. The power is given by the specialization. The units data transfer has to be addressed. The payloads scheduling has to be addressed. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 10/117
  • 25. Technological Background Computer Architectures Computer Architectures Multi-core and Heterogeneous Today’s computer architecture are: Multi-core,Two or more independent actual processing units execute multiple instructions at the same time. The power is given by the number of cores. Parallelism has to be addressed. Heterogeneous, different types of processing units. Cell, GPU, Parallela, TPU. The power is given by the specialization. The units data transfer has to be addressed. The payloads scheduling has to be addressed. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 10/117
  • 26. Technological Background Computer Architectures Computer Architectures Multi-core and Heterogeneous Today’s computer architecture are: Multi-core,Two or more independent actual processing units execute multiple instructions at the same time. The power is given by the number of cores. Parallelism has to be addressed. Heterogeneous, different types of processing units. Cell, GPU, Parallela, TPU. The power is given by the specialization. The units data transfer has to be addressed. The payloads scheduling has to be addressed. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 10/117
  • 27. Technological Background Computer Architectures Computer Architectures Multi-core and Heterogeneous Today’s computer architecture are: Multi-core,Two or more independent actual processing units execute multiple instructions at the same time. The power is given by the number of cores. Parallelism has to be addressed. Heterogeneous, different types of processing units. Cell, GPU, Parallela, TPU. The power is given by the specialization. The units data transfer has to be addressed. The payloads scheduling has to be addressed. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 10/117
  • 28. Technological Background The first Idea The BondMachine The first idea High level sources: Go, TensorFlow, NN, ... Building a new kind of computer architecture (multi-core and heterogeneous both in cores types and interconnections) which dynamically adapt to the specific computational problem rather than be static. BM architecture Layer FPGA Concurrency and Special- ization Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 11/117
  • 29. Technological Background The first Idea The BondMachine The first idea High level sources: Go, TensorFlow, NN, ... Building a new kind of computer architecture (multi-core and heterogeneous both in cores types and interconnections) which dynamically adapt to the specific computational problem rather than be static. BM architecture Layer FPGA Concurrency and Special- ization Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 11/117
  • 30. Technological Background The first Idea The BondMachine The first idea High level sources: Go, TensorFlow, NN, ... Building a new kind of computer architecture (multi-core and heterogeneous both in cores types and interconnections) which dynamically adapt to the specific computational problem rather than be static. BM architecture Layer FPGA Concurrency and Special- ization Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 11/117
  • 31. Technological Background The first Idea The BondMachine The first idea High level sources: Go, TensorFlow, NN, ... Building a new kind of computer architecture (multi-core and heterogeneous both in cores types and interconnections) which dynamically adapt to the specific computational problem rather than be static. BM architecture Layer FPGA Concurrency and Special- ization Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 11/117
  • 32. Technological Background The first Idea The BondMachine The first idea High level sources: Go, TensorFlow, NN, ... Building a new kind of computer architecture (multi-core and heterogeneous both in cores types and interconnections) which dynamically adapt to the specific computational problem rather than be static. BM architecture Layer FPGA Concurrency and Special- ization Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 11/117
  • 33. Technological Background Abstractions Abstractions and Interfaces Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 12/117
  • 34. Technological Background Abstractions Layer, Abstractions and Interfaces Introduction A Computing system is a matter of abstraction and interfaces. A lower layer exposes its functionalities (via interfaces) to the above layer hiding (abstraction) its inner details. The quality of a computing system is determined by how abstractions are simple and how interfaces are clean. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 13/117
  • 35. Technological Background Abstractions Layers, Abstractions and Interfaces An example Programming language User mode Kernel mode Processor Transistors Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 14/117
  • 36. Technological Background Abstractions Layers, Abstractions and Interfaces An example Programming language User mode Kernel mode Processor Register Machine Transistors Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 14/117
  • 37. Technological Background Abstractions Layers, Abstractions and Interfaces An example Programming language User mode Kernel mode Processor Register Machine Opcodes Transistors Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 14/117
  • 38. Technological Background Abstractions Layers, Abstractions and Interfaces An example Programming language User mode Kernel mode Kernel Processor Register Machine Opcodes Transistors Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 14/117
  • 39. Technological Background Abstractions Layers, Abstractions and Interfaces An example Programming language User mode Kernel mode Kernel System calls Processor Register Machine Opcodes Transistors Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 14/117
  • 40. Technological Background Abstractions Layers, Abstractions and Interfaces An example Programming language User mode Standard Library Kernel mode Kernel System calls Processor Register Machine Opcodes Transistors Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 14/117
  • 41. Technological Background Abstractions Layers, Abstractions and Interfaces An example Programming language User mode Standard Library Stl calls Kernel mode Kernel System calls Processor Register Machine Opcodes Transistors Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 14/117
  • 42. Technological Background Abstractions Layers, Abstractions and Interfaces An example Programming language Compiler/Interpreter User mode Standard Library Stl calls Kernel mode Kernel System calls Processor Register Machine Opcodes Transistors Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 14/117
  • 43. Technological Background Abstractions Layers, Abstractions and Interfaces An example Programming language Compiler/Interpreter Language User mode Standard Library Stl calls Kernel mode Kernel System calls Processor Register Machine Opcodes Transistors Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 14/117
  • 44. Technological Background Abstractions Layers, Abstractions and Interfaces An example Programming language Compiler/Interpreter Language User mode Standard Library Stl calls Kernel mode Kernel System calls Processor Register Machine Opcodes Transistors Hardware Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 14/117
  • 45. Technological Background Abstractions Layers, Abstractions and Interfaces An example Programming language Compiler/Interpreter Language User mode Standard Library Stl calls Kernel mode Kernel System calls Processor Register Machine Opcodes Transistors Hardware Software Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 14/117
  • 46. Technological Background Abstractions Layers, Abstractions and Interfaces An example Programming language Compiler/Interpreter Language User mode Standard Library Stl calls Kernel mode Kernel System calls Processor Register Machine Opcodes Transistors Hardware Specialization Software Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 14/117
  • 47. Technological Background Abstractions Layers, Abstractions and Interfaces An example Programming language Compiler/Interpreter Language User mode Standard Library Stl calls Kernel mode Kernel System calls Processor Register Machine Opcodes Transistors Hardware Specialization Software Generalization Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 14/117
  • 48. Technological Background Abstractions Layers, Abstractions and Interfaces An example Programming language Compiler/Interpreter Language User mode Standard Library Stl calls Kernel mode Kernel System calls Processor Register Machine Opcodes Transistors Hardware Specialization Performance Software Generalization Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 14/117
  • 49. Technological Background Abstractions Layers, Abstractions and Interfaces An example Programming language Compiler/Interpreter Language User mode Standard Library Stl calls Kernel mode Kernel System calls Processor Register Machine Opcodes Transistors Hardware Specialization Performance Software Generalization User friendly Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 14/117
  • 50. Technological Background The second idea Layers, Abstractions and Interfaces The second idea Rethinking the stack Build a computing system with a decreased number of layers resulting in a minor gap between HW and SW but keeping an user friendly way of programming it. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 15/117
  • 51. BondMachine BondMachine Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 16/117
  • 52. BondMachine Introducing the BondMachine (BM) The BondMachine is a software ecosystem for the dynamic generation of computer architectures that: Are composed by many, possibly hundreds, computing cores. Have very small cores and not necessarily of the same type (different ISA and ABI). Have a not fixed way of interconnecting cores. May have some elements shared among cores (for example channels and shared memories). Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 17/117
  • 53. BondMachine Introducing the BondMachine (BM) The BondMachine is a software ecosystem for the dynamic generation of computer architectures that: Are composed by many, possibly hundreds, computing cores. Have very small cores and not necessarily of the same type (different ISA and ABI). Have a not fixed way of interconnecting cores. May have some elements shared among cores (for example channels and shared memories). Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 17/117
  • 54. BondMachine Introducing the BondMachine (BM) The BondMachine is a software ecosystem for the dynamic generation of computer architectures that: Are composed by many, possibly hundreds, computing cores. Have very small cores and not necessarily of the same type (different ISA and ABI). Have a not fixed way of interconnecting cores. May have some elements shared among cores (for example channels and shared memories). Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 17/117
  • 55. BondMachine Introducing the BondMachine (BM) The BondMachine is a software ecosystem for the dynamic generation of computer architectures that: Are composed by many, possibly hundreds, computing cores. Have very small cores and not necessarily of the same type (different ISA and ABI). Have a not fixed way of interconnecting cores. May have some elements shared among cores (for example channels and shared memories). Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 17/117
  • 56. BondMachine Introducing the BondMachine (BM) The BondMachine is a software ecosystem for the dynamic generation of computer architectures that: Are composed by many, possibly hundreds, computing cores. Have very small cores and not necessarily of the same type (different ISA and ABI). Have a not fixed way of interconnecting cores. May have some elements shared among cores (for example channels and shared memories). Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 17/117
  • 57. BondMachine The BondMachine An example Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 18/117
  • 58. BondMachine Connecting Processors Connecting Processor (CP) The computational unit of the BM The atomic computational unit of a BM is the “connecting processor” (CP) and has: Some general purpose registers of size Rsize. Some I/O dedicated registers of size Rsize. A set of implemented opcodes chosen among many available. Dedicated ROM and RAM. Three possible operating modes. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 19/117
  • 59. BondMachine Connecting Processors Connecting Processor (CP) The computational unit of the BM The atomic computational unit of a BM is the “connecting processor” (CP) and has: Some general purpose registers of size Rsize. Some I/O dedicated registers of size Rsize. A set of implemented opcodes chosen among many available. Dedicated ROM and RAM. Three possible operating modes. General purpose registers 2R registers: r0,r1,r2,r3 ... r2R Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 19/117
  • 60. BondMachine Connecting Processors Connecting Processor (CP) The computational unit of the BM The atomic computational unit of a BM is the “connecting processor” (CP) and has: Some general purpose registers of size Rsize. Some I/O dedicated registers of size Rsize. A set of implemented opcodes chosen among many available. Dedicated ROM and RAM. Three possible operating modes. I/O specialized registers N input registers: i0,i1 ... iN M output registers: o0,o1 ... oM Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 19/117
  • 61. BondMachine Connecting Processors Connecting Processor (CP) The computational unit of the BM The atomic computational unit of a BM is the “connecting processor” (CP) and has: Some general purpose registers of size Rsize. Some I/O dedicated registers of size Rsize. A set of implemented opcodes chosen among many available. Dedicated ROM and RAM. Three possible operating modes. Full set of possible opcodes add,addf,addi,chc,chw,clr,cpy,dec,divf,dpc,hit,hlt,i2r,inc,j, je,jz,m2r,mult,multf,nop,r2m,r2o,r2s,rset,sic,s2r,saj,sub, wrd,wwr Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 19/117
  • 62. BondMachine Connecting Processors Connecting Processor (CP) The computational unit of the BM The atomic computational unit of a BM is the “connecting processor” (CP) and has: Some general purpose registers of size Rsize. Some I/O dedicated registers of size Rsize. A set of implemented opcodes chosen among many available. Dedicated ROM and RAM. Three possible operating modes. RAM and ROM 2L RAM memory cells. 2O ROM memory cells. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 19/117
  • 63. BondMachine Connecting Processors Connecting Processor (CP) The computational unit of the BM The atomic computational unit of a BM is the “connecting processor” (CP) and has: Some general purpose registers of size Rsize. Some I/O dedicated registers of size Rsize. A set of implemented opcodes chosen among many available. Dedicated ROM and RAM. Three possible operating modes. Operating modes Full Harvard mode. Full Von Neuman mode. Hybrid mode. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 19/117
  • 64. BondMachine Connecting Processors Connecting Processor (CP) Full set of possible opcodes Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 20/117
  • 65. BondMachine Shared Objects Shared Objects (SO) The non-computational element of the BM Alongside CPs, BondMachines include non-computing units called “Shared Objects” (SO). Examples of their purposes are: Data storage (Memories). Message passing. CP synchronization. A single SO can be shared among different CPs. To use it CPs have special instructions (opcodes) oriented to the specific SO. Four kind of SO have been developed so far: the Channel, the Shared Memory, the Barrier and a Pseudo Random Numbers Gen- erator. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 21/117
  • 66. BondMachine Shared Objects Shared Objects (SO) The non-computational element of the BM Alongside CPs, BondMachines include non-computing units called “Shared Objects” (SO). Examples of their purposes are: Data storage (Memories). Message passing. CP synchronization. A single SO can be shared among different CPs. To use it CPs have special instructions (opcodes) oriented to the specific SO. Four kind of SO have been developed so far: the Channel, the Shared Memory, the Barrier and a Pseudo Random Numbers Gen- erator. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 21/117
  • 67. BondMachine Shared Objects Shared Objects (SO) The non-computational element of the BM Alongside CPs, BondMachines include non-computing units called “Shared Objects” (SO). Examples of their purposes are: Data storage (Memories). Message passing. CP synchronization. A single SO can be shared among different CPs. To use it CPs have special instructions (opcodes) oriented to the specific SO. Four kind of SO have been developed so far: the Channel, the Shared Memory, the Barrier and a Pseudo Random Numbers Gen- erator. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 21/117
  • 68. BondMachine Shared Objects Shared Objects (SO) The non-computational element of the BM Alongside CPs, BondMachines include non-computing units called “Shared Objects” (SO). Examples of their purposes are: Data storage (Memories). Message passing. CP synchronization. A single SO can be shared among different CPs. To use it CPs have special instructions (opcodes) oriented to the specific SO. Four kind of SO have been developed so far: the Channel, the Shared Memory, the Barrier and a Pseudo Random Numbers Gen- erator. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 21/117
  • 69. BondMachine Shared Objects Shared Objects (SO) The non-computational element of the BM Alongside CPs, BondMachines include non-computing units called “Shared Objects” (SO). Examples of their purposes are: Data storage (Memories). Message passing. CP synchronization. A single SO can be shared among different CPs. To use it CPs have special instructions (opcodes) oriented to the specific SO. Four kind of SO have been developed so far: the Channel, the Shared Memory, the Barrier and a Pseudo Random Numbers Gen- erator. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 21/117
  • 70. BondMachine Shared Objects Shared Objects (SO) The non-computational element of the BM Alongside CPs, BondMachines include non-computing units called “Shared Objects” (SO). Examples of their purposes are: Data storage (Memories). Message passing. CP synchronization. A single SO can be shared among different CPs. To use it CPs have special instructions (opcodes) oriented to the specific SO. Four kind of SO have been developed so far: the Channel, the Shared Memory, the Barrier and a Pseudo Random Numbers Gen- erator. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 21/117
  • 71. BondMachine Shared Objects Channel The Channel SO is an hardware implementation of the CSP (communi- cating sequential processes) channel. It is a model for inter-core communication and synchronization via mes- sage passing. CPs use channels via 4 opcodes wrd: Want Read. wwr: Want Write. chc: Channel Check. chw: Channel Wait. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 22/117
  • 72. BondMachine Shared Objects Channel The Channel SO is an hardware implementation of the CSP (communi- cating sequential processes) channel. It is a model for inter-core communication and synchronization via mes- sage passing. CPs use channels via 4 opcodes wrd: Want Read. wwr: Want Write. chc: Channel Check. chw: Channel Wait. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 22/117
  • 73. BondMachine Shared Objects Channel The Channel SO is an hardware implementation of the CSP (communi- cating sequential processes) channel. It is a model for inter-core communication and synchronization via mes- sage passing. CPs use channels via 4 opcodes wrd: Want Read. wwr: Want Write. chc: Channel Check. chw: Channel Wait. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 22/117
  • 74. BondMachine Shared Objects Shared Memory The Shared Memory SO is a RAM block accessible from more than one CP. Different Shared Memories can be used by different CP and not neces- sarily by all of them. CPs use shared memories via 2 opcodes s2r: Shared memory read. r2s: Shared memory write. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 23/117
  • 75. BondMachine Shared Objects Shared Memory The Shared Memory SO is a RAM block accessible from more than one CP. Different Shared Memories can be used by different CP and not neces- sarily by all of them. CPs use shared memories via 2 opcodes s2r: Shared memory read. r2s: Shared memory write. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 23/117
  • 76. BondMachine Shared Objects Shared Memory The Shared Memory SO is a RAM block accessible from more than one CP. Different Shared Memories can be used by different CP and not neces- sarily by all of them. CPs use shared memories via 2 opcodes s2r: Shared memory read. r2s: Shared memory write. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 23/117
  • 77. BondMachine Shared Objects Barrier The Barrier SO is used to make CPs act synchronously. When a CP hits a barrier, the execution stop until all the CPs that share the same barrier hit it. CPs use barriers via 1 opcode hit: Hit the barrier. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 24/117
  • 78. BondMachine Shared Objects Barrier The Barrier SO is used to make CPs act synchronously. When a CP hits a barrier, the execution stop until all the CPs that share the same barrier hit it. CPs use barriers via 1 opcode hit: Hit the barrier. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 24/117
  • 79. BondMachine Shared Objects Barrier The Barrier SO is used to make CPs act synchronously. When a CP hits a barrier, the execution stop until all the CPs that share the same barrier hit it. CPs use barriers via 1 opcode hit: Hit the barrier. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 24/117
  • 80. BondMachine Shared Objects Multicore and Heterogeneous First idea on the BondMachine The idea was: Having a multi-core architecture completely heterogeneous both in cores types and interconnections. The BondMachine may have many cores, eventually all different, arbitrarily interconnected and sharing non computing elements. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 25/117
  • 81. Architectures Handling Architectures Handling Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 26/117
  • 82. Architectures Handling Handle BM computer architectures The BM computer architecture is managed by a set of tools to: build a specify architecture modify a pre-existing architecture simulate or emulate the behavior Generate the Hardware Description Code (HDL) Processor Builder Selects the single pro- cessor, assembles and disassembles, saves on disk as JSON, creates the HDL code of a CP BondMachine Builder Connects CPs and SOs together in custom topologies, loads and saves on disk as JSON, create BM’s HDL code Simulation Framework Simulates the be- haviour, emulates a BM on a standard Linux workstation Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 27/117
  • 83. Architectures Handling Handle BM computer architectures The BM computer architecture is managed by a set of tools to: build a specify architecture modify a pre-existing architecture simulate or emulate the behavior Generate the Hardware Description Code (HDL) Processor Builder Selects the single pro- cessor, assembles and disassembles, saves on disk as JSON, creates the HDL code of a CP BondMachine Builder Connects CPs and SOs together in custom topologies, loads and saves on disk as JSON, create BM’s HDL code Simulation Framework Simulates the be- haviour, emulates a BM on a standard Linux workstation Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 27/117
  • 84. Architectures Handling Handle BM computer architectures The BM computer architecture is managed by a set of tools to: build a specify architecture modify a pre-existing architecture simulate or emulate the behavior Generate the Hardware Description Code (HDL) Processor Builder Selects the single pro- cessor, assembles and disassembles, saves on disk as JSON, creates the HDL code of a CP BondMachine Builder Connects CPs and SOs together in custom topologies, loads and saves on disk as JSON, create BM’s HDL code Simulation Framework Simulates the be- haviour, emulates a BM on a standard Linux workstation Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 27/117
  • 85. Architectures Handling Handle BM computer architectures The BM computer architecture is managed by a set of tools to: build a specify architecture modify a pre-existing architecture simulate or emulate the behavior Generate the Hardware Description Code (HDL) Processor Builder Selects the single pro- cessor, assembles and disassembles, saves on disk as JSON, creates the HDL code of a CP BondMachine Builder Connects CPs and SOs together in custom topologies, loads and saves on disk as JSON, create BM’s HDL code Simulation Framework Simulates the be- haviour, emulates a BM on a standard Linux workstation Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 27/117
  • 86. Architectures Handling Processor Builder Procbuilder is the CP manipulation tool. CP Creation CP Load/Save CP Assembler/Disassembler CP HDL Examples (32 bit registers counter machine) procbuilder -register-size 32 -opcodes clr,cpy,dec,inc,je,jz (Input and Output registers) procbuilder -inputs 3 -outputs 2 ... Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 28/117
  • 87. Architectures Handling Processor Builder Procbuilder is the CP manipulation tool. CP Creation CP Load/Save CP Assembler/Disassembler CP HDL Examples (Loading a CP) procbuilder -load-machine conproc.json ... (Saving a CP) procbuilder -save-machine conproc.json ... Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 28/117
  • 88. Architectures Handling Processor Builder Procbuilder is the CP manipulation tool. CP Creation CP Load/Save CP Assembler/Disassembler CP HDL Examples (Assembiling) procbuilder -input-assembly program.asm ... (Disassembling) procbuilder -show-program-disassembled ... Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 28/117
  • 89. Architectures Handling Processor Builder Procbuilder is the CP manipulation tool. CP Creation CP Load/Save CP Assembler/Disassembler CP HDL Examples (Create the CP HDL code in Verilog) procbuilder -create-verilog ... (Create testbench) procbuilder -create-verilog-testbench test.v ... Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 28/117
  • 90. Architectures Handling Procbuilder demo Demo N.1 It will be shown how: To create a simple processor To assemble and disassemble code for it To produce its HDL code Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 29/117
  • 91. Architectures Handling BondMachine Builder Bondmachine is the tool that compose CP and SO to form BondMachines. BM CP insert and remove BM SO insert and remove BM Inputs and Outputs BM Bonding Processors and/or IO BM Visualizing or HDL Examples (Add a processor) bondmachine -add-domains proc.json ... ; ... -add-processor 0 (Remove a processor) bondmachine -bondmachine-file bmach.json -del-processor n Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 30/117
  • 92. Architectures Handling BondMachine Builder Bondmachine is the tool that compose CP and SO to form BondMachines. BM CP insert and remove BM SO insert and remove BM Inputs and Outputs BM Bonding Processors and/or IO BM Visualizing or HDL Examples (Add a Shared Object) bondmachine -add-shared-objects specs ... (Connect an SO to a processor) bondmachine -connect-processor-shared-object ... Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 30/117
  • 93. Architectures Handling BondMachine Builder Bondmachine is the tool that compose CP and SO to form BondMachines. BM CP insert and remove BM SO insert and remove BM Inputs and Outputs BM Bonding Processors and/or IO BM Visualizing or HDL Examples (Adding inputs or outputs) bondmachine -add-inputs ... ; bondmachine -add-outputs ... (Removing inputs or outputs) bondmachine -del-input ... ; bondmachine -del-output ... Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 30/117
  • 94. Architectures Handling BondMachine Builder Bondmachine is the tool that compose CP and SO to form BondMachines. BM CP insert and remove BM SO insert and remove BM Inputs and Outputs BM Bonding Processors and/or IO BM Visualizing or HDL Examples (Bonding processor) bondmachine -add-bond p0i2,p1o4 ... (Bonding IO) bondmachine -add-bond i2,p0i6 ... Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 30/117
  • 95. Architectures Handling BondMachine Builder Bondmachine is the tool that compose CP and SO to form BondMachines. BM CP insert and remove BM SO insert and remove BM Inputs and Outputs BM Bonding Processors and/or IO BM Visualizing or HDL Examples (Visualizing) bondmachine -emit-dot ... (Create HDL code) bondmachine -create-verilog ... Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 30/117
  • 96. Architectures Handling BondMachine demo Demo N.2 It will be shown how: To create a single-core BondMachine To attach an external output To produce its HDL code Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 31/117
  • 97. Architectures Handling Toolchains Toolchains A set of toolchains allow the build and the direct deploy to a target device of BondMachines. Bondgo Toolchain example A file local.mk contains references to the source code as well all the build necessities. make bondmachine creates the JSON representation of the BM and assemble its code. make show displays a graphical representation of the BM. make simulate start a simulation. make videosim create a simulation video. make flash the device into the destination target. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 32/117
  • 98. Architectures Handling Toolchains BondMachine demo Demo N.3 It will be shown how: To explore the toolchain To flash the board with the code from the previous example Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 33/117
  • 99. Architectures Handling Toolchains BondMachine demo Demo N.4 It will be shown how: To build a BondMachine with a processor and a shared object To flash the board Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 34/117
  • 100. Architectures Handling Toolchains BondMachine demo Demo N.5 It will be shown how: To build a dual-core BondMachine To connect cores To flash the board Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 35/117
  • 101. Architectures Handling Web frontend BondMachine web front-end Operations on BondMachines can also be performed via an under development web framework Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 36/117
  • 102. Architectures Handling Simulation Simulation An important feature of the tools is the possibility of simulating BondMachine behavior. An event input file describes how BondMachines elements has to change during the simulation timespan and which one has to be be reported. The simulator can produce results in the form of: Activity log of the BM internal. Graphical representation of the simulation. Report file with quantitative data. Useful to construct metrics Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 37/117
  • 103. Architectures Handling Simulation Simulation An important feature of the tools is the possibility of simulating BondMachine behavior. An event input file describes how BondMachines elements has to change during the simulation timespan and which one has to be be reported. The simulator can produce results in the form of: Activity log of the BM internal. Graphical representation of the simulation. Report file with quantitative data. Useful to construct metrics Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 37/117
  • 104. Architectures Handling Simulation Simulation An important feature of the tools is the possibility of simulating BondMachine behavior. An event input file describes how BondMachines elements has to change during the simulation timespan and which one has to be be reported. The simulator can produce results in the form of: Activity log of the BM internal. Graphical representation of the simulation. Report file with quantitative data. Useful to construct metrics Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 37/117
  • 105. Architectures Handling Simulation Simulation examples Activity log example: Simulation Youtube video Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 38/117
  • 106. Architectures Handling Simulation Simulation demo Demo N.6 It will be shown how: To show the simulation capabilities of the framework Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 39/117
  • 107. Architectures Handling Simulation Emulation The same engine that simulate BondMachines can be used as emulator. Through the emulator BondMachines can be used on Linux workstations. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 40/117
  • 108. Architectures Molding Architectures Molding Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 41/117
  • 109. Architectures Molding API and Libraries Molding the BondMachine As stated before BondMachines are not general purpose architectures, and to be effective have to be shaped according the specific problem. Several methods (apart from writing in assembly and building a Bond- Machine from scratch) have been developed to do that: bondgo: A new type of compiler that create not only the CPs assembly but also the architecture itself. A set of API to create BondMachine to fit a specific computational problems. An Evolutionary Computation framework to “grow” BondMachines according some fitness function via simulation. A set of tools to use BondMachine in Machine Learning. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 42/117
  • 110. Architectures Molding API and Libraries Molding the BondMachine As stated before BondMachines are not general purpose architectures, and to be effective have to be shaped according the specific problem. Several methods (apart from writing in assembly and building a Bond- Machine from scratch) have been developed to do that: bondgo: A new type of compiler that create not only the CPs assembly but also the architecture itself. A set of API to create BondMachine to fit a specific computational problems. An Evolutionary Computation framework to “grow” BondMachines according some fitness function via simulation. A set of tools to use BondMachine in Machine Learning. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 42/117
  • 111. Architectures Molding API and Libraries Molding the BondMachine As stated before BondMachines are not general purpose architectures, and to be effective have to be shaped according the specific problem. Several methods (apart from writing in assembly and building a Bond- Machine from scratch) have been developed to do that: bondgo: A new type of compiler that create not only the CPs assembly but also the architecture itself. A set of API to create BondMachine to fit a specific computational problems. An Evolutionary Computation framework to “grow” BondMachines according some fitness function via simulation. A set of tools to use BondMachine in Machine Learning. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 42/117
  • 112. Architectures Molding API and Libraries Molding the BondMachine As stated before BondMachines are not general purpose architectures, and to be effective have to be shaped according the specific problem. Several methods (apart from writing in assembly and building a Bond- Machine from scratch) have been developed to do that: bondgo: A new type of compiler that create not only the CPs assembly but also the architecture itself. A set of API to create BondMachine to fit a specific computational problems. An Evolutionary Computation framework to “grow” BondMachines according some fitness function via simulation. A set of tools to use BondMachine in Machine Learning. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 42/117
  • 113. Architectures Molding API and Libraries Molding the BondMachine As stated before BondMachines are not general purpose architectures, and to be effective have to be shaped according the specific problem. Several methods (apart from writing in assembly and building a Bond- Machine from scratch) have been developed to do that: bondgo: A new type of compiler that create not only the CPs assembly but also the architecture itself. A set of API to create BondMachine to fit a specific computational problems. An Evolutionary Computation framework to “grow” BondMachines according some fitness function via simulation. A set of tools to use BondMachine in Machine Learning. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 42/117
  • 114. Architectures Molding API and Libraries Molding the BondMachine As stated before BondMachines are not general purpose architectures, and to be effective have to be shaped according the specific problem. Several methods (apart from writing in assembly and building a Bond- Machine from scratch) have been developed to do that: bondgo: A new type of compiler that create not only the CPs assembly but also the architecture itself. A set of API to create BondMachine to fit a specific computational problems. An Evolutionary Computation framework to “grow” BondMachines according some fitness function via simulation. A set of tools to use BondMachine in Machine Learning. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 42/117
  • 115. Architectures Molding API and Libraries Use the BM computer architecture Mapping specific computational problems to BMs Symbond Map symbolic mathematical expressions to BM Boolbond Map boolean systems to BM Matrixwork Basic matrix computation Evolutive BM Evolutionary computing to BM Bondgo The architecture compiler ML tools Map computational graphs to BM Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 43/117
  • 116. Architectures Molding API and Libraries Use the BM computer architecture Mapping specific computational problems to BMs Symbond Map symbolic mathematical expressions to BM Boolbond Map boolean systems to BM Matrixwork Basic matrix computation Evolutive BM Evolutionary computing to BM Bondgo The architecture compiler ML tools Map computational graphs to BM Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 43/117
  • 117. Architectures Molding API and Libraries Use the BM computer architecture Mapping specific computational problems to BMs Symbond Map symbolic mathematical expressions to BM Boolbond Map boolean systems to BM Matrixwork Basic matrix computation Evolutive BM Evolutionary computing to BM Bondgo The architecture compiler ML tools Map computational graphs to BM Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 43/117
  • 118. Architectures Molding API and Libraries Use the BM computer architecture Mapping specific computational problems to BMs Symbond Map symbolic mathematical expressions to BM Boolbond Map boolean systems to BM Matrixwork Basic matrix computation Evolutive BM Evolutionary computing to BM Bondgo The architecture compiler ML tools Map computational graphs to BM Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 43/117
  • 119. Architectures Molding API and Libraries Use the BM computer architecture Mapping specific computational problems to BMs Symbond Map symbolic mathematical expressions to BM Boolbond Map boolean systems to BM Matrixwork Basic matrix computation Evolutive BM Evolutionary computing to BM Bondgo The architecture compiler ML tools Map computational graphs to BM Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 43/117
  • 120. Architectures Molding API and Libraries Use the BM computer architecture Mapping specific computational problems to BMs Symbond Map symbolic mathematical expressions to BM Boolbond Map boolean systems to BM Matrixwork Basic matrix computation Evolutive BM Evolutionary computing to BM Bondgo The architecture compiler ML tools Map computational graphs to BM Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 43/117
  • 121. Architectures Molding API and Libraries Use the BM computer architecture Mapping specific computational problems to BMs Symbond Map symbolic mathematical expressions to BM Boolbond Map boolean systems to BM Matrixwork Basic matrix computation Evolutive BM Evolutionary computing to BM Bondgo The architecture compiler ML tools Map computational graphs to BM Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 43/117
  • 122. Architectures Molding Bondgo Bondgo Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 44/117
  • 123. Architectures Molding Bondgo Bondgo The major innovation of the BondMachine Project is its compiler. Bondgo is the name chosen for the compiler developed for the BondMachine. The compiler source language is Go as the name suggest. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 45/117
  • 124. Architectures Molding Bondgo Bondgo This is the standard flow when building computer programs Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 46/117
  • 125. Architectures Molding Bondgo Bondgo This is the standard flow when building computer programs high level language source Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 46/117
  • 126. Architectures Molding Bondgo Bondgo This is the standard flow when building computer programs high level language source assembly file Compiling Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 46/117
  • 127. Architectures Molding Bondgo Bondgo This is the standard flow when building computer programs high level language source assembly file Compiling machine code Assembling Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 46/117
  • 128. Architectures Molding Bondgo Bondgo bondgo loop example package main import () func main () { var reg_aa uint8 var reg_ab uint8 for reg_aa = 10; reg_aa 0; reg_aa -- { reg_ab = reg_aa break } } bondgo loop example in asm clr aa clr ab rset ac 10 cpy aa ac cpy ac aa jz ac 11 cpy ac aa cpy ab ac j 11 dec aa j 4 Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 47/117
  • 129. Architectures Molding Bondgo Bondgo Bondgo does something different from standard compilers ... Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 48/117
  • 130. Architectures Molding Bondgo Bondgo Bondgo does something different from standard compilers ... high level GO source Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 48/117
  • 131. Architectures Molding Bondgo Bondgo Bondgo does something different from standard compilers ... high level GO source assembly file Compiling Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 48/117
  • 132. Architectures Molding Bondgo Bondgo Bondgo does something different from standard compilers ... high level GO source assembly file Compiling CP specs Arch generating Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 48/117
  • 133. Architectures Molding Bondgo Bondgo Bondgo does something different from standard compilers ... high level GO source assembly file Compiling CP specs Arch generating machine code Assembling Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 48/117
  • 134. Architectures Molding Bondgo Bondgo Bondgo does something different from standard compilers ... high level GO source assembly file Compiling CP specs Arch generating machine code Assembling Processor implementation Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 48/117
  • 135. Architectures Molding Bondgo Bondgo Bondgo does something different from standard compilers ... high level GO source assembly file Compiling CP specs Arch generating machine code Assembling Processor implementation Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 48/117
  • 136. Architectures Molding Bondgo Bondgo A first example counter go source package main import ( bondgo ) func main () { var outgoing bondgo.Output var reg_int uint8 outgoing = bondgo.Make(bondgo.Output , 3) reg_int = 0 for { bondgo.IOWrite(outgoing , reg_int) reg_int ++ } } bondgo –input-file counter.go -mpm BM JSON rapresentation counter asm clr r0 rset r1 0 cpy r0 r1 cpy r1 r0 r2o r1 o0 inc r0 j 3 bondmachine tool BM HDL code FPGA procbuilder tool Binary Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 49/117
  • 137. Architectures Molding Bondgo Bondgo A first example counter go source package main import ( bondgo ) func main () { var outgoing bondgo.Output var reg_int uint8 outgoing = bondgo.Make(bondgo.Output , 3) reg_int = 0 for { bondgo.IOWrite(outgoing , reg_int) reg_int ++ } } bondgo –input-file counter.go -mpm BM JSON rapresentation counter asm clr r0 rset r1 0 cpy r0 r1 cpy r1 r0 r2o r1 o0 inc r0 j 3 bondmachine tool BM HDL code FPGA procbuilder tool Binary Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 49/117
  • 138. Architectures Molding Bondgo Bondgo A first example counter go source package main import ( bondgo ) func main () { var outgoing bondgo.Output var reg_int uint8 outgoing = bondgo.Make(bondgo.Output , 3) reg_int = 0 for { bondgo.IOWrite(outgoing , reg_int) reg_int ++ } } bondgo –input-file counter.go -mpm BM JSON rapresentation counter asm clr r0 rset r1 0 cpy r0 r1 cpy r1 r0 r2o r1 o0 inc r0 j 3 bondmachine tool BM HDL code FPGA procbuilder tool Binary Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 49/117
  • 139. Architectures Molding Bondgo Bondgo A first example counter go source package main import ( bondgo ) func main () { var outgoing bondgo.Output var reg_int uint8 outgoing = bondgo.Make(bondgo.Output , 3) reg_int = 0 for { bondgo.IOWrite(outgoing , reg_int) reg_int ++ } } bondgo –input-file counter.go -mpm BM JSON rapresentation counter asm clr r0 rset r1 0 cpy r0 r1 cpy r1 r0 r2o r1 o0 inc r0 j 3 bondmachine tool BM HDL code FPGA procbuilder tool Binary Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 49/117
  • 140. Architectures Molding Bondgo Bondgo A first example counter go source package main import ( bondgo ) func main () { var outgoing bondgo.Output var reg_int uint8 outgoing = bondgo.Make(bondgo.Output , 3) reg_int = 0 for { bondgo.IOWrite(outgoing , reg_int) reg_int ++ } } bondgo –input-file counter.go -mpm BM JSON rapresentation counter asm clr r0 rset r1 0 cpy r0 r1 cpy r1 r0 r2o r1 o0 inc r0 j 3 bondmachine tool BM HDL code FPGA procbuilder tool Binary Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 49/117
  • 141. Architectures Molding Bondgo Bondgo A first example counter go source package main import ( bondgo ) func main () { var outgoing bondgo.Output var reg_int uint8 outgoing = bondgo.Make(bondgo.Output , 3) reg_int = 0 for { bondgo.IOWrite(outgoing , reg_int) reg_int ++ } } bondgo –input-file counter.go -mpm BM JSON rapresentation counter asm clr r0 rset r1 0 cpy r0 r1 cpy r1 r0 r2o r1 o0 inc r0 j 3 bondmachine tool BM HDL code FPGA procbuilder tool Binary Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 49/117
  • 142. Architectures Molding Bondgo Bondgo A first example counter go source package main import ( bondgo ) func main () { var outgoing bondgo.Output var reg_int uint8 outgoing = bondgo.Make(bondgo.Output , 3) reg_int = 0 for { bondgo.IOWrite(outgoing , reg_int) reg_int ++ } } bondgo –input-file counter.go -mpm BM JSON rapresentation counter asm clr r0 rset r1 0 cpy r0 r1 cpy r1 r0 r2o r1 o0 inc r0 j 3 bondmachine tool BM HDL code FPGA procbuilder tool Binary Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 49/117
  • 143. Architectures Molding Bondgo Bondgo A first example counter go source package main import ( bondgo ) func main () { var outgoing bondgo.Output var reg_int uint8 outgoing = bondgo.Make(bondgo.Output , 3) reg_int = 0 for { bondgo.IOWrite(outgoing , reg_int) reg_int ++ } } bondgo –input-file counter.go -mpm BM JSON rapresentation counter asm clr r0 rset r1 0 cpy r0 r1 cpy r1 r0 r2o r1 o0 inc r0 j 3 bondmachine tool BM HDL code FPGA procbuilder tool Binary Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 49/117
  • 144. Architectures Molding Bondgo Bondgo A first example counter go source package main import ( bondgo ) func main () { var outgoing bondgo.Output var reg_int uint8 outgoing = bondgo.Make(bondgo.Output , 3) reg_int = 0 for { bondgo.IOWrite(outgoing , reg_int) reg_int ++ } } bondgo –input-file counter.go -mpm BM JSON rapresentation counter asm clr r0 rset r1 0 cpy r0 r1 cpy r1 r0 r2o r1 o0 inc r0 j 3 bondmachine tool BM HDL code FPGA procbuilder tool Binary Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 49/117
  • 145. Architectures Molding Bondgo Bondgo A first example counter go source package main import ( bondgo ) func main () { var outgoing bondgo.Output var reg_int uint8 outgoing = bondgo.Make(bondgo.Output , 3) reg_int = 0 for { bondgo.IOWrite(outgoing , reg_int) reg_int ++ } } bondgo –input-file counter.go -mpm BM JSON rapresentation counter asm clr r0 rset r1 0 cpy r0 r1 cpy r1 r0 r2o r1 o0 inc r0 j 3 bondmachine tool BM HDL code FPGA procbuilder tool Binary Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 49/117
  • 146. Architectures Molding Bondgo Bondgo demo Demo N.7 It will be shown how: To create a BondMachine from a Go source file To build the architecture To build the program To create the firmware and flash it to the board Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 50/117
  • 147. Architectures Molding Bondgo Bondgo ... bondgo may not only create the binaries, but also the CP architecture, and ... Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 51/117
  • 148. Architectures Molding Bondgo Bondgo ... it can do even much more interesting things when compiling concurrent programs. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 52/117
  • 149. Architectures Molding Bondgo Bondgo ... it can do even much more interesting things when compiling concurrent programs. high level GO source Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 52/117
  • 150. Architectures Molding Bondgo Bondgo ... it can do even much more interesting things when compiling concurrent programs. high level GO source assembly CP 1 assembly CP 2 assembly CP 3 assembly CP 4 assembly CP ... assembly CP N Compiling Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 52/117
  • 151. Architectures Molding Bondgo Bondgo ... it can do even much more interesting things when compiling concurrent programs. high level GO source assembly CP 1 assembly CP 2 assembly CP 3 assembly CP 4 assembly CP ... assembly CP N Compiling CP 1 specs CP 2 CP 3 CP 4 CP ... CP N Archs generating Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 52/117
  • 152. Architectures Molding Bondgo Bondgo ... it can do even much more interesting things when compiling concurrent programs. high level GO source assembly CP 1 assembly CP 2 assembly CP 3 assembly CP 4 assembly CP ... assembly CP N Compiling CP 1 specs CP 2 CP 3 CP 4 CP ... CP N Archs generating Asm and Binaries Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 52/117
  • 153. Architectures Molding Bondgo Bondgo ... it can do even much more interesting things when compiling concurrent programs. high level GO source assembly CP 1 assembly CP 2 assembly CP 3 assembly CP 4 assembly CP ... assembly CP N Compiling CP 1 specs CP 2 CP 3 CP 4 CP ... CP N Archs generating Asm and Binaries Interconnections Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 52/117
  • 154. Architectures Molding Bondgo Bondgo ... it can do even much more interesting things when compiling concurrent programs. high level GO source assembly CP 1 assembly CP 2 assembly CP 3 assembly CP 4 assembly CP ... assembly CP N Compiling CP 1 specs CP 2 CP 3 CP 4 CP ... CP N Archs generating Asm and Binaries BondMachine Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 52/117
  • 155. Architectures Molding Bondgo Bondgo A multi-core example multi-core counter package main import ( bondgo ) func pong () { var in0 bondgo.Input var out0 bondgo.Output in0 = bondgo.Make(bondgo.Input , 3) out0 = bondgo.Make(bondgo.Output , 5) for { bondgo.IOWrite(out0 , bondgo.IORead(in0)+1) } } func main () { var in0 bondgo.Input var out0 bondgo.Output in0 = bondgo.Make(bondgo.Input , 5) out0 = bondgo.Make(bondgo.Output , 3) device_0: go pong () for { bondgo.IOWrite(out0 , bondgo.IORead(in0)) } } Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 53/117
  • 156. Architectures Molding Bondgo Bondgo A multi-core example Compiling the code with the bondgo compiler: bondgo -input-file ds.go -mpm The toolchain perform the following steps: Map the two goroutines to two hardware cores. Creates two types of core, each one optimized to execute the assigned goroutine. Creates the two binaries. Connected the two core as inferred from the source code, using special IO registers. The result is a multicore BondMachine: Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 54/117
  • 157. Architectures Molding Bondgo Bondgo A multi-core example Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 55/117
  • 158. Architectures Molding Bondgo Simulation demo Demo N.8 It will be shown how: To use bondgo to create a chain of interconnected processors To flash the firmware to the board Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 56/117
  • 159. Architectures Molding Bondgo Compiling Architectures One of the most important result The architecture creation is a part of the compilation process. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 57/117
  • 160. Architectures Molding Bondgo Bondgo Go in hardware Bondgo implements a sort of “Go in hardware”. High level Go source code is directly mapped to interconnected processors without Operating Systems or runtimes. Goroutines Cores Channels Channel SOs Variables (local) CP RAM segments Variables (by val) CH messages or IO regs Variables (by ref) Shared RAM segments Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 58/117
  • 161. Architectures Molding Bondgo Bondgo Go in hardware Bondgo implements a sort of “Go in hardware”. High level Go source code is directly mapped to interconnected processors without Operating Systems or runtimes. Goroutines Cores Channels Channel SOs Variables (local) CP RAM segments Variables (by val) CH messages or IO regs Variables (by ref) Shared RAM segments Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 58/117
  • 162. Architectures Molding Bondgo Bondgo Go in hardware Bondgo implements a sort of “Go in hardware”. High level Go source code is directly mapped to interconnected processors without Operating Systems or runtimes. Goroutines Cores Channels Channel SOs Variables (local) CP RAM segments Variables (by val) CH messages or IO regs Variables (by ref) Shared RAM segments Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 58/117
  • 163. Architectures Molding Bondgo Bondgo Go in hardware Bondgo implements a sort of “Go in hardware”. High level Go source code is directly mapped to interconnected processors without Operating Systems or runtimes. Goroutines Cores Channels Channel SOs Variables (local) CP RAM segments Variables (by val) CH messages or IO regs Variables (by ref) Shared RAM segments Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 58/117
  • 164. Architectures Molding Bondgo Bondgo Go in hardware Bondgo implements a sort of “Go in hardware”. High level Go source code is directly mapped to interconnected processors without Operating Systems or runtimes. Goroutines Cores Channels Channel SOs Variables (local) CP RAM segments Variables (by val) CH messages or IO regs Variables (by ref) Shared RAM segments Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 58/117
  • 165. Architectures Molding Bondgo Bondgo Go in hardware Bondgo implements a sort of “Go in hardware”. High level Go source code is directly mapped to interconnected processors without Operating Systems or runtimes. Goroutines Cores Channels Channel SOs Variables (local) CP RAM segments Variables (by val) CH messages or IO regs Variables (by ref) Shared RAM segments Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 58/117
  • 166. Architectures Molding Bondgo Bondgo Go in hardware Bondgo implements a sort of “Go in hardware”. High level Go source code is directly mapped to interconnected processors without Operating Systems or runtimes. Goroutines Cores Channels Channel SOs Variables (local) CP RAM segments Variables (by val) CH messages or IO regs Variables (by ref) Shared RAM segments Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 58/117
  • 167. Architectures Molding Bondgo Bondgo Go in hardware Bondgo implements a sort of “Go in hardware”. High level Go source code is directly mapped to interconnected processors without Operating Systems or runtimes. Goroutines Cores Channels Channel SOs Variables (local) CP RAM segments Variables (by val) CH messages or IO regs Variables (by ref) Shared RAM segments Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 58/117
  • 168. Architectures Molding Bondgo Bondgo Go in hardware Bondgo implements a sort of “Go in hardware”. High level Go source code is directly mapped to interconnected processors without Operating Systems or runtimes. Goroutines Cores Channels Channel SOs Variables (local) CP RAM segments Variables (by val) CH messages or IO regs Variables (by ref) Shared RAM segments Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 58/117
  • 169. Architectures Molding Bondgo Bondgo Go in hardware Bondgo implements a sort of “Go in hardware”. High level Go source code is directly mapped to interconnected processors without Operating Systems or runtimes. Goroutines Cores Channels Channel SOs Variables (local) CP RAM segments Variables (by val) CH messages or IO regs Variables (by ref) Shared RAM segments Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 58/117
  • 170. Architectures Molding Bondgo Bondgo Go in hardware Bondgo implements a sort of “Go in hardware”. High level Go source code is directly mapped to interconnected processors without Operating Systems or runtimes. Goroutines Cores Channels Channel SOs Variables (local) CP RAM segments Variables (by val) CH messages or IO regs Variables (by ref) Shared RAM segments Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 58/117
  • 171. Architectures Molding Bondgo Go in hardware Second idea on the BondMachine The idea was: Build a computing system with a decreased number of layers resulting in a lower HW/SW gap. This would raise the overall performances yet keeping an user friendly way of programming. Between HW and SW there is only the processor abstraction, no Operating System nor runtimes. Despite that programming is done at high level. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 59/117
  • 172. Architectures Molding Bondgo Layers, Abstractions and Interfaces and BondMachines Programming language Compiler Language BondMachine Register Machine Opcodes Transistors Hardware Specialization Performance Software Generalization User friendly Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 60/117
  • 173. Architectures Molding Bondgo Bondgo An example bondgo stream processing example package main import ( bondgo ) func streamprocessor (a *[] uint8 , b *[] uint8 , c *[] uint8 , gid uint8) { (*c)[gid] = (*a)[gid] + (*b)[gid] } func main () { a := make ([] uint8 , 256) b := make ([] uint8 , 256) c := make ([] uint8 , 256) // ... some a and b values fill for i := 0; i 256; i++ { go streamprocessor (a, b, c, uint8(i)) } } The compilation of this example results in the creation of a 257 CPs where 256 are the stream processors executing the code in the function called streamprocessor, and one is the coordinating CP. Each stream processor is optimized and capable only to make additions since it is the only operation requested by the source code. The three slices created on the main function are passed by reference to the Goroutines then a shared RAM is created by the Bondgo compiler available to the generated CPs. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 61/117
  • 174. Architectures Molding Bondgo Bondgo Abstract Assembly The Assembly language for the BM has been kept as independent as possible from the particular CP. Given a specific piece of assembly code Bondgo has the ability to compute the “minimum CP” that can execute that code. These are Building Blocks for complex BondMachines. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 62/117
  • 175. Architectures Molding Builders API Builders API Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 63/117
  • 176. Architectures Molding Builders API Builders API With these Building Blocks Several libraries have been developed to map specific problems on BondMachines: Symbond, to handle mathematical expression. Boolbond, to map boolean expression. Matrixwork, to perform matrices operations. Neuralbond, to use neural networks. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 64/117
  • 177. Architectures Molding Builders API Builders API With these Building Blocks Several libraries have been developed to map specific problems on BondMachines: Symbond, to handle mathematical expression. Boolbond, to map boolean expression. Matrixwork, to perform matrices operations. Neuralbond, to use neural networks. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 64/117
  • 178. Architectures Molding Builders API Builders API With these Building Blocks Several libraries have been developed to map specific problems on BondMachines: Symbond, to handle mathematical expression. Boolbond, to map boolean expression. Matrixwork, to perform matrices operations. Neuralbond, to use neural networks. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 64/117
  • 179. Architectures Molding Builders API Builders API With these Building Blocks Several libraries have been developed to map specific problems on BondMachines: Symbond, to handle mathematical expression. Boolbond, to map boolean expression. Matrixwork, to perform matrices operations. Neuralbond, to use neural networks. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 64/117
  • 180. Architectures Molding Builders API Builders API With these Building Blocks Several libraries have been developed to map specific problems on BondMachines: Symbond, to handle mathematical expression. Boolbond, to map boolean expression. Matrixwork, to perform matrices operations. Neuralbond, to use neural networks. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 64/117
  • 181. Architectures Molding Builders API Builders API Symbond A mathematical expression, or a system can be converted to a BondMachine: sum(var(x),const(2)) Boolbond symbond -expression sum(var(x),const(2)) -save-bondmachine bondmachine.json Resulting in: Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 65/117
  • 182. Architectures Molding Builders API Builders API Symbond A mathematical expression, or a system can be converted to a BondMachine: sum(var(x),const(2)) Boolbond symbond -expression sum(var(x),const(2)) -save-bondmachine bondmachine.json Resulting in: Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 65/117
  • 183. Architectures Molding Builders API Builders API Symbond A mathematical expression, or a system can be converted to a BondMachine: sum(var(x),const(2)) Boolbond symbond -expression sum(var(x),const(2)) -save-bondmachine bondmachine.json Resulting in: Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 65/117
  • 184. Architectures Molding Builders API Builders API Symbond Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 66/117
  • 185. Architectures Molding Builders API Builders API Boolbond A system of boolean equations, input and output variables are expressed as in the example file: var(z)=or(var(x),not(var(y))) var(t)=or(and(var(x),var(y)),var(z)) var(l)=and(xor(var(x),var(y)),var(t)) i:var(x) i:var(y) o:var(z) o:var(t) o:var(l) Boolbond boolbond -system-file expression.txt -save-bondmachine bondmachine.json Resulting in: Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 67/117
  • 186. Architectures Molding Builders API Builders API Boolbond A system of boolean equations, input and output variables are expressed as in the example file: var(z)=or(var(x),not(var(y))) var(t)=or(and(var(x),var(y)),var(z)) var(l)=and(xor(var(x),var(y)),var(t)) i:var(x) i:var(y) o:var(z) o:var(t) o:var(l) Boolbond boolbond -system-file expression.txt -save-bondmachine bondmachine.json Resulting in: Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 67/117
  • 187. Architectures Molding Builders API Builders API Boolbond A system of boolean equations, input and output variables are expressed as in the example file: var(z)=or(var(x),not(var(y))) var(t)=or(and(var(x),var(y)),var(z)) var(l)=and(xor(var(x),var(y)),var(t)) i:var(x) i:var(y) o:var(z) o:var(t) o:var(l) Boolbond boolbond -system-file expression.txt -save-bondmachine bondmachine.json Resulting in: Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 67/117
  • 188. Architectures Molding Builders API Builders API Boolbond Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 68/117
  • 189. Architectures Molding Builders API Boolbond demo Demo N.9 It will be shown how: To create complex multi-cores from boolean expressions Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 69/117
  • 190. Architectures Molding Builders API Builders API Matrixwork Matrix multiplication if mymachine, ok := matrixwork.Build_M(n, t); ok == nil ... Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 70/117
  • 191. Architectures Molding Evolutionary BondMachine Evolutionary BondMachine Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 71/117
  • 192. Architectures Molding Evolutionary BondMachine Evolutionary BondMachine Find an architecture that solve a problem Simulation Framework Metrics to check how well a BM solve a problem Building Blocks of the BM Population of BondMachines Genetically Evolved Architectures Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 72/117
  • 193. Architectures Molding Evolutionary BondMachine Evolutionary BondMachine Find an architecture that solve a problem Simulation Framework Metrics to check how well a BM solve a problem Building Blocks of the BM Population of BondMachines Genetically Evolved Architectures Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 72/117
  • 194. Architectures Molding Evolutionary BondMachine Evolutionary BondMachine Find an architecture that solve a problem Simulation Framework Metrics to check how well a BM solve a problem Building Blocks of the BM Population of BondMachines Genetically Evolved Architectures Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 72/117
  • 195. Architectures Molding Evolutionary BondMachine Evolutionary BondMachine Find an architecture that solve a problem Simulation Framework Metrics to check how well a BM solve a problem Building Blocks of the BM Population of BondMachines Genetically Evolved Architectures Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 72/117
  • 196. Architectures Molding Evolutionary BondMachine Evolutionary BondMachine Find an architecture that solve a problem Simulation Framework Metrics to check how well a BM solve a problem Building Blocks of the BM Population of BondMachines Genetically Evolved Architectures Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 72/117
  • 197. Architectures Molding Evolutionary BondMachine Evolutionary BondMachine Find an architecture that solve a problem Simulation Framework Metrics to check how well a BM solve a problem Building Blocks of the BM Population of BondMachines Genetically Evolved Architectures Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 72/117
  • 198. Architectures Molding Machine Learning with BondMachine section Machine Learning Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 73/117
  • 199. Architectures Molding Machine Learning with BondMachine section Machine Learning with BondMachine Architectures with multiple interconnected processors like the ones produced by the BondMachine Toolkit are a perfect fit for Neural Networks and Computational Graphs. Several ways to map this structures to BondMachine has been developed: A native Neural Network library A Tensorflow to BondMachine translator An NNEF based BondMachine composer Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 74/117
  • 200. Architectures Molding Machine Learning with BondMachine section Machine Learning with BondMachine Architectures with multiple interconnected processors like the ones produced by the BondMachine Toolkit are a perfect fit for Neural Networks and Computational Graphs. Several ways to map this structures to BondMachine has been developed: A native Neural Network library A Tensorflow to BondMachine translator An NNEF based BondMachine composer Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 74/117
  • 201. Architectures Molding Bondmachine native Neural Network library Machine Learning with BondMachine Native Neural Network library The tool neuralbond allow the creation of BM-based neural chips from an API go interface. Neurons are converted to BondMachine connecting processors. Tensors are mapped to CP connections. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 75/117
  • 202. Architectures Molding TensorFlow to Bondmachine TensorFlow™ to Bondmachine tf2bm TensorFlow™ is an open source software library for numerical computation using data flow graphs. Graphs can be converted to BondMachines with the tf2bm tool. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 76/117
  • 203. Architectures Molding NNEF Composer Machine Learning with BondMachine NNEF Composer Neural Network Exchange Format (NNEF) is a standard from Khronos Group to enable the easy transfer of trained networks among frameworks, inference engines and devices The NNEF BM tool approach is to descent NNEF models and build BondMachine multi-core accordingly This approch has several advandages over the previous: It is not limited to a single framework NNEF is a textual file, so no complex operations are needed to read models Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 77/117
  • 204. Hardware Hardware Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 78/117
  • 205. Hardware Hardware implementation FPGA The HDL code for the BondMachine is written in Verilog and System Verilog, and has been tested on these devices/system: Digilent Basys3 - Xilinx Artix-7 - Vivado. Kintex7 Evaluation Board - Vivado. Digilent Zedboard - Xilinx Zynq 7020 - Vivado. Linux - Iverilog. Terasic De10nano - Intel Cyclone V - Quartus Within the project other firmwares have been written or tested: Microchip ENC28J60 Ethernet interface controller. Microchip ENC424J600 10/100 Base-T Ethernet interface controller. ESP8266 Wi-Fi chip. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 79/117
  • 206. Prototype The Prototype The project has been selected for the participation at MakerFaire 2016 Rome (The Europen Edition) and a prototype has been assembled and presented. First run Youtube video Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 80/117
  • 207. Clustering Clustering Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 81/117
  • 208. Clustering BondMachine Clustering So far we saw: An user friendly approach to create processors (single core). Optimizing a single device to support intricate computational work-flows (multi-cores) over an heterogeneous layer. Interconnected BondMachines What if we could extend the this layer to multiple interconnected devices ? Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 82/117
  • 209. Clustering BondMachine Clustering So far we saw: An user friendly approach to create processors (single core). Optimizing a single device to support intricate computational work-flows (multi-cores) over an heterogeneous layer. Interconnected BondMachines What if we could extend the this layer to multiple interconnected devices ? Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 82/117
  • 210. Clustering BondMachine Clustering So far we saw: An user friendly approach to create processors (single core). Optimizing a single device to support intricate computational work-flows (multi-cores) over an heterogeneous layer. Interconnected BondMachines What if we could extend the this layer to multiple interconnected devices ? Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 82/117
  • 211. Clustering BondMachine Clustering So far we saw: An user friendly approach to create processors (single core). Optimizing a single device to support intricate computational work-flows (multi-cores) over an heterogeneous layer. Interconnected BondMachines What if we could extend the this layer to multiple interconnected devices ? Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 82/117
  • 212. Clustering BondMachine Clustering The same logic existing among CP have been extended among different BondMachines organized in clusters. Protocols, one ethernet called etherbond and one using UDP called udpbond have been created for the purpose. FPGA based BondMachines, standard Linux Workstations, Emulated BondMachines might join a cluster an contribute to a single distributed computational problem. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 83/117
  • 213. Clustering BondMachine Clustering The same logic existing among CP have been extended among different BondMachines organized in clusters. Protocols, one ethernet called etherbond and one using UDP called udpbond have been created for the purpose. FPGA based BondMachines, standard Linux Workstations, Emulated BondMachines might join a cluster an contribute to a single distributed computational problem. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 83/117
  • 214. Clustering BondMachine Clustering The same logic existing among CP have been extended among different BondMachines organized in clusters. Protocols, one ethernet called etherbond and one using UDP called udpbond have been created for the purpose. FPGA based BondMachines, standard Linux Workstations, Emulated BondMachines might join a cluster an contribute to a single distributed computational problem. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 83/117
  • 215. Clustering BondMachine Clustering Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 84/117
  • 216. Clustering Redeployer BondMachine Clustering A distributed example distributed counter package main import ( bondgo ) func pong () { var in0 bondgo.Input var out0 bondgo.Output in0 = bondgo.Make(bondgo.Input , 3) out0 = bondgo.Make(bondgo.Output , 5) for { bondgo.IOWrite(out0 , bondgo.IORead(in0)+1) } } func main () { var in0 bondgo.Input var out0 bondgo.Output in0 = bondgo.Make(bondgo.Input , 5) out0 = bondgo.Make(bondgo.Output , 3) device_1: go pong () for { bondgo.IOWrite(out0 , bondgo.IORead(in0)) } } Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 85/117
  • 217. Clustering Redeployer BondMachine Clustering A distributed example Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 86/117
  • 218. Clustering Redeployer BondMachine Clustering A distributed example The result is: BondMachine Clustering Youtube video A general result Parts of the system can be redeployed among different devices without changing the system behavior (only the performances). Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 87/117
  • 219. Clustering Redeployer BondMachine Clustering Results Results User can deploy an entire HW/SW cluster starting from code written in a high level description (Go, NNEF, etc) Workstation with emulated BondMachines, workstation with etherbond drivers, standalone BondMachines (FPGA) may join these clusters. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 88/117
  • 220. Clustering Redeployer BondMachine Clustering Results Results User can deploy an entire HW/SW cluster starting from code written in a high level description (Go, NNEF, etc) Workstation with emulated BondMachines, workstation with etherbond drivers, standalone BondMachines (FPGA) may join these clusters. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 88/117
  • 221. Use cases Use cases Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 89/117
  • 222. Use cases Use cases Two use cases in Physics experiments are currently being developed: Real time pulse shape analysis in neutron detectors bringing the intelligence to the edge Test beam for space experiments (DAMPE, HERD) increasing testbed operations efficiency Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 90/117
  • 223. Use cases Physics Real time pulse shape analysis in neutron detectors The operation of the new generation of high-intensity neutron sources like SNS, JSNS and European Spallation Source (ESS, Lund, Sweden), now under construction, are introducing a new demand for neutron detection capabilities. These demands yield to the need for new data collection procedures and new technology based on solid state Si devices. We are trying to use BondMachines to make the real time shape analysis in this kind of detecting devices. Courtesy of Prof. F.Sacchetti Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 91/117
  • 224. Use cases Physics Test beam for space experiments (DAMPE, HERD) Trigger logic for test beams Courtesy of V.Vagelli and M.Duranti Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 92/117
  • 225. Use cases Physics Test beam for space experiments (DAMPE, HERD) Trigger logic for test beams Courtesy of V.Vagelli and M.Duranti Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 93/117
  • 226. Use cases Physics Test beam for space experiments (DAMPE, HERD) Trigger logic for test beams We are trying to explore the possibility of using BondMachine to handle efficiently this kind of operations. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 94/117
  • 227. Use cases Other uses Possible other uses The BondMachine could be used in several types of real world applications, some of them being: IoT and CyberPhysical systems. Computer Science educational applications. Computing Accelerator Our effort is now in enabling the possibility of building computing accelerators to be used from within standard (Linux) applications. Mirko Mariotti The BondMachine Toolkit NIPS Summer School - Perugia 3-6 Sept 95/117