SlideShare a Scribd company logo
1 of 5
Download to read offline
ETI Many-Core Software Architecture for Large-Scale Systems
Executive Summary

ET International, Inc. (“ETI”) specializes in many-core, high-
performance computing. Traditional operating systems (OS) are based on
the sequential execution model developed in the 1960s. Such operating
systems cannot address new many-core parallel hardware architecture
without major overhauls. With the computing industry’s rapid progress
toward many-core chipsets and technology, the classical OS and system
software are facing unprecedented challenges.

ETI offers an innovative software platform solution designed specifically
for many-core systems. Our software works with the classical OS
seamlessly, and has been deployed on the IBM Cyclops-64 many-core
chip, comprising 160-cores on a chip and scaling over hundreds of chips.
This document provides an overview of how ETI’s software system
resolves multiple issues facing many-core hardware platforms.

Why Use ETI’s Software System?

How do you harness the power of a >10,000-core system? By using a Linux cluster, or do you need
something else?

With the advent of the many-core revolution, fundamental “technology gaps” need to be bridged before
applications can fully take advantage of the massive parallelism and high-performance features of
multi-core architectures. As noted above, traditional operating systems are based on the 1960s’
sequential execution model. These operating systems are at a disadvantage vis a vis many-core
hardware.

Tradition OS has poor scaling capability
Traditional operating systems are intended for single-core processors. Direct porting of existing single-
or dual-core-based systems and application software
solutions onto a many-core system leads to poor
performance that deteriorates further as the number of
cores increase. Without major software adaptation,
traditional operating systems cannot execute runtime
processes efficiently beyond several cores.

The traditional OS model has four major architectural
problems:

Inefficient handling of parallel processing
Traditional operating systems are architected in ways to optimize resource sharing for multitasking and
independent jobs. In high-performance parallel computing, a job is subdivided into a large pool of
smaller, related tasks.


                                For More Information: Ray Kung rkung@etinternational.com
                            100 White Clay Center Drive, #103 – Newark DE 19711, 302-738-1438
                                                                                                       1
These tasks interact with each other through a run time system
(RTS). However, the OS and RTS are unaware of each other and
do not share information to optimize performance. In fact, they
often act in opposing ways. For example, a task might be waiting
on a resource that is expected to be free in the next a few
milliseconds; the OS may swap out the task and run something
else instead of letting the task continue. The constant swapping of
tasks causes unnecessary overhead. The overhead increases
proportionally with the number of tasks running.

The large number of parallel processing tasks becomes very inefficient in traditional operating
systems. Each task needs timely access to the underlying hardware at the right time and the right place,
given their mutual dependence.

Ineffective control structure among cores
Traditional operating systems have a command-and-control operating
structure wherein the OS plays the central administrative and planning role
among all threads and cores. This micro-management of local administrative
operations ensures task completion, but is highly inefficient. To manage all
resources in a global manner, the OS uses centralized control and data
structures. As the number of cores grows arithmetically, the performance
bottleneck grows geometrically, because the central OS is now inundated
with request messages.

Inadequate synchronization and communication among large numbers of parallel tasks
Traditional operating systems provide very limited synchronization and communication mechanisms
among cores. In general, processes that must
communicate and synchronize with each other do so
through the OS. This lack of coordination becomes a
real bottleneck, because two breakthroughs in many-
core chip technology are ultra-fast on-chip bandwidths
among cores, and among memory banks. Relying on the traditional OS to coordinate parallel
processing will waste vast amounts of communication bandwidth and system resources.

Abstraction of hardware resources creates unnecessary indirection
Traditional OSes hide the underlying hardware resources from the programmer. Thus, a program would
see a 4-core environment the same way as a 64-core
environment. Additionally, it would not react differently
if there were interfering external programs or not. While
such hardware abstraction enables portability and ease of
programming, it limits optimal use of the underlying
resources. For example, if a program knew that it was
operating on a 32 core environment, it would be more
efficient to spawn 32 threads such that one runs non-
preemptively on each core rather than to spawn more threads and context switch between them or
fewer threads and fail to exploit the available resources.


                                For More Information: Ray Kung rkung@etinternational.com
                            100 White Clay Center Drive, #103 – Newark DE 19711, 302-738-1438
                                                                                                      2
ETI’s Solution

To resolve these shortcomings, ETI has developed an innovative parallel system software solution
addressing each traditional OS limitation. ETI’s solution relieves the operating system of micro-
management, and has adapted a fine-grained decentralized
software model to unleash the power of many-core parallel
computing.

The motivation behind this decentralized system software
model is to provide an on-chip control unit that is
adaptable and customizable to the local terrain or cores.
This is more efficient and flexible than a conventional
centralized control unit. In our implementation, ETI’s
system software relieves the operating system of core-to-
core coordination and local resource management. Instead,
ETI’s FIRST (Flexible Integrated Runtime System
Technology), an execution environment with an embedded micro-OS kernel, is tasked to manage
parallel operations on the chip.

FIRST provides a runtime execution
environment that features:
   1) Fine-grained task scheduling among
       cores,
   2) Effective       coordination    of
       communication and synchronization
       among parallel tasks, and
   3) Dynamic load balancing across the
       entire system.

In addition to creating a co-processing
layered operating system structure, another
distinguishing feature of ETI’s parallel
platform is that it creates a fine-grained
dataflow-like hierarchical thread platform and is a clear breakthrough from traditional coarse-grained
multithreading.

Fine-grained parallel system
The improvement comes in the form of
prioritizing the execution of threads based on
their “readiness” to be executed. Instead of
using sequential program counters as in a
traditional OS, ETI’s fine-grained parallel
software system executes parallel tasks
depending on the “readiness” of the input
arguments and available resources.

As an example: In a sequential counter-based
system, similar to cooking a dish, the recipe is

                                For More Information: Ray Kung rkung@etinternational.com
                            100 White Clay Center Drive, #103 – Newark DE 19711, 302-738-1438
                                                                                                    3
broken down into a series of sequential steps such as 1) measuring the ingredient amounts, 2) mixing
ingredient A with ingredient B, and 3) pouring white wine to the mixture, etc,

In a dataflow execution environment, sequential steps/counters are irrelevant. Instead, inputs are
crucial. This is similar to a restaurant’s kitchen with a team of chefs and helpers. Each person works at
his or her workstation, and when ingredients enter a workspace, the person stationed begins mixing or
chopping these ingredients until done. The output then passes to another person as input. Thus, each
person is performing a minute operation independent of the sequential procedure and dependent only
on sufficient inputs. This whole process is analogous to a dataflow environment, wherein each atomic
operation is a fine-grained task.

By continuously sub-dividing tasks to finer-grained tasks, and assigning them across many cores as
non-preemptible atomic operations, ETI’s FIRST technology has provided a foundation for resolving
the scalability issue facing traditional operating systems. The granularity of the tasks is dictated by the
hardware environment in such a way to minimize run time system overhead (i.e. creating tasks,
determining the next non-preemptive task to fire, etc) while fully exploiting the parallel concurrency of
the application.

ETI Software vs. Traditional Software




                                For More Information: Ray Kung rkung@etinternational.com
                            100 White Clay Center Drive, #103 – Newark DE 19711, 302-738-1438
                                                                                                         4
Overview of ETI Software Platform1

ETI understands both the benefits and the bottlenecks of the traditional OS, and our product is tailor-
made for many-core architectures. Two decades of research have led us to develop a dataflow-like
software platform for many-core computing. In developing the IBM Cyclops-64 (C64) supercomputer,
ETI has been the sole software vendor to design and implement a software infrastructure to unleash the
full power of the many-core C64 system.

Our latest version of the system software is running
on 8000+ cores, and over 50 C64 blades2. Our
software is designed to work on over 1 million
cores. This highly scalable system software includes
the following design infrastructure:

•      FIRST – Flexible Integrated Runtime System
       Technology.      Delivers a parallel runtime
       execution model that manages hardware
       resources efficiently, and exposes them to user
       applications;
•      Control and monitoring software – Boots the
       system, ensures safe operating conditions and
       detects hardware anomalies;
•      Resource manager – Schedules jobs across the entire system under a multi-user environment;
•      Software development toolkit (SDK) – Provides a Linux-based environment with which to compile,
       debug and simulate the many-core chip platforms.	
  
1
    For more technical information, please see ETI OS technical whitepaper
2
    A blade consists of a C64 chip, a memory unit, an FPGA, and Ethernet. See C64 hardware whitepaper




                                         For More Information: Ray Kung rkung@etinternational.com
                                     100 White Clay Center Drive, #103 – Newark DE 19711, 302-738-1438
                                                                                                         5

More Related Content

What's hot

Distributed computing for new bloods
Distributed computing for new bloodsDistributed computing for new bloods
Distributed computing for new bloodsRaymond Tay
 
lecture 1 (Part 2) kernal and its categories
lecture 1 (Part 2) kernal and its categorieslecture 1 (Part 2) kernal and its categories
lecture 1 (Part 2) kernal and its categoriesWajeehaBaig
 
Distributed operating system
Distributed operating systemDistributed operating system
Distributed operating systemPrankit Mishra
 
Distributed operating system
Distributed operating systemDistributed operating system
Distributed operating systemudaya khanal
 
SOLUTION MANUAL OF OPERATING SYSTEM CONCEPTS BY ABRAHAM SILBERSCHATZ, PETER B...
SOLUTION MANUAL OF OPERATING SYSTEM CONCEPTS BY ABRAHAM SILBERSCHATZ, PETER B...SOLUTION MANUAL OF OPERATING SYSTEM CONCEPTS BY ABRAHAM SILBERSCHATZ, PETER B...
SOLUTION MANUAL OF OPERATING SYSTEM CONCEPTS BY ABRAHAM SILBERSCHATZ, PETER B...vtunotesbysree
 
Bce notes unit 1 be 205
Bce notes unit 1 be 205Bce notes unit 1 be 205
Bce notes unit 1 be 205NANDINI SHARMA
 
Operating Systems Part I-Basics
Operating Systems Part I-BasicsOperating Systems Part I-Basics
Operating Systems Part I-BasicsAjit Nayak
 
Operating system concepts ninth edition (2012), chapter 2 solution e1
Operating system concepts ninth edition (2012), chapter 2 solution e1Operating system concepts ninth edition (2012), chapter 2 solution e1
Operating system concepts ninth edition (2012), chapter 2 solution e1Navid Daneshvaran
 
Chap1
Chap1Chap1
Chap1adisi
 
Operating System & Utility Programme
Operating System & Utility ProgrammeOperating System & Utility Programme
Operating System & Utility Programmebbp2067
 
Operating system by aman kr kushwaha
Operating system by aman kr kushwahaOperating system by aman kr kushwaha
Operating system by aman kr kushwahaAMAN KUMAR KUSHWAHA
 
Class 1: Introduction - What is an Operating System?
Class 1: Introduction - What is an Operating System?Class 1: Introduction - What is an Operating System?
Class 1: Introduction - What is an Operating System?David Evans
 

What's hot (20)

Distributed computing for new bloods
Distributed computing for new bloodsDistributed computing for new bloods
Distributed computing for new bloods
 
lecture 1 (Part 2) kernal and its categories
lecture 1 (Part 2) kernal and its categorieslecture 1 (Part 2) kernal and its categories
lecture 1 (Part 2) kernal and its categories
 
Distributed operating system
Distributed operating systemDistributed operating system
Distributed operating system
 
Distributed operating system
Distributed operating systemDistributed operating system
Distributed operating system
 
20120140506012
2012014050601220120140506012
20120140506012
 
Operating System Structure
Operating System StructureOperating System Structure
Operating System Structure
 
SOLUTION MANUAL OF OPERATING SYSTEM CONCEPTS BY ABRAHAM SILBERSCHATZ, PETER B...
SOLUTION MANUAL OF OPERATING SYSTEM CONCEPTS BY ABRAHAM SILBERSCHATZ, PETER B...SOLUTION MANUAL OF OPERATING SYSTEM CONCEPTS BY ABRAHAM SILBERSCHATZ, PETER B...
SOLUTION MANUAL OF OPERATING SYSTEM CONCEPTS BY ABRAHAM SILBERSCHATZ, PETER B...
 
Bce notes unit 1 be 205
Bce notes unit 1 be 205Bce notes unit 1 be 205
Bce notes unit 1 be 205
 
kernels
 kernels kernels
kernels
 
Operating Systems Part I-Basics
Operating Systems Part I-BasicsOperating Systems Part I-Basics
Operating Systems Part I-Basics
 
2. microkernel new
2. microkernel new2. microkernel new
2. microkernel new
 
Kernel (computing)
Kernel (computing)Kernel (computing)
Kernel (computing)
 
Kernel (OS)
Kernel (OS)Kernel (OS)
Kernel (OS)
 
Operating system concepts ninth edition (2012), chapter 2 solution e1
Operating system concepts ninth edition (2012), chapter 2 solution e1Operating system concepts ninth edition (2012), chapter 2 solution e1
Operating system concepts ninth edition (2012), chapter 2 solution e1
 
Chap1
Chap1Chap1
Chap1
 
Operating System & Utility Programme
Operating System & Utility ProgrammeOperating System & Utility Programme
Operating System & Utility Programme
 
Operating system by aman kr kushwaha
Operating system by aman kr kushwahaOperating system by aman kr kushwaha
Operating system by aman kr kushwaha
 
Class 1: Introduction - What is an Operating System?
Class 1: Introduction - What is an Operating System?Class 1: Introduction - What is an Operating System?
Class 1: Introduction - What is an Operating System?
 
Distributed Operating System_1
Distributed Operating System_1Distributed Operating System_1
Distributed Operating System_1
 
Linux kernel
Linux kernelLinux kernel
Linux kernel
 

Similar to ETI Software System

Implementation of Kernel API
Implementation of Kernel APIImplementation of Kernel API
Implementation of Kernel APIkhushi74
 
LM1 - Computer System Overview, system calls
LM1 - Computer System Overview, system callsLM1 - Computer System Overview, system calls
LM1 - Computer System Overview, system callsmanideepakc
 
1.multicore processors
1.multicore processors1.multicore processors
1.multicore processorsHebeon1
 
MODERN OPERATING SYSTEM
MODERN OPERATING SYSTEMMODERN OPERATING SYSTEM
MODERN OPERATING SYSTEMkhanz2012
 
Michael Gschwind et al, "An Open Source Environment for Cell Broadband Engine...
Michael Gschwind et al, "An Open Source Environment for Cell Broadband Engine...Michael Gschwind et al, "An Open Source Environment for Cell Broadband Engine...
Michael Gschwind et al, "An Open Source Environment for Cell Broadband Engine...Michael Gschwind
 
Operating system basics, Types of operating systems, Tasks, Process and Thre...
Operating system basics, Types of operating  systems, Tasks, Process and Thre...Operating system basics, Types of operating  systems, Tasks, Process and Thre...
Operating system basics, Types of operating systems, Tasks, Process and Thre...SattiBabu16
 
CC LECTURE NOTES (1).pdf
CC LECTURE NOTES (1).pdfCC LECTURE NOTES (1).pdf
CC LECTURE NOTES (1).pdfHasanAfwaaz1
 
Basics of OS & RTOS.ppt
Basics of OS & RTOS.pptBasics of OS & RTOS.ppt
Basics of OS & RTOS.pptDr.YNM
 
Linux Assignment 3
Linux Assignment 3Linux Assignment 3
Linux Assignment 3Diane Allen
 
MYSQL DATABASE Operating System Part2 (1).pptx
MYSQL DATABASE Operating System Part2 (1).pptxMYSQL DATABASE Operating System Part2 (1).pptx
MYSQL DATABASE Operating System Part2 (1).pptxArjayBalberan1
 
Operating systems
Operating systemsOperating systems
Operating systemsanishgoel
 
Enea Enabling Real-Time in Linux Whitepaper
Enea Enabling Real-Time in Linux WhitepaperEnea Enabling Real-Time in Linux Whitepaper
Enea Enabling Real-Time in Linux WhitepaperEnea Software AB
 
Synergistic processing in cell's multicore architecture
Synergistic processing in cell's multicore architectureSynergistic processing in cell's multicore architecture
Synergistic processing in cell's multicore architectureMichael Gschwind
 
An operating system for multicore and clouds: mechanism and implementation
An operating system for multicore and clouds: mechanism and implementationAn operating system for multicore and clouds: mechanism and implementation
An operating system for multicore and clouds: mechanism and implementationMohanadarshan Vivekanandalingam
 

Similar to ETI Software System (20)

Implementation of Kernel API
Implementation of Kernel APIImplementation of Kernel API
Implementation of Kernel API
 
LM1 - Computer System Overview, system calls
LM1 - Computer System Overview, system callsLM1 - Computer System Overview, system calls
LM1 - Computer System Overview, system calls
 
UNIT 2.pptx
UNIT 2.pptxUNIT 2.pptx
UNIT 2.pptx
 
Distributed Computing
Distributed ComputingDistributed Computing
Distributed Computing
 
1.multicore processors
1.multicore processors1.multicore processors
1.multicore processors
 
MODERN OPERATING SYSTEM
MODERN OPERATING SYSTEMMODERN OPERATING SYSTEM
MODERN OPERATING SYSTEM
 
Michael Gschwind et al, "An Open Source Environment for Cell Broadband Engine...
Michael Gschwind et al, "An Open Source Environment for Cell Broadband Engine...Michael Gschwind et al, "An Open Source Environment for Cell Broadband Engine...
Michael Gschwind et al, "An Open Source Environment for Cell Broadband Engine...
 
HPCC Presentation
HPCC PresentationHPCC Presentation
HPCC Presentation
 
Operating system basics, Types of operating systems, Tasks, Process and Thre...
Operating system basics, Types of operating  systems, Tasks, Process and Thre...Operating system basics, Types of operating  systems, Tasks, Process and Thre...
Operating system basics, Types of operating systems, Tasks, Process and Thre...
 
CC LECTURE NOTES (1).pdf
CC LECTURE NOTES (1).pdfCC LECTURE NOTES (1).pdf
CC LECTURE NOTES (1).pdf
 
Clusters
ClustersClusters
Clusters
 
Basics of OS & RTOS.ppt
Basics of OS & RTOS.pptBasics of OS & RTOS.ppt
Basics of OS & RTOS.ppt
 
Linux Assignment 3
Linux Assignment 3Linux Assignment 3
Linux Assignment 3
 
Barrelfish OS
Barrelfish OS Barrelfish OS
Barrelfish OS
 
MYSQL DATABASE Operating System Part2 (1).pptx
MYSQL DATABASE Operating System Part2 (1).pptxMYSQL DATABASE Operating System Part2 (1).pptx
MYSQL DATABASE Operating System Part2 (1).pptx
 
Operating systems
Operating systemsOperating systems
Operating systems
 
Enea Enabling Real-Time in Linux Whitepaper
Enea Enabling Real-Time in Linux WhitepaperEnea Enabling Real-Time in Linux Whitepaper
Enea Enabling Real-Time in Linux Whitepaper
 
Synergistic processing in cell's multicore architecture
Synergistic processing in cell's multicore architectureSynergistic processing in cell's multicore architecture
Synergistic processing in cell's multicore architecture
 
OS Unit 2.pptx
OS Unit 2.pptxOS Unit 2.pptx
OS Unit 2.pptx
 
An operating system for multicore and clouds: mechanism and implementation
An operating system for multicore and clouds: mechanism and implementationAn operating system for multicore and clouds: mechanism and implementation
An operating system for multicore and clouds: mechanism and implementation
 

More from IBM India Smarter Computing

Using the IBM XIV Storage System in OpenStack Cloud Environments
Using the IBM XIV Storage System in OpenStack Cloud Environments Using the IBM XIV Storage System in OpenStack Cloud Environments
Using the IBM XIV Storage System in OpenStack Cloud Environments IBM India Smarter Computing
 
TSL03104USEN Exploring VMware vSphere Storage API for Array Integration on th...
TSL03104USEN Exploring VMware vSphere Storage API for Array Integration on th...TSL03104USEN Exploring VMware vSphere Storage API for Array Integration on th...
TSL03104USEN Exploring VMware vSphere Storage API for Array Integration on th...IBM India Smarter Computing
 
A Comparison of PowerVM and Vmware Virtualization Performance
A Comparison of PowerVM and Vmware Virtualization PerformanceA Comparison of PowerVM and Vmware Virtualization Performance
A Comparison of PowerVM and Vmware Virtualization PerformanceIBM India Smarter Computing
 
IBM pureflex system and vmware vcloud enterprise suite reference architecture
IBM pureflex system and vmware vcloud enterprise suite reference architectureIBM pureflex system and vmware vcloud enterprise suite reference architecture
IBM pureflex system and vmware vcloud enterprise suite reference architectureIBM India Smarter Computing
 

More from IBM India Smarter Computing (20)

Using the IBM XIV Storage System in OpenStack Cloud Environments
Using the IBM XIV Storage System in OpenStack Cloud Environments Using the IBM XIV Storage System in OpenStack Cloud Environments
Using the IBM XIV Storage System in OpenStack Cloud Environments
 
All-flash Needs End to End Storage Efficiency
All-flash Needs End to End Storage EfficiencyAll-flash Needs End to End Storage Efficiency
All-flash Needs End to End Storage Efficiency
 
TSL03104USEN Exploring VMware vSphere Storage API for Array Integration on th...
TSL03104USEN Exploring VMware vSphere Storage API for Array Integration on th...TSL03104USEN Exploring VMware vSphere Storage API for Array Integration on th...
TSL03104USEN Exploring VMware vSphere Storage API for Array Integration on th...
 
IBM FlashSystem 840 Product Guide
IBM FlashSystem 840 Product GuideIBM FlashSystem 840 Product Guide
IBM FlashSystem 840 Product Guide
 
IBM System x3250 M5
IBM System x3250 M5IBM System x3250 M5
IBM System x3250 M5
 
IBM NeXtScale nx360 M4
IBM NeXtScale nx360 M4IBM NeXtScale nx360 M4
IBM NeXtScale nx360 M4
 
IBM System x3650 M4 HD
IBM System x3650 M4 HDIBM System x3650 M4 HD
IBM System x3650 M4 HD
 
IBM System x3300 M4
IBM System x3300 M4IBM System x3300 M4
IBM System x3300 M4
 
IBM System x iDataPlex dx360 M4
IBM System x iDataPlex dx360 M4IBM System x iDataPlex dx360 M4
IBM System x iDataPlex dx360 M4
 
IBM System x3500 M4
IBM System x3500 M4IBM System x3500 M4
IBM System x3500 M4
 
IBM System x3550 M4
IBM System x3550 M4IBM System x3550 M4
IBM System x3550 M4
 
IBM System x3650 M4
IBM System x3650 M4IBM System x3650 M4
IBM System x3650 M4
 
IBM System x3500 M3
IBM System x3500 M3IBM System x3500 M3
IBM System x3500 M3
 
IBM System x3400 M3
IBM System x3400 M3IBM System x3400 M3
IBM System x3400 M3
 
IBM System x3250 M3
IBM System x3250 M3IBM System x3250 M3
IBM System x3250 M3
 
IBM System x3200 M3
IBM System x3200 M3IBM System x3200 M3
IBM System x3200 M3
 
IBM PowerVC Introduction and Configuration
IBM PowerVC Introduction and ConfigurationIBM PowerVC Introduction and Configuration
IBM PowerVC Introduction and Configuration
 
A Comparison of PowerVM and Vmware Virtualization Performance
A Comparison of PowerVM and Vmware Virtualization PerformanceA Comparison of PowerVM and Vmware Virtualization Performance
A Comparison of PowerVM and Vmware Virtualization Performance
 
IBM pureflex system and vmware vcloud enterprise suite reference architecture
IBM pureflex system and vmware vcloud enterprise suite reference architectureIBM pureflex system and vmware vcloud enterprise suite reference architecture
IBM pureflex system and vmware vcloud enterprise suite reference architecture
 
X6: The sixth generation of EXA Technology
X6: The sixth generation of EXA TechnologyX6: The sixth generation of EXA Technology
X6: The sixth generation of EXA Technology
 

Recently uploaded

Stronger Together: Developing an Organizational Strategy for Accessible Desig...
Stronger Together: Developing an Organizational Strategy for Accessible Desig...Stronger Together: Developing an Organizational Strategy for Accessible Desig...
Stronger Together: Developing an Organizational Strategy for Accessible Desig...caitlingebhard1
 
Quantum Leap in Next-Generation Computing
Quantum Leap in Next-Generation ComputingQuantum Leap in Next-Generation Computing
Quantum Leap in Next-Generation ComputingWSO2
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)Samir Dash
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Modernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using BallerinaModernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using BallerinaWSO2
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Navigating Identity and Access Management in the Modern Enterprise
Navigating Identity and Access Management in the Modern EnterpriseNavigating Identity and Access Management in the Modern Enterprise
Navigating Identity and Access Management in the Modern EnterpriseWSO2
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityWSO2
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
Less Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data PlatformLess Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data PlatformWSO2
 
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....rightmanforbloodline
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity
 
API Governance and Monetization - The evolution of API governance
API Governance and Monetization -  The evolution of API governanceAPI Governance and Monetization -  The evolution of API governance
API Governance and Monetization - The evolution of API governanceWSO2
 

Recently uploaded (20)

Stronger Together: Developing an Organizational Strategy for Accessible Desig...
Stronger Together: Developing an Organizational Strategy for Accessible Desig...Stronger Together: Developing an Organizational Strategy for Accessible Desig...
Stronger Together: Developing an Organizational Strategy for Accessible Desig...
 
Quantum Leap in Next-Generation Computing
Quantum Leap in Next-Generation ComputingQuantum Leap in Next-Generation Computing
Quantum Leap in Next-Generation Computing
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Modernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using BallerinaModernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using Ballerina
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Navigating Identity and Access Management in the Modern Enterprise
Navigating Identity and Access Management in the Modern EnterpriseNavigating Identity and Access Management in the Modern Enterprise
Navigating Identity and Access Management in the Modern Enterprise
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Less Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data PlatformLess Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data Platform
 
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
API Governance and Monetization - The evolution of API governance
API Governance and Monetization -  The evolution of API governanceAPI Governance and Monetization -  The evolution of API governance
API Governance and Monetization - The evolution of API governance
 

ETI Software System

  • 1. ETI Many-Core Software Architecture for Large-Scale Systems Executive Summary ET International, Inc. (“ETI”) specializes in many-core, high- performance computing. Traditional operating systems (OS) are based on the sequential execution model developed in the 1960s. Such operating systems cannot address new many-core parallel hardware architecture without major overhauls. With the computing industry’s rapid progress toward many-core chipsets and technology, the classical OS and system software are facing unprecedented challenges. ETI offers an innovative software platform solution designed specifically for many-core systems. Our software works with the classical OS seamlessly, and has been deployed on the IBM Cyclops-64 many-core chip, comprising 160-cores on a chip and scaling over hundreds of chips. This document provides an overview of how ETI’s software system resolves multiple issues facing many-core hardware platforms. Why Use ETI’s Software System? How do you harness the power of a >10,000-core system? By using a Linux cluster, or do you need something else? With the advent of the many-core revolution, fundamental “technology gaps” need to be bridged before applications can fully take advantage of the massive parallelism and high-performance features of multi-core architectures. As noted above, traditional operating systems are based on the 1960s’ sequential execution model. These operating systems are at a disadvantage vis a vis many-core hardware. Tradition OS has poor scaling capability Traditional operating systems are intended for single-core processors. Direct porting of existing single- or dual-core-based systems and application software solutions onto a many-core system leads to poor performance that deteriorates further as the number of cores increase. Without major software adaptation, traditional operating systems cannot execute runtime processes efficiently beyond several cores. The traditional OS model has four major architectural problems: Inefficient handling of parallel processing Traditional operating systems are architected in ways to optimize resource sharing for multitasking and independent jobs. In high-performance parallel computing, a job is subdivided into a large pool of smaller, related tasks. For More Information: Ray Kung rkung@etinternational.com 100 White Clay Center Drive, #103 – Newark DE 19711, 302-738-1438 1
  • 2. These tasks interact with each other through a run time system (RTS). However, the OS and RTS are unaware of each other and do not share information to optimize performance. In fact, they often act in opposing ways. For example, a task might be waiting on a resource that is expected to be free in the next a few milliseconds; the OS may swap out the task and run something else instead of letting the task continue. The constant swapping of tasks causes unnecessary overhead. The overhead increases proportionally with the number of tasks running. The large number of parallel processing tasks becomes very inefficient in traditional operating systems. Each task needs timely access to the underlying hardware at the right time and the right place, given their mutual dependence. Ineffective control structure among cores Traditional operating systems have a command-and-control operating structure wherein the OS plays the central administrative and planning role among all threads and cores. This micro-management of local administrative operations ensures task completion, but is highly inefficient. To manage all resources in a global manner, the OS uses centralized control and data structures. As the number of cores grows arithmetically, the performance bottleneck grows geometrically, because the central OS is now inundated with request messages. Inadequate synchronization and communication among large numbers of parallel tasks Traditional operating systems provide very limited synchronization and communication mechanisms among cores. In general, processes that must communicate and synchronize with each other do so through the OS. This lack of coordination becomes a real bottleneck, because two breakthroughs in many- core chip technology are ultra-fast on-chip bandwidths among cores, and among memory banks. Relying on the traditional OS to coordinate parallel processing will waste vast amounts of communication bandwidth and system resources. Abstraction of hardware resources creates unnecessary indirection Traditional OSes hide the underlying hardware resources from the programmer. Thus, a program would see a 4-core environment the same way as a 64-core environment. Additionally, it would not react differently if there were interfering external programs or not. While such hardware abstraction enables portability and ease of programming, it limits optimal use of the underlying resources. For example, if a program knew that it was operating on a 32 core environment, it would be more efficient to spawn 32 threads such that one runs non- preemptively on each core rather than to spawn more threads and context switch between them or fewer threads and fail to exploit the available resources. For More Information: Ray Kung rkung@etinternational.com 100 White Clay Center Drive, #103 – Newark DE 19711, 302-738-1438 2
  • 3. ETI’s Solution To resolve these shortcomings, ETI has developed an innovative parallel system software solution addressing each traditional OS limitation. ETI’s solution relieves the operating system of micro- management, and has adapted a fine-grained decentralized software model to unleash the power of many-core parallel computing. The motivation behind this decentralized system software model is to provide an on-chip control unit that is adaptable and customizable to the local terrain or cores. This is more efficient and flexible than a conventional centralized control unit. In our implementation, ETI’s system software relieves the operating system of core-to- core coordination and local resource management. Instead, ETI’s FIRST (Flexible Integrated Runtime System Technology), an execution environment with an embedded micro-OS kernel, is tasked to manage parallel operations on the chip. FIRST provides a runtime execution environment that features: 1) Fine-grained task scheduling among cores, 2) Effective coordination of communication and synchronization among parallel tasks, and 3) Dynamic load balancing across the entire system. In addition to creating a co-processing layered operating system structure, another distinguishing feature of ETI’s parallel platform is that it creates a fine-grained dataflow-like hierarchical thread platform and is a clear breakthrough from traditional coarse-grained multithreading. Fine-grained parallel system The improvement comes in the form of prioritizing the execution of threads based on their “readiness” to be executed. Instead of using sequential program counters as in a traditional OS, ETI’s fine-grained parallel software system executes parallel tasks depending on the “readiness” of the input arguments and available resources. As an example: In a sequential counter-based system, similar to cooking a dish, the recipe is For More Information: Ray Kung rkung@etinternational.com 100 White Clay Center Drive, #103 – Newark DE 19711, 302-738-1438 3
  • 4. broken down into a series of sequential steps such as 1) measuring the ingredient amounts, 2) mixing ingredient A with ingredient B, and 3) pouring white wine to the mixture, etc, In a dataflow execution environment, sequential steps/counters are irrelevant. Instead, inputs are crucial. This is similar to a restaurant’s kitchen with a team of chefs and helpers. Each person works at his or her workstation, and when ingredients enter a workspace, the person stationed begins mixing or chopping these ingredients until done. The output then passes to another person as input. Thus, each person is performing a minute operation independent of the sequential procedure and dependent only on sufficient inputs. This whole process is analogous to a dataflow environment, wherein each atomic operation is a fine-grained task. By continuously sub-dividing tasks to finer-grained tasks, and assigning them across many cores as non-preemptible atomic operations, ETI’s FIRST technology has provided a foundation for resolving the scalability issue facing traditional operating systems. The granularity of the tasks is dictated by the hardware environment in such a way to minimize run time system overhead (i.e. creating tasks, determining the next non-preemptive task to fire, etc) while fully exploiting the parallel concurrency of the application. ETI Software vs. Traditional Software For More Information: Ray Kung rkung@etinternational.com 100 White Clay Center Drive, #103 – Newark DE 19711, 302-738-1438 4
  • 5. Overview of ETI Software Platform1 ETI understands both the benefits and the bottlenecks of the traditional OS, and our product is tailor- made for many-core architectures. Two decades of research have led us to develop a dataflow-like software platform for many-core computing. In developing the IBM Cyclops-64 (C64) supercomputer, ETI has been the sole software vendor to design and implement a software infrastructure to unleash the full power of the many-core C64 system. Our latest version of the system software is running on 8000+ cores, and over 50 C64 blades2. Our software is designed to work on over 1 million cores. This highly scalable system software includes the following design infrastructure: • FIRST – Flexible Integrated Runtime System Technology. Delivers a parallel runtime execution model that manages hardware resources efficiently, and exposes them to user applications; • Control and monitoring software – Boots the system, ensures safe operating conditions and detects hardware anomalies; • Resource manager – Schedules jobs across the entire system under a multi-user environment; • Software development toolkit (SDK) – Provides a Linux-based environment with which to compile, debug and simulate the many-core chip platforms.   1 For more technical information, please see ETI OS technical whitepaper 2 A blade consists of a C64 chip, a memory unit, an FPGA, and Ethernet. See C64 hardware whitepaper For More Information: Ray Kung rkung@etinternational.com 100 White Clay Center Drive, #103 – Newark DE 19711, 302-738-1438 5