This thesis talk studies resource management for on-demand computing services through a shared cluster. In such a context, the aim was to propose tools to enable allocating resources automatically for executing on-demand user requests, to enable sharing resources proportionally among those services, while maximizing their use. Funded by the Minalogic global business cluster through the Ciloe Project (http://ciloe.minalogic.net), this work targets on organizations such as SMB, which are not able to support the charge of purchasing and maintaining a dedicated computing infrastructure. Firstly, we have achieved a deep survey in the areas of on-demand computing and high performance computing. From this survey, we have defined a virtualized architecture to enable dynamic execution of user requests thanks to a special resource manager. Finally, we have proposed policies and algorithms which are so flexible to offer a suitable tradeoff between equity and resource use. Having worked in a context of industrial collaboration, we have developed a prototype of our proposal as a proof of concept. Based on open standards, this prototype relies on existing virtualization tools such as OpenNebula for allocating and manipulating virtual machines over the cluster's nodes. From this prototype along with various workloads, we have carried out experiments to evaluate our architecture and scheduling algorithms. Results have shown that our contributions allow to achieve the expected goals while being reliable and efficient.
Confused by cloud? Logicalis at how and why to move to an enterprise cloud platform:
What type of Cloud do I need?
Cloud value elements
What does Cloud mean to you?
Software Architecture for Cloud InfrastructureTapio Rautonen
Distributed systems are hard to build. Software architecture must be carefully crafted to suit cloud infrastructure.
Design for failure. Learn from failure. Adopt new cloud compatible design patterns and follow the guidelines during the journey of building cloud native applications.
Presentation delivered in Chicago as part of RACO Chicago, August 24, 2010. Also on NARA website at http://archives.gov/records-mgmt/presentations/ravanbakhsh.ppt
Confused by cloud? Logicalis at how and why to move to an enterprise cloud platform:
What type of Cloud do I need?
Cloud value elements
What does Cloud mean to you?
Software Architecture for Cloud InfrastructureTapio Rautonen
Distributed systems are hard to build. Software architecture must be carefully crafted to suit cloud infrastructure.
Design for failure. Learn from failure. Adopt new cloud compatible design patterns and follow the guidelines during the journey of building cloud native applications.
Presentation delivered in Chicago as part of RACO Chicago, August 24, 2010. Also on NARA website at http://archives.gov/records-mgmt/presentations/ravanbakhsh.ppt
Guide to Application Performance: Planning to Continued OptimizationMuleSoft
Supporting everything from mobile apps with thousands of concurrent users to global deployments processing millions of requests daily, Anypoint Platform has been put to test. In this session, MuleSoft experts will talk through case studies from our most demanding deployments and provide a best practice approach to designing and tuning applications for optimal performance.
The True State of the Oracle Public Cloud - Dutch Oracle Architects Platform ...Lucas Jellema
Why does Oracle want to be a cloud provider? And how is it going about it? What are the challenges facing Oracle along that path? Where does it currently stand with its Public Cloud service offerings - and what lies around the corner? How can organizations get started with the Oracle Public Cloud? What architectural considerations come into play?
Creating an Operating Model to enable a high frequency organizationTom Laszewski
Establishing an appropriate cloud operating model is critical to forming your organization’s successful adoption of cloud, and delivering greater business agility, increasing the cloud migration Return on Investment, and deliver a more secure, performant, reliable, and cost effective cloud computing environment. The impact of the cloud will be felt across your entire organization, including processes and people - not just Information technology. It will significantly affect, and be affected by, your organizational culture and Information technology delivery structures. This session will provide prescriptive guidance regarding the best approaches to evolving an operating model from projects to products, manual, process intensive governance to a ‘trust but verify’ model, long development cycles to continuous integration and deployment, silos between business and IT into a collaborative organizational structure, self-service processes, and continuous improvement. The recommendations in the presentation are based upon lesson learned, best practices, and anti-patterns from thousands of customer’s cloud transformation journeys.
Creating a Centralized Consumer Profile Management Service with WebSphere Dat...Prolifics
In this presentation will talk about how one of the world's leading Financial Institutions, leveraged WebSphere DataPower to provide a set of centralized consumer profile management services. This central service would be leveraged by internal and external applications, and would align with enterprise marketing capabilities. The solution included a complex security model which included the following products: Tivoli Directory Server, Tivoli Access Manager and Tivoli Federated Identity Manager. We will describe how to build complex orchestrations in WebSphere DataPower, and also go through some of the performance tuning options we implemented to achieve a high degree of efficiency.
Deliver Best-in-Class HPC Cloud Solutions Without Losing Your MindAvere Systems
While cloud computing offers virtually unlimited capacity, harnessing that capacity in an efficient, cost effective fashion can be cumbersome and difficult at the workload level. At the organizational level, it can quickly become chaos.
You must make choices around cloud deployment, and these choices could have a long-lasting impact on your organization. It is important to understand your options and avoid incomplete, complicated, locked-in scenarios. Data management and placement challenges make having the ability to automate workflows and processes across multiple clouds a requirement.
In this webinar, you will:
• Learn how to leverage cloud services as part of an overall computation approach
• Understand data management in a cloud-based world
• Hear what options you have to orchestrate HPC in the cloud
• Learn how cloud orchestration works to automate and align computing with specific goals and objectives
• See an example of an orchestrated HPC workload using on-premises data
From computational research to financial back testing, and research simulations to IoT processing frameworks, decisions made now will not only impact future manageability, but also your sanity.
Interop ITX: Moving applications: From Legacy to Cloud-to-CloudSusan Wu
Cloud computing provides an array of hosting and service options to fit your overall company strategy. Sometimes a public cloud is your best option and other times your data requirements demand a private cloud. As needs converge, a hybrid solution continues to gain popularity. Developers must consider if their applications might be run on either or both.
Hear about Midokura.com's journey going from the colos to cloud servers to AWS.
The cloud is a metaphor for the internet. It signifies a shift from traditional, on-site data management to a network of powerful servers accessible from anywhere. This offers flexibility, scalability, and often cost savings compared to owning physical hardware.
AWS Summit 2013 | Singapore - Service Orchestration – Managing the Cloud Disr...Amazon Web Services
Enterprises are confronted by simultaneous and extraordinary change from two sides. On the one hand their business models are changing, with accelerating digitalisation, emerging markets, increasing competition, and the need for rapid innovation to face the competition. On the other hand, new business infrastructure services based on the cloud are emerging every day, as well as changing in functionality, features and price. This sprawl and flux of services have to be harnessed to serve the shifting business priorities, whilst keeping the current business and existing investments properly operational. The way to manage this chaos is through a robust and comprehensive Service Orchestration platform.
Maruthi Prithivirajan, Head of ASEAN & IN Solution Architecture, Neo4j
Get an inside look at the latest Neo4j innovations that enable relationship-driven intelligence at scale. Learn more about the newest cloud integrations and product enhancements that make Neo4j an essential choice for developers building apps with interconnected data and generative AI.
More Related Content
Similar to Execution Environment for On-Demand Computing Services Based on Shared Clusters
Guide to Application Performance: Planning to Continued OptimizationMuleSoft
Supporting everything from mobile apps with thousands of concurrent users to global deployments processing millions of requests daily, Anypoint Platform has been put to test. In this session, MuleSoft experts will talk through case studies from our most demanding deployments and provide a best practice approach to designing and tuning applications for optimal performance.
The True State of the Oracle Public Cloud - Dutch Oracle Architects Platform ...Lucas Jellema
Why does Oracle want to be a cloud provider? And how is it going about it? What are the challenges facing Oracle along that path? Where does it currently stand with its Public Cloud service offerings - and what lies around the corner? How can organizations get started with the Oracle Public Cloud? What architectural considerations come into play?
Creating an Operating Model to enable a high frequency organizationTom Laszewski
Establishing an appropriate cloud operating model is critical to forming your organization’s successful adoption of cloud, and delivering greater business agility, increasing the cloud migration Return on Investment, and deliver a more secure, performant, reliable, and cost effective cloud computing environment. The impact of the cloud will be felt across your entire organization, including processes and people - not just Information technology. It will significantly affect, and be affected by, your organizational culture and Information technology delivery structures. This session will provide prescriptive guidance regarding the best approaches to evolving an operating model from projects to products, manual, process intensive governance to a ‘trust but verify’ model, long development cycles to continuous integration and deployment, silos between business and IT into a collaborative organizational structure, self-service processes, and continuous improvement. The recommendations in the presentation are based upon lesson learned, best practices, and anti-patterns from thousands of customer’s cloud transformation journeys.
Creating a Centralized Consumer Profile Management Service with WebSphere Dat...Prolifics
In this presentation will talk about how one of the world's leading Financial Institutions, leveraged WebSphere DataPower to provide a set of centralized consumer profile management services. This central service would be leveraged by internal and external applications, and would align with enterprise marketing capabilities. The solution included a complex security model which included the following products: Tivoli Directory Server, Tivoli Access Manager and Tivoli Federated Identity Manager. We will describe how to build complex orchestrations in WebSphere DataPower, and also go through some of the performance tuning options we implemented to achieve a high degree of efficiency.
Deliver Best-in-Class HPC Cloud Solutions Without Losing Your MindAvere Systems
While cloud computing offers virtually unlimited capacity, harnessing that capacity in an efficient, cost effective fashion can be cumbersome and difficult at the workload level. At the organizational level, it can quickly become chaos.
You must make choices around cloud deployment, and these choices could have a long-lasting impact on your organization. It is important to understand your options and avoid incomplete, complicated, locked-in scenarios. Data management and placement challenges make having the ability to automate workflows and processes across multiple clouds a requirement.
In this webinar, you will:
• Learn how to leverage cloud services as part of an overall computation approach
• Understand data management in a cloud-based world
• Hear what options you have to orchestrate HPC in the cloud
• Learn how cloud orchestration works to automate and align computing with specific goals and objectives
• See an example of an orchestrated HPC workload using on-premises data
From computational research to financial back testing, and research simulations to IoT processing frameworks, decisions made now will not only impact future manageability, but also your sanity.
Interop ITX: Moving applications: From Legacy to Cloud-to-CloudSusan Wu
Cloud computing provides an array of hosting and service options to fit your overall company strategy. Sometimes a public cloud is your best option and other times your data requirements demand a private cloud. As needs converge, a hybrid solution continues to gain popularity. Developers must consider if their applications might be run on either or both.
Hear about Midokura.com's journey going from the colos to cloud servers to AWS.
The cloud is a metaphor for the internet. It signifies a shift from traditional, on-site data management to a network of powerful servers accessible from anywhere. This offers flexibility, scalability, and often cost savings compared to owning physical hardware.
AWS Summit 2013 | Singapore - Service Orchestration – Managing the Cloud Disr...Amazon Web Services
Enterprises are confronted by simultaneous and extraordinary change from two sides. On the one hand their business models are changing, with accelerating digitalisation, emerging markets, increasing competition, and the need for rapid innovation to face the competition. On the other hand, new business infrastructure services based on the cloud are emerging every day, as well as changing in functionality, features and price. This sprawl and flux of services have to be harnessed to serve the shifting business priorities, whilst keeping the current business and existing investments properly operational. The way to manage this chaos is through a robust and comprehensive Service Orchestration platform.
Maruthi Prithivirajan, Head of ASEAN & IN Solution Architecture, Neo4j
Get an inside look at the latest Neo4j innovations that enable relationship-driven intelligence at scale. Learn more about the newest cloud integrations and product enhancements that make Neo4j an essential choice for developers building apps with interconnected data and generative AI.
Threats to mobile devices are more prevalent and increasing in scope and complexity. Users of mobile devices desire to take full advantage of the features
available on those devices, but many of the features provide convenience and capability but sacrifice security. This best practices guide outlines steps the users can take to better protect personal devices and information.
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024Neo4j
Neha Bajwa, Vice President of Product Marketing, Neo4j
Join us as we explore breakthrough innovations enabled by interconnected data and AI. Discover firsthand how organizations use relationships in data to uncover contextual insights and solve our most pressing challenges – from optimizing supply chains, detecting fraud, and improving customer experiences to accelerating drug discoveries.
DevOps and Testing slides at DASA ConnectKari Kakkonen
My and Rik Marselis slides at 30.5.2024 DASA Connect conference. We discuss about what is testing, then what is agile testing and finally what is Testing in DevOps. Finally we had lovely workshop with the participants trying to find out different ways to think about quality and testing in different parts of the DevOps infinity loop.
The Art of the Pitch: WordPress Relationships and SalesLaura Byrne
Clients don’t know what they don’t know. What web solutions are right for them? How does WordPress come into the picture? How do you make sure you understand scope and timeline? What do you do if sometime changes?
All these questions and more will be explored as we talk about matching clients’ needs with what your agency offers without pulling teeth or pulling your hair out. Practical tips, and strategies for successful relationship building that leads to closing the deal.
A tale of scale & speed: How the US Navy is enabling software delivery from l...sonjaschweigert1
Rapid and secure feature delivery is a goal across every application team and every branch of the DoD. The Navy’s DevSecOps platform, Party Barge, has achieved:
- Reduction in onboarding time from 5 weeks to 1 day
- Improved developer experience and productivity through actionable findings and reduction of false positives
- Maintenance of superior security standards and inherent policy enforcement with Authorization to Operate (ATO)
Development teams can ship efficiently and ensure applications are cyber ready for Navy Authorizing Officials (AOs). In this webinar, Sigma Defense and Anchore will give attendees a look behind the scenes and demo secure pipeline automation and security artifacts that speed up application ATO and time to production.
We will cover:
- How to remove silos in DevSecOps
- How to build efficient development pipeline roles and component templates
- How to deliver security artifacts that matter for ATO’s (SBOMs, vulnerability reports, and policy evidence)
- How to streamline operations with automated policy checks on container images
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Albert Hoitingh
In this session I delve into the encryption technology used in Microsoft 365 and Microsoft Purview. Including the concepts of Customer Key and Double Key Encryption.
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AIVladimir Iglovikov, Ph.D.
Presented by Vladimir Iglovikov:
- https://www.linkedin.com/in/iglovikov/
- https://x.com/viglovikov
- https://www.instagram.com/ternaus/
This presentation delves into the journey of Albumentations.ai, a highly successful open-source library for data augmentation.
Created out of a necessity for superior performance in Kaggle competitions, Albumentations has grown to become a widely used tool among data scientists and machine learning practitioners.
This case study covers various aspects, including:
People: The contributors and community that have supported Albumentations.
Metrics: The success indicators such as downloads, daily active users, GitHub stars, and financial contributions.
Challenges: The hurdles in monetizing open-source projects and measuring user engagement.
Development Practices: Best practices for creating, maintaining, and scaling open-source libraries, including code hygiene, CI/CD, and fast iteration.
Community Building: Strategies for making adoption easy, iterating quickly, and fostering a vibrant, engaged community.
Marketing: Both online and offline marketing tactics, focusing on real, impactful interactions and collaborations.
Mental Health: Maintaining balance and not feeling pressured by user demands.
Key insights include the importance of automation, making the adoption process seamless, and leveraging offline interactions for marketing. The presentation also emphasizes the need for continuous small improvements and building a friendly, inclusive community that contributes to the project's growth.
Vladimir Iglovikov brings his extensive experience as a Kaggle Grandmaster, ex-Staff ML Engineer at Lyft, sharing valuable lessons and practical advice for anyone looking to enhance the adoption of their open-source projects.
Explore more about Albumentations and join the community at:
GitHub: https://github.com/albumentations-team/albumentations
Website: https://albumentations.ai/
LinkedIn: https://www.linkedin.com/company/100504475
Twitter: https://x.com/albumentations
UiPath Test Automation using UiPath Test Suite series, part 5DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 5. In this session, we will cover CI/CD with devops.
Topics covered:
CI/CD with in UiPath
End-to-end overview of CI/CD pipeline with Azure devops
Speaker:
Lyndsey Byblow, Test Suite Sales Engineer @ UiPath, Inc.
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!SOFTTECHHUB
As the digital landscape continually evolves, operating systems play a critical role in shaping user experiences and productivity. The launch of Nitrux Linux 3.5.0 marks a significant milestone, offering a robust alternative to traditional systems such as Windows 11. This article delves into the essence of Nitrux Linux 3.5.0, exploring its unique features, advantages, and how it stands as a compelling choice for both casual users and tech enthusiasts.
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...SOFTTECHHUB
The choice of an operating system plays a pivotal role in shaping our computing experience. For decades, Microsoft's Windows has dominated the market, offering a familiar and widely adopted platform for personal and professional use. However, as technological advancements continue to push the boundaries of innovation, alternative operating systems have emerged, challenging the status quo and offering users a fresh perspective on computing.
One such alternative that has garnered significant attention and acclaim is Nitrux Linux 3.5.0, a sleek, powerful, and user-friendly Linux distribution that promises to redefine the way we interact with our devices. With its focus on performance, security, and customization, Nitrux Linux presents a compelling case for those seeking to break free from the constraints of proprietary software and embrace the freedom and flexibility of open-source computing.
Essentials of Automations: The Art of Triggers and Actions in FMESafe Software
In this second installment of our Essentials of Automations webinar series, we’ll explore the landscape of triggers and actions, guiding you through the nuances of authoring and adapting workspaces for seamless automations. Gain an understanding of the full spectrum of triggers and actions available in FME, empowering you to enhance your workspaces for efficient automation.
We’ll kick things off by showcasing the most commonly used event-based triggers, introducing you to various automation workflows like manual triggers, schedules, directory watchers, and more. Plus, see how these elements play out in real scenarios.
Whether you’re tweaking your current setup or building from the ground up, this session will arm you with the tools and insights needed to transform your FME usage into a powerhouse of productivity. Join us to discover effective strategies that simplify complex processes, enhancing your productivity and transforming your data management practices with FME. Let’s turn complexity into clarity and make your workspaces work wonders!
GraphRAG is All You need? LLM & Knowledge GraphGuy Korland
Guy Korland, CEO and Co-founder of FalkorDB, will review two articles on the integration of language models with knowledge graphs.
1. Unifying Large Language Models and Knowledge Graphs: A Roadmap.
https://arxiv.org/abs/2306.08302
2. Microsoft Research's GraphRAG paper and a review paper on various uses of knowledge graphs:
https://www.microsoft.com/en-us/research/blog/graphrag-unlocking-llm-discovery-on-narrative-private-data/
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...DanBrown980551
Do you want to learn how to model and simulate an electrical network from scratch in under an hour?
Then welcome to this PowSyBl workshop, hosted by Rte, the French Transmission System Operator (TSO)!
During the webinar, you will discover the PowSyBl ecosystem as well as handle and study an electrical network through an interactive Python notebook.
PowSyBl is an open source project hosted by LF Energy, which offers a comprehensive set of features for electrical grid modelling and simulation. Among other advanced features, PowSyBl provides:
- A fully editable and extendable library for grid component modelling;
- Visualization tools to display your network;
- Grid simulation tools, such as power flows, security analyses (with or without remedial actions) and sensitivity analyses;
The framework is mostly written in Java, with a Python binding so that Python developers can access PowSyBl functionalities as well.
What you will learn during the webinar:
- For beginners: discover PowSyBl's functionalities through a quick general presentation and the notebook, without needing any expert coding skills;
- For advanced developers: master the skills to efficiently apply PowSyBl functionalities to your real-world scenarios.
Mind map of terminologies used in context of Generative AI
Execution Environment for On-Demand Computing Services Based on Shared Clusters
1. 1/40
Execution Environment for On-
Demand Computing Services Based
on Shared Clusters
PhD thesis, Grenoble University
By Rodrigue Chakode
(LIG/INRIA, Equipe Mescal)
Advisors: - Jean-François Méhaut
- Maurice Tchuenté
2. 2/40
Cloud Computing in a Nutshell
◉ Enables computing features as services
◉ Free or commercial services accessible over network
◉ On-demand and elastic accesses, plus a utility billing
– Customers (users of the service) only pay for what they use,
aka pay-as-you-go
– Requests for more or less features should be satisfied quickly
◉ Services setup transparently against customers
– They don't have to care about how the service is enabled
3. 3/40
Context Statement on Cloud Computing
◉Various sorts of cloud services
– Infrastructure-as-a-Service, Platform-as-a-Service, Software-
as-a-Service, Data-as-a-Service, Translation-as-a-Service...
– Almost everything could be a service (XaaS)
◉Requires to set up a suitable computing
infrastructure
– Servers, storage, network fabrics, cooling system...
◉May need significant investments
– Out of reach for many small or medium businesses (SMBs)
– Market currently dominated by biggest organizations
Introduction
4. 4/40
Challenges for HPC
◉ Numerous software require
intensive computing capabilities
– E.g. EDA Applications (Ciloe Project)
– Integrated circuits need to be simulated
before manufacturing
◉ Computing architectures are
increasingly parallel
– SMP, NUMA, GPU, Cluster... and
soon many-core architectures
◉ HPC applications run on clusters
of multicore nodes (SMP/NUMA)
◉ Also expensive
Example of a cluster. Credit : CEA
Introduction
5. 5/40
Bring HPC Services into Clouds
◉Services requiring intensive computations
◉Services enabled from a mutualized cluster
– Cluster supported by several businesses
– Each business providing its own service
– Cluster's resources shared among the services
◉Study with the context of an industrial
collaboration
– The Ciloe Project [http://ciloe.minalogic.net]
– Three SBEs editing EDA applications involved
Introduction
6. 6/40
Outline
◉ Introduction
◉ Problem statement
◉ Background
– Existing SaaS clouds and their related RM issues
– Survey on existing resource sharing techniques
◉ Contributions
– Overview : Scheduling Approach and Execution Model
– Architecture Model and Scheduling Strategy
– Prototyping
◉ Experimental evaluation
– Evaluation Protocol
– Results
◉ Conclusion & perspectives
7. 7/40
Resource Management for HPC SaaS Services
◉What is a service
–Computes customer data
with a specific application
–Input specifies an
application and the data
–Output retrieved after the
computation
–No more interactions
necessary
Problem Statement
8. 8/40
Related Research Issues
◉Data Management
◉Resilience and Fault Tolerance
◉Security and privacy
◉Resource Management
Problem Statement
9. 9/40
Scheduling Problems
◉Share the cluster's resources among the services
– according to the investments of the different businesses
◉Maximize the use of resources
– Use idle resources to run pending requests
– Run miscellaneous tasks on idle resources in a best-effort
way
◉Minimize the impact of selfish behaviors
– A business can under-invest while needing a lot of resources
Problem Statement
10. 10/40
Resource Allocation for On-demand Services
◉ Running requests in a dynamic way
– Resources should be allocated dynamically
– Allocated resources should be freed up automatically once
a request completed
– Handle Input/Output data in a transparent way
◉ Need to think of resource partitioning
– Modern computing nodes have several cores
– The amount of cores required by certain tasks can be less
than the number of cores available on a node
Problem Statement
11. 11/40
Outline
◉ Introduction
◉ Problem statement
◉ Background
– Existing SaaS clouds and their related RM issues
– Survey on existing resource sharing techniques
◉ Contributions
– Overview : Scheduling Approach and Execution Model
– Architecture Model and Scheduling Strategy
– Prototyping
◉ Experimental evaluation
– Evaluation Protocol
– Results
◉ Conclusion & perspectives
12. 12/40
Background on Existing SaaS Clouds
◉ Target office and collaborative
applications
– E.g. Google Docs, Salesforce,
Office365...
– Need of interactiveness
◉ SaaS cloud as a layer on top of a
PaaS
– PaaS can rely on an IaaS layer
– IaaS enables on-demand resource
allocation
• Virtualization plays an important role
◉ Resources belong to an unique
organization
Background on SaaS Clouds
13. 13/40
Services for Intensive Computations
◉ No need of interactiveness
◉ Requires a high dynamicity and
transparency
• Allocation of resources when
executing a task
• Release of resources once a task
completed
◉ Mutualized resources
=>Need to deal with sharing the
resources among the services
Background on SaaS Clouds
14. 14/40
Scheduling services on mutualized resources
◉ Raises conflicting objectives
– Fairness against the service suppliers
– Efficiency concerning the use of resources
◉ Prioritize an objective penalizes the other
=> Requires to make a tradeoff
Background on resource management
15. 15/40
Common resource scheduling strategies
◉ First-come, First-served (FCFS)
◉ FCFS along with Backfilling (EASY/Conservative)
+ Fair against users
– Inefficient in terms of utilization
– May be unfair against some
businesses in out context
+ Improve utilization
– May significantly delay biggest
tasks
+ Possible optimization with a
conservative backfilling
– Remains unfair in our context
Background on resource management
16. 16/40Background on resource management
How Resources are Assigned to Tasks
◉ Simple assignation strategies
– Greedy and round-robin algorithms
◉ Assignations guided by performance requirements
– Notion of match-making (affinities between resources and tasks)
◉ Prioritization
– More prioritized tasks get access to resources first
• Preemption can be introduced
=> Notion of best-effort when certain tasks only run on idle
resources
◉ Reservation and leasing
– Resources are allocated for a given time slot
17. 17/40Background on resource management
Common resource sharing strategies
◉ Static sharing (partitioning)
◉ Fair-sharing (no partitioning + dynamic priorities)
+ Fair and easy to setup
– Inefficient in terms of
utilization in our context
+ Tradeoff between the fairness and
the utilization
– May still raise unfair situations in
our context
R1
R2
R3
R4
R5
R6
R7
R1
R2
R3
R4
R5
R6
R7
Business 1
Business 2
Business 3
18. 18/40
Partitioning Individual Node
◉ Requires isolation among tasks
– A task would not access resources allocated to another task
◉ Isolation with containers (cgroups, cpusets, OpenVZ, LXC...)
+ Low level partitioning inducing a low overhead
=> good performances
– Non-flexible since not easy to handle dynamically
◉ Isolation with virtual machines (VMs)
+ High level partitioning
=> High flexibility in terms of automation
– Possible performance overhead
―Several optimizations (e.g. HVM, paravirtualization, PCI passthrough...)
Background on resource management
19. 19/40
Synthesis on Partitioning Resources
◉ Virtual Machines enable interesting features
– To partition each individual node along with a high isolation
– To allocate and free up resources dynamically
– To suspend/restart best-effort tasks
◉ Powerful and proved VM management tools
– Handle VMs on individual node
– Xen, KVM, ESXi, Hyper-V...
– Handle VMs on distributed environments
• OpenNebula, Eucalyptus, OpenStack...
―Target IaaS clouds
20. 20/40
Problems to Address With VMs
◉ Deal with performance overhead
– Generic optimizations
• HVM, PCI Passthrough
– Solution-specific optimizations
• Paravirtualization (Xen, Hyper-V)
• Virtio (KVM, Xen)
◉ Allocate custom VMs dynamically on distributed
environments
– Contextualization enables interesting features (OpenNebula)
21. 21/40
Lacks of the Existing According to Our Aims
◉ On-demand HPC services on a mutualized cluster
– Existing SaaS clouds focus on collaborative or office applications
• Resources owned by a single organization
◉ Existing resources sharing strategies don't suit our needs
=> Necessity to design new approaches
◉ Contributions
– Scheduling strategy for sharing mutualized resources
– Architecture for on-demand HPC services
– Prototyping for evaluation
Background on resource management
22. 22/40
Outline
◉ Introduction
◉ Problem statement
◉ Background
– Existing SaaS clouds and their related RM issues
– Survey on existing resource sharing techniques
◉ Contributions
– Overview : Scheduling Approach and Execution Model
– Architecture Model and Scheduling Strategy
– Prototyping
◉ Experimental evaluation
– Evaluation Protocol
– Results
◉ Conclusion & perspectives
23. 23/40
Ideas for the resource sharing strategy
◉ Combines the advantages...
– of a static sharing where the fairness is easy to hold
– and those of a fair-sharing strategy that allows to improve the
utilization
◉ Enables a elasticity in resource sharing
– A business to use more resources than its investment :
• When the task raising such a situation has a duration less than
a acceptable duration threshold noted D
• Or When the task is of best-effort type
=> Limits the impact of selfish behaviors from certain
businesses
Contributions : Overview
24. 24/40
Handling Requests Dynamically
◉ Encapsulate each task within a virtual machine (VM)
– Eases the partitioning of nodes and enables dynamicity
◉ Enable a Specific SaaS Manager
– Implements the scheduling strategy to address the resource
sharing issues
– Assumes the allocation and the destruction of VMs
◉ Exploit the Contextualization of VMs
– VM created, customized and started dynamically
• VM suitably set to launch the task once started
– VM automatically destroyed once the task is completed
25. 25/40
Architecture Model
◉ The SaaS Manager on top
of the cluster
– Relies on a virtual
infrastructure manager (VIM)
– VIM relies on hypervisors
◉ Possibility of reusing
existing tools
– Avoids rewriting existing
features
– Benefits of features from
powerful proved tools
Contributions : Architecture Model
26. 26/40
Design Driven by Openness, Performances and
Interoperability
◉ OpenNebula enables support
for handling the VMs
– Featuring the
contextualization
◉ Xen manages VMs on each
individual node
– Exploits the
paravirtualization for better
performances
◉ The different components
coupled though Open APIs
– Ensure a better interopera-
bility
Contributions : Architecture Model
27. 27/40
Resource Sharing Strategy : Case study
◉ A situation with three
businesses B1, B2 and B3
– B1 (with green tasks) invested
for 2/7 of resources (R1,
R2...R7)
– B2 (with red tasks) invested for
2/7
– B3 (with blue tasks) for 3/7
◉ On the figure, think of tasks
as the related VMs
Contributions : Resource Management Strategy
t2
t3 t5
t6
t1 t4
Queued tasks
28. 28/40
Resource Sharing Strategy : Example 1
◉ Assumes the duration of t1
and t5 <= D (the chosen
duration threshold)
– B1 and B3 are using ratios of
resources geater than their
investments
– That representing a
complementary ratio of 1/14 for
each of them
Contributions : Resource Management Strategy
Queued tasks
t5
t1
t2
t3
t6
t4
29. 29/40
Resource sharing strategy : Example 2
◉ None of tasks has a
duration <= D, but the task
t2 is of best-effort type
– B1 is using a ratio of resources
1/7 greater than its investment
– t2 can be suspended at any
time
Contributions : Resource Management Strategy
t4t1
Queued tasks
t3
t2
t5
t6
30. 30/40
About Implementation
◉ Relies on principles of resource leasing
– A lease consists in allocating a virtual machine for running a task
– The duration of a lease depends on the related task
• Its duration and its of the type (best-effort or not)
◉ Two kinds of leases handled specifically
– Non-preemptive leases
• Assigned to tasks related to the customers
―Non preemptive tasks
=> Resources only freed up at completion
– Preemptive leases
• Assigned to best-effort tasks
―VMs can be suspended to be restart later
=> No guaranty of completion
Contributions : Resource Management Strategy
31. 31/40
Prototyping and Overview on Integration
◉ SVMSched (Smart Virtual
Machine Scheduler)
– Drop-in replacement for the
OpenNebula's default scheduler
– Proper interfaces that provide the
SaaS abstraction
– Deals with allocating and freeing
up VMs dynamically
– Implements the resource sharing
strategy
– Supports contextualization data
stored on Network File Systems
Contributions : Prototyping
32. 32/40
Outline
◉ Introduction
◉ Problem statement
◉ Background
– Existing SaaS clouds and their related RM issues
– Survey on existing resource sharing techniques
◉ Contributions
– Overview : Scheduling Approach and Execution Model
– Architecture Model and Scheduling Strategy
– Prototyping
◉ Experimental evaluation
– Evaluation Protocol
– Results
◉ Conclusion & perspectives
33. 33/40
Evaluation Protocol
◉ Evaluation of the performances of an application
– Time to setup the VM
– Performance overhead induced by the virtualization
◉ Study of the scheduling strategy
– Is that behaves well regarding the fairness and the utilization ?
– If not, how it can be improved?
◉ Experimental conditions
– Nodes from Grid'5000 : each having 2x4 cores, 2.27 Ghz, 8Go of RAM
– Xen 3.4.2 and OpenNebula 1.4.2 along with VM images of 500MB
– Applications from the Parsec Benchmark (BodyTrack, Blackscholes,
Freqmine)
Evaluation
34. 34/40Evaluation
Performances of the virtualization
◉ Full VMs perform better than contextualized
ones => slight difference
◉ High overhead : applications requiring high
disk IO
◉ VMs perform better than native machines
=>concurrent tasks requiring high memory IO
◉ Contextualized VMs : require
constant and low setup time
– ~15s (<5% of the duration of a task
of 5 mins) with an image of 500 MB
◉ Full VMs : times grow linearly
35. 35/40Evaluation
Analyzing the scheduling strategy
◉ Better choice of the threshold
– Businesses can benefit from the mutualization
– Prevents the temptation for selfish behaviors
– Best-effort tasks would allows better utilization
◉ Mutualization is not relevant
– The threshold is not suitably chosen
– There is no best-effort tasks
– The strategy leads to a static sharing
36. 36/40
Outline
◉ Introduction
◉ Problem statement
◉ Background
– Existing SaaS clouds and their related RM issues
– Survey on existing resource sharing techniques
◉ Contributions
– Overview : Scheduling Approach and Execution Model
– Architecture Model and Scheduling Strategy
– Prototyping
◉ Experimental evaluation
– Evaluation Protocol
– Results
◉ Conclusion & perspectives
37. 37/40
Conclusion
◉ We studied and set up an environment for enabling HPC
SaaS services on shared computing resources
– Designing an architecture model that relies on virtualization for
executing on-demand requests
– Design resource management algorithms that allow to share in a fair
way the resources while maximizing their use
◉ A prototype has been developed to evaluate experimentally
our contributions
– Results shown the feasibility of our approach
– Prototype integrated in the deliveries of the Ciloe Project
◉ Thus we have enabled a room for addressing the problem
of costs that highly constraints SMBs needing HPC
resources for their applications
Conclusion & Perspectives
38. 38/40
Perspectives
◉ Model of predicting the duration of each task
– Envisioning an approximation model based on reinforcing
learning
◉ Economic model of billing
– What parameters the invoicing can take into account?
• Per-use costs of software licenses and computing resources +
earnings
◉ Dimensioning the platform
– To allow each business to have a suitable view of its needs in
terms of resources
Conclusion & Perspectives
39. 39/40
About this Work
◉ Awards
– 1st Prize Grid'5000 Challenge, Reims 2011
◉ Book Chapter
– Rodrigue chakode, Jean-François Méhaut, Blaise-Omer Yenke. Scheduling On-demand SaaS Services on a
Shared Virtual Cluster. In Cloud Computing and Services Science. Pages 259 – 276. ISBN 978-1-4614-2325-6,
Springer-Verlag, April 2012.
◉ International conferences
– Rodrigue chakode, Blaise-Omer Yenke, Jean-François Méhaut. Resource Management of Virtual Infrastructure
for On-demand SaaS Services. In CLOSER2011 - International conference on Cloud Computing and Service
Science. Pages 352 – 361. Netherlands, May 2011.
– Rodrigue Chakode, Jean-François Méhaut, François Charlet. High Performance Computing on Demand:
Sharing and Mutualizing Clusters. In AINA'10 - IEEE International Conference on Avanced Information
Networking and Applications. Pages 126 – 133. Australia, April 2010.
◉ National conferences
– Rodrigue chakode, Blaise-Omer Yenke. Utilisation des machines virtuelles comme support de services de
calcul à la demande. In Renpar'20: les actes des Rencontres francophones du Parallélisme, édition 2011.
Saint-Malo, France, Mai 2011.
◉ Other publications (in the cloud community)
– Rodrigue chakode. SVMSched : A tool to enable On-demand SaaS and PaaS Services on top of OpenNebula.
In OpenNebula Official Blog, http://blog.opennebula.org/?p=1646.
– Link on the OpenNebula Software Ecosystem : http://opennebula.org/software:ecosystem:svmsched