This paper details the state of the art, the design, development and deployment of the EGI Federated Cloud platform, an e-infrastructure offering scalable and flexible models of utilization to the European research community. While continuing support for the traditional High Throughput Computing model, the EGI Cloud Platform extends its reach to other models of utilization such as long-lived services and on demand computation. Following a two-year period of development, the EGI Federated Cloud platform was officially launched in May 2014 offering resources provided by trusted academic and research organisations from within the user communities and consistently with their standard funding regime. Since then, the use cases supported have significantly increased both in total number and diversity of model of service required, validating both the choice of enforcing cloud technology agnosticism and of supporting service mobility and portability by means of open standards. These design choices have also allowed for the inclusion of commercial cloud providers into an infrastructure previously supported only by academic institutions. This contributes to a wider goal of funding agencies to create economic and social impact from supported research activities.
The past decade has seen increasingly ambitious and successful methods for outsourcing computing. Approaches such as utility computing, on-demand computing, grid computing, software as a service, and cloud computing all seek to free computer applications from the limiting confines of a single computer. Software that thus runs "outside the box" can be more powerful (think Google, TeraGrid), dynamic (think Animoto, caBIG), and collaborative (think FaceBook, myExperiment). It can also be cheaper, due to economies of scale in hardware and software. The combination of new functionality and new economics inspires new applications, reduces barriers to entry for application providers, and in general disrupts the computing ecosystem. I discuss the new applications that outside-the-box computing enables, in both business and science, and the hardware and software architectures that make these new applications possible.
Towards a Cloud Native Big Data Platform using MiCADOAbdelkhalik Mosa
Demonstrates the automatic deployment and scalability of Hadoop-based big data platforms using MiCADO. Challenges in existing Hadoop deployments. Hadoop and cloud native computing. MiCADO overview. Containerizing big data tools. Creating big data ADT for MiCADO.
Cloud2Sim - An Elastic Middleware Platform for Concurrent and Distributed Cloud and MapReduce Simulations.
This is a presentation that partially describes the work-in-progress CloudSim, as presented at MASCOTS 2014 in Paris.
The past decade has seen increasingly ambitious and successful methods for outsourcing computing. Approaches such as utility computing, on-demand computing, grid computing, software as a service, and cloud computing all seek to free computer applications from the limiting confines of a single computer. Software that thus runs "outside the box" can be more powerful (think Google, TeraGrid), dynamic (think Animoto, caBIG), and collaborative (think FaceBook, myExperiment). It can also be cheaper, due to economies of scale in hardware and software. The combination of new functionality and new economics inspires new applications, reduces barriers to entry for application providers, and in general disrupts the computing ecosystem. I discuss the new applications that outside-the-box computing enables, in both business and science, and the hardware and software architectures that make these new applications possible.
Towards a Cloud Native Big Data Platform using MiCADOAbdelkhalik Mosa
Demonstrates the automatic deployment and scalability of Hadoop-based big data platforms using MiCADO. Challenges in existing Hadoop deployments. Hadoop and cloud native computing. MiCADO overview. Containerizing big data tools. Creating big data ADT for MiCADO.
Cloud2Sim - An Elastic Middleware Platform for Concurrent and Distributed Cloud and MapReduce Simulations.
This is a presentation that partially describes the work-in-progress CloudSim, as presented at MASCOTS 2014 in Paris.
Federated Cloud Computing - The OpenNebula Experience v1.0sIgnacio M. Llorente
The talk mostly focuses on private cloud computing to support Science and High Performance Computing environments, the different architectures to federate cloud infrastructures, the existing challenges for cloud interoperability, and the OpenNebula's vision for the future of existing Grid infrastructures.
A brief discussion about Cloud computing for a beginner, you can get a clear idea about cloud computing from this slides.Also, discuss cloudsim simulator.
Creating a Widely Distributed Production Federated Cloud System to Support Mu...David Wallom
The EGI federated cloud, which has been in development for the past 3 years has now been in production for over a year. Building on the tried and trusted EGI core services we added federated IaaS compute and storage services, utilising open standards to support 10 pilot communities at launch. Since then the number of supported communities has grown over seven fold. We will discuss the development of the model of federation, the different application design models of user communities we support and why cloud will be a success when compared with grid due to this inherent flexibility.
Federated Cloud Computing - The OpenNebula Experience v1.0sIgnacio M. Llorente
The talk mostly focuses on private cloud computing to support Science and High Performance Computing environments, the different architectures to federate cloud infrastructures, the existing challenges for cloud interoperability, and the OpenNebula's vision for the future of existing Grid infrastructures.
A brief discussion about Cloud computing for a beginner, you can get a clear idea about cloud computing from this slides.Also, discuss cloudsim simulator.
Creating a Widely Distributed Production Federated Cloud System to Support Mu...David Wallom
The EGI federated cloud, which has been in development for the past 3 years has now been in production for over a year. Building on the tried and trusted EGI core services we added federated IaaS compute and storage services, utilising open standards to support 10 pilot communities at launch. Since then the number of supported communities has grown over seven fold. We will discuss the development of the model of federation, the different application design models of user communities we support and why cloud will be a success when compared with grid due to this inherent flexibility.
Emerging Trends in Hybrid-Cloud & Multi-Cloud StrategiesChaitanya Atreya
As Cloud Computing rapidly evolves, newer deployment strategies such as Hybrid-Cloud, Multi-Cloud and On-Prem Cloud are emerging. More and more enterprise solution providers are offering support for a combination of these deployment targets. It is imperative that the larger organizations have a clear Hybrid-Cloud and Multi-Cloud strategy to avoid cloud lock-in and to de-risk business decisions.
What do each of these terminologies mean? What is the scope of each and overlap if any? We will discuss the emerging best-practices across these interdisciplinary trends, especially in the context of Modern Data and Analytics Platforms and Enterprise Self-Service.
OSDC 2012 | OpenNebula Open Source Toolkit for DataCenter Virtualization by C...NETWAYS
OpenNebula is the open-source industry standard for data center virtualization. The presentation will describe its innovative features for the comprehensive, complete management of virtualized data centers to enable on-premise IaaS clouds in existing infrastructure. OpenNebula interoperability makes cloud an evolution by leveraging existing IT assets, protecting your investments, and avoiding vendor lock-in.
Using a Widely Distributed Federated Cloud System to Support Multiple Dispara...inside-BigData.com
In this deck from the 2014 ISC Cloud Conference, David Wallom from the University of Oxford presents:
Using a Widely Distributed Federated Cloud System to Support Multiple Disparate User Communities.
"The EGI federated cloud, which has been in development for the past 3 years has now entered production. Building on the tried and trusted EGI core services we have added federated IaS compute and storage services, utilising open standards to support more than 10 pilot communities. We will discuss the model of federation, and the different application design models that the users use and why cloud will be a success when compared with grid due to this inherent flexibility."
Learn more: http://www.isc-events.com/cloud14/schedule.html
Watch the video presentation: http://wp.me/p3RLHQ-daY
Privacy preserving public auditing for secured cloud storagedbpublications
As the cloud computing technology develops during the last decade, outsourcing data to cloud service for storage becomes an attractive trend, which benefits in sparing efforts on heavy data maintenance and management. Nevertheless, since the outsourced cloud storage is not fully trustworthy, it raises security concerns on how to realize data deduplication in cloud while achieving integrity auditing. In this work, we study the problem of integrity auditing and secure deduplication on cloud data. Specifically, aiming at achieving both data integrity and deduplication in cloud, we propose two secure systems, namely SecCloud and SecCloud+. SecCloud introduces an auditing entity with a maintenance of a MapReduce cloud, which helps clients generate data tags before uploading as well as audit the integrity of data having been stored in cloud. Compared with previous work, the computation by user in SecCloud is greatly reduced during the file uploading and auditing phases. SecCloud+ is designed motivated by the fact that customers always want to encrypt their data before uploading, and enables integrity auditing and secure deduplication on encrypted data.
Cloud Standards in the Real World: Cloud Standards Testing for DevelopersAlan Sill
Learn about standards studied in the US National Science Foundation Cloud and Autonomic Computing Industry/University Cooperative Research Center Cloud Standards Testing Lab and how you can get involved to extend the successes from these results in your own cloud software settings. Presented at the O'Reilly OSCON 2014 Open Cloud Day.
Video available at https://www.youtube.com/watch?v=eD2h0SqC7tY
Supporting Research through "Desktop as a Service" models of e-infrastructure...David Wallom
Keynote presentation given 13/9/16 @ ESA Earth Observation Open Science workshop 2016.
"The rise in cloud computing as an e-infrastructure model is one that has the power to democratise access to computational and data resources throughout the research communities. We have seen the difference that Infrastructure as a Service (IaaS) has made for different communities and are now only beginning to understand what different models further up the stack can make. It is also becoming clear that with the increase in research data volumes, the number of sources and the possibility of utilising data from different regulatory regimes that a different model of how analysis is performed on the data is possible. Utilising a "Desktop as a Service" model, with community focused applications installed on a common and well understood virtual system image that is directly connected to community relevant data allows the researcher to no longer have to consider moving data but only the final analysed results. This massively simplifies both the user model and the data and resource owner model. We will consider the specific example of the Environmental Ecomics Synthesis Cloud and how it could easily be generalised to other areas."
e-Research & the art of linking Astrophysics to DeforestationDavid Wallom
Keynote at HPCS 2016 on e-Research, talking about the e-Research methodology linking work on Astrophysics with finally Deforestation via Smartening Energy Systems and Detecting Energy Theft
Privacy and Security policies in the cloudDavid Wallom
Presentation on the impact of EU data protection and cyber-security regulations/directives on cloud computing @ Public Policy Exchange symposium 'Rethinking Data Protection and Privacy in Europe: Shaping the European Digital Future', 6th July 2016
Working with Earth Observation Data, INFORM and the IEADavid Wallom
Presentation given to University of Oxford Space network Earth Observation workshop on project and activities within OeRC Energy & Environmental ICT group. 9/6/16
Mapping Priorities and Future Collaborations for you ProjectsDavid Wallom
Presentation on the outputs of cloud computing clustering and how the self help tool can give insight to where you sit in the cloud ecosystem and who else you should be talking to.
CloudWatch: Mapping priorities and future collaboration for your projectDavid Wallom
Presentation @ NetFutures2016 on how building effective relationships is essential through all parts of a project from formation to closure. The CloudWATCH Cloud Clustering tool allows quantitative analysis of the cloud ecosystem landscape and a project or activities position in it.
Trust and Cloud Computing, removing the need to trust your cloud providerDavid Wallom
Presentation at CloudSecurityExpo 2106 publicly describing the Porridge distributed remote attestation using multiple trusted Third Parties as a way of building a cryptographically secure cloud service. Allowing users to know the cloud they are using is in exactly the format they expect. This will be commercially available through the Antyran product. This work is supported by InnovateUK KTP in partnership between University of Oxford e-Research centre and 100PercentIT. Other partners not in the KTP in include OctaInnovations.
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...DanBrown980551
Do you want to learn how to model and simulate an electrical network from scratch in under an hour?
Then welcome to this PowSyBl workshop, hosted by Rte, the French Transmission System Operator (TSO)!
During the webinar, you will discover the PowSyBl ecosystem as well as handle and study an electrical network through an interactive Python notebook.
PowSyBl is an open source project hosted by LF Energy, which offers a comprehensive set of features for electrical grid modelling and simulation. Among other advanced features, PowSyBl provides:
- A fully editable and extendable library for grid component modelling;
- Visualization tools to display your network;
- Grid simulation tools, such as power flows, security analyses (with or without remedial actions) and sensitivity analyses;
The framework is mostly written in Java, with a Python binding so that Python developers can access PowSyBl functionalities as well.
What you will learn during the webinar:
- For beginners: discover PowSyBl's functionalities through a quick general presentation and the notebook, without needing any expert coding skills;
- For advanced developers: master the skills to efficiently apply PowSyBl functionalities to your real-world scenarios.
GridMate - End to end testing is a critical piece to ensure quality and avoid...ThomasParaiso2
End to end testing is a critical piece to ensure quality and avoid regressions. In this session, we share our journey building an E2E testing pipeline for GridMate components (LWC and Aura) using Cypress, JSForce, FakerJS…
Epistemic Interaction - tuning interfaces to provide information for AI supportAlan Dix
Paper presented at SYNERGY workshop at AVI 2024, Genoa, Italy. 3rd June 2024
https://alandix.com/academic/papers/synergy2024-epistemic/
As machine learning integrates deeper into human-computer interactions, the concept of epistemic interaction emerges, aiming to refine these interactions to enhance system adaptability. This approach encourages minor, intentional adjustments in user behaviour to enrich the data available for system learning. This paper introduces epistemic interaction within the context of human-system communication, illustrating how deliberate interaction design can improve system understanding and adaptation. Through concrete examples, we demonstrate the potential of epistemic interaction to significantly advance human-computer interaction by leveraging intuitive human communication strategies to inform system design and functionality, offering a novel pathway for enriching user-system engagements.
DevOps and Testing slides at DASA ConnectKari Kakkonen
My and Rik Marselis slides at 30.5.2024 DASA Connect conference. We discuss about what is testing, then what is agile testing and finally what is Testing in DevOps. Finally we had lovely workshop with the participants trying to find out different ways to think about quality and testing in different parts of the DevOps infinity loop.
Communications Mining Series - Zero to Hero - Session 1DianaGray10
This session provides introduction to UiPath Communication Mining, importance and platform overview. You will acquire a good understand of the phases in Communication Mining as we go over the platform with you. Topics covered:
• Communication Mining Overview
• Why is it important?
• How can it help today’s business and the benefits
• Phases in Communication Mining
• Demo on Platform overview
• Q/A
In the rapidly evolving landscape of technologies, XML continues to play a vital role in structuring, storing, and transporting data across diverse systems. The recent advancements in artificial intelligence (AI) present new methodologies for enhancing XML development workflows, introducing efficiency, automation, and intelligent capabilities. This presentation will outline the scope and perspective of utilizing AI in XML development. The potential benefits and the possible pitfalls will be highlighted, providing a balanced view of the subject.
We will explore the capabilities of AI in understanding XML markup languages and autonomously creating structured XML content. Additionally, we will examine the capacity of AI to enrich plain text with appropriate XML markup. Practical examples and methodological guidelines will be provided to elucidate how AI can be effectively prompted to interpret and generate accurate XML markup.
Further emphasis will be placed on the role of AI in developing XSLT, or schemas such as XSD and Schematron. We will address the techniques and strategies adopted to create prompts for generating code, explaining code, or refactoring the code, and the results achieved.
The discussion will extend to how AI can be used to transform XML content. In particular, the focus will be on the use of AI XPath extension functions in XSLT, Schematron, Schematron Quick Fixes, or for XML content refactoring.
The presentation aims to deliver a comprehensive overview of AI usage in XML development, providing attendees with the necessary knowledge to make informed decisions. Whether you’re at the early stages of adopting AI or considering integrating it in advanced XML development, this presentation will cover all levels of expertise.
By highlighting the potential advantages and challenges of integrating AI with XML development tools and languages, the presentation seeks to inspire thoughtful conversation around the future of XML development. We’ll not only delve into the technical aspects of AI-powered XML development but also discuss practical implications and possible future directions.
In his public lecture, Christian Timmerer provides insights into the fascinating history of video streaming, starting from its humble beginnings before YouTube to the groundbreaking technologies that now dominate platforms like Netflix and ORF ON. Timmerer also presents provocative contributions of his own that have significantly influenced the industry. He concludes by looking at future challenges and invites the audience to join in a discussion.
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
Threats to mobile devices are more prevalent and increasing in scope and complexity. Users of mobile devices desire to take full advantage of the features
available on those devices, but many of the features provide convenience and capability but sacrifice security. This best practices guide outlines steps the users can take to better protect personal devices and information.
Removing Uninteresting Bytes in Software FuzzingAftab Hussain
Imagine a world where software fuzzing, the process of mutating bytes in test seeds to uncover hidden and erroneous program behaviors, becomes faster and more effective. A lot depends on the initial seeds, which can significantly dictate the trajectory of a fuzzing campaign, particularly in terms of how long it takes to uncover interesting behaviour in your code. We introduce DIAR, a technique designed to speedup fuzzing campaigns by pinpointing and eliminating those uninteresting bytes in the seeds. Picture this: instead of wasting valuable resources on meaningless mutations in large, bloated seeds, DIAR removes the unnecessary bytes, streamlining the entire process.
In this work, we equipped AFL, a popular fuzzer, with DIAR and examined two critical Linux libraries -- Libxml's xmllint, a tool for parsing xml documents, and Binutil's readelf, an essential debugging and security analysis command-line tool used to display detailed information about ELF (Executable and Linkable Format). Our preliminary results show that AFL+DIAR does not only discover new paths more quickly but also achieves higher coverage overall. This work thus showcases how starting with lean and optimized seeds can lead to faster, more comprehensive fuzzing campaigns -- and DIAR helps you find such seeds.
- These are slides of the talk given at IEEE International Conference on Software Testing Verification and Validation Workshop, ICSTW 2022.
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024Neo4j
Neha Bajwa, Vice President of Product Marketing, Neo4j
Join us as we explore breakthrough innovations enabled by interconnected data and AI. Discover firsthand how organizations use relationships in data to uncover contextual insights and solve our most pressing challenges – from optimizing supply chains, detecting fraud, and improving customer experiences to accelerating drug discoveries.
Pushing the limits of ePRTC: 100ns holdover for 100 daysAdtran
At WSTS 2024, Alon Stern explored the topic of parametric holdover and explained how recent research findings can be implemented in real-world PNT networks to achieve 100 nanoseconds of accuracy for up to 100 days.
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Albert Hoitingh
In this session I delve into the encryption technology used in Microsoft 365 and Microsoft Purview. Including the concepts of Customer Key and Double Key Encryption.
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfPaige Cruz
Monitoring and observability aren’t traditionally found in software curriculums and many of us cobble this knowledge together from whatever vendor or ecosystem we were first introduced to and whatever is a part of your current company’s observability stack.
While the dev and ops silo continues to crumble….many organizations still relegate monitoring & observability as the purview of ops, infra and SRE teams. This is a mistake - achieving a highly observable system requires collaboration up and down the stack.
I, a former op, would like to extend an invitation to all application developers to join the observability party will share these foundational concepts to build on:
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Federating Infrastructure as a Service cloud computing systems to create a uniform e-infrastructure for research
1. 1
Federating Infrastructure as a Service cloud
computing systems to create a uniform e-
infrastructure for research
David C.H. Wallom1, Matteo Turilli1,2, Michel Drescher3, Diego
Scardaci3,4 and Steven Newhouse3,5
1University of Oxford, Oxford, UK
2Rutgers University, New Jersey, USA
3EGI.eu, Amsterdam, NL
4INFN Division of Catania, Catania, Italy
5EMBL-EBI, Hinxton, UK
4. Rationale
Growth of Providers
• Grid Computing
– Academic resource providers
• Cloud Computing
– Diversity of resource providers
4
Tens of 1000’s Millions
Few related use cases
Single application model
Many diverse use cases
& application models
Growth of Research Communities
5. Hardware
Hardware
Hardware
Hardware
Hardware
Cloud Management
Cloud Management
Cloud Management
Cloud Management
Cloud Management
User Communities
User Communities
User Communities
Federated
interfaces
Federated
services
• Provider agnosticism: the only condition
to federate resources is to expose the
chosen interfaces and services.
• Heterogeneous implementation: no
mandate on the cloud technology,
protecting resource provider investment
• Standards and validation: Use only
recommended and common open
standards for the interfaces and images.
• Resource integration: Cloud Computing to
be integrated into the existing production
infrastructure, protecting prior EGI
investment and increasing ROI
Principles of Federation
6. User-offer Target
• Total control over deployed applications
• Elastic resource consumption based on real needs
• Workloads processed on-demand
• Endorsed and accredited applications available
from multiple different communities shared
• Single sign-on at multiple, independent providers
• Centralised access to service information across
multiple providers
6
7. Requirements on a federated cloud
# Capability User Stories
1 Virtual Machine
management
“I want to instantiate a single existing VM image on a remote cloud.”
2 Data management “I want to instantiate a VM instance from an image that I have created and is not on the
cloud I wish to use.”
“I want to associate my running VM with a data set in the Cloud.”
“I want to take snapshots of my running VM for restart purposes.”
3 Integrated
information system
“I want to choose on which resource provider I want to start my single VM.”
“I need to know about the Virtual Machine Manager (VMM) capabilities the provider
offers.”
4 Accounting “My usage across different resource providers needs to be recorded and reported to
multiple aggregators.”
5 Availability &
Reliability
“Information relating to the availability/reliability and current status of the remote
virtualised resource needs to be available to me.”
6 VM & Resource state
change notification
“When the status of the [VM] instance I am running changes (or will change) I want to be
told about it.”
7 Integrated AAI “I want to use my existing identity, and not re-apply for new credentials to use each
component of the service.”
8 VM Image
Management
“I want to use a single VM image across multiple different infrastructure providers.”
9 Brokering “I want my VM instance to run on a resource that is suitable based on a set of policies or
requirements rather than my choosing directly which resource will run it.”
10 Contextualisation “When I deploy a VM instance on a resource I want to give it configuration information for
customisation of the default template. This can only happen when it is up and running.”
8. Cloud within the EGI Infrastructure Platforms
8
EGI Core Platform
Federated
AAI
Service
Registry
Monitoring Accounting
EGI Cloud Infrastructure Platform
Instance
Mgmt
Information
Discovery
Storage
Management
Help and
Support
Security Co-
ordination
Training and
Outreach
EGICollaborationTools
EGIApplication
DB
Image
Repository
EGICloudServiceMarketplace
Sustainable
Business
Models
User Community
Monitoring and control of utilisation
Technical Consultancy and Support
Uniform interfaces to Cloud
Compute and Storage
Cloud Management Stacks
(OpenStack, OpenNebula, Synnefo, …)
SecureendorsedApplicationand
ServiceDeployment
GSIGLUE2
Cloudinit CDMI
SAM UR
OVF
OCCI
9. Cloud technology Integration Level
Integration
Cloud
Management
Stack
Fed.AAI
Monitoring
Accounting
Img.Mgmt.
OCCI
CDMI
OpenStack Y Y Y Y Y Y
OpenNebula Y Y Y Y Y Y
Synnefo Y Y Y Y Y
WNoDeS Y Y
StratusLab* Y Y Y Y
Cloudstack Y
Emotive Y Y
10. Resource Allocation
• e-Grant supports;
– Resource request
– Resource availability
• 5,000 job slots and 170 TB of storage aside
• Preliminary Units of Allocation
Name # of vCPU Mem (GB) Storage
(GB)
Small 1 2 1 x 20
Medium 2 4 1 x 40
Large 4 8 2 x 80
Other >2 >7.5 n x >40
11. FedCloud Infrastructure
• One year of production
• Resources
– 21 providers from 14 NGIs (Countries)
• 55% Openstack, 42% Open Nebula, 3%
Syneffo
– 17 interested in joining from 7 new
NGIs
• Usage
~700K VMs
~18M CPU hours
EGI Conference 2015
12. Supported Usage Models
• Service Hosting
–Long-running services (e.g. web, database or application
servers)
• Compute and data intensive workloads
–Batch and interactive (e.g. IPython, R, MATLAB) with
scalable and customized environments not limited to the
traditional job model
• Datasets repository
–Store and manage large datasets for your applications
• Disposable and testing environments
–Host training events, test new developments and
applications without overhead
14. Use Case Discipline Classification
Usecases
- 12 @ Launch
- 60 to date
- 15
production
15. READemption
Next Generation Sequencing
• Implemented in Python 3
• Needs:
• Third-Party libraries (numpy, scipy,
matplotlib, pysam)
• Short Read mapper ‘Segemehl’
• R
• Usual runtime on a multi-core machine
several hours to some days
Implementation on EGI Federated Cloud
• VO supported by GWDG, IFCA and
CESNET sites
• VM with 24 cores and 128 GB of RAM
• Block storage up to 3TB
• To serve peak loads
Source: Konrad U. Förstner
• Scientific Discipline: Natural Science, Biological Sciences, Bioinformatics
• Status: Production (hightroughputseq.egi.eu VO)
• Sites: GoeGrid, IFCA (Nov-Dec 2014)
16. OpenModeller on the Biovel Portal
Evalua
mod
BENELUX confe
Ghent, 02. April
The Ecological Niche Modeling (ENM) Workflow takes as input a file
containing species occurrence points to create a model with the
openModeller Web Service.
• The EUBrazilOpenBio ENM service is
exposed through an extended
openModeller Web Service interface
• Multi-staging and multi-parametric oM
experiments are implemented through
COMPSs that dynamically creates the
virtual resources to execute the
operations.
• An OCCI connector is used for the VMs
management while data management
supports CDMI endpoints.
ENM Service (OMWS2)
VENUS-C Cloud Middleware
COMPSs Workflow
Orchestrator
OCCI CDMI
EGI Federated Cloud
Service available at
https://portal.biovel.eu/
17. Peachnote
Peachnote is a music score search engine and
analysis platform.
Hundreds of thousands of music scores
are being digitized by libraries all over the
world. In contrast to books, they generally
remain inaccessible for content-based
retrieval and algorithmic analysis.
There is no analogue to Google Books
for music scores, and no large corpora
exists that can empower advanced
analysis on music scores.
Peachnote want to help change that
providing visitors and researchers access
to a massive amount of symbolic music
data.
EGI Federated Cloud
OMR
Worker
PDF
Splitter
OMR
Feeder
OMR
Worker
…
• Scientific Discipline: Humanities, Arts, Musicology
• Status: Production (peachnote.com VO)
• Sites: CESNET, FZJ (Nov-Dec 2014)
18. Chipster
Chipster in the EGI FedCloud:
• ‘light’ VM (datasets removed)
• Chipster VM configured through
contextualisation
• shared block storage exported as
NFS for tools (500 GB)
• block storage for output (500 GB)
• Scientific Discipline: Natural Science, Biological Sciences, Bioinformatics
• Status: Test & Integration (fedcloud.egi.eu VO)
• Sites: PRISMA-INFN-Bari
ELIXIR Pilot Action Proposal:
Using virtual machines and clouds in bioinformatics training
User-friendly analysis software for
high-throughput data:
• NGS
• Microarray
• Proteomics
• sequence data
19. Future Work
• Inter and Intra cloud networking,
– Using standards to support creation of custom multi resource private networks for cloud
resources
• Development of multi provider and multi consumer business models for federated cloud
operation,
• Expansion in higher level tools and services building on concrete foundations.
• Further development and release of updated standards in cloud and other related areas
– In development progress of complete EGI profile which will fully document all interfaces to the
CMF
• Connect resources from other commercial cloud providers to showcase deployment of
standards
20. Federated Cloud Services
Federated IaaS and STaaS Cloud
Tier 1:
Reliable
Infrastructure Cloud
Tier 4:
Zero ICT
Infrastructures
Tier 3:
Platform as a Service
Tier 2:
General-purpose
platform services
PaaS
PaaS
DBaaS
Hadoop
aaS
VRE
Secure storage
KeyMgmt
Encryptio
n
ACL
mgmt
Virtual
eLaboratory
21. Conclusion
The EGI Federated Cloud is a federation of institutional private Clouds, offering Cloud
Services to researchers in Europe and worldwide
A single cloud system able to
• Scale to user needs – beyond test and development instantiations to production private
clouds
• Integrate multiple different providers to give resilience
• Prevent vendor lock-in
• Enable resource provision targeted towards the research community
22. Thanks!
Alison Packer, Álvaro López García, Binh Minh Nguyen, Björn
Hagemeier, Boris Parak, Boro Jakimovski, Cal Loomis, Carlos Gimeno,
Christos Loverdos, Daniele Cesini, Daniele Lezzi, David Blundell, Diego
Scardaci, Elisabetta Ronchieri, Emir Imamagic, Enol Fernandez, Esteban
Freire, Feyza Eryol, Florian Feldhaus, Gergely Sipos, Ignacio Blanquer,
Iván Díaz, Jan Meizner, Jerome Pansanel, John Gordon, Kostas
Koumantaros, Linda Cornwall, Luis Alves, Malgorzata Krakowian, Marios
Chatziangelou, Marco Verlato, Marica Antonacci , Mattieu Puel, Matteo
Turilli, Michel Jouvin, Michel Drescher, Miguel A. Díaz, Miroslav Ruda,
Mischa Salle, Nuno L. Ferreira, Owen Synge, Paul Millar, Peter Solagna,
Piotr Kasprzak , Roberto Barbera, Ruben Valles, Sándor Ács, Salvatore
Pinto, Silvio Spardi, Soonwook Hwang, Steven Newhouse, Stuart
Pullinger, Sven Gabriel, Thijs Metsch, Tomasz Szepieniec, Víctor
Mendez, Viet Tran, Vincenzo Spinozo, Zeeshan Ali Shah, Zdenek Sustr