This document discusses continuous integration and delivery of research applications. It notes that computational infrastructure is complex with many different systems and applications. Researchers struggle to get their applications running across different environments. The document proposes automating as much of the application integration process as possible through continuous integration practices. This would include defining application dependencies, environments, and automating common integration tasks. The goal is to lower the barriers for researchers to use various computing resources by handling more of the integration work automatically.
This presentation gives an oiverview of the Sci-GaIA project, in the context of the CHAIN-REDS workshop at EGI2015 (Lisbon).
Aspects covered are :
1. The Sci-GaIA project: facts, figures and bjectives
2. The legacy of other projects (ei4Africa and CHAIN-REDS
3. The Sci-GaIA work programme
RAPIDS is a suite of open source software libraries and APIs gives you the ability to execute end-to-end data science and analytics pipelines entirely on GPUs.In this workshop, we will:
1. Introduce Rapids.ai & GPUs
2. Illustrate why GPUs are critical for machine learning and AI applications
3. Demonstrate common machine learning algorithms such as Regression, KNN,SGD etc. using RAPIDS on the QuSandbox
NICIS: Stepping Stones to a Cyberinfrastructure CommonsBruce Becker
There are a lot of benefits from the development of an Open Commons for science, but what are these exactly, in the case of e-Infrastructure and e-Science ? What are the commmons in South Africa ? What are the implications for researchers, educators, operators and developers in the region ? This and more, inside.
This presentation gives an oiverview of the Sci-GaIA project, in the context of the CHAIN-REDS workshop at EGI2015 (Lisbon).
Aspects covered are :
1. The Sci-GaIA project: facts, figures and bjectives
2. The legacy of other projects (ei4Africa and CHAIN-REDS
3. The Sci-GaIA work programme
RAPIDS is a suite of open source software libraries and APIs gives you the ability to execute end-to-end data science and analytics pipelines entirely on GPUs.In this workshop, we will:
1. Introduce Rapids.ai & GPUs
2. Illustrate why GPUs are critical for machine learning and AI applications
3. Demonstrate common machine learning algorithms such as Regression, KNN,SGD etc. using RAPIDS on the QuSandbox
NICIS: Stepping Stones to a Cyberinfrastructure CommonsBruce Becker
There are a lot of benefits from the development of an Open Commons for science, but what are these exactly, in the case of e-Infrastructure and e-Science ? What are the commmons in South Africa ? What are the implications for researchers, educators, operators and developers in the region ? This and more, inside.
Boston Data Engineering: Kedro Python Framework for Data Science: Overview an...Boston Data Engineering
What is Kedro?
Kedro is an open-source Python framework for creating reproducible, maintainable and modular data science code. It borrows concepts from software engineering best practices and applies them to machine-learning code; applied concepts include modularity, separation of concerns and versioning.
Kedro 2-minute Intro Video: https://youtu.be/KEdmJ2ADy_M
Kedro Docs: https://kedro.readthedocs.io
Kedro GitHub repo: https://github.com/quantumblacklabs/kedro
Meetup: https://www.meetup.com/f7324858-b804-4ed8-ba45-580c262189f1/events/280986950/
Workshop 1. Architecting Innovative Graph Applications
Join this hands-on workshop for beginners led by Neo4j experts guiding you to systematically uncover contextual intelligence. Using a real-life dataset we will build step-by-step a graph solution; from building the graph data model to running queries and data visualization. The approach will be applicable across multiple use cases and industries.
Slides from a presentation given to the User Experience Design Immersive class at General Assembly. The conversation focused on the following topics:
1) should designers know how to code?
2) should coding be part of the job?
3) what should a designer know? HTML/CSS? Ruby/Python? CMS? E-commerce platforms?
4) how does a designer go about picking up these technical skills?
hello everyone!
This is an example of how to make an industrial report for your college. By getting through this report you can easily make your own report.
This will help all those who spend a lot of time in browsing or for formats on how to make reports for their industrial training.
I am sure that after watching this report you will get a brief idea on how to make your own one and make it look attractive and purposeful.
Thank You!
What Architecture Taught Me About Information Architecture (and UX)Nam-ho Park
2016-02-20 Presentation at World IA Day / Seattle
1. Layering of complex systems
2. Sequencing of spaces
3. Figuring out documentation
4. Power in the grid
5. Patterns everywhere
6. Beauty in simplicity
7. Recognizing scale
Do UX designers have a role in reducing digital waste?User Vision
UX designers are primarily concerned with ensuring the experience of end users, but should we also consider the impact on the environment?
Do the ultra-usable and convenient digital lifestyles we help create provide ease-of-use at the cost of sustainability?
We'll explore the surprisingly large impact that digital has on C02 emissions and other contributors to the climate crisis.
Then we’ll discuss what can be done by individuals and as a profession to raise awareness of the issue contribute to ways to mitigate the problem.
Taming Cloud Sprawl - XConf Europe 2023 - Kief.pdfKief Morris
Composable Environments is an architectural approach to implementing environments (using Infrastructure as Code, of course) that are better aligned to support your organization's strategy.
Avoid Snowkflakes as Code and Horizontal Environment Provisioning. Instead, use Application-Driven Infrastructure Provisioning and Just Enough Environments!
Presentation for MAE November 2012 meeting
There is often a kind of tension between Agile Concepts and Architecture concepts.
Why is that?
What can be done about it?
Trusted Configurations for e-Infrastructure DeploymentBruce Becker
This presentation shows how EGI adopts tools and methodologies paving the way to DevOps via test automation and collaboration.
We present the development of an opinionated Style Guide for Ansible, along with compliance profiles and various testing and deployment scenarios.
Now you can cite APHRC's data sets (CHAIN-REDS)Bruce Becker
A report on how the CHAIN-REDS project worked with the African Populationa nd Health Research Centre (APHRC) to improve discoverability, impact and persistence.
More Related Content
Similar to Continuous Integration and Delivery of Research Applications
Boston Data Engineering: Kedro Python Framework for Data Science: Overview an...Boston Data Engineering
What is Kedro?
Kedro is an open-source Python framework for creating reproducible, maintainable and modular data science code. It borrows concepts from software engineering best practices and applies them to machine-learning code; applied concepts include modularity, separation of concerns and versioning.
Kedro 2-minute Intro Video: https://youtu.be/KEdmJ2ADy_M
Kedro Docs: https://kedro.readthedocs.io
Kedro GitHub repo: https://github.com/quantumblacklabs/kedro
Meetup: https://www.meetup.com/f7324858-b804-4ed8-ba45-580c262189f1/events/280986950/
Workshop 1. Architecting Innovative Graph Applications
Join this hands-on workshop for beginners led by Neo4j experts guiding you to systematically uncover contextual intelligence. Using a real-life dataset we will build step-by-step a graph solution; from building the graph data model to running queries and data visualization. The approach will be applicable across multiple use cases and industries.
Slides from a presentation given to the User Experience Design Immersive class at General Assembly. The conversation focused on the following topics:
1) should designers know how to code?
2) should coding be part of the job?
3) what should a designer know? HTML/CSS? Ruby/Python? CMS? E-commerce platforms?
4) how does a designer go about picking up these technical skills?
hello everyone!
This is an example of how to make an industrial report for your college. By getting through this report you can easily make your own report.
This will help all those who spend a lot of time in browsing or for formats on how to make reports for their industrial training.
I am sure that after watching this report you will get a brief idea on how to make your own one and make it look attractive and purposeful.
Thank You!
What Architecture Taught Me About Information Architecture (and UX)Nam-ho Park
2016-02-20 Presentation at World IA Day / Seattle
1. Layering of complex systems
2. Sequencing of spaces
3. Figuring out documentation
4. Power in the grid
5. Patterns everywhere
6. Beauty in simplicity
7. Recognizing scale
Do UX designers have a role in reducing digital waste?User Vision
UX designers are primarily concerned with ensuring the experience of end users, but should we also consider the impact on the environment?
Do the ultra-usable and convenient digital lifestyles we help create provide ease-of-use at the cost of sustainability?
We'll explore the surprisingly large impact that digital has on C02 emissions and other contributors to the climate crisis.
Then we’ll discuss what can be done by individuals and as a profession to raise awareness of the issue contribute to ways to mitigate the problem.
Taming Cloud Sprawl - XConf Europe 2023 - Kief.pdfKief Morris
Composable Environments is an architectural approach to implementing environments (using Infrastructure as Code, of course) that are better aligned to support your organization's strategy.
Avoid Snowkflakes as Code and Horizontal Environment Provisioning. Instead, use Application-Driven Infrastructure Provisioning and Just Enough Environments!
Presentation for MAE November 2012 meeting
There is often a kind of tension between Agile Concepts and Architecture concepts.
Why is that?
What can be done about it?
Trusted Configurations for e-Infrastructure DeploymentBruce Becker
This presentation shows how EGI adopts tools and methodologies paving the way to DevOps via test automation and collaboration.
We present the development of an opinionated Style Guide for Ansible, along with compliance profiles and various testing and deployment scenarios.
Now you can cite APHRC's data sets (CHAIN-REDS)Bruce Becker
A report on how the CHAIN-REDS project worked with the African Populationa nd Health Research Centre (APHRC) to improve discoverability, impact and persistence.
this is a presentation to the RECAS summer school on Science Gateways proposing the GADGET-2 application. GADGET2 is a smoothed-particle hydrodynamics simulation for astrophysical systems.
We are proposing to develop a science gateway portlet for easy access and use of this powerful application
A peptalk to the students of the RECAS/CaSaP summer school for science gateways in Catania.
http://agenda.ct.infn.it/conferenceOtherViews.py?view=standard&confId=1056
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...Ana Luísa Pinho
Functional Magnetic Resonance Imaging (fMRI) provides means to characterize brain activations in response to behavior. However, cognitive neuroscience has been limited to group-level effects referring to the performance of specific tasks. To obtain the functional profile of elementary cognitive mechanisms, the combination of brain responses to many tasks is required. Yet, to date, both structural atlases and parcellation-based activations do not fully account for cognitive function and still present several limitations. Further, they do not adapt overall to individual characteristics. In this talk, I will give an account of deep-behavioral phenotyping strategies, namely data-driven methods in large task-fMRI datasets, to optimize functional brain-data collection and improve inference of effects-of-interest related to mental processes. Key to this approach is the employment of fast multi-functional paradigms rich on features that can be well parametrized and, consequently, facilitate the creation of psycho-physiological constructs to be modelled with imaging data. Particular emphasis will be given to music stimuli when studying high-order cognitive mechanisms, due to their ecological nature and quality to enable complex behavior compounded by discrete entities. I will also discuss how deep-behavioral phenotyping and individualized models applied to neuroimaging data can better account for the subject-specific organization of domain-general cognitive systems in the human brain. Finally, the accumulation of functional brain signatures brings the possibility to clarify relationships among tasks and create a univocal link between brain systems and mental functions through: (1) the development of ontologies proposing an organization of cognitive processes; and (2) brain-network taxonomies describing functional specialization. To this end, tools to improve commensurability in cognitive science are necessary, such as public repositories, ontology-based platforms and automated meta-analysis tools. I will thus discuss some brain-atlasing resources currently under development, and their applicability in cognitive as well as clinical neuroscience.
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptxMAGOTI ERNEST
Although Artemia has been known to man for centuries, its use as a food for the culture of larval organisms apparently began only in the 1930s, when several investigators found that it made an excellent food for newly hatched fish larvae (Litvinenko et al., 2023). As aquaculture developed in the 1960s and ‘70s, the use of Artemia also became more widespread, due both to its convenience and to its nutritional value for larval organisms (Arenas-Pardo et al., 2024). The fact that Artemia dormant cysts can be stored for long periods in cans, and then used as an off-the-shelf food requiring only 24 h of incubation makes them the most convenient, least labor-intensive, live food available for aquaculture (Sorgeloos & Roubach, 2021). The nutritional value of Artemia, especially for marine organisms, is not constant, but varies both geographically and temporally. During the last decade, however, both the causes of Artemia nutritional variability and methods to improve poorquality Artemia have been identified (Loufi et al., 2024).
Brine shrimp (Artemia spp.) are used in marine aquaculture worldwide. Annually, more than 2,000 metric tons of dry cysts are used for cultivation of fish, crustacean, and shellfish larva. Brine shrimp are important to aquaculture because newly hatched brine shrimp nauplii (larvae) provide a food source for many fish fry (Mozanzadeh et al., 2021). Culture and harvesting of brine shrimp eggs represents another aspect of the aquaculture industry. Nauplii and metanauplii of Artemia, commonly known as brine shrimp, play a crucial role in aquaculture due to their nutritional value and suitability as live feed for many aquatic species, particularly in larval stages (Sorgeloos & Roubach, 2021).
What is greenhouse gasses and how many gasses are there to affect the Earth.moosaasad1975
What are greenhouse gasses how they affect the earth and its environment what is the future of the environment and earth how the weather and the climate effects.
hematic appreciation test is a psychological assessment tool used to measure an individual's appreciation and understanding of specific themes or topics. This test helps to evaluate an individual's ability to connect different ideas and concepts within a given theme, as well as their overall comprehension and interpretation skills. The results of the test can provide valuable insights into an individual's cognitive abilities, creativity, and critical thinking skills
Seminar of U.V. Spectroscopy by SAMIR PANDASAMIR PANDA
Spectroscopy is a branch of science dealing the study of interaction of electromagnetic radiation with matter.
Ultraviolet-visible spectroscopy refers to absorption spectroscopy or reflect spectroscopy in the UV-VIS spectral region.
Ultraviolet-visible spectroscopy is an analytical method that can measure the amount of light received by the analyte.
Richard's aventures in two entangled wonderlandsRichard Gill
Since the loophole-free Bell experiments of 2020 and the Nobel prizes in physics of 2022, critics of Bell's work have retreated to the fortress of super-determinism. Now, super-determinism is a derogatory word - it just means "determinism". Palmer, Hance and Hossenfelder argue that quantum mechanics and determinism are not incompatible, using a sophisticated mathematical construction based on a subtle thinning of allowed states and measurements in quantum mechanics, such that what is left appears to make Bell's argument fail, without altering the empirical predictions of quantum mechanics. I think however that it is a smoke screen, and the slogan "lost in math" comes to my mind. I will discuss some other recent disproofs of Bell's theorem using the language of causality based on causal graphs. Causal thinking is also central to law and justice. I will mention surprising connections to my work on serial killer nurse cases, in particular the Dutch case of Lucia de Berk and the current UK case of Lucy Letby.
Toxic effects of heavy metals : Lead and Arsenicsanjana502982
Heavy metals are naturally occuring metallic chemical elements that have relatively high density, and are toxic at even low concentrations. All toxic metals are termed as heavy metals irrespective of their atomic mass and density, eg. arsenic, lead, mercury, cadmium, thallium, chromium, etc.
Phenomics assisted breeding in crop improvementIshaGoswami9
As the population is increasing and will reach about 9 billion upto 2050. Also due to climate change, it is difficult to meet the food requirement of such a large population. Facing the challenges presented by resource shortages, climate
change, and increasing global population, crop yield and quality need to be improved in a sustainable way over the coming decades. Genetic improvement by breeding is the best way to increase crop productivity. With the rapid progression of functional
genomics, an increasing number of crop genomes have been sequenced and dozens of genes influencing key agronomic traits have been identified. However, current genome sequence information has not been adequately exploited for understanding
the complex characteristics of multiple gene, owing to a lack of crop phenotypic data. Efficient, automatic, and accurate technologies and platforms that can capture phenotypic data that can
be linked to genomics information for crop improvement at all growth stages have become as important as genotyping. Thus,
high-throughput phenotyping has become the major bottleneck restricting crop breeding. Plant phenomics has been defined as the high-throughput, accurate acquisition and analysis of multi-dimensional phenotypes
during crop growing stages at the organism level, including the cell, tissue, organ, individual plant, plot, and field levels. With the rapid development of novel sensors, imaging technology,
and analysis methods, numerous infrastructure platforms have been developed for phenotyping.
Nutraceutical market, scope and growth: Herbal drug technologyLokesh Patil
As consumer awareness of health and wellness rises, the nutraceutical market—which includes goods like functional meals, drinks, and dietary supplements that provide health advantages beyond basic nutrition—is growing significantly. As healthcare expenses rise, the population ages, and people want natural and preventative health solutions more and more, this industry is increasing quickly. Further driving market expansion are product formulation innovations and the use of cutting-edge technology for customized nutrition. With its worldwide reach, the nutraceutical industry is expected to keep growing and provide significant chances for research and investment in a number of categories, including vitamins, minerals, probiotics, and herbal supplements.
ANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptxRASHMI M G
Abnormal or anomalous secondary growth in plants. It defines secondary growth as an increase in plant girth due to vascular cambium or cork cambium. Anomalous secondary growth does not follow the normal pattern of a single vascular cambium producing xylem internally and phloem externally.
This presentation explores a brief idea about the structural and functional attributes of nucleotides, the structure and function of genetic materials along with the impact of UV rays and pH upon them.
Continuous Integration and Delivery of Research Applications
1. Continuous Integration and
Delivery of
Research Applications
Bruce Becker
bbecker@csir.co.za
Coordinator, AAROC
Meraka Institute, CSIR
Fanie Riekert
RiekertSJPK@ufs.ac.za
Senior Systems Engineer
HPC Unit, UFS
Steps towards an Open e-Science Commons
2. Bruce Becker: Coordinator, SAGrid | bbecker@csir.co.za | http://www.sagrid.ac.za
Outline
●
Compuational Infrastructure: It's a jungle out
there
●
How does research actually get done ? A
reminder of the scientific method in the 3rd and
4th paradigm
●
How do applications get onto an infrastructure ?
●
What are the issues ?
●
What are we doing about it ?
●
What can you do about it ?
3. Bruce Becker: Coordinator, SAGrid | bbecker@csir.co.za | http://www.sagrid.ac.za3
Functional
Sites
Core Services
6 sites
~2300 CPU cores
~100 TB perm. Storage
~17 permanent staff
Proposed extensions
So, we built this...
5. Bruce Becker: Coordinator, SAGrid | bbecker@csir.co.za | http://www.sagrid.ac.za5
Universite'
Gaston
Berger
(Saint-Louis)
Kwame Nkrumah
University of
Science
And Technology
(Kumasi, Ghana)
Nsukka
University
(Nsukka)
http://cds.cern.ch/record/1476
353
220 servers, 30
routers
And others are too
6. Bruce Becker: Coordinator, SAGrid | bbecker@csir.co.za | http://www.sagrid.ac.za6
And it is part of this
blue: EGI council members and EGI-InSPIRE
partners
green: integrated resource infrastructure providers
purple: peer resource infrastructure providershttp://www.egi.eu/infrastructure/
8. Bruce Becker: Coordinator, SAGrid | bbecker@csir.co.za | http://www.sagrid.ac.za
It's (apparently) a jungle out there.
http://en.wikipedia.org/wiki/Jungle_computing
Seinstra, F. J., Maassen, J., van Nieuwpoort, R.~V., et al. 2011,
Grids, Clouds and Virtualization, 167
Jungle Computing: Distributed Supercomputing beyond Clusters, Grids, and Clouds
9. Bruce Becker: Coordinator, SAGrid | bbecker@csir.co.za | http://www.sagrid.ac.za
Where does your code run ?
●
On your laptop ?
●
Congrats ! Uh oh, you just rand out of X
●
You want to do what !?
●
Local cluster ?
●
hm. different setup... <hack hack>
hey it's working ! … kinda.
●
Permissions, environment,
data staging, etc.
●
CHPC ?
●
There's more than one
●
ALL the Optimisations !!
10. Bruce Becker: Coordinator, SAGrid | bbecker@csir.co.za | http://www.sagrid.ac.za10
Are you even talking about the
same application ?
●
Apart from performance, are you sure that you
are getting the same results on different
architectures and platforms ?
●
Is it even worth the effort ?
12. Bruce Becker: Coordinator, SAGrid | bbecker@csir.co.za | http://www.sagrid.ac.za
What users want
Amazing infrastructure
Some users want highly
varied, modular
application selection
Vertically integrated
Highly specialised
applications
Highly trained supportHighly trained support
13. Bruce Becker: Coordinator, SAGrid | bbecker@csir.co.za | http://www.sagrid.ac.za
What users get sometimes
14. Bruce Becker: Coordinator, SAGrid | bbecker@csir.co.za | http://www.sagrid.ac.za
Current Methods
Are you a legit
scientist ?
Ok, sounds good
what applications
are you running ?
We can run
almost anything !
Hi, I heard about
AfricaGrid – sounds
grrrrreat ! Can I
Use it ?
I've got beakers
…
And glasses !
Awesome ! Well, first we do some
preprocessing with RaNDo, then we
run a massively parallel pipeline with
XBOT, then blah and blah then blah
blah blah and blah, we also need blah
blah and sometimes our colleagues
from FarAwayU need to access the
data with StrangeProtocolx1, and of
course blah blah, not to mention blah
…..
We'll see
what we
can do
15. Bruce Becker: Coordinator, SAGrid | bbecker@csir.co.za | http://www.sagrid.ac.za
Actual workflow
●
Find the source code
●
Unpack, ./configure && make … Error 1 missing x.h
●
… hack hack hack … hm. Missing dependencies
●
Recursively* determine the dependencies**
●
*Yes, recursively means curse, curse then curse again.
●
**The actual dependencies, not the ones that were written on the
web page from 5 years ago.
●
… mutter mutter …
●
???? Why are you compiling now !?
16. Bruce Becker: Coordinator, SAGrid | bbecker@csir.co.za | http://www.sagrid.ac.za
Actual communication
Hey, how's it going ?
you done with my porting yet ?
You done yet ?
You done ?
…. bloody IT people !
Have you actually seen this code ?
I just...
It looks like Cthulhu's beard....
Ok, I got it to compile,
but only with if you
type with your left toes.
AAArrrgh, I'm a scientist,
not an IT people !
17. Bruce Becker: Coordinator, SAGrid | bbecker@csir.co.za | http://www.sagrid.ac.za
How many applications are there ?
Natural Sciences
Other
Medical and Health Sciences
Support Activities
Engineering and Technology
Agricultural Sciences
Humanities
Social Sciences
http://appdb.egi.eu
18. Bruce Becker: Coordinator, SAGrid | bbecker@csir.co.za | http://www.sagrid.ac.za
Way too many...
(and these are just the ones we
know about)
Biological sciences
Bioinformatics
Biochemistry and molecular biology
Structural biology
Biophysics
Genetics and heredity
Cell biology
Evolutionary biology
Physical sciences
Astronomy
Astroparticle physics
Astrophysics
High energy physics
Fusion
Plasma physics
Fluid Mechanics
Condensed matter physics
Mathematical physics
Medical physics
Optics
Physics
Acoustics
19. Bruce Becker: Coordinator, SAGrid | bbecker@csir.co.za | http://www.sagrid.ac.za
There's gotta be a better way
Let's give researchers a
”better deal”
20. Bruce Becker: Coordinator, SAGrid | bbecker@csir.co.za | http://www.sagrid.ac.za20
• Hypothesis 1:
– It always comes down to an application
• At the end of all first-contact you get a list of applications
that users want
– Let's focus on that, shall we ?
• Hypothesis 2:
– No software is an island
• Everything has dependencies, even if it's just glibc or
libgfortran
– Let's make those explicit, shall we ?
Compuatational Infrastructure
What is it good for ?
21. Bruce Becker: Coordinator, SAGrid | bbecker@csir.co.za | http://www.sagrid.ac.za21
• Hypothesis 3:
– Applications need an environment
• Compiler ?
• Architecture ?
• Dependencies ?
• Middleware ?
• Etc....
– Let's make those explicit shall we ?
Compuatational Infrastructure
What is it good for ?
22. Bruce Becker: Coordinator, SAGrid | bbecker@csir.co.za | http://www.sagrid.ac.za22
• Hypothesis 4:
– there is more than one environment.
• Nothing is going to ever be really useful if it can only run in
one place.
• Nobody wants to be forced into a single environment
(grid/HPC/cloud, etc)
– Let's try to simulate as many target environments (sites)
as we can, shall we?
Compuatational Infrastructure
What is it good for ?
23. Bruce Becker: Coordinator, SAGrid | bbecker@csir.co.za | http://www.sagrid.ac.za23
Compuatational Infrastructure
What is it good for ?
●
Hypothesis 5:
●
Solutions decay
– The solution – as well as the dead ends – are part of the
process.
– If it's not in a version-controlled repository, it's no good.
– If it's not executable (in a wiki/document, etc), then you
have to convert it back into something a computer
understands – why not just leave it in executable format ?
●
Let's recognise that solutions decay and code for the
future, shall we ?
24. Bruce Becker: Coordinator, SAGrid | bbecker@csir.co.za | http://www.sagrid.ac.za24
Humans need not apply
(for certain tasks)
●
Hypothesis 6:
●
Most of the work of porting and supporting applications
can be automated
– Reduce the load on the user and the operator, by automating
common tasks
●
Let's:
– Integrate the various software, middleware, service and
infrastructure components as far as possible,
– Keep it Open
●
… shall we ?
25. Bruce Becker: Coordinator, SAGrid | bbecker@csir.co.za | http://www.sagrid.ac.za25
Compuatational Infrastructure
What is it good for ?
●
Hypothesis 6 (Corollary) : You're in the way
26. Bruce Becker: Coordinator, SAGrid | bbecker@csir.co.za | http://www.sagrid.ac.za26
Compuatational Infrastructure
What is it good for ?
●
Hypothesis 7:
●
This stuff's not hard
– Continuous Integration has been around for a long time
– If it can be done by hand, it can be scripted; if it can be
scripted, it can be automated properly.
●
Let's
– adopt widely-used tools and methodologies, and
– hey, how about we use the web to communciate
(you know, like we do with EVERYTHING else ?)
27. Bruce Becker: Coordinator, SAGrid | bbecker@csir.co.za | http://www.sagrid.ac.za
Let's work on a better deal...
●
How do we lower the barrier to entry to the grid or
cloud infrastructure ?
●
How can the application expert prove to the resource
provider that the application will actually run on the
execution environment of the site ?
●
How can we manage the lifecycle of applications across
multiple versions, architectures, configurations ?
●
How can we ensure that once applications are ”certified”,
they are actually available on as many sites as possible ?
28. Bruce Becker: Coordinator, SAGrid | bbecker@csir.co.za | http://www.sagrid.ac.za
Go to Bootcamp
http://software-carpentry.org/bootcamps
29. Bruce Becker: Coordinator, SAGrid | bbecker@csir.co.za | http://www.sagrid.ac.za29
What are we actually doing ?
●
Put the ball in the developer/community/user's court
●
You tell us how to build your application
●
Provide a simple interface to porting pipeline
●
Provide focussed, goal-oriented support when needed.
●
Communication has a context and is directly linked to the issues at
hand
●
The entry point is open
●
Extensible and reproducible
●
Publish and preserve the entire toolchain
●
It's a real collaboration now – with code !
●
Can we say the big S word ? (not the 4-letter word - SUSTAINABLE)
30. Bruce Becker: Coordinator, SAGrid | bbecker@csir.co.za | http://www.sagrid.ac.za30
Tools of the Trade (1)
Waffle.io
Project
planning
Slack.com
Team collab
32. Bruce Becker: Coordinator, SAGrid | bbecker@csir.co.za | http://www.sagrid.ac.za32
What's the plan, Stan ?
https://waffle.io/aaroc/devops
33. Bruce Becker: Coordinator, SAGrid | bbecker@csir.co.za | http://www.sagrid.ac.za
Typical workflow and roles
Testingmatrix
Defines relevant
tests in Jenkins
Writes code to
pass required tests
Dev/Stage env.Application
developer
Infrastructure
expert
Reads description
of execution environment tests
Promote a build
to CVMFS
34. Bruce Becker: Coordinator, SAGrid | bbecker@csir.co.za | http://www.sagrid.ac.za
My Github brings all the hackers to
the backyard
35. Bruce Becker: Coordinator, SAGrid | bbecker@csir.co.za | http://www.sagrid.ac.za
Social coding
Identify yourself, create a branch
Jenkins will help you test your code
at every commit
Contribute back, open a pull request
Discuss contrbutions
https://guides.github.com/introduction/flow/index.html
36. Bruce Becker: Coordinator, SAGrid | bbecker@csir.co.za | http://www.sagrid.ac.za
Get credit for your work
Bruce Becker, Marco Fargetta, pkoro, Pavlos Daoglou, Chris Lee, et al. (2014)
DevOps: AAROC Ansible LDAP role release.
Available:http://dx.doi.org/10.5281/zenodo.11914.
https://github.com/AAROC/DevOps/tree/v0.0.2
DOIDOI 10.5281/zenodo.1023310.5281/zenodo.10233
37. Bruce Becker: Coordinator, SAGrid | bbecker@csir.co.za | http://www.sagrid.ac.za
Sounds great, show me the details
38. Bruce Becker: Coordinator, SAGrid | bbecker@csir.co.za | http://www.sagrid.ac.za38
Application porting, testing, integration and deployment workflow
Follow up
with the user
Fail Test
Tests Fail
Invoke Artifact
Tests Pass
Open Issue
on Github
Stage to
Testing Repo
Field Tests
Update Repo
Version
Lock CVMFS
repo
Field Tests Pass
Open Issue
on Github
Field Tests Fail
DeployArtifact
Update Repo
Version
Tag sites
Create
Build Script
Pull Request
Create Jenkins
Job
Build
if first time
Invoke Test
if build
has passed
Fail Build
Build Fails Build Passes
Follow up
with the user
Fail Test
Tests Fail
Invoke Artifact
Tests Pass
Open Issue
on Github
Stage to
Testing Repo
Update Repo
Version
The new pipeline
48. Bruce Becker: Coordinator, SAGrid | bbecker@csir.co.za | http://www.sagrid.ac.za48
Last Bit: Delivery
Executable Artifacts
go here
Every site can
mount
the CVMFS
repo
Even your
laptop...
49. Bruce Becker: Coordinator, SAGrid | bbecker@csir.co.za | http://www.sagrid.ac.za49
Sound Familiar ?
Executable Artifacts
go here
This will take a lot of
work, but there's a
huge community
behind the tools.
Deploy into linux
containers
50. Bruce Becker: Coordinator, SAGrid | bbecker@csir.co.za | http://www.sagrid.ac.za50
Do you even remember the title of
this talk ?
EVERYTHING
=
CODE
51. Bruce Becker: Coordinator, SAGrid | bbecker@csir.co.za | http://www.sagrid.ac.za51
What if we could automate
infrastructure execution too ?
●
We can and are, but there's no time to get into that right
now.
●
It's a jungle out there – but we know most of the animals.
●
Configuration and integration are the main issues.
●
Turn it into … a zoo ? (kinda)
●
See https://github.com/AAROC/DevOps
●
Yes, we need help
52. Bruce Becker: Coordinator, SAGrid | bbecker@csir.co.za | http://www.sagrid.ac.za52
Summary
●
Making the best use of computational infrastructure comes
down to running applications
●
Maintaining and porting them is hard and tricky.
Communication is the hardest part.
●
We've built an automated porting system, which
●
will deliver functional, tested, relevant software to your site.
●
Can ease the communication blocks in collaborative work
●
Oh and it works :)
●
Come on in and help us build it.
We want an Open e-Science Commons
53. Bruce Becker: Coordinator, SAGrid | bbecker@csir.co.za | http://www.sagrid.ac.za53
What can you do ?
● Join the orgs on Github
● Propose applications
● Fork, branch, pull request.
● Help develop the workflow
● Tell your friends
54. Bruce Becker: Coordinator, SAGrid | bbecker@csir.co.za | http://www.sagrid.ac.za54
Thanks !
●
Join the organisations:
●
http://github.com/AAROC/
●
http://github.com/SAGridOps
●
Let's talk : http://aaroc.github.com/
●
https://ciari.slack.com/ (ask for an invite)
●
@TheSAGrid
●
@brusisceddu
●
fb.com/SAGrid
Editor's Notes
So, what does one of these ”e-Infrastructures” look like ? In South Africa, we have one of the youngest. A significant investment in computing and data resources, both centrally managed and funded by the government,as well as at universities and research laboratories. A complex ecosystem... but the vast majority of the funding so far has gone into the South African Research Network. The South African NREN connects more people and resources than was even conceivable just 5 years ago, at 10 GB/s. The impact of this single investment is difficult to estimate, since it is an enabling infrastructure and indeed some projects in science and education can now be undertaken thanks to it.
A large portion of the computing resources have been integrated into a distributed computing infrastructure based on the grid paradigm which you are all familiar with – the South African National Grid. You see at the sites the boxes and figures representing the people and services which operate as a single entity in a federation.
Now, South Africa is of course not alone in these developments. The BGI project itself has pushed such developments on both sides of the continent, not to mention those on the northern coast, supported by EUMedGrid, Support and Connect. There are strong NRENs in Kenya and Tanzania, which, together with Uganda, form good regional capacity in Eastern Africa.
West Africa is of course a bit more complex than other parts of our continent, but here we see a lot of activity. In order to cut down on time, I&apos;ve shown only a few, such as the site at UCGB in Seneal, the site at Nsukka University in Nigeria and the site under development at Kwame Nkrumah University of Science and Technology, which recently was the recipient of a large transfer of materiel from CERN.
As in the case of Eastern Africa, these form the nucleus the model I will describe in a moment. But let us go back to South Africa for a moment and take stock.
So, what have we learned in the last few years ?
A few general trends, being followed worldwide, such as the adoption of the digital paradigm of scientific research, and the role that e-Infrastructures play in it... but we&apos;ve documented and to some extent liberated the latent potential for collaboration in Africa. The digital divide still separates many, but the increase in access to bandwidt and other digital platforms has been almost revolutionary. It has connected us with a large diaspora with which to collaborate and allowed us to meaningfully contribute to international research project. Importantly, a great many neglected problems have become visible and thus piqued the interest of research communities locally, which are more optimistic of their ability to address these challenges.
However, despitethes positive tones, one must remain realistic: there is indeed vast potential, but the available energy is not all transformed into actual work. (more thermodynamic analogies later)
So, what have we learned in the last few years ?
A few general trends, being followed worldwide, such as the adoption of the digital paradigm of scientific research, and the role that e-Infrastructures play in it... but we&apos;ve documented and to some extent liberated the latent potential for collaboration in Africa. The digital divide still separates many, but the increase in access to bandwidt and other digital platforms has been almost revolutionary. It has connected us with a large diaspora with which to collaborate and allowed us to meaningfully contribute to international research project. Importantly, a great many neglected problems have become visible and thus piqued the interest of research communities locally, which are more optimistic of their ability to address these challenges.
However, despitethes positive tones, one must remain realistic: there is indeed vast potential, but the available energy is not all transformed into actual work. (more thermodynamic analogies later)
So, what have we learned in the last few years ?
A few general trends, being followed worldwide, such as the adoption of the digital paradigm of scientific research, and the role that e-Infrastructures play in it... but we&apos;ve documented and to some extent liberated the latent potential for collaboration in Africa. The digital divide still separates many, but the increase in access to bandwidt and other digital platforms has been almost revolutionary. It has connected us with a large diaspora with which to collaborate and allowed us to meaningfully contribute to international research project. Importantly, a great many neglected problems have become visible and thus piqued the interest of research communities locally, which are more optimistic of their ability to address these challenges.
However, despitethes positive tones, one must remain realistic: there is indeed vast potential, but the available energy is not all transformed into actual work. (more thermodynamic analogies later)