- Life sciences have specific computational challenges as data production from sequencing grows faster than Moore's law and there is a constant need to compare new data to existing data.
- France is developing complementary HPC and HTC infrastructures for life sciences, including the Institut Français de Bioinformatique, France Génomique, and E-Biothon - an HPC platform for research in life sciences.
- E-Biothon provides researchers with access to Blue Gene/P supercomputers and over 200TB of storage for large-scale genome assembly and comparative analyses in projects like studying synteny across microbial genomes and characterizing biodiversity.
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 3Gianpaolo Coro
An e-Infrastructure is a distributed network of service nodes, residing on multiple sites and managed by one or more organizations. e-Infrastructures allow scientists residing at distant places to collaborate. They offer a multiplicity of facilities as-a-service, supporting data sharing and usage at different levels of abstraction, e.g. data transfer, data harmonization, data processing workflows etc. e-Infrastructures are gaining an important place in the field of biodiversity conservation. Their computational capabilities help scientists to reuse models, obtain results in shorter time and share these results with other colleagues. They are also used to access several and heterogeneous biodiversity catalogues.
In this course, the D4Science e-Infrastructure will be used to conduct experiments in the field of biodiversity conservation. D4Science hosts models and contributions by several international organizations involved in the biodiversity conservation field. The course will give students an overview of the models, the practices and the methods that large international organizations like FAO and UNESCO apply by means of D4Science. At the same time, the course will introduce students to the basic concepts under e-Infrastructures, Virtual Research Environments, data sharing and experiments reproducibility.
Calit2 - CSE's Living Laboratory for ApplicationsLarry Smarr
08.05.27
UCSD CSE 91 - Perspectives in Computer Science (Spring 2008)
Calit2@UCSD
Title: Calit2 - CSE's Living Laboratory for Applications
La Jolla, CA
Web Apollo: Lessons learned from community-based biocuration efforts.Monica Munoz-Torres
This presentation tries to highlight the importance and relevance of community-based curation of biological data. It describes the results of harvesting expertise from dispersed researchers assigning functions to predicted and curated peptides, as well as collaborative efforts for standardization of genes and gene product attributes across species and databases.
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 3Gianpaolo Coro
An e-Infrastructure is a distributed network of service nodes, residing on multiple sites and managed by one or more organizations. e-Infrastructures allow scientists residing at distant places to collaborate. They offer a multiplicity of facilities as-a-service, supporting data sharing and usage at different levels of abstraction, e.g. data transfer, data harmonization, data processing workflows etc. e-Infrastructures are gaining an important place in the field of biodiversity conservation. Their computational capabilities help scientists to reuse models, obtain results in shorter time and share these results with other colleagues. They are also used to access several and heterogeneous biodiversity catalogues.
In this course, the D4Science e-Infrastructure will be used to conduct experiments in the field of biodiversity conservation. D4Science hosts models and contributions by several international organizations involved in the biodiversity conservation field. The course will give students an overview of the models, the practices and the methods that large international organizations like FAO and UNESCO apply by means of D4Science. At the same time, the course will introduce students to the basic concepts under e-Infrastructures, Virtual Research Environments, data sharing and experiments reproducibility.
Calit2 - CSE's Living Laboratory for ApplicationsLarry Smarr
08.05.27
UCSD CSE 91 - Perspectives in Computer Science (Spring 2008)
Calit2@UCSD
Title: Calit2 - CSE's Living Laboratory for Applications
La Jolla, CA
Web Apollo: Lessons learned from community-based biocuration efforts.Monica Munoz-Torres
This presentation tries to highlight the importance and relevance of community-based curation of biological data. It describes the results of harvesting expertise from dispersed researchers assigning functions to predicted and curated peptides, as well as collaborative efforts for standardization of genes and gene product attributes across species and databases.
Looking for etiquette training toronto,toronto business etiquette,executive presence training,business etiquette toronto,business etiquette training,image consultant toronto,leadership training toronto.Then corporateclassinc.com is one stop for you.
Social Commerce 2.0 With CPC Strategy & AddShoppersTinuiti
In this LIVE webinar event, retailers will learn how to drive orders by integrating social into their existing channels like SEO, email marketing, and retargeting. CPC Strategy’s Stephen Kerner and AddShoppers Co-founder Chad Ledford team up to deliver advanced social commerce strategies supported by real client data.
Daisy CTO, Nathan Marke, talks digital technology and how it's affecting businesses across all industries. This is the speech Nathan gave at Daisy Communications' flagship event 'Daisy Wired? 2014'. For more info, visit www.daisygroupplc.com
IDB-Cloud Providing Bioinformatics Services on Cloudstratuslab
A presentation of IDB (Infrastructure Distributed for Biology) using StratusLab technology by Christophe Blanchet and Clément Gauthey at Lille, France, May 2013.
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 5Gianpaolo Coro
An e-Infrastructure is a distributed network of service nodes, residing on multiple sites and managed by one or more organizations. e-Infrastructures allow scientists residing at distant places to collaborate. They offer a multiplicity of facilities as-a-service, supporting data sharing and usage at different levels of abstraction, e.g. data transfer, data harmonization, data processing workflows etc. e-Infrastructures are gaining an important place in the field of biodiversity conservation. Their computational capabilities help scientists to reuse models, obtain results in shorter time and share these results with other colleagues. They are also used to access several and heterogeneous biodiversity catalogues.
In this course, the D4Science e-Infrastructure will be used to conduct experiments in the field of biodiversity conservation. D4Science hosts models and contributions by several international organizations involved in the biodiversity conservation field. The course will give students an overview of the models, the practices and the methods that large international organizations like FAO and UNESCO apply by means of D4Science. At the same time, the course will introduce students to the basic concepts under e-Infrastructures, Virtual Research Environments, data sharing and experiments reproducibility.
Looking for etiquette training toronto,toronto business etiquette,executive presence training,business etiquette toronto,business etiquette training,image consultant toronto,leadership training toronto.Then corporateclassinc.com is one stop for you.
Social Commerce 2.0 With CPC Strategy & AddShoppersTinuiti
In this LIVE webinar event, retailers will learn how to drive orders by integrating social into their existing channels like SEO, email marketing, and retargeting. CPC Strategy’s Stephen Kerner and AddShoppers Co-founder Chad Ledford team up to deliver advanced social commerce strategies supported by real client data.
Daisy CTO, Nathan Marke, talks digital technology and how it's affecting businesses across all industries. This is the speech Nathan gave at Daisy Communications' flagship event 'Daisy Wired? 2014'. For more info, visit www.daisygroupplc.com
IDB-Cloud Providing Bioinformatics Services on Cloudstratuslab
A presentation of IDB (Infrastructure Distributed for Biology) using StratusLab technology by Christophe Blanchet and Clément Gauthey at Lille, France, May 2013.
USING E-INFRASTRUCTURES FOR BIODIVERSITY CONSERVATION - Module 5Gianpaolo Coro
An e-Infrastructure is a distributed network of service nodes, residing on multiple sites and managed by one or more organizations. e-Infrastructures allow scientists residing at distant places to collaborate. They offer a multiplicity of facilities as-a-service, supporting data sharing and usage at different levels of abstraction, e.g. data transfer, data harmonization, data processing workflows etc. e-Infrastructures are gaining an important place in the field of biodiversity conservation. Their computational capabilities help scientists to reuse models, obtain results in shorter time and share these results with other colleagues. They are also used to access several and heterogeneous biodiversity catalogues.
In this course, the D4Science e-Infrastructure will be used to conduct experiments in the field of biodiversity conservation. D4Science hosts models and contributions by several international organizations involved in the biodiversity conservation field. The course will give students an overview of the models, the practices and the methods that large international organizations like FAO and UNESCO apply by means of D4Science. At the same time, the course will introduce students to the basic concepts under e-Infrastructures, Virtual Research Environments, data sharing and experiments reproducibility.
National scale research computing and beyond pearc panel 2017Gregory Newby
Panel at the PEARC 2017 event in New Orleans, July 11-13. Panelists were: Gregory Newby, Chief Technology Officer, Compute Canada; Florian Berberich, Member of the Board of Directors PRACE aisbl; Gergely Sipos, Customer and Technical Outreach Manager, EGI Foundation; and John Towns, Director of Collaborative eScience Programs, National Center for Supercomputing Applications.
Panel abstract: How might the international community of research computing users and stakeholders benefit from knowledge sharing among national- or international-scale research computing organizations and providers? It is common for large-scale investments in research computing systems, services and support to be guided and funded with government oversight and centralized planning. There are many commonalities, including stakeholder relations, outcomes reporting, long-range strategic planning, and governance. What trends exist currently, and how might information sharing and collaboration among resource providers be beneficial? Is there desire to form a partnership, or to build upon existing relationships? Participants in this panel will include personnel involved in US, Canadian and European research computing jurisdictions.
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...Bonnie Hurwitz
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to microbes. Overview of work underway to add applications and computational analysis pipelines to iPlant for metagenomics and microbial ecology.
Providing Bioinformatics Services on Cloudstratuslab
Improvements of experimental technologies forces biologists to face a deluge of data that require relevant tools and sufficient resources to be analyzed. The cloud helps bioinformatics experts to define virtual appliances with pre-installed tools and workflows, and helps scientists to deploy them, on demand, on national research infrastructures.
Presented by Christophe Blanchet and Clément Cauthey at the EGI Community Forum in Manchester, UK in April 2013.
The Neuroinformatics community in OpenAIRE Connect (Presentation by Sorina Po...OpenAIRE
"The Neuroinformatics community in OpenAIRE Connect"
Presentation by Sorina Pop from CNRS at the Digital Infrastructures Conference 2018, Lisbon. OpenAIRE Session: OpenAIRE services for Research Communities (Oct. 11, 2018)
The BlueBRIDGE approach to collaborative researchBlue BRIDGE
Gianpaolo Coro, ISTI-CNR, at BlueBRIDGE workshop on "Data Management services to support stock assessement", held during the Annual ICES Science conference 2016
Data are the new oil: Big data, data mining and bio - inspiring techniquesAboul Ella Hassanien
Invited talk at the national institute of astronomy and geophysics - Helwan on Wed. 21 October 2014 on Data are the new oil: Big data, data mining and bio - inspiring techniques
Data is the new oil: Big data, data mining and bio - inspiring techniquesAboul Ella Hassanien
Invited talk at the national institute of astronomy and geophysics - Helwan on Wed. 21 October 2014 on Data is the new oil: Big data, data mining and bio - inspiring techniques
Genomic Big Data Management, Integration and Mining - Emanuel WeitschekData Driven Innovation
Thanks to Next Generation Sequencing (NGS), a technology that is lowering the cost and time of reading DNA, we are faced with huge amounts of biomedical data. These data are continuously collected by research laboratories, and often organized through world-wide consortia, which are releasing many public data bases. One of the main aims of bioinformatics is to solve fundamental issues in biomedicine research (e.g., how cancer occurs) starting from big genomic data and their analysis. In this talk I will give an overview of big genomic data management, integration, and mining.
Text (personal views position statement) to accompany presentation on what research infrastructures really need for data, XLDB-Europe, 8-10th June 2011, Edinburgh
GraphRAG is All You need? LLM & Knowledge GraphGuy Korland
Guy Korland, CEO and Co-founder of FalkorDB, will review two articles on the integration of language models with knowledge graphs.
1. Unifying Large Language Models and Knowledge Graphs: A Roadmap.
https://arxiv.org/abs/2306.08302
2. Microsoft Research's GraphRAG paper and a review paper on various uses of knowledge graphs:
https://www.microsoft.com/en-us/research/blog/graphrag-unlocking-llm-discovery-on-narrative-private-data/
Securing your Kubernetes cluster_ a step-by-step guide to success !KatiaHIMEUR1
Today, after several years of existence, an extremely active community and an ultra-dynamic ecosystem, Kubernetes has established itself as the de facto standard in container orchestration. Thanks to a wide range of managed services, it has never been so easy to set up a ready-to-use Kubernetes cluster.
However, this ease of use means that the subject of security in Kubernetes is often left for later, or even neglected. This exposes companies to significant risks.
In this talk, I'll show you step-by-step how to secure your Kubernetes cluster for greater peace of mind and reliability.
JMeter webinar - integration with InfluxDB and GrafanaRTTS
Watch this recorded webinar about real-time monitoring of application performance. See how to integrate Apache JMeter, the open-source leader in performance testing, with InfluxDB, the open-source time-series database, and Grafana, the open-source analytics and visualization application.
In this webinar, we will review the benefits of leveraging InfluxDB and Grafana when executing load tests and demonstrate how these tools are used to visualize performance metrics.
Length: 30 minutes
Session Overview
-------------------------------------------
During this webinar, we will cover the following topics while demonstrating the integrations of JMeter, InfluxDB and Grafana:
- What out-of-the-box solutions are available for real-time monitoring JMeter tests?
- What are the benefits of integrating InfluxDB and Grafana into the load testing stack?
- Which features are provided by Grafana?
- Demonstration of InfluxDB and Grafana using a practice web application
To view the webinar recording, go to:
https://www.rttsweb.com/jmeter-integration-webinar
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Tobias Schneck
As AI technology is pushing into IT I was wondering myself, as an “infrastructure container kubernetes guy”, how get this fancy AI technology get managed from an infrastructure operational view? Is it possible to apply our lovely cloud native principals as well? What benefit’s both technologies could bring to each other?
Let me take this questions and provide you a short journey through existing deployment models and use cases for AI software. On practical examples, we discuss what cloud/on-premise strategy we may need for applying it to our own infrastructure to get it to work from an enterprise perspective. I want to give an overview about infrastructure requirements and technologies, what could be beneficial or limiting your AI use cases in an enterprise environment. An interactive Demo will give you some insides, what approaches I got already working for real.
Accelerate your Kubernetes clusters with Varnish CachingThijs Feryn
A presentation about the usage and availability of Varnish on Kubernetes. This talk explores the capabilities of Varnish caching and shows how to use the Varnish Helm chart to deploy it to Kubernetes.
This presentation was delivered at K8SUG Singapore. See https://feryn.eu/presentations/accelerate-your-kubernetes-clusters-with-varnish-caching-k8sug-singapore-28-2024 for more details.
The Art of the Pitch: WordPress Relationships and SalesLaura Byrne
Clients don’t know what they don’t know. What web solutions are right for them? How does WordPress come into the picture? How do you make sure you understand scope and timeline? What do you do if sometime changes?
All these questions and more will be explored as we talk about matching clients’ needs with what your agency offers without pulling teeth or pulling your hair out. Practical tips, and strategies for successful relationship building that leads to closing the deal.
Connector Corner: Automate dynamic content and events by pushing a buttonDianaGray10
Here is something new! In our next Connector Corner webinar, we will demonstrate how you can use a single workflow to:
Create a campaign using Mailchimp with merge tags/fields
Send an interactive Slack channel message (using buttons)
Have the message received by managers and peers along with a test email for review
But there’s more:
In a second workflow supporting the same use case, you’ll see:
Your campaign sent to target colleagues for approval
If the “Approve” button is clicked, a Jira/Zendesk ticket is created for the marketing design team
But—if the “Reject” button is pushed, colleagues will be alerted via Slack message
Join us to learn more about this new, human-in-the-loop capability, brought to you by Integration Service connectors.
And...
Speakers:
Akshay Agnihotri, Product Manager
Charlie Greenberg, Host
Key Trends Shaping the Future of Infrastructure.pdfCheryl Hung
Keynote at DIGIT West Expo, Glasgow on 29 May 2024.
Cheryl Hung, ochery.com
Sr Director, Infrastructure Ecosystem, Arm.
The key trends across hardware, cloud and open-source; exploring how these areas are likely to mature and develop over the short and long-term, and then considering how organisations can position themselves to adapt and thrive.
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualityInflectra
In this insightful webinar, Inflectra explores how artificial intelligence (AI) is transforming software development and testing. Discover how AI-powered tools are revolutionizing every stage of the software development lifecycle (SDLC), from design and prototyping to testing, deployment, and monitoring.
Learn about:
• The Future of Testing: How AI is shifting testing towards verification, analysis, and higher-level skills, while reducing repetitive tasks.
• Test Automation: How AI-powered test case generation, optimization, and self-healing tests are making testing more efficient and effective.
• Visual Testing: Explore the emerging capabilities of AI in visual testing and how it's set to revolutionize UI verification.
• Inflectra's AI Solutions: See demonstrations of Inflectra's cutting-edge AI tools like the ChatGPT plugin and Azure Open AI platform, designed to streamline your testing process.
Whether you're a developer, tester, or QA professional, this webinar will give you valuable insights into how AI is shaping the future of software delivery.
Neuro-symbolic is not enough, we need neuro-*semantic*Frank van Harmelen
Neuro-symbolic (NeSy) AI is on the rise. However, simply machine learning on just any symbolic structure is not sufficient to really harvest the gains of NeSy. These will only be gained when the symbolic structures have an actual semantics. I give an operational definition of semantics as “predictable inference”.
All of this illustrated with link prediction over knowledge graphs, but the argument is general.
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
E biothon workshop 2014 04 15 v1
1. e-Biothon
V. Breton (breton@clermont.in2p3.fr)
LPC Clermont-Ferrand, IdGC
CNRS-IN2P3
http://france-grilles.fr
Credit: N. Bard, A. Franc, JF Gibrat
Extreme Performance Computational Science workshop
Tokyo, April 15th 2014
2. Table of content
2
• What are the computing challenges of life
sciences?
• France Grilles: a
multidisciplinarydistributede-
infrastructure for science
• E-Biothon: an HPC platform for research in
life sciences
3. Generalities on sequencing
• Genome = DNA sequence (4 nucleotids:
A, C, G, T)
– Smallest non viral genome:
Carsonellaruddii (0,16Mbp)
– Largestgenome: Polychaosdubium(670Gbp)
4. Sanger technology 500 bpsequences
454 technology 105reads of 450 to 600bp seq.
Illumina Technology 106 reads of 100 bpseq.
Currentprojects(Tara) 107reads of 100 to 400 bpseq.
Explosion of data set size
Data analysis ?
Algorithms?
Heuristics?
Tara @ http://oceans.taraexpeditions.org/
Evolution of sequencing
techniques
5. Data production isdistributed
2558 High Throughput « NextGeneration » sequencingfacilities in the world,
located in 920 centers (only 10 with more than 15 machines)
Source: omicspmaps.com
7. Sequencing scenarii
• Interest for a new genome requires assembly
– process of taking a large number of short DNA sequences and
putting them back together to create a representation of the
original
– Algorithms based on read overlapping benefit from large RAM (1
TO) -> HPC
• Working with a reference genome requires comparative
analysis
– Alignment algorithms (BLAST) findregions of local
similaritybetweensequences
– Phylogeny algorithms (PhyML) build evolutionary relationships
between genomes
– Comparative analyses are easily parallelized at data level -> HTC
8. Summary
• Life Sciences have specificcomputational challenges
– Data production growsfasterthan Moore law
– Permanent need of comparing new data to existingones
• Life sciences needscanberelevantlyaddressed on
multidisciplinary IT infrastructures (e-infrastructures)
– HPC resources best fitted for genomeassembly
– Grid/cloud HTC resourceswellfitted for comparative analysis
• Life sciences are among the main users of the French
national grid/cloud production infrastructure
9. France Grilles
• Is a ScientificInterest Group…
– Created in 2010 by 8 partners: CEA, CNRS,CPU, INRA, INRIA,
INSERM, MESR, RENATER…
– To steer up and coordinate the national strategy in the fields of
grids and clouds
• Vision:
– Build and operate a national distributedcomputing
infrastructure open to all sciences and to developing countries
9
10. France Grilles model
• France Grilles does not own the resources
– Resourcesowned by user communities
• France Grilles provides a framework
– To shareresources, expertise and know how
– To promote innovation and initiatives
– To foster collaboration at national and international
levels
– To reach out to the long tail of users
10
12. EGI de 2010 à 2013
12
2010-2013: from 14 regional to 34 operations centres in 53 countries,
from 188,000 jobs/day with 80,000 cores on 250 Resource Centres
to 1,200,000 jobs/day with 430,000 cores on 337 Resource Centres
Technologies
• Grids
• Clouds
• Desktops
Exposé S. Newhouse Madrid, Sept. 2013
France Grilles, a partner of EGI
14. Provide an open environment for fruitfuldisciplinary and
multidisciplinaryresearch
14
5 1 1
218
54
9 1 5 9 11 15 13 11
755
99 50
9
23
1
10
100
1000
Over 1500 scientific publications
june 2010 – April 2014
15. Web portal
Users
479 registered users in Nov 2013 (175 in France)
Most used robot certificate in EGI (http://go.egi.eu/wiki.robot.users)
Neuro-image analysisCancer therapy simulation
Prostate radiotherapy plan simulated
with GATE(L. Grevillot and D. Sarrut)
Image simulation
Echocardiography simulated with
FIELD-II (O. Bernard et al)
Modeling and optimization of
distributed computing systems
Acceleration yielded by non-clairvoyant
task replication (R. Ferreira da Silva et al)
Brain tissue segmentation
with Freesurfer
Scientific applications
Infrastructure
Supported by EGI Infrastructure
Uses biomed VO (most used EGI VO for life sciences in 2013)
VIP accounts for ~25% of biomed's activity
VIP consumes ~50 CPU years every month
DIRAC
France-Grilles
Application as a service
File transfer to/from grid
Virtual Imaging Platform:
http://www.creatis.insa-lyon.fr/vip
16. Collaborations withdedicated life sciences infrastructures
• Institut Français de Bioinformatique (computing
and storageresourcesatIDRIS)
• France Genomique ( computing and
storageresourcesat TGCC)
• France Life Imaging (infrastructure for
biomedicalimaging)
• E-Biothon
16
17. 17
• Telethon: everyyear, fundraising by
french media for French
MuscularDistrophy Association (AFM)
• FromTelethon to Decrypthon
– Computing infrastructure (IBM)
– Researchprojects (CNRS)
– Humanresources (AFM)
• FromDecrypthon to E-Biothon
E-Biothon: history
18. e-Biothon: an HPC platform for
research in life sciences
18
User Support
Blue Gene / p
machines
Technical supportUser Support
Blue Gene / P
operationWeb access
portal
19. E-Biothon: infrastructure
19
• 2 Blue Gene/P IBM racks
with 200 TO storage
– 2x1024 4-core nodes
– up to 28 TFlopspeak
performance
• SysFera-DS web access
to computingresources
• 2 modes:
– Standard (MPI)
– HTC (1024
independenttasks in
parallel)
20. E-Biothon vision is to offer a service to
the user communities in life sciences
• 2013-2014: first 3 projects
– Jean-François Gibrat et al, (MIGALE
platform, INRA Jouy-en-Josas)
– Olivier Gascuel, Stéphane Guindon et
Vincent Lefort (CNRS Montpellier)
– Yec’hanLaizet, Philippe
Chaumeil, Jean-Marc
Frigerio, Stéphanie Mariette, Sophie
Gerber, Alain Franc (INRA BioGeCo –
Bordeaux)
• > 2014: open call for projects (IFB)
21. Studying the synteny over a wide
range of microbialgenomes
21
• Definition: similar blocks of genes in the same relative positions in
the genome
• Interest: Study of syntenycan show how the genomeiscut and pasted
in the course of evolution
• MIGALE team at INRA designed a pipeline analysis to
computesyntenybetween 2 genomes and store it in a database
• E-Biothon impact: change in scale - capacity to
computesyntenybetween 2000 completebacterialgenomes (7
millions comparisons)
22. PhyML
Philogeneticsis the study of evolutionaryrelationshipsamong groups of
organisms
PhyMLis a software thatestimates maximum
likelihoodphylogeniesfromalignments of nucleotide or
aminoacidsequences
PhyML original publication in 2007 is the mostcited in environment and
ecology (> 6000 citations).
E-Biothon impact: change in scale in the resources made available
to PhyMLusers
25. Study of biodiversity in Guyane
16000 differenttreespecies
in amazonianforest (≈ 300
in Europe)
More biodiversity in 10000
m2 of forest in French
Guyana than in Europe
Decrypthonadded value
Change in scale (from local Mesocenter in
Bordeaux)
Millions of reads
Exact distance computation
withoutheuristics (alignement scores)
TOctets of data producedeveryweek
26. Conclusion
• Both HPC and HTC resources are increasinglyneeded to
address life sciences data and computing challenges:
– As sequencing technologies keepevolving, data production
growsfasterthan Moore law and isincreasinglydistributed
– Biological data need to beconstantlycompared to
eachother (phylogenetics, genomics comparative analysis)
• France isdevelopingcomplementary HPC and HTC
infrastructures for life sciences
– Institut Français de Bioinformatique, France Génomique
– E-Biothon: an HPC platform for research in life sciences
– France Grilles: a multidisciplinarygrid/cloud production
infrastructure
30. Are life sciences
specificw.r.tcomputing?
Whatisspecific to life sciences:
- As sequencing technologies keepevolving, data production growsfasterthan
Moore law
- Biological data need to beconstantlycompared to eachother (phylogenetics,
Genomics comparative analysis)
Whatis not specific?
- Data production isdistributed
- Multiscalemodeling