SlideShare a Scribd company logo
1 of 33
Download to read offline
TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes
Càlcul i anàlisi de dades massiu per al disseny
d’enzims amb aplicacions biotecnològiques
Xevi Biarnés
@xevibiarnes
xevi.biarnes@iqs.edu
Departament de Bioenginyeria
IQS School of Engineering
Jornada TAC’2015
Mataró, 30 de juny de 2015
TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes
IQS School of Engineering
IQS School of Management
Via Augusta, 390
Barcelona
TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes
Bioengineering Department
 Laboratory of Biochemistry
 Laboratory of Biomaterials
 Laboratory of Tissue Engineering
 Laboratory of Microbiology
 Laboratory of Bioprocesses
 Degree in Biotechnology
 Master’s of Science in Bioengineering
 PhD in Bioengineering
TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes
Laboratory of Biochemistry
Main R&D topics:
 Protein engineering and enzymology of glycosidases and glycosyltranferases
 Therapeutic targets in infectious diseases
 Amyloidogenic proteins in neurodegenerative diseases
 Biocatalysis: enzyme redesign, directed evolution of enzymes
 Metabolic Engineering for the production of glycoglycerolipids
Headed by Prof. Antoni Planas
5 permanent staff
8 PhD students
6 MSc students
4 undergrad students
2 research assistants
TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes
Bioinformatics and Molecular Modelling Unit
Laboratory of Biochemistry
BIOMMIQS
 Bioinformatics for comparative
analysis of genomic sequences
 Protein Structure Prediction
 In silico tools to assist in
experimental Protein Engineering
 Simulation of small and macro
molecules conformational dynamics
TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes
BIOMMIQS @IQS and Anella Científica
BIOMMIQS
Direct access to
consortium services:
 CBUC/CCUC
 EDUROAM
TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes
BIOMMIQS @IQS and Anella Científica
 Barcelona (BSC-CNS)
MareNostrum 3, MinoTauro, Altix
 Madrid (CeSViMa-UPM)
Magerit
 Islas Canarias (IAC, ITC)
LaPalma 2, Atlante
 Cantabria (UC)
Altamira 2
 Málaga (UMA)
Picasso 2
 Valéncia (UV)
Tirant 2
 Zaragoza (BIFI-UZ)
CaesarAugusta 2
BIOMMIQS
TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes
Re-Evolution of Genomes
optimization of the genetic codes of
living organisms to adapt to their
living environments
TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes
Burst of Genomic Data
• Growth of GeneBank database
http://www.ncbi.nlm.nih.gov/genbank/statistics
700 GBytes
of raw data
TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes
ACTAACCCCTCAGTTTTTGTCAAGCTGTCAGACCCTCCAGCGCAGGTTTCAGTGCC
ATTCATGTCACCTGCGAGTGCTTATCAATGGTTTTATGACGGATATCCCACATTCGGA
GAACACAAACAGGAGAAAGATCTTGAATACGGGGCATGTCCTAATAACATGATGG
GCACGTTCTCAGTGCGGACTGTGGGGACCTCCAAGTCCAAGTACCCTTTAGTGGT
AGGATCTACATGAGAATGAAGCACGTCAGGGCGTGGATACCTCGCCCGATGCGTA
CCAGAACTACCTATTCAAAGCCAACCCAAATTATGCTGGCAACTCCATTAAGCCAAC
TGGTGCCAGTCGTACAGCGATCACCACTCTTGGGAAATTTGGACAACAGTCTGGG
GCTATTTATGTGGGCAACTTTAGAGTGGTCAACCGACATCTTGCCACTCACAATGAT
TGGGCAAATCTTGTTTGGGAAGACAGCTCTCGCGACTTGCTCGTGTCATGAACCAC
CGCCCAAGGCTGTGACACGATTGCTCGTTGCGATTGCCAGACAGGGGTGTACTAC
GTAACTCGATGAGAAAACACTACCCAGTCAGTTTTTCAAAACCCAGCCTGATCTATG
TAGAGGCTAGCGAGTATTACCCAGCCAGGTACCAATCACATCTCATGCTCGCACAG
GGTCACTCAGAACCTGGTGATTGCGGTGGTATCCTTAGATGCCAACATGGCGTCGT
CGGCATAGTGTCTACTGGTGGTAATGGGCTCGTTGGCTTTGCAGACGTTAGAGACC
TCTTGTGGTTAGATGAAGAAGCTATGGAACAGGGCGTGTCCGACTACATCAAGGG
TCTCGGAGATGCTTTTGGAACAGGCTTCACTGACGCAGTCTCAAGGGAGGTTGAA
GCTCTCAAGAACTATCTTATAGGGTCTGAAGGAGCAGTTGAGAAAATCTTGAAAAA
TCTTATTAAACTAATCTCTGCACTGGTATTGTGATCAGAAGTGATTACGACATGGTTA
Where is
the information?
TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes
We are at the post-genomic era
Torgeir R. Hvidsten
TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes
Proteins are the Machinery of the Cells
genes (DNA) only keep the information
proteins (aminoacids) perform the function
The Inner Life of the Cell (youtube)
XVIVO Scientific Animation
TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes
Proteins are 3D Objects
nanometer size
>sequence_of_aminoacids
MYCSASTCTCSTTATRHYGCKLMNDSSCRFGH
KLISPRDTEDFSGFRTCSKLIPSCSFACVIPL
PSFACEERERWQSRTNCVISCRTEDPLKISCF
GRSRACGRSTTRSGCSPLYPLREDTSWASDFR
3D structure and function of proteins is dictated
by their aminoacids sequence
TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes
Proteins are Dynamic 3D Objects
hemoglobin:
oxygen
transporter
in blood
eppur si muove
a = F / m
x(t) = x0 + v0·t + ½·a·t2
protein motion can be simulated on the computer:
Molecular Dynamics (MD)
typical simulation:
1 billion of steps!
TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes
Computer Simulations of Biological Processes @ BIOMMIQS
Implementation of
computational algorithms
based on Molecular Dynamics
(metadynamics)
that enhance the simulation of
biologically relevant processes
BIOMMIQS
Protein Folding
Protein Aggregation
TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes
Computer Simulations of Biological Processes @ BIOMMIQS
Prion protein folding
Prion protein (the causative agent of spongiform encephalopathy)
is an unstable protein that can adopt different structures.
One of these structures, tends to form precipitates in the central
nervous system tissue, leading to neurodegeneration.
TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes
Computer Simulations of Biological Processes @ BIOMMIQS
Prion protein folding
The structural determinants of prion protein
stability were identified in-silico by extensive
computer simulations.
Benetti F. and Biarnés X. et al, JMB 2014
The simulations spent
 1.000.000 of hours of total CPU time,
and generated
 1 TeraByte of data.
TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes
Computer Simulations of Biological Processes @ BIOMMIQS
Amyloid-β peptide aggregation
Amyloid-β is an intrinsically disordered
protein. The final segment of this
protein can lead to aggregates. These
aggregates are associated to
Alzheimer’s disease.
18 molecules of the final Amyloid-β
segment were simulated, and a nascent
fibril was detected.
Baftizadeh F. et al, PRL 2013
TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes
Technical Details of Computer Simulations
• Huge CPU-demanding
– millions of hours in CPU time  supercomputers
• Big Data Storage
– 100 GBytes per simulation (3-4 months)  cloud storage?
• Data Transfer
– 1 GByte of data generated daily
– Need to transfer locally for visualization  efficient communications
– Current download rates:
• From CESVIMA (UPM Madrid) to IQS 5.3 MBytes / s
• From BSC (UPC Barcelona) to IQS ??
• From SISSA (Trieste, Italy) to IQS 9.5 Mbytes / s
TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes
Technical Details of Computer Simulations
• Visualization
– Renders are done off-line in local computers
• Visualization on-line?
– Remote Desktops are a solution  not nice
– “Streaming” of xyz coordinates could be a solution?
C 3.23 2.22 4.34
O 2.31 1.34 3.41
H 2.88 2.35 5.32
C 3.21 2.11 1.22
…
…
30000-50000 atoms
x 100000 frames
Minimal Atomic Coordinates File
atom x y z
3D renders are generated by specific software
based on an atomic coordinate file containing
the xyz coordinates of each atom
in the protein structure
TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes
Proteins as Chemical Machines: ENZYMES
 Chemical transformations rule our life
6·CO2 + 6·H2O C6H12O6 + 6·O2
Enzymes decrease the activation
energy required for a chemical
transformation: this is a catalyst
PHOTOSYNTHESIS
TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes
Proteins as “bio”catalysts (enzymes)
Protein structures are tightly
tuned to accommodate their
natural ligands.
Maximum catalytic efficiency of
enzymes is attained, in part, by the
binding forces in the active site.
TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes
Industrial applications of enzymes
 Amylases
production of sugars from starch in
syrups production
 Glucanases
starch degradation prior to
fermentation in beer production
 Proteases
 Cellulases
cellulose degradation prior to fermentation in
bioethanol production
 Lipases
esterification of lipids in biodiesel production
 Amylases, Xylanases, Cellulases, Ligninases
starch degradation to lower viscosity, aiding
sizing and coating paper
TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes
Industrial applications of enzymes
PRODUCTION OF NON-NATURAL ADDED-VALUE
COMPOUNDS
Pharmaceuticals Pigments Biomaterials
Complementing traditional chemical industry
···
- Processes optimization
- Green chemistry
TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes
Enzymes to Produce Added-Value Compounds
natural
compound
novel
compound
there is room for enzyme optimization
by PROTEIN ENGINEERING
Natural enzymes are not optimized for non-natural compounds
TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes
The Protein Engineering Dilemma
MSDTAGSPWFSHSLKRNQDFGFYY
SDFCNARSDTPQSCWREGQNESDR
QTAVWPYRTSCNMLKCSRYTCVPM
Protein Engineering can be guided by
Computer Simulations and Genomic-Data Mining
TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes
Protein Engineering by Data Mining @ BIOMMIQS
Setting-up of an
integrative platform
to assist in protein engineering
experiments
BIOMMIQS
The platform is based on biological data integration
from different database sources
and complemented with computer simulations
TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes
Protein Engineering by Data Mining @ BIOMMIQS
 UNIPROT
50.000.000
protein sequences
 PDB
110.000
protein structures
 PFAM
16.000
protein functions
 CAZY
340.000
enzymes active on carbohydrates
 GENBANK
185.000.000
genomic sequences
http://www.ncbi.nlm.nih.gov/genbank/
http://uniprot.org
http://pdb.org
http://pfam.xfam.org
http://cazy.org
TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes
Protein Engineering by Data Mining @ BIOMMIQS
Protein engineering of chitindeacetylases for the
biotechnological production of chitosan
http://nano3bio.eu
from chitin
to chitosan
……
+ + + ……
TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes
Protein Engineering by Data Mining @ BIOMMIQS
 Natural chitindeacetylase enzymes producing different chitosans
Andrés E. et al, Angew Chem Intl Ed 2014
 Engineering of a non-natural
chitindeacetylase to produce
new-to-nature chitosans
TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes
Technical Details of Data Mining
• Public Databases size
– GENBANK  700 GBytes
– UNIPROT  27 GBytes
– PDB  373 GBytes
– PFAM  195 GBytes
– No local copies! For general purposes, public web services are used.
• BLAST
• JMOL
• HMMSEARCH
• Public Databases are updated regularly (weekly)
– Need to update local copies  mirrors?
TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes
Bioinformatics and Molecular Modelling Unit
Laboratory of Biochemistry
BIOMMIQS
 Bioinformatics for comparative
analysis of genomic sequences
 Protein Structure Prediction
 In silico tools to assist in
experimental Protein Engineering
 Simulation of small and macro
molecules conformational dynamics
TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes
Càlcul i anàlisi de dades massiu per al disseny
d’enzims amb aplicacions biotecnològiques
Xevi Biarnés
@xevibiarnes
xevi.biarnes@iqs.edu
Departament de Bioenginyeria
IQS School of Engineering
Jornada TAC’2015
Mataró, 30 de juny de 2015

More Related Content

Similar to Càlcul i anàlisi de dades massiu per al disseny d'enzims amb aplicacions a la indústria biotecnològica

Alagene BioFoundry: Releasing the Genie Out of the Bottle
Alagene BioFoundry: Releasing the Genie Out of the Bottle Alagene BioFoundry: Releasing the Genie Out of the Bottle
Alagene BioFoundry: Releasing the Genie Out of the Bottle
Levi Shapiro
 
Real time modeling system
Real time modeling systemReal time modeling system
Real time modeling system
GBX Summits
 
ACIB - Biotech Stories
ACIB - Biotech StoriesACIB - Biotech Stories
ACIB - Biotech Stories
Martin Trinker
 

Similar to Càlcul i anàlisi de dades massiu per al disseny d'enzims amb aplicacions a la indústria biotecnològica (20)

Bio g - ManufacturingIintelligence webinar GBX
Bio g - ManufacturingIintelligence webinar GBXBio g - ManufacturingIintelligence webinar GBX
Bio g - ManufacturingIintelligence webinar GBX
 
Biomanufacturing 2016
Biomanufacturing 2016 Biomanufacturing 2016
Biomanufacturing 2016
 
Webinar on Biomanufacturing 4.0 – A New Era in Cell and Gene Therapy Development
Webinar on Biomanufacturing 4.0 – A New Era in Cell and Gene Therapy DevelopmentWebinar on Biomanufacturing 4.0 – A New Era in Cell and Gene Therapy Development
Webinar on Biomanufacturing 4.0 – A New Era in Cell and Gene Therapy Development
 
Omprn 2018 module1_final
Omprn 2018 module1_finalOmprn 2018 module1_final
Omprn 2018 module1_final
 
A global integrative ecosystem for digital pathology: how can we get there?
A global integrative ecosystem for digital pathology: how can we get there?A global integrative ecosystem for digital pathology: how can we get there?
A global integrative ecosystem for digital pathology: how can we get there?
 
Industrial Biotechnology-a Key to Bioeconomy
Industrial Biotechnology-a Key to BioeconomyIndustrial Biotechnology-a Key to Bioeconomy
Industrial Biotechnology-a Key to Bioeconomy
 
Bio.Kitchen
Bio.KitchenBio.Kitchen
Bio.Kitchen
 
Data analytics and software sensors for single use bioprocessing
Data analytics and software sensors for single use bioprocessingData analytics and software sensors for single use bioprocessing
Data analytics and software sensors for single use bioprocessing
 
Cancer uk 2015_module1_ouellette_ver02
Cancer uk 2015_module1_ouellette_ver02Cancer uk 2015_module1_ouellette_ver02
Cancer uk 2015_module1_ouellette_ver02
 
Alagene BioFoundry: Releasing the Genie Out of the Bottle
Alagene BioFoundry: Releasing the Genie Out of the Bottle Alagene BioFoundry: Releasing the Genie Out of the Bottle
Alagene BioFoundry: Releasing the Genie Out of the Bottle
 
KJP Almeria Horticulture collaboration 2015 final
KJP Almeria Horticulture collaboration 2015 finalKJP Almeria Horticulture collaboration 2015 final
KJP Almeria Horticulture collaboration 2015 final
 
Health, Data Analytics and Decision Support
Health, Data Analytics and Decision SupportHealth, Data Analytics and Decision Support
Health, Data Analytics and Decision Support
 
automation of milk process in dairy field using plc and scada
automation of milk process in dairy field using plc and scadaautomation of milk process in dairy field using plc and scada
automation of milk process in dairy field using plc and scada
 
The role of reliable data collection systems for improved livestock genetics ...
The role of reliable data collection systems for improved livestock genetics ...The role of reliable data collection systems for improved livestock genetics ...
The role of reliable data collection systems for improved livestock genetics ...
 
Real time modeling system
Real time modeling systemReal time modeling system
Real time modeling system
 
Keeping brazil's medical industry safe with Micro Profile and JakartaEE - Jak...
Keeping brazil's medical industry safe with Micro Profile and JakartaEE - Jak...Keeping brazil's medical industry safe with Micro Profile and JakartaEE - Jak...
Keeping brazil's medical industry safe with Micro Profile and JakartaEE - Jak...
 
SMi Group's Cell & Gene Therapy 2018 conference
SMi Group's Cell & Gene Therapy 2018 conferenceSMi Group's Cell & Gene Therapy 2018 conference
SMi Group's Cell & Gene Therapy 2018 conference
 
ACIB - Biotech Stories
ACIB - Biotech StoriesACIB - Biotech Stories
ACIB - Biotech Stories
 
Tracking cmo investments in biologics
Tracking cmo investments in biologicsTracking cmo investments in biologics
Tracking cmo investments in biologics
 
agriopenlink WS@EFITA 2015
agriopenlink WS@EFITA 2015agriopenlink WS@EFITA 2015
agriopenlink WS@EFITA 2015
 

More from CSUC - Consorci de Serveis Universitaris de Catalunya

More from CSUC - Consorci de Serveis Universitaris de Catalunya (20)

Tendencias en herramientas de monitorización de redes y modelo de madurez en ...
Tendencias en herramientas de monitorización de redes y modelo de madurez en ...Tendencias en herramientas de monitorización de redes y modelo de madurez en ...
Tendencias en herramientas de monitorización de redes y modelo de madurez en ...
 
Quantum Computing Master Class 2024 (Quantum Day)
Quantum Computing Master Class 2024 (Quantum Day)Quantum Computing Master Class 2024 (Quantum Day)
Quantum Computing Master Class 2024 (Quantum Day)
 
Publicar dades de recerca amb el Repositori de Dades de Recerca
Publicar dades de recerca amb el Repositori de Dades de RecercaPublicar dades de recerca amb el Repositori de Dades de Recerca
Publicar dades de recerca amb el Repositori de Dades de Recerca
 
In sharing we trust. Taking advantage of a diverse consortium to build a tran...
In sharing we trust. Taking advantage of a diverse consortium to build a tran...In sharing we trust. Taking advantage of a diverse consortium to build a tran...
In sharing we trust. Taking advantage of a diverse consortium to build a tran...
 
Formació RDM: com fer un pla de gestió de dades amb l’eiNa DMP?
Formació RDM: com fer un pla de gestió de dades amb l’eiNa DMP?Formació RDM: com fer un pla de gestió de dades amb l’eiNa DMP?
Formació RDM: com fer un pla de gestió de dades amb l’eiNa DMP?
 
Com pot ajudar la gestió de les dades de recerca a posar en pràctica la ciènc...
Com pot ajudar la gestió de les dades de recerca a posar en pràctica la ciènc...Com pot ajudar la gestió de les dades de recerca a posar en pràctica la ciènc...
Com pot ajudar la gestió de les dades de recerca a posar en pràctica la ciènc...
 
Security Human Factor Sustainable Outputs: The Network eAcademy
Security Human Factor Sustainable Outputs: The Network eAcademySecurity Human Factor Sustainable Outputs: The Network eAcademy
Security Human Factor Sustainable Outputs: The Network eAcademy
 
The Research Portal of Catalonia: Growing more (information) & more (services)
The Research Portal of Catalonia: Growing more (information) & more (services)The Research Portal of Catalonia: Growing more (information) & more (services)
The Research Portal of Catalonia: Growing more (information) & more (services)
 
Facilitar la gestión, visibilidad y reutilización de los datos de investigaci...
Facilitar la gestión, visibilidad y reutilización de los datos de investigaci...Facilitar la gestión, visibilidad y reutilización de los datos de investigaci...
Facilitar la gestión, visibilidad y reutilización de los datos de investigaci...
 
La gestión de datos de investigación en las bibliotecas universitarias españolas
La gestión de datos de investigación en las bibliotecas universitarias españolasLa gestión de datos de investigación en las bibliotecas universitarias españolas
La gestión de datos de investigación en las bibliotecas universitarias españolas
 
Disposes de recursos il·limitats? Prioritza estratègicament els teus projecte...
Disposes de recursos il·limitats? Prioritza estratègicament els teus projecte...Disposes de recursos il·limitats? Prioritza estratègicament els teus projecte...
Disposes de recursos il·limitats? Prioritza estratègicament els teus projecte...
 
Les persones i les seves capacitats en el nucli de la transformació digital. ...
Les persones i les seves capacitats en el nucli de la transformació digital. ...Les persones i les seves capacitats en el nucli de la transformació digital. ...
Les persones i les seves capacitats en el nucli de la transformació digital. ...
 
Enginyeria Informàtica: una cursa de fons
Enginyeria Informàtica: una cursa de fonsEnginyeria Informàtica: una cursa de fons
Enginyeria Informàtica: una cursa de fons
 
Transformació de rols i habilitats en un món ple d'IA
Transformació de rols i habilitats en un món ple d'IATransformació de rols i habilitats en un món ple d'IA
Transformació de rols i habilitats en un món ple d'IA
 
Difusió del coneixement a l'Il·lustre Col·legi de l'Advocacia de Barcelona
Difusió del coneixement a l'Il·lustre Col·legi de l'Advocacia de BarcelonaDifusió del coneixement a l'Il·lustre Col·legi de l'Advocacia de Barcelona
Difusió del coneixement a l'Il·lustre Col·legi de l'Advocacia de Barcelona
 
Fons de discos perforats de cartró
Fons de discos perforats de cartróFons de discos perforats de cartró
Fons de discos perforats de cartró
 
Biblioteca Digital Gencat
Biblioteca Digital GencatBiblioteca Digital Gencat
Biblioteca Digital Gencat
 
El fons Enrique Tierno Galván: recepció, tractament i difusió
El fons Enrique Tierno Galván: recepció, tractament i difusióEl fons Enrique Tierno Galván: recepció, tractament i difusió
El fons Enrique Tierno Galván: recepció, tractament i difusió
 
El CIDMA: més enllà dels espais físics
El CIDMA: més enllà dels espais físicsEl CIDMA: més enllà dels espais físics
El CIDMA: més enllà dels espais físics
 
Els serveis del CSUC per a la comunitat CCUC
Els serveis del CSUC per a la comunitat CCUCEls serveis del CSUC per a la comunitat CCUC
Els serveis del CSUC per a la comunitat CCUC
 

Recently uploaded

Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptxHarnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
FIDO Alliance
 
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider  Progress from Awareness to Implementation.pptxTales from a Passkey Provider  Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
FIDO Alliance
 
Structuring Teams and Portfolios for Success
Structuring Teams and Portfolios for SuccessStructuring Teams and Portfolios for Success
Structuring Teams and Portfolios for Success
UXDXConf
 

Recently uploaded (20)

ERP Contender Series: Acumatica vs. Sage Intacct
ERP Contender Series: Acumatica vs. Sage IntacctERP Contender Series: Acumatica vs. Sage Intacct
ERP Contender Series: Acumatica vs. Sage Intacct
 
Extensible Python: Robustness through Addition - PyCon 2024
Extensible Python: Robustness through Addition - PyCon 2024Extensible Python: Robustness through Addition - PyCon 2024
Extensible Python: Robustness through Addition - PyCon 2024
 
Intro to Passkeys and the State of Passwordless.pptx
Intro to Passkeys and the State of Passwordless.pptxIntro to Passkeys and the State of Passwordless.pptx
Intro to Passkeys and the State of Passwordless.pptx
 
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptxHarnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
 
How we scaled to 80K users by doing nothing!.pdf
How we scaled to 80K users by doing nothing!.pdfHow we scaled to 80K users by doing nothing!.pdf
How we scaled to 80K users by doing nothing!.pdf
 
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdf
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdfThe Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdf
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdf
 
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider  Progress from Awareness to Implementation.pptxTales from a Passkey Provider  Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
 
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
 
State of the Smart Building Startup Landscape 2024!
State of the Smart Building Startup Landscape 2024!State of the Smart Building Startup Landscape 2024!
State of the Smart Building Startup Landscape 2024!
 
Event-Driven Architecture Masterclass: Challenges in Stream Processing
Event-Driven Architecture Masterclass: Challenges in Stream ProcessingEvent-Driven Architecture Masterclass: Challenges in Stream Processing
Event-Driven Architecture Masterclass: Challenges in Stream Processing
 
Easier, Faster, and More Powerful – Notes Document Properties Reimagined
Easier, Faster, and More Powerful – Notes Document Properties ReimaginedEasier, Faster, and More Powerful – Notes Document Properties Reimagined
Easier, Faster, and More Powerful – Notes Document Properties Reimagined
 
Working together SRE & Platform Engineering
Working together SRE & Platform EngineeringWorking together SRE & Platform Engineering
Working together SRE & Platform Engineering
 
WebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM PerformanceWebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM Performance
 
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdf
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdfLinux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdf
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdf
 
Structuring Teams and Portfolios for Success
Structuring Teams and Portfolios for SuccessStructuring Teams and Portfolios for Success
Structuring Teams and Portfolios for Success
 
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdfWhere to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
 
Intro in Product Management - Коротко про професію продакт менеджера
Intro in Product Management - Коротко про професію продакт менеджераIntro in Product Management - Коротко про професію продакт менеджера
Intro in Product Management - Коротко про професію продакт менеджера
 
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
Event-Driven Architecture Masterclass: Integrating Distributed Data Stores Ac...
 
AI mind or machine power point presentation
AI mind or machine power point presentationAI mind or machine power point presentation
AI mind or machine power point presentation
 
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
 

Càlcul i anàlisi de dades massiu per al disseny d'enzims amb aplicacions a la indústria biotecnològica

  • 1. TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes Càlcul i anàlisi de dades massiu per al disseny d’enzims amb aplicacions biotecnològiques Xevi Biarnés @xevibiarnes xevi.biarnes@iqs.edu Departament de Bioenginyeria IQS School of Engineering Jornada TAC’2015 Mataró, 30 de juny de 2015
  • 2. TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes IQS School of Engineering IQS School of Management Via Augusta, 390 Barcelona
  • 3. TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes Bioengineering Department  Laboratory of Biochemistry  Laboratory of Biomaterials  Laboratory of Tissue Engineering  Laboratory of Microbiology  Laboratory of Bioprocesses  Degree in Biotechnology  Master’s of Science in Bioengineering  PhD in Bioengineering
  • 4. TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes Laboratory of Biochemistry Main R&D topics:  Protein engineering and enzymology of glycosidases and glycosyltranferases  Therapeutic targets in infectious diseases  Amyloidogenic proteins in neurodegenerative diseases  Biocatalysis: enzyme redesign, directed evolution of enzymes  Metabolic Engineering for the production of glycoglycerolipids Headed by Prof. Antoni Planas 5 permanent staff 8 PhD students 6 MSc students 4 undergrad students 2 research assistants
  • 5. TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes Bioinformatics and Molecular Modelling Unit Laboratory of Biochemistry BIOMMIQS  Bioinformatics for comparative analysis of genomic sequences  Protein Structure Prediction  In silico tools to assist in experimental Protein Engineering  Simulation of small and macro molecules conformational dynamics
  • 6. TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes BIOMMIQS @IQS and Anella Científica BIOMMIQS Direct access to consortium services:  CBUC/CCUC  EDUROAM
  • 7. TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes BIOMMIQS @IQS and Anella Científica  Barcelona (BSC-CNS) MareNostrum 3, MinoTauro, Altix  Madrid (CeSViMa-UPM) Magerit  Islas Canarias (IAC, ITC) LaPalma 2, Atlante  Cantabria (UC) Altamira 2  Málaga (UMA) Picasso 2  Valéncia (UV) Tirant 2  Zaragoza (BIFI-UZ) CaesarAugusta 2 BIOMMIQS
  • 8. TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes Re-Evolution of Genomes optimization of the genetic codes of living organisms to adapt to their living environments
  • 9. TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes Burst of Genomic Data • Growth of GeneBank database http://www.ncbi.nlm.nih.gov/genbank/statistics 700 GBytes of raw data
  • 10. TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes ACTAACCCCTCAGTTTTTGTCAAGCTGTCAGACCCTCCAGCGCAGGTTTCAGTGCC ATTCATGTCACCTGCGAGTGCTTATCAATGGTTTTATGACGGATATCCCACATTCGGA GAACACAAACAGGAGAAAGATCTTGAATACGGGGCATGTCCTAATAACATGATGG GCACGTTCTCAGTGCGGACTGTGGGGACCTCCAAGTCCAAGTACCCTTTAGTGGT AGGATCTACATGAGAATGAAGCACGTCAGGGCGTGGATACCTCGCCCGATGCGTA CCAGAACTACCTATTCAAAGCCAACCCAAATTATGCTGGCAACTCCATTAAGCCAAC TGGTGCCAGTCGTACAGCGATCACCACTCTTGGGAAATTTGGACAACAGTCTGGG GCTATTTATGTGGGCAACTTTAGAGTGGTCAACCGACATCTTGCCACTCACAATGAT TGGGCAAATCTTGTTTGGGAAGACAGCTCTCGCGACTTGCTCGTGTCATGAACCAC CGCCCAAGGCTGTGACACGATTGCTCGTTGCGATTGCCAGACAGGGGTGTACTAC GTAACTCGATGAGAAAACACTACCCAGTCAGTTTTTCAAAACCCAGCCTGATCTATG TAGAGGCTAGCGAGTATTACCCAGCCAGGTACCAATCACATCTCATGCTCGCACAG GGTCACTCAGAACCTGGTGATTGCGGTGGTATCCTTAGATGCCAACATGGCGTCGT CGGCATAGTGTCTACTGGTGGTAATGGGCTCGTTGGCTTTGCAGACGTTAGAGACC TCTTGTGGTTAGATGAAGAAGCTATGGAACAGGGCGTGTCCGACTACATCAAGGG TCTCGGAGATGCTTTTGGAACAGGCTTCACTGACGCAGTCTCAAGGGAGGTTGAA GCTCTCAAGAACTATCTTATAGGGTCTGAAGGAGCAGTTGAGAAAATCTTGAAAAA TCTTATTAAACTAATCTCTGCACTGGTATTGTGATCAGAAGTGATTACGACATGGTTA Where is the information?
  • 11. TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes We are at the post-genomic era Torgeir R. Hvidsten
  • 12. TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes Proteins are the Machinery of the Cells genes (DNA) only keep the information proteins (aminoacids) perform the function The Inner Life of the Cell (youtube) XVIVO Scientific Animation
  • 13. TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes Proteins are 3D Objects nanometer size >sequence_of_aminoacids MYCSASTCTCSTTATRHYGCKLMNDSSCRFGH KLISPRDTEDFSGFRTCSKLIPSCSFACVIPL PSFACEERERWQSRTNCVISCRTEDPLKISCF GRSRACGRSTTRSGCSPLYPLREDTSWASDFR 3D structure and function of proteins is dictated by their aminoacids sequence
  • 14. TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes Proteins are Dynamic 3D Objects hemoglobin: oxygen transporter in blood eppur si muove a = F / m x(t) = x0 + v0·t + ½·a·t2 protein motion can be simulated on the computer: Molecular Dynamics (MD) typical simulation: 1 billion of steps!
  • 15. TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes Computer Simulations of Biological Processes @ BIOMMIQS Implementation of computational algorithms based on Molecular Dynamics (metadynamics) that enhance the simulation of biologically relevant processes BIOMMIQS Protein Folding Protein Aggregation
  • 16. TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes Computer Simulations of Biological Processes @ BIOMMIQS Prion protein folding Prion protein (the causative agent of spongiform encephalopathy) is an unstable protein that can adopt different structures. One of these structures, tends to form precipitates in the central nervous system tissue, leading to neurodegeneration.
  • 17. TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes Computer Simulations of Biological Processes @ BIOMMIQS Prion protein folding The structural determinants of prion protein stability were identified in-silico by extensive computer simulations. Benetti F. and Biarnés X. et al, JMB 2014 The simulations spent  1.000.000 of hours of total CPU time, and generated  1 TeraByte of data.
  • 18. TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes Computer Simulations of Biological Processes @ BIOMMIQS Amyloid-β peptide aggregation Amyloid-β is an intrinsically disordered protein. The final segment of this protein can lead to aggregates. These aggregates are associated to Alzheimer’s disease. 18 molecules of the final Amyloid-β segment were simulated, and a nascent fibril was detected. Baftizadeh F. et al, PRL 2013
  • 19. TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes Technical Details of Computer Simulations • Huge CPU-demanding – millions of hours in CPU time  supercomputers • Big Data Storage – 100 GBytes per simulation (3-4 months)  cloud storage? • Data Transfer – 1 GByte of data generated daily – Need to transfer locally for visualization  efficient communications – Current download rates: • From CESVIMA (UPM Madrid) to IQS 5.3 MBytes / s • From BSC (UPC Barcelona) to IQS ?? • From SISSA (Trieste, Italy) to IQS 9.5 Mbytes / s
  • 20. TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes Technical Details of Computer Simulations • Visualization – Renders are done off-line in local computers • Visualization on-line? – Remote Desktops are a solution  not nice – “Streaming” of xyz coordinates could be a solution? C 3.23 2.22 4.34 O 2.31 1.34 3.41 H 2.88 2.35 5.32 C 3.21 2.11 1.22 … … 30000-50000 atoms x 100000 frames Minimal Atomic Coordinates File atom x y z 3D renders are generated by specific software based on an atomic coordinate file containing the xyz coordinates of each atom in the protein structure
  • 21. TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes Proteins as Chemical Machines: ENZYMES  Chemical transformations rule our life 6·CO2 + 6·H2O C6H12O6 + 6·O2 Enzymes decrease the activation energy required for a chemical transformation: this is a catalyst PHOTOSYNTHESIS
  • 22. TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes Proteins as “bio”catalysts (enzymes) Protein structures are tightly tuned to accommodate their natural ligands. Maximum catalytic efficiency of enzymes is attained, in part, by the binding forces in the active site.
  • 23. TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes Industrial applications of enzymes  Amylases production of sugars from starch in syrups production  Glucanases starch degradation prior to fermentation in beer production  Proteases  Cellulases cellulose degradation prior to fermentation in bioethanol production  Lipases esterification of lipids in biodiesel production  Amylases, Xylanases, Cellulases, Ligninases starch degradation to lower viscosity, aiding sizing and coating paper
  • 24. TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes Industrial applications of enzymes PRODUCTION OF NON-NATURAL ADDED-VALUE COMPOUNDS Pharmaceuticals Pigments Biomaterials Complementing traditional chemical industry ··· - Processes optimization - Green chemistry
  • 25. TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes Enzymes to Produce Added-Value Compounds natural compound novel compound there is room for enzyme optimization by PROTEIN ENGINEERING Natural enzymes are not optimized for non-natural compounds
  • 26. TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes The Protein Engineering Dilemma MSDTAGSPWFSHSLKRNQDFGFYY SDFCNARSDTPQSCWREGQNESDR QTAVWPYRTSCNMLKCSRYTCVPM Protein Engineering can be guided by Computer Simulations and Genomic-Data Mining
  • 27. TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes Protein Engineering by Data Mining @ BIOMMIQS Setting-up of an integrative platform to assist in protein engineering experiments BIOMMIQS The platform is based on biological data integration from different database sources and complemented with computer simulations
  • 28. TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes Protein Engineering by Data Mining @ BIOMMIQS  UNIPROT 50.000.000 protein sequences  PDB 110.000 protein structures  PFAM 16.000 protein functions  CAZY 340.000 enzymes active on carbohydrates  GENBANK 185.000.000 genomic sequences http://www.ncbi.nlm.nih.gov/genbank/ http://uniprot.org http://pdb.org http://pfam.xfam.org http://cazy.org
  • 29. TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes Protein Engineering by Data Mining @ BIOMMIQS Protein engineering of chitindeacetylases for the biotechnological production of chitosan http://nano3bio.eu from chitin to chitosan …… + + + ……
  • 30. TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes Protein Engineering by Data Mining @ BIOMMIQS  Natural chitindeacetylase enzymes producing different chitosans Andrés E. et al, Angew Chem Intl Ed 2014  Engineering of a non-natural chitindeacetylase to produce new-to-nature chitosans
  • 31. TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes Technical Details of Data Mining • Public Databases size – GENBANK  700 GBytes – UNIPROT  27 GBytes – PDB  373 GBytes – PFAM  195 GBytes – No local copies! For general purposes, public web services are used. • BLAST • JMOL • HMMSEARCH • Public Databases are updated regularly (weekly) – Need to update local copies  mirrors?
  • 32. TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes Bioinformatics and Molecular Modelling Unit Laboratory of Biochemistry BIOMMIQS  Bioinformatics for comparative analysis of genomic sequences  Protein Structure Prediction  In silico tools to assist in experimental Protein Engineering  Simulation of small and macro molecules conformational dynamics
  • 33. TAC’2015 30/juny/2015 Xevi Biarnés @xevibiarnes Càlcul i anàlisi de dades massiu per al disseny d’enzims amb aplicacions biotecnològiques Xevi Biarnés @xevibiarnes xevi.biarnes@iqs.edu Departament de Bioenginyeria IQS School of Engineering Jornada TAC’2015 Mataró, 30 de juny de 2015