SlideShare a Scribd company logo
1 of 10
Cloud Applications
Protein Structure Predication
and
gene expression data analysis
protein structure prediction
• Proteins are chains of amino acids joined
together by peptide bonds.
• Many conformations of this chain are possible
due to the rotation of the chain about each atom.
• Protein structure is these conformational changes
that are responsible for differences in the three
dimensional structure of proteins.
Why we are using cloud computing
• It require high computing capabilities and often
operate on large data- sets that cause extensive
I/O operations.
• Protein structure prediction is a computationally
intensive task that is fundamental to different
types of research in the life sciences
Benefits of protein structure
• Manually 3D structure determination is difficult, slow and
expensive

• Structure helps in the design of new drugs for the
treatment of diseases.
• The geometric structure of a protein cannot be
directly inferred from the sequence of genes that
compose its structure, but it is the result of
complex computations aimed at identifying the
structure that minimizes the required energy.
• In the above figure the web portal enables
scientist not to worry about predictions task, all
work is done by cloud service.
Machines divides the pattern recognition problem
into three phases:
• initialization,
• classification,
• and a final phase.
these phases executes in parallel to reduce the
computational time of the prediction.
The prediction algorithm is then translated into a
task graph that is submitted to Aneka. Once the
task is completed, the middleware makes the
results available for visualization through the
portal.
Gene expression data analysis
• Gene expression profiling is the measurement of
the expression levels of thousands of genes at
once, Consequently, it is widely used for cancer
prediction.
• It is also used in medical diagnosis and drug
design.
Cancer
• Cancer is a disease characterized by uncontrolled
cell growth and proliferation. This behavior occurs
because genes regulating the cell growth mutate.
This means that all the cancerous cells contain
mutated genes.
• These uncontrolled growth develops different
types of tumors, In this context, gene expression
profiling is utilized to provide a more accurate
classification of tumors.
• The dimensionality of typical gene expression
datasets ranges from several thousands to over
tens of thousands of genes
• For these large classification is solved by
eXtended Classifier System(XCS) which has
been successfully utilized for classifying large
datasets.
• Cloud-CoXCS, is a machine learning
classification system for gene expression
datasets on the Cloud infrastructure. It extends
the XCS model by introducing a coevolutionary
approach.
• CoXCS divides the entire search space into sub
domains and employs the standard XCS
algorithm in each of these sub domains.
Working of CoXCS

More Related Content

What's hot

Distributed computing
Distributed computingDistributed computing
Distributed computing
shivli0769
 
OIT552 Cloud Computing - Question Bank
OIT552 Cloud Computing - Question BankOIT552 Cloud Computing - Question Bank
OIT552 Cloud Computing - Question Bank
pkaviya
 
Data security in cloud computing
Data security in cloud computingData security in cloud computing
Data security in cloud computing
Prince Chandu
 

What's hot (20)

software project management Artifact set(spm)
software project management Artifact set(spm)software project management Artifact set(spm)
software project management Artifact set(spm)
 
Model Based Software Architectures
Model Based Software ArchitecturesModel Based Software Architectures
Model Based Software Architectures
 
Data science unit1
Data science unit1Data science unit1
Data science unit1
 
Virtual Machine provisioning and migration services
Virtual Machine provisioning and migration servicesVirtual Machine provisioning and migration services
Virtual Machine provisioning and migration services
 
Face detection and recognition
Face detection and recognitionFace detection and recognition
Face detection and recognition
 
Big data lecture notes
Big data lecture notesBig data lecture notes
Big data lecture notes
 
Behavioural modelling
Behavioural modellingBehavioural modelling
Behavioural modelling
 
Distributed computing
Distributed computingDistributed computing
Distributed computing
 
Overfitting & Underfitting
Overfitting & UnderfittingOverfitting & Underfitting
Overfitting & Underfitting
 
Testing in multiplatform environment
Testing in multiplatform environmentTesting in multiplatform environment
Testing in multiplatform environment
 
OIT552 Cloud Computing - Question Bank
OIT552 Cloud Computing - Question BankOIT552 Cloud Computing - Question Bank
OIT552 Cloud Computing - Question Bank
 
Cellular wireless network security
Cellular wireless network securityCellular wireless network security
Cellular wireless network security
 
HANDWRITTEN DIGIT RECOGNITIONppt1.pptx
HANDWRITTEN DIGIT RECOGNITIONppt1.pptxHANDWRITTEN DIGIT RECOGNITIONppt1.pptx
HANDWRITTEN DIGIT RECOGNITIONppt1.pptx
 
Project Planning in Software Engineering
Project Planning in Software EngineeringProject Planning in Software Engineering
Project Planning in Software Engineering
 
Collaborating Using Cloud Services
Collaborating Using Cloud ServicesCollaborating Using Cloud Services
Collaborating Using Cloud Services
 
Quality and productivity factors
Quality and productivity factorsQuality and productivity factors
Quality and productivity factors
 
Cloud sim
Cloud simCloud sim
Cloud sim
 
Hit and-miss transform
Hit and-miss transformHit and-miss transform
Hit and-miss transform
 
Reusibility vs Extensibility in OOAD
Reusibility vs Extensibility in OOADReusibility vs Extensibility in OOAD
Reusibility vs Extensibility in OOAD
 
Data security in cloud computing
Data security in cloud computingData security in cloud computing
Data security in cloud computing
 

Viewers also liked

Market oriented Cloud Computing
Market oriented Cloud ComputingMarket oriented Cloud Computing
Market oriented Cloud Computing
Jithin Parakka
 

Viewers also liked (6)

Federation of OpenStack clouds
Federation of OpenStack cloudsFederation of OpenStack clouds
Federation of OpenStack clouds
 
Market oriented Cloud Computing
Market oriented Cloud ComputingMarket oriented Cloud Computing
Market oriented Cloud Computing
 
Satellite image processing
Satellite image processingSatellite image processing
Satellite image processing
 
Social Media, Cloud Computing and architecture
Social Media, Cloud Computing and architectureSocial Media, Cloud Computing and architecture
Social Media, Cloud Computing and architecture
 
Task programming
Task programmingTask programming
Task programming
 
Social Cloud: Cloud Computing in Social Networks
Social Cloud: Cloud Computing in Social NetworksSocial Cloud: Cloud Computing in Social Networks
Social Cloud: Cloud Computing in Social Networks
 

Similar to Cloud applications - Protein Structure Predication and gene expression data analysis

Biology protein structure
Biology protein structureBiology protein structure
Biology protein structure
gaurav jain
 
[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...
[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...
[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...
DataScienceConferenc1
 
Nitant_Choksi_CAP6545_Presentation_Slides.pptx
Nitant_Choksi_CAP6545_Presentation_Slides.pptxNitant_Choksi_CAP6545_Presentation_Slides.pptx
Nitant_Choksi_CAP6545_Presentation_Slides.pptx
NitantChoksi1
 

Similar to Cloud applications - Protein Structure Predication and gene expression data analysis (20)

Biology protein structure
Biology protein structureBiology protein structure
Biology protein structure
 
DREAM Challenge
DREAM ChallengeDREAM Challenge
DREAM Challenge
 
Genomics types
Genomics typesGenomics types
Genomics types
 
[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...
[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...
[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...
 
Introduction to bioinformatics
Introduction to bioinformaticsIntroduction to bioinformatics
Introduction to bioinformatics
 
Bioinformatics-R program의 실례
Bioinformatics-R program의 실례Bioinformatics-R program의 실례
Bioinformatics-R program의 실례
 
De novo str_prediction
De novo str_predictionDe novo str_prediction
De novo str_prediction
 
AI approaches in healthcare - targeting precise and personalized medicine
AI approaches in healthcare - targeting precise and personalized medicine AI approaches in healthcare - targeting precise and personalized medicine
AI approaches in healthcare - targeting precise and personalized medicine
 
Yeast two hybrid system for Protein Protein Interaction Studies
Yeast two hybrid system for Protein Protein Interaction StudiesYeast two hybrid system for Protein Protein Interaction Studies
Yeast two hybrid system for Protein Protein Interaction Studies
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Cancer Care using Quahog Health Decision System
Cancer Care using Quahog Health Decision SystemCancer Care using Quahog Health Decision System
Cancer Care using Quahog Health Decision System
 
Microarray data Analysis.pptx
Microarray data Analysis.pptxMicroarray data Analysis.pptx
Microarray data Analysis.pptx
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
 
UNMSymposium2014
UNMSymposium2014UNMSymposium2014
UNMSymposium2014
 
protein Modeling Abi.pptx
protein Modeling Abi.pptxprotein Modeling Abi.pptx
protein Modeling Abi.pptx
 
Nitant_Choksi_CAP6545_Presentation_Slides.pptx
Nitant_Choksi_CAP6545_Presentation_Slides.pptxNitant_Choksi_CAP6545_Presentation_Slides.pptx
Nitant_Choksi_CAP6545_Presentation_Slides.pptx
 
DataMining Techniques in BreastCancer.pptx
DataMining Techniques in BreastCancer.pptxDataMining Techniques in BreastCancer.pptx
DataMining Techniques in BreastCancer.pptx
 
Open Source Networking Solving Molecular Analysis of Cancer
Open Source Networking Solving Molecular Analysis of CancerOpen Source Networking Solving Molecular Analysis of Cancer
Open Source Networking Solving Molecular Analysis of Cancer
 
Thesis Presentation
Thesis PresentationThesis Presentation
Thesis Presentation
 
protein design, principles and examples.pptx
protein design, principles and examples.pptxprotein design, principles and examples.pptx
protein design, principles and examples.pptx
 

Recently uploaded

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Recently uploaded (20)

Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 

Cloud applications - Protein Structure Predication and gene expression data analysis

  • 1. Cloud Applications Protein Structure Predication and gene expression data analysis
  • 2. protein structure prediction • Proteins are chains of amino acids joined together by peptide bonds. • Many conformations of this chain are possible due to the rotation of the chain about each atom. • Protein structure is these conformational changes that are responsible for differences in the three dimensional structure of proteins.
  • 3. Why we are using cloud computing • It require high computing capabilities and often operate on large data- sets that cause extensive I/O operations. • Protein structure prediction is a computationally intensive task that is fundamental to different types of research in the life sciences
  • 4. Benefits of protein structure • Manually 3D structure determination is difficult, slow and expensive • Structure helps in the design of new drugs for the treatment of diseases. • The geometric structure of a protein cannot be directly inferred from the sequence of genes that compose its structure, but it is the result of complex computations aimed at identifying the structure that minimizes the required energy.
  • 5.
  • 6. • In the above figure the web portal enables scientist not to worry about predictions task, all work is done by cloud service. Machines divides the pattern recognition problem into three phases: • initialization, • classification, • and a final phase. these phases executes in parallel to reduce the computational time of the prediction. The prediction algorithm is then translated into a task graph that is submitted to Aneka. Once the task is completed, the middleware makes the results available for visualization through the portal.
  • 7. Gene expression data analysis • Gene expression profiling is the measurement of the expression levels of thousands of genes at once, Consequently, it is widely used for cancer prediction. • It is also used in medical diagnosis and drug design.
  • 8. Cancer • Cancer is a disease characterized by uncontrolled cell growth and proliferation. This behavior occurs because genes regulating the cell growth mutate. This means that all the cancerous cells contain mutated genes. • These uncontrolled growth develops different types of tumors, In this context, gene expression profiling is utilized to provide a more accurate classification of tumors. • The dimensionality of typical gene expression datasets ranges from several thousands to over tens of thousands of genes
  • 9. • For these large classification is solved by eXtended Classifier System(XCS) which has been successfully utilized for classifying large datasets. • Cloud-CoXCS, is a machine learning classification system for gene expression datasets on the Cloud infrastructure. It extends the XCS model by introducing a coevolutionary approach. • CoXCS divides the entire search space into sub domains and employs the standard XCS algorithm in each of these sub domains.