SlideShare a Scribd company logo
1 of 15
Algorithmic approach to Computational
Biology using Graphs
Submitted by
S P Sajjan
Research Guide
Dr. Ishwar BaidariMCA,Ph. D.
Dept. of Computer Science
Karnatak University, Dharwad.
What is Computational Biology?
"Computational biology is not a "field", but an "approach" involving
the use of computers to study biological processes and hence it is an area as
diverse as biology itself."
• Biological data
Biological data are data or measurements collected from
biological sources,
which are often stored or exchanged in a digital form.
Biological data are commonly stored in files or databases.
Ex : DNA sequences, and population data used in ecology.
• Functional molecules
In organic chemistry, functional groups are specific groups
of atoms or bonds within molecules that are responsible for the
characteristic chemical reactions of those molecules.
• Mining in molecular biology
Text-mining in molecular biology is defined as the
automatic extraction of information about genes, proteins and
their functional relationships from text documents.
Ex: Information science, Bioinformatics and Computational
linguistics.
• Defining Metabolism
The term, 'Metabolism' refers to biochemical processes
that happen within a person or living organism.
Metabolism is something that consists of both,’
Catabolism,' and, 'Anabolism;' which are the buildup and
breakdown of substances.
Cellular networks
• Interacting molecular sets
within cells.
• It includes mainly p-p
interactions, metabolism, gene
transcriptional regulatory
networks and signal
transduction pathways.
• All of them are different subsets
of a single large-scale cellular
network, since they are
eventually cross-linked.
Purpose of Computational Biology
• Computational Biology can be summarized as the field
utilizing high throughput technology and computation to study
complex organizational patterns of biological systems and
how they contribute to the normal physiology and disease.
• Experimental systems biology uses various
genomics/proteomics.
• Large number of genes or proteins at a genome scale, which
naturally yields a large volume of data to be interpreted and
put within the context of real biology.
• There are several nation-wide large projects aiming at
characterizing the genome and proteome of different (e.g
cancer) cells.
• Billions of dollars are spending into this research that spans
many of the top institutions across the nation.
• Classical molecular biology has mainly focused on gene or
molecular centric research,
• 30-40 years of this research led to our realization of the
incredible complexity of biological systems.
• we need more global experimental approaches and equally as
importantly.
Relevance of the study and present status
Issues Related to Computational Biology
• ~22,000 noted Human genes in Sequence
• ~60,000 known protein-protein interactions in human
• Millions of indirect relationships between genes
• Typical genomic experiment: millions of data points
Statement of Research Problem
• The theory of complex networks plays an important role in a
wide variety of disciplines, ranging from communication to
molecular and population biology.
• The focus of this Research is on graph theory methods for
computational biology.
• We will survey methods and approaches in graph theory,
along with current applications in biomedical informatics.
• Within the fields of Biology and Medicine, potential
applications of network analysis by using graph theory
including identifying drug targets, determining the role of
proteins or genes of unknown function.
• There are several biological domains where graph theory
techniques are applied for knowledge extraction from data.
We have classified these problems as follows.
• Modeling methods of bio-molecular networks such as protein
interaction networks, metabolic networks, as well as
transcriptional regulatory networks.
• Measurement of centrality and importance in bio-molecular
networks. To identify the most important nodes in a large
complex network is of fundamental importance in
computational biology.
• We will introduce several researches that applied centrality
measures to identify structurally important genes or proteins
identified in this way.
• Mining new pathways from bio-molecular networks.
• Experimental validation of identification of the pathway in
different organisms is requires huge amounts of time and effort.
• Thus, there is a need for Graph theory tools help scientists predict
pathways in bio-molecular networks.
• Our primary goal in the present Research is to provide as broad a
survey as possible of the major advances made in this field.
Moreover, we also highlight what has been achieved as well as
some of the most significant open issues that need to be addressed.
• Finally, we hope that this Research will serve as a useful
introduction to the field for those unfamiliar with the literature.
The concept of Graph theory
• Graph: A graph G consists of a set of vertices V(G) and set of
edges E(G).
• Simple Graph: In simple graph, two of the vertices in G are
linked if there exits an edge (𝑉𝑖, 𝑉𝑗) ∈E(G). connecting the
vertices and in graph G such that 𝑉𝑖 ∈V(G) and 𝑉𝑗 ∈V(G).
• Undirected Graph : An undirected graph is graph, i.e., a set of
objects (called vertices or nodes) that are connected together,
where all the edges are bidirectional. An undirected graph is
sometimes called an undirected network.
• Directed Graph: A directed graph is graph, i.e., a set of objects
(called vertices or nodes) that are connected together, where all
the edges are directed from one vertex to another. A directed
graph is sometimes called a digraph or a directed network.
Modeling of Bio-molecular networks in
Graph
• In Biology, Transcriptional regulatory networks and metabolic
networks would usually be modeled as directed graphs.
• For instance, in a Transcriptional regulatory network, nodes
represent genes with edges denoting the Transcriptional
relationship between them.
• In recent years, attentions have been focused on the protein-
protein interaction networks of various simple organisms. These
networks describe the direct physical interaction between the
proteins in an organism’s proteome and there is no direction
associated with the interactions in such networks.
• Hence, PPI networks are typically modeled as undirected
graphs, in which nodes represent protein and edges represent
interaction.
Computational Limitations
• The challenges of computational biology are enormous, and may exceed
the expected increases in computing capability. Several years ago the
computational power of “state-of-the-art parallel supercomputers”
allowed highly predictive calculations treating only hundreds of atoms for
time scales of picoseconds, while molecular dynamics calculations of tens
of thousands of atoms for nanoseconds were becoming common, although
they were some what less predictive.
• A straightforward application of Moore’s Law would predict an increase
of about three – four doublings in capability in the intervening five or six
years.
• Using current methodologies, achieving the desired level of computation
would represent an increase of greater than ~109 times in computing
power.
• It must be noted that even an increase of ~109 in computing power would
only provide the ability to simulate certain cellular systems, and may not
provide a means to predictively model whole cells, organs or organisms.
Algorithmic approach to computational biology using graphs

More Related Content

What's hot

Biological databases
Biological databasesBiological databases
Biological databasesAfra Fathima
 
Basics of Data Analysis in Bioinformatics
Basics of Data Analysis in BioinformaticsBasics of Data Analysis in Bioinformatics
Basics of Data Analysis in BioinformaticsElena Sügis
 
Animal cell culture in Biopharmaceutical Industry in the Production of Therap...
Animal cell culture in Biopharmaceutical Industry in the Production of Therap...Animal cell culture in Biopharmaceutical Industry in the Production of Therap...
Animal cell culture in Biopharmaceutical Industry in the Production of Therap...Shubham Chinchulkar
 
Databases short nucletide polymorphism
Databases short nucletide polymorphismDatabases short nucletide polymorphism
Databases short nucletide polymorphismIram Wains
 
The Gene Ontology & Gene Ontology Annotation resources
The Gene Ontology & Gene Ontology Annotation resourcesThe Gene Ontology & Gene Ontology Annotation resources
The Gene Ontology & Gene Ontology Annotation resourcesMelanie Courtot
 
PAM : Point Accepted Mutation
PAM : Point Accepted MutationPAM : Point Accepted Mutation
PAM : Point Accepted MutationAmit Kyada
 
Overview of methods for variant calling from next-generation sequence data
Overview of methods for variant calling from next-generation sequence dataOverview of methods for variant calling from next-generation sequence data
Overview of methods for variant calling from next-generation sequence dataThomas Keane
 
Structure analysis of protein
Structure analysis of proteinStructure analysis of protein
Structure analysis of proteinKAUSHAL SAHU
 

What's hot (20)

(Expasy)
(Expasy)(Expasy)
(Expasy)
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Structure alignment methods
Structure alignment methodsStructure alignment methods
Structure alignment methods
 
Ppi
PpiPpi
Ppi
 
Basics of Data Analysis in Bioinformatics
Basics of Data Analysis in BioinformaticsBasics of Data Analysis in Bioinformatics
Basics of Data Analysis in Bioinformatics
 
Animal cell culture in Biopharmaceutical Industry in the Production of Therap...
Animal cell culture in Biopharmaceutical Industry in the Production of Therap...Animal cell culture in Biopharmaceutical Industry in the Production of Therap...
Animal cell culture in Biopharmaceutical Industry in the Production of Therap...
 
Protein purification
Protein purification Protein purification
Protein purification
 
Databases short nucletide polymorphism
Databases short nucletide polymorphismDatabases short nucletide polymorphism
Databases short nucletide polymorphism
 
Molecular modeling database
Molecular modeling database Molecular modeling database
Molecular modeling database
 
Histidine Operon
Histidine OperonHistidine Operon
Histidine Operon
 
Proteomics
ProteomicsProteomics
Proteomics
 
The Gene Ontology & Gene Ontology Annotation resources
The Gene Ontology & Gene Ontology Annotation resourcesThe Gene Ontology & Gene Ontology Annotation resources
The Gene Ontology & Gene Ontology Annotation resources
 
Metabolomics
MetabolomicsMetabolomics
Metabolomics
 
Metabolomics
MetabolomicsMetabolomics
Metabolomics
 
PAM : Point Accepted Mutation
PAM : Point Accepted MutationPAM : Point Accepted Mutation
PAM : Point Accepted Mutation
 
Overview of methods for variant calling from next-generation sequence data
Overview of methods for variant calling from next-generation sequence dataOverview of methods for variant calling from next-generation sequence data
Overview of methods for variant calling from next-generation sequence data
 
dot plot analysis
dot plot analysisdot plot analysis
dot plot analysis
 
Structure analysis of protein
Structure analysis of proteinStructure analysis of protein
Structure analysis of protein
 
GENOMICS
GENOMICSGENOMICS
GENOMICS
 
Structural genomics
Structural genomicsStructural genomics
Structural genomics
 

Similar to Algorithmic approach to computational biology using graphs

Introduction to graph databases: Neo4j and Cypher
Introduction to graph databases: Neo4j and CypherIntroduction to graph databases: Neo4j and Cypher
Introduction to graph databases: Neo4j and CypherAnjani Dhrangadhariya
 
BASIC OF BIOINFORMATICS.pptx
BASIC OF BIOINFORMATICS.pptxBASIC OF BIOINFORMATICS.pptx
BASIC OF BIOINFORMATICS.pptxDevaprasadPanda
 
System Biology and Pathway Network.pptx
System Biology and Pathway Network.pptxSystem Biology and Pathway Network.pptx
System Biology and Pathway Network.pptxssuserecbdb6
 
Introduction to Biology for Engineers.pptx
Introduction to Biology for Engineers.pptxIntroduction to Biology for Engineers.pptx
Introduction to Biology for Engineers.pptxDr. G Shanmugavel
 
Basics Of Bioinformatics .pptx
Basics Of Bioinformatics .pptxBasics Of Bioinformatics .pptx
Basics Of Bioinformatics .pptxMohdkaifkhan18
 
Bioinformatics—an introduction for computer scientists
Bioinformatics—an introduction for computer scientistsBioinformatics—an introduction for computer scientists
Bioinformatics—an introduction for computer scientistsunyil96
 
LECTURE NOTES ON BIOINFORMATICS
LECTURE NOTES ON BIOINFORMATICSLECTURE NOTES ON BIOINFORMATICS
LECTURE NOTES ON BIOINFORMATICSMSCW Mysore
 
Bioinformatics
BioinformaticsBioinformatics
BioinformaticsAmna Jalil
 
System biology and its tools
System biology and its toolsSystem biology and its tools
System biology and its toolsGaurav Diwakar
 
Session ii g2 overview chemical modeling mmc
Session ii g2 overview chemical modeling mmcSession ii g2 overview chemical modeling mmc
Session ii g2 overview chemical modeling mmcUSD Bioinformatics
 
Bioinformatics.pptx
Bioinformatics.pptxBioinformatics.pptx
Bioinformatics.pptxbreenaawan
 
COMPUTER SIMULATIONS IN PHARMACOKINETICS AND PHARMACODYNAMICS
COMPUTER SIMULATIONS INPHARMACOKINETICS ANDPHARMACODYNAMICSCOMPUTER SIMULATIONS INPHARMACOKINETICS ANDPHARMACODYNAMICS
COMPUTER SIMULATIONS IN PHARMACOKINETICS AND PHARMACODYNAMICSnaazmohd2
 
Genome data management
Genome data managementGenome data management
Genome data managementShareb Ismaeel
 
introduction to bioinfromatics.pptx
introduction to bioinfromatics.pptxintroduction to bioinfromatics.pptx
introduction to bioinfromatics.pptxAbelPhilipJoseph
 
Network Biology: A paradigm for modeling biological complex systems
Network Biology: A paradigm for modeling biological complex systemsNetwork Biology: A paradigm for modeling biological complex systems
Network Biology: A paradigm for modeling biological complex systemsGanesh Bagler
 
Overall Vision for NRNB: 2015-2020
Overall Vision for NRNB: 2015-2020Overall Vision for NRNB: 2015-2020
Overall Vision for NRNB: 2015-2020Alexander Pico
 

Similar to Algorithmic approach to computational biology using graphs (20)

Introduction to graph databases: Neo4j and Cypher
Introduction to graph databases: Neo4j and CypherIntroduction to graph databases: Neo4j and Cypher
Introduction to graph databases: Neo4j and Cypher
 
BASIC OF BIOINFORMATICS.pptx
BASIC OF BIOINFORMATICS.pptxBASIC OF BIOINFORMATICS.pptx
BASIC OF BIOINFORMATICS.pptx
 
System Biology and Pathway Network.pptx
System Biology and Pathway Network.pptxSystem Biology and Pathway Network.pptx
System Biology and Pathway Network.pptx
 
Introduction to Biology for Engineers.pptx
Introduction to Biology for Engineers.pptxIntroduction to Biology for Engineers.pptx
Introduction to Biology for Engineers.pptx
 
Basics Of Bioinformatics .pptx
Basics Of Bioinformatics .pptxBasics Of Bioinformatics .pptx
Basics Of Bioinformatics .pptx
 
Bioinformatics—an introduction for computer scientists
Bioinformatics—an introduction for computer scientistsBioinformatics—an introduction for computer scientists
Bioinformatics—an introduction for computer scientists
 
LECTURE NOTES ON BIOINFORMATICS
LECTURE NOTES ON BIOINFORMATICSLECTURE NOTES ON BIOINFORMATICS
LECTURE NOTES ON BIOINFORMATICS
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
BioInformatics Software
BioInformatics SoftwareBioInformatics Software
BioInformatics Software
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
System biology and its tools
System biology and its toolsSystem biology and its tools
System biology and its tools
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Session ii g2 overview chemical modeling mmc
Session ii g2 overview chemical modeling mmcSession ii g2 overview chemical modeling mmc
Session ii g2 overview chemical modeling mmc
 
Bioinformatics.pptx
Bioinformatics.pptxBioinformatics.pptx
Bioinformatics.pptx
 
COMPUTER SIMULATIONS IN PHARMACOKINETICS AND PHARMACODYNAMICS
COMPUTER SIMULATIONS INPHARMACOKINETICS ANDPHARMACODYNAMICSCOMPUTER SIMULATIONS INPHARMACOKINETICS ANDPHARMACODYNAMICS
COMPUTER SIMULATIONS IN PHARMACOKINETICS AND PHARMACODYNAMICS
 
Genomics types
Genomics typesGenomics types
Genomics types
 
Genome data management
Genome data managementGenome data management
Genome data management
 
introduction to bioinfromatics.pptx
introduction to bioinfromatics.pptxintroduction to bioinfromatics.pptx
introduction to bioinfromatics.pptx
 
Network Biology: A paradigm for modeling biological complex systems
Network Biology: A paradigm for modeling biological complex systemsNetwork Biology: A paradigm for modeling biological complex systems
Network Biology: A paradigm for modeling biological complex systems
 
Overall Vision for NRNB: 2015-2020
Overall Vision for NRNB: 2015-2020Overall Vision for NRNB: 2015-2020
Overall Vision for NRNB: 2015-2020
 

Recently uploaded

Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 

Recently uploaded (20)

Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 

Algorithmic approach to computational biology using graphs

  • 1. Algorithmic approach to Computational Biology using Graphs Submitted by S P Sajjan Research Guide Dr. Ishwar BaidariMCA,Ph. D. Dept. of Computer Science Karnatak University, Dharwad.
  • 2. What is Computational Biology? "Computational biology is not a "field", but an "approach" involving the use of computers to study biological processes and hence it is an area as diverse as biology itself."
  • 3. • Biological data Biological data are data or measurements collected from biological sources, which are often stored or exchanged in a digital form. Biological data are commonly stored in files or databases. Ex : DNA sequences, and population data used in ecology. • Functional molecules In organic chemistry, functional groups are specific groups of atoms or bonds within molecules that are responsible for the characteristic chemical reactions of those molecules.
  • 4. • Mining in molecular biology Text-mining in molecular biology is defined as the automatic extraction of information about genes, proteins and their functional relationships from text documents. Ex: Information science, Bioinformatics and Computational linguistics. • Defining Metabolism The term, 'Metabolism' refers to biochemical processes that happen within a person or living organism. Metabolism is something that consists of both,’ Catabolism,' and, 'Anabolism;' which are the buildup and breakdown of substances.
  • 5. Cellular networks • Interacting molecular sets within cells. • It includes mainly p-p interactions, metabolism, gene transcriptional regulatory networks and signal transduction pathways. • All of them are different subsets of a single large-scale cellular network, since they are eventually cross-linked.
  • 6. Purpose of Computational Biology • Computational Biology can be summarized as the field utilizing high throughput technology and computation to study complex organizational patterns of biological systems and how they contribute to the normal physiology and disease. • Experimental systems biology uses various genomics/proteomics. • Large number of genes or proteins at a genome scale, which naturally yields a large volume of data to be interpreted and put within the context of real biology.
  • 7. • There are several nation-wide large projects aiming at characterizing the genome and proteome of different (e.g cancer) cells. • Billions of dollars are spending into this research that spans many of the top institutions across the nation. • Classical molecular biology has mainly focused on gene or molecular centric research, • 30-40 years of this research led to our realization of the incredible complexity of biological systems. • we need more global experimental approaches and equally as importantly. Relevance of the study and present status
  • 8. Issues Related to Computational Biology • ~22,000 noted Human genes in Sequence • ~60,000 known protein-protein interactions in human • Millions of indirect relationships between genes • Typical genomic experiment: millions of data points
  • 9. Statement of Research Problem • The theory of complex networks plays an important role in a wide variety of disciplines, ranging from communication to molecular and population biology. • The focus of this Research is on graph theory methods for computational biology. • We will survey methods and approaches in graph theory, along with current applications in biomedical informatics. • Within the fields of Biology and Medicine, potential applications of network analysis by using graph theory including identifying drug targets, determining the role of proteins or genes of unknown function.
  • 10. • There are several biological domains where graph theory techniques are applied for knowledge extraction from data. We have classified these problems as follows. • Modeling methods of bio-molecular networks such as protein interaction networks, metabolic networks, as well as transcriptional regulatory networks. • Measurement of centrality and importance in bio-molecular networks. To identify the most important nodes in a large complex network is of fundamental importance in computational biology. • We will introduce several researches that applied centrality measures to identify structurally important genes or proteins identified in this way.
  • 11. • Mining new pathways from bio-molecular networks. • Experimental validation of identification of the pathway in different organisms is requires huge amounts of time and effort. • Thus, there is a need for Graph theory tools help scientists predict pathways in bio-molecular networks. • Our primary goal in the present Research is to provide as broad a survey as possible of the major advances made in this field. Moreover, we also highlight what has been achieved as well as some of the most significant open issues that need to be addressed. • Finally, we hope that this Research will serve as a useful introduction to the field for those unfamiliar with the literature.
  • 12. The concept of Graph theory • Graph: A graph G consists of a set of vertices V(G) and set of edges E(G). • Simple Graph: In simple graph, two of the vertices in G are linked if there exits an edge (𝑉𝑖, 𝑉𝑗) ∈E(G). connecting the vertices and in graph G such that 𝑉𝑖 ∈V(G) and 𝑉𝑗 ∈V(G). • Undirected Graph : An undirected graph is graph, i.e., a set of objects (called vertices or nodes) that are connected together, where all the edges are bidirectional. An undirected graph is sometimes called an undirected network. • Directed Graph: A directed graph is graph, i.e., a set of objects (called vertices or nodes) that are connected together, where all the edges are directed from one vertex to another. A directed graph is sometimes called a digraph or a directed network.
  • 13. Modeling of Bio-molecular networks in Graph • In Biology, Transcriptional regulatory networks and metabolic networks would usually be modeled as directed graphs. • For instance, in a Transcriptional regulatory network, nodes represent genes with edges denoting the Transcriptional relationship between them. • In recent years, attentions have been focused on the protein- protein interaction networks of various simple organisms. These networks describe the direct physical interaction between the proteins in an organism’s proteome and there is no direction associated with the interactions in such networks. • Hence, PPI networks are typically modeled as undirected graphs, in which nodes represent protein and edges represent interaction.
  • 14. Computational Limitations • The challenges of computational biology are enormous, and may exceed the expected increases in computing capability. Several years ago the computational power of “state-of-the-art parallel supercomputers” allowed highly predictive calculations treating only hundreds of atoms for time scales of picoseconds, while molecular dynamics calculations of tens of thousands of atoms for nanoseconds were becoming common, although they were some what less predictive. • A straightforward application of Moore’s Law would predict an increase of about three – four doublings in capability in the intervening five or six years. • Using current methodologies, achieving the desired level of computation would represent an increase of greater than ~109 times in computing power. • It must be noted that even an increase of ~109 in computing power would only provide the ability to simulate certain cellular systems, and may not provide a means to predictively model whole cells, organs or organisms.