SlideShare a Scribd company logo
Algorithmic approach to Computational
Biology using Graphs
Submitted by
S P Sajjan
Research Guide
Dr. Ishwar BaidariMCA,Ph. D.
Dept. of Computer Science
Karnatak University, Dharwad.
What is Computational Biology?
"Computational biology is not a "field", but an "approach" involving
the use of computers to study biological processes and hence it is an area as
diverse as biology itself."
• Biological data
Biological data are data or measurements collected from
biological sources,
which are often stored or exchanged in a digital form.
Biological data are commonly stored in files or databases.
Ex : DNA sequences, and population data used in ecology.
• Functional molecules
In organic chemistry, functional groups are specific groups
of atoms or bonds within molecules that are responsible for the
characteristic chemical reactions of those molecules.
• Mining in molecular biology
Text-mining in molecular biology is defined as the
automatic extraction of information about genes, proteins and
their functional relationships from text documents.
Ex: Information science, Bioinformatics and Computational
linguistics.
• Defining Metabolism
The term, 'Metabolism' refers to biochemical processes
that happen within a person or living organism.
Metabolism is something that consists of both,’
Catabolism,' and, 'Anabolism;' which are the buildup and
breakdown of substances.
Cellular networks
• Interacting molecular sets
within cells.
• It includes mainly p-p
interactions, metabolism, gene
transcriptional regulatory
networks and signal
transduction pathways.
• All of them are different subsets
of a single large-scale cellular
network, since they are
eventually cross-linked.
Purpose of Computational Biology
• Computational Biology can be summarized as the field
utilizing high throughput technology and computation to study
complex organizational patterns of biological systems and
how they contribute to the normal physiology and disease.
• Experimental systems biology uses various
genomics/proteomics.
• Large number of genes or proteins at a genome scale, which
naturally yields a large volume of data to be interpreted and
put within the context of real biology.
• There are several nation-wide large projects aiming at
characterizing the genome and proteome of different (e.g
cancer) cells.
• Billions of dollars are spending into this research that spans
many of the top institutions across the nation.
• Classical molecular biology has mainly focused on gene or
molecular centric research,
• 30-40 years of this research led to our realization of the
incredible complexity of biological systems.
• we need more global experimental approaches and equally as
importantly.
Relevance of the study and present status
Issues Related to Computational Biology
• ~22,000 noted Human genes in Sequence
• ~60,000 known protein-protein interactions in human
• Millions of indirect relationships between genes
• Typical genomic experiment: millions of data points
Statement of Research Problem
• The theory of complex networks plays an important role in a
wide variety of disciplines, ranging from communication to
molecular and population biology.
• The focus of this Research is on graph theory methods for
computational biology.
• We will survey methods and approaches in graph theory,
along with current applications in biomedical informatics.
• Within the fields of Biology and Medicine, potential
applications of network analysis by using graph theory
including identifying drug targets, determining the role of
proteins or genes of unknown function.
• There are several biological domains where graph theory
techniques are applied for knowledge extraction from data.
We have classified these problems as follows.
• Modeling methods of bio-molecular networks such as protein
interaction networks, metabolic networks, as well as
transcriptional regulatory networks.
• Measurement of centrality and importance in bio-molecular
networks. To identify the most important nodes in a large
complex network is of fundamental importance in
computational biology.
• We will introduce several researches that applied centrality
measures to identify structurally important genes or proteins
identified in this way.
• Mining new pathways from bio-molecular networks.
• Experimental validation of identification of the pathway in
different organisms is requires huge amounts of time and effort.
• Thus, there is a need for Graph theory tools help scientists predict
pathways in bio-molecular networks.
• Our primary goal in the present Research is to provide as broad a
survey as possible of the major advances made in this field.
Moreover, we also highlight what has been achieved as well as
some of the most significant open issues that need to be addressed.
• Finally, we hope that this Research will serve as a useful
introduction to the field for those unfamiliar with the literature.
The concept of Graph theory
• Graph: A graph G consists of a set of vertices V(G) and set of
edges E(G).
• Simple Graph: In simple graph, two of the vertices in G are
linked if there exits an edge (𝑉𝑖, 𝑉𝑗) ∈E(G). connecting the
vertices and in graph G such that 𝑉𝑖 ∈V(G) and 𝑉𝑗 ∈V(G).
• Undirected Graph : An undirected graph is graph, i.e., a set of
objects (called vertices or nodes) that are connected together,
where all the edges are bidirectional. An undirected graph is
sometimes called an undirected network.
• Directed Graph: A directed graph is graph, i.e., a set of objects
(called vertices or nodes) that are connected together, where all
the edges are directed from one vertex to another. A directed
graph is sometimes called a digraph or a directed network.
Modeling of Bio-molecular networks in
Graph
• In Biology, Transcriptional regulatory networks and metabolic
networks would usually be modeled as directed graphs.
• For instance, in a Transcriptional regulatory network, nodes
represent genes with edges denoting the Transcriptional
relationship between them.
• In recent years, attentions have been focused on the protein-
protein interaction networks of various simple organisms. These
networks describe the direct physical interaction between the
proteins in an organism’s proteome and there is no direction
associated with the interactions in such networks.
• Hence, PPI networks are typically modeled as undirected
graphs, in which nodes represent protein and edges represent
interaction.
Computational Limitations
• The challenges of computational biology are enormous, and may exceed
the expected increases in computing capability. Several years ago the
computational power of “state-of-the-art parallel supercomputers”
allowed highly predictive calculations treating only hundreds of atoms for
time scales of picoseconds, while molecular dynamics calculations of tens
of thousands of atoms for nanoseconds were becoming common, although
they were some what less predictive.
• A straightforward application of Moore’s Law would predict an increase
of about three – four doublings in capability in the intervening five or six
years.
• Using current methodologies, achieving the desired level of computation
would represent an increase of greater than ~109 times in computing
power.
• It must be noted that even an increase of ~109 in computing power would
only provide the ability to simulate certain cellular systems, and may not
provide a means to predictively model whole cells, organs or organisms.
Algorithmic approach to computational biology using graphs

More Related Content

What's hot

BLAST
BLASTBLAST
Protein Databases
Protein DatabasesProtein Databases
Primary and secondary database
Primary and secondary databasePrimary and secondary database
Primary and secondary database
KAUSHAL SAHU
 
databases in bioinformatics
databases in bioinformaticsdatabases in bioinformatics
databases in bioinformaticsnadeem akhter
 
Fasta
FastaFasta
Chou fasman algorithm for protein structure prediction
Chou fasman algorithm for protein structure predictionChou fasman algorithm for protein structure prediction
Chou fasman algorithm for protein structure prediction
Roshan Karunarathna
 
European molecular biology laboratory (EMBL)
European molecular biology laboratory (EMBL)European molecular biology laboratory (EMBL)
European molecular biology laboratory (EMBL)
Hafiz Muhammad Zeeshan Raza
 
Cath
CathCath
Cath
Ramya S
 
Primary, secondary, tertiary biological database
Primary, secondary, tertiary biological databasePrimary, secondary, tertiary biological database
Primary, secondary, tertiary biological database
KAUSHAL SAHU
 
MULTIPLE SEQUENCE ALIGNMENT
MULTIPLE  SEQUENCE  ALIGNMENTMULTIPLE  SEQUENCE  ALIGNMENT
MULTIPLE SEQUENCE ALIGNMENT
Mariya Raju
 
Data retrieval
Data retrievalData retrieval
Database Searching
Database SearchingDatabase Searching
Database Searching
Meghaj Mallick
 
Sequence alignment
Sequence alignmentSequence alignment
Sequence alignment
Zeeshan Hanjra
 
Sequence file formats
Sequence file formatsSequence file formats
Sequence file formats
Alphonsa Joseph
 
Introduction to Bioinformatics
Introduction to BioinformaticsIntroduction to Bioinformatics
Introduction to Bioinformatics
Asad Afridi
 
Genome Database Systems
Genome Database Systems Genome Database Systems
Genome Database Systems
Harindu Chathuranga Korala
 
Data Retrieval Systems
Data Retrieval SystemsData Retrieval Systems
Data Retrieval Systems
Saramita De Chakravarti
 
BLAST
BLASTBLAST
BLAST
Rabia W.
 
Swiss pdb viewer
Swiss pdb viewerSwiss pdb viewer
Swiss pdb viewer
Vidya Kalaivani Rajkumar
 

What's hot (20)

BLAST
BLASTBLAST
BLAST
 
Protein Databases
Protein DatabasesProtein Databases
Protein Databases
 
Primary and secondary database
Primary and secondary databasePrimary and secondary database
Primary and secondary database
 
databases in bioinformatics
databases in bioinformaticsdatabases in bioinformatics
databases in bioinformatics
 
Fasta
FastaFasta
Fasta
 
Chou fasman algorithm for protein structure prediction
Chou fasman algorithm for protein structure predictionChou fasman algorithm for protein structure prediction
Chou fasman algorithm for protein structure prediction
 
European molecular biology laboratory (EMBL)
European molecular biology laboratory (EMBL)European molecular biology laboratory (EMBL)
European molecular biology laboratory (EMBL)
 
Cath
CathCath
Cath
 
Primary, secondary, tertiary biological database
Primary, secondary, tertiary biological databasePrimary, secondary, tertiary biological database
Primary, secondary, tertiary biological database
 
MULTIPLE SEQUENCE ALIGNMENT
MULTIPLE  SEQUENCE  ALIGNMENTMULTIPLE  SEQUENCE  ALIGNMENT
MULTIPLE SEQUENCE ALIGNMENT
 
Data retrieval
Data retrievalData retrieval
Data retrieval
 
Database Searching
Database SearchingDatabase Searching
Database Searching
 
Sequence alignment
Sequence alignmentSequence alignment
Sequence alignment
 
Sequence file formats
Sequence file formatsSequence file formats
Sequence file formats
 
Introduction to Bioinformatics
Introduction to BioinformaticsIntroduction to Bioinformatics
Introduction to Bioinformatics
 
Genome Database Systems
Genome Database Systems Genome Database Systems
Genome Database Systems
 
blast bioinformatics
blast bioinformaticsblast bioinformatics
blast bioinformatics
 
Data Retrieval Systems
Data Retrieval SystemsData Retrieval Systems
Data Retrieval Systems
 
BLAST
BLASTBLAST
BLAST
 
Swiss pdb viewer
Swiss pdb viewerSwiss pdb viewer
Swiss pdb viewer
 

Similar to Algorithmic approach to computational biology using graphs

Introduction to graph databases: Neo4j and Cypher
Introduction to graph databases: Neo4j and CypherIntroduction to graph databases: Neo4j and Cypher
Introduction to graph databases: Neo4j and Cypher
Anjani Dhrangadhariya
 
BASIC OF BIOINFORMATICS.pptx
BASIC OF BIOINFORMATICS.pptxBASIC OF BIOINFORMATICS.pptx
BASIC OF BIOINFORMATICS.pptx
DevaprasadPanda
 
System Biology and Pathway Network.pptx
System Biology and Pathway Network.pptxSystem Biology and Pathway Network.pptx
System Biology and Pathway Network.pptx
ssuserecbdb6
 
Introduction to Biology for Engineers.pptx
Introduction to Biology for Engineers.pptxIntroduction to Biology for Engineers.pptx
Introduction to Biology for Engineers.pptx
Dr. G Shanmugavel
 
Basics Of Bioinformatics .pptx
Basics Of Bioinformatics .pptxBasics Of Bioinformatics .pptx
Basics Of Bioinformatics .pptx
Mohdkaifkhan18
 
Bioinformatics—an introduction for computer scientists
Bioinformatics—an introduction for computer scientistsBioinformatics—an introduction for computer scientists
Bioinformatics—an introduction for computer scientistsunyil96
 
LECTURE NOTES ON BIOINFORMATICS
LECTURE NOTES ON BIOINFORMATICSLECTURE NOTES ON BIOINFORMATICS
LECTURE NOTES ON BIOINFORMATICS
MSCW Mysore
 
BioInformatics Software
BioInformatics SoftwareBioInformatics Software
BioInformatics Software
university of education,Lahore
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
Amna Jalil
 
System biology and its tools
System biology and its toolsSystem biology and its tools
System biology and its tools
Gaurav Diwakar
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
Vidya Kalaivani Rajkumar
 
Session ii g2 overview chemical modeling mmc
Session ii g2 overview chemical modeling mmcSession ii g2 overview chemical modeling mmc
Session ii g2 overview chemical modeling mmcUSD Bioinformatics
 
Bioinformatics.pptx
Bioinformatics.pptxBioinformatics.pptx
Bioinformatics.pptx
breenaawan
 
COMPUTER SIMULATIONS IN PHARMACOKINETICS AND PHARMACODYNAMICS
COMPUTER SIMULATIONS INPHARMACOKINETICS ANDPHARMACODYNAMICSCOMPUTER SIMULATIONS INPHARMACOKINETICS ANDPHARMACODYNAMICS
COMPUTER SIMULATIONS IN PHARMACOKINETICS AND PHARMACODYNAMICS
naazmohd2
 
Genomics types
Genomics typesGenomics types
Genome data management
Genome data managementGenome data management
Genome data management
Shareb Ismaeel
 
introduction to bioinfromatics.pptx
introduction to bioinfromatics.pptxintroduction to bioinfromatics.pptx
introduction to bioinfromatics.pptx
AbelPhilipJoseph
 
Network Biology: A paradigm for modeling biological complex systems
Network Biology: A paradigm for modeling biological complex systemsNetwork Biology: A paradigm for modeling biological complex systems
Network Biology: A paradigm for modeling biological complex systems
Ganesh Bagler
 
Overall Vision for NRNB: 2015-2020
Overall Vision for NRNB: 2015-2020Overall Vision for NRNB: 2015-2020
Overall Vision for NRNB: 2015-2020
Alexander Pico
 

Similar to Algorithmic approach to computational biology using graphs (20)

Introduction to graph databases: Neo4j and Cypher
Introduction to graph databases: Neo4j and CypherIntroduction to graph databases: Neo4j and Cypher
Introduction to graph databases: Neo4j and Cypher
 
BASIC OF BIOINFORMATICS.pptx
BASIC OF BIOINFORMATICS.pptxBASIC OF BIOINFORMATICS.pptx
BASIC OF BIOINFORMATICS.pptx
 
System Biology and Pathway Network.pptx
System Biology and Pathway Network.pptxSystem Biology and Pathway Network.pptx
System Biology and Pathway Network.pptx
 
Introduction to Biology for Engineers.pptx
Introduction to Biology for Engineers.pptxIntroduction to Biology for Engineers.pptx
Introduction to Biology for Engineers.pptx
 
Basics Of Bioinformatics .pptx
Basics Of Bioinformatics .pptxBasics Of Bioinformatics .pptx
Basics Of Bioinformatics .pptx
 
Bioinformatics—an introduction for computer scientists
Bioinformatics—an introduction for computer scientistsBioinformatics—an introduction for computer scientists
Bioinformatics—an introduction for computer scientists
 
LECTURE NOTES ON BIOINFORMATICS
LECTURE NOTES ON BIOINFORMATICSLECTURE NOTES ON BIOINFORMATICS
LECTURE NOTES ON BIOINFORMATICS
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
BioInformatics Software
BioInformatics SoftwareBioInformatics Software
BioInformatics Software
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
System biology and its tools
System biology and its toolsSystem biology and its tools
System biology and its tools
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Session ii g2 overview chemical modeling mmc
Session ii g2 overview chemical modeling mmcSession ii g2 overview chemical modeling mmc
Session ii g2 overview chemical modeling mmc
 
Bioinformatics.pptx
Bioinformatics.pptxBioinformatics.pptx
Bioinformatics.pptx
 
COMPUTER SIMULATIONS IN PHARMACOKINETICS AND PHARMACODYNAMICS
COMPUTER SIMULATIONS INPHARMACOKINETICS ANDPHARMACODYNAMICSCOMPUTER SIMULATIONS INPHARMACOKINETICS ANDPHARMACODYNAMICS
COMPUTER SIMULATIONS IN PHARMACOKINETICS AND PHARMACODYNAMICS
 
Genomics types
Genomics typesGenomics types
Genomics types
 
Genome data management
Genome data managementGenome data management
Genome data management
 
introduction to bioinfromatics.pptx
introduction to bioinfromatics.pptxintroduction to bioinfromatics.pptx
introduction to bioinfromatics.pptx
 
Network Biology: A paradigm for modeling biological complex systems
Network Biology: A paradigm for modeling biological complex systemsNetwork Biology: A paradigm for modeling biological complex systems
Network Biology: A paradigm for modeling biological complex systems
 
Overall Vision for NRNB: 2015-2020
Overall Vision for NRNB: 2015-2020Overall Vision for NRNB: 2015-2020
Overall Vision for NRNB: 2015-2020
 

Recently uploaded

GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
Neo4j
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
Neo4j
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
danishmna97
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
SOFTTECHHUB
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
DianaGray10
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
名前 です男
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems S.M.S.A.
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
James Anderson
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex ProofszkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
Alex Pruden
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
Neo4j
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
DianaGray10
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
SOFTTECHHUB
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
KAMESHS29
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
nkrafacyberclub
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Paige Cruz
 
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AIEnchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Vladimir Iglovikov, Ph.D.
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 

Recently uploaded (20)

GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex ProofszkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
 
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AIEnchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 

Algorithmic approach to computational biology using graphs

  • 1. Algorithmic approach to Computational Biology using Graphs Submitted by S P Sajjan Research Guide Dr. Ishwar BaidariMCA,Ph. D. Dept. of Computer Science Karnatak University, Dharwad.
  • 2. What is Computational Biology? "Computational biology is not a "field", but an "approach" involving the use of computers to study biological processes and hence it is an area as diverse as biology itself."
  • 3. • Biological data Biological data are data or measurements collected from biological sources, which are often stored or exchanged in a digital form. Biological data are commonly stored in files or databases. Ex : DNA sequences, and population data used in ecology. • Functional molecules In organic chemistry, functional groups are specific groups of atoms or bonds within molecules that are responsible for the characteristic chemical reactions of those molecules.
  • 4. • Mining in molecular biology Text-mining in molecular biology is defined as the automatic extraction of information about genes, proteins and their functional relationships from text documents. Ex: Information science, Bioinformatics and Computational linguistics. • Defining Metabolism The term, 'Metabolism' refers to biochemical processes that happen within a person or living organism. Metabolism is something that consists of both,’ Catabolism,' and, 'Anabolism;' which are the buildup and breakdown of substances.
  • 5. Cellular networks • Interacting molecular sets within cells. • It includes mainly p-p interactions, metabolism, gene transcriptional regulatory networks and signal transduction pathways. • All of them are different subsets of a single large-scale cellular network, since they are eventually cross-linked.
  • 6. Purpose of Computational Biology • Computational Biology can be summarized as the field utilizing high throughput technology and computation to study complex organizational patterns of biological systems and how they contribute to the normal physiology and disease. • Experimental systems biology uses various genomics/proteomics. • Large number of genes or proteins at a genome scale, which naturally yields a large volume of data to be interpreted and put within the context of real biology.
  • 7. • There are several nation-wide large projects aiming at characterizing the genome and proteome of different (e.g cancer) cells. • Billions of dollars are spending into this research that spans many of the top institutions across the nation. • Classical molecular biology has mainly focused on gene or molecular centric research, • 30-40 years of this research led to our realization of the incredible complexity of biological systems. • we need more global experimental approaches and equally as importantly. Relevance of the study and present status
  • 8. Issues Related to Computational Biology • ~22,000 noted Human genes in Sequence • ~60,000 known protein-protein interactions in human • Millions of indirect relationships between genes • Typical genomic experiment: millions of data points
  • 9. Statement of Research Problem • The theory of complex networks plays an important role in a wide variety of disciplines, ranging from communication to molecular and population biology. • The focus of this Research is on graph theory methods for computational biology. • We will survey methods and approaches in graph theory, along with current applications in biomedical informatics. • Within the fields of Biology and Medicine, potential applications of network analysis by using graph theory including identifying drug targets, determining the role of proteins or genes of unknown function.
  • 10. • There are several biological domains where graph theory techniques are applied for knowledge extraction from data. We have classified these problems as follows. • Modeling methods of bio-molecular networks such as protein interaction networks, metabolic networks, as well as transcriptional regulatory networks. • Measurement of centrality and importance in bio-molecular networks. To identify the most important nodes in a large complex network is of fundamental importance in computational biology. • We will introduce several researches that applied centrality measures to identify structurally important genes or proteins identified in this way.
  • 11. • Mining new pathways from bio-molecular networks. • Experimental validation of identification of the pathway in different organisms is requires huge amounts of time and effort. • Thus, there is a need for Graph theory tools help scientists predict pathways in bio-molecular networks. • Our primary goal in the present Research is to provide as broad a survey as possible of the major advances made in this field. Moreover, we also highlight what has been achieved as well as some of the most significant open issues that need to be addressed. • Finally, we hope that this Research will serve as a useful introduction to the field for those unfamiliar with the literature.
  • 12. The concept of Graph theory • Graph: A graph G consists of a set of vertices V(G) and set of edges E(G). • Simple Graph: In simple graph, two of the vertices in G are linked if there exits an edge (𝑉𝑖, 𝑉𝑗) ∈E(G). connecting the vertices and in graph G such that 𝑉𝑖 ∈V(G) and 𝑉𝑗 ∈V(G). • Undirected Graph : An undirected graph is graph, i.e., a set of objects (called vertices or nodes) that are connected together, where all the edges are bidirectional. An undirected graph is sometimes called an undirected network. • Directed Graph: A directed graph is graph, i.e., a set of objects (called vertices or nodes) that are connected together, where all the edges are directed from one vertex to another. A directed graph is sometimes called a digraph or a directed network.
  • 13. Modeling of Bio-molecular networks in Graph • In Biology, Transcriptional regulatory networks and metabolic networks would usually be modeled as directed graphs. • For instance, in a Transcriptional regulatory network, nodes represent genes with edges denoting the Transcriptional relationship between them. • In recent years, attentions have been focused on the protein- protein interaction networks of various simple organisms. These networks describe the direct physical interaction between the proteins in an organism’s proteome and there is no direction associated with the interactions in such networks. • Hence, PPI networks are typically modeled as undirected graphs, in which nodes represent protein and edges represent interaction.
  • 14. Computational Limitations • The challenges of computational biology are enormous, and may exceed the expected increases in computing capability. Several years ago the computational power of “state-of-the-art parallel supercomputers” allowed highly predictive calculations treating only hundreds of atoms for time scales of picoseconds, while molecular dynamics calculations of tens of thousands of atoms for nanoseconds were becoming common, although they were some what less predictive. • A straightforward application of Moore’s Law would predict an increase of about three – four doublings in capability in the intervening five or six years. • Using current methodologies, achieving the desired level of computation would represent an increase of greater than ~109 times in computing power. • It must be noted that even an increase of ~109 in computing power would only provide the ability to simulate certain cellular systems, and may not provide a means to predictively model whole cells, organs or organisms.