SlideShare a Scribd company logo
INSTITUTE OF INFORMATION TECHNOLOGY (IIT), UNIVERSITY OF DHAKA
           A High-Throughput
        Bioinformatics Distributed
           Computing Platform



19-09-2012                                                             1

               A high-throughput bioinformatics distributed computing platform
INSTITUTE OF INFORMATION TECHNOLOGY (IIT), UNIVERSITY OF DHAKA
                   Presented by-


                     Md. Habibur Rahman
                            BIT 0216
             Institute of Information Technology
                      University of Dhaka
                           Bangladesh


19-09-2012                                                                   2

                     A high-throughput bioinformatics distributed computing platform
The contributors of the paper




                                                                                    INSTITUTE OF INFORMATION TECHNOLOGY (IIT), UNIVERSITY OF DHAKA
 Thomas M. Keane, Andrew J. Page, James O. McInerney,
              and Thomas J. Naughton

       Bioinformatics and Pharmacogenomics Laboratory,
      National University of Ireland, Maynooth, Co. Kildare,
                               Ireland

   Department of Computer Science, National University of
          Ireland, Maynooth, Co. Kildare, Ireland


              Homepage: http://www.cs.nuim.ie/distibuted

19-09-2012                                                                      3

                        A high-throughput bioinformatics distributed computing platform
INSTITUTE OF INFORMATION TECHNOLOGY (IIT), UNIVERSITY OF DHAKA
             Publications

18th   IEEE Symposium on Computer-
     Based Medical System (CBMS’05)




19-09-2012                                                           4

             A high-throughput bioinformatics distributed computing platform
Suitability of Bioinformatics to




                                                                                     INSTITUTE OF INFORMATION TECHNOLOGY (IIT), UNIVERSITY OF DHAKA
                 Distributed Computing

             A    Class       of      Algorithmic           Parallelism
               referred to as coarse-grained parallelism.
              High   compute-to-data ratio.




19-09-2012                                                                       5

                         A high-throughput bioinformatics distributed computing platform
Topic and Problem Overview




                                                                                       INSTITUTE OF INFORMATION TECHNOLOGY (IIT), UNIVERSITY OF DHAKA
            Demand for high performance computing has increased
             dramatically in the area of bioinformatics due to rapid
             increase in the size of genomic databases.
            Traditional database search algorithm was not feasible to
             perform full search of a large database in a reasonable
             time.
            Feasibility of heuristic algorithm but reduction of
             sensitivity of search.
            Evolutionary biology, phylogenetic tree and greedy
             heuristic algorithm.




19-09-2012                                                                         6

                           A high-throughput bioinformatics distributed computing platform
INSTITUTE OF INFORMATION TECHNOLOGY (IIT), UNIVERSITY OF DHAKA
                Proposed solution
o    According to the writers of the paper---

                                            “We     present      a   general-
     purpose    programmable        distributed      computing       platform
     suitable for deployment in a typical university environment
     where many semi-idle desktop PC’s are connected via a
     network”
    The system is fully cross-platform.
    Two distributed bioinformatics applications:
                 i) DSEARCH
                 ii) DPRml
19-09-2012                                                                     7

                       A high-throughput bioinformatics distributed computing platform
Proposed solution(cont.)




                                                                                      INSTITUTE OF INFORMATION TECHNOLOGY (IIT), UNIVERSITY OF DHAKA
       o     Java Distributed Computing platform

             - Client Server model

             - Server controls the resources (database, algorithm
             or computer hardware)

             - The model is divided into three separate pieces of
             software: server, client and remote interface.




19-09-2012                                                                        8

                          A high-throughput bioinformatics distributed computing platform
Proposed solution(cont.)




                                                                               INSTITUTE OF INFORMATION TECHNOLOGY (IIT), UNIVERSITY OF DHAKA
                  Fig: Diagram of the complete system
19-09-2012                                                                 9

                   A high-throughput bioinformatics distributed computing platform
Proposed solution(cont.)




                                                                                  INSTITUTE OF INFORMATION TECHNOLOGY (IIT), UNIVERSITY OF DHAKA
    Installation and Deployment
-    Consists of three executable JAR files corresponding to
     the server, client and remote interface.
-    Run the client as a low priority background service.
-    Hardware specification: At least Pentium IV processor
-    OS compatibility: Windows, Sun Solaris, Mac OSX and
     Linux.




19-09-2012                                                                   10

                     A high-throughput bioinformatics distributed computing platform
INSTITUTE OF INFORMATION TECHNOLOGY (IIT), UNIVERSITY OF DHAKA
                                   DPRml
             - Distributed Phylogeny Reconstruction by maximum likelihood




                                Previous situation:
 Maximum       likelihood evolution is one the most accurate techniques
 for reconstructing phylogenies.
 Developed        parallel ML programs for reconstructing large and
 accurate phylogenetic trees.
 Implemented       in platform specific language



19-09-2012                                                                        11

                          A high-throughput bioinformatics distributed computing platform
INSTITUTE OF INFORMATION TECHNOLOGY (IIT), UNIVERSITY OF DHAKA
                             DPRml (cont.)
              - Distributed Phylogeny Reconstruction by maximum likelihood



             After the development of distributed computing platform:
 One        of the most general and powerful likelihood-based phylogenetic
 tree building program.
 Used        proven tree building algorithm and phylogenetic Analysis
 Library
 Possibility     of multiple phylogenetic computation.
 Platform      independent ML program.


19-09-2012                                                                         12

                           A high-throughput bioinformatics distributed computing platform
DPRml (cont.)




                                                                                           INSTITUTE OF INFORMATION TECHNOLOGY (IIT), UNIVERSITY OF DHAKA
                - Distributed Phylogeny Reconstruction by maximum likelihood


                                    Speed up Testing:




             Fig. Speedup achieved by running 6 simultaneous DPRml problems
                         using between 1-40 semi-idle processors.
19-09-2012                                                                            13

                              A high-throughput bioinformatics distributed computing platform
DSEARCH




                                                                                     INSTITUTE OF INFORMATION TECHNOLOGY (IIT), UNIVERSITY OF DHAKA
      Fully cross-platform parallel database search program.
      Operates in a master slave environment.
      Splitting the database into fixed sized units that are subsequently
       searched on the donor machines.,




19-09-2012                                                                      14

                        A high-throughput bioinformatics distributed computing platform
DSEARCH (cont.)




                                                                                           INSTITUTE OF INFORMATION TECHNOLOGY (IIT), UNIVERSITY OF DHAKA
               Speed up Testing:                           Using-

                                                       -    FASTA database file,
                                                       -    A    FASTA        query
                                                            sequence file.
                                                       -    A searching scheme
                                                       -    A configuration file.




    Fig. Speedup achieved by DSEARCH running on
           between 1-80 semi-idle processors.


19-09-2012                                                                            15

                         A high-throughput bioinformatics distributed computing platform
My criticism and future work to do




                                                                                      INSTITUTE OF INFORMATION TECHNOLOGY (IIT), UNIVERSITY OF DHAKA
     No detail description about how the applications works on the
      distributed computing platform.
     If we don’t get the spare clock cycle of the semi-idle pc then the
      system will not give us the best result.
     Failure of interconnected network of the desktop-pc’s will reduce
      the performance.
     To improve and expand the range of bioinformatics applications for
      the system.


19-09-2012                                                                       16

                         A high-throughput bioinformatics distributed computing platform
Conclusion




                                                                             INSTITUTE OF INFORMATION TECHNOLOGY (IIT), UNIVERSITY OF DHAKA
 “There should not have any conclusion of
 research work, It is a continual process and it will
 be continued for the betterment of the human
 being.”




19-09-2012                                                              17

                A high-throughput bioinformatics distributed computing platform
INSTITUTE OF INFORMATION TECHNOLOGY (IIT), UNIVERSITY OF DHAKA
             ANY QUESTION?



19-09-2012                                                             18

               A high-throughput bioinformatics distributed computing platform
INSTITUTE OF INFORMATION TECHNOLOGY (IIT), UNIVERSITY OF DHAKA
19-09-2012                                                           19

             A high-throughput bioinformatics distributed computing platform

More Related Content

What's hot

Data-Intensive Research
Data-Intensive ResearchData-Intensive Research
Data-Intensive Research
Jano van Hemert
 
Virtual Campfire/iNMV Storytelling on the iPhone
Virtual Campfire/iNMV Storytelling on the iPhoneVirtual Campfire/iNMV Storytelling on the iPhone
Virtual Campfire/iNMV Storytelling on the iPhoneYiwei Cao
 
PROCEDURE OF EFFECTIVE USE OF CLOUDLETS IN WIRELESS METROPOLITAN AREA NETWORK...
PROCEDURE OF EFFECTIVE USE OF CLOUDLETS IN WIRELESS METROPOLITAN AREA NETWORK...PROCEDURE OF EFFECTIVE USE OF CLOUDLETS IN WIRELESS METROPOLITAN AREA NETWORK...
PROCEDURE OF EFFECTIVE USE OF CLOUDLETS IN WIRELESS METROPOLITAN AREA NETWORK...
IJCNCJournal
 
"Parallel and Distributed Computing: BOINC Grid Implementation" por Rodrigo N...
"Parallel and Distributed Computing: BOINC Grid Implementation" por Rodrigo N..."Parallel and Distributed Computing: BOINC Grid Implementation" por Rodrigo N...
"Parallel and Distributed Computing: BOINC Grid Implementation" por Rodrigo N...
Núcleo de Electrónica e Informática da Universidade do Algarve
 
english_cv_final.doc
english_cv_final.docenglish_cv_final.doc
english_cv_final.docbutest
 
Multimedia Processing on Multimedia Semantics and Multimedia Context
Multimedia Processing on Multimedia Semantics and Multimedia ContextMultimedia Processing on Multimedia Semantics and Multimedia Context
Multimedia Processing on Multimedia Semantics and Multimedia Context
Ralf Klamma
 
OW2 A presentation pierre_chatel
OW2 A presentation pierre_chatelOW2 A presentation pierre_chatel
OW2 A presentation pierre_chatelchoreos
 
Flexible Technologies for Smart Campus
Flexible Technologies for Smart CampusFlexible Technologies for Smart Campus
Flexible Technologies for Smart Campus
Kamal Spring
 
Bio-UnaGrid: Easing bioinformatics workflow execution
Bio-UnaGrid: Easing bioinformatics workflow executionBio-UnaGrid: Easing bioinformatics workflow execution
Bio-UnaGrid: Easing bioinformatics workflow execution
Mario Jose Villamizar Cano
 
An efficient transport protocol for delivery of multimedia content in wireles...
An efficient transport protocol for delivery of multimedia content in wireles...An efficient transport protocol for delivery of multimedia content in wireles...
An efficient transport protocol for delivery of multimedia content in wireles...
Alexander Decker
 
Radterror Spb Oct04 Paper
Radterror Spb Oct04 PaperRadterror Spb Oct04 Paper
Radterror Spb Oct04 Papermartindudziak
 
CYBER INFRASTRUCTURE AS A SERVICE TO EMPOWER MULTIDISCIPLINARY, DATA-DRIVEN S...
CYBER INFRASTRUCTURE AS A SERVICE TO EMPOWER MULTIDISCIPLINARY, DATA-DRIVEN S...CYBER INFRASTRUCTURE AS A SERVICE TO EMPOWER MULTIDISCIPLINARY, DATA-DRIVEN S...
CYBER INFRASTRUCTURE AS A SERVICE TO EMPOWER MULTIDISCIPLINARY, DATA-DRIVEN S...
ijcsit
 
Part 2: Efficient Multimedia Delivery in Content-Centric Mobile Networks
Part 2: Efficient Multimedia Delivery in Content-Centric Mobile NetworksPart 2: Efficient Multimedia Delivery in Content-Centric Mobile Networks
Part 2: Efficient Multimedia Delivery in Content-Centric Mobile Networks
Dr. Mahfuzur Rahman Bosunia
 
Kalman Graffi - 15 Slide on Monitoring P2P Systems - 2010
Kalman Graffi - 15 Slide on Monitoring P2P Systems - 2010Kalman Graffi - 15 Slide on Monitoring P2P Systems - 2010
Kalman Graffi - 15 Slide on Monitoring P2P Systems - 2010Kalman Graffi
 
Design of an IT Capstone Subject - Cloud Robotics
Design of an IT Capstone Subject - Cloud RoboticsDesign of an IT Capstone Subject - Cloud Robotics
Design of an IT Capstone Subject - Cloud Robotics
ITIIIndustries
 

What's hot (16)

Data-Intensive Research
Data-Intensive ResearchData-Intensive Research
Data-Intensive Research
 
Virtual Campfire/iNMV Storytelling on the iPhone
Virtual Campfire/iNMV Storytelling on the iPhoneVirtual Campfire/iNMV Storytelling on the iPhone
Virtual Campfire/iNMV Storytelling on the iPhone
 
PROCEDURE OF EFFECTIVE USE OF CLOUDLETS IN WIRELESS METROPOLITAN AREA NETWORK...
PROCEDURE OF EFFECTIVE USE OF CLOUDLETS IN WIRELESS METROPOLITAN AREA NETWORK...PROCEDURE OF EFFECTIVE USE OF CLOUDLETS IN WIRELESS METROPOLITAN AREA NETWORK...
PROCEDURE OF EFFECTIVE USE OF CLOUDLETS IN WIRELESS METROPOLITAN AREA NETWORK...
 
"Parallel and Distributed Computing: BOINC Grid Implementation" por Rodrigo N...
"Parallel and Distributed Computing: BOINC Grid Implementation" por Rodrigo N..."Parallel and Distributed Computing: BOINC Grid Implementation" por Rodrigo N...
"Parallel and Distributed Computing: BOINC Grid Implementation" por Rodrigo N...
 
english_cv_final.doc
english_cv_final.docenglish_cv_final.doc
english_cv_final.doc
 
Multimedia Processing on Multimedia Semantics and Multimedia Context
Multimedia Processing on Multimedia Semantics and Multimedia ContextMultimedia Processing on Multimedia Semantics and Multimedia Context
Multimedia Processing on Multimedia Semantics and Multimedia Context
 
OW2 A presentation pierre_chatel
OW2 A presentation pierre_chatelOW2 A presentation pierre_chatel
OW2 A presentation pierre_chatel
 
Flexible Technologies for Smart Campus
Flexible Technologies for Smart CampusFlexible Technologies for Smart Campus
Flexible Technologies for Smart Campus
 
Bio-UnaGrid: Easing bioinformatics workflow execution
Bio-UnaGrid: Easing bioinformatics workflow executionBio-UnaGrid: Easing bioinformatics workflow execution
Bio-UnaGrid: Easing bioinformatics workflow execution
 
Mrsql3
Mrsql3Mrsql3
Mrsql3
 
An efficient transport protocol for delivery of multimedia content in wireles...
An efficient transport protocol for delivery of multimedia content in wireles...An efficient transport protocol for delivery of multimedia content in wireles...
An efficient transport protocol for delivery of multimedia content in wireles...
 
Radterror Spb Oct04 Paper
Radterror Spb Oct04 PaperRadterror Spb Oct04 Paper
Radterror Spb Oct04 Paper
 
CYBER INFRASTRUCTURE AS A SERVICE TO EMPOWER MULTIDISCIPLINARY, DATA-DRIVEN S...
CYBER INFRASTRUCTURE AS A SERVICE TO EMPOWER MULTIDISCIPLINARY, DATA-DRIVEN S...CYBER INFRASTRUCTURE AS A SERVICE TO EMPOWER MULTIDISCIPLINARY, DATA-DRIVEN S...
CYBER INFRASTRUCTURE AS A SERVICE TO EMPOWER MULTIDISCIPLINARY, DATA-DRIVEN S...
 
Part 2: Efficient Multimedia Delivery in Content-Centric Mobile Networks
Part 2: Efficient Multimedia Delivery in Content-Centric Mobile NetworksPart 2: Efficient Multimedia Delivery in Content-Centric Mobile Networks
Part 2: Efficient Multimedia Delivery in Content-Centric Mobile Networks
 
Kalman Graffi - 15 Slide on Monitoring P2P Systems - 2010
Kalman Graffi - 15 Slide on Monitoring P2P Systems - 2010Kalman Graffi - 15 Slide on Monitoring P2P Systems - 2010
Kalman Graffi - 15 Slide on Monitoring P2P Systems - 2010
 
Design of an IT Capstone Subject - Cloud Robotics
Design of an IT Capstone Subject - Cloud RoboticsDesign of an IT Capstone Subject - Cloud Robotics
Design of an IT Capstone Subject - Cloud Robotics
 

Similar to A High Throughput Bioinformatics Distributed Computing Platform

Beyond the Client-Server Architectures: A Survey of Mobile Cloud Techniques
Beyond the Client-Server Architectures: A Survey of Mobile Cloud TechniquesBeyond the Client-Server Architectures: A Survey of Mobile Cloud Techniques
Beyond the Client-Server Architectures: A Survey of Mobile Cloud Techniques
Dejan Kovachev
 
Research Challenges in Networked Systems, Torsten Braun, Universität Bern
Research Challenges in Networked Systems, Torsten Braun, Universität BernResearch Challenges in Networked Systems, Torsten Braun, Universität Bern
Research Challenges in Networked Systems, Torsten Braun, Universität Bern
The Research Council of Norway, IKTPLUSS
 
Dagstuhl 2010 - Kalman Graffi - Alternative, more promising IT Paradigms for ...
Dagstuhl 2010 - Kalman Graffi - Alternative, more promising IT Paradigms for ...Dagstuhl 2010 - Kalman Graffi - Alternative, more promising IT Paradigms for ...
Dagstuhl 2010 - Kalman Graffi - Alternative, more promising IT Paradigms for ...
Kalman Graffi
 
Telecom trends 261112
Telecom trends 261112Telecom trends 261112
Telecom trends 261112Sharon Rozov
 
A crisis-communication-network-based-on-embodied-conversational-agents-system...
A crisis-communication-network-based-on-embodied-conversational-agents-system...A crisis-communication-network-based-on-embodied-conversational-agents-system...
A crisis-communication-network-based-on-embodied-conversational-agents-system...Cemal Ardil
 
Big Data in Bioinformatics & the Era of Cloud Computing
Big Data in Bioinformatics & the Era of Cloud ComputingBig Data in Bioinformatics & the Era of Cloud Computing
Big Data in Bioinformatics & the Era of Cloud Computing
IOSR Journals
 
Fujitsu keynote at Oracle OpenWorld 2012
Fujitsu keynote at Oracle OpenWorld 2012 Fujitsu keynote at Oracle OpenWorld 2012
Fujitsu keynote at Oracle OpenWorld 2012
Fujitsu Global
 
GRID COMPUTING.ppt
GRID COMPUTING.pptGRID COMPUTING.ppt
GRID COMPUTING.ppt
4173CarreonIraMaeL
 
Parking
ParkingParking
Parking
vaneetmodi
 
Presentation-1.ppt
Presentation-1.pptPresentation-1.ppt
Presentation-1.ppt
ssuserbfbf6f1
 
GridComputing-an introduction.ppt
GridComputing-an introduction.pptGridComputing-an introduction.ppt
GridComputing-an introduction.ppt
NileshkuGiri
 
1. GRID COMPUTING
1. GRID COMPUTING1. GRID COMPUTING
1. GRID COMPUTING
Dr Sandeep Kumar Poonia
 
A Real-time Collaboration-enabled Mobile Augmented Reality System with Semant...
A Real-time Collaboration-enabled Mobile Augmented Reality System with Semant...A Real-time Collaboration-enabled Mobile Augmented Reality System with Semant...
A Real-time Collaboration-enabled Mobile Augmented Reality System with Semant...Dejan Kovachev
 
Blueprint for the Industrial Internet of Things
Blueprint for the Industrial Internet of ThingsBlueprint for the Industrial Internet of Things
Blueprint for the Industrial Internet of Things
Real-Time Innovations (RTI)
 
Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...
Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...
Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...
i_scienceEU
 
2014 IEEE JAVA PARALLEL DISTRIBUTED PROJECT The design-and-evaluation-of-an-i...
2014 IEEE JAVA PARALLEL DISTRIBUTED PROJECT The design-and-evaluation-of-an-i...2014 IEEE JAVA PARALLEL DISTRIBUTED PROJECT The design-and-evaluation-of-an-i...
2014 IEEE JAVA PARALLEL DISTRIBUTED PROJECT The design-and-evaluation-of-an-i...
IEEEGLOBALSOFTSTUDENTSPROJECTS
 

Similar to A High Throughput Bioinformatics Distributed Computing Platform (20)

Beyond the Client-Server Architectures: A Survey of Mobile Cloud Techniques
Beyond the Client-Server Architectures: A Survey of Mobile Cloud TechniquesBeyond the Client-Server Architectures: A Survey of Mobile Cloud Techniques
Beyond the Client-Server Architectures: A Survey of Mobile Cloud Techniques
 
grid computing
grid computinggrid computing
grid computing
 
Research Challenges in Networked Systems, Torsten Braun, Universität Bern
Research Challenges in Networked Systems, Torsten Braun, Universität BernResearch Challenges in Networked Systems, Torsten Braun, Universität Bern
Research Challenges in Networked Systems, Torsten Braun, Universität Bern
 
Research Challenges in Networked Systems
Research Challenges in Networked SystemsResearch Challenges in Networked Systems
Research Challenges in Networked Systems
 
Dagstuhl 2010 - Kalman Graffi - Alternative, more promising IT Paradigms for ...
Dagstuhl 2010 - Kalman Graffi - Alternative, more promising IT Paradigms for ...Dagstuhl 2010 - Kalman Graffi - Alternative, more promising IT Paradigms for ...
Dagstuhl 2010 - Kalman Graffi - Alternative, more promising IT Paradigms for ...
 
Telecom trends 261112
Telecom trends 261112Telecom trends 261112
Telecom trends 261112
 
A crisis-communication-network-based-on-embodied-conversational-agents-system...
A crisis-communication-network-based-on-embodied-conversational-agents-system...A crisis-communication-network-based-on-embodied-conversational-agents-system...
A crisis-communication-network-based-on-embodied-conversational-agents-system...
 
Big Data in Bioinformatics & the Era of Cloud Computing
Big Data in Bioinformatics & the Era of Cloud ComputingBig Data in Bioinformatics & the Era of Cloud Computing
Big Data in Bioinformatics & the Era of Cloud Computing
 
Fujitsu keynote at Oracle OpenWorld 2012
Fujitsu keynote at Oracle OpenWorld 2012 Fujitsu keynote at Oracle OpenWorld 2012
Fujitsu keynote at Oracle OpenWorld 2012
 
GRID COMPUTING.ppt
GRID COMPUTING.pptGRID COMPUTING.ppt
GRID COMPUTING.ppt
 
Parking
ParkingParking
Parking
 
Presentation-1.ppt
Presentation-1.pptPresentation-1.ppt
Presentation-1.ppt
 
GridComputing-an introduction.ppt
GridComputing-an introduction.pptGridComputing-an introduction.ppt
GridComputing-an introduction.ppt
 
1. GRID COMPUTING
1. GRID COMPUTING1. GRID COMPUTING
1. GRID COMPUTING
 
A Real-time Collaboration-enabled Mobile Augmented Reality System with Semant...
A Real-time Collaboration-enabled Mobile Augmented Reality System with Semant...A Real-time Collaboration-enabled Mobile Augmented Reality System with Semant...
A Real-time Collaboration-enabled Mobile Augmented Reality System with Semant...
 
3003 eve 1
3003 eve 13003 eve 1
3003 eve 1
 
BIOMAJ
BIOMAJBIOMAJ
BIOMAJ
 
Blueprint for the Industrial Internet of Things
Blueprint for the Industrial Internet of ThingsBlueprint for the Industrial Internet of Things
Blueprint for the Industrial Internet of Things
 
Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...
Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...
Kave Salamatian, Universite de Savoie and Eiko Yoneki, University of Cambridg...
 
2014 IEEE JAVA PARALLEL DISTRIBUTED PROJECT The design-and-evaluation-of-an-i...
2014 IEEE JAVA PARALLEL DISTRIBUTED PROJECT The design-and-evaluation-of-an-i...2014 IEEE JAVA PARALLEL DISTRIBUTED PROJECT The design-and-evaluation-of-an-i...
2014 IEEE JAVA PARALLEL DISTRIBUTED PROJECT The design-and-evaluation-of-an-i...
 

Recently uploaded

STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBCSTRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
kimdan468
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
Jisc
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
JosvitaDsouza2
 
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat  Leveraging AI for Diversity, Equity, and InclusionExecutive Directors Chat  Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
TechSoup
 
Acetabularia Information For Class 9 .docx
Acetabularia Information For Class 9  .docxAcetabularia Information For Class 9  .docx
Acetabularia Information For Class 9 .docx
vaibhavrinwa19
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
Sandy Millin
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
Peter Windle
 
Digital Artifact 2 - Investigating Pavilion Designs
Digital Artifact 2 - Investigating Pavilion DesignsDigital Artifact 2 - Investigating Pavilion Designs
Digital Artifact 2 - Investigating Pavilion Designs
chanes7
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
Thiyagu K
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
Celine George
 
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptxChapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Mohd Adib Abd Muin, Senior Lecturer at Universiti Utara Malaysia
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
camakaiclarkmusic
 
South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)
Academy of Science of South Africa
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
Celine George
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
Jisc
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
Delapenabediema
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
Pavel ( NSTU)
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
Mohd Adib Abd Muin, Senior Lecturer at Universiti Utara Malaysia
 
JEE1_This_section_contains_FOUR_ questions
JEE1_This_section_contains_FOUR_ questionsJEE1_This_section_contains_FOUR_ questions
JEE1_This_section_contains_FOUR_ questions
ShivajiThube2
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
Special education needs
 

Recently uploaded (20)

STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBCSTRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
 
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat  Leveraging AI for Diversity, Equity, and InclusionExecutive Directors Chat  Leveraging AI for Diversity, Equity, and Inclusion
Executive Directors Chat Leveraging AI for Diversity, Equity, and Inclusion
 
Acetabularia Information For Class 9 .docx
Acetabularia Information For Class 9  .docxAcetabularia Information For Class 9  .docx
Acetabularia Information For Class 9 .docx
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
 
Digital Artifact 2 - Investigating Pavilion Designs
Digital Artifact 2 - Investigating Pavilion DesignsDigital Artifact 2 - Investigating Pavilion Designs
Digital Artifact 2 - Investigating Pavilion Designs
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
 
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptxChapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
 
South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
 
JEE1_This_section_contains_FOUR_ questions
JEE1_This_section_contains_FOUR_ questionsJEE1_This_section_contains_FOUR_ questions
JEE1_This_section_contains_FOUR_ questions
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
 

A High Throughput Bioinformatics Distributed Computing Platform

  • 1. INSTITUTE OF INFORMATION TECHNOLOGY (IIT), UNIVERSITY OF DHAKA A High-Throughput Bioinformatics Distributed Computing Platform 19-09-2012 1 A high-throughput bioinformatics distributed computing platform
  • 2. INSTITUTE OF INFORMATION TECHNOLOGY (IIT), UNIVERSITY OF DHAKA Presented by- Md. Habibur Rahman BIT 0216 Institute of Information Technology University of Dhaka Bangladesh 19-09-2012 2 A high-throughput bioinformatics distributed computing platform
  • 3. The contributors of the paper INSTITUTE OF INFORMATION TECHNOLOGY (IIT), UNIVERSITY OF DHAKA Thomas M. Keane, Andrew J. Page, James O. McInerney, and Thomas J. Naughton Bioinformatics and Pharmacogenomics Laboratory, National University of Ireland, Maynooth, Co. Kildare, Ireland Department of Computer Science, National University of Ireland, Maynooth, Co. Kildare, Ireland Homepage: http://www.cs.nuim.ie/distibuted 19-09-2012 3 A high-throughput bioinformatics distributed computing platform
  • 4. INSTITUTE OF INFORMATION TECHNOLOGY (IIT), UNIVERSITY OF DHAKA Publications 18th IEEE Symposium on Computer- Based Medical System (CBMS’05) 19-09-2012 4 A high-throughput bioinformatics distributed computing platform
  • 5. Suitability of Bioinformatics to INSTITUTE OF INFORMATION TECHNOLOGY (IIT), UNIVERSITY OF DHAKA Distributed Computing A Class of Algorithmic Parallelism referred to as coarse-grained parallelism.  High compute-to-data ratio. 19-09-2012 5 A high-throughput bioinformatics distributed computing platform
  • 6. Topic and Problem Overview INSTITUTE OF INFORMATION TECHNOLOGY (IIT), UNIVERSITY OF DHAKA  Demand for high performance computing has increased dramatically in the area of bioinformatics due to rapid increase in the size of genomic databases.  Traditional database search algorithm was not feasible to perform full search of a large database in a reasonable time.  Feasibility of heuristic algorithm but reduction of sensitivity of search.  Evolutionary biology, phylogenetic tree and greedy heuristic algorithm. 19-09-2012 6 A high-throughput bioinformatics distributed computing platform
  • 7. INSTITUTE OF INFORMATION TECHNOLOGY (IIT), UNIVERSITY OF DHAKA Proposed solution o According to the writers of the paper--- “We present a general- purpose programmable distributed computing platform suitable for deployment in a typical university environment where many semi-idle desktop PC’s are connected via a network”  The system is fully cross-platform.  Two distributed bioinformatics applications: i) DSEARCH ii) DPRml 19-09-2012 7 A high-throughput bioinformatics distributed computing platform
  • 8. Proposed solution(cont.) INSTITUTE OF INFORMATION TECHNOLOGY (IIT), UNIVERSITY OF DHAKA o Java Distributed Computing platform - Client Server model - Server controls the resources (database, algorithm or computer hardware) - The model is divided into three separate pieces of software: server, client and remote interface. 19-09-2012 8 A high-throughput bioinformatics distributed computing platform
  • 9. Proposed solution(cont.) INSTITUTE OF INFORMATION TECHNOLOGY (IIT), UNIVERSITY OF DHAKA Fig: Diagram of the complete system 19-09-2012 9 A high-throughput bioinformatics distributed computing platform
  • 10. Proposed solution(cont.) INSTITUTE OF INFORMATION TECHNOLOGY (IIT), UNIVERSITY OF DHAKA  Installation and Deployment - Consists of three executable JAR files corresponding to the server, client and remote interface. - Run the client as a low priority background service. - Hardware specification: At least Pentium IV processor - OS compatibility: Windows, Sun Solaris, Mac OSX and Linux. 19-09-2012 10 A high-throughput bioinformatics distributed computing platform
  • 11. INSTITUTE OF INFORMATION TECHNOLOGY (IIT), UNIVERSITY OF DHAKA DPRml - Distributed Phylogeny Reconstruction by maximum likelihood Previous situation: Maximum likelihood evolution is one the most accurate techniques for reconstructing phylogenies. Developed parallel ML programs for reconstructing large and accurate phylogenetic trees. Implemented in platform specific language 19-09-2012 11 A high-throughput bioinformatics distributed computing platform
  • 12. INSTITUTE OF INFORMATION TECHNOLOGY (IIT), UNIVERSITY OF DHAKA DPRml (cont.) - Distributed Phylogeny Reconstruction by maximum likelihood After the development of distributed computing platform: One of the most general and powerful likelihood-based phylogenetic tree building program. Used proven tree building algorithm and phylogenetic Analysis Library Possibility of multiple phylogenetic computation. Platform independent ML program. 19-09-2012 12 A high-throughput bioinformatics distributed computing platform
  • 13. DPRml (cont.) INSTITUTE OF INFORMATION TECHNOLOGY (IIT), UNIVERSITY OF DHAKA - Distributed Phylogeny Reconstruction by maximum likelihood Speed up Testing: Fig. Speedup achieved by running 6 simultaneous DPRml problems using between 1-40 semi-idle processors. 19-09-2012 13 A high-throughput bioinformatics distributed computing platform
  • 14. DSEARCH INSTITUTE OF INFORMATION TECHNOLOGY (IIT), UNIVERSITY OF DHAKA  Fully cross-platform parallel database search program.  Operates in a master slave environment.  Splitting the database into fixed sized units that are subsequently searched on the donor machines., 19-09-2012 14 A high-throughput bioinformatics distributed computing platform
  • 15. DSEARCH (cont.) INSTITUTE OF INFORMATION TECHNOLOGY (IIT), UNIVERSITY OF DHAKA Speed up Testing: Using- - FASTA database file, - A FASTA query sequence file. - A searching scheme - A configuration file. Fig. Speedup achieved by DSEARCH running on between 1-80 semi-idle processors. 19-09-2012 15 A high-throughput bioinformatics distributed computing platform
  • 16. My criticism and future work to do INSTITUTE OF INFORMATION TECHNOLOGY (IIT), UNIVERSITY OF DHAKA  No detail description about how the applications works on the distributed computing platform.  If we don’t get the spare clock cycle of the semi-idle pc then the system will not give us the best result.  Failure of interconnected network of the desktop-pc’s will reduce the performance.  To improve and expand the range of bioinformatics applications for the system. 19-09-2012 16 A high-throughput bioinformatics distributed computing platform
  • 17. Conclusion INSTITUTE OF INFORMATION TECHNOLOGY (IIT), UNIVERSITY OF DHAKA “There should not have any conclusion of research work, It is a continual process and it will be continued for the betterment of the human being.” 19-09-2012 17 A high-throughput bioinformatics distributed computing platform
  • 18. INSTITUTE OF INFORMATION TECHNOLOGY (IIT), UNIVERSITY OF DHAKA ANY QUESTION? 19-09-2012 18 A high-throughput bioinformatics distributed computing platform
  • 19. INSTITUTE OF INFORMATION TECHNOLOGY (IIT), UNIVERSITY OF DHAKA 19-09-2012 19 A high-throughput bioinformatics distributed computing platform