SlideShare a Scribd company logo
1 of 10
BIO MAJ 
WORKFLOW ENGINE DEDICATED TO BIO-DATA 
SYNCHRONIZATION AND 
PROCESSING. 
Sana Anam Roll # 3003 
Bs(Hons) Botany 3rd semester Eve 
Submitted to Inam ul Haq 
University of Education
CONTENT 
 INTRODUCTION 
 BACKGROUND OF BIOMAJ 
 APPLICATION 
 BIOMAJ PROVIDE 
 CONCLUSION 
 REFRENCES 
University of Education
INTRODUCTION 
 In biocomputing, 
 analyses are almost systematically reliant on 
databanks. 
 Any biocomputing site therefore needs to manage 
these invaluable databanks that hold a huge 
amount of information usually several terabytes, 
spread over various international sites and in a 
consistent format (there are still several different 
standards currently). 
University of Education
BACKGROUND OF BIOMAJ 
 The BioMAJ project came out of the work of three teams in 2005: INRIA 
Rennes and INRA 
 Toulouse and JouyenJosas. 
 At the time, no free applications met users’ requirements. The closest 
 application was citrina, developed by Josh Goodman (from Washington 
University’s gmod project). 
 This was a promising prototype – nonetheless quite far from the 
application required – and it had 
 not been updated since 2004. 
 In 2006, these teams (INRIA and INRA) developed a new engine called 
BioMAJ1. Based on 
 citrina 0.51, nearly all the code was rewritten and the application’s 
architecture and functions were 
 completely rethought and considerably extended. 
 During 2007, the application was tested on the three sites involved in the 
project to make it 
 more robust and suitable 
University of Education
APPLICATION 
 Synchronization : 
Multiple remote protocols (ftp, sftp, http, rsync, 
local copy) 
Data transfers integrity check 
Release versioning using a incremental 
approach 
Multi threading 
Data extraction (gzip, tar, bzip) 
Data tree directory normalization 
University of Education
 Pre &Post processing : 
Advanced workflow description (D.A.G) using 
Easy normalized syntax language 
Post-process indexation for various 
bioinformatics software (blast, srs, fastacmd, 
readseq, etc…) 
Easy integration of personal scripts for bank 
post-processing automation 
University of Education
 Supervision : 
Administration web interface 
Repository statistics 
Mail alerts for the update cycle supervision 
University of Education
BIOMAJ PROVIDE 
 A reliable workflow engine that can download 
remote data automatically and intelligently 
 (error correction, synchronization of local and 
remote data), apply formatting to this data and 
 put it into production (make the data available for all 
users and/or applications). 
 A group of predefined workflows for the main 
biological banks. 
 An indexing scripts library (formatting for 
biological data) 
University of Education
CONCLUSION 
 BioMAJ provides flexibility in managing banks of 
sequences on a site while allowing for rapid 
implementation of new workflows by simply creating 
a bank description file. 
University of Education
REFERENCES 
 Website: http://biomaj.genouest.org/ 
University of Education 
Authors: David Allouche, Olivier Filangi , Romaric Sabas, 
Olivier Sallou 
(olivier.sallou@irisa.fr)

More Related Content

What's hot

bio data
bio databio data
bio data
007dcp
 
American Society for Mass Spectrometry Conference 2013
American Society for Mass Spectrometry Conference 2013American Society for Mass Spectrometry Conference 2013
American Society for Mass Spectrometry Conference 2013
Dmitry Grapov
 
eXframe: A Semantic Web Platform for Genomic Experiments
eXframe: A Semantic Web Platform for Genomic ExperimentseXframe: A Semantic Web Platform for Genomic Experiments
eXframe: A Semantic Web Platform for Genomic Experiments
Tim Clark
 

What's hot (20)

Biological Database
Biological DatabaseBiological Database
Biological Database
 
Features of biological databases
Features of biological databasesFeatures of biological databases
Features of biological databases
 
bio data
bio databio data
bio data
 
The Chemtools LaBLog
The Chemtools LaBLogThe Chemtools LaBLog
The Chemtools LaBLog
 
Open babel
Open babelOpen babel
Open babel
 
Quick Intro to InterMine within AIP and MTGD - JCVI Research Works-in-Progres...
Quick Intro to InterMine within AIP and MTGD - JCVI Research Works-in-Progres...Quick Intro to InterMine within AIP and MTGD - JCVI Research Works-in-Progres...
Quick Intro to InterMine within AIP and MTGD - JCVI Research Works-in-Progres...
 
Proteomics resources at the EBI & ExPASy
Proteomics resources at the EBI & ExPASyProteomics resources at the EBI & ExPASy
Proteomics resources at the EBI & ExPASy
 
Genome Database Systems
Genome Database Systems Genome Database Systems
Genome Database Systems
 
ACS 248th Paper 136 JSmol/JSpecView Eureka Integration
ACS 248th Paper 136 JSmol/JSpecView Eureka IntegrationACS 248th Paper 136 JSmol/JSpecView Eureka Integration
ACS 248th Paper 136 JSmol/JSpecView Eureka Integration
 
American Society for Mass Spectrometry Conference 2013
American Society for Mass Spectrometry Conference 2013American Society for Mass Spectrometry Conference 2013
American Society for Mass Spectrometry Conference 2013
 
FAIR data management in biomedicine
FAIR data management  in biomedicineFAIR data management  in biomedicine
FAIR data management in biomedicine
 
exFrame: a Semantic Web Platform for Genomics Experiments
exFrame: a Semantic Web Platform for Genomics ExperimentsexFrame: a Semantic Web Platform for Genomics Experiments
exFrame: a Semantic Web Platform for Genomics Experiments
 
eXframe: A Semantic Web Platform for Genomic Experiments
eXframe: A Semantic Web Platform for Genomic ExperimentseXframe: A Semantic Web Platform for Genomic Experiments
eXframe: A Semantic Web Platform for Genomic Experiments
 
Proteins databases
Proteins databasesProteins databases
Proteins databases
 
Fairport domain specific metadata using w3 c dcat & skos w ontology views
Fairport domain specific metadata using w3 c dcat & skos w ontology viewsFairport domain specific metadata using w3 c dcat & skos w ontology views
Fairport domain specific metadata using w3 c dcat & skos w ontology views
 
Powerpoint BinoM
Powerpoint BinoMPowerpoint BinoM
Powerpoint BinoM
 
Model repositories and standard formats for model reusability
Model repositories and standard formats for model reusabilityModel repositories and standard formats for model reusability
Model repositories and standard formats for model reusability
 
FuGE Update
FuGE UpdateFuGE Update
FuGE Update
 
Reference Management Tools
Reference Management ToolsReference Management Tools
Reference Management Tools
 
Ngsp
NgspNgsp
Ngsp
 

Similar to BIOMAJ

FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
Carole Goble
 
Ogce Workflow Suite
Ogce Workflow SuiteOgce Workflow Suite
Ogce Workflow Suite
smarru
 
Enabling Large Scale Sequencing Studies through Science as a Service
Enabling Large Scale Sequencing Studies through Science as a ServiceEnabling Large Scale Sequencing Studies through Science as a Service
Enabling Large Scale Sequencing Studies through Science as a Service
Justin Johnson
 
So Long Computer Overlords
So Long Computer OverlordsSo Long Computer Overlords
So Long Computer Overlords
Ian Foster
 
Rpi talk foster september 2011
Rpi talk foster september 2011Rpi talk foster september 2011
Rpi talk foster september 2011
Ian Foster
 
An Ad-hoc Smart Gateway Platform for the Web of Things (IEEE iThings 2013 Bes...
An Ad-hoc Smart Gateway Platform for the Web of Things (IEEE iThings 2013 Bes...An Ad-hoc Smart Gateway Platform for the Web of Things (IEEE iThings 2013 Bes...
An Ad-hoc Smart Gateway Platform for the Web of Things (IEEE iThings 2013 Bes...
Darren Carlson
 
Book of abstract volume 8 no 9 ijcsis december 2010
Book of abstract volume 8 no 9 ijcsis december 2010Book of abstract volume 8 no 9 ijcsis december 2010
Book of abstract volume 8 no 9 ijcsis december 2010
Oladokun Sulaiman
 
Developing a Digital Repository for International Symposium on Information Ma...
Developing a Digital Repository for International Symposium on Information Ma...Developing a Digital Repository for International Symposium on Information Ma...
Developing a Digital Repository for International Symposium on Information Ma...
Müge Akbulut
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
Carole Goble
 

Similar to BIOMAJ (20)

FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
Ogce Workflow Suite
Ogce Workflow SuiteOgce Workflow Suite
Ogce Workflow Suite
 
Software Pipelines: The Good, The Bad and The Ugly
Software Pipelines: The Good, The Bad and The UglySoftware Pipelines: The Good, The Bad and The Ugly
Software Pipelines: The Good, The Bad and The Ugly
 
grid mining
grid mininggrid mining
grid mining
 
Data Ingestion At Scale (CNECCS 2017)
Data Ingestion At Scale (CNECCS 2017)Data Ingestion At Scale (CNECCS 2017)
Data Ingestion At Scale (CNECCS 2017)
 
Grid Computing
Grid ComputingGrid Computing
Grid Computing
 
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
 
Enabling Large Scale Sequencing Studies through Science as a Service
Enabling Large Scale Sequencing Studies through Science as a ServiceEnabling Large Scale Sequencing Studies through Science as a Service
Enabling Large Scale Sequencing Studies through Science as a Service
 
A Web Services Based Framework For Uniform Integration Of Command-Line Bioinf...
A Web Services Based Framework For Uniform Integration Of Command-Line Bioinf...A Web Services Based Framework For Uniform Integration Of Command-Line Bioinf...
A Web Services Based Framework For Uniform Integration Of Command-Line Bioinf...
 
So Long Computer Overlords
So Long Computer OverlordsSo Long Computer Overlords
So Long Computer Overlords
 
Web services for sharing germplasm data sets, at FAO in Rome (2006)
Web services for sharing germplasm data sets, at FAO in Rome (2006)Web services for sharing germplasm data sets, at FAO in Rome (2006)
Web services for sharing germplasm data sets, at FAO in Rome (2006)
 
Rpi talk foster september 2011
Rpi talk foster september 2011Rpi talk foster september 2011
Rpi talk foster september 2011
 
An Ad-hoc Smart Gateway Platform for the Web of Things (IEEE iThings 2013 Bes...
An Ad-hoc Smart Gateway Platform for the Web of Things (IEEE iThings 2013 Bes...An Ad-hoc Smart Gateway Platform for the Web of Things (IEEE iThings 2013 Bes...
An Ad-hoc Smart Gateway Platform for the Web of Things (IEEE iThings 2013 Bes...
 
Book of abstract volume 8 no 9 ijcsis december 2010
Book of abstract volume 8 no 9 ijcsis december 2010Book of abstract volume 8 no 9 ijcsis december 2010
Book of abstract volume 8 no 9 ijcsis december 2010
 
Overview of the Data Processing Error Analysis System (DPEAS)
Overview of the Data Processing Error Analysis System (DPEAS)Overview of the Data Processing Error Analysis System (DPEAS)
Overview of the Data Processing Error Analysis System (DPEAS)
 
A consistent and efficient graphical User Interface Design and Querying Organ...
A consistent and efficient graphical User Interface Design and Querying Organ...A consistent and efficient graphical User Interface Design and Querying Organ...
A consistent and efficient graphical User Interface Design and Querying Organ...
 
Developing a Digital Repository for International Symposium on Information Ma...
Developing a Digital Repository for International Symposium on Information Ma...Developing a Digital Repository for International Symposium on Information Ma...
Developing a Digital Repository for International Symposium on Information Ma...
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
Closing the Gap in Time: From Raw Data to Real Science
Closing the Gap in Time: From Raw Data to Real ScienceClosing the Gap in Time: From Raw Data to Real Science
Closing the Gap in Time: From Raw Data to Real Science
 
WSO2 Big Data Platform and Applications
WSO2 Big Data Platform and ApplicationsWSO2 Big Data Platform and Applications
WSO2 Big Data Platform and Applications
 

More from university of education,Lahore

More from university of education,Lahore (20)

Activites and Time Planning
 Activites and Time Planning Activites and Time Planning
Activites and Time Planning
 
Steganography
SteganographySteganography
Steganography
 
Classical Encryption Techniques
Classical Encryption TechniquesClassical Encryption Techniques
Classical Encryption Techniques
 
Activites and Time Planning
Activites and Time PlanningActivites and Time Planning
Activites and Time Planning
 
OSI Security Architecture
OSI Security ArchitectureOSI Security Architecture
OSI Security Architecture
 
Network Security Terminologies
Network Security TerminologiesNetwork Security Terminologies
Network Security Terminologies
 
Project Scheduling, Planning and Risk Management
Project Scheduling, Planning and Risk ManagementProject Scheduling, Planning and Risk Management
Project Scheduling, Planning and Risk Management
 
Software Testing and Debugging
Software Testing and DebuggingSoftware Testing and Debugging
Software Testing and Debugging
 
ePayment Methods
ePayment MethodsePayment Methods
ePayment Methods
 
SEO
SEOSEO
SEO
 
A Star Search
A Star SearchA Star Search
A Star Search
 
Enterprise Application Integration
Enterprise Application IntegrationEnterprise Application Integration
Enterprise Application Integration
 
Uml Diagrams
Uml DiagramsUml Diagrams
Uml Diagrams
 
eDras Max
eDras MaxeDras Max
eDras Max
 
RAD Model
RAD ModelRAD Model
RAD Model
 
Microsoft Project
Microsoft ProjectMicrosoft Project
Microsoft Project
 
Itertaive Process Development
Itertaive Process DevelopmentItertaive Process Development
Itertaive Process Development
 
Computer Aided Software Engineering Nayab Awan
Computer Aided Software Engineering Nayab AwanComputer Aided Software Engineering Nayab Awan
Computer Aided Software Engineering Nayab Awan
 
Lect 2 assessing the technology landscape
Lect 2 assessing the technology landscapeLect 2 assessing the technology landscape
Lect 2 assessing the technology landscape
 
system level requirements gathering and analysis
system level requirements gathering and analysissystem level requirements gathering and analysis
system level requirements gathering and analysis
 

Recently uploaded

Recently uploaded (20)

latest AZ-104 Exam Questions and Answers
latest AZ-104 Exam Questions and Answerslatest AZ-104 Exam Questions and Answers
latest AZ-104 Exam Questions and Answers
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
 
How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
How to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxHow to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptx
 
Details on CBSE Compartment Exam.pptx1111
Details on CBSE Compartment Exam.pptx1111Details on CBSE Compartment Exam.pptx1111
Details on CBSE Compartment Exam.pptx1111
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptx
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptx
 
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
 
Basic Intentional Injuries Health Education
Basic Intentional Injuries Health EducationBasic Intentional Injuries Health Education
Basic Intentional Injuries Health Education
 
OSCM Unit 2_Operations Processes & Systems
OSCM Unit 2_Operations Processes & SystemsOSCM Unit 2_Operations Processes & Systems
OSCM Unit 2_Operations Processes & Systems
 
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptxOn_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024
 
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
 
Tatlong Kwento ni Lola basyang-1.pdf arts
Tatlong Kwento ni Lola basyang-1.pdf artsTatlong Kwento ni Lola basyang-1.pdf arts
Tatlong Kwento ni Lola basyang-1.pdf arts
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfUnit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptx
 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)
 

BIOMAJ

  • 1. BIO MAJ WORKFLOW ENGINE DEDICATED TO BIO-DATA SYNCHRONIZATION AND PROCESSING. Sana Anam Roll # 3003 Bs(Hons) Botany 3rd semester Eve Submitted to Inam ul Haq University of Education
  • 2. CONTENT  INTRODUCTION  BACKGROUND OF BIOMAJ  APPLICATION  BIOMAJ PROVIDE  CONCLUSION  REFRENCES University of Education
  • 3. INTRODUCTION  In biocomputing,  analyses are almost systematically reliant on databanks.  Any biocomputing site therefore needs to manage these invaluable databanks that hold a huge amount of information usually several terabytes, spread over various international sites and in a consistent format (there are still several different standards currently). University of Education
  • 4. BACKGROUND OF BIOMAJ  The BioMAJ project came out of the work of three teams in 2005: INRIA Rennes and INRA  Toulouse and JouyenJosas.  At the time, no free applications met users’ requirements. The closest  application was citrina, developed by Josh Goodman (from Washington University’s gmod project).  This was a promising prototype – nonetheless quite far from the application required – and it had  not been updated since 2004.  In 2006, these teams (INRIA and INRA) developed a new engine called BioMAJ1. Based on  citrina 0.51, nearly all the code was rewritten and the application’s architecture and functions were  completely rethought and considerably extended.  During 2007, the application was tested on the three sites involved in the project to make it  more robust and suitable University of Education
  • 5. APPLICATION  Synchronization : Multiple remote protocols (ftp, sftp, http, rsync, local copy) Data transfers integrity check Release versioning using a incremental approach Multi threading Data extraction (gzip, tar, bzip) Data tree directory normalization University of Education
  • 6.  Pre &Post processing : Advanced workflow description (D.A.G) using Easy normalized syntax language Post-process indexation for various bioinformatics software (blast, srs, fastacmd, readseq, etc…) Easy integration of personal scripts for bank post-processing automation University of Education
  • 7.  Supervision : Administration web interface Repository statistics Mail alerts for the update cycle supervision University of Education
  • 8. BIOMAJ PROVIDE  A reliable workflow engine that can download remote data automatically and intelligently  (error correction, synchronization of local and remote data), apply formatting to this data and  put it into production (make the data available for all users and/or applications).  A group of predefined workflows for the main biological banks.  An indexing scripts library (formatting for biological data) University of Education
  • 9. CONCLUSION  BioMAJ provides flexibility in managing banks of sequences on a site while allowing for rapid implementation of new workflows by simply creating a bank description file. University of Education
  • 10. REFERENCES  Website: http://biomaj.genouest.org/ University of Education Authors: David Allouche, Olivier Filangi , Romaric Sabas, Olivier Sallou (olivier.sallou@irisa.fr)