SlideShare a Scribd company logo
1 of 14
SageCite workflow citation demonstrator Peter Li
Workflows Two workflows have been developed with Brig Mecham from Sage Bionetworks
MetaGEO project The 2 workflows have been developed in the context ofBrig’s MetaGEO project which normalises gene expression data sets in the GEO database The normalised data sets enable meta-analyses, e.g. identification of disease signatures Difference between MetaGEO and other similar projects is that all research objects in MetaGEO is open access Data, results, intermediate results, data analysis and integration procedures, etc Enhances the trust of MetaGEO data by researchers For more information on MetaGEO, see Brig’s slides on SageCite wiki
metaGEO: Current Users/Contributors Lilyana Margaretha,Stem Cell Biology Pete Nelson, Prostate Cancer Bin Zhang,AML Joyoti Dey,Medulloblastoma Mette Peters, Alzheimers Peter Li, Workflows Anders Rosengren, Diabetes & Perturbations Ji Zhang, AML Brig Mecham, Sage Bionetworks Roel Verhaak, Updated GSE6891
metaGEO: Automated Workflows (1) Acquire Data (2) Curation (4) Inference (3) QC Brig Mecham, Sage Bionetworks
Workflow 1 This workflow produces an annotation library that is used to map gene probes on Affymetrix chips to a specific gene for an organism The library is used as part of the curation step for gene expression data sets in GEO
Workflow 2 This workflow performs normalisation and inference analysis on GEO data Produces normalised data and statistics of gene expression
Workflow citation demonstrator Developed a Taverna plugin for registering workflow results with a DOI using DataCite service
Workflow citation demonstrator Plugin provides an operation in Taverna’s service palette that can be incorporated into workflows to register a data set with a DOI via DataCite
Registration of data To register data, the plugin provides it with a DOI For example: 10.5520/SAGECITE-1
SageCite demo repository web site The plugin stores data in a local sqlite database and creates a web page on the SageCite demo repository web site to display data
Registration of data using DataCite The plugin uses the DOI to register the data on DataCite using its Web API
Registration of data using DataCite Clicking on the DOI link takes you to the web page for the data on the SageCite demo repository site
To do and issues Need to register metadata for workflow results using DataCite API Large size of data generated from Brig’s pipelines sometimes breaks plugin

More Related Content

What's hot

Recording and Reasoning Over Data Provenance in Web and Grid Services
Recording and Reasoning Over Data Provenance in Web and Grid ServicesRecording and Reasoning Over Data Provenance in Web and Grid Services
Recording and Reasoning Over Data Provenance in Web and Grid Services
Martin Szomszor
 
IC-SDV 2019: OntoChem
IC-SDV 2019: OntoChemIC-SDV 2019: OntoChem
IC-SDV 2019: OntoChem
Dr. Haxel Consult
 
NFAIS Altmetrics Webinar 2014
NFAIS Altmetrics Webinar 2014NFAIS Altmetrics Webinar 2014
NFAIS Altmetrics Webinar 2014
William Gunn
 
U Maryland Connect: How Mendeley Illuminates a Broader Definition of Impact
U Maryland Connect: How Mendeley Illuminates a Broader Definition of ImpactU Maryland Connect: How Mendeley Illuminates a Broader Definition of Impact
U Maryland Connect: How Mendeley Illuminates a Broader Definition of Impact
William Gunn
 
香港六合彩
香港六合彩香港六合彩
香港六合彩
shujia
 

What's hot (20)

A Data Biosphere for Biomedical Research
A Data Biosphere for Biomedical ResearchA Data Biosphere for Biomedical Research
A Data Biosphere for Biomedical Research
 
Recording and Reasoning Over Data Provenance in Web and Grid Services
Recording and Reasoning Over Data Provenance in Web and Grid ServicesRecording and Reasoning Over Data Provenance in Web and Grid Services
Recording and Reasoning Over Data Provenance in Web and Grid Services
 
Role of PIDs in connecting scholarly works
Role of PIDs in connecting scholarly worksRole of PIDs in connecting scholarly works
Role of PIDs in connecting scholarly works
 
What is Data Commons and How Can Your Organization Build One?
What is Data Commons and How Can Your Organization Build One?What is Data Commons and How Can Your Organization Build One?
What is Data Commons and How Can Your Organization Build One?
 
20170621_System requirements of data journal platform
20170621_System requirements of data journal platform20170621_System requirements of data journal platform
20170621_System requirements of data journal platform
 
Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...
 
Molecular interactions. PSICQUIC and IntAct.
Molecular interactions. PSICQUIC and IntAct.Molecular interactions. PSICQUIC and IntAct.
Molecular interactions. PSICQUIC and IntAct.
 
How OpenAIRE uses persistent identifiers for discovery, enrichment, and linki...
How OpenAIRE uses persistent identifiers for discovery, enrichment, and linki...How OpenAIRE uses persistent identifiers for discovery, enrichment, and linki...
How OpenAIRE uses persistent identifiers for discovery, enrichment, and linki...
 
IC-SDV 2019: OntoChem
IC-SDV 2019: OntoChemIC-SDV 2019: OntoChem
IC-SDV 2019: OntoChem
 
New PID developments
New PID developmentsNew PID developments
New PID developments
 
Crossing the Analytics Chasm and Getting the Models You Developed Deployed
Crossing the Analytics Chasm and Getting the Models You Developed DeployedCrossing the Analytics Chasm and Getting the Models You Developed Deployed
Crossing the Analytics Chasm and Getting the Models You Developed Deployed
 
Data Science for the Win
Data Science for the WinData Science for the Win
Data Science for the Win
 
NFAIS Altmetrics Webinar 2014
NFAIS Altmetrics Webinar 2014NFAIS Altmetrics Webinar 2014
NFAIS Altmetrics Webinar 2014
 
U Maryland Connect: How Mendeley Illuminates a Broader Definition of Impact
U Maryland Connect: How Mendeley Illuminates a Broader Definition of ImpactU Maryland Connect: How Mendeley Illuminates a Broader Definition of Impact
U Maryland Connect: How Mendeley Illuminates a Broader Definition of Impact
 
香港六合彩
香港六合彩香港六合彩
香港六合彩
 
How DataCite and Crossref Support Research Data Sharing - Crossref LIVE Hannover
How DataCite and Crossref Support Research Data Sharing - Crossref LIVE HannoverHow DataCite and Crossref Support Research Data Sharing - Crossref LIVE Hannover
How DataCite and Crossref Support Research Data Sharing - Crossref LIVE Hannover
 
Publishing data and code openly
Publishing data and code openlyPublishing data and code openly
Publishing data and code openly
 
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
 
DataONE Education Module 02: Data Sharing
DataONE Education Module 02: Data SharingDataONE Education Module 02: Data Sharing
DataONE Education Module 02: Data Sharing
 
Pistoia Alliance conference April 2016: Big Data: Mathew Woodwark
Pistoia Alliance conference April 2016: Big Data: Mathew WoodwarkPistoia Alliance conference April 2016: Big Data: Mathew Woodwark
Pistoia Alliance conference April 2016: Big Data: Mathew Woodwark
 

Viewers also liked

The Needleman Wunsch algorithm
The Needleman Wunsch algorithmThe Needleman Wunsch algorithm
The Needleman Wunsch algorithm
avrilcoghlan
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
hemantbreeder
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
biinoida
 
Classification and properties of protein
Classification and properties of proteinClassification and properties of protein
Classification and properties of protein
Mark Philip Besana
 

Viewers also liked (13)

Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...
Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...
Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...
 
Vivian unlocking-vista
Vivian unlocking-vistaVivian unlocking-vista
Vivian unlocking-vista
 
Visualization Tools
Visualization ToolsVisualization Tools
Visualization Tools
 
Next-generation sequencing data format and visualization with ngs.plot 2015
Next-generation sequencing data format and visualization with ngs.plot 2015Next-generation sequencing data format and visualization with ngs.plot 2015
Next-generation sequencing data format and visualization with ngs.plot 2015
 
Publicly available tools and open resources in Bioinformatics
Publicly available  tools and open resources in BioinformaticsPublicly available  tools and open resources in Bioinformatics
Publicly available tools and open resources in Bioinformatics
 
Introduction to Bioinformatics
Introduction to BioinformaticsIntroduction to Bioinformatics
Introduction to Bioinformatics
 
The Needleman Wunsch algorithm
The Needleman Wunsch algorithmThe Needleman Wunsch algorithm
The Needleman Wunsch algorithm
 
Comparative Genomics and Visualisation BS32010
Comparative Genomics and Visualisation BS32010Comparative Genomics and Visualisation BS32010
Comparative Genomics and Visualisation BS32010
 
Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-
 
Protein structure classification
Protein structure classificationProtein structure classification
Protein structure classification
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Classification and properties of protein
Classification and properties of proteinClassification and properties of protein
Classification and properties of protein
 

Similar to SageCite demonstrator overview

Go pathway-interaction-integration
Go pathway-interaction-integrationGo pathway-interaction-integration
Go pathway-interaction-integration
Chris Mungall
 
Pharma Research Automation by Connecting Researchers with Robots and Systems ...
Pharma Research Automation by Connecting Researchers with Robots and Systems ...Pharma Research Automation by Connecting Researchers with Robots and Systems ...
Pharma Research Automation by Connecting Researchers with Robots and Systems ...
camunda services GmbH
 
Tag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh PlatformTag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh Platform
Sanjay Padhi, Ph.D
 
Geant4_Web_Application_Update_and_Pion_Cross_Section_Simulation
Geant4_Web_Application_Update_and_Pion_Cross_Section_SimulationGeant4_Web_Application_Update_and_Pion_Cross_Section_Simulation
Geant4_Web_Application_Update_and_Pion_Cross_Section_Simulation
Rasheed Auguste
 

Similar to SageCite demonstrator overview (20)

R meetup talk scaling data science with dgit
R meetup talk   scaling data science with dgitR meetup talk   scaling data science with dgit
R meetup talk scaling data science with dgit
 
Comparing and analyzing various method of data integration in big data
Comparing and analyzing various method of data integration in big dataComparing and analyzing various method of data integration in big data
Comparing and analyzing various method of data integration in big data
 
Go pathway-interaction-integration
Go pathway-interaction-integrationGo pathway-interaction-integration
Go pathway-interaction-integration
 
Process management seminar
Process management seminarProcess management seminar
Process management seminar
 
Is there a way that we can build our Azure Data Factory all with parameters b...
Is there a way that we can build our Azure Data Factory all with parameters b...Is there a way that we can build our Azure Data Factory all with parameters b...
Is there a way that we can build our Azure Data Factory all with parameters b...
 
AWS HCLS Virtual Symposium 2021_Maze-Nichols.pptx
AWS HCLS Virtual Symposium 2021_Maze-Nichols.pptxAWS HCLS Virtual Symposium 2021_Maze-Nichols.pptx
AWS HCLS Virtual Symposium 2021_Maze-Nichols.pptx
 
Pharma Research Automation by Connecting Researchers with Robots and Systems ...
Pharma Research Automation by Connecting Researchers with Robots and Systems ...Pharma Research Automation by Connecting Researchers with Robots and Systems ...
Pharma Research Automation by Connecting Researchers with Robots and Systems ...
 
Tag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh PlatformTag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh Platform
 
Project Focused Activity And Knowledge Tracker A Unified Data Analysis Collab...
Project Focused Activity And Knowledge Tracker A Unified Data Analysis Collab...Project Focused Activity And Knowledge Tracker A Unified Data Analysis Collab...
Project Focused Activity And Knowledge Tracker A Unified Data Analysis Collab...
 
Workshop: Introduction to Cytoscape at UT-KBRIN Bioinformatics Summit 2014 (4...
Workshop: Introduction to Cytoscape at UT-KBRIN Bioinformatics Summit 2014 (4...Workshop: Introduction to Cytoscape at UT-KBRIN Bioinformatics Summit 2014 (4...
Workshop: Introduction to Cytoscape at UT-KBRIN Bioinformatics Summit 2014 (4...
 
S02 hybrid app_and_gae_restful_architecture_v2.0
S02 hybrid app_and_gae_restful_architecture_v2.0S02 hybrid app_and_gae_restful_architecture_v2.0
S02 hybrid app_and_gae_restful_architecture_v2.0
 
Batch Process Analytics
Batch Process Analytics Batch Process Analytics
Batch Process Analytics
 
Comparing the performance of a business process: using Excel & Python
Comparing the performance of a business process: using Excel & PythonComparing the performance of a business process: using Excel & Python
Comparing the performance of a business process: using Excel & Python
 
Data access
Data accessData access
Data access
 
Change data capture the journey to real time bi
Change data capture the journey to real time biChange data capture the journey to real time bi
Change data capture the journey to real time bi
 
An Overview of Data Lake
An Overview of Data LakeAn Overview of Data Lake
An Overview of Data Lake
 
Geant4_Web_Application_Update_and_Pion_Cross_Section_Simulation
Geant4_Web_Application_Update_and_Pion_Cross_Section_SimulationGeant4_Web_Application_Update_and_Pion_Cross_Section_Simulation
Geant4_Web_Application_Update_and_Pion_Cross_Section_Simulation
 
Data Governance
Data GovernanceData Governance
Data Governance
 
What's New in Pentaho 7.0?
What's New in Pentaho 7.0?What's New in Pentaho 7.0?
What's New in Pentaho 7.0?
 
Taverna workflow management system (2010 11-30 Bath Workflow Tools)
Taverna workflow management system (2010 11-30 Bath Workflow Tools)Taverna workflow management system (2010 11-30 Bath Workflow Tools)
Taverna workflow management system (2010 11-30 Bath Workflow Tools)
 

Recently uploaded

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Recently uploaded (20)

Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 

SageCite demonstrator overview

  • 1. SageCite workflow citation demonstrator Peter Li
  • 2. Workflows Two workflows have been developed with Brig Mecham from Sage Bionetworks
  • 3. MetaGEO project The 2 workflows have been developed in the context ofBrig’s MetaGEO project which normalises gene expression data sets in the GEO database The normalised data sets enable meta-analyses, e.g. identification of disease signatures Difference between MetaGEO and other similar projects is that all research objects in MetaGEO is open access Data, results, intermediate results, data analysis and integration procedures, etc Enhances the trust of MetaGEO data by researchers For more information on MetaGEO, see Brig’s slides on SageCite wiki
  • 4. metaGEO: Current Users/Contributors Lilyana Margaretha,Stem Cell Biology Pete Nelson, Prostate Cancer Bin Zhang,AML Joyoti Dey,Medulloblastoma Mette Peters, Alzheimers Peter Li, Workflows Anders Rosengren, Diabetes & Perturbations Ji Zhang, AML Brig Mecham, Sage Bionetworks Roel Verhaak, Updated GSE6891
  • 5. metaGEO: Automated Workflows (1) Acquire Data (2) Curation (4) Inference (3) QC Brig Mecham, Sage Bionetworks
  • 6. Workflow 1 This workflow produces an annotation library that is used to map gene probes on Affymetrix chips to a specific gene for an organism The library is used as part of the curation step for gene expression data sets in GEO
  • 7. Workflow 2 This workflow performs normalisation and inference analysis on GEO data Produces normalised data and statistics of gene expression
  • 8. Workflow citation demonstrator Developed a Taverna plugin for registering workflow results with a DOI using DataCite service
  • 9. Workflow citation demonstrator Plugin provides an operation in Taverna’s service palette that can be incorporated into workflows to register a data set with a DOI via DataCite
  • 10. Registration of data To register data, the plugin provides it with a DOI For example: 10.5520/SAGECITE-1
  • 11. SageCite demo repository web site The plugin stores data in a local sqlite database and creates a web page on the SageCite demo repository web site to display data
  • 12. Registration of data using DataCite The plugin uses the DOI to register the data on DataCite using its Web API
  • 13. Registration of data using DataCite Clicking on the DOI link takes you to the web page for the data on the SageCite demo repository site
  • 14. To do and issues Need to register metadata for workflow results using DataCite API Large size of data generated from Brig’s pipelines sometimes breaks plugin