SlideShare a Scribd company logo
1 of 34
Download to read offline
MyTardis
Steve Androulakis
Research Data Management is Painful
● Time-consuming to store, describe and organise data
● Poor storage media
● Costly storage
● Hard to share data with collaborators (especially external to institution)
● Hard to access data
● Hard to open data to the public and cite it in a journal
Research Data Management is Painful
● Time-consuming to store, describe and organise data
● Poor storage media
● Costly storage
● Hard to share data with collaborators (especially external to institution)
● Hard to access data
● Hard to open data to the public and cite it in a journal
Protein Data Bank ID: 4BOF
4.2mb
Presented on the Synchrotron’s Control PCs in a slideshow.
Research Data Management is Painful
● Time-consuming to store, describe and organise data
● Manual effort required to store and describe data
● Poor storage media
● Costly storage
● Hard to share data with collaborators (especially external to institution)
● Hard to access data
● Hard to open data to the public and cite it in a journal
Research Data Management is Painful
● Time-consuming to store, describe and organise data
● Manual effort required to store and describe data
● Poor storage media
● Costly storage
● Hard to share data with collaborators (especially external to institution)
● Hard to access data
● Hard to open data to the public and cite it in a journal
Describe
Download
Browse
Research Data Management is Painful
● Time-consuming to store, describe and organise data
● Manual effort required to store and describe data
● Poor storage media
● Costly storage
● Hard to share data with collaborators (especially external to institution)
● Hard to access data
● Hard to open data to the public and cite it in a journal
Research Data Management is Painful
● Time-consuming to store, describe and organise data
● Manual effort required to store and describe data
● Poor storage media
● Costly storage
● Hard to share data with collaborators (especially external to institution)
● Hard to access data
● Hard to open data to the public and cite it in a journal
Research Data Management is Painful
● Time-consuming to store, describe and organise data
● Manual effort required to store and describe data
● Poor storage media
● Costly storage
● Hard to share data with collaborators (especially external to institution)
● Hard to access data
● Hard to open data to the public and cite it in a journal
Research Data Management is Painful
● Time-consuming to store, describe and organise data
● Manual effort required to store and describe data
● Poor storage media
● Costly storage
● Hard to share data with collaborators (especially external to institution)
● Hard to access data
● Hard to open data to the public and cite it in a journal
Professionally-run Storage and Servers
Research Data Management is Painful
● Time-consuming to store, describe and organise data
● Manual effort required to store and describe data
● Poor storage media
● Costly storage
● Hard to share data with collaborators (especially external to institution)
● Hard to access data
● Hard to open data to the public and cite it in a journal
Research Data Management is Painful
● Time-consuming to store, describe and organise data
● Manual effort required to store and describe data
● Poor storage media
● Costly storage
● Hard to share data with collaborators (especially external to institution)
● Hard to access data
● Hard to open data to the public and cite it in a journal
Store.Synchrotron.org.au
● Store.Synchrotron is a service that captures all macromolecular beamline data, available
online to all non-commercial Australian Synchrotron users. It was developed by Monash
University in a strategic, ongoing partnership.
● Data is immediately shareable by the researcher on the web and able to be published.
● The service operates on the Australian NeCTAR Research compute cloud in a scalable
setup able to withstand load and large fully redundant RDSI (VicNode) storage.
● We’re actively opening access to raw data behind high-impact research publications
under CC BY licenses. Six institutions have opened data so far.
● Built on MyTardis – an open source, Australian made data management.
● Visit store.synchrotron.org.au for access
github.com/mytardis API docs at mytardis.readthedocs.org
MyTardis
(store.synchrotron.org.au)
Organised Data And Metadata
MyTardis
(store.synchrotron.org.au)
Organised Data And Metadata
Instrument
Control
Systems
Proposal and
Scheduling
Systems
Data
Autoprocessing
Real-time instrument data capture
Capture began June 2013. As of July 2014, it has captured over 31
terabytes of data in over 2.4 million raw diffraction images.
Source: http://bdp-aaf-dev.dyndns.org/graphtime.html
Research Data Management is Painful
● Time-consuming to store, describe and organise data
● Manual effort required to store and describe data
● Poor storage media
● Costly storage
● Hard to share data with collaborators (especially external to institution)
● Hard to access data
● Hard to open data to the public and cite it in a journal
Research Data Management is Painful
● Time-consuming to store, describe and organise data
● Manual effort required to store and describe data
● Poor storage media
● Costly storage
● Hard to share data with collaborators (especially external to institution)
● Hard to access data
● Hard to open data to the public and cite it in a journal
Case Study: Monash Micro Imaging
Users organise data into username/dataset folders on the instrument
control computers and data is automatically retrieved by MyTardis.
Over 15 Microscopes at Monash Micro Imaging are integrated.
This is being done for gene sequencers (UoM), nanofabrication (RMIT), MRI
(UQ) and more.
Case Study: RMIT Cloud HPC Provider
Processing On
NeCTAR Cloud
HRMC Web App
Results and
Analysis in
MyTardis
● Complex high performance
computing both on cloud
infrastructure and queue-based
systems
● Simple web interface to start
complex MapReduce runs with
parameter sweeps (Hybrid
Reverse Monte Carlo
simulations)
● Results come back with
analysis: graphs auto-generated
Manual Curation
Upload data from anywhere..
Button and Drag’n’Drop Code
(github.com/steveandroulakis/mytardis-uploader)
dx.doi.org/10.5072/03/529D3551F0117
Research Data Management is possible
● Time-consuming to store, describe and organise data
● Manual effort required to store and describe data
● Poor storage media
● Costly storage
● Hard to share data with collaborators (especially external to institution)
● Hard to access data
● Hard to open data to the public and cite it in a journal
Coming Soon..
● Deploy a scalable MyTardis on a cloud (like OpenStack or Amazon EC2) in a few
commands (using SaltStack).
● Mount MyTardis on the file system to browse and access your data.
● An instrument integration app is being produced with a double-click installer for simple
Instrument data to MyTardis (works on Windows!).
● Go to mytardis.org for more news and information.
● Thanks!
● Contact Me: Steve Androulakis (Bioinformatics Manager), steve.androulakis@monash.
edu

More Related Content

What's hot

Data Archiving and Processing
Data Archiving and ProcessingData Archiving and Processing
Data Archiving and Processing
CRRC-Armenia
 

What's hot (20)

Datasets with bioschemas
Datasets with bioschemasDatasets with bioschemas
Datasets with bioschemas
 
Data Archiving and Processing
Data Archiving and ProcessingData Archiving and Processing
Data Archiving and Processing
 
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATResearch Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
 
Yosemite part-4 webinar-final
Yosemite part-4 webinar-finalYosemite part-4 webinar-final
Yosemite part-4 webinar-final
 
Data mining
Data miningData mining
Data mining
 
EDI Training Module 11: Publishing Data in the EDI Repository
EDI Training Module 11:  Publishing Data in the EDI RepositoryEDI Training Module 11:  Publishing Data in the EDI Repository
EDI Training Module 11: Publishing Data in the EDI Repository
 
Publishing Linked Data using Schema.org
Publishing Linked Data using Schema.orgPublishing Linked Data using Schema.org
Publishing Linked Data using Schema.org
 
Introduction to using REDCap for multi-site longitudinal research in medicine
Introduction to using REDCap for multi-site longitudinal research in medicineIntroduction to using REDCap for multi-site longitudinal research in medicine
Introduction to using REDCap for multi-site longitudinal research in medicine
 
Metadata ppt
Metadata ppt Metadata ppt
Metadata ppt
 
Getting to Grips with Research Data Management
Getting to Grips with Research Data Management Getting to Grips with Research Data Management
Getting to Grips with Research Data Management
 
Enhancing Interoperability: The Implementation of OpenAIRE Guidelines and COA...
Enhancing Interoperability: The Implementation of OpenAIRE Guidelines and COA...Enhancing Interoperability: The Implementation of OpenAIRE Guidelines and COA...
Enhancing Interoperability: The Implementation of OpenAIRE Guidelines and COA...
 
Discovering Related Data Sources in Data Portals
Discovering Related Data Sources in Data PortalsDiscovering Related Data Sources in Data Portals
Discovering Related Data Sources in Data Portals
 
How Does Erasure Coding Protect Data?
How Does Erasure Coding Protect Data?How Does Erasure Coding Protect Data?
How Does Erasure Coding Protect Data?
 
THOR Workshop - Persistent Identifier Linking
THOR Workshop - Persistent Identifier LinkingTHOR Workshop - Persistent Identifier Linking
THOR Workshop - Persistent Identifier Linking
 
itx presentation: FHIR and the New Zealand EHR
itx presentation: FHIR and the New Zealand EHRitx presentation: FHIR and the New Zealand EHR
itx presentation: FHIR and the New Zealand EHR
 
Linked (Open) Data: A quick introduction
Linked (Open) Data: A quick introductionLinked (Open) Data: A quick introduction
Linked (Open) Data: A quick introduction
 
Enterprise knowledge graphs
Enterprise knowledge graphsEnterprise knowledge graphs
Enterprise knowledge graphs
 
Linked Data efforts for data standards in biopharma and healthcare
Linked Data efforts for data standards in biopharma and healthcareLinked Data efforts for data standards in biopharma and healthcare
Linked Data efforts for data standards in biopharma and healthcare
 
Querying the Wikidata Knowledge Graph
Querying the Wikidata Knowledge GraphQuerying the Wikidata Knowledge Graph
Querying the Wikidata Knowledge Graph
 
Connections that work: Linked Open Data demystified
Connections that work: Linked Open Data demystifiedConnections that work: Linked Open Data demystified
Connections that work: Linked Open Data demystified
 

Viewers also liked

Ultima version planificador de proyectos
Ultima version planificador de proyectosUltima version planificador de proyectos
Ultima version planificador de proyectos
fabyjulian
 
El docente y los programas escolares
El docente y los programas escolaresEl docente y los programas escolares
El docente y los programas escolares
Ednithaa Vazquez
 
TESNPPT2015General
TESNPPT2015GeneralTESNPPT2015General
TESNPPT2015General
TESN FREIGHT
 

Viewers also liked (20)

Two models for eresearch practice and service - NZ eResearch Symposium 2011
Two models for eresearch practice and service - NZ eResearch Symposium 2011Two models for eresearch practice and service - NZ eResearch Symposium 2011
Two models for eresearch practice and service - NZ eResearch Symposium 2011
 
Is eResearch about the technology? ...space, platforms, hubs and social chang...
Is eResearch about the technology? ...space, platforms, hubs and social chang...Is eResearch about the technology? ...space, platforms, hubs and social chang...
Is eResearch about the technology? ...space, platforms, hubs and social chang...
 
eResearch activities brochure
eResearch activities brochureeResearch activities brochure
eResearch activities brochure
 
Computational Data-Science Slideshow
Computational Data-Science SlideshowComputational Data-Science Slideshow
Computational Data-Science Slideshow
 
A Social Cloud for Public eResearch
A Social Cloud for Public eResearchA Social Cloud for Public eResearch
A Social Cloud for Public eResearch
 
Collaborative eResearch in a Social Cloud
Collaborative eResearch in a Social CloudCollaborative eResearch in a Social Cloud
Collaborative eResearch in a Social Cloud
 
Social Cloud talk at KSRI Service Summit 2012
Social Cloud talk at KSRI Service Summit 2012Social Cloud talk at KSRI Service Summit 2012
Social Cloud talk at KSRI Service Summit 2012
 
Podcast en la enseñanza del Idioma Inglés
Podcast en la enseñanza del Idioma InglésPodcast en la enseñanza del Idioma Inglés
Podcast en la enseñanza del Idioma Inglés
 
Ultima version planificador de proyectos
Ultima version planificador de proyectosUltima version planificador de proyectos
Ultima version planificador de proyectos
 
El docente y los programas escolares
El docente y los programas escolaresEl docente y los programas escolares
El docente y los programas escolares
 
Planificador de proyectos
Planificador de proyectosPlanificador de proyectos
Planificador de proyectos
 
Gestionar el conocimiento
Gestionar el conocimientoGestionar el conocimiento
Gestionar el conocimiento
 
TAREA DE INFORMATICA
TAREA DE INFORMATICA TAREA DE INFORMATICA
TAREA DE INFORMATICA
 
Sms insurance
Sms insuranceSms insurance
Sms insurance
 
HSBA's Litigation and Insurance Coverage Litigation sections - Nautilus Ins C...
HSBA's Litigation and Insurance Coverage Litigation sections - Nautilus Ins C...HSBA's Litigation and Insurance Coverage Litigation sections - Nautilus Ins C...
HSBA's Litigation and Insurance Coverage Litigation sections - Nautilus Ins C...
 
Sistem operasi part 2
Sistem operasi part 2Sistem operasi part 2
Sistem operasi part 2
 
Aplicación fox sports colombia.
Aplicación fox sports  colombia.Aplicación fox sports  colombia.
Aplicación fox sports colombia.
 
TESNPPT2015General
TESNPPT2015GeneralTESNPPT2015General
TESNPPT2015General
 
diferensial vektor
diferensial vektordiferensial vektor
diferensial vektor
 
pressure
pressurepressure
pressure
 

Similar to MyTardis for Uni of Cape Town 2014

Similar to MyTardis for Uni of Cape Town 2014 (20)

Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATResearch Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
 
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu | Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
 
University of Hertfordshire researcher development - research data management
University of Hertfordshire researcher development - research data management University of Hertfordshire researcher development - research data management
University of Hertfordshire researcher development - research data management
 
Enabling Your Data Science Team with Modern Data Engineering
Enabling Your Data Science Team with Modern Data EngineeringEnabling Your Data Science Team with Modern Data Engineering
Enabling Your Data Science Team with Modern Data Engineering
 
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
 
Intro to RDM
Intro to RDMIntro to RDM
Intro to RDM
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...
 
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...
 
Managing data throughout the research lifecycle
Managing data throughout the research lifecycleManaging data throughout the research lifecycle
Managing data throughout the research lifecycle
 
Converged IT and Data Commons
Converged IT and Data CommonsConverged IT and Data Commons
Converged IT and Data Commons
 
Dive deep into your Data Pools
Dive deep into your Data PoolsDive deep into your Data Pools
Dive deep into your Data Pools
 
EUDAT Research Data Management | www.eudat.eu |
EUDAT Research Data Management | www.eudat.eu | EUDAT Research Data Management | www.eudat.eu |
EUDAT Research Data Management | www.eudat.eu |
 
2012 Fall Data Management Planning Workshop
2012 Fall Data Management Planning Workshop2012 Fall Data Management Planning Workshop
2012 Fall Data Management Planning Workshop
 
A Year in Review - Building a Comprehensive Data Management Program
A Year in Review - Building a Comprehensive Data Management ProgramA Year in Review - Building a Comprehensive Data Management Program
A Year in Review - Building a Comprehensive Data Management Program
 
OU Library Research Support webinar: Working with research data
OU Library Research Support webinar: Working with research dataOU Library Research Support webinar: Working with research data
OU Library Research Support webinar: Working with research data
 
Writing a successful data management plan with the DMPTool
Writing a successful data management plan with the DMPToolWriting a successful data management plan with the DMPTool
Writing a successful data management plan with the DMPTool
 
RDM & ELNs @ Edinburgh
RDM & ELNs @ EdinburghRDM & ELNs @ Edinburgh
RDM & ELNs @ Edinburgh
 
Planning for Research Data Management
Planning for Research Data ManagementPlanning for Research Data Management
Planning for Research Data Management
 
Planning for Research Data Managment
Planning for Research Data ManagmentPlanning for Research Data Managment
Planning for Research Data Managment
 
Providing support and services for researchers in good data governance
Providing support and services for researchers in good data governanceProviding support and services for researchers in good data governance
Providing support and services for researchers in good data governance
 

Recently uploaded

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Sérgio Sacani
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
Areesha Ahmad
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
MohamedFarag457087
 
Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.
Silpa
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
1301aanya
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
Scintica Instrumentation
 

Recently uploaded (20)

COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)
 
300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flypumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
 
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdf
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical Science
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
 
Exploring Criminology and Criminal Behaviour.pdf
Exploring Criminology and Criminal Behaviour.pdfExploring Criminology and Criminal Behaviour.pdf
Exploring Criminology and Criminal Behaviour.pdf
 
Chemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdfChemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdf
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.
 
Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx                 .Clean In Place(CIP).pptx                 .
Clean In Place(CIP).pptx .
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
 

MyTardis for Uni of Cape Town 2014

  • 2. Research Data Management is Painful ● Time-consuming to store, describe and organise data ● Poor storage media ● Costly storage ● Hard to share data with collaborators (especially external to institution) ● Hard to access data ● Hard to open data to the public and cite it in a journal
  • 3. Research Data Management is Painful ● Time-consuming to store, describe and organise data ● Poor storage media ● Costly storage ● Hard to share data with collaborators (especially external to institution) ● Hard to access data ● Hard to open data to the public and cite it in a journal
  • 4.
  • 5. Protein Data Bank ID: 4BOF 4.2mb
  • 6.
  • 7. Presented on the Synchrotron’s Control PCs in a slideshow.
  • 8. Research Data Management is Painful ● Time-consuming to store, describe and organise data ● Manual effort required to store and describe data ● Poor storage media ● Costly storage ● Hard to share data with collaborators (especially external to institution) ● Hard to access data ● Hard to open data to the public and cite it in a journal
  • 9. Research Data Management is Painful ● Time-consuming to store, describe and organise data ● Manual effort required to store and describe data ● Poor storage media ● Costly storage ● Hard to share data with collaborators (especially external to institution) ● Hard to access data ● Hard to open data to the public and cite it in a journal
  • 11.
  • 12. Research Data Management is Painful ● Time-consuming to store, describe and organise data ● Manual effort required to store and describe data ● Poor storage media ● Costly storage ● Hard to share data with collaborators (especially external to institution) ● Hard to access data ● Hard to open data to the public and cite it in a journal
  • 13. Research Data Management is Painful ● Time-consuming to store, describe and organise data ● Manual effort required to store and describe data ● Poor storage media ● Costly storage ● Hard to share data with collaborators (especially external to institution) ● Hard to access data ● Hard to open data to the public and cite it in a journal
  • 14.
  • 15. Research Data Management is Painful ● Time-consuming to store, describe and organise data ● Manual effort required to store and describe data ● Poor storage media ● Costly storage ● Hard to share data with collaborators (especially external to institution) ● Hard to access data ● Hard to open data to the public and cite it in a journal
  • 16. Research Data Management is Painful ● Time-consuming to store, describe and organise data ● Manual effort required to store and describe data ● Poor storage media ● Costly storage ● Hard to share data with collaborators (especially external to institution) ● Hard to access data ● Hard to open data to the public and cite it in a journal
  • 18. Research Data Management is Painful ● Time-consuming to store, describe and organise data ● Manual effort required to store and describe data ● Poor storage media ● Costly storage ● Hard to share data with collaborators (especially external to institution) ● Hard to access data ● Hard to open data to the public and cite it in a journal
  • 19. Research Data Management is Painful ● Time-consuming to store, describe and organise data ● Manual effort required to store and describe data ● Poor storage media ● Costly storage ● Hard to share data with collaborators (especially external to institution) ● Hard to access data ● Hard to open data to the public and cite it in a journal
  • 20. Store.Synchrotron.org.au ● Store.Synchrotron is a service that captures all macromolecular beamline data, available online to all non-commercial Australian Synchrotron users. It was developed by Monash University in a strategic, ongoing partnership. ● Data is immediately shareable by the researcher on the web and able to be published. ● The service operates on the Australian NeCTAR Research compute cloud in a scalable setup able to withstand load and large fully redundant RDSI (VicNode) storage. ● We’re actively opening access to raw data behind high-impact research publications under CC BY licenses. Six institutions have opened data so far. ● Built on MyTardis – an open source, Australian made data management. ● Visit store.synchrotron.org.au for access
  • 21. github.com/mytardis API docs at mytardis.readthedocs.org
  • 23. MyTardis (store.synchrotron.org.au) Organised Data And Metadata Instrument Control Systems Proposal and Scheduling Systems Data Autoprocessing
  • 24. Real-time instrument data capture Capture began June 2013. As of July 2014, it has captured over 31 terabytes of data in over 2.4 million raw diffraction images. Source: http://bdp-aaf-dev.dyndns.org/graphtime.html
  • 25.
  • 26.
  • 27. Research Data Management is Painful ● Time-consuming to store, describe and organise data ● Manual effort required to store and describe data ● Poor storage media ● Costly storage ● Hard to share data with collaborators (especially external to institution) ● Hard to access data ● Hard to open data to the public and cite it in a journal
  • 28. Research Data Management is Painful ● Time-consuming to store, describe and organise data ● Manual effort required to store and describe data ● Poor storage media ● Costly storage ● Hard to share data with collaborators (especially external to institution) ● Hard to access data ● Hard to open data to the public and cite it in a journal
  • 29. Case Study: Monash Micro Imaging Users organise data into username/dataset folders on the instrument control computers and data is automatically retrieved by MyTardis. Over 15 Microscopes at Monash Micro Imaging are integrated. This is being done for gene sequencers (UoM), nanofabrication (RMIT), MRI (UQ) and more.
  • 30. Case Study: RMIT Cloud HPC Provider Processing On NeCTAR Cloud HRMC Web App Results and Analysis in MyTardis ● Complex high performance computing both on cloud infrastructure and queue-based systems ● Simple web interface to start complex MapReduce runs with parameter sweeps (Hybrid Reverse Monte Carlo simulations) ● Results come back with analysis: graphs auto-generated
  • 31. Manual Curation Upload data from anywhere.. Button and Drag’n’Drop Code (github.com/steveandroulakis/mytardis-uploader)
  • 33. Research Data Management is possible ● Time-consuming to store, describe and organise data ● Manual effort required to store and describe data ● Poor storage media ● Costly storage ● Hard to share data with collaborators (especially external to institution) ● Hard to access data ● Hard to open data to the public and cite it in a journal
  • 34. Coming Soon.. ● Deploy a scalable MyTardis on a cloud (like OpenStack or Amazon EC2) in a few commands (using SaltStack). ● Mount MyTardis on the file system to browse and access your data. ● An instrument integration app is being produced with a double-click installer for simple Instrument data to MyTardis (works on Windows!). ● Go to mytardis.org for more news and information. ● Thanks! ● Contact Me: Steve Androulakis (Bioinformatics Manager), steve.androulakis@monash. edu