SlideShare a Scribd company logo
CCAMBIO and the mARS
project
Anton Van de Putte
CCAMBIO Annual Meeting
12 may 2014
Microbial Antarctic
Resources System
An information system dedicated to facilitate the
discovery, access and analysis of geo-
referenced,
molecular microbial diversity (meta)data generated
by Antarctic researchers, in an Open fashion.
What’s happened so far
• mARS Workshop hosted at the Belgian Science Policy
Office (BELSPO, Brussels) in May 2012
• mARS Workshop held during the SCAR Open
Science Conference (Portland, OR) in July 2012
• Technical mARS Workshop hosted at the Université
Libre de Bruxelles in December 2013
• Initiate the development of the database and
webplatform
Near future planning
• mARS Workshop held during the SCAR Open
Science Conference (Auckland, NZ) on 27 august
2014
• Present a proof of concept of the dataindrastructure to
be used for mARS
The Vision
4 incremental steps
Step 1
: Data description and
discovery
Integrated Publishing Toolkit
(IPT)
Step 2: Habitat and
Microbial Sequence
Metadata Entry
MiMARKS and mARS Sequence Set
Te m p l a t e
Step 3: Georeferenced-
molecular sequence database
integration
Step 4: Processing batch
sequence data
Circum-Antarctic microbial
diversity
Standard Operating
Procedure
How to get started
Getting Data into mARS
• Requires that
• Data is accessible in a public a public repository
(Genbank, IMG-M or other web accessible)
• 2 additional metadata files
• MiMARKS
• Microbial Sequence spreadsheet
0. Before you start
• 1. Clearly Identify your needs
• You have a project that you would like to register
with mARS
• no sequence data or environmental data at this
point: skip Steps 1, 2, 4 and 7
• environmental data, but no publicly available
sequences yet, follow all Steps below, but do not
enter Sequence IDs in the forms.
• environmental data, and publicly available
sequences. Follow all Steps below.
0. Before you start
• Send an email to request a username and password
from the IPT administrator
0. Before you start
• Send an email to request a username and password
from the IPT administrator
• Make a copy of the MiMarks Googlesheet from the
RDP MiMarks Googlesheet (click on “Make copy” from
the “File” menu).
0. Before you start
• Send an email to request a username and password
from the IPT administrator
• Make a copy of the MiMarks Googlesheet from the
RDP MiMarks Googlesheet (click on “Make copy” from
the “File” menu).
• Make a copy of the Microbial Sequence Set from the
mARS Googlesheet (click on “Make copy” from the
“File” menu).
1. Prepare your MiMarks
spreadsheet• In the MiMarks Googlesheet you’ve created in step 0,
fill in your environmental metadata details using the
“Google Documents” interface, following the
instructions available from the MiMarks Googlesheet
documentation at RDP. Example files are available
from the mARS website.
• In the header for each column that will hold your
sequence set data, list the unique identifier of your
sequence set.
• Once you are finished, download your spreadsheet as
a CSV (Comma-separated Values) file on your
computer.
2. Prepare your Microbial
Sequence Set spreadsheet
• In the Microbial Sequence Set Googlesheet you’ve
created in step 0, fill all the fields (replace the
examples available from the Googlesheet)
• Once you are finished, download your spreadsheet as
a CSV (Comma-separated Values) file on your
computer.
3. Describe your data in the
IPT
• Login the IPT using your credentials:
• Use the form at the bottom of the “Manage Resource”
page to create a new resource. Provide a unique
"shortname" for your dataset.
• Click the “Create” button. You will arrive on the
Resource Management page.
• Click on the “Edit” button in the Metadata section on
the left and fill in the details for the different metadata
sections. A detailed instructions are available from IPT
quick reference guide. Hint: mention your grant
number in the “Project Data” section, to allow us to link
your resource to relevant projects in the GCMD/AMD.
4. Upload your MiMarks and
Microbial Sequence Set
• 1. In your IPT session, from your Resource
Management page, click on the “Choose file” button in
the “Source data” section on the left of the page.
• 2. Point to your completed MiMarks CSV, and click on
“Choose”
• 3. Click on the “add” button in the “Source data”
section on the left of the page then click on the “Save”
button on the bottom. Your MiMarks CSV file is now
uploaded on the IPT.
4. Upload your MiMarks and
Microbial Sequence Set
• 5. From your Resource Management page, click on
the “Choose file” button in the “Source data” section on
the left of the page.
• 6. Point to your completed Microbial Sequence Set
CSV, and click on “Choose”
• 7. Click on the “add” button in the Source data section
on the left of the page, then click on the “Save” button.
Your Microbial Sequence Set CSV file is now uploaded
on the IPT.
5. Publish and register your
data
• From your Resource Management page, click on the
“Publish” button in the “Published release” section on
the left of the page. Do not worry when you see a
warning message “Source data or Darwin Core
mappings missing. No data archive generated
• By default, your resource’s visibility is set to “Private”.
To allow your resource to become visible on the IPT
for all users, click on the “Public” button in the
“Visibility” section.
• Request one of the administrators to “Register” your
dataset.

More Related Content

Similar to Ccambio annual meeting 2014

Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...
Sarah Anna Stewart
 
Virtual BenchLearning - I-BiDaaS - Industrial-Driven Big Data as a Self-Servi...
Virtual BenchLearning - I-BiDaaS - Industrial-Driven Big Data as a Self-Servi...Virtual BenchLearning - I-BiDaaS - Industrial-Driven Big Data as a Self-Servi...
Virtual BenchLearning - I-BiDaaS - Industrial-Driven Big Data as a Self-Servi...
Big Data Value Association
 
Session 2 - A Project Perspective on Big Data Architectural Pipelines and Ben...
Session 2 - A Project Perspective on Big Data Architectural Pipelines and Ben...Session 2 - A Project Perspective on Big Data Architectural Pipelines and Ben...
Session 2 - A Project Perspective on Big Data Architectural Pipelines and Ben...
DataBench
 
Database Security – Issues and Best PracticesOutline
Database Security – Issues and Best PracticesOutlineDatabase Security – Issues and Best PracticesOutline
Database Security – Issues and Best PracticesOutline
OllieShoresna
 
Citrination tutorial
Citrination tutorialCitrination tutorial
Citrination tutorial
Joshua Tappan
 
Doing Analytics Right - Building the Analytics Environment
Doing Analytics Right - Building the Analytics EnvironmentDoing Analytics Right - Building the Analytics Environment
Doing Analytics Right - Building the Analytics Environment
Tasktop
 
Dr Di Liu - BOLD Mirror Setup
Dr Di Liu - BOLD Mirror SetupDr Di Liu - BOLD Mirror Setup
Dr Di Liu - BOLD Mirror Setup
Consortium for the Barcode of Life (CBOL)
 
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014ALTER WAY
 
Myth Busters: I’m Building a Data Lake, So I Don’t Need Data Virtualization (...
Myth Busters: I’m Building a Data Lake, So I Don’t Need Data Virtualization (...Myth Busters: I’m Building a Data Lake, So I Don’t Need Data Virtualization (...
Myth Busters: I’m Building a Data Lake, So I Don’t Need Data Virtualization (...
Denodo
 
IR-GUIDE
IR-GUIDEIR-GUIDE
IR-GUIDE
Hilton Gibson
 
Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...
Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...
Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...
CSUC - Consorci de Serveis Universitaris de Catalunya
 
Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...
Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...
Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...
ECHOES (Empowering Communities with a Heritage Open Ecosystem)
 
DP-203T00 Microsoft Azure Data Engineering-08.pptx
DP-203T00 Microsoft Azure Data Engineering-08.pptxDP-203T00 Microsoft Azure Data Engineering-08.pptx
DP-203T00 Microsoft Azure Data Engineering-08.pptx
ssuser45b0e7
 
WireCloud, WStore and WMarket
WireCloud, WStore and WMarketWireCloud, WStore and WMarket
WireCloud, WStore and WMarket
Aitor Magán García
 
DATA MINING TOOL- ORANGE
DATA MINING TOOL- ORANGEDATA MINING TOOL- ORANGE
DATA MINING TOOL- ORANGENeeraj Goswami
 
BioCASE web services for germplasm data sets, at FAO, Rome (2006)
BioCASE web services for germplasm data sets, at FAO, Rome (2006)BioCASE web services for germplasm data sets, at FAO, Rome (2006)
BioCASE web services for germplasm data sets, at FAO, Rome (2006)
Dag Endresen
 
Stream Processing in SmartNews #jawsdays
Stream Processing in SmartNews #jawsdaysStream Processing in SmartNews #jawsdays
Stream Processing in SmartNews #jawsdays
SmartNews, Inc.
 
MetricMiner: Supporting Researchers in Mining Software Repositories - SCAM 2013
MetricMiner: Supporting Researchers in Mining Software Repositories - SCAM 2013MetricMiner: Supporting Researchers in Mining Software Repositories - SCAM 2013
MetricMiner: Supporting Researchers in Mining Software Repositories - SCAM 2013
Maurício Aniche
 
Bioschemas Workshop
Bioschemas WorkshopBioschemas Workshop
Bioschemas Workshop
Niall Beard
 

Similar to Ccambio annual meeting 2014 (20)

Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...
 
Virtual BenchLearning - I-BiDaaS - Industrial-Driven Big Data as a Self-Servi...
Virtual BenchLearning - I-BiDaaS - Industrial-Driven Big Data as a Self-Servi...Virtual BenchLearning - I-BiDaaS - Industrial-Driven Big Data as a Self-Servi...
Virtual BenchLearning - I-BiDaaS - Industrial-Driven Big Data as a Self-Servi...
 
Session 2 - A Project Perspective on Big Data Architectural Pipelines and Ben...
Session 2 - A Project Perspective on Big Data Architectural Pipelines and Ben...Session 2 - A Project Perspective on Big Data Architectural Pipelines and Ben...
Session 2 - A Project Perspective on Big Data Architectural Pipelines and Ben...
 
Database Security – Issues and Best PracticesOutline
Database Security – Issues and Best PracticesOutlineDatabase Security – Issues and Best PracticesOutline
Database Security – Issues and Best PracticesOutline
 
Citrination tutorial
Citrination tutorialCitrination tutorial
Citrination tutorial
 
Doing Analytics Right - Building the Analytics Environment
Doing Analytics Right - Building the Analytics EnvironmentDoing Analytics Right - Building the Analytics Environment
Doing Analytics Right - Building the Analytics Environment
 
Dr Di Liu - BOLD Mirror Setup
Dr Di Liu - BOLD Mirror SetupDr Di Liu - BOLD Mirror Setup
Dr Di Liu - BOLD Mirror Setup
 
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
 
Myth Busters: I’m Building a Data Lake, So I Don’t Need Data Virtualization (...
Myth Busters: I’m Building a Data Lake, So I Don’t Need Data Virtualization (...Myth Busters: I’m Building a Data Lake, So I Don’t Need Data Virtualization (...
Myth Busters: I’m Building a Data Lake, So I Don’t Need Data Virtualization (...
 
IR-GUIDE
IR-GUIDEIR-GUIDE
IR-GUIDE
 
Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...
Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...
Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...
 
Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...
Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...
Technical Challenges and Approaches to Build an Open Ecosystem of Heterogeneo...
 
DP-203T00 Microsoft Azure Data Engineering-08.pptx
DP-203T00 Microsoft Azure Data Engineering-08.pptxDP-203T00 Microsoft Azure Data Engineering-08.pptx
DP-203T00 Microsoft Azure Data Engineering-08.pptx
 
WireCloud, WStore and WMarket
WireCloud, WStore and WMarketWireCloud, WStore and WMarket
WireCloud, WStore and WMarket
 
DATA MINING TOOL- ORANGE
DATA MINING TOOL- ORANGEDATA MINING TOOL- ORANGE
DATA MINING TOOL- ORANGE
 
BioCASE web services for germplasm data sets, at FAO, Rome (2006)
BioCASE web services for germplasm data sets, at FAO, Rome (2006)BioCASE web services for germplasm data sets, at FAO, Rome (2006)
BioCASE web services for germplasm data sets, at FAO, Rome (2006)
 
Stream Processing in SmartNews #jawsdays
Stream Processing in SmartNews #jawsdaysStream Processing in SmartNews #jawsdays
Stream Processing in SmartNews #jawsdays
 
J0212065068
J0212065068J0212065068
J0212065068
 
MetricMiner: Supporting Researchers in Mining Software Repositories - SCAM 2013
MetricMiner: Supporting Researchers in Mining Software Repositories - SCAM 2013MetricMiner: Supporting Researchers in Mining Software Repositories - SCAM 2013
MetricMiner: Supporting Researchers in Mining Software Repositories - SCAM 2013
 
Bioschemas Workshop
Bioschemas WorkshopBioschemas Workshop
Bioschemas Workshop
 

More from Anton Van de Putte

The data behind the Biogeographic Atlas of the Southern Ocean
The data behind the Biogeographic Atlas of the Southern OceanThe data behind the Biogeographic Atlas of the Southern Ocean
The data behind the Biogeographic Atlas of the Southern Ocean
Anton Van de Putte
 
Antarctic Biodiversity Portal Virtual Research Environment
Antarctic Biodiversity Portal Virtual Research EnvironmentAntarctic Biodiversity Portal Virtual Research Environment
Antarctic Biodiversity Portal Virtual Research Environment
Anton Van de Putte
 
Energetic Value of Zooplankton and Nekton of the Southern Ocean: A Review
Energetic Value of Zooplankton and Nekton of the Southern Ocean: A ReviewEnergetic Value of Zooplankton and Nekton of the Southern Ocean: A Review
Energetic Value of Zooplankton and Nekton of the Southern Ocean: A Review
Anton Van de Putte
 
Antarctic Observatory System
Antarctic Observatory SystemAntarctic Observatory System
Antarctic Observatory System
Anton Van de Putte
 
The Antarctic Master Directory, sharing Antarctic (meta)data from multiple di...
The Antarctic Master Directory, sharing Antarctic (meta)data from multiple di...The Antarctic Master Directory, sharing Antarctic (meta)data from multiple di...
The Antarctic Master Directory, sharing Antarctic (meta)data from multiple di...
Anton Van de Putte
 
Biodiversity informatics for Polar Regions - how to transform data into knowl...
Biodiversity informatics for Polar Regions - how to transform data into knowl...Biodiversity informatics for Polar Regions - how to transform data into knowl...
Biodiversity informatics for Polar Regions - how to transform data into knowl...
Anton Van de Putte
 
Marine Lifewatch meeting 3-5 June 2014
Marine Lifewatch meeting 3-5 June 2014Marine Lifewatch meeting 3-5 June 2014
Marine Lifewatch meeting 3-5 June 2014
Anton Van de Putte
 
SCAR Data Management and Policy
SCAR Data Management and PolicySCAR Data Management and Policy
SCAR Data Management and Policy
Anton Van de Putte
 
Van de Putte & Danis
Van de Putte & DanisVan de Putte & Danis
Van de Putte & Danis
Anton Van de Putte
 

More from Anton Van de Putte (9)

The data behind the Biogeographic Atlas of the Southern Ocean
The data behind the Biogeographic Atlas of the Southern OceanThe data behind the Biogeographic Atlas of the Southern Ocean
The data behind the Biogeographic Atlas of the Southern Ocean
 
Antarctic Biodiversity Portal Virtual Research Environment
Antarctic Biodiversity Portal Virtual Research EnvironmentAntarctic Biodiversity Portal Virtual Research Environment
Antarctic Biodiversity Portal Virtual Research Environment
 
Energetic Value of Zooplankton and Nekton of the Southern Ocean: A Review
Energetic Value of Zooplankton and Nekton of the Southern Ocean: A ReviewEnergetic Value of Zooplankton and Nekton of the Southern Ocean: A Review
Energetic Value of Zooplankton and Nekton of the Southern Ocean: A Review
 
Antarctic Observatory System
Antarctic Observatory SystemAntarctic Observatory System
Antarctic Observatory System
 
The Antarctic Master Directory, sharing Antarctic (meta)data from multiple di...
The Antarctic Master Directory, sharing Antarctic (meta)data from multiple di...The Antarctic Master Directory, sharing Antarctic (meta)data from multiple di...
The Antarctic Master Directory, sharing Antarctic (meta)data from multiple di...
 
Biodiversity informatics for Polar Regions - how to transform data into knowl...
Biodiversity informatics for Polar Regions - how to transform data into knowl...Biodiversity informatics for Polar Regions - how to transform data into knowl...
Biodiversity informatics for Polar Regions - how to transform data into knowl...
 
Marine Lifewatch meeting 3-5 June 2014
Marine Lifewatch meeting 3-5 June 2014Marine Lifewatch meeting 3-5 June 2014
Marine Lifewatch meeting 3-5 June 2014
 
SCAR Data Management and Policy
SCAR Data Management and PolicySCAR Data Management and Policy
SCAR Data Management and Policy
 
Van de Putte & Danis
Van de Putte & DanisVan de Putte & Danis
Van de Putte & Danis
 

Recently uploaded

Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
Subhajit Sahu
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
haila53
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
Oppotus
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Linda486226
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
NABLAS株式会社
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_Crimes
StarCompliance.io
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
vcaxypu
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
benishzehra469
 
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
correoyaya
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Boston Institute of Analytics
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
ewymefz
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
NABLAS株式会社
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Subhajit Sahu
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
AbhimanyuSinha9
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
nscud
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
enxupq
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
ewymefz
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape ReportSOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar
 

Recently uploaded (20)

Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_Crimes
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
 
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape ReportSOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape Report
 

Ccambio annual meeting 2014

  • 1. CCAMBIO and the mARS project Anton Van de Putte CCAMBIO Annual Meeting 12 may 2014
  • 2. Microbial Antarctic Resources System An information system dedicated to facilitate the discovery, access and analysis of geo- referenced, molecular microbial diversity (meta)data generated by Antarctic researchers, in an Open fashion.
  • 3. What’s happened so far • mARS Workshop hosted at the Belgian Science Policy Office (BELSPO, Brussels) in May 2012 • mARS Workshop held during the SCAR Open Science Conference (Portland, OR) in July 2012 • Technical mARS Workshop hosted at the Université Libre de Bruxelles in December 2013 • Initiate the development of the database and webplatform
  • 4. Near future planning • mARS Workshop held during the SCAR Open Science Conference (Auckland, NZ) on 27 august 2014 • Present a proof of concept of the dataindrastructure to be used for mARS
  • 6. Step 1 : Data description and discovery Integrated Publishing Toolkit (IPT)
  • 7. Step 2: Habitat and Microbial Sequence Metadata Entry MiMARKS and mARS Sequence Set Te m p l a t e
  • 8. Step 3: Georeferenced- molecular sequence database integration
  • 9. Step 4: Processing batch sequence data Circum-Antarctic microbial diversity
  • 11. Getting Data into mARS • Requires that • Data is accessible in a public a public repository (Genbank, IMG-M or other web accessible) • 2 additional metadata files • MiMARKS • Microbial Sequence spreadsheet
  • 12. 0. Before you start • 1. Clearly Identify your needs • You have a project that you would like to register with mARS • no sequence data or environmental data at this point: skip Steps 1, 2, 4 and 7 • environmental data, but no publicly available sequences yet, follow all Steps below, but do not enter Sequence IDs in the forms. • environmental data, and publicly available sequences. Follow all Steps below.
  • 13. 0. Before you start • Send an email to request a username and password from the IPT administrator
  • 14.
  • 15. 0. Before you start • Send an email to request a username and password from the IPT administrator • Make a copy of the MiMarks Googlesheet from the RDP MiMarks Googlesheet (click on “Make copy” from the “File” menu).
  • 16.
  • 17. 0. Before you start • Send an email to request a username and password from the IPT administrator • Make a copy of the MiMarks Googlesheet from the RDP MiMarks Googlesheet (click on “Make copy” from the “File” menu). • Make a copy of the Microbial Sequence Set from the mARS Googlesheet (click on “Make copy” from the “File” menu).
  • 18.
  • 19. 1. Prepare your MiMarks spreadsheet• In the MiMarks Googlesheet you’ve created in step 0, fill in your environmental metadata details using the “Google Documents” interface, following the instructions available from the MiMarks Googlesheet documentation at RDP. Example files are available from the mARS website. • In the header for each column that will hold your sequence set data, list the unique identifier of your sequence set. • Once you are finished, download your spreadsheet as a CSV (Comma-separated Values) file on your computer.
  • 20.
  • 21. 2. Prepare your Microbial Sequence Set spreadsheet • In the Microbial Sequence Set Googlesheet you’ve created in step 0, fill all the fields (replace the examples available from the Googlesheet) • Once you are finished, download your spreadsheet as a CSV (Comma-separated Values) file on your computer.
  • 22.
  • 23. 3. Describe your data in the IPT • Login the IPT using your credentials: • Use the form at the bottom of the “Manage Resource” page to create a new resource. Provide a unique "shortname" for your dataset. • Click the “Create” button. You will arrive on the Resource Management page. • Click on the “Edit” button in the Metadata section on the left and fill in the details for the different metadata sections. A detailed instructions are available from IPT quick reference guide. Hint: mention your grant number in the “Project Data” section, to allow us to link your resource to relevant projects in the GCMD/AMD.
  • 24. 4. Upload your MiMarks and Microbial Sequence Set • 1. In your IPT session, from your Resource Management page, click on the “Choose file” button in the “Source data” section on the left of the page. • 2. Point to your completed MiMarks CSV, and click on “Choose” • 3. Click on the “add” button in the “Source data” section on the left of the page then click on the “Save” button on the bottom. Your MiMarks CSV file is now uploaded on the IPT.
  • 25. 4. Upload your MiMarks and Microbial Sequence Set • 5. From your Resource Management page, click on the “Choose file” button in the “Source data” section on the left of the page. • 6. Point to your completed Microbial Sequence Set CSV, and click on “Choose” • 7. Click on the “add” button in the Source data section on the left of the page, then click on the “Save” button. Your Microbial Sequence Set CSV file is now uploaded on the IPT.
  • 26. 5. Publish and register your data • From your Resource Management page, click on the “Publish” button in the “Published release” section on the left of the page. Do not worry when you see a warning message “Source data or Darwin Core mappings missing. No data archive generated • By default, your resource’s visibility is set to “Private”. To allow your resource to become visible on the IPT for all users, click on the “Public” button in the “Visibility” section. • Request one of the administrators to “Register” your dataset.

Editor's Notes

  1. This step will capture information about molecular microbial diversity research efforts that are being or have been conducted by the Antarctic research community. The results of step 1 will facilitate communication and collaboration, augment comparative biodiversity studies, and provide a legacy- discoverable resource to advance science, conservation awareness and management. The scope of the information that can be entered in the IPT encompasses present, past, or future studies involving marker gene surveys (e.g.16S or 18S rRNA, functional genes), or meta “omic” projects from natural samples in Antarctic habitats, enrichment or pure culture efforts
  2. Secondly, users will be invited to upload habitat and molecular methods-specific (meta)data pertaining to the samples and the related sequencing data (including accession numbers) using standardized accessible on the mARS website. These templates can readily be shared with your collaborators and it works with GenBank submission tools (Sequin and WebIN). Used together, and uploaded with the corresponding IPT metadata entry (as described in Step 1), these templates will describe geo-referenced physiochemical information that relates to Antarctic microbial diversity studies as well as the matching sequencing information.
  3. In this step, sequence data files produced by different technologies (e.g. Sanger sequencing, 454, Illumina, Ion Torrent) will be linked back to the relevant entries as described in steps 1 and 2. mARSwill provide indexed searching capabilities and geo-server links to DNA sequence data from Antarctic studies that have been deposited in public repositories, providing rapid access to this information through the biodiversity.aqdata portal. There is currently no exhaustive resource that provides this level of information from a geo-referenced perspective. The Antarctic scientific community is actively engaged in molecular icrobial diversity and genomic surveys in both terrestrial and marine realms. mARSprovides a unique resource to harness this information.
  4. As the primary mandate of biodiversity.aqis to provide the scientific community access to Antarctic diversity information, biodiversity.aq staff will process the microbial diversity information referenced in mARS for selected, highly used regions of marker genes (for each domain of life) generated through both Sanger sequencing studies and NGS efforts in order to provide the users with a window into the microbial diversity present in Antarctica.
  5. This SOP details how you can upload (meta)data to mARS. To ensure this procedure only has to been carried out once, the mARS team has devoted special care to following widely-used standards for biodiversity data, as promoted by the Global Biodiversity Information Facility and the Genomics Standards Consortium. In this particular case, this SOP is built around two main types of standards, namely DarwinCore and MiMarks,ensuring maximal interoperability with internationally-recognized data and metadata repositories.