SlideShare a Scribd company logo
1 of 33
Download to read offline
Data	
  accessibility	
  and	
  challenges	
  
Jyo2	
  khadake	
  
24th	
  October	
  2016	
  
EMBL-­‐ABR	
  workshop	
  
	
  
Data	
  life	
  cycle	
  	
  
The	
  life	
  cycle	
  of	
  data	
  depends	
  on	
  Project	
  
aims	
  and	
  purpose.	
  	
  
Planning/	
  project	
  design	
  
Finding/crea2ng	
  the	
  data	
  
Extrac2ng	
  Transforming	
  and	
  Loading	
  
Processing	
  
Analyzing	
  data	
  –informa2on	
  –	
  publica2on	
  
Data	
  associated	
  with	
  study	
  can	
  be	
  reused	
  
Planning	
  
Genera2ng/	
  
Reliability	
  
Ownership	
  
Metadata	
  
Versioning	
  
Standardisa2on	
  
Quality	
  
Publishing	
  
Data	
  access	
  and	
  data	
  sharing	
  
•  What	
  do	
  you	
  expect	
  when	
  we	
  access	
  data?	
  
•  What	
  do	
  you	
  expect	
  when	
  we	
  share	
  data?	
  
•  These	
  are	
  two	
  sides	
  of	
  the	
  same	
  coin	
  
Open	
  access	
  data	
  policy	
  
•  Data	
  created	
  from	
  research	
  are	
  valuable	
  
resources	
  that	
  can	
  be	
  used	
  and	
  reused	
  for	
  future	
  
scien2fic	
  and	
  educa2onal	
  purposes.	
  Sharing	
  data	
  
facilitates	
  new	
  scien2fic	
  inquiry,	
  avoids	
  duplicate	
  
data	
  collec2on	
  and	
  provides	
  real	
  life	
  resources	
  
for	
  educa2on	
  and	
  training	
  	
  
OR	
  
•  Publicly	
  funded	
  research	
  data	
  should	
  be	
  as	
  far	
  as	
  
possible	
  openly	
  available	
  to	
  the	
  scien2fic	
  
community	
  
What	
  does	
  this	
  achieve	
  
•  Encourages	
  scien2fic	
  enquiry	
  and	
  debate	
  
•  Promotes	
  innova2on	
  and	
  poten2al	
  new	
  data	
  uses	
  
•  New	
  collabora2ons	
  between	
  users	
  and	
  creators	
  of	
  data	
  
•  Maximises	
  transperancy	
  and	
  accoun2bility	
  
•  Enables	
  scru2ny	
  of	
  research	
  findings	
  
•  Encourages	
  improvement	
  and	
  valida2on	
  of	
  research	
  
findings	
  
•  Reduces	
  cost	
  of	
  supplica2ng	
  data	
  collec2on	
  
•  Increases	
  visibility	
  of	
  research	
  
•  Provides	
  direct	
  credit	
  to	
  researcher	
  
•  Research	
  outcome	
  for	
  educa2on	
  and	
  training	
  
Encouraged	
  by	
  	
  
•  Research	
  funders	
  under	
  guidance	
  from	
  OECD	
  have	
  
developed	
  data	
  sharing	
  policies	
  that	
  allow	
  researches	
  2me	
  
for	
  exclusive	
  use	
  of	
  data	
  for	
  a	
  limited	
  2me	
  with	
  a	
  mandate	
  
to	
  publish	
  at	
  the	
  end	
  of	
  agreed	
  period.	
  This	
  can	
  be	
  done	
  via	
  
repositories	
  or	
  data	
  centers.	
  The	
  funders	
  also	
  require	
  data	
  
management	
  and	
  sharing	
  plan	
  	
  
•  Journals	
  	
  data	
  that	
  forms	
  basis	
  of	
  publica2on	
  needs	
  to	
  be	
  
shared	
  or	
  deposited	
  within	
  an	
  accessible	
  accessible	
  
database	
  or	
  repository.	
  	
  
•  Ini2a2ves	
  like	
  DataCite	
  registry	
  assign	
  Unique	
  digital	
  object	
  
iden2fiers	
  DOIs	
  to	
  research	
  data	
  helping	
  scien2st	
  make	
  
data	
  discoverable,	
  citable	
  and	
  tracable	
  so	
  research	
  data	
  as	
  
well	
  as	
  publica2on	
  based	
  on	
  those	
  data	
  form	
  part	
  of	
  
scien2fic	
  output.	
  
•  Use	
  of	
  Metadata	
  dependent	
  URIs	
  to	
  iden2fy	
  and	
  share	
  data	
  
How	
  to	
  share	
  /	
  access	
  data	
  
•  Specialist	
  data	
  centers,	
  archives	
  or	
  data	
  
banks	
  
•  Journal	
  to	
  support	
  publica2on	
  
•  Ins2tu2onal	
  repository	
  
•  Online	
  via	
  project	
  or	
  ins2tu2onal	
  website	
  
•  Informally	
  between	
  researchers	
  on	
  a	
  peer-­‐
to-­‐peer	
  basis	
  
	
  
URI	
  iden2fies	
  data	
  
Advantages	
  of	
  deposi2ng	
  data	
  with	
  data	
  
center	
  or	
  repository	
  
•  Assurance	
  that	
  data	
  meets	
  set	
  standards	
  
•  Long	
  term	
  preserva2on	
  of	
  standardised	
  accessible	
  data	
  format,	
  format	
  
conversion	
  when	
  so_ware	
  upgraded	
  
•  Safe	
  keeping	
  with	
  a`ribu2on	
  in	
  secure	
  environment	
  
•  Regular	
  data	
  backup	
  
•  Online	
  resource	
  discovery	
  through	
  catalogues	
  
•  Access	
  in	
  popular	
  formats	
  
•  Licensing	
  arrangement	
  to	
  acknowledge	
  data	
  rights	
  
•  Standardised	
  cita2on	
  mechanism	
  to	
  acknowledge	
  data	
  ownership	
  
•  Pormo2on	
  of	
  data	
  to	
  many	
  users	
  
•  Monitoring	
  secondary	
  usage	
  of	
  data	
  
•  Management	
  of	
  access	
  to	
  data	
  and	
  user	
  queries	
  on	
  behalf	
  of	
  data	
  owner
So	
  we	
  need	
  to	
  share	
  data	
  
	
  
&	
  
	
  
Shared	
  data	
  is	
  available	
  to	
  us	
  
What	
  affects	
  Sharing/Accessing	
  data	
  
Size	
  of	
  data	
  and	
  compute	
  
Community	
  developed	
  of	
  data	
  standards	
  
Exis2ng	
  repositories	
  or	
  storage	
  facili2es	
  
Nature	
  of	
  data	
  
Appropriate	
  data	
  tracking	
  and	
  governance	
  
Key	
  management	
  points	
  
Metadata	
  
	
  
Size	
  of	
  data	
  
Decides	
  what	
  kind	
  of	
  storage/	
  archival	
  is	
  used	
  	
  
Cloud	
  storage	
  
OK	
  for	
  data	
  that	
  does	
  not	
  go	
  into	
  terabytes	
  or	
  
does	
  not	
  have	
  restric2ons	
  
Cost	
  implica2ons	
  
Available	
  as	
  DaaS,	
  SaaS,	
  PaaS,	
  IaaS	
  
Sta2c	
  storage:	
  Cluster	
  based	
  compu2ng/storage	
  
	
  Geographical	
  restric2ons	
  
	
  Provides	
  compute	
  for	
  analysis	
  since	
  big	
  data	
  
does	
  not	
  move.	
  
	
  Good	
  access	
  control?	
  
Compute	
  for	
  analysis	
  
•  Once	
  there	
  is	
  data,	
  access	
  decision	
  needs	
  to	
  
be	
  made	
  on	
  how	
  much	
  compute	
  is	
  required	
  
for	
  analysis.	
  
•  Cloud	
  based	
  solu2ons	
  are	
  available	
  for	
  small	
  
scale	
  data	
  
•  Data	
  centers	
  like	
  Aimes	
  allow	
  for	
  compute	
  on	
  
clusters	
  
•  Ins2tute/repository	
  may	
  provide	
  HPC	
  as	
  well	
  
as	
  so_ware	
  for	
  analysis	
  
Community	
  developed	
  data	
  standards	
  
An	
  ac2ve	
  collabora2ve	
  community	
  is	
  essen2al	
  for	
  
development	
  of	
  community	
  standards	
  
	
  
The	
  standards	
  are	
  required	
  for	
  	
  
	
  format/s	
  for	
  data	
  storage/exchange	
  
	
  vocabulary	
  for	
  data	
  representa2on	
  
	
  
Absence	
  of	
  Community	
  standards?	
  
	
  
	
  	
  	
  	
  	
  Catalogues	
  can	
  be	
  found	
  at:	
  	
  
	
   	
  	
  	
  h`p://www.ebi.ac.uk/ols/index	
  
	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  h`p://bioportal.bioontology.org/	
  
Exis2ng	
  data	
  repositories/storage	
  
•  Topic	
  specific	
  repositories	
  will	
  give	
  maximum	
  
exposure	
  to	
  the	
  data	
  /	
  access	
  to	
  relevant	
  data	
  
•  Issue	
  with	
  mul2ple	
  repositories	
  –	
  collabora2ve	
  
approaches	
  to	
  repositories	
  eg.	
  RCSB	
  for	
  structure	
  
data	
  
•  Absence	
  of	
  repositories	
  ??	
  
•  h`p://datacite.org/repolist	
  
•  h`p://databib.org	
  
Nature	
  of	
  data	
  
•  This	
  decides	
  whether	
  the	
  data	
  can	
  be	
  open	
  
access	
  or	
  controlled	
  access.	
  
•  There	
  may	
  be	
  further	
  geographical	
  restric2on	
  
on	
  the	
  data.	
  
•  If	
  controlled	
  access	
  is	
  required	
  there	
  is	
  a	
  need	
  
for	
  development	
  of	
  Data	
  Access	
  Agreements	
  
&	
  Applica2on	
  Forms.	
  
•  Management	
  of	
  the	
  access	
  control	
  
Approaches	
  to	
  secure	
  access	
  
•  DAC	
  controlled	
  access	
  but	
  with	
  /	
  without	
  
monitoring	
  
•  Highly	
  controlled	
  access	
  where	
  only	
  analysis	
  
results	
  can	
  be	
  taken	
  away	
  -­‐	
  Datasheild	
  
Roles	
  and	
  responsibili2es	
  
Par2cularly	
  important	
  where	
  sensi2ve	
  data,	
  personal	
  
data	
  or	
  patent	
  data	
  are	
  involved.	
  	
  
Appropriate	
  consents	
  and	
  ethics	
  need	
  to	
  be	
  in	
  place	
  
Some2mes	
  only	
  processed	
  ananomized	
  data	
  can	
  be	
  
used.	
  
•  Requires	
  the	
  establishment	
  of	
  DAC	
  and	
  MC	
  
– Manages	
  applica2ons	
  
– Approves	
  applica2ons	
  
– Manages	
  access	
  
– Manages	
  destruc2on	
  of	
  data	
  if	
  required	
  
Data	
  governance	
  
Data	
  management	
  planning	
  
•  Plan	
  ahead	
  to	
  create	
  high	
  –	
  quality	
  and	
  
sustainable	
  data	
  that	
  can	
  be	
  shared	
  
•  This	
  will	
  need	
  checking	
  periodically	
  to	
  see	
  that	
  
the	
  plan	
  s2ll	
  meets	
  requirements	
  
Available	
  resources:	
  	
  
h`ps://dmponline.dcc.ac.uk	
  
h"p://www.mrc.ac.uk/documents/doc/data-­‐
management-­‐plan-­‐template/	
  
Data	
  cycle	
  
Metadata	
  
•  What	
  is	
  metadata?	
  
– Documenta2on	
  and	
  descrip2on	
  associate	
  with	
  
data	
  
– Required	
  to	
  make	
  sense	
  of	
  the	
  data	
  eg	
  descrip2on	
  
of	
  variables,	
  classifica2on	
  scheme,	
  dates	
  and	
  
project..	
  
There	
  are	
  Metadata	
  standards	
  
Eg.	
  Dublin	
  core,	
  Darwin	
  core,	
  OECD	
  minimal	
  data	
  
set,	
  AGROVOC	
  
Forma2ng	
  your	
  data	
  
•  Different	
  formats	
  good	
  for	
  different	
  purposes	
  
•  Open	
  formats	
  adopted	
  by	
  community	
  are	
  more	
  
sustainable	
  eg.	
  Re,	
  2f,	
  vaw,	
  xml,	
  csv	
  
•  Proprietary	
  and/or	
  compressed	
  formats	
  that	
  
have	
  widespread	
  use	
  eg.	
  Doc,	
  jpg,	
  mp3,	
  gzip	
  
•  Organising	
  files	
  and	
  folders	
  
•  Quality	
  assurance	
  
•  Version	
  control	
  and	
  authen2city	
  transcrip2on	
  
Available	
  resources	
  
Storing	
  your	
  data	
  
•  Keep	
  your	
  digital	
  data	
  safe	
  secure	
  and	
  recoverable	
  
•  Making	
  backups	
  at	
  least	
  2	
  
•  Ins2tu2onal	
  back-­‐up	
  policies	
  
•  Manage	
  backups:	
  snapshots,	
  integrity,	
  recoverability	
  	
  
•  Data	
  storage	
  strategy	
  
•  Data	
  security	
  
•  Security	
  of	
  personal	
  data	
  
•  Data	
  destruc2on	
  /	
  disposal	
  
•  Data	
  transmission	
  and	
  encryp2on	
  
•  File	
  sharing	
  and	
  collabora2ve	
  environment	
  
	
  -­‐	
  email,	
  dropbox,	
  _p,	
  encrypted	
  media,	
  file	
  store,	
  
VRES	
  ..	
  
Ins2tu2onal	
  backup/storage	
  
Ins2tutes	
  are	
  required	
  to	
  provide	
  storage	
  of	
  
data.	
  
Make	
  sure	
  you	
  allocate	
  funds	
  for	
  this	
  when	
  you	
  
write	
  proposal.	
  
Planning	
  
Genera2ng/	
  
Reliability	
  
Ownership	
  
Metadata	
  
Versioning	
  
Standardisa2on	
  
Quality	
  
Publishing	
  
Archiving	
  
*	
   *	
  
*	
  
*	
  Destroy	
  
*	
  
Resources	
  for	
  archiving	
  data	
  
•  Dryad	
  —	
  Dryad	
  is	
  an	
  interna2onal	
  repository	
  
of	
  data	
  underlying	
  peer-­‐reviewed	
  ar2cles	
  in	
  
the	
  basic	
  and	
  applied	
  biosciences.	
  
•  The	
  Dataverse	
  Network	
  —	
  The	
  Dataverse	
  
Network	
  is	
  an	
  open	
  source	
  applica2on	
  to	
  
publish,	
  share,	
  reference,	
  extract	
  and	
  analyze	
  
research	
  data.	
  (Harvard)	
  
Destroy	
  data	
  
•  Physical	
  destruc2on	
  
•  Overwri2ng	
  
•  Demagne2sing	
  the	
  storage	
  
•  Disc	
  distruc2on	
  
•  Purging	
  the	
  printers	
  and	
  other	
  devices	
  
Best	
  Prac2ces	
  
•  Make	
  DMP	
  
•  Use	
  standard	
  vocabulary	
  
•  Standardised	
  format	
  
•  Check	
  ins2tu2onal	
  policy	
  for	
  data	
  storage	
  and	
  
exchange	
  
•  Check	
  funders	
  policy	
  for	
  data	
  exchange	
  	
  
•  Check	
  legal	
  constraints	
  and	
  requirements.	
  
•  Make	
  data	
  available	
  under	
  DAA	
  
•  Wri`en	
  policy	
  for	
  reten2on	
  and	
  disposal	
  of	
  data	
  
•  Safe	
  and	
  secure	
  sharing	
  of	
  data	
  
Strategies	
  for	
  centers	
  
•  Provide	
  management	
  framework	
  for	
  
researchers	
  	
  
Some	
  sources	
  are:	
  
UK	
  data	
  archive	
  
Boston	
  university	
  
Melbourne	
  
Data	
  Cura2on	
  Center	
  
Improve	
  Data	
  Access	
  

More Related Content

What's hot

Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...Historic Environment Scotland
 
ERA CoBioTech Data Management Webinar
ERA CoBioTech Data Management WebinarERA CoBioTech Data Management Webinar
ERA CoBioTech Data Management WebinarFAIRDOM
 
Data Management Planning for Engineers
Data Management Planning for EngineersData Management Planning for Engineers
Data Management Planning for EngineersSherry Lake
 
Guidelines for OSTP Data Access Plans
Guidelines for OSTP Data Access PlansGuidelines for OSTP Data Access Plans
Guidelines for OSTP Data Access PlansICPSR
 
DMP health sciences
DMP health sciencesDMP health sciences
DMP health sciencesSarah Jones
 
Research support-challenges
Research support-challengesResearch support-challenges
Research support-challengesSarah Jones
 
EPSRC research data expectations and PURE for datasets
EPSRC research data expectations and PURE for datasetsEPSRC research data expectations and PURE for datasets
EPSRC research data expectations and PURE for datasetsEDINA, University of Edinburgh
 
H2020 Open Research Data pilot
H2020 Open Research Data pilotH2020 Open Research Data pilot
H2020 Open Research Data pilotSarah Jones
 
Horizon 2020 and the open research data pilot
Horizon 2020 and the open research data pilotHorizon 2020 and the open research data pilot
Horizon 2020 and the open research data pilotSarah Jones
 
Developing metadata curation processes for data that can’t be shared openly
Developing metadata curation processes for data that can’t be shared openlyDeveloping metadata curation processes for data that can’t be shared openly
Developing metadata curation processes for data that can’t be shared openlyRebecca Grant
 
Engaging with students and researchers: the case of the social sciences
Engaging with students and researchers: the case of the social sciencesEngaging with students and researchers: the case of the social sciences
Engaging with students and researchers: the case of the social sciencesLouise Corti
 
Writing a successful data management plan with the DMPTool
Writing a successful data management plan with the DMPToolWriting a successful data management plan with the DMPTool
Writing a successful data management plan with the DMPToolkfear
 
Managing and sharing data
Managing and sharing dataManaging and sharing data
Managing and sharing dataSarah Jones
 
H2020 Open Data Pilot
H2020 Open Data PilotH2020 Open Data Pilot
H2020 Open Data PilotSarah Jones
 
Dc101 oxford sj_16062010
Dc101 oxford sj_16062010Dc101 oxford sj_16062010
Dc101 oxford sj_16062010Sarah Jones
 
Digital curation for postgraduate students
Digital curation for postgraduate studentsDigital curation for postgraduate students
Digital curation for postgraduate studentsSarah Jones
 

What's hot (20)

Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...
 
ERA CoBioTech Data Management Webinar
ERA CoBioTech Data Management WebinarERA CoBioTech Data Management Webinar
ERA CoBioTech Data Management Webinar
 
Data Management Planning for Engineers
Data Management Planning for EngineersData Management Planning for Engineers
Data Management Planning for Engineers
 
What is-rdm
What is-rdmWhat is-rdm
What is-rdm
 
Guidelines for OSTP Data Access Plans
Guidelines for OSTP Data Access PlansGuidelines for OSTP Data Access Plans
Guidelines for OSTP Data Access Plans
 
DMP health sciences
DMP health sciencesDMP health sciences
DMP health sciences
 
Research support-challenges
Research support-challengesResearch support-challenges
Research support-challenges
 
EPSRC research data expectations and PURE for datasets
EPSRC research data expectations and PURE for datasetsEPSRC research data expectations and PURE for datasets
EPSRC research data expectations and PURE for datasets
 
Research Data Management: Why is it important?
Research Data Management: Why is it  important?Research Data Management: Why is it  important?
Research Data Management: Why is it important?
 
H2020 Open Research Data pilot
H2020 Open Research Data pilotH2020 Open Research Data pilot
H2020 Open Research Data pilot
 
Supporting-DMPs
Supporting-DMPsSupporting-DMPs
Supporting-DMPs
 
Digital Curation 101 - Taster
Digital Curation 101 - TasterDigital Curation 101 - Taster
Digital Curation 101 - Taster
 
Horizon 2020 and the open research data pilot
Horizon 2020 and the open research data pilotHorizon 2020 and the open research data pilot
Horizon 2020 and the open research data pilot
 
Developing metadata curation processes for data that can’t be shared openly
Developing metadata curation processes for data that can’t be shared openlyDeveloping metadata curation processes for data that can’t be shared openly
Developing metadata curation processes for data that can’t be shared openly
 
Engaging with students and researchers: the case of the social sciences
Engaging with students and researchers: the case of the social sciencesEngaging with students and researchers: the case of the social sciences
Engaging with students and researchers: the case of the social sciences
 
Writing a successful data management plan with the DMPTool
Writing a successful data management plan with the DMPToolWriting a successful data management plan with the DMPTool
Writing a successful data management plan with the DMPTool
 
Managing and sharing data
Managing and sharing dataManaging and sharing data
Managing and sharing data
 
H2020 Open Data Pilot
H2020 Open Data PilotH2020 Open Data Pilot
H2020 Open Data Pilot
 
Dc101 oxford sj_16062010
Dc101 oxford sj_16062010Dc101 oxford sj_16062010
Dc101 oxford sj_16062010
 
Digital curation for postgraduate students
Digital curation for postgraduate studentsDigital curation for postgraduate students
Digital curation for postgraduate students
 

Viewers also liked

Case study for agile software development:
Case study for agile software development: Case study for agile software development:
Case study for agile software development: Joe Crespo
 
El puente avanzapormas-com
El puente avanzapormas-comEl puente avanzapormas-com
El puente avanzapormas-com"LesCarin"
 
Practicar la misericordia en el cuidado de la casa común
Practicar la misericordia en el cuidado de la casa comúnPracticar la misericordia en el cuidado de la casa común
Practicar la misericordia en el cuidado de la casa comúnfranfrater
 
New media careers
New media careersNew media careers
New media careersputlocker66
 
Grup novalians] tasca 2
Grup novalians] tasca 2Grup novalians] tasca 2
Grup novalians] tasca 2novalians
 
Reprise de Chambord Prestige
Reprise de Chambord PrestigeReprise de Chambord Prestige
Reprise de Chambord PrestigeEric Blondeau
 
port for metu-4
port for metu-4port for metu-4
port for metu-4Duygu Ocal
 
Photo oct 15, 10 09 51 am
Photo oct 15, 10 09 51 amPhoto oct 15, 10 09 51 am
Photo oct 15, 10 09 51 amTahira Sands
 
Discurso an[1]
Discurso an[1]Discurso an[1]
Discurso an[1]DjMaoPozo
 
Installar un paquete_rpm_linux
Installar un paquete_rpm_linuxInstallar un paquete_rpm_linux
Installar un paquete_rpm_linuxJames Jara
 
Horari abril juliol 14 públic
Horari abril juliol 14 públicHorari abril juliol 14 públic
Horari abril juliol 14 públicÒmnia Blanes
 
Natural slightly wavy hair, deep colors, Silky and soft human hair
Natural slightly wavy hair, deep colors, Silky and soft human hairNatural slightly wavy hair, deep colors, Silky and soft human hair
Natural slightly wavy hair, deep colors, Silky and soft human hairEastern Hair
 
Grup lul·lianes] tasca 2
Grup lul·lianes] tasca 2Grup lul·lianes] tasca 2
Grup lul·lianes] tasca 222lulianes
 

Viewers also liked (20)

Yahoo
YahooYahoo
Yahoo
 
resumen
resumen resumen
resumen
 
Case study for agile software development:
Case study for agile software development: Case study for agile software development:
Case study for agile software development:
 
El puente avanzapormas-com
El puente avanzapormas-comEl puente avanzapormas-com
El puente avanzapormas-com
 
Silencio
SilencioSilencio
Silencio
 
Practicar la misericordia en el cuidado de la casa común
Practicar la misericordia en el cuidado de la casa comúnPracticar la misericordia en el cuidado de la casa común
Practicar la misericordia en el cuidado de la casa común
 
New media careers
New media careersNew media careers
New media careers
 
Grup novalians] tasca 2
Grup novalians] tasca 2Grup novalians] tasca 2
Grup novalians] tasca 2
 
Reprise de Chambord Prestige
Reprise de Chambord PrestigeReprise de Chambord Prestige
Reprise de Chambord Prestige
 
port for metu-4
port for metu-4port for metu-4
port for metu-4
 
slzkq.pdf
slzkq.pdfslzkq.pdf
slzkq.pdf
 
BCS - Excellent Cust Serv
BCS - Excellent Cust ServBCS - Excellent Cust Serv
BCS - Excellent Cust Serv
 
Photo oct 15, 10 09 51 am
Photo oct 15, 10 09 51 amPhoto oct 15, 10 09 51 am
Photo oct 15, 10 09 51 am
 
ztabv.pdf
ztabv.pdfztabv.pdf
ztabv.pdf
 
Discurso an[1]
Discurso an[1]Discurso an[1]
Discurso an[1]
 
Installar un paquete_rpm_linux
Installar un paquete_rpm_linuxInstallar un paquete_rpm_linux
Installar un paquete_rpm_linux
 
Horari abril juliol 14 públic
Horari abril juliol 14 públicHorari abril juliol 14 públic
Horari abril juliol 14 públic
 
Natural slightly wavy hair, deep colors, Silky and soft human hair
Natural slightly wavy hair, deep colors, Silky and soft human hairNatural slightly wavy hair, deep colors, Silky and soft human hair
Natural slightly wavy hair, deep colors, Silky and soft human hair
 
Tugas 2
Tugas 2Tugas 2
Tugas 2
 
Grup lul·lianes] tasca 2
Grup lul·lianes] tasca 2Grup lul·lianes] tasca 2
Grup lul·lianes] tasca 2
 

Similar to Data accessibilityandchallenges

Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATResearch Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATOpenAIRE
 
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...dkNET
 
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...EUDAT
 
Research Data Mangagement Essentials, 5th July 2017
Research Data Mangagement Essentials, 5th July 2017Research Data Mangagement Essentials, 5th July 2017
Research Data Mangagement Essentials, 5th July 2017Research Data Leeds
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...Projeto RCAAP
 
Creating a Data Management Plan for your Grant Application
Creating a Data Management Plan for your Grant ApplicationCreating a Data Management Plan for your Grant Application
Creating a Data Management Plan for your Grant ApplicationHistoric Environment Scotland
 
Creating a Data Management Plan for your Grant Application
Creating a Data Management Plan for your Grant ApplicationCreating a Data Management Plan for your Grant Application
Creating a Data Management Plan for your Grant ApplicationEDINA, University of Edinburgh
 
dkNET Office Hours - "Are You Ready for 2023? New NIH Data Management and Sha...
dkNET Office Hours - "Are You Ready for 2023? New NIH Data Management and Sha...dkNET Office Hours - "Are You Ready for 2023? New NIH Data Management and Sha...
dkNET Office Hours - "Are You Ready for 2023? New NIH Data Management and Sha...dkNET
 
FAIRsharing - ENVRI-FAIR Webinar
FAIRsharing - ENVRI-FAIR WebinarFAIRsharing - ENVRI-FAIR Webinar
FAIRsharing - ENVRI-FAIR WebinarPeter McQuilton
 
Planning for Research Data Management
Planning for Research Data ManagementPlanning for Research Data Management
Planning for Research Data Managementdancrane_open
 
RDM Roadmap to the Future, or: Lords and Ladies of the Data
RDM Roadmap to the Future, or: Lords and Ladies of the DataRDM Roadmap to the Future, or: Lords and Ladies of the Data
RDM Roadmap to the Future, or: Lords and Ladies of the DataRobin Rice
 
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...EUDAT
 
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShareResearch Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShareHistoric Environment Scotland
 
Research methods group accelarating impact by sharing data
Research methods group  accelarating impact by sharing dataResearch methods group  accelarating impact by sharing data
Research methods group accelarating impact by sharing dataWorld Agroforestry (ICRAF)
 
Creating a Data Management Plan for your Research
Creating a Data Management Plan for your ResearchCreating a Data Management Plan for your Research
Creating a Data Management Plan for your ResearchRobin Rice
 
Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...
Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...
Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...BigData_Europe
 

Similar to Data accessibilityandchallenges (20)

Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATResearch Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
 
RDM & ELNs @ Edinburgh
RDM & ELNs @ EdinburghRDM & ELNs @ Edinburgh
RDM & ELNs @ Edinburgh
 
Shareable by Design: Making Better Use of your Research
Shareable by Design: Making Better Use of your ResearchShareable by Design: Making Better Use of your Research
Shareable by Design: Making Better Use of your Research
 
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...
 
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...
 
Research Data Mangagement Essentials, 5th July 2017
Research Data Mangagement Essentials, 5th July 2017Research Data Mangagement Essentials, 5th July 2017
Research Data Mangagement Essentials, 5th July 2017
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...
 
Creating a Data Management Plan for your Grant Application
Creating a Data Management Plan for your Grant ApplicationCreating a Data Management Plan for your Grant Application
Creating a Data Management Plan for your Grant Application
 
Creating a Data Management Plan for your Grant Application
Creating a Data Management Plan for your Grant ApplicationCreating a Data Management Plan for your Grant Application
Creating a Data Management Plan for your Grant Application
 
Introduction to Research Data Management - 2017-02-15 - MPLS Division, Univer...
Introduction to Research Data Management - 2017-02-15 - MPLS Division, Univer...Introduction to Research Data Management - 2017-02-15 - MPLS Division, Univer...
Introduction to Research Data Management - 2017-02-15 - MPLS Division, Univer...
 
dkNET Office Hours - "Are You Ready for 2023? New NIH Data Management and Sha...
dkNET Office Hours - "Are You Ready for 2023? New NIH Data Management and Sha...dkNET Office Hours - "Are You Ready for 2023? New NIH Data Management and Sha...
dkNET Office Hours - "Are You Ready for 2023? New NIH Data Management and Sha...
 
FAIRsharing - ENVRI-FAIR Webinar
FAIRsharing - ENVRI-FAIR WebinarFAIRsharing - ENVRI-FAIR Webinar
FAIRsharing - ENVRI-FAIR Webinar
 
Planning for Research Data Management
Planning for Research Data ManagementPlanning for Research Data Management
Planning for Research Data Management
 
RDM Roadmap to the Future, or: Lords and Ladies of the Data
RDM Roadmap to the Future, or: Lords and Ladies of the DataRDM Roadmap to the Future, or: Lords and Ladies of the Data
RDM Roadmap to the Future, or: Lords and Ladies of the Data
 
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
 
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShareResearch Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
 
DC101 UWE
DC101 UWEDC101 UWE
DC101 UWE
 
Research methods group accelarating impact by sharing data
Research methods group  accelarating impact by sharing dataResearch methods group  accelarating impact by sharing data
Research methods group accelarating impact by sharing data
 
Creating a Data Management Plan for your Research
Creating a Data Management Plan for your ResearchCreating a Data Management Plan for your Research
Creating a Data Management Plan for your Research
 
Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...
Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...
Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...
 

Recently uploaded

Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
 
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Pooja Nehwal
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...amitlee9823
 

Recently uploaded (20)

Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectPredicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science Project
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
 

Data accessibilityandchallenges

  • 1. Data  accessibility  and  challenges   Jyo2  khadake   24th  October  2016   EMBL-­‐ABR  workshop    
  • 2. Data  life  cycle     The  life  cycle  of  data  depends  on  Project   aims  and  purpose.     Planning/  project  design   Finding/crea2ng  the  data   Extrac2ng  Transforming  and  Loading   Processing   Analyzing  data  –informa2on  –  publica2on   Data  associated  with  study  can  be  reused  
  • 3. Planning   Genera2ng/   Reliability   Ownership   Metadata   Versioning   Standardisa2on   Quality   Publishing  
  • 4. Data  access  and  data  sharing   •  What  do  you  expect  when  we  access  data?   •  What  do  you  expect  when  we  share  data?   •  These  are  two  sides  of  the  same  coin  
  • 5. Open  access  data  policy   •  Data  created  from  research  are  valuable   resources  that  can  be  used  and  reused  for  future   scien2fic  and  educa2onal  purposes.  Sharing  data   facilitates  new  scien2fic  inquiry,  avoids  duplicate   data  collec2on  and  provides  real  life  resources   for  educa2on  and  training     OR   •  Publicly  funded  research  data  should  be  as  far  as   possible  openly  available  to  the  scien2fic   community  
  • 6. What  does  this  achieve   •  Encourages  scien2fic  enquiry  and  debate   •  Promotes  innova2on  and  poten2al  new  data  uses   •  New  collabora2ons  between  users  and  creators  of  data   •  Maximises  transperancy  and  accoun2bility   •  Enables  scru2ny  of  research  findings   •  Encourages  improvement  and  valida2on  of  research   findings   •  Reduces  cost  of  supplica2ng  data  collec2on   •  Increases  visibility  of  research   •  Provides  direct  credit  to  researcher   •  Research  outcome  for  educa2on  and  training  
  • 7. Encouraged  by     •  Research  funders  under  guidance  from  OECD  have   developed  data  sharing  policies  that  allow  researches  2me   for  exclusive  use  of  data  for  a  limited  2me  with  a  mandate   to  publish  at  the  end  of  agreed  period.  This  can  be  done  via   repositories  or  data  centers.  The  funders  also  require  data   management  and  sharing  plan     •  Journals    data  that  forms  basis  of  publica2on  needs  to  be   shared  or  deposited  within  an  accessible  accessible   database  or  repository.     •  Ini2a2ves  like  DataCite  registry  assign  Unique  digital  object   iden2fiers  DOIs  to  research  data  helping  scien2st  make   data  discoverable,  citable  and  tracable  so  research  data  as   well  as  publica2on  based  on  those  data  form  part  of   scien2fic  output.   •  Use  of  Metadata  dependent  URIs  to  iden2fy  and  share  data  
  • 8. How  to  share  /  access  data   •  Specialist  data  centers,  archives  or  data   banks   •  Journal  to  support  publica2on   •  Ins2tu2onal  repository   •  Online  via  project  or  ins2tu2onal  website   •  Informally  between  researchers  on  a  peer-­‐ to-­‐peer  basis     URI  iden2fies  data  
  • 9. Advantages  of  deposi2ng  data  with  data   center  or  repository   •  Assurance  that  data  meets  set  standards   •  Long  term  preserva2on  of  standardised  accessible  data  format,  format   conversion  when  so_ware  upgraded   •  Safe  keeping  with  a`ribu2on  in  secure  environment   •  Regular  data  backup   •  Online  resource  discovery  through  catalogues   •  Access  in  popular  formats   •  Licensing  arrangement  to  acknowledge  data  rights   •  Standardised  cita2on  mechanism  to  acknowledge  data  ownership   •  Pormo2on  of  data  to  many  users   •  Monitoring  secondary  usage  of  data   •  Management  of  access  to  data  and  user  queries  on  behalf  of  data  owner
  • 10. So  we  need  to  share  data     &     Shared  data  is  available  to  us  
  • 11. What  affects  Sharing/Accessing  data   Size  of  data  and  compute   Community  developed  of  data  standards   Exis2ng  repositories  or  storage  facili2es   Nature  of  data   Appropriate  data  tracking  and  governance   Key  management  points   Metadata    
  • 12. Size  of  data   Decides  what  kind  of  storage/  archival  is  used     Cloud  storage   OK  for  data  that  does  not  go  into  terabytes  or   does  not  have  restric2ons   Cost  implica2ons   Available  as  DaaS,  SaaS,  PaaS,  IaaS   Sta2c  storage:  Cluster  based  compu2ng/storage    Geographical  restric2ons    Provides  compute  for  analysis  since  big  data   does  not  move.    Good  access  control?  
  • 13. Compute  for  analysis   •  Once  there  is  data,  access  decision  needs  to   be  made  on  how  much  compute  is  required   for  analysis.   •  Cloud  based  solu2ons  are  available  for  small   scale  data   •  Data  centers  like  Aimes  allow  for  compute  on   clusters   •  Ins2tute/repository  may  provide  HPC  as  well   as  so_ware  for  analysis  
  • 14. Community  developed  data  standards   An  ac2ve  collabora2ve  community  is  essen2al  for   development  of  community  standards     The  standards  are  required  for      format/s  for  data  storage/exchange    vocabulary  for  data  representa2on     Absence  of  Community  standards?              Catalogues  can  be  found  at:            h`p://www.ebi.ac.uk/ols/index                            h`p://bioportal.bioontology.org/  
  • 15. Exis2ng  data  repositories/storage   •  Topic  specific  repositories  will  give  maximum   exposure  to  the  data  /  access  to  relevant  data   •  Issue  with  mul2ple  repositories  –  collabora2ve   approaches  to  repositories  eg.  RCSB  for  structure   data   •  Absence  of  repositories  ??   •  h`p://datacite.org/repolist   •  h`p://databib.org  
  • 16. Nature  of  data   •  This  decides  whether  the  data  can  be  open   access  or  controlled  access.   •  There  may  be  further  geographical  restric2on   on  the  data.   •  If  controlled  access  is  required  there  is  a  need   for  development  of  Data  Access  Agreements   &  Applica2on  Forms.   •  Management  of  the  access  control  
  • 17. Approaches  to  secure  access   •  DAC  controlled  access  but  with  /  without   monitoring   •  Highly  controlled  access  where  only  analysis   results  can  be  taken  away  -­‐  Datasheild  
  • 18. Roles  and  responsibili2es   Par2cularly  important  where  sensi2ve  data,  personal   data  or  patent  data  are  involved.     Appropriate  consents  and  ethics  need  to  be  in  place   Some2mes  only  processed  ananomized  data  can  be   used.   •  Requires  the  establishment  of  DAC  and  MC   – Manages  applica2ons   – Approves  applica2ons   – Manages  access   – Manages  destruc2on  of  data  if  required  
  • 20. Data  management  planning   •  Plan  ahead  to  create  high  –  quality  and   sustainable  data  that  can  be  shared   •  This  will  need  checking  periodically  to  see  that   the  plan  s2ll  meets  requirements   Available  resources:     h`ps://dmponline.dcc.ac.uk   h"p://www.mrc.ac.uk/documents/doc/data-­‐ management-­‐plan-­‐template/  
  • 22. Metadata   •  What  is  metadata?   – Documenta2on  and  descrip2on  associate  with   data   – Required  to  make  sense  of  the  data  eg  descrip2on   of  variables,  classifica2on  scheme,  dates  and   project..   There  are  Metadata  standards   Eg.  Dublin  core,  Darwin  core,  OECD  minimal  data   set,  AGROVOC  
  • 23.
  • 24. Forma2ng  your  data   •  Different  formats  good  for  different  purposes   •  Open  formats  adopted  by  community  are  more   sustainable  eg.  Re,  2f,  vaw,  xml,  csv   •  Proprietary  and/or  compressed  formats  that   have  widespread  use  eg.  Doc,  jpg,  mp3,  gzip   •  Organising  files  and  folders   •  Quality  assurance   •  Version  control  and  authen2city  transcrip2on   Available  resources  
  • 25.
  • 26. Storing  your  data   •  Keep  your  digital  data  safe  secure  and  recoverable   •  Making  backups  at  least  2   •  Ins2tu2onal  back-­‐up  policies   •  Manage  backups:  snapshots,  integrity,  recoverability     •  Data  storage  strategy   •  Data  security   •  Security  of  personal  data   •  Data  destruc2on  /  disposal   •  Data  transmission  and  encryp2on   •  File  sharing  and  collabora2ve  environment    -­‐  email,  dropbox,  _p,  encrypted  media,  file  store,   VRES  ..  
  • 27. Ins2tu2onal  backup/storage   Ins2tutes  are  required  to  provide  storage  of   data.   Make  sure  you  allocate  funds  for  this  when  you   write  proposal.  
  • 28. Planning   Genera2ng/   Reliability   Ownership   Metadata   Versioning   Standardisa2on   Quality   Publishing   Archiving   *   *   *   *  Destroy   *  
  • 29. Resources  for  archiving  data   •  Dryad  —  Dryad  is  an  interna2onal  repository   of  data  underlying  peer-­‐reviewed  ar2cles  in   the  basic  and  applied  biosciences.   •  The  Dataverse  Network  —  The  Dataverse   Network  is  an  open  source  applica2on  to   publish,  share,  reference,  extract  and  analyze   research  data.  (Harvard)  
  • 30. Destroy  data   •  Physical  destruc2on   •  Overwri2ng   •  Demagne2sing  the  storage   •  Disc  distruc2on   •  Purging  the  printers  and  other  devices  
  • 31. Best  Prac2ces   •  Make  DMP   •  Use  standard  vocabulary   •  Standardised  format   •  Check  ins2tu2onal  policy  for  data  storage  and   exchange   •  Check  funders  policy  for  data  exchange     •  Check  legal  constraints  and  requirements.   •  Make  data  available  under  DAA   •  Wri`en  policy  for  reten2on  and  disposal  of  data   •  Safe  and  secure  sharing  of  data  
  • 32. Strategies  for  centers   •  Provide  management  framework  for   researchers     Some  sources  are:   UK  data  archive   Boston  university   Melbourne   Data  Cura2on  Center