More Related Content

Jisc's new shared data centre

  1. 10/09/2014 Jisc’s new shared data centre Professor Paul Layzell, Principal of Royal Holloway University of London For UUK annual conference 2014
  2. 10/09/2014 Jisc’s new shared data centre Professor Martyn Harrow, chief executive officer, Jisc For UUK annual conference 2014
  3. www.luscombelab.org Nicholas Luscombe Professor of Computational Biology, UCL Senior Group Leader, Francis Crick Institute
  4. Our research Applying computational methods to genomic data to understand how genes are switched on and off.
  5. Human genome encodes ˜23,000 genes
  6. Right gene at right time and place
  7. Right gene at right time and place
  8. Switching is controlled by regulatory genes
  9. • We discovered 1,300 regulators in humans • 80% have no known target genes • Foundation for many downstream studies 33 human tissues What are the regulators? [Vaquerizas (2019) Nat Rev Genet]
  10. What do they control? predicted expression • eve gene is expressed in fly embryos • Regulator-target relationships unknown • Modelling identifies relationships [Ilsley (2013) eLife] actual expression even-skipped
  11. No hnRNP C Nonsensical transcripts hnRNP C binds pre-mRNA Alus suppressed And if it goes wrong…? • hnRNP C loss causes 1000s of nonsense transcripts • Mutations in binding sites can cause genetic disorders [Zarnack (2013) Cell]
  12. The data challenge
  13. Human genome is huge This is really hard Relationship between regulators and genes are not straightforward We still don’t know what 90% of regulators control
  14. Human genome is huge This is really hard Relationship between regulators and genes are not straightforward We still don’t know what 90% of regulators control 3 billion letters = 119 volumes!
  15. GenBank growth since 1989 # nucleotides 1989 time Datasets keep growing 1999
  16. 1999 GenBank growth since 1989 # nucleotides 1989 time Datasets keep growing 1999 2010
  17. 1999 GenBank growth since 1989 # nucleotides 1989 time Datasets keep growing 1999 2010 2010 2014
  18. How do you transport them? data transfer
  19. How do you transport them? data transfer
  20. Discovery without boundaries
  21. Francis Crick Institute • Opening 2015 • 6 partners (MRC, Cancer Research UK, Wellcome Trust, UCL, KCL, ICL) • World-class biomedical institute with interdisciplinary research • Aiming for >100 “dry" scientists
  22. Central London location
  23. A solution
  24. Shared offsite data centre Crick computing Collaborator project
  25. eMedLab Data driven discovery for Personalised Medicine
  26. eMedLab £8.9M MRC award for medical bioinformatics eMedLab infrastructure Capacity building
  27. eMedLab £8.9M MRC award for medical bioinformatics eMedLab infrastructure Capacity building Secure storage, coordination, analysis
  28. eMedLab £8.9M MRC award for medical bioinformatics eMedLab infrastructure Capacity building Secure storage, coordination, analysis Research output Clinical outcomes
  29. Partners provide unique expertise's • Interface between bioinformatics and clinic • Novel bioinformatics methods and interface with wet lab • Genomics of health and disease • Public data access and chemoinformatics
  30. eMedLab infrastructure Crick
  31. eMedLab infrastructure Crick Sanger UCL Partners EBI Secure collaborative space
  32. eMedLab infrastructure Crick Sanger UCL Partners EBI Secure collaborative space
  33. eMedLab infrastructure Crick Sanger UCL Partners EBI Secure collaborative space
  34. Infrastructure enables science Research can’t be achieved without reliable infrastructure
  35. 10/09/2014 Jisc’s new shared data centre Dr Phil Richards, Chief Innovation Officer, Jisc For UUK annual conference 2014
  36. Outline »Background »Benefits of scale »The human barriers »Partners »Thanks
  37. Background »Government focus on shared services 2011 »HEFCE Universities Modernisation Fund (UMF) › Feasibility work around shared data centres › Technical proofs of concept via Janet network »Then a pause…
  38. Benefits of scale
  39. Benefits of large scale “… construction of extremely large-scale, commodity-computer data centres at low-cost locations… uncovered the factors of 5 to 7 decrease in cost of electricity, bandwidth, operations, software and hardware at these very large economies of scale.” Armburst, Armando Fox et al., Above the Clouds, Berkeley
  40. Example industrial-scale data centres Owner Location Square feet Apple Maiden, North Carolina 500,000 Facebook Forest City, North Carolina 300,000 Amazon Dublin, Ireland 240,000 Google Hamina, Finland 300,000 HP Winyard, UK 305,000 Source: Greenpeace report ‘How dirty is your data?’, April 2011
  41. The human barriers
  42. The human barriers Distributed Data-centres Under Desks (DDUDs)
  43. The Janet network – our ‘national grid’ for big data and computing Industrial-scale data centre Industrial-scale data centre Universities and research institutions
  44. Partners
  45. PR’s candidate big themes and possible projects • Lifting the student number cap • Break replacement cycle for Student Record Systems • Open source SRS modelling student lifecycle • Backdoor to HESA for easier data entry and benchmarking • Exorcising the ‘ghost of the MAC initiative’? • MOOCs for the masses • National platform to complement FutureLearn • FutureLearn platform lite or EdX instance? • Scalable approaches to Research Data and Equipment • National site licences for commercial big data • National Kit Catalogue • Joining the big data to the meta-data • Going beyond short-term compliance • Will policy be diluted as true costs emerge?
  46. Partners
  47. Extra slides
  48. Further opportunity
  49. Further opportunity »Collaborative research data sharing »Consolidation of research computation support »Scale to £100Ms PA saving for the sector › Why does any HEI need its own data centre? › Can we all start benefitting from large scale? »Through the Janet network, we can!
  50. Thanks
  51. Thanks • Colleagues • Tim Marshall • Bob Day • Jeremy Sharpe • Dan Perry • Organisations • Hefce • Infinity
  52. Find out more… Dr Phil Richards Chief Innovation Officer p.richards@jisc.ac.uk One Castlepark Tower Hill Bristol BS2 0JA T 020 3697 5800 info@jisc.ac.uk jisc.ac.uk Except where otherwise noted, this work is licensed under CC-BY-NC-ND
  53. Panel discussion