SlideShare a Scribd company logo
1 of 22
Advancing Research at
London’s Global University
Clare Gryce

Daniel Hanlon

Research IT Services, UCL
London’s Global University
Consistently ranked in word’s top 10 Universities
Alumnae include 20 Nobel prize winners
Founded 1826; first to admit without regard to race or
religion, women on equal terms as men
8000 staff, 25% from 84 countries outside UK
~25,000 students (over 1/3 postgrads)
Annual turnover > £900 million
> £15m dependent on University HPC
Highly multi-disciplinary
Delivering a Culture of Wisdom
“UCL is London’s research powerhouse, with a commitment to enhancing the
lives of people in the capital, the UK and around the world. Our academics
have breadth and depth of expertise across the entire range of academic
disciplines. Individually, they expand our understanding of the world;
collectively and collaboratively, they deliver analysis that addresses the major
challenges facing humanity” Professor David Price, UCL Vice-Provost
(Research)
•  UCL Grand Challenges
– Impact
•  UCL Research Frontiers
– Enquiry and ‘curiosity’
Underpinning Research
•  Research
IT Services
•  Established
June 2012
•  Support and
enablement
across research
lifecycle
IRIS
HPC
Collaboration
tools
Research DataRPS
UCL Discovery
Research Data
Research IT Services
•  Investment in people and expertise
–  Research software development initiative
–  Comprehensive training programme
•  Collaboration and Innovation
–  e-Infrastructure South
–  UK Research Data community
–  Vendor partnership
Innovation and Skills
UCL’s case for Research Data Initiative
•  Many departments and research groups with long
history of excellence
–  Lost opportunity for new research building on old
•  UCL is a highly multi-disciplinary institution
–  Lost opportunity for cross-disciplinary re-use
•  Unmanaged datasets are lost datasets
–  Increasing burden to researchers
–  Backup, Failing USB HDDs, AuthN problems
All carrots, no sticks
•  Research Data is an offering
•  Projects can and will opt out
•  Remove the burden of managing storage
•  Resiliency is better than backup
•  Remove burden of compliance with
Research Council requirements
All carrots, no (hardly any) sticks
•  Research Data is an offering
•  Projects can and will opt out
•  Remove the burden of managing storage
•  Resiliency is better than backup
•  Remove burden of compliance with
Research Council requirements
Requirements capture
•  Researchers – We want…
–  Everything! but more! faster! and shinier!
–  NFS, CIFS, scp, GridFTP, Cloud (Dro***x)
•  Institution – Solution must have…
–  Low admin overhead
–  Cheap (cost per TB)
–  High density
Architecture choices
•  Simple
–  start with tried and trusted
–  challenge the Big Data hysteria
•  Strong abstractions
–  avoiding lock-in to proprietary technology
•  Hedge bets
–  Build in migration between storage solutions
•  Project-based
Separate Live and Archive
•  Live
–  Mutable
–  Address current requirement
–  Private
•  Archive
–  Immutable
–  Exploitable
–  Transfer of responsibility
Metadata
•  Registration
–  Principle Investigator
–  Project name
–  Members
–  Dates
–  Funder
•  Enrichment for Archive
–  Domain specific
iRODS
http://www.irods.org
•  Metadata store
•  Bridge between different areas and storage types
–  Conventional storage
–  Object storage
–  Tape
•  Metadata store
–  Possibility for enrichment during live phase
WOS
•  Simple Object Store
–  REST API
–  Policies
–  Geographical redundancy
•  Highly scalable… >10 PB deployments
•  Low admin overhead
•  Native iRODS connector
WOS Access
•  NAS presentation
–  expose WOS as CIFS or NFS mount points
–  HA and backed up in WOS
•  Open architecture based on Open Source projects
–  easy to build upon
GPFS – IBM’s General Parallel FileSystem
•  The conventional choice
–  POSIX filesystem
–  High performance, parallel transfers
•  Connect with UCL’s existing HPC resources
–  Multi-cluster around UCL
•  Many options for exports
–  Native GPFS, samba, scp, cNFS
–  Non-trivial to manage interactions/locking issues
Cloud – “The storage that dare not speak its name”
•  Very widely used in academia (unofficially)
–  need to satisfy a very real requirement
•  Rapidly evolving
–  standards needed (not S3!)
•  Oxygen Cloud
–  Local storage
–  Local authN
–  Local files can be stubs or fully synchronised
Current state of play
•  Number of projects
–  11
•  Number of users
–  22
•  Volume of data
–  35TB
•  Continuing to deploy and slowly
expand users
Project
Registration
Manual
Process
Smaller initial
deployment but to
scale with phase II
Storage
connector
Cloud
infrastructure
AD/Moonshot
infrastructure
AuthN
connector
Client applications
mirror local files in
to RD live storage
Create
project
group
E-mail
with
cloud
access
details
DDN SFA12K
iCAT
Metadata
store
E-mail
with
campus
access
details
Legion
CS cluster
etc...
GPFS/NFS/
Samba/ssh/
iRODS via PAM
Store high level
project metadata
Mounted access via
GPFS/CIFS/NFS/
WebDAV
sshd
(gridftpd)
Storage resiliency
>10Gb network
Asynchronous
access via
scp, iRODS
Policy driven
replication,
encryption and
backup as required
with project-level
granularity
DDN WOS
Challenges
•  Authentication
–  Multiple access mechanisms with single AuthN
–  Cross-University collaboration (Moonshot)
•  Networking
–  central storage vs local NAS device
–  poor connectivity to some departments
•  Multiple access mechanisms
–  Multiple views…
Clare Gryce c.gryce@ucl.ac.uk
Daniel Hanlon d.hanlon@ucl.ac.uk

More Related Content

What's hot

The Digital Archaeological Workflow: A Case Study from Sweden
The Digital Archaeological Workflow: A Case Study from SwedenThe Digital Archaeological Workflow: A Case Study from Sweden
The Digital Archaeological Workflow: A Case Study from Sweden
Marcus Smith
 
LOCKSS UK, with a focus on reporting experience
LOCKSS UK, with a focus on reporting experienceLOCKSS UK, with a focus on reporting experience
LOCKSS UK, with a focus on reporting experience
Chris Rusbridge
 

What's hot (20)

Ariadne overview
Ariadne overviewAriadne overview
Ariadne overview
 
The Digital Archaeological Workflow: A Case Study from Sweden
The Digital Archaeological Workflow: A Case Study from SwedenThe Digital Archaeological Workflow: A Case Study from Sweden
The Digital Archaeological Workflow: A Case Study from Sweden
 
How Worthy is DSpace for Digital Libraries
How Worthy is DSpace for Digital LibrariesHow Worthy is DSpace for Digital Libraries
How Worthy is DSpace for Digital Libraries
 
UKSG Conference 2015 - Digital preservation: we know what it means today, but...
UKSG Conference 2015 - Digital preservation: we know what it means today, but...UKSG Conference 2015 - Digital preservation: we know what it means today, but...
UKSG Conference 2015 - Digital preservation: we know what it means today, but...
 
Open science, open-source, and open data: Collaboration as an emergent property?
Open science, open-source, and open data: Collaboration as an emergent property?Open science, open-source, and open data: Collaboration as an emergent property?
Open science, open-source, and open data: Collaboration as an emergent property?
 
Report: Archivematica hosting in the cloud
Report: Archivematica hosting in the cloudReport: Archivematica hosting in the cloud
Report: Archivematica hosting in the cloud
 
Linked Data
Linked DataLinked Data
Linked Data
 
RLG Partnership Update Webinar Slides
RLG Partnership Update Webinar SlidesRLG Partnership Update Webinar Slides
RLG Partnership Update Webinar Slides
 
Wikimedia 재단과 MediaWiki 위키 소프트웨어 조사
Wikimedia 재단과 MediaWiki 위키 소프트웨어 조사Wikimedia 재단과 MediaWiki 위키 소프트웨어 조사
Wikimedia 재단과 MediaWiki 위키 소프트웨어 조사
 
SAICSIT 2011 Postgraduate Symposium Presentation
SAICSIT 2011 Postgraduate Symposium PresentationSAICSIT 2011 Postgraduate Symposium Presentation
SAICSIT 2011 Postgraduate Symposium Presentation
 
The Islandora Preservation Framework
The Islandora Preservation FrameworkThe Islandora Preservation Framework
The Islandora Preservation Framework
 
Open Science Days 2014 - Becker - Repositories and Linked Data
Open Science Days 2014 - Becker - Repositories and Linked DataOpen Science Days 2014 - Becker - Repositories and Linked Data
Open Science Days 2014 - Becker - Repositories and Linked Data
 
Introducing SUL
Introducing SULIntroducing SUL
Introducing SUL
 
VanDyck Long-Term Preservation of Digital Scholarly Literature
VanDyck Long-Term Preservation of Digital Scholarly LiteratureVanDyck Long-Term Preservation of Digital Scholarly Literature
VanDyck Long-Term Preservation of Digital Scholarly Literature
 
Advocating Open Access: Before, during and after HEFCE
Advocating Open Access: Before, during and after HEFCEAdvocating Open Access: Before, during and after HEFCE
Advocating Open Access: Before, during and after HEFCE
 
LOCKSS UK, with a focus on reporting experience
LOCKSS UK, with a focus on reporting experienceLOCKSS UK, with a focus on reporting experience
LOCKSS UK, with a focus on reporting experience
 
OCLC Linked Data Progress
OCLC Linked Data ProgressOCLC Linked Data Progress
OCLC Linked Data Progress
 
Using CERIF-based CRIS to support the academic and research community: emergi...
Using CERIF-based CRIS to support the academic and research community: emergi...Using CERIF-based CRIS to support the academic and research community: emergi...
Using CERIF-based CRIS to support the academic and research community: emergi...
 
Jabes 2010 - Session plénière "Les bibliothèques sur un nuage"
Jabes 2010 - Session plénière "Les bibliothèques sur un nuage"Jabes 2010 - Session plénière "Les bibliothèques sur un nuage"
Jabes 2010 - Session plénière "Les bibliothèques sur un nuage"
 
BeOpen odoo partner
BeOpen odoo partnerBeOpen odoo partner
BeOpen odoo partner
 

Viewers also liked

Viewers also liked (6)

Beyond Cross-­Device: Latest innovations in cross­-device, micro­-segmentatio...
Beyond Cross-­Device: Latest innovations in cross­-device, micro­-segmentatio...Beyond Cross-­Device: Latest innovations in cross­-device, micro­-segmentatio...
Beyond Cross-­Device: Latest innovations in cross­-device, micro­-segmentatio...
 
いったいなんぼなら.Pdf
いったいなんぼなら.Pdfいったいなんぼなら.Pdf
いったいなんぼなら.Pdf
 
SGI HPC Update for June 2013
SGI HPC Update for June 2013SGI HPC Update for June 2013
SGI HPC Update for June 2013
 
El futuro del targeting digital
El futuro del targeting digitalEl futuro del targeting digital
El futuro del targeting digital
 
Sgi Hpc Day Kiev 2009 10 Uv
Sgi Hpc Day Kiev 2009 10 UvSgi Hpc Day Kiev 2009 10 Uv
Sgi Hpc Day Kiev 2009 10 Uv
 
De Datos a Insight: Descubriendo Historias “Data-Driven”
De Datos a Insight: Descubriendo Historias “Data-Driven”De Datos a Insight: Descubriendo Historias “Data-Driven”
De Datos a Insight: Descubriendo Historias “Data-Driven”
 

Similar to Advancing Research at London's Global University

Cro presentation for library jan13v2
Cro presentation for library jan13v2Cro presentation for library jan13v2
Cro presentation for library jan13v2
NeilStewartCity
 
Desktop as a Service supporting Environmental ‘omics
Desktop as a Service supporting Environmental ‘omicsDesktop as a Service supporting Environmental ‘omics
Desktop as a Service supporting Environmental ‘omics
David Wallom
 

Similar to Advancing Research at London's Global University (20)

Virtualization for HPC at NCI
Virtualization for HPC at NCIVirtualization for HPC at NCI
Virtualization for HPC at NCI
 
Research Cyberinfrastructure at UCSD - David Minor - RDAP12
Research Cyberinfrastructure at UCSD - David Minor - RDAP12Research Cyberinfrastructure at UCSD - David Minor - RDAP12
Research Cyberinfrastructure at UCSD - David Minor - RDAP12
 
Cro presentation for library jan13v2
Cro presentation for library jan13v2Cro presentation for library jan13v2
Cro presentation for library jan13v2
 
Research Data Management at Imperial College London
Research Data Management at Imperial College LondonResearch Data Management at Imperial College London
Research Data Management at Imperial College London
 
Preservation of Research Data: Dataverse / Archivematica Integration by Allan...
Preservation of Research Data: Dataverse / Archivematica Integration by Allan...Preservation of Research Data: Dataverse / Archivematica Integration by Allan...
Preservation of Research Data: Dataverse / Archivematica Integration by Allan...
 
Desktop as a Service supporting Environmental 'Omics
Desktop as a Service supporting Environmental 'OmicsDesktop as a Service supporting Environmental 'Omics
Desktop as a Service supporting Environmental 'Omics
 
การประยุกต์ใช้ DSpace Open Source ในการจัดการความรู้ขององค์กร
การประยุกต์ใช้ DSpace Open Source ในการจัดการความรู้ขององค์กรการประยุกต์ใช้ DSpace Open Source ในการจัดการความรู้ขององค์กร
การประยุกต์ใช้ DSpace Open Source ในการจัดการความรู้ขององค์กร
 
Supporting Research through "Desktop as a Service" models of e-infrastructure...
Supporting Research through "Desktop as a Service" models of e-infrastructure...Supporting Research through "Desktop as a Service" models of e-infrastructure...
Supporting Research through "Desktop as a Service" models of e-infrastructure...
 
Using Containers and HPC to Solve the Mysteries of the Universe by Deborah Bard
Using Containers and HPC to Solve the Mysteries of the Universe by Deborah BardUsing Containers and HPC to Solve the Mysteries of the Universe by Deborah Bard
Using Containers and HPC to Solve the Mysteries of the Universe by Deborah Bard
 
Desktop as a Service supporting Environmental ‘omics
Desktop as a Service supporting Environmental ‘omicsDesktop as a Service supporting Environmental ‘omics
Desktop as a Service supporting Environmental ‘omics
 
Data-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and CloudData-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and Cloud
 
Research data zone: veilige en geoptimaliseerde netwerkomgeving voor onderzoe...
Research data zone: veilige en geoptimaliseerde netwerkomgeving voor onderzoe...Research data zone: veilige en geoptimaliseerde netwerkomgeving voor onderzoe...
Research data zone: veilige en geoptimaliseerde netwerkomgeving voor onderzoe...
 
Lots of LOCKSS Keeping Stuff Safe: The Future of the LOCKSS Program
Lots of LOCKSS Keeping Stuff Safe: The Future of the LOCKSS ProgramLots of LOCKSS Keeping Stuff Safe: The Future of the LOCKSS Program
Lots of LOCKSS Keeping Stuff Safe: The Future of the LOCKSS Program
 
Oct 15 NISO Webinar: 21st Century Resource Sharing: Which Inter-Library Loan ...
Oct 15 NISO Webinar: 21st Century Resource Sharing: Which Inter-Library Loan ...Oct 15 NISO Webinar: 21st Century Resource Sharing: Which Inter-Library Loan ...
Oct 15 NISO Webinar: 21st Century Resource Sharing: Which Inter-Library Loan ...
 
Impact of Covid-19 on Learning and Education
Impact of Covid-19 on Learning and EducationImpact of Covid-19 on Learning and Education
Impact of Covid-19 on Learning and Education
 
e-infrastructural needs to support informatics
e-infrastructural needs to support informaticse-infrastructural needs to support informatics
e-infrastructural needs to support informatics
 
Hadoop ecosystem for health/life sciences
Hadoop ecosystem for health/life sciencesHadoop ecosystem for health/life sciences
Hadoop ecosystem for health/life sciences
 
Leaving the Ivory Tower: Research in the Real World
Leaving the Ivory Tower: Research in the Real WorldLeaving the Ivory Tower: Research in the Real World
Leaving the Ivory Tower: Research in the Real World
 
A collaborative approach to "filling the digital preservation gap" for Resear...
A collaborative approach to "filling the digital preservation gap" for Resear...A collaborative approach to "filling the digital preservation gap" for Resear...
A collaborative approach to "filling the digital preservation gap" for Resear...
 
CLIMB talk in the Virtual Laboratories session at the RCUK Cloud Working Grou...
CLIMB talk in the Virtual Laboratories session at the RCUK Cloud Working Grou...CLIMB talk in the Virtual Laboratories session at the RCUK Cloud Working Grou...
CLIMB talk in the Virtual Laboratories session at the RCUK Cloud Working Grou...
 

More from inside-BigData.com

Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...
inside-BigData.com
 
Transforming Private 5G Networks
Transforming Private 5G NetworksTransforming Private 5G Networks
Transforming Private 5G Networks
inside-BigData.com
 
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean MonitoringBiohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
inside-BigData.com
 
Machine Learning for Weather Forecasts
Machine Learning for Weather ForecastsMachine Learning for Weather Forecasts
Machine Learning for Weather Forecasts
inside-BigData.com
 
Energy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic TuningEnergy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic Tuning
inside-BigData.com
 
Versal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud AccelerationVersal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud Acceleration
inside-BigData.com
 
Introducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi ClusterIntroducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi Cluster
inside-BigData.com
 

More from inside-BigData.com (20)

Major Market Shifts in IT
Major Market Shifts in ITMajor Market Shifts in IT
Major Market Shifts in IT
 
Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...
 
Transforming Private 5G Networks
Transforming Private 5G NetworksTransforming Private 5G Networks
Transforming Private 5G Networks
 
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
 
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
 
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
 
HPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural NetworksHPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural Networks
 
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean MonitoringBiohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
 
Machine Learning for Weather Forecasts
Machine Learning for Weather ForecastsMachine Learning for Weather Forecasts
Machine Learning for Weather Forecasts
 
HPC AI Advisory Council Update
HPC AI Advisory Council UpdateHPC AI Advisory Council Update
HPC AI Advisory Council Update
 
Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19
 
Energy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic TuningEnergy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic Tuning
 
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODHPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
 
State of ARM-based HPC
State of ARM-based HPCState of ARM-based HPC
State of ARM-based HPC
 
Versal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud AccelerationVersal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud Acceleration
 
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance EfficientlyZettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
 
Scaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's EraScaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's Era
 
CUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computingCUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computing
 
Introducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi ClusterIntroducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi Cluster
 
Overview of HPC Interconnects
Overview of HPC InterconnectsOverview of HPC Interconnects
Overview of HPC Interconnects
 

Recently uploaded

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Recently uploaded (20)

Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 

Advancing Research at London's Global University

  • 1. Advancing Research at London’s Global University Clare Gryce
 Daniel Hanlon
 Research IT Services, UCL
  • 2. London’s Global University Consistently ranked in word’s top 10 Universities Alumnae include 20 Nobel prize winners Founded 1826; first to admit without regard to race or religion, women on equal terms as men 8000 staff, 25% from 84 countries outside UK ~25,000 students (over 1/3 postgrads) Annual turnover > £900 million > £15m dependent on University HPC Highly multi-disciplinary
  • 3. Delivering a Culture of Wisdom “UCL is London’s research powerhouse, with a commitment to enhancing the lives of people in the capital, the UK and around the world. Our academics have breadth and depth of expertise across the entire range of academic disciplines. Individually, they expand our understanding of the world; collectively and collaboratively, they deliver analysis that addresses the major challenges facing humanity” Professor David Price, UCL Vice-Provost (Research) •  UCL Grand Challenges – Impact •  UCL Research Frontiers – Enquiry and ‘curiosity’
  • 4. Underpinning Research •  Research IT Services •  Established June 2012 •  Support and enablement across research lifecycle IRIS HPC Collaboration tools Research DataRPS UCL Discovery Research Data
  • 5. Research IT Services •  Investment in people and expertise –  Research software development initiative –  Comprehensive training programme •  Collaboration and Innovation –  e-Infrastructure South –  UK Research Data community –  Vendor partnership Innovation and Skills
  • 6. UCL’s case for Research Data Initiative •  Many departments and research groups with long history of excellence –  Lost opportunity for new research building on old •  UCL is a highly multi-disciplinary institution –  Lost opportunity for cross-disciplinary re-use •  Unmanaged datasets are lost datasets –  Increasing burden to researchers –  Backup, Failing USB HDDs, AuthN problems
  • 7. All carrots, no sticks •  Research Data is an offering •  Projects can and will opt out •  Remove the burden of managing storage •  Resiliency is better than backup •  Remove burden of compliance with Research Council requirements
  • 8. All carrots, no (hardly any) sticks •  Research Data is an offering •  Projects can and will opt out •  Remove the burden of managing storage •  Resiliency is better than backup •  Remove burden of compliance with Research Council requirements
  • 9. Requirements capture •  Researchers – We want… –  Everything! but more! faster! and shinier! –  NFS, CIFS, scp, GridFTP, Cloud (Dro***x) •  Institution – Solution must have… –  Low admin overhead –  Cheap (cost per TB) –  High density
  • 10. Architecture choices •  Simple –  start with tried and trusted –  challenge the Big Data hysteria •  Strong abstractions –  avoiding lock-in to proprietary technology •  Hedge bets –  Build in migration between storage solutions •  Project-based
  • 11. Separate Live and Archive •  Live –  Mutable –  Address current requirement –  Private •  Archive –  Immutable –  Exploitable –  Transfer of responsibility
  • 12. Metadata •  Registration –  Principle Investigator –  Project name –  Members –  Dates –  Funder •  Enrichment for Archive –  Domain specific
  • 13. iRODS http://www.irods.org •  Metadata store •  Bridge between different areas and storage types –  Conventional storage –  Object storage –  Tape •  Metadata store –  Possibility for enrichment during live phase
  • 14. WOS •  Simple Object Store –  REST API –  Policies –  Geographical redundancy •  Highly scalable… >10 PB deployments •  Low admin overhead •  Native iRODS connector
  • 15. WOS Access •  NAS presentation –  expose WOS as CIFS or NFS mount points –  HA and backed up in WOS •  Open architecture based on Open Source projects –  easy to build upon
  • 16. GPFS – IBM’s General Parallel FileSystem •  The conventional choice –  POSIX filesystem –  High performance, parallel transfers •  Connect with UCL’s existing HPC resources –  Multi-cluster around UCL •  Many options for exports –  Native GPFS, samba, scp, cNFS –  Non-trivial to manage interactions/locking issues
  • 17. Cloud – “The storage that dare not speak its name” •  Very widely used in academia (unofficially) –  need to satisfy a very real requirement •  Rapidly evolving –  standards needed (not S3!) •  Oxygen Cloud –  Local storage –  Local authN –  Local files can be stubs or fully synchronised
  • 18. Current state of play •  Number of projects –  11 •  Number of users –  22 •  Volume of data –  35TB •  Continuing to deploy and slowly expand users
  • 19. Project Registration Manual Process Smaller initial deployment but to scale with phase II Storage connector Cloud infrastructure AD/Moonshot infrastructure AuthN connector Client applications mirror local files in to RD live storage Create project group E-mail with cloud access details DDN SFA12K iCAT Metadata store E-mail with campus access details Legion CS cluster etc... GPFS/NFS/ Samba/ssh/ iRODS via PAM Store high level project metadata Mounted access via GPFS/CIFS/NFS/ WebDAV sshd (gridftpd) Storage resiliency >10Gb network Asynchronous access via scp, iRODS Policy driven replication, encryption and backup as required with project-level granularity DDN WOS
  • 20. Challenges •  Authentication –  Multiple access mechanisms with single AuthN –  Cross-University collaboration (Moonshot) •  Networking –  central storage vs local NAS device –  poor connectivity to some departments •  Multiple access mechanisms –  Multiple views…
  • 21.
  • 22. Clare Gryce c.gryce@ucl.ac.uk Daniel Hanlon d.hanlon@ucl.ac.uk