Research data and the ANDS agenda in AustraliaAndrew Treloar
This document discusses research data and the agenda of the Australian National Data Service (ANDS) in Australia. ANDS was established in 2009 to enable Australian researchers to more easily publish, discover, access and reuse research data. It provides several national services and has funded over 200 projects. The document also outlines relevant national policies and ANDS's involvement in international organizations like the Research Data Alliance.
This document discusses how data is driving decisions in research. It notes that the amount of data being generated is growing exponentially and researchers are now in the data business. It outlines four transformations needed - from unmanaged to managed data, disconnected to connected data, invisible to findable data, and single-use to reusable data. National strategies in Australia are aiming to support these transformations through initiatives like the Australian National Data Service which provides resources and expertise to help researchers manage, connect, and enable reuse of research data.
Use and reuse: research data locally & globally #esipfedKevin Ashley
The document discusses the importance of research data reuse and the growing demands by funders for data management and sharing. It notes that properly managing and sharing research data can improve research quality, speed, and cost effectiveness. However, many researchers remain reluctant to share data due to various excuses. The document advocates for national research data infrastructure and services to support universities in meeting funder requirements and overcoming barriers to data sharing.
Slides from my panel session at Science & Innovation 2015 with STFC DiRAC, HPC Midlands, Francis Crick Institute and UCL. As we move into the expected post-election comprehensive spending review, it is a good time to take stock of some of the innovations that have helped the UK’s institutions and industry to work together to accelerate innovation whilst achieving operating efficiencies over the last few years.
In this session we hear about trend setting initiatives such as Jisc’s shared data centre and equipment sharing initiative, which makes over £200m of capital equipment available for sharing between institutions and with industry, and industrial connectivity to the UK’s Janet network.
The document discusses various ways of querying and accessing information from the web including through open data sources like Freebase, crowdsourcing platforms, public sector data reforms, and semantic web technologies like microformats and the Astoria framework. It also covers challenges around ownership and monetization of online information.
My data, your data, our data - increasing data value through reuse (Eurocris2...Kevin Ashley
My keynote talk for Eurocris2014, Rome. I make the case for reuse of research data, discuss the barriers and look at ways we are trying to overcome them.
Save money and consolidate data in one safe environment - Jisc Digital Festiv...Jisc
Making the right decision about how and where to manage your data is key to an organisation’s IT strategy. The new Jisc shared data centre has been procured to provide a cost effective environment to co-locate systems and services in one safe environment.
So whether you are supporting enterprise activities or high end research, the Jisc shared data centre can provide significant benefits to your organisation.
UK e-Infrastructure for Research - UK/USA HPC Workshop, Oxford, July 2015Martin Hamilton
The document summarizes the UK's investments in e-infrastructure for research from 2011-2015. It discusses the major investments made in high performance computing (HPC), networking infrastructure, and big data projects. The investments totaled £160 million in 2011-2012, £189 million in 2012-2013, and £257 million in 2014-2015. It also summarizes the results of a survey of the UK's e-infrastructure, including details on the largest HPC systems and datasets. Finally, it mentions that the Research Councils UK (RCUK) developed a roadmap for the UK's e-infrastructure with a vision for an integrated infrastructure to support researchers.
Research data and the ANDS agenda in AustraliaAndrew Treloar
This document discusses research data and the agenda of the Australian National Data Service (ANDS) in Australia. ANDS was established in 2009 to enable Australian researchers to more easily publish, discover, access and reuse research data. It provides several national services and has funded over 200 projects. The document also outlines relevant national policies and ANDS's involvement in international organizations like the Research Data Alliance.
This document discusses how data is driving decisions in research. It notes that the amount of data being generated is growing exponentially and researchers are now in the data business. It outlines four transformations needed - from unmanaged to managed data, disconnected to connected data, invisible to findable data, and single-use to reusable data. National strategies in Australia are aiming to support these transformations through initiatives like the Australian National Data Service which provides resources and expertise to help researchers manage, connect, and enable reuse of research data.
Use and reuse: research data locally & globally #esipfedKevin Ashley
The document discusses the importance of research data reuse and the growing demands by funders for data management and sharing. It notes that properly managing and sharing research data can improve research quality, speed, and cost effectiveness. However, many researchers remain reluctant to share data due to various excuses. The document advocates for national research data infrastructure and services to support universities in meeting funder requirements and overcoming barriers to data sharing.
Slides from my panel session at Science & Innovation 2015 with STFC DiRAC, HPC Midlands, Francis Crick Institute and UCL. As we move into the expected post-election comprehensive spending review, it is a good time to take stock of some of the innovations that have helped the UK’s institutions and industry to work together to accelerate innovation whilst achieving operating efficiencies over the last few years.
In this session we hear about trend setting initiatives such as Jisc’s shared data centre and equipment sharing initiative, which makes over £200m of capital equipment available for sharing between institutions and with industry, and industrial connectivity to the UK’s Janet network.
The document discusses various ways of querying and accessing information from the web including through open data sources like Freebase, crowdsourcing platforms, public sector data reforms, and semantic web technologies like microformats and the Astoria framework. It also covers challenges around ownership and monetization of online information.
My data, your data, our data - increasing data value through reuse (Eurocris2...Kevin Ashley
My keynote talk for Eurocris2014, Rome. I make the case for reuse of research data, discuss the barriers and look at ways we are trying to overcome them.
Save money and consolidate data in one safe environment - Jisc Digital Festiv...Jisc
Making the right decision about how and where to manage your data is key to an organisation’s IT strategy. The new Jisc shared data centre has been procured to provide a cost effective environment to co-locate systems and services in one safe environment.
So whether you are supporting enterprise activities or high end research, the Jisc shared data centre can provide significant benefits to your organisation.
UK e-Infrastructure for Research - UK/USA HPC Workshop, Oxford, July 2015Martin Hamilton
The document summarizes the UK's investments in e-infrastructure for research from 2011-2015. It discusses the major investments made in high performance computing (HPC), networking infrastructure, and big data projects. The investments totaled £160 million in 2011-2012, £189 million in 2012-2013, and £257 million in 2014-2015. It also summarizes the results of a survey of the UK's e-infrastructure, including details on the largest HPC systems and datasets. Finally, it mentions that the Research Councils UK (RCUK) developed a roadmap for the UK's e-infrastructure with a vision for an integrated infrastructure to support researchers.
Big data and the dark arts - Jisc Digital Media 2015Jisc
There still remains a certain misunderstanding by the very definition of "big data" and the perceived hype around the term. This workshop clarified the concepts and give examples of relevant big data projects.
The document discusses the BioFresh project's efforts to mobilize freshwater biodiversity data. It provides the following key points:
1) BioFresh has been actively reaching out to data holders and has integrated 162 metadata entries and supported 26 data mobilization projects through its contingency fund. However, convincing data holders to contribute has been difficult due to issues like lack of funding, incompatible data formats, and scientists' time constraints.
2) Barriers to data sharing include a lack of incentives and rewards in science, commercial imperatives around data ownership, and the fact that early career researchers want to publish from their data first before sharing.
3) Mobilizing data is an interdisciplinary challenge that requires overcoming not
The slides for my talk on "HPC as a service" at the 25th anniversary Machine Evaluation Workshop in December 2014. I cover Jisc's HPC brokerage and related initiatives including our shared data centre, industry connectivity to Janet, our VAT cost sharing group, and our pilot of the Kit-Catalogue equipment sharing database.
The continued development of 3D technologies has enabled more affordable and accessible use in a wide range of teaching and research disciplines.
This workshop gave delegates a better understanding of how using 3D technologies can benefit education and research.
Jisc - Rebooting a National Innovation Agency (EUNIS 2014)Martin Hamilton
This is my presentation on "Rebooting" Jisc, from the EUNIS 2014 Congress at Umeå, Sweden. I begin by introducing Jisc, for anyone not already familiar with who we are and what we do. I highlight a few of our success stories that the EUNIS audience might not be familiar with, talk about some current projects - and how our focus and structure has changed following the Wilson Review. I close with our mission statement and vision for 2020.
The future of cloud computing - Jisc Digifest 2016Jisc
In Jisc's future of cloud computing horizon scan report, we identified three strategic areas where Jisc could support universities and colleges in moving to the cloud – cloud as a utility, app as a service, and working to build capability in cloud technologies.
Come along to this session to hear more about this work from Jisc futurist Martin Hamilton, and find out how you can get involved.
EDF2014: Nikolaos Loutas, Manager at PwC Belgium, Business Models for Linked ...European Data Forum
Selected Talk by Nikolaos Loutas, Manager at PwC Belgium at the European Data Forum 2014, 19 March 2014 in Athens, Greece: Business Models for Linked Government Data: What lies beneath?
Highlights of what is coming - presentation from Paul FeldmanJisc
Jisc plans to upgrade the Janet network to have 600gbps core capacity and use 400gbps optical technology. It will also rearchitect the regional network and explore network virtualization. Jisc will launch several new cybersecurity services including a security portal, DDoS mitigation, and a security conference. It will also offer cloud migration consulting, improve the geospatial service through a partnership, and expand the National Bibliographic Knowledgebase. Additionally, Jisc will provide various digital capability and learning analytics services.
Total cost of ownership: reducing the cost of gold open access - Jisc Digital...Jisc
Learn how Jisc Collections is addressing the cost UK higher education institutions face in maintaining subscriptions and also paying for article processing charges to the same publishers for the same journals.
The Helix Nebula initiative was presented at EGU 2013 and has continued to expand with more research organisations, providers and services. The hybrid cloud model deployed by Helix Nebula has grown to become a viable approach for provisioning ICT services for research communities from both public and commercial service providers (http://dx.doi.org/10.5281/zenodo.16001).
The relevance of this approach for all those communities facing societal challenges in explained in a recent EIROforum publication https://zenodo.org/record/34264#.V2ub4jXFn9g. This presentation, given at EGU2016, summarizes the progress of Helix Nebula and explores the essential characteristics of a European Open Science Cloud.
The document discusses how to cost data curation and research data management. It defines data curation as the active management of digital items over the long term to ensure they remain secure, discoverable and accessible. Research data management involves storing, accessing and preserving data produced from an investigation over its entire lifecycle. The document recommends using activity-based costing to identify all direct and indirect activities associated with data curation and research data management. It also emphasizes the importance of defining what is being costed and choosing an appropriate costing methodology.
Inverting the data pyramid: maximising the value of data reuse (IMCW2014/ICKM...Kevin Ashley
This document summarizes a presentation on research data management and reuse. It discusses:
1. The Digital Curation Centre's (DCC) mission to increase research data services capabilities in UK institutions and how this is an international issue.
2. How data reuse is already occurring but could be expanded, providing benefits for research quality, speed, and costs. Proper data management can also help ensure research integrity.
3. Barriers to increased data reuse including lack of infrastructure and services in some domains, and variability in data management practices between fields. Overcoming these issues requires attention from senior researchers, librarians, and policymakers.
Research data for repository managers Kevin Ashley
A presentation given at ULCC's Institutional Repository Manager's workshop 2012 on 2012-06-15. Aimed at getting traditional repository managers to think about their role in research data management.
What can the DCC do for you? Sheffield RoadshowKevin Ashley
A description of the ways in which the Digital Curation can work with institutions to improve research data management at institutional level. Delivered at the 2nd DCC roadshow, Sheffield, 2011-03-01
JISC repositories and preservation programme: Plenary presentation 2009Kevin Ashley
The document summarizes the Repositories and Preservation Programme that was conducted by JISC, looking back at what was asked of participants and what was accomplished, and looking forward to the future direction. Specifically:
1) JISC asked participants to create more repositories, enhance existing ones, and provide services to help and exploit repository content through specific targeted projects.
2) Participants established more repositories, built on existing successes, and created services to help with discovery, deposit, and application profiles.
3) Looking ahead, the document suggests moving away from individual projects and toward more joined-up international activities, exposing and sharing content across repositories to better support research, teaching, and learning.
Audit and outsourcing: their role in creating interoperable repository infras...Kevin Ashley
A brief presentation for the REPRISE workshop before IDCC09 (2009-12-02) in London. I look at the role that audit and outsourcing play in helping deliver interoperable preservation in repositories.
Missing links closing talk - with notesKevin Ashley
A closing talk I gave at the JISC/DPC 'Missing Links' conference on web archiving in July 2009. The talks were on the DPC site but ironically the link is now broken.
Supporting open research - how to help your researchers - Vitae15Kevin Ashley
A talk given at a Vitae event in Leeds, 2015-12-01, on how universities and other research organisations can help their researchers practice open research, with a special focus on the training resources provided by FOSTER.
Opening up data: a UK perspective – Jisc and CNI conference 10 July 2014Jisc
This document summarizes Kevin Ashley's presentation on opening up research data from a UK perspective. The presentation discusses the policy background around open data in the UK, developments in infrastructure to support open data, and costs associated with making data openly available. It also notes that fully realizing the benefits of open data will require international cooperation across organizations like the Digital Curation Centre.
Big data and the dark arts - Jisc Digital Media 2015Jisc
There still remains a certain misunderstanding by the very definition of "big data" and the perceived hype around the term. This workshop clarified the concepts and give examples of relevant big data projects.
The document discusses the BioFresh project's efforts to mobilize freshwater biodiversity data. It provides the following key points:
1) BioFresh has been actively reaching out to data holders and has integrated 162 metadata entries and supported 26 data mobilization projects through its contingency fund. However, convincing data holders to contribute has been difficult due to issues like lack of funding, incompatible data formats, and scientists' time constraints.
2) Barriers to data sharing include a lack of incentives and rewards in science, commercial imperatives around data ownership, and the fact that early career researchers want to publish from their data first before sharing.
3) Mobilizing data is an interdisciplinary challenge that requires overcoming not
The slides for my talk on "HPC as a service" at the 25th anniversary Machine Evaluation Workshop in December 2014. I cover Jisc's HPC brokerage and related initiatives including our shared data centre, industry connectivity to Janet, our VAT cost sharing group, and our pilot of the Kit-Catalogue equipment sharing database.
The continued development of 3D technologies has enabled more affordable and accessible use in a wide range of teaching and research disciplines.
This workshop gave delegates a better understanding of how using 3D technologies can benefit education and research.
Jisc - Rebooting a National Innovation Agency (EUNIS 2014)Martin Hamilton
This is my presentation on "Rebooting" Jisc, from the EUNIS 2014 Congress at Umeå, Sweden. I begin by introducing Jisc, for anyone not already familiar with who we are and what we do. I highlight a few of our success stories that the EUNIS audience might not be familiar with, talk about some current projects - and how our focus and structure has changed following the Wilson Review. I close with our mission statement and vision for 2020.
The future of cloud computing - Jisc Digifest 2016Jisc
In Jisc's future of cloud computing horizon scan report, we identified three strategic areas where Jisc could support universities and colleges in moving to the cloud – cloud as a utility, app as a service, and working to build capability in cloud technologies.
Come along to this session to hear more about this work from Jisc futurist Martin Hamilton, and find out how you can get involved.
EDF2014: Nikolaos Loutas, Manager at PwC Belgium, Business Models for Linked ...European Data Forum
Selected Talk by Nikolaos Loutas, Manager at PwC Belgium at the European Data Forum 2014, 19 March 2014 in Athens, Greece: Business Models for Linked Government Data: What lies beneath?
Highlights of what is coming - presentation from Paul FeldmanJisc
Jisc plans to upgrade the Janet network to have 600gbps core capacity and use 400gbps optical technology. It will also rearchitect the regional network and explore network virtualization. Jisc will launch several new cybersecurity services including a security portal, DDoS mitigation, and a security conference. It will also offer cloud migration consulting, improve the geospatial service through a partnership, and expand the National Bibliographic Knowledgebase. Additionally, Jisc will provide various digital capability and learning analytics services.
Total cost of ownership: reducing the cost of gold open access - Jisc Digital...Jisc
Learn how Jisc Collections is addressing the cost UK higher education institutions face in maintaining subscriptions and also paying for article processing charges to the same publishers for the same journals.
The Helix Nebula initiative was presented at EGU 2013 and has continued to expand with more research organisations, providers and services. The hybrid cloud model deployed by Helix Nebula has grown to become a viable approach for provisioning ICT services for research communities from both public and commercial service providers (http://dx.doi.org/10.5281/zenodo.16001).
The relevance of this approach for all those communities facing societal challenges in explained in a recent EIROforum publication https://zenodo.org/record/34264#.V2ub4jXFn9g. This presentation, given at EGU2016, summarizes the progress of Helix Nebula and explores the essential characteristics of a European Open Science Cloud.
The document discusses how to cost data curation and research data management. It defines data curation as the active management of digital items over the long term to ensure they remain secure, discoverable and accessible. Research data management involves storing, accessing and preserving data produced from an investigation over its entire lifecycle. The document recommends using activity-based costing to identify all direct and indirect activities associated with data curation and research data management. It also emphasizes the importance of defining what is being costed and choosing an appropriate costing methodology.
Inverting the data pyramid: maximising the value of data reuse (IMCW2014/ICKM...Kevin Ashley
This document summarizes a presentation on research data management and reuse. It discusses:
1. The Digital Curation Centre's (DCC) mission to increase research data services capabilities in UK institutions and how this is an international issue.
2. How data reuse is already occurring but could be expanded, providing benefits for research quality, speed, and costs. Proper data management can also help ensure research integrity.
3. Barriers to increased data reuse including lack of infrastructure and services in some domains, and variability in data management practices between fields. Overcoming these issues requires attention from senior researchers, librarians, and policymakers.
Research data for repository managers Kevin Ashley
A presentation given at ULCC's Institutional Repository Manager's workshop 2012 on 2012-06-15. Aimed at getting traditional repository managers to think about their role in research data management.
What can the DCC do for you? Sheffield RoadshowKevin Ashley
A description of the ways in which the Digital Curation can work with institutions to improve research data management at institutional level. Delivered at the 2nd DCC roadshow, Sheffield, 2011-03-01
JISC repositories and preservation programme: Plenary presentation 2009Kevin Ashley
The document summarizes the Repositories and Preservation Programme that was conducted by JISC, looking back at what was asked of participants and what was accomplished, and looking forward to the future direction. Specifically:
1) JISC asked participants to create more repositories, enhance existing ones, and provide services to help and exploit repository content through specific targeted projects.
2) Participants established more repositories, built on existing successes, and created services to help with discovery, deposit, and application profiles.
3) Looking ahead, the document suggests moving away from individual projects and toward more joined-up international activities, exposing and sharing content across repositories to better support research, teaching, and learning.
Audit and outsourcing: their role in creating interoperable repository infras...Kevin Ashley
A brief presentation for the REPRISE workshop before IDCC09 (2009-12-02) in London. I look at the role that audit and outsourcing play in helping deliver interoperable preservation in repositories.
Missing links closing talk - with notesKevin Ashley
A closing talk I gave at the JISC/DPC 'Missing Links' conference on web archiving in July 2009. The talks were on the DPC site but ironically the link is now broken.
Supporting open research - how to help your researchers - Vitae15Kevin Ashley
A talk given at a Vitae event in Leeds, 2015-12-01, on how universities and other research organisations can help their researchers practice open research, with a special focus on the training resources provided by FOSTER.
Opening up data: a UK perspective – Jisc and CNI conference 10 July 2014Jisc
This document summarizes Kevin Ashley's presentation on opening up research data from a UK perspective. The presentation discusses the policy background around open data in the UK, developments in infrastructure to support open data, and costs associated with making data openly available. It also notes that fully realizing the benefits of open data will require international cooperation across organizations like the Digital Curation Centre.
This document discusses open data and open scholarship. It provides examples of how data from different disciplines can be reused in new contexts, highlighting the benefits of open data for research quality, speed, and cost effectiveness. It also outlines funder policies requiring data to be shared and challenges for universities in supporting open data. Overall it advocates for making research data discoverable and reusable to further open scholarship.
Martin Tully, the Information Architecture Lead for the Office of the Chief Information Officer, presented on developing an Integrated Information Services Platform for healthcare in Ireland. The presentation outlined the journey from an initial data dictionary proof of concept in December 2015 to developing a business case for a single source of truth for healthcare data. The goal is to improve patient care through an integrated platform that enables clinical care and business intelligence using standardized healthcare data.
This document summarizes a presentation about data management and curation. It discusses different types of data repositories like data libraries, data warehouses, and data archives. It also discusses open data practices and tools to help with data management planning. The presentation outlines the UK Research Data Service plan to provide cloud infrastructure and services to support research data curation and sharing. The overall messages are around institutions knowing what data they have, where it is located, who can access it, and having plans for long-term data management and curation.
This document provides information about open data resources for public health. It discusses how freely available facts and figures are essential for improving public services. Open data initiatives aim to increase transparency and accountability in governance. The Central Statistics Office (CSO) of Ireland disseminates national data through its Statbank and supports other agencies in publishing open data through the Public Sector Statistics Network. International data is also available through sources like Eurostat and the OECD.
British Library document supply: A changing service for a changing landscapesconul
SCONUL Conference 20-21 June 2013
SCONUL Fringe session - British Library document supply: A changing service for a changing landscape, with Andy Appleyard, Head of Document Supply and Customer Services, The British Library and Richard Ebdon, Business Development Manager, The British Library.
The universe of identifiers and how ANDS is using themAndrew Treloar
Presentation on identifiers in general, and ANDS' approach to identifiers for objects and people in particular. Given at ODIP 3rd Workshop on August 7, 2014.
This document discusses the rapid growth of the Internet of Things (IoT) and its impact across various industries. Some key points:
- The number of devices connected to the IoT is expected to grow from 12.5 billion in 2010 to over 50 billion by 2020.
- IoT applications span many sectors including retail, manufacturing, energy, transportation, healthcare and more.
- Challenges to IoT adoption include developing business models, addressing security and privacy concerns, and establishing standards.
- IoT is expected to contribute significantly to revenue growth for service providers and suppliers from 2014 to 2017.
Kevin Ashley_Sharing research data: benefits for the researcher, benefits for...Platforma Otwartej Nauki
“Open Research Data: Implications for Science and Society”, Warsaw, Poland, May 28–29, 2015, conference organized by the Open Science Platform — an initiative of the Interdisciplinary Centre for Mathematical and Computational Modelling at the University of Warsaw. pon.edu.pl @OpenSciPlatform #ORD2015
The document discusses open educational resources (OER) and copyright issues in education. It notes that OER are freely available teaching and learning materials that can be reused and modified. The document outlines some of the benefits of OER, including lower costs, and discusses how copyright compliance is challenging for education given new technologies. It also provides examples of OER initiatives in different countries and organizations that support OER.
This document discusses opportunities for big and small companies from big data. It begins by setting the stage, noting that large corporations like infrastructure and applications and like connecting the unconnected. The agenda covers big data as a buzzword, drivers and use cases from IoT/IoE, and opportunities and next steps. Several sections explore big data opportunities across different industries like healthcare, retail, and smart cities. Big data is transforming many industries by providing insights from diverse data sources to improve decision making, risk analysis, fraud detection, personalized recommendations and more.
Brian D. Voss - Kuali Foundation Applications and You!Kuali Days UK
Presented by Brian D. Voss, Vice President and Chief Information Officer at the University of Maryland at the Kuali Days UK conference, 30 October 2013.
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...CINECAProject
We live in an era of cloud computing. Many of the services in the life sciences are keenly planning cloud transformations, seeking to create globally distributed ecosystems of harmonised data based on standards from organisations like GA4GH. CINECA faces similar challenges, gathering cohort datasets from all over the globe, many of which are pinned in place, due to their size, legal restrictions, or other considerations. But is “bringing compute to the data” always the right choice? In this webinar, based on experiences from the Human Cell Atlas Data Coordination Platform and other projects from EMBL-EBI, we will explore the concept of “data gravity”: The idea that whilst there are forces that may hold data in one place, there are others that require it to be mobile. We’ll consider how effectively planning a cloud strategy requires consideration of the gravity of datasets, and the impact it may have on team skills required, incentives for good practice, and storage and compute costs.
The CINECA webinar series aims to discuss ways to address common challenges and share best practices in the field of cohort data analysis, as well as distribute CINECA project results. All CINECA webinars include an audience Q&A session during which attendees can ask questions and make suggestions. Please note that all webinars are recorded and available for posterior viewing. CINECA webinars include an audience Q&A session during which attendees can ask questions and make suggestions.
This webinar took place on 12th November 2020 and is part of the CINECA webinar series.
For previous and upcoming CINECA webinars see:
https://www.cineca-project.eu/webinars
This document provides information about Cisco's operations and partnerships in Russia, including:
- Cisco has been established in Russia since 1995 and employs 500 people across 12 offices, providing $1.6 billion in financing.
- It partners with 3000 organizations and has established 760 NetAcademies and entrepreneur institutes reaching over 25,000 students.
- Cisco's Chairman & CEO John Chambers sponsors its country operations and partnerships that support Russia's innovation and modernization agenda.
RISE - the DCC's Research Infrastructure Self-Evaluation FrameworkKevin Ashley
The document introduces RISE, a research infrastructure self-evaluation framework developed by the Digital Curation Centre. RISE aims to help research institutions assess the maturity of their research data services, identify gaps, prioritize improvements, and benchmark against peers. It comprises 22 capabilities across different levels of achievement. The framework was created based on the DCC's experience and incorporates standards from existing models. It is freely available online and several institutions have conducted self-assessments using RISE.
An analysis of open data and open science policies in Europe - a SPARCEurope ...Kevin Ashley
A short presentation given at the SPARCEurope members meeting on July 5th in Patras, Greece. It summarises the findings of a recent joint report by the DCC and SPARCEurope on European national open data and open science policy.
This document summarizes a presentation on the benefits of research data management. It discusses how data management can benefit researchers through increased citations and compliance with funder requirements. It also benefits society by enabling data sharing, reuse and discovery. However, many researchers do not practice good data management due to a lack of skills, resources or incentives. The presentation provides information on data management best practices and their importance for research excellence.
Data Quality and Data Curation - a personal viewKevin Ashley
- The document discusses data quality and curation from the perspective of Kevin Ashley, director of the Digital Curation Centre.
- It notes that different stakeholders have varying definitions of data quality, as some aspects of quality, like accuracy, may conflict with others like timeliness or completeness.
- It suggests that current curation practices often only cater to single consumer groups and domains, and that taking a more generic approach could increase data mobility and reuse across different domains.
The document discusses the importance of good research data management. It notes that good data is needed for good research and outlines funder requirements for data management plans and long-term data preservation. The Digital Curation Centre (DCC) provides tools, services, and support to help research institutions develop their research data management capabilities and policies.
Research Data Management: the UK national change programme (Nordbib)Kevin Ashley
The UK National Change Program aims to realize the maximum value of research data through a 5-year program of national services and support coordinated by the Digital Curation Centre. The program focuses on building capacity and capability for research data management within research institutions. Proper research data management is important because data is expensive to create, facilitates reuse and reproducibility, and is increasingly subject to legal and regulatory requirements from research funders.
This document reviews challenges in digital preservation research by examining past reports that identified key research areas. It discusses work that has been done, is currently being done, and remains to be done. Some areas explored include format migration, repository models, metadata standards, and preserving newer digital formats and software. The document emphasizes the need for both pragmatic and theoretical research that can inform practice and help define problems more specifically to guide future work.
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...Alex Pruden
Folding is a recent technique for building efficient recursive SNARKs. Several elegant folding protocols have been proposed, such as Nova, Supernova, Hypernova, Protostar, and others. However, all of them rely on an additively homomorphic commitment scheme based on discrete log, and are therefore not post-quantum secure. In this work we present LatticeFold, the first lattice-based folding protocol based on the Module SIS problem. This folding protocol naturally leads to an efficient recursive lattice-based SNARK and an efficient PCD scheme. LatticeFold supports folding low-degree relations, such as R1CS, as well as high-degree relations, such as CCS. The key challenge is to construct a secure folding protocol that works with the Ajtai commitment scheme. The difficulty, is ensuring that extracted witnesses are low norm through many rounds of folding. We present a novel technique using the sumcheck protocol to ensure that extracted witnesses are always low norm no matter how many rounds of folding are used. Our evaluation of the final proof system suggests that it is as performant as Hypernova, while providing post-quantum security.
Paper Link: https://eprint.iacr.org/2024/257
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectorsDianaGray10
Join us to learn how UiPath Apps can directly and easily interact with prebuilt connectors via Integration Service--including Salesforce, ServiceNow, Open GenAI, and more.
The best part is you can achieve this without building a custom workflow! Say goodbye to the hassle of using separate automations to call APIs. By seamlessly integrating within App Studio, you can now easily streamline your workflow, while gaining direct access to our Connector Catalog of popular applications.
We’ll discuss and demo the benefits of UiPath Apps and connectors including:
Creating a compelling user experience for any software, without the limitations of APIs.
Accelerating the app creation process, saving time and effort
Enjoying high-performance CRUD (create, read, update, delete) operations, for
seamless data management.
Speakers:
Russell Alfeche, Technology Leader, RPA at qBotic and UiPath MVP
Charlie Greenberg, host
Monitoring and Managing Anomaly Detection on OpenShift.pdfTosin Akinosho
Monitoring and Managing Anomaly Detection on OpenShift
Overview
Dive into the world of anomaly detection on edge devices with our comprehensive hands-on tutorial. This SlideShare presentation will guide you through the entire process, from data collection and model training to edge deployment and real-time monitoring. Perfect for those looking to implement robust anomaly detection systems on resource-constrained IoT/edge devices.
Key Topics Covered
1. Introduction to Anomaly Detection
- Understand the fundamentals of anomaly detection and its importance in identifying unusual behavior or failures in systems.
2. Understanding Edge (IoT)
- Learn about edge computing and IoT, and how they enable real-time data processing and decision-making at the source.
3. What is ArgoCD?
- Discover ArgoCD, a declarative, GitOps continuous delivery tool for Kubernetes, and its role in deploying applications on edge devices.
4. Deployment Using ArgoCD for Edge Devices
- Step-by-step guide on deploying anomaly detection models on edge devices using ArgoCD.
5. Introduction to Apache Kafka and S3
- Explore Apache Kafka for real-time data streaming and Amazon S3 for scalable storage solutions.
6. Viewing Kafka Messages in the Data Lake
- Learn how to view and analyze Kafka messages stored in a data lake for better insights.
7. What is Prometheus?
- Get to know Prometheus, an open-source monitoring and alerting toolkit, and its application in monitoring edge devices.
8. Monitoring Application Metrics with Prometheus
- Detailed instructions on setting up Prometheus to monitor the performance and health of your anomaly detection system.
9. What is Camel K?
- Introduction to Camel K, a lightweight integration framework built on Apache Camel, designed for Kubernetes.
10. Configuring Camel K Integrations for Data Pipelines
- Learn how to configure Camel K for seamless data pipeline integrations in your anomaly detection workflow.
11. What is a Jupyter Notebook?
- Overview of Jupyter Notebooks, an open-source web application for creating and sharing documents with live code, equations, visualizations, and narrative text.
12. Jupyter Notebooks with Code Examples
- Hands-on examples and code snippets in Jupyter Notebooks to help you implement and test anomaly detection models.
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyScyllaDB
Freshworks creates AI-boosted business software that helps employees work more efficiently and effectively. Managing data across multiple RDBMS and NoSQL databases was already a challenge at their current scale. To prepare for 10X growth, they knew it was time to rethink their database strategy. Learn how they architected a solution that would simplify scaling while keeping costs under control.
Driving Business Innovation: Latest Generative AI Advancements & Success StorySafe Software
Are you ready to revolutionize how you handle data? Join us for a webinar where we’ll bring you up to speed with the latest advancements in Generative AI technology and discover how leveraging FME with tools from giants like Google Gemini, Amazon, and Microsoft OpenAI can supercharge your workflow efficiency.
During the hour, we’ll take you through:
Guest Speaker Segment with Hannah Barrington: Dive into the world of dynamic real estate marketing with Hannah, the Marketing Manager at Workspace Group. Hear firsthand how their team generates engaging descriptions for thousands of office units by integrating diverse data sources—from PDF floorplans to web pages—using FME transformers, like OpenAIVisionConnector and AnthropicVisionConnector. This use case will show you how GenAI can streamline content creation for marketing across the board.
Ollama Use Case: Learn how Scenario Specialist Dmitri Bagh has utilized Ollama within FME to input data, create custom models, and enhance security protocols. This segment will include demos to illustrate the full capabilities of FME in AI-driven processes.
Custom AI Models: Discover how to leverage FME to build personalized AI models using your data. Whether it’s populating a model with local data for added security or integrating public AI tools, find out how FME facilitates a versatile and secure approach to AI.
We’ll wrap up with a live Q&A session where you can engage with our experts on your specific use cases, and learn more about optimizing your data workflows with AI.
This webinar is ideal for professionals seeking to harness the power of AI within their data management systems while ensuring high levels of customization and security. Whether you're a novice or an expert, gain actionable insights and strategies to elevate your data processes. Join us to see how FME and AI can revolutionize how you work with data!
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor IvaniukFwdays
At this talk we will discuss DDoS protection tools and best practices, discuss network architectures and what AWS has to offer. Also, we will look into one of the largest DDoS attacks on Ukrainian infrastructure that happened in February 2022. We'll see, what techniques helped to keep the web resources available for Ukrainians and how AWS improved DDoS protection for all customers based on Ukraine experience
Have you ever been confused by the myriad of choices offered by AWS for hosting a website or an API?
Lambda, Elastic Beanstalk, Lightsail, Amplify, S3 (and more!) can each host websites + APIs. But which one should we choose?
Which one is cheapest? Which one is fastest? Which one will scale to meet our needs?
Join me in this session as we dive into each AWS hosting service to determine which one is best for your scenario and explain why!
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfChart Kalyan
A Mix Chart displays historical data of numbers in a graphical or tabular form. The Kalyan Rajdhani Mix Chart specifically shows the results of a sequence of numbers over different periods.
Fueling AI with Great Data with Airbyte WebinarZilliz
This talk will focus on how to collect data from a variety of sources, leveraging this data for RAG and other GenAI use cases, and finally charting your course to productionalization.
Digital Banking in the Cloud: How Citizens Bank Unlocked Their MainframePrecisely
Inconsistent user experience and siloed data, high costs, and changing customer expectations – Citizens Bank was experiencing these challenges while it was attempting to deliver a superior digital banking experience for its clients. Its core banking applications run on the mainframe and Citizens was using legacy utilities to get the critical mainframe data to feed customer-facing channels, like call centers, web, and mobile. Ultimately, this led to higher operating costs (MIPS), delayed response times, and longer time to market.
Ever-changing customer expectations demand more modern digital experiences, and the bank needed to find a solution that could provide real-time data to its customer channels with low latency and operating costs. Join this session to learn how Citizens is leveraging Precisely to replicate mainframe data to its customer channels and deliver on their “modern digital bank” experiences.
For the full video of this presentation, please visit: https://www.edge-ai-vision.com/2024/06/how-axelera-ai-uses-digital-compute-in-memory-to-deliver-fast-and-energy-efficient-computer-vision-a-presentation-from-axelera-ai/
Bram Verhoef, Head of Machine Learning at Axelera AI, presents the “How Axelera AI Uses Digital Compute-in-memory to Deliver Fast and Energy-efficient Computer Vision” tutorial at the May 2024 Embedded Vision Summit.
As artificial intelligence inference transitions from cloud environments to edge locations, computer vision applications achieve heightened responsiveness, reliability and privacy. This migration, however, introduces the challenge of operating within the stringent confines of resource constraints typical at the edge, including small form factors, low energy budgets and diminished memory and computational capacities. Axelera AI addresses these challenges through an innovative approach of performing digital computations within memory itself. This technique facilitates the realization of high-performance, energy-efficient and cost-effective computer vision capabilities at the thin and thick edge, extending the frontier of what is achievable with current technologies.
In this presentation, Verhoef unveils his company’s pioneering chip technology and demonstrates its capacity to deliver exceptional frames-per-second performance across a range of standard computer vision networks typical of applications in security, surveillance and the industrial sector. This shows that advanced computer vision can be accessible and efficient, even at the very edge of our technological ecosystem.
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUpanagenda
Webinar Recording: https://www.panagenda.com/webinars/hcl-notes-und-domino-lizenzkostenreduzierung-in-der-welt-von-dlau/
DLAU und die Lizenzen nach dem CCB- und CCX-Modell sind für viele in der HCL-Community seit letztem Jahr ein heißes Thema. Als Notes- oder Domino-Kunde haben Sie vielleicht mit unerwartet hohen Benutzerzahlen und Lizenzgebühren zu kämpfen. Sie fragen sich vielleicht, wie diese neue Art der Lizenzierung funktioniert und welchen Nutzen sie Ihnen bringt. Vor allem wollen Sie sicherlich Ihr Budget einhalten und Kosten sparen, wo immer möglich. Das verstehen wir und wir möchten Ihnen dabei helfen!
Wir erklären Ihnen, wie Sie häufige Konfigurationsprobleme lösen können, die dazu führen können, dass mehr Benutzer gezählt werden als nötig, und wie Sie überflüssige oder ungenutzte Konten identifizieren und entfernen können, um Geld zu sparen. Es gibt auch einige Ansätze, die zu unnötigen Ausgaben führen können, z. B. wenn ein Personendokument anstelle eines Mail-Ins für geteilte Mailboxen verwendet wird. Wir zeigen Ihnen solche Fälle und deren Lösungen. Und natürlich erklären wir Ihnen das neue Lizenzmodell.
Nehmen Sie an diesem Webinar teil, bei dem HCL-Ambassador Marc Thomas und Gastredner Franz Walder Ihnen diese neue Welt näherbringen. Es vermittelt Ihnen die Tools und das Know-how, um den Überblick zu bewahren. Sie werden in der Lage sein, Ihre Kosten durch eine optimierte Domino-Konfiguration zu reduzieren und auch in Zukunft gering zu halten.
Diese Themen werden behandelt
- Reduzierung der Lizenzkosten durch Auffinden und Beheben von Fehlkonfigurationen und überflüssigen Konten
- Wie funktionieren CCB- und CCX-Lizenzen wirklich?
- Verstehen des DLAU-Tools und wie man es am besten nutzt
- Tipps für häufige Problembereiche, wie z. B. Team-Postfächer, Funktions-/Testbenutzer usw.
- Praxisbeispiele und Best Practices zum sofortigen Umsetzen
Taking AI to the Next Level in Manufacturing.pdfssuserfac0301
Read Taking AI to the Next Level in Manufacturing to gain insights on AI adoption in the manufacturing industry, such as:
1. How quickly AI is being implemented in manufacturing.
2. Which barriers stand in the way of AI adoption.
3. How data quality and governance form the backbone of AI.
4. Organizational processes and structures that may inhibit effective AI adoption.
6. Ideas and approaches to help build your organization's AI strategy.
Programming Foundation Models with DSPy - Meetup SlidesZilliz
Prompting language models is hard, while programming language models is easy. In this talk, I will discuss the state-of-the-art framework DSPy for programming foundation models with its powerful optimizers and runtime constraint system.
Programming Foundation Models with DSPy - Meetup Slides
Research data: burden or treasure? (Talk from #fote13)
1. Research data: burden or treasure?
Kevin Ashley
Digital Curation Centre
www.dcc.ac.uk
@kevingashley
Kevin.ashley@ed.ac.uk
Reusable with attribution: CC-BY
The DCC is supported by Jisc & FP7
2. 164 universities in UK*
*2011 HESA data
2013-10-11 Kevin Ashley – FOTiE 2013 - CC-BY 2
71 (43%) > 5% research income
115 (70%) > £1m income from research
4. Funders are making demands
2013-10-11 Kevin Ashley – FOTiE 2013 - CC-BY 4
5. 2013-10-11
Kevin Ashley – FOTiE 2013 - CC-
BY
5
http://www.epsrc.ac.uk/about/standards/researchdata/Pages/expectations.aspx
EPSRC expects all those institutions it funds
to develop a roadmap that aligns … with
EPSRC’s expectations by 1st May 2012;
to be fully compliant … by 1st May 2015.
6. 2012-06-15
Kevin Ashley, DCC; IRWM12,
ULCC; CC-BY
6
• Awareness of regulatory environment
• Data access statement
• Policies and processes
• Data storage
• Structured metadata descriptions
• DOIs for data
• Securely preserved for a minimum of 10 years
from last use
7. How much data do we have?
• Edinburgh – provision for 5 Petabytes
• Oxford – guessing 3Pb/year
• For comparison – LHC @ CERN – 15 Pb/year
• £2m investment in storage not unusual
2013-10-11 Kevin Ashley – FOTiE 2013 - CC-BY 7
8. The Data Deluge is upon us
2013-10-11 Kevin Ashley – FOTiE 2013 - CC-BY 8
Sensor’s ability
to produce data
outstrips IT’s
ability to
process it
9. Research Data Centres – the solution!
2013-10-11 Kevin Ashley – FOTiE 2013 - CC-BY 9
MANY AREAS OF
RESEARCH HAVE NO
DATA CENTRE TO
SERVE THEM
10. Cloud – sorted!
• Sorry, but it isn’t.
• See David Rosenthal’s analysis of the
economics of Amazon for preservation
“Distributed digital preservation in the cloud”
IJDC 8(1), 2013 doi:10.2218/ijdc.v8i1.248
2013-10-11 Kevin Ashley – FOTiE 2013 - CC-BY 10
13. That looks like a problem
• Funder requirements exist for a reason:
– That data is valuable
• Value to funder, society from reuse
• Value to the institution is there also
2013-10-11 Kevin Ashley – FOTiE 2013 - CC-BY 13
BIS business case: £1.5m investment in research data
services pays back 2.5 times after 5 years
14. Integrity
• Not everyone publishes
here
• Almost all fraud
connected to
unavailable data
• People suffer & die due
to research fraud
• When your research is
reproducible – it gets
cited
2013-10-11 Kevin Ashley – FOTiE 2013 - CC-BY 14
15. Citability
• Making data available increases citations
• Everyone – academic, funder, institution –
loves citations
• Want evidence?
– Alter, Pienta, Lyle – 240%, social sciences *
– Piwowar, Vision – 9% (microarray data)†
– Henneken, Accomazzi – 20% (astronomy) #
2013-10-11 Kevin Ashley – FOTiE 2013 - CC-BY 15
† Piwowar H, Vision TJ. (2013) Data reuse & the open data citation advantage. PeerJ PrePrints 1:e1v1
http://dx.doi.org/10.7287/peerj.preprints.1v1
* Amy Pienta, George Alter, Jared Lyle, (2010) The Enduring Value of Social Science Research: The Use and Reuse of Primary Research Data.
http://hdl.handle.net/2027.42/78307
# Edwin Henneken, Alberto Accomazzi, (2011) Linking to Data - Effect on Citation Rates in Astronomy. http://arxiv.org/abs/1111.3618
16. Value in the institution
• New research depends on the old – well
managed data resources like well-equipped
labs
• Teaching more effective when real data from
research is used
2013-10-11 Kevin Ashley – FOTiE 2013 - CC-BY 16
17. Wherever it is, it has valueWant a 400% -> 1200%
return on your
investment?
2013-10-11 Kevin Ashley – FOTiE 2013 - CC-BY 17
Try BADC!
http://www.jisc.ac.uk/whatwedo/programmes/di_directions/strategicdirections/badc.aspx
19. Can we find it?
• Data must be discoverable to be reused
• Alone, or in conjunction with publication
• Institutional catalogues, national data
registries – JISC is piloting through DCC
2013-10-11 Kevin Ashley – FOTiE 2013 - CC-BY 19
24. 2013-10-11 Kevin Ashley – FOTiE 2013 - CC-BY 24
Idea
Develop
Fund
Plan
Record
Process
Publish
Read
25. 2013-10-11 Kevin Ashley – FOTiE 2013 - CC-BY 25
Idea
Develop
Fund
Plan
Record
Process
Publish
Read
Idea
Develop
Fund
Plan
Record
Process
Publish
Read
27. Data reuse stories
• The palaeontologist who saved years of work
with archaeological data
• The ‘noise’ from research radar that mapped
dust from Eyjafjallajökull
• The 19th-century logs and photographs that
help us model climate change
2013-10-11 Kevin Ashley – FOTiE 2013 - CC-BY 27
Often your data tells
stories that your
publications do not
29. Thanks for your attention
kevin.ashley@ed.ac.uk
www.dcc.ac.uk
@kevingashley
2013-10-11 Kevin Ashley – FOTiE 2013 - CC-BY 29
30. DCC ‘institutional engagement’
Assess
needs
Make the case
Develop
support and
services
RDM policy
development
Customised Data
Management Plans
DAF & CARDIO
assessments
Guidance and
training
Workflow
assessment
DCC
support
team
Advocacy with senior
management
Institutional
data catalogues
Pilot RDM
tools
…and support policy implementation
2013-10-11 Kevin Ashley – FOTiE 2013 - CC-BY 30
Editor's Notes
Those external pressures include those from funders such as EPSRC. Looming deadlines this year and in 2015 got the attention of senior university management.
The expectations that universities need to sign up are listed here – their roadmaps need to demonstrate how they are going to deliver on these expectations by 2015. They include a commitment to keep data for 10 years after its last use – note, not just after the project ends. Some worry that this means they need to keep data for 100 years. I say that if your data is still being used (and cited) 100 years later you should break out the champagne, not worry about paying for it.