The document discusses research data management services provided by the University of Western Australia (UWA) Library. It notes that funders like the Australian Research Council (ARC) and National Health and Medical Research Council (NHMRC) require research data to be managed and shared. UWA policies also require research data related to publications to be available through the UWA Research Repository. The document provides guidance on creating data management plans, using appropriate licenses, and securely storing data long-term using the Institutional Research Data Storage (IRDS) system rather than third-party cloud services like Dropbox.
2017 05 03 Implementing Pure at UWA - ANDS Webinar SeriesKatina Toufexis
This document summarizes the University of Western Australia's efforts to consolidate its research data systems. It discusses the migration of datasets from its existing DSpace repository to its new Pure repository to have a single system for publications, theses, and datasets. The migration project timeline and functional requirements are outlined, along with issues encountered with the previous DSpace and Vivo systems. Finally, future plans are mentioned, such as enabling dataset submissions directly in Pure and linking publications, theses, and datasets from the same grants/instruments.
This document provides information on research data management services at UWA. It discusses creating data management plans, funder and publisher requirements for data sharing, using the Research Data Online repository, data storage options like IRDS and UniDrive, and contacts for further assistance. Managing research data properly ensures compliance, reproducibility, and legacy of research outputs.
This document provides an overview of a webinar on digital curation and research data management for universities. The webinar covers an introduction to digital curation, the benefits and drivers for research data management, current initiatives in UK universities, and the role of libraries in supporting research data management. Libraries are increasingly involved in developing institutional policies, providing training, and advising researchers on writing data management plans and sharing data. The webinar highlights training opportunities for librarians to develop skills in research data management and digital curation.
This document discusses licensing research data for reuse. It begins by providing a scenario where a user has downloaded a dataset but is unsure what they can do with the data due to licensing. It then discusses that licensing is critical to enabling data reuse and citation. It provides information on AusGOAL, the Australian open access and licensing framework, and notes it is recommended for data publishing by ANDS partners. It also includes links to licensing guides and FAQs. In summary, the document emphasizes the importance of data licensing for enabling reuse and outlines Australia's recommended licensing system.
Building a collaborative RDM community, research data networkJisc RDM
This document summarizes Dr. Marta Teperek's presentation on building a collaborative research data management (RDM) community. The presentation covered how not to start RDM services by mandating data sharing, and instead focusing on the benefits of sharing. It discussed Cambridge University's democratic approach to developing RDM services by empowering researchers, and the positive feedback received. Collaboration, open communication, and shaping services and policies with researchers were emphasized as key to success.
This document discusses research data management (RDM). It defines research data and describes the RDM lifecycle. Key aspects of RDM include creating data management plans, documenting and organizing data, and ensuring long-term preservation and sharing of data. The document outlines best practices for RDM, such as using appropriate file formats and metadata standards. It also discusses challenges around sensitive data and guidelines for data sharing and citation. The roles libraries can play in supporting RDM are identified, such as developing RDM policies, training researchers, and setting up data repositories.
2017 05 03 Implementing Pure at UWA - ANDS Webinar SeriesKatina Toufexis
This document summarizes the University of Western Australia's efforts to consolidate its research data systems. It discusses the migration of datasets from its existing DSpace repository to its new Pure repository to have a single system for publications, theses, and datasets. The migration project timeline and functional requirements are outlined, along with issues encountered with the previous DSpace and Vivo systems. Finally, future plans are mentioned, such as enabling dataset submissions directly in Pure and linking publications, theses, and datasets from the same grants/instruments.
This document provides information on research data management services at UWA. It discusses creating data management plans, funder and publisher requirements for data sharing, using the Research Data Online repository, data storage options like IRDS and UniDrive, and contacts for further assistance. Managing research data properly ensures compliance, reproducibility, and legacy of research outputs.
This document provides an overview of a webinar on digital curation and research data management for universities. The webinar covers an introduction to digital curation, the benefits and drivers for research data management, current initiatives in UK universities, and the role of libraries in supporting research data management. Libraries are increasingly involved in developing institutional policies, providing training, and advising researchers on writing data management plans and sharing data. The webinar highlights training opportunities for librarians to develop skills in research data management and digital curation.
This document discusses licensing research data for reuse. It begins by providing a scenario where a user has downloaded a dataset but is unsure what they can do with the data due to licensing. It then discusses that licensing is critical to enabling data reuse and citation. It provides information on AusGOAL, the Australian open access and licensing framework, and notes it is recommended for data publishing by ANDS partners. It also includes links to licensing guides and FAQs. In summary, the document emphasizes the importance of data licensing for enabling reuse and outlines Australia's recommended licensing system.
Building a collaborative RDM community, research data networkJisc RDM
This document summarizes Dr. Marta Teperek's presentation on building a collaborative research data management (RDM) community. The presentation covered how not to start RDM services by mandating data sharing, and instead focusing on the benefits of sharing. It discussed Cambridge University's democratic approach to developing RDM services by empowering researchers, and the positive feedback received. Collaboration, open communication, and shaping services and policies with researchers were emphasized as key to success.
This document discusses research data management (RDM). It defines research data and describes the RDM lifecycle. Key aspects of RDM include creating data management plans, documenting and organizing data, and ensuring long-term preservation and sharing of data. The document outlines best practices for RDM, such as using appropriate file formats and metadata standards. It also discusses challenges around sensitive data and guidelines for data sharing and citation. The roles libraries can play in supporting RDM are identified, such as developing RDM policies, training researchers, and setting up data repositories.
The document summarizes an agenda for a workshop on practicing open science. The workshop covers topics such as why practice open science, understanding open access publishing, managing and sharing research data, data management planning, and tools. It provides an overview of each topic and exercises for participants. The Digital Repository of Ireland is introduced as a national infrastructure that can help with archiving, preserving and sharing research data according to open science principles.
The state of global research data initiatives: observations from a life on th...Projeto RCAAP
The document discusses research data management and provides guidance on best practices. It defines research data management as the active management of data over its lifecycle. It recommends writing a data management plan to document how data will be created, stored, shared, and preserved. It also provides tips for making data accessible and reusable through use of metadata standards, documentation, open licensing, and depositing data in repositories with persistent identifiers. The goal is to help researchers manage and share their data effectively to increase access and reuse.
Managing and sharing confidential data in Australian social scienceARDC
The “problem” of “sensitive data” - the 5 Safes model
The “problem” of open and transparent research – the FAIR principles
From problems to solutions – Access to sensitive data in Australia – ADA as a model for journal data access system
The document outlines a 23 Things program for research data management training, which releases weekly activities and has monthly webinars, and provides a calendar of events and list of coordinators for the program at UWA.
This slideshow was used in a Preparing Your Research Material for the Future course for the Humanities Division, University of Oxford, on 2018-06-08. It provides an overview of some key issues, focusing on the long-term management of data and other research material, including sharing and curation.
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATTony Ross-Hellauer
OpenAIRE and EUDAT co-present this webinar which aims to introduce researchers and others to the concept of research data management (RDM). As well as presenting the benefits of taking an active approach to research data management – including increased speed and ease of access, efficiency (fund once, reuse many times), and improved quality and transparency of research – the webinar will advise on strategies for successful RDM, resources to help manage data effectively, choosing where to store and deposit data, the EC H2020 Open Data Pilot and the basics of data management, stewardship and archiving.
Webinar recording available: http://www.instantpresenter.com/eifl/EB57D6888147
This document provides an introduction to research data management for geoscience PhD students. It defines research data and different data types. It discusses the importance of managing data throughout its lifecycle for efficient and valid research. It outlines funder requirements, university policies, and activities involved in good research data management like data planning, documentation, storage, sharing and preservation.
Role of libraries in research and scholarly communicationNikesh Narayanan
Libraries play an important role in supporting research through facilitating literature searches, providing information literacy and reference services, and guiding researchers in publishing and managing their research profiles. Libraries can help researchers efficiently search across disjointed information sources through federated search software or web-scale discovery tools which provide a single search interface. Libraries also help connect researchers to open access resources and guide them on where and how to publish their research findings.
Management of research data specifically for Engineering and Physical Science. Delivered by Stuart Macdonald at the "Support for Enhancing Research Impact" meeting at the University of Edinburgh on 22 June 2016.
Writing a successful data management plan with the DMPToolkfear
This document provides an overview of how to write an effective Data Management Plan (DMP) using the DMPTool. It discusses the key components of a DMP including data products, standards, access and sharing, preservation, and documentation. The goals are to help researchers generate a DMP, understand the basic elements, and recognize how good data management leads to a strong plan. Writing a thorough DMP is now required by many funders and helps ensure data is organized, accessible, and preserved for future use.
This document summarizes a workshop on authority files. It discusses how authority files can transform from library silos to a web of linked data by uniquely identifying entities like people, publications, organizations, and connecting them using identifiers. Four use cases are presented: developing a repository authority file, enhancing a journal authority file to track open access evolution, integrating existing authority files to make cultural data web compliant, and using authority files to enable new analyses and business intelligence from research information systems. The benefits of authority files for discovery, reliability, accountability, and efficiency are outlined. An example of crosswalking different authority files is also provided. The document concludes with an opinion poll on authority file topics.
DataShare - Pauline Ward to University of Edinburgh School of Chemistry - 3 f...University of Edinburgh
Talk targeted at researchers at the University of Edinburgh, explaining how they can use DataShare to publish their research results, and some of the benefits of doing so.
S. Venkataraman (DCC) talks about the basics of Research Data Management and how to apply this when creating or reviewing a Data Management Plan (DMP). He discusses data formats and metadata standards, persistent identifiers, licensing, controlled vocabularies and data repositories.
link to : dcc.ac.uk/resources
The document summarizes a workshop on planning for research data management. It discusses what research data management is, including definitions and lifecycle models. It emphasizes the importance of planning for RDM from the beginning of a research project, including developing a data management plan that addresses data collection, documentation, storage, sharing, and long-term preservation. The workshop also covered naming conventions, file formats, metadata, and tools and resources available to support RDM.
Keynote on software sustainability given at the 2nd Annual Netherlands eScience Symposium, November 2014.
Based on the article
Carole Goble ,
Better Software, Better Research
Issue No.05 - Sept.-Oct. (2014 vol.18)
pp: 4-8
IEEE Computer Society
http://www.computer.org/csdl/mags/ic/2014/05/mic2014050004.pdf
http://doi.ieeecomputersociety.org/10.1109/MIC.2014.88
http://www.software.ac.uk/resources/publications/better-software-better-research
This document discusses research data management services at the University of Western Australia (UWA). It provides information on the Institutional Research Data Store (IRDS), a no-cost research data storage option for UWA researchers that provides 25GB of secure storage. It also discusses requirements for research data management and sharing from funding bodies like the Australian Research Council, and options for making data available through UWA's Research Data Online platform. Contact information is provided for the Research Data Coordinator for any questions.
Research Data Management in practice, RIA Data Management Workshop Brisbane 2017ARDC
The Australian National Data Service (ANDS) aims to make Australian research data more valuable by partnering with research organizations and funding data projects. In 2015, ANDS conducted over 100 workshops and events with over 4,000 participants and developed online resources. ANDS provides guides on topics like data management and the FAIR data principles. ANDS also advocates for practices like data citation and publishing to ensure research data is preserved and reusable over time. The presentation outlines ANDS' role in supporting good research data management practices and sharing to ensure the integrity and impact of research evidence.
The document summarizes an agenda for a workshop on practicing open science. The workshop covers topics such as why practice open science, understanding open access publishing, managing and sharing research data, data management planning, and tools. It provides an overview of each topic and exercises for participants. The Digital Repository of Ireland is introduced as a national infrastructure that can help with archiving, preserving and sharing research data according to open science principles.
The state of global research data initiatives: observations from a life on th...Projeto RCAAP
The document discusses research data management and provides guidance on best practices. It defines research data management as the active management of data over its lifecycle. It recommends writing a data management plan to document how data will be created, stored, shared, and preserved. It also provides tips for making data accessible and reusable through use of metadata standards, documentation, open licensing, and depositing data in repositories with persistent identifiers. The goal is to help researchers manage and share their data effectively to increase access and reuse.
Managing and sharing confidential data in Australian social scienceARDC
The “problem” of “sensitive data” - the 5 Safes model
The “problem” of open and transparent research – the FAIR principles
From problems to solutions – Access to sensitive data in Australia – ADA as a model for journal data access system
The document outlines a 23 Things program for research data management training, which releases weekly activities and has monthly webinars, and provides a calendar of events and list of coordinators for the program at UWA.
This slideshow was used in a Preparing Your Research Material for the Future course for the Humanities Division, University of Oxford, on 2018-06-08. It provides an overview of some key issues, focusing on the long-term management of data and other research material, including sharing and curation.
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATTony Ross-Hellauer
OpenAIRE and EUDAT co-present this webinar which aims to introduce researchers and others to the concept of research data management (RDM). As well as presenting the benefits of taking an active approach to research data management – including increased speed and ease of access, efficiency (fund once, reuse many times), and improved quality and transparency of research – the webinar will advise on strategies for successful RDM, resources to help manage data effectively, choosing where to store and deposit data, the EC H2020 Open Data Pilot and the basics of data management, stewardship and archiving.
Webinar recording available: http://www.instantpresenter.com/eifl/EB57D6888147
This document provides an introduction to research data management for geoscience PhD students. It defines research data and different data types. It discusses the importance of managing data throughout its lifecycle for efficient and valid research. It outlines funder requirements, university policies, and activities involved in good research data management like data planning, documentation, storage, sharing and preservation.
Role of libraries in research and scholarly communicationNikesh Narayanan
Libraries play an important role in supporting research through facilitating literature searches, providing information literacy and reference services, and guiding researchers in publishing and managing their research profiles. Libraries can help researchers efficiently search across disjointed information sources through federated search software or web-scale discovery tools which provide a single search interface. Libraries also help connect researchers to open access resources and guide them on where and how to publish their research findings.
Management of research data specifically for Engineering and Physical Science. Delivered by Stuart Macdonald at the "Support for Enhancing Research Impact" meeting at the University of Edinburgh on 22 June 2016.
Writing a successful data management plan with the DMPToolkfear
This document provides an overview of how to write an effective Data Management Plan (DMP) using the DMPTool. It discusses the key components of a DMP including data products, standards, access and sharing, preservation, and documentation. The goals are to help researchers generate a DMP, understand the basic elements, and recognize how good data management leads to a strong plan. Writing a thorough DMP is now required by many funders and helps ensure data is organized, accessible, and preserved for future use.
This document summarizes a workshop on authority files. It discusses how authority files can transform from library silos to a web of linked data by uniquely identifying entities like people, publications, organizations, and connecting them using identifiers. Four use cases are presented: developing a repository authority file, enhancing a journal authority file to track open access evolution, integrating existing authority files to make cultural data web compliant, and using authority files to enable new analyses and business intelligence from research information systems. The benefits of authority files for discovery, reliability, accountability, and efficiency are outlined. An example of crosswalking different authority files is also provided. The document concludes with an opinion poll on authority file topics.
DataShare - Pauline Ward to University of Edinburgh School of Chemistry - 3 f...University of Edinburgh
Talk targeted at researchers at the University of Edinburgh, explaining how they can use DataShare to publish their research results, and some of the benefits of doing so.
S. Venkataraman (DCC) talks about the basics of Research Data Management and how to apply this when creating or reviewing a Data Management Plan (DMP). He discusses data formats and metadata standards, persistent identifiers, licensing, controlled vocabularies and data repositories.
link to : dcc.ac.uk/resources
The document summarizes a workshop on planning for research data management. It discusses what research data management is, including definitions and lifecycle models. It emphasizes the importance of planning for RDM from the beginning of a research project, including developing a data management plan that addresses data collection, documentation, storage, sharing, and long-term preservation. The workshop also covered naming conventions, file formats, metadata, and tools and resources available to support RDM.
Keynote on software sustainability given at the 2nd Annual Netherlands eScience Symposium, November 2014.
Based on the article
Carole Goble ,
Better Software, Better Research
Issue No.05 - Sept.-Oct. (2014 vol.18)
pp: 4-8
IEEE Computer Society
http://www.computer.org/csdl/mags/ic/2014/05/mic2014050004.pdf
http://doi.ieeecomputersociety.org/10.1109/MIC.2014.88
http://www.software.ac.uk/resources/publications/better-software-better-research
This document discusses research data management services at the University of Western Australia (UWA). It provides information on the Institutional Research Data Store (IRDS), a no-cost research data storage option for UWA researchers that provides 25GB of secure storage. It also discusses requirements for research data management and sharing from funding bodies like the Australian Research Council, and options for making data available through UWA's Research Data Online platform. Contact information is provided for the Research Data Coordinator for any questions.
Research Data Management in practice, RIA Data Management Workshop Brisbane 2017ARDC
The Australian National Data Service (ANDS) aims to make Australian research data more valuable by partnering with research organizations and funding data projects. In 2015, ANDS conducted over 100 workshops and events with over 4,000 participants and developed online resources. ANDS provides guides on topics like data management and the FAIR data principles. ANDS also advocates for practices like data citation and publishing to ensure research data is preserved and reusable over time. The presentation outlines ANDS' role in supporting good research data management practices and sharing to ensure the integrity and impact of research evidence.
This slideshow was used in a Preparing Your Research Data for the Future course taught in the Medical Sciences Division, University of Oxford, on 2015-06-08. It provides an overview of some key issues, focusing on long-term data management, sharing, and curation.
Getting to grips with research data management Wendy Mears
This document provides an overview of research data management. It defines research data management and discusses its importance. It also outlines the data lifecycle model and provides guidance on sharing data, working with data, planning for data management, and useful resources for research data management. The document aims to help researchers effectively manage the data created throughout the research process.
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...Sarah Anna Stewart
Presentation given at the M25 Consortium of Academic Libraries, CPD25 Event on 'The Role of the Library in Supporting Research'. Provides an introduction to data, software and PIDs and a brief look at how libraries can enable researchers to gain impact and credit for their research data and software.
This slideshow was used in Preparing Your Research Data for the Future course taught in the Social Sciences Division, University of Oxford, on 2014-02-17. It provides an overview of some key issues, focusing on long-term data management, sharing, and curation
The document discusses research data sharing and data availability statements. It defines a data availability statement as a statement describing how research data can be accessed and under what conditions. There are three main ways to share research data: data repositories, supplementary files, and upon request. Writing a good data availability statement involves answering questions about the raw data, where it can be accessed, any conditions for access, and restrictions. Sharing data openly increases transparency, collaboration, citations and reproducibility of research.
Data sharing is the practice of making research data openly available to others. It has many benefits including enabling innovation, improving transparency and research integrity, and increasing citations and impact. Major funders now require data sharing as a condition of funding. To share data, it must be prepared by documenting it with metadata and supporting files. This allows others to understand and use the data. Researchers are encouraged to share data in open repositories to maximize access and reuse. Proper preparation of data for sharing helps ensure data is FAIR - Findable, Accessible, Interoperable and Reusable.
This document provides an overview of preparing research data for long-term preservation and sharing. It discusses defining data, following data management policies, documenting data with metadata, securely storing data through backups and appropriate file formats and storage media. It also addresses sharing data by depositing in repositories, making data publicly available through services like Figshare, and using licenses. The document emphasizes planning for data management from the start of a research project through drafting a data management plan and seeking advice from university support services.
This presentation was delivered at IT Services, University of Oxford on 2014-05-28, as part of the 'Things To Do With Data' series of lunchtime talks. It offers an overview of resources available for management and support staff whose responsibilities include planning and implementing data management strategies.
This document discusses the importance of managing research data and provides best practices and resources for doing so. It notes that data is a valuable product of research that should be stored securely and potentially shared. Guidelines recommend developing a data management plan, organizing and documenting data, storing data securely in multiple locations, considering ethics and copyright, and potentially sharing data. The document provides links to Bond University's research data management toolkit and other resources to help researchers manage their data responsibly.
Aim:- To show how research data management can contribute to the success of your PhD.
*What is research data and why it is important?
*The Research Data lifecycle
* Research Data – more than just your results
* FAIR data and Open Research
* DMP online tool
This slideshow was used in a Preparing Your Research Data for the Future course taught in the Social Sciences Division, University of Oxford, on 2015-03-02. It provides an overview of some key issues, focusing on long-term data management, sharing, and curation.
Research Data Management in GLAM: Managing Data for Cultural HeritageSarah Anna Stewart
Presentation given at the 'Open Science Infrastructures for Big Cultural Data' - Advanced International Masterclass in Plovdiv, Bulgaria. Dec. 13-15, 2018
Research Data Management at the University of SalfordDavid Clay
The document summarizes the University of Salford's research data management project. It describes the drivers for the project including funder policies requiring open data. It outlines the requirements gathering and policy development process. It then details the proposed solution architecture including online storage, a data repository, source code management, and support services. Finally it discusses the pilot infrastructure launched in 2015 using Figshare and describes next steps to evaluate scaling up the RDM service.
The document discusses sharing research data through open data platforms. It describes the CGIAR as uniquely positioned to collect agricultural data worldwide and argues that most CGIAR data should be archived and shared to increase its value. However, data archiving across CGIAR centers is currently poor. The document then discusses using the Dataverse platform to improve data sharing. Dataverse allows researchers to publish, share, cite, and analyze data. It also facilitates making data available while giving credit to data authors and institutions.
Similar to Data Access & Storage @ UWA - UWA Research Week September 2017 (20)
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Aggregage
This webinar will explore cutting-edge, less familiar but powerful experimentation methodologies which address well-known limitations of standard A/B Testing. Designed for data and product leaders, this session aims to inspire the embrace of innovative approaches and provide insights into the frontiers of experimentation!
Global Situational Awareness of A.I. and where its headedvikram sood
You can see the future first in San Francisco.
Over the past year, the talk of the town has shifted from $10 billion compute clusters to $100 billion clusters to trillion-dollar clusters. Every six months another zero is added to the boardroom plans. Behind the scenes, there’s a fierce scramble to secure every power contract still available for the rest of the decade, every voltage transformer that can possibly be procured. American big business is gearing up to pour trillions of dollars into a long-unseen mobilization of American industrial might. By the end of the decade, American electricity production will have grown tens of percent; from the shale fields of Pennsylvania to the solar farms of Nevada, hundreds of millions of GPUs will hum.
The AGI race has begun. We are building machines that can think and reason. By 2025/26, these machines will outpace college graduates. By the end of the decade, they will be smarter than you or I; we will have superintelligence, in the true sense of the word. Along the way, national security forces not seen in half a century will be un-leashed, and before long, The Project will be on. If we’re lucky, we’ll be in an all-out race with the CCP; if we’re unlucky, an all-out war.
Everyone is now talking about AI, but few have the faintest glimmer of what is about to hit them. Nvidia analysts still think 2024 might be close to the peak. Mainstream pundits are stuck on the wilful blindness of “it’s just predicting the next word”. They see only hype and business-as-usual; at most they entertain another internet-scale technological change.
Before long, the world will wake up. But right now, there are perhaps a few hundred people, most of them in San Francisco and the AI labs, that have situational awareness. Through whatever peculiar forces of fate, I have found myself amongst them. A few years ago, these people were derided as crazy—but they trusted the trendlines, which allowed them to correctly predict the AI advances of the past few years. Whether these people are also right about the next few years remains to be seen. But these are very smart people—the smartest people I have ever met—and they are the ones building this technology. Perhaps they will be an odd footnote in history, or perhaps they will go down in history like Szilard and Oppenheimer and Teller. If they are seeing the future even close to correctly, we are in for a wild ride.
Let me tell you what we see.
State of Artificial intelligence Report 2023kuntobimo2016
Artificial intelligence (AI) is a multidisciplinary field of science and engineering whose goal is to create intelligent machines.
We believe that AI will be a force multiplier on technological progress in our increasingly digital, data-driven world. This is because everything around us today, ranging from culture to consumer products, is a product of intelligence.
The State of AI Report is now in its sixth year. Consider this report as a compilation of the most interesting things we’ve seen with a goal of triggering an informed conversation about the state of AI and its implication for the future.
We consider the following key dimensions in our report:
Research: Technology breakthroughs and their capabilities.
Industry: Areas of commercial application for AI and its business impact.
Politics: Regulation of AI, its economic implications and the evolving geopolitics of AI.
Safety: Identifying and mitigating catastrophic risks that highly-capable future AI systems could pose to us.
Predictions: What we believe will happen in the next 12 months and a 2022 performance review to keep us honest.
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...sameer shah
"Join us for STATATHON, a dynamic 2-day event dedicated to exploring statistical knowledge and its real-world applications. From theory to practice, participants engage in intensive learning sessions, workshops, and challenges, fostering a deeper understanding of statistical methodologies and their significance in various fields."
The Building Blocks of QuestDB, a Time Series Databasejavier ramirez
Talk Delivered at Valencia Codes Meetup 2024-06.
Traditionally, databases have treated timestamps just as another data type. However, when performing real-time analytics, timestamps should be first class citizens and we need rich time semantics to get the most out of our data. We also need to deal with ever growing datasets while keeping performant, which is as fun as it sounds.
It is no wonder time-series databases are now more popular than ever before. Join me in this session to learn about the internal architecture and building blocks of QuestDB, an open source time-series database designed for speed. We will also review a history of some of the changes we have gone over the past two years to deal with late and unordered data, non-blocking writes, read-replicas, or faster batch ingestion.
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataKiwi Creative
Harness the power of AI-backed reports, benchmarking and data analysis to predict trends and detect anomalies in your marketing efforts.
Peter Caputa, CEO at Databox, reveals how you can discover the strategies and tools to increase your growth rate (and margins!).
From metrics to track to data habits to pick up, enhance your reporting for powerful insights to improve your B2B tech company's marketing.
- - -
This is the webinar recording from the June 2024 HubSpot User Group (HUG) for B2B Technology USA.
Watch the video recording at https://youtu.be/5vjwGfPN9lw
Sign up for future HUG events at https://events.hubspot.com/b2b-technology-usa/
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Data and AI
Discussion on Vector Databases, Unstructured Data and AI
https://www.meetup.com/unstructured-data-meetup-new-york/
This meetup is for people working in unstructured data. Speakers will come present about related topics such as vector databases, LLMs, and managing data at scale. The intended audience of this group includes roles like machine learning engineers, data scientists, data engineers, software engineers, and PMs.This meetup was formerly Milvus Meetup, and is sponsored by Zilliz maintainers of Milvus.
11. The Data Management Plan is:
• an online form
• uses the Qualtrics survey platform
• Asks relevant questions pertaining to your research data
• Sends you an electronic plan directly to your email after
completion
Research Data Management Planning
15. The ARC and the NHMRC care about your
research data.
Funders
16. Funders
The ARC and the NHMRC care about your
research data.
Australian Code for the Responsible Conduct of Research
(NHMRC, ARC and Universities Australia 2007)
17. Funders
The ARC and the NHMRC care about your
research data.
Discovery Projects Instructions to Applicants in 2018
requires a data management statement
18. Funders
The ARC and the NHMRC care about your
research data.
ARC 2018 Funding Rules:
Follow the Australian Code for the Responsible Conduct of Research (2007).
STRONGLY ENCOURAGES the depositing of data arising from a Project in an
appropriate publically accessible subject and/or institutional repository.
19. Funders
The ARC and the NHMRC care about your
research data.
2017 Data Sharing Statement:
“NHMRC encourages data sharing and
providing access to data and other
research outputs (metadata, analysis
code, study protocols, study materials
and other collected data) arising from
NHMRC supported research”
20. Publishers
Many publishers are requiring that the
data behind your published findings are
PUBLICLY ACCESSIBLE in an institutional
data repository.
21. Publishers
Many publishers are requiring that the
data behind your published findings are
PUBLICLY ACCESSIBLE in an institutional
data repository.
PLOS, BMJ and about 100 more
22. Publishers
Many publishers are requiring that the
data behind your published findings are
PUBLICLY ACCESSIBLE in an institutional
data repository.
Via data availability statements
23. Publishers
Many publishers are requiring that the
data behind your published findings are
PUBLICLY ACCESSIBLE in an institutional
data repository.
Tables, raw data, images etc.
24. Publishers
Many publishers are requiring that the
data behind your published findings are
PUBLICLY ACCESSIBLE in an institutional
data repository.
You must provide a link
– preferably a DOI –
to the reviewers
Anyone should be able to access
that dataset at any time, without
restriction
26. DATA Journals
Data journals allow researchers to formally
publish, and gain acknowledgement for,
their research data outputs.
27. DATA Journals
Data journals allow researchers to formally
publish, and gain acknowledgement for,
their research data outputs.
Wiley’s
Geoscience
Data
Journal
Nature’s
Scientific
Data
Ubiquity’s
Journal of Open
Archaeology Data
28. DATA Journals
Data journals allow researchers to formally
publish, and gain acknowledgement for,
their research data outputs.
citation metrics for research data
outputs
32. UWA - Code
Code of Conduct for the Responsible
Practice of Research
3.7 Research data related to publications must be
available for discussion with other researchers. The
availability of such data must be recorded and managed
through the UWA Research Repository. The six Creative
Commons version 4.0 licenses recommended in AusGOAL
are the preferred licenses to be used for Open Access. CC-
BY is used wherever possible for sharing research data.
33. UWA - Code
Code of Conduct for the Responsible
Practice of Research
3.7 Research data related to publications must be
available for discussion with other researchers. The
availability of such data must be recorded and managed
through the UWA Research Repository. The six Creative
Commons version 4.0 licenses recommended in
AusGOAL are the preferred licenses to be used for Open
Access. CC-BY is used wherever possible for sharing
research data.
34. UWA - Code
Code of Conduct for the Responsible
Practice of Research
3.7 Research data related to publications must be
available for discussion with other researchers. The
availability of such data must be recorded and managed
through the UWA Research Repository. The six Creative
Commons version 4.0 licenses recommended in
AusGOAL are the preferred licenses to be used for Open
Access. CC-BY is used wherever possible for sharing
research data.
72. Case Study
Dropbox?
HDR student query
Can I use Dropbox to transfer
confidential data from
international companies for
my research at UWA?
• Confidentiality?
• Is encryption ok?
• Other cloud storage options?
Our response
Technically, yes, but should
you?
• Contracts/Agreements with data
providers?
• Seek advice from Risk and Legal
• No charge for < 2GB ; but
>2GB=$$
• Allows access to collaborators.
• Confidential data may not be
safe. The data is being stored
overseas.
• Not managed, maintained or
stored by UWA.
73. Case Study
HDR student query
Can I use Dropbox to transfer
confidential data from
international companies for
my research at UWA?
• Confidentiality?
• Is encryption ok?
• Other cloud storage options?
Our response
Make informed decisions
• UWA’s Records Management
Services provide guidance on
using public cloud storage.
– http://www.igs.uwa.edu.au/p
olicies/guides/auth/cloud-
storage
Dropbox?
74. Case Study
HDR student query
Can I use Dropbox to transfer
confidential data from
international companies for
my research at UWA?
• Confidentiality?
• Is encryption ok?
• Other cloud storage options?
Our response
Make informed decisions
• University Policy on: Institutional Data Centre
• University Policy on: Records Management
• Computer and Software Use Regulations
• University Policy on: Records Management
• UWA Code of Conduct for the Responsible
Practice of Research
• UWA Recordkeeping Plan
• Western Australian University Sector Disposal
Authority
• Australian Code for the Responsible Conduct of
Research
• University’s Policy on Privacy of Electronic
Material
Dropbox?
75. Case Study
HDR student query
Can I use Dropbox to transfer
confidential data from
international companies for
my research at UWA?
• Confidentiality?
• Is encryption ok?
• Other cloud storage options?
Our response
IRDS would be an excellent
choice
• Data is stored locally.
• The IRDS maintained and
supported by UWA (Service
Desk Support 24/7)
• Encouraged for long-term
storage and can be used to
comply with WAUSDA.
– “Research records must be
retained for a minimum of 7
years after the date of
publication or project
completion, whichever is later.”
Dropbox?
76. Case Study
IRDS vs Dropbox
HDR student query
Can I use Dropbox to transfer
confidential data from
international companies for
my research at UWA?
• Confidentiality?
• Is encryption ok?
• Other cloud storage options?
Our response
IRDS would be an excellent
choice
• Allows for external collaborator
access
– Pheme authentication
provided to external
collaborators via
http://www.hr.uwa.edu.au/__
data/assets/pdf_file/0006/21
72606/Commencement_of_
Non-university_Staff.pdf
77. Case Study
IRDS vs Dropbox
HDR student query
Can I use Dropbox to transfer
confidential data from
international companies for
my research at UWA?
• Confidentiality?
• Is encryption ok?
• Other cloud storage options?
Our response
IRDS would be an excellent
choice
78. Case Study
UniDrive vs Dropbox
HDR student query
Can I use Dropbox to transfer
confidential data from
international companies for
my research at UWA?
• Confidentiality?
• Is encryption ok?
• Other cloud storage options?
Our response
UniDrive Client
1. Windows laptops
2. Off-campus desktops
3. Staff and students will
access H; drive and IRDS
shares
4. Staff will also access
UNIWA S: drive
79. Case Study
HDR student query
Can I use Dropbox to transfer
confidential data from
international companies for
my research at UWA?
• Confidentiality?
• Is encryption ok?
• Other cloud storage options?
Our response
UniDrive Client
5. Similar functionality to
DropBox without the risk
6. Persistent connection
7. Offline Connections
8. Will appear in Windows
Explorer
UniDrive vs Dropbox
80. Case Study
PAWSEY vs Dropbox
HDR student query
Can I use Dropbox to transfer
confidential data from
international companies for
my research at UWA?
• Confidentiality?
• Is encryption ok?
• Other cloud storage options?
Our response
Pawsey Supercomputing
could be an option
• Can cope with large (‘Big Data’)
datasets and file transfer.
• Allows access to collaborators.
• Not encouraged for long-term
storage (dependent on funding).
• Not managed or maintained by
UWA.
81.
82. Data storage
• Online application process (data >5TB*) [*some exceptions]
• Designed for collaboration, not for ‘primary’ copy of data
• Access is governed by the Data Storage and Management
Policy (DSMP)
• Access to Pawsey data stores is provided by the LiveARC
storage management framework (also known as Mediaflux)
• Command line and web-interface access
92. Contacts
Questions?
Katina Toufexis
Research Data Coordinator
katina.toufexis@uwa.edu.au
6488 5319
Faculty Librarian:
Arts, Business, Law & Education Librarians
ablelibrarians-lib@uwa.edu.au
Health & Medical Sciences
hmslibrarians-lib@uwa.edu.au
Engineering & Mathematical Sciences
emslibrarians-lib@uwa.edu.au
Science
scilibrarians-lib@uwa.edu.au
Editor's Notes
The Northern Party at the South Magnetic Pole. Photographer Douglas Mawson 1909. Courtesy Mawson Collection South Australian Museum
“It is not possible to apply a uniform definition of research data across all disciplines. Research data may be numerical, textual, audio-visual, digital or physical, depending on the discipline and the nature of the research.”
You should care because if you’re organised, you can them comply with Funder, publisher, institutional policies and follow recent government announcements and initiatives.
If you’re working in a team this is imperative for reasons such as consistency and efficiency
Of course this applies to solo researchers….
If you’re working solo, having a data management plan keeps you organised
You need to think about how what you’re going to do with your data.