Presentation at the Department of Health and Human Services October 17, 2014 to introduce other agencies outside of NIH the development of the Commons concept.
#2 NCI data services - Fair data webinar 6 Sept 2017ARDC
Fair data webinar 2 - A is for Accessible
Jingbo Wang Data Collections Manager at NCI presented on how they make data accessible through services over the data so they can be interrogated and used by humans and machines.
Full webinar recording on YouTube: https://youtu.be/me27whU8GG8
SafeNet is a 2-year Jisc funded project that aims to improve assured continuity of access to e-journal content paid for by UK higher education institutions. It has two main components: an entitlement registry to hold subscription data for institutions, and a private LOCKSS network to provide assurance that paid-for content will remain available. The presentation provides details on the core service components and governance structure. It also outlines the project's goal of data gathering - to empower libraries by minimizing duplication of work and focusing on what content libraries believe they are entitled to access. Assistance is being sought from several universities on exploring the range of subscriptions, challenges, and data available to make assertions for an entitlement registry.
3rd International Conference on Big Data and Blockchain (BDAB 2022)ijbesjournal
3rd International Conference on Big Data and Blockchain (BDAB 2022) will act as a major forum for the presentation of innovative ideas, approaches, developments, and research projects in the areas of Big Data and Blockchain. It will also serve to facilitate the exchange of information between researchers and industry professionals to discuss the latest issues and advancement in the area of Big Data and Blockchain.
An Overview of Data Citation Principles Synthesis ActivityMicah Altman
The document discusses an ongoing effort to synthesize recommendations for data citation principles from various groups. It notes there is agreement on key principles like the need for citation, use of persistent identifiers, and support for access and verification. The synthesis group has identified consistent principles across recommendations and seeks to formulate a draft synthesis highlighting principles, examples, and references. The group plans to disseminate and promote adoption of an agreed upon declaration of principles.
This presentation, given by Bob Jones, CERN & HNSciCloud Coordinator, at the ESA-ESPI Workshop on “Space Data & Cloud Computing Infrastructures: Policies and Regulations”, describes what are the challenges and needs of the cloud users and explains how an hybrid cloud model can support them.
The document discusses the need for a "Data Commons" platform to foster sharing and analysis of biomedical big data. It describes how such a platform could treat research data, software and methods as digital assets within a shared virtual space. This would allow different parties to find, deposit, manage, share and reuse digital assets. The platform would form the basis of a digital ecosystem enabling interactions between producers and consumers of research outputs. It provides an overview of an NIH pilot project to test initial implementation of a Data Commons through co-locating data and tools on storage and computing infrastructure.
The document introduces COBWEB, a research project that develops a crowdsourcing infrastructure for collecting and analyzing environmental data provided by citizens. The project aims to address data quality issues and support policy decisions. It has several pilot sites and partners, including UNESCO biosphere reserves. The framework includes mobile apps, QA processes, and a portal to view and analyze citizen-submitted data. It uses open standards and aims to be customizable for different use cases involving topics like biological monitoring and flooding.
#2 NCI data services - Fair data webinar 6 Sept 2017ARDC
Fair data webinar 2 - A is for Accessible
Jingbo Wang Data Collections Manager at NCI presented on how they make data accessible through services over the data so they can be interrogated and used by humans and machines.
Full webinar recording on YouTube: https://youtu.be/me27whU8GG8
SafeNet is a 2-year Jisc funded project that aims to improve assured continuity of access to e-journal content paid for by UK higher education institutions. It has two main components: an entitlement registry to hold subscription data for institutions, and a private LOCKSS network to provide assurance that paid-for content will remain available. The presentation provides details on the core service components and governance structure. It also outlines the project's goal of data gathering - to empower libraries by minimizing duplication of work and focusing on what content libraries believe they are entitled to access. Assistance is being sought from several universities on exploring the range of subscriptions, challenges, and data available to make assertions for an entitlement registry.
3rd International Conference on Big Data and Blockchain (BDAB 2022)ijbesjournal
3rd International Conference on Big Data and Blockchain (BDAB 2022) will act as a major forum for the presentation of innovative ideas, approaches, developments, and research projects in the areas of Big Data and Blockchain. It will also serve to facilitate the exchange of information between researchers and industry professionals to discuss the latest issues and advancement in the area of Big Data and Blockchain.
An Overview of Data Citation Principles Synthesis ActivityMicah Altman
The document discusses an ongoing effort to synthesize recommendations for data citation principles from various groups. It notes there is agreement on key principles like the need for citation, use of persistent identifiers, and support for access and verification. The synthesis group has identified consistent principles across recommendations and seeks to formulate a draft synthesis highlighting principles, examples, and references. The group plans to disseminate and promote adoption of an agreed upon declaration of principles.
This presentation, given by Bob Jones, CERN & HNSciCloud Coordinator, at the ESA-ESPI Workshop on “Space Data & Cloud Computing Infrastructures: Policies and Regulations”, describes what are the challenges and needs of the cloud users and explains how an hybrid cloud model can support them.
The document discusses the need for a "Data Commons" platform to foster sharing and analysis of biomedical big data. It describes how such a platform could treat research data, software and methods as digital assets within a shared virtual space. This would allow different parties to find, deposit, manage, share and reuse digital assets. The platform would form the basis of a digital ecosystem enabling interactions between producers and consumers of research outputs. It provides an overview of an NIH pilot project to test initial implementation of a Data Commons through co-locating data and tools on storage and computing infrastructure.
The document introduces COBWEB, a research project that develops a crowdsourcing infrastructure for collecting and analyzing environmental data provided by citizens. The project aims to address data quality issues and support policy decisions. It has several pilot sites and partners, including UNESCO biosphere reserves. The framework includes mobile apps, QA processes, and a portal to view and analyze citizen-submitted data. It uses open standards and aims to be customizable for different use cases involving topics like biological monitoring and flooding.
DIRISA aims to build a national data infrastructure in South Africa with tiered data services. It will maintain Tier 1 data nodes with 8PB of storage and provide research data management services. DIRISA also focuses on developing skills through training programs. Its goals are to support open data and open science by enabling data preservation, sharing and processing in a reliable and controlled manner.
Presentation on INSPIRE and Higher Education (1 of 2)JISC GECO
Presentation designed to explain the relationship between academic data and the EU INSPIRE Directive. Produced by staff from EDINA and the Digital Curation Centre.
Bob Jones, CERN & HNSciCloud Coordinator gives an update on the HNSciCloud Pre-Commercial Procurement which is now in its Solution Prototyping phase. The presentation includes also an overview of the prototypes under development.
LIFE3: Predicting Long Term Preservation Costs, by Brian HoleJISC KeepIt project
This presentation describes LIFE, a tool for evaluating and predicting the costs of managing a digital object over its full lifecycle. It was given as part of module 2 of a 5-module course on digital preservation tools for repository managers, presented by the JISC KeepIt project. For more on this and other presentations in this course look for the tag 'KeepIt course' in the project blog http://blogs.ecs.soton.ac.uk/keepit/
LIFE3: Predicting Long Term Preservation Costs, Brian Holeguest128c9a
This presentation describes LIFE, a tool for evaluating and predicting the costs of managing a digital object over its full lifecycle. It was given as part of module 2 of a 5-module course on digital preservation tools for repository managers, presented by the JISC KeepIt project. For more on this and other presentations in this course look for the tag 'KeepIt course' in the project blog http://blogs.ecs.soton.ac.uk/keepit/
8th International Conference on Bioscience & Engineering (BIOE 2022)ijbesjournal
8th International Conference on Bioscience & Engineering (BIOE 2022) will provide an excellent international forum for sharing knowledge and results, in theory, methodology and applications impacts and challenges of Bioscience and Engineering. The conference documents practical and theoretical results which make a fundamental contribution to the development of Bioscience and Engineering. The aim of the conference is to provide a platform to the researchers and practitioners from both academia as well as industry to meet and share cutting-edge development in the field.
The XDC project aims to develop scalable data management technologies for distributed computing environments. It will improve existing federated data management services by adding new functionalities requested by research communities. These include intelligent dataset distribution, policy-driven data orchestration, data preprocessing, smart caching, and enhanced metadata management. The project is co-funded by Horizon2020 and involves 8 partners representing 7 European countries and 7 research communities.
The document discusses BONSAI, an open access database and open source software for industrial ecology. Industrial ecology studies material and energy flows through industrial systems. BONSAI aims to evaluate the environmental impacts of industrial networks by providing interoperable unit process data. The author notes improvements that could be made to BONSAI summer schools, including distributing more reading material in advance and focusing more on hands-on activities. BONSAI also wishes to address current bottlenecks in industrial ecology by allowing crowd-based data review, harmonizing ontologies and metadata, and organizing user communities on the European Open Science Cloud.
Big Data R&D Strategy - Ensure the long term sustainability, access, and deve...Sky Bristol
Presentation on one of the strategic themes being considered for a U.S. Government Big Data R&D strategy - https://www.nitrd.gov/bigdata/rfi/02102014.aspx.
#4 FAIR - Provenance as an element of FAIR data principles - 20-09-17ARDC
Margie Smith
Full Webinar: https://youtu.be/EDhJTCm9RN8
Transcript: https://www.slideshare.net/AustralianNationalDataService/transcript-4-fair-r-for-reusable
Other webinars in the series: http://www.ands.org.au/news-and-events/events/fair-webinar-series
The document summarizes the Ticer Summer School held on August 24th, 2006. It discusses topics related to digital libraries, grids, e-science, and data management. Examples are provided of different e-science projects that utilize grids for data-intensive applications in domains such as climate modeling, biomedicine, and high-energy physics. Requirements for users and owners of data resources on grids are also outlined.
The MIMAS workshop discussed the RepositoryNet infrastructure and components including aggregation, text mining, search, benchmarking and statistics, registries, deposit tools, and metadata quality. It provided updates on components outside RepositoryNet like IRS Search and NAMES 2. A demonstration of IRUS showed its current functionality for benchmarking and statistics and future plans for funding, APIs, international scope, and business models. Developing service level agreements for RepNet services was also discussed.
Full Webinar: https://youtu.be/EDhJTCm9RN8
Transcript: https://www.slideshare.net/AustralianNationalDataService/transcript-4-fair-r-for-reusable
Other webinars in the series: http://www.ands.org.au/news-and-events/events/fair-webinar-series
Citizen Observatory Framework with Access Management Federation in GEOSS - Ba...COBWEB Project
Presentation given by Bart De Lathouwer (Interoperability Program, OGC and COBWEB Project) on Thursday 10th October, at the ENVIP'2013 Workshop, part of ISESS (International Symposium on Environmental Software Systems) 2013 in Neusiedl am See, Austria.
Find out more about the COBWEB Project at:
http://cobwebproject.eu/dissemination/
The Helix Nebula Science Cloud (HNSciCloud) project is a pre-commercial procurement action funded by the EU Horizon 2020 programme to develop an innovative hybrid cloud platform for the European research community. A group of procuring organizations, including CERN, have committed procurement funds, resources for testing, use cases, and in-house IT infrastructure. The multi-phase tender process will result in the development of integrated cloud services accessible to end-users. Challenges include providing compute, storage, networking, and payment models to support scientific workloads using datasets in the petabyte range across the procurers' resources and public e-infrastructure.
Demonstrating a Framework for KOS-based Recommendations SystemsGESIS
This document describes a framework for knowledge organization system (KOS)-based recommendation systems. It discusses two funded projects (IRM I and IRM II) that aimed to implement value-added information retrieval services for digital libraries based on applying scholarly models. These services include term suggestion, query suggestion, and bibliometric analysis. The document outlines the Information Retrieval Service Assessment (IRSA) component, which calculates search term suggestions using co-occurrence analysis of controlled vocabularies harvested via OAI-PMH. It demonstrates the IRSA system and discusses limitations and references.
1. The document discusses some early observations from the Associate Director for Data Science at the National Institutes of Health regarding data at NIH.
2. It notes that NIH does not fully understand how existing data is used, has focused more on why data should be shared rather than how to share it, and lacks plans for long-term sustainability of data.
3. Potential solutions discussed include developing a biomedical commons, modifying the review process, improving education in data science, and expanding the Big Data to Knowledge initiative. The goal is to create a digital research enterprise that better connects all aspects of the research lifecycle.
One Scientist’s Wish List for Scientific PublishersPhilip Bourne
1. The document summarizes the speaker's wish list and vision for improving scientific publishing and communication. The speaker advocates for more open access to literature and data, better integration of literature and data, and ensuring reproducibility through sharing of workflows, source code, and data.
2. The speaker discusses experiments with rich media formats like video to enhance scholarly communication. The goal is to leverage new technologies and better link literature, data, and methods.
3. The current reward system in academia does not adequately incentivize open and reproducible science. New models are needed that reward things like maintaining databases, curating data, and developing community resources.
DIRISA aims to build a national data infrastructure in South Africa with tiered data services. It will maintain Tier 1 data nodes with 8PB of storage and provide research data management services. DIRISA also focuses on developing skills through training programs. Its goals are to support open data and open science by enabling data preservation, sharing and processing in a reliable and controlled manner.
Presentation on INSPIRE and Higher Education (1 of 2)JISC GECO
Presentation designed to explain the relationship between academic data and the EU INSPIRE Directive. Produced by staff from EDINA and the Digital Curation Centre.
Bob Jones, CERN & HNSciCloud Coordinator gives an update on the HNSciCloud Pre-Commercial Procurement which is now in its Solution Prototyping phase. The presentation includes also an overview of the prototypes under development.
LIFE3: Predicting Long Term Preservation Costs, by Brian HoleJISC KeepIt project
This presentation describes LIFE, a tool for evaluating and predicting the costs of managing a digital object over its full lifecycle. It was given as part of module 2 of a 5-module course on digital preservation tools for repository managers, presented by the JISC KeepIt project. For more on this and other presentations in this course look for the tag 'KeepIt course' in the project blog http://blogs.ecs.soton.ac.uk/keepit/
LIFE3: Predicting Long Term Preservation Costs, Brian Holeguest128c9a
This presentation describes LIFE, a tool for evaluating and predicting the costs of managing a digital object over its full lifecycle. It was given as part of module 2 of a 5-module course on digital preservation tools for repository managers, presented by the JISC KeepIt project. For more on this and other presentations in this course look for the tag 'KeepIt course' in the project blog http://blogs.ecs.soton.ac.uk/keepit/
8th International Conference on Bioscience & Engineering (BIOE 2022)ijbesjournal
8th International Conference on Bioscience & Engineering (BIOE 2022) will provide an excellent international forum for sharing knowledge and results, in theory, methodology and applications impacts and challenges of Bioscience and Engineering. The conference documents practical and theoretical results which make a fundamental contribution to the development of Bioscience and Engineering. The aim of the conference is to provide a platform to the researchers and practitioners from both academia as well as industry to meet and share cutting-edge development in the field.
The XDC project aims to develop scalable data management technologies for distributed computing environments. It will improve existing federated data management services by adding new functionalities requested by research communities. These include intelligent dataset distribution, policy-driven data orchestration, data preprocessing, smart caching, and enhanced metadata management. The project is co-funded by Horizon2020 and involves 8 partners representing 7 European countries and 7 research communities.
The document discusses BONSAI, an open access database and open source software for industrial ecology. Industrial ecology studies material and energy flows through industrial systems. BONSAI aims to evaluate the environmental impacts of industrial networks by providing interoperable unit process data. The author notes improvements that could be made to BONSAI summer schools, including distributing more reading material in advance and focusing more on hands-on activities. BONSAI also wishes to address current bottlenecks in industrial ecology by allowing crowd-based data review, harmonizing ontologies and metadata, and organizing user communities on the European Open Science Cloud.
Big Data R&D Strategy - Ensure the long term sustainability, access, and deve...Sky Bristol
Presentation on one of the strategic themes being considered for a U.S. Government Big Data R&D strategy - https://www.nitrd.gov/bigdata/rfi/02102014.aspx.
#4 FAIR - Provenance as an element of FAIR data principles - 20-09-17ARDC
Margie Smith
Full Webinar: https://youtu.be/EDhJTCm9RN8
Transcript: https://www.slideshare.net/AustralianNationalDataService/transcript-4-fair-r-for-reusable
Other webinars in the series: http://www.ands.org.au/news-and-events/events/fair-webinar-series
The document summarizes the Ticer Summer School held on August 24th, 2006. It discusses topics related to digital libraries, grids, e-science, and data management. Examples are provided of different e-science projects that utilize grids for data-intensive applications in domains such as climate modeling, biomedicine, and high-energy physics. Requirements for users and owners of data resources on grids are also outlined.
The MIMAS workshop discussed the RepositoryNet infrastructure and components including aggregation, text mining, search, benchmarking and statistics, registries, deposit tools, and metadata quality. It provided updates on components outside RepositoryNet like IRS Search and NAMES 2. A demonstration of IRUS showed its current functionality for benchmarking and statistics and future plans for funding, APIs, international scope, and business models. Developing service level agreements for RepNet services was also discussed.
Full Webinar: https://youtu.be/EDhJTCm9RN8
Transcript: https://www.slideshare.net/AustralianNationalDataService/transcript-4-fair-r-for-reusable
Other webinars in the series: http://www.ands.org.au/news-and-events/events/fair-webinar-series
Citizen Observatory Framework with Access Management Federation in GEOSS - Ba...COBWEB Project
Presentation given by Bart De Lathouwer (Interoperability Program, OGC and COBWEB Project) on Thursday 10th October, at the ENVIP'2013 Workshop, part of ISESS (International Symposium on Environmental Software Systems) 2013 in Neusiedl am See, Austria.
Find out more about the COBWEB Project at:
http://cobwebproject.eu/dissemination/
The Helix Nebula Science Cloud (HNSciCloud) project is a pre-commercial procurement action funded by the EU Horizon 2020 programme to develop an innovative hybrid cloud platform for the European research community. A group of procuring organizations, including CERN, have committed procurement funds, resources for testing, use cases, and in-house IT infrastructure. The multi-phase tender process will result in the development of integrated cloud services accessible to end-users. Challenges include providing compute, storage, networking, and payment models to support scientific workloads using datasets in the petabyte range across the procurers' resources and public e-infrastructure.
Demonstrating a Framework for KOS-based Recommendations SystemsGESIS
This document describes a framework for knowledge organization system (KOS)-based recommendation systems. It discusses two funded projects (IRM I and IRM II) that aimed to implement value-added information retrieval services for digital libraries based on applying scholarly models. These services include term suggestion, query suggestion, and bibliometric analysis. The document outlines the Information Retrieval Service Assessment (IRSA) component, which calculates search term suggestions using co-occurrence analysis of controlled vocabularies harvested via OAI-PMH. It demonstrates the IRSA system and discusses limitations and references.
1. The document discusses some early observations from the Associate Director for Data Science at the National Institutes of Health regarding data at NIH.
2. It notes that NIH does not fully understand how existing data is used, has focused more on why data should be shared rather than how to share it, and lacks plans for long-term sustainability of data.
3. Potential solutions discussed include developing a biomedical commons, modifying the review process, improving education in data science, and expanding the Big Data to Knowledge initiative. The goal is to create a digital research enterprise that better connects all aspects of the research lifecycle.
One Scientist’s Wish List for Scientific PublishersPhilip Bourne
1. The document summarizes the speaker's wish list and vision for improving scientific publishing and communication. The speaker advocates for more open access to literature and data, better integration of literature and data, and ensuring reproducibility through sharing of workflows, source code, and data.
2. The speaker discusses experiments with rich media formats like video to enhance scholarly communication. The goal is to leverage new technologies and better link literature, data, and methods.
3. The current reward system in academia does not adequately incentivize open and reproducible science. New models are needed that reward things like maintaining databases, curating data, and developing community resources.
The document summarizes NIH's approach to data science and the ADDS mission. It discusses establishing a data ecosystem through community, policy, and infrastructure. The goals are to foster sustainability, efficiency, collaboration, reproducibility, and accessibility. NIH plans to seed the ecosystem through existing resources and funding. Example initiatives include establishing a data commons, standards, and training programs to develop a diverse data science workforce. The overall aim is to support a "digital enterprise" that enhances biomedical research and health outcomes.
Training Quantitative Scientists for Biomedical Science Through the BD2K Init...Philip Bourne
The NIH remains committed to training the next generation of biomedical scientists reflecting the scientific needs of the funded research. The NIH agrees that the training of quantitative scientists is critical to continuing the advancement of biomedical science, especially in the era of big data and complex genomic data. We will describe efforts in this direction through the Big Data to Knowledge BD2K Initiative.
Presented at the AAAS Meeting, San Jose, CA, February 16, 2015.
Regional Student Group NBIC Career Presentation April 18, 2011Philip Bourne
Talk given to the Regional Student Group (RSG) of the Netherlands BioInformatics Center annual meeting on April 18, 2011. It is one scientists career path in bioinformatics.
This document outlines Philip Bourne's vision for data science at NIH through 2020. It discusses the commitment to continuing the Big Data to Knowledge (BD2K) program, and views BD2K as part of a broader data science strategy. This includes a vibrant research program, developing a sustainable data ecosystem around the FAIR principles and commons, increased workforce training, and a changing governance model. The goal is new innovations from large data, evidence of real applications, broad commons adoption improving sharing and reuse, and policies supporting an effective balance between data spending and gains.
Biomedical Research as an Open Digital EnterprisePhilip Bourne
The document discusses the challenges and opportunities facing biomedical research as it transitions to becoming a fully digital open enterprise. It notes issues around reproducibility, limited funding, and the need to better connect different elements of the research lifecycle like data capture, analysis, and publication. The author proposes the "Commons" as a conceptual framework to help address these issues by providing shared resources like cloud-based storage and computing, tools to discover and access data and software, and standards to improve reproducibility. The goal is to foster an ecosystem that maximizes the benefits of digital technologies for biomedical research.
Moving Forward with Open Data Science - SWOT AnalysisPhilip Bourne
This document summarizes the strengths, weaknesses, opportunities, and threats of open data science based on a symposium discussion. The strengths included government support for open data and science, contributions from young people, and pre-competitive engagement from the commercial sector. Weaknesses consisted of needs for better search capabilities, overcoming cultural gaps, issues with existing business models and academic evaluation. Opportunities involved openness enabling more advancement, potential for public health impacts, and emerging open platforms and partnerships. Threats centered around incentive and reward structures, potential rollbacks of initiatives, and risks of recentralization.
The document proposes the creation of a federated cloud computing platform called "The Commons" to support biomedical data sharing and analysis across multiple cloud providers. Key points:
- The Commons would index metadata and digital objects across conformant public and private cloud providers.
- It would be funded by providing credits to investigators for storage and computing, creating competition among providers to offer better services at lower costs.
- A phased implementation is outlined to initially involve experienced users and later expand to all NIH grantees.
NIH Data Initiatives: Harnessing Big (and small) Data to Improve Health
Presentation at the internet2 Global Forum, April 28, 2015
Session NIH Perspectives
The Commons: Leveraging the Power of the Cloud for Big DataPhilip Bourne
The document discusses the need for a Commons framework to leverage cloud computing for big data in biomedicine. It describes key principles of the Commons, including supporting a digital ecosystem, treating research outputs as digital objects, and ensuring objects are FAIR (findable, accessible, interoperable, and reusable). The Commons framework exploits cloud technologies to provide access to data and tools through APIs and containers. Current pilots applying this framework include the Cloud Credits Model, BD2K Centers, model organism databases, the Human Microbiome Project, and NCI cancer genomics data. The goal is to make large biomedical datasets and associated tools broadly available for research in a standardized, interoperable manner.
STI 2022 - Generating large-scale network analyses of scientific landscapes i...Michele Pasin
The growth of large, programatically accessible bibliometrics databases presents new opportunities for complex analyses of publication metadata. In addition to providing a wealth of information about authors and institutions, databases such as those provided by Dimensions also provide conceptual information and links to entities such as grants, funders and patents. However, data is not the only challenge in evaluating patterns in scholarly work: These large datasets can be challenging to integrate, particularly for those unfamiliar with the complex schemas necessary for accommodating such heterogeneous information, and those most comfortable with data mining may not be as experienced in data visualisation. Here, we present an open-source Python library that streamlines the process accessing and diagramming subsets of the Dimensions on Google BigQuery database and demonstrate its use on the freely available Dimensions COVID-19 dataset. We are optimistic that this tool will expand access to this valuable information by streamlining what would otherwise be multiple complex technical tasks, enabling more researchers to examine patterns in research focus and collaboration over time.
- The document discusses challenges related to biomedical data including that data is growing rapidly, stored across silos, and expensive to maintain while demands for sharing are increasing. It also notes a lack of data science skills.
- Solutions explored include developing the NIH Commons, which would integrate disparate cloud initiatives using BD2K standards to make data findable, accessible, interoperable and reusable. This could enable new insights from aggregate analysis across datasets.
- A 3-year BD2K-sponsored pilot of the Commons is underway to address questions around discoveries, productivity, reproducibility and cost-effectiveness compared to current approaches. The pilot involves moving model organism databases to the Commons as a test case.
The document summarizes a training event on trusted digital repositories that took place in Vilnius, Lithuania from October 1-5, 2007. It discusses key initiatives and standards for digital preservation and certification of trusted repositories, including the nestor criteria catalogue. The nestor network aims to create a framework of trustworthy long-term digital archives. The document outlines criteria for organizational framework, object management, and infrastructure/security that repositories should address.
The NIH Data Commons - BD2K All Hands Meeting 2015Vivien Bonazzi
Presentation given at the BD2K All Hands meeting in Bethesda, MD, USA in November 2015
https://datascience.nih.gov/bd2k/events/NOV2015-AllHands
Video cast of this presentation:
http://videocast.nih.gov/summary.asp?Live=17480&bhcp=1
talk starts at 2hrs 40min (its about 55mins long) - includes video!
Document describing the Commons : https://datascience.nih.gov/commons
NDS Relevant Update from the NIH Data Science (ADDS) OfficePhilip Bourne
This document summarizes a presentation given by Dr. Phil Bourne on the National Data Science (NDS) initiative and the National Institutes of Health (NIH) All of Us Data and Science (ADDS) office. The presentation discusses how NDS can succeed by defining clear problems, starting with pilots, and developing sustainable applications. It then outlines ADDS's mission to accelerate biomedical research through an open data ecosystem. ADDS's strategy focuses on discovery, workforce development, policy, leadership, and sustainability through developing a shared "Commons" of digital research objects in the cloud. Pilot projects are evaluating this Commons framework and populating it with datasets and tools.
Big Data as a Catalyst for Collaboration & InnovationPhilip Bourne
Big data is disrupting biomedical research through digitization of data sources. The National Institutes of Health (NIH) launched the Big Data to Knowledge (BD2K) initiative to support this disruption. BD2K funds various programs including data sharing policies, data science training, and the development of shared infrastructure and standards. This infrastructure includes the "Commons" which would provide discoverable, accessible, interoperable and reusable research objects to catalyze collaboration using open APIs and computing platforms. SRP could interact with BD2K through initiatives like open science competitions, data standards development, and leadership in trans-NIH big data efforts.
The document discusses the need for an NIH Data Commons to address challenges with data sharing and storage. It describes how factors like increasing data volumes, availability of cloud technologies, and emphasis on FAIR data principles are driving the need for a centralized data platform. The proposed NIH Data Commons would provide findable, accessible, interoperable and reusable data through cloud-based services and tools. It would enable data-driven science by facilitating discovery, access and analysis of biomedical data across different sources. Plans are outlined to develop and test an initial Data Commons pilot using existing genomic and other biomedical datasets.
This document summarizes an update on the Big Data to Knowledge (BD2K) initiative at the National Institutes of Health (NIH). It discusses progress made in the first year of BD2K funding in three key areas: advancing data science research through centers and targeted awards; sharing data and software through the development of indexing tools and standards; and expanding training programs. It outlines funding amounts and recipient numbers for fiscal year 2015. Future plans are outlined through 2021 with the goals of further developing tools and applications, expanding the data sharing commons, and increasing training and sustainability efforts.
The document provides an overview of the development of the NIH Data Commons. It discusses factors driving the need for a data commons, including large amounts of data being generated and increased support for data sharing. It outlines the goals of making data findable, accessible, interoperable and reusable. Several pilots are exploring the feasibility of the commons framework, including placing large datasets in the cloud and developing indexing methods. Considerations in fully realizing the commons are also discussed, such as standards, discoverability, policies and incentives.
This document discusses four key lessons learned from the Digging Into Data Challenge, which funded computational research projects in the humanities and social sciences.
Lesson 1 is that these projects require open sharing of resources like hardware, software, data, and communication tools. Information professionals can facilitate partnerships and resource sharing agreements. Lesson 2 is that these projects rely on diverse expertise in domains, analytics, data management, and project management that information professionals can provide. Lesson 3 is that computational tools need to be adapted to evolving research questions through close and distant readings. Lesson 4 is that humanities and social science research now deals with "big data" and produces large datasets that information professionals can help curate and preserve long-
Research data management: from policy to practice with DMP OnlineMartin Donnelly
The document summarizes a presentation given by Martin Donnelly and Sarah Jones of the Digital Curation Centre (DCC) on research data management. It discusses the DCC's role in developing tools like DMP Online to help researchers create data management plans. DMP Online allows users to create and update plans that meet funder requirements, and receive guidance on best practices. The presentation highlights the DCC's collaborations with funders and institutions to develop templates and provide support for putting data management policies into practice.
The document summarizes a presentation given by Martin Donnelly and Sarah Jones of the Digital Curation Centre (DCC) on research data management. It discusses the DCC's role in developing tools like DMP Online to help researchers create data management plans. DMP Online allows users to create and update plans that meet funder requirements, and receive guidance on best practices. The presentation highlights the DCC's collaborations with funders and institutions to develop templates and provide support for putting data management policies into practice.
Philip Bourne presented on the NIH's Big Data to Knowledge (BD2K) initiative and the Associate Director for Data Science (ADDS) office. The goals of BD2K are to use data science to accelerate biomedical research and enhance health outcomes. BD2K supports various centers, projects, and training programs related to data discovery, standards, cloud computing, sustainability, and workforce development. The ADDS office oversees BD2K and aims to establish a sustainable data science ecosystem and well-trained workforce to enable major scientific discoveries through data-driven research.
Data Science and AI in Biomedicine: The World has ChangedPhilip Bourne
This document discusses the changing landscape of data science and AI in biomedicine. Some key points:
- We are at a tipping point where data science is becoming a driver of biomedical research rather than just a tool. Biomedical researchers need to become data scientists.
- Data science is interdisciplinary and touches every field due to the rise of digital data. It requires openness, translation of findings, and consideration of responsibilities like algorithmic bias.
- Advances like AlphaFold2 show the power of large collaborative efforts combining data, computing resources, engineering, and domain expertise. This points to the need for public-private partnerships and new models of open data sharing.
- The definition of
AI in Medical Education A Meta View to Start a ConversationPhilip Bourne
- AI has the potential to significantly impact medical education and healthcare.
- Chatbots and large language models can provide a rich training ground for students to learn, while augmented reality may change the student-patient dynamic.
- AI tools like predictive analytics and imaging analysis can assist in research, diagnosis, and personalized treatment, but models are still limited and education of implications is needed.
- If developed responsibly with oversight, AI could help democratize healthcare and create new industries, but history shows technology disruptions can also lead to deception if misused. The impacts and timeline of AI in medicine remain uncertain.
AI+ Now and Then How Did We Get Here And Where Are We GoingPhilip Bourne
The document discusses the past, present, and future of artificial intelligence (AI). It describes how AI has advanced due to increases in data and improvements in algorithms and computing technology. An example of AI, ChatGPT, is discussed as using large language models, pre-training, and transformers to generate language. The future of AI is uncertain but could involve neural networks that mimic the brain more closely. AI may disrupt many industries like education and research in the coming years or decades through forces of digitization, disruption, and other factors. The impacts and timeline of AI progress are difficult to predict precisely.
Thoughts on Biological Data SustainabilityPhilip Bourne
This document discusses approaches to improving biological data sustainability. It proposes moving from the current BDS 1.0 model to a BDS 2.0 model. BDS 1.0 is characterized by increasing data and costs but decreasing funds for innovation. BDS 2.0 would recognize the monetary value of data and embrace public-private partnerships and a data economy. It suggests a "data credits" system where data curation is a service with monetary value. The document provides examples of how this could work for the Protein Data Bank (PDB) and more globally. It argues BDS 2.0 could encourage competition, globalization, and private sector engagement to better foster sustainable and FAIR biological data.
The document discusses FAIR data and its importance. FAIR stands for Findable, Accessible, Interoperable, and Reusable. The author argues that data science is becoming a major driver in many fields due to the large amounts of digital data being created. For data and data science to reach their full potential, data needs to be FAIR so it can be easily discovered, accessed, integrated and reused. An example is given of a researcher combining health and vehicle crash data using techniques from data science to improve emergency care. Making data FAIR enables greater collaboration, public-private partnerships and opportunities for translation.
Data Science Meets Biomedicine, Does Anything ChangePhilip Bourne
Data science is driving major changes in biomedical research by enabling new types of integrative, multi-scale analyses. However, biomedical research may no longer lead data science due to a lack of comprehensive data infrastructure and cultural barriers. Responsible data science that balances openness, ethics, and benefiting patients could help establish biomedicine's continued leadership role. Major challenges include limited resources, attracting diverse talent, and prioritizing strategic initiatives over conforming to traditional models of research.
Presented online as part of the NASM series in Advancing Drug Discovery see https://www.nationalacademies.org/event/40883_09-2023_advancing-drug-discovery-data-science-meets-drug-discovery
Biomedical Data Science: We Are Not AlonePhilip Bourne
This document discusses biomedical data science and the opportunities and challenges presented by new developments in data science. Some key points:
- We are at a tipping point where biomedical research is no longer the sole leader in data science due to advances in many other fields. Biomedical researchers need to become data scientists to stay relevant.
- Data science is being driven by the massive growth of digital data and requires an interdisciplinary approach. It is touching every field and attracting many students.
- Developing effective data systems and infrastructure is a major challenge to enable open sharing and analysis of data. Initiatives are underway but more collaboration is needed across sectors.
- Advances in machine learning, like Alpha
BIMS7100-2023. Social Responsibility in ResearchPhilip Bourne
Social responsibility in research refers to conducting studies that benefit society while avoiding harm. It involves considering risks and benefits to human and animal subjects, ensuring transparency and integrity, and engaging stakeholders. Socially responsible research also addresses equity, diversity and inclusion. Data sharing is an important aspect of social responsibility, as it enables reproducibility and collaborative research. However, data must be shared in a FAIR manner and maintained over time to realize its full benefits. Researchers should consider social responsibility throughout the entire research lifecycle.
What Data Science Will Mean to You - One Person's ViewPhilip Bourne
This document provides an overview of data science from the perspective of Philip Bourne. Some key points:
- Data science is disruptive to higher education and all disciplines are being impacted by large amounts of digital data.
- Data science can be defined using a 4+1 model focusing on value, design, systems, analytics, and practice.
- Principles of excellence, inclusivity, openness, and fairness should guide data science work.
- Lessons from advances in computational biology and AlphaFold2 show the importance of open data, collaboration, and engineering challenges.
- A data science school should focus on responsible data practices while balancing open research that benefits patients.
The document provides an overview of the School of Data Science at the University of Virginia and its approach to collaborating with Novo Nordisk on diabetes research. It discusses that the School of Data Science aims to catalyze discovery through interdisciplinary research, educate a diverse workforce, and serve the community by applying data science. It also provides examples of using artificial intelligence to recognize patterns related to diabetes and details potential areas of collaboration between the School and Novo Nordisk, including student projects, visiting fellows, faculty partnerships, and PhD mentorship.
Towards a US Open research Commons (ORC)Philip Bourne
On August 2nd, 2021, US scientists and officials met to discuss establishing a US Open Research Commons (ORC) to make research data and computing resources more accessible and interoperable across public and private sectors. Currently, US resources are siloed and limited in discoverability. Other countries have established similar initiatives that the US is not formally represented in. An ORC could pool resources to benefit a more diverse group of researchers in addressing societal challenges, but establishing one requires overcoming cultural and institutional barriers between agencies through policy leadership. Immediate action is needed for the US to remain competitive in open science.
This document discusses opportunities for precision education arising from the move to digital education during the COVID pandemic. It notes that for the first time, essentially all educational materials were digital, creating opportunities to make content findable, accessible, interoperable, and reusable. This could improve content quality through transparency and ratings similar to academic publishing. It also enables aggregated views of content and student performance, improved content and syllabi, and recommender systems. Challenges include issues around content ownership, sharing rules, bias, privacy, security, and adoption of next generation learning management systems.
The document discusses the rise of data science and its disruptive impact on higher education. It analyzes precedents like bioinformatics that were enabled by new digital data sources and technologies. The author advocates that universities should embrace data science by establishing interdisciplinary collaborations, investing in data infrastructure, and ensuring research has societal value and responsibility.
Philip Bourne presented on how data can advance sustainability. He discussed how high throughput DNA digital data changed biomedicine and spawned the new field of data science. Data science now touches all domains, including helping achieve UN Goal 10 of reducing inequalities through projects like using data to understand the history of Native American displacement. While data presents endless opportunities, it also has weaknesses like being messy and non-conclusive, and threats like bias and lack of training. Bourne advocates for building trust through evidence and creating an open data environment to realize data's potential, while acknowledging that sustaining open data faces challenges around proprietary concerns, security, and driving social change.
Frontiers of Computing at the Cellular and Molecular ScalesPhilip Bourne
3 basic points when establishing a new biomedical initiative. Presented at Frontiers of Computing in Health and Society, George Mason University, September 21, 2021.
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...PECB
Denis is a dynamic and results-driven Chief Information Officer (CIO) with a distinguished career spanning information systems analysis and technical project management. With a proven track record of spearheading the design and delivery of cutting-edge Information Management solutions, he has consistently elevated business operations, streamlined reporting functions, and maximized process efficiency.
Certified as an ISO/IEC 27001: Information Security Management Systems (ISMS) Lead Implementer, Data Protection Officer, and Cyber Risks Analyst, Denis brings a heightened focus on data security, privacy, and cyber resilience to every endeavor.
His expertise extends across a diverse spectrum of reporting, database, and web development applications, underpinned by an exceptional grasp of data storage and virtualization technologies. His proficiency in application testing, database administration, and data cleansing ensures seamless execution of complex projects.
What sets Denis apart is his comprehensive understanding of Business and Systems Analysis technologies, honed through involvement in all phases of the Software Development Lifecycle (SDLC). From meticulous requirements gathering to precise analysis, innovative design, rigorous development, thorough testing, and successful implementation, he has consistently delivered exceptional results.
Throughout his career, he has taken on multifaceted roles, from leading technical project management teams to owning solutions that drive operational excellence. His conscientious and proactive approach is unwavering, whether he is working independently or collaboratively within a team. His ability to connect with colleagues on a personal level underscores his commitment to fostering a harmonious and productive workplace environment.
Date: May 29, 2024
Tags: Information Security, ISO/IEC 27001, ISO/IEC 42001, Artificial Intelligence, GDPR
-------------------------------------------------------------------------------
Find out more about ISO training and certification services
Training: ISO/IEC 27001 Information Security Management System - EN | PECB
ISO/IEC 42001 Artificial Intelligence Management System - EN | PECB
General Data Protection Regulation (GDPR) - Training Courses - EN | PECB
Webinars: https://pecb.com/webinars
Article: https://pecb.com/article
-------------------------------------------------------------------------------
For more information about PECB:
Website: https://pecb.com/
LinkedIn: https://www.linkedin.com/company/pecb/
Facebook: https://www.facebook.com/PECBInternational/
Slideshare: http://www.slideshare.net/PECBCERTIFICATION
हिंदी वर्णमाला पीपीटी, hindi alphabet PPT presentation, hindi varnamala PPT, Hindi Varnamala pdf, हिंदी स्वर, हिंदी व्यंजन, sikhiye hindi varnmala, dr. mulla adam ali, hindi language and literature, hindi alphabet with drawing, hindi alphabet pdf, hindi varnamala for childrens, hindi language, hindi varnamala practice for kids, https://www.drmullaadamali.com
Beyond Degrees - Empowering the Workforce in the Context of Skills-First.pptxEduSkills OECD
Iván Bornacelly, Policy Analyst at the OECD Centre for Skills, OECD, presents at the webinar 'Tackling job market gaps with a skills-first approach' on 12 June 2024
Temple of Asclepius in Thrace. Excavation resultsKrassimira Luka
The temple and the sanctuary around were dedicated to Asklepios Zmidrenus. This name has been known since 1875 when an inscription dedicated to him was discovered in Rome. The inscription is dated in 227 AD and was left by soldiers originating from the city of Philippopolis (modern Plovdiv).
Leveraging Generative AI to Drive Nonprofit InnovationTechSoup
In this webinar, participants learned how to utilize Generative AI to streamline operations and elevate member engagement. Amazon Web Service experts provided a customer specific use cases and dived into low/no-code tools that are quick and easy to deploy through Amazon Web Service (AWS.)
This document provides an overview of wound healing, its functions, stages, mechanisms, factors affecting it, and complications.
A wound is a break in the integrity of the skin or tissues, which may be associated with disruption of the structure and function.
Healing is the body’s response to injury in an attempt to restore normal structure and functions.
Healing can occur in two ways: Regeneration and Repair
There are 4 phases of wound healing: hemostasis, inflammation, proliferation, and remodeling. This document also describes the mechanism of wound healing. Factors that affect healing include infection, uncontrolled diabetes, poor nutrition, age, anemia, the presence of foreign bodies, etc.
Complications of wound healing like infection, hyperpigmentation of scar, contractures, and keloid formation.
Communicating effectively and consistently with students can help them feel at ease during their learning experience and provide the instructor with a communication trail to track the course's progress. This workshop will take you through constructing an engaging course container to facilitate effective communication.
3. What Is The Commons
Nothing fundamentally new
Defined differently by different folks
A concept that will be stood up in conjunction with the
$32m associated with BD2K
A collection of physical compute and storage
resources – public/private/hybrid clouds, NPC,
institutional etc.
Agreement to abide by the principles of the
Commons
An agile environment that will be evaluated as small
steps are made
An environment subject to additional BD2K resources
(drawn from a total $650M) through 2020
4. What are the Commons Principles
Defined by the community!
Phase I (1-2 years)
– ACL’s but an agreement to share (clinical and basic data)
– Usage by NIH intramural and BD2K participants
– First applications defined
Phase II (2-4 years)
– Research object identifier scheme defined
– First phase of findability implemented
– Broader user base, API’s appearing
Phase III (3+ years)
– Value metrics in place
– Broader adoption
6. Immediate
Cloud providers and academia agreed to provide
resources to stand up the Commons
Accessible starting Nov 3, 2014 at the time of the first
BD2K Workshop
First shared content by end of 2014
7. Metrics of Success
Measurable cost savings
Evidence of discovery
Measurable evidence that the rate of that discovery
has accelerated
9. One View of the Commons
Data
The Long Tail
Core Facilities/HS Centers
Clinical /Patient
The Why:
Data Sharing Plans
The How:
NIH Knowledge
The
Government
Commons
Data
Discovery
Index
The End Game:
Scientific
Discovery
Usability
Quality
Security/
Privacy
Sustainable
Storage
Awardees
Private
Sector Metrics/
Standards
Rest of
Academia
Software Standards
Index
BD2K
Centers
Cloud, Research Objects,
Business Models
10. One View of the Commons
Data
The Long Tail
Core Facilities/HS Centers
Clinical /Patient
The Why:
Data Sharing Plans
The How:
NIH Knowledge
The
Government
Commons
Data
Discovery
Index
The End Game:
Scientific
Discovery
Usability
Quality
Security/
Privacy
Sustainable
Storage
Awardees
Private
Sector Metrics/
Standards
Rest of
Academia
Software Standards
Index
BD2K
Centers
Cloud, Research Objects,
Business Models