This document discusses content standards for datasets including formats, terminologies, and guidelines. It notes that content standards provide essential descriptors for interpreting, verifying, reproducing, and reusing datasets. Content standards fall into three main categories: minimum reporting requirements and checklists, controlled vocabularies and ontologies, and conceptual data models and exchange formats. There are many community-driven and organization-driven initiatives that have established various content standards, with over 1000 working groups, 220 terminologies, and 115 guidelines. Tracking how these standards develop and relate to each other and to databases, tools, and data policies is important.
Role of BioSharing in domain disciplinary ‘research data management protocols’ (RDMP): life science use case.
RDA IG Domain Repositories IG session on "Community-driven Research Data Management: Towards Domain Protocols for Research Data Management" https://www.rd-alliance.org/ig-domain-repositories-rda-9th-plenary-meeting
FAIRsharing - connecting standards, repositories and data policies across agr...Peter McQuilton
A 10 minute presentation on FAIRsharing, highlighting the manually curated metadata we provide on agri-related standards (ontologies, reporting guidelines, identifier schema, models and formats), databases (both knowledgebases and repositories) and data policies from funders and journal publishers. Presented at the RDA P14 meeting in Helsinki, Finland (October 2019).
This presentation was provided by Scott Warren and Anne Rauh of Syracuse University during the NISO virtual conference, Research Information Systems: The Connections Enabling Collaboration, held on August 16, 2017.
This presentation was provided by Jan Fransen of the University of Minnesota - Twin Cities during the NISO virtual conference, Research Information Systems: The Connections Enabling Collaboration, held on August 16, 2017.
This presentation was provided by Muhammad Javed of Cornell University during the NISO virtual conference, Research Information Systems: The Connections Enabling Collaboration, held on August 16, 2017.
This presentation was provided by Peggy Layne, Andi Ogier, and Ginny Pannabecker of Virginia Tech during the NISO virtual conference, Research Information Systems: The Connections Enabling Collaboration, held on August 16, 2017.
Role of BioSharing in domain disciplinary ‘research data management protocols’ (RDMP): life science use case.
RDA IG Domain Repositories IG session on "Community-driven Research Data Management: Towards Domain Protocols for Research Data Management" https://www.rd-alliance.org/ig-domain-repositories-rda-9th-plenary-meeting
FAIRsharing - connecting standards, repositories and data policies across agr...Peter McQuilton
A 10 minute presentation on FAIRsharing, highlighting the manually curated metadata we provide on agri-related standards (ontologies, reporting guidelines, identifier schema, models and formats), databases (both knowledgebases and repositories) and data policies from funders and journal publishers. Presented at the RDA P14 meeting in Helsinki, Finland (October 2019).
This presentation was provided by Scott Warren and Anne Rauh of Syracuse University during the NISO virtual conference, Research Information Systems: The Connections Enabling Collaboration, held on August 16, 2017.
This presentation was provided by Jan Fransen of the University of Minnesota - Twin Cities during the NISO virtual conference, Research Information Systems: The Connections Enabling Collaboration, held on August 16, 2017.
This presentation was provided by Muhammad Javed of Cornell University during the NISO virtual conference, Research Information Systems: The Connections Enabling Collaboration, held on August 16, 2017.
This presentation was provided by Peggy Layne, Andi Ogier, and Ginny Pannabecker of Virginia Tech during the NISO virtual conference, Research Information Systems: The Connections Enabling Collaboration, held on August 16, 2017.
A 10 minute presentation given at the RDA UK meeting in London (Jan 2019). This presentation covers FAIRsharing work as part of the RDA/Force11 FAIRsharing WG.
Transforming liaison roles for academic librarians is critical, as universities are moving to position themselves to meet the demands of a more competitive national research environment. At La Trobe University, librarians are repackaging current research support services to streamline and incorporate these more efficiently into the researcher’s life cycle, in order to support the University’s research initiatives
The Emergence of Research Information Management (RIM) within US LibrariesOCLC
Presented by Rebecca Bryant, Maliaca Oxnam, and Paolo Mangiafico, at the CNI Spring 2017 Membership Meeting, 3 April 2017, Albuquerque, New Mexico (USA).
Presentation at
CODESRIA-UNESCO –CLACSO Panel: Strengthening Scholarly Community Led open access publishing in the Global South
CODESRIA Conference on Electronic Publishing and Dissemination
CODESRIA-Council for the Development of Social Science Research in Africa
Dakar, Senegal, March 31st., 2016
Presentation at COAR-SPARC conference “Connecting research, bridging communities, opening scholarship. University of Porto, Portugal, April 15-16, 2015
https://www.coar-repositories.org/news-media/coar-sparc-conference-2015-connecting-research-results-bridging-communities-opening-scholarship/
Presentation at COAR-SPARC Conference “Connecting research, bridging communities, opening scholarship". University of Porto, Portugal, April 15-16, 2015
sparc.arl.org/events/joint-coar-sparc-conference
Research 3.0: Libraries, Scholarly Communications, and Research Services
Presented at Coalition for Networked Information (CNI)
April 4, 2016, San Antonio, Texas
Rebecca Bryant
Visiting Project Manager, Researcher Information Systems
University of Illinois at Urbana-Champaign
Beth Namachchivaya
Associate University Librarian
University of Illinois at Urbana-Champaign
The landscape of academic research has changed rapidly in the past decade, with access to high-performance networks, and the focus on data-intensive and interdisciplinary scholarship. Research libraries in North America are developing new services and programs aimed at meeting scholars’ needs for data-intensive, and interdisciplinary research support. Examples of some emerging programs include:
• Supporting digital research (graphical information systems, digital humanities, survey research methodologies, working with large datasets)
• Educating users about copyright and author rights
• Supporting content-creation and publishing activities in numerous ways: institutional repository to store and host works, establishing maker spaces, and developing infrastructure and workflows for more formal library-located publishing efforts
• Collaboration with research offices to educate researchers about federal mandates for open access publications and datasets
• Establishment of data management and archival resources
• Partnering with third-party vendors and with consortia to achieve scale-efficiencies and facilitate impact
• Development of researcher information management systems to support collaboration, discovery, and reporting
We present a case study of the development of a suite of new tools and services at the University of Illinois Library at Urbana-Champaign within its newly established Office of Research to support digital scholarship and to provide sustained and broad access to research. We will also discuss the significant challenges and opportunities of library/campus partnerships for cyberinfrastructure and research support.
My presentation at the http://neuroinformatics2017.org (Kuala Lumpur, Malaysia) on FAIR and FAIRsharing (previously BioSharing); metadata standards and their implementation by databases/repositories and adoption by journals' and funders' data policies.
A 10 minute presentation given at the RDA UK meeting in London (Jan 2019). This presentation covers FAIRsharing work as part of the RDA/Force11 FAIRsharing WG.
Transforming liaison roles for academic librarians is critical, as universities are moving to position themselves to meet the demands of a more competitive national research environment. At La Trobe University, librarians are repackaging current research support services to streamline and incorporate these more efficiently into the researcher’s life cycle, in order to support the University’s research initiatives
The Emergence of Research Information Management (RIM) within US LibrariesOCLC
Presented by Rebecca Bryant, Maliaca Oxnam, and Paolo Mangiafico, at the CNI Spring 2017 Membership Meeting, 3 April 2017, Albuquerque, New Mexico (USA).
Presentation at
CODESRIA-UNESCO –CLACSO Panel: Strengthening Scholarly Community Led open access publishing in the Global South
CODESRIA Conference on Electronic Publishing and Dissemination
CODESRIA-Council for the Development of Social Science Research in Africa
Dakar, Senegal, March 31st., 2016
Presentation at COAR-SPARC conference “Connecting research, bridging communities, opening scholarship. University of Porto, Portugal, April 15-16, 2015
https://www.coar-repositories.org/news-media/coar-sparc-conference-2015-connecting-research-results-bridging-communities-opening-scholarship/
Presentation at COAR-SPARC Conference “Connecting research, bridging communities, opening scholarship". University of Porto, Portugal, April 15-16, 2015
sparc.arl.org/events/joint-coar-sparc-conference
Research 3.0: Libraries, Scholarly Communications, and Research Services
Presented at Coalition for Networked Information (CNI)
April 4, 2016, San Antonio, Texas
Rebecca Bryant
Visiting Project Manager, Researcher Information Systems
University of Illinois at Urbana-Champaign
Beth Namachchivaya
Associate University Librarian
University of Illinois at Urbana-Champaign
The landscape of academic research has changed rapidly in the past decade, with access to high-performance networks, and the focus on data-intensive and interdisciplinary scholarship. Research libraries in North America are developing new services and programs aimed at meeting scholars’ needs for data-intensive, and interdisciplinary research support. Examples of some emerging programs include:
• Supporting digital research (graphical information systems, digital humanities, survey research methodologies, working with large datasets)
• Educating users about copyright and author rights
• Supporting content-creation and publishing activities in numerous ways: institutional repository to store and host works, establishing maker spaces, and developing infrastructure and workflows for more formal library-located publishing efforts
• Collaboration with research offices to educate researchers about federal mandates for open access publications and datasets
• Establishment of data management and archival resources
• Partnering with third-party vendors and with consortia to achieve scale-efficiencies and facilitate impact
• Development of researcher information management systems to support collaboration, discovery, and reporting
We present a case study of the development of a suite of new tools and services at the University of Illinois Library at Urbana-Champaign within its newly established Office of Research to support digital scholarship and to provide sustained and broad access to research. We will also discuss the significant challenges and opportunities of library/campus partnerships for cyberinfrastructure and research support.
My presentation at the http://neuroinformatics2017.org (Kuala Lumpur, Malaysia) on FAIR and FAIRsharing (previously BioSharing); metadata standards and their implementation by databases/repositories and adoption by journals' and funders' data policies.
The Diversity of Biomedical Data, Databases and Standards (Research Data Alli...Peter McQuilton
A 10 minute presentation given in Denver (CO) on the 15th September as part of the IG Elixir Bridging Force, WG Biosharing Registry,WG Data Type Registries,WG Metadata Standards Catalog joint session of the Research Data Alliance 8th Plenary (part of International Data Week).
This presentation covers the proliferation of data, databases, and data standards in biomedicine, and how BioSharing can help inform and educate users on this landscape and relationships between data, databases and data standards.
FAIRsharing presentation at the Japan Science and Technology AgencyPeter McQuilton
A 30 minute seminar presented at the National Bioscience Database Center, part of the Japanese Science and Technology Agency, based in Tokyo, Japan. This presentation covers the FAIR Principles, the aims, methodology and use of FAIRsharing, related projects such as Bioschemas, and international initiatives such as ELIXIR and EOSC.
FAIRsharing Keynote - International Workshop on Sharing, Citation and Publica...Peter McQuilton
A 30 minute presentation on FAIRsharing given at the International Workshop on Sharing, Citation and Publication of
Scientific Data across Disciplines in Tachikawa, Tokyo, Japan on Tuesday 5th December, 2017
Overview to: BBSRC Oxford Doctoral Training Partnership - Dr Sansone - July 2014Susanna-Assunta Sansone
What to know when planning for your data management strategy and preparing a data management statement for a research proposal for BBSRC DTP first year students
Rafael Jimenez presents the FAIRSharing on behalf of Peter McQuilton, Susanna-Assunta Sansone & the FAIRsharing team | OSFair2017 Workshop
Workshop title: How FAIR friendly is your data catalogue?
Workshop overview:
This workshop will build upon the work planned by the EOSCpilot data interoperability task and the BlueBridge workshop held on April 3 at the RDA meeting. We will investigate common mechanisms for interoperation of data catalogues that preserve established community standards, norms and resources, while simplifying the process of being/becoming FAIR. Can we have a simple interoperability architecture based on a common set of metadata types? What are the minimum metadata requirements to expose FAIR data to EOSC services and EOSC users?
DAY 3 - PARALLEL SESSION 6 & 7
"Standards landscape" NIF Big Data 2 Knowledge (BD2K) Initiative, Sep, 2013Susanna-Assunta Sansone
Overview of the landscape of standards in life sciences for the NIH BD2K
"Frameworks for Community-Based Standards Efforts" workshop
September 25, 2013 - September 26, 2013
Co-Chairs: Susanna Sansone, PhD and David Kennedy PhD.
The overall goal of this workshop is to learn what has worked and what has not worked in community-based standards efforts. Participants will have experience in leading specific community based standards initiatives. Prior to the workshop, participants will be asked to address in writing answers to specific questions regarding formulating, conducting, and maintaining such efforts. This information will be used to facilitate focused and actionable discussion at the workshop. Issuance of a Request for Information soliciting comment from the broader community on some of the key issues addressed in the workshop is currently envisioned.
Contact: BD2Kworkshops@mail.nih.gov
Agenda: Frameworks for Community-Based Standards Efforts (PDF 40.7KB)
Participant List: Roster of Invited Participants (PDF 32KB)
Forum (Join the discussion): http://frameworks.prophpbb.com
Watch Live: http://videocast.nih.gov/summary.asp?live=13088 - See more at: http://bd2k.nih.gov/workshops.html#cbse
INSERM Workshop 246 - Management and reuse of health data: methodological issues: https://ateliersinserm.dakini.fr/en/workshop.246.management.and.reuse.of.health.data.methodological.issues-66-22.php
Overview of metadata standards, and how FAIRsharing and the FAIR Cookbook help selecting and using them. Presentation to the What is metadata? Common standards and properties. EHP Workshop, November 9, 2022: https://ephconference.eu/pre-conference-programme-441
FAIR, community standards and data FAIRification: components and recipesSusanna-Assunta Sansone
Overview of FAIR, FAIRsharing and the FAIR Cookbook at the ATI event on Knowledge Graphs: https://github.com/turing-knowledge-graphs/meet-ups/blob/main/symposium-2022.md
Presentation to the EOSC workshop on policies (https://www.google.com/url?q=https://eoscfuture.eu/eventsfuture/monitoring-eosc-readiness-fair-data-policies) on what FAIRsharing does for policies, including providing registration, discovery, flexible and clearer descriptions, relationships, machine readability and comparability.
The role of FAIRsharing in assessing FAIRness of digital objects: we assist, not assess. The workshop brought together a number of FAIR evaluation tools to discuss and design common FAIR tests to ensure tools deliver consistet results. Our presentation illustrates how FAIRsharing's content helps and how FAIRsharing's service contributes. The work will contribute to the work of the EOSC FAIR Metrics Task Force.
Presentation to the EC Workshop on Maximizing investments in health research: FAIR data for a coordinate COVID-19 response. Workshop III, November 8, 2021.
Presentation to the EC Workshop on Maximizing investments in health research: FAIR data for a coordinate COVID-19 response. Workshop I, October 11, 2021.
The FAIR Cookbook poster, as presented at the ELIXIR-UK Node and the UK Conference of Bioinformatics and Computational Biology 2021: https://www.earlham.ac.uk/uk-conference-bioinformatics-and-computational-biology-21
The FAIR Cookbook poster, as presented at the UK Conference of Bioinformatics and Computational Biology 2021: https://www.earlham.ac.uk/uk-conference-bioinformatics-and-computational-biology-21
Breif overview of the FAIR Cookbook for the UK Conference of Bioinformatics and Computational Biology 2021: https://www.earlham.ac.uk/uk-conference-bioinformatics-and-computational-biology-21
Adjusting primitives for graph : SHORT REPORT / NOTESSubhajit Sahu
Graph algorithms, like PageRank Compressed Sparse Row (CSR) is an adjacency-list based graph representation that is
Multiply with different modes (map)
1. Performance of sequential execution based vs OpenMP based vector multiply.
2. Comparing various launch configs for CUDA based vector multiply.
Sum with different storage types (reduce)
1. Performance of vector element sum using float vs bfloat16 as the storage type.
Sum with different modes (reduce)
1. Performance of sequential execution based vs OpenMP based vector element sum.
2. Performance of memcpy vs in-place based CUDA based vector element sum.
3. Comparing various launch configs for CUDA based vector element sum (memcpy).
4. Comparing various launch configs for CUDA based vector element sum (in-place).
Sum with in-place strategies of CUDA mode (reduce)
1. Comparing various launch configs for CUDA based vector element sum (in-place).
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Subhajit Sahu
Abstract — Levelwise PageRank is an alternative method of PageRank computation which decomposes the input graph into a directed acyclic block-graph of strongly connected components, and processes them in topological order, one level at a time. This enables calculation for ranks in a distributed fashion without per-iteration communication, unlike the standard method where all vertices are processed in each iteration. It however comes with a precondition of the absence of dead ends in the input graph. Here, the native non-distributed performance of Levelwise PageRank was compared against Monolithic PageRank on a CPU as well as a GPU. To ensure a fair comparison, Monolithic PageRank was also performed on a graph where vertices were split by components. Results indicate that Levelwise PageRank is about as fast as Monolithic PageRank on the CPU, but quite a bit slower on the GPU. Slowdown on the GPU is likely caused by a large submission of small workloads, and expected to be non-issue when the computation is performed on massive graphs.
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...pchutichetpong
M Capital Group (“MCG”) expects to see demand and the changing evolution of supply, facilitated through institutional investment rotation out of offices and into work from home (“WFH”), while the ever-expanding need for data storage as global internet usage expands, with experts predicting 5.3 billion users by 2023. These market factors will be underpinned by technological changes, such as progressing cloud services and edge sites, allowing the industry to see strong expected annual growth of 13% over the next 4 years.
Whilst competitive headwinds remain, represented through the recent second bankruptcy filing of Sungard, which blames “COVID-19 and other macroeconomic trends including delayed customer spending decisions, insourcing and reductions in IT spending, energy inflation and reduction in demand for certain services”, the industry has seen key adjustments, where MCG believes that engineering cost management and technological innovation will be paramount to success.
MCG reports that the more favorable market conditions expected over the next few years, helped by the winding down of pandemic restrictions and a hybrid working environment will be driving market momentum forward. The continuous injection of capital by alternative investment firms, as well as the growing infrastructural investment from cloud service providers and social media companies, whose revenues are expected to grow over 3.6x larger by value in 2026, will likely help propel center provision and innovation. These factors paint a promising picture for the industry players that offset rising input costs and adapt to new technologies.
According to M Capital Group: “Specifically, the long-term cost-saving opportunities available from the rise of remote managing will likely aid value growth for the industry. Through margin optimization and further availability of capital for reinvestment, strong players will maintain their competitive foothold, while weaker players exit the market to balance supply and demand.”
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...John Andrews
SlideShare Description for "Chatty Kathy - UNC Bootcamp Final Project Presentation"
Title: Chatty Kathy: Enhancing Physical Activity Among Older Adults
Description:
Discover how Chatty Kathy, an innovative project developed at the UNC Bootcamp, aims to tackle the challenge of low physical activity among older adults. Our AI-driven solution uses peer interaction to boost and sustain exercise levels, significantly improving health outcomes. This presentation covers our problem statement, the rationale behind Chatty Kathy, synthetic data and persona creation, model performance metrics, a visual demonstration of the project, and potential future developments. Join us for an insightful Q&A session to explore the potential of this groundbreaking project.
Project Team: Jay Requarth, Jana Avery, John Andrews, Dr. Dick Davis II, Nee Buntoum, Nam Yeongjin & Mat Nicholas
The affect of service quality and online reviews on customer loyalty in the E...
BioSharing - EUDAT semantic workshop
1. n
connecting standards, databases and data policies
Susanna-Assunta Sansone
Associate Director
Oxford e-Research Centre, University of Oxford
2. • Domain-level descriptors that are essential for interpretation, verification,
reproducibility and reusability of datasets
• The depth and breadth of descriptors vary according to the domain broadly
covering the what, who, when, how and why
Content standards
4. Minimum information reporting
requirements, checklists
o Report the same core, essential
information
o e.g. MIAME guidelines
Controlled vocabularies, taxonomies, thesauri, ontologies etc.
o Unambiguous identification and definition of concepts
o e.g. Gene Ontology
Conceptual model, schema,
exchange formats etc
o Define the structure and
interrelation of information,
and the transmission format
o e.g. FASTA Formats Terminologies Guidelines
Content standards: three categories
6. 883 -> ~1000
220+
115+
548
source source
source
Content standards in numbers
Formats Terminologies Guidelines
MIAME
MIRIAM
MIQASMIX
MIGEN
ARRIVE
MIAPE
MIASE
MIQE
MISFISHIE….
REMARK
CONSORT
SRAxml
SOFT FASTA
DICOM
MzML
SBRML
SEDML…
GELML
ISA
CML
MITAB
AAO
CHEBIOBI
PATO ENVO
MOD
BTO
IDO…
TEDDY
PRO
XAO
DO
VO
MIAPPE
Sample-Tab
7. Content standards
Data policies by
funders, journals and
other organizations
Databases, tools
and services
Formats Terminologies Guidelines
Mapping this evolving landscape
8. Content standards
Data policies by
funders, journals and
other organizations
Databases, tools
and services
Formats Terminologies Guidelines
a resource of the ELIXIR Interoperability Platform
• A web-based, curated and searchable portal that monitors their
development and evolution to inform and educate
9.
10.
11.
12. Not just quantity but quality:
rich, curated and community
vetted descriptions
13. Indicators to describe the status of standards and databases
Ready for use, implementation, or recommendation
In development
Status uncertain
Deprecated as subsumed or superseded
Manually curated and verified
by the community behind each resource
16. …to inform and educate on
existing and new resources
Data Policy
17. Working with/for the community and our ‘adopters’, e.g.:
Standard developing groups:Journal, publishers:
Cross-links, data exchange:
Societies and organisations: Institutional RDM services:
Projects, programmes:
533
responders
18. Progressively cross-linking with other ELIXIR resources
Cross-links, data exchange:
Societies and organisations:
Standard developing groups:Journal, publishers:
Institutional RDM services:
Projects, programmes:
19. • Increase discoverability (e.g. by search engines), aggregation (e.g. by indices)
and analysis of content in different websites and services
• use of schema.org structured semantic markup (for web pages’ content) by Google, Bing,
Yahoo, Yandex
• coordinate its extension, where needed, in the life science area
Gaining traction and
support by: