Overview of FAIRsharing in 5min; CODATA event on metadata, pre-RDA Plenary, Philadelphia 2019
https://conference.codata.org/Drexel_CODATA_2019/programme/
FAIRsharing and DataCite: Data Repository Selection- Criteria That MatterSusanna-Assunta Sansone
Through a collaboration with Datacite, FAIRsharing is working with a number of journal publishers (PLOS, Springer Nature, F1000, Wiley, Taylor and Francis, Elsevier, EMBO Press, eLife, GigaScience and Cambridge University Press) to identify a common set of criteria for selecting and recommending data repositories (and associated standards) that will be implemented in FAIRsharing. Details of this work and participants at https://osf.io/m2bce
BioSharing, an ELIXIR Interoperability Platform resourcePeter McQuilton
A 20 minute presentation given at the 9th RDA Plenary in Barcelona as part of the BioSharing WG - ELIXIR Bridging Force IG session. This presentation covers the basics of what BioSharing is, who it's for, and how it captures and connects information on data standards, databases and data policies from the life, biomedical and environmental sciences.
RDA Webinar - BioSharing - mapping the landscape of data standards, repositor...Peter McQuilton
A 30 minute webinar presented on behalf of the RDA/Force11 BioSharing WG, covering our work to map data standards, databases, and data policies in the life, biomedical and environmental sciences.
A 10 minute presentation given at the RDA UK meeting in London (Jan 2019). This presentation covers FAIRsharing work as part of the RDA/Force11 FAIRsharing WG.
RDA Plenary 9 BioSharing WG output/recommendationPeter McQuilton
A 10 minute talk given at the RDA Plenary 9 meeting in Barcelona, April 2017. This talk covers the work performed over the past 18 months in the joint RDA/Force11 WG. This WG has two main outputs, a set of guidelines for how one can link data policies, databases and data standards (in the life sciences); and the BioSharing registry (building upon the prototype).
RDA Data Innovation Forum: FAIRsharing.org, an output of the joint RDA/Force ...Peter McQuilton
A 15 minute presentation at the RDA Data Innovation Forum in Brussels on the 20th January. This presentation covers the RDA/Force11 WG and FAIRsharing, mapping the landscape of data standards, databases and data policies.
A 30 minute presentation given as a webinar as part of the ELIXIR series (https://www.elixir-europe.org/events/webinars/previous). This presentation covers the history of BioSharing, what it covers (data standards, databases and data policies), and our community collaborations and data sharing.
RDA BioSharing WG/ELIXIR Session Montreal 2017Peter McQuilton
A 15 minute presentation giving an introduction to FAIRsharing, an ELIXIR Interoperability Platform resource of curated and linked information on standards, databases and policies.
FAIRsharing and DataCite: Data Repository Selection- Criteria That MatterSusanna-Assunta Sansone
Through a collaboration with Datacite, FAIRsharing is working with a number of journal publishers (PLOS, Springer Nature, F1000, Wiley, Taylor and Francis, Elsevier, EMBO Press, eLife, GigaScience and Cambridge University Press) to identify a common set of criteria for selecting and recommending data repositories (and associated standards) that will be implemented in FAIRsharing. Details of this work and participants at https://osf.io/m2bce
BioSharing, an ELIXIR Interoperability Platform resourcePeter McQuilton
A 20 minute presentation given at the 9th RDA Plenary in Barcelona as part of the BioSharing WG - ELIXIR Bridging Force IG session. This presentation covers the basics of what BioSharing is, who it's for, and how it captures and connects information on data standards, databases and data policies from the life, biomedical and environmental sciences.
RDA Webinar - BioSharing - mapping the landscape of data standards, repositor...Peter McQuilton
A 30 minute webinar presented on behalf of the RDA/Force11 BioSharing WG, covering our work to map data standards, databases, and data policies in the life, biomedical and environmental sciences.
A 10 minute presentation given at the RDA UK meeting in London (Jan 2019). This presentation covers FAIRsharing work as part of the RDA/Force11 FAIRsharing WG.
RDA Plenary 9 BioSharing WG output/recommendationPeter McQuilton
A 10 minute talk given at the RDA Plenary 9 meeting in Barcelona, April 2017. This talk covers the work performed over the past 18 months in the joint RDA/Force11 WG. This WG has two main outputs, a set of guidelines for how one can link data policies, databases and data standards (in the life sciences); and the BioSharing registry (building upon the prototype).
RDA Data Innovation Forum: FAIRsharing.org, an output of the joint RDA/Force ...Peter McQuilton
A 15 minute presentation at the RDA Data Innovation Forum in Brussels on the 20th January. This presentation covers the RDA/Force11 WG and FAIRsharing, mapping the landscape of data standards, databases and data policies.
A 30 minute presentation given as a webinar as part of the ELIXIR series (https://www.elixir-europe.org/events/webinars/previous). This presentation covers the history of BioSharing, what it covers (data standards, databases and data policies), and our community collaborations and data sharing.
RDA BioSharing WG/ELIXIR Session Montreal 2017Peter McQuilton
A 15 minute presentation giving an introduction to FAIRsharing, an ELIXIR Interoperability Platform resource of curated and linked information on standards, databases and policies.
The Diversity of Biomedical Data, Databases and Standards (Research Data Alli...Peter McQuilton
A 10 minute presentation given in Denver (CO) on the 15th September as part of the IG Elixir Bridging Force, WG Biosharing Registry,WG Data Type Registries,WG Metadata Standards Catalog joint session of the Research Data Alliance 8th Plenary (part of International Data Week).
This presentation covers the proliferation of data, databases, and data standards in biomedicine, and how BioSharing can help inform and educate users on this landscape and relationships between data, databases and data standards.
FAIRsharing - Mapping the Landscape of Databases, Repositories, Standards and...Peter McQuilton
A 15 minute slide set presented at two workshops at #biocuration2019; the first on ontologies and FAIRification, the second to map the landscape of biocuration.
2021 04 Introduction to FAIRsharing - cinecaAllyson Lister
Part of the The “How FAIR are you” webinar series and hackathon, which aim at increasing and facilitating the uptake of FAIR approaches into software, training materials and cohort data, to facilitate responsible and ethical data and resource sharing and implementation of federated applications for data analysis.
More information at
* the webinar page: https://www.cineca-project.eu/news-events-all/how-fair-are-you-hackathon
* the recording of the talk: https://www.youtube.com/watch?v=UdGZOynyuGo
A 2-page leaflet detailing the life science database, standard, and policy registries in BioSharing, and the ability to make a Collection of these resources.
"Standards landscape" NIF Big Data 2 Knowledge (BD2K) Initiative, Sep, 2013Susanna-Assunta Sansone
Overview of the landscape of standards in life sciences for the NIH BD2K
"Frameworks for Community-Based Standards Efforts" workshop
September 25, 2013 - September 26, 2013
Co-Chairs: Susanna Sansone, PhD and David Kennedy PhD.
The overall goal of this workshop is to learn what has worked and what has not worked in community-based standards efforts. Participants will have experience in leading specific community based standards initiatives. Prior to the workshop, participants will be asked to address in writing answers to specific questions regarding formulating, conducting, and maintaining such efforts. This information will be used to facilitate focused and actionable discussion at the workshop. Issuance of a Request for Information soliciting comment from the broader community on some of the key issues addressed in the workshop is currently envisioned.
Contact: BD2Kworkshops@mail.nih.gov
Agenda: Frameworks for Community-Based Standards Efforts (PDF 40.7KB)
Participant List: Roster of Invited Participants (PDF 32KB)
Forum (Join the discussion): http://frameworks.prophpbb.com
Watch Live: http://videocast.nih.gov/summary.asp?live=13088 - See more at: http://bd2k.nih.gov/workshops.html#cbse
David Van Enckevort - FAIR sample and data access DataSciSIG
David van Enckevort from the University of Groningen describes FAIR Sample and Data Access in Biobanking and Biorepositories.
This talk was sponsored by the NIH Data Science Special Interest Group and part of a webinar panel on June 23, 2017 on Global Biobanking and Access to Specimens.
Presented at http://mcbios-maqc.org. The FAIR Principles have propelled the global debate in all disciplines about better RDM, transparent and reproducible data worldwide, and in all disciplines. FAIR has de facto become a global norm for good RDM, a prerequisite for data science, since their endorsement by global and intergovernmental leaders. Funding bodies are consolidating FAIR into their funding agreements; publishers have united behind FAIR as a way to remain at the forefront of open research; and in the private sector FAIR is adopted and enshrined in policy in major biopharmas, libraries, and unions. FAIR is changing the culture of data science, but work is needed to turn the principles into reality. I will use the work of the FAIRplus project as examplar to illustrate challenges and progresses.
The FAIR Cookbook poster, as presented at the ELIXIR-UK Node and the UK Conference of Bioinformatics and Computational Biology 2021: https://www.earlham.ac.uk/uk-conference-bioinformatics-and-computational-biology-21
Presentation to the EC Workshop on Maximizing investments in health research: FAIR data for a coordinate COVID-19 response. Workshop I, October 11, 2021.
Presentation to the EOSC workshop on policies (https://www.google.com/url?q=https://eoscfuture.eu/eventsfuture/monitoring-eosc-readiness-fair-data-policies) on what FAIRsharing does for policies, including providing registration, discovery, flexible and clearer descriptions, relationships, machine readability and comparability.
A presentation on FAIR, FAIRsharing and the FAIR ecosystem for the ENVRI-FAIR community on the 13th December 2019. This presentation covers the basics of what FAIR is, how FAIRsharing can help 'FAIRify' standards, repositories, knowledgebases and data policies, and then the connections FAIRsharing has with other initiatives, such as the FAIR Evaluator, Data Stewardship Wizard, our RDA WG, GO-FAIR and EOSC-Life.
Presented by Rodrigo Sara, from the CGIAR System Management Office at the 5th Webinar held by the CGIAR Gender and Agriculture Research Network on September 29, 2016.
Overview of FAIR and the IMI FAIRplus project at the UK Conference of Bioinformatics and Computational Biology 2020: https://www.earlham.ac.uk/uk-conference-bioinformatics-and-computational-biology-2020
Cross-linked metadata standards, repositories and the data policies - The Bio...Peter McQuilton
A 20 minute presentation given in Denver (CO) on the 17th September as part of the Biosharing Registry WG, Metadata Standards Catalog WG, and Publishing Data Workflows WG joint session at the Research Data Alliance 8th Plenary (part of International Data Week).
This presentation covers the explosion of metadata standards and databases in the life, biomedical and environmental sciences and how BioSharing is helping to understand this landscape, both in terms of the relationship between standards and other standards and databases, and the life cycle and evolution of each resource. BioSharing also links these resources to the data policies that recommend them (for example, from funding agencies or journal publishers), enabling an understanding of the entire data cycle, from conception to publishing and storage.
The Diversity of Biomedical Data, Databases and Standards (Research Data Alli...Peter McQuilton
A 10 minute presentation given in Denver (CO) on the 15th September as part of the IG Elixir Bridging Force, WG Biosharing Registry,WG Data Type Registries,WG Metadata Standards Catalog joint session of the Research Data Alliance 8th Plenary (part of International Data Week).
This presentation covers the proliferation of data, databases, and data standards in biomedicine, and how BioSharing can help inform and educate users on this landscape and relationships between data, databases and data standards.
FAIRsharing - Mapping the Landscape of Databases, Repositories, Standards and...Peter McQuilton
A 15 minute slide set presented at two workshops at #biocuration2019; the first on ontologies and FAIRification, the second to map the landscape of biocuration.
2021 04 Introduction to FAIRsharing - cinecaAllyson Lister
Part of the The “How FAIR are you” webinar series and hackathon, which aim at increasing and facilitating the uptake of FAIR approaches into software, training materials and cohort data, to facilitate responsible and ethical data and resource sharing and implementation of federated applications for data analysis.
More information at
* the webinar page: https://www.cineca-project.eu/news-events-all/how-fair-are-you-hackathon
* the recording of the talk: https://www.youtube.com/watch?v=UdGZOynyuGo
A 2-page leaflet detailing the life science database, standard, and policy registries in BioSharing, and the ability to make a Collection of these resources.
"Standards landscape" NIF Big Data 2 Knowledge (BD2K) Initiative, Sep, 2013Susanna-Assunta Sansone
Overview of the landscape of standards in life sciences for the NIH BD2K
"Frameworks for Community-Based Standards Efforts" workshop
September 25, 2013 - September 26, 2013
Co-Chairs: Susanna Sansone, PhD and David Kennedy PhD.
The overall goal of this workshop is to learn what has worked and what has not worked in community-based standards efforts. Participants will have experience in leading specific community based standards initiatives. Prior to the workshop, participants will be asked to address in writing answers to specific questions regarding formulating, conducting, and maintaining such efforts. This information will be used to facilitate focused and actionable discussion at the workshop. Issuance of a Request for Information soliciting comment from the broader community on some of the key issues addressed in the workshop is currently envisioned.
Contact: BD2Kworkshops@mail.nih.gov
Agenda: Frameworks for Community-Based Standards Efforts (PDF 40.7KB)
Participant List: Roster of Invited Participants (PDF 32KB)
Forum (Join the discussion): http://frameworks.prophpbb.com
Watch Live: http://videocast.nih.gov/summary.asp?live=13088 - See more at: http://bd2k.nih.gov/workshops.html#cbse
David Van Enckevort - FAIR sample and data access DataSciSIG
David van Enckevort from the University of Groningen describes FAIR Sample and Data Access in Biobanking and Biorepositories.
This talk was sponsored by the NIH Data Science Special Interest Group and part of a webinar panel on June 23, 2017 on Global Biobanking and Access to Specimens.
Presented at http://mcbios-maqc.org. The FAIR Principles have propelled the global debate in all disciplines about better RDM, transparent and reproducible data worldwide, and in all disciplines. FAIR has de facto become a global norm for good RDM, a prerequisite for data science, since their endorsement by global and intergovernmental leaders. Funding bodies are consolidating FAIR into their funding agreements; publishers have united behind FAIR as a way to remain at the forefront of open research; and in the private sector FAIR is adopted and enshrined in policy in major biopharmas, libraries, and unions. FAIR is changing the culture of data science, but work is needed to turn the principles into reality. I will use the work of the FAIRplus project as examplar to illustrate challenges and progresses.
The FAIR Cookbook poster, as presented at the ELIXIR-UK Node and the UK Conference of Bioinformatics and Computational Biology 2021: https://www.earlham.ac.uk/uk-conference-bioinformatics-and-computational-biology-21
Presentation to the EC Workshop on Maximizing investments in health research: FAIR data for a coordinate COVID-19 response. Workshop I, October 11, 2021.
Presentation to the EOSC workshop on policies (https://www.google.com/url?q=https://eoscfuture.eu/eventsfuture/monitoring-eosc-readiness-fair-data-policies) on what FAIRsharing does for policies, including providing registration, discovery, flexible and clearer descriptions, relationships, machine readability and comparability.
A presentation on FAIR, FAIRsharing and the FAIR ecosystem for the ENVRI-FAIR community on the 13th December 2019. This presentation covers the basics of what FAIR is, how FAIRsharing can help 'FAIRify' standards, repositories, knowledgebases and data policies, and then the connections FAIRsharing has with other initiatives, such as the FAIR Evaluator, Data Stewardship Wizard, our RDA WG, GO-FAIR and EOSC-Life.
Presented by Rodrigo Sara, from the CGIAR System Management Office at the 5th Webinar held by the CGIAR Gender and Agriculture Research Network on September 29, 2016.
Overview of FAIR and the IMI FAIRplus project at the UK Conference of Bioinformatics and Computational Biology 2020: https://www.earlham.ac.uk/uk-conference-bioinformatics-and-computational-biology-2020
Cross-linked metadata standards, repositories and the data policies - The Bio...Peter McQuilton
A 20 minute presentation given in Denver (CO) on the 17th September as part of the Biosharing Registry WG, Metadata Standards Catalog WG, and Publishing Data Workflows WG joint session at the Research Data Alliance 8th Plenary (part of International Data Week).
This presentation covers the explosion of metadata standards and databases in the life, biomedical and environmental sciences and how BioSharing is helping to understand this landscape, both in terms of the relationship between standards and other standards and databases, and the life cycle and evolution of each resource. BioSharing also links these resources to the data policies that recommend them (for example, from funding agencies or journal publishers), enabling an understanding of the entire data cycle, from conception to publishing and storage.
A 15 minutes presentation to the SCDS IUPAC Workshop in Amsterdam on the 16-17th July 2018. This presentation also introduces the current state of chemistry-related standards, databases and data policies in FAIRsharing (all included in a Collection in FAIRsharing), and an outline of the workshop conducted at the meeting.
FAIRsharing consists of three registries: data standards, databases and data policies. This short talk focuses on the FAIRsharing data policy registry, and how including your institutional, funder, publisher, journal, society, project in FAIRsharing can improve findability and machine readability of your policy
This 15min presentation covers work from the FAIRsharing WG, including covering FAIRsharing.org, one of our RDA endorsed outputs, and our work with journal publishers and DataCite to define Repository Selection Criteria for journal and journal publisher data policies.
LITA’s Altmetrics and Digital Analytics Interest Group is proud to present Heather Coates, Richard Naples, and Lauren Collister in our second free webinar of the season. Heather will introduce the concept of altmetrics with a quick "Altmetrics 101," Richard will discuss the Smithsonian's implementation of Altmetric, and Lauren will share the University of Pittsburgh's experience with Plum Analytics.
A 10 minute presentation for the virtual ELIXIR All Hands Meeting 2020 - FAIRification mini symposium. In this presentation I talk about some of the community work we do in FAIRsharing, from sharing our metadata with other resources to research on data policy repository criteria.
FAIRsharing: curation and governance of an ecosystem of research standards an...Allyson Lister
FAIRsharing is an informative and educational resource on interlinked standards (including terminologies), databases and policies, three key elements of the FAIR ecosystem. FAIRsharing is adopted by funders, publishers and communities across all research disciplines. It promotes the existence and value of these resources to aid data sharing and consequently requires a high standard of curation to ensure accurate and timely information is provided across all of our stakeholder groups. Here I discuss the methods employed and challenges faced during curation and maintenance of existing content, as well as the introduction of new features. I will cover how we store machine- and human-accessible metadata, including governance information, and the methods we use to determine what common metadata we should describe. I also will discuss the benefits of both in-house curation and community-driven curation by our stakeholder groups.
Using community-defined metadata standards in the FAIR principles: how BioSha...Peter McQuilton
A 10 minute presentation given in Denver (CO) on the 16th September as part of the IG Elixir Bridging Force and Biosharing Registry WG joint session at the Research Data Alliance 8th Plenary (part of International Data Week).
This presentation covers the use of community-defined metadata standards in the life science, making these standards FAIR, and how BioSharing can help.
Brief summary for the INCF Neuroscience Assembly (https://neuroinformatics.incf.org/2021/program-week-2) of the two sessions run at the RDA Plenary 17th, which FAIRsharing WG has contributed t.
Building data networks: exploring trust and interoperability between authoris...Repository Fringe
Building data networks: exploring trust and interoperability between authoris, repositories and journals. Varsha Khodiyar , Scientific Data; Neil Chue Hong, Journal of Open Research Software; Rachael Kotarski, DataCite, Peter McQuilton, BioSharing; Reza Salek, Metabolights. At Repository Fringe 2015
Overview of metadata standards, and how FAIRsharing and the FAIR Cookbook help selecting and using them. Presentation to the What is metadata? Common standards and properties. EHP Workshop, November 9, 2022: https://ephconference.eu/pre-conference-programme-441
FAIR, community standards and data FAIRification: components and recipesSusanna-Assunta Sansone
Overview of FAIR, FAIRsharing and the FAIR Cookbook at the ATI event on Knowledge Graphs: https://github.com/turing-knowledge-graphs/meet-ups/blob/main/symposium-2022.md
The role of FAIRsharing in assessing FAIRness of digital objects: we assist, not assess. The workshop brought together a number of FAIR evaluation tools to discuss and design common FAIR tests to ensure tools deliver consistet results. Our presentation illustrates how FAIRsharing's content helps and how FAIRsharing's service contributes. The work will contribute to the work of the EOSC FAIR Metrics Task Force.
Presentation to the EC Workshop on Maximizing investments in health research: FAIR data for a coordinate COVID-19 response. Workshop III, November 8, 2021.
The FAIR Cookbook poster, as presented at the UK Conference of Bioinformatics and Computational Biology 2021: https://www.earlham.ac.uk/uk-conference-bioinformatics-and-computational-biology-21
Breif overview of the FAIR Cookbook for the UK Conference of Bioinformatics and Computational Biology 2021: https://www.earlham.ac.uk/uk-conference-bioinformatics-and-computational-biology-21
Brief introduction to FAIRsharing work with industry (publishers, pharmas) and the FAIR Cookbook (for the Life Science): https://www.opensciencefair.eu/2021/workshops/applying-fair-principles-to-open-science-and-industry-to-drive-innovation-challenges-and-opportunities
Overview of the role of FAIRsharing and a dedicated Collection of data resources (platforms and registries that collect, harmonize, and share participant-level clinical-epidemiological, OMICs, and/or imaging data) for the COVID-19 Clinical Research Coalition and The Tropical Disease Research initiatives: https://coronavirus.tghn.org/research-resources/data-sharing-covid-19
Analysis insight about a Flyball dog competition team's performanceroli9797
Insight of my analysis about a Flyball dog competition team's last year performance. Find more: https://github.com/rolandnagy-ds/flyball_race_analysis/tree/main
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Subhajit Sahu
Abstract — Levelwise PageRank is an alternative method of PageRank computation which decomposes the input graph into a directed acyclic block-graph of strongly connected components, and processes them in topological order, one level at a time. This enables calculation for ranks in a distributed fashion without per-iteration communication, unlike the standard method where all vertices are processed in each iteration. It however comes with a precondition of the absence of dead ends in the input graph. Here, the native non-distributed performance of Levelwise PageRank was compared against Monolithic PageRank on a CPU as well as a GPU. To ensure a fair comparison, Monolithic PageRank was also performed on a graph where vertices were split by components. Results indicate that Levelwise PageRank is about as fast as Monolithic PageRank on the CPU, but quite a bit slower on the GPU. Slowdown on the GPU is likely caused by a large submission of small workloads, and expected to be non-issue when the computation is performed on massive graphs.
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdfEnterprise Wired
In this guide, we'll explore the key considerations and features to look for when choosing a Trusted analytics platform that meets your organization's needs and delivers actionable intelligence you can trust.
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Data and AI
Discussion on Vector Databases, Unstructured Data and AI
https://www.meetup.com/unstructured-data-meetup-new-york/
This meetup is for people working in unstructured data. Speakers will come present about related topics such as vector databases, LLMs, and managing data at scale. The intended audience of this group includes roles like machine learning engineers, data scientists, data engineers, software engineers, and PMs.This meetup was formerly Milvus Meetup, and is sponsored by Zilliz maintainers of Milvus.
Adjusting OpenMP PageRank : SHORT REPORT / NOTESSubhajit Sahu
For massive graphs that fit in RAM, but not in GPU memory, it is possible to take
advantage of a shared memory system with multiple CPUs, each with multiple cores, to
accelerate pagerank computation. If the NUMA architecture of the system is properly taken
into account with good vertex partitioning, the speedup can be significant. To take steps in
this direction, experiments are conducted to implement pagerank in OpenMP using two
different approaches, uniform and hybrid. The uniform approach runs all primitives required
for pagerank in OpenMP mode (with multiple threads). On the other hand, the hybrid
approach runs certain primitives in sequential mode (i.e., sumAt, multiply).
Techniques to optimize the pagerank algorithm usually fall in two categories. One is to try reducing the work per iteration, and the other is to try reducing the number of iterations. These goals are often at odds with one another. Skipping computation on vertices which have already converged has the potential to save iteration time. Skipping in-identical vertices, with the same in-links, helps reduce duplicate computations and thus could help reduce iteration time. Road networks often have chains which can be short-circuited before pagerank computation to improve performance. Final ranks of chain nodes can be easily calculated. This could reduce both the iteration time, and the number of iterations. If a graph has no dangling nodes, pagerank of each strongly connected component can be computed in topological order. This could help reduce the iteration time, no. of iterations, and also enable multi-iteration concurrency in pagerank computation. The combination of all of the above methods is the STICD algorithm. [sticd] For dynamic graphs, unchanged components whose ranks are unaffected can be skipped altogether.
The Building Blocks of QuestDB, a Time Series Databasejavier ramirez
Talk Delivered at Valencia Codes Meetup 2024-06.
Traditionally, databases have treated timestamps just as another data type. However, when performing real-time analytics, timestamps should be first class citizens and we need rich time semantics to get the most out of our data. We also need to deal with ever growing datasets while keeping performant, which is as fun as it sounds.
It is no wonder time-series databases are now more popular than ever before. Join me in this session to learn about the internal architecture and building blocks of QuestDB, an open source time-series database designed for speed. We will also review a history of some of the changes we have gone over the past two years to deal with late and unordered data, non-blocking writes, read-replicas, or faster batch ingestion.
Learn SQL from basic queries to Advance queriesmanishkhaire30
Dive into the world of data analysis with our comprehensive guide on mastering SQL! This presentation offers a practical approach to learning SQL, focusing on real-world applications and hands-on practice. Whether you're a beginner or looking to sharpen your skills, this guide provides the tools you need to extract, analyze, and interpret data effectively.
Key Highlights:
Foundations of SQL: Understand the basics of SQL, including data retrieval, filtering, and aggregation.
Advanced Queries: Learn to craft complex queries to uncover deep insights from your data.
Data Trends and Patterns: Discover how to identify and interpret trends and patterns in your datasets.
Practical Examples: Follow step-by-step examples to apply SQL techniques in real-world scenarios.
Actionable Insights: Gain the skills to derive actionable insights that drive informed decision-making.
Join us on this journey to enhance your data analysis capabilities and unlock the full potential of SQL. Perfect for data enthusiasts, analysts, and anyone eager to harness the power of data!
#DataAnalysis #SQL #LearningSQL #DataInsights #DataScience #Analytics
Adjusting primitives for graph : SHORT REPORT / NOTESSubhajit Sahu
Graph algorithms, like PageRank Compressed Sparse Row (CSR) is an adjacency-list based graph representation that is
Multiply with different modes (map)
1. Performance of sequential execution based vs OpenMP based vector multiply.
2. Comparing various launch configs for CUDA based vector multiply.
Sum with different storage types (reduce)
1. Performance of vector element sum using float vs bfloat16 as the storage type.
Sum with different modes (reduce)
1. Performance of sequential execution based vs OpenMP based vector element sum.
2. Performance of memcpy vs in-place based CUDA based vector element sum.
3. Comparing various launch configs for CUDA based vector element sum (memcpy).
4. Comparing various launch configs for CUDA based vector element sum (in-place).
Sum with in-place strategies of CUDA mode (reduce)
1. Comparing various launch configs for CUDA based vector element sum (in-place).
2. Community paper with 69 authors out April 2nd
in
Nature Biotech as OA CC-BY: doi.org/10.1101/245183
• A registry of inter-linked (meta)data
standards, repositories and policies
• A set of tools and services to discovery
and visualize these resources
• A collaboratory with activities on FAIR
metrics, maturity models and guidance
FAIRsharing is one of the few recommended resources
in these funder-driven policies and reports
3. Community paper with 69 authors out April 2nd
in
Nature Biotech as OA CC-BY: doi.org/10.1101/245183
Our mission is to increase:
• guidance to consumers of data
standards, repositories, and policies,
to accelerate the discovery, selection
and use of these resources;
• producers satisfaction in terms of
resource visibility, adoption and citation
4. Databases and
data repositories
Community standards,
focusing on metadata and identifier schemas
Formats Terminologies Guidelines
Data policies
by funders, journals and
other organizations
Identifiers
Ready for use, implementation, or recommendation
In development
Status uncertain
Deprecated as subsumed or superseded
All records are manually curated
in-house, verified and claimed by the
community behind each resource
Describe and interlink standards, repositories
and data policies