In this presentation I present the plan to make rare disease data resources findable, accessible, interoperable, and reusable for humans and computers (FAIR). The presentation was made for the IRDiRC conference 2017 in Paris.
In this tutorial we explain the basics of a 'Linked Data and Ontology' approach for combining data, in particular for the study of rare diseases. The approach is motivated by a case study provided by health care researcher Ulrike Braisch. The main take home lesson is that with this approach the effort for data integration can be substantially lowered, i.e. lead to a shorter path to new treatments for (rare) diseases.
The presentation is based on a tutorial given at the RD-Connect/Neuromics/Euronomics plenary meeting in Heidelberg, Germany, February 26, 2014. It was made possible by RD-Connect, a European project to support Rare Disease research (http://www.rd-connect.eu).
Interpreting Complex Real World Data for Pharmaceutical ResearchPaul Agapow
This document discusses using real world data (RWD) for pharmaceutical research and development. It notes that while RWD is attractive due to its scale and realism, it is also complex and difficult to interpret. The document proposes several approaches for analyzing RWD, including using machine learning on graphical representations of patient data, analyzing temporal trajectories, integrating multiple 'omics data sources, and generating hypotheses rather than attempting to definitively model patient populations. It concludes that more work is needed to build larger, more diverse real world datasets and address challenges around privacy, methods validation, and scaling analysis techniques.
This document discusses patient summaries in emergency departments. It describes the high cognitive demands placed on emergency department staff and high diagnostic error rates. International and European guidelines for patient summaries are outlined, defining them as a minimum set of clinical data for healthcare coordination. The document argues that patient summaries could help navigate health systems and unlock health data, but challenges include data sharing between different health record systems and ensuring privacy and trust.
CINECA webinar slides: Open science through fair health data networks dream o...CINECAProject
Since the FAIR data principles were published in 2016, many organizations including science funders and governments have adopted these principles to promote and foster true open science collaborations. However, to define a vision and create a video of a Personal Health Train that leverages worldwide FAIR health data in a federated manner is one step. To actually make this happen at scale and be able to show new scientific and medical insights for it is quite another!
In this webinar, we will dive into the basics of FAIR health data, but also take stock of the current situation in health data networks: after a year of frantic research and collaborations and many open datasets and hackathons on COVID-19, has the situation actually improved? Are we sharing health data on a global scale to improve medical practice, or is quality medical data still only accessible to researchers with the right credentials and deep pockets?
This webinar is part of the “How FAIR are you” webinar series and hackathon, which aim at increasing and facilitating the uptake of FAIR approaches into software, training materials and cohort data, to facilitate responsible and ethical data and resource sharing and implementation of federated applications for data analysis.
The CINECA webinar series aims to discuss ways to address common challenges and share best practices in the field of cohort data analysis, as well as distribute CINECA project results. All CINECA webinars include an audience Q&A session during which attendees can ask questions and make suggestions. Please note that all webinars are recorded and available for posterior viewing. CINECA webinars include an audience Q&A session during which attendees can ask questions and make suggestions.
This webinar took place on 21st January 2021 and is part of the CINECA webinar series.
For previous and upcoming CINECA webinars see:
https://www.cineca-project.eu/webinars
Why collect and use health data? Professor Peter Bradley, Director of Knowl...NHS England
Professor Bradley outlines the importance of population based studies, the development of data science and what is needed for the efficient use of data.
The document discusses healthcare informatics and big data in healthcare. It provides an introduction to healthcare informatics, the advantages and disciplines involved. It then discusses big data in healthcare, including the sources and types of healthcare data, challenges in big data analytics, and conceptual architectures. Tools for big data analytics are also outlined, including Hadoop, Pig, Hive and others. Finally, it provides an example case study of a systematic review on the effectiveness of mobile health technology interventions.
This document summarizes a big data event held in Leeds, UK called Leeds Data Thing. It provides information on the organizers, speakers, and goals of the event which were to explore open data, learn from other industries and each other, and highlight good work. The event attracted people from various backgrounds interested in working with data. Speakers discussed topics like open data mapping and analyzing demographic data of Leeds. Feedback was gathered on how future events could be improved, such as providing experts to help teams with data analysis. The organizers aim to continue holding Leeds Data Thing events regularly.
Open science and medical evidence generation - Kees van Bochove - The HyveKees van Bochove
Presentation about open science, the FAIR principles, and medical evidence generation with the OHDSI COVID-19 study-a-thon as an example. I've used variations on this deck in a couple of classroom and online courses for PhD and master students early 2020.
In this tutorial we explain the basics of a 'Linked Data and Ontology' approach for combining data, in particular for the study of rare diseases. The approach is motivated by a case study provided by health care researcher Ulrike Braisch. The main take home lesson is that with this approach the effort for data integration can be substantially lowered, i.e. lead to a shorter path to new treatments for (rare) diseases.
The presentation is based on a tutorial given at the RD-Connect/Neuromics/Euronomics plenary meeting in Heidelberg, Germany, February 26, 2014. It was made possible by RD-Connect, a European project to support Rare Disease research (http://www.rd-connect.eu).
Interpreting Complex Real World Data for Pharmaceutical ResearchPaul Agapow
This document discusses using real world data (RWD) for pharmaceutical research and development. It notes that while RWD is attractive due to its scale and realism, it is also complex and difficult to interpret. The document proposes several approaches for analyzing RWD, including using machine learning on graphical representations of patient data, analyzing temporal trajectories, integrating multiple 'omics data sources, and generating hypotheses rather than attempting to definitively model patient populations. It concludes that more work is needed to build larger, more diverse real world datasets and address challenges around privacy, methods validation, and scaling analysis techniques.
This document discusses patient summaries in emergency departments. It describes the high cognitive demands placed on emergency department staff and high diagnostic error rates. International and European guidelines for patient summaries are outlined, defining them as a minimum set of clinical data for healthcare coordination. The document argues that patient summaries could help navigate health systems and unlock health data, but challenges include data sharing between different health record systems and ensuring privacy and trust.
CINECA webinar slides: Open science through fair health data networks dream o...CINECAProject
Since the FAIR data principles were published in 2016, many organizations including science funders and governments have adopted these principles to promote and foster true open science collaborations. However, to define a vision and create a video of a Personal Health Train that leverages worldwide FAIR health data in a federated manner is one step. To actually make this happen at scale and be able to show new scientific and medical insights for it is quite another!
In this webinar, we will dive into the basics of FAIR health data, but also take stock of the current situation in health data networks: after a year of frantic research and collaborations and many open datasets and hackathons on COVID-19, has the situation actually improved? Are we sharing health data on a global scale to improve medical practice, or is quality medical data still only accessible to researchers with the right credentials and deep pockets?
This webinar is part of the “How FAIR are you” webinar series and hackathon, which aim at increasing and facilitating the uptake of FAIR approaches into software, training materials and cohort data, to facilitate responsible and ethical data and resource sharing and implementation of federated applications for data analysis.
The CINECA webinar series aims to discuss ways to address common challenges and share best practices in the field of cohort data analysis, as well as distribute CINECA project results. All CINECA webinars include an audience Q&A session during which attendees can ask questions and make suggestions. Please note that all webinars are recorded and available for posterior viewing. CINECA webinars include an audience Q&A session during which attendees can ask questions and make suggestions.
This webinar took place on 21st January 2021 and is part of the CINECA webinar series.
For previous and upcoming CINECA webinars see:
https://www.cineca-project.eu/webinars
Why collect and use health data? Professor Peter Bradley, Director of Knowl...NHS England
Professor Bradley outlines the importance of population based studies, the development of data science and what is needed for the efficient use of data.
The document discusses healthcare informatics and big data in healthcare. It provides an introduction to healthcare informatics, the advantages and disciplines involved. It then discusses big data in healthcare, including the sources and types of healthcare data, challenges in big data analytics, and conceptual architectures. Tools for big data analytics are also outlined, including Hadoop, Pig, Hive and others. Finally, it provides an example case study of a systematic review on the effectiveness of mobile health technology interventions.
This document summarizes a big data event held in Leeds, UK called Leeds Data Thing. It provides information on the organizers, speakers, and goals of the event which were to explore open data, learn from other industries and each other, and highlight good work. The event attracted people from various backgrounds interested in working with data. Speakers discussed topics like open data mapping and analyzing demographic data of Leeds. Feedback was gathered on how future events could be improved, such as providing experts to help teams with data analysis. The organizers aim to continue holding Leeds Data Thing events regularly.
Open science and medical evidence generation - Kees van Bochove - The HyveKees van Bochove
Presentation about open science, the FAIR principles, and medical evidence generation with the OHDSI COVID-19 study-a-thon as an example. I've used variations on this deck in a couple of classroom and online courses for PhD and master students early 2020.
IMS Health Workshop World Orphan Drug CongressIMSHealthRWES
This document provides an agenda for a workshop titled "Using real-world data to find undiagnosed patients with rare diseases" hosted by IMS Health at the World Orphan Drug Congress from April 20-22 in Washington D.C. The workshop will explore how real-world data can help address underdiagnosis of rare diseases. Speakers will discuss using large-scale real-world data to transform understanding of rare diseases, and present case studies of companies leveraging real-world data. The workshop will also cover using predictive modeling of real-world data to identify undiagnosed rare disease patients and new engagement models with healthcare providers and payers to increase rare disease treatments.
Data Literacy -- Necessity and challengesSrdjan Verbić
Technology can't help us in understanding data. Who needs the most data literacy competences: policy makers, journalists, doctors, patients or civil sector activists?
Federated Learning (FL) is a learning paradigm that enables collaborative learning without centralizing datasets. In this webinar, NVIDIA present the concept of FL and discuss how it can help overcome some of the barriers seen in the development of AI-based solutions for pharma, genomics and healthcare. Following the presentation, the panel debate on other elements that could drive the adoption of digital approaches more widely and help answer currently intractable science and business questions.
The document discusses Illumina's role in advancing precision medicine through next-generation sequencing and data analytics. It notes that while sequencing costs have decreased dramatically, challenges remain in interpreting, integrating, and analyzing the large volumes of genomic and other healthcare data. Illumina aims to develop comprehensive, patient-centric analytics platforms and knowledgebases to help address these challenges and enable more effective prevention, diagnosis, and treatment based on a patient's genetics, environment, and lifestyle. The success of these efforts will be measured by improvements in patient outcomes, healthcare costs and efficiencies, and changes in clinical practice guided by integrated genomic and clinical data analysis.
Presentation of the PICASO Project at WHINN Conference, October 2016PicasoProject
Presentation of PICASO in the session on integrating health and social care by Jesper Thestrup from partners In-JeT ApS
WHINN: Week of Health and INNovation, October 2016
Mental Health Informatics - What we can learn from the past and where we can beHimanshu Tyagi
This document discusses how mental health informatics has evolved over the past 20 years and the barriers to further advancement. It notes that while technology has accelerated, healthcare has lagged in adopting new technologies. Barriers include unrealistic expectations of technology's capabilities, challenges around customizing solutions at scale, and data explosion outpacing ability to analyze information. The author advocates learning from other industries that develop customizable solutions and integrate diverse data sources. Clinical leadership is needed to help overcome barriers and advance mental health informatics.
Digital Enlightment Forum: Towards a European ecosystem for health care data
Presentation of eStandards/Trillium II at the workshop of the Digital Enlightment Forum
Trusted! Quest for data-driven and fair health solutions Sitra / Hyvinvointi
An inspiring online event on 3 February 2021. We are discussing the future of data-driven health solutions that focus on fairness for all stakeholders: people, business and the public sector. We are asking questions such as: What is fairness in health? What role does trust play in data-driven health services? What needs to change and who needs to act? Most of all, we are launching “The Fair Health Data Challenge“.
Event speakers:
- Jaana Sinipuro, Project Director, IHAN – Human-driven data economy, Sitra
- Dipak Kalra, President, The European Institute for Innovation through Health Data (i~HD)
- Pekka Kahri, Technology Officer, HUS Helsinki University Hospital
- Markus Kalliola, Project Director, Health data 2030, Sitra
- Tiina Härkönen, Leading Specialist, Sitra
20191203 DOE Data Driven Healthcare- Expert EventDayOne
This document summarizes a presentation on data-driven healthcare given at the DayOne Experts conference in Basel on December 3rd, 2019. The presentation discusses whether the healthcare system is ready for data-driven approaches and what is needed to enable new data-driven health solutions. It notes that while digitalization in Swiss medical practices is increasing, coordination between different parts of the treatment chain remains a challenge. The Swiss Personalized Health Network aims to connect different health data sources and establish national infrastructure to enable interoperable exchange of health data for research purposes. University hospitals are working to structure, standardize and integrate different types of clinical and research data according to SPHN requirements.
This presentation outlines a mechanism for using the power of "Big Data", social networking and technology infrastructure to speed the process of curing a horrible disease.
The Dementia Intelligence Network (DIN) provides data tools and reports to help commissioners understand dementia prevalence and care in their local areas. The DIN's online Dementia Profile tool includes indicators across the dementia care pathway to assess needs, service usage, and outcomes. Recent updates include additional risk factor data and future plans involve expanding metrics on prevention, comorbidities, and health economics. The DIN aims to help local decision-makers improve dementia commissioning and care.
The IMI EHDEN project: large-scale analysis of observation data in Europe - C...Maxim Moinat
This document summarizes presentations from the CDISC 2020 Europe Interchange conference on April 1-2, 2020. It introduces Maxim Moinat who works on converting healthcare data to the OMOP Common Data Model. It also introduces Nigel Hughes who is the project lead for the IMI2 European Health Data & Evidence Network (EHDEN) and was previously involved in other real-world data projects. The document provides an agenda for the conference sessions which will discuss EHDEN, OHDSI, a EHDEN study-a-thon, using real-world data in clinical trials and for regulatory purposes.
ICG-11 - genomic data projects around the world - nov 5 2016Fiona Nielsen
How to find data for your research
Presented by Fiona Nielsen at the International Conference of Genomics 2016 www.icg-11.org in the session Data Sharing and Analysis chaired by Laurie Goodman, editor-in-chief, GigaScience
New Health Data Deluges Require Secure Information Flow Enablement Via Standa...Dana Gardner
Transcript of a BriefingsDirect podcast on how new devices and practices have the potential to expand the information available to healthcare providers and facilities.
This document discusses the need to talk about how health data is used. It notes that people currently have little understanding of how their data is used by various organizations like the NHS, academics, and commercial groups. The document outlines resources and strategies to help improve public understanding of issues like how data is kept safe, whether it is identifiable, what choices people have, and the benefits of data use. It emphasizes the importance of language and transparency to build public trust and confidence in the important uses of health data for individual care, research, and improving health services.
A look at the key trends and challenges in applying Big Data to transform healthcare by supporting research, self care, providers and building ecosystems. Purchase the report here: https://gumroad.com/l/PlXP
The document discusses patient summaries in emergency departments. It describes how patient summaries can provide key medical information to help emergency department clinicians make timely treatment decisions. However, emergency clinicians currently face barriers to accessing complete and trusted patient summary data. Standards development aims to address issues of interoperability and data sharing so that patient summaries can better support clinical decision-making in emergency care situations.
The document discusses patient summaries in emergency departments and their potential to improve care. It describes how patient summaries can provide key medical information to clinicians in emergency situations. However, effective use of patient summaries faces challenges around standards, access to data across organizations, and clinician trust in external medical records. Overcoming these challenges could help unlock the power of health data to support clinical decision making in emergency care.
Presentation done at CBEB'16 the 17th October by Vicente Traver introducing EIP-AHA and H2020 policies, the LINK project and different opportunities to cooperate together between Brazil and EU about telehealth and personal health
Slides as given for the i2b2 conference in October 2019. https://transmartfoundation.org/tubingen-symposium-2019-agenda/
Observational Health Data Sciences and Informatics (OHDSI) is a multi-stakeholder,
interdisciplinary, international collaborative with a mission to improve health by empowering
a community to collaboratively generate the evidence that promotes better health decisions
and better care. With 200 researchers from 25 countries and half a billion unique patients,
OHDSI carries out federated studies at sufficient scale to answer questions about diagnosis
and treatment. At the heart of the OHDSI platform is the OMOP Common Data Model,
currently at v6, around which a toolset is built for carrying out reusable, repeatable and
reproducible observational clinical research on a large scale.
This document discusses different levels of semantics that can be used when making assertions in nanopublications. Weaker semantics include minted URIs which are machine readable but not machine interpretable. Stronger semantics involve linking concepts to existing ontologies to make assertions more machine interpretable. The document outlines approaches ranging from weakest to strongest semantics, noting tradeoffs between interpretability and difficulty.
Slides for the Technology Track of ISMB/ECCB 2013 in Berlin on digital publishing, highlighting the Research Object model, Nanopublications, and ISA as a means to capture methods and results when research is carried out digitally. This work was supported by the EU workflow forever project (http://wf4ever-project.org).
More Related Content
Similar to Rare Disease Data Linkage plan 2017 - IRDiRC 2017 presentation
IMS Health Workshop World Orphan Drug CongressIMSHealthRWES
This document provides an agenda for a workshop titled "Using real-world data to find undiagnosed patients with rare diseases" hosted by IMS Health at the World Orphan Drug Congress from April 20-22 in Washington D.C. The workshop will explore how real-world data can help address underdiagnosis of rare diseases. Speakers will discuss using large-scale real-world data to transform understanding of rare diseases, and present case studies of companies leveraging real-world data. The workshop will also cover using predictive modeling of real-world data to identify undiagnosed rare disease patients and new engagement models with healthcare providers and payers to increase rare disease treatments.
Data Literacy -- Necessity and challengesSrdjan Verbić
Technology can't help us in understanding data. Who needs the most data literacy competences: policy makers, journalists, doctors, patients or civil sector activists?
Federated Learning (FL) is a learning paradigm that enables collaborative learning without centralizing datasets. In this webinar, NVIDIA present the concept of FL and discuss how it can help overcome some of the barriers seen in the development of AI-based solutions for pharma, genomics and healthcare. Following the presentation, the panel debate on other elements that could drive the adoption of digital approaches more widely and help answer currently intractable science and business questions.
The document discusses Illumina's role in advancing precision medicine through next-generation sequencing and data analytics. It notes that while sequencing costs have decreased dramatically, challenges remain in interpreting, integrating, and analyzing the large volumes of genomic and other healthcare data. Illumina aims to develop comprehensive, patient-centric analytics platforms and knowledgebases to help address these challenges and enable more effective prevention, diagnosis, and treatment based on a patient's genetics, environment, and lifestyle. The success of these efforts will be measured by improvements in patient outcomes, healthcare costs and efficiencies, and changes in clinical practice guided by integrated genomic and clinical data analysis.
Presentation of the PICASO Project at WHINN Conference, October 2016PicasoProject
Presentation of PICASO in the session on integrating health and social care by Jesper Thestrup from partners In-JeT ApS
WHINN: Week of Health and INNovation, October 2016
Mental Health Informatics - What we can learn from the past and where we can beHimanshu Tyagi
This document discusses how mental health informatics has evolved over the past 20 years and the barriers to further advancement. It notes that while technology has accelerated, healthcare has lagged in adopting new technologies. Barriers include unrealistic expectations of technology's capabilities, challenges around customizing solutions at scale, and data explosion outpacing ability to analyze information. The author advocates learning from other industries that develop customizable solutions and integrate diverse data sources. Clinical leadership is needed to help overcome barriers and advance mental health informatics.
Digital Enlightment Forum: Towards a European ecosystem for health care data
Presentation of eStandards/Trillium II at the workshop of the Digital Enlightment Forum
Trusted! Quest for data-driven and fair health solutions Sitra / Hyvinvointi
An inspiring online event on 3 February 2021. We are discussing the future of data-driven health solutions that focus on fairness for all stakeholders: people, business and the public sector. We are asking questions such as: What is fairness in health? What role does trust play in data-driven health services? What needs to change and who needs to act? Most of all, we are launching “The Fair Health Data Challenge“.
Event speakers:
- Jaana Sinipuro, Project Director, IHAN – Human-driven data economy, Sitra
- Dipak Kalra, President, The European Institute for Innovation through Health Data (i~HD)
- Pekka Kahri, Technology Officer, HUS Helsinki University Hospital
- Markus Kalliola, Project Director, Health data 2030, Sitra
- Tiina Härkönen, Leading Specialist, Sitra
20191203 DOE Data Driven Healthcare- Expert EventDayOne
This document summarizes a presentation on data-driven healthcare given at the DayOne Experts conference in Basel on December 3rd, 2019. The presentation discusses whether the healthcare system is ready for data-driven approaches and what is needed to enable new data-driven health solutions. It notes that while digitalization in Swiss medical practices is increasing, coordination between different parts of the treatment chain remains a challenge. The Swiss Personalized Health Network aims to connect different health data sources and establish national infrastructure to enable interoperable exchange of health data for research purposes. University hospitals are working to structure, standardize and integrate different types of clinical and research data according to SPHN requirements.
This presentation outlines a mechanism for using the power of "Big Data", social networking and technology infrastructure to speed the process of curing a horrible disease.
The Dementia Intelligence Network (DIN) provides data tools and reports to help commissioners understand dementia prevalence and care in their local areas. The DIN's online Dementia Profile tool includes indicators across the dementia care pathway to assess needs, service usage, and outcomes. Recent updates include additional risk factor data and future plans involve expanding metrics on prevention, comorbidities, and health economics. The DIN aims to help local decision-makers improve dementia commissioning and care.
The IMI EHDEN project: large-scale analysis of observation data in Europe - C...Maxim Moinat
This document summarizes presentations from the CDISC 2020 Europe Interchange conference on April 1-2, 2020. It introduces Maxim Moinat who works on converting healthcare data to the OMOP Common Data Model. It also introduces Nigel Hughes who is the project lead for the IMI2 European Health Data & Evidence Network (EHDEN) and was previously involved in other real-world data projects. The document provides an agenda for the conference sessions which will discuss EHDEN, OHDSI, a EHDEN study-a-thon, using real-world data in clinical trials and for regulatory purposes.
ICG-11 - genomic data projects around the world - nov 5 2016Fiona Nielsen
How to find data for your research
Presented by Fiona Nielsen at the International Conference of Genomics 2016 www.icg-11.org in the session Data Sharing and Analysis chaired by Laurie Goodman, editor-in-chief, GigaScience
New Health Data Deluges Require Secure Information Flow Enablement Via Standa...Dana Gardner
Transcript of a BriefingsDirect podcast on how new devices and practices have the potential to expand the information available to healthcare providers and facilities.
This document discusses the need to talk about how health data is used. It notes that people currently have little understanding of how their data is used by various organizations like the NHS, academics, and commercial groups. The document outlines resources and strategies to help improve public understanding of issues like how data is kept safe, whether it is identifiable, what choices people have, and the benefits of data use. It emphasizes the importance of language and transparency to build public trust and confidence in the important uses of health data for individual care, research, and improving health services.
A look at the key trends and challenges in applying Big Data to transform healthcare by supporting research, self care, providers and building ecosystems. Purchase the report here: https://gumroad.com/l/PlXP
The document discusses patient summaries in emergency departments. It describes how patient summaries can provide key medical information to help emergency department clinicians make timely treatment decisions. However, emergency clinicians currently face barriers to accessing complete and trusted patient summary data. Standards development aims to address issues of interoperability and data sharing so that patient summaries can better support clinical decision-making in emergency care situations.
The document discusses patient summaries in emergency departments and their potential to improve care. It describes how patient summaries can provide key medical information to clinicians in emergency situations. However, effective use of patient summaries faces challenges around standards, access to data across organizations, and clinician trust in external medical records. Overcoming these challenges could help unlock the power of health data to support clinical decision making in emergency care.
Presentation done at CBEB'16 the 17th October by Vicente Traver introducing EIP-AHA and H2020 policies, the LINK project and different opportunities to cooperate together between Brazil and EU about telehealth and personal health
Slides as given for the i2b2 conference in October 2019. https://transmartfoundation.org/tubingen-symposium-2019-agenda/
Observational Health Data Sciences and Informatics (OHDSI) is a multi-stakeholder,
interdisciplinary, international collaborative with a mission to improve health by empowering
a community to collaboratively generate the evidence that promotes better health decisions
and better care. With 200 researchers from 25 countries and half a billion unique patients,
OHDSI carries out federated studies at sufficient scale to answer questions about diagnosis
and treatment. At the heart of the OHDSI platform is the OMOP Common Data Model,
currently at v6, around which a toolset is built for carrying out reusable, repeatable and
reproducible observational clinical research on a large scale.
Similar to Rare Disease Data Linkage plan 2017 - IRDiRC 2017 presentation (20)
This document discusses different levels of semantics that can be used when making assertions in nanopublications. Weaker semantics include minted URIs which are machine readable but not machine interpretable. Stronger semantics involve linking concepts to existing ontologies to make assertions more machine interpretable. The document outlines approaches ranging from weakest to strongest semantics, noting tradeoffs between interpretability and difficulty.
Slides for the Technology Track of ISMB/ECCB 2013 in Berlin on digital publishing, highlighting the Research Object model, Nanopublications, and ISA as a means to capture methods and results when research is carried out digitally. This work was supported by the EU workflow forever project (http://wf4ever-project.org).
This document discusses how workflows can help biologists by allowing them to combine various computational tools and databases. It notes that individual biologists have limited time and computational skills, but can use workflows to access various expertises and resources. Workflows allow biologists to design complex computational experiments and analyze large amounts of data by connecting different services and applications in an automated, repeatable process.
Extended presentation from the enabling technology track of the BBMRI 'BioBanking for Science' conference in Amsterdam, September 2010. Feedback from the audience have been added.
This document introduces Marco Roos and discusses his transition from traditional molecular biology and bioinformatics work to e-science. It describes how e-science approaches can help address challenges in biology by enabling greater data and knowledge sharing, reuse of tools and workflows, and integrated analysis across multiple data types and sources. Examples discussed include semantic web technologies, workflow systems, and proposed e-laboratory platforms to empower scientists with virtual collaborative environments and intelligent assistance. The goal is to help biologists better exploit computational resources and expertise through enhanced and standardized e-science frameworks.
This document summarizes a presentation about using the Taverna workflow system and myExperiment repository for collaborative bioinformatics research. It discusses how Taverna allows researchers to combine multiple computational methods and online data sources into reproducible workflows. The presenter describes their own experiences with early "spaghetti code" approaches to bioinformatics and how e-Science tools now enable more insightful experiments through collaboration and sharing of workflows.
Presentation in support of AIDA demonstration at the ISMB/ECCB conference in Vienna, 2007. We demonstrated the application of AIDA web services for mining associations of proteins and diseases with an input query through a text mining workflow implemented as a workflow in Taverna. The AIDA toolkit combines services for information retrieval, information extraction, and Semantic Web modelling and storage. The services are created by experts in different fields collaborating under the name of 'Adaptive Information Disclosure' in the VL-e project (http://www.vl-e.nl).
The document summarizes the experience of a biologist in adopting an e-science approach to their work. It describes how before e-science, the biologist took an uncoordinated "spaghetti" approach using various tools without a unified strategy. The biologist then explains how adopting e-science principles like collaboration, reusable workflows, and web services helped enhance their work by allowing experts from different domains to combine their expertise. The biologist also reflects on outreach efforts to promote e-science to other researchers.
1. The document discusses how a biologist, Marco Roos, became interested in e-science through his work in molecular and cellular biology, bioinformatics, and data integration projects.
2. Roos describes how e-science allows for collaboration between different experts and disciplines through technologies like workflows, semantic web, and virtual laboratories.
3. Roos emphasizes that e-science should empower scientists by making tools and resources easy to use, share, and build upon so that scientists can focus on scientific problems rather than technical challenges.
The document discusses developments in e-Science and online tools for scientific communities. It describes how electronic lab notebooks, wikis, blogs and workflows can enable collaboration and knowledge sharing. Computational experiments using web services allow combining various experts' tools and data. E-science approaches leverage many minds to generate hypotheses, publish results and enable virtual laboratories.
When I was asked to give a companion lecture in support of ‘The Philosophy of Science’ (https://shorturl.at/4pUXz) I decided not to walk through the detail of the many methodologies in order of use. Instead, I chose to employ a long standing, and ongoing, scientific development as an exemplar. And so, I chose the ever evolving story of Thermodynamics as a scientific investigation at its best.
Conducted over a period of >200 years, Thermodynamics R&D, and application, benefitted from the highest levels of professionalism, collaboration, and technical thoroughness. New layers of application, methodology, and practice were made possible by the progressive advance of technology. In turn, this has seen measurement and modelling accuracy continually improved at a micro and macro level.
Perhaps most importantly, Thermodynamics rapidly became a primary tool in the advance of applied science/engineering/technology, spanning micro-tech, to aerospace and cosmology. I can think of no better a story to illustrate the breadth of scientific methodologies and applications at their best.
Or: Beyond linear.
Abstract: Equivariant neural networks are neural networks that incorporate symmetries. The nonlinear activation functions in these networks result in interesting nonlinear equivariant maps between simple representations, and motivate the key player of this talk: piecewise linear representation theory.
Disclaimer: No one is perfect, so please mind that there might be mistakes and typos.
dtubbenhauer@gmail.com
Corrected slides: dtubbenhauer.com/talks.html
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptxMAGOTI ERNEST
Although Artemia has been known to man for centuries, its use as a food for the culture of larval organisms apparently began only in the 1930s, when several investigators found that it made an excellent food for newly hatched fish larvae (Litvinenko et al., 2023). As aquaculture developed in the 1960s and ‘70s, the use of Artemia also became more widespread, due both to its convenience and to its nutritional value for larval organisms (Arenas-Pardo et al., 2024). The fact that Artemia dormant cysts can be stored for long periods in cans, and then used as an off-the-shelf food requiring only 24 h of incubation makes them the most convenient, least labor-intensive, live food available for aquaculture (Sorgeloos & Roubach, 2021). The nutritional value of Artemia, especially for marine organisms, is not constant, but varies both geographically and temporally. During the last decade, however, both the causes of Artemia nutritional variability and methods to improve poorquality Artemia have been identified (Loufi et al., 2024).
Brine shrimp (Artemia spp.) are used in marine aquaculture worldwide. Annually, more than 2,000 metric tons of dry cysts are used for cultivation of fish, crustacean, and shellfish larva. Brine shrimp are important to aquaculture because newly hatched brine shrimp nauplii (larvae) provide a food source for many fish fry (Mozanzadeh et al., 2021). Culture and harvesting of brine shrimp eggs represents another aspect of the aquaculture industry. Nauplii and metanauplii of Artemia, commonly known as brine shrimp, play a crucial role in aquaculture due to their nutritional value and suitability as live feed for many aquatic species, particularly in larval stages (Sorgeloos & Roubach, 2021).
Current Ms word generated power point presentation covers major details about the micronuclei test. It's significance and assays to conduct it. It is used to detect the micronuclei formation inside the cells of nearly every multicellular organism. It's formation takes place during chromosomal sepration at metaphase.
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...Sérgio Sacani
Context. With a mass exceeding several 104 M⊙ and a rich and dense population of massive stars, supermassive young star clusters
represent the most massive star-forming environment that is dominated by the feedback from massive stars and gravitational interactions
among stars.
Aims. In this paper we present the Extended Westerlund 1 and 2 Open Clusters Survey (EWOCS) project, which aims to investigate
the influence of the starburst environment on the formation of stars and planets, and on the evolution of both low and high mass stars.
The primary targets of this project are Westerlund 1 and 2, the closest supermassive star clusters to the Sun.
Methods. The project is based primarily on recent observations conducted with the Chandra and JWST observatories. Specifically,
the Chandra survey of Westerlund 1 consists of 36 new ACIS-I observations, nearly co-pointed, for a total exposure time of 1 Msec.
Additionally, we included 8 archival Chandra/ACIS-S observations. This paper presents the resulting catalog of X-ray sources within
and around Westerlund 1. Sources were detected by combining various existing methods, and photon extraction and source validation
were carried out using the ACIS-Extract software.
Results. The EWOCS X-ray catalog comprises 5963 validated sources out of the 9420 initially provided to ACIS-Extract, reaching a
photon flux threshold of approximately 2 × 10−8 photons cm−2
s
−1
. The X-ray sources exhibit a highly concentrated spatial distribution,
with 1075 sources located within the central 1 arcmin. We have successfully detected X-ray emissions from 126 out of the 166 known
massive stars of the cluster, and we have collected over 71 000 photons from the magnetar CXO J164710.20-455217.
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...University of Maribor
Slides from talk:
Aleš Zamuda: Remote Sensing and Computational, Evolutionary, Supercomputing, and Intelligent Systems.
11th International Conference on Electrical, Electronics and Computer Engineering (IcETRAN), Niš, 3-6 June 2024
Inter-Society Networking Panel GRSS/MTT-S/CIS Panel Session: Promoting Connection and Cooperation
https://www.etran.rs/2024/en/home-english/
The technology uses reclaimed CO₂ as the dyeing medium in a closed loop process. When pressurized, CO₂ becomes supercritical (SC-CO₂). In this state CO₂ has a very high solvent power, allowing the dye to dissolve easily.
ESR spectroscopy in liquid food and beverages.pptxPRIYANKA PATEL
With increasing population, people need to rely on packaged food stuffs. Packaging of food materials requires the preservation of food. There are various methods for the treatment of food to preserve them and irradiation treatment of food is one of them. It is the most common and the most harmless method for the food preservation as it does not alter the necessary micronutrients of food materials. Although irradiated food doesn’t cause any harm to the human health but still the quality assessment of food is required to provide consumers with necessary information about the food. ESR spectroscopy is the most sophisticated way to investigate the quality of the food and the free radicals induced during the processing of the food. ESR spin trapping technique is useful for the detection of highly unstable radicals in the food. The antioxidant capability of liquid food and beverages in mainly performed by spin trapping technique.
Phenomics assisted breeding in crop improvementIshaGoswami9
As the population is increasing and will reach about 9 billion upto 2050. Also due to climate change, it is difficult to meet the food requirement of such a large population. Facing the challenges presented by resource shortages, climate
change, and increasing global population, crop yield and quality need to be improved in a sustainable way over the coming decades. Genetic improvement by breeding is the best way to increase crop productivity. With the rapid progression of functional
genomics, an increasing number of crop genomes have been sequenced and dozens of genes influencing key agronomic traits have been identified. However, current genome sequence information has not been adequately exploited for understanding
the complex characteristics of multiple gene, owing to a lack of crop phenotypic data. Efficient, automatic, and accurate technologies and platforms that can capture phenotypic data that can
be linked to genomics information for crop improvement at all growth stages have become as important as genotyping. Thus,
high-throughput phenotyping has become the major bottleneck restricting crop breeding. Plant phenomics has been defined as the high-throughput, accurate acquisition and analysis of multi-dimensional phenotypes
during crop growing stages at the organism level, including the cell, tissue, organ, individual plant, plot, and field levels. With the rapid development of novel sensors, imaging technology,
and analysis methods, numerous infrastructure platforms have been developed for phenotyping.
ESPP presentation to EU Waste Water Network, 4th June 2024 “EU policies driving nutrient removal and recycling
and the revised UWWTD (Urban Waste Water Treatment Directive)”
Rare Disease Data Linkage plan 2017 - IRDiRC 2017 presentation
1. THE RARE DISEASE DATA LINKAGE PLAN
BOOSTING RESEARCH BY MAKING DATA
RESOURCES FINDABLE, ACCESSIBLE,
INTEROPERABLE AND REUSABLE TOGETHER
I RDiRC 2017, Paris, Februar y 9, 2017
Marco Roos, David van Enckevort
Acknowledging patient representatives, ELIXIR(-EXCELERATE), RD-CONNECT, BBMRI(-NL), ODEX4ALL,
FAIRDict, the rare disease linked data and ontology task force, Mark Thompson, Robert Reihs, Rajaram
Kaliyaperumal, Pedro Sernadela, Marc Hanauer, Mark Wilkinson, Claudio Carta, Rachel Thompson, Estrella Lopes, Lorena
Casareto, Frederique Ehrhart, Roxana Merino, Luiz Bonino & team, Ronald Cornet, Peter Robinson, Mathias Brochhausen, Simon Jupp, Sira
Sarntivijai, Helen Parkinson, Ana Rath, Heimo Muller, Lucia Monaco, Domenica Taruscio, Manuel Posada, Luca Sangiorgi, Morris Swertz, José
Oliveira, Peter-Bram ‘t Hoen, Hanns Lochmuller, Larry Hunter, participants and organisers of rare disease Bring Your Own Data workshops
In this presentation, I will give
you a light-weight introduction
to the rare disease data linkage
plan to boost research on rare
diseases.
2. 2
ELIXIR
16 February
2017
A distributed
infrastructure
for life science
information
The plan is supported by several projects and
infrastructures. Here, I highlight ELIXIR, the European
infrastructure for life science information. Ivo Gut, Sergi
Beltran and Marco Roos co-lead the case for rare diseases in
this infrastructure.
3. 3
To boost rare disease research
Objective
16 February
2017
Our goal is to boost rare
disease research
4. 4
> 6000 rare diseases
16 February
2017
But then we have to take
into account that there are
over 6000 rare diseases
5. 5
> 6000 rare diseases
biobanks, registries, sequencing, OMICS, …
16 February
2017
Collecting multiple types of
materials and data, such as
biobanks, registries,
sequence data, omics,
etcetera
6. 6
> 6000 rare diseases
biobanks, registries, sequencing, OMICS, …
across countries and institutes
16 February
2017
And across many countries
and institutes
7. 7
What we want to achieve
16 February
2017
7
Rare disease researcher
What we aim to achieve is
the following; here is rare
disease researcher Claudio
8. 8
Rare disease researcher asks
Which treatments for symptoms
of other diseases may mitigate
the same symptoms of
my rare disease?
Claudio has questions like
these
9. 9
A rare disease researcher asks…
Where can I obtain biosamples of donors
with an abnormality in head or neck?
In which biobanks can I
find these samples?
Or these…
We have built a
demonstration web tool
to show how we can
answer these questions
with the technologies that
we advocate.
12. And obtain this table with
the answers.
Note that all items are blue; that means you can click on
them to get more information about them.
Note that the phenotypes are all subtypes of abnormality
of head or neck.
Also note that there are multiple diseases, multiple
biobanks, and multiple registries in the list. This shows
that the information came from different sources.
13. 13
The role of computational analysis in
boosting health care and life science will
Decrease
Increase
Stay the same
Questions to you
16 February
2017
Before I continue, think
about this question
14. 14
The role of computational analysis in
boosting health care and life science will
Decrease
Increase
Stay the same
Questions to you
16 February
2017
Most people tend to say it
will increase.
15. 15
to boost rare disease research
16 February
2017
If that is the case, then…
17. 17
The substrate for computational analysts
DATA
And then we have to look at
the substrate for
computational analysts: data
18. 18
How to boost rare disease research?
PATIENT
DATA
In the rare disease domain,
we have many different
types of data, such as
patient data
19. 19
How to boost rare disease research?
OMICS
DATA
Omics data
20. 20
How to boost rare disease research?
SAMPLE
DATA
Or biological samples
21. 21
How to boost rare disease research?
SAMPLE
DATA
And in all shapes and sizes,
different languages,
different formats
22. 22
How to boost rare disease research?
PATIENT
DATA
> 6000 rare diseases
across countries and institutes
Remember that we have
thousands of data sources in
our domain.
23. 23
How to boost rare disease research?
PATIENT
DATA
Data incompatibilities are
an enormous bottleneck for
data analysts: they spend
months per data source to
resolve them.
24. 24
How to boost rare disease research?
DA
TA
DATA
You Them
A way to address this is by letting
others make your data compatible:
‘they’ transform the data to be more
compatible.
25. 25
How to boost rare disease research?
DA
TA
You Them
DATA
They can do that with
multiple data sources, and
integrate them.
26. 26
DATA
How to boost rare disease research?
DA
TA
You Them
However, there is a big risk. When for whatever
reason, ‘they’ cannot maintain this anymore, for
instance because the funding stops…
27. 27
How to boost rare disease research?
DA
TA
You
There is nothing left.
28. 28
How to boost rare disease research?
DA
TA
You
We are back to square one:
incompatible data.
This is not good enough for data infrastructure.
International leading data experts have defined an
approach for this that I cannot explain better than is done
in the following video.
30. 30
G20 Endorsement of FAIR principles
Next to ELIXIR, EOSC, NIH Commons
We, the leaders of the G20…
facilitate appropriate access to publicly
funded research results on
findable, accessible,
interoperable and reusable
(FAIR) principles …
The FAIR principles are highly endorsed, by ELIXIR, the Open
European Science Cloud, NIH via its ‘commons’ program,
and since 2016 also the G20.
31. 31
FAIR principles applied to rare disease data
RD
DATA
FAIR
Linkable
RD
DATA
You
How do we apply them for
rare diseases?
At a high level, the steps are more-or-less the
same: your data, a transformation, but now
we have FAIR, linkable data on the right.
32. 32
FAIR principles applied to rare disease data
RD
DATA
FAIR
Linkable
RD
DATA
You YouYou and them
together
Knowledge exchange
But there are major differences: instead of ‘you’
and ‘them’ data experts and FAIR data experts do
the transformation together.
This involves substantial knowledge exchange. Another major difference is that
instead of ‘them’ there is ‘you’ on the right: data owners stay in control.
33. 33
FAIR principles applied to rare disease data
Data can be more easily combined. Each resource is an
independently FAIR resource. This is a much more
robust infrastructure.
34. 34
Creating substrate to boost rare disease research
In our rare
disease data
linkage plan
we go
through this
process, one
at a time.
Each time we
improve our
methods,
each time we
do this faster.
35. Rare disease data linkage plan - 2017
• > 7 biobanks/registries FAIR at the source
• Study FAIR pathways, Orphanet, mutation data
• Support: RD-Connect, ELIXIR, BBMRI (-NL), FAIRDict,
ODEX4All, patient organisations
David van Enckevort
Technical leader
Our aims for 2017
36. 36
Light-weight introduction to the rare
disease data linkage plan
Account for scale and sparsity of data in
rare disease domain
Federated infrastructure of local FAIR data
Summary
16-Feb-17
36
37. 37
Invitation
Contact us about making
rare disease data FAIR
Turn plan into long-
running service
Long term plans
16 February
2017
Contact us about making rare disease data FAIR. Let us know
if you would like to help turning the plan for 2017 into a long
running service for the rare disease community. We envision
a role for patient organisations in that.
38. • Mascha Jansen: FAIR data projects
and Bring Your Own Data workshops
(mascha.jansen@dtls.nl)
• David van Enckevort, Marco Roos:
Rare disease data linkage plan &
FAIR RD data projects
• Erik Schultes: FAIR data (awareness)
training; for Elixir/RD: Brane
Leskosek (DTL: Celia van Gelder)
Thank you
Acknowledging patient
representatives, ELIXIR(-
EXCELERATE), RD-CONNECT,
BBMRI(-NL), ODEX4ALL,
FAIRDict, the rare disease
linked data and ontology
task force, David van Enckevort,
Mark Thompson, Robert Reihs,
Rajaram Kaliyaperumal, Pedro
Sernadela, Marc Hanauer, Mark
Wilkinson, Claudio Carta, Rachel
Thompson, Estrella Lopes, Lorena
Casareto, Frederique Ehrhart,
Roxana Merino, Luiz Bonino & team,
Ronald Cornet, Peter Robinson,
Mathias Brochhausen, Simon Jupp,
Sira Sarntivijai, Helen Parkinson, Ana
Rath, Heimo Muller, Lucia Monaco,
Domenica Taruscio, Manuel Posada,
Luca Sangiorgi, Morris Swertz, José
Oliveira, Peter-Bram ‘t Hoen, Hanns
Lochmuller, Larry Hunter,
participants and organisers of rare
disease Bring Your Own Data
workshops
Thank you for your attention.
Editor's Notes
In this presentation, I will give you a light-weight introduction to the rare disease data linkage plan to boost research on rare diseases.
The plan is supported by several projects and infrastructures. Here, I highlight ELIXIR, the European infrastructure for life science information. Ivo Gut and Sergi Beltran from Barcelona and myself from Leiden are carrying the flag for rare diseases in this infrastructure.
Our goal is to boost rare disease research
But then we have to take into account that there are over 6000 rare diseases
For each disease we collect multiple types of materials and data, such as samples in biobanks, health information in registries, sequence data, omics data, etcetera
And across many countries and institutes
What we aim to achieve is the following; here is rare disease researcher Claudio
Claudio has questions like these: which treatments for symptoms of other diseases may mitigate the same symptoms in my disease of interest?
Or, where can I obtain biosamples of donors with an abnormality in head or neck?
And in which biobanks can I find these samples?
We have built a demonstration web tool to show how we can answer these questions with the technologies that we advocate.
Here, Claudio can select his question.
Select the symptom that he is interested in
And obtain this table with the answers.
Note that all items are blue; that means you can click on them to get more information about them.
Note that the phenotypes are all subtypes of abnormality of head or neck.
Also note that there are multiple diseases, multiple biobanks, and multiple registries in the list. This shows that the information came from different sources.
Before I continue, think about this question: do you think that the role of computational analysis will increase in health care and life science in the future?
Most people tend to say it will increase.
If that is the case, then to boost rare disease research
We need to enable computational analysts to boost rare disease research
And then we have to look at the substrate for computational analysts: data
In the rare disease domain, we have many different types of data, such as patient data
Omics data
Or biosamples
And in all shapes and sizes, different languages, different formats
Remember that we have thousands of data sources in our domain.
Data incompatibilities are an enormous bottleneck for data analysts: they spend months per data source to resolve them.
A way to address this is by letting others make your data compatible: ‘they’ transform the data to be more compatible.
They can do that with multiple data sources, and integrate them.
However, there is a big risk. When for whatever reason, ‘they’ cannot maintain this anymore, for instance because the funding stops.
There is nothing left.
We are back to square one: incompatible data.
This is not good enough for data infrastructure. International leading data experts have defined an approach for this that I cannot explain better than is done in the following video.
The FAIR principles are highly endorsed, such as by ELIXIR, a European initiative called the Open European Science Cloud, NIH via its ‘commons’ program, and since 2016 also the G20.
How do we apply them for rare diseases?
At a high level, the steps are more-or-less the same: your data, a transformation, but now we have FAIR, linkable data on the right.
But there are major differences: instead of ‘you’ and ‘them’ data experts and FAIR data experts do the transformation together. This involves substantial knowledge exchange. Another major difference is that instead of ‘them’ there is ‘you’ on the right: data owners stay in control of their data.
Data can be more easily combined. Each resource is an independently FAIR resource. This is a much more robust infrastructure.
In our rare disease data linkage plan we go through this process, one at a time. Each time we improve our methods, each time we do this faster.
In 2017 we aim at our first seven biobanks/registries, we will study pathways, orphaned and mutation data.
We have support from multiple projects, including patient organisations who we ask to also invest into the collaboration.
In summary: I have given a light-weight introduction to the rare disease data linkage pan.
I have shown how we account for the scale and sparsity of data in the rare disease domain,
By federated infrastructure of local FAIR data
I invite you to contact us about making rare disease data FAIR
And I invite you to let us know if you would like to help us turn this plan into a long running service. We envision a role for patient organisations in that.
Thank you for your attention. Here are some contact points.