Introducing the HPC challenges associated with developing a set of clinical microbial genomics services in the NHS in Wales. Demonstrating the potential of these technologies, and the impact it is already having for the patients of the Welsh NHS.
I spoke on "Big Data in Biology". The talk basically concentrates on how biology has affected big data and how big data has become a key player in biology. I have also covered how DNA storage can address long term archival storage.
Genetic engineers are developing new techniques to turn cells into miniature drug factories by delivering synthetic DNA. This is done using either viral or plasmid vectors to transport the DNA past the cell's lipid bilayer and into the nucleus. Techniques like electroporation, gene guns, and novel injection devices use physical forces to disrupt the bilayer and allow DNA entry. Team is working with ChronTech Pharma to optimize a new injection device called IVIN that uses an array of needles to inject DNA vaccines into muscle tissue with improved transfection results. Overcoming the challenge of efficiently delivering genetic material into cells opens possibilities for treating previously untreatable diseases.
This document discusses the importance of computational tools for biological research. It provides an overview of how computer applications are used in areas like the Human Genome Project, transcriptomics, proteomics, and systems biology. The document also notes challenges for biological research in Thailand, including a lack of background knowledge in computer science and limited access to free and easy-to-use computational tools, especially in the Thai language. It argues that biology students in Thailand should be taught bioinformatics and computational biology skills to better facilitate biological research.
Brief presentation on the challenges and current state of play with regards to the bioinformatics of a pathogen, M. tuberculosis. Presented at the UWC/UCT Big Data workshop in January 2015
Federal Research & Development for the Florida system Sept 2014 Warren Kibbe
This document discusses challenges in cancer data integration and analysis. It proposes the development of open science models, standardized data elements, and sustainable informatics infrastructure. Emerging technologies like mobile devices, social media, and cloud computing create opportunities to build a national "learning health system" for cancer. The National Cancer Institute is pursuing initiatives like the Cancer Genomics Data Commons and cloud pilots to leverage large genomic and clinical datasets using these technologies and develop predictive models to improve outcomes. The ultimate goal is a system that facilitates data sharing, continuous learning from all cancer patients, and personalized, predictive oncology.
GxAlert Papua New Guinea Case Study 072518SystemOne
GxAlert's use in Papua New Guinea for disease surveillance and response. Initially used for Tuberculosis response, device management and second-line drug forecasting and stockage.
I spoke on "Big Data in Biology". The talk basically concentrates on how biology has affected big data and how big data has become a key player in biology. I have also covered how DNA storage can address long term archival storage.
Genetic engineers are developing new techniques to turn cells into miniature drug factories by delivering synthetic DNA. This is done using either viral or plasmid vectors to transport the DNA past the cell's lipid bilayer and into the nucleus. Techniques like electroporation, gene guns, and novel injection devices use physical forces to disrupt the bilayer and allow DNA entry. Team is working with ChronTech Pharma to optimize a new injection device called IVIN that uses an array of needles to inject DNA vaccines into muscle tissue with improved transfection results. Overcoming the challenge of efficiently delivering genetic material into cells opens possibilities for treating previously untreatable diseases.
This document discusses the importance of computational tools for biological research. It provides an overview of how computer applications are used in areas like the Human Genome Project, transcriptomics, proteomics, and systems biology. The document also notes challenges for biological research in Thailand, including a lack of background knowledge in computer science and limited access to free and easy-to-use computational tools, especially in the Thai language. It argues that biology students in Thailand should be taught bioinformatics and computational biology skills to better facilitate biological research.
Brief presentation on the challenges and current state of play with regards to the bioinformatics of a pathogen, M. tuberculosis. Presented at the UWC/UCT Big Data workshop in January 2015
Federal Research & Development for the Florida system Sept 2014 Warren Kibbe
This document discusses challenges in cancer data integration and analysis. It proposes the development of open science models, standardized data elements, and sustainable informatics infrastructure. Emerging technologies like mobile devices, social media, and cloud computing create opportunities to build a national "learning health system" for cancer. The National Cancer Institute is pursuing initiatives like the Cancer Genomics Data Commons and cloud pilots to leverage large genomic and clinical datasets using these technologies and develop predictive models to improve outcomes. The ultimate goal is a system that facilitates data sharing, continuous learning from all cancer patients, and personalized, predictive oncology.
GxAlert Papua New Guinea Case Study 072518SystemOne
GxAlert's use in Papua New Guinea for disease surveillance and response. Initially used for Tuberculosis response, device management and second-line drug forecasting and stockage.
The Swaziland National Tuberculosis Program (NTP) implemented a new diagnostic device management platform called GxAlert to help manage their fleet of GeneXpert devices across 28 laboratories in the country. This was necessary to maximize the potential of the new diagnostics by allowing remote monitoring, error detection, targeted training, and data-driven decision making. With support from SystemOne and a skilled local NTP team, 15 laboratories were successfully connected to GxAlert in just 3 days, despite some initial challenges. GxAlert has since helped the NTP optimize inventory, monitor the drug resistance survey, and collect over 48,000 diagnostic results to improve patient care and understand disease trends.
The document discusses a knowledge management platform developed at Genentech to manage pre-clinical animal studies.
The platform called DIVOS manages over 12,000 in vivo studies dating back to 1998 across multiple therapeutic areas conducted both in-house and by CROs. The technical approach involved developing a structured yet flexible platform to capture study details while enabling data reuse. People were key to change through various teams guiding strategy, tactics and operations. New capabilities like improved study logistics, enhanced collaboration and new insights from data analysis provide evidence of the platform's success.
From Digitally Enabled Genomic Medicineto Personalized HealthcareLarry Smarr
The document discusses the future of personalized healthcare through digital health technologies and genomic medicine. It describes how continuous monitoring of various biological sensors can capture temporal data on factors like physical activity, diet, sleep, environmental exposures and more. This comprehensive data combined with clinical records, genetic information, and microbial metagenomic analysis can enable true preventative medicine through early detection, feedback loops, and tuning of lifestyle and medical factors.
Classifying lymphoma and tuberculosis case reports using machine learning alg...journalBEEI
Available literature reports several lymphoma cases misdiagnosed as tuberculosis, especially in countries with a heavy TB burden. This frequent misdiagnosis is due to the fact that the two diseases can present with similar symptoms. The present study therefore aims to analyse and explore TB as well as lymphoma case reports using Natural Language Processing tools and evaluate the use of machine learning to differentiate between the two diseases. As a starting point in the study, case reports were collected for each disease using web scraping. Natural language processing tools and text clustering were then used to explore the created dataset. Finally, six machine learning algorithms were trained and tested on the collected data, which contained 765 lymphoma and 546 tuberculosis case reports. Each method was evaluated using various performance metrics. The results indicated that the multi-layer perceptron model achieved the best accuracy (93.1%), recall (91.9%) and precision score (93.7%), thus outperforming other algorithms in terms of correctly classifying the different case reports.
Microsoft genomics to advance clinical scienceBruno Denys
Microsoft invested mlassively into advanced clinical genomics services in the public cloud Azure. This is a healthcare revolution allowing to create new treatment based on better Genomic information
Bangladesh case study on how we implemented a connected diagnostics solution using GxAlert for infectious disease surveillance, TB program optimization and leveraging diagnostic data.
Presentation about how much bioinformatics involved in the medical field. This was presented at the University of Colombo in 2007 for an undergraduate seminar
This document discusses complex metagenome assembly and career thoughts in bioinformatics. It begins with the speaker's research background and then discusses two main topics: 1) challenges with metagenome assembly due to low coverage regions and strain variation in sequencing data, and approaches using assembly graphs, and 2) the need for more "bioinformaticians in the middle" who are comfortable with both biology and computational analysis to integrate large-scale data into their research. The speaker provides advice for embracing computation and seeking formal training opportunities to develop skills at this intersection of disciplines.
The Application of Next Generation Sequencing (NGS) in cancer treatmentPremadarshini Sai
Next-generation sequencing (NGS) has several advantages for cancer treatment including high throughput sequencing, screening of multiple genes simultaneously, and decreased costs. NGS faces challenges from complex data analysis and validation of new technologies. Key clinical applications of NGS include whole genome sequencing, transcriptome analysis via RNA-seq, and sequencing of cell-free DNA. Future areas of development include immunotherapy, epigenetics research, and using circulating tumor cells to detect early relapse. More research is still needed to fully realize the potential of NGS in personalized cancer treatment.
Week 1 lecture for High School Bioinformatics course; covers why we need to use computers in biology, what bioinformatics/computational biology is, an introduction to machine learning, and examples from current research
We can aid decision making from the pre-clinical to the clinical setting, supporting line of sight to the clinic, by identifying and translating crucial biomarker approaches into the real world.
The document discusses how artificial intelligence can be used for human welfare in various fields such as biology, medicine, and agriculture. It provides examples of how AI is inspired by biological systems to make intelligent decisions. AI is being used in medical applications such as cancer treatment, regenerative medicine, and precision agriculture to increase crop yields in a sustainable way. The document concludes that AI systems have great potential to help address challenges in healthcare access and delivery in India by powering virtual assistants and precision farming technologies.
Storage and Analysis of Sensitive Large-Scale Biomedical Data in SwedenOla Spjuth
This document summarizes the storage and analysis of sensitive large-scale biomedical data in Sweden. It discusses:
1) The Science for Life Laboratory, established in 2010, which is a leading center for developing and applying large-scale technologies for molecular biosciences.
2) Hardware resources available for bioinformatics analysis including high-performance computing clusters and large-scale storage facilities.
3) The generation of over 120 terabases of sequencing data in 2014 alone and resources for ongoing large-scale human genome sequencing efforts.
4) The SNIC-Sens project, a 4-year national security project funded to develop specifications for analyzing sensitive biomedical data within legal and ethical guidelines.
Whole Exome and Genome sequencing in Neurological disorders. The document discusses the techniques of next generation sequencing (NGS), including whole exome sequencing (WES) and whole genome sequencing (WGS). It provides examples of studies applying WES/WGS to diagnose neurological disorders, finding diagnostic variants in 25-60% of cases of leukoencephalopathy, limb-girdle muscular dystrophy, cerebellar ataxia, and more. The limitations and clinical applications of WES/WGS are also reviewed.
Feasibility of an SMS intervention to deliver tuberculosis testing results in...SystemOne
Pre-treatment loss to follow-up is common for patients diagnosed with tuberculosis (TB) in high-burden countries. Delivering test results by SMS is increasingly being considered as a solution, but there is limited information about its feasibility as a public health tool in low resourced settings. It was found that reporting Xpert results via automated SMS is technically feasible and results in approximately half of patients receiving their test results immediately. Additional research should be done to address process inefficiencies in order to maximize impact of this technology and link its successful utilization to improved patient outcomes.
Healthcare Conference 2013 : Genes, Clouds and Cancer - dr. Andrew LittD3 Consutling
Dell Healthcare provides IT services to healthcare organizations worldwide. They serve over 50% of US hospitals, the top 10 pharmaceutical companies, and 100 insurance organizations. Dell Healthcare manages billions of medical images in the cloud, billions of security events daily, and provides genomic sequencing services. They are sponsoring the first FDA-approved clinical trial using whole genome sequencing to provide personalized cancer treatment to children with neuroblastoma. The trial aims to reduce analysis time from weeks to hours using Dell's high performance computing capabilities and improve collaboration using their genomics cloud. The goal is to expand personalized medicine from treating a few children to hundreds and thousands.
Machine learning has many applications and opportunities in biology, though also faces challenges. It can be used for tasks like disease detection from medical images. Deep learning models like convolutional neural networks have achieved performance exceeding human experts in detecting pneumonia from chest X-rays. Frameworks like DeepChem apply deep learning to problems in drug discovery, while platforms like Open Targets integrate data on drug targets and their relationships to diseases. Overall, machine learning shows promise for advancing biological research, though developing expertise through learning resources and implementing models to solve real-world problems is important.
SAMSI Precision Medicine Keynote, August 2018: Data: where Precision Oncology...Warren Kibbe
The promise of precision medicine in oncology is predicated on the availability of accurate, high quality data from the clinic and the laboratory. Likewise, a Learning Health System is one in which we use data to monitor that we are following guidelines and care pathways to deliver the best care and not revert to prior practices (regression testing for care!) and also provide real world evidence to determine effectiveness and identify populations that would benefit from novel therapies. Into this mix of clinical drivers are the rapidly changing capabilities in instrumentation, computing, computation, and the pervasive use of sensors and smart devices. I will highlight a few of the obvious and perhaps not as obvious opportunities in leveraging the increasingly digital landscape in healthcare and biomedical research as we move toward a national learning health system for cancer.
Making your science powerful : an introduction to NGS experimental designjelena121
A basic overview of considerations for designing genomics experiments using Next Generation Sequencing (NGS). Includes a discussion of power, accuracy, what samples to collect, and what sequencing parameters to use.
My talk from the IBMS Congress 2019, outlining key challenges and advice, based on our experience, for people who want to implement pathogen genomics services.
The Swaziland National Tuberculosis Program (NTP) implemented a new diagnostic device management platform called GxAlert to help manage their fleet of GeneXpert devices across 28 laboratories in the country. This was necessary to maximize the potential of the new diagnostics by allowing remote monitoring, error detection, targeted training, and data-driven decision making. With support from SystemOne and a skilled local NTP team, 15 laboratories were successfully connected to GxAlert in just 3 days, despite some initial challenges. GxAlert has since helped the NTP optimize inventory, monitor the drug resistance survey, and collect over 48,000 diagnostic results to improve patient care and understand disease trends.
The document discusses a knowledge management platform developed at Genentech to manage pre-clinical animal studies.
The platform called DIVOS manages over 12,000 in vivo studies dating back to 1998 across multiple therapeutic areas conducted both in-house and by CROs. The technical approach involved developing a structured yet flexible platform to capture study details while enabling data reuse. People were key to change through various teams guiding strategy, tactics and operations. New capabilities like improved study logistics, enhanced collaboration and new insights from data analysis provide evidence of the platform's success.
From Digitally Enabled Genomic Medicineto Personalized HealthcareLarry Smarr
The document discusses the future of personalized healthcare through digital health technologies and genomic medicine. It describes how continuous monitoring of various biological sensors can capture temporal data on factors like physical activity, diet, sleep, environmental exposures and more. This comprehensive data combined with clinical records, genetic information, and microbial metagenomic analysis can enable true preventative medicine through early detection, feedback loops, and tuning of lifestyle and medical factors.
Classifying lymphoma and tuberculosis case reports using machine learning alg...journalBEEI
Available literature reports several lymphoma cases misdiagnosed as tuberculosis, especially in countries with a heavy TB burden. This frequent misdiagnosis is due to the fact that the two diseases can present with similar symptoms. The present study therefore aims to analyse and explore TB as well as lymphoma case reports using Natural Language Processing tools and evaluate the use of machine learning to differentiate between the two diseases. As a starting point in the study, case reports were collected for each disease using web scraping. Natural language processing tools and text clustering were then used to explore the created dataset. Finally, six machine learning algorithms were trained and tested on the collected data, which contained 765 lymphoma and 546 tuberculosis case reports. Each method was evaluated using various performance metrics. The results indicated that the multi-layer perceptron model achieved the best accuracy (93.1%), recall (91.9%) and precision score (93.7%), thus outperforming other algorithms in terms of correctly classifying the different case reports.
Microsoft genomics to advance clinical scienceBruno Denys
Microsoft invested mlassively into advanced clinical genomics services in the public cloud Azure. This is a healthcare revolution allowing to create new treatment based on better Genomic information
Bangladesh case study on how we implemented a connected diagnostics solution using GxAlert for infectious disease surveillance, TB program optimization and leveraging diagnostic data.
Presentation about how much bioinformatics involved in the medical field. This was presented at the University of Colombo in 2007 for an undergraduate seminar
This document discusses complex metagenome assembly and career thoughts in bioinformatics. It begins with the speaker's research background and then discusses two main topics: 1) challenges with metagenome assembly due to low coverage regions and strain variation in sequencing data, and approaches using assembly graphs, and 2) the need for more "bioinformaticians in the middle" who are comfortable with both biology and computational analysis to integrate large-scale data into their research. The speaker provides advice for embracing computation and seeking formal training opportunities to develop skills at this intersection of disciplines.
The Application of Next Generation Sequencing (NGS) in cancer treatmentPremadarshini Sai
Next-generation sequencing (NGS) has several advantages for cancer treatment including high throughput sequencing, screening of multiple genes simultaneously, and decreased costs. NGS faces challenges from complex data analysis and validation of new technologies. Key clinical applications of NGS include whole genome sequencing, transcriptome analysis via RNA-seq, and sequencing of cell-free DNA. Future areas of development include immunotherapy, epigenetics research, and using circulating tumor cells to detect early relapse. More research is still needed to fully realize the potential of NGS in personalized cancer treatment.
Week 1 lecture for High School Bioinformatics course; covers why we need to use computers in biology, what bioinformatics/computational biology is, an introduction to machine learning, and examples from current research
We can aid decision making from the pre-clinical to the clinical setting, supporting line of sight to the clinic, by identifying and translating crucial biomarker approaches into the real world.
The document discusses how artificial intelligence can be used for human welfare in various fields such as biology, medicine, and agriculture. It provides examples of how AI is inspired by biological systems to make intelligent decisions. AI is being used in medical applications such as cancer treatment, regenerative medicine, and precision agriculture to increase crop yields in a sustainable way. The document concludes that AI systems have great potential to help address challenges in healthcare access and delivery in India by powering virtual assistants and precision farming technologies.
Storage and Analysis of Sensitive Large-Scale Biomedical Data in SwedenOla Spjuth
This document summarizes the storage and analysis of sensitive large-scale biomedical data in Sweden. It discusses:
1) The Science for Life Laboratory, established in 2010, which is a leading center for developing and applying large-scale technologies for molecular biosciences.
2) Hardware resources available for bioinformatics analysis including high-performance computing clusters and large-scale storage facilities.
3) The generation of over 120 terabases of sequencing data in 2014 alone and resources for ongoing large-scale human genome sequencing efforts.
4) The SNIC-Sens project, a 4-year national security project funded to develop specifications for analyzing sensitive biomedical data within legal and ethical guidelines.
Whole Exome and Genome sequencing in Neurological disorders. The document discusses the techniques of next generation sequencing (NGS), including whole exome sequencing (WES) and whole genome sequencing (WGS). It provides examples of studies applying WES/WGS to diagnose neurological disorders, finding diagnostic variants in 25-60% of cases of leukoencephalopathy, limb-girdle muscular dystrophy, cerebellar ataxia, and more. The limitations and clinical applications of WES/WGS are also reviewed.
Feasibility of an SMS intervention to deliver tuberculosis testing results in...SystemOne
Pre-treatment loss to follow-up is common for patients diagnosed with tuberculosis (TB) in high-burden countries. Delivering test results by SMS is increasingly being considered as a solution, but there is limited information about its feasibility as a public health tool in low resourced settings. It was found that reporting Xpert results via automated SMS is technically feasible and results in approximately half of patients receiving their test results immediately. Additional research should be done to address process inefficiencies in order to maximize impact of this technology and link its successful utilization to improved patient outcomes.
Healthcare Conference 2013 : Genes, Clouds and Cancer - dr. Andrew LittD3 Consutling
Dell Healthcare provides IT services to healthcare organizations worldwide. They serve over 50% of US hospitals, the top 10 pharmaceutical companies, and 100 insurance organizations. Dell Healthcare manages billions of medical images in the cloud, billions of security events daily, and provides genomic sequencing services. They are sponsoring the first FDA-approved clinical trial using whole genome sequencing to provide personalized cancer treatment to children with neuroblastoma. The trial aims to reduce analysis time from weeks to hours using Dell's high performance computing capabilities and improve collaboration using their genomics cloud. The goal is to expand personalized medicine from treating a few children to hundreds and thousands.
Machine learning has many applications and opportunities in biology, though also faces challenges. It can be used for tasks like disease detection from medical images. Deep learning models like convolutional neural networks have achieved performance exceeding human experts in detecting pneumonia from chest X-rays. Frameworks like DeepChem apply deep learning to problems in drug discovery, while platforms like Open Targets integrate data on drug targets and their relationships to diseases. Overall, machine learning shows promise for advancing biological research, though developing expertise through learning resources and implementing models to solve real-world problems is important.
SAMSI Precision Medicine Keynote, August 2018: Data: where Precision Oncology...Warren Kibbe
The promise of precision medicine in oncology is predicated on the availability of accurate, high quality data from the clinic and the laboratory. Likewise, a Learning Health System is one in which we use data to monitor that we are following guidelines and care pathways to deliver the best care and not revert to prior practices (regression testing for care!) and also provide real world evidence to determine effectiveness and identify populations that would benefit from novel therapies. Into this mix of clinical drivers are the rapidly changing capabilities in instrumentation, computing, computation, and the pervasive use of sensors and smart devices. I will highlight a few of the obvious and perhaps not as obvious opportunities in leveraging the increasingly digital landscape in healthcare and biomedical research as we move toward a national learning health system for cancer.
Making your science powerful : an introduction to NGS experimental designjelena121
A basic overview of considerations for designing genomics experiments using Next Generation Sequencing (NGS). Includes a discussion of power, accuracy, what samples to collect, and what sequencing parameters to use.
My talk from the IBMS Congress 2019, outlining key challenges and advice, based on our experience, for people who want to implement pathogen genomics services.
Clinical trial data wants to be free: Lessons from the ImmPort Immunology Dat...Barry Smith
Presentation to the Clinical and Research Ethics Seminar, Clinical and Translational Science Center, Buffalo, January 21, 2014
https://immport.niaid.nih.gov/
http://youtu.be/booqxkpvJMg
Will Biomedical Research Fundamentally Change in the Era of Big Data?Philip Bourne
This document discusses how biomedical research may fundamentally change in the era of big data. It notes that biomedical research has always been data-driven, but the scope, variety, complexity and volume of data is now much greater. It also discusses the need for more open data sharing and new tools and methods for large-scale analysis. The document suggests biomedical research may move towards a more collaborative "platform" model, as seen with companies like Airbnb, with the goal of improving data access, reuse and reproducibility of research. However, overcoming challenges like incentives, trust and work practices will be important for any new platform to succeed.
Genomic epidemiology uses whole genome sequencing data from pathogens combined with epidemiological investigations to track the spread of infectious diseases. The document discusses making genomic epidemiology a widespread reality in public health. It outlines key requirements including building a user-friendly analysis platform, developing portable analysis pipelines, providing training to public health personnel, and improving information sharing between organizations.
How Can We Make Genomic Epidemiology a Widespread Reality? - William HsiaoWilliam Hsiao
The document discusses genomic epidemiology and the requirements to bring genomic sequencing into routine public health practice. It outlines two parts: (1) what genomic epidemiology is and why it is important; and (2) the requirements for genomic sequencing to be used routinely in public health. Whole genome sequencing is seen as a way to generate high quality pathogen genomes quickly and allow for more detailed tracking of disease spread compared to traditional methods. However, bringing genomic sequencing into public health practice requires overcoming barriers such as the need for user-friendly analysis platforms, training public health personnel in genomics, and improving information sharing between organizations.
We enable UK life science companies to develop their drug discovery projects. And through networks of expert labs and CROs our Virtual R&D team can access and provide:
> industrially rigorous advice in drug discovery
> clinical and commercial insight
> expertise in delivery and project management
If you are an SME with a drug discovery project, or a CRO with expertise to provide, attend this event and find out how we can help you.
MedChemica BigData What Is That All About?Al Dossetter
A light look at the world of BigData for the lay person - a look at a couple of examples and what we do in MedChemica to speed up drug discovery. First presented at Macclesfield SciBar, and then Knutsford SciBar.
- National challenges in cancer research include lowering barriers to data access and analysis, and integrating clinical and basic research data to enable improved outcomes.
- Disruptive technologies like high-throughput biology and ubiquitous computing are generating large amounts of molecular and clinical cancer data.
- The NCI is working to build infrastructure like the Genomics Data Commons and Cloud Pilots to make these data widely accessible and support data analysis.
- The goal is to develop a national "learning health system" that applies insights from real-world cancer data to research and clinical practice to continuously improve patient care and outcomes.
My Personal Odyssey with Big Data - Brad PopovichCityAge
Brad Popovich underwent early exploration of his personal genome in the 1970s-2000s by analyzing genes related to various conditions. In 2013-2014, he obtained whole genome sequencing through Illumina's Understanding Your Genome program to learn more about his genetic blueprint and risks. While whole genome sequencing provides potential for informed decisions, interpretation challenges remain due to limited comparison data and phenotypic information. Popovich hopes to learn how it feels to be a consumer of genomic data and whether the healthcare system can accommodate medically actionable follow-up of any concerning variants found.
IRIDA: A Federated Bioinformatics Platform Enabling Richer Genomic Epidemiolo...William Hsiao
Introducing BCCDC and Public Health Microbiology (PHM)
Current State of PHM
Sequence Technology Advancement -> revolution of PHM
Genomic Epidemiology
Amount of Sequence Data Produced
Need to Process the data – Introduction to IRIDA
Need of Metadata and Ontology
Software to improve data sharing
How research microbiology and PHM can joint effort
Hannes smarason next code-wuxi combined technologiesHannes Smárason
Lower-cost genome sequencing has reached a point of strong commercial viability. The remaining 2 legs of the “3-legged stool” of genomics-enabling technologies —genomic analysis tools database storage—are rapidly evolving to support the use of genomic information in medical care.
Talk entitled "from the Virtual Human to a Digital Me" presented at the Virtual Physiological Human 2012 Conference held at IET Savoy, Savoy Place, London, 18-20 September 2012.
Genome sequencing technology available today can accurately sequence a whole genome from an individual’s test sample for a surprisingly low cost.
As a result, the adoption of this technology is rapidly expanding as medical centers around the world embrace its utility in informing healthcare decisions—an emerging reality of personalized medicine.
Accelerating the benefits of genomics worldwideJoaquin Dopazo
Grand Challenges in Genomics
A Joint NHGRI and Wellcome Trust Strategic Meeting
25 and 26 February 2019
https://www.wellcomeevents.org/WELLCOME/media/uploaded/EVWELLCOME/event_661/Draft_agenda_for_WT_December_2018.pdf
Join lecture: Nicky Mulder, Han Brunner and Joaquin Dopazo
Utilization of virtual microscopy in a cooperative group settingBIT002
The document discusses the use of virtual microscopy in cooperative cancer research groups. It summarizes the Research Informatics Core's role in developing digital pathology solutions to improve review times and access for cooperative groups like the Children's Oncology Group. Key applications discussed include the Virtual Imaging for Pathology, Education and Research (VIPER) system and efforts to integrate virtual microscopy with gene expression data through the Virtual Microscopy to Microarray (VM2M) project. Future goals include incorporating additional data types and developing more sophisticated analysis and search capabilities.
Accelerating the translation of medical research - 27 JuneInnovation Agency
Slides from the event focusing on translational research in Liverpool and North of England and why companies are establishing and growing operations in the region.
This document provides an overview of the field of bioinformatics. It defines bioinformatics as using computational techniques to solve biological problems by analyzing large amounts of biological data like DNA sequences, amino acid sequences, and more. It discusses the need for bioinformatics due to the exponential growth of biological data from sequencing projects. Some key applications of bioinformatics mentioned include data management, knowledge discovery, drug discovery, proteomics, personalized medicine, agriculture, and its use in systems biology.
Similar to Beating Bugs with Big Data: Harnessing HPC to Realize the Potential of Genomics in the NHS in Wales (20)
Are you looking for a long-lasting solution to your missing tooth?
Dental implants are the most common type of method for replacing the missing tooth. Unlike dentures or bridges, implants are surgically placed in the jawbone. In layman’s terms, a dental implant is similar to the natural root of the tooth. It offers a stable foundation for the artificial tooth giving it the look, feel, and function similar to the natural tooth.
Histololgy of Female Reproductive System.pptxAyeshaZaid1
Dive into an in-depth exploration of the histological structure of female reproductive system with this comprehensive lecture. Presented by Dr. Ayesha Irfan, Assistant Professor of Anatomy, this presentation covers the Gross anatomy and functional histology of the female reproductive organs. Ideal for students, educators, and anyone interested in medical science, this lecture provides clear explanations, detailed diagrams, and valuable insights into female reproductive system. Enhance your knowledge and understanding of this essential aspect of human biology.
Travel Clinic Cardiff: Health Advice for International TravelersNX Healthcare
Travel Clinic Cardiff offers comprehensive travel health services, including vaccinations, travel advice, and preventive care for international travelers. Our expert team ensures you are well-prepared and protected for your journey, providing personalized consultations tailored to your destination. Conveniently located in Cardiff, we help you travel with confidence and peace of mind. Visit us: www.nxhealthcare.co.uk
Summer is a time for fun in the sun, but the heat and humidity can also wreak havoc on your skin. From itchy rashes to unwanted pigmentation, several skin conditions become more prevalent during these warmer months.
Osteoporosis - Definition , Evaluation and Management .pdfJim Jacob Roy
Osteoporosis is an increasing cause of morbidity among the elderly.
In this document , a brief outline of osteoporosis is given , including the risk factors of osteoporosis fractures , the indications for testing bone mineral density and the management of osteoporosis
These lecture slides, by Dr Sidra Arshad, offer a simplified look into the mechanisms involved in the regulation of respiration:
Learning objectives:
1. Describe the organisation of respiratory center
2. Describe the nervous control of inspiration and respiratory rhythm
3. Describe the functions of the dorsal and respiratory groups of neurons
4. Describe the influences of the Pneumotaxic and Apneustic centers
5. Explain the role of Hering-Breur inflation reflex in regulation of inspiration
6. Explain the role of central chemoreceptors in regulation of respiration
7. Explain the role of peripheral chemoreceptors in regulation of respiration
8. Explain the regulation of respiration during exercise
9. Integrate the respiratory regulatory mechanisms
10. Describe the Cheyne-Stokes breathing
Study Resources:
1. Chapter 42, Guyton and Hall Textbook of Medical Physiology, 14th edition
2. Chapter 36, Ganong’s Review of Medical Physiology, 26th edition
3. Chapter 13, Human Physiology by Lauralee Sherwood, 9th edition
10 Benefits an EPCR Software should Bring to EMS Organizations Traumasoft LLC
The benefits of an ePCR solution should extend to the whole EMS organization, not just certain groups of people or certain departments. It should provide more than just a form for entering and a database for storing information. It should also include a workflow of how information is communicated, used and stored across the entire organization.
The skin is the largest organ and its health plays a vital role among the other sense organs. The skin concerns like acne breakout, psoriasis, or anything similar along the lines, finding a qualified and experienced dermatologist becomes paramount.
Nano-gold for Cancer Therapy chemistry investigatory projectSIVAVINAYAKPK
chemistry investigatory project
The development of nanogold-based cancer therapy could revolutionize oncology by providing a more targeted, less invasive treatment option. This project contributes to the growing body of research aimed at harnessing nanotechnology for medical applications, paving the way for future clinical trials and potential commercial applications.
Cancer remains one of the leading causes of death worldwide, prompting the need for innovative treatment methods. Nanotechnology offers promising new approaches, including the use of gold nanoparticles (nanogold) for targeted cancer therapy. Nanogold particles possess unique physical and chemical properties that make them suitable for drug delivery, imaging, and photothermal therapy.
Cell Therapy Expansion and Challenges in Autoimmune DiseaseHealth Advances
There is increasing confidence that cell therapies will soon play a role in the treatment of autoimmune disorders, but the extent of this impact remains to be seen. Early readouts on autologous CAR-Ts in lupus are encouraging, but manufacturing and cost limitations are likely to restrict access to highly refractory patients. Allogeneic CAR-Ts have the potential to broaden access to earlier lines of treatment due to their inherent cost benefits, however they will need to demonstrate comparable or improved efficacy to established modalities.
In addition to infrastructure and capacity constraints, CAR-Ts face a very different risk-benefit dynamic in autoimmune compared to oncology, highlighting the need for tolerable therapies with low adverse event risk. CAR-NK and Treg-based therapies are also being developed in certain autoimmune disorders and may demonstrate favorable safety profiles. Several novel non-cell therapies such as bispecific antibodies, nanobodies, and RNAi drugs, may also offer future alternative competitive solutions with variable value propositions.
Widespread adoption of cell therapies will not only require strong efficacy and safety data, but also adapted pricing and access strategies. At oncology-based price points, CAR-Ts are unlikely to achieve broad market access in autoimmune disorders, with eligible patient populations that are potentially orders of magnitude greater than the number of currently addressable cancer patients. Developers have made strides towards reducing cell therapy COGS while improving manufacturing efficiency, but payors will inevitably restrict access until more sustainable pricing is achieved.
Despite these headwinds, industry leaders and investors remain confident that cell therapies are poised to address significant unmet need in patients suffering from autoimmune disorders. However, the extent of this impact on the treatment landscape remains to be seen, as the industry rapidly approaches an inflection point.
Cell Therapy Expansion and Challenges in Autoimmune Disease
Beating Bugs with Big Data: Harnessing HPC to Realize the Potential of Genomics in the NHS in Wales
1. Beating Bugs with Big Data: Harnessing HPC
to Realize the Potential of Genomics in the
NHS in Wales
Dr Tom Connor
Bioinformatics Lead, Public Health Wales,
Reader, Cardiff University,
Group Leader, Quadram Institute Bioscience
Supported by
3. Acknowledgements
Other PHW Colleagues
Dr Matt Backx
Dr Catherine Moore
Trefor Morris
Michael Perry
Dr Harriet Hughes
Dr Noel Craine
Dr Simon Cottrell
Helen Adams
David Heyburn
Fatima Downing
Sue Edwards
Cardiff University
Dr Matt Bull
Dr Anna Price
The ARCCA team
CU IT Networking
CLIMB Collaborators
Professor Nick Loman
Simon Thompson
Radoslaw Poplawski
Simon Ellwood-Thompson
PHW Pathogen Genomics
Dr Sally Corden
Joanne Watkins
Lee Graham
Alec Birchley
Bree Wilcox
Jason Coombes
Lauren Gilbert
PHW Genomics
Workstreams (ARGENT,
DIGEST, WCM, HIV,
Influenza)
5. The NHS in Wales
• National Health Service, universal
healthcare, free at the point of use
• The NHS is a devolved area
• Public Health Wales is one of the 11
organisations that makes up the NHS in
Wales
• Has a wide remit including:
• Runs the microbiological diagnostic labs
for most of Wales’ hospitals and GPs
• Public health and surveillance
• PHW offers fully national services for the
whole 3.25 million population of Wales
6. Why does microbiology matter?
• Infectious disease accounts for ~7%
of all deaths in the UK and over
£30bn of economic costs per year
• Antibiotic resistance alone has a
projected cumulative global cost of
$100 Trillion by 2050
• We need better tools and systems to
prevent, diagnose, track and treat
infectious disease
7. Quick biology lesson; introducing microbial genomes
A bacterial cell
The bacterial chromosome for this
bacterial cell – its blueprint
That blueprint includes the plans for
proteins (“genes”) and other information
relating to how and when these need to
be made
This genetic information is the
organisms genome
8. So, what is Genomics?
Sequencing instruments read DNA, and
by extension, enable us to read the
genome of an organism of interest
However, what really defines the field is
the use of sequencing instruments
Put simply, genomics is the branch of
biology that is concerned with studying
genomes
From that we can make predictions
about the organism of interest, or
compare it with others
9. That means genomics has a lot of potential in healthcare
Faster results
More clinically
actionable information
per test
Clearer, better
diagnostics
- Genomics enables medicine to become more personalized
Simultaneously
- Genomics also gives us unprecedented tools to track and
combat outbreaks/epidemics on a population level
Data is digital and has many
uses
10. Why now? The reduction in the raw cost of sequencing the
human genome
20192003
* Humans are boring. For the
same money we can sequence
~50 bacterial genomes
11. Genomics Partnership Wales
• In July 2017 the Welsh Government launched the
Genomics for Precision Medicine Strategy, with over
£10M spent so far
• Covers both human and pathogen genomics
• PHW is leading the Pathogen Genomics elements,
with a Pathogen Genomics Unit launched in 2018
• Current development areas
– AMR bacterial surveillance and characterisation
– Cystic Fibrosis polymicrobial infection
diagnostics
– Enterovirus surveillance
• Production systems
– C. difficile surveillance and outbreak support
– Mycobacteria identification and characterisation
– Influenza surveillance
– HIV susceptibility testing
Accreditation in process
System design/needs assessment in
progress
Pilot system development
12. The (clinical) sequencing process has 5 main elements
1. Sample to Sequencer
2. Sequencing
3. Reads to reports
4. Interpretation
5. Fixing it when it breaks
Labwork Data science/Bioinformatics
13. However, the major costs and difficulties do not lie with
the generation of data, they lie with how we share, store
and analyse the data we generate
Bioinformatics expertise
User accessibility of software/hardware
Appropriate compute capacity
Software development
Storage availability
Network capacity
Sequencing is now relatively cheap and easy ; we can
sequence large numbers of strains for modest amounts
of money
These
account
for up to
90%
of the
costs of
doing
genomics
work
Our problem: The sequencing iceberg
15. 320 samples
Approx 6-
700GB
uncompressed
data Sequence
Assembly
Each job 4-8GB
RAM
1 CPU core
Each job generates
intermediate files of
~6GB
Runtime: 1+
hours/job
Sequence
mapping 320 jobs
Each job 4GB RAM
1 CPU core
Each job generates
intermediate files of
~3GB
Runtime: 1
hour/job
Phylogenomics
1 job, 1+ cores, up
to 128GB RAM
Intermediate file
size ~2+GB
Output file ~2GB
Runtime 1-2 days
Virulence and
antimicrobial
resistance
screening
320 jobs, single
core
100MB ram
Runtime: 5
mins/job
Generates 10-20
small files per job
Bayesian
modelling
3 jobs, 1 core+, up
to 1 GB RAM
CPU intensive
Runtime: 2 days
per job
Output file ~10GB
Can use GPUs
Written in Java
Larger RAM HTC HTC
HPC Possibly HPC
Not so pretty workload
17. Our challenges
• We need to be able to rapidly process patient samples
• We need to be able to deal with large volumes of
complex data
• Need to be able to compare new samples with old
• Need to have locked down pipelines that don’t change
and can work anywhere
• Need to be able to deal with very mixed workloads
• Need the ability to scale our analyses
• Need to get around issues with physical infrastructure
(e.g. hospital networks)
• Need to be able to build upon what researchers have
done, but in a more enterprise way
• Need to do this in an environment (the NHS) which only
understands Windows
BIOINFORMATICS
18. MRC CLIMB
• 4 sites (1 in Cardiff)
each with ~1920 vCPU
cores including 3x 3TB
RAM nodes
• 1.7PB Ceph/site and
500TB of GPFS
• Runs OpenStack
• Also enables data
sharing with researchers
PH CLIMB
• 1280 vCPU cores
• 1PB Object and 80TB SSD
block/posix storage running
Ceph
• Runs OpenStack and
Kubernetes
In-lab cluster
• 160 CPU core cluster
• ~300TB of medium tier
NFS storage
• 24TB of SSD storage
running Ceph
Data, Software
19. Enabling personalised medicine - HIV
• HIV is an RNA virus that attacks the immune
system, eventually if untreated, causing AIDS
• HIV spread widely in the 1980’s and attracted
a lot of hysteria
• Today, HIV can be controlled through
medication
• The medication that a patient takes targets key
proteins encoded by the virus
• For an individual patient, ensuring that they
are on the right drug is critical to ensure that
their HIV is controlled
• The mutations that create resistance to drugs
are encoded in the HIV virus genome
20. HIV resistance typing using genomcs
• Our system enables a fully
automated process
• After DNA has been loaded the
next time the lab team sees
anything, it is the reports and QC
data
• Has underpinned an improvement
in turnaround time from average
2 weeks to an average of 6 days
• Has halved the cost per sample
• Has added in integrase testing at
no extra cost, increasing the
amount of clinically actionable
information provided
• Used for all HIV patients in Wales
21. Fighting outbreaks in hospitals – Clostridioides difficile
• C. difficile is a key pathogen that
causes disease in hospitals
• Sometimes described as a
‘superbug’, it is often associated
with antibiotic use
• ~220,000 cases and almost
13,000 deaths in the US in 2017
from C. difficile
• It spreads easily, and can survive
in the vacuum of space – so is
hard to get rid of in hospitals
• Until now, tools for tracking have
been low-resolution, so don’t
allow for effective infection
control
22. Outbreak tracking and data integration with C. difficile
• Since 2017 we have been working to produce a service for C. difficile outbreak support
• Now in full parallel running
• Clustering genome sequences and linking this to other data enables patient-level
examination of causes and prevention measures
23. Tracking pathogens across healthcare systems
We can move beyond single patients to
look at clusters of disease
In North Wales, for example, 11/18
clusters crossed hospitals; the reason
for these large clusters is under
investigation
Explains why reducing C. difficile rates
is so hard – normally we only look
within hospitals
Genomics gives us a nationwide view
24. National and international surveillance of Influenza
• Not much we can do in terms of treatment – the key
for influenza is prevention
• Prevention is achieved through vaccination, so we
want to
– Get the vaccine right
– Undertake campaigns to promote vaccination as requited
Proposal for a pilot study to Welsh government in October 2017
• Real time surveillance helps with both of these
• We were asked to build a service for Influenza
sequencing and surveillance in October 2017
• Went from a standing start to production clinical
service in less than 12 months with a team of 2
• Now have one of the fastest turnaround times in the
world for routine Influenza surveillance
• That was all underpinned by a system that has been
engineered for automation and reproducibility from
the ground up
25. Global collaboration for improved global health
• Lots of our work to date has been about
meeting local needs
• Two big things coming internationally that we
will be feeding in to
– First is a new system (called SP3) which will
provide a cloud agnostic unified pathogen
pipeline platform for use in clinical settings
• Will provide a system for the global community,
combining enterprise-grade software, documented
best practice and validation/verification datasets
– Second is the Public Health Alliance for Genomic
Epidemiology
• A new initiative to build standards and best practice
around microbial bioinformatics in public health
• These are larger open projects with significant
buy-in from national and international
organisations that will help spread the adoption
of genomics in public health
26. Summary
• Building clinical genomics services pose big
challenges in terms of infrastructure
• The system we have built, that integrates
HPC and on premise cloud has enabled us to
– Build world-leading clinical genomics services
– Analyse over 8,000 sequenced patient samples
in the last 12 months
– Support more than 10 distinct analysis
pipelines
– Track outbreaks across multiple species
– Provide better, faster diagnostics for multiple
patient groups in Wales
• The work we have done is now feeding into
international efforts to define and build upon
best practice in this area, in which
HPC/Infrastructure is a key component
0 SNPs
1 SNP Source
Cluster
of
Patient
Samples
27. Acknowledgements
Other PHW Colleagues
Dr Matt Backx
Dr Catherine Moore
Trefor Morris
Michael Perry
Dr Harriet Hughes
Dr Noel Craine
Dr Simon Cottrell
Helen Adams
David Heyburn
Fatima Downing
Sue Edwards
Cardiff University
Dr Matt Bull
Dr Anna Price
The ARCCA team
CU IT Networking
CLIMB Collaborators
Professor Nick Loman
Simon Thompson
Radoslaw Poplawski
Simon Ellwood-Thompson
PHW Pathogen Genomics
Dr Sally Corden
Joanne Watkins
Lee Graham
Alec Birchley
Bree Wilcox
Jason Coombes
Lauren Gilbert
PHW Genomics
Workstreams (ARGENT,
DIGEST, WCM, HIV,
Influenza)