Web-based Tools for Integrative Analysis of Pancreatic Cancer Data

•Download as PPTX, PDF•

0 likes•180 views

This document summarizes web-based tools for integrative analysis of pancreatic cancer data. It discusses moving from monolithic to modular cancer analysis apps and portals. Specific apps developed at the Wolfson Wohl Cancer Research Centre are described, including pathway analysis, gene variants, and survival apps. The pathway analysis app allows searching pathways, viewing heatmaps of patient vs. genes, and overlaying pathway diagrams. The gene variants app allows filtering and visualizing variants. The survival app generates Kaplan-Meier curves. Future directions include incorporating more data and making the apps available on a shared network site.

Health & Medicine

Web-based Tools for
Integrative Analysis of
Pancreatic Cancer Data
Derek Wright
Wolfson Wohl Cancer Research Centre
Visualisation in
Science Conference
5th April 2017

Cancer Analysis Apps
• Monolithic vs Modular
• Architecture moving to connected apps/microservices
• Portals
• ICGC Data Portal
• cBioPortal
• generic, many capabilities, many classes of user
• Apps - our approach
• bespoke, use case driven, needs of user group

• Rapid development of interactive web applications in R
• Incorporate existing R analysis scripts
• No need to create separate web server/front-end layers
• Extend using custom JavaScript/CSS/HTML
• Powerful data visualisation: ggplot2, ggvis
• Database access with dplyr
• Hosting and deployment locally, on own servers, on cloud
• Free, open source (RStudio's shinyapps.io service is paid)

Genomics Apps (WWCRC)
Pathway Analysis
Visualise gene regulation
Gene Variants
Browse mutations
Survival
Kaplan-Meier analysis

PDCL Pathway
Analysis
• Search
• KEGG GENES database
• KEGG PATHWAY database
• Pathways involving genes
• Interactive heatmap
• Patient vs Genes in pathway
• GSVA ranking
• Scaling (-3 to 3)
• KEGG Diagrams
• Download pathway diagram
• Overlay values in colour

Key Pathways
Bailey, P. et al. Genomic analyses identify molecular subtypes
of pancreatic cancer. Nature 531, 47–52 (2016).

Gene
Variants
Select cohort
• Primary tumour
• Cell lines
Filter
❖SNV, structural variants, CNV
❖Sample ID
❖Gene, variant type, chromosome
❖Key mutations: pathway/gene
❖Tumour subtype
❖CNV loss/gain
Visualise
• Circos

Genome Viewer
Visualise patient-centric variants

Survival
• Generate gene-centric
Kaplan-Meier curves.
• Adjust probability value
• Select time/censor
• Boxplots of pancreatic
cancer subtype

Summary
• Pathway Analysis
• Gene Variants
• Survival
• Future directions:
• PRECISION-Panc data
• PanCancer Analysis of Whole Genomes (PCAWG)
• shiny.tcrc.gla.ac.uk (UOG network)
• Project website precisionpanc.org

3rd BiVi Annual Meeting (2017)
http://bivi.co
20th-21st April
Edinburgh Napier University
Craiglockhart Campus

The document describes a serverless variant query tool that allows users to query genomic variant data stored in Amazon S3 using standard SQL via Amazon Athena. It has a static website frontend hosted on S3 that communicates with Lambda functions to trigger Athena queries and process uploaded variant files with EMR Spark jobs that convert VCF files to Parquet for faster querying. The tool was inspired by other genomic projects like MSSNG that use cloud services for scalable analysis without traditional infrastructure.

Molecular interactions. PSICQUIC and IntAct.

Rafael C. Jimenez

This document summarizes an bioinformatics workshop on enabling systems biology. It discusses various databases and standards for sharing molecular interaction data, including IntAct, UniProt, Reactome, PRIDE, and PSI formats. It also describes services for programmatically accessing this data, such as PSICQUIC and PSISCORE, which allow querying multiple source repositories using a common standard. Clients have been created to access these services and visualize interaction networks.

Data formats and ontologies

Rafael C. Jimenez

The document summarizes a workshop on protein-protein interaction data formats and ontologies. It discusses standards like PSI-MI which define formats for representing protein interaction data to facilitate sharing and integration. It also describes tools for working with PSI-MI data formats, including parsers for the XML and tabular formats, as well as the PSI-MI ontology which defines over 1,500 terms for annotating interaction data. Minimum information guidelines like MIMIx and data submission tools are also summarized.

Big Data Initiatives for Agroecosystems

Cyndy Parr

Making obamacare work with Big Data

laurenstill

Discover Introduction to REDCap

STARSurg

REDCap is an electronic data capture system that gives users control over their data and is more powerful, flexible and secure than other options. It securely stores data in encrypted databases and has various access controls, passwords, and logging features to track user activity. REDCap automatically logs all user actions to allow administrators to review the activity and data accessed by any given user. While it has strong security, some audit departments may see disadvantages compared to paper-based systems. The document provides instructions on registering and logging into REDCap for a clinical trial launch event.

Biodiversity Virtual e-Laboratory (BioVeL): Athentication & Authorisation

Renzo Kottmann

Resume xiaodan(vinci)

vinci105

The document summarizes the Beacon Network, an experiment allowing genetic data sharing between international sites through a simple web service. It started in 2014 and has gained traction, allowing users to query institutions' databases to check for genetic variants. The Beacon Network aims to minimize genomic data risks while being technically simple and publicly available. Over time, it developed standards, functionality for federated queries and aggregation, and security measures including authentication. It now includes around 100 installations across 40 institutions and 18 countries.

Brisbane Health-y Data: RedCap

ARDC

RDA Web service discoverability workshop

Niall Beard

OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...

Open Science Fair

Carole Goble presents the FAIRDOM | OSFair2017 Workshop Workshop title: How FAIR friendly is your data catalogue? Workshop overview: This workshop will build upon the work planned by the EOSCpilot data interoperability task and the BlueBridge workshop held on April 3 at the RDA meeting. We will investigate common mechanisms for interoperation of data catalogues that preserve established community standards, norms and resources, while simplifying the process of being/becoming FAIR. Can we have a simple interoperability architecture based on a common set of metadata types? What are the minimum metadata requirements to expose FAIR data to EOSC services and EOSC users? DAY 3 - PARALLEL SESSION 6 & 7

BlogMyData at AllHands 2010

Andrew Milsted

BlogMyData is a virtual research environment for collaboratively visualizing environmental data. It allows researchers to detect features in models, diagnose problems, preview data before downloading, make sense of large datasets, and communicate complex concepts. Existing scientific visualization software requires expert knowledge and has limited interoperability. BlogMyData addresses this by providing a web-based blogging tool for scientists to discuss, collaborate, and record discussions as part of the research record. It utilizes open authentication and spatial features from existing frameworks to overlay blogged data on other visualization clients and offer customized geospatial feeds of blog entries. The prototype received positive feedback and future features may include supporting more data types.

"Article Level" The Future of Resource Discovery

macay beck

The document discusses the evolution of resource discovery tools in libraries from online catalogs to current discovery services. It outlines the components of discovery interfaces, including the user interface, local search capabilities, and ability to search remote indexes. Examples of popular discovery interface products are provided. The trends in discovery include a shift from locally-installed to hosted, pre-populated indexes aiming for web-scale discovery. The future of discovery is envisioned to include more comprehensive indexes, stronger search technologies, discovery beyond library interfaces, and linked data.

NETWORK CENTRALITY IN SUB-NATIONAL AREAS OF INTEREST USING GDELT DATA

DataTactics

Wrangling RedCap_An Introduction and Inspiration

Jacqueline Stern

REDCap is a secure web application for building and managing online surveys and databases. It allows users to rapidly develop projects using either an online designer or by uploading a data dictionary template from Excel. Projects can include both surveys and databases. REDCap provides tools like branching logic, file uploading, scheduling, and exporting data to statistical software. The presentation provides examples of how REDCap has been used at Vanderbilt for projects involving training program tracking, appointment scheduling, and participant data collection. Users are encouraged to consider how REDCap could help with tasks requiring regular information gathering or projects with multiple steps and users.

VictorCassen

Victor Cassen

This document is a resume for Victor Cassen summarizing his technical skills and work history as a software developer. It lists his proficiency with various programming languages, frameworks and databases. It then describes his two most recent roles developing network management software and biological research tools and databases. It provides examples of projects in each role involving web services, data processing pipelines, and user-facing applications. It concludes with his educational background of a Computer Science degree from the University of Washington.

NCI Cancer Research Data Commons - Overview

imgcommcall

The NCI Cancer Research Data Commons aims to enable sharing of diverse cancer research data across institutions by providing easy access to data stored in domain-specific repositories through a common authentication and authorization mechanism. It utilizes a framework of reusable components including data nodes, a cancer data aggregator, and cloud resources to integrate genomic, imaging, proteomic, and other data types while controlling access. The goals are to facilitate discovery and analysis tools as well as sustainably sharing data publicly to advance cancer research.

GenePattern Integration with Globus

Globus

Enabling knowledge management in the Agronomic Domain

Pierre Larmande

Data Harmonization for a Molecularly Driven Health System

Warren Kibbe

Running Mixed Workloads on Kubernetes at IHME

Tyrone Grandison

The mission of the IHME is to apply rigorous measurement and analysis to help policy makers make better decisions on a range of health policy issues. Like other organizations, the IHME have embraced containers and micro-services aggressively to better support hundreds of collaborating researchers. In addition to containerized workloads, the IHME run a wide-variety of traditional analytic, simulation and high-performance computing workloads on an HPC cluster with 15,000 cores and 13PB of storage. Researchers increasingly need to combine both containerized and non-containerized elements into workflow pipelines, and a key challenge has been ensuring SLAs for various departments and avoiding duplicate infrastructure and unnecessary data movement and duplication. In collaboration with industry partners, IHME have deployed a unique solution based on Univa’s Navops technology that allows them to combine containerized and traditional analytic and high-performance application workloads on a single shared Kubernetes cluster, ensuring departmental SLAs and helping contain infrastructure costs. In this talk Dr. Grandison will discuss IHME, their experience deploying containerized applications and how they went about using Kubernetes to support a variety of new containerized applications as well as a variety of traditional analytic applications.

Building genomic data cyberinfrastructure with the online database software T...

mestato

This document discusses building genomic data cyberinfrastructure using the Tripal online database software and Galaxy analysis workflows. It summarizes Tripal's goals of simplifying community genomics website construction and encouraging standards-based data sharing. Key Tripal modules like organisms, sequences, and genotypes are mentioned. Extensions discussed include Elasticsearch for improved searching, an expression module for RNA-Seq data, and a Galaxy module to integrate analysis workflows. Future work includes mobile data collection apps and expanding Tripal and Galaxy integration.

Hmp 201512

Hector Corrada Bravo

This document discusses statistical and visualization methods for analyzing metagenomic data. It introduces several R/Bioconductor packages for metagenomic analysis, including metagenomeSeq for differential abundance analysis of 16S data and metagenomicFeatures for annotating 16S features. It also describes msd16s example data. Additionally, it discusses the benefits of R/Bioconductor including infrastructure objects, documentation, and reproducibility. Finally, it introduces Metaviz, an interactive browser-based tool for exploring hierarchical metagenomic data through integration and visualization of multiple data sources.

Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWS

Ed Dodds

1) Globus Genomics addresses challenges in sequencing analysis by providing a platform that integrates data transfer via Globus Online, workflow management in Galaxy, and scalable compute resources in AWS. 2) An example collaboration with the Dobyns Lab saw over a 10x speedup in exome data analysis by replacing a manual process with Globus Genomics. 3) Globus Genomics leverages XSEDE services like Globus Transfer and Nexus while integrating additional resources like sequencing centers and cloud computing, in order to reduce the costs and complexities of genomic research for communities not traditionally using advanced cyberinfrastructure.

GlobusWorld 2020 Keynote

Globus

Bonazzi commons bd2 k ahm 2016 v2

Vivien Bonazzi

The document provides an overview of the development of the NIH Data Commons. It discusses factors driving the need for a data commons, including large amounts of data being generated and increased support for data sharing. It outlines the goals of making data findable, accessible, interoperable and reusable. Several pilots are exploring the feasibility of the commons framework, including placing large datasets in the cloud and developing indexing methods. Considerations in fully realizing the commons are also discussed, such as standards, discoverability, policies and incentives.

Tripal v3, the Collaborative Online Database Platform Supporting an Internati...

Bradford Condon

COBWEB - infrastructure and platform for Environmental Crowd Sensing and Big ...

EDINA, University of Edinburgh

The document introduces COBWEB, a European Commission-funded project that develops a crowdsourcing infrastructure for collecting and analyzing environmental data. It summarizes the goals of the project, its partners which include UNESCO biosphere reserves, methods for co-designing use cases, and the development of quality assurance processes and mobile/web apps. Key components under development include workflows, services, sensor networks, and tools for customizing data collection and ensuring data quality.

Adelaide Rhodes Resume March 2023

Stacy Taylor

Adelaide Rhodes has over 15 years of experience in bioinformatics and analyzing large datasets for environmental, metagenomic, and human health applications. She has strong skills in programming languages like R, Python, and UNIX/Linux and has experience building workflows for analyzing NGS data using tools like Nextflow, Snakemake, and Kubernetes on cloud platforms. She currently works as a Senior Bioinformatics Scientist at Tufts University, where she provides strategic consulting and trains researchers on analytical methods and cloud resources.

What's hot

Beacon Network

Miro Cupak

Brisbane Health-y Data: RedCap

ARDC

RDA Web service discoverability workshop

Niall Beard

OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...

Open Science Fair

BlogMyData at AllHands 2010

Andrew Milsted

"Article Level" The Future of Resource Discovery

macay beck

NETWORK CENTRALITY IN SUB-NATIONAL AREAS OF INTEREST USING GDELT DATA

DataTactics

Wrangling RedCap_An Introduction and Inspiration

Jacqueline Stern

What's hot (8)

Beacon Network

Brisbane Health-y Data: RedCap

RDA Web service discoverability workshop

OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...

BlogMyData at AllHands 2010

"Article Level" The Future of Resource Discovery

NETWORK CENTRALITY IN SUB-NATIONAL AREAS OF INTEREST USING GDELT DATA

Wrangling RedCap_An Introduction and Inspiration

Similar to Web-based Tools for Integrative Analysis of Pancreatic Cancer Data

VictorCassen

Victor Cassen

NCI Cancer Research Data Commons - Overview

imgcommcall

GenePattern Integration with Globus

Globus

Enabling knowledge management in the Agronomic Domain

Pierre Larmande

Data Harmonization for a Molecularly Driven Health System

Warren Kibbe

Running Mixed Workloads on Kubernetes at IHME

Tyrone Grandison

Building genomic data cyberinfrastructure with the online database software T...

mestato

Hmp 201512

Hector Corrada Bravo

Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWS

Ed Dodds

GlobusWorld 2020 Keynote

Globus

Bonazzi commons bd2 k ahm 2016 v2

Vivien Bonazzi

Tripal v3, the Collaborative Online Database Platform Supporting an Internati...

Bradford Condon

COBWEB - infrastructure and platform for Environmental Crowd Sensing and Big ...

EDINA, University of Edinburgh

Adelaide Rhodes Resume March 2023

Stacy Taylor

Cloud e-Genome: NGS Workflows on the Cloud Using e-Science Central

Paolo Missier

This document discusses moving whole exome sequencing pipelines to the cloud using e-Science Central workflow management. The goal is to process 3000 exomes from neurological patients in a scalable and cost-effective way. Current scripts are being ported to e-Science Central for improved abstraction, execution, and provenance tracking. Provenance will help compare results from different pipeline versions and support clinical diagnosis. Initial testing with 300 exomes will begin, with full scalability testing planned for September 2014.

Building a modern in-house analytics pipeline

Sergey Burkov

Data-driven organizations need a modern data pipeline to ensure that right data is available at all times to make decisions, and as a foundation for building smart innovative services. A carefully managed data pipeline can provide access to reliable and well-structured datasets for analytics, machine learning and research stakeholders. Automating the movement and transformation of data allows the consolidation of data from multiple sources so that it can be used strategically.

Flattening the Curve with Kafka (Rishi Tarar, Northrop Grumman Corp.) Kafka S...

confluent

Responding to a global pandemic presents a unique set of technical and public health challenges. The real challenge is the ability to gather data coming in via many data streams in variety of formats influences the real-world outcome and impacts everyone. The Centers for Disease Control and Prevention CELR (COVID Electronic Lab Reporting) program was established to rapidly aggregate, validate, transform, and distribute laboratory testing data submitted by public health departments and other partners. Confluent Kafka with KStreams and Connect play a critical role in program objectives to: o Track the threat of COVID-19 virus o Provide comprehensive data for local, state, and federal response o Better understand locations with an increase in incidence

Pushing Chemical Biology Through the Pipes

Rajarshi Guha

This document discusses the BioAssay Research Database (BARD) API, which was developed to make bioassay data from the NIH Molecular Libraries Program more accessible. The BARD API uses a RESTful design to provide access to data through resources like assays, compounds, and experiments. It also has an extensibility framework that allows new functionality to be added through plugins written in Java. The document outlines how users can search, access, and extend the BARD API resources.

Irida bccdc dec10_2015

IRIDA_community

Canada's Integrated Rapid Infectious Disease Analysis Platform for Genomic Epidemiology (IRIDA) is an open source platform designed to support real-time disease outbreak investigations using genomic sequencing data. IRIDA provides tools for rapid processing of genomic data, sample and metadata management, built-in analysis workflows, and data sharing between public health agencies. The platform is being developed and tested collaboratively with Canadian and international public health partners.

What is Data Commons and How Can Your Organization Build One?

Robert Grossman

1. Data commons co-locate large biomedical datasets with cloud computing infrastructure and analysis tools to create shared resources for the research community. 2. The NCI Genomic Data Commons is an example of a data commons that makes over 2.5 petabytes of cancer genomics data available through web portals, APIs, and harmonized analysis pipelines. 3. The Gen3 platform is an open source software stack for building data commons that can interoperate through common APIs and data models to support reproducible, collaborative research across projects.

Similar to Web-based Tools for Integrative Analysis of Pancreatic Cancer Data (20)

VictorCassen

NCI Cancer Research Data Commons - Overview

GenePattern Integration with Globus

Enabling knowledge management in the Agronomic Domain

Data Harmonization for a Molecularly Driven Health System

Running Mixed Workloads on Kubernetes at IHME

Building genomic data cyberinfrastructure with the online database software T...

Hmp 201512

Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWS

GlobusWorld 2020 Keynote

Bonazzi commons bd2 k ahm 2016 v2

Tripal v3, the Collaborative Online Database Platform Supporting an Internati...

COBWEB - infrastructure and platform for Environmental Crowd Sensing and Big ...

Adelaide Rhodes Resume March 2023

Cloud e-Genome: NGS Workflows on the Cloud Using e-Science Central

Building a modern in-house analytics pipeline

Flattening the Curve with Kafka (Rishi Tarar, Northrop Grumman Corp.) Kafka S...

Pushing Chemical Biology Through the Pipes

Irida bccdc dec10_2015

What is Data Commons and How Can Your Organization Build One?

Recently uploaded

Pollen and Fungal allergy: aeroallergy.pdf

Chulalongkorn Allergy and Clinical Immunology Research Group

Patellar Instability: Diagnosis Management

Dr Nitin Tyagi

PARASITIC INFECTIONS IN CHILDREN peads.pptx

MwambaChikonde1

STUDIES IN SUPPORT OF SPECIAL POPULATIONS: GERIATRICS E7

shruti jagirdar

Unit 4: MRA 103T Regulatory affairs This guideline is directed principally toward new Molecular Entities that are likely to have significant use in the elderly, either because the disease intended to be treated is characteristically a disease of aging ( e.g., Alzheimer's disease) or because the population to be treated is known to include substantial numbers of geriatric patients (e.g., hypertension).

Hemodialysis: Chapter 6, Hemodialysis Adequacy and Dose - Dr.Gawad

NephroTube - Dr.Gawad

- Video recording of this lecture in English language: https://youtu.be/RvdYsTzgQq8 - Video recording of this lecture in Arabic language: https://youtu.be/ECILGWtgZko - Link to download the book free: https://nephrotube.blogspot.com/p/nephrotube-nephrology-books.html - Link to NephroTube website: www.NephroTube.com - Link to NephroTube social media accounts: https://nephrotube.blogspot.com/p/join-nephrotube-on-social-media.html

Tele Optometry (kunj'sppt) / Basics of tele optometry.

Kunj Vihari

Pharmacology of Drugs for Congestive Heart Failure

Dr. Nikhilkumar Sakle

Congestive Heart failure is caused by low cardiac output and high sympathetic discharge. Diuretics reduce preload, ACE inhibitors lower afterload, beta blockers reduce sympathetic activity, and digitalis has inotropic effects. Newer medications target vasodilation and myosin activation to improve heart efficiency while lowering energy requirements. Combination therapy, following an assessment of cardiac function and volume status, is the most effective strategy to heart failure care.

District Residency Programme (DRP) for PGs in India.pptx

CommunityMedicine46

13. PROM premature rupture of membranes

TigistuMelak

5 Effective Homeopathic Medicines for Irregular Periods

Dr. Deepika's Homeopathy - Gaur City

TEST BANK For Brunner and Suddarth's Textbook of Medical-Surgical Nursing, 14...

Donc Test

TEST BANK For Brunner and Suddarth's Textbook of Medical-Surgical Nursing, 14th Edition (Hinkle, 2017) Verified Chapter's 1 - 73 Complete.pdf TEST BANK For Brunner and Suddarth's Textbook of Medical-Surgical Nursing, 14th Edition (Hinkle, 2017) Verified Chapter's 1 - 73 Complete.pdf TEST BANK For Brunner and Suddarth's Textbook of Medical-Surgical Nursing, 14th Edition (Hinkle, 2017) Verified Chapter's 1 - 73 Complete.pdf

NARCOTICS- POLICY AND PROCEDURES FOR ITS USE

Dr. Ahana Haroon

PGx Analysis in VarSeq: A User’s Perspective

Golden Helix

Since our release of the PGx capabilities in VarSeq, we’ve had a few months to gather some insights from various use cases. Some users approach PGx workflows by means of array genotyping or what seems to be a growing trend of adding the star allele calling to the existing NGS pipeline for whole genome data. Luckily, both approaches are supported with the VarSeq software platform. The genotyping method being used will also dictate what the scope of the tertiary analysis will be. For example, are your PGx reports a standalone pipeline or would your lab’s goal be to handle a dual-purpose workflow and report on PGx + Diagnostic findings. The purpose of this webcast is to: Discuss and demonstrate the approaches with array and NGS genotyping methods for star allele calling to prep for downstream analysis. Following genotyping, explore alternative tertiary workflow concepts in VarSeq to handle PGx reporting. Moreover, we will include insights users will need to consider when validating their PGx workflow for all possible star alleles and options you have for automating your PGx analysis for large number of samples. Please join us for a session dedicated to the application of star allele genotyping and subsequent PGx workflows in our VarSeq software.

pharmacology for dummies free pdf download.pdf

KerlynIgnacio

RESPIRATORY DISEASES by bhavya kelavadiya

Bhavyakelawadiya

Osvaldo Bernardo Muchanga-GASTROINTESTINAL INFECTIONS AND GASTRITIS-2024.pdf

Osvaldo Bernardo Muchanga

GASTROINTESTINAL INFECTIONS AND GASTRITIS Osvaldo Bernardo Muchanga Gastrointestinal Infections GASTROINTESTINAL INFECTIONS result from the ingestion of pathogens that cause infections at the level of this tract, generally being transmitted by food, water and hands contaminated by microorganisms such as E. coli, Salmonella, Shigella, Vibrio cholerae, Campylobacter, Staphylococcus, Rotavirus among others that are generally contained in feces, thus configuring a FECAL-ORAL type of transmission. Among the factors that lead to the occurrence of gastrointestinal infections are the hygienic and sanitary deficiencies that characterize our markets and other places where raw or cooked food is sold, poor environmental sanitation in communities, deficiencies in water treatment (or in the process of its plumbing), risky hygienic-sanitary habits (not washing hands after major and/or minor needs), among others. These are generally consequences (signs and symptoms) resulting from gastrointestinal infections: diarrhea, vomiting, fever and malaise, among others. The treatment consists of replacing lost liquids and electrolytes (drinking drinking water and other recommended liquids, including consumption of juicy fruits such as papayas, apples, pears, among others that contain water in their composition). To prevent this, it is necessary to promote health education, improve the hygienic-sanitary conditions of markets and communities in general as a way of promoting, preserving and prolonging PUBLIC HEALTH. Gastritis and Gastric Health Gastric Health is one of the most relevant concerns in human health, with gastrointestinal infections being among the main illnesses that affect humans. Among gastric problems, we have GASTRITIS AND GASTRIC ULCERS as the main public health problems. Gastritis and gastric ulcers normally result from inflammation and corrosion of the walls of the stomach (gastric mucosa) and are generally associated (caused) by the bacterium Helicobacter pylor, which, according to the literature, this bacterium settles on these walls (of the stomach) and starts to release urease that ends up altering the normal pH of the stomach (acid), which leads to inflammation and corrosion of the mucous membranes and consequent gastritis or ulcers, respectively. In addition to bacterial infections, gastritis and gastric ulcers are associated with several factors, with emphasis on prolonged fasting, chemical substances including drugs, alcohol, foods with strong seasonings including chilli, which ends up causing inflammation of the stomach walls and/or corrosion. of the same, resulting in the appearance of wounds and consequent gastritis or ulcers, respectively. Among patients with gastritis and/or ulcers, one of the dilemmas is associated with the foods to consume in order to minimize the sensation of pain and discomfort.

June 2024 Oncology Cartoons By Dr Kanhu Charan Patro

Kanhu Charan

pharmacy exam preparation for undergradute students.pptx

AdugnaWari

acne vulgaris -Mpharm (2nd semester) Cosmetics and cosmeceuticals

MuskanShingari

All about shoulder Joint ..

Aswan University Hospital

Recently uploaded (20)

Pollen and Fungal allergy: aeroallergy.pdf

Patellar Instability: Diagnosis Management

PARASITIC INFECTIONS IN CHILDREN peads.pptx

STUDIES IN SUPPORT OF SPECIAL POPULATIONS: GERIATRICS E7

Hemodialysis: Chapter 6, Hemodialysis Adequacy and Dose - Dr.Gawad

Tele Optometry (kunj'sppt) / Basics of tele optometry.

Pharmacology of Drugs for Congestive Heart Failure

District Residency Programme (DRP) for PGs in India.pptx

13. PROM premature rupture of membranes

5 Effective Homeopathic Medicines for Irregular Periods

TEST BANK For Brunner and Suddarth's Textbook of Medical-Surgical Nursing, 14...

NARCOTICS- POLICY AND PROCEDURES FOR ITS USE

PGx Analysis in VarSeq: A User’s Perspective

pharmacology for dummies free pdf download.pdf

RESPIRATORY DISEASES by bhavya kelavadiya

Osvaldo Bernardo Muchanga-GASTROINTESTINAL INFECTIONS AND GASTRITIS-2024.pdf

June 2024 Oncology Cartoons By Dr Kanhu Charan Patro

pharmacy exam preparation for undergradute students.pptx

acne vulgaris -Mpharm (2nd semester) Cosmetics and cosmeceuticals

All about shoulder Joint ..

Web-based Tools for Integrative Analysis of Pancreatic Cancer Data

1. Web-based Tools for Integrative Analysis of Pancreatic Cancer Data Derek Wright Wolfson Wohl Cancer Research Centre Visualisation in Science Conference 5th April 2017

3. Cancer Analysis Apps • Monolithic vs Modular • Architecture moving to connected apps/microservices • Portals • ICGC Data Portal • cBioPortal • generic, many capabilities, many classes of user • Apps - our approach • bespoke, use case driven, needs of user group

4. • Rapid development of interactive web applications in R • Incorporate existing R analysis scripts • No need to create separate web server/front-end layers • Extend using custom JavaScript/CSS/HTML • Powerful data visualisation: ggplot2, ggvis • Database access with dplyr • Hosting and deployment locally, on own servers, on cloud • Free, open source (RStudio's shinyapps.io service is paid)

5. Genomics Apps (WWCRC) Pathway Analysis Visualise gene regulation Gene Variants Browse mutations Survival Kaplan-Meier analysis

6. PDCL Pathway Analysis • Search • KEGG GENES database • KEGG PATHWAY database • Pathways involving genes • Interactive heatmap • Patient vs Genes in pathway • GSVA ranking • Scaling (-3 to 3) • KEGG Diagrams • Download pathway diagram • Overlay values in colour

8. Key Pathways Bailey, P. et al. Genomic analyses identify molecular subtypes of pancreatic cancer. Nature 531, 47–52 (2016).

10. Gene Variants Select cohort • Primary tumour • Cell lines Filter ❖SNV, structural variants, CNV ❖Sample ID ❖Gene, variant type, chromosome ❖Key mutations: pathway/gene ❖Tumour subtype ❖CNV loss/gain Visualise • Circos

11. Genome Viewer Visualise patient-centric variants

12.

13.

14. Survival • Generate gene-centric Kaplan-Meier curves. • Adjust probability value • Select time/censor • Boxplots of pancreatic cancer subtype

15.

16. Summary • Pathway Analysis • Gene Variants • Survival • Future directions: • PRECISION-Panc data • PanCancer Analysis of Whole Genomes (PCAWG) • shiny.tcrc.gla.ac.uk (UOG network) • Project website precisionpanc.org

17. 3rd BiVi Annual Meeting (2017) http://bivi.co 20th-21st April Edinburgh Napier University Craiglockhart Campus

Editor's Notes

I’m a bioinformatician in Andrew Biankin’s group at the Wolfson Wohl Cancer Research Centre. I’m here to talk to you today about interactive web applications we have been developing for cancer bioinformatics.
University of Glasgow is at the forefront of research into precision medicine. Our project forms the pancreatic cancer stream, known as Precision-Panc. This is a project involving academia, NHS and the private sector. Pancreatic cancer has seen little improvement in mortality rates over the years and we hope to make inroads into this. Our project was in the news recently as we received funding from Cancer Research UK. Patients will be recruited on to the project and tumour sequencing data and clinical data will be collected and analysed. We intend to produce a molecular profile, detailing the mutational landscape individual patient, guiding clinical trial options for the consultant to offer to the patient.
The traditional approach for developing cancer web applications has been to build large and complicated portals with multiple functions, such as cBioPortal and the ICGC Data Portal. In software development generally, there is a move towards smaller applications or services for more specific use cases.
I’m aware that today’s audience is multidisciplinary, with researchers, medics and people from the creative industries, so I would like to talk a bit about our approach to app development, using the Shiny framework, which many of you may find useful in the data visualisation arena. R is a statistical programming language that is popular in bioinformatics. It is free and open source, with many packages (or libraries) available for statistics, big data and bioinformatics. There are excellent packages available for data visualisation - in particular ggplot2 for static visualisations and the newer ggvis which produces interactive plots. There are also packages for geographical mapping which are useful for public health studies. A bioinformatician typically has a toolbox of analysis scripts that they run each time a wet lab scientist wants data analysed. Shiny is a server environment that allows R scripts to be turned into interactive web applications, promoting code reuse and empowering users to perform their own analyses. Shiny enables the entire application to be written in R. Typically a web application will have layers of HTML and JavaScript on the front end, a software framework written in PHP, Java, Ruby etc. and a database access layer to translate between software objects and SQL. I have worked with these kinds of systems in my previous life as a software engineer and R/Shiny is an absolute breeze in comparison. HTML is generated from R code, which creates page layouts using the popular Bootstrap framework. Pages are responsive and will work readily on mobile devices. However, you don’t have to go with the Shiny look and feel and it is possible to customise the presentation layer with your own front-end HTML code, style sheets and JavaScript visualisation frameworks such as D3. Plots are generated dynamically according to the inputs that the user selects. The basic data structure in R is a tabular structure and the excellent dplyr package allows you to treat R data tables like a database, performing selects, joins, group bys and so on,. It is also possible to visualise data stored in a relational database. dplyr can be linked to a tables in a database such Postgres, MySQL or SQLite. You manipulate R’s tabular data structures, and dplyr constructs SQL automatically and executes the query, bringing the results back into R. Database queries may thus be performed transparently without the need to write SQL.
We have developed 3 initial apps as part of this workflow to perform genomic analysis. I will talk about each each one in more detail.
Our Pathway app is based on RNA-Seq data from tumour cell lines. You can visualise the activity of your gene of interest, in the context of other genes it interacts with in a pathway. We overlay gene expression activity onto pathway diagrams that have been retrieved from the KEGG biological pathway database using a web service. We also draw heat maps, showing up- or down-regulated gene expression in a pathway using the gene set variation analysis ranking algorithm. Cell lines with similar patterns of activity cluster together.
Our group have identified key pathways and genes involved in pancreatic cancer, detailed by my colleague Peter Bailey in his Nature paper and shown in this figure from the paper.
4 molecular subtypes of pancreatic cancer, derived from clustering analysis of RNA-seq, were identified in the paper and are shown in these figures from the paper. These key pathways and molecular subtypes have been incorporated into our apps.
The Gene Variants app allows browsing and visualisation of single nucleotide, structural and copy number variants in the APGI patients, from tumour or cell line datasets. I have added filtering options such as key genes and pathways as identified in Peter’s Nature and a copy number slider. Underlying data is stored in a SQLite database.
The Gene Variants app provides an interactive Circos plot showing an overview variants for an individual patient. Circos is a popular visualisation technique where a network is displayed with a circular layout. It is often used to show structural variation between chromosomes in genomics with translocations and so on shown as arcs. Additional features may be added as concentric layers. A patient’s variants may be visualised in an interactive Circos plot. The outer track shows copy number variants as a line plot. The chromosome may be clicked to expand, providing detail of individual copy number gain or loss variants shown as green or red ticks. Single nucleotide variants are shown in the scatter plot. Structural variants are shown as arcs in the centre. The user can click on a chromosome to expand and reveal detail.
Our Survival app draws gene-centric Kaplan-Meier survival curves. The app lets you look at the association of expression of an individual gene with patient survival. Patients are subdivided into the 4 cancer subtypes, identified in our studies, and differential gene expression between these groups is visualised as a boxplot.
These 3 apps from the project have been made available for users on the University of Glasgow network. We are currently not permitted to host the data on a public web server due to data sharing agreements. We hope to make these more widely available in future, possibly publishing a more generic toolkit. Future directions – we will incorporate new data generated by PRECISION-Panc. I am also currently analysing the PCAWG dataset. This is whole genome sequencing data from about 3000 cancer patients from various cancer projects around the world and I would like to incorporate some of this data into the apps. Our project website precisionpanc.org is where you can learn more about the project and pancreatic cancer in general and we have associated social media accounts you can follow.
Very briefly before I finish. I would like to draw your attention for another data visualisation conference taking place later this month in Edinburgh. It’s a great meeting with some excellent speakers and tutorials. Registration fees are very reasonable at £50 and registration closes on 10th April.

Web-based Tools for Integrative Analysis of Pancreatic Cancer Data

Recommended

Recommended

More Related Content

What's hot

What's hot (8)

Similar to Web-based Tools for Integrative Analysis of Pancreatic Cancer Data

Similar to Web-based Tools for Integrative Analysis of Pancreatic Cancer Data (20)

Recently uploaded

Recently uploaded (20)

Web-based Tools for Integrative Analysis of Pancreatic Cancer Data

Editor's Notes