SlideShare a Scribd company logo
Mining complex relationships
• Data mining heterogeneous sources for many to
many relationships.
• CCB (Center for Cancer Biology)
– Regulatory relationships between microRNAs,
transcription factors, and genes.
– Data sources:
• DNA sequences
• Gene expression data
• Multiple labs
• Domain knowledge.
• One ARC project
• Causal inference
• Discovery of group-group relationships
Heterogeneous data
Inferring miRNA-mRNA regulatory relationships
Gene regulatory relationships
Causal inference based approaches
Why interested in causal relationships?
• Gene regulatory relationships are causal by nature
• Most existing work identifies only statistical associations/correlations
Gene C
Gene A Gene B
What’s the catch?
• Gold standard of causal discovery is controlled random trials
• RCTs are expensive and not always possible
• We want to discover causal relationships from observational data
Causal inference– Do calculus
Judea Pearl. Causality: Models, Reasoning, and Inference. Cambridge University Press, 2000.
X1 X2 … Xn-1 Xn
5.2 7.5 6.5 5.2
5.6 7.2 6.6 5.3
… … … … …
5.4 7.1 7.1 5.7
5.7 6.9 6.9 5.8
+1
+0.8
Methods
– IDA
– Maathuis, H. M.,
Colombo, D., Kalisch, M.,
and Buhlmann, P. (2010).
Predicting causal effects
in large-scale systems
from observational data.
Nature Methods, 7(4),
247–249.
5
6
Causal inference based approaches
• We also applied the causal inference method to
detect condition specific regulatory relationships
• The steps:
˗ Split samples into to two parts according to conditions
(cancer or normal)
˗ Detect causal regulatory relationships in each condition
˗ A relationship (miR_i, mR_j) detected in condition 1 but
not in condition 2 is specific to condition 1, and miR_i is
an active microRNA in condition 1
Causal inference based approaches
Knowledge + Data Mining
Idea from information retrieval
• Correspondence Latent Dirichlet Allocation
(Corr-LDA)
– Automatic annotations of images (Blei et al.
2004)
images
words
miRNAs
mRNAs
Model migration
11
FMRMs DependencyTopics
FMRMs
Generative process
12
• Each miRNA or mRNA is drawn from one of the
modules;
• Each sample is a random mixture of miRNAs and
mRNAs expressed in different modules;
• Samples may associate with multiple functional
modules;
Results
13
FMRM# c x Mouse model class Tumor subtype p-value
3 10 3 C3TAg Basal 0.0081
4 8 3 MMTV_Wnt Luminal 0.004
5 10 3 Hras Luminal 0.0081
6 14 3 p53 Basal 0.0222
11 10 3 C3TAg Basal 0.0081
13 14 3 p53 Basal 0.0222
19 10 3 BRCA_p53 Basal 0.0081
Causal inference based approaches
Causal inference based approaches

More Related Content

What's hot

Structural genomics
Structural genomicsStructural genomics
Structural genomics
Vaibhav Maurya
 
Revealing disease-associated pathways by network integration of untargeted me...
Revealing disease-associated pathways by network integration of untargeted me...Revealing disease-associated pathways by network integration of untargeted me...
Revealing disease-associated pathways by network integration of untargeted me...
SOYEON KIM
 
Systems biology - Bioinformatics on complete biological systems
Systems biology - Bioinformatics on complete biological systemsSystems biology - Bioinformatics on complete biological systems
Systems biology - Bioinformatics on complete biological systems
Lars Juhl Jensen
 
Introduction to systems biology
Introduction to systems biologyIntroduction to systems biology
Introduction to systems biology
lemberger
 
SEQUENCE ANALYSIS
SEQUENCE ANALYSISSEQUENCE ANALYSIS
SEQUENCE ANALYSIS
prashant tripathi
 
Deep learning based multi-omics integration, a survey
Deep learning based multi-omics integration, a surveyDeep learning based multi-omics integration, a survey
Deep learning based multi-omics integration, a survey
SOYEON KIM
 
Report on System Biology Funding from BMBF
Report on System Biology Funding from BMBFReport on System Biology Funding from BMBF
Report on System Biology Funding from BMBF
EuroBioForum
 
NetBioSIG2013-Talk Martina Kutmon
NetBioSIG2013-Talk Martina KutmonNetBioSIG2013-Talk Martina Kutmon
NetBioSIG2013-Talk Martina Kutmon
Alexander Pico
 
20042016_pizzaclub_part2
20042016_pizzaclub_part220042016_pizzaclub_part2
20042016_pizzaclub_part2
RSG Luxembourg
 
Systems Biology Approaches to Cancer
Systems Biology Approaches to CancerSystems Biology Approaches to Cancer
Systems Biology Approaches to Cancer
Raunak Shrestha
 
Analytical Study of Hexapod miRNAs using Phylogenetic Methods
Analytical Study of Hexapod miRNAs using Phylogenetic MethodsAnalytical Study of Hexapod miRNAs using Phylogenetic Methods
Analytical Study of Hexapod miRNAs using Phylogenetic Methods
cscpconf
 
DREAM Challenge
DREAM ChallengeDREAM Challenge
DREAM Challenge
Tulip Nandu
 
Microarray
MicroarrayMicroarray
Microarray
Sunil Bhandari
 
Project report-on-bio-informatics
Project report-on-bio-informaticsProject report-on-bio-informatics
Project report-on-bio-informatics
Daniela Rotariu
 
Genetic disease identification and medical diagnosis using MF, CC, BF, MicroR...
Genetic disease identification and medical diagnosis using MF, CC, BF, MicroR...Genetic disease identification and medical diagnosis using MF, CC, BF, MicroR...
Genetic disease identification and medical diagnosis using MF, CC, BF, MicroR...
KarthigaRavichandran3
 
IRJET- Disease Identification using Proteins Values and Regulatory Modules
IRJET-  	  Disease Identification using Proteins Values and Regulatory  ModulesIRJET-  	  Disease Identification using Proteins Values and Regulatory  Modules
IRJET- Disease Identification using Proteins Values and Regulatory Modules
IRJET Journal
 
Genetic disease identification and medical diagnosis using MF, CC, BF, MicroR...
Genetic disease identification and medical diagnosis using MF, CC, BF, MicroR...Genetic disease identification and medical diagnosis using MF, CC, BF, MicroR...
Genetic disease identification and medical diagnosis using MF, CC, BF, MicroR...
KarthigaRavichandran3
 
STRING - Modeling of pathways through cross-species integration of large-scal...
STRING - Modeling of pathways through cross-species integration of large-scal...STRING - Modeling of pathways through cross-species integration of large-scal...
STRING - Modeling of pathways through cross-species integration of large-scal...
Lars Juhl Jensen
 
From systems biology
From systems biologyFrom systems biology
From systems biology
brnbarcelona
 
Introduction to Systemics with focus on Systems Biology
Introduction to Systemics with focus on Systems BiologyIntroduction to Systemics with focus on Systems Biology
Introduction to Systemics with focus on Systems Biology
Mrinal Vashisth
 

What's hot (20)

Structural genomics
Structural genomicsStructural genomics
Structural genomics
 
Revealing disease-associated pathways by network integration of untargeted me...
Revealing disease-associated pathways by network integration of untargeted me...Revealing disease-associated pathways by network integration of untargeted me...
Revealing disease-associated pathways by network integration of untargeted me...
 
Systems biology - Bioinformatics on complete biological systems
Systems biology - Bioinformatics on complete biological systemsSystems biology - Bioinformatics on complete biological systems
Systems biology - Bioinformatics on complete biological systems
 
Introduction to systems biology
Introduction to systems biologyIntroduction to systems biology
Introduction to systems biology
 
SEQUENCE ANALYSIS
SEQUENCE ANALYSISSEQUENCE ANALYSIS
SEQUENCE ANALYSIS
 
Deep learning based multi-omics integration, a survey
Deep learning based multi-omics integration, a surveyDeep learning based multi-omics integration, a survey
Deep learning based multi-omics integration, a survey
 
Report on System Biology Funding from BMBF
Report on System Biology Funding from BMBFReport on System Biology Funding from BMBF
Report on System Biology Funding from BMBF
 
NetBioSIG2013-Talk Martina Kutmon
NetBioSIG2013-Talk Martina KutmonNetBioSIG2013-Talk Martina Kutmon
NetBioSIG2013-Talk Martina Kutmon
 
20042016_pizzaclub_part2
20042016_pizzaclub_part220042016_pizzaclub_part2
20042016_pizzaclub_part2
 
Systems Biology Approaches to Cancer
Systems Biology Approaches to CancerSystems Biology Approaches to Cancer
Systems Biology Approaches to Cancer
 
Analytical Study of Hexapod miRNAs using Phylogenetic Methods
Analytical Study of Hexapod miRNAs using Phylogenetic MethodsAnalytical Study of Hexapod miRNAs using Phylogenetic Methods
Analytical Study of Hexapod miRNAs using Phylogenetic Methods
 
DREAM Challenge
DREAM ChallengeDREAM Challenge
DREAM Challenge
 
Microarray
MicroarrayMicroarray
Microarray
 
Project report-on-bio-informatics
Project report-on-bio-informaticsProject report-on-bio-informatics
Project report-on-bio-informatics
 
Genetic disease identification and medical diagnosis using MF, CC, BF, MicroR...
Genetic disease identification and medical diagnosis using MF, CC, BF, MicroR...Genetic disease identification and medical diagnosis using MF, CC, BF, MicroR...
Genetic disease identification and medical diagnosis using MF, CC, BF, MicroR...
 
IRJET- Disease Identification using Proteins Values and Regulatory Modules
IRJET-  	  Disease Identification using Proteins Values and Regulatory  ModulesIRJET-  	  Disease Identification using Proteins Values and Regulatory  Modules
IRJET- Disease Identification using Proteins Values and Regulatory Modules
 
Genetic disease identification and medical diagnosis using MF, CC, BF, MicroR...
Genetic disease identification and medical diagnosis using MF, CC, BF, MicroR...Genetic disease identification and medical diagnosis using MF, CC, BF, MicroR...
Genetic disease identification and medical diagnosis using MF, CC, BF, MicroR...
 
STRING - Modeling of pathways through cross-species integration of large-scal...
STRING - Modeling of pathways through cross-species integration of large-scal...STRING - Modeling of pathways through cross-species integration of large-scal...
STRING - Modeling of pathways through cross-species integration of large-scal...
 
From systems biology
From systems biologyFrom systems biology
From systems biology
 
Introduction to Systemics with focus on Systems Biology
Introduction to Systemics with focus on Systems BiologyIntroduction to Systemics with focus on Systems Biology
Introduction to Systemics with focus on Systems Biology
 

Similar to Bioinformatics

Slides
SlidesSlides
Qi liu 08.08.2014
Qi liu 08.08.2014Qi liu 08.08.2014
Qi liu 08.08.2014
Hyun Wong Choi
 
Experimental methods and the big data sets
Experimental methods and the big data sets Experimental methods and the big data sets
Experimental methods and the big data sets
improvemed
 
overview on Next generation sequencing in breast csncer
overview on Next generation sequencing in breast csnceroverview on Next generation sequencing in breast csncer
overview on Next generation sequencing in breast csncer
Seham Al-Shehri
 
Toxicogenomics: microarray
Toxicogenomics: microarrayToxicogenomics: microarray
Toxicogenomics: microarray
Eden D'souza
 
miRNA Breast Cancer Prognosis -- Ingenuity Systems
miRNA Breast Cancer Prognosis -- Ingenuity SystemsmiRNA Breast Cancer Prognosis -- Ingenuity Systems
miRNA Breast Cancer Prognosis -- Ingenuity Systems
Natalie Ng
 
Vanderbilt b
Vanderbilt bVanderbilt b
Vanderbilt b
Claudine Garcia
 
Assays for protein dna interactions
Assays for protein dna interactionsAssays for protein dna interactions
Assays for protein dna interactions
oikawa
 
Microarray CGH
Microarray CGHMicroarray CGH
Microarray CGH
Pinal Chaudhari
 
Genomics_Aishwarya Teli.pptx
Genomics_Aishwarya Teli.pptxGenomics_Aishwarya Teli.pptx
Genomics_Aishwarya Teli.pptx
AishwaryaTeli5
 
presentation
presentationpresentation
presentation
Peter Langfelder
 
bioinformatics simple
bioinformatics simple bioinformatics simple
bioinformatics simple
nadeem akhter
 
The Role of Statistician in Personalized Medicine: An Overview of Statistical...
The Role of Statistician in Personalized Medicine: An Overview of Statistical...The Role of Statistician in Personalized Medicine: An Overview of Statistical...
The Role of Statistician in Personalized Medicine: An Overview of Statistical...
Setia Pramana
 
High throughput Data Analysis
High throughput Data AnalysisHigh throughput Data Analysis
High throughput Data Analysis
Setia Pramana
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
Pawan Kumar
 
Predicting phenotype from genotype with machine learning
Predicting phenotype from genotype with machine learningPredicting phenotype from genotype with machine learning
Predicting phenotype from genotype with machine learning
Patricia Francis-Lyon
 
KnetMiner - EBI Workshop 2017
KnetMiner - EBI Workshop 2017KnetMiner - EBI Workshop 2017
KnetMiner - EBI Workshop 2017
Keywan Hassani-Pak
 
Genomics seminar copy
Genomics seminar   copyGenomics seminar   copy
Genomics seminar copy
manjunatha s s
 
Applied Bioinformatics Assignment 5docx
Applied Bioinformatics Assignment  5docxApplied Bioinformatics Assignment  5docx
Applied Bioinformatics Assignment 5docx
University of Allahabad
 
MICROARRAY.pptx
MICROARRAY.pptxMICROARRAY.pptx
MICROARRAY.pptx
ANJUHARIDAS5
 

Similar to Bioinformatics (20)

Slides
SlidesSlides
Slides
 
Qi liu 08.08.2014
Qi liu 08.08.2014Qi liu 08.08.2014
Qi liu 08.08.2014
 
Experimental methods and the big data sets
Experimental methods and the big data sets Experimental methods and the big data sets
Experimental methods and the big data sets
 
overview on Next generation sequencing in breast csncer
overview on Next generation sequencing in breast csnceroverview on Next generation sequencing in breast csncer
overview on Next generation sequencing in breast csncer
 
Toxicogenomics: microarray
Toxicogenomics: microarrayToxicogenomics: microarray
Toxicogenomics: microarray
 
miRNA Breast Cancer Prognosis -- Ingenuity Systems
miRNA Breast Cancer Prognosis -- Ingenuity SystemsmiRNA Breast Cancer Prognosis -- Ingenuity Systems
miRNA Breast Cancer Prognosis -- Ingenuity Systems
 
Vanderbilt b
Vanderbilt bVanderbilt b
Vanderbilt b
 
Assays for protein dna interactions
Assays for protein dna interactionsAssays for protein dna interactions
Assays for protein dna interactions
 
Microarray CGH
Microarray CGHMicroarray CGH
Microarray CGH
 
Genomics_Aishwarya Teli.pptx
Genomics_Aishwarya Teli.pptxGenomics_Aishwarya Teli.pptx
Genomics_Aishwarya Teli.pptx
 
presentation
presentationpresentation
presentation
 
bioinformatics simple
bioinformatics simple bioinformatics simple
bioinformatics simple
 
The Role of Statistician in Personalized Medicine: An Overview of Statistical...
The Role of Statistician in Personalized Medicine: An Overview of Statistical...The Role of Statistician in Personalized Medicine: An Overview of Statistical...
The Role of Statistician in Personalized Medicine: An Overview of Statistical...
 
High throughput Data Analysis
High throughput Data AnalysisHigh throughput Data Analysis
High throughput Data Analysis
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
 
Predicting phenotype from genotype with machine learning
Predicting phenotype from genotype with machine learningPredicting phenotype from genotype with machine learning
Predicting phenotype from genotype with machine learning
 
KnetMiner - EBI Workshop 2017
KnetMiner - EBI Workshop 2017KnetMiner - EBI Workshop 2017
KnetMiner - EBI Workshop 2017
 
Genomics seminar copy
Genomics seminar   copyGenomics seminar   copy
Genomics seminar copy
 
Applied Bioinformatics Assignment 5docx
Applied Bioinformatics Assignment  5docxApplied Bioinformatics Assignment  5docx
Applied Bioinformatics Assignment 5docx
 
MICROARRAY.pptx
MICROARRAY.pptxMICROARRAY.pptx
MICROARRAY.pptx
 

Recently uploaded

DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
SOFTTECHHUB
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
KAMESHS29
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
Neo4j
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
Quotidiano Piemontese
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
Matthew Sinclair
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
ControlCase
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
Adtran
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
Kumud Singh
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”
Claudio Di Ciccio
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
Aftab Hussain
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
Zilliz
 
Data structures and Algorithms in Python.pdf
Data structures and Algorithms in Python.pdfData structures and Algorithms in Python.pdf
Data structures and Algorithms in Python.pdf
TIPNGVN2
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
Pierluigi Pugliese
 

Recently uploaded (20)

DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
 
Data structures and Algorithms in Python.pdf
Data structures and Algorithms in Python.pdfData structures and Algorithms in Python.pdf
Data structures and Algorithms in Python.pdf
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
 

Bioinformatics

  • 1. Mining complex relationships • Data mining heterogeneous sources for many to many relationships. • CCB (Center for Cancer Biology) – Regulatory relationships between microRNAs, transcription factors, and genes. – Data sources: • DNA sequences • Gene expression data • Multiple labs • Domain knowledge. • One ARC project
  • 2. • Causal inference • Discovery of group-group relationships Heterogeneous data Inferring miRNA-mRNA regulatory relationships Gene regulatory relationships
  • 3. Causal inference based approaches Why interested in causal relationships? • Gene regulatory relationships are causal by nature • Most existing work identifies only statistical associations/correlations Gene C Gene A Gene B What’s the catch? • Gold standard of causal discovery is controlled random trials • RCTs are expensive and not always possible • We want to discover causal relationships from observational data
  • 4. Causal inference– Do calculus Judea Pearl. Causality: Models, Reasoning, and Inference. Cambridge University Press, 2000. X1 X2 … Xn-1 Xn 5.2 7.5 6.5 5.2 5.6 7.2 6.6 5.3 … … … … … 5.4 7.1 7.1 5.7 5.7 6.9 6.9 5.8 +1 +0.8
  • 5. Methods – IDA – Maathuis, H. M., Colombo, D., Kalisch, M., and Buhlmann, P. (2010). Predicting causal effects in large-scale systems from observational data. Nature Methods, 7(4), 247–249. 5
  • 6. 6
  • 7. Causal inference based approaches • We also applied the causal inference method to detect condition specific regulatory relationships • The steps: ˗ Split samples into to two parts according to conditions (cancer or normal) ˗ Detect causal regulatory relationships in each condition ˗ A relationship (miR_i, mR_j) detected in condition 1 but not in condition 2 is specific to condition 1, and miR_i is an active microRNA in condition 1
  • 10. Idea from information retrieval • Correspondence Latent Dirichlet Allocation (Corr-LDA) – Automatic annotations of images (Blei et al. 2004)
  • 12. Generative process 12 • Each miRNA or mRNA is drawn from one of the modules; • Each sample is a random mixture of miRNAs and mRNAs expressed in different modules; • Samples may associate with multiple functional modules;
  • 13. Results 13 FMRM# c x Mouse model class Tumor subtype p-value 3 10 3 C3TAg Basal 0.0081 4 8 3 MMTV_Wnt Luminal 0.004 5 10 3 Hras Luminal 0.0081 6 14 3 p53 Basal 0.0222 11 10 3 C3TAg Basal 0.0081 13 14 3 p53 Basal 0.0222 19 10 3 BRCA_p53 Basal 0.0081

Editor's Notes

  1. X causes Y iff there is some manipulation of X leading to a change in the probability distribution of Y. (Judea Pearl, 2000; Neapolitan, 2003)
  2. Completed Partially Directed Acyclic Graph
  3. Correspond -> correspondence May say: what’s given (input), what’s to be obtained (output), how (rough idea) – may use a diagram ?
  4. May be swap the three dot points around so the flow is like: Assume that functional modules exist -> then each sample is obtained by drawing miRNAs and mRNAs from the modules (so a sample is a random mixture ...) Not quite sure where to put the first dot point
  5. Assigning biological conditions to FMRMs. The y-axis on the right side of the figure denotes sample names, mouse model types, and breast cancer subtypes in three columns. Using the parameter , the likelihood that a particular sample is associated with a specific module, the top 5% samples associated with each module are displayed using the grey scale. These samples are considered to map modules to biological conditions. Samples may occur more than once in the y-axis because some samples are significantly associated with more than one module. Some modules, such as module-11, have only rather low probability of association with samples, and thus have nearly white shading even for their top 5 samples. Significant mapping of FMRMs to conditions is highlighted.