SlideShare a Scribd company logo
1 of 33
Download to read offline
The Language Diversity
of Computing
Or, how to talk with a computer.
Jeremy Yang
(Mgr., Systems & Programming)
Translational Informatics Div.
Dept. of Internal Medicine
University of New Mexico
BioMISS -- Thursday, Oct 15, 2015 1
Language Diversity Examples
Python Perl Fortran C R
C++ Java Basic SQL Sparql
XML XSD XPath URLs bash
HTML HTTP ASCII UTF-8 regex
Scala ICD-10 Ruby OWL RDF
2
A Working Definition of “Language”
● Coherent symbology (symbolic system)
3
Languages: Some major advances
COBOL
(1960) Sparql
(2008)
Java
(1995)
4
1950
FORTRAN
(1953)
1960 1970 1980 1990 2000 2010
SQL
(1979)
C
(1969)
C++
(1979)
Perl
(1987)
Python
(1989)
HTML
(1990)
XML
(1997)
RDF
(1999)
Language merit vs. elitism
5
Why do we care about languages?
● Compatibility
● Efficiency
● Usability
● Knowledge
representation
● Intelligence
● Evolution
Naturellement!
6
7
℅ Prof Harald Sack,
Hasso Plattner
Institute, U.
Potsdam, MOOC:
“Semantic Web
Technologies”
Programming paradigms
Object Oriented
● classes
● instances
● methods
● ~ nouns
8
Functional
● functions
● routines
● parameters
● ~ verbs
Programming paradigms are language paradigms.
9
Object Oriented
Example:
CDK = Chemistry
Development Kit
Open source Java
package & API
Computers have
“evolved” from
numerical calculators to
knowledge processors.
Knowledge
representation and
processing via
language!
10
Italian Music Terms
Choice of language should be guided by the domain.
Q: So what is the problem?
A: Language gaps
CODE
JARGON
MEANING
“Interpretation”
MATH
11
Q: So what is the problem?
A: Standards (so many!)
“Why can’t my iPhone talk to my ...”
● TV
● Audio system
● Car
● Medical records
12
Q: So what is the problem?
A: Language shapes, empowers,
limits thought. (Sapir-Whorf
Hypothesis, aka Linguistic
Relativity)
13
Q: So what is the problem?
A: Abstraction
● Overgeneralizing
● Reality is concrete!
● But: abstraction organizes knowledge
● (a feature, not a bug!)
14
“We think in generalities, but we live in detail.” -- Alfred North Whitehead
15
Abstraction:
Shakespeare quotes
“Full of sound and fury, signifying nothing.”
16
"On to this one quicker than
a jackrabbit on a hot date.
Look at this finish! That is
beyond world class."
"Braver than a matador in a
pink tutu he was."
"Racing Santander’s butcher
men tried to hack down Xavi.
Xavi dancing over the
combine harvesters that are
coming after him."
“He could make an onion
cry.” (on Lionel Messi) "Where the insane
becomes the routine with
this man. He is nothing less
than a ball whisperer."
Abstraction:
Ray Hudson Quotes
17
“You campaign in poetry. You
govern in prose.” - Mario Cuomo
But maybe all language is poetic.
Languages of Biomedical Knowledge
18
19
Which cirrhosis?
Specificity?
http://apps.who.int/classifications/icd10
Translation
and
mapping
terms
20
story
history
Our project:
Illuminating the Druggable Genome (IDG)
$4.9M
21
Illuminating the Druggable Genome
Knowledge Management Center (IDG-KMC)
Translational Informatics Division
Chief: Tudor Oprea, MD, PhD
IDG-KMC Workflow
22
IDG-KMC Collaborator Network
23
Slide ℅ Tudor Oprea
24
Heterogeneous data integration. Language diversity.
IDG-KMC
Language
Challenge:
Case #1:
Drug
Nomenclature
25http://pasilla.health.unm.edu/tomcat/drugdb
IDG-KMC
Language
Challenge:
Case #2:
Disease
Nomenclature
26
27
ICD Disease Ontology
● The International Classification of
Diseases (ICD) is the standard
diagnostic tool for epidemiology,
health management and clinical
purposes.
● WHO
● Clinical emphasis
● Procedures (CM)
● EMR
● Versions
● The mission the Disease Ontology
(DO) is to provide an open source
ontology for the integration of
biomedical data that is associated
with human disease.
● Academic network
● Research emphasis
● Community driven
● Continual updates
Disease nomenclature
● Nosology, classification, ontology
● 17k codes in ICD-9. 155k codes in ICD-10.
● Implicit: Disease model of medicine
28
My recent
Dx: Otitis
Disease vs.
Condition vs.
Symptom vs.
Phenotype
29
℅ WebMD
30
IDG KMC: Gene expression vs. Tissues; Different sources, tissue terms.
IDG-KMC: TCRD - Target Central Research Db
+------------+------------+--------+------+------------------------------------------------------------------+--------+-------+
| doid | Disease | zscore | conf | Protein | idgfam | tdl |
+------------+------------+--------+------+------------------------------------------------------------------+--------+-------+
| DOID:13189 | Gout | 3.512 | 1.8 | Alpha-protein kinase 1 | Kinase | Tbio |
| DOID:13189 | Gout | 3.214 | 1.6 | Serine/threonine-protein kinase SIK1 | Kinase | Tchem |
| DOID:13189 | Gout | 2.922 | 1.5 | Melanocortin receptor 3 | GPCR | Tchem |
| DOID:13189 | Gout | 2.797 | 1.4 | Taste receptor type 2 member 30 | GPCR | Tbio |
| DOID:13189 | Gout | 2.576 | 1.3 | Taste receptor type 2 member 16 | GPCR | Tbio |
| DOID:13189 | Gout | 2.379 | 1.2 | Hepatocyte nuclear factor 4-gamma | NR | Tbio |
| DOID:13189 | Gout | 2.441 | 1.2 | Tyrosine-protein kinase SYK | Kinase | Tchem |
| DOID:13189 | Gout | 1.948 | 1.0 | cGMP-dependent protein kinase 2 | Kinase | Tchem |
| DOID:13189 | Gout | 1.798 | 0.9 | Pannexin-1 | IC | Tbio |
| DOID:13189 | Gout | 1.517 | 0.8 | Taste receptor type 2 member 38 | GPCR | Tbio |
| DOID:13189 | Gout | 1.565 | 0.8 | Transient receptor potential cation channel subfamily A member 1 | IC | Tclin |
| DOID:13189 | Gout | 1.531 | 0.8 | Transient receptor potential cation channel subfamily V member 1 | IC | Tclin |
| DOID:13189 | Gout | 1.388 | 0.7 | Adenosine kinase | Kinase | Tchem |
| DOID:13189 | Gout | 1.427 | 0.7 | Interleukin-1 receptor-associated kinase 1 | Kinase | Tchem |
| DOID:13189 | Gout | 1.375 | 0.7 | Transient receptor potential cation channel subfamily M member 3 | IC | Tbio |
| DOID:13189 | Gout | 1.255 | 0.6 | Free fatty acid receptor 4 | GPCR | Tchem |
| DOID:13189 | Gout | 1.231 | 0.6 | P2X purinoceptor 2 | IC | Tbio |
| DOID:13189 | Gout | 1.198 | 0.6 | Proto-oncogene tyrosine-protein kinase Src | Kinase | Tclin |
| DOID:13189 | Gout | 1.108 | 0.6 | Tribbles homolog 1 | Kinase | Tbio |
| DOID:13189 | Gout | 1.093 | 0.5 | Activin receptor type-1B | Kinase | Tchem |
| DOID:13189 | Gout | 1.048 | 0.5 | Transient receptor potential cation channel subfamily V member 2 | IC | Tbio |
+------------+------------+--------+------+------------------------------------------------------------------+--------+-------+
Disease-gene associations via literature text mining. 31
32
Text mining, named entity recognition, term frequency
Natural language processing, Google, Watson, Siri, and the state of the art
Language Diversity of Computers
Final Thought:
“Can we talk?”*
℅ Joan Rivers, 1933-2014
33

More Related Content

Similar to BioMISS: Language Diversity of Computing

Databases_CSS2.pptx
Databases_CSS2.pptxDatabases_CSS2.pptx
Databases_CSS2.pptxSilpa87
 
Bda2015 tutorial-part2-data&databases
Bda2015 tutorial-part2-data&databasesBda2015 tutorial-part2-data&databases
Bda2015 tutorial-part2-data&databasesInterpretOmics
 
Open Science and Data Sharing - CERF
Open Science and Data Sharing - CERFOpen Science and Data Sharing - CERF
Open Science and Data Sharing - CERFKaitlin Thaney
 
NetBioSIG2013-Talk Thomas Kelder
NetBioSIG2013-Talk Thomas KelderNetBioSIG2013-Talk Thomas Kelder
NetBioSIG2013-Talk Thomas KelderAlexander Pico
 
A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance...
A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance...A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance...
A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance...InsideScientific
 
Flow Cytometry Training : Introduction day 1 session 1
Flow Cytometry Training : Introduction day 1 session 1Flow Cytometry Training : Introduction day 1 session 1
Flow Cytometry Training : Introduction day 1 session 1Robert (Rob) Salomon
 
GS1 Standards for Smart Agriculture and Food Safety Systems
GS1 Standards for Smart Agriculture and Food Safety SystemsGS1 Standards for Smart Agriculture and Food Safety Systems
GS1 Standards for Smart Agriculture and Food Safety SystemsDaeyoung Kim
 
SNOMED CT Demo Intro / SNOMED CT 2019 -seminaari (29.3.2019)
SNOMED CT Demo Intro / SNOMED CT 2019 -seminaari (29.3.2019)SNOMED CT Demo Intro / SNOMED CT 2019 -seminaari (29.3.2019)
SNOMED CT Demo Intro / SNOMED CT 2019 -seminaari (29.3.2019)THL
 
Overview of DisTEMIST at BioASQ: Automatic detection and normalization of dis...
Overview of DisTEMIST at BioASQ: Automatic detection and normalization of dis...Overview of DisTEMIST at BioASQ: Automatic detection and normalization of dis...
Overview of DisTEMIST at BioASQ: Automatic detection and normalization of dis...Martin Krallinger
 
Data standards in bioinformatics
Data standards in bioinformatics	Data standards in bioinformatics
Data standards in bioinformatics Rafael C. Jimenez
 
Unevenly Distributed
Unevenly DistributedUnevenly Distributed
Unevenly DistributedC4Media
 
Big (chemical) data? No Problem!
Big (chemical) data? No Problem!Big (chemical) data? No Problem!
Big (chemical) data? No Problem!Greg Landrum
 
wolstencroft-ogf20-astro
wolstencroft-ogf20-astrowolstencroft-ogf20-astro
wolstencroft-ogf20-astrowebuploader
 
Knowledge Sharing - aCCCeso
Knowledge Sharing - aCCCesoKnowledge Sharing - aCCCeso
Knowledge Sharing - aCCCesoKaitlin Thaney
 
Reproducible research - to infinity
Reproducible research - to infinityReproducible research - to infinity
Reproducible research - to infinityPeterMorrell4
 

Similar to BioMISS: Language Diversity of Computing (20)

Data integration
Data integrationData integration
Data integration
 
Databases_CSS2.pptx
Databases_CSS2.pptxDatabases_CSS2.pptx
Databases_CSS2.pptx
 
Bda2015 tutorial-part2-data&databases
Bda2015 tutorial-part2-data&databasesBda2015 tutorial-part2-data&databases
Bda2015 tutorial-part2-data&databases
 
Open Science and Data Sharing - CERF
Open Science and Data Sharing - CERFOpen Science and Data Sharing - CERF
Open Science and Data Sharing - CERF
 
NetBioSIG2013-Talk Thomas Kelder
NetBioSIG2013-Talk Thomas KelderNetBioSIG2013-Talk Thomas Kelder
NetBioSIG2013-Talk Thomas Kelder
 
A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance...
A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance...A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance...
A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance...
 
HUG @ NGCLE@e-Novia 15.11.2017
HUG @ NGCLE@e-Novia 15.11.2017HUG @ NGCLE@e-Novia 15.11.2017
HUG @ NGCLE@e-Novia 15.11.2017
 
ChIP-seq Theory
ChIP-seq TheoryChIP-seq Theory
ChIP-seq Theory
 
Data integration
Data integrationData integration
Data integration
 
Flow Cytometry Training : Introduction day 1 session 1
Flow Cytometry Training : Introduction day 1 session 1Flow Cytometry Training : Introduction day 1 session 1
Flow Cytometry Training : Introduction day 1 session 1
 
GS1 Standards for Smart Agriculture and Food Safety Systems
GS1 Standards for Smart Agriculture and Food Safety SystemsGS1 Standards for Smart Agriculture and Food Safety Systems
GS1 Standards for Smart Agriculture and Food Safety Systems
 
Practical semantics in the pharmaceutical industry - the Open PHACTS project
Practical semantics in the pharmaceutical industry - the Open PHACTS projectPractical semantics in the pharmaceutical industry - the Open PHACTS project
Practical semantics in the pharmaceutical industry - the Open PHACTS project
 
SNOMED CT Demo Intro / SNOMED CT 2019 -seminaari (29.3.2019)
SNOMED CT Demo Intro / SNOMED CT 2019 -seminaari (29.3.2019)SNOMED CT Demo Intro / SNOMED CT 2019 -seminaari (29.3.2019)
SNOMED CT Demo Intro / SNOMED CT 2019 -seminaari (29.3.2019)
 
Overview of DisTEMIST at BioASQ: Automatic detection and normalization of dis...
Overview of DisTEMIST at BioASQ: Automatic detection and normalization of dis...Overview of DisTEMIST at BioASQ: Automatic detection and normalization of dis...
Overview of DisTEMIST at BioASQ: Automatic detection and normalization of dis...
 
Data standards in bioinformatics
Data standards in bioinformatics	Data standards in bioinformatics
Data standards in bioinformatics
 
Unevenly Distributed
Unevenly DistributedUnevenly Distributed
Unevenly Distributed
 
Big (chemical) data? No Problem!
Big (chemical) data? No Problem!Big (chemical) data? No Problem!
Big (chemical) data? No Problem!
 
wolstencroft-ogf20-astro
wolstencroft-ogf20-astrowolstencroft-ogf20-astro
wolstencroft-ogf20-astro
 
Knowledge Sharing - aCCCeso
Knowledge Sharing - aCCCesoKnowledge Sharing - aCCCeso
Knowledge Sharing - aCCCeso
 
Reproducible research - to infinity
Reproducible research - to infinityReproducible research - to infinity
Reproducible research - to infinity
 

More from Jeremy Yang

TIGA: Target Illumination GWAS Analytics
TIGA: Target Illumination GWAS AnalyticsTIGA: Target Illumination GWAS Analytics
TIGA: Target Illumination GWAS AnalyticsJeremy Yang
 
DrugCentralDb and BioClients: Dockerized PostgreSql with Python API-tizer
DrugCentralDb and BioClients: Dockerized PostgreSql with Python API-tizerDrugCentralDb and BioClients: Dockerized PostgreSql with Python API-tizer
DrugCentralDb and BioClients: Dockerized PostgreSql with Python API-tizerJeremy Yang
 
Mining ClinicalTrials.gov via CTTI AACT for drug target hypotheses
Mining ClinicalTrials.gov via CTTI AACT for drug target hypothesesMining ClinicalTrials.gov via CTTI AACT for drug target hypotheses
Mining ClinicalTrials.gov via CTTI AACT for drug target hypothesesJeremy Yang
 
TIN-X v2: modernized architecture with REST API
TIN-X v2: modernized architecture with REST APITIN-X v2: modernized architecture with REST API
TIN-X v2: modernized architecture with REST APIJeremy Yang
 
Ex-files: Sex-Specific Gene Expression Profiles Explorer
Ex-files: Sex-Specific Gene Expression Profiles ExplorerEx-files: Sex-Specific Gene Expression Profiles Explorer
Ex-files: Sex-Specific Gene Expression Profiles ExplorerJeremy Yang
 
Illuminating the Druggable Genome with Knowledge Engineering and Machine Lear...
Illuminating the Druggable Genome with Knowledge Engineering and Machine Lear...Illuminating the Druggable Genome with Knowledge Engineering and Machine Lear...
Illuminating the Druggable Genome with Knowledge Engineering and Machine Lear...Jeremy Yang
 
Open Phenotypic Drug Discovery Resource poster
Open Phenotypic Drug Discovery Resource posterOpen Phenotypic Drug Discovery Resource poster
Open Phenotypic Drug Discovery Resource posterJeremy Yang
 
Badapple: promiscuity patterns from noisy evidence (poster)
Badapple: promiscuity patterns from noisy evidence (poster)Badapple: promiscuity patterns from noisy evidence (poster)
Badapple: promiscuity patterns from noisy evidence (poster)Jeremy Yang
 
Bibliological data science and drug discovery
Bibliological data science and drug discoveryBibliological data science and drug discovery
Bibliological data science and drug discoveryJeremy Yang
 
RMSD: routine measure stirs doubts
RMSD: routine measure stirs doubtsRMSD: routine measure stirs doubts
RMSD: routine measure stirs doubtsJeremy Yang
 
Canonicalized systematic nomenclature in cheminformatics
Canonicalized systematic nomenclature in cheminformaticsCanonicalized systematic nomenclature in cheminformatics
Canonicalized systematic nomenclature in cheminformaticsJeremy Yang
 
Molecular scaffolds poster
Molecular scaffolds posterMolecular scaffolds poster
Molecular scaffolds posterJeremy Yang
 
Molecular scaffolds are special and useful guides to discovery
Molecular scaffolds are special and useful guides to discoveryMolecular scaffolds are special and useful guides to discovery
Molecular scaffolds are special and useful guides to discoveryJeremy Yang
 
The BADAPPLE promiscuity plugin for BARD
The BADAPPLE promiscuity plugin for BARDThe BADAPPLE promiscuity plugin for BARD
The BADAPPLE promiscuity plugin for BARDJeremy Yang
 
Cheminformatics Software Development: Case Studies
Cheminformatics Software Development: Case StudiesCheminformatics Software Development: Case Studies
Cheminformatics Software Development: Case StudiesJeremy Yang
 
How am I supposed to organize a protein database when I can't even organize m...
How am I supposed to organize a protein database when I can't even organize m...How am I supposed to organize a protein database when I can't even organize m...
How am I supposed to organize a protein database when I can't even organize m...Jeremy Yang
 
UNM Division of Biocomputing public web applications
UNM Division of Biocomputing public web applicationsUNM Division of Biocomputing public web applications
UNM Division of Biocomputing public web applicationsJeremy Yang
 
Cyberinfrastructure Day 2010: Applications in Biocomputing
Cyberinfrastructure Day 2010: Applications in BiocomputingCyberinfrastructure Day 2010: Applications in Biocomputing
Cyberinfrastructure Day 2010: Applications in BiocomputingJeremy Yang
 
Promiscuous patterns and perils in PubChem and the MLSCN
Promiscuous patterns and perils in PubChem and the MLSCNPromiscuous patterns and perils in PubChem and the MLSCN
Promiscuous patterns and perils in PubChem and the MLSCNJeremy Yang
 

More from Jeremy Yang (19)

TIGA: Target Illumination GWAS Analytics
TIGA: Target Illumination GWAS AnalyticsTIGA: Target Illumination GWAS Analytics
TIGA: Target Illumination GWAS Analytics
 
DrugCentralDb and BioClients: Dockerized PostgreSql with Python API-tizer
DrugCentralDb and BioClients: Dockerized PostgreSql with Python API-tizerDrugCentralDb and BioClients: Dockerized PostgreSql with Python API-tizer
DrugCentralDb and BioClients: Dockerized PostgreSql with Python API-tizer
 
Mining ClinicalTrials.gov via CTTI AACT for drug target hypotheses
Mining ClinicalTrials.gov via CTTI AACT for drug target hypothesesMining ClinicalTrials.gov via CTTI AACT for drug target hypotheses
Mining ClinicalTrials.gov via CTTI AACT for drug target hypotheses
 
TIN-X v2: modernized architecture with REST API
TIN-X v2: modernized architecture with REST APITIN-X v2: modernized architecture with REST API
TIN-X v2: modernized architecture with REST API
 
Ex-files: Sex-Specific Gene Expression Profiles Explorer
Ex-files: Sex-Specific Gene Expression Profiles ExplorerEx-files: Sex-Specific Gene Expression Profiles Explorer
Ex-files: Sex-Specific Gene Expression Profiles Explorer
 
Illuminating the Druggable Genome with Knowledge Engineering and Machine Lear...
Illuminating the Druggable Genome with Knowledge Engineering and Machine Lear...Illuminating the Druggable Genome with Knowledge Engineering and Machine Lear...
Illuminating the Druggable Genome with Knowledge Engineering and Machine Lear...
 
Open Phenotypic Drug Discovery Resource poster
Open Phenotypic Drug Discovery Resource posterOpen Phenotypic Drug Discovery Resource poster
Open Phenotypic Drug Discovery Resource poster
 
Badapple: promiscuity patterns from noisy evidence (poster)
Badapple: promiscuity patterns from noisy evidence (poster)Badapple: promiscuity patterns from noisy evidence (poster)
Badapple: promiscuity patterns from noisy evidence (poster)
 
Bibliological data science and drug discovery
Bibliological data science and drug discoveryBibliological data science and drug discovery
Bibliological data science and drug discovery
 
RMSD: routine measure stirs doubts
RMSD: routine measure stirs doubtsRMSD: routine measure stirs doubts
RMSD: routine measure stirs doubts
 
Canonicalized systematic nomenclature in cheminformatics
Canonicalized systematic nomenclature in cheminformaticsCanonicalized systematic nomenclature in cheminformatics
Canonicalized systematic nomenclature in cheminformatics
 
Molecular scaffolds poster
Molecular scaffolds posterMolecular scaffolds poster
Molecular scaffolds poster
 
Molecular scaffolds are special and useful guides to discovery
Molecular scaffolds are special and useful guides to discoveryMolecular scaffolds are special and useful guides to discovery
Molecular scaffolds are special and useful guides to discovery
 
The BADAPPLE promiscuity plugin for BARD
The BADAPPLE promiscuity plugin for BARDThe BADAPPLE promiscuity plugin for BARD
The BADAPPLE promiscuity plugin for BARD
 
Cheminformatics Software Development: Case Studies
Cheminformatics Software Development: Case StudiesCheminformatics Software Development: Case Studies
Cheminformatics Software Development: Case Studies
 
How am I supposed to organize a protein database when I can't even organize m...
How am I supposed to organize a protein database when I can't even organize m...How am I supposed to organize a protein database when I can't even organize m...
How am I supposed to organize a protein database when I can't even organize m...
 
UNM Division of Biocomputing public web applications
UNM Division of Biocomputing public web applicationsUNM Division of Biocomputing public web applications
UNM Division of Biocomputing public web applications
 
Cyberinfrastructure Day 2010: Applications in Biocomputing
Cyberinfrastructure Day 2010: Applications in BiocomputingCyberinfrastructure Day 2010: Applications in Biocomputing
Cyberinfrastructure Day 2010: Applications in Biocomputing
 
Promiscuous patterns and perils in PubChem and the MLSCN
Promiscuous patterns and perils in PubChem and the MLSCNPromiscuous patterns and perils in PubChem and the MLSCN
Promiscuous patterns and perils in PubChem and the MLSCN
 

Recently uploaded

Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 
Caco-2 cell permeability assay for drug absorption
Caco-2 cell permeability assay for drug absorptionCaco-2 cell permeability assay for drug absorption
Caco-2 cell permeability assay for drug absorptionPriyansha Singh
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhousejana861314
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxgindu3009
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |aasikanpl
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisDiwakar Mishra
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCEPRINCE C P
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Sérgio Sacani
 
Work, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE PhysicsWork, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE Physicsvishikhakeshava1
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls
 
Types of different blotting techniques.pptx
Types of different blotting techniques.pptxTypes of different blotting techniques.pptx
Types of different blotting techniques.pptxkhadijarafiq2012
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfmuntazimhurra
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...jana861314
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksSérgio Sacani
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxkessiyaTpeter
 

Recently uploaded (20)

Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
Caco-2 cell permeability assay for drug absorption
Caco-2 cell permeability assay for drug absorptionCaco-2 cell permeability assay for drug absorption
Caco-2 cell permeability assay for drug absorption
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhouse
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
Work, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE PhysicsWork, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE Physics
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
Types of different blotting techniques.pptx
Types of different blotting techniques.pptxTypes of different blotting techniques.pptx
Types of different blotting techniques.pptx
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
 

BioMISS: Language Diversity of Computing

  • 1. The Language Diversity of Computing Or, how to talk with a computer. Jeremy Yang (Mgr., Systems & Programming) Translational Informatics Div. Dept. of Internal Medicine University of New Mexico BioMISS -- Thursday, Oct 15, 2015 1
  • 2. Language Diversity Examples Python Perl Fortran C R C++ Java Basic SQL Sparql XML XSD XPath URLs bash HTML HTTP ASCII UTF-8 regex Scala ICD-10 Ruby OWL RDF 2
  • 3. A Working Definition of “Language” ● Coherent symbology (symbolic system) 3
  • 4. Languages: Some major advances COBOL (1960) Sparql (2008) Java (1995) 4 1950 FORTRAN (1953) 1960 1970 1980 1990 2000 2010 SQL (1979) C (1969) C++ (1979) Perl (1987) Python (1989) HTML (1990) XML (1997) RDF (1999)
  • 5. Language merit vs. elitism 5
  • 6. Why do we care about languages? ● Compatibility ● Efficiency ● Usability ● Knowledge representation ● Intelligence ● Evolution Naturellement! 6
  • 7. 7 ℅ Prof Harald Sack, Hasso Plattner Institute, U. Potsdam, MOOC: “Semantic Web Technologies”
  • 8. Programming paradigms Object Oriented ● classes ● instances ● methods ● ~ nouns 8 Functional ● functions ● routines ● parameters ● ~ verbs Programming paradigms are language paradigms.
  • 9. 9 Object Oriented Example: CDK = Chemistry Development Kit Open source Java package & API Computers have “evolved” from numerical calculators to knowledge processors. Knowledge representation and processing via language!
  • 10. 10 Italian Music Terms Choice of language should be guided by the domain.
  • 11. Q: So what is the problem? A: Language gaps CODE JARGON MEANING “Interpretation” MATH 11
  • 12. Q: So what is the problem? A: Standards (so many!) “Why can’t my iPhone talk to my ...” ● TV ● Audio system ● Car ● Medical records 12
  • 13. Q: So what is the problem? A: Language shapes, empowers, limits thought. (Sapir-Whorf Hypothesis, aka Linguistic Relativity) 13
  • 14. Q: So what is the problem? A: Abstraction ● Overgeneralizing ● Reality is concrete! ● But: abstraction organizes knowledge ● (a feature, not a bug!) 14 “We think in generalities, but we live in detail.” -- Alfred North Whitehead
  • 15. 15 Abstraction: Shakespeare quotes “Full of sound and fury, signifying nothing.”
  • 16. 16 "On to this one quicker than a jackrabbit on a hot date. Look at this finish! That is beyond world class." "Braver than a matador in a pink tutu he was." "Racing Santander’s butcher men tried to hack down Xavi. Xavi dancing over the combine harvesters that are coming after him." “He could make an onion cry.” (on Lionel Messi) "Where the insane becomes the routine with this man. He is nothing less than a ball whisperer." Abstraction: Ray Hudson Quotes
  • 17. 17 “You campaign in poetry. You govern in prose.” - Mario Cuomo But maybe all language is poetic.
  • 18. Languages of Biomedical Knowledge 18
  • 21. Our project: Illuminating the Druggable Genome (IDG) $4.9M 21
  • 22. Illuminating the Druggable Genome Knowledge Management Center (IDG-KMC) Translational Informatics Division Chief: Tudor Oprea, MD, PhD IDG-KMC Workflow 22
  • 24. Slide ℅ Tudor Oprea 24 Heterogeneous data integration. Language diversity.
  • 27. 27 ICD Disease Ontology ● The International Classification of Diseases (ICD) is the standard diagnostic tool for epidemiology, health management and clinical purposes. ● WHO ● Clinical emphasis ● Procedures (CM) ● EMR ● Versions ● The mission the Disease Ontology (DO) is to provide an open source ontology for the integration of biomedical data that is associated with human disease. ● Academic network ● Research emphasis ● Community driven ● Continual updates
  • 28. Disease nomenclature ● Nosology, classification, ontology ● 17k codes in ICD-9. 155k codes in ICD-10. ● Implicit: Disease model of medicine 28
  • 29. My recent Dx: Otitis Disease vs. Condition vs. Symptom vs. Phenotype 29 ℅ WebMD
  • 30. 30 IDG KMC: Gene expression vs. Tissues; Different sources, tissue terms.
  • 31. IDG-KMC: TCRD - Target Central Research Db +------------+------------+--------+------+------------------------------------------------------------------+--------+-------+ | doid | Disease | zscore | conf | Protein | idgfam | tdl | +------------+------------+--------+------+------------------------------------------------------------------+--------+-------+ | DOID:13189 | Gout | 3.512 | 1.8 | Alpha-protein kinase 1 | Kinase | Tbio | | DOID:13189 | Gout | 3.214 | 1.6 | Serine/threonine-protein kinase SIK1 | Kinase | Tchem | | DOID:13189 | Gout | 2.922 | 1.5 | Melanocortin receptor 3 | GPCR | Tchem | | DOID:13189 | Gout | 2.797 | 1.4 | Taste receptor type 2 member 30 | GPCR | Tbio | | DOID:13189 | Gout | 2.576 | 1.3 | Taste receptor type 2 member 16 | GPCR | Tbio | | DOID:13189 | Gout | 2.379 | 1.2 | Hepatocyte nuclear factor 4-gamma | NR | Tbio | | DOID:13189 | Gout | 2.441 | 1.2 | Tyrosine-protein kinase SYK | Kinase | Tchem | | DOID:13189 | Gout | 1.948 | 1.0 | cGMP-dependent protein kinase 2 | Kinase | Tchem | | DOID:13189 | Gout | 1.798 | 0.9 | Pannexin-1 | IC | Tbio | | DOID:13189 | Gout | 1.517 | 0.8 | Taste receptor type 2 member 38 | GPCR | Tbio | | DOID:13189 | Gout | 1.565 | 0.8 | Transient receptor potential cation channel subfamily A member 1 | IC | Tclin | | DOID:13189 | Gout | 1.531 | 0.8 | Transient receptor potential cation channel subfamily V member 1 | IC | Tclin | | DOID:13189 | Gout | 1.388 | 0.7 | Adenosine kinase | Kinase | Tchem | | DOID:13189 | Gout | 1.427 | 0.7 | Interleukin-1 receptor-associated kinase 1 | Kinase | Tchem | | DOID:13189 | Gout | 1.375 | 0.7 | Transient receptor potential cation channel subfamily M member 3 | IC | Tbio | | DOID:13189 | Gout | 1.255 | 0.6 | Free fatty acid receptor 4 | GPCR | Tchem | | DOID:13189 | Gout | 1.231 | 0.6 | P2X purinoceptor 2 | IC | Tbio | | DOID:13189 | Gout | 1.198 | 0.6 | Proto-oncogene tyrosine-protein kinase Src | Kinase | Tclin | | DOID:13189 | Gout | 1.108 | 0.6 | Tribbles homolog 1 | Kinase | Tbio | | DOID:13189 | Gout | 1.093 | 0.5 | Activin receptor type-1B | Kinase | Tchem | | DOID:13189 | Gout | 1.048 | 0.5 | Transient receptor potential cation channel subfamily V member 2 | IC | Tbio | +------------+------------+--------+------+------------------------------------------------------------------+--------+-------+ Disease-gene associations via literature text mining. 31
  • 32. 32 Text mining, named entity recognition, term frequency Natural language processing, Google, Watson, Siri, and the state of the art
  • 33. Language Diversity of Computers Final Thought: “Can we talk?”* ℅ Joan Rivers, 1933-2014 33