SlideShare a Scribd company logo
1 of 29
Transcriptomics and Lexico-syntactic Analysis (Yet another meaning of the TLA homonym) Lars Juhl Jensen EMBL Heidelberg
A brief history of TLA ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
The context – what I actually work on (When I’m not telling other people to work on my IE project) ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
“Biologists would rather share their toothbrush than share a gene name” ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],The synonyms and orthologs lists can be downloaded from: http://www.bork.embl.de/synonyms
Number of uniquely resolvable names for each species 40,038 18,702 7.7 48,291 6,210 S. cerevisiae 15,865 116,712 6.6 132,577 20,006 M. musculus 18,944 181,186 7.1 200,130 27,936 H. sapiens 14,072 77,757 22,707 6.1 103,208 16,871 D. melanogaster 18,214 65,749 45,835 5.4 110,602 20,348 C. elegans 20,158 118,818 5.3 138,976 25,957 A. thaliana Uni-Gene SWALL Species specific Ratio Names Proteins
Orthographic variations of gene names ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Retraining TreeTagger for Medline abstracts ,[object Object],[object Object],[object Object],[object Object]
Tagging is really easy ... compared to extracting the information you are after ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
A mini-ontology of transcription regulation ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Parsing abstracts to identify relationships between genes/proteins ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
TIGERSearch is used for searching and browsing the large processed text corpus
Pattern recognize sentences in both active and passive voice
Typical results are shown ,[object Object],[object Object],[object Object]
We can only wish that all biologists mention their results twice ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Two out of three is not bad at all ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Life is unfair ,[object Object],[object Object],[object Object],[object Object]
Why not extract phosphorylations while we are at it? ,[object Object],[object Object]
Using text mining of Medline abstract to support predicted regulatory interactions ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Microarrays 101 ,[object Object],[object Object],[object Object],[object Object]
Non-linear normalization of intensities and correction for spatial effects Downloaded SMD data After intensity normalization Spatial bias estimate After spatial normalization
Combining arrays from multiple experiments into one gene expression matrix ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
“And now we cluster correlated expression profiles ... no, wait a second!” ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Singular value decomposition – letting the data speak for themselves ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],   Sporulation  Polysomes  Salt treatment  Heat-shock    Starvation      RNA stability 8 7 6 5 4 3 2 1
Inferring functional links from projections of genes onto singular vectors ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Proteins linked to the human mitotic checkpoint protein BUB1 Comments Description Identifier Cyclin-dependent kinases regulatory subunit CKS1_HUMAN Involved in cell cycle arrest Serine/threonine-protein kinase Chk1 CHK1_HUMAN Involved in mitotic regulation Serine/threonine-protein kinase NEK2 NEK2_HUMAN Cell cycle-dependent expression CRM1 protein O14980 May act as a negative regulator of entry into mitosis Wee1-like protein kinase WEE1_HUMAN Kinesin-like protein 2 KNS2_HUMAN Kinesin-like protein 2 Q96SE4 Kinesin-like protein KIF14 KF14_HUMAN HCAP-H protein Q15003 Contains six WD40 repeats L2DTL protein Q9NZJ0 Cyclin A2 CGA2_HUMAN Contains a PRY and a SPRY domain Hypothetical protein Q8N324 Polymyositis/scleroderma autoantigen 1 PMC1_HUMAN M-phase inducer phosphatase 1 MPI1_HUMAN DNA topoisomerase II TP2A_HUMAN Cell cycle regulated kinase, inhibits Cdc2 Membrane-associated kinase O14731 Phosphorylated by Cdk2 during S-phase Myb-related protein B MYBB_HUMAN Associated with "growth cones" Brain acid soluble protein 1 BASP_HUMAN Phosphorylated in M-phase Forkhead box protein M1 FXM1_HUMAN High mobility group protein 2 HMG2_HUMAN Cyclin-dependent kinase inhibitor 3 CDN3_HUMAN Mitotic kinesin-like protein 1 Kinesin-like 5 Q8WVP0
Tha-tha-tha-that’s all folks! ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
The STRING web service ,[object Object],[object Object],[object Object],[object Object],STRING is accessible at: http:// www.bork.embl.de/ STRING
Honestly – it’s not my fault! ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Questions?

More Related Content

What's hot

What's hot (20)

iGEM Paper (more pretty)
iGEM Paper (more pretty)iGEM Paper (more pretty)
iGEM Paper (more pretty)
 
Characterisation of 5-HT3 receptor subunits
Characterisation of 5-HT3 receptor subunitsCharacterisation of 5-HT3 receptor subunits
Characterisation of 5-HT3 receptor subunits
 
Bocca et al BBRC
Bocca et al BBRCBocca et al BBRC
Bocca et al BBRC
 
Gene mutation
Gene mutationGene mutation
Gene mutation
 
RNAP_paper
RNAP_paperRNAP_paper
RNAP_paper
 
Will the real proteins please stand up
Will the real proteins please stand upWill the real proteins please stand up
Will the real proteins please stand up
 
Giuliano SIRE
Giuliano SIREGiuliano SIRE
Giuliano SIRE
 
Curriculum Vitae.
Curriculum Vitae.Curriculum Vitae.
Curriculum Vitae.
 
Complementation test
Complementation testComplementation test
Complementation test
 
Essential Biology 4.3 Theoretical Genetics
Essential Biology 4.3 Theoretical GeneticsEssential Biology 4.3 Theoretical Genetics
Essential Biology 4.3 Theoretical Genetics
 
Lesson 13.3
Lesson 13.3Lesson 13.3
Lesson 13.3
 
projekt final
projekt finalprojekt final
projekt final
 
Thesis Project Luke Morton 2016
Thesis Project Luke Morton 2016Thesis Project Luke Morton 2016
Thesis Project Luke Morton 2016
 
Molecular Basis of Mutation
Molecular Basis of MutationMolecular Basis of Mutation
Molecular Basis of Mutation
 
assignment on inheritance and expressio of organeller dna 1
 assignment on inheritance and expressio of organeller dna 1 assignment on inheritance and expressio of organeller dna 1
assignment on inheritance and expressio of organeller dna 1
 
transposon mediated mutagenesis
transposon mediated mutagenesistransposon mediated mutagenesis
transposon mediated mutagenesis
 
melissa Poster SGM mel[1]
melissa Poster SGM mel[1]melissa Poster SGM mel[1]
melissa Poster SGM mel[1]
 
15.1 3 study guide ans
15.1 3 study guide ans15.1 3 study guide ans
15.1 3 study guide ans
 
CREE_poster_v8
CREE_poster_v8CREE_poster_v8
CREE_poster_v8
 
P53_Final_Presentation
P53_Final_PresentationP53_Final_Presentation
P53_Final_Presentation
 

Viewers also liked

Integrative analysis of transcriptomics and proteomics data with ArrayMining ...
Integrative analysis of transcriptomics and proteomics data with ArrayMining ...Integrative analysis of transcriptomics and proteomics data with ArrayMining ...
Integrative analysis of transcriptomics and proteomics data with ArrayMining ...Natalio Krasnogor
 
Al ramtha city jordan surface potential map, final2
Al ramtha city jordan surface potential map, final2Al ramtha city jordan surface potential map, final2
Al ramtha city jordan surface potential map, final2Shomou' Aljizawi
 
Urban Data Requirements by Anna Rose (Space Syntax)
Urban Data Requirements by Anna Rose (Space Syntax)Urban Data Requirements by Anna Rose (Space Syntax)
Urban Data Requirements by Anna Rose (Space Syntax)plan4business
 
Port-City governance. A comparative analysis in the European context.
Port-City governance. A comparative analysis in the European context.Port-City governance. A comparative analysis in the European context.
Port-City governance. A comparative analysis in the European context.José Manuel Pagés Sánchez
 
Al ramtha city in jordan analysis by space syntax
Al ramtha city in jordan analysis by  space syntaxAl ramtha city in jordan analysis by  space syntax
Al ramtha city in jordan analysis by space syntaxShomou' Aljizawi
 
Space syntax
Space syntaxSpace syntax
Space syntaxmaklipu
 
Space syntax introduction & overview
Space syntax introduction & overviewSpace syntax introduction & overview
Space syntax introduction & overviewTim Stonor
 
PERI-URBAN LAND USE CHANGE IN LAGOS THE MEGA-CITY SEMINAR 2
PERI-URBAN LAND USE CHANGE IN LAGOS THE MEGA-CITY SEMINAR 2PERI-URBAN LAND USE CHANGE IN LAGOS THE MEGA-CITY SEMINAR 2
PERI-URBAN LAND USE CHANGE IN LAGOS THE MEGA-CITY SEMINAR 2Samuel Dekolo
 
Waterfront Development Principles
Waterfront Development PrinciplesWaterfront Development Principles
Waterfront Development PrinciplesNicholas Socrates
 
K-Means, its Variants and its Applications
K-Means, its Variants and its ApplicationsK-Means, its Variants and its Applications
K-Means, its Variants and its ApplicationsVarad Meru
 
Urban land use
Urban land useUrban land use
Urban land useSarahDee24
 
Forward Thinking: A Study In Transportation, Land Use And Urban Design In Nag...
Forward Thinking: A Study In Transportation, Land Use And Urban Design In Nag...Forward Thinking: A Study In Transportation, Land Use And Urban Design In Nag...
Forward Thinking: A Study In Transportation, Land Use And Urban Design In Nag...Willy Prilles
 

Viewers also liked (20)

Integrative analysis of transcriptomics and proteomics data with ArrayMining ...
Integrative analysis of transcriptomics and proteomics data with ArrayMining ...Integrative analysis of transcriptomics and proteomics data with ArrayMining ...
Integrative analysis of transcriptomics and proteomics data with ArrayMining ...
 
Al ramtha city jordan surface potential map, final2
Al ramtha city jordan surface potential map, final2Al ramtha city jordan surface potential map, final2
Al ramtha city jordan surface potential map, final2
 
Map
Map Map
Map
 
Mes welcome day
Mes welcome dayMes welcome day
Mes welcome day
 
Land use analysis in GMS
Land use analysis in GMSLand use analysis in GMS
Land use analysis in GMS
 
Urban Data Requirements by Anna Rose (Space Syntax)
Urban Data Requirements by Anna Rose (Space Syntax)Urban Data Requirements by Anna Rose (Space Syntax)
Urban Data Requirements by Anna Rose (Space Syntax)
 
Port-City governance. A comparative analysis in the European context.
Port-City governance. A comparative analysis in the European context.Port-City governance. A comparative analysis in the European context.
Port-City governance. A comparative analysis in the European context.
 
Isocarp jose sanchez
Isocarp jose sanchezIsocarp jose sanchez
Isocarp jose sanchez
 
Al ramtha city in jordan analysis by space syntax
Al ramtha city in jordan analysis by  space syntaxAl ramtha city in jordan analysis by  space syntax
Al ramtha city in jordan analysis by space syntax
 
Maps
MapsMaps
Maps
 
Space syntax
Space syntaxSpace syntax
Space syntax
 
Space syntax introduction & overview
Space syntax introduction & overviewSpace syntax introduction & overview
Space syntax introduction & overview
 
Port of Rotterdam 3
Port of Rotterdam 3Port of Rotterdam 3
Port of Rotterdam 3
 
PERI-URBAN LAND USE CHANGE IN LAGOS THE MEGA-CITY SEMINAR 2
PERI-URBAN LAND USE CHANGE IN LAGOS THE MEGA-CITY SEMINAR 2PERI-URBAN LAND USE CHANGE IN LAGOS THE MEGA-CITY SEMINAR 2
PERI-URBAN LAND USE CHANGE IN LAGOS THE MEGA-CITY SEMINAR 2
 
Case study space syntax
Case study space syntaxCase study space syntax
Case study space syntax
 
Space syntax
Space syntaxSpace syntax
Space syntax
 
Waterfront Development Principles
Waterfront Development PrinciplesWaterfront Development Principles
Waterfront Development Principles
 
K-Means, its Variants and its Applications
K-Means, its Variants and its ApplicationsK-Means, its Variants and its Applications
K-Means, its Variants and its Applications
 
Urban land use
Urban land useUrban land use
Urban land use
 
Forward Thinking: A Study In Transportation, Land Use And Urban Design In Nag...
Forward Thinking: A Study In Transportation, Land Use And Urban Design In Nag...Forward Thinking: A Study In Transportation, Land Use And Urban Design In Nag...
Forward Thinking: A Study In Transportation, Land Use And Urban Design In Nag...
 

Similar to Transcriptomics Analysis of Gene Regulation Using Text Mining

Prediction of protein function
Prediction of protein functionPrediction of protein function
Prediction of protein functionLars Juhl Jensen
 
Utilizing literature for biological discovery
Utilizing literature for biological discoveryUtilizing literature for biological discovery
Utilizing literature for biological discoveryLars Juhl Jensen
 
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...Anita de Waard
 
NCBI Boot Camp for Beginners Slides
NCBI Boot Camp for Beginners SlidesNCBI Boot Camp for Beginners Slides
NCBI Boot Camp for Beginners SlidesJackie Wirz, PhD
 
STRING: Protein association networks
STRING: Protein association networksSTRING: Protein association networks
STRING: Protein association networksLars Juhl Jensen
 
STRING: protein association networks
STRING: protein association networksSTRING: protein association networks
STRING: protein association networksLars Juhl Jensen
 
Apollo - A webinar for the Phascolarctos cinereus research community
Apollo - A webinar for the Phascolarctos cinereus research communityApollo - A webinar for the Phascolarctos cinereus research community
Apollo - A webinar for the Phascolarctos cinereus research communityMonica Munoz-Torres
 
Pathema Burkholderia Annotation Jamboree: Prokaryotic Annotation Overview
Pathema Burkholderia Annotation Jamboree: Prokaryotic Annotation OverviewPathema Burkholderia Annotation Jamboree: Prokaryotic Annotation Overview
Pathema Burkholderia Annotation Jamboree: Prokaryotic Annotation OverviewPathema
 
How to analyse large data sets
How to analyse large data setsHow to analyse large data sets
How to analyse large data setsimprovemed
 
MathiasHibbard_604FinalPaper
MathiasHibbard_604FinalPaperMathiasHibbard_604FinalPaper
MathiasHibbard_604FinalPaperMathias Hibbard
 
Multilocus sequence typin1
Multilocus sequence typin1Multilocus sequence typin1
Multilocus sequence typin1Manash Debbarma
 
DNA Sequencing in Phylogeny
DNA Sequencing in PhylogenyDNA Sequencing in Phylogeny
DNA Sequencing in PhylogenyBikash1489
 
12.03.13 - Journal Club
12.03.13 - Journal Club12.03.13 - Journal Club
12.03.13 - Journal ClubFarhoud Faraji
 
Apollo Introduction for the Chestnut Research Community
Apollo Introduction for the Chestnut Research CommunityApollo Introduction for the Chestnut Research Community
Apollo Introduction for the Chestnut Research CommunityMonica Munoz-Torres
 
Apollo : A workshop for the Manakin Research Coordination Network
Apollo: A workshop for the Manakin Research Coordination NetworkApollo: A workshop for the Manakin Research Coordination Network
Apollo : A workshop for the Manakin Research Coordination NetworkMonica Munoz-Torres
 
Mar Gonzales Porta, One gene One transcript, fged_seattle_2013
Mar Gonzales Porta, One gene One transcript, fged_seattle_2013Mar Gonzales Porta, One gene One transcript, fged_seattle_2013
Mar Gonzales Porta, One gene One transcript, fged_seattle_2013Functional Genomics Data Society
 
SALK seaside symposium
SALK seaside symposiumSALK seaside symposium
SALK seaside symposiumJING GU
 
Network biology: Large-scale integration of data and text
Network biology: Large-scale integration of data and textNetwork biology: Large-scale integration of data and text
Network biology: Large-scale integration of data and textLars Juhl Jensen
 
Genes, Genomics, and Chromosomes computational biology introduction .ppt
Genes, Genomics, and Chromosomes computational biology introduction .pptGenes, Genomics, and Chromosomes computational biology introduction .ppt
Genes, Genomics, and Chromosomes computational biology introduction .pptMohamedHasan816582
 

Similar to Transcriptomics Analysis of Gene Regulation Using Text Mining (20)

Prediction of protein function
Prediction of protein functionPrediction of protein function
Prediction of protein function
 
Utilizing literature for biological discovery
Utilizing literature for biological discoveryUtilizing literature for biological discovery
Utilizing literature for biological discovery
 
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
 
NCBI Boot Camp for Beginners Slides
NCBI Boot Camp for Beginners SlidesNCBI Boot Camp for Beginners Slides
NCBI Boot Camp for Beginners Slides
 
Comparitive genomics
Comparitive genomicsComparitive genomics
Comparitive genomics
 
STRING: Protein association networks
STRING: Protein association networksSTRING: Protein association networks
STRING: Protein association networks
 
STRING: protein association networks
STRING: protein association networksSTRING: protein association networks
STRING: protein association networks
 
Apollo - A webinar for the Phascolarctos cinereus research community
Apollo - A webinar for the Phascolarctos cinereus research communityApollo - A webinar for the Phascolarctos cinereus research community
Apollo - A webinar for the Phascolarctos cinereus research community
 
Pathema Burkholderia Annotation Jamboree: Prokaryotic Annotation Overview
Pathema Burkholderia Annotation Jamboree: Prokaryotic Annotation OverviewPathema Burkholderia Annotation Jamboree: Prokaryotic Annotation Overview
Pathema Burkholderia Annotation Jamboree: Prokaryotic Annotation Overview
 
How to analyse large data sets
How to analyse large data setsHow to analyse large data sets
How to analyse large data sets
 
MathiasHibbard_604FinalPaper
MathiasHibbard_604FinalPaperMathiasHibbard_604FinalPaper
MathiasHibbard_604FinalPaper
 
Multilocus sequence typin1
Multilocus sequence typin1Multilocus sequence typin1
Multilocus sequence typin1
 
DNA Sequencing in Phylogeny
DNA Sequencing in PhylogenyDNA Sequencing in Phylogeny
DNA Sequencing in Phylogeny
 
12.03.13 - Journal Club
12.03.13 - Journal Club12.03.13 - Journal Club
12.03.13 - Journal Club
 
Apollo Introduction for the Chestnut Research Community
Apollo Introduction for the Chestnut Research CommunityApollo Introduction for the Chestnut Research Community
Apollo Introduction for the Chestnut Research Community
 
Apollo : A workshop for the Manakin Research Coordination Network
Apollo: A workshop for the Manakin Research Coordination NetworkApollo: A workshop for the Manakin Research Coordination Network
Apollo : A workshop for the Manakin Research Coordination Network
 
Mar Gonzales Porta, One gene One transcript, fged_seattle_2013
Mar Gonzales Porta, One gene One transcript, fged_seattle_2013Mar Gonzales Porta, One gene One transcript, fged_seattle_2013
Mar Gonzales Porta, One gene One transcript, fged_seattle_2013
 
SALK seaside symposium
SALK seaside symposiumSALK seaside symposium
SALK seaside symposium
 
Network biology: Large-scale integration of data and text
Network biology: Large-scale integration of data and textNetwork biology: Large-scale integration of data and text
Network biology: Large-scale integration of data and text
 
Genes, Genomics, and Chromosomes computational biology introduction .ppt
Genes, Genomics, and Chromosomes computational biology introduction .pptGenes, Genomics, and Chromosomes computational biology introduction .ppt
Genes, Genomics, and Chromosomes computational biology introduction .ppt
 

More from Lars Juhl Jensen

One tagger, many uses: Illustrating the power of dictionary-based named entit...
One tagger, many uses: Illustrating the power of dictionary-based named entit...One tagger, many uses: Illustrating the power of dictionary-based named entit...
One tagger, many uses: Illustrating the power of dictionary-based named entit...Lars Juhl Jensen
 
One tagger, many uses: Simple text-mining strategies for biomedicine
One tagger, many uses: Simple text-mining strategies for biomedicineOne tagger, many uses: Simple text-mining strategies for biomedicine
One tagger, many uses: Simple text-mining strategies for biomedicineLars Juhl Jensen
 
Extract 2.0: Text-mining-assisted interactive annotation
Extract 2.0: Text-mining-assisted interactive annotationExtract 2.0: Text-mining-assisted interactive annotation
Extract 2.0: Text-mining-assisted interactive annotationLars Juhl Jensen
 
Network visualization: A crash course on using Cytoscape
Network visualization: A crash course on using CytoscapeNetwork visualization: A crash course on using Cytoscape
Network visualization: A crash course on using CytoscapeLars Juhl Jensen
 
STRING & STITCH : Network integration of heterogeneous data
STRING & STITCH: Network integration of heterogeneous dataSTRING & STITCH: Network integration of heterogeneous data
STRING & STITCH : Network integration of heterogeneous dataLars Juhl Jensen
 
Biomedical text mining: Automatic processing of unstructured text
Biomedical text mining: Automatic processing of unstructured textBiomedical text mining: Automatic processing of unstructured text
Biomedical text mining: Automatic processing of unstructured textLars Juhl Jensen
 
Medical network analysis: Linking diseases and genes through data and text mi...
Medical network analysis: Linking diseases and genes through data and text mi...Medical network analysis: Linking diseases and genes through data and text mi...
Medical network analysis: Linking diseases and genes through data and text mi...Lars Juhl Jensen
 
Network Biology: A crash course on STRING and Cytoscape
Network Biology: A crash course on STRING and CytoscapeNetwork Biology: A crash course on STRING and Cytoscape
Network Biology: A crash course on STRING and CytoscapeLars Juhl Jensen
 
Cellular Network Biology: Large-scale integration of data and text
Cellular Network Biology: Large-scale integration of data and textCellular Network Biology: Large-scale integration of data and text
Cellular Network Biology: Large-scale integration of data and textLars Juhl Jensen
 
Statistics on big biomedical data: Methods and pitfalls when analyzing high-t...
Statistics on big biomedical data: Methods and pitfalls when analyzing high-t...Statistics on big biomedical data: Methods and pitfalls when analyzing high-t...
Statistics on big biomedical data: Methods and pitfalls when analyzing high-t...Lars Juhl Jensen
 
STRING & related databases: Large-scale integration of heterogeneous data
STRING & related databases: Large-scale integration of heterogeneous dataSTRING & related databases: Large-scale integration of heterogeneous data
STRING & related databases: Large-scale integration of heterogeneous dataLars Juhl Jensen
 
Tagger: Rapid dictionary-based named entity recognition
Tagger: Rapid dictionary-based named entity recognitionTagger: Rapid dictionary-based named entity recognition
Tagger: Rapid dictionary-based named entity recognitionLars Juhl Jensen
 
Network Biology: Large-scale integration of data and text
Network Biology: Large-scale integration of data and textNetwork Biology: Large-scale integration of data and text
Network Biology: Large-scale integration of data and textLars Juhl Jensen
 
Medical text mining: Linking diseases, drugs, and adverse reactions
Medical text mining: Linking diseases, drugs, and adverse reactionsMedical text mining: Linking diseases, drugs, and adverse reactions
Medical text mining: Linking diseases, drugs, and adverse reactionsLars Juhl Jensen
 
Network biology: Large-scale integration of data and text
Network biology: Large-scale integration of data and textNetwork biology: Large-scale integration of data and text
Network biology: Large-scale integration of data and textLars Juhl Jensen
 
Medical data and text mining: Linking diseases, drugs, and adverse reactions
Medical data and text mining: Linking diseases, drugs, and adverse reactionsMedical data and text mining: Linking diseases, drugs, and adverse reactions
Medical data and text mining: Linking diseases, drugs, and adverse reactionsLars Juhl Jensen
 
Biomarker bioinformatics: Network-based candidate prioritization
Biomarker bioinformatics: Network-based candidate prioritizationBiomarker bioinformatics: Network-based candidate prioritization
Biomarker bioinformatics: Network-based candidate prioritizationLars Juhl Jensen
 
The Art of Counting: Scoring and ranking co-occurrences in literature
The Art of Counting: Scoring and ranking co-occurrences in literatureThe Art of Counting: Scoring and ranking co-occurrences in literature
The Art of Counting: Scoring and ranking co-occurrences in literatureLars Juhl Jensen
 

More from Lars Juhl Jensen (20)

One tagger, many uses: Illustrating the power of dictionary-based named entit...
One tagger, many uses: Illustrating the power of dictionary-based named entit...One tagger, many uses: Illustrating the power of dictionary-based named entit...
One tagger, many uses: Illustrating the power of dictionary-based named entit...
 
One tagger, many uses: Simple text-mining strategies for biomedicine
One tagger, many uses: Simple text-mining strategies for biomedicineOne tagger, many uses: Simple text-mining strategies for biomedicine
One tagger, many uses: Simple text-mining strategies for biomedicine
 
Extract 2.0: Text-mining-assisted interactive annotation
Extract 2.0: Text-mining-assisted interactive annotationExtract 2.0: Text-mining-assisted interactive annotation
Extract 2.0: Text-mining-assisted interactive annotation
 
Network visualization: A crash course on using Cytoscape
Network visualization: A crash course on using CytoscapeNetwork visualization: A crash course on using Cytoscape
Network visualization: A crash course on using Cytoscape
 
STRING & STITCH : Network integration of heterogeneous data
STRING & STITCH: Network integration of heterogeneous dataSTRING & STITCH: Network integration of heterogeneous data
STRING & STITCH : Network integration of heterogeneous data
 
Biomedical text mining: Automatic processing of unstructured text
Biomedical text mining: Automatic processing of unstructured textBiomedical text mining: Automatic processing of unstructured text
Biomedical text mining: Automatic processing of unstructured text
 
Medical network analysis: Linking diseases and genes through data and text mi...
Medical network analysis: Linking diseases and genes through data and text mi...Medical network analysis: Linking diseases and genes through data and text mi...
Medical network analysis: Linking diseases and genes through data and text mi...
 
Network Biology: A crash course on STRING and Cytoscape
Network Biology: A crash course on STRING and CytoscapeNetwork Biology: A crash course on STRING and Cytoscape
Network Biology: A crash course on STRING and Cytoscape
 
Cellular networks
Cellular networksCellular networks
Cellular networks
 
Cellular Network Biology: Large-scale integration of data and text
Cellular Network Biology: Large-scale integration of data and textCellular Network Biology: Large-scale integration of data and text
Cellular Network Biology: Large-scale integration of data and text
 
Statistics on big biomedical data: Methods and pitfalls when analyzing high-t...
Statistics on big biomedical data: Methods and pitfalls when analyzing high-t...Statistics on big biomedical data: Methods and pitfalls when analyzing high-t...
Statistics on big biomedical data: Methods and pitfalls when analyzing high-t...
 
STRING & related databases: Large-scale integration of heterogeneous data
STRING & related databases: Large-scale integration of heterogeneous dataSTRING & related databases: Large-scale integration of heterogeneous data
STRING & related databases: Large-scale integration of heterogeneous data
 
Tagger: Rapid dictionary-based named entity recognition
Tagger: Rapid dictionary-based named entity recognitionTagger: Rapid dictionary-based named entity recognition
Tagger: Rapid dictionary-based named entity recognition
 
Network Biology: Large-scale integration of data and text
Network Biology: Large-scale integration of data and textNetwork Biology: Large-scale integration of data and text
Network Biology: Large-scale integration of data and text
 
Medical text mining: Linking diseases, drugs, and adverse reactions
Medical text mining: Linking diseases, drugs, and adverse reactionsMedical text mining: Linking diseases, drugs, and adverse reactions
Medical text mining: Linking diseases, drugs, and adverse reactions
 
Network biology: Large-scale integration of data and text
Network biology: Large-scale integration of data and textNetwork biology: Large-scale integration of data and text
Network biology: Large-scale integration of data and text
 
Medical data and text mining: Linking diseases, drugs, and adverse reactions
Medical data and text mining: Linking diseases, drugs, and adverse reactionsMedical data and text mining: Linking diseases, drugs, and adverse reactions
Medical data and text mining: Linking diseases, drugs, and adverse reactions
 
Cellular Network Biology
Cellular Network BiologyCellular Network Biology
Cellular Network Biology
 
Biomarker bioinformatics: Network-based candidate prioritization
Biomarker bioinformatics: Network-based candidate prioritizationBiomarker bioinformatics: Network-based candidate prioritization
Biomarker bioinformatics: Network-based candidate prioritization
 
The Art of Counting: Scoring and ranking co-occurrences in literature
The Art of Counting: Scoring and ranking co-occurrences in literatureThe Art of Counting: Scoring and ranking co-occurrences in literature
The Art of Counting: Scoring and ranking co-occurrences in literature
 

Recently uploaded

How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17Celine George
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designMIPLM
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...Nguyen Thanh Tu Collection
 
AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.arsicmarija21
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfMr Bounab Samir
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxDr.Ibrahim Hassaan
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfSpandanaRallapalli
 
Hierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementHierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementmkooblal
 
Romantic Opera MUSIC FOR GRADE NINE pptx
Romantic Opera MUSIC FOR GRADE NINE pptxRomantic Opera MUSIC FOR GRADE NINE pptx
Romantic Opera MUSIC FOR GRADE NINE pptxsqpmdrvczh
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomnelietumpap1
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceSamikshaHamane
 

Recently uploaded (20)

How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-design
 
Raw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptxRaw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptx
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptx
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdf
 
Hierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementHierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of management
 
Romantic Opera MUSIC FOR GRADE NINE pptx
Romantic Opera MUSIC FOR GRADE NINE pptxRomantic Opera MUSIC FOR GRADE NINE pptx
Romantic Opera MUSIC FOR GRADE NINE pptx
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choom
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in Pharmacovigilance
 

Transcriptomics Analysis of Gene Regulation Using Text Mining

  • 1. Transcriptomics and Lexico-syntactic Analysis (Yet another meaning of the TLA homonym) Lars Juhl Jensen EMBL Heidelberg
  • 2.
  • 3.
  • 4.
  • 5. Number of uniquely resolvable names for each species 40,038 18,702 7.7 48,291 6,210 S. cerevisiae 15,865 116,712 6.6 132,577 20,006 M. musculus 18,944 181,186 7.1 200,130 27,936 H. sapiens 14,072 77,757 22,707 6.1 103,208 16,871 D. melanogaster 18,214 65,749 45,835 5.4 110,602 20,348 C. elegans 20,158 118,818 5.3 138,976 25,957 A. thaliana Uni-Gene SWALL Species specific Ratio Names Proteins
  • 6.
  • 7.
  • 8.
  • 9.
  • 10.
  • 11. TIGERSearch is used for searching and browsing the large processed text corpus
  • 12. Pattern recognize sentences in both active and passive voice
  • 13.
  • 14.
  • 15.
  • 16.
  • 17.
  • 18.
  • 19.
  • 20. Non-linear normalization of intensities and correction for spatial effects Downloaded SMD data After intensity normalization Spatial bias estimate After spatial normalization
  • 21.
  • 22.
  • 23.
  • 24.
  • 25. Proteins linked to the human mitotic checkpoint protein BUB1 Comments Description Identifier Cyclin-dependent kinases regulatory subunit CKS1_HUMAN Involved in cell cycle arrest Serine/threonine-protein kinase Chk1 CHK1_HUMAN Involved in mitotic regulation Serine/threonine-protein kinase NEK2 NEK2_HUMAN Cell cycle-dependent expression CRM1 protein O14980 May act as a negative regulator of entry into mitosis Wee1-like protein kinase WEE1_HUMAN Kinesin-like protein 2 KNS2_HUMAN Kinesin-like protein 2 Q96SE4 Kinesin-like protein KIF14 KF14_HUMAN HCAP-H protein Q15003 Contains six WD40 repeats L2DTL protein Q9NZJ0 Cyclin A2 CGA2_HUMAN Contains a PRY and a SPRY domain Hypothetical protein Q8N324 Polymyositis/scleroderma autoantigen 1 PMC1_HUMAN M-phase inducer phosphatase 1 MPI1_HUMAN DNA topoisomerase II TP2A_HUMAN Cell cycle regulated kinase, inhibits Cdc2 Membrane-associated kinase O14731 Phosphorylated by Cdk2 during S-phase Myb-related protein B MYBB_HUMAN Associated with "growth cones" Brain acid soluble protein 1 BASP_HUMAN Phosphorylated in M-phase Forkhead box protein M1 FXM1_HUMAN High mobility group protein 2 HMG2_HUMAN Cyclin-dependent kinase inhibitor 3 CDN3_HUMAN Mitotic kinesin-like protein 1 Kinesin-like 5 Q8WVP0
  • 26.
  • 27.
  • 28.