Submit Search
Upload
Tagger: Rapid dictionary-based named entity recognition
•
Download as PPT, PDF
•
1 like
•
92 views
Lars Juhl Jensen
Follow
Tagger: Rapid dictionary-based named entity recognition
Read less
Read more
Science
Report
Share
Report
Share
1 of 35
Download now
Recommended
Extract 2.0: Text-mining-assisted interactive annotation
Extract 2.0: Text-mining-assisted interactive annotation
Lars Juhl Jensen
One tagger, many uses: Illustrating the power of dictionary-based named entit...
One tagger, many uses: Illustrating the power of dictionary-based named entit...
Lars Juhl Jensen
Cellular Network Biology: Large-scale integration of data and text
Cellular Network Biology: Large-scale integration of data and text
Lars Juhl Jensen
Large-scale integration of data and text
Large-scale integration of data and text
Lars Juhl Jensen
Real-time tagging of biomedical entities
Real-time tagging of biomedical entities
Lars Juhl Jensen
Prediction of protein networks through data integration
Prediction of protein networks through data integration
Lars Juhl Jensen
Functional association networks - The STRING and STITCH web resources
Functional association networks - The STRING and STITCH web resources
Lars Juhl Jensen
Kefed introduction 12-05-10-2224
Kefed introduction 12-05-10-2224
Gully Burns
Recommended
Extract 2.0: Text-mining-assisted interactive annotation
Extract 2.0: Text-mining-assisted interactive annotation
Lars Juhl Jensen
One tagger, many uses: Illustrating the power of dictionary-based named entit...
One tagger, many uses: Illustrating the power of dictionary-based named entit...
Lars Juhl Jensen
Cellular Network Biology: Large-scale integration of data and text
Cellular Network Biology: Large-scale integration of data and text
Lars Juhl Jensen
Large-scale integration of data and text
Large-scale integration of data and text
Lars Juhl Jensen
Real-time tagging of biomedical entities
Real-time tagging of biomedical entities
Lars Juhl Jensen
Prediction of protein networks through data integration
Prediction of protein networks through data integration
Lars Juhl Jensen
Functional association networks - The STRING and STITCH web resources
Functional association networks - The STRING and STITCH web resources
Lars Juhl Jensen
Kefed introduction 12-05-10-2224
Kefed introduction 12-05-10-2224
Gully Burns
Genomes On Rails
Genomes On Rails
Matt Wood
Welch Wordifier Bosc2009
Welch Wordifier Bosc2009
bosc
2014 Taverna Tutorial Introduction to eScience and workflows
2014 Taverna Tutorial Introduction to eScience and workflows
myGrid team
One tagger, many uses - Illustrating the power of ontologies in named entity ...
One tagger, many uses - Illustrating the power of ontologies in named entity ...
Lars Juhl Jensen
Computation and Knowledge
Computation and Knowledge
Ian Foster
Next generation sequencing & microarray-- Genotypic Technology
Next generation sequencing & microarray-- Genotypic Technology
Genotypic Technology
Text mining for organism and environment names
Text mining for organism and environment names
Lars Juhl Jensen
Discovery Engines for Big Data: Accelerating Discovery in Basic Energy Sciences
Discovery Engines for Big Data: Accelerating Discovery in Basic Energy Sciences
Ian Foster
OVium Bioinformatic Solutions
OVium Bioinformatic Solutions
OVium Solutions
Scientific Data Management
Scientific Data Management
Alberto Labarga
The pragmatic text miner: It's just another type of poorly standardized data
The pragmatic text miner: It's just another type of poorly standardized data
Lars Juhl Jensen
The STITCH and Reflect web resources
The STITCH and Reflect web resources
Lars Juhl Jensen
Bioinformatics Data Pipelines built by CSIRO on AWS
Bioinformatics Data Pipelines built by CSIRO on AWS
Lynn Langit
One tagger, many uses: Simple text-mining strategies for biomedicine
One tagger, many uses: Simple text-mining strategies for biomedicine
Lars Juhl Jensen
Semantic Web for Health Care and Biomedical Informatics
Semantic Web for Health Care and Biomedical Informatics
Amit Sheth
Odyssey Of The IWGSC Reference Genome Sequence: 12 Years 1 Month 28 Days 11 ...
Odyssey Of The IWGSC Reference Genome Sequence: 12 Years 1 Month 28 Days 11 ...
Fabio Caligaris
Jan2016 pac bio giab
Jan2016 pac bio giab
GenomeInABottle
Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...
Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...
GigaScience, BGI Hong Kong
Optimizing queries via search server ElasticSearch: a study applied to large ...
Optimizing queries via search server ElasticSearch: a study applied to large ...
Alex Camargo
The pragmatic text miner: It’s just another type of poorly standardized data
The pragmatic text miner: It’s just another type of poorly standardized data
Lars Juhl Jensen
Network visualization: A crash course on using Cytoscape
Network visualization: A crash course on using Cytoscape
Lars Juhl Jensen
STRING & STITCH: Network integration of heterogeneous data
STRING & STITCH: Network integration of heterogeneous data
Lars Juhl Jensen
More Related Content
Similar to Tagger: Rapid dictionary-based named entity recognition
Genomes On Rails
Genomes On Rails
Matt Wood
Welch Wordifier Bosc2009
Welch Wordifier Bosc2009
bosc
2014 Taverna Tutorial Introduction to eScience and workflows
2014 Taverna Tutorial Introduction to eScience and workflows
myGrid team
One tagger, many uses - Illustrating the power of ontologies in named entity ...
One tagger, many uses - Illustrating the power of ontologies in named entity ...
Lars Juhl Jensen
Computation and Knowledge
Computation and Knowledge
Ian Foster
Next generation sequencing & microarray-- Genotypic Technology
Next generation sequencing & microarray-- Genotypic Technology
Genotypic Technology
Text mining for organism and environment names
Text mining for organism and environment names
Lars Juhl Jensen
Discovery Engines for Big Data: Accelerating Discovery in Basic Energy Sciences
Discovery Engines for Big Data: Accelerating Discovery in Basic Energy Sciences
Ian Foster
OVium Bioinformatic Solutions
OVium Bioinformatic Solutions
OVium Solutions
Scientific Data Management
Scientific Data Management
Alberto Labarga
The pragmatic text miner: It's just another type of poorly standardized data
The pragmatic text miner: It's just another type of poorly standardized data
Lars Juhl Jensen
The STITCH and Reflect web resources
The STITCH and Reflect web resources
Lars Juhl Jensen
Bioinformatics Data Pipelines built by CSIRO on AWS
Bioinformatics Data Pipelines built by CSIRO on AWS
Lynn Langit
One tagger, many uses: Simple text-mining strategies for biomedicine
One tagger, many uses: Simple text-mining strategies for biomedicine
Lars Juhl Jensen
Semantic Web for Health Care and Biomedical Informatics
Semantic Web for Health Care and Biomedical Informatics
Amit Sheth
Odyssey Of The IWGSC Reference Genome Sequence: 12 Years 1 Month 28 Days 11 ...
Odyssey Of The IWGSC Reference Genome Sequence: 12 Years 1 Month 28 Days 11 ...
Fabio Caligaris
Jan2016 pac bio giab
Jan2016 pac bio giab
GenomeInABottle
Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...
Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...
GigaScience, BGI Hong Kong
Optimizing queries via search server ElasticSearch: a study applied to large ...
Optimizing queries via search server ElasticSearch: a study applied to large ...
Alex Camargo
The pragmatic text miner: It’s just another type of poorly standardized data
The pragmatic text miner: It’s just another type of poorly standardized data
Lars Juhl Jensen
Similar to Tagger: Rapid dictionary-based named entity recognition
(20)
Genomes On Rails
Genomes On Rails
Welch Wordifier Bosc2009
Welch Wordifier Bosc2009
2014 Taverna Tutorial Introduction to eScience and workflows
2014 Taverna Tutorial Introduction to eScience and workflows
One tagger, many uses - Illustrating the power of ontologies in named entity ...
One tagger, many uses - Illustrating the power of ontologies in named entity ...
Computation and Knowledge
Computation and Knowledge
Next generation sequencing & microarray-- Genotypic Technology
Next generation sequencing & microarray-- Genotypic Technology
Text mining for organism and environment names
Text mining for organism and environment names
Discovery Engines for Big Data: Accelerating Discovery in Basic Energy Sciences
Discovery Engines for Big Data: Accelerating Discovery in Basic Energy Sciences
OVium Bioinformatic Solutions
OVium Bioinformatic Solutions
Scientific Data Management
Scientific Data Management
The pragmatic text miner: It's just another type of poorly standardized data
The pragmatic text miner: It's just another type of poorly standardized data
The STITCH and Reflect web resources
The STITCH and Reflect web resources
Bioinformatics Data Pipelines built by CSIRO on AWS
Bioinformatics Data Pipelines built by CSIRO on AWS
One tagger, many uses: Simple text-mining strategies for biomedicine
One tagger, many uses: Simple text-mining strategies for biomedicine
Semantic Web for Health Care and Biomedical Informatics
Semantic Web for Health Care and Biomedical Informatics
Odyssey Of The IWGSC Reference Genome Sequence: 12 Years 1 Month 28 Days 11 ...
Odyssey Of The IWGSC Reference Genome Sequence: 12 Years 1 Month 28 Days 11 ...
Jan2016 pac bio giab
Jan2016 pac bio giab
Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...
Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...
Optimizing queries via search server ElasticSearch: a study applied to large ...
Optimizing queries via search server ElasticSearch: a study applied to large ...
The pragmatic text miner: It’s just another type of poorly standardized data
The pragmatic text miner: It’s just another type of poorly standardized data
More from Lars Juhl Jensen
Network visualization: A crash course on using Cytoscape
Network visualization: A crash course on using Cytoscape
Lars Juhl Jensen
STRING & STITCH: Network integration of heterogeneous data
STRING & STITCH: Network integration of heterogeneous data
Lars Juhl Jensen
Biomedical text mining: Automatic processing of unstructured text
Biomedical text mining: Automatic processing of unstructured text
Lars Juhl Jensen
Medical network analysis: Linking diseases and genes through data and text mi...
Medical network analysis: Linking diseases and genes through data and text mi...
Lars Juhl Jensen
Network Biology: A crash course on STRING and Cytoscape
Network Biology: A crash course on STRING and Cytoscape
Lars Juhl Jensen
Cellular networks
Cellular networks
Lars Juhl Jensen
Statistics on big biomedical data: Methods and pitfalls when analyzing high-t...
Statistics on big biomedical data: Methods and pitfalls when analyzing high-t...
Lars Juhl Jensen
STRING & related databases: Large-scale integration of heterogeneous data
STRING & related databases: Large-scale integration of heterogeneous data
Lars Juhl Jensen
Network Biology: Large-scale integration of data and text
Network Biology: Large-scale integration of data and text
Lars Juhl Jensen
Medical text mining: Linking diseases, drugs, and adverse reactions
Medical text mining: Linking diseases, drugs, and adverse reactions
Lars Juhl Jensen
Network biology: Large-scale integration of data and text
Network biology: Large-scale integration of data and text
Lars Juhl Jensen
Medical data and text mining: Linking diseases, drugs, and adverse reactions
Medical data and text mining: Linking diseases, drugs, and adverse reactions
Lars Juhl Jensen
Cellular Network Biology
Cellular Network Biology
Lars Juhl Jensen
Network biology: Large-scale integration of data and text
Network biology: Large-scale integration of data and text
Lars Juhl Jensen
Biomarker bioinformatics: Network-based candidate prioritization
Biomarker bioinformatics: Network-based candidate prioritization
Lars Juhl Jensen
The Art of Counting: Scoring and ranking co-occurrences in literature
The Art of Counting: Scoring and ranking co-occurrences in literature
Lars Juhl Jensen
Text-mining-based retrieval of protein networks
Text-mining-based retrieval of protein networks
Lars Juhl Jensen
Medical data and text mining: Linking diseases, drugs, and adverse reactions
Medical data and text mining: Linking diseases, drugs, and adverse reactions
Lars Juhl Jensen
Gene association networks: Large-scale integration of data and text
Gene association networks: Large-scale integration of data and text
Lars Juhl Jensen
Protein association networks: Large-scale integration of data and text
Protein association networks: Large-scale integration of data and text
Lars Juhl Jensen
More from Lars Juhl Jensen
(20)
Network visualization: A crash course on using Cytoscape
Network visualization: A crash course on using Cytoscape
STRING & STITCH: Network integration of heterogeneous data
STRING & STITCH: Network integration of heterogeneous data
Biomedical text mining: Automatic processing of unstructured text
Biomedical text mining: Automatic processing of unstructured text
Medical network analysis: Linking diseases and genes through data and text mi...
Medical network analysis: Linking diseases and genes through data and text mi...
Network Biology: A crash course on STRING and Cytoscape
Network Biology: A crash course on STRING and Cytoscape
Cellular networks
Cellular networks
Statistics on big biomedical data: Methods and pitfalls when analyzing high-t...
Statistics on big biomedical data: Methods and pitfalls when analyzing high-t...
STRING & related databases: Large-scale integration of heterogeneous data
STRING & related databases: Large-scale integration of heterogeneous data
Network Biology: Large-scale integration of data and text
Network Biology: Large-scale integration of data and text
Medical text mining: Linking diseases, drugs, and adverse reactions
Medical text mining: Linking diseases, drugs, and adverse reactions
Network biology: Large-scale integration of data and text
Network biology: Large-scale integration of data and text
Medical data and text mining: Linking diseases, drugs, and adverse reactions
Medical data and text mining: Linking diseases, drugs, and adverse reactions
Cellular Network Biology
Cellular Network Biology
Network biology: Large-scale integration of data and text
Network biology: Large-scale integration of data and text
Biomarker bioinformatics: Network-based candidate prioritization
Biomarker bioinformatics: Network-based candidate prioritization
The Art of Counting: Scoring and ranking co-occurrences in literature
The Art of Counting: Scoring and ranking co-occurrences in literature
Text-mining-based retrieval of protein networks
Text-mining-based retrieval of protein networks
Medical data and text mining: Linking diseases, drugs, and adverse reactions
Medical data and text mining: Linking diseases, drugs, and adverse reactions
Gene association networks: Large-scale integration of data and text
Gene association networks: Large-scale integration of data and text
Protein association networks: Large-scale integration of data and text
Protein association networks: Large-scale integration of data and text
Recently uploaded
Early Development of Mammals (Mouse and Human).pdf
Early Development of Mammals (Mouse and Human).pdf
Department of Education Philippines
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
Dr. TATHAGAT KHOBRAGADE
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
DiariAli
Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx .
Poonam Aher Patil
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Sérgio Sacani
Cot curve, melting temperature, unique and repetitive DNA
Cot curve, melting temperature, unique and repetitive DNA
Cherry
Concept of gene and Complementation test.pdf
Concept of gene and Complementation test.pdf
Cherry
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical Science
Alex Henderson
PODOCARPUS...........................pptx
PODOCARPUS...........................pptx
Cherry
GBSN - Microbiology (Unit 3)Defense Mechanism of the body
GBSN - Microbiology (Unit 3)Defense Mechanism of the body
Areesha Ahmad
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
Cherry
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
Goa Call Girls High Profile Escorts
Site specific recombination and transposition.........pdf
Site specific recombination and transposition.........pdf
Cherry
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
muralinath2
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
levieagacer
Kanchipuram Escorts 🥰 8617370543 Call Girls Offer VIP Hot Girls
Kanchipuram Escorts 🥰 8617370543 Call Girls Offer VIP Hot Girls
Deepika Singh
Cyanide resistant respiration pathway.pptx
Cyanide resistant respiration pathway.pptx
Cherry
Genetics and epigenetics of ADHD and comorbid conditions
Genetics and epigenetics of ADHD and comorbid conditions
bassianu17
FS P2 COMBO MSTA LAST PUSH past exam papers.
FS P2 COMBO MSTA LAST PUSH past exam papers.
takadzanijustinmaime
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
1301aanya
Recently uploaded
(20)
Early Development of Mammals (Mouse and Human).pdf
Early Development of Mammals (Mouse and Human).pdf
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx .
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Cot curve, melting temperature, unique and repetitive DNA
Cot curve, melting temperature, unique and repetitive DNA
Concept of gene and Complementation test.pdf
Concept of gene and Complementation test.pdf
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical Science
PODOCARPUS...........................pptx
PODOCARPUS...........................pptx
GBSN - Microbiology (Unit 3)Defense Mechanism of the body
GBSN - Microbiology (Unit 3)Defense Mechanism of the body
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
Site specific recombination and transposition.........pdf
Site specific recombination and transposition.........pdf
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
Kanchipuram Escorts 🥰 8617370543 Call Girls Offer VIP Hot Girls
Kanchipuram Escorts 🥰 8617370543 Call Girls Offer VIP Hot Girls
Cyanide resistant respiration pathway.pptx
Cyanide resistant respiration pathway.pptx
Genetics and epigenetics of ADHD and comorbid conditions
Genetics and epigenetics of ADHD and comorbid conditions
FS P2 COMBO MSTA LAST PUSH past exam papers.
FS P2 COMBO MSTA LAST PUSH past exam papers.
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
Tagger: Rapid dictionary-based named entity recognition
1.
Lars Juhl Jensen Tagger Rapid
dictionary-based named entity recognition
2.
C++ engine
3.
flexible matching
4.
>1000 abstracts /
second
5.
inherently thread-safe
6.
comprehensive dictionary
7.
70–80% recall
8.
expansion rules
9.
80–90% precision
10.
curated blacklist
11.
entity types &
use cases
12.
genes/proteins
13.
assess studiedness
14.
Cannon et al.,
Bioinformatics, 2017newdrugtargets.org
15.
construct networks
16.
string-db.org Szklarczyk et
al., Nucleic Acids Research, 2017
17.
chemicals
18.
cellular components
19.
tissues
20.
diseases
21.
organisms
22.
environments
23.
interactive annotation
24.
Pafilis et al.,
Proceedings of BioCreative V, 2015extract.hcmr.gr
25.
real-time tagger
26.
BeCalm API
27.
Python wrapper
28.
163 lines of
code
29.
performance
30.
tagger.jensenlab.org/BeCalm
31.
32.
bottlenecks
33.
network latency
34.
text retrieval
35.
Acknowledgments Evangelos Pafilis Sune Pletscher- Frankild Damian
Szklarczyk Michael Kuhn
Download now