SlideShare a Scribd company logo
1 of 21
Global Names Recognition and
Discovery (GNRD)
• High throughput, queue-based « skin » on
multiple processes of scientific name-finding
engines
– NetiNeti: Python, machine-learning-based
– TaxonFinder: Perl, dictionary-based
• Inputs: any file, URL, free-form text
– Uses Docsplit gem (Tesseract OCR as needed)
– Can send gzip request
• Outputs: JSON/xml
– Scientific names & their character offsets
– OCR text
– Resolved names
GNRD Clients & Applications
15,000 OCR’d articles, 1868 - 2002
All with DOIs
158,000 unique scientific names
92,000 vernaculars
20,000 entities
No Consistency in Search APIs
{
"totalResults": 152,
"startIndex": 1,
"itemsPerPage": 30,
"results": [
{
"id": 14349,
"title": "Ursus",
"link":
"http://eol.org/14349?action=overview&controller=taxa",
"content": "Ursus Linnaeus, 1758; Ursus; Ursus
(genus); Ursus (genus) Linnaeus, 1758; Ursus Arctos
Bruinosus"
},
{ ... },
],
"first": "http://eol.org/api/search/Ursus.json?page=1",
"self": "http://eol.org/api/search/Ursus.json?page=1",
"next": "http://eol.org/api/search/Ursus.json?page=2",
"last": "http://eol.org/api/search/Ursus.json?page=6"
}
http://eol.org/api/search/1.0.json?q=Ursus http://api.gbif.org/name_usage/search?q=Ursus
{
offset: 0,
limit: 20,
endOfRecords: false,
count: 77,
results: [
{
datasetTitle: "English Wikipedia Species Pages",
parent: "Ursidae",
kingdom: "Animalia",
phylum: "Chordata",
clazz: "Mammalia",
order: "Carnivora",
family: "Ursidae",
genus: "Ursus »,
scientificName: "Ursus",
canonicalName: "Ursus",
authorship: "",
nameType: "WELLFORMED",
rank: "GENUS",
…
Use Darwin Core Terms
OpenURL
• Created in late 1990s by a Flemish librarian
• eg v0.1
http://resolver.example.edu/cgi?genre=book
&isbn=0836218310&title=The+Far+Side+Galle
ry+3
• But no specification for response structure!!!
bibJSON
{
"title": "Open Bibliography for Science, Technology and Medicine",
"author":[
{"name": "Richard Jones"},
{"name": "Mark MacGillivray"},
{"name": "Peter Murray-Rust"},
{"name": "Jim Pitman"},
{"name": "Peter Sefton"},
{"name": "Ben O'Steen"},
{"name": "William Waites"}
],
"type": "article",
"year": "2011",
"journal": {"name": "Journal of Cheminformatics"},
"link": [{"url":"http://www.jcheminf.com/content/3/1/47"}],
"identifier": [{"type":"doi","id":"10.1186/1758-2946-3-47"}]
}
Recommendation
• Use DwC terms as query params for find or ‘q’ for
search
• Use DwC terms as keys in JSON responses
http://www.antweb.org/description.do?name=claripes%2
0orbiculatopunctatus&genus=camponotus&rank=species&
project=worldants
http://www.antweb.org/description.do?specificEpithet=cla
ripes&infraspecificEpithet=orbiculatopunctatus&genus=ca
mponotus&taxonRank=species&project=worldants
Canadensys:
Vascular Plants of Canada
(VASCAN)
Luc Brouillet, Peter Desmet, et al.
http://data.canadensys.net/vascan
http://data.canadensys.net/vascan/name/Carex%20abbreviata
http://data.canadensys.net/vascan/taxon/26512
http://doi.org/10.3897/phytokeys.25.3100
http://creativecommons.org/publicdomain/zero/1.0/
Suggestions for AntCat
• Run literature through GNRD
• Simplify web presence with concentration on
search as the entry point
– index all available content
– Present « pages » as declaration of relationships
• Use Darwin Core terms in « find » and
« search » services
• Make DwC-A, CC-0 waiver, data paper &
publish to GBIF, make accessible to GN

More Related Content

What's hot

Elasticsearch 101 - Cluster setup and tuning
Elasticsearch 101 - Cluster setup and tuningElasticsearch 101 - Cluster setup and tuning
Elasticsearch 101 - Cluster setup and tuningPetar Djekic
 
Elasticsearch in 15 Minutes
Elasticsearch in 15 MinutesElasticsearch in 15 Minutes
Elasticsearch in 15 MinutesKarel Minarik
 
Search Engine-Building with Lucene and Solr
Search Engine-Building with Lucene and SolrSearch Engine-Building with Lucene and Solr
Search Engine-Building with Lucene and SolrKai Chan
 
Elasticsearch (Rubyshift 2013)
Elasticsearch (Rubyshift 2013)Elasticsearch (Rubyshift 2013)
Elasticsearch (Rubyshift 2013)Karel Minarik
 
Simple search with elastic search
Simple search with elastic searchSimple search with elastic search
Simple search with elastic searchmarkstory
 
Elasticsearch - DevNexus 2015
Elasticsearch - DevNexus 2015Elasticsearch - DevNexus 2015
Elasticsearch - DevNexus 2015Roy Russo
 
Peggy elasticsearch應用
Peggy elasticsearch應用Peggy elasticsearch應用
Peggy elasticsearch應用LearningTech
 
Analytics with MongoDB Aggregation Framework and Hadoop Connector
Analytics with MongoDB Aggregation Framework and Hadoop ConnectorAnalytics with MongoDB Aggregation Framework and Hadoop Connector
Analytics with MongoDB Aggregation Framework and Hadoop ConnectorHenrik Ingo
 
elasticsearch - advanced features in practice
elasticsearch - advanced features in practiceelasticsearch - advanced features in practice
elasticsearch - advanced features in practiceJano Suchal
 
ElasticSearch - DevNexus Atlanta - 2014
ElasticSearch - DevNexus Atlanta - 2014ElasticSearch - DevNexus Atlanta - 2014
ElasticSearch - DevNexus Atlanta - 2014Roy Russo
 
Data Processing and Aggregation with MongoDB
Data Processing and Aggregation with MongoDB Data Processing and Aggregation with MongoDB
Data Processing and Aggregation with MongoDB MongoDB
 
Introduction to MongoDB
Introduction to MongoDBIntroduction to MongoDB
Introduction to MongoDBantoinegirbal
 
2011 Mongo FR - MongoDB introduction
2011 Mongo FR - MongoDB introduction2011 Mongo FR - MongoDB introduction
2011 Mongo FR - MongoDB introductionantoinegirbal
 
Elasticsearch - under the hood
Elasticsearch - under the hoodElasticsearch - under the hood
Elasticsearch - under the hoodSmartCat
 
Introduction to MongoDB
Introduction to MongoDBIntroduction to MongoDB
Introduction to MongoDBNosh Petigara
 

What's hot (20)

Elasticsearch 101 - Cluster setup and tuning
Elasticsearch 101 - Cluster setup and tuningElasticsearch 101 - Cluster setup and tuning
Elasticsearch 101 - Cluster setup and tuning
 
Elasticsearch in 15 Minutes
Elasticsearch in 15 MinutesElasticsearch in 15 Minutes
Elasticsearch in 15 Minutes
 
Search Engine-Building with Lucene and Solr
Search Engine-Building with Lucene and SolrSearch Engine-Building with Lucene and Solr
Search Engine-Building with Lucene and Solr
 
Elasticsearch (Rubyshift 2013)
Elasticsearch (Rubyshift 2013)Elasticsearch (Rubyshift 2013)
Elasticsearch (Rubyshift 2013)
 
Simple search with elastic search
Simple search with elastic searchSimple search with elastic search
Simple search with elastic search
 
Elasticsearch - DevNexus 2015
Elasticsearch - DevNexus 2015Elasticsearch - DevNexus 2015
Elasticsearch - DevNexus 2015
 
Mongo indexes
Mongo indexesMongo indexes
Mongo indexes
 
Peggy elasticsearch應用
Peggy elasticsearch應用Peggy elasticsearch應用
Peggy elasticsearch應用
 
Avro introduction
Avro introductionAvro introduction
Avro introduction
 
Analytics with MongoDB Aggregation Framework and Hadoop Connector
Analytics with MongoDB Aggregation Framework and Hadoop ConnectorAnalytics with MongoDB Aggregation Framework and Hadoop Connector
Analytics with MongoDB Aggregation Framework and Hadoop Connector
 
elasticsearch - advanced features in practice
elasticsearch - advanced features in practiceelasticsearch - advanced features in practice
elasticsearch - advanced features in practice
 
ElasticSearch - DevNexus Atlanta - 2014
ElasticSearch - DevNexus Atlanta - 2014ElasticSearch - DevNexus Atlanta - 2014
ElasticSearch - DevNexus Atlanta - 2014
 
04 standard class library c#
04 standard class library c#04 standard class library c#
04 standard class library c#
 
Week5
Week5Week5
Week5
 
Data Processing and Aggregation with MongoDB
Data Processing and Aggregation with MongoDB Data Processing and Aggregation with MongoDB
Data Processing and Aggregation with MongoDB
 
Introduction to MongoDB
Introduction to MongoDBIntroduction to MongoDB
Introduction to MongoDB
 
2011 Mongo FR - MongoDB introduction
2011 Mongo FR - MongoDB introduction2011 Mongo FR - MongoDB introduction
2011 Mongo FR - MongoDB introduction
 
Elasticsearch - under the hood
Elasticsearch - under the hoodElasticsearch - under the hood
Elasticsearch - under the hood
 
Introduction to MongoDB
Introduction to MongoDBIntroduction to MongoDB
Introduction to MongoDB
 
MongoDB
MongoDBMongoDB
MongoDB
 

Viewers also liked

Have We Got the Names "Right"?
Have We Got the Names "Right"?Have We Got the Names "Right"?
Have We Got the Names "Right"?David Shorthouse
 
Paid, Earned and Owned = Converged Media
Paid, Earned and Owned = Converged MediaPaid, Earned and Owned = Converged Media
Paid, Earned and Owned = Converged MediaColin Browning
 
Canadensys - Federal Geospatial Platform
Canadensys - Federal Geospatial PlatformCanadensys - Federal Geospatial Platform
Canadensys - Federal Geospatial PlatformDavid Shorthouse
 
2008.Eol Chicago Smith And Rycroft
2008.Eol Chicago Smith And Rycroft2008.Eol Chicago Smith And Rycroft
2008.Eol Chicago Smith And RycroftDavid Shorthouse
 
Improving Drupal Taxonomy Editor
Improving Drupal Taxonomy EditorImproving Drupal Taxonomy Editor
Improving Drupal Taxonomy EditorDavid Shorthouse
 

Viewers also liked (10)

Have We Got the Names "Right"?
Have We Got the Names "Right"?Have We Got the Names "Right"?
Have We Got the Names "Right"?
 
Global Names ievobio 2012
Global Names ievobio 2012Global Names ievobio 2012
Global Names ievobio 2012
 
10minutes Roger
10minutes Roger10minutes Roger
10minutes Roger
 
Paid, Earned and Owned = Converged Media
Paid, Earned and Owned = Converged MediaPaid, Earned and Owned = Converged Media
Paid, Earned and Owned = Converged Media
 
BSC Shorthouse ESC 2011
BSC Shorthouse ESC 2011BSC Shorthouse ESC 2011
BSC Shorthouse ESC 2011
 
Eol Shorthouse
Eol ShorthouseEol Shorthouse
Eol Shorthouse
 
Canadensys - Federal Geospatial Platform
Canadensys - Federal Geospatial PlatformCanadensys - Federal Geospatial Platform
Canadensys - Federal Geospatial Platform
 
Shorthouse
ShorthouseShorthouse
Shorthouse
 
2008.Eol Chicago Smith And Rycroft
2008.Eol Chicago Smith And Rycroft2008.Eol Chicago Smith And Rycroft
2008.Eol Chicago Smith And Rycroft
 
Improving Drupal Taxonomy Editor
Improving Drupal Taxonomy EditorImproving Drupal Taxonomy Editor
Improving Drupal Taxonomy Editor
 

Similar to GlobalNames - Canadensys - Shorthouse

Closing the Loop in Extended Reality with Kafka Streams and Machine Learning ...
Closing the Loop in Extended Reality with Kafka Streams and Machine Learning ...Closing the Loop in Extended Reality with Kafka Streams and Machine Learning ...
Closing the Loop in Extended Reality with Kafka Streams and Machine Learning ...confluent
 
iAuthor.cn: ORCID China Services and International Identifier for Researchers
iAuthor.cn: ORCID China Services and International Identifier for ResearchersiAuthor.cn: ORCID China Services and International Identifier for Researchers
iAuthor.cn: ORCID China Services and International Identifier for Researchersjianyongzhang
 
An analysis of the quality issues of the properties available in the Spanish ...
An analysis of the quality issues of the properties available in the Spanish ...An analysis of the quality issues of the properties available in the Spanish ...
An analysis of the quality issues of the properties available in the Spanish ...Nandana Mihindukulasooriya
 
The nature.com ontologies portal: nature.com/ontologies
The nature.com ontologies portal: nature.com/ontologiesThe nature.com ontologies portal: nature.com/ontologies
The nature.com ontologies portal: nature.com/ontologiesTony Hammond
 
ElasticSearch for .NET Developers
ElasticSearch for .NET DevelopersElasticSearch for .NET Developers
ElasticSearch for .NET DevelopersBen van Mol
 
JAX-RS JavaOne Hyderabad, India 2011
JAX-RS JavaOne Hyderabad, India 2011JAX-RS JavaOne Hyderabad, India 2011
JAX-RS JavaOne Hyderabad, India 2011Shreedhar Ganapathy
 
Elastic search intro-@lamper
Elastic search intro-@lamperElastic search intro-@lamper
Elastic search intro-@lampermedcl
 
Lucene Introduction
Lucene IntroductionLucene Introduction
Lucene Introductionotisg
 
Accelerating Genomics SNPs Processing and Interpretation with Apache Spark
Accelerating Genomics SNPs Processing and Interpretation with Apache SparkAccelerating Genomics SNPs Processing and Interpretation with Apache Spark
Accelerating Genomics SNPs Processing and Interpretation with Apache SparkDatabricks
 
Berlin Buzz Words - Apache Drill by Ted Dunning & Michael Hausenblas
Berlin Buzz Words - Apache Drill by Ted Dunning & Michael HausenblasBerlin Buzz Words - Apache Drill by Ted Dunning & Michael Hausenblas
Berlin Buzz Words - Apache Drill by Ted Dunning & Michael HausenblasMapR Technologies
 
Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial ...
Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial ...Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial ...
Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial ...OpenAIRE
 
FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout Carole Goble
 
Arabidopsis Information Portal, Developer Workshop 2014, Introduction
Arabidopsis Information Portal, Developer Workshop 2014, IntroductionArabidopsis Information Portal, Developer Workshop 2014, Introduction
Arabidopsis Information Portal, Developer Workshop 2014, IntroductionJasonRafeMiller
 

Similar to GlobalNames - Canadensys - Shorthouse (20)

Discovering python search engine
Discovering python search engineDiscovering python search engine
Discovering python search engine
 
Closing the Loop in Extended Reality with Kafka Streams and Machine Learning ...
Closing the Loop in Extended Reality with Kafka Streams and Machine Learning ...Closing the Loop in Extended Reality with Kafka Streams and Machine Learning ...
Closing the Loop in Extended Reality with Kafka Streams and Machine Learning ...
 
CouchDB-Lucene
CouchDB-LuceneCouchDB-Lucene
CouchDB-Lucene
 
A hint of_mint
A hint of_mintA hint of_mint
A hint of_mint
 
iAuthor.cn: ORCID China Services and International Identifier for Researchers
iAuthor.cn: ORCID China Services and International Identifier for ResearchersiAuthor.cn: ORCID China Services and International Identifier for Researchers
iAuthor.cn: ORCID China Services and International Identifier for Researchers
 
An analysis of the quality issues of the properties available in the Spanish ...
An analysis of the quality issues of the properties available in the Spanish ...An analysis of the quality issues of the properties available in the Spanish ...
An analysis of the quality issues of the properties available in the Spanish ...
 
The nature.com ontologies portal: nature.com/ontologies
The nature.com ontologies portal: nature.com/ontologiesThe nature.com ontologies portal: nature.com/ontologies
The nature.com ontologies portal: nature.com/ontologies
 
ElasticSearch for .NET Developers
ElasticSearch for .NET DevelopersElasticSearch for .NET Developers
ElasticSearch for .NET Developers
 
Discovering python search engines
Discovering python search enginesDiscovering python search engines
Discovering python search engines
 
JAX-RS JavaOne Hyderabad, India 2011
JAX-RS JavaOne Hyderabad, India 2011JAX-RS JavaOne Hyderabad, India 2011
JAX-RS JavaOne Hyderabad, India 2011
 
IR with lucene
IR with luceneIR with lucene
IR with lucene
 
Elastic search intro-@lamper
Elastic search intro-@lamperElastic search intro-@lamper
Elastic search intro-@lamper
 
Lucene Introduction
Lucene IntroductionLucene Introduction
Lucene Introduction
 
Accelerating Genomics SNPs Processing and Interpretation with Apache Spark
Accelerating Genomics SNPs Processing and Interpretation with Apache SparkAccelerating Genomics SNPs Processing and Interpretation with Apache Spark
Accelerating Genomics SNPs Processing and Interpretation with Apache Spark
 
Berlin Buzz Words - Apache Drill by Ted Dunning & Michael Hausenblas
Berlin Buzz Words - Apache Drill by Ted Dunning & Michael HausenblasBerlin Buzz Words - Apache Drill by Ted Dunning & Michael Hausenblas
Berlin Buzz Words - Apache Drill by Ted Dunning & Michael Hausenblas
 
October 1 NISO Training Thursday: Using Alerting Systems to Ensure OA Policy ...
October 1 NISO Training Thursday: Using Alerting Systems to Ensure OA Policy ...October 1 NISO Training Thursday: Using Alerting Systems to Ensure OA Policy ...
October 1 NISO Training Thursday: Using Alerting Systems to Ensure OA Policy ...
 
Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial ...
Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial ...Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial ...
Making Use of the Linked Open Data Services for OpenAIRE (DI4R 2016 tutorial ...
 
FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout
 
Arabidopsis Information Portal, Developer Workshop 2014, Introduction
Arabidopsis Information Portal, Developer Workshop 2014, IntroductionArabidopsis Information Portal, Developer Workshop 2014, Introduction
Arabidopsis Information Portal, Developer Workshop 2014, Introduction
 
Publishing Linked Data using Schema.org
Publishing Linked Data using Schema.orgPublishing Linked Data using Schema.org
Publishing Linked Data using Schema.org
 

More from David Shorthouse

What do accessible occurrence data and checklists tell us about species diver...
What do accessible occurrence data and checklists tell us about species diver...What do accessible occurrence data and checklists tell us about species diver...
What do accessible occurrence data and checklists tell us about species diver...David Shorthouse
 
2014.04.01 Shorthouse REDM400
2014.04.01 Shorthouse REDM4002014.04.01 Shorthouse REDM400
2014.04.01 Shorthouse REDM400David Shorthouse
 
Introduction to Biodiversity Informatics
Introduction to Biodiversity Informatics Introduction to Biodiversity Informatics
Introduction to Biodiversity Informatics David Shorthouse
 
Chach Eol Drupalsprint Presentation
Chach Eol Drupalsprint PresentationChach Eol Drupalsprint Presentation
Chach Eol Drupalsprint PresentationDavid Shorthouse
 
Eol Drupal Dman Presentation
Eol   Drupal   Dman PresentationEol   Drupal   Dman Presentation
Eol Drupal Dman PresentationDavid Shorthouse
 

More from David Shorthouse (9)

What do accessible occurrence data and checklists tell us about species diver...
What do accessible occurrence data and checklists tell us about species diver...What do accessible occurrence data and checklists tell us about species diver...
What do accessible occurrence data and checklists tell us about species diver...
 
2014.07.22 shorthouse
2014.07.22   shorthouse2014.07.22   shorthouse
2014.07.22 shorthouse
 
2014.04.01 Shorthouse REDM400
2014.04.01 Shorthouse REDM4002014.04.01 Shorthouse REDM400
2014.04.01 Shorthouse REDM400
 
Introduction to Biodiversity Informatics
Introduction to Biodiversity Informatics Introduction to Biodiversity Informatics
Introduction to Biodiversity Informatics
 
Sperling Esc08 V Mds
Sperling Esc08 V MdsSperling Esc08 V Mds
Sperling Esc08 V Mds
 
Chach Eol Drupalsprint Presentation
Chach Eol Drupalsprint PresentationChach Eol Drupalsprint Presentation
Chach Eol Drupalsprint Presentation
 
Eol Drupal Dman Presentation
Eol   Drupal   Dman PresentationEol   Drupal   Dman Presentation
Eol Drupal Dman Presentation
 
Natcatchpoleslides
NatcatchpoleslidesNatcatchpoleslides
Natcatchpoleslides
 
Eol Matthias Hutterer
Eol Matthias HuttererEol Matthias Hutterer
Eol Matthias Hutterer
 

Recently uploaded

4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptxmary850239
 
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfVirtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfErwinPantujan2
 
Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)cama23
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17Celine George
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfJemuel Francisco
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfSpandanaRallapalli
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parentsnavabharathschool99
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptxSherlyMaeNeri
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSJoshuaGantuangco2
 
Culture Uniformity or Diversity IN SOCIOLOGY.pptx
Culture Uniformity or Diversity IN SOCIOLOGY.pptxCulture Uniformity or Diversity IN SOCIOLOGY.pptx
Culture Uniformity or Diversity IN SOCIOLOGY.pptxPoojaSen20
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...JhezDiaz1
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Jisc
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Celine George
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPCeline George
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYKayeClaireEstoconing
 
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...Postal Advocate Inc.
 

Recently uploaded (20)

4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx
 
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfVirtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
 
Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
 
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptxYOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdf
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parents
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptx
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
 
Culture Uniformity or Diversity IN SOCIOLOGY.pptx
Culture Uniformity or Diversity IN SOCIOLOGY.pptxCulture Uniformity or Diversity IN SOCIOLOGY.pptx
Culture Uniformity or Diversity IN SOCIOLOGY.pptx
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERP
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
 
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
 

GlobalNames - Canadensys - Shorthouse

  • 1. Global Names Recognition and Discovery (GNRD) • High throughput, queue-based « skin » on multiple processes of scientific name-finding engines – NetiNeti: Python, machine-learning-based – TaxonFinder: Perl, dictionary-based • Inputs: any file, URL, free-form text – Uses Docsplit gem (Tesseract OCR as needed) – Can send gzip request • Outputs: JSON/xml – Scientific names & their character offsets – OCR text – Resolved names
  • 2.
  • 3. GNRD Clients & Applications
  • 4. 15,000 OCR’d articles, 1868 - 2002 All with DOIs 158,000 unique scientific names 92,000 vernaculars 20,000 entities
  • 5.
  • 6. No Consistency in Search APIs { "totalResults": 152, "startIndex": 1, "itemsPerPage": 30, "results": [ { "id": 14349, "title": "Ursus", "link": "http://eol.org/14349?action=overview&controller=taxa", "content": "Ursus Linnaeus, 1758; Ursus; Ursus (genus); Ursus (genus) Linnaeus, 1758; Ursus Arctos Bruinosus" }, { ... }, ], "first": "http://eol.org/api/search/Ursus.json?page=1", "self": "http://eol.org/api/search/Ursus.json?page=1", "next": "http://eol.org/api/search/Ursus.json?page=2", "last": "http://eol.org/api/search/Ursus.json?page=6" } http://eol.org/api/search/1.0.json?q=Ursus http://api.gbif.org/name_usage/search?q=Ursus { offset: 0, limit: 20, endOfRecords: false, count: 77, results: [ { datasetTitle: "English Wikipedia Species Pages", parent: "Ursidae", kingdom: "Animalia", phylum: "Chordata", clazz: "Mammalia", order: "Carnivora", family: "Ursidae", genus: "Ursus », scientificName: "Ursus", canonicalName: "Ursus", authorship: "", nameType: "WELLFORMED", rank: "GENUS", …
  • 8. OpenURL • Created in late 1990s by a Flemish librarian • eg v0.1 http://resolver.example.edu/cgi?genre=book &isbn=0836218310&title=The+Far+Side+Galle ry+3 • But no specification for response structure!!!
  • 9. bibJSON { "title": "Open Bibliography for Science, Technology and Medicine", "author":[ {"name": "Richard Jones"}, {"name": "Mark MacGillivray"}, {"name": "Peter Murray-Rust"}, {"name": "Jim Pitman"}, {"name": "Peter Sefton"}, {"name": "Ben O'Steen"}, {"name": "William Waites"} ], "type": "article", "year": "2011", "journal": {"name": "Journal of Cheminformatics"}, "link": [{"url":"http://www.jcheminf.com/content/3/1/47"}], "identifier": [{"type":"doi","id":"10.1186/1758-2946-3-47"}] }
  • 10. Recommendation • Use DwC terms as query params for find or ‘q’ for search • Use DwC terms as keys in JSON responses http://www.antweb.org/description.do?name=claripes%2 0orbiculatopunctatus&genus=camponotus&rank=species& project=worldants http://www.antweb.org/description.do?specificEpithet=cla ripes&infraspecificEpithet=orbiculatopunctatus&genus=ca mponotus&taxonRank=species&project=worldants
  • 11. Canadensys: Vascular Plants of Canada (VASCAN) Luc Brouillet, Peter Desmet, et al.
  • 13.
  • 14.
  • 17.
  • 19.
  • 21. Suggestions for AntCat • Run literature through GNRD • Simplify web presence with concentration on search as the entry point – index all available content – Present « pages » as declaration of relationships • Use Darwin Core terms in « find » and « search » services • Make DwC-A, CC-0 waiver, data paper & publish to GBIF, make accessible to GN