SlideShare a Scribd company logo
BUILDING A CROWDSOURCED
CHEMICAL DATABASE FROM
THE WEB
Árpád Figyelmesi
BACKGROUND
Chemistry in the deep
Deep Web is parts of the World Wide Web not
indexed by standard search engines.
• Limited access or scripted
• Web archives
• Chemistry is hardly indexed
• Buried under the waste
Chemicalize original concept
Free, web based, experimental, demonstration and
advertising application for non-commercial use only.
chemicalize.org
beta
Eight years ago…
History
• 2008 Alpha release
• 2009 Webpage annotation
• 2010 Property calculation
• 2011 Chemical & Web search
Crowdsourced web exploration
Public pages visited by
Chemicalize users
Auto annotations scripts
Search results
Contribution to PubChem (2013)
• 300k structures
• 350k web pages
• 100k novel
Popularity (2015)
• 25k users / month
• 1 million structures 2 millions visited URLs
• A dozen of blog posts and journal references
• Continuous valuable user feedback
Dark side:
• Scalability & performance
• Maintenance & operation
• Abuse and non-fair usage
NEW CHEMICALIZE
Vision
Preserve current values but make Chemicalize a
professional and much more powerful platform.
• Improve reliability
• Extend functionality
• Know and understand users
Development
• Secure
• Reliable
• Scalable
• Extensible
• Simple
• Fast
Full redesign and enterprise ready reimplementation
in a modular cloud architecture.
New business model
• Free registration
• Free basic functions
• Free credits monthly
• Pay-per-use
• Credit package system
Enough for most
typical use cases
For more intensive
usage
Instant cheminformatics solutions
Current modules
Calculation
Names,
identifiers,
physicochemical
properties eg.
pKa, logP/logD,
solubility…
Annotation
Chemical
structures
recognition and
extraction from
web pages
Search
Combined
chemical and text
search with
relevance scoring,
hit highlighting…
Compliance
Compliance check
with regulations on
psychotropic drugs,
explosives, toxic
agents
+ Extensible with any further modules
NEW HEART
Annotation
Improved annotation
view for modern web
pages with better CSS
and JS support
• GooglePatents
• ScienceDirect
• Wiley Online Library
Content
More preloaded content and proactive web
exploration besides of crowdsourcing
Processed in the first stage:
• English Wikipedia
5 million articles
• USPTO grants
Last 5 years
• Chemicalize
800k URLs
Search
New engine offering
unlimited combination of
chemical and keyword
search
• Substructure, full, similarity
• Name, SMILES, InChI, CAS
• Full text, field
• Boolean, proximity, wildcard
Query examples
acetylsalicylic acid AND fever
Aspirin, acetylsalicylic acid, 2-
(acetyloxy)benzoic acid and all chemically
equivalent terms and fever together.
SUB:benzene
Containing any structure which contains
benzene as a substructure. For
example, toluene, phenol, benzoic acid.
SIM:viagra AND "half-life" AND "pulmonary
arterial hypertension"
Containing structures chemically similar
to Viagra and containing "half-life" and
"pulmonary arterial hypertension".
(c?emotherap* AND ("Phosphoinositide 3-
kinases"~3OR Pi3K)) AND FULL:idelalisib
Wildcard operators: ? for one character, * for
multiple characters. Proximity operator: "term1
term2"~distance. Phrase: "term1 term2".
chemicalize.com
THANK YOU
Árpád Figyelmesi

More Related Content

What's hot

II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in Nice
Dr. Haxel Consult
 
Biodiversity Virtual e-Laboratory (BioVeL): Athentication & Authorisation
Biodiversity Virtual e-Laboratory (BioVeL): Athentication & AuthorisationBiodiversity Virtual e-Laboratory (BioVeL): Athentication & Authorisation
Biodiversity Virtual e-Laboratory (BioVeL): Athentication & Authorisation
Renzo Kottmann
 
Crossmark - Crossref LIVE South Africa
Crossmark - Crossref LIVE South AfricaCrossmark - Crossref LIVE South Africa
Crossmark - Crossref LIVE South Africa
Crossref
 
Crossmark - Crossref LIVE Hannover
Crossmark - Crossref LIVE HannoverCrossmark - Crossref LIVE Hannover
Crossmark - Crossref LIVE Hannover
Crossref
 
CrossCheck and CrossMark
CrossCheck and CrossMarkCrossCheck and CrossMark
CrossCheck and CrossMark
Crossref
 
CSE 2013 CrossCheck & CrossMark Presentation by Rachel Lammey
CSE 2013 CrossCheck & CrossMark Presentation by Rachel LammeyCSE 2013 CrossCheck & CrossMark Presentation by Rachel Lammey
CSE 2013 CrossCheck & CrossMark Presentation by Rachel Lammey
Crossref
 
SafeNet: Progress and Data Gathering
SafeNet: Progress and Data GatheringSafeNet: Progress and Data Gathering
SafeNet: Progress and Data Gathering
EDINA, University of Edinburgh
 
Developing Infrastructure to Support Closer Collaboration of Aggregators with...
Developing Infrastructure to Support Closer Collaboration of Aggregators with...Developing Infrastructure to Support Closer Collaboration of Aggregators with...
Developing Infrastructure to Support Closer Collaboration of Aggregators with...
Nancy Pontika
 
January 13, 2016 NISO Webinar: Ensuring the Scholarly Record: Scholarly Retra...
January 13, 2016 NISO Webinar: Ensuring the Scholarly Record: Scholarly Retra...January 13, 2016 NISO Webinar: Ensuring the Scholarly Record: Scholarly Retra...
January 13, 2016 NISO Webinar: Ensuring the Scholarly Record: Scholarly Retra...
DeVonne Parks, CEM
 
Policy Commons
Policy CommonsPolicy Commons
Policy Commons
dbeuro
 
Psicquic applications
Psicquic applicationsPsicquic applications
Psicquic applications
Rafael C. Jimenez
 
Repository models: from experimentation to services
Repository models: from experimentation to servicesRepository models: from experimentation to services
Repository models: from experimentation to services
DigitalPreservationEurope
 
Open ILRI
Open ILRIOpen ILRI
Open ILRI
ILRI
 
New Product Introductions - Minesoft
New Product Introductions - MinesoftNew Product Introductions - Minesoft
New Product Introductions - Minesoft
Dr. Haxel Consult
 
UKSG Conference 2017 Breakout - KBART recommendations: challenges and achieve...
UKSG Conference 2017 Breakout - KBART recommendations: challenges and achieve...UKSG Conference 2017 Breakout - KBART recommendations: challenges and achieve...
UKSG Conference 2017 Breakout - KBART recommendations: challenges and achieve...
UKSG: connecting the knowledge community
 
Streamlining deposit an ojs to repository plugin
Streamlining deposit an ojs to repository pluginStreamlining deposit an ojs to repository plugin
Streamlining deposit an ojs to repository plugin
Jisc
 
L cwebinar russell_wise_feb26-2015
L cwebinar russell_wise_feb26-2015L cwebinar russell_wise_feb26-2015
L cwebinar russell_wise_feb26-2015
Library_Connect
 
The Global Open Knowledgebase (GOKb): open, linked data supporting library el...
The Global Open Knowledgebase (GOKb): open, linked data supporting library el...The Global Open Knowledgebase (GOKb): open, linked data supporting library el...
The Global Open Knowledgebase (GOKb): open, linked data supporting library el...
GOKb Project
 
Rotenberg Provider's Perspective on Identity and Authentication Management
Rotenberg Provider's Perspective on Identity and Authentication ManagementRotenberg Provider's Perspective on Identity and Authentication Management
Rotenberg Provider's Perspective on Identity and Authentication Management
National Information Standards Organization (NISO)
 
Linked Data: from Library Entities to the Web of Data
Linked Data: from Library Entities to the Web of DataLinked Data: from Library Entities to the Web of Data
Linked Data: from Library Entities to the Web of Data
Richard Wallis
 

What's hot (20)

II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in Nice
 
Biodiversity Virtual e-Laboratory (BioVeL): Athentication & Authorisation
Biodiversity Virtual e-Laboratory (BioVeL): Athentication & AuthorisationBiodiversity Virtual e-Laboratory (BioVeL): Athentication & Authorisation
Biodiversity Virtual e-Laboratory (BioVeL): Athentication & Authorisation
 
Crossmark - Crossref LIVE South Africa
Crossmark - Crossref LIVE South AfricaCrossmark - Crossref LIVE South Africa
Crossmark - Crossref LIVE South Africa
 
Crossmark - Crossref LIVE Hannover
Crossmark - Crossref LIVE HannoverCrossmark - Crossref LIVE Hannover
Crossmark - Crossref LIVE Hannover
 
CrossCheck and CrossMark
CrossCheck and CrossMarkCrossCheck and CrossMark
CrossCheck and CrossMark
 
CSE 2013 CrossCheck & CrossMark Presentation by Rachel Lammey
CSE 2013 CrossCheck & CrossMark Presentation by Rachel LammeyCSE 2013 CrossCheck & CrossMark Presentation by Rachel Lammey
CSE 2013 CrossCheck & CrossMark Presentation by Rachel Lammey
 
SafeNet: Progress and Data Gathering
SafeNet: Progress and Data GatheringSafeNet: Progress and Data Gathering
SafeNet: Progress and Data Gathering
 
Developing Infrastructure to Support Closer Collaboration of Aggregators with...
Developing Infrastructure to Support Closer Collaboration of Aggregators with...Developing Infrastructure to Support Closer Collaboration of Aggregators with...
Developing Infrastructure to Support Closer Collaboration of Aggregators with...
 
January 13, 2016 NISO Webinar: Ensuring the Scholarly Record: Scholarly Retra...
January 13, 2016 NISO Webinar: Ensuring the Scholarly Record: Scholarly Retra...January 13, 2016 NISO Webinar: Ensuring the Scholarly Record: Scholarly Retra...
January 13, 2016 NISO Webinar: Ensuring the Scholarly Record: Scholarly Retra...
 
Policy Commons
Policy CommonsPolicy Commons
Policy Commons
 
Psicquic applications
Psicquic applicationsPsicquic applications
Psicquic applications
 
Repository models: from experimentation to services
Repository models: from experimentation to servicesRepository models: from experimentation to services
Repository models: from experimentation to services
 
Open ILRI
Open ILRIOpen ILRI
Open ILRI
 
New Product Introductions - Minesoft
New Product Introductions - MinesoftNew Product Introductions - Minesoft
New Product Introductions - Minesoft
 
UKSG Conference 2017 Breakout - KBART recommendations: challenges and achieve...
UKSG Conference 2017 Breakout - KBART recommendations: challenges and achieve...UKSG Conference 2017 Breakout - KBART recommendations: challenges and achieve...
UKSG Conference 2017 Breakout - KBART recommendations: challenges and achieve...
 
Streamlining deposit an ojs to repository plugin
Streamlining deposit an ojs to repository pluginStreamlining deposit an ojs to repository plugin
Streamlining deposit an ojs to repository plugin
 
L cwebinar russell_wise_feb26-2015
L cwebinar russell_wise_feb26-2015L cwebinar russell_wise_feb26-2015
L cwebinar russell_wise_feb26-2015
 
The Global Open Knowledgebase (GOKb): open, linked data supporting library el...
The Global Open Knowledgebase (GOKb): open, linked data supporting library el...The Global Open Knowledgebase (GOKb): open, linked data supporting library el...
The Global Open Knowledgebase (GOKb): open, linked data supporting library el...
 
Rotenberg Provider's Perspective on Identity and Authentication Management
Rotenberg Provider's Perspective on Identity and Authentication ManagementRotenberg Provider's Perspective on Identity and Authentication Management
Rotenberg Provider's Perspective on Identity and Authentication Management
 
Linked Data: from Library Entities to the Web of Data
Linked Data: from Library Entities to the Web of DataLinked Data: from Library Entities to the Web of Data
Linked Data: from Library Entities to the Web of Data
 

Viewers also liked

ICIC 2016: New product Introduction BizInt
ICIC 2016: New product Introduction BizIntICIC 2016: New product Introduction BizInt
ICIC 2016: New product Introduction BizInt
Dr. Haxel Consult
 
ICIC 2016: Information Flow and the Commercialisation Window
ICIC 2016: Information Flow and the Commercialisation WindowICIC 2016: Information Flow and the Commercialisation Window
ICIC 2016: Information Flow and the Commercialisation Window
Dr. Haxel Consult
 
ICIC 2016: Business Intelligence at the Service of Leading Edge Innovation
ICIC 2016: Business Intelligence at the Service of Leading Edge InnovationICIC 2016: Business Intelligence at the Service of Leading Edge Innovation
ICIC 2016: Business Intelligence at the Service of Leading Edge Innovation
Dr. Haxel Consult
 
The Addition of Chemical Search Capabilities to PATENTSCOPE: Turning a Full-t...
The Addition of Chemical Search Capabilities to PATENTSCOPE: Turning a Full-t...The Addition of Chemical Search Capabilities to PATENTSCOPE: Turning a Full-t...
The Addition of Chemical Search Capabilities to PATENTSCOPE: Turning a Full-t...
Dr. Haxel Consult
 
ICIC 2016: New Product Introductions CENTREDOC
ICIC 2016: New Product Introductions CENTREDOCICIC 2016: New Product Introductions CENTREDOC
ICIC 2016: New Product Introductions CENTREDOC
Dr. Haxel Consult
 
ICIC 2016: New Product Introductions FIZ Karlsruhe / STN
ICIC 2016: New Product Introductions FIZ Karlsruhe / STNICIC 2016: New Product Introductions FIZ Karlsruhe / STN
ICIC 2016: New Product Introductions FIZ Karlsruhe / STN
Dr. Haxel Consult
 
ICIC 2016: Searching for Innovation in Chemistry using Statistical Analysis a...
ICIC 2016: Searching for Innovation in Chemistry using Statistical Analysis a...ICIC 2016: Searching for Innovation in Chemistry using Statistical Analysis a...
ICIC 2016: Searching for Innovation in Chemistry using Statistical Analysis a...
Dr. Haxel Consult
 
ICIC 2016: Examining Funding Data to Predict the Future of Research
ICIC 2016: Examining Funding Data to Predict the Future of ResearchICIC 2016: Examining Funding Data to Predict the Future of Research
ICIC 2016: Examining Funding Data to Predict the Future of Research
Dr. Haxel Consult
 
ICIC 2016: New Product Introduction Deep SEARCH 9
ICIC 2016: New Product Introduction Deep SEARCH 9ICIC 2016: New Product Introduction Deep SEARCH 9
ICIC 2016: New Product Introduction Deep SEARCH 9
Dr. Haxel Consult
 
ICIC 2016: New Product Introduction CAS
ICIC 2016: New Product Introduction CASICIC 2016: New Product Introduction CAS
ICIC 2016: New Product Introduction CAS
Dr. Haxel Consult
 
ICIC 2016: Tutorial: Searching for Information – the Classical Way with Key W...
ICIC 2016: Tutorial: Searching for Information – the Classical Way with Key W...ICIC 2016: Tutorial: Searching for Information – the Classical Way with Key W...
ICIC 2016: Tutorial: Searching for Information – the Classical Way with Key W...
Dr. Haxel Consult
 
ICIC 2016: Patent Information - Looking beyond China
ICIC 2016: Patent Information - Looking beyond ChinaICIC 2016: Patent Information - Looking beyond China
ICIC 2016: Patent Information - Looking beyond China
Dr. Haxel Consult
 
ICIC 2016: Mind the Gap: The novel benefits of human-curated substance locat...
ICIC 2016: Mind the Gap:  The novel benefits of human-curated substance locat...ICIC 2016: Mind the Gap:  The novel benefits of human-curated substance locat...
ICIC 2016: Mind the Gap: The novel benefits of human-curated substance locat...
Dr. Haxel Consult
 
ICIC 2016: New Product Introduction LexisNexis
ICIC 2016: New Product Introduction LexisNexisICIC 2016: New Product Introduction LexisNexis
ICIC 2016: New Product Introduction LexisNexis
Dr. Haxel Consult
 
The Final ICIC 2016 Programme in Heidelberg
The Final ICIC 2016 Programme in HeidelbergThe Final ICIC 2016 Programme in Heidelberg
The Final ICIC 2016 Programme in Heidelberg
Dr. Haxel Consult
 
ICIC 2016: 20 Years is Not Enough
ICIC 2016: 20 Years is Not EnoughICIC 2016: 20 Years is Not Enough
ICIC 2016: 20 Years is Not Enough
Dr. Haxel Consult
 

Viewers also liked (16)

ICIC 2016: New product Introduction BizInt
ICIC 2016: New product Introduction BizIntICIC 2016: New product Introduction BizInt
ICIC 2016: New product Introduction BizInt
 
ICIC 2016: Information Flow and the Commercialisation Window
ICIC 2016: Information Flow and the Commercialisation WindowICIC 2016: Information Flow and the Commercialisation Window
ICIC 2016: Information Flow and the Commercialisation Window
 
ICIC 2016: Business Intelligence at the Service of Leading Edge Innovation
ICIC 2016: Business Intelligence at the Service of Leading Edge InnovationICIC 2016: Business Intelligence at the Service of Leading Edge Innovation
ICIC 2016: Business Intelligence at the Service of Leading Edge Innovation
 
The Addition of Chemical Search Capabilities to PATENTSCOPE: Turning a Full-t...
The Addition of Chemical Search Capabilities to PATENTSCOPE: Turning a Full-t...The Addition of Chemical Search Capabilities to PATENTSCOPE: Turning a Full-t...
The Addition of Chemical Search Capabilities to PATENTSCOPE: Turning a Full-t...
 
ICIC 2016: New Product Introductions CENTREDOC
ICIC 2016: New Product Introductions CENTREDOCICIC 2016: New Product Introductions CENTREDOC
ICIC 2016: New Product Introductions CENTREDOC
 
ICIC 2016: New Product Introductions FIZ Karlsruhe / STN
ICIC 2016: New Product Introductions FIZ Karlsruhe / STNICIC 2016: New Product Introductions FIZ Karlsruhe / STN
ICIC 2016: New Product Introductions FIZ Karlsruhe / STN
 
ICIC 2016: Searching for Innovation in Chemistry using Statistical Analysis a...
ICIC 2016: Searching for Innovation in Chemistry using Statistical Analysis a...ICIC 2016: Searching for Innovation in Chemistry using Statistical Analysis a...
ICIC 2016: Searching for Innovation in Chemistry using Statistical Analysis a...
 
ICIC 2016: Examining Funding Data to Predict the Future of Research
ICIC 2016: Examining Funding Data to Predict the Future of ResearchICIC 2016: Examining Funding Data to Predict the Future of Research
ICIC 2016: Examining Funding Data to Predict the Future of Research
 
ICIC 2016: New Product Introduction Deep SEARCH 9
ICIC 2016: New Product Introduction Deep SEARCH 9ICIC 2016: New Product Introduction Deep SEARCH 9
ICIC 2016: New Product Introduction Deep SEARCH 9
 
ICIC 2016: New Product Introduction CAS
ICIC 2016: New Product Introduction CASICIC 2016: New Product Introduction CAS
ICIC 2016: New Product Introduction CAS
 
ICIC 2016: Tutorial: Searching for Information – the Classical Way with Key W...
ICIC 2016: Tutorial: Searching for Information – the Classical Way with Key W...ICIC 2016: Tutorial: Searching for Information – the Classical Way with Key W...
ICIC 2016: Tutorial: Searching for Information – the Classical Way with Key W...
 
ICIC 2016: Patent Information - Looking beyond China
ICIC 2016: Patent Information - Looking beyond ChinaICIC 2016: Patent Information - Looking beyond China
ICIC 2016: Patent Information - Looking beyond China
 
ICIC 2016: Mind the Gap: The novel benefits of human-curated substance locat...
ICIC 2016: Mind the Gap:  The novel benefits of human-curated substance locat...ICIC 2016: Mind the Gap:  The novel benefits of human-curated substance locat...
ICIC 2016: Mind the Gap: The novel benefits of human-curated substance locat...
 
ICIC 2016: New Product Introduction LexisNexis
ICIC 2016: New Product Introduction LexisNexisICIC 2016: New Product Introduction LexisNexis
ICIC 2016: New Product Introduction LexisNexis
 
The Final ICIC 2016 Programme in Heidelberg
The Final ICIC 2016 Programme in HeidelbergThe Final ICIC 2016 Programme in Heidelberg
The Final ICIC 2016 Programme in Heidelberg
 
ICIC 2016: 20 Years is Not Enough
ICIC 2016: 20 Years is Not EnoughICIC 2016: 20 Years is Not Enough
ICIC 2016: 20 Years is Not Enough
 

Similar to ICIC 2016: Building a Crowdsourced Chemical Database from the Web (Bring Deep Web Content to the Surface)

NCBO Technology Overview
NCBO Technology OverviewNCBO Technology Overview
NCBO Technology Overview
Trish Whetzel
 
ChemSpider – disseminating data and enabling an abundance of chemistry platforms
ChemSpider – disseminating data and enabling an abundance of chemistry platformsChemSpider – disseminating data and enabling an abundance of chemistry platforms
ChemSpider – disseminating data and enabling an abundance of chemistry platforms
Ken Karapetyan
 
SPCA2013 - Best Practices & Considerations for Designing Your SharePoint Logi...
SPCA2013 - Best Practices & Considerations for Designing Your SharePoint Logi...SPCA2013 - Best Practices & Considerations for Designing Your SharePoint Logi...
SPCA2013 - Best Practices & Considerations for Designing Your SharePoint Logi...
NCCOMMS
 
Marc and beyond: 3 Linked Data Choices
 Marc and beyond: 3 Linked Data Choices  Marc and beyond: 3 Linked Data Choices
Marc and beyond: 3 Linked Data Choices
Richard Wallis
 
Big data challenges associated with building a national data repository for c...
Big data challenges associated with building a national data repository for c...Big data challenges associated with building a national data repository for c...
Big data challenges associated with building a national data repository for c...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Taylor & Francis Group - Digital Product Overview (2016)
Taylor & Francis Group - Digital Product Overview (2016)Taylor & Francis Group - Digital Product Overview (2016)
Taylor & Francis Group - Digital Product Overview (2016)
Kait Neese
 
RDA Web service discoverability workshop
RDA Web service discoverability workshopRDA Web service discoverability workshop
RDA Web service discoverability workshop
Niall Beard
 
TechFuse 2013 - Break down the walls SharePoint 2013
TechFuse 2013 - Break down the walls SharePoint 2013TechFuse 2013 - Break down the walls SharePoint 2013
TechFuse 2013 - Break down the walls SharePoint 2013
Avtex
 
ICIC 2013 Conference Proceedings Antony Williams Royal Society of Chemistry
ICIC 2013 Conference Proceedings Antony Williams Royal Society of ChemistryICIC 2013 Conference Proceedings Antony Williams Royal Society of Chemistry
ICIC 2013 Conference Proceedings Antony Williams Royal Society of Chemistry
Dr. Haxel Consult
 
Atypon dhug2021
Atypon dhug2021Atypon dhug2021
Atypon dhug2021
Access Innovations, Inc.
 
Online Journal Management using Open Journal Systems (OJS)
Online Journal Management using Open Journal Systems (OJS)Online Journal Management using Open Journal Systems (OJS)
Online Journal Management using Open Journal Systems (OJS)
Ina Smith
 
ufsojs-161024084446 (1).pdf
ufsojs-161024084446 (1).pdfufsojs-161024084446 (1).pdf
ufsojs-161024084446 (1).pdf
Teshome Oljira
 
Migrating to Drupal: Open Source Library Intranets
Migrating to Drupal: Open Source Library IntranetsMigrating to Drupal: Open Source Library Intranets
Migrating to Drupal: Open Source Library Intranets
Nina McHale
 
Kasyanov "Web of Science API Workshop"
Kasyanov "Web of Science API Workshop"Kasyanov "Web of Science API Workshop"
Kasyanov "Web of Science API Workshop"
National Information Standards Organization (NISO)
 
Anderson-Annotation in the Spectrum of Engagement
Anderson-Annotation in the Spectrum of EngagementAnderson-Annotation in the Spectrum of Engagement
Anderson-Annotation in the Spectrum of Engagement
National Information Standards Organization (NISO)
 
The UK National Chemical Database Service – an integration of commercial and ...
The UK National Chemical Database Service – an integration of commercial and ...The UK National Chemical Database Service – an integration of commercial and ...
The UK National Chemical Database Service – an integration of commercial and ...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Images reviews tags and recommendations - Ya Wang
Images reviews tags and recommendations - Ya WangImages reviews tags and recommendations - Ya Wang
Images reviews tags and recommendations - Ya Wang
Electronic Resources & Libraries
 
Search Engine Optimization for the Research Librarian, or, How Librarians Can...
Search Engine Optimization for the Research Librarian, or, How Librarians Can...Search Engine Optimization for the Research Librarian, or, How Librarians Can...
Search Engine Optimization for the Research Librarian, or, How Librarians Can...
melissagasparotto
 
Comparison of Top CMS Systems
Comparison of Top CMS SystemsComparison of Top CMS Systems
Comparison of Top CMS Systems
Ryan Street
 
Marrying ACDLabs technologies to eScience Projects at the Royal Society of C...
Marrying ACDLabs technologies to eScience Projects at the  Royal Society of C...Marrying ACDLabs technologies to eScience Projects at the  Royal Society of C...
Marrying ACDLabs technologies to eScience Projects at the Royal Society of C...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 

Similar to ICIC 2016: Building a Crowdsourced Chemical Database from the Web (Bring Deep Web Content to the Surface) (20)

NCBO Technology Overview
NCBO Technology OverviewNCBO Technology Overview
NCBO Technology Overview
 
ChemSpider – disseminating data and enabling an abundance of chemistry platforms
ChemSpider – disseminating data and enabling an abundance of chemistry platformsChemSpider – disseminating data and enabling an abundance of chemistry platforms
ChemSpider – disseminating data and enabling an abundance of chemistry platforms
 
SPCA2013 - Best Practices & Considerations for Designing Your SharePoint Logi...
SPCA2013 - Best Practices & Considerations for Designing Your SharePoint Logi...SPCA2013 - Best Practices & Considerations for Designing Your SharePoint Logi...
SPCA2013 - Best Practices & Considerations for Designing Your SharePoint Logi...
 
Marc and beyond: 3 Linked Data Choices
 Marc and beyond: 3 Linked Data Choices  Marc and beyond: 3 Linked Data Choices
Marc and beyond: 3 Linked Data Choices
 
Big data challenges associated with building a national data repository for c...
Big data challenges associated with building a national data repository for c...Big data challenges associated with building a national data repository for c...
Big data challenges associated with building a national data repository for c...
 
Taylor & Francis Group - Digital Product Overview (2016)
Taylor & Francis Group - Digital Product Overview (2016)Taylor & Francis Group - Digital Product Overview (2016)
Taylor & Francis Group - Digital Product Overview (2016)
 
RDA Web service discoverability workshop
RDA Web service discoverability workshopRDA Web service discoverability workshop
RDA Web service discoverability workshop
 
TechFuse 2013 - Break down the walls SharePoint 2013
TechFuse 2013 - Break down the walls SharePoint 2013TechFuse 2013 - Break down the walls SharePoint 2013
TechFuse 2013 - Break down the walls SharePoint 2013
 
ICIC 2013 Conference Proceedings Antony Williams Royal Society of Chemistry
ICIC 2013 Conference Proceedings Antony Williams Royal Society of ChemistryICIC 2013 Conference Proceedings Antony Williams Royal Society of Chemistry
ICIC 2013 Conference Proceedings Antony Williams Royal Society of Chemistry
 
Atypon dhug2021
Atypon dhug2021Atypon dhug2021
Atypon dhug2021
 
Online Journal Management using Open Journal Systems (OJS)
Online Journal Management using Open Journal Systems (OJS)Online Journal Management using Open Journal Systems (OJS)
Online Journal Management using Open Journal Systems (OJS)
 
ufsojs-161024084446 (1).pdf
ufsojs-161024084446 (1).pdfufsojs-161024084446 (1).pdf
ufsojs-161024084446 (1).pdf
 
Migrating to Drupal: Open Source Library Intranets
Migrating to Drupal: Open Source Library IntranetsMigrating to Drupal: Open Source Library Intranets
Migrating to Drupal: Open Source Library Intranets
 
Kasyanov "Web of Science API Workshop"
Kasyanov "Web of Science API Workshop"Kasyanov "Web of Science API Workshop"
Kasyanov "Web of Science API Workshop"
 
Anderson-Annotation in the Spectrum of Engagement
Anderson-Annotation in the Spectrum of EngagementAnderson-Annotation in the Spectrum of Engagement
Anderson-Annotation in the Spectrum of Engagement
 
The UK National Chemical Database Service – an integration of commercial and ...
The UK National Chemical Database Service – an integration of commercial and ...The UK National Chemical Database Service – an integration of commercial and ...
The UK National Chemical Database Service – an integration of commercial and ...
 
Images reviews tags and recommendations - Ya Wang
Images reviews tags and recommendations - Ya WangImages reviews tags and recommendations - Ya Wang
Images reviews tags and recommendations - Ya Wang
 
Search Engine Optimization for the Research Librarian, or, How Librarians Can...
Search Engine Optimization for the Research Librarian, or, How Librarians Can...Search Engine Optimization for the Research Librarian, or, How Librarians Can...
Search Engine Optimization for the Research Librarian, or, How Librarians Can...
 
Comparison of Top CMS Systems
Comparison of Top CMS SystemsComparison of Top CMS Systems
Comparison of Top CMS Systems
 
Marrying ACDLabs technologies to eScience Projects at the Royal Society of C...
Marrying ACDLabs technologies to eScience Projects at the  Royal Society of C...Marrying ACDLabs technologies to eScience Projects at the  Royal Society of C...
Marrying ACDLabs technologies to eScience Projects at the Royal Society of C...
 

More from Dr. Haxel Consult

AI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering ManagementAI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
Dr. Haxel Consult
 
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
Dr. Haxel Consult
 
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
Dr. Haxel Consult
 
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
Dr. Haxel Consult
 
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
Dr. Haxel Consult
 
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
Dr. Haxel Consult
 
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
Dr. Haxel Consult
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...
Dr. Haxel Consult
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...
Dr. Haxel Consult
 
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
Dr. Haxel Consult
 
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
Dr. Haxel Consult
 
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
Dr. Haxel Consult
 
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
Dr. Haxel Consult
 
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
Dr. Haxel Consult
 
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
Dr. Haxel Consult
 
AI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance CenterAI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance Center
Dr. Haxel Consult
 
AI-SDV 2022: Lighthouse IP
AI-SDV 2022: Lighthouse IPAI-SDV 2022: Lighthouse IP
AI-SDV 2022: Lighthouse IP
Dr. Haxel Consult
 
AI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOCAI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOC
Dr. Haxel Consult
 
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
Dr. Haxel Consult
 
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
Dr. Haxel Consult
 

More from Dr. Haxel Consult (20)

AI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering ManagementAI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
 
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
 
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
 
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
 
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
 
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
 
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...
 
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
 
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
 
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
 
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
 
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
 
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
 
AI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance CenterAI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance Center
 
AI-SDV 2022: Lighthouse IP
AI-SDV 2022: Lighthouse IPAI-SDV 2022: Lighthouse IP
AI-SDV 2022: Lighthouse IP
 
AI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOCAI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOC
 
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
 
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
 

Recently uploaded

一比一原版(uc毕业证书)加拿大卡尔加里大学毕业证如何办理
一比一原版(uc毕业证书)加拿大卡尔加里大学毕业证如何办理一比一原版(uc毕业证书)加拿大卡尔加里大学毕业证如何办理
一比一原版(uc毕业证书)加拿大卡尔加里大学毕业证如何办理
dtagbe
 
Decentralized Justice in Gaming and Esports
Decentralized Justice in Gaming and EsportsDecentralized Justice in Gaming and Esports
Decentralized Justice in Gaming and Esports
Federico Ast
 
Bengaluru Dreamin' 24 - Personal Branding
Bengaluru Dreamin' 24 - Personal BrandingBengaluru Dreamin' 24 - Personal Branding
Bengaluru Dreamin' 24 - Personal Branding
Tarandeep Singh
 
Securing BGP: Operational Strategies and Best Practices for Network Defenders...
Securing BGP: Operational Strategies and Best Practices for Network Defenders...Securing BGP: Operational Strategies and Best Practices for Network Defenders...
Securing BGP: Operational Strategies and Best Practices for Network Defenders...
APNIC
 
怎么办理(umiami毕业证书)美国迈阿密大学毕业证文凭证书实拍图原版一模一样
怎么办理(umiami毕业证书)美国迈阿密大学毕业证文凭证书实拍图原版一模一样怎么办理(umiami毕业证书)美国迈阿密大学毕业证文凭证书实拍图原版一模一样
怎么办理(umiami毕业证书)美国迈阿密大学毕业证文凭证书实拍图原版一模一样
rtunex8r
 
How to make a complaint to the police for Social Media Fraud.pdf
How to make a complaint to the police for Social Media Fraud.pdfHow to make a complaint to the police for Social Media Fraud.pdf
How to make a complaint to the police for Social Media Fraud.pdf
Infosec train
 
快速办理(新加坡SMU毕业证书)新加坡管理大学毕业证文凭证书一模一样
快速办理(新加坡SMU毕业证书)新加坡管理大学毕业证文凭证书一模一样快速办理(新加坡SMU毕业证书)新加坡管理大学毕业证文凭证书一模一样
快速办理(新加坡SMU毕业证书)新加坡管理大学毕业证文凭证书一模一样
3a0sd7z3
 
Honeypots Unveiled: Proactive Defense Tactics for Cyber Security, Phoenix Sum...
Honeypots Unveiled: Proactive Defense Tactics for Cyber Security, Phoenix Sum...Honeypots Unveiled: Proactive Defense Tactics for Cyber Security, Phoenix Sum...
Honeypots Unveiled: Proactive Defense Tactics for Cyber Security, Phoenix Sum...
APNIC
 
一比一原版新西兰林肯大学毕业证(Lincoln毕业证书)学历如何办理
一比一原版新西兰林肯大学毕业证(Lincoln毕业证书)学历如何办理一比一原版新西兰林肯大学毕业证(Lincoln毕业证书)学历如何办理
一比一原版新西兰林肯大学毕业证(Lincoln毕业证书)学历如何办理
thezot
 
快速办理(Vic毕业证书)惠灵顿维多利亚大学毕业证完成信一模一样
快速办理(Vic毕业证书)惠灵顿维多利亚大学毕业证完成信一模一样快速办理(Vic毕业证书)惠灵顿维多利亚大学毕业证完成信一模一样
快速办理(Vic毕业证书)惠灵顿维多利亚大学毕业证完成信一模一样
3a0sd7z3
 
Bangalore Call Girls 9079923931 With -Cuties' Hot Call Girls
Bangalore Call Girls 9079923931 With -Cuties' Hot Call GirlsBangalore Call Girls 9079923931 With -Cuties' Hot Call Girls
Bangalore Call Girls 9079923931 With -Cuties' Hot Call Girls
narwatsonia7
 
cyber crime.pptx..........................
cyber crime.pptx..........................cyber crime.pptx..........................
cyber crime.pptx..........................
GNAMBIKARAO
 
KubeCon & CloudNative Con 2024 Artificial Intelligent
KubeCon & CloudNative Con 2024 Artificial IntelligentKubeCon & CloudNative Con 2024 Artificial Intelligent
KubeCon & CloudNative Con 2024 Artificial Intelligent
Emre Gündoğdu
 

Recently uploaded (13)

一比一原版(uc毕业证书)加拿大卡尔加里大学毕业证如何办理
一比一原版(uc毕业证书)加拿大卡尔加里大学毕业证如何办理一比一原版(uc毕业证书)加拿大卡尔加里大学毕业证如何办理
一比一原版(uc毕业证书)加拿大卡尔加里大学毕业证如何办理
 
Decentralized Justice in Gaming and Esports
Decentralized Justice in Gaming and EsportsDecentralized Justice in Gaming and Esports
Decentralized Justice in Gaming and Esports
 
Bengaluru Dreamin' 24 - Personal Branding
Bengaluru Dreamin' 24 - Personal BrandingBengaluru Dreamin' 24 - Personal Branding
Bengaluru Dreamin' 24 - Personal Branding
 
Securing BGP: Operational Strategies and Best Practices for Network Defenders...
Securing BGP: Operational Strategies and Best Practices for Network Defenders...Securing BGP: Operational Strategies and Best Practices for Network Defenders...
Securing BGP: Operational Strategies and Best Practices for Network Defenders...
 
怎么办理(umiami毕业证书)美国迈阿密大学毕业证文凭证书实拍图原版一模一样
怎么办理(umiami毕业证书)美国迈阿密大学毕业证文凭证书实拍图原版一模一样怎么办理(umiami毕业证书)美国迈阿密大学毕业证文凭证书实拍图原版一模一样
怎么办理(umiami毕业证书)美国迈阿密大学毕业证文凭证书实拍图原版一模一样
 
How to make a complaint to the police for Social Media Fraud.pdf
How to make a complaint to the police for Social Media Fraud.pdfHow to make a complaint to the police for Social Media Fraud.pdf
How to make a complaint to the police for Social Media Fraud.pdf
 
快速办理(新加坡SMU毕业证书)新加坡管理大学毕业证文凭证书一模一样
快速办理(新加坡SMU毕业证书)新加坡管理大学毕业证文凭证书一模一样快速办理(新加坡SMU毕业证书)新加坡管理大学毕业证文凭证书一模一样
快速办理(新加坡SMU毕业证书)新加坡管理大学毕业证文凭证书一模一样
 
Honeypots Unveiled: Proactive Defense Tactics for Cyber Security, Phoenix Sum...
Honeypots Unveiled: Proactive Defense Tactics for Cyber Security, Phoenix Sum...Honeypots Unveiled: Proactive Defense Tactics for Cyber Security, Phoenix Sum...
Honeypots Unveiled: Proactive Defense Tactics for Cyber Security, Phoenix Sum...
 
一比一原版新西兰林肯大学毕业证(Lincoln毕业证书)学历如何办理
一比一原版新西兰林肯大学毕业证(Lincoln毕业证书)学历如何办理一比一原版新西兰林肯大学毕业证(Lincoln毕业证书)学历如何办理
一比一原版新西兰林肯大学毕业证(Lincoln毕业证书)学历如何办理
 
快速办理(Vic毕业证书)惠灵顿维多利亚大学毕业证完成信一模一样
快速办理(Vic毕业证书)惠灵顿维多利亚大学毕业证完成信一模一样快速办理(Vic毕业证书)惠灵顿维多利亚大学毕业证完成信一模一样
快速办理(Vic毕业证书)惠灵顿维多利亚大学毕业证完成信一模一样
 
Bangalore Call Girls 9079923931 With -Cuties' Hot Call Girls
Bangalore Call Girls 9079923931 With -Cuties' Hot Call GirlsBangalore Call Girls 9079923931 With -Cuties' Hot Call Girls
Bangalore Call Girls 9079923931 With -Cuties' Hot Call Girls
 
cyber crime.pptx..........................
cyber crime.pptx..........................cyber crime.pptx..........................
cyber crime.pptx..........................
 
KubeCon & CloudNative Con 2024 Artificial Intelligent
KubeCon & CloudNative Con 2024 Artificial IntelligentKubeCon & CloudNative Con 2024 Artificial Intelligent
KubeCon & CloudNative Con 2024 Artificial Intelligent
 

ICIC 2016: Building a Crowdsourced Chemical Database from the Web (Bring Deep Web Content to the Surface)

  • 1. BUILDING A CROWDSOURCED CHEMICAL DATABASE FROM THE WEB Árpád Figyelmesi
  • 3. Chemistry in the deep Deep Web is parts of the World Wide Web not indexed by standard search engines. • Limited access or scripted • Web archives • Chemistry is hardly indexed • Buried under the waste
  • 4. Chemicalize original concept Free, web based, experimental, demonstration and advertising application for non-commercial use only. chemicalize.org beta Eight years ago…
  • 5. History • 2008 Alpha release • 2009 Webpage annotation • 2010 Property calculation • 2011 Chemical & Web search
  • 6. Crowdsourced web exploration Public pages visited by Chemicalize users Auto annotations scripts Search results
  • 7. Contribution to PubChem (2013) • 300k structures • 350k web pages • 100k novel
  • 8. Popularity (2015) • 25k users / month • 1 million structures 2 millions visited URLs • A dozen of blog posts and journal references • Continuous valuable user feedback Dark side: • Scalability & performance • Maintenance & operation • Abuse and non-fair usage
  • 10. Vision Preserve current values but make Chemicalize a professional and much more powerful platform. • Improve reliability • Extend functionality • Know and understand users
  • 11. Development • Secure • Reliable • Scalable • Extensible • Simple • Fast Full redesign and enterprise ready reimplementation in a modular cloud architecture.
  • 12. New business model • Free registration • Free basic functions • Free credits monthly • Pay-per-use • Credit package system Enough for most typical use cases For more intensive usage Instant cheminformatics solutions
  • 13. Current modules Calculation Names, identifiers, physicochemical properties eg. pKa, logP/logD, solubility… Annotation Chemical structures recognition and extraction from web pages Search Combined chemical and text search with relevance scoring, hit highlighting… Compliance Compliance check with regulations on psychotropic drugs, explosives, toxic agents + Extensible with any further modules
  • 15. Annotation Improved annotation view for modern web pages with better CSS and JS support • GooglePatents • ScienceDirect • Wiley Online Library
  • 16. Content More preloaded content and proactive web exploration besides of crowdsourcing Processed in the first stage: • English Wikipedia 5 million articles • USPTO grants Last 5 years • Chemicalize 800k URLs
  • 17. Search New engine offering unlimited combination of chemical and keyword search • Substructure, full, similarity • Name, SMILES, InChI, CAS • Full text, field • Boolean, proximity, wildcard
  • 18. Query examples acetylsalicylic acid AND fever Aspirin, acetylsalicylic acid, 2- (acetyloxy)benzoic acid and all chemically equivalent terms and fever together. SUB:benzene Containing any structure which contains benzene as a substructure. For example, toluene, phenol, benzoic acid. SIM:viagra AND "half-life" AND "pulmonary arterial hypertension" Containing structures chemically similar to Viagra and containing "half-life" and "pulmonary arterial hypertension". (c?emotherap* AND ("Phosphoinositide 3- kinases"~3OR Pi3K)) AND FULL:idelalisib Wildcard operators: ? for one character, * for multiple characters. Proximity operator: "term1 term2"~distance. Phrase: "term1 term2".