SlideShare a Scribd company logo
1 of 40
Download to read offline
Dr. Alexander Klenner-Bajaja 30-31 March 2017Data Scientist Requirements Engineering-Solution Design | Dir. 2.8.3.3
Towards semantic search at the European
Patent Office
II-SDV 2017
European Patent Office
About me
Alexander Klenner-Bajaja
▪ Bioinformatics at Goethe University Frankfurt
▪ PhD ETH Zurich & Goethe University –
in Cheminformatics
▪ PostDoc at Fraunhofer Society SCAI in Cologne
− Chemical entity recognition in patents
▪ Data Scientist, Search & Knowledge, DG2, EPO
− automated search
− search benchmarking
− new search technologies
2
2006
2011
2013
current
European Patent Office
▪ Anecdotal Evidence and the need for a Benchmarking Environment
▪ Keywords and query generation in (automatic) search scenarios
(Project ERa)
▪ Introducing machine learned semantic search technologies and
(automatic) query expansion
▪ Introducing terminology based semantics within the APL project
▪ (Automated) Search result confidence
3
European Patent Office
Anecdotal Evidence and Benchmarking
▪ How can two different search
strategies be compared?
▪ Historically decision are taken
based on expert feedback
▪ Valuable information that can
be complemented with
measurements
▪ Distribution shown as Box-plot
4
median:
5 citations
in search
reports
European Patent Office
Search Benchmarking
5
▪ We implemented a holistic prototyping and benchmark system
▪ We have access to all distributed system in the EPO IT landscape
and can connect them in "visual" workflows
European Patent Office
Automated Queries exploiting the shown tools
6
Application
Automated
Query
Formulation
relevant documents
Evaluation and
Feedback
Search
PriorArt
European Patent Office
Benchmarking Environment
▪ From anecdotal to statistical evidence
▪ "Towards Automated Prior Art Search" (TAPAS)
7
Patent Corpus
Citations
Split Date
Applications
Method 1 Method 2
Ø Recall@100: 0.25 Ø Recall@100: 0.17
Search
Corpus
Result 1 Result 2
Data Set
Generation
Retrospective
Search
trec_eval trec_eval
European Patent Office
▪ Retrieval success measured by different metrics
▪ Computed by the trec_eval tool
▪ Average over all n simulated applications
Benchmarking Environment
8
Top 50 Documents Returned Cited Documents not Returned
Recall @ 50
Hit Rate @ 50
European Patent Office
Benchmarking Environment
9
▪ Search reports for about 40 million simple patent families
▪ The relevant documents are mentioned in the search report as
either
– X or A citations
▪ Only few citations per document (2 X-category, 3 A-category)
0 40x106patent documents
X X
A
relevant
European Patent Office
Benchmarking Environment
10
▪ Search reports for about 40 million simple patent families
▪ The relevant documents are mentioned in the search report as either
– X or A citations
▪ Only few citations per document (2 X-category, 3 A-category)
▪ But more "soft" information available: Returned during search (R), put
aside for detailed inspection (Ins) , printed (Pr), stored for citation (Cs)
relevance
0 80x106patent documents
X X
A
R
Ins
Pr
European Patent Office
▪ Anecdotal Evidence and the need for a Benchmarking Environment
▪ Keywords and query generation in (automatic) search scenarios
(Project ERa)
▪ Introducing machine learned semantic search technologies and
(automatic) query expansion
▪ Introducing terminology based semantics within the APL project
▪ (Automated) Search result confidence
11
European Patent Office
Term-Based Search
▪ Can we hope to find all citations with keywords?
 overlap in vocabulary (non-stopwords)
12
Application Citation
Overlap
European Patent Office
Term-Based Search
13
▪ However: also overlap with random documents
▪ Random: Y-Scrambling
CitedDocumentsRandomDocuments
Frequency
Overlapping Terms
All Terms Nouns, no Stop Words Keywords (TF)
Overlapping Terms Overlapping Terms
Overlapping Terms Overlapping Terms Overlapping Terms
Frequency
Frequency
Frequency
Frequency
Frequency
European Patent Office
Term-Based Search fully automated
14
▪ How good can we get?
▪ More Like This on Title / Abstract / Description / Claims
▪ Percentage of citations found in the top k
Top 100 Top 1 000 Top 10 000 Top 100 000
0%
29%
57%
80%
European Patent Office
Term-Based Search fully automated
▪ Can we close the gap?
... using semantic search?
15
Top 100 Top 1 000 Top 10 000 Top 100 000
0%
29%
57%
80%
European Patent Office
▪ Anecdotal Evidence and the need for a Benchmarking Environment
▪ Keywords and query generation in (automatic) search scenarios
(Project ERa)
▪ Introducing machine learned semantic search technologies and
(automatic) query expansion
▪ Introducing terminology based semantics within the APL project
▪ (Automated) Search result confidence
16
European Patent Office
Introducing "Semantics"
17
Virus
European Patent Office
Automated Query Expansion
▪ Analyze word embeddings
▪ Words with similar meaning have similar context
▪ Train neural network  words represented by vectors
▪ Words with similar meaning have similar vectors
 allows distance calculations between words
 semantic similarity
18
Word
eating apples is healthy
Context
eating fruits is healthy
apple = [0.253, 1.793, ..., 5.555, 3,142]
tomato
apple fruit
banana
steak
liver
European Patent Office
Automated Query Expansion
19
Patent
Corpus
CPC w2v model
A01B M1
A02B M2
A02C M3
H05K MN-1
All MN
Query Word
top X
relevant
words per
class
European Patent Office
Automated Query Expansion
A61K
A virus is a small infectious agent
that replicates only inside the
living cells of other organisms.
G06F
A computer virus is a type of
malicious software program
("malware") that, when executed,
replicates by reproducing itself
(copying its own source code) or
infecting other computer programs
by modifying them.
20
hiv, viral, human
immunodeficiency, viral
genome, genome, replication,
pathogen, adenovirus,
replicating infected, virus infection, malicious,
anti-virus, malware, virus-
scanning, virus worm, macro virus,
virus scanner
European Patent Office
Semantics through Machine Learning
Example Queries: descriptions from Wikipedia
.
21
A virus is a small infectious
agent that replicates only
inside the living cells of other
organisms.
A computer virus is a type of
malicious software program
("malware") that, when executed,
replicates by reproducing itself
(copying its own source code) or
infecting other computer programs
by modifying them.
Electrical digital
data processing
Preparations for medical,
dental or toilet purposes
European Patent Office
Automated Query Expansion
Inter model similarity in
CPC A Human Necessities
based on vocabularies
22
A47KA45DA47L
European Patent Office
Automated Query Expansion
▪ Word2Vec Models applied to (100) extracted TF-IDF Keywords
23
"Touch sensing system and display apparatus"
EP2869168A120150506
TF-IDF Keywords
(10 of 100)
TF-IDF
Scores
Word2Vec Enrichments
(G06F)
max 5 terms, at least Cosine 0.6
possessory
sensing
touch
bridge
nodes
shared
electrodes
controller
instructing
masters
memory
sensor
algorithm
senses
data
...
49.01
33.04
31.36
27.88
27.59
26.86
25.52
23.18
22.11
21.23
20.39
20.38
20.02
20.02
19.91
...
possessory
sensing OR sensed OR sensor OR touch
touch OR touched OR touch screen OR touches OR touch panel OR touching
bridge OR bridges OR pci bus OR bus OR interfaces 22a-22b OR pci-pci bridge
nodes OR node OR nodes n2
shared OR share OR sharing OR non-shared
electrodes OR electrode OR electrodes y2
controller OR control OR controllers OR bus OR unit OR processor
instructing OR instruct OR instructs
masters
memory OR ram OR non-volatile OR memories OR processor OR flash
sensor OR sensors OR sensing OR humidity sensor OR sensed OR sensor senses
algorithm OR algorithms
senses OR sensed
data OR stored OR stores OR store OR storing
...
European Patent Office
Automated Query Expansion
▪ Query Enrichment brings increased performance
EP2869168A120150506
24
"Touch sensing system and display apparatus"
Recall@100: 0
(0 of 2)
Recall@100: 1
(2 of 2)
Keyword
Extraction
W2V
Enrichment
TF-IDF Query
W2V Query
European Patent Office
Automated Query Expansion
▪ Documents found with increased Term Overlap with Query
25
39100958
48143418
Recall@100: 0
(0 of 2)
Recall@100: 1
(2 of 2)
39 80
43 80
Query Citation
TF-IDF
Query
W2V Query
European Patent Office
Automated Query Expansion
▪ Word2Vec Expansion Terms in Overlap
▪ Re-Suggestion of known Query Terms
26
outputs
apparatus
screen
increasing
serially
signal OR signals OR circuit OR output OR clock OR outputs
controlling OR control OR controlled OR apparatus
display OR screen OR displays OR displaying OR displayed OR lcd
increases OR increase OR decreases OR increased OR increasing OR decrease
decreasing OR increasing OR decrease OR increase OR decreases OR decreased
parallel OR serially
New Overlap Original Keyword Word2Vec Enrichment
transmission transmitted OR transmits OR received OR transmit OR transmission OR transmitting
reception OR transmission OR transmitted OR transmits
transmitting OR receiving OR transmitted OR transmission OR sending OR transmit
Term Original Keyword Word2Vec Enrichment
▪ Should this result in higher weight?
▪ Models heavily suggest inflections
European Patent Office
▪ Anecdotal Evidence and the need for a Benchmarking Environment
▪ Keywords and query generation in (automatic) search scenarios
(Project ERa)
▪ Introducing machine learned semantic search technologies and
(automatic) query expansion
▪ Introducing terminology based semantics within the APL project
▪ (Automated) Search result confidence
27
European Patent Office
Introducing semantics through annotations
We are in the process of annotating the full prior art collection with
normalised
▪ Chemical Entities
▪ Physical Units
▪ Citations
▪ Controlled Terminologies (e.g. MeSH)
to enable a semantic search of those entities.
28
Patent
Collection
Patent
Collection
Annotation Pipeline
Aspirin
400 mm
solar panel
European Patent Office
Introducing annotations and semantics
▪ Our annotation platform's key elements
− Based on the open source framework UIMA
− Plug and play of new components (analysis engines)
− Quality Evaluation Workbench
− Knowledge Base for controlled vocabulary
− Scaling architecture
− Data and annotations stored in noSQL databases
29
Application Citation
acetylsalicylic acid
(D001241)
matching terms through
controlled vocabulary
MeSH Unique ID: D001241
Aspirin
(D001241)
European Patent Office
Introducing annotations and semantics
Project in Execution - Platform installed and tested at this moment
30
1. Define (sub)set of documents to work on
2. Define an annotation Pipeline
European Patent Office
Word2Vec & APL
31
Application Citation
Aspirin acetylsalicylic acid
matching terms through
controlled vocabulary
Aspirin = acetylsalicylic acid
(would have been
False Negative)
Virus Virus
separating identical terms
with different meaning
Virus = Virus
(would have been
False Positive)
European Patent Office
▪ Anecdotal Evidence and the need for a Benchmarking Environment
▪ Keywords and query generation in (automatic) search scenarios
(Project ERa)
▪ Introducing machine learned semantic search technologies and
(automatic) query expansion
▪ Introducing terminology based semantics within the APL project
▪ (Automated) Search result confidence
32
European Patent Office
Identify Successful (Automated) Searches
Machine-Learning: look at the documents
33
Word2Vec Heatmap
Sequence Alignment
(Bioinformatics)
document1document2
document1document2
document 0
document 0
document 0
document 0
Citation
Random
European Patent Office
Can we identify successful searches? (Machine learned)
▪ Prediction Model for Citability based on Distribution Statistics of
heat maps
▪ 10-fold Cross Validation
(Y-Scrambling)
34
[mean, variance, skewness, kurtosis, min, q1, q2, q3, max]
Random ForestMulti-Layer Perceptron
▪ ensemble of decision trees
▪ bootstrap aggregating
▪ random selection of features
Training Set
Validation Set
Round 1
Error
Rate 1
Round 2
Error
Rate 2
Round 3
Error
Rate 3
Round 10
Error
Rate 10
~8% ~5%
Predicted
Relevance
European Patent Office
Identify Successful (Automated) Searches
▪ Meta Data: same author as in application occurs in result set
▪ Interesting Item Set Mining & Subgroup Discovery
35
Searches with same author in
result set (35%)
Recall@100 0.2357
Searches without same author in
result set (65%)
Recall@100 0.1209
All searches (100%)
Recall@100 0.1608
Only documents from same author
Recall@100 0.0884
MLT Search
0 100
Document Rank
Recall
European Patent Office
Complementariness of Results
36
"Touch sensing system and display apparatus"Query
Method 1 Method 2
European Patent Office
Complementariness of Results
37
"Touch sensing system and display apparatus"Query
Method 1 Method 2 Method 3 Method i...
European Patent Office
Complementariness of Results
38
"Touch sensing system and display apparatus"Query
Method 1 Method 2 Method 3 Method i...
European Patent Office
Conclusion and outline
▪ We are exploiting state-of-the-art semantic- and other search
technologies
▪ We have created a prototyping and benchmarking environment to
evaluate the performance and quality of a new search algorithm
learning from prior searches
▪ Bringing all these new technologies together in a productive search
environment is the biggest challenge ahead
▪ Semantic (text) search is one small mosaic piece in the complex
search environment at the EPO
39
European Patent Office
Acknowledgments
▪ The Search & Knowledge Directorate
− Domenico Golzio Director
− Stefan Klocke Head of Department
− Volker Hähnke Ranking and Confidence
− Matthias Wirth Neo4J, CPC classification
− Pavel Goncharik Ansera, ranking, lucene
▪ Examiners
− Robert Herman ERa
− Ype Kingma ERa
− Markus Arndt ERa
− Christos Stergio APL
40

More Related Content

What's hot

II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceDr. Haxel Consult
 
II-SDV 2012 Dealing with Large Data Volumes in Statistical Analysis and Text ...
II-SDV 2012 Dealing with Large Data Volumes in Statistical Analysis and Text ...II-SDV 2012 Dealing with Large Data Volumes in Statistical Analysis and Text ...
II-SDV 2012 Dealing with Large Data Volumes in Statistical Analysis and Text ...Dr. Haxel Consult
 
II-PIC 2017: Artificial Intelligence, Machine Learning, And Deep Neural Netwo...
II-PIC 2017: Artificial Intelligence, Machine Learning, And Deep Neural Netwo...II-PIC 2017: Artificial Intelligence, Machine Learning, And Deep Neural Netwo...
II-PIC 2017: Artificial Intelligence, Machine Learning, And Deep Neural Netwo...Dr. Haxel Consult
 
II-PIC 2017: Product Presentation Gridlogics
II-PIC 2017: Product Presentation GridlogicsII-PIC 2017: Product Presentation Gridlogics
II-PIC 2017: Product Presentation GridlogicsDr. Haxel Consult
 
II-SDV 2017: Applications of RNN (Recurrent Neural Networks) within Machine T...
II-SDV 2017: Applications of RNN (Recurrent Neural Networks) within Machine T...II-SDV 2017: Applications of RNN (Recurrent Neural Networks) within Machine T...
II-SDV 2017: Applications of RNN (Recurrent Neural Networks) within Machine T...Dr. Haxel Consult
 
II-PIC 2017: Gain insight into technical, legal and business information thro...
II-PIC 2017: Gain insight into technical, legal and business information thro...II-PIC 2017: Gain insight into technical, legal and business information thro...
II-PIC 2017: Gain insight into technical, legal and business information thro...Dr. Haxel Consult
 
II-PIC 2017: The Use of Patent Information for Innovation and Competitive Int...
II-PIC 2017: The Use of Patent Information for Innovation and Competitive Int...II-PIC 2017: The Use of Patent Information for Innovation and Competitive Int...
II-PIC 2017: The Use of Patent Information for Innovation and Competitive Int...Dr. Haxel Consult
 
AI-SDV 2020: Combining Knowledge and Machine Learning for the Analysis of Sci...
AI-SDV 2020: Combining Knowledge and Machine Learning for the Analysis of Sci...AI-SDV 2020: Combining Knowledge and Machine Learning for the Analysis of Sci...
AI-SDV 2020: Combining Knowledge and Machine Learning for the Analysis of Sci...Dr. Haxel Consult
 
IC-SDV 2018: Aleksandar Kapisoda (Boehringer) Using Machine Learning for Auto...
IC-SDV 2018: Aleksandar Kapisoda (Boehringer) Using Machine Learning for Auto...IC-SDV 2018: Aleksandar Kapisoda (Boehringer) Using Machine Learning for Auto...
IC-SDV 2018: Aleksandar Kapisoda (Boehringer) Using Machine Learning for Auto...Dr. Haxel Consult
 
II-SDV 2014 The Challenges of Managing “Big Data” in the Patent Field: Patent...
II-SDV 2014 The Challenges of Managing “Big Data” in the Patent Field: Patent...II-SDV 2014 The Challenges of Managing “Big Data” in the Patent Field: Patent...
II-SDV 2014 The Challenges of Managing “Big Data” in the Patent Field: Patent...Dr. Haxel Consult
 
II-SDV 2017: The Next Era: Deep Learning for Biomedical Research
II-SDV 2017: The Next Era: Deep Learning for Biomedical ResearchII-SDV 2017: The Next Era: Deep Learning for Biomedical Research
II-SDV 2017: The Next Era: Deep Learning for Biomedical ResearchDr. Haxel Consult
 
II-PIC 2017: To err is human – growing in experience as a patent information ...
II-PIC 2017: To err is human – growing in experience as a patent information ...II-PIC 2017: To err is human – growing in experience as a patent information ...
II-PIC 2017: To err is human – growing in experience as a patent information ...Dr. Haxel Consult
 
II-PIC 2017: Product Presentation LexisNexis
II-PIC 2017: Product Presentation LexisNexisII-PIC 2017: Product Presentation LexisNexis
II-PIC 2017: Product Presentation LexisNexisDr. Haxel Consult
 
II-SDV 2017: How Visualisation of Open Patent Data can help with Strategic De...
II-SDV 2017: How Visualisation of Open Patent Data can help with Strategic De...II-SDV 2017: How Visualisation of Open Patent Data can help with Strategic De...
II-SDV 2017: How Visualisation of Open Patent Data can help with Strategic De...Dr. Haxel Consult
 
II-SDV 2012 Automatic Query Re-Ranking in a Patent Database by Local Frequenc...
II-SDV 2012 Automatic Query Re-Ranking in a Patent Database by Local Frequenc...II-SDV 2012 Automatic Query Re-Ranking in a Patent Database by Local Frequenc...
II-SDV 2012 Automatic Query Re-Ranking in a Patent Database by Local Frequenc...Dr. Haxel Consult
 
Reinventing Laboratory Data To Be Bigger, Smarter & Faster
Reinventing Laboratory Data To Be Bigger, Smarter & FasterReinventing Laboratory Data To Be Bigger, Smarter & Faster
Reinventing Laboratory Data To Be Bigger, Smarter & FasterOSTHUS
 

What's hot (20)

II-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in NiceII-SDV 2015, 20 - 21 April, in Nice
II-SDV 2015, 20 - 21 April, in Nice
 
II-SDV 2012 Dealing with Large Data Volumes in Statistical Analysis and Text ...
II-SDV 2012 Dealing with Large Data Volumes in Statistical Analysis and Text ...II-SDV 2012 Dealing with Large Data Volumes in Statistical Analysis and Text ...
II-SDV 2012 Dealing with Large Data Volumes in Statistical Analysis and Text ...
 
II-PIC 2017: Artificial Intelligence, Machine Learning, And Deep Neural Netwo...
II-PIC 2017: Artificial Intelligence, Machine Learning, And Deep Neural Netwo...II-PIC 2017: Artificial Intelligence, Machine Learning, And Deep Neural Netwo...
II-PIC 2017: Artificial Intelligence, Machine Learning, And Deep Neural Netwo...
 
AI-SDV 2020: Kairntech
AI-SDV 2020: KairntechAI-SDV 2020: Kairntech
AI-SDV 2020: Kairntech
 
II-PIC 2017: Product Presentation Gridlogics
II-PIC 2017: Product Presentation GridlogicsII-PIC 2017: Product Presentation Gridlogics
II-PIC 2017: Product Presentation Gridlogics
 
IC-SDV 2018: Averbis
IC-SDV 2018: AverbisIC-SDV 2018: Averbis
IC-SDV 2018: Averbis
 
II-SDV 2017: Applications of RNN (Recurrent Neural Networks) within Machine T...
II-SDV 2017: Applications of RNN (Recurrent Neural Networks) within Machine T...II-SDV 2017: Applications of RNN (Recurrent Neural Networks) within Machine T...
II-SDV 2017: Applications of RNN (Recurrent Neural Networks) within Machine T...
 
SciBite
SciBiteSciBite
SciBite
 
II-PIC 2017: Gain insight into technical, legal and business information thro...
II-PIC 2017: Gain insight into technical, legal and business information thro...II-PIC 2017: Gain insight into technical, legal and business information thro...
II-PIC 2017: Gain insight into technical, legal and business information thro...
 
II-PIC 2017: The Use of Patent Information for Innovation and Competitive Int...
II-PIC 2017: The Use of Patent Information for Innovation and Competitive Int...II-PIC 2017: The Use of Patent Information for Innovation and Competitive Int...
II-PIC 2017: The Use of Patent Information for Innovation and Competitive Int...
 
AI-SDV 2020: Combining Knowledge and Machine Learning for the Analysis of Sci...
AI-SDV 2020: Combining Knowledge and Machine Learning for the Analysis of Sci...AI-SDV 2020: Combining Knowledge and Machine Learning for the Analysis of Sci...
AI-SDV 2020: Combining Knowledge and Machine Learning for the Analysis of Sci...
 
IC-SDV 2018: Aleksandar Kapisoda (Boehringer) Using Machine Learning for Auto...
IC-SDV 2018: Aleksandar Kapisoda (Boehringer) Using Machine Learning for Auto...IC-SDV 2018: Aleksandar Kapisoda (Boehringer) Using Machine Learning for Auto...
IC-SDV 2018: Aleksandar Kapisoda (Boehringer) Using Machine Learning for Auto...
 
II-SDV 2014 The Challenges of Managing “Big Data” in the Patent Field: Patent...
II-SDV 2014 The Challenges of Managing “Big Data” in the Patent Field: Patent...II-SDV 2014 The Challenges of Managing “Big Data” in the Patent Field: Patent...
II-SDV 2014 The Challenges of Managing “Big Data” in the Patent Field: Patent...
 
II-SDV 2017: The Next Era: Deep Learning for Biomedical Research
II-SDV 2017: The Next Era: Deep Learning for Biomedical ResearchII-SDV 2017: The Next Era: Deep Learning for Biomedical Research
II-SDV 2017: The Next Era: Deep Learning for Biomedical Research
 
II-PIC 2017: To err is human – growing in experience as a patent information ...
II-PIC 2017: To err is human – growing in experience as a patent information ...II-PIC 2017: To err is human – growing in experience as a patent information ...
II-PIC 2017: To err is human – growing in experience as a patent information ...
 
II-PIC 2017: Product Presentation LexisNexis
II-PIC 2017: Product Presentation LexisNexisII-PIC 2017: Product Presentation LexisNexis
II-PIC 2017: Product Presentation LexisNexis
 
II-SDV 2017: How Visualisation of Open Patent Data can help with Strategic De...
II-SDV 2017: How Visualisation of Open Patent Data can help with Strategic De...II-SDV 2017: How Visualisation of Open Patent Data can help with Strategic De...
II-SDV 2017: How Visualisation of Open Patent Data can help with Strategic De...
 
II-SDV 2012 Automatic Query Re-Ranking in a Patent Database by Local Frequenc...
II-SDV 2012 Automatic Query Re-Ranking in a Patent Database by Local Frequenc...II-SDV 2012 Automatic Query Re-Ranking in a Patent Database by Local Frequenc...
II-SDV 2012 Automatic Query Re-Ranking in a Patent Database by Local Frequenc...
 
IC-SDV 2018: Deep Search 9
IC-SDV 2018: Deep Search 9IC-SDV 2018: Deep Search 9
IC-SDV 2018: Deep Search 9
 
Reinventing Laboratory Data To Be Bigger, Smarter & Faster
Reinventing Laboratory Data To Be Bigger, Smarter & FasterReinventing Laboratory Data To Be Bigger, Smarter & Faster
Reinventing Laboratory Data To Be Bigger, Smarter & Faster
 

Similar to II-SDV 2017: Towards Semantic Search at the European Patent Office

How to valuate and determine standard essential patents
How to valuate and determine standard essential patentsHow to valuate and determine standard essential patents
How to valuate and determine standard essential patentsMIPLM
 
InnovationQ Plus by Mr. Paul Henriques, ex-USPTO and Product specialist/ IEEE...
InnovationQ Plus by Mr. Paul Henriques, ex-USPTO and Product specialist/ IEEE...InnovationQ Plus by Mr. Paul Henriques, ex-USPTO and Product specialist/ IEEE...
InnovationQ Plus by Mr. Paul Henriques, ex-USPTO and Product specialist/ IEEE...Rolsy Mavi
 
El nuevo superordenador Mare Nostrum y el futuro procesador europeo
El nuevo superordenador Mare Nostrum y el futuro procesador europeoEl nuevo superordenador Mare Nostrum y el futuro procesador europeo
El nuevo superordenador Mare Nostrum y el futuro procesador europeoAMETIC
 
Infraestrutura para a Ciência Aberta na Europa - OpenAIRE: O poder dos reposi...
Infraestrutura para a Ciência Aberta na Europa - OpenAIRE: O poder dos reposi...Infraestrutura para a Ciência Aberta na Europa - OpenAIRE: O poder dos reposi...
Infraestrutura para a Ciência Aberta na Europa - OpenAIRE: O poder dos reposi...Pedro Príncipe
 
PPEPR Project Presentation
PPEPR Project PresentationPPEPR Project Presentation
PPEPR Project PresentationMuntazir Mehdi
 
Epo info resources & espacenet & search techniques
Epo info resources & espacenet & search techniquesEpo info resources & espacenet & search techniques
Epo info resources & espacenet & search techniquesLATIPAT
 
Epo info resources & espacenet & search techniques
Epo info resources & espacenet & search techniquesEpo info resources & espacenet & search techniques
Epo info resources & espacenet & search techniquesLATIPAT
 
Enhancing the Quality of ImmPort Data
Enhancing the Quality of ImmPort DataEnhancing the Quality of ImmPort Data
Enhancing the Quality of ImmPort DataBarry Smith
 
The Logical Model Designer - Binding Information Models to Terminology
The Logical Model Designer - Binding Information Models to TerminologyThe Logical Model Designer - Binding Information Models to Terminology
The Logical Model Designer - Binding Information Models to TerminologySnow Owl
 
5G Standard Essential Patents
5G Standard Essential Patents5G Standard Essential Patents
5G Standard Essential PatentsTim Pohlmann
 
2017 Atlanta Regional User Seminar Introduction
2017 Atlanta Regional User Seminar Introduction2017 Atlanta Regional User Seminar Introduction
2017 Atlanta Regional User Seminar IntroductionOPAL-RT TECHNOLOGIES
 
Structural Biology in the Clouds: A Success Story of 10 years
Structural Biology in the Clouds: A Success Story of 10 yearsStructural Biology in the Clouds: A Success Story of 10 years
Structural Biology in the Clouds: A Success Story of 10 yearsAlexandreBonvin2
 
Benchmarking Commercial RDF Stores with Publications Office Dataset
Benchmarking Commercial RDF Stores with Publications Office DatasetBenchmarking Commercial RDF Stores with Publications Office Dataset
Benchmarking Commercial RDF Stores with Publications Office DatasetGhislain Atemezing
 
Speeding up information extraction programs: a holistic optimizer and a learn...
Speeding up information extraction programs: a holistic optimizer and a learn...Speeding up information extraction programs: a holistic optimizer and a learn...
Speeding up information extraction programs: a holistic optimizer and a learn...INRIA-OAK
 
Scaling Up AI Research to Production with PyTorch and MLFlow
Scaling Up AI Research to Production with PyTorch and MLFlowScaling Up AI Research to Production with PyTorch and MLFlow
Scaling Up AI Research to Production with PyTorch and MLFlowDatabricks
 
EUGM 2014 - Alfonso Pozzan (Aptuit): Expanding the scope of “literature data”...
EUGM 2014 - Alfonso Pozzan (Aptuit): Expanding the scope of “literature data”...EUGM 2014 - Alfonso Pozzan (Aptuit): Expanding the scope of “literature data”...
EUGM 2014 - Alfonso Pozzan (Aptuit): Expanding the scope of “literature data”...ChemAxon
 
Presentation iiia
Presentation iiiaPresentation iiia
Presentation iiiancaste
 
Presentation iiia
Presentation iiiaPresentation iiia
Presentation iiiaelenaific
 

Similar to II-SDV 2017: Towards Semantic Search at the European Patent Office (20)

How to valuate and determine standard essential patents
How to valuate and determine standard essential patentsHow to valuate and determine standard essential patents
How to valuate and determine standard essential patents
 
InnovationQ Plus by Mr. Paul Henriques, ex-USPTO and Product specialist/ IEEE...
InnovationQ Plus by Mr. Paul Henriques, ex-USPTO and Product specialist/ IEEE...InnovationQ Plus by Mr. Paul Henriques, ex-USPTO and Product specialist/ IEEE...
InnovationQ Plus by Mr. Paul Henriques, ex-USPTO and Product specialist/ IEEE...
 
El nuevo superordenador Mare Nostrum y el futuro procesador europeo
El nuevo superordenador Mare Nostrum y el futuro procesador europeoEl nuevo superordenador Mare Nostrum y el futuro procesador europeo
El nuevo superordenador Mare Nostrum y el futuro procesador europeo
 
Infraestrutura para a Ciência Aberta na Europa - OpenAIRE: O poder dos reposi...
Infraestrutura para a Ciência Aberta na Europa - OpenAIRE: O poder dos reposi...Infraestrutura para a Ciência Aberta na Europa - OpenAIRE: O poder dos reposi...
Infraestrutura para a Ciência Aberta na Europa - OpenAIRE: O poder dos reposi...
 
PPEPR Project Presentation
PPEPR Project PresentationPPEPR Project Presentation
PPEPR Project Presentation
 
Epo info resources & espacenet & search techniques
Epo info resources & espacenet & search techniquesEpo info resources & espacenet & search techniques
Epo info resources & espacenet & search techniques
 
Epo info resources & espacenet & search techniques
Epo info resources & espacenet & search techniquesEpo info resources & espacenet & search techniques
Epo info resources & espacenet & search techniques
 
Enhancing the Quality of ImmPort Data
Enhancing the Quality of ImmPort DataEnhancing the Quality of ImmPort Data
Enhancing the Quality of ImmPort Data
 
Towards processing and reasoning streams of events in knowledge driven manufa...
Towards processing and reasoning streams of events in knowledge driven manufa...Towards processing and reasoning streams of events in knowledge driven manufa...
Towards processing and reasoning streams of events in knowledge driven manufa...
 
The Logical Model Designer - Binding Information Models to Terminology
The Logical Model Designer - Binding Information Models to TerminologyThe Logical Model Designer - Binding Information Models to Terminology
The Logical Model Designer - Binding Information Models to Terminology
 
5G Standard Essential Patents
5G Standard Essential Patents5G Standard Essential Patents
5G Standard Essential Patents
 
2017 Atlanta Regional User Seminar Introduction
2017 Atlanta Regional User Seminar Introduction2017 Atlanta Regional User Seminar Introduction
2017 Atlanta Regional User Seminar Introduction
 
Structural Biology in the Clouds: A Success Story of 10 years
Structural Biology in the Clouds: A Success Story of 10 yearsStructural Biology in the Clouds: A Success Story of 10 years
Structural Biology in the Clouds: A Success Story of 10 years
 
Benchmarking Commercial RDF Stores with Publications Office Dataset
Benchmarking Commercial RDF Stores with Publications Office DatasetBenchmarking Commercial RDF Stores with Publications Office Dataset
Benchmarking Commercial RDF Stores with Publications Office Dataset
 
Speeding up information extraction programs: a holistic optimizer and a learn...
Speeding up information extraction programs: a holistic optimizer and a learn...Speeding up information extraction programs: a holistic optimizer and a learn...
Speeding up information extraction programs: a holistic optimizer and a learn...
 
CORRECT-ICSE2016
CORRECT-ICSE2016CORRECT-ICSE2016
CORRECT-ICSE2016
 
Scaling Up AI Research to Production with PyTorch and MLFlow
Scaling Up AI Research to Production with PyTorch and MLFlowScaling Up AI Research to Production with PyTorch and MLFlow
Scaling Up AI Research to Production with PyTorch and MLFlow
 
EUGM 2014 - Alfonso Pozzan (Aptuit): Expanding the scope of “literature data”...
EUGM 2014 - Alfonso Pozzan (Aptuit): Expanding the scope of “literature data”...EUGM 2014 - Alfonso Pozzan (Aptuit): Expanding the scope of “literature data”...
EUGM 2014 - Alfonso Pozzan (Aptuit): Expanding the scope of “literature data”...
 
Presentation iiia
Presentation iiiaPresentation iiia
Presentation iiia
 
Presentation iiia
Presentation iiiaPresentation iiia
Presentation iiia
 

More from Dr. Haxel Consult

AI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering ManagementAI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering ManagementDr. Haxel Consult
 
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...Dr. Haxel Consult
 
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...Dr. Haxel Consult
 
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...Dr. Haxel Consult
 
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...Dr. Haxel Consult
 
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...Dr. Haxel Consult
 
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...Dr. Haxel Consult
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...Dr. Haxel Consult
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...Dr. Haxel Consult
 
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...Dr. Haxel Consult
 
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...Dr. Haxel Consult
 
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...Dr. Haxel Consult
 
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...Dr. Haxel Consult
 
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...Dr. Haxel Consult
 
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...Dr. Haxel Consult
 
AI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance CenterAI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance CenterDr. Haxel Consult
 
AI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOCAI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOCDr. Haxel Consult
 
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...Dr. Haxel Consult
 
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...Dr. Haxel Consult
 

More from Dr. Haxel Consult (20)

AI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering ManagementAI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
 
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
 
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
 
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
 
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
 
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
 
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...
 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...
 
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT – Will AI help? Nils Newman (Search Technology,...
 
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
 
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
 
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
 
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
 
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CE...
 
AI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance CenterAI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance Center
 
AI-SDV 2022: Lighthouse IP
AI-SDV 2022: Lighthouse IPAI-SDV 2022: Lighthouse IP
AI-SDV 2022: Lighthouse IP
 
AI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOCAI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOC
 
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
 
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer – Turning bits into insight...
 

Recently uploaded

APNIC Updates presented by Paul Wilson at ARIN 53
APNIC Updates presented by Paul Wilson at ARIN 53APNIC Updates presented by Paul Wilson at ARIN 53
APNIC Updates presented by Paul Wilson at ARIN 53APNIC
 
Nanded City ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready ...
Nanded City ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready ...Nanded City ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready ...
Nanded City ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready ...tanu pandey
 
Busty Desi⚡Call Girls in Vasundhara Ghaziabad >༒8448380779 Escort Service
Busty Desi⚡Call Girls in Vasundhara Ghaziabad >༒8448380779 Escort ServiceBusty Desi⚡Call Girls in Vasundhara Ghaziabad >༒8448380779 Escort Service
Busty Desi⚡Call Girls in Vasundhara Ghaziabad >༒8448380779 Escort ServiceDelhi Call girls
 
Katraj ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For S...
Katraj ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For S...Katraj ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For S...
Katraj ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For S...tanu pandey
 
( Pune ) VIP Baner Call Girls 🎗️ 9352988975 Sizzling | Escorts | Girls Are Re...
( Pune ) VIP Baner Call Girls 🎗️ 9352988975 Sizzling | Escorts | Girls Are Re...( Pune ) VIP Baner Call Girls 🎗️ 9352988975 Sizzling | Escorts | Girls Are Re...
( Pune ) VIP Baner Call Girls 🎗️ 9352988975 Sizzling | Escorts | Girls Are Re...nilamkumrai
 
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdfpdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdfJOHNBEBONYAP1
 
💚😋 Salem Escort Service Call Girls, 9352852248 ₹5000 To 25K With AC💚😋
💚😋 Salem Escort Service Call Girls, 9352852248 ₹5000 To 25K With AC💚😋💚😋 Salem Escort Service Call Girls, 9352852248 ₹5000 To 25K With AC💚😋
💚😋 Salem Escort Service Call Girls, 9352852248 ₹5000 To 25K With AC💚😋nirzagarg
 
Call Girls Sangvi Call Me 7737669865 Budget Friendly No Advance BookingCall G...
Call Girls Sangvi Call Me 7737669865 Budget Friendly No Advance BookingCall G...Call Girls Sangvi Call Me 7737669865 Budget Friendly No Advance BookingCall G...
Call Girls Sangvi Call Me 7737669865 Budget Friendly No Advance BookingCall G...roncy bisnoi
 
Trump Diapers Over Dems t shirts Sweatshirt
Trump Diapers Over Dems t shirts SweatshirtTrump Diapers Over Dems t shirts Sweatshirt
Trump Diapers Over Dems t shirts Sweatshirtrahman018755
 
VIP Call Girls Pollachi 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Pollachi 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Pollachi 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Pollachi 7001035870 Whatsapp Number, 24/07 Bookingdharasingh5698
 
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
20240509 QFM015 Engineering Leadership Reading List April 2024.pdfMatthew Sinclair
 
Pune Airport ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready...
Pune Airport ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready...Pune Airport ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready...
Pune Airport ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready...tanu pandey
 
Dubai=Desi Dubai Call Girls O525547819 Outdoor Call Girls Dubai
Dubai=Desi Dubai Call Girls O525547819 Outdoor Call Girls DubaiDubai=Desi Dubai Call Girls O525547819 Outdoor Call Girls Dubai
Dubai=Desi Dubai Call Girls O525547819 Outdoor Call Girls Dubaikojalkojal131
 
Real Men Wear Diapers T Shirts sweatshirt
Real Men Wear Diapers T Shirts sweatshirtReal Men Wear Diapers T Shirts sweatshirt
Real Men Wear Diapers T Shirts sweatshirtrahman018755
 
Shikrapur - Call Girls in Pune Neha 8005736733 | 100% Gennuine High Class Ind...
Shikrapur - Call Girls in Pune Neha 8005736733 | 100% Gennuine High Class Ind...Shikrapur - Call Girls in Pune Neha 8005736733 | 100% Gennuine High Class Ind...
Shikrapur - Call Girls in Pune Neha 8005736733 | 100% Gennuine High Class Ind...SUHANI PANDEY
 

Recently uploaded (20)

APNIC Updates presented by Paul Wilson at ARIN 53
APNIC Updates presented by Paul Wilson at ARIN 53APNIC Updates presented by Paul Wilson at ARIN 53
APNIC Updates presented by Paul Wilson at ARIN 53
 
Nanded City ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready ...
Nanded City ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready ...Nanded City ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready ...
Nanded City ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready ...
 
Busty Desi⚡Call Girls in Vasundhara Ghaziabad >༒8448380779 Escort Service
Busty Desi⚡Call Girls in Vasundhara Ghaziabad >༒8448380779 Escort ServiceBusty Desi⚡Call Girls in Vasundhara Ghaziabad >༒8448380779 Escort Service
Busty Desi⚡Call Girls in Vasundhara Ghaziabad >༒8448380779 Escort Service
 
Katraj ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For S...
Katraj ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For S...Katraj ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For S...
Katraj ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For S...
 
( Pune ) VIP Baner Call Girls 🎗️ 9352988975 Sizzling | Escorts | Girls Are Re...
( Pune ) VIP Baner Call Girls 🎗️ 9352988975 Sizzling | Escorts | Girls Are Re...( Pune ) VIP Baner Call Girls 🎗️ 9352988975 Sizzling | Escorts | Girls Are Re...
( Pune ) VIP Baner Call Girls 🎗️ 9352988975 Sizzling | Escorts | Girls Are Re...
 
6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...
6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...
6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...
 
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdfpdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
pdfcoffee.com_business-ethics-q3m7-pdf-free.pdf
 
📱Dehradun Call Girls Service 📱☎️ +91'905,3900,678 ☎️📱 Call Girls In Dehradun 📱
📱Dehradun Call Girls Service 📱☎️ +91'905,3900,678 ☎️📱 Call Girls In Dehradun 📱📱Dehradun Call Girls Service 📱☎️ +91'905,3900,678 ☎️📱 Call Girls In Dehradun 📱
📱Dehradun Call Girls Service 📱☎️ +91'905,3900,678 ☎️📱 Call Girls In Dehradun 📱
 
💚😋 Salem Escort Service Call Girls, 9352852248 ₹5000 To 25K With AC💚😋
💚😋 Salem Escort Service Call Girls, 9352852248 ₹5000 To 25K With AC💚😋💚😋 Salem Escort Service Call Girls, 9352852248 ₹5000 To 25K With AC💚😋
💚😋 Salem Escort Service Call Girls, 9352852248 ₹5000 To 25K With AC💚😋
 
Call Girls in Prashant Vihar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Prashant Vihar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort ServiceCall Girls in Prashant Vihar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Prashant Vihar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
 
(INDIRA) Call Girl Pune Call Now 8250077686 Pune Escorts 24x7
(INDIRA) Call Girl Pune Call Now 8250077686 Pune Escorts 24x7(INDIRA) Call Girl Pune Call Now 8250077686 Pune Escorts 24x7
(INDIRA) Call Girl Pune Call Now 8250077686 Pune Escorts 24x7
 
Call Girls Sangvi Call Me 7737669865 Budget Friendly No Advance BookingCall G...
Call Girls Sangvi Call Me 7737669865 Budget Friendly No Advance BookingCall G...Call Girls Sangvi Call Me 7737669865 Budget Friendly No Advance BookingCall G...
Call Girls Sangvi Call Me 7737669865 Budget Friendly No Advance BookingCall G...
 
Trump Diapers Over Dems t shirts Sweatshirt
Trump Diapers Over Dems t shirts SweatshirtTrump Diapers Over Dems t shirts Sweatshirt
Trump Diapers Over Dems t shirts Sweatshirt
 
VIP Call Girls Pollachi 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Pollachi 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Pollachi 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Pollachi 7001035870 Whatsapp Number, 24/07 Booking
 
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
 
Pune Airport ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready...
Pune Airport ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready...Pune Airport ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready...
Pune Airport ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready...
 
Dubai=Desi Dubai Call Girls O525547819 Outdoor Call Girls Dubai
Dubai=Desi Dubai Call Girls O525547819 Outdoor Call Girls DubaiDubai=Desi Dubai Call Girls O525547819 Outdoor Call Girls Dubai
Dubai=Desi Dubai Call Girls O525547819 Outdoor Call Girls Dubai
 
Low Sexy Call Girls In Mohali 9053900678 🥵Have Save And Good Place 🥵
Low Sexy Call Girls In Mohali 9053900678 🥵Have Save And Good Place 🥵Low Sexy Call Girls In Mohali 9053900678 🥵Have Save And Good Place 🥵
Low Sexy Call Girls In Mohali 9053900678 🥵Have Save And Good Place 🥵
 
Real Men Wear Diapers T Shirts sweatshirt
Real Men Wear Diapers T Shirts sweatshirtReal Men Wear Diapers T Shirts sweatshirt
Real Men Wear Diapers T Shirts sweatshirt
 
Shikrapur - Call Girls in Pune Neha 8005736733 | 100% Gennuine High Class Ind...
Shikrapur - Call Girls in Pune Neha 8005736733 | 100% Gennuine High Class Ind...Shikrapur - Call Girls in Pune Neha 8005736733 | 100% Gennuine High Class Ind...
Shikrapur - Call Girls in Pune Neha 8005736733 | 100% Gennuine High Class Ind...
 

II-SDV 2017: Towards Semantic Search at the European Patent Office

  • 1. Dr. Alexander Klenner-Bajaja 30-31 March 2017Data Scientist Requirements Engineering-Solution Design | Dir. 2.8.3.3 Towards semantic search at the European Patent Office II-SDV 2017
  • 2. European Patent Office About me Alexander Klenner-Bajaja ▪ Bioinformatics at Goethe University Frankfurt ▪ PhD ETH Zurich & Goethe University – in Cheminformatics ▪ PostDoc at Fraunhofer Society SCAI in Cologne − Chemical entity recognition in patents ▪ Data Scientist, Search & Knowledge, DG2, EPO − automated search − search benchmarking − new search technologies 2 2006 2011 2013 current
  • 3. European Patent Office ▪ Anecdotal Evidence and the need for a Benchmarking Environment ▪ Keywords and query generation in (automatic) search scenarios (Project ERa) ▪ Introducing machine learned semantic search technologies and (automatic) query expansion ▪ Introducing terminology based semantics within the APL project ▪ (Automated) Search result confidence 3
  • 4. European Patent Office Anecdotal Evidence and Benchmarking ▪ How can two different search strategies be compared? ▪ Historically decision are taken based on expert feedback ▪ Valuable information that can be complemented with measurements ▪ Distribution shown as Box-plot 4 median: 5 citations in search reports
  • 5. European Patent Office Search Benchmarking 5 ▪ We implemented a holistic prototyping and benchmark system ▪ We have access to all distributed system in the EPO IT landscape and can connect them in "visual" workflows
  • 6. European Patent Office Automated Queries exploiting the shown tools 6 Application Automated Query Formulation relevant documents Evaluation and Feedback Search PriorArt
  • 7. European Patent Office Benchmarking Environment ▪ From anecdotal to statistical evidence ▪ "Towards Automated Prior Art Search" (TAPAS) 7 Patent Corpus Citations Split Date Applications Method 1 Method 2 Ø Recall@100: 0.25 Ø Recall@100: 0.17 Search Corpus Result 1 Result 2 Data Set Generation Retrospective Search trec_eval trec_eval
  • 8. European Patent Office ▪ Retrieval success measured by different metrics ▪ Computed by the trec_eval tool ▪ Average over all n simulated applications Benchmarking Environment 8 Top 50 Documents Returned Cited Documents not Returned Recall @ 50 Hit Rate @ 50
  • 9. European Patent Office Benchmarking Environment 9 ▪ Search reports for about 40 million simple patent families ▪ The relevant documents are mentioned in the search report as either – X or A citations ▪ Only few citations per document (2 X-category, 3 A-category) 0 40x106patent documents X X A relevant
  • 10. European Patent Office Benchmarking Environment 10 ▪ Search reports for about 40 million simple patent families ▪ The relevant documents are mentioned in the search report as either – X or A citations ▪ Only few citations per document (2 X-category, 3 A-category) ▪ But more "soft" information available: Returned during search (R), put aside for detailed inspection (Ins) , printed (Pr), stored for citation (Cs) relevance 0 80x106patent documents X X A R Ins Pr
  • 11. European Patent Office ▪ Anecdotal Evidence and the need for a Benchmarking Environment ▪ Keywords and query generation in (automatic) search scenarios (Project ERa) ▪ Introducing machine learned semantic search technologies and (automatic) query expansion ▪ Introducing terminology based semantics within the APL project ▪ (Automated) Search result confidence 11
  • 12. European Patent Office Term-Based Search ▪ Can we hope to find all citations with keywords?  overlap in vocabulary (non-stopwords) 12 Application Citation Overlap
  • 13. European Patent Office Term-Based Search 13 ▪ However: also overlap with random documents ▪ Random: Y-Scrambling CitedDocumentsRandomDocuments Frequency Overlapping Terms All Terms Nouns, no Stop Words Keywords (TF) Overlapping Terms Overlapping Terms Overlapping Terms Overlapping Terms Overlapping Terms Frequency Frequency Frequency Frequency Frequency
  • 14. European Patent Office Term-Based Search fully automated 14 ▪ How good can we get? ▪ More Like This on Title / Abstract / Description / Claims ▪ Percentage of citations found in the top k Top 100 Top 1 000 Top 10 000 Top 100 000 0% 29% 57% 80%
  • 15. European Patent Office Term-Based Search fully automated ▪ Can we close the gap? ... using semantic search? 15 Top 100 Top 1 000 Top 10 000 Top 100 000 0% 29% 57% 80%
  • 16. European Patent Office ▪ Anecdotal Evidence and the need for a Benchmarking Environment ▪ Keywords and query generation in (automatic) search scenarios (Project ERa) ▪ Introducing machine learned semantic search technologies and (automatic) query expansion ▪ Introducing terminology based semantics within the APL project ▪ (Automated) Search result confidence 16
  • 17. European Patent Office Introducing "Semantics" 17 Virus
  • 18. European Patent Office Automated Query Expansion ▪ Analyze word embeddings ▪ Words with similar meaning have similar context ▪ Train neural network  words represented by vectors ▪ Words with similar meaning have similar vectors  allows distance calculations between words  semantic similarity 18 Word eating apples is healthy Context eating fruits is healthy apple = [0.253, 1.793, ..., 5.555, 3,142] tomato apple fruit banana steak liver
  • 19. European Patent Office Automated Query Expansion 19 Patent Corpus CPC w2v model A01B M1 A02B M2 A02C M3 H05K MN-1 All MN Query Word top X relevant words per class
  • 20. European Patent Office Automated Query Expansion A61K A virus is a small infectious agent that replicates only inside the living cells of other organisms. G06F A computer virus is a type of malicious software program ("malware") that, when executed, replicates by reproducing itself (copying its own source code) or infecting other computer programs by modifying them. 20 hiv, viral, human immunodeficiency, viral genome, genome, replication, pathogen, adenovirus, replicating infected, virus infection, malicious, anti-virus, malware, virus- scanning, virus worm, macro virus, virus scanner
  • 21. European Patent Office Semantics through Machine Learning Example Queries: descriptions from Wikipedia . 21 A virus is a small infectious agent that replicates only inside the living cells of other organisms. A computer virus is a type of malicious software program ("malware") that, when executed, replicates by reproducing itself (copying its own source code) or infecting other computer programs by modifying them. Electrical digital data processing Preparations for medical, dental or toilet purposes
  • 22. European Patent Office Automated Query Expansion Inter model similarity in CPC A Human Necessities based on vocabularies 22 A47KA45DA47L
  • 23. European Patent Office Automated Query Expansion ▪ Word2Vec Models applied to (100) extracted TF-IDF Keywords 23 "Touch sensing system and display apparatus" EP2869168A120150506 TF-IDF Keywords (10 of 100) TF-IDF Scores Word2Vec Enrichments (G06F) max 5 terms, at least Cosine 0.6 possessory sensing touch bridge nodes shared electrodes controller instructing masters memory sensor algorithm senses data ... 49.01 33.04 31.36 27.88 27.59 26.86 25.52 23.18 22.11 21.23 20.39 20.38 20.02 20.02 19.91 ... possessory sensing OR sensed OR sensor OR touch touch OR touched OR touch screen OR touches OR touch panel OR touching bridge OR bridges OR pci bus OR bus OR interfaces 22a-22b OR pci-pci bridge nodes OR node OR nodes n2 shared OR share OR sharing OR non-shared electrodes OR electrode OR electrodes y2 controller OR control OR controllers OR bus OR unit OR processor instructing OR instruct OR instructs masters memory OR ram OR non-volatile OR memories OR processor OR flash sensor OR sensors OR sensing OR humidity sensor OR sensed OR sensor senses algorithm OR algorithms senses OR sensed data OR stored OR stores OR store OR storing ...
  • 24. European Patent Office Automated Query Expansion ▪ Query Enrichment brings increased performance EP2869168A120150506 24 "Touch sensing system and display apparatus" Recall@100: 0 (0 of 2) Recall@100: 1 (2 of 2) Keyword Extraction W2V Enrichment TF-IDF Query W2V Query
  • 25. European Patent Office Automated Query Expansion ▪ Documents found with increased Term Overlap with Query 25 39100958 48143418 Recall@100: 0 (0 of 2) Recall@100: 1 (2 of 2) 39 80 43 80 Query Citation TF-IDF Query W2V Query
  • 26. European Patent Office Automated Query Expansion ▪ Word2Vec Expansion Terms in Overlap ▪ Re-Suggestion of known Query Terms 26 outputs apparatus screen increasing serially signal OR signals OR circuit OR output OR clock OR outputs controlling OR control OR controlled OR apparatus display OR screen OR displays OR displaying OR displayed OR lcd increases OR increase OR decreases OR increased OR increasing OR decrease decreasing OR increasing OR decrease OR increase OR decreases OR decreased parallel OR serially New Overlap Original Keyword Word2Vec Enrichment transmission transmitted OR transmits OR received OR transmit OR transmission OR transmitting reception OR transmission OR transmitted OR transmits transmitting OR receiving OR transmitted OR transmission OR sending OR transmit Term Original Keyword Word2Vec Enrichment ▪ Should this result in higher weight? ▪ Models heavily suggest inflections
  • 27. European Patent Office ▪ Anecdotal Evidence and the need for a Benchmarking Environment ▪ Keywords and query generation in (automatic) search scenarios (Project ERa) ▪ Introducing machine learned semantic search technologies and (automatic) query expansion ▪ Introducing terminology based semantics within the APL project ▪ (Automated) Search result confidence 27
  • 28. European Patent Office Introducing semantics through annotations We are in the process of annotating the full prior art collection with normalised ▪ Chemical Entities ▪ Physical Units ▪ Citations ▪ Controlled Terminologies (e.g. MeSH) to enable a semantic search of those entities. 28 Patent Collection Patent Collection Annotation Pipeline Aspirin 400 mm solar panel
  • 29. European Patent Office Introducing annotations and semantics ▪ Our annotation platform's key elements − Based on the open source framework UIMA − Plug and play of new components (analysis engines) − Quality Evaluation Workbench − Knowledge Base for controlled vocabulary − Scaling architecture − Data and annotations stored in noSQL databases 29 Application Citation acetylsalicylic acid (D001241) matching terms through controlled vocabulary MeSH Unique ID: D001241 Aspirin (D001241)
  • 30. European Patent Office Introducing annotations and semantics Project in Execution - Platform installed and tested at this moment 30 1. Define (sub)set of documents to work on 2. Define an annotation Pipeline
  • 31. European Patent Office Word2Vec & APL 31 Application Citation Aspirin acetylsalicylic acid matching terms through controlled vocabulary Aspirin = acetylsalicylic acid (would have been False Negative) Virus Virus separating identical terms with different meaning Virus = Virus (would have been False Positive)
  • 32. European Patent Office ▪ Anecdotal Evidence and the need for a Benchmarking Environment ▪ Keywords and query generation in (automatic) search scenarios (Project ERa) ▪ Introducing machine learned semantic search technologies and (automatic) query expansion ▪ Introducing terminology based semantics within the APL project ▪ (Automated) Search result confidence 32
  • 33. European Patent Office Identify Successful (Automated) Searches Machine-Learning: look at the documents 33 Word2Vec Heatmap Sequence Alignment (Bioinformatics) document1document2 document1document2 document 0 document 0 document 0 document 0 Citation Random
  • 34. European Patent Office Can we identify successful searches? (Machine learned) ▪ Prediction Model for Citability based on Distribution Statistics of heat maps ▪ 10-fold Cross Validation (Y-Scrambling) 34 [mean, variance, skewness, kurtosis, min, q1, q2, q3, max] Random ForestMulti-Layer Perceptron ▪ ensemble of decision trees ▪ bootstrap aggregating ▪ random selection of features Training Set Validation Set Round 1 Error Rate 1 Round 2 Error Rate 2 Round 3 Error Rate 3 Round 10 Error Rate 10 ~8% ~5% Predicted Relevance
  • 35. European Patent Office Identify Successful (Automated) Searches ▪ Meta Data: same author as in application occurs in result set ▪ Interesting Item Set Mining & Subgroup Discovery 35 Searches with same author in result set (35%) Recall@100 0.2357 Searches without same author in result set (65%) Recall@100 0.1209 All searches (100%) Recall@100 0.1608 Only documents from same author Recall@100 0.0884 MLT Search 0 100 Document Rank Recall
  • 36. European Patent Office Complementariness of Results 36 "Touch sensing system and display apparatus"Query Method 1 Method 2
  • 37. European Patent Office Complementariness of Results 37 "Touch sensing system and display apparatus"Query Method 1 Method 2 Method 3 Method i...
  • 38. European Patent Office Complementariness of Results 38 "Touch sensing system and display apparatus"Query Method 1 Method 2 Method 3 Method i...
  • 39. European Patent Office Conclusion and outline ▪ We are exploiting state-of-the-art semantic- and other search technologies ▪ We have created a prototyping and benchmarking environment to evaluate the performance and quality of a new search algorithm learning from prior searches ▪ Bringing all these new technologies together in a productive search environment is the biggest challenge ahead ▪ Semantic (text) search is one small mosaic piece in the complex search environment at the EPO 39
  • 40. European Patent Office Acknowledgments ▪ The Search & Knowledge Directorate − Domenico Golzio Director − Stefan Klocke Head of Department − Volker Hähnke Ranking and Confidence − Matthias Wirth Neo4J, CPC classification − Pavel Goncharik Ansera, ranking, lucene ▪ Examiners − Robert Herman ERa − Ype Kingma ERa − Markus Arndt ERa − Christos Stergio APL 40