SlideShare a Scribd company logo
1 of 30
Download to read offline
Linked Patent Data:
opportunities and challenges
for patent professionals
CEPIUG  Conference  
Milano,  9  -­ 11  September  2018
Alberto.Ciaramella @intellisemantic.com
This  presentation  and  its  objectives              
n This  presentation  is
n a  lean  tutorial  about  linked  data
n a  status  of  the  art  about  linked  data  in  patents  
n a  position  paper
n to  identify  how  today  technology  can  cover  
patent  professionals  needs
n to  encourage patent offices to  extend the  
adoption of  this paradigm in  patent information  
they publish
a  network  effect can  occur
IntelliSemantic 2
Index              
n The  framework:  
n what  are  Linked  Data?    
n why  and  when  to  use  them?    
n A Linked  Data  technical  introduction
n Linked  Data  in  patents
n EPO
n Others
n Opportunities  and  challenges  in  patents
n Conclusions
IntelliSemantic 3
A framework and
a technical introduction
What  are  Linked  Data?              
n Linked  Data  (LD)  is  a  new  more  powerful  and  flexible  
technology  of  publishing  and  accessing  data  bases
n Linked  data  is  backed  by a  solid background  of  
international standards,  developed by  the  Word  Wide  
Web  Consortium (W3C)  
n “Linked  data”  were  originally  proposed  by  Berners  
Lee  (2006),  as  Linked  Open  Data  (LOD),  but  it  is  now  
well  recognized  that  this  is  only  a  specific  case  of  use:
n Linked  data  can  also  be  private,  of  course.
n More  recently,  the  focus  of  Linked  Data  research  
moved  to  engineering  aspects  as  privacy,  security,  
efficiency,  quality  of  data
n Linked  data  is  one  of  the  enabling  technologies  for  
Big  Data  applications,  more  specifically  to  solve  the  
challenge  of  the  variety  of  data
IntelliSemantic 5
Who,  where,  why  linked  data  (e.g.  in  patents)          
IntelliSemantic 6
Who Where	
  	
  (layer) Why	
  
Data	
  base	
  
supplier
Data	
  base To	
  facilitate the	
  verification	
  
and	
  cleaning	
  of	
  	
  a	
  data	
  base
Application	
  
developer
Data	
  bases	
  
integration
To	
  facilitate	
  the integration	
  
of	
  different	
  data	
  bases	
  
(different	
  patent	
  offices,	
  	
  
non patent	
  literature	
  ..)	
  
Application	
  
developer	
  /	
  
users
Query	
  expression To	
  express	
  more	
  powerful	
  
queries	
  
Application	
  
developer	
  
/users	
  
Results	
  analysis To	
  facilitate the	
  inclusion	
  of	
  
more	
  intelligent	
  applications
Evolutions  of  Linked  Data  in  patents
n At  EPOPIC  Kilkenny  (2011)  an  invited speech by  
Nigel  Shadbolt presented the  results achieved and/or  
achievable with  open  data  in  different fields
n since that,  Open  Data  and  Linked Open  Data  
entered as a  discussion topic in  patent conferences
n some  LOD  demo  were released from  2013  to  2016,  
typically including a  patent set  sample
n notable demos by  KIPO,  USPTO  and  EPO  
n in  April  2018  the  EPO  launched its product-­level
solution LOD,  regularly updated (weekly)    
n this seems to  be  a  turning point!  
IntelliSemantic 7
A  step  back  -­
Data  base  models:  from  tables  to  graphs    
n A  well  established  and  used  model  in  data  bases  is    
the  relational  model,  defined  in  the  ‘70s  
n in  these  systems  (RDBMS),  data  are  represented  as  
a  set  of  related  tables.  As  an  example:
n a  first  table  associates  to  any  patent  its  data,  
including  applicant  and  inventor  
n other  related  tables  include  further  details  about  
applicants  and  inventors
n this  paradigm  (table-­based)  is  efficient,  but,  once  
designed,  it  is  more  difficult  to  evolve  and  scale  
n in  any  case,  it  is  straightforward  to  convert  tables  to  
graphs (see  next  slide),  and  this  adds  a  lot  of  flexibility:  
this  conversion  is  the  basic  concept  of  Linked  Data
IntelliSemantic 8
From  tables  to  (triple)  graphs:  an  example    
If  we  have  this  table  chunk:
IntelliSemantic 9
Patent id Application date applicant -­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐ -­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐
XYZ12488 2018/09/01 Company	
  A -­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐ -­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐
-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐ -­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐ -­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐ -­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐ -­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐
Any  cell in  the  table  can  be  represented  as  a  triple  of  values:
a) Subject  (the  heading  of  a row,  as  XYZ12488):  represented  as  a  graph  node
b) Predicate  (the  heading  of  a  column,  as  application  date):  represented  as  a  
graph  edge
c) Object  (the  cell  value,  as  Company  A):  represented  as  a  graph  node
Hence  these  two  cells  in  the  table  can  also  be  represented  by  this  graph  chunk  
If  we  apply  this  trasformation to  any  cell.  
the  table is transformed to  a  labeled
graph,  which includes nodes (subjects,  
objects) and  connections (predicates)
From  triple  graphs  to  LD  on  the  web      
Linked  Data  (LD)  implement  on  the  net  (internet  or  
intranet)  the  graph  representation  just  described.  To  do  
that:  
n reuse well  known  internet  standards  
n to  name  things  (with  URI)  
n to  access  them  (with  HTTP)    
n define new  kind  of  information,  i.e.  triples
n triples  are  not  required  in  the  “web  of  documents”
n include  external  references  to  easily  reuse  
knowledge  already  defined  in  other  graphs,  as:  
n “same  level”  complementary  data,  e.g.  data  from  
scientific  papers to  integrate  patent  data
n “higher  level”  common  vocabularies,  e.g.  the  
definition  of  author.    
IntelliSemantic 10
SPARQL  to  query  LD:  what  and  why      
n what  is
n SPARQL  is  the  standard  query  language  for  LD
n it  means  “SPARQL  Protocol and  RDF  Query  Language”
n SPARQL  has  been  defined  by  W3C
n the  last  version  1.1  has  being  released  on  2013  
n why  to  use
n to  facilitate  the  integration  of  different  data  bases,  
independently  from  their  structure
n to  facilitate  the  identification  of  eventual  missing  or  bad  
data  in  the  original  data  bases  
n to  implement  more  efficiently  advanced  applications  on  
the  top  of  your  data  
n how  to  use
n a  LD  is  typically  provided  by  a  SPARQL  end  point,  i.e.  
an  internet  address  which  accepts  SPARQL  queries
IntelliSemantic 11
SPARQL:  the  query
n The  query  is  represented  by  a  subgraph on  the  whole  
graph,  having  defined  fixed  point(s)  in  the  whole  graph  
n e.g.  the  patent  applicant
n Different  kind  of  queries  can  be  represented,  from  
“usual”  queries  to  more  involved,  still  interesting  queries,  
as:  
n example  1:  identify  all  patents  of  the  applicant  x  
published  by  the  authority  y  in  the  years  zi to  zn
n this  is  a  more  usual  example
n example  2:  identify  all  patents  which  are  co-­cited  by  
at  least  a  member  of  the  patent  family  A  and  by  at  least  
a  member  of  the  patent  family  B  
n this  is  a  little  more  advanced  example
IntelliSemantic 12
SPARQL:  results    
n SPARQL  results  are  typically  presented  by  tables
n e.g.  the  table  of  patents  satisfying  a  query
n native  SPARQL  commands  allow  
n to  select  the  columns  in  the  results  table
n to  order  rows  in  the  results  table
n to  extract  summary  values  from  rows,  
n hence  to  extract  statistics  
n it  is  also  possible  to  make  inferences  from  your  data,  
n in  any  case  under  your  control  
IntelliSemantic 13
SPARQL:  a  simplified  example    
n the  question  to  solve:  identify  all  patents  citing  patent  X,  
directly  or  indirectly  (i.e.  through  a  chain  of  citations)
n the  kernel  of  the  SPARQL  query  is
SELECT  DISTINCT  ?s
WHERE    {	
  ?s      cites+    patentX.  }
n the  WHERE  clause  identifies  the  subgraph  in  which
n the  patentX is  the  object:  it  is  the  patent  you  know  
n the  predicate  is  “cites”,  which  includes  in  this  case  
the  recursion  symbol  +
n the  ?s  are  the  unknown  subjects  to  identify
n the  SELECT  DISTINCT  clause  produces  the  output
IntelliSemantic 14
Linked  Data  accesses        
Linked  Data  are  accessed  at  specific  web  addresses  (end-­
point).  Depending  from  the  end-­point,  they  can  be:
n linked  public  data,  used  for
n open  and  reliable  vocabularies  and  ontologies
n open  and  reliable  information,  e.g.  DBpedia
provided  that  the  performance  is  adequate  for  you
n linked  restricted  data
n data  restricted  to  you  and  to  your  business  partners  
n linked  internal  data  (behind  your  firewall)  to  access  
n your  internal  company  data    
Whilst  Linked  Data  emerged  first  as  Linked  Open  Data,  it  
is  misleading  to  assume  that  Linked  Data  have  to  be  
necessarily  open  as  well.  
IntelliSemantic 15
What is available in patents
The  EPO  LOD:  an  overview        
n It  is  one of  the  three kind of  services provided by  EPO  to  
application integrators
n It is available since April  2018,  at
https://www.epo.org/searching-­for-­patents/data/linked-­
open-­data.html#tab-­1
n It is provided under  the  Creative  Common  Attribution 4.0  
License,  hence the  integrator:
n is free  to  share  and  adapt these data
n giving credit  and  without adding restrictions
n User  manual and  support forum  available
n Panel  access at https://data.epo.org/linked-­data/
n it provides a  SPARQL  query suggestion interface
n it provides access to  a  SPARQL  endpoint
IntelliSemantic 17
The  EPO  LOD:  technical  details    
n The  data  base  is EPO-­specific and  includes:    
n data  provided by  EPO,  i.e.
n EPO  application and  publication bibliographic
information,  including text
n weekly updated
n the  CPCs tree
n a  vocabulary of  patent concepts defined by  EPO
n links to  other  LOD  key  vocabularies/concepts
n most  of  which  originated  by  W3C
n Text  search  also  available
n which  is  an  interesting  extension  
n The  data  base  does  not  include  (yet?):
n information  about  the  legal  status    
IntelliSemantic 18
An  example:  the  EPO  LOD  patent  vocabulary  
IntelliSemantic 19
This	
  vocabulary	
  developed	
  by	
  the	
  EPO	
  defines	
  some	
  key	
  concepts	
  
in	
  patents	
  (e.g.	
  application,	
  publication)	
  and	
  can	
  be	
  shared	
  with	
  
other	
  patent	
  information	
  provided	
  as	
  linked	
  data.	
  This	
  is	
  only	
  a	
  
simplified	
  sketch,	
  indeed.
The  EPO  LOD:  the  business  model      
n free  data  and  access
n to  be  used  according  to  the  fair  use  policy,  hence  for  
casual  users  and  experiments:  
n best  effort  policy  is  applied  for  results:1  minute  
after  the  query,  results  are  terminated  in  any  case  
n for  production  level  applications,  the  LOD  data  base  
has  to  be  transferred  to  the  server  of  the  application  
integrator
n to  the  purists  of  the  open  web,  this  is  a  limit
n for  the  real  business,  this  is  the  example  of  a  
realistic  compromise,  which  can  enable  some  
more  flexibility  on  the  business  model
IntelliSemantic 20
Other  LOD  examples  for  patents          
n KIPO  LOD  
n info  and  data  at http://lod.kipo.kr/
n SPARQL  query access at http://lod.kipo.kr/data/sparql
n the  data  base  includes bibliographic and  legal KIPO  info
n LOD  support is mentioned the  annual KIPO  report  
n it is a  beta  now
n US  LOD
n info  and  data  at https://old.datahub.io/dataset/linked-­‐uspto-­‐
patent-­‐data
n SPARQL  query access at
http://us.patents.aksw.org/sparql
n it seems still a  demo,  since the  data  content is not
updated recently,  and  the  kind of  licence is not mentioned
IntelliSemantic 21
Opportunities  
and  challenges
Conclusions
Bulk  data  base  providers  (patent  offices)        
IntelliSemantic 23
n Opportunities  
n technical:  the  adoption  of  the  LD  paradigm    
facilitates the  verification  and  cleaning  of  your  own  
data  base,  hence  it  should  be  promoted  also  as  an  
internal  best  practice
n Challenges
n business:  if a  provider  delivers LD  to  its customers,    
it has to  define its license and  its commercial  
conditions
n beware:  the  provider  is not forced by  the  
technology itself to  offer its data  for  free!  This a  
misleading objection.
Application  developers          
n Technical  opportunities:  
n the  integration of  patent data  bases provided by  
different patent officies will become easier
n the  integration of  patent data  with  other information  
(e.g.  scientific or  business)  becomes easier
n this will support the  extension of  patent
platforms to  business  intelligence  platforms
n more  advanced functions can  be  provided to  patent
professionals and  their environment (i.e.  colleagues
and  managers)
n Business  challenges:
n all the  previous mentioned opportunities will
materialize easier if the  LD  paradigm will suitably
adopted by  data  base  providers
IntelliSemantic 24
Patent  professionals        
n Technical  opportunities:  have to  opportunity to  rely on:
n more  powerful and  diversified queries,  e.g.  to  
suitable explore the  citations network  
n more  advanced and  flexible analytics tools,  for  
deeper patent sets  analyses
n more  transparent and  well agreed definition of  
concepts,  to  support a  new  generation  of  semantic
solutions in  patent informatics
n I  define this semantic 3.0  in  patent informatics
n more  information  rich tools,  which can  help  them
to  support better a  broader set  of  users
n Challenges:  
n motivations and  willingness to  improve the  
profession
IntelliSemantic 25
Semantics  in  Patent  Informatics:  a  view      
Semantics in  Patent Informatics can  be  categorized also by  
generations:  
n Semantics 1.0  (implicit):  Solutions  and  best  practices in  
patent informatics include  some  semantic concepts,  but
they are  not mentioned under  this term.  For  example:  IPCs
and  CPCs are  actually ontologies.
n Semantics 2.0  (explicit):  Natural  language semantics is
included in  solutions,  to  add new  ways  of  searching and/or  
to  analyze results.  The  word  “semantic”  is now explicitly
advertised as such,  for  very diversified solutions indeed.    
Some  concerns arise too,  e.g.  about “which kind of  
semantic”  is in  place (for  searching?  for  analyzing?).
n Semantics 3.0  (standardized):   The  inclusion of  explicit,  
agreed and  transparent standard knowledge bases will
establish semantics as a  true value added for  the  
professional,  not as a  question mark issue.  
IntelliSemantic 26
Conclusions        
n The  adoption of  Linked Data  is a  win /  win
opportunity for  all players:
n patent offices
n application developers
n professional users
n their environment (colleagues and  managers)  
n but the  awareness of  this possible evolution in  
patent informatics must  sparkle now!
n Linked Data  in  fact:
n uses well established international standards
n facilitates the  reuse of  existing knowledge
data  bases (e.g.  vocabularies)
IntelliSemantic 27
Questions        
n technology-­related questions?
n application-­related questions?  
n business  model-­related questions?
n bulk  data  base  providers?
n application developers?  
n other questions?
IntelliSemantic 28
Acknowledgment  and  extensions        
n since  the  topic  is  definitely  challenging  to  be  
summarized  in  a  20  minute  presentation,  an  open  
access paper with  the  same title and  coauthored with  
Marco  Ciaramella  will extends this presentation with:  
n more  details
n patent  specific  examples
n bibliography    
n the  paper  will  be  accessed  at  https://arxiv.org/
n in  the  section  computers  /  digital  libraries
n I  do  acknowledge  Marco  for  the  contribution  to  this  
paper  as  well  to  this  presentation  
IntelliSemantic 29
Contact  information        
a)  For  specific  questions  and  information  requests
n e-­mail:  alberto.ciaramella@intellisemantic.com
n tel.  +39  011  9550  380
b)  For  registering  to  the  IntelliSemantic newsletter,  
including  references  to  new  IntelliSemantic papers  and  
webinars  announcements  
n e-­mail:  newsletter@intellisemantic.com
n next  issue:  September  2018
c)  For  general  information  and  first  level  overview:
n site  http://www.intellisemantic.com
IntelliSemantic 30

More Related Content

What's hot

Flexible metadata schemes for research data repositories - Clarin Conference...
Flexible metadata schemes for research data repositories  - Clarin Conference...Flexible metadata schemes for research data repositories  - Clarin Conference...
Flexible metadata schemes for research data repositories - Clarin Conference...Vyacheslav Tykhonov
 
Scalable and privacy-preserving data integration - part 1
Scalable and privacy-preserving data integration - part 1Scalable and privacy-preserving data integration - part 1
Scalable and privacy-preserving data integration - part 1ErhardRahm
 
FIWARE Global Summit - IDS Implementation with FIWARE Software Components
FIWARE Global Summit - IDS Implementation with FIWARE Software ComponentsFIWARE Global Summit - IDS Implementation with FIWARE Software Components
FIWARE Global Summit - IDS Implementation with FIWARE Software ComponentsFIWARE
 
Big Data and the Semantic Web: Challenges and Opportunities
Big Data and the Semantic Web: Challenges and OpportunitiesBig Data and the Semantic Web: Challenges and Opportunities
Big Data and the Semantic Web: Challenges and OpportunitiesSrinath Srinivasa
 
Using the Semantic Web Stack to Make Big Data Smarter
Using the Semantic Web Stack to Make  Big Data SmarterUsing the Semantic Web Stack to Make  Big Data Smarter
Using the Semantic Web Stack to Make Big Data SmarterMatheus Mota
 
Linked Open Data (LOD) part 3
Linked Open Data (LOD)  part 3Linked Open Data (LOD)  part 3
Linked Open Data (LOD) part 3IPLODProject
 
External controlled vocabularies support in Dataverse
External controlled vocabularies support in DataverseExternal controlled vocabularies support in Dataverse
External controlled vocabularies support in Dataversevty
 
CLARIN CMDI support in Dataverse
CLARIN CMDI support in Dataverse CLARIN CMDI support in Dataverse
CLARIN CMDI support in Dataverse vty
 
Controlled vocabularies and ontologies in Dataverse data repository
Controlled vocabularies and ontologies in Dataverse data repositoryControlled vocabularies and ontologies in Dataverse data repository
Controlled vocabularies and ontologies in Dataverse data repositoryvty
 
Linking Open, Big Data Using Semantic Web Technologies - An Introduction
Linking Open, Big Data Using Semantic Web Technologies - An IntroductionLinking Open, Big Data Using Semantic Web Technologies - An Introduction
Linking Open, Big Data Using Semantic Web Technologies - An IntroductionRonald Ashri
 
15. STL - Data Structures using C++ by Varsha Patil
15. STL - Data Structures using C++ by Varsha Patil15. STL - Data Structures using C++ by Varsha Patil
15. STL - Data Structures using C++ by Varsha Patilwidespreadpromotion
 
Ethics & (Explainable) AI – Semantic AI & the Role of the Knowledge Scientist
Ethics & (Explainable) AI – Semantic AI & the Role of the Knowledge ScientistEthics & (Explainable) AI – Semantic AI & the Role of the Knowledge Scientist
Ethics & (Explainable) AI – Semantic AI & the Role of the Knowledge ScientistStratos Kontopoulos
 
Knowledge Discovery in Remote Access Databases
Knowledge Discovery in Remote Access Databases Knowledge Discovery in Remote Access Databases
Knowledge Discovery in Remote Access Databases Zakaria Zubi
 
ROI in Linking Content to CRM by Applying the Linked Data Stack
ROI in Linking Content to CRM by Applying the Linked Data StackROI in Linking Content to CRM by Applying the Linked Data Stack
ROI in Linking Content to CRM by Applying the Linked Data StackMartin Voigt
 
The world of Docker and Kubernetes
The world of Docker and Kubernetes The world of Docker and Kubernetes
The world of Docker and Kubernetes vty
 
Fighting COVID-19 with Artificial Intelligence
Fighting COVID-19 with Artificial IntelligenceFighting COVID-19 with Artificial Intelligence
Fighting COVID-19 with Artificial Intelligencevty
 

What's hot (20)

LOD2 Webinar Series: D2R and Sparqlify
LOD2 Webinar Series: D2R and SparqlifyLOD2 Webinar Series: D2R and Sparqlify
LOD2 Webinar Series: D2R and Sparqlify
 
LOD2 Webinar: SIREn
LOD2 Webinar: SIREnLOD2 Webinar: SIREn
LOD2 Webinar: SIREn
 
Lod2 review meeting
Lod2 review meetingLod2 review meeting
Lod2 review meeting
 
Flexible metadata schemes for research data repositories - Clarin Conference...
Flexible metadata schemes for research data repositories  - Clarin Conference...Flexible metadata schemes for research data repositories  - Clarin Conference...
Flexible metadata schemes for research data repositories - Clarin Conference...
 
Scalable and privacy-preserving data integration - part 1
Scalable and privacy-preserving data integration - part 1Scalable and privacy-preserving data integration - part 1
Scalable and privacy-preserving data integration - part 1
 
FIWARE Global Summit - IDS Implementation with FIWARE Software Components
FIWARE Global Summit - IDS Implementation with FIWARE Software ComponentsFIWARE Global Summit - IDS Implementation with FIWARE Software Components
FIWARE Global Summit - IDS Implementation with FIWARE Software Components
 
Big Data and the Semantic Web: Challenges and Opportunities
Big Data and the Semantic Web: Challenges and OpportunitiesBig Data and the Semantic Web: Challenges and Opportunities
Big Data and the Semantic Web: Challenges and Opportunities
 
Using the Semantic Web Stack to Make Big Data Smarter
Using the Semantic Web Stack to Make  Big Data SmarterUsing the Semantic Web Stack to Make  Big Data Smarter
Using the Semantic Web Stack to Make Big Data Smarter
 
Linked Open Data (LOD) part 3
Linked Open Data (LOD)  part 3Linked Open Data (LOD)  part 3
Linked Open Data (LOD) part 3
 
External controlled vocabularies support in Dataverse
External controlled vocabularies support in DataverseExternal controlled vocabularies support in Dataverse
External controlled vocabularies support in Dataverse
 
LOD2 Plenary Vienna 2012: WP3 - Knowledge Base Creation, Enrichment and Repair
LOD2 Plenary Vienna 2012: WP3 - Knowledge Base Creation, Enrichment and RepairLOD2 Plenary Vienna 2012: WP3 - Knowledge Base Creation, Enrichment and Repair
LOD2 Plenary Vienna 2012: WP3 - Knowledge Base Creation, Enrichment and Repair
 
CLARIN CMDI support in Dataverse
CLARIN CMDI support in Dataverse CLARIN CMDI support in Dataverse
CLARIN CMDI support in Dataverse
 
Controlled vocabularies and ontologies in Dataverse data repository
Controlled vocabularies and ontologies in Dataverse data repositoryControlled vocabularies and ontologies in Dataverse data repository
Controlled vocabularies and ontologies in Dataverse data repository
 
Linking Open, Big Data Using Semantic Web Technologies - An Introduction
Linking Open, Big Data Using Semantic Web Technologies - An IntroductionLinking Open, Big Data Using Semantic Web Technologies - An Introduction
Linking Open, Big Data Using Semantic Web Technologies - An Introduction
 
15. STL - Data Structures using C++ by Varsha Patil
15. STL - Data Structures using C++ by Varsha Patil15. STL - Data Structures using C++ by Varsha Patil
15. STL - Data Structures using C++ by Varsha Patil
 
Ethics & (Explainable) AI – Semantic AI & the Role of the Knowledge Scientist
Ethics & (Explainable) AI – Semantic AI & the Role of the Knowledge ScientistEthics & (Explainable) AI – Semantic AI & the Role of the Knowledge Scientist
Ethics & (Explainable) AI – Semantic AI & the Role of the Knowledge Scientist
 
Knowledge Discovery in Remote Access Databases
Knowledge Discovery in Remote Access Databases Knowledge Discovery in Remote Access Databases
Knowledge Discovery in Remote Access Databases
 
ROI in Linking Content to CRM by Applying the Linked Data Stack
ROI in Linking Content to CRM by Applying the Linked Data StackROI in Linking Content to CRM by Applying the Linked Data Stack
ROI in Linking Content to CRM by Applying the Linked Data Stack
 
The world of Docker and Kubernetes
The world of Docker and Kubernetes The world of Docker and Kubernetes
The world of Docker and Kubernetes
 
Fighting COVID-19 with Artificial Intelligence
Fighting COVID-19 with Artificial IntelligenceFighting COVID-19 with Artificial Intelligence
Fighting COVID-19 with Artificial Intelligence
 

Similar to Linked Patent Data Opportunities and Challenges

Towards a Resource Slice Interoperability Hub for IoT
Towards a Resource Slice Interoperability Hub for IoTTowards a Resource Slice Interoperability Hub for IoT
Towards a Resource Slice Interoperability Hub for IoTHong-Linh Truong
 
Computer aided design, computer aided manufacturing, computer aided engineering
Computer aided design, computer aided manufacturing, computer aided engineeringComputer aided design, computer aided manufacturing, computer aided engineering
Computer aided design, computer aided manufacturing, computer aided engineeringuniversity of sust.
 
Tag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh PlatformTag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh PlatformSanjay Padhi, Ph.D
 
Semantically Enriched Knowledge Extraction With Data Mining
Semantically Enriched Knowledge Extraction With Data MiningSemantically Enriched Knowledge Extraction With Data Mining
Semantically Enriched Knowledge Extraction With Data MiningEditor IJCATR
 
OSFair2017 Workshop | Brokering services facilitating interoperability and da...
OSFair2017 Workshop | Brokering services facilitating interoperability and da...OSFair2017 Workshop | Brokering services facilitating interoperability and da...
OSFair2017 Workshop | Brokering services facilitating interoperability and da...Open Science Fair
 
Llinked open data training for EU institutions
Llinked open data training for EU institutionsLlinked open data training for EU institutions
Llinked open data training for EU institutionsOpen Data Support
 
Data Wrangling and Visualization Using Python
Data Wrangling and Visualization Using PythonData Wrangling and Visualization Using Python
Data Wrangling and Visualization Using PythonMOHITKUMAR1379
 
Logical Data Fabric: Architectural Components
Logical Data Fabric: Architectural ComponentsLogical Data Fabric: Architectural Components
Logical Data Fabric: Architectural ComponentsDenodo
 
STUDY ON EMERGING APPLICATIONS ON DATA PLANE AND OPTIMIZATION POSSIBILITIES
STUDY ON EMERGING APPLICATIONS ON DATA  PLANE AND OPTIMIZATION POSSIBILITIES STUDY ON EMERGING APPLICATIONS ON DATA  PLANE AND OPTIMIZATION POSSIBILITIES
STUDY ON EMERGING APPLICATIONS ON DATA PLANE AND OPTIMIZATION POSSIBILITIES ijdpsjournal
 
STUDY ON EMERGING APPLICATIONS ON DATA PLANE AND OPTIMIZATION POSSIBILITIES
STUDY ON EMERGING APPLICATIONS ON DATA PLANE AND OPTIMIZATION POSSIBILITIESSTUDY ON EMERGING APPLICATIONS ON DATA PLANE AND OPTIMIZATION POSSIBILITIES
STUDY ON EMERGING APPLICATIONS ON DATA PLANE AND OPTIMIZATION POSSIBILITIESijdpsjournal
 
EDI Training Module 12: Learn to Cite and Link Your Data
EDI Training Module 12:  Learn to Cite and Link Your DataEDI Training Module 12:  Learn to Cite and Link Your Data
EDI Training Module 12: Learn to Cite and Link Your DataEnvironmental Data Initiative
 
01VD062009003760042.pdf
01VD062009003760042.pdf01VD062009003760042.pdf
01VD062009003760042.pdfSunilMatsagar1
 
A Generic Scientific Data Model and Ontology for Representation of Chemical Data
A Generic Scientific Data Model and Ontology for Representation of Chemical DataA Generic Scientific Data Model and Ontology for Representation of Chemical Data
A Generic Scientific Data Model and Ontology for Representation of Chemical DataStuart Chalk
 
ICWE2017 BigDataEurope
ICWE2017 BigDataEuropeICWE2017 BigDataEurope
ICWE2017 BigDataEuropeBigData_Europe
 
OpenAIRE and the case of Irish Repositories, by Jochen Schirrwagen (RIAN Work...
OpenAIRE and the case of Irish Repositories, by Jochen Schirrwagen (RIAN Work...OpenAIRE and the case of Irish Repositories, by Jochen Schirrwagen (RIAN Work...
OpenAIRE and the case of Irish Repositories, by Jochen Schirrwagen (RIAN Work...OpenAIRE
 
OpenAIRE and the Case of Irish Repositories
OpenAIRE and the Case of Irish RepositoriesOpenAIRE and the Case of Irish Repositories
OpenAIRE and the Case of Irish RepositoriesRIANIreland
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationDenodo
 

Similar to Linked Patent Data Opportunities and Challenges (20)

Towards a Resource Slice Interoperability Hub for IoT
Towards a Resource Slice Interoperability Hub for IoTTowards a Resource Slice Interoperability Hub for IoT
Towards a Resource Slice Interoperability Hub for IoT
 
Computer aided design, computer aided manufacturing, computer aided engineering
Computer aided design, computer aided manufacturing, computer aided engineeringComputer aided design, computer aided manufacturing, computer aided engineering
Computer aided design, computer aided manufacturing, computer aided engineering
 
Tag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh PlatformTag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh Platform
 
Planetdata simpda
Planetdata simpdaPlanetdata simpda
Planetdata simpda
 
PlanetData: Consuming Structured Data at Web Scale
PlanetData: Consuming Structured Data at Web ScalePlanetData: Consuming Structured Data at Web Scale
PlanetData: Consuming Structured Data at Web Scale
 
Netsoft19 Keynote: Fluid Network Planes
Netsoft19 Keynote: Fluid Network PlanesNetsoft19 Keynote: Fluid Network Planes
Netsoft19 Keynote: Fluid Network Planes
 
Semantically Enriched Knowledge Extraction With Data Mining
Semantically Enriched Knowledge Extraction With Data MiningSemantically Enriched Knowledge Extraction With Data Mining
Semantically Enriched Knowledge Extraction With Data Mining
 
OSFair2017 Workshop | Brokering services facilitating interoperability and da...
OSFair2017 Workshop | Brokering services facilitating interoperability and da...OSFair2017 Workshop | Brokering services facilitating interoperability and da...
OSFair2017 Workshop | Brokering services facilitating interoperability and da...
 
Llinked open data training for EU institutions
Llinked open data training for EU institutionsLlinked open data training for EU institutions
Llinked open data training for EU institutions
 
Data Wrangling and Visualization Using Python
Data Wrangling and Visualization Using PythonData Wrangling and Visualization Using Python
Data Wrangling and Visualization Using Python
 
Logical Data Fabric: Architectural Components
Logical Data Fabric: Architectural ComponentsLogical Data Fabric: Architectural Components
Logical Data Fabric: Architectural Components
 
STUDY ON EMERGING APPLICATIONS ON DATA PLANE AND OPTIMIZATION POSSIBILITIES
STUDY ON EMERGING APPLICATIONS ON DATA  PLANE AND OPTIMIZATION POSSIBILITIES STUDY ON EMERGING APPLICATIONS ON DATA  PLANE AND OPTIMIZATION POSSIBILITIES
STUDY ON EMERGING APPLICATIONS ON DATA PLANE AND OPTIMIZATION POSSIBILITIES
 
STUDY ON EMERGING APPLICATIONS ON DATA PLANE AND OPTIMIZATION POSSIBILITIES
STUDY ON EMERGING APPLICATIONS ON DATA PLANE AND OPTIMIZATION POSSIBILITIESSTUDY ON EMERGING APPLICATIONS ON DATA PLANE AND OPTIMIZATION POSSIBILITIES
STUDY ON EMERGING APPLICATIONS ON DATA PLANE AND OPTIMIZATION POSSIBILITIES
 
EDI Training Module 12: Learn to Cite and Link Your Data
EDI Training Module 12:  Learn to Cite and Link Your DataEDI Training Module 12:  Learn to Cite and Link Your Data
EDI Training Module 12: Learn to Cite and Link Your Data
 
01VD062009003760042.pdf
01VD062009003760042.pdf01VD062009003760042.pdf
01VD062009003760042.pdf
 
A Generic Scientific Data Model and Ontology for Representation of Chemical Data
A Generic Scientific Data Model and Ontology for Representation of Chemical DataA Generic Scientific Data Model and Ontology for Representation of Chemical Data
A Generic Scientific Data Model and Ontology for Representation of Chemical Data
 
ICWE2017 BigDataEurope
ICWE2017 BigDataEuropeICWE2017 BigDataEurope
ICWE2017 BigDataEurope
 
OpenAIRE and the case of Irish Repositories, by Jochen Schirrwagen (RIAN Work...
OpenAIRE and the case of Irish Repositories, by Jochen Schirrwagen (RIAN Work...OpenAIRE and the case of Irish Repositories, by Jochen Schirrwagen (RIAN Work...
OpenAIRE and the case of Irish Repositories, by Jochen Schirrwagen (RIAN Work...
 
OpenAIRE and the Case of Irish Repositories
OpenAIRE and the Case of Irish RepositoriesOpenAIRE and the Case of Irish Repositories
OpenAIRE and the Case of Irish Repositories
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data Virtualization
 

Recently uploaded

04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...ThinkInnovation
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha
 
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAmazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAbdelrhman abooda
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 

Recently uploaded (20)

04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAmazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
 
Call Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort ServiceCall Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort Service
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts Service
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 

Linked Patent Data Opportunities and Challenges

  • 1. Linked Patent Data: opportunities and challenges for patent professionals CEPIUG  Conference   Milano,  9  -­ 11  September  2018 Alberto.Ciaramella @intellisemantic.com
  • 2. This  presentation  and  its  objectives               n This  presentation  is n a  lean  tutorial  about  linked  data n a  status  of  the  art  about  linked  data  in  patents   n a  position  paper n to  identify  how  today  technology  can  cover   patent  professionals  needs n to  encourage patent offices to  extend the   adoption of  this paradigm in  patent information   they publish a  network  effect can  occur IntelliSemantic 2
  • 3. Index               n The  framework:   n what  are  Linked  Data?     n why  and  when  to  use  them?     n A Linked  Data  technical  introduction n Linked  Data  in  patents n EPO n Others n Opportunities  and  challenges  in  patents n Conclusions IntelliSemantic 3
  • 4. A framework and a technical introduction
  • 5. What  are  Linked  Data?               n Linked  Data  (LD)  is  a  new  more  powerful  and  flexible   technology  of  publishing  and  accessing  data  bases n Linked  data  is  backed  by a  solid background  of   international standards,  developed by  the  Word  Wide   Web  Consortium (W3C)   n “Linked  data”  were  originally  proposed  by  Berners   Lee  (2006),  as  Linked  Open  Data  (LOD),  but  it  is  now   well  recognized  that  this  is  only  a  specific  case  of  use: n Linked  data  can  also  be  private,  of  course. n More  recently,  the  focus  of  Linked  Data  research   moved  to  engineering  aspects  as  privacy,  security,   efficiency,  quality  of  data n Linked  data  is  one  of  the  enabling  technologies  for   Big  Data  applications,  more  specifically  to  solve  the   challenge  of  the  variety  of  data IntelliSemantic 5
  • 6. Who,  where,  why  linked  data  (e.g.  in  patents)           IntelliSemantic 6 Who Where    (layer) Why   Data  base   supplier Data  base To  facilitate the  verification   and  cleaning  of    a  data  base Application   developer Data  bases   integration To  facilitate  the integration   of  different  data  bases   (different  patent  offices,     non patent  literature  ..)   Application   developer  /   users Query  expression To  express  more  powerful   queries   Application   developer   /users   Results  analysis To  facilitate the  inclusion  of   more  intelligent  applications
  • 7. Evolutions  of  Linked  Data  in  patents n At  EPOPIC  Kilkenny  (2011)  an  invited speech by   Nigel  Shadbolt presented the  results achieved and/or   achievable with  open  data  in  different fields n since that,  Open  Data  and  Linked Open  Data   entered as a  discussion topic in  patent conferences n some  LOD  demo  were released from  2013  to  2016,   typically including a  patent set  sample n notable demos by  KIPO,  USPTO  and  EPO   n in  April  2018  the  EPO  launched its product-­level solution LOD,  regularly updated (weekly)     n this seems to  be  a  turning point!   IntelliSemantic 7
  • 8. A  step  back  -­ Data  base  models:  from  tables  to  graphs     n A  well  established  and  used  model  in  data  bases  is     the  relational  model,  defined  in  the  ‘70s   n in  these  systems  (RDBMS),  data  are  represented  as   a  set  of  related  tables.  As  an  example: n a  first  table  associates  to  any  patent  its  data,   including  applicant  and  inventor   n other  related  tables  include  further  details  about   applicants  and  inventors n this  paradigm  (table-­based)  is  efficient,  but,  once   designed,  it  is  more  difficult  to  evolve  and  scale   n in  any  case,  it  is  straightforward  to  convert  tables  to   graphs (see  next  slide),  and  this  adds  a  lot  of  flexibility:   this  conversion  is  the  basic  concept  of  Linked  Data IntelliSemantic 8
  • 9. From  tables  to  (triple)  graphs:  an  example     If  we  have  this  table  chunk: IntelliSemantic 9 Patent id Application date applicant -­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐ -­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐ XYZ12488 2018/09/01 Company  A -­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐ -­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐ -­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐ -­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐ -­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐ -­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐ -­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐-­‐ Any  cell in  the  table  can  be  represented  as  a  triple  of  values: a) Subject  (the  heading  of  a row,  as  XYZ12488):  represented  as  a  graph  node b) Predicate  (the  heading  of  a  column,  as  application  date):  represented  as  a   graph  edge c) Object  (the  cell  value,  as  Company  A):  represented  as  a  graph  node Hence  these  two  cells  in  the  table  can  also  be  represented  by  this  graph  chunk   If  we  apply  this  trasformation to  any  cell.   the  table is transformed to  a  labeled graph,  which includes nodes (subjects,   objects) and  connections (predicates)
  • 10. From  triple  graphs  to  LD  on  the  web       Linked  Data  (LD)  implement  on  the  net  (internet  or   intranet)  the  graph  representation  just  described.  To  do   that:   n reuse well  known  internet  standards   n to  name  things  (with  URI)   n to  access  them  (with  HTTP)     n define new  kind  of  information,  i.e.  triples n triples  are  not  required  in  the  “web  of  documents” n include  external  references  to  easily  reuse   knowledge  already  defined  in  other  graphs,  as:   n “same  level”  complementary  data,  e.g.  data  from   scientific  papers to  integrate  patent  data n “higher  level”  common  vocabularies,  e.g.  the   definition  of  author.     IntelliSemantic 10
  • 11. SPARQL  to  query  LD:  what  and  why       n what  is n SPARQL  is  the  standard  query  language  for  LD n it  means  “SPARQL  Protocol and  RDF  Query  Language” n SPARQL  has  been  defined  by  W3C n the  last  version  1.1  has  being  released  on  2013   n why  to  use n to  facilitate  the  integration  of  different  data  bases,   independently  from  their  structure n to  facilitate  the  identification  of  eventual  missing  or  bad   data  in  the  original  data  bases   n to  implement  more  efficiently  advanced  applications  on   the  top  of  your  data   n how  to  use n a  LD  is  typically  provided  by  a  SPARQL  end  point,  i.e.   an  internet  address  which  accepts  SPARQL  queries IntelliSemantic 11
  • 12. SPARQL:  the  query n The  query  is  represented  by  a  subgraph on  the  whole   graph,  having  defined  fixed  point(s)  in  the  whole  graph   n e.g.  the  patent  applicant n Different  kind  of  queries  can  be  represented,  from   “usual”  queries  to  more  involved,  still  interesting  queries,   as:   n example  1:  identify  all  patents  of  the  applicant  x   published  by  the  authority  y  in  the  years  zi to  zn n this  is  a  more  usual  example n example  2:  identify  all  patents  which  are  co-­cited  by   at  least  a  member  of  the  patent  family  A  and  by  at  least   a  member  of  the  patent  family  B   n this  is  a  little  more  advanced  example IntelliSemantic 12
  • 13. SPARQL:  results     n SPARQL  results  are  typically  presented  by  tables n e.g.  the  table  of  patents  satisfying  a  query n native  SPARQL  commands  allow   n to  select  the  columns  in  the  results  table n to  order  rows  in  the  results  table n to  extract  summary  values  from  rows,   n hence  to  extract  statistics   n it  is  also  possible  to  make  inferences  from  your  data,   n in  any  case  under  your  control   IntelliSemantic 13
  • 14. SPARQL:  a  simplified  example     n the  question  to  solve:  identify  all  patents  citing  patent  X,   directly  or  indirectly  (i.e.  through  a  chain  of  citations) n the  kernel  of  the  SPARQL  query  is SELECT  DISTINCT  ?s WHERE    {  ?s      cites+    patentX.  } n the  WHERE  clause  identifies  the  subgraph  in  which n the  patentX is  the  object:  it  is  the  patent  you  know   n the  predicate  is  “cites”,  which  includes  in  this  case   the  recursion  symbol  + n the  ?s  are  the  unknown  subjects  to  identify n the  SELECT  DISTINCT  clause  produces  the  output IntelliSemantic 14
  • 15. Linked  Data  accesses         Linked  Data  are  accessed  at  specific  web  addresses  (end-­ point).  Depending  from  the  end-­point,  they  can  be: n linked  public  data,  used  for n open  and  reliable  vocabularies  and  ontologies n open  and  reliable  information,  e.g.  DBpedia provided  that  the  performance  is  adequate  for  you n linked  restricted  data n data  restricted  to  you  and  to  your  business  partners   n linked  internal  data  (behind  your  firewall)  to  access   n your  internal  company  data     Whilst  Linked  Data  emerged  first  as  Linked  Open  Data,  it   is  misleading  to  assume  that  Linked  Data  have  to  be   necessarily  open  as  well.   IntelliSemantic 15
  • 16. What is available in patents
  • 17. The  EPO  LOD:  an  overview         n It  is  one of  the  three kind of  services provided by  EPO  to   application integrators n It is available since April  2018,  at https://www.epo.org/searching-­for-­patents/data/linked-­ open-­data.html#tab-­1 n It is provided under  the  Creative  Common  Attribution 4.0   License,  hence the  integrator: n is free  to  share  and  adapt these data n giving credit  and  without adding restrictions n User  manual and  support forum  available n Panel  access at https://data.epo.org/linked-­data/ n it provides a  SPARQL  query suggestion interface n it provides access to  a  SPARQL  endpoint IntelliSemantic 17
  • 18. The  EPO  LOD:  technical  details     n The  data  base  is EPO-­specific and  includes:     n data  provided by  EPO,  i.e. n EPO  application and  publication bibliographic information,  including text n weekly updated n the  CPCs tree n a  vocabulary of  patent concepts defined by  EPO n links to  other  LOD  key  vocabularies/concepts n most  of  which  originated  by  W3C n Text  search  also  available n which  is  an  interesting  extension   n The  data  base  does  not  include  (yet?): n information  about  the  legal  status     IntelliSemantic 18
  • 19. An  example:  the  EPO  LOD  patent  vocabulary   IntelliSemantic 19 This  vocabulary  developed  by  the  EPO  defines  some  key  concepts   in  patents  (e.g.  application,  publication)  and  can  be  shared  with   other  patent  information  provided  as  linked  data.  This  is  only  a   simplified  sketch,  indeed.
  • 20. The  EPO  LOD:  the  business  model       n free  data  and  access n to  be  used  according  to  the  fair  use  policy,  hence  for   casual  users  and  experiments:   n best  effort  policy  is  applied  for  results:1  minute   after  the  query,  results  are  terminated  in  any  case   n for  production  level  applications,  the  LOD  data  base   has  to  be  transferred  to  the  server  of  the  application   integrator n to  the  purists  of  the  open  web,  this  is  a  limit n for  the  real  business,  this  is  the  example  of  a   realistic  compromise,  which  can  enable  some   more  flexibility  on  the  business  model IntelliSemantic 20
  • 21. Other  LOD  examples  for  patents           n KIPO  LOD   n info  and  data  at http://lod.kipo.kr/ n SPARQL  query access at http://lod.kipo.kr/data/sparql n the  data  base  includes bibliographic and  legal KIPO  info n LOD  support is mentioned the  annual KIPO  report   n it is a  beta  now n US  LOD n info  and  data  at https://old.datahub.io/dataset/linked-­‐uspto-­‐ patent-­‐data n SPARQL  query access at http://us.patents.aksw.org/sparql n it seems still a  demo,  since the  data  content is not updated recently,  and  the  kind of  licence is not mentioned IntelliSemantic 21
  • 23. Bulk  data  base  providers  (patent  offices)         IntelliSemantic 23 n Opportunities   n technical:  the  adoption  of  the  LD  paradigm     facilitates the  verification  and  cleaning  of  your  own   data  base,  hence  it  should  be  promoted  also  as  an   internal  best  practice n Challenges n business:  if a  provider  delivers LD  to  its customers,     it has to  define its license and  its commercial   conditions n beware:  the  provider  is not forced by  the   technology itself to  offer its data  for  free!  This a   misleading objection.
  • 24. Application  developers           n Technical  opportunities:   n the  integration of  patent data  bases provided by   different patent officies will become easier n the  integration of  patent data  with  other information   (e.g.  scientific or  business)  becomes easier n this will support the  extension of  patent platforms to  business  intelligence  platforms n more  advanced functions can  be  provided to  patent professionals and  their environment (i.e.  colleagues and  managers) n Business  challenges: n all the  previous mentioned opportunities will materialize easier if the  LD  paradigm will suitably adopted by  data  base  providers IntelliSemantic 24
  • 25. Patent  professionals         n Technical  opportunities:  have to  opportunity to  rely on: n more  powerful and  diversified queries,  e.g.  to   suitable explore the  citations network   n more  advanced and  flexible analytics tools,  for   deeper patent sets  analyses n more  transparent and  well agreed definition of   concepts,  to  support a  new  generation  of  semantic solutions in  patent informatics n I  define this semantic 3.0  in  patent informatics n more  information  rich tools,  which can  help  them to  support better a  broader set  of  users n Challenges:   n motivations and  willingness to  improve the   profession IntelliSemantic 25
  • 26. Semantics  in  Patent  Informatics:  a  view       Semantics in  Patent Informatics can  be  categorized also by   generations:   n Semantics 1.0  (implicit):  Solutions  and  best  practices in   patent informatics include  some  semantic concepts,  but they are  not mentioned under  this term.  For  example:  IPCs and  CPCs are  actually ontologies. n Semantics 2.0  (explicit):  Natural  language semantics is included in  solutions,  to  add new  ways  of  searching and/or   to  analyze results.  The  word  “semantic”  is now explicitly advertised as such,  for  very diversified solutions indeed.     Some  concerns arise too,  e.g.  about “which kind of   semantic”  is in  place (for  searching?  for  analyzing?). n Semantics 3.0  (standardized):   The  inclusion of  explicit,   agreed and  transparent standard knowledge bases will establish semantics as a  true value added for  the   professional,  not as a  question mark issue.   IntelliSemantic 26
  • 27. Conclusions         n The  adoption of  Linked Data  is a  win /  win opportunity for  all players: n patent offices n application developers n professional users n their environment (colleagues and  managers)   n but the  awareness of  this possible evolution in   patent informatics must  sparkle now! n Linked Data  in  fact: n uses well established international standards n facilitates the  reuse of  existing knowledge data  bases (e.g.  vocabularies) IntelliSemantic 27
  • 28. Questions         n technology-­related questions? n application-­related questions?   n business  model-­related questions? n bulk  data  base  providers? n application developers?   n other questions? IntelliSemantic 28
  • 29. Acknowledgment  and  extensions         n since  the  topic  is  definitely  challenging  to  be   summarized  in  a  20  minute  presentation,  an  open   access paper with  the  same title and  coauthored with   Marco  Ciaramella  will extends this presentation with:   n more  details n patent  specific  examples n bibliography     n the  paper  will  be  accessed  at  https://arxiv.org/ n in  the  section  computers  /  digital  libraries n I  do  acknowledge  Marco  for  the  contribution  to  this   paper  as  well  to  this  presentation   IntelliSemantic 29
  • 30. Contact  information         a)  For  specific  questions  and  information  requests n e-­mail:  alberto.ciaramella@intellisemantic.com n tel.  +39  011  9550  380 b)  For  registering  to  the  IntelliSemantic newsletter,   including  references  to  new  IntelliSemantic papers  and   webinars  announcements   n e-­mail:  newsletter@intellisemantic.com n next  issue:  September  2018 c)  For  general  information  and  first  level  overview: n site  http://www.intellisemantic.com IntelliSemantic 30