SlideShare a Scribd company logo
1 of 21
Download to read offline
Deconstructing Freebase
Manuel de la Villa
Department of Information Technologies
University of Huelva                     Medical-Miner Project
                                         Huelva 22/7/2010
Deconstructing Freebase
• What is it
  – A net? A tool? A kind of Data? wikipedia-turned-
    database?
• The Schema,
  – domains, types, properties. Sources.
• How it works
• How can we use it
  – MQL, API, Acre, RDF

                                               Medical-Miner Project
                                               Huelva 22/7/2010
What is it
• Freebase is a large public database that collects three
  kinds of information:
   – data;
   – texts; and
   – media, that references…
• …entities or topics (≈ 12 million). An entity is a unique
  single person, place, or thing.
       • A single concept or real-world thing.
       • A topic could also be called an entity, resource or element or thing, it
         is a fundamental unit in Freebase.
       • /common/topic
       • Each topic has a Guid or globally unique ID
           – http://www.freebase.com/view/en/barack_obama
           – http://www.freebase.com/guid/9202a8c04000641f800000000029c277


                                                                        Medical-Miner Project
                                                                        Huelva 22/7/2010
What is it (and II)
• Freebase connects entities together as a graph,
   – defines its data structure as a set of nodes and a set of
     links that establish relationships between the nodes.
• Most of our topics are associated with one or
  more types (such as people, places, books,
  films, etc) and may have additional properties like "date of
  birth" for a person or latitude and longitude for a location.
  These types and properties and related concepts are called
  Schema.


                                                          Medical-Miner Project
                                                          Huelva 22/7/2010
The Schema
• We use the term Schema to refer to the way Freebase's data is laid out or
  structured. In the Semantic web world, this is referred to as Ontology.
• Schema in Freebase is expressed through Types and Properties. Types are
  grouped together in Domains.

              What is Schema?
                   Schema describes how data in Freebase is structured.

              Schema is made by creating:
                   Types, which define a particular kind of person, place or thing and contain

                   Properties, which define the unique qualities of a type.



                                       William Shakespeare
                                              is a  →      type     → Person
                                              has a →      property → Date of Birth


                                                                                          Medical-Miner Project
                                                                                          Huelva 22/7/2010
The Schema (II)




                  Medical-Miner Project
                  Huelva 22/7/2010
The Schema (II)




                  Medical-Miner Project
                  Huelva 22/7/2010
The Schema (II)




                  Medical-Miner Project
                  Huelva 22/7/2010
The Schema (II)




                  Medical-Miner Project
                  Huelva 22/7/2010
The Schema: Medicine




                  Medical-Miner Project
                  Huelva 22/7/2010
Then… is an Ontology?
• Yes and No,

          is a kind of Folksonomy…




                                     Medical-Miner Project
                                     Huelva 22/7/2010
The sources
• Wikipedia, ChefMoz, NNDB, MusicBrainz…
• The Freebase community, with a 2.0 spirit, are:
       •   data contributors,
       •   schema builders,
       •   data curators,
       •   application developers,
       •   and many others who use Freebase
   – Expert status
   – Augmentative edition
• Creative Commons Attribution Licensing, which means
  that it's free for you to browse, query, copy, and even use the data
  in your own systems or software, even for commercial use; all we
  ask in return is that you mention that

                                                               Medical-Miner Project
                                                               Huelva 22/7/2010
How can we use it…
• As a reference or information source
• Create interesting Views and Visualizations and
  share them with others
• Embed Freebase data in your website
• Use our API or Acre, our hosted app development
  platform, to build apps that use Freebase data
• Download our Data dumps
• Use Freebase's RDF for Semantic Web applications


                                              Medical-Miner Project
                                              Huelva 22/7/2010
How can we use it…
• Ways to use Freebase:
  – Use Freebase's Ids to uniquely identify entities
    anywhere on the web
  – Query Freebase's data using MQL
  – Build applications using our API or Acre, the
    hosted development platform




                                                 Medical-Miner Project
                                                 Huelva 22/7/2010
MQL (Metaweb Query Language)
• http://api.freebase.com/api/service/mqlread?query={"query"
  :{"type":"/music/artist","name":"U2","album":[]}}
• Query Editor




                                                     Medical-Miner Project
                                                     Huelva 22/7/2010
MQL (and II)




               Medical-Miner Project
               Huelva 22/7/2010
Freebase API
Freebase's API allows you:
    –  to perform searches and queries against
      Freebase's data, or
    – to write data to Freebase.
    – There are Libraries available for many
      languages.

Searching with Search API
• The search API lets you do a fuzzily-matched
   search for a string that occurs in a topic's
   name, aliases, or description.
Reading
• The mqlread API is for doing structured
   queries against Freebase's schema.
Writing
• The mqlwrite API lets you write data to
   Freebase.


                                                     Medical-Miner Project
                                                     Huelva 22/7/2010
Acre
A hosted application
development platform,
Acre:
   • makes it easy to build and
   share Freebase applications.
   •It encapsulates many of the
   trickier steps (like OAuth
   access for writing to
   Freebase) and
   •lets you view other people's
   apps, clone them, modify
   them, and share your
   improvements.

                                          Medical-Miner Project
                                          Huelva 22/7/2010
Applications… with Acre
• Powerset (NLP search engine)

• Fmdb

• Freebase
     Sets



                                 Medical-Miner Project
                                 Huelva 22/7/2010
RDF




      Medical-Miner Project
      Huelva 22/7/2010
Summarizing
"Freebase is the bridge between the bottom up vision
  of Web 2.0 collective intelligence and the more
  structured world of the semantic web." Tim O'Reilly




                                                Medical-Miner Project
                                                Huelva 22/7/2010

More Related Content

Similar to Deconstructing freebase

Linked Open Data in Libraries, Archives & Museums
Linked Open Data in Libraries, Archives & MuseumsLinked Open Data in Libraries, Archives & Museums
Linked Open Data in Libraries, Archives & MuseumsJon Voss
 
What is New in W3C land?
What is New in W3C land?What is New in W3C land?
What is New in W3C land?Ivan Herman
 
Intro to Linked Open Data in Libraries, Archives & Museums
Intro to Linked Open Data in Libraries, Archives & MuseumsIntro to Linked Open Data in Libraries, Archives & Museums
Intro to Linked Open Data in Libraries, Archives & MuseumsJon Voss
 
Oss and libraries enabling arabic libraries and creating opportunities
Oss and libraries   enabling arabic libraries and creating opportunitiesOss and libraries   enabling arabic libraries and creating opportunities
Oss and libraries enabling arabic libraries and creating opportunitiesMassoud AlShareef
 
Breaking Down Walls in Enterprise with Social Semantics
Breaking Down Walls in Enterprise with Social SemanticsBreaking Down Walls in Enterprise with Social Semantics
Breaking Down Walls in Enterprise with Social SemanticsJohn Breslin
 
Curation and Characterization of Web Services
Curation and Characterization of Web ServicesCuration and Characterization of Web Services
Curation and Characterization of Web ServicesJose Enrique Ruiz
 
Global lodlam_communities and open cultural data
Global lodlam_communities and open cultural dataGlobal lodlam_communities and open cultural data
Global lodlam_communities and open cultural dataMinerva Lin
 
Intro to Linked Open Data in Libraries Archives & Museums.
Intro to Linked Open Data in Libraries Archives & Museums.Intro to Linked Open Data in Libraries Archives & Museums.
Intro to Linked Open Data in Libraries Archives & Museums.Jon Voss
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Anita de Waard
 
Overview AG AKSW
Overview AG AKSWOverview AG AKSW
Overview AG AKSWSören Auer
 
Open Data - Principles and Techniques
Open Data - Principles and TechniquesOpen Data - Principles and Techniques
Open Data - Principles and TechniquesBernhard Haslhofer
 
Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012
Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012
Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012lljohnston
 
Linked Open Data in Libraries Archives & Museums
Linked Open Data in Libraries Archives & MuseumsLinked Open Data in Libraries Archives & Museums
Linked Open Data in Libraries Archives & MuseumsJon Voss
 

Similar to Deconstructing freebase (20)

Linked Open Data in Libraries, Archives & Museums
Linked Open Data in Libraries, Archives & MuseumsLinked Open Data in Libraries, Archives & Museums
Linked Open Data in Libraries, Archives & Museums
 
What is New in W3C land?
What is New in W3C land?What is New in W3C land?
What is New in W3C land?
 
Intro to Linked Open Data in Libraries, Archives & Museums
Intro to Linked Open Data in Libraries, Archives & MuseumsIntro to Linked Open Data in Libraries, Archives & Museums
Intro to Linked Open Data in Libraries, Archives & Museums
 
Oss and libraries enabling arabic libraries and creating opportunities
Oss and libraries   enabling arabic libraries and creating opportunitiesOss and libraries   enabling arabic libraries and creating opportunities
Oss and libraries enabling arabic libraries and creating opportunities
 
Breaking Down Walls in Enterprise with Social Semantics
Breaking Down Walls in Enterprise with Social SemanticsBreaking Down Walls in Enterprise with Social Semantics
Breaking Down Walls in Enterprise with Social Semantics
 
Curation and Characterization of Web Services
Curation and Characterization of Web ServicesCuration and Characterization of Web Services
Curation and Characterization of Web Services
 
Here Comes Everything
Here Comes EverythingHere Comes Everything
Here Comes Everything
 
Global lodlam_communities and open cultural data
Global lodlam_communities and open cultural dataGlobal lodlam_communities and open cultural data
Global lodlam_communities and open cultural data
 
Linked Data
Linked DataLinked Data
Linked Data
 
International Image Interoperability Framework (IIIF)
International Image Interoperability Framework (IIIF)International Image Interoperability Framework (IIIF)
International Image Interoperability Framework (IIIF)
 
NISO/DCMI May 22 Webinar: Semantic Mashups Across Large, Heterogeneous Insti...
 NISO/DCMI May 22 Webinar: Semantic Mashups Across Large, Heterogeneous Insti... NISO/DCMI May 22 Webinar: Semantic Mashups Across Large, Heterogeneous Insti...
NISO/DCMI May 22 Webinar: Semantic Mashups Across Large, Heterogeneous Insti...
 
Intro to Linked Open Data in Libraries Archives & Museums.
Intro to Linked Open Data in Libraries Archives & Museums.Intro to Linked Open Data in Libraries Archives & Museums.
Intro to Linked Open Data in Libraries Archives & Museums.
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
 
Overview AG AKSW
Overview AG AKSWOverview AG AKSW
Overview AG AKSW
 
Open Data - Principles and Techniques
Open Data - Principles and TechniquesOpen Data - Principles and Techniques
Open Data - Principles and Techniques
 
CAEPIA 2011
CAEPIA 2011CAEPIA 2011
CAEPIA 2011
 
Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012
Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012
Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012
 
International Image Interoperability Framework (IIIF)
International Image Interoperability Framework (IIIF)International Image Interoperability Framework (IIIF)
International Image Interoperability Framework (IIIF)
 
Linked Open Data in Libraries Archives & Museums
Linked Open Data in Libraries Archives & MuseumsLinked Open Data in Libraries Archives & Museums
Linked Open Data in Libraries Archives & Museums
 
Digital Content Management
Digital Content ManagementDigital Content Management
Digital Content Management
 

More from Manuel de la Villa

Presentación TFG Informes de Alta Automáticos
Presentación TFG Informes de Alta AutomáticosPresentación TFG Informes de Alta Automáticos
Presentación TFG Informes de Alta AutomáticosManuel de la Villa
 
Presentación programa Social Media UHU
Presentación programa Social Media UHUPresentación programa Social Media UHU
Presentación programa Social Media UHUManuel de la Villa
 
Marca personal para community managers
Marca personal para community managersMarca personal para community managers
Marca personal para community managersManuel de la Villa
 
Taller Facebook #SMUHU parte 2
Taller Facebook #SMUHU parte 2Taller Facebook #SMUHU parte 2
Taller Facebook #SMUHU parte 2Manuel de la Villa
 
Taller Facebook #SMUHU parte 1
Taller Facebook #SMUHU parte 1Taller Facebook #SMUHU parte 1
Taller Facebook #SMUHU parte 1Manuel de la Villa
 
Taller de Presentaciones efectivas
Taller de Presentaciones efectivasTaller de Presentaciones efectivas
Taller de Presentaciones efectivasManuel de la Villa
 
Presentacion Grado en Ingeniería Informática UHU
Presentacion Grado en Ingeniería Informática UHUPresentacion Grado en Ingeniería Informática UHU
Presentacion Grado en Ingeniería Informática UHUManuel de la Villa
 
Curso personal branding profesores
Curso personal branding profesoresCurso personal branding profesores
Curso personal branding profesoresManuel de la Villa
 
Experiences on integrating explicit knowledge on information access tools in ...
Experiences on integrating explicit knowledge on information access tools in ...Experiences on integrating explicit knowledge on information access tools in ...
Experiences on integrating explicit knowledge on information access tools in ...Manuel de la Villa
 
A Biomedical Information Retrieval System based on Clustering for Mobile Dev...
A Biomedical Information Retrieval System  based on Clustering for Mobile Dev...A Biomedical Information Retrieval System  based on Clustering for Mobile Dev...
A Biomedical Information Retrieval System based on Clustering for Mobile Dev...Manuel de la Villa
 
A critical and comparative study about ISO 9001, CMMI and ISO 15504
A critical and comparative study about  ISO 9001, CMMI and ISO 15504A critical and comparative study about  ISO 9001, CMMI and ISO 15504
A critical and comparative study about ISO 9001, CMMI and ISO 15504Manuel de la Villa
 

More from Manuel de la Villa (17)

Mantenimiento del software
Mantenimiento del softwareMantenimiento del software
Mantenimiento del software
 
Presentación TFG Informes de Alta Automáticos
Presentación TFG Informes de Alta AutomáticosPresentación TFG Informes de Alta Automáticos
Presentación TFG Informes de Alta Automáticos
 
Presentación programa Social Media UHU
Presentación programa Social Media UHUPresentación programa Social Media UHU
Presentación programa Social Media UHU
 
Marca personal para community managers
Marca personal para community managersMarca personal para community managers
Marca personal para community managers
 
Taller Facebook #SMUHU parte 2
Taller Facebook #SMUHU parte 2Taller Facebook #SMUHU parte 2
Taller Facebook #SMUHU parte 2
 
Taller Facebook #SMUHU parte 1
Taller Facebook #SMUHU parte 1Taller Facebook #SMUHU parte 1
Taller Facebook #SMUHU parte 1
 
Personal branding
Personal brandingPersonal branding
Personal branding
 
Taller de Presentaciones efectivas
Taller de Presentaciones efectivasTaller de Presentaciones efectivas
Taller de Presentaciones efectivas
 
Presentacion Grado en Ingeniería Informática UHU
Presentacion Grado en Ingeniería Informática UHUPresentacion Grado en Ingeniería Informática UHU
Presentacion Grado en Ingeniería Informática UHU
 
Curso personal branding profesores
Curso personal branding profesoresCurso personal branding profesores
Curso personal branding profesores
 
Herramientas web 2.0 parte 2
Herramientas web 2.0 parte 2Herramientas web 2.0 parte 2
Herramientas web 2.0 parte 2
 
Herramientas web 2.0 Parte 1
Herramientas web 2.0 Parte 1Herramientas web 2.0 Parte 1
Herramientas web 2.0 Parte 1
 
MVilla IUI 2012 Lisbon
MVilla IUI 2012 LisbonMVilla IUI 2012 Lisbon
MVilla IUI 2012 Lisbon
 
Experiences on integrating explicit knowledge on information access tools in ...
Experiences on integrating explicit knowledge on information access tools in ...Experiences on integrating explicit knowledge on information access tools in ...
Experiences on integrating explicit knowledge on information access tools in ...
 
A Biomedical Information Retrieval System based on Clustering for Mobile Dev...
A Biomedical Information Retrieval System  based on Clustering for Mobile Dev...A Biomedical Information Retrieval System  based on Clustering for Mobile Dev...
A Biomedical Information Retrieval System based on Clustering for Mobile Dev...
 
A critical and comparative study about ISO 9001, CMMI and ISO 15504
A critical and comparative study about  ISO 9001, CMMI and ISO 15504A critical and comparative study about  ISO 9001, CMMI and ISO 15504
A critical and comparative study about ISO 9001, CMMI and ISO 15504
 
Tesina08
Tesina08Tesina08
Tesina08
 

Recently uploaded

MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2
 
AI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAnitaRaj43
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard37
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Zilliz
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
Introduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDMIntroduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDMKumar Satyam
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 

Recently uploaded (20)

MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
AI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by Anitaraj
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptx
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Introduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDMIntroduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDM
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 

Deconstructing freebase

  • 1. Deconstructing Freebase Manuel de la Villa Department of Information Technologies University of Huelva Medical-Miner Project Huelva 22/7/2010
  • 2. Deconstructing Freebase • What is it – A net? A tool? A kind of Data? wikipedia-turned- database? • The Schema, – domains, types, properties. Sources. • How it works • How can we use it – MQL, API, Acre, RDF Medical-Miner Project Huelva 22/7/2010
  • 3. What is it • Freebase is a large public database that collects three kinds of information: – data; – texts; and – media, that references… • …entities or topics (≈ 12 million). An entity is a unique single person, place, or thing. • A single concept or real-world thing. • A topic could also be called an entity, resource or element or thing, it is a fundamental unit in Freebase. • /common/topic • Each topic has a Guid or globally unique ID – http://www.freebase.com/view/en/barack_obama – http://www.freebase.com/guid/9202a8c04000641f800000000029c277 Medical-Miner Project Huelva 22/7/2010
  • 4. What is it (and II) • Freebase connects entities together as a graph, – defines its data structure as a set of nodes and a set of links that establish relationships between the nodes. • Most of our topics are associated with one or more types (such as people, places, books, films, etc) and may have additional properties like "date of birth" for a person or latitude and longitude for a location. These types and properties and related concepts are called Schema. Medical-Miner Project Huelva 22/7/2010
  • 5. The Schema • We use the term Schema to refer to the way Freebase's data is laid out or structured. In the Semantic web world, this is referred to as Ontology. • Schema in Freebase is expressed through Types and Properties. Types are grouped together in Domains. What is Schema? Schema describes how data in Freebase is structured. Schema is made by creating: Types, which define a particular kind of person, place or thing and contain Properties, which define the unique qualities of a type. William Shakespeare is a → type → Person has a → property → Date of Birth Medical-Miner Project Huelva 22/7/2010
  • 6. The Schema (II) Medical-Miner Project Huelva 22/7/2010
  • 7. The Schema (II) Medical-Miner Project Huelva 22/7/2010
  • 8. The Schema (II) Medical-Miner Project Huelva 22/7/2010
  • 9. The Schema (II) Medical-Miner Project Huelva 22/7/2010
  • 10. The Schema: Medicine Medical-Miner Project Huelva 22/7/2010
  • 11. Then… is an Ontology? • Yes and No, is a kind of Folksonomy… Medical-Miner Project Huelva 22/7/2010
  • 12. The sources • Wikipedia, ChefMoz, NNDB, MusicBrainz… • The Freebase community, with a 2.0 spirit, are: • data contributors, • schema builders, • data curators, • application developers, • and many others who use Freebase – Expert status – Augmentative edition • Creative Commons Attribution Licensing, which means that it's free for you to browse, query, copy, and even use the data in your own systems or software, even for commercial use; all we ask in return is that you mention that Medical-Miner Project Huelva 22/7/2010
  • 13. How can we use it… • As a reference or information source • Create interesting Views and Visualizations and share them with others • Embed Freebase data in your website • Use our API or Acre, our hosted app development platform, to build apps that use Freebase data • Download our Data dumps • Use Freebase's RDF for Semantic Web applications Medical-Miner Project Huelva 22/7/2010
  • 14. How can we use it… • Ways to use Freebase: – Use Freebase's Ids to uniquely identify entities anywhere on the web – Query Freebase's data using MQL – Build applications using our API or Acre, the hosted development platform Medical-Miner Project Huelva 22/7/2010
  • 15. MQL (Metaweb Query Language) • http://api.freebase.com/api/service/mqlread?query={"query" :{"type":"/music/artist","name":"U2","album":[]}} • Query Editor Medical-Miner Project Huelva 22/7/2010
  • 16. MQL (and II) Medical-Miner Project Huelva 22/7/2010
  • 17. Freebase API Freebase's API allows you: – to perform searches and queries against Freebase's data, or – to write data to Freebase. – There are Libraries available for many languages. Searching with Search API • The search API lets you do a fuzzily-matched search for a string that occurs in a topic's name, aliases, or description. Reading • The mqlread API is for doing structured queries against Freebase's schema. Writing • The mqlwrite API lets you write data to Freebase. Medical-Miner Project Huelva 22/7/2010
  • 18. Acre A hosted application development platform, Acre: • makes it easy to build and share Freebase applications. •It encapsulates many of the trickier steps (like OAuth access for writing to Freebase) and •lets you view other people's apps, clone them, modify them, and share your improvements. Medical-Miner Project Huelva 22/7/2010
  • 19. Applications… with Acre • Powerset (NLP search engine) • Fmdb • Freebase Sets Medical-Miner Project Huelva 22/7/2010
  • 20. RDF Medical-Miner Project Huelva 22/7/2010
  • 21. Summarizing "Freebase is the bridge between the bottom up vision of Web 2.0 collective intelligence and the more structured world of the semantic web." Tim O'Reilly Medical-Miner Project Huelva 22/7/2010