SlideShare a Scribd company logo
1 of 69
Algorithms and Tools
Information Extraction
from the Web
Benjamin Habegger
University of Lyon, CNRS, INSA-Lyon, LIRIS, UMR5205
Seminary on Information Extraction from the Web
ENSIAS, Rabat, Morocco - June 19, 2013
About Me
@b_habegger
http://www.linkedin.com/in/benjaminhabegger
benjamin.habegger@insa-lyon.fr
Overview
● Fundamentals of information extraction from the web
– Document representations
– Approaches
● Algorithms to extract information from semi-structured web content
– Wien, Stalker, DIPRE, IERel
● Tools to describe and web scrappers
– WetDL, WebSource
● Applications and extensions of information extraction
– Making our human web smarter
– Learning mappings for data integration
What types of data are we taking about ?
Types of data on the Web
● Structured
● Unstructured
● Semi-structured
Types of data on the Web
● Structured
● Unstructured
● Semi-structured
Types of data on the Web
● Structured
● Unstructured
● Semi-structured
Semi-structured data
● Usually, but not limited to, data from a
database formatted as HTML
● Listings of entities
● Presented in a “regular” presentation format
Multiple possible representations
(DOM) Tree
Rendered page
<tr class="participant">
  <td class="pname" id="part1968752570">
     […]
     <div class="pname">Benjamin</div>
  </td>
  […]
</tr>
String
<tr class="participant">
  <td class="pname" id="part1968752570">
     […]
     <div class="pname">Benjamin</div>
  </td>
  […]
</tr>
HTML string
What do we want to do with those documents ?
Information extraction from the web
monster.frmonster.fr apec.frapec.fr remixjobs.comremixjobs.com
Job DatabaseJob Database
Information extraction from the web
● Extract data from one or more web sites
● Wrap it into a predefined target format
How do we do this ?
Wrappers (scrapper)
monster.fr apec.fr remixjobs.com
Job Database
Algorithms to learn wrappers
● Wien
● Stalker
● SoftMealy
● IEPad
● RoadRunner
● DIPRE
● IERel
● TreePat Miner
● Squirrel
Wrapper representations
● A program
● A transducer (string or tree)
● A regular expression
● A tree pattern
● A query
Document and wrapper
representations
Algorithm Document
Model
Query/Wrapper
Model
Wien [Kushmerick] String LR-Patterns
Stalker [Muslea] String Delimiter-rules
SoftMealy [Hsu & Dung] Analysed String Transducer
IERel [Habegger] HTML String *-Patterns
Squirrel [Carme] DOM Tree Tree Automata
Habegger & Debarbieux DOM Tree Tree-Pattern Queries
SoftMealy
SoftMealy
● Input:
– Completely labeled document
● Preprocessing:
– Tokenize input string
● Output:
– A transducer
SoftMealy: Document
Representation
Symbol Description
CAlph(x) String composed of only capitals
C1Alph(x) Strinng starting with a capital
Num(x) Numerical string
Html(x) An HTML tag
OAlph(x) String of alpha-numerical characters
Punc(x) Punctuation symbol
NL(n) n line feeds
Tab(n) n tabulations
Spc(n) n spaces
SoftMealy: Algorithm
N E O
SoftMealy: Results
SoftMealy: Conclusion
● String-based wrapper induction algorithm
● Patterns which take format into account
→ Improvement over WIEN
● As WIEN & Stalker
– imposes much labeling
– “batch” approach
RoadRunner
RoadRunner
● Input:
– Collection of sample pages
● Algorithm
– Induce structural pattern from the pages
● Output
– A DTD-like schema structure for the documents
RoadRunner: Example
RoadRunner: Results
RoadRunner
● Wraps regularities into a page pattern
– Compacts structure
● Structural item of the found schema NOT
mapped to a target schema
● Option: uses output as input of a mapping
mining algorithm
DIPRE
Dipre [Brin1998]
● Input:
– Example instances of a relation to be extracted
– A collection of web documents
● Output:
– Patterns to be applied to the collection
– (New) instances extracted using the patterns
DIPRE: Relation extraction from a
web cache
Web Cache
Relation
Instances
Very Basic
Extraction
Patterns
Dipre
● Interesting cyclic process
● Very (too) simple patterns for IE
● Problem of over-generalizations
● Pattern set drifting from their extraction target
IERel
IERel
● Input:
– Examples of a relation to be extracted
● Algorithm
– Extract patterns & generalize them
● Output
– Extraction patterns
IERel: Document representation
<tr class="participant">
<td class="pname" id="part1968752570">
[…]
<div class="pname">
B
e
n
j
a
m
i
n
</div>
</td>
[…]
</tr>
§1§
§2§
[…]
§3§
B
e
n
j
a
m
i
n
§4§
§5§
[…]
§6§
IERel: Generalization
<tr class="participant">
<td class="pname" id="part825438027">
[…]
<div class="pname">
M
o
h
a
m
e
d
</div>
</td>
[…]
</tr>
§1§
§7§
[…]
§3§
M
o
h
a
m
e
d
§4§
§5§
[…]
§6§
IERel: Generalization
§1§
§7§
[…]
§3§
M
o
h
a
m
e
d
§4§
§5§
[…]
§6§
§1§
§2§
[…]
§3§
B
e
n
j
a
m
i
n
§4§
§5§
[…]
§6§
§1§
*
[…]
§3§
*
§4§
§5§
[…]
§6§
IERel: Interactive Learning
Examples
Extracted Results
Patterns
Refined
Patterns
Refined
Patterns
New examples / Negate wrong ones
Results using refined patterns
Coping with over-generalization
Learn a set of patterns
i.e.
a disjunction of conjunctions
IERel: Pattern construction
IERel: Evaluation
● Multiple tested domains
– Online directories
– Search engine results
– Product catalogs
Demo
IERel: Example entropy
IERel: Conclusion
● Labeling can be limited
● Underlines the interest for interactive learning
Other representations
Learning Tree Pattern Queries
Maximal weight generalization
Other algorithms on trees
● Carme et al.
– inducing node selecting tree automata
● Marty et al.
– Tabluar descriptions of nodes to be selected
– Using classification techiques
We can extract data from the web.
Now what ?
Extraction is not all
WetDL
– Query
– Fetch
– Parse
– Extract
– Transform
– External
● Workflow description of a web navigation patterns
● An execution model
● A collection of meta-operators
Semantics of a WetDL workflow
● Nodes are processors
– Receive messages through a queue
– Process and dispatch the result messages
● A processor may generate 0, 1 or n messages
● Workflow terminates when all queues are empty
WebSource: execute WetDL flows
● Each node can:
– enqueue data (push)
– generate data (pull)
● Processing can occur:
– on push (forward chaining)
– on pull (backward chaining)
WetDL
● Simple description of navigation patterns
– Straightforward operators in the context of IE
● Powerful expressiveness (in particular for IE)
– We can describe most (if not all) web information
extraction tasks
WebSource
Open-source WetDL interpreter
http://websource.sf.net/
Applications and extensions
Semabot: Motivation
What does the following query give ?
“lyon informatique emploi”
Semabot: Motivation
A list of documents containing the terms
“lyon”
“informatique”
“emploi”
Semabot: Objectives
The query “lyon informatique emploi”
should give:
A list of computer engineer job offers
Semabot
● Registry of “object” schemas and wrappers
● Wrappers generate “objects”
– Job offers, People, Products, etc.
● Crawler wraps pages and indexes objects
Semabot: Open problems
● Wrap the web into objects
– i.e. what we have seen in this seminar ;)
● Interpret (some of) the terms of the query
– “lyon” => http://en.wikipedia.org/wiki/Lyon
– “emploi” => http://en.wikipedia.org/wiki/Job_(role)
Information Extraction
● WHAT ?
– Make content adapted to human consumption as
content consumable by a target schema
● HOW ?
– Using machine learning approaches
Data Integration
● WHAT ?
– Make content adapted to human consumption as
content consumable by a target schema
● HOW ?
– Using machine learning approaches
to a source schema
Data Integration
DB 1
Schema 1
App ASchema 2
Mappings Query Rewriting
Extracting = Mapping
Data model Query Super Model
String Regular Expressions / Automata
Tree Xpath Expressions
Relational data SQL/SPARQL Expressions
Wrapping HTML to RDF
<li id=”gs2”>
<b>Samsung Galaxy S II</b>
<i>300 EUR</i> <br />
Vendor: charly@example.com
</li>
● Samsung Galaxy S 300 EUR
Vendor: charly@example.com
http://phones.example.com/samsung/charly/#gs2
name price vendor
Samsung Galaxy S II
300 EUR
charly@example.com
Wrap-up
● Tour of information extraction
– Learning wrappers
– Building IE tasks
● Link with semantic web/open data
● Link with data integration
Perspectives
● Further explore the potential interactive learning
● Learning navigation patterns
● Search of “objects” rather than documents
● Extension of interaction cycle
– pattern generation
– some form of automated pattern evaluation
– continuous (re)learning
Thank you
@b_habegger
http://www.linkedin.com/in/benjaminhabegger
benjamin.habegger@insa-lyon.fr

More Related Content

What's hot

Email Data Cleaning
Email Data CleaningEmail Data Cleaning
Email Data Cleaning
feiwin
 
Presentation of OpenNLP
Presentation of OpenNLPPresentation of OpenNLP
Presentation of OpenNLP
Robert Viseur
 
Question Answering with Lydia
Question Answering with LydiaQuestion Answering with Lydia
Question Answering with Lydia
Jae Hong Kil
 
Unknown Word 08
Unknown Word 08Unknown Word 08
Unknown Word 08
Jason Yang
 
Query Translation for Data Sources with Heterogeneous Content Semantics
Query Translation for Data Sources with Heterogeneous Content Semantics Query Translation for Data Sources with Heterogeneous Content Semantics
Query Translation for Data Sources with Heterogeneous Content Semantics
Jie Bao
 
Scalable Discovery Of Hidden Emails From Large Folders
Scalable Discovery Of Hidden Emails From Large FoldersScalable Discovery Of Hidden Emails From Large Folders
Scalable Discovery Of Hidden Emails From Large Folders
feiwin
 
Pattern Mining To Unknown Word Extraction (10
Pattern Mining To Unknown Word Extraction (10Pattern Mining To Unknown Word Extraction (10
Pattern Mining To Unknown Word Extraction (10
Jason Yang
 
Leveraging Lucene/Solr as a Knowledge Graph and Intent Engine: Presented by T...
Leveraging Lucene/Solr as a Knowledge Graph and Intent Engine: Presented by T...Leveraging Lucene/Solr as a Knowledge Graph and Intent Engine: Presented by T...
Leveraging Lucene/Solr as a Knowledge Graph and Intent Engine: Presented by T...
Lucidworks
 

What's hot (20)

Email Data Cleaning
Email Data CleaningEmail Data Cleaning
Email Data Cleaning
 
Analysis of Similarity Measures between Short Text for the NTCIR-12 Short Tex...
Analysis of Similarity Measures between Short Text for the NTCIR-12 Short Tex...Analysis of Similarity Measures between Short Text for the NTCIR-12 Short Tex...
Analysis of Similarity Measures between Short Text for the NTCIR-12 Short Tex...
 
NLP Project Full Cycle
NLP Project Full CycleNLP Project Full Cycle
NLP Project Full Cycle
 
RuleML2015: Rule Generalization Strategies in Incremental Learning of Disjunc...
RuleML2015: Rule Generalization Strategies in Incremental Learning of Disjunc...RuleML2015: Rule Generalization Strategies in Incremental Learning of Disjunc...
RuleML2015: Rule Generalization Strategies in Incremental Learning of Disjunc...
 
Presentation of OpenNLP
Presentation of OpenNLPPresentation of OpenNLP
Presentation of OpenNLP
 
Crash Course in Natural Language Processing (2016)
Crash Course in Natural Language Processing (2016)Crash Course in Natural Language Processing (2016)
Crash Course in Natural Language Processing (2016)
 
Applications of Word Vectors in Text Retrieval and Classification
Applications of Word Vectors in Text Retrieval and ClassificationApplications of Word Vectors in Text Retrieval and Classification
Applications of Word Vectors in Text Retrieval and Classification
 
Can functional programming be liberated from static typing?
Can functional programming be liberated from static typing?Can functional programming be liberated from static typing?
Can functional programming be liberated from static typing?
 
Question Answering with Lydia
Question Answering with LydiaQuestion Answering with Lydia
Question Answering with Lydia
 
Unknown Word 08
Unknown Word 08Unknown Word 08
Unknown Word 08
 
Arcomem training entities-and-events_advanced
Arcomem training entities-and-events_advancedArcomem training entities-and-events_advanced
Arcomem training entities-and-events_advanced
 
OUTDATED Text Mining 4/5: Text Classification
OUTDATED Text Mining 4/5: Text ClassificationOUTDATED Text Mining 4/5: Text Classification
OUTDATED Text Mining 4/5: Text Classification
 
Entity Linking
Entity LinkingEntity Linking
Entity Linking
 
Query Translation for Data Sources with Heterogeneous Content Semantics
Query Translation for Data Sources with Heterogeneous Content Semantics Query Translation for Data Sources with Heterogeneous Content Semantics
Query Translation for Data Sources with Heterogeneous Content Semantics
 
Scalable Discovery Of Hidden Emails From Large Folders
Scalable Discovery Of Hidden Emails From Large FoldersScalable Discovery Of Hidden Emails From Large Folders
Scalable Discovery Of Hidden Emails From Large Folders
 
Webinar: Simpler Semantic Search with Solr
Webinar: Simpler Semantic Search with SolrWebinar: Simpler Semantic Search with Solr
Webinar: Simpler Semantic Search with Solr
 
Topic Modeling for Information Retrieval and Word Sense Disambiguation tasks
Topic Modeling for Information Retrieval and Word Sense Disambiguation tasksTopic Modeling for Information Retrieval and Word Sense Disambiguation tasks
Topic Modeling for Information Retrieval and Word Sense Disambiguation tasks
 
Word Tagging with Foundational Ontology Classes
Word Tagging with Foundational Ontology ClassesWord Tagging with Foundational Ontology Classes
Word Tagging with Foundational Ontology Classes
 
Pattern Mining To Unknown Word Extraction (10
Pattern Mining To Unknown Word Extraction (10Pattern Mining To Unknown Word Extraction (10
Pattern Mining To Unknown Word Extraction (10
 
Leveraging Lucene/Solr as a Knowledge Graph and Intent Engine: Presented by T...
Leveraging Lucene/Solr as a Knowledge Graph and Intent Engine: Presented by T...Leveraging Lucene/Solr as a Knowledge Graph and Intent Engine: Presented by T...
Leveraging Lucene/Solr as a Knowledge Graph and Intent Engine: Presented by T...
 

Viewers also liked

Be able to extract information from written sources
Be able to extract information from written sourcesBe able to extract information from written sources
Be able to extract information from written sources
kim2612
 
Multimodal Information Extraction: Disease, Date and Location Retrieval
Multimodal Information Extraction: Disease, Date and Location RetrievalMultimodal Information Extraction: Disease, Date and Location Retrieval
Multimodal Information Extraction: Disease, Date and Location Retrieval
Svitlana volkova
 
Group-13 Project 15 Sub event detection on social media
Group-13 Project 15 Sub event detection on social mediaGroup-13 Project 15 Sub event detection on social media
Group-13 Project 15 Sub event detection on social media
Ahmedali Durga
 
[EN] Capture Indexing & Auto-Classification | DLM Forum Industry Whitepaper 0...
[EN] Capture Indexing & Auto-Classification | DLM Forum Industry Whitepaper 0...[EN] Capture Indexing & Auto-Classification | DLM Forum Industry Whitepaper 0...
[EN] Capture Indexing & Auto-Classification | DLM Forum Industry Whitepaper 0...
PROJECT CONSULT Unternehmensberatung Dr. Ulrich Kampffmeyer GmbH
 
Web Information Extraction Learning based on Probabilistic Graphical Models
Web Information Extraction Learning based on Probabilistic Graphical ModelsWeb Information Extraction Learning based on Probabilistic Graphical Models
Web Information Extraction Learning based on Probabilistic Graphical Models
GUANBO
 
System for-health-diagnosis
System for-health-diagnosisSystem for-health-diagnosis
System for-health-diagnosis
ask2372
 
A survey of_eigenvector_methods_for_web_information_retrieval
A survey of_eigenvector_methods_for_web_information_retrievalA survey of_eigenvector_methods_for_web_information_retrieval
A survey of_eigenvector_methods_for_web_information_retrieval
Chen Xi
 
Open Information Extraction 2nd
Open Information Extraction 2ndOpen Information Extraction 2nd
Open Information Extraction 2nd
hit_alex
 

Viewers also liked (20)

Information Extraction
Information ExtractionInformation Extraction
Information Extraction
 
Named Entity Recognition - ACL 2011 Presentation
Named Entity Recognition - ACL 2011 PresentationNamed Entity Recognition - ACL 2011 Presentation
Named Entity Recognition - ACL 2011 Presentation
 
Data and Information Extraction on the Web
Data and Information Extraction on the WebData and Information Extraction on the Web
Data and Information Extraction on the Web
 
IE: Named Entity Recognition (NER)
IE: Named Entity Recognition (NER)IE: Named Entity Recognition (NER)
IE: Named Entity Recognition (NER)
 
Enterprise information extraction: recent developments and open challenges
Enterprise information extraction: recent developments and open challengesEnterprise information extraction: recent developments and open challenges
Enterprise information extraction: recent developments and open challenges
 
Information Extraction from Web-Scale N-Gram Data
Information Extraction from Web-Scale N-Gram DataInformation Extraction from Web-Scale N-Gram Data
Information Extraction from Web-Scale N-Gram Data
 
Be able to extract information from written sources
Be able to extract information from written sourcesBe able to extract information from written sources
Be able to extract information from written sources
 
Multimodal Information Extraction: Disease, Date and Location Retrieval
Multimodal Information Extraction: Disease, Date and Location RetrievalMultimodal Information Extraction: Disease, Date and Location Retrieval
Multimodal Information Extraction: Disease, Date and Location Retrieval
 
IRE- Algorithm Name Detection in Research Papers
IRE- Algorithm Name Detection in Research PapersIRE- Algorithm Name Detection in Research Papers
IRE- Algorithm Name Detection in Research Papers
 
Group-13 Project 15 Sub event detection on social media
Group-13 Project 15 Sub event detection on social mediaGroup-13 Project 15 Sub event detection on social media
Group-13 Project 15 Sub event detection on social media
 
[EN] Capture Indexing & Auto-Classification | DLM Forum Industry Whitepaper 0...
[EN] Capture Indexing & Auto-Classification | DLM Forum Industry Whitepaper 0...[EN] Capture Indexing & Auto-Classification | DLM Forum Industry Whitepaper 0...
[EN] Capture Indexing & Auto-Classification | DLM Forum Industry Whitepaper 0...
 
Mining Product Synonyms - Slides
Mining Product Synonyms - SlidesMining Product Synonyms - Slides
Mining Product Synonyms - Slides
 
Web Information Retrieval and Mining
Web Information Retrieval and MiningWeb Information Retrieval and Mining
Web Information Retrieval and Mining
 
Web Information Extraction Learning based on Probabilistic Graphical Models
Web Information Extraction Learning based on Probabilistic Graphical ModelsWeb Information Extraction Learning based on Probabilistic Graphical Models
Web Information Extraction Learning based on Probabilistic Graphical Models
 
System for-health-diagnosis
System for-health-diagnosisSystem for-health-diagnosis
System for-health-diagnosis
 
A survey of_eigenvector_methods_for_web_information_retrieval
A survey of_eigenvector_methods_for_web_information_retrievalA survey of_eigenvector_methods_for_web_information_retrieval
A survey of_eigenvector_methods_for_web_information_retrieval
 
Information_retrieval_and_extraction_IIIT
Information_retrieval_and_extraction_IIITInformation_retrieval_and_extraction_IIIT
Information_retrieval_and_extraction_IIIT
 
Open Information Extraction 2nd
Open Information Extraction 2ndOpen Information Extraction 2nd
Open Information Extraction 2nd
 
Information Retrieval and Extraction
Information Retrieval and ExtractionInformation Retrieval and Extraction
Information Retrieval and Extraction
 
Algorithm Name Detection & Extraction
Algorithm Name Detection & ExtractionAlgorithm Name Detection & Extraction
Algorithm Name Detection & Extraction
 

Similar to Information Extraction from the Web - Algorithms and Tools

PhD Presentation
PhD PresentationPhD Presentation
PhD Presentation
mskayed
 

Similar to Information Extraction from the Web - Algorithms and Tools (20)

Webinar: Scaling MongoDB
Webinar: Scaling MongoDBWebinar: Scaling MongoDB
Webinar: Scaling MongoDB
 
FDMEE Scripting - Cloud and On-Premises - It Ain't Groovy, But It's My Bread ...
FDMEE Scripting - Cloud and On-Premises - It Ain't Groovy, But It's My Bread ...FDMEE Scripting - Cloud and On-Premises - It Ain't Groovy, But It's My Bread ...
FDMEE Scripting - Cloud and On-Premises - It Ain't Groovy, But It's My Bread ...
 
Big data analysis in python @ PyCon.tw 2013
Big data analysis in python @ PyCon.tw 2013Big data analysis in python @ PyCon.tw 2013
Big data analysis in python @ PyCon.tw 2013
 
Software Engineering
Software EngineeringSoftware Engineering
Software Engineering
 
Data Science for Dummies - Data Engineering with Titanic dataset + Databricks...
Data Science for Dummies - Data Engineering with Titanic dataset + Databricks...Data Science for Dummies - Data Engineering with Titanic dataset + Databricks...
Data Science for Dummies - Data Engineering with Titanic dataset + Databricks...
 
The Quest for an Open Source Data Science Platform
 The Quest for an Open Source Data Science Platform The Quest for an Open Source Data Science Platform
The Quest for an Open Source Data Science Platform
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
More about PHP
More about PHPMore about PHP
More about PHP
 
What's Your Super-Power? Mine is Machine Learning with Oracle Autonomous DB.
What's Your Super-Power? Mine is Machine Learning with Oracle Autonomous DB.What's Your Super-Power? Mine is Machine Learning with Oracle Autonomous DB.
What's Your Super-Power? Mine is Machine Learning with Oracle Autonomous DB.
 
Intro_2.ppt
Intro_2.pptIntro_2.ppt
Intro_2.ppt
 
Intro.ppt
Intro.pptIntro.ppt
Intro.ppt
 
Intro.ppt
Intro.pptIntro.ppt
Intro.ppt
 
Continuous delivery for machine learning
Continuous delivery for machine learningContinuous delivery for machine learning
Continuous delivery for machine learning
 
Company Visitor Management System Report.docx
Company Visitor Management System Report.docxCompany Visitor Management System Report.docx
Company Visitor Management System Report.docx
 
Python ml
Python mlPython ml
Python ml
 
JavaScripts & jQuery
JavaScripts & jQueryJavaScripts & jQuery
JavaScripts & jQuery
 
PhD Presentation
PhD PresentationPhD Presentation
PhD Presentation
 
Intro to web scraping with Python
Intro to web scraping with PythonIntro to web scraping with Python
Intro to web scraping with Python
 
Introduction to AngularJS
Introduction to AngularJSIntroduction to AngularJS
Introduction to AngularJS
 
MongoDB Days Germany: Data Processing with MongoDB
MongoDB Days Germany: Data Processing with MongoDBMongoDB Days Germany: Data Processing with MongoDB
MongoDB Days Germany: Data Processing with MongoDB
 

Recently uploaded

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 

Recently uploaded (20)

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 

Information Extraction from the Web - Algorithms and Tools