1. Ontologies
creation,
extraction and
maintenance
Ontology creation, extraction, and
6th AOS
Workshop
maintenance
Vial Real
(Portugal)
26-27 July
2005
Food and
Agriculture
Organization of
the UN 6th AOS Workshop
Library and Vial Real (Portugal)
Documentation
Systems Division 26-27 July 2005
Discussion forum n.1
Chair: Anita Liang
July 2005
2. The AOS/CS Workbench
Ontologies
creation,
extraction and
maintenance
• Support and manage the multi-language
terminology work of information
6th AOS
Workshop management specialists in the
Vial Real
(Portugal) development, maintenance, and quality
26-27 July
2005 assurance of the AOS/CS
Food and
Agriculture
Organization of
the UN
Library and
Documentation
Systems Division
July 2005
3. The AOS/CS Workbench
Ontologies
creation,
extraction and
maintenance
• Features
– Text processing
6th AOS
Workshop – Corpus Creation
Vial Real
(Portugal) – Corpus Analysis
26-27 July
2005
– Term/Relationship Management
Food and
Agriculture
Organization of – Quality Assurance
the UN
Library and
Documentation
– Versioning and Deployment
Systems Division
July 2005
4. Concept Hierarchy
Ontologies
creation,
extraction and
maintenance
text corpus
6th AOS .doc, .pdf,
Workshop
.xml, etc.
Vial Real
(Portugal)
26-27 July
2005
Food and
Agriculture
Organization of
the UN
Library and
Documentation
Systems Division
AOS/CS Workbench
concordance
July 2005 pattern-matching
multilingual
5. Tool features: Text Processing Capabilities
Ontologies
creation,
extraction and
maintenance
• Multilingual
• Font support for Chinese and Arabic at
6th AOS
Workshop minimum, also Lao, Thai
Vial Real
(Portugal)
26-27 July
• Other
2005
Food and – Entity extraction
Agriculture
Organization of
the UN – POS tagging
– Parsing
Library and
Documentation
Systems Division
July 2005
6. Tool Features: Corpus Creation and Maintenance
Ontologies
creation,
extraction and
maintenance
• Spidering tools
(http://www.manageability.org/blog/stuff/open-
source-web-crawlers-java/view)
6th AOS
Workshop • Document input and storage
Vial Real
(Portugal) – .doc, .pdf, .html, .xml
26-27 July
2005 • Text extraction (http://multivalent.sourceforge.net/)
Food and
Agriculture
Organization of
• Domain-specific repositories
the UN
– specifiable: agriculture, chemistry
Library and
Documentation
Systems Division
– combine and remove
July 2005
7. Tool Features: Corpus Analysis
Ontologies
creation,
• Text file management
extraction and
maintenance
– Add/delete files
– Add/delete Directories
– Add/delete URLs
• Search:
– Word (or part-word) or phrase; string, regular expression, tag search
6th AOS – Number of hits
Workshop – Case (in)sensitive
Vial Real • Display:
(Portugal)
– hide keyword option
26-27 July
2005 – toggle between a KWIC format and sentence mode
Food and • Sorting
Agriculture – 1L (First Left), 1R, 2L, 2R, as well as by search word and by text order;
Organization of
the UN – primary and secondary sorts (e.g., first right, then first left).
Library and • Frequency information
Documentation – display in alphabetical or frequency order of words
Systems Division
• Collocates
– Search collocates of spans from 1L-1R to 4L-4R
– Collocate highlighting
• Output: The concordance results can be saved to a file and/or printed.
July 2005
• Pattern-matching with pattern language
• Other: pattern-matching using POS tags, parallel text concordancing
8. Tool Features: Automatic KOS Search
Ontologies
creation,
extraction and
maintenance
• Specify online KOS URLs
• Automatic suggested parent and placement
6th AOS
Workshop within hierarchy
Vial Real
(Portugal)
26-27 July
2005
Food and
Agriculture
Organization of
the UN
Library and
Documentation
Systems Division
July 2005
9. Tool Features: Term/Relationship Management
Ontologies
creation,
extraction and
maintenance • Modifications (AGROVOC Maintenance Tool)
– term
• add
• delete
6th AOS • edit
Workshop – relationship
Vial Real
(Portugal) • add
26-27 July • delete
2005 • edit
Food and
Agriculture
• Machine learning (Annotation Tool)
Organization of
the UN
– wordnet
Library and – agrovoc itself
Documentation
Systems Division
– other thesauri
• Batch/bulk modifications based on patterns and structure
(rules-as-you-go)
July 2005
10. AOS/CS Workbench
Ontologies
creation,
extraction and
maintenance
6th AOS
Workshop
Vial Real
(Portugal)
26-27 July
2005
Food and
Agriculture
Organization of
the UN
Library and
Documentation
Systems Division
July 2005
11. Tool Features: Versioning and deployment
Ontologies
creation,
extraction and
maintenance
– CVS-type system to check out and check in
changes
6th AOS – Administrator-level functionalities for publishing
Workshop
Vial Real versions
(Portugal)
26-27 July
2005
– Language-level versioning
Food and
Agriculture
Organization of
the UN
Library and
Documentation
Systems Division
July 2005
12. Tool Features: Quality Assurance
Ontologies
creation,
extraction and
maintenance
• Logging and reporting of user actions
• Confirmation/verification of user actions
6th AOS
Workshop
Vial Real
• User rights management
(Portugal)
26-27 July
2005
• Ownership
Food and
Agriculture
Organization of
the UN
Library and
Documentation
Systems Division
July 2005
13. Technical Platform Details
Ontologies
creation,
extraction and
maintenance
• Centralized relational database backend
– local and remote databases?
– is there need for (1) referential integrity, triggers, etc. or
6th AOS (2) can we get by with publishing and storage
Workshop
• (1): PostgreSQL
Vial Real
(Portugal) • (2): MySQL
26-27 July
2005 • Web-based GUI
Food and
Agriculture
• Distributed client-server architecture
Organization of
the UN • Java-based
Library and
Documentation • Scalability and performance for network
– web services
Systems Division
July 2005
14. Workshops & Training
Ontologies
creation,
extraction and
maintenance
• Promote the Workbench
• Contact AGROVOC center of excellence
6th AOS
Workshop
Vial Real
• Nominate AGROVOC managers
(Portugal)
26-27 July
2005
• Organize workshop
Food and
Agriculture • Organize training
Organization of
the UN
Library and
Documentation
Systems Division
July 2005
15. Discussion points
Ontologies
creation,
extraction and
maintenance
• Tools (workbench)
• Modeling concept / term / string
6th AOS
Workshop
Vial Real
• Concepts vs instances
(Portugal)
26-27 July
2005
• Knowledge representation language (OWL,
Food and
Agriculture
SKOS)
Organization of
the UN
Library and
• Update it’s a problem?
Documentation
Systems Division
July 2005