Information in textual data doesn’t cross-borders. Semantic interoperability is not a translation problem but rather an issue of meaning equivalence across languages. This can be achieved by multilingual linking stored in a knowledge system.
Schema on read is obsolete. Welcome metaprogramming..pdf
Semantic Interoperability Explained
1. Semantic Interoperability for Multilingual DSIs ‒ Brussels, 18 Oct 2018
Semantic Interoperability Explained
Jochen Hummel
CEO ESTeam AB
language technology
software
CEF Info Days
4. @JochenHummelSemantic Interoperability ExplainedWorkshop Multilingual DSIs
Principles
4
1. Subsidiarity and proportionality
2. Openness
3. Transparency
4. Reusability
5. Technological neutrality and data portability
6. User-centricity
7. Inclusion and accessibility
8. Security and privacy
9. Multilingualism
10. Administrative simplification
11. Preservation of information
12. Assessment of Effectiveness and Efficiency
5. @JochenHummelSemantic Interoperability ExplainedWorkshop Multilingual DSIs
Semantic Interoperability
The precise meaning of exchanged information
is understood by all stakeholders…
…and preserved throughout exchanges.
Exchanged information is described in terms of
meaning, relationships, grammar and format.
At back-office level, the underlying information
architectures should be linguistically neutral so
that multilingualism is not a blocking issue.
Agreement on reference data.
Data and information is a valuable public
asset.
5
6. @JochenHummelSemantic Interoperability ExplainedWorkshop Multilingual DSIs
Semantic versus Syntactic
Interoperability
Syntatic
describes instantiation of data
format and grammar
metadata
Semantic
defines meaning of data elements
relationships between elements
common understanding by all
parties
6
7. @JochenHummelSemantic Interoperability ExplainedWorkshop Multilingual DSIs
Recommendation 30 and 31
Perceive data and information as a public asset that should be
appropriately generated, collected, managed, shared, protected and
preserved.
7
Semantic Interoperability Assets
taxonomies
controlled vocabularies
thesauri
code lists
reusable data structures/models
Put in place an information management strategy at the highest
possible level to avoid fragmentation and duplication. Management of
metadata, master data and reference data should be prioritised.*
* European Interoperability Framework - Implementation
Strategy
March 2017
8. @JochenHummelSemantic Interoperability ExplainedWorkshop Multilingual DSIs
Recommendation 32
Support the establishment of sector-specific and cross-sectoral
communities that aim to create open information specifications and
encourage relevant communities to share their results on national and
European platforms.
8
9. @JochenHummelSemantic Interoperability ExplainedWorkshop Multilingual DSIs
Status
Relatively new undertaking, few success stories.
Diverse linguistic, cultural, legal, and administrative environments make it hard.
Multilingualism adds further complexity to the problem.
For decades, robust, coherent and universally applicable information standards an
specifications haven’t really evolved.
Power of Semantic Interoperability Assets not yet understood.
9
10. @JochenHummelSemantic Interoperability ExplainedWorkshop Multilingual DSIs
Semantic Interoperability and Language
10
Monolingual Interoperability
Many countries operate in regions
Different terms used by different stakeholders
Multilingual Interoperability
Cross-border (mostly)
Complexity of monolingual multiplied by
languages
15. @JochenHummelSemantic Interoperability ExplainedWorkshop Multilingual DSIs
TMClass Statistics
15
Started with >110,000 English terms
Other languages ranging from 8,000 to 60,000 (DE)
Cleaning up resources – Reduce EN to 93,000
Language Equality
93,000 terms in all EU languages too €€€ without automation
Mining data for translations
DB grew from 400,000 to 2.1m terms
Conceptual clustering
Non essential entries removed – DB shrunk to 1.5m terms
Semi-automatic creation of 50,000 meaning clusters
Taxonomy built for the top layers and concepts linked
16. @JochenHummelSemantic Interoperability ExplainedWorkshop Multilingual DSIs
Success Story Interoperability @ EUIPO
60+ participating countries, all EU and
USA, China, Japan, Korea, Brazil, etc.
Project costs including internal work:
~€30m
Direct operational savings:
> €25m annually
Economical savings in EU member
states administration of IP:
€ tens of millions annually
Economical gains of industry having
their IP rights much faster validated:
€ billions annually
The world following European
Goods & Service classification:
priceless
16
18. @JochenHummelSemantic Interoperability ExplainedWorkshop Multilingual DSIs
Saving the Union with Interoperability
“We have to share ideas
and try to align, map
between concepts,
rather than trying
to standardize
different world perspectives.”
Tom van Engers,
Leibniz Center for Law, Amsterdam
18
Editor's Notes
Many of you probably think translation is the answer
Google Translate – Problem fixed!
But actually semantic interoperability is not a question of translation.
It is a question of understanding
What we need is to map terms from on concept system to another.
And from one language to the other.
This way you make non-standardized documents or procedures searchable and comparable cross-border, cross language