Your SlideShare is downloading. ×
0
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
An Ontology for K-12 Education and the NIEM
Upcoming SlideShare
Loading in...5
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×
Saving this for later? Get the SlideShare app to save on your phone or tablet. Read anywhere, anytime – even offline.
Text the download link to your phone
Standard text messaging rates apply

An Ontology for K-12 Education and the NIEM

3,717

Published on

The following was presented at the Semantic Technology conference in March of 2006 in San Jose California. This case study examines the extension of the National …

The following was presented at the Semantic Technology conference in March of 2006 in San Jose California. This case study examines the extension of the National
Information Exchange Model NIEM to include K-12
education metadata. NIEM’s compliance with ISO/IEC
11179 metadata standards was found to be critical for
cost-effective system interoperability. This study indicates
that extending the NIEM can be compatible with newer
RDF and OWL metadata standards. We discuss how this
strategy will dramatically lower data integration costs and
make longitudinal data analysis more cost-effective. We
make recommendations for state education agencies,
federal policy makers, and metadata standards
organizations. The conclusion discusses the possible
impacts of recent innovations in collaborative metadata
standards efforts.

Published in: Business, Technology
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
3,717
On Slideshare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
0
Comments
0
Likes
1
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
No notes for slide
  • This is a case study. It is a story of how a team tried to build a data dictionary for the Minnesota Department of Education driven by the need for accurate long-term student assessment. I have struggled with weather to use “Ontology” in the title. I did not want to scare people off, but I feel that is what we are building. Some sources indicate that the difference between an taxonomy and an ontology is the difference between a tree and a graph but since my representation is much more complex then just a simple tree of data elements, I really should the word Ontology.
  • Transcript

    • 1. Case Study: Integrating K-12 Education into the National Information Exchange Model Dan McCreary Dan McCreary & Associates
    • 2. Background <ul><li>Dan McCreary - Dan McCreary & Associates </li></ul><ul><li>President of consulting firm that focuses on metadata-driven IT strategy development infrastructures for: </li></ul><ul><ul><li>Service Oriented Architectures (SOA) </li></ul></ul><ul><ul><li>Model Driven Architecture and Development (MDA, MDD) </li></ul></ul><ul><ul><li>Data warehousing and Business Intelligence (BI) </li></ul></ul><ul><ul><li>Metadata management training </li></ul></ul><ul><li>Hired in January of 2005 to build and populate a enterprise-wide metadata registry for the Minnesota Department of Education in partnership with Wisconsin Department of Public Instruction and Michigan Department of Education </li></ul><ul><li>Presentation Web site: </li></ul><ul><ul><li>http://www.danmccreary.com/presentations/semweb2006 </li></ul></ul>
    • 3. Agenda <ul><li>Case study of building a “semantic garden” for K-12 metadata with a modest budget for a state agency (~$150K) </li></ul><ul><li>A place where your metadata can take root, grow and bloom </li></ul><ul><li>Target a broad audience with goal of concept retention – use of images and metaphors </li></ul>
    • 4. Overview of Presentation
    • 5. 1970 Sci-Fi Classic: “The Forbin Project” A New Intersystem Language! Lesson: Before you take over the world you must exchange semantically precise metadata!
    • 6. Big Hairy Audacious Goals: Search Agents Legislator: What statewide programs increase test scores? District Superintendent: What “subgroups” in my district need the most help in math to meet NCLB guidelines? School Principal: What areas do new teachers need help in? Teacher: What areas do my students need the most help to pass statewide assessments?
    • 7. “ Shopping” for Metadata Your “shopping cart” is full of Data Elements
    • 8. Key Business Drivers <ul><li>Emphasis on “data driven decision making” </li></ul><ul><li>Need for longitudinal data analysis (i.e. a data warehouse) driven by the No Child Left Behind (NCLB) act </li></ul><ul><li>Required Consistency across: </li></ul><ul><ul><li>Time </li></ul></ul><ul><ul><li>School districts </li></ul></ul><ul><ul><li>Grade-levels (K-12) </li></ul></ul><ul><ul><li>Assessment-subjects (reading, writing, math) </li></ul></ul><ul><li>Need for cost-effective application interoperability and the desire to “break down application silos” </li></ul>
    • 9. Technology Drivers <ul><li>Desire to promote Service Oriented Architectures (SOA) </li></ul><ul><ul><li>Web services </li></ul></ul><ul><ul><li>Build a library of exchange documents </li></ul></ul><ul><ul><li>Consistent web-form definitions </li></ul></ul><ul><li>Desire to promote Model-driven Architecture (MDA) </li></ul><ul><ul><li>Model driven development (MDD) </li></ul></ul><ul><ul><li>Model driven testing (MDT) </li></ul></ul><ul><li>Migration from “procedural” to “declarative” programming </li></ul><ul><ul><li>Procedural programming is over-emphasized and makes business logic only maintainable by programmers </li></ul></ul><ul><ul><li>Declarative programming and transformation is much more appropriate when a large metadata-databases are available </li></ul></ul><ul><ul><li>Metadata driven systems allow more non-programmers to maintain business logic </li></ul></ul><ul><li>Avoid invention of new standards </li></ul><ul><ul><li>Desire to “build upon&quot; other machine-readable standards </li></ul></ul><ul><ul><li>ISO metadata registries do exist </li></ul></ul>
    • 10. Promotion of Loosely Coupled Systems <ul><li>Tightly Coupled </li></ul><ul><ul><li>Like a wine glass </li></ul></ul><ul><ul><li>Fragile </li></ul></ul><ul><ul><li>Breaks easily when there are changes in either the source or destination system </li></ul></ul><ul><li>Loosely Coupled </li></ul><ul><ul><li>Like a rubber ball </li></ul></ul><ul><ul><li>Resilient </li></ul></ul><ul><ul><li>Allows change and interoperability regression testing without breaking interfaces </li></ul></ul><ul><ul><li>Example: the addition of new data elements </li></ul></ul>
    • 11. <ul><li>US Department of Education effort to measure student “proficiency” deltas for nine subgroup populations (Asian, Black, Hispanic, Native American, Special Ed etc.) within each state over time and measure incremental gains in achievement levels </li></ul><ul><li>Introduced concept of Adequate Yearly Progress (AYP) for a School and School District – (if any sub-group fails your school and district fail) </li></ul><ul><li>Each state defines “proficiency” independently so state-to-state comparisons are not practical at this time </li></ul><ul><li>Multiple political interpretations of NCLB not discussed here: </li></ul><ul><ul><li>Republican vs. Democratic </li></ul></ul><ul><ul><li>Rural vs. Suburban vs. Inner City </li></ul></ul><ul><ul><li>Public vs. Private Educational Funding </li></ul></ul><ul><li>US Dept of Ed. releasing $53 million in grants for longitudinal data systems to individual states </li></ul><ul><li>Message from the Department of Education: “Build your statewide assessment metadata garden” </li></ul>NCLB
    • 12. <ul><li>US Department of Justice/Department of Homeland Security initiative to build a federal metadata registry based on Global Justice XML Data Model (GJXDM) project </li></ul><ul><li>Complies with federal ISO/ICE 11179 metadata registry guidelines (with a few exceptions) </li></ul><ul><li>Introduced very successful tools for subschema generation in conjunction with large ontologies in building XML exchange documents </li></ul><ul><li>Introduced concepts of “Universal” and “Core” classification schemes </li></ul><ul><li>Available today in an XML Schema and an Excel spreadsheet </li></ul><ul><li>Subschema generation tools may be available in 3Q of 2006 </li></ul>
    • 13. NIEM Scope Source: http://www.niem.gov/implementation.php You Are Here
    • 14. NIEM Type “Classification Scheme” Domain Specific Student Teacher Common Aircraft Assessment Boat Case Clothing Activity Address Document Event Image Long/Lat Location Organization Person Residence Street Vehicle Universal Contact
    • 15. High Level Structure of the NIEM <ul><li>The NIEM loosely follows ISO-11179 metadata registry guidelines </li></ul><ul><li>The structure is a subclass hierarchy of “Concepts” </li></ul><ul><li>Start with a abstract Thing </li></ul><ul><li>Start with shared upper-ontology “Concepts” (blue) </li></ul><ul><li>Add properties that each have Representation Terms (orange) </li></ul><ul><li>Add subclasses and then subclass properties (yellow) </li></ul>Thing ActivityStartDate ActivityEndDate PersonBirthDate PropertyType Activity Document Person Organization ConceptType StudentStateAssignedID EnrollmentStateDate Student Teacher Education Extensions Enrollment
    • 16. Reuse and Extension Strategy <ul><li>Match: If an NIEM data element met our needs, we used the NEIM data element and created an OWL sameAs statement with a high-precision match (Note: The definitions must match exactly) </li></ul><ul><li>Trim: If an NIEM data element has more detail than we needed, we created a local definition but created a sameAs link with a lower precision match level. </li></ul><ul><li>Extend: If the NIEM doesn’t have everything we need, create a local definition, add to the definition and create a sameAs link with a medium match level. </li></ul><ul><li>New: If there is no data element that matches what we need, we create an new one an put it in our local namespace. </li></ul><ul><li>Submit: If this is not a state-specific data element and we think other states may use it we can submit it to the NIEM for inclusion. </li></ul>
    • 17. A Semantic Equivalence Registry <ul><li>Goal: create semantic maps to a single federal metadata standard, not many standards </li></ul>R 5 R 2 R 3 R 4 R 6 R 7 R N Mapping from Minnesota's metadata registry to N other metadata registries: The O(N 2 ) problem R 2 R 3 R 4 R 5 R 6 R 7 R N NIEM Mapping from Minnesota's metadata registry to the NIEM The O(N) problem
    • 18. ISO/IEC 11179 XML Tag Name <ul><li>A standard naming convention for all XML data elements that “cross the wire” by most state and federal agencies that follow the ISO guidlines </li></ul><ul><li>Frequently called the “Data Element ISO name” </li></ul>niem:PersonBirthDate Object Class Term (leftmost) Representation Term (rightmost) Property Term (follows object class term) Namespace (domain)
    • 19. The Data Mapping : The “Frontline” of Semantics <ul><li>Left: A sample School District “flat file dump” from the Learning Management System (e.g. Moodle) of one school district (many data elements omitted for clarity) </li></ul><ul><li>Right: A mapping to a ISO named and defined Statewide XML schema standard for an on-line learning classes. Note because of names and definitions how much easier it is to quickly tell the semantics of the data element. </li></ul>Screen shot from Altova MapForce™
    • 20. Need a “Semantically Aware” Mapper <ul><li>Mapping tools have “auto connect matching children” but they require that the data element names be identical </li></ul><ul><li>They do not yet have the ability to “look up synonyms” in a metadata registry the equivalence of two data elements </li></ul><ul><li>We need semantic-aware tools! </li></ul>Goal: Add menu item for “Consult Semantic Broker”
    • 21. Constrain Exchange Document Data Element Selection <ul><li>When creating an exchange document, we can now quickly select data elements from a list derived from a metadata registry that has semantically-precise definitions and namespaces </li></ul><ul><li>This can be done by business analysts (B.A.s) with under a week of training and does not require programmers </li></ul><ul><li>Constraints can be added to this document or a second constraint schema </li></ul>Schema creation using Altova XMLSpy™ and importing a GJXDM subschema
    • 22. Hypertext Links and Data Element Links The Semantic Web Metadata Registry A Metadata Registry B The semantic web is about linking data elements in published metadata registries The Hypertext Web The current web is focused on linking published documents with HTML
    • 23. Challenges: Education Standards <ul><li>Lack of machine-readable metadata registries for K-12 metadata with synonyms </li></ul><ul><li>Many standards </li></ul><ul><ul><ul><li>Minnesota historical 80-column fixed-with punch-card driven file format standards </li></ul></ul></ul><ul><ul><ul><li>US Dept of Ed. National Center for Education Statistics (NCES) </li></ul></ul></ul><ul><ul><ul><li>Common Core Data (CCD) </li></ul></ul></ul><ul><ul><ul><li>Educational Data Network (EDEN) </li></ul></ul></ul><ul><ul><ul><li>SchoolMatters </li></ul></ul></ul><ul><ul><ul><li>School Integration Framework (SIF) </li></ul></ul></ul><ul><ul><ul><li>XML Business Reporting Language (XBRL) </li></ul></ul></ul><ul><li>No published synonyms in any of the above standards </li></ul><ul><li>As of December 2005, no K-12 education-specific data elements in the NIEM metadata registry </li></ul><ul><li>Lack of useful data element definitions: </li></ul><ul><ul><li>Document: “Details about inherent and frequently used characteristics of a document.” </li></ul></ul>
    • 24. Metadata Publishing Standards <ul><li>Lack of a single standard to publish metadata elements (XML Schema, Topic Maps, ISO/IEC-11179, OWL, XMDR) that includes metadata registry concepts </li></ul><ul><li>OWL one of few standards with “synonym” statements but few tools currently support OWL and inter-metadata registry synonym statements </li></ul><ul><li>OWL appears to be the best candidate for “over the wire” representations and the most easily extensible but it is not a metadata registry standard </li></ul>
    • 25. Challenge: We Need Semantic Aware Tools <ul><li>Lack of semantically-precise production tools </li></ul><ul><ul><li>Altova XMLSpy™ – excellent graphical schema design and management but no semantics in the XML schema standards </li></ul></ul><ul><ul><li>Stanford Medical Informatics Protégé (Open-Source) </li></ul></ul><ul><ul><li>Altova SemanticWorks™ (1 st release in October of 2005) </li></ul></ul><ul><li>ISO/IEC 11179 metadata registry tools are expensive </li></ul><ul><ul><li>Frequently above $100K before customization </li></ul></ul><ul><ul><li>Some lack workflow and public/private publishing </li></ul></ul><ul><ul><li>Several excellent solutions if you have >$1M budget and consulting dollars </li></ul></ul><ul><li>Ideal: A zero-footprint, AJAX-based, drag-and-drop, semantically-aware Open-Source schema design and data mapping tool that consults one or more synonym registries </li></ul><ul><li>Predict this is 3-4 years away (unless I get a grant) </li></ul>
    • 26. Tools Used <ul><li>Built initial version using a collection of Open-Source tools and inexpensive Altova tools (XMLSpy™, MapForce™ and SemanticWorks™) </li></ul><ul><li>Model-driven-development using a XML Schemas for the model of the registry </li></ul><ul><ul><li>Define XML Schemas for all metadata registry structures (meta-metadata) </li></ul></ul><ul><ul><li>XSL transforms of the data dictionary schema </li></ul></ul><ul><ul><li>XSL transforms of the XSL transforms for impact analysis </li></ul></ul><ul><li>XML Transforms for metadata publishing and visualizations </li></ul><ul><li>Apache Ant build scripts to publish to public web site and private intranet site </li></ul><ul><li>Eclipse 3.1 IDE to build and maintain ant scripts </li></ul><ul><li>Saxon 8 XSLT Java libraries </li></ul><ul><li>Extensive use of XSLT 2.0 and XPath 2.0 </li></ul><ul><li>FreeMind open source mind mapping tool with excellent XML interfaces </li></ul><ul><li>Various data element editing forms </li></ul><ul><ul><ul><li>(Castor, Struts, JSP, ASP, MS-Access) </li></ul></ul></ul>
    • 27. Diagram From ISO-11179 Specification (1:1) DATA ELEMENT CONCEPT DATA ELEMENT Property (1:N) Object Class (1:1) Property (1:N) Object Class Representation (1:1) (1:1) Taken from Figure 1 &quot;Fundamental Model for Data Elements&quot; ISO/IEC 11179:1:2004(E) page 11 (non-normative) (1:N)
    • 28. UML Model for RDF RDF Statement Subject Predicate ResourceValuedStatement LiteralValuedStatement Object Resource Property Literal TypedLiteral Object See Lee W. Lacy: OWL: Representing Information Using the Web Ontology Language p 82
    • 29. UML Model of Metadata Registry <ul><li>A Data Dictionary is composed of many Data Elements </li></ul><ul><li>All Data Elements must have required names and ISO definitions </li></ul><ul><li>Each Data Element must be either a Concepts or a Property of a Data Element Concept </li></ul><ul><li>Each property is associated with a single concept and has a Property Name and a R epresentation Term </li></ul><ul><li>Some properties (where the representation term is of type Code) have one or more Enumerated Values </li></ul>Data Element Concept Property Property Name Representation Term Data Dictionary subClassOf (simplified for clarity) Data Element Name ISO-Definition Enumerated Value Code Definition
    • 30. Representation Terms (ebXML Core Component Tech Spec v1.9) <ul><li>Amount – Monetary value with units of currency. </li></ul><ul><li>BinaryObject – Set of finite-length sequences of binary octets. (secondary: Graphic, Picture, Sound, Video ) </li></ul><ul><li>Code – Character string that for brevity represents a specific meaningwhere the values are enumerated and each value has a clear definition. </li></ul><ul><li>DateAndTime – Date + time; a point in time where both date and time are known. (secondary: Date , Time ) </li></ul><ul><li>Identifier – Character string used to establish identity of, and uniquely distinguish one instance of an object within an ID scheme. (authorized abbreviation: ID ) </li></ul><ul><li>Indicator – Boolean (exactly two mutually exclusive values). </li></ul><ul><li>Measure – Numeric value determined by measurement with units. </li></ul><ul><li>Number – Assigned or determined by calculation. (secondary: Value, Rate, Percent ) </li></ul><ul><li>Quantity – Non-monetary numeric value or count with units. </li></ul><ul><li>Text – Character string generally in the form of words. (secondary: Name ) </li></ul>
    • 31. Publishing Metaphor <ul><li>Publishing implies high-quality information is shared with a large audience </li></ul><ul><li>Emphasis on multi-state reviews and clarity to a diverse base of consumers </li></ul><ul><li>Commitment to accuracy and change control </li></ul>
    • 32. The Psychology of Sharing and Trust <ul><li>Research done in mid-1990s by Adele Goldberg and others </li></ul><ul><li>Groups only tend to share objects with other people or systems they trust </li></ul><ul><li>We need to create systems for building trust </li></ul><ul><ul><li>Have a define a peer review process (see 11179 standards) </li></ul></ul><ul><ul><li>Have experts with credibility play a role in approval </li></ul></ul><ul><ul><li>Publish list of users of metadata </li></ul></ul><ul><ul><li>Publish test cases </li></ul></ul><ul><ul><li>Publish change control process </li></ul></ul><ul><ul><li>Publish success stories </li></ul></ul>
    • 33. Metadata Publishing Workflow Funnel <ul><li>Develop a simple workflow system for publishing data elements </li></ul><ul><li>Include harvesting areas of simple glossary-of-terms found in documentation, web sites and by using metadata “scrapers” to inventory all columns in relational database systems </li></ul><ul><li>Get stakeholder teams to “accept” a data elements, review them and take on the data stewardship role for these data elements </li></ul><ul><li>Commit to change-control only after data elements are marked “approved for publication” by over 50% of the stewardship team </li></ul><ul><li>Exclude sensitive information from public web sites (data sources) </li></ul>Under Review Approved for Publication Glossary Of Terms Metadata Harvesters Initial Draft
    • 34. Model-Driven Development XML Form Editors Data Elements (500 Small XML Files) Data Dictionary (Single, Large XML File) Transforms (Saxon 8) Apache Ant HTML OWL FreeMind PDF MindManager Excel SQL Subversion RDBMS OLAP Cubes SemanticWorks Protégé Intranet Public Web Server
    • 35. Visualization <ul><li>People will not trust what they don’t understand </li></ul><ul><li>They tend to understand concepts if you make them clear </li></ul><ul><li>Visualizations are the best way to promote clarity to a subgroup </li></ul><ul><li>Focus attention and remove “chart junk” </li></ul><ul><li>Quickly display a subgroup’s data elements under review </li></ul><ul><li>Let them pick the colors! </li></ul><ul><li>50 line XSLT </li></ul>Sample from FreeMind: Open Source mind mapping tool
    • 36. Results http://education.state.mn.us/datadictionary
    • 37. Store Semantic Mappings to Foreign Data Elements Directly in the Metadata Registry Current metadata registry standards do not clearly specify where and how semantic equivalence and precision is stored.
    • 38. Owl:sameAs and owl:equivalentClass <ul><li>OWL is different from XML Schema because it addresses data element semantics </li></ul><ul><ul><li>XML Schema has no way of declaring two data types as &quot;equivalent&quot; </li></ul></ul><ul><ul><li>XML Schema was designed to create a way to validate a data set used in messaging systems </li></ul></ul><ul><li>OWL was designed to manage metadata </li></ul><ul><ul><li>Example: </li></ul></ul><ul><ul><li>owl: Class Equivalency Operator &quot; equivalentClass “ </li></ul></ul><ul><ul><li>OWL “ sameAs ” operator for instance equivalence </li></ul></ul><ul><ul><li>NIEM:Person = SUMO:Human = CYC:Individual </li></ul></ul>Metadata Registry A Metadata Registry B M etadata Equivalence Mappings
    • 39. Future: Semantic Mappers and Semantic Brokers Report Request In Model A Gartner: Vocabulary-based transformation XMLA: XML for Analysis Metadata Translation Service XML Response In Model A TDS In Model B Metadata Registry Model A Model B M etadata Mappings RDF Queries XML Results Data Warehouse (RDBMS) SQL or XMLA Queries In Model B
    • 40. What Data Elements Are Important? <ul><li>It costs time and money for each data element you add to your metadata registry (over $1,000 per data element) </li></ul><ul><li>The more unimportant data elements are in your metadata registry, the harder it becomes to detect duplicates </li></ul><ul><li>Prioritization criteria should be developed to determine what Data Elements should have priority </li></ul><ul><li>Metadata “scraping tools” developed to pull candidate Data Elements from databases, spreadsheets and documents </li></ul><ul><li>We developed a six-step criteria for determining the value of a data element in the data dictionary </li></ul><ul><li>Anything can be in a Glossary but only about 10% of Glossary data items are promoted to a data element </li></ul>Low Value Data Elements High Value Data Elements
    • 41. Wikipedia Rocks! <ul><li>It is currently burdensome to add new metadata to the registry </li></ul><ul><li>Would like to add “Edit this data element” (ala Wikis) </li></ul><ul><li>Ideally a “Semantic Wiki” </li></ul>See: Wikipedia: “Semantic Wiki”
    • 42. Wantlist Standards <ul><li><?xml version=&quot;1.0&quot; encoding=&quot;UTF-8&quot;?> </li></ul><ul><li>< w:wantList w:release =&quot; 3.0.3 &quot; xmlns:w =&quot; http://gjxdmtools.gtri.gatech.edu/wantList/1 &quot;> </li></ul><ul><li>< w:element w:prefix =&quot; j &quot; w:name =&quot; ContactEmailID &quot; w:isReference =&quot; false &quot;/> </li></ul><ul><li>< w:element w:prefix =&quot; j &quot; w:name =&quot; ContactTelephoneNumber &quot; w:isReference =&quot; false &quot;/> </li></ul><ul><li>< w:element w:prefix =&quot; j &quot; w:name =&quot; Person &quot; w:isReference =&quot; false &quot;/> </li></ul><ul><li>< w:element w:prefix =&quot; j &quot; w:name =&quot; PersonBirthDate &quot; w:isReference =&quot; false &quot;/> </li></ul><ul><li>< w:element w:prefix =&quot; j &quot; w:name =&quot; PersonGivenName &quot; w:isReference =&quot; false &quot;/> </li></ul><ul><li>< w:element w:prefix =&quot; j &quot; w:name =&quot; PersonSurName &quot; w:isReference =&quot; false &quot;/> </li></ul><ul><li></ w:wantList > </li></ul><ul><li>Metadata management tools could share data elements wantlists with other tools. </li></ul><ul><li>If you don’t have an appropriate data element, you should be able to look it up in clearinghouse of metadata with precise ISO definitions (e.g. Swoogle) </li></ul><ul><li>Web service queries and metadata translation services could be used </li></ul>
    • 43. McCreary’s Top 10 Recommendations <ul><li>Organizations and applications that exchange data should be encouraged to publish their metadata in a machine-readable format to facilitate agent interoperability </li></ul><ul><li>Published data dictionaries should drive exchange document creation standards and published web services and metadata registry “ shopping cart ” tools should be accessible to non-programmers </li></ul><ul><li>Data warehouse initiatives should attempt to reuse and integrate existing federal metadata standards </li></ul><ul><li>Federal and state agencies should follow ISO/IEC 11179 and Data Reference Model (DRM) guidelines and use formal representation terms for all data element properties </li></ul><ul><li>Fundamentals of metadata publishing and transformation training should be encouraged by data architects and integration managers </li></ul><ul><li>Metadata standards should continue to be developed with the goal of building semantic integration brokers and agents </li></ul><ul><li>Producers of data mapping software should integrate semantic equivalency statements into automated mapping systems </li></ul><ul><li>XML integration appliance vendors should include semantic integration services to make integration easier </li></ul><ul><li>Organizations should perform ROI analysis on semantic integration </li></ul><ul><li>Awards should be given to organizations that publishing useful and high-quality metadata </li></ul>
    • 44. Things to Ponder… <ul><li>Just like the ARPANET and DAML, some worthy standards come from US federally funded efforts. But they will need to “evolve” before they are widely adopted outside government projects. </li></ul><ul><li>Before you “take over the world”, you need to publish your metadata with your stakeholders </li></ul><ul><li>Metadata publishing is 80% social engineering and 20% technical engineering and is achieved through building shared meaning via trust building systems </li></ul><ul><li>Standards are complex. Sometimes the more general they are, the more widely adopted they are but the more abstract they become. Some standards frequently need an expert interpreter to adjust for local business needs </li></ul><ul><li>People need to understand something before they trust it. One of the best ways is to build tools to allow users to visualize their data elements </li></ul><ul><li>When planting a metadata garden, start small and keep weeding out the unimportant and redundant data elements </li></ul>
    • 45. Open The Door To The Semantic Web! <ul><li>Metadata publishing is hard </li></ul><ul><li>It is a foundation upon which the Semantic Web will be built </li></ul><ul><li>The benefits are indirect and need strong executive sponsorship </li></ul><ul><li>Metadata publishing is no “silver bullet” </li></ul><ul><li>I believe it is the most direct way to get to the Semantic Web </li></ul><ul><li>This will be the most practical way to build intelligent agents </li></ul>Agents Metadata Publishing
    • 46. References <ul><li>Web site for paper: </li></ul><ul><ul><li>www.danmccreary.com/presentations/semweb2006 </li></ul></ul><ul><li>Data dictionary for Minnesota Department of Education </li></ul><ul><ul><li>education.state.mn.us/datadictionary </li></ul></ul><ul><li>ISO-11179 metadata registry standards </li></ul><ul><li>National Information Exchange Model (NIEM.gov) </li></ul><ul><li>Wikipedia Articles </li></ul><ul><ul><li>Metadata registry </li></ul></ul><ul><ul><li>ISO/IEC 11179 </li></ul></ul><ul><ul><li>Representation term </li></ul></ul><ul><ul><li>Metadata publishing </li></ul></ul><ul><ul><li>Semantic broker </li></ul></ul>
    • 47. Questions & Answers <ul><li>If software is ever going to be able to effectively inter-operate (in ways that were not explicitly preconceived and engineered), it will be because applications share enough of the semantics of their data elements. </li></ul><ul><li>Doug Lenat, Cycorp </li></ul><ul><li>Semantic Technology Conference </li></ul><ul><li>2005 </li></ul>
    • 48. Contact Information <ul><li>Dan McCreary, President </li></ul><ul><li>Dan McCreary & Associates </li></ul><ul><li>Dan <at> danmccreary.com </li></ul><ul><li>http://www.danmccreary.com </li></ul><ul><li>also: http://www.LinkedIn.com </li></ul>

    ×