(More) Transparency Transformation

1,889 views

Published on

Presentation given Spring/Summer/Fall 09 various venues

Published in: Technology, Education
1 Comment
4 Likes
Statistics
Notes
No Downloads
Views
Total views
1,889
On SlideShare
0
From Embeds
0
Number of Embeds
121
Actions
Shares
0
Downloads
19
Comments
1
Likes
4
Embeds 0
No embeds

No notes for slide

(More) Transparency Transformation

  1. 1. transparency transformation solution architecture tools and techniques george thomas #gtra-sym09
  2. 2. agenda <ul><li>An overview of Web Oriented Architecture (WOA) design principles that have made the Web the most successful distributed computing platform ever created will be given. </li></ul><ul><li>Technologies for exposing raw data and publishing semantically enriched structured data for persistence and syndication on the Web as public records will be described. </li></ul><ul><li>Technologies that enable interoperability across these published assets and currently disparate data sources to achieve low cost, large scale data federation will be described. </li></ul><ul><li>Widgets and services that consume and transform this data for interactive and integration purposes will be discussed in the context of different stakeholder views. </li></ul><ul><li>A Web-scale approach to Business Intelligence leveraging Cloud Computing approaches to data archive analysis will be described. </li></ul><ul><li>Finally, the applicability of the proposed solution architecture to the Federal Segment Architecture Methodology and tools like Visualization to Understand Expenditures in IT will be discussed. </li></ul>
  3. 3. agenda <ul><li>An overview of Web Oriented Architecture (WOA) design principles that have made the Web the most successful distributed computing platform ever created will be given. </li></ul><ul><li>Technologies for exposing raw data and publishing semantically enriched structured data for persistence and syndication on the Web as public records will be described. </li></ul><ul><li>Technologies that enable interoperability across these published assets and currently disparate data sources to achieve low cost, large scale data federation will be described. </li></ul><ul><li>Widgets and services that consume and transform this data for interactive and integration purposes will be discussed in the context of different stakeholder views. </li></ul><ul><li>A Web-scale approach to Business Intelligence leveraging Cloud Computing approaches to data archive analysis will be described. </li></ul><ul><li>Finally, the applicability of the proposed solution architecture to the Federal Segment Architecture Methodology and tools like Visualization to Understand Expenditures in IT will be discussed. </li></ul>
  4. 4. Web Oriented Architecture (WOA) <ul><li>REpresentational State Transfer (REST) </li></ul><ul><ul><li>The architectural style of the World Wide Web </li></ul></ul><ul><ul><li>aka Resource Oriented Architecture (ROA) </li></ul></ul><ul><li>hyperlinks dereference (information) resource representations </li></ul><ul><ul><li>HTTP URI's and content negotiation </li></ul></ul><ul><ul><ul><li>user agent prefers .htm, .xml, .rdf, .etc </li></ul></ul></ul><ul><li>statefulness </li></ul><ul><ul><li>servers maintain resource state, clients maintain application state </li></ul></ul><ul><li>RESTful Web services </li></ul><ul><ul><li>HTTP uniform interface </li></ul></ul><ul><ul><ul><li>CRUD analog to HTTP PUT/GET/POST/DELETE </li></ul></ul></ul><ul><ul><li>contrast to Remote Procedure Call (RPC) style Web services </li></ul></ul><ul><ul><ul><li>SOAP/WSDL, you design the methods to invoke </li></ul></ul></ul><ul><li>global visibility (the Web) and persistence (permalinks) </li></ul><ul><ul><li>caching, crawling, indexing </li></ul></ul>
  5. 5. agenda <ul><li>An overview of Web Oriented Architecture (WOA) design principles that have made the Web the most successful distributed computing platform ever created will be given. </li></ul><ul><li>Technologies for exposing raw data and publishing semantically enriched structured data for persistence and syndication on the Web as public records will be described. </li></ul><ul><li>Technologies that enable interoperability across these published assets and currently disparate data sources to achieve low cost, large scale data federation will be described. </li></ul><ul><li>Widgets and services that consume and transform this data for interactive and integration purposes will be discussed in the context of different stakeholder views. </li></ul><ul><li>A Web-scale approach to Business Intelligence leveraging Cloud Computing approaches to data archive analysis will be described. </li></ul><ul><li>Finally, the applicability of the proposed solution architecture to the Federal Segment Architecture Methodology and tools like Visualization to Understand Expenditures in IT will be discussed. </li></ul>
  6. 6. XForms - human data capture <ul><li>Orbeon server side XForms engine, Ajax browser GUI's </li></ul><ul><ul><li>catalog and builder apps </li></ul></ul><ul><ul><li>create new XSD bound forms </li></ul></ul><ul><ul><li>populate, persist, search </li></ul></ul><ul><ul><li>Tomcat and eXist </li></ul></ul><ul><ul><li>off-line capability </li></ul></ul><ul><ul><li>transformation pipeline </li></ul></ul>
  7. 7. Atom Publishing Protocol (APP) <ul><li>automated invocation of the RESTful Web service </li></ul><ul><ul><li>HTTP PUT/POST the spreadsheet or XML instance doc </li></ul></ul><ul><ul><ul><li>to atomserver.codehaus.org </li></ul></ul></ul><ul><li>where else is APP used? </li></ul><ul><ul><li>Google Data API's, Microsoft Live Framework </li></ul></ul>
  8. 8. Atom Syndication Format <ul><ul><li>transform XForm or APP captured info into XHTML+RDFa </li></ul></ul><ul><ul><li>(permalinked) public recordset in feed entry <content> </li></ul></ul>
  9. 9. RDFa enabled 'deep link' discovery <ul><ul><li>Rich Snippets from Google </li></ul></ul><ul><ul><li>SearchMonkey from Yahoo </li></ul></ul>
  10. 10. the london-gazette.co.uk
  11. 11. london-gazette.co.uk/listing
  12. 12. web page = web service
  13. 13. agenda <ul><li>An overview of Web Oriented Architecture (WOA) design principles that have made the Web the most successful distributed computing platform ever created will be given. </li></ul><ul><li>Technologies for exposing raw data and publishing semantically enriched structured data for persistence and syndication on the Web as public records will be described. </li></ul><ul><li>Technologies that enable interoperability across these published assets and currently disparate data sources to achieve low cost, large scale data federation will be described. </li></ul><ul><li>Widgets and services that consume and transform this data for interactive and integration purposes will be discussed in the context of different stakeholder views. </li></ul><ul><li>A Web-scale approach to Business Intelligence leveraging Cloud Computing approaches to data archive analysis will be described. </li></ul><ul><li>Finally, the applicability of the proposed solution architecture to the Federal Segment Architecture Methodology and tools like Visualization to Understand Expenditures in IT will be discussed. </li></ul>
  14. 14. Web as DB - Web API <ul><ul><li>Linked Open (Government) Data (LOD) </li></ul></ul><ul><ul><li>SPARQL endpoints </li></ul></ul>linkeddata.org
  15. 15. linkedmdb.org – LOD example
  16. 16. RDF/XML from W3C Tabulator
  17. 17. from web of docs to web of data
  18. 18. querying the Web
  19. 19. faceted browsing <ul><li>click through on this (S-P-O) result set, use as query </li></ul>
  20. 20. federated correlation <ul><li>graph based dynamic schema evolution across silos </li></ul><ul><ul><li>centralization/normalization not required (or realistic/practical!) </li></ul></ul>
  21. 21. agenda <ul><li>An overview of Web Oriented Architecture (WOA) design principles that have made the Web the most successful distributed computing platform ever created will be given. </li></ul><ul><li>Technologies for exposing raw data and publishing semantically enriched structured data for persistence and syndication on the Web as public records will be described. </li></ul><ul><li>Technologies that enable interoperability across these published assets and currently disparate data sources to achieve low cost, large scale data federation will be described. </li></ul><ul><li>Widgets and services that consume and transform this data for interactive and integration purposes will be discussed in the context of different stakeholder views. </li></ul><ul><li>A Web-scale approach to Business Intelligence leveraging Cloud Computing approaches to data archive analysis will be described. </li></ul><ul><li>Finally, the applicability of the proposed solution architecture to the Federal Segment Architecture Methodology and tools like Visualization to Understand Expenditures in IT will be discussed. </li></ul>
  22. 22. many to many set visualization
  23. 23. ad-hoc analyst/end-user 'meshups'
  24. 24. agenda <ul><li>An overview of Web Oriented Architecture (WOA) design principles that have made the Web the most successful distributed computing platform ever created will be given. </li></ul><ul><li>Technologies for exposing raw data and publishing semantically enriched structured data for persistence and syndication on the Web as public records will be described. </li></ul><ul><li>Technologies that enable interoperability across these published assets and currently disparate data sources to achieve low cost, large scale data federation will be described. </li></ul><ul><li>Widgets and services that consume and transform this data for interactive and integration purposes will be discussed in the context of different stakeholder views. </li></ul><ul><li>A Web-scale approach to Business Intelligence leveraging Cloud Computing approaches to data archive analysis will be described. </li></ul><ul><li>Finally, the applicability of the proposed solution architecture to the Federal Segment Architecture Methodology and tools like Visualization to Understand Expenditures in IT will be discussed. </li></ul>
  25. 25. analytics in the wild
  26. 26. cloud scale analytics <ul><li>Google </li></ul><ul><ul><li>GFS and MapReduce </li></ul></ul><ul><ul><ul><li>Proprietary </li></ul></ul></ul><ul><ul><ul><li>page rank impl </li></ul></ul></ul><ul><li>Apache Hadoop </li></ul><ul><ul><li>HDFS and MapReduce </li></ul></ul><ul><ul><ul><li>open source </li></ul></ul></ul><ul><li>Amazon EMR, Cloudera </li></ul>facebook.com
  27. 27. agenda <ul><li>An overview of Web Oriented Architecture (WOA) design principles that have made the Web the most successful distributed computing platform ever created will be given. </li></ul><ul><li>Technologies for exposing raw data and publishing semantically enriched structured data for persistence and syndication on the Web as public records will be described. </li></ul><ul><li>Technologies that enable interoperability across these published assets and currently disparate data sources to achieve low cost, large scale data federation will be described. </li></ul><ul><li>Widgets and services that consume and transform this data for interactive and integration purposes will be discussed in the context of different stakeholder views. </li></ul><ul><li>A Web-scale approach to Business Intelligence leveraging Cloud Computing approaches to data archive analysis will be described. </li></ul><ul><li>Finally, the applicability of the proposed solution architecture to the Federal Segment Architecture Methodology and tools like Visualization to Understand Expenditures in IT will be discussed. </li></ul>
  28. 28. application to EA discipline <ul><li>getting there from here </li></ul><ul><ul><li>stop: </li></ul></ul><ul><ul><ul><li>publishing / analyzing / visualizing unstructured data </li></ul></ul></ul><ul><ul><ul><li>using structure data only in file or message exchanges </li></ul></ul></ul><ul><ul><li>start: </li></ul></ul><ul><ul><ul><li>align Gov and Web architecture (including EA KB's!) </li></ul></ul></ul><ul><ul><ul><li>divide and conquer / publish component ontologies </li></ul></ul></ul><ul><ul><ul><li>linking them </li></ul></ul></ul><ul><ul><li>continue: </li></ul></ul><ul><ul><ul><li>embracing emergent structure and continuous improvement </li></ul></ul></ul><ul><ul><ul><li>using open source and enabling crowd-sourcing </li></ul></ul></ul>
  29. 29. q&a - discussion <ul><li>thanks for your time and attention! </li></ul><ul><li>contact me </li></ul><ul><ul><li>http://xri.net/=george.thomas </li></ul></ul><ul><ul><li>GSA OCIO Enterprise Chief Architect </li></ul></ul><ul><ul><li>FCIOC-AIC Services Subcommittee Chair </li></ul></ul><ul><ul><li>W3C eGov IG invited expert </li></ul></ul><ul><ul><li>OMG GovDTF Steering Committee </li></ul></ul><ul><ul><li>Graduate School Faculty SOA Instructor </li></ul></ul>

×