DataCenter (C) 2012 Michael Szalay
What is the data center




      Data integration solution
      Web-based application
      Modular
What is the data center




      Integrated data model
      Integrated search
      Integrated view
Data Model



  Design the data model
Data Model



  Design the data model
      OWL syntax
      Open source tool Protege can be used
      Dynamic upload / download in the datacenter
Thesauri



   Define thesauri
       Use SKOS Thesauri to define your glossar
       automatic search term expansion
       Dynamic upload / download in the datacenter
Data Sources




   Define data sources
       databases
       documents
       linked data on the web
       user generated content
Data Sources



   Define databases
       all databases with JDBC drivers possible
Data annotation




   Define annotators
       keyword extraction
       categorization
       enrich data
       link data
Data annotation




   Define annotators
       per default, there is a simple keyword extraction for
       documents
       classifier has to be trained
       you may want to have custom annotators for your data
Data annotation


   Training
Data annotation
   Classification can be trained on the fly by drag and drop elements
Data annotation
   Manual annotation always possible (rating, keywords, comments)
Data views




   Define custom views
       There is a default view for data
       You may want to have custom views for different types of
       elements
       Dynamic replacement of views possible at runtime
Search

   Semantic search
         Automatic search term expansion according to the glossary
         Facets and filters
Search




   Search is extendable
Briefcase



   Collect links to documents
Dashboard




      Replaceable modules in the dashboard
      You may want to have custom modules
      Gadget compatible!
Dashboard
Dashboard


  Standard gadgets can be plugged in
Dashboard


  The Leo translator gadget
Monitoring

   Web-based monitoring
Architecture




      OSGi based on Apache Karaf
      VAADIN ui framework
      OpenRDF Sesame triple store (replacable)
      Apache JackRabbit content store
Roadmap




     Internationalization
     Collaboration

DataCenter

  • 1.
    DataCenter (C) 2012Michael Szalay
  • 2.
    What is thedata center Data integration solution Web-based application Modular
  • 3.
    What is thedata center Integrated data model Integrated search Integrated view
  • 4.
    Data Model Design the data model
  • 5.
    Data Model Design the data model OWL syntax Open source tool Protege can be used Dynamic upload / download in the datacenter
  • 6.
    Thesauri Define thesauri Use SKOS Thesauri to define your glossar automatic search term expansion Dynamic upload / download in the datacenter
  • 7.
    Data Sources Define data sources databases documents linked data on the web user generated content
  • 8.
    Data Sources Define databases all databases with JDBC drivers possible
  • 9.
    Data annotation Define annotators keyword extraction categorization enrich data link data
  • 10.
    Data annotation Define annotators per default, there is a simple keyword extraction for documents classifier has to be trained you may want to have custom annotators for your data
  • 11.
  • 12.
    Data annotation Classification can be trained on the fly by drag and drop elements
  • 13.
    Data annotation Manual annotation always possible (rating, keywords, comments)
  • 14.
    Data views Define custom views There is a default view for data You may want to have custom views for different types of elements Dynamic replacement of views possible at runtime
  • 15.
    Search Semantic search Automatic search term expansion according to the glossary Facets and filters
  • 16.
    Search Search is extendable
  • 17.
    Briefcase Collect links to documents
  • 18.
    Dashboard Replaceable modules in the dashboard You may want to have custom modules Gadget compatible!
  • 19.
  • 20.
    Dashboard Standardgadgets can be plugged in
  • 21.
    Dashboard TheLeo translator gadget
  • 22.
    Monitoring Web-based monitoring
  • 23.
    Architecture OSGi based on Apache Karaf VAADIN ui framework OpenRDF Sesame triple store (replacable) Apache JackRabbit content store
  • 24.
    Roadmap Internationalization Collaboration