SlideShare a Scribd company logo
Pathways to SEASR


                       National Center for Supercomputing Applications!
                          University of Illinois at Urbana-Champaign
                                                                   




The SEASR project and its Meandre infrastructure!
are sponsored by The Andrew W. Mellon Foundation
SEASR
     This project will focus on developing, integrating, deploying, and sustaining a
• 
     set of reusable and expandable software components and a supporting
     framework, SEASR that will benefit a broad set of data mining applications for
     scholars in humanities
     The key goals established for this effort are a set of software centric directives:
• 

     Support the development of a state-of-the-art software environment for unstructured
• 
     data management and analysis of digital libraries, repositories and archives, as well as
     educational platforms that are expected to contribute to many of the humanities
     breakthroughs of the 21st century.

     Support the continued development, expansion, and maintenance of end-to-end
• 
     software system – user interfaces, workflow engines, data management, analysis and
     visualization tools, collaborative tools, and other software integrated into a complete
     environment SEASR – to bring the full power of data analytics to the scholars. 

     Support education and training for use of this software environment for analysis through
• 
     workshops to promote its usage among scholars.

                               +INTERMEDIATE LAYER + EOT
     
APPLICATIONS
Agenda Day 1 Morning
Wednesday April 22, 2009

•  8:30am Registration and Breakfast

•  9:00am SEASR Overview

•  10:00am Break
•  10:30am SEASR Application Examples and Demonstrations 
    –  Zotero and SEASR

    –  Text Analysis and SEASR

    –  Audio Analysis and SEASR (NEMA, NESTER)

    –  Fedora and SEASR

•  Noon Lunch
Agenda Day 1 Afternoon
     1:00pm SEASR Architecture, Installation
• 

     2:00pm SEASR Tools
• 

     2:30pm Break
• 

     3:00pm Breakout Session
• 

      –  Humanities: SEASR Tools with Hand On Demo

      –  Developers: SEASR Technical Details

     4:30pm End of Day
• 
Workshop Objective
The objective of the workshop is:


•  To explain and demonstrate the utility of SEASR
   for digital humanities, and to bring you to a point
   where you could deploy, contribute and utilize the
   SEASR environment.


SEASR + TOOLS + EXEMPLARS + HANDS ON
Workshop Goals
The goals of the workshop are:
•  LEARN: Provide a detailed understanding of the SEASR framework 
•  LEARN: Provide a foundation and examples for participant teams to
   use SEASR in a study or inquiry
•  ADOPT: Share participant generated research plans to utilize SEASR
•  INSTALL: Provide detailed instructions on how to install, build
   components, integrate existing applications, and maintain the SEASR
   environment 
•  SUPPORT: Develop plans for resolution of issues raised by the user
   community in utilization of SEASR 
•  SUSTAIN: Develop a plan for community driven future development
   and dissemination of SEASR 

Learn + Adopt + Sustain
The SEASR Picture
SEASR: Reach + Relevance + Reuse + Repeatability 

    
SEASR emphasizes flexibility, scalability, modularity,
     provides community hub and access to heterogeneous
     data and computational systems
     –  Semantic driven environment for SOA interoperability
     –  Encourages sharing and participation for building communities
     –  Modular construction allows flows to be modified and configured
        to encourage reusability within and across domains
     –  Enables a mashup and integration of tools
     –  Data-intensive flows can be executed on a simple desktop or a
        large cluster(s) without modification
     –  Computation can be created for distributed execution on servers
        where the content lives
     –  User accessibility to control trust and compliance with required
        copyright license of content
     –  Relies on standardized Resource Description Framework (RDF) to
        define components and flow
SEASR
Apps
                     SEASR
Plugins
             SEASR
Web
Apps
                SEASR
Services



                                                      Meandre
Data‐Intensive
Flows


                                                          SEASR
Components

Developer
Tools





                                Data
                           Analy5cs
                    Visualiza5on

                         Gateway
Connec5ons
                 Descrip5ve
Sta5s5cs
                   Graphing

                           Data
Persistence
                 Predic5ve
Modeling
                 Modeling
Vis

                         Data
Transforma5on
                      Discovery
               Info
Vis
(small
mul5ples)

                        Natural
Lang
Processing





                                Component
Repository
                               Component
Discovery


                                                         Meandre
Infrastructure




              Shared
Stores
                       File
Systems
              Metadata
Stores
              SOA
Gateways



                                                       Virtualiza5on
Infrastructure

Data Driven Models
Workbench
•  Web-based UI
•  Components and flows
   are retrieved from server
•  Additional locations of
   components and flows
   can be added to server
•  Create flow using a
   graphical drag and drop
   interface
•  Change property values
•  Execute the flow
Community Hub
SEASR @ Work – Zotero
•  Plugin to Firefox 
•  Zotero manages the
   collection
•  Launch SEASR Analytics 
   –  Citation Analysis uses the
      JUNG network importance
      algorithms to rank the authors
      in the citation network that is
      exported as RDF data from
      Zotero to SEASR
   –  Zotero Export to Fedora
      through SEASR
   –  Saves results from SEASR
      Analytics to a Collection
•  Launch MONK Processing
   –  MONK DB Ingestion Workflow
SEASR @ Work – Fedora




                         Interactive Web
                            Application




Web Service
SEASR @ Work – Entity Mash-up
•  Entity Extraction
   with OpenNLP
•  Locations
   viewed on
   Google Map 
•  Dates viewed on
   Simile Timeline
SEASR @ Work – Audio Analysis
•  NEMA: Executes a
   SEASR flow for each
   run
   –  Loads audio data
   –  Extracts features for
      every 10 sec moving
      window of audio
   –  Loads and applies the
      models
   –  Sends results back to the
      WebUI
•  NESTER: Annotation of
   Audio via Spectral
   Analysis
SEASR @ Work – MONK
Executes flows for
  each analysis
  requested
  –  Predictive
     modeling using
     Naïve Bayes
  –  Predictive
     modeling using
     Support Vector
     Machines (SVM)
SEASR @ Work – DISCUS
     On-demand usage of
• 
     analytics while surfing
      –  While navigating
         request analytics to be
         performed on page
      –  Text extraction and
         cleaning
     Summarization and key
• 
     work extraction
      –  List the important
         terms on the page
         being analyzed
      –  Provide relevant short
         summaries 
     Visual maps
• 
      –  Provide a visual
         representation of the
         key concepts
      –  Show the graph of
         relations between
         concepts
SEASR and UIMA : Emotion Tracking
 
Goal is to have this type of Visualization to track emotions across a
  text document (Leveraging flare.prefuse.org)
Tag Cloud of All RegForm Text
Pathways to SEASR


                           National Center for Supercomputing Applications!
                              University of Illinois at Urbana-Champaign
                                                                       




The SEASR project and its Meandre infrastructure!
are sponsored by The Andrew W. Mellon Foundation

More Related Content

Similar to Seasr Overview Ws April 2009

SEASR-Meandre Architecture Ws Jan 2009
SEASR-Meandre Architecture Ws Jan 2009SEASR-Meandre Architecture Ws Jan 2009
SEASR-Meandre Architecture Ws Jan 2009Loretta Auvil
 
Meandre: Semantic-Driven Data-Intensive Flows in the Clouds
Meandre: Semantic-Driven Data-Intensive Flows in the CloudsMeandre: Semantic-Driven Data-Intensive Flows in the Clouds
Meandre: Semantic-Driven Data-Intensive Flows in the CloudsXavier Llorà
 
Exploring Data Visualization
Exploring Data VisualizationExploring Data Visualization
Exploring Data VisualizationJim Jenkins
 
Text Mining and SEASR
Text Mining and SEASRText Mining and SEASR
Text Mining and SEASRLoretta Auvil
 
Fedora App Slide 2009 Hastac
Fedora App Slide 2009 HastacFedora App Slide 2009 Hastac
Fedora App Slide 2009 HastacLoretta Auvil
 
Apache Spark in Scientific Applications
Apache Spark in Scientific ApplicationsApache Spark in Scientific Applications
Apache Spark in Scientific ApplicationsDr. Mirko Kämpf
 
Apache Spark in Scientific Applciations
Apache Spark in Scientific ApplciationsApache Spark in Scientific Applciations
Apache Spark in Scientific ApplciationsDr. Mirko Kämpf
 
7 Ways To Leverage SP for PM Success
7 Ways To Leverage SP for PM Success7 Ways To Leverage SP for PM Success
7 Ways To Leverage SP for PM SuccessDux Raymond Sy
 
A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...
A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...
A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...Ilkay Altintas, Ph.D.
 
The Server Side of Responsive Web Design
The Server Side of Responsive Web DesignThe Server Side of Responsive Web Design
The Server Side of Responsive Web DesignDave Olsen
 
Bringing Wireless Sensing to its full potential
Bringing Wireless Sensing to its full potentialBringing Wireless Sensing to its full potential
Bringing Wireless Sensing to its full potentialAdrian Hornsby
 
Web 2 0 Data Visualization With Jsf
Web 2 0 Data Visualization With JsfWeb 2 0 Data Visualization With Jsf
Web 2 0 Data Visualization With Jsfrajivmordani
 
Applications of the REST Principle
Applications of the REST PrincipleApplications of the REST Principle
Applications of the REST Principleelliando dias
 

Similar to Seasr Overview Ws April 2009 (20)

SEASR eScience 2008
SEASR eScience 2008SEASR eScience 2008
SEASR eScience 2008
 
SEASR-Meandre Architecture Ws Jan 2009
SEASR-Meandre Architecture Ws Jan 2009SEASR-Meandre Architecture Ws Jan 2009
SEASR-Meandre Architecture Ws Jan 2009
 
Meandre: Semantic-Driven Data-Intensive Flows in the Clouds
Meandre: Semantic-Driven Data-Intensive Flows in the CloudsMeandre: Semantic-Driven Data-Intensive Flows in the Clouds
Meandre: Semantic-Driven Data-Intensive Flows in the Clouds
 
SEASR and Zotero
SEASR and ZoteroSEASR and Zotero
SEASR and Zotero
 
SEASR Overview
SEASR OverviewSEASR Overview
SEASR Overview
 
Exploring Data Visualization
Exploring Data VisualizationExploring Data Visualization
Exploring Data Visualization
 
SEASR Installation
SEASR InstallationSEASR Installation
SEASR Installation
 
Text Mining and SEASR
Text Mining and SEASRText Mining and SEASR
Text Mining and SEASR
 
Huu Bang's Résumé
Huu Bang's RésuméHuu Bang's Résumé
Huu Bang's Résumé
 
Fedora App Slide 2009 Hastac
Fedora App Slide 2009 HastacFedora App Slide 2009 Hastac
Fedora App Slide 2009 Hastac
 
Apache Spark in Scientific Applications
Apache Spark in Scientific ApplicationsApache Spark in Scientific Applications
Apache Spark in Scientific Applications
 
Apache Spark in Scientific Applciations
Apache Spark in Scientific ApplciationsApache Spark in Scientific Applciations
Apache Spark in Scientific Applciations
 
7 Ways To Leverage SP for PM Success
7 Ways To Leverage SP for PM Success7 Ways To Leverage SP for PM Success
7 Ways To Leverage SP for PM Success
 
A View on eScience
A View on eScienceA View on eScience
A View on eScience
 
A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...
A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...
A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...
 
Nuxeo JavaOne 2007
Nuxeo JavaOne 2007Nuxeo JavaOne 2007
Nuxeo JavaOne 2007
 
The Server Side of Responsive Web Design
The Server Side of Responsive Web DesignThe Server Side of Responsive Web Design
The Server Side of Responsive Web Design
 
Bringing Wireless Sensing to its full potential
Bringing Wireless Sensing to its full potentialBringing Wireless Sensing to its full potential
Bringing Wireless Sensing to its full potential
 
Web 2 0 Data Visualization With Jsf
Web 2 0 Data Visualization With JsfWeb 2 0 Data Visualization With Jsf
Web 2 0 Data Visualization With Jsf
 
Applications of the REST Principle
Applications of the REST PrincipleApplications of the REST Principle
Applications of the REST Principle
 

More from Loretta Auvil

More from Loretta Auvil (9)

SEASR Text
SEASR TextSEASR Text
SEASR Text
 
SEASR-Fedora App
SEASR-Fedora AppSEASR-Fedora App
SEASR-Fedora App
 
SEASR Community Hub
SEASR Community HubSEASR Community Hub
SEASR Community Hub
 
Meandre Workbench Ws Jan 2009
Meandre Workbench Ws Jan 2009Meandre Workbench Ws Jan 2009
Meandre Workbench Ws Jan 2009
 
SEASR and UIMA
SEASR and UIMASEASR and UIMA
SEASR and UIMA
 
ICHASS Workshop Lab
ICHASS Workshop LabICHASS Workshop Lab
ICHASS Workshop Lab
 
ICHASS Workshop Seasr
ICHASS Workshop SeasrICHASS Workshop Seasr
ICHASS Workshop Seasr
 
ICHASS Workshop Text Mining
ICHASS Workshop Text MiningICHASS Workshop Text Mining
ICHASS Workshop Text Mining
 
Text Mining Wksp Auvil
Text Mining Wksp AuvilText Mining Wksp Auvil
Text Mining Wksp Auvil
 

Recently uploaded

Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Product School
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaRTTS
 
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptxUnpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptxDavid Michel
 
IESVE for Early Stage Design and Planning
IESVE for Early Stage Design and PlanningIESVE for Early Stage Design and Planning
IESVE for Early Stage Design and PlanningIES VE
 
Salesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone KomSalesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone KomCzechDreamin
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsPaul Groth
 
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...CzechDreamin
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualityInflectra
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Alison B. Lowndes
 
10 Differences between Sales Cloud and CPQ, Blanka Doktorová
10 Differences between Sales Cloud and CPQ, Blanka Doktorová10 Differences between Sales Cloud and CPQ, Blanka Doktorová
10 Differences between Sales Cloud and CPQ, Blanka DoktorováCzechDreamin
 
Agentic RAG What it is its types applications and implementation.pdf
Agentic RAG What it is its types applications and implementation.pdfAgentic RAG What it is its types applications and implementation.pdf
Agentic RAG What it is its types applications and implementation.pdfChristopherTHyatt
 
Powerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara LaskowskaPowerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara LaskowskaCzechDreamin
 
IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024IoTAnalytics
 
Speed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in MinutesSpeed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in Minutesconfluent
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Product School
 
In-Depth Performance Testing Guide for IT Professionals
In-Depth Performance Testing Guide for IT ProfessionalsIn-Depth Performance Testing Guide for IT Professionals
In-Depth Performance Testing Guide for IT ProfessionalsExpeed Software
 
PLAI - Acceleration Program for Generative A.I. Startups
PLAI - Acceleration Program for Generative A.I. StartupsPLAI - Acceleration Program for Generative A.I. Startups
PLAI - Acceleration Program for Generative A.I. StartupsStefano
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...Product School
 
Demystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John StaveleyDemystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John StaveleyJohn Staveley
 
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)Julian Hyde
 

Recently uploaded (20)

Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptxUnpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
 
IESVE for Early Stage Design and Planning
IESVE for Early Stage Design and PlanningIESVE for Early Stage Design and Planning
IESVE for Early Stage Design and Planning
 
Salesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone KomSalesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
 
10 Differences between Sales Cloud and CPQ, Blanka Doktorová
10 Differences between Sales Cloud and CPQ, Blanka Doktorová10 Differences between Sales Cloud and CPQ, Blanka Doktorová
10 Differences between Sales Cloud and CPQ, Blanka Doktorová
 
Agentic RAG What it is its types applications and implementation.pdf
Agentic RAG What it is its types applications and implementation.pdfAgentic RAG What it is its types applications and implementation.pdf
Agentic RAG What it is its types applications and implementation.pdf
 
Powerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara LaskowskaPowerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara Laskowska
 
IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024
 
Speed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in MinutesSpeed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in Minutes
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
In-Depth Performance Testing Guide for IT Professionals
In-Depth Performance Testing Guide for IT ProfessionalsIn-Depth Performance Testing Guide for IT Professionals
In-Depth Performance Testing Guide for IT Professionals
 
PLAI - Acceleration Program for Generative A.I. Startups
PLAI - Acceleration Program for Generative A.I. StartupsPLAI - Acceleration Program for Generative A.I. Startups
PLAI - Acceleration Program for Generative A.I. Startups
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
 
Demystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John StaveleyDemystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John Staveley
 
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
 

Seasr Overview Ws April 2009

  • 1. Pathways to SEASR National Center for Supercomputing Applications! University of Illinois at Urbana-Champaign The SEASR project and its Meandre infrastructure! are sponsored by The Andrew W. Mellon Foundation
  • 2. SEASR This project will focus on developing, integrating, deploying, and sustaining a •  set of reusable and expandable software components and a supporting framework, SEASR that will benefit a broad set of data mining applications for scholars in humanities The key goals established for this effort are a set of software centric directives: •  Support the development of a state-of-the-art software environment for unstructured •  data management and analysis of digital libraries, repositories and archives, as well as educational platforms that are expected to contribute to many of the humanities breakthroughs of the 21st century. Support the continued development, expansion, and maintenance of end-to-end •  software system – user interfaces, workflow engines, data management, analysis and visualization tools, collaborative tools, and other software integrated into a complete environment SEASR – to bring the full power of data analytics to the scholars.  Support education and training for use of this software environment for analysis through •  workshops to promote its usage among scholars. +INTERMEDIATE LAYER + EOT APPLICATIONS
  • 3. Agenda Day 1 Morning Wednesday April 22, 2009 •  8:30am Registration and Breakfast •  9:00am SEASR Overview •  10:00am Break •  10:30am SEASR Application Examples and Demonstrations –  Zotero and SEASR –  Text Analysis and SEASR –  Audio Analysis and SEASR (NEMA, NESTER) –  Fedora and SEASR •  Noon Lunch
  • 4. Agenda Day 1 Afternoon 1:00pm SEASR Architecture, Installation •  2:00pm SEASR Tools •  2:30pm Break •  3:00pm Breakout Session •  –  Humanities: SEASR Tools with Hand On Demo –  Developers: SEASR Technical Details 4:30pm End of Day • 
  • 5. Workshop Objective The objective of the workshop is: •  To explain and demonstrate the utility of SEASR for digital humanities, and to bring you to a point where you could deploy, contribute and utilize the SEASR environment. SEASR + TOOLS + EXEMPLARS + HANDS ON
  • 6. Workshop Goals The goals of the workshop are: •  LEARN: Provide a detailed understanding of the SEASR framework •  LEARN: Provide a foundation and examples for participant teams to use SEASR in a study or inquiry •  ADOPT: Share participant generated research plans to utilize SEASR •  INSTALL: Provide detailed instructions on how to install, build components, integrate existing applications, and maintain the SEASR environment •  SUPPORT: Develop plans for resolution of issues raised by the user community in utilization of SEASR •  SUSTAIN: Develop a plan for community driven future development and dissemination of SEASR Learn + Adopt + Sustain
  • 8. SEASR: Reach + Relevance + Reuse + Repeatability SEASR emphasizes flexibility, scalability, modularity, provides community hub and access to heterogeneous data and computational systems –  Semantic driven environment for SOA interoperability –  Encourages sharing and participation for building communities –  Modular construction allows flows to be modified and configured to encourage reusability within and across domains –  Enables a mashup and integration of tools –  Data-intensive flows can be executed on a simple desktop or a large cluster(s) without modification –  Computation can be created for distributed execution on servers where the content lives –  User accessibility to control trust and compliance with required copyright license of content –  Relies on standardized Resource Description Framework (RDF) to define components and flow
  • 9. SEASR
Apps
 SEASR
Plugins
 SEASR
Web
Apps
 SEASR
Services
 Meandre
Data‐Intensive
Flows
 SEASR
Components
 Developer
Tools
 Data
 Analy5cs
 Visualiza5on
 Gateway
Connec5ons
 Descrip5ve
Sta5s5cs
 Graphing
 Data
Persistence
 Predic5ve
Modeling
 Modeling
Vis
 Data
Transforma5on
 Discovery
 Info
Vis
(small
mul5ples)
 Natural
Lang
Processing
 Component
Repository
 Component
Discovery
 Meandre
Infrastructure
 Shared
Stores
 File
Systems
 Metadata
Stores
 SOA
Gateways
 Virtualiza5on
Infrastructure

  • 11. Workbench •  Web-based UI •  Components and flows are retrieved from server •  Additional locations of components and flows can be added to server •  Create flow using a graphical drag and drop interface •  Change property values •  Execute the flow
  • 13. SEASR @ Work – Zotero •  Plugin to Firefox •  Zotero manages the collection •  Launch SEASR Analytics –  Citation Analysis uses the JUNG network importance algorithms to rank the authors in the citation network that is exported as RDF data from Zotero to SEASR –  Zotero Export to Fedora through SEASR –  Saves results from SEASR Analytics to a Collection •  Launch MONK Processing –  MONK DB Ingestion Workflow
  • 14. SEASR @ Work – Fedora Interactive Web Application Web Service
  • 15. SEASR @ Work – Entity Mash-up •  Entity Extraction with OpenNLP •  Locations viewed on Google Map •  Dates viewed on Simile Timeline
  • 16. SEASR @ Work – Audio Analysis •  NEMA: Executes a SEASR flow for each run –  Loads audio data –  Extracts features for every 10 sec moving window of audio –  Loads and applies the models –  Sends results back to the WebUI •  NESTER: Annotation of Audio via Spectral Analysis
  • 17. SEASR @ Work – MONK Executes flows for each analysis requested –  Predictive modeling using Naïve Bayes –  Predictive modeling using Support Vector Machines (SVM)
  • 18. SEASR @ Work – DISCUS On-demand usage of •  analytics while surfing –  While navigating request analytics to be performed on page –  Text extraction and cleaning Summarization and key •  work extraction –  List the important terms on the page being analyzed –  Provide relevant short summaries Visual maps •  –  Provide a visual representation of the key concepts –  Show the graph of relations between concepts
  • 19. SEASR and UIMA : Emotion Tracking Goal is to have this type of Visualization to track emotions across a text document (Leveraging flare.prefuse.org)
  • 20. Tag Cloud of All RegForm Text
  • 21. Pathways to SEASR National Center for Supercomputing Applications! University of Illinois at Urbana-Champaign The SEASR project and its Meandre infrastructure! are sponsored by The Andrew W. Mellon Foundation