Loading…

Flash Player 9 (or above) is needed to view presentations.
We have detected that you do not have it on your computer. To install it, go here.

Like this presentation? Why not share!

Building the Inform Semantic Publishing Ecosystem: from Author to Audience

on

  • 1,765 views

ISWC Presentation for Inform Technologies

ISWC Presentation for Inform Technologies

Statistics

Views

Total Views
1,765
Views on SlideShare
1,765
Embed Views
0

Actions

Likes
1
Downloads
16
Comments
0

0 Embeds 0

No embeds

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

Building the Inform Semantic Publishing Ecosystem: from Author to Audience Building the Inform Semantic Publishing Ecosystem: from Author to Audience Presentation Transcript

  • More Meaning. Better Results. Building the Inform Semantic Publishing Ecosystem: from Author to Audience Marc Hadfield VP, Research & Development [email_address]
      • Marc Hadfield
      • Semantic Technology, Computer Science
      • Inform Technologies (Head of R&D)
        • Semantic Technologies applied to Content Analysis & Distribution
      • Alitora Systems (Co-Founder / CTO)
        • Life Science Semantic Technology, Research, Big Data Analytics, Semantic HPC
        • Life Science Natural Language Processing
      • Columbia Genome Center
        • NLP applied to Life Science Research Articles
      • LCconnect (CTO)
        • Letter-of-Credit Exchange
  • Semantics in Publishing…
      • Ongoing Theme at ISWC 2010…
        • NY Times
        • Facebook (OpenGraph)
        • Elsevier
        • BBC
  • What is Inform?
      • Inform is a content enrichment solution designed to increase consumer engagement, page views and revenue.
      • We provide a hosted Semantic Web Service for content publishers that:
        • Reads your article before you publish it
        • Turns main topics and entities (people, places, companies, organizations) into links
        • Provides feeds of related web content when you publish it
      • New Direction : Optimizing Content Distribution via Direct Channels
        • Web users moving away from destination web sites, but still want the destination web site content.
      • Companies utilizing Inform include:
  • Connecting your content Audio, Video & Blogs from the Web Articles from the Web Content from Inform Your Affiliates’ Content Your Content Affiliated Content Your Content Licensed Content Google Street View Topic 0.90 Google Company 1.00 Ireland Place 0.70 Norway Place 0.70 South Africa Place 0.70 Sweden Place 0.70 Brian McClendon Person 0.80 Mountain View, California Place 0.60 Wi-Fi Topic 0.50
  • Related Content Widgets
  • Inform Topic Pages, Micro Sites
  • My Job: Building the Semantic Platform…
      • “ Silo”-ed Semantic Technology  Semantic Web
        • Aligned with Wikipedia, Leverage Linked Data for Mash-Ups
        • RDFa, SKOS, Semantic SEO
      • Semantic / NLP Engine
        • Improve Features, Quality
      • Semantic Data Infrastructure
        • Scalable Infrastructure
      • Semantic Data Analysis
        • Algorithms (Topology of Graphs), Inference
        • “ PageRank” on semantic data
      • Personalization, Usage Analysis
      • Micro Sites
        • Clusters of Topics, Generating Rich Content Experience
      • Distributing to Social Platforms
        • i.e. Facebook
  • Inform: Author to Audience
  • Leverage Inform Taxonomy
        • Author 
        • Content Creation Services
        • Semantic Data Repository
        • Semantic Data Analysis
        • Content Selection Algorithms
        • Webservices
        • Content Distribution Services
        •  Audience
    Inside the Semantic System Architecture
  • Content Creation
      • Article Creation Tool (ACT)
        • Author Tools
        • Embed in CMS, Tumblr / Wordpress Plugin
      • Publisher Portal
        • Editorial Tool
        • Content Feeds
      • Web Crawl
      • Summarizer
        • Create smart “blurbs” to advertise article
      • LinkedData
        • Freebase, Wikipedia, DBPedia, et cetera.
  • ACT Tool
  • ACT Tool
  • ACT Tool, Tumblr, Wordpress
  • Publisher Portal
  • Summarizer
  • Semantic Data Repository
      • Data Master / Data Node
        • Federated Semantic Data Managers
        • SPARQL Triplestore (scalable cluster)
        • Semantic Search
          • Search Indexes (Semi-Structured and Full-Text Search)
            • Lucene/Siren (Sindice)
          • Facets, Frequency Counts
        • Cache (In-Memory)
        • Blob Store (Voldemort)
        • Listener to Activity (Flume)
          • User Activity (clicks)
          • Content Activity (content updates)
        • Near Real-Time Trends, Analysis
        • Compute Algorithms (Stored Procedures in Groovy)
        • Long Term Content Archive (offline)
  • Semantic Data Analysis
      • Natural Language Processing
          • Rules & Machine Learning, Training
          • 500K articles per day, 4,000 unique sites
        • Text Extraction, Section/Sentence Extraction
        • Tokenization, Part-of-Speech, Noun/Verb Phrases
        • Entity Extraction, Entity Normalization
        • Topic Extraction, Summarization, Clustering
      • User Activity
        • User Model (Personalization)
      • Semantic Inference
        • F-Logic, Multi-Domain
        • Linked Data Mash-Ups
      • Semantic Graph Topology
        • Entity / Property Importance Metrics, Ranking, “PageRank”
        • Which triples in LinkedData are interesting?
  • Content Selection Algorithms
      • Model of User, Personalization
        • Social Networks provide Context
      • Semantic Analysis of Content
      • Algorithms
        • Maximize Relevancy / Relatedness (Meets Editorial Criteria)
        • Maximize Click-Through
          • Cute Kitten vs. Engagement Issue
        • Maximize Monetization
          • Goal: Content Exchange
  • Webservices
      • REST
        • Outputs RDF / JSON Data
      • Natural Language Processing
        • Article to Semantic MetaData
      • Related Content
        • Inputs: Content, Personalization, Algorithm
        • Articles
        • Semantic Mash-Ups
        • Topics
        • Entities
      • Semantic Query, Site Search
      • Storage, Content Repository
  • Content Distribution Services
      • Customer Destinations (Traditional Business)
        • Deep Integration
      • Publisher Widgets
        • Levels of Lightweight Integration
        • Example: Related-Content-Widget in JavaScript
      • Inform.com
        • Topic Pages
      • Micro Sites
        • Several Thousand Owned-and-Operated Domains/Sites, Topic Driven
      • Social Networks
        • Facebook
      • Tools:
      • Semantic SEO
        • RDFa, SKOS
  • Semantic MetaData, RDFa http://inspector.sindice.com
  • Facebook App
  • Using Facebook OpenGraph
    • Relevancy Algorithm:
    • Combine:
    • Trending / Popular Topics
    • Trending / Popular Articles
    • Personalization “Liked” Topics
    • Personalization “Liked” Articles
    • User Profiles (“Users like you…”)
  • Facebook “Liked” Topics
  • Facebook Article Stream
  • Inform: Author to Audience via Semantics
  • Thanks for your attention!
      • Questions?
      • Contact Information:
      • Marc Hadfield
      • [email_address]