• Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
100608 temis marklogic joint webinar slideshare
 

100608 temis marklogic joint webinar slideshare

on

  • 1,965 views

The slides from a joint TEMIS - Marklogic webinar with

The slides from a joint TEMIS - Marklogic webinar with
Guest speaker : Amanda Ward (Nature Publishing Group)
MarkLogic : David Wormald
TEMIS : Daniel Mayer (myself)

Statistics

Views

Total Views
1,965
Views on SlideShare
1,944
Embed Views
21

Actions

Likes
2
Downloads
60
Comments
0

3 Embeds 21

http://www.slideshare.net 18
http://www.slashdocs.com 2
http://breakthroughpractices.com 1

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment
  • Experience in BI, DBMS, App Servers, Search US based company Sequoia Capital: a few small successes like Oracle, Google, Yahoo, Youtube, Apple 4 th fastest by Deloitte
  • Breadth of industries, companies, usages… Major companies are betting on Mark Logic to build next generation information applications
  • Our XML Server is unique and powerful - Combine Database, Search, and Application Server Built together, not bolted on – a true hybrid technology, we didn’t just take a search engine, a DBMS and a blow torch and weld them together Without one of the components, there is no XML server Uses standard XML formats and languages for data, querying, and logic It turns out this was a very good idea. - There is *a lot* of information that is XML or is naturally modeled by XML.
  • Beautiful!
  • Beautiful!
  • This slide talks to why MarkLogic Server is the ideal platform for information applications that leverage entity enrichment
  • Don’t forget: Information Infrastructures Start with information as the fundamental unit The XML Server is flexible Capable of massive scale and speed Unifies the information spectrum Information-based Alerting Relevant data finds the user Send email, IM, call a web service, … Run additional queries… Update the database… Exploitation of Location Information Search using Geospatial markup in the data Search based on proximity or area Real-time query and analytics
  • Don’t forget: Information Infrastructures Start with information as the fundamental unit The XML Server is flexible Capable of massive scale and speed Unifies the information spectrum Information-based Alerting Relevant data finds the user Send email, IM, call a web service, … Run additional queries… Update the database… Exploitation of Location Information Search using Geospatial markup in the data Search based on proximity or area Real-time query and analytics

100608 temis marklogic joint webinar slideshare 100608 temis marklogic joint webinar slideshare Presentation Transcript

  • The Smart Content Advantage June 8 th 2010 Daniel Mayer Product Marketing Manager David Wormald Sales Director UK, Information & Media Amanda Ward Head of platform technologies
  • Outline
    • Speaker introductions (5’)
    • Introducing TEMIS and MarkLogic (10’)
      • Corporate overview
      • Digital Publishing Challenges & Opportunities
      • Our Solutions and their Benefits
    • Joint customer showcase : Nature Publishing Group (25’)
    • Summary (5’)
    • Q&A
  • Introducing TEMIS and MarkLogic
    • Experienced management team: Business Objects, Oracle, Sun, Verity
    • Over 200 customers, over 160 employees
    • Headquarters in San Carlos, California (Silicon Valley)
    • Lead investor: Sequoia Capital
    • Named 4 th fastest growing IT company in Silicon Valley
    About MarkLogic MarkLogic Corporation is a leading provider of infrastructure software for information applications
  • About TEMIS New York Heidelberg 2000 TEMIS is a leading provider of content enrichment & discovery solutions Paris Grenoble London 70
  • MarkLogic & TEMIS Team-Up
    • Active collaboration
      • Long-standing relationship
      • Shared vision
      • Joint go-to-market
    • Complementary solutions
      • Integrated
      • Best-of-Breed strategy
    • Joint commitment to Publishing
      • World-class solution & expertise
      • Prestigious references
  • Selected Customers
  • Information Providers Need Clarity of Purpose facebook
  • “ Please help me…”
    • Create new information products
      • Repackage existing content
      • Develop applications more quickly
      • Integrate new content easily
      • Be more productive
    • Differentiate my information products
      • Add value to our content
      • Enable user interaction with content
    • Take advantage of disruptive trends
      • Social Media
      • Mobile
  • MarkLogic Server – Unique Approach Application Server DBMS Search
  • What Does MarkLogic Server Do? Manipulate Store Deliver Search
  • An Agile Information Infrastructure Application Services MarkLogic Infrastructure HTML / XML Reports Briefings Policies Metadata Connectors And Toolkits Customer Applications Custom Publishing Content Analytics Metadata catalogs Search- Based Apps Mobile Content Delivery
  • The 3 Ingredients of Content Enrichment 1. Information Extraction HTC Strikes Back Against Apple – Wall Street Journal - NIRAJ SHETH May 13 th 2010 - HTC Corp., the maker of several phones that run on Google Inc.'s Android platform, filed a complaint against Apple Inc. alleging patent infringement, a move that follows an intellectual-property suit that the iPhone maker filed against HTC two months ago. HTC's complaint with the U.S. International Trade Commission alleges that Apple has violated five patents held by the Taiwan-based electronics manufacturer and asks the trade court to stop Apple from selling the iPhone, the iPad and iPod in the U.S. Apple in March filed complaints with the ITC, and in a federal court in Delaware, outlining a total of 20 patents related to touch-screen technology and mobile computing that it alleges HTC infringed. In its filing, HTC cites patents related to power consumption in smartphones and how cellphones dial contacts from an address book. Some of the patents cited by Apple in its suit also relate to power usage, but it wasn't immediately clear how similar those claims are to HTC's. "We are taking this action against Apple to protect our intellectual property, our industry partners, and most importantly our customers that use HTC phones," HTC's North American vice president, Jason Mackenzie, said in a written statement. In another sign of the growing legal pressure on Google and its Android partners, HTC recently announced a licensing deal with Microsoft Corp., which said it believes that HTC's Android phones infringes on a range of Microsoft patents including user interfaces. The share of Android-based smartphones in the U.S. grew to 28% in the first quarter, according to research firm NPD, for the first time edging out the iPhone's 21% share. HTC Apple Google US International Trade Commission Microsoft Corp Jason Mackenzie NIRAJ SHETH Company names US International Trade Commission Organization names People names Niraj Sheth iPhone iPad iPod Product names Jason Mackenzie NPD filed complaints with the ITC filed a complaint against Apple Inc. alleging patent infringement Relationships Court Case ITC recently announced a licensing deal with HTC Licensing American vice president ‘ s North HTC HR iPhone iPad iPod Android smartphones cellphones user interfaces power usage HTC Apple Google Microsoft Corp. NPD Technology names Android smartphones power usage user interfaces The share of Android-based smartphones in the U.S. grew to 28% in the first quarter Market share
  • The 3 Ingredients of Content Enrichment 2. Knowledge Insertion
      • Prior (authoritative) Knowledge about the entity
      • Internal / Proprietary
      • External / Commercial or Open Source
    HTC Strikes Back Against Apple Apple Apple
  • The 3 Ingredients of Content Enrichment 3. Content Networking
    • Create links based on dynamic queries that leverage context
      • Extracted Concepts (« metadata »)
      • User preferences & areas of interest
      • Available Knowledge & Content Assets
    HTC Strikes Back Against Apple – Wall Street Journal - NIRAJ SHETH May 13 th 2010 - HTC Corp., the maker of several phones that run on Google Inc.'s Android platform, filed a complaint against Apple Inc. alleging patent infringement, a move that follows an intellectual-property suit that the iPhone maker filed against HTC two months ago. HTC's complaint with the U.S. International Trade Commission alleges that Apple has violated five patents held by the Taiwan-based electronics manufacturer and asks the trade court to stop Apple from selling the iPhone, the iPad and iPod in the U.S. Apple in March filed complaints with the ITC, and in a federal court in Delaware, outlining a total of 20 patents related to touch-screen technology and mobile computing that it alleges HTC infringed. In its filing, HTC cites patents related to power consumption in smartphones and how cellphones dial contacts from an address book. Some of the patents cited by Apple in its suit also relate to power usage, but it wasn't immediately clear how similar those claims are to HTC's. "We are taking this action against Apple to protect our intellectual property, our industry partners, and most importantly our customers that use HTC phones," HTC's North American vice president, Jason Mackenzie, said in a written statement. In another sign of the growing legal pressure on Google and its Android partners, HTC recently announced a licensing deal with Microsoft Corp., which said it believes that HTC's Android phones infringes on a range of Microsoft patents including user interfaces. The share of Android-based smartphones in the U.S. grew to 28% in the first quarter, according to research firm NPD, for the first time edging out the iPhone's 21% share. filed complaints with the ITC filed a complaint against Apple Inc. alleging patent infringement ITC recently announced a licensing deal with HTC American vice president ‘ s North HTC HTC Apple Google US International Trade Commission Microsoft Corp iPhone iPad iPod Jason Mackenzie NIRAJ SHETH Android smartphones cellphones user interfaces power usage The share of Android-based smartphones in the U.S. grew to 28% in the first quarter NPD other articles discussing Apple http://news.google.com/news/search?q = Apple Biography of Jason Mackenzie http://en.wikipedia.org/wiki/ Jason Mackenzie other articles discussing Apple and HTC http://news.google.com/news/search?q= + Apple HTC patents held by Apple related to smartphones http://patft.uspto.gov/netacgi/nph-Parser?S1=( +AND+ ) smartphones Apple
  • The 3 Ingredients of Content Enrichment 3. Content Networking HTC Strikes Back Against Apple – Wall Street Journal - NIRAJ SHETH May 13 th 2010 - HTC Corp., the maker of several phones that run on Google Inc.'s Android platform, filed a complaint against Apple Inc. alleging patent infringement, a move that follows an intellectual-property suit that the iPhone maker filed against HTC two months ago. HTC's complaint with the U.S. International Trade Commission alleges that Apple has violated five patents held by the Taiwan-based electronics manufacturer and asks the trade court to stop Apple from selling the iPhone, the iPad and iPod in the U.S. Apple in March filed complaints with the ITC, and in a federal court in Delaware, outlining a total of 20 patents related to touch-screen technology and mobile computing that it alleges HTC infringed. In its filing, HTC cites patents related to power consumption in smartphones and how cellphones dial contacts from an address book. Some of the patents cited by Apple in its suit also relate to power usage, but it wasn't immediately clear how similar those claims are to HTC's. "We are taking this action against Apple to protect our intellectual property, our industry partners, and most importantly our customers that use HTC phones," HTC's North American vice president, Jason Mackenzie, said in a written statement. In another sign of the growing legal pressure on Google and its Android partners, HTC recently announced a licensing deal with Microsoft Corp., which said it believes that HTC's Android phones infringes on a range of Microsoft patents including user interfaces. The share of Android-based smartphones in the U.S. grew to 28% in the first quarter, according to research firm NPD, for the first time edging out the iPhone's 21% share. filed complaints with the ITC filed a complaint against Apple Inc. alleging patent infringement ITC recently announced a licensing deal with HTC American vice president ‘ s North HTC HTC Apple Google US International Trade Commission Microsoft Corp iPhone iPad iPod Jason Mackenzie NIRAJ SHETH Android smartphones cellphones user interfaces power usage The share of Android-based smartphones in the U.S. grew to 28% in the first quarter NPD other articles discussing Apple http://news.google.com/news/search?q = Apple Biography of Jason Mackenzie http://en.wikipedia.org/wiki/ Jason Mackenzie other articles discussing Apple and HTC http://news.google.com/news/search?q= + Apple HTC patents held by Apple related to smartphones http://patft.uspto.gov/netacgi/nph-Parser?S1=( +AND+ ) smartphones Apple Your Content Repo SELECT  ?content WHERE { ?content c:topic  “ ” ?content c:Author “ ” ?content c:keyplayer “ ” } Market share NPD Android SELECT  ?content WHERE { ?content c:topic  “ ” ?content c:keyplayer “ ” } Court Case Apple
  • Product Architecture Enrich
  • Product Architecture Luxid® Content Enrichment Platform Enrich Luxid® Content Pipeline Luxid® Information Analytics
  • Luxid® Content Enrichment Platform Luxid® Annotation Factory Enrich Skill Cartridge® Library
  • Luxid® Content Enrichment Platform Luxid® Annotation Factory Enrich Skill Cartridge® Library Luxid® Knowledge Studio
  • Luxid® Platform features
    • Genuine semantic models for deep information extraction
      • Extract entities, relationships, topics, sentiment, categories
    • Off-the-shelf Skill Cartridges®
      • Generic and specialized applications
    • Deeply Customizable and Extendable
      • With Luxid® Knowledge Studio
    • Platform approach
      • Editorial and Product Development
      • Batch and real-time processing
      • On-Premise and SaaS
    Enrich Skill Cartridge® Library Luxid® Knowledge Studio Luxid® Annotation Factory
  • MarkLogic Server for Content Enrichment
    • Inline Enrichment
      • Entities become embedded, flexible, extensible metadata
      • Efficiency gains and future-proofing
    • Open Enrichment Framework
      • Production-ready
      • Deploy on-load, dynamically or retrospectively
    • MarkLogic understands structure (and text)
      • Gives added context to entity tags
  • Information Infrastructure Of A Joint Deployment Custom Publishing Content Analytics Metadata catalogs Search- Based Apps Mobile Content Delivery
  • Configuration Of The Enrichment Pipeline
  • Joint Value Proposition
    • Editorial Productivity
      • Automate the tagging process
      • Develop faster and deeper insight into content archives
    • Make existing content more compelling
      • Improve findability
      • Enhance insight with Context & Analytics
      • Engage & Retain your Audience
    • Develop new products
      • Agile prototyping & response to market
      • Slice and Dice Content
      • Serve dynamic Topic Pages
  • Customer Applications
  • Deployment at
  • Nature Publishing Group : The Situation In 2008
    • Publisher of a diverse range of over 15 000 articles per year, article content range from news to original research and include a range of multimedia
    • Known for excellent editorial content and for good use of technology around this content to provide new exciting services to scientists e.g. Nature network, Connotea, Nature Precedings
    • Less able to easily manage fresh interpretations of our core publishing program
      • Content all stored flat on a file server - reuse painfully manual
      • No real ability to better interpret data and find new hooks - slicing and dicing hard to do
  • Publishing Program Goals
    • “ Article Particles”
      • Managing content below the article level, while retaining relationship with parent article
      • Slicing and dicing at various levels
    • Collections
      • Special relationships not necessarily explicit in the article content - e.g. disease portals
      • Tools and workflows in 2008 could not support the above without huge amounts of manual involvement
    • Querying/Search/Retrieval
      • Content in XML but not taking advantage of powerful search capabilities and /or ability to be easily represented in numerous different formats – OAI/RSS etc.
    • Persistent validation to retain content integrity
      • No semantic validation
      • Curated information – wanted to extend ‘information within an article
  • Publishing Program Motivations
    • Provision of new and evolving products and services
      • Subject portals
      • Dynamic publishing/re-publishing - articles no longer siloed at point of publication
      • Integrate non NPG resources into a paper
    • Customer retention
      • Increased time spent on site by users
      • Improved CPD for site license customers
    • Revenue generation
      • More advertising inventory – new products
      • Higher CPM for advertising based on better targeting
      • New sponsorship opportunities
  • Fast Forward to 2010 : TEMIS Deployment
    • Luxid® installed at NPG in January 2009
    • Went live in March 2009 with new journal Nature Chemistry
    • Excellent first candidate for ‘entity’ identification
      • Chemical structures have good standard identification system acting as a unique hook
      • Chemistry is a visual science : structure identification has become a requirement
  • Fast Forward to 2010 : TEMIS Deployment
    • Luxid® installed at NPG in January 2009
    • Went live in March 2009 with new journal Nature Chemistry
    • Excellent first candidate for ‘entity’ identification
    • Structure annotation now a feature on five of our titles, and we are looking at biological entity extraction with Luxid®
  • Fast Forward to 2010 : TEMIS Deployment
    • Luxid® installed at NPG in January 2009
    • Went live in March 2009 with new journal Nature Chemistry
    • Excellent first candidate for ‘entity’ identification
    • Structure annotation now a feature on five of our titles, and we are looking at biological entity extraction with Luxid®
    • Benefits
      • Users have increased functionality
      • We can target advertising more appropriately
      • We can create better links through our data corpus
      • We can curate location entities following a similar process that we set up for chemical structures
  • Nature Publishing Group in 2010 : TEMIS Deployment
    • Articles are automatically annotated to identify all chemical structures mentioned in the text.
    • N2S conversion enables InChI look-up
  • Fast Forward to 2010 : MarkLogic Deployment
    • MarkLogic integrated end 2008
    • Full data corpus is stored in MarkLogic
    • Capable of leveraging all available metadata
  • Fast Forward to 2010 : MarkLogic Deployment
    • MarkLogic integrated end 2008
    • Full data corpus is stored in MarkLogic
    • Capable of leveraging all available metadata
    • Benefits
      • Full search service : Bespoke, Machine to machine, Mobile, Desktop based widgets, … more to come
      • Easy to access/organise content with subject-based metadata
      • Allows new connections between articles to be made – related articles based on metadata
      • Allows new products on nature.com created and easily published – content queried and new collections created
      • New off platform products can be created via easy data reuse e.g. data source for Nature iPhone app
  • Integrating both platforms
    • Looking into the future: demo of how annotated entities could drive new products published out of MarkLogic
    • Currently we identify chemical structures http://www.nature.com/nchem/journal/v1/n1/full/nchem.100.html
    • Will soon extend this to genes/proteins/species/diseases
    • We already create ‘portal’ pages created with ease by non technical users (again using metadata hooks in articles)
    • http://www.nature.com/subject/nextgenseq , http://www.nature.com/subject/interferon , http://www.nature.com/subject/methylation
    • Current pages are generally sponsored, but ease of creation means these could be created at many different levels with many different use cases
    • More hooks in your article the easier it is to slice and dice
  •  
  •  
  • Measuring the impact
    • Advertising
      • Improved targeting means CPM can be increased 3 to 5 times
    • Soon to launch Entity Pages
      • Dynamic product driven from ‘entity hooks’
      • Entity pages for structures will be a resource on nature.com providing a broad range of information about ‘structure x’
      • Envisage these pages to be available in their 1000’s
      • Increase site stickiness and hence value of CPD for our librarian customers
    • Content has better value thanks to explicit tagging
      • E.g. datamining
  • Measuring the impact
    • Advertising
      • Improved targeting means CPM can be increased 3 to 5 times
    • Soon to launch Entity Pages
      • Dynamic product driven from ‘entity hooks’
      • Entity pages for structures will be a resource on nature.com providing a broad range of information about ‘structure x’
      • Envisage these pages to be available in their 1000’s
      • Increase site stickiness and hence value of CPD for our librarian customers
    • Content has better value thanks to explicit tagging
      • E.g. datamining
  • Looking into the future
    • We intend on modularizing / organizing this in our workflow
      • The current “XML on file server” workflow is limited
  • Looking into the future
    • The modular workflow will bring the following benefits :
      • Wider range of annotated entities
      • Curation by different sources
      • New services potentially driven/created by users from article hooks
      • Semantic understanding of our data corpus – become more diverse publishers
  • Nature 462 , 1011-1015 (24 December 2009) | doi:10.1038/nature08588; Subject Categories: Biochemistry Databases Ontology linking Topic modelling Rights metadata Entity page Dataset Storage
  • Future-proof your digital business
    • What’s at stake?
      • Leverage current assets (customers & products)
      • Create new information products
      • Take advantage of disruptive trends
    • Best-of-breed solutions for Information Applications
      • Scale-out MarkLogic Server
      • Off-the-shelf & extendable Luxid® Content Enrichment Platform
    • Joint & value proposition
      • Support agility & reduce time to market
      • Engage & retain audience
      • Fuel new business opportunities
  • Q & A Daniel Mayer [email_address] David Wormald [email_address] Amanda Ward [email_address]