An Overview of Semantic Technologies at Dow Jones New York Semantic Web Meetup Christine Connors Global Director, Semantic Technology Solutions Dow Jones February 19, 2009
Christine Connors - Who? B.S. Theatre Arts Management Survived SABRE M.S. Library & Information Science Survived DIALOG Hated Cataloging & Classification class Analyst firm --> Dot-Com --> Library --> IT --> Content Operations/IKM -->Management “ Over” taxonomies at Raytheon, started learning about semantic web Prefer the hybrid, practical approach Need to solve UI/IxD and security/DAM issues Content not technology Many hats…
The World of News Corporation
Dow Jones Founded: 1882 First year distributed ‘electronically’: 1897 Full-time employees: 7000+ Business & Financial News Staff: 1900 Local Media Group Enterprise Media Group Consumer Media Group
Dow Jones Enterprise Media Group  Business & Relationship Intelligence
What is the  Semantic Web ? A universal medium for exchanging information that can be processed electronically and still have meaning and relevance  It provides a common, standardized framework that allows data to be shared and reused across applications, enterprises, and community boundaries. Why do we care about it for our solutions ? We need to provide import and export support for the Semantic Web to enable easier data exchange Greater interoperability means better standardization and integration into Web based applications –  more customers can use it!
Massive amounts of real-time data On average we code 5 million articles per month 180,000+ articles per day 9000+ images per week Publications and pictures from more than 159 countries and 350 geographical regions worldwide 5500 journals, 2300 newspapers, 400+ newswires, 300 media transcripts, 15,000 websites, blogs & forums, 2.4M company reports, 1.3M people profiles 8 24x7 coding desks around the globe Real-time delivery Scrolling headlines Tickers Feeds Some product’s delivery time is measured in milli-seconds
Committed to the process Assess Business Goals Content IT Metadata Taxonomy Standards & Best Practices Users Design Audience Segmentation & Definition Facet Analysis Information Architecture Editorial Guidelines & Workflow Build Entity Extraction (machine  and/or  human) Content Tagging Rules (machine and/or human) Taxonomy Construction & Mapping Maintain Continuous Work-in-progress Engage end-users (query log analysis, focus groups, folksonomy) Governance Process
Information Processing model Dow Jones Intelligent Indexing Data  Capture Normalizer Data Capture Normalization Quality Control Distribution Editorial  Interfaces Content  Monitoring & Alerting Information Providers Coding Manual Coding  Interface Manual Coding Queue Entity Extraction Categorizer Rules-based Coding Expansion/ Validation
It’s all about the metadata Dow Jones Intelligent Indexing™ 7 million+ company codes 7 million+ executives 900+ industry codes 535+ region codes 680+ subject codes 22 language codes Generate Indexing 11.5 million+ companies 5 million+ person profiles
Why is Vocabulary Management Valuable to your organization?  Standardize your corporate vocabulary  and improve information management. Adding structure to information assets streamlines product development, regulatory compliance and risk mitigation Improve product development  with a global business vocabulary that feeds right into downstream applications such as portals, reporting programs and CRMs    Enable Corporate intranet or portal users to easily and quickly locate relevant information  by leveraging taxonomy to improve productivity Gain valuable productivity benefits  by connecting users and workgroups with the relevant information they need, when and where they need it Streamline information access and license pre-built taxonomies  that can be easily integrated into your enterprise search technology to streamline information access  Drive online revenue  and increase customer satisfaction by improving how online customers find your information, products or services by cross-referencing industry classification codes and brand names Monetize media content  by unifying digital platforms to enable a better search infrastructure and a competitive advantage in the media marketplace
A managed taxonomy using connects information to build a global semantic network  John Doe ABC Corporation New York Employed By Employer Of Located In Location Of Manufacturer Of Vendor To Widgets Manufactured By PQR Corporation XYZ Corporation Client Of Client Of Vendor To
Global Organizations Need Synaptica: As a  centralized  repository for multi-lingual semantic management that is:   -   Independent  from web-portal search and categorization systems -  Scalable ; capable of evolving with emerging corporate semantic standards HTML CSV MS Word XML Z39.5 SKOS RDF OWL Web Services Centralized Taxonomy Management System Synaptica ® Portals Portals Categorizers Portals Portals Search Engines Portals Portals Content Portals Multiple users working in  collaborative  and  compartmentalized space P e r m i s s i o n s
Use Case Example: How a taxonomy will help  your corporate intranet succeed: Adds structure to a portal/intranet project Focuses on user requirements Discovers what content is valuable  Achieves convergence on issues like semantics & terminology  Manages organizational change Drives people to the portal/intranet Improves user experience and increases usage Makes information professionals more valuable to the business Info pros become information architects Enables them to drive actionable information to specific user environments
Synaptica’s new side by side relationship editor makes the creation and editing of terms a one step process. Save time finding, creating and editing terms with a new one step process
Synaptica drag and drop hierarchical relationship editing provides a simple, convenient way to manage vocabulary hierarchies. Managing and editing vocabulary hierarchies just got easier
Term Information Summary Window provides quick views of term details Gain quick views of term information right inside your current window
In addition to CSV, HTML and XML formats, reports may be created in Microsoft Word and Excel. Leverage Expanded Reporting Functionality for Easier, More Flexible Information Sharing
View Comprehensive Executive Profiles Biographies Bio Clips News Relationships Affiliation Strength Web Search
Dynamic Analysis of Any Web Page
Access Dow Jones g2 at the Click of a Button Single click access to trusted information in context to the news article in seconds with no typing
Gain a Strategic Vantage Point Discovery Search Alert Visualization Discovery
Drive Profitable Decisions Across Enterprise
Create Intelligence with Innovative Tools Technologies to help people quickly analyze and understand large volumes of information  Content whenever, wherever, however Intranets, portals, public Websites, trading applications, CRM, mobile devices  Design and build custom solutions
Time is Money!
References DowJones.com Fact Sheets http://www. dowjones . com/Pressroom/FactSheets . htm DowJones Elementized News Feed http://www.djnewswires.com/us/djenf.htm Dow Jones Enterprise Media Group http://www.dowjones.com/Products_Services/ElectronicPublishing/EnterpriseMedia.htm icb Industry Classification Benchmark http://www.icbenchmark.com/  Outsell.  Market Intelligence Service: Market Size, Share & Forecast Report - Information Industry Market Size and Share Rankings: Preliminary 2007 Results . Volume 2, January 10, 2008. Celent.  IT Spending in Financial Services: A Global Perspective . December 2007.
Thank You! [email_address]

NYC Sem Web Meetup 20090219

  • 1.
    An Overview ofSemantic Technologies at Dow Jones New York Semantic Web Meetup Christine Connors Global Director, Semantic Technology Solutions Dow Jones February 19, 2009
  • 2.
    Christine Connors -Who? B.S. Theatre Arts Management Survived SABRE M.S. Library & Information Science Survived DIALOG Hated Cataloging & Classification class Analyst firm --> Dot-Com --> Library --> IT --> Content Operations/IKM -->Management “ Over” taxonomies at Raytheon, started learning about semantic web Prefer the hybrid, practical approach Need to solve UI/IxD and security/DAM issues Content not technology Many hats…
  • 3.
    The World ofNews Corporation
  • 4.
    Dow Jones Founded:1882 First year distributed ‘electronically’: 1897 Full-time employees: 7000+ Business & Financial News Staff: 1900 Local Media Group Enterprise Media Group Consumer Media Group
  • 5.
    Dow Jones EnterpriseMedia Group Business & Relationship Intelligence
  • 6.
    What is the Semantic Web ? A universal medium for exchanging information that can be processed electronically and still have meaning and relevance It provides a common, standardized framework that allows data to be shared and reused across applications, enterprises, and community boundaries. Why do we care about it for our solutions ? We need to provide import and export support for the Semantic Web to enable easier data exchange Greater interoperability means better standardization and integration into Web based applications – more customers can use it!
  • 7.
    Massive amounts ofreal-time data On average we code 5 million articles per month 180,000+ articles per day 9000+ images per week Publications and pictures from more than 159 countries and 350 geographical regions worldwide 5500 journals, 2300 newspapers, 400+ newswires, 300 media transcripts, 15,000 websites, blogs & forums, 2.4M company reports, 1.3M people profiles 8 24x7 coding desks around the globe Real-time delivery Scrolling headlines Tickers Feeds Some product’s delivery time is measured in milli-seconds
  • 8.
    Committed to theprocess Assess Business Goals Content IT Metadata Taxonomy Standards & Best Practices Users Design Audience Segmentation & Definition Facet Analysis Information Architecture Editorial Guidelines & Workflow Build Entity Extraction (machine and/or human) Content Tagging Rules (machine and/or human) Taxonomy Construction & Mapping Maintain Continuous Work-in-progress Engage end-users (query log analysis, focus groups, folksonomy) Governance Process
  • 9.
    Information Processing modelDow Jones Intelligent Indexing Data Capture Normalizer Data Capture Normalization Quality Control Distribution Editorial Interfaces Content Monitoring & Alerting Information Providers Coding Manual Coding Interface Manual Coding Queue Entity Extraction Categorizer Rules-based Coding Expansion/ Validation
  • 10.
    It’s all aboutthe metadata Dow Jones Intelligent Indexing™ 7 million+ company codes 7 million+ executives 900+ industry codes 535+ region codes 680+ subject codes 22 language codes Generate Indexing 11.5 million+ companies 5 million+ person profiles
  • 11.
    Why is VocabularyManagement Valuable to your organization? Standardize your corporate vocabulary and improve information management. Adding structure to information assets streamlines product development, regulatory compliance and risk mitigation Improve product development with a global business vocabulary that feeds right into downstream applications such as portals, reporting programs and CRMs Enable Corporate intranet or portal users to easily and quickly locate relevant information by leveraging taxonomy to improve productivity Gain valuable productivity benefits by connecting users and workgroups with the relevant information they need, when and where they need it Streamline information access and license pre-built taxonomies that can be easily integrated into your enterprise search technology to streamline information access Drive online revenue and increase customer satisfaction by improving how online customers find your information, products or services by cross-referencing industry classification codes and brand names Monetize media content by unifying digital platforms to enable a better search infrastructure and a competitive advantage in the media marketplace
  • 12.
    A managed taxonomyusing connects information to build a global semantic network John Doe ABC Corporation New York Employed By Employer Of Located In Location Of Manufacturer Of Vendor To Widgets Manufactured By PQR Corporation XYZ Corporation Client Of Client Of Vendor To
  • 13.
    Global Organizations NeedSynaptica: As a centralized repository for multi-lingual semantic management that is: - Independent from web-portal search and categorization systems - Scalable ; capable of evolving with emerging corporate semantic standards HTML CSV MS Word XML Z39.5 SKOS RDF OWL Web Services Centralized Taxonomy Management System Synaptica ® Portals Portals Categorizers Portals Portals Search Engines Portals Portals Content Portals Multiple users working in collaborative and compartmentalized space P e r m i s s i o n s
  • 14.
    Use Case Example:How a taxonomy will help your corporate intranet succeed: Adds structure to a portal/intranet project Focuses on user requirements Discovers what content is valuable Achieves convergence on issues like semantics & terminology Manages organizational change Drives people to the portal/intranet Improves user experience and increases usage Makes information professionals more valuable to the business Info pros become information architects Enables them to drive actionable information to specific user environments
  • 15.
    Synaptica’s new sideby side relationship editor makes the creation and editing of terms a one step process. Save time finding, creating and editing terms with a new one step process
  • 16.
    Synaptica drag anddrop hierarchical relationship editing provides a simple, convenient way to manage vocabulary hierarchies. Managing and editing vocabulary hierarchies just got easier
  • 17.
    Term Information SummaryWindow provides quick views of term details Gain quick views of term information right inside your current window
  • 18.
    In addition toCSV, HTML and XML formats, reports may be created in Microsoft Word and Excel. Leverage Expanded Reporting Functionality for Easier, More Flexible Information Sharing
  • 19.
    View Comprehensive ExecutiveProfiles Biographies Bio Clips News Relationships Affiliation Strength Web Search
  • 20.
    Dynamic Analysis ofAny Web Page
  • 21.
    Access Dow Jonesg2 at the Click of a Button Single click access to trusted information in context to the news article in seconds with no typing
  • 22.
    Gain a StrategicVantage Point Discovery Search Alert Visualization Discovery
  • 23.
    Drive Profitable DecisionsAcross Enterprise
  • 24.
    Create Intelligence withInnovative Tools Technologies to help people quickly analyze and understand large volumes of information Content whenever, wherever, however Intranets, portals, public Websites, trading applications, CRM, mobile devices Design and build custom solutions
  • 25.
  • 26.
    References DowJones.com FactSheets http://www. dowjones . com/Pressroom/FactSheets . htm DowJones Elementized News Feed http://www.djnewswires.com/us/djenf.htm Dow Jones Enterprise Media Group http://www.dowjones.com/Products_Services/ElectronicPublishing/EnterpriseMedia.htm icb Industry Classification Benchmark http://www.icbenchmark.com/ Outsell. Market Intelligence Service: Market Size, Share & Forecast Report - Information Industry Market Size and Share Rankings: Preliminary 2007 Results . Volume 2, January 10, 2008. Celent. IT Spending in Financial Services: A Global Perspective . December 2007.
  • 27.