NYC Sem Web Meetup 20090219

  • 1,394 views
Uploaded on

Presentation to the New York Semantic Web Meetup in February 2009 at Dow Jones' offices in 1155 Ave of the Americas

Presentation to the New York Semantic Web Meetup in February 2009 at Dow Jones' offices in 1155 Ave of the Americas

More in: Technology , Education
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Be the first to comment
No Downloads

Views

Total Views
1,394
On Slideshare
0
From Embeds
0
Number of Embeds
2

Actions

Shares
Downloads
0
Comments
0
Likes
4

Embeds 0

No embeds

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
    No notes for slide

Transcript

  • 1. An Overview of Semantic Technologies at Dow Jones New York Semantic Web Meetup Christine Connors Global Director, Semantic Technology Solutions Dow Jones February 19, 2009
  • 2. Christine Connors - Who?
    • B.S. Theatre Arts Management
    • Survived SABRE
    • M.S. Library & Information Science
    • Survived DIALOG
    • Hated Cataloging & Classification class
    • Analyst firm --> Dot-Com --> Library --> IT --> Content Operations/IKM -->Management
    • “ Over” taxonomies at Raytheon, started learning about semantic web
    • Prefer the hybrid, practical approach
    • Need to solve UI/IxD and security/DAM issues
    • Content not technology
    • Many hats…
  • 3. The World of News Corporation
  • 4. Dow Jones
    • Founded: 1882
    • First year distributed ‘electronically’: 1897
    • Full-time employees: 7000+
    • Business & Financial News Staff: 1900
    Local Media Group Enterprise Media Group Consumer Media Group
  • 5. Dow Jones Enterprise Media Group Business & Relationship Intelligence
  • 6. What is the Semantic Web ?
    • A universal medium for exchanging information that can be processed electronically and still have meaning and relevance
    • It provides a common, standardized framework that allows data to be shared and reused across applications, enterprises, and community boundaries.
    • Why do we care about it for our solutions ?
    • We need to provide import and export support for the Semantic Web to enable easier data exchange
    • Greater interoperability means better standardization and integration into Web based applications – more customers can use it!
  • 7. Massive amounts of real-time data
    • On average we code 5 million articles per month
      • 180,000+ articles per day
    • 9000+ images per week
    • Publications and pictures from more than 159 countries and 350 geographical regions worldwide
    • 5500 journals, 2300 newspapers, 400+ newswires, 300 media transcripts, 15,000 websites, blogs & forums, 2.4M company reports, 1.3M people profiles
    • 8 24x7 coding desks around the globe
    • Real-time delivery
      • Scrolling headlines
      • Tickers
      • Feeds
      • Some product’s delivery time is measured in milli-seconds
  • 8. Committed to the process Assess Business Goals Content IT Metadata Taxonomy Standards & Best Practices Users Design Audience Segmentation & Definition Facet Analysis Information Architecture Editorial Guidelines & Workflow Build Entity Extraction (machine and/or human) Content Tagging Rules (machine and/or human) Taxonomy Construction & Mapping Maintain Continuous Work-in-progress Engage end-users (query log analysis, focus groups, folksonomy) Governance Process
  • 9. Information Processing model Dow Jones Intelligent Indexing Data Capture Normalizer Data Capture Normalization Quality Control Distribution Editorial Interfaces Content Monitoring & Alerting Information Providers Coding Manual Coding Interface Manual Coding Queue Entity Extraction Categorizer Rules-based Coding Expansion/ Validation
  • 10. It’s all about the metadata
    • Dow Jones Intelligent Indexing™
      • 7 million+ company codes
      • 7 million+ executives
      • 900+ industry codes
      • 535+ region codes
      • 680+ subject codes
      • 22 language codes
    • Generate Indexing
      • 11.5 million+ companies
      • 5 million+ person profiles
  • 11. Why is Vocabulary Management Valuable to your organization?
    • Standardize your corporate vocabulary and improve information management. Adding structure to information assets streamlines product development, regulatory compliance and risk mitigation
    • Improve product development with a global business vocabulary that feeds right into downstream applications such as portals, reporting programs and CRMs
    • Enable Corporate intranet or portal users to easily and quickly locate relevant information by leveraging taxonomy to improve productivity
    • Gain valuable productivity benefits by connecting users and workgroups with the relevant information they need, when and where they need it
    • Streamline information access and license pre-built taxonomies that can be easily integrated into your enterprise search technology to streamline information access
    • Drive online revenue and increase customer satisfaction by improving how online customers find your information, products or services by cross-referencing industry classification codes and brand names
    • Monetize media content by unifying digital platforms to enable a better search infrastructure and a competitive advantage in the media marketplace
  • 12. A managed taxonomy using connects information to build a global semantic network John Doe ABC Corporation New York Employed By Employer Of Located In Location Of Manufacturer Of Vendor To Widgets Manufactured By PQR Corporation XYZ Corporation Client Of Client Of Vendor To
  • 13. Global Organizations Need Synaptica: As a centralized repository for multi-lingual semantic management that is: - Independent from web-portal search and categorization systems - Scalable ; capable of evolving with emerging corporate semantic standards HTML CSV MS Word XML Z39.5 SKOS RDF OWL Web Services Centralized Taxonomy Management System Synaptica ® Portals Portals Categorizers Portals Portals Search Engines Portals Portals Content Portals Multiple users working in collaborative and compartmentalized space P e r m i s s i o n s
  • 14. Use Case Example: How a taxonomy will help your corporate intranet succeed:
    • Adds structure to a portal/intranet project
      • Focuses on user requirements
      • Discovers what content is valuable
      • Achieves convergence on issues like semantics & terminology
      • Manages organizational change
    • Drives people to the portal/intranet
      • Improves user experience and increases usage
    • Makes information professionals more valuable to the business
      • Info pros become information architects
      • Enables them to drive actionable information to specific user environments
  • 15. Synaptica’s new side by side relationship editor makes the creation and editing of terms a one step process. Save time finding, creating and editing terms with a new one step process
  • 16. Synaptica drag and drop hierarchical relationship editing provides a simple, convenient way to manage vocabulary hierarchies. Managing and editing vocabulary hierarchies just got easier
  • 17. Term Information Summary Window provides quick views of term details Gain quick views of term information right inside your current window
  • 18. In addition to CSV, HTML and XML formats, reports may be created in Microsoft Word and Excel. Leverage Expanded Reporting Functionality for Easier, More Flexible Information Sharing
  • 19. View Comprehensive Executive Profiles
    • Biographies
    • Bio Clips
    • News
    • Relationships
    • Affiliation Strength
    • Web Search
  • 20. Dynamic Analysis of Any Web Page
  • 21. Access Dow Jones g2 at the Click of a Button Single click access to trusted information in context to the news article in seconds with no typing
  • 22. Gain a Strategic Vantage Point Discovery Search Alert Visualization Discovery
  • 23. Drive Profitable Decisions Across Enterprise
  • 24. Create Intelligence with Innovative Tools
    • Technologies to help people quickly analyze and understand large volumes of information
    • Content whenever, wherever, however
      • Intranets, portals, public Websites, trading applications, CRM, mobile devices
    • Design and build custom solutions
  • 25. Time is Money!
  • 26. References
    • DowJones.com
      • Fact Sheets http://www. dowjones . com/Pressroom/FactSheets . htm
      • DowJones Elementized News Feed http://www.djnewswires.com/us/djenf.htm
      • Dow Jones Enterprise Media Group http://www.dowjones.com/Products_Services/ElectronicPublishing/EnterpriseMedia.htm
      • icb Industry Classification Benchmark http://www.icbenchmark.com/
      • Outsell. Market Intelligence Service: Market Size, Share & Forecast Report - Information Industry Market Size and Share Rankings: Preliminary 2007 Results . Volume 2, January 10, 2008.
      • Celent. IT Spending in Financial Services: A Global Perspective . December 2007.
  • 27. Thank You!
    • [email_address]