rNews: Embedding Metadata in On-line News
From the talk at SemTech
Wednesday, June 8, 2011
09:45 AM - 10:35 AM
Level: Business / Non-Technical
Case Study
Location: Yosemite A
The IPTC, a consortium of the world's major news agencies, news publishers and news industry vendors, recently released rNews, a semantic standard for on-line news. rNews uses RDFa to annotate HTML documents with news-specific metadata, to help with search, ad placement, aggregation and the sharing of on-line news. Jayson Lorenzen, a software engineer with Business Wire and one of the IPTC Member organization delegates working on rNews, will give an overview of the IPTC, the rNews standard, why rNews is needed and how the standard was eventually created. The talk will include use cases and live demonstrations of rNews and will end with a call to action for you to participate; rNews is currently at version 0.5 and the IPTC is looking for feedback on how to improve the standard.
31. IPTC: XML Standards NewsML Structure for packages of news items containing text, photo, graphics, and video components <NewsML Version="1.2"> ... <NewsItem> <Identification> <NewsIdentifier> <ProviderId>businesswire.com</ProviderId><NewsItemId>20100809006755 ... http://www.newsml.org
32. IPTC: XML Standards G2 Family of standards XML Schema based components shared by entire family: (NewsML-G2, EventsML-G2 & SportsML-G2) <newsItem guid="urn :newsml:acmenews.com:20081125T1205:US-FINANCE-PAULSON" version="3" xmlns="http://iptc.org/std/nar/2006-10-01/" standard="NewsML-G2" http://iptc.org/site/News_Exchange_Formats/IPTC_G2-Standards_- _about_which_one_do_you_want_to_know_more
62. rNews: Design Strategy Reuse existing IPTC Standards IPTC standards widely used in the industry Familiar to implementors Familiar to the IPTC
63. rNews: Design Strategy Use Controlled Vocabularies to Minimize number Of objects and properties cv.iptc.org/newscodes/ format / cv.iptc.org/newscodes/ audiocodec / cv.iptc.org/newscodes/ videocodec / cv.iptc.org/newscodes/ mediatype /
79. rNews: Timeline 2010 September rNews proposed to IPTC at the fall meeting STANDARD
80. rNews: Timeline rNews draft v0.1 approved by IPTC at spring meeting 2010 September 2011 Mar rNews proposed to IPTC at the fall meeting STANDARD
81. rNews: Timeline rNews draft v0.1 approved by IPTC at spring meeting 2010 September 2011 Mar 2011 Mar - May rNews proposed to IPTC at the fall meeting Public Testing & feedback STANDARD
82. rNews: rNews draft v0.1 approved by IPTC at spring meeting 2010 September 2011 Mar 2011 Mar - May rNews proposed to IPTC at the fall meeting Public Testing & feedback STANDARD IPTC and Member companies: AP, NYTimes, EBU, Getty Images, Business Wire, Press Association, Transtel, Thomson Reuters, XML Team, IPTC and more ... Guests and forum members Semantic-web and linked-data W3C mailing lists: lists.w3.org/Archives/Public/semantic-web/2011Apr/0050.html lists.w3.org/Archives/Public/public-lod/2011Apr/0055.html many news organizations & invited experts involved
83. rNews: Timeline rNews draft v0.1 approved by IPTC at spring meeting IPTC to vote on rNews draft v0.5 at summer meeting 2010 September 2011 Mar 2011 Mar - May 2011 June rNews proposed to IPTC at the fall meeting Public Testing & feedback STANDARD
84. STANDARD rNews: Timeline rNews draft v0.1 approved by IPTC at spring meeting IPTC to vote on rNews draft v0.5 at summer meeting 2010 September 2011 Mar 2011 Mar - May 2011 June 2011 June - Sept rNews proposed to IPTC at the fall meeting Public Testing & feedback Request public Testing & feedback
85. rNews: Timeline rNews draft v0.1 approved by IPTC at spring meeting Final IPTC Vote on rNews IPTC to vote on rNews draft v0.5 at summer meeting 2010 September 2011 Mar 2011 Mar - May 2011 June 2011 June - Sept rNews proposed to IPTC at the fall meeting Public Testing & feedback Request public Testing & feedback 2011 Sept
87. rNews: Use Case / Demo Business Wire & rNews (we leave presentation for this one) http://www.businesswire.com
88. rNews: Use Case / Demo rNews and TinyMCE http://www.ontos.com
89. Editor I (manual)โฆ In a WYSIWYG tool, User selects text and chooses the rNews type from the drop down When saving/submitting the tool creates HTML the including RDFa/rNews. WYSIWYG Editor (e.g. TinyMCE)
90. Editor I (manual)โฆ Close up showing the tripple browser and the property chooser menus WYSIWYG Editor (e.g. TinyMCE)
91. Editor II (semi)-automaticโฆ Same WYSIWYG editor, but using a feature to โAutomatically annotate contentโ Calls an annotation system (e.g. from Ontos) that will analyze the article and suggest the tags/values. The user can still change the tags before saving.
95. rNews: Call To Action How you can continue the community involvement
96. rNews: Call To Action Today : Try rNews on your site and continue sending us feedback what is works well, what needs work Fall 11: Implement version rNews 1.0 on your web site Build tools to utilize news data made available via rNews
-IPTC (The International Press Telecommunications Council) Creates and maintains standards for the exchange of news. a non-profit consortium of the world's major news agencies and news industry vendors. Creats and maintains standards for the exchange of news (and for embedding metadata in photos) used by virtually every major news organization in the world. -Established in 1965 by a group of news organisations to safeguard the telecommunications interests of the World's Press. Group included the Alliance Europรฉenne des Agences de Presse, ANPA (now NAA), FIEJ (now WAN) and the North American News Agencies (a joint committee of Associated Press, Canadian Press and United Press International) . -IPTC 7901, internationalized version of ANPA 1312 from Newspaper Association of America (NAA), formerly the American Newspaper Publishers Association (ANPA) -Since the 1990's IPTC's standardization work is based on open standards (first SGML, then the XML family of standards, MIME, Unicode, and so on.) -NITF News Industry Text Format, SGML Standard Generalized Markup Language first XML (stated as SGML) to exchange news -2001 NewsML IPTC's first standard to exchange multimedia news (XML)
-IPTC (The International Press Telecommunications Council) Creates and maintains standards for the exchange of news. a non-profit consortium of the world's major news agencies and news industry vendors. Creats and maintains standards for the exchange of news (and for embedding metadata in photos) used by virtually every major news organization in the world. -Established in 1965 by a group of news organisations to safeguard the telecommunications interests of the World's Press. Group included the Alliance Europรฉenne des Agences de Presse, ANPA (now NAA), FIEJ (now WAN) and the North American News Agencies (a joint committee of Associated Press, Canadian Press and United Press International) . -IPTC 7901, internationalized version of ANPA 1312 from Newspaper Association of America (NAA), formerly the American Newspaper Publishers Association (ANPA) -Since the 1990's IPTC's standardization work is based on open standards (first SGML, then the XML family of standards, MIME, Unicode, and so on.) -NITF News Industry Text Format, SGML Standard Generalized Markup Language first XML (stated as SGML) to exchange news -2001 NewsML IPTC's first standard to exchange multimedia news (XML)
-IPTC (The International Press Telecommunications Council) Creates and maintains standards for the exchange of news. a non-profit consortium of the world's major news agencies and news industry vendors. Creats and maintains standards for the exchange of news (and for embedding metadata in photos) used by virtually every major news organization in the world. -Established in 1965 by a group of news organisations to safeguard the telecommunications interests of the World's Press. Group included the Alliance Europรฉenne des Agences de Presse, ANPA (now NAA), FIEJ (now WAN) and the North American News Agencies (a joint committee of Associated Press, Canadian Press and United Press International) . -IPTC 7901, internationalized version of ANPA 1312 from Newspaper Association of America (NAA), formerly the American Newspaper Publishers Association (ANPA) -Since the 1990's IPTC's standardization work is based on open standards (first SGML, then the XML family of standards, MIME, Unicode, and so on.) -NITF News Industry Text Format, SGML Standard Generalized Markup Language first XML (stated as SGML) to exchange news -2001 NewsML IPTC's first standard to exchange multimedia news (XML)
-IPTC (The International Press Telecommunications Council) Creates and maintains standards for the exchange of news. a non-profit consortium of the world's major news agencies and news industry vendors. Creats and maintains standards for the exchange of news (and for embedding metadata in photos) used by virtually every major news organization in the world. -Established in 1965 by a group of news organisations to safeguard the telecommunications interests of the World's Press. Group included the Alliance Europรฉenne des Agences de Presse, ANPA (now NAA), FIEJ (now WAN) and the North American News Agencies (a joint committee of Associated Press, Canadian Press and United Press International) . -IPTC 7901, internationalized version of ANPA 1312 from Newspaper Association of America (NAA), formerly the American Newspaper Publishers Association (ANPA) -Since the 1990's IPTC's standardization work is based on open standards (first SGML, then the XML family of standards, MIME, Unicode, and so on.) -NITF News Industry Text Format, SGML Standard Generalized Markup Language first XML (stated as SGML) to exchange news -2001 NewsML IPTC's first standard to exchange multimedia news (XML)
-IPTC (The International Press Telecommunications Council) Creates and maintains standards for the exchange of news. a non-profit consortium of the world's major news agencies and news industry vendors. Creats and maintains standards for the exchange of news (and for embedding metadata in photos) used by virtually every major news organization in the world. -Established in 1965 by a group of news organisations to safeguard the telecommunications interests of the World's Press. Group included the Alliance Europรฉenne des Agences de Presse, ANPA (now NAA), FIEJ (now WAN) and the North American News Agencies (a joint committee of Associated Press, Canadian Press and United Press International) . -IPTC 7901, internationalized version of ANPA 1312 from Newspaper Association of America (NAA), formerly the American Newspaper Publishers Association (ANPA) -Since the 1990's IPTC's standardization work is based on open standards (first SGML, then the XML family of standards, MIME, Unicode, and so on.) -NITF News Industry Text Format, SGML Standard Generalized Markup Language first XML (stated as SGML) to exchange news -2001 NewsML IPTC's first standard to exchange multimedia news (XML)
-IPTC 7901, internationalized version of ANPA 1312 from Newspaper Association of America (NAA), formerly the American Newspaper Publishers Association (ANPA) IPTC 7901 for the transmission of text content to newspapers, news agencies and other recipients. Initially released in the early eighties and last updated in 1995 though it is still in use my many news organizations around the world. Structured text format, using mainly whitespace delimiters (Space and CR LF characters)
-IPTC 7901, internationalized version of ANPA 1312 from Newspaper Association of America (NAA), formerly the American Newspaper Publishers Association (ANPA) IPTC 7901 for the transmission of text content to newspapers, news agencies and other recipients. Initially released in the early eighties and last updated in 1995 though it is still in use my many news organizations around the world. Structured text format, using mainly whitespace delimiters (Space and CR LF characters)
-Since the 1990's IPTC's standardization work is based on open standards (first SGML, then the XML family of standards, MIME, Unicode, and so on.) -NITF News Industry Text Format, SGML Standard Generalized Markup Language first XML (stated as SGML) to exchange news
-Since the 1990's IPTC's standardization work is based on open standards (first SGML, then the XML family of standards, MIME, Unicode, and so on.) -NITF News Industry Text Format, SGML Standard Generalized Markup Language first XML (stated as SGML) to exchange news
-Since the 1990's IPTC's standardization work is based on open standards (first SGML, then the XML family of standards, MIME, Unicode, and so on.) -NITF News Industry Text Format, SGML Standard Generalized Markup Language first XML (stated as SGML) to exchange news
-Since the 1990's IPTC's standardization work is based on open standards (first SGML, then the XML family of standards, MIME, Unicode, and so on.) -NITF News Industry Text Format, SGML Standard Generalized Markup Language first XML (stated as SGML) to exchange news
-IPTC 7901, internationalized version of ANPA 1312 from Newspaper Association of America (NAA), formerly the American Newspaper Publishers Association (ANPA) IPTC 7901 for the transmission of text content to newspapers, news agencies and other recipients. Initially released in the early eighties and last updated in 1995 though it is still in use my many news organizations around the world. Structured text format, using mainly whitespace delimiters (Space and CR LF characters)
-IPTC 7901, internationalized version of ANPA 1312 from Newspaper Association of America (NAA), formerly the American Newspaper Publishers Association (ANPA) IPTC 7901 for the transmission of text content to newspapers, news agencies and other recipients. Initially released in the early eighties and last updated in 1995 though it is still in use my many news organizations around the world. Structured text format, using mainly whitespace delimiters (Space and CR LF characters)
IPTC 7901 for the transmission of text content to newspapers, news agencies and other recipients. Initially released in the early eighties and last updated in 1995 though it is still in use my many news organizations around the world. Structured text format, using mainly whitespace delimiters (Space and CR LF characters) The format is composed of four sections: preheader information, message header, message text, post-text information
IPTC 7901 for the transmission of text content to newspapers, news agencies and other recipients. Initially released in the early eighties and last updated in 1995 though it is still in use my many news organizations around the world. Structured text format, using mainly whitespace delimiters (Space and CR LF characters) The format is composed of four sections: preheader information, message header, message text, post-text information
IPTC 7901 for the transmission of text content to newspapers, news agencies and other recipients. Initially released in the early eighties and last updated in 1995 though it is still in use my many news organizations around the world. Structured text format, using mainly whitespace delimiters (Space and CR LF characters) The format is composed of four sections: preheader information, message header, message text, post-text information
IPTC 7901 for the transmission of text content to newspapers, news agencies and other recipients. Initially released in the early eighties and last updated in 1995 though it is still in use my many news organizations around the world. Structured text format, using mainly whitespace delimiters (Space and CR LF characters) The format is composed of four sections: preheader information, message header, message text, post-text information
IPTC 7901 for the transmission of text content to newspapers, news agencies and other recipients. Initially released in the early eighties and last updated in 1995 though it is still in use my many news organizations around the world. Structured text format, using mainly whitespace delimiters (Space and CR LF characters) The format is composed of four sections: preheader information, message header, message text, post-text information
IPTC 7901 for the transmission of text content to newspapers, news agencies and other recipients. Initially released in the early eighties and last updated in 1995 though it is still in use my many news organizations around the world. Structured text format, using mainly whitespace delimiters (Space and CR LF characters) The format is composed of four sections: preheader information, message header, message text, post-text information
-IPTC 7901, internationalized version of ANPA 1312 from Newspaper Association of America (NAA), formerly the American Newspaper Publishers Association (ANPA) IPTC 7901 for the transmission of text content to newspapers, news agencies and other recipients. Initially released in the early eighties and last updated in 1995 though it is still in use my many news organizations around the world. Structured text format, using mainly whitespace delimiters (Space and CR LF characters)
-IPTC 7901, internationalized version of ANPA 1312 from Newspaper Association of America (NAA), formerly the American Newspaper Publishers Association (ANPA) IPTC 7901 for the transmission of text content to newspapers, news agencies and other recipients. Initially released in the early eighties and last updated in 1995 though it is still in use my many news organizations around the world. Structured text format, using mainly whitespace delimiters (Space and CR LF characters)
-IPTC 7901, internationalized version of ANPA 1312 from Newspaper Association of America (NAA), formerly the American Newspaper Publishers Association (ANPA) IPTC 7901 for the transmission of text content to newspapers, news agencies and other recipients. Initially released in the early eighties and last updated in 1995 though it is still in use my many news organizations around the world. Structured text format, using mainly whitespace delimiters (Space and CR LF characters)
-IPTC 7901, internationalized version of ANPA 1312 from Newspaper Association of America (NAA), formerly the American Newspaper Publishers Association (ANPA) IPTC 7901 for the transmission of text content to newspapers, news agencies and other recipients. Initially released in the early eighties and last updated in 1995 though it is still in use my many news organizations around the world. Structured text format, using mainly whitespace delimiters (Space and CR LF characters)
-IPTC 7901, internationalized version of ANPA 1312 from Newspaper Association of America (NAA), formerly the American Newspaper Publishers Association (ANPA) IPTC 7901 for the transmission of text content to newspapers, news agencies and other recipients. Initially released in the early eighties and last updated in 1995 though it is still in use my many news organizations around the world. Structured text format, using mainly whitespace delimiters (Space and CR LF characters)
-IPTC 7901, internationalized version of ANPA 1312 from Newspaper Association of America (NAA), formerly the American Newspaper Publishers Association (ANPA) IPTC 7901 for the transmission of text content to newspapers, news agencies and other recipients. Initially released in the early eighties and last updated in 1995 though it is still in use my many news organizations around the world. Structured text format, using mainly whitespace delimiters (Space and CR LF characters)
-IPTC 7901, internationalized version of ANPA 1312 from Newspaper Association of America (NAA), formerly the American Newspaper Publishers Association (ANPA) IPTC 7901 for the transmission of text content to newspapers, news agencies and other recipients. Initially released in the early eighties and last updated in 1995 though it is still in use my many news organizations around the world. Structured text format, using mainly whitespace delimiters (Space and CR LF characters)
If the IPTC would decide to use metadata from different namespaces we have to ask; which one to choose? The W3C (Media Annotations group) is working on a news ontology and lists twenty (20), different existing metadata schemas. And there are others not listed there. That was another good reason to start with only a single namespace, some mapping would be required in this but also in a multi-namespace case. So ... A: why we haven't sub-classed or aligned to existing vocabularies. A: we want to, but felt that this work would best be carried out in collaboration with the broader semantic web community -- you. If anybody would like to propose such an alignment we'd be happy to consider
-IPTC 7901, internationalized version of ANPA 1312 from Newspaper Association of America (NAA), formerly the American Newspaper Publishers Association (ANPA) IPTC 7901 for the transmission of text content to newspapers, news agencies and other recipients. Initially released in the early eighties and last updated in 1995 though it is still in use my many news organizations around the world. Structured text format, using mainly whitespace delimiters (Space and CR LF characters)
-IPTC 7901, internationalized version of ANPA 1312 from Newspaper Association of America (NAA), formerly the American Newspaper Publishers Association (ANPA) IPTC 7901 for the transmission of text content to newspapers, news agencies and other recipients. Initially released in the early eighties and last updated in 1995 though it is still in use my many news organizations around the world. Structured text format, using mainly whitespace delimiters (Space and CR LF characters)
-IPTC 7901, internationalized version of ANPA 1312 from Newspaper Association of America (NAA), formerly the American Newspaper Publishers Association (ANPA) IPTC 7901 for the transmission of text content to newspapers, news agencies and other recipients. Initially released in the early eighties and last updated in 1995 though it is still in use my many news organizations around the world. Structured text format, using mainly whitespace delimiters (Space and CR LF characters)
-IPTC 7901, internationalized version of ANPA 1312 from Newspaper Association of America (NAA), formerly the American Newspaper Publishers Association (ANPA) IPTC 7901 for the transmission of text content to newspapers, news agencies and other recipients. Initially released in the early eighties and last updated in 1995 though it is still in use my many news organizations around the world. Structured text format, using mainly whitespace delimiters (Space and CR LF characters)
-IPTC 7901, internationalized version of ANPA 1312 from Newspaper Association of America (NAA), formerly the American Newspaper Publishers Association (ANPA) IPTC 7901 for the transmission of text content to newspapers, news agencies and other recipients. Initially released in the early eighties and last updated in 1995 though it is still in use my many news organizations around the world. Structured text format, using mainly whitespace delimiters (Space and CR LF characters)
- rNews start in or around Sept of 2010 via conference calls. (20 on 1 st call, incl. from: AP, BBC, BW, EBU, Getty, IFRA, NYT, PA, TR, Xinhua Work was begun to make the IPTC controlled vocabularies linkable and to align them with other sources of linked data (DBPedia). Hosting and formats for the data were discussed An effort to create an ontology for news, using NewsML G2, the latest IPTC standard for the exchange of news, as a starting point, was begun. RDFa was chosen as the vehicle
If the IPTC would decide to use metadata from different namespaces we have to ask; which one to choose? The W3C (Media Annotations group) is working on a news ontology and lists twenty (20), different existing metadata schemas. And there are others not listed there. That was another good reason to start with only a single namespace, some mapping would be required in this but also in a multi-namespace case. So ... A: why we haven't sub-classed or aligned to existing vocabularies. A: we want to, but felt that this work would best be carried out in collaboration with the broader semantic web community -- you. If anybody would like to propose such an alignment we'd be happy to consider
TinyMCE WYSIWYG Editor - Open Source Software project Ontos rNews Solution is based on TinyMCE and adds rNews tagging
TinyMCE WYSIWYG Editor - Open Source Software project Ontos rNews Solution is based on TinyMCE and adds rNews tagging
TinyMCE WYSIWYG Editor - Open Source Software project Ontos rNews Solution is based on TinyMCE and adds rNews tagging
TinyMCE WYSIWYG Editor - Open Source Software project Ontos rNews Solution is based on TinyMCE and adds rNews tagging