SlideShare a Scribd company logo
News in JSON Activity
http://www.flickr.com/photos/jondresner/5789254800/
The ninjs Approach to ...
News in JSON
http://www.flickr.com/photos/jondresner/5789254800/
What ninjs is Not
• Not a restricted news data model
• Not XML in JSON
• Not RDF in JSON
© 2017 IPTC (www.iptc.org) All rights reserved 2
ninjs is comprehensive
© 2017 IPTC (www.iptc.org) All rights reserved 3
http://groups.yahoo.com/neo/groups/iptc-news-in-json-dev
ninjs Data Model
© 2017 IPTC (www.iptc.org) All rights reserved 4
http://dev.iptc.org/ninjs
Data Model ninjs
• The ninjs data model is more comprehensive than other
IPTC data models
– We selected a set of priority properties to represent
– NewsML-G2, NewsML 1, rNews, NITF
– We are ready to add more
• ninjs is a JSON representation of a news item
– Text, Photo, Graphic, Video, Audio, Package
– You can represent a complete item, with all properties
– Or you may want to convey key properties
– Associations are themselves ninjs documents
© 2017 IPTC (www.iptc.org) All rights reserved 5
A Complete NINJS 1.1 Article
{
"uri" : "http://ninjs.example.com/newsitems/20130709simp123",
"type" : "text",
"versioncreated" : "2013-07-09T10:37:00Z",
"byline" : "Paulo Santalucia and Frances d'Emilio",
"headline" : "Captain of wrecked cruise ship on trial in
Italy",
"body_text" : "GROSSETO, Italy (EP) -- The trial of the
captain of the shipwrecked Costa Concordia cruise liner has begun
in a theater converted into a courtroom …"
}
© 2015 IPTC (www.iptc.org) All rights reserved 6
XML and RDF: Powerful Tools
• XML
– Namespaces
– XSLT
– XPath and XQuery
– Schema Validation
• RDF
– Object graphs
– Sets of triples
– Object lists
© 2017 IPTC (www.iptc.org) All rights reserved 7
And Yet Developers Prefer JSON
• How to measure “preferences”?
• https://www.programmableweb.com/apis
– “Most Popular” 5/10 JSON only, 4/10 JSON+XML, 1 XML only
– JSON only Facebook Graph, Google Maps, Twitter,
AccuWeather, Pinterest, Reddit, Foursquare
– XML and JSON Google Cloud Storage, Linkedin, Flickr
• Databases – trends towards JSON
– Only JSON MongoDB, CouchDB, Elasticsearch
– Added JSON eXistDB, BaseX, MarkLogic, Oracle Database,
PostgresSQL
• For AP – the number one request is “can we get this in
JSON instead?”
© 2017 IPTC (www.iptc.org) All rights reserved 8
Why JSON?
• Maps easily into modern programming data structures
– Feels “more natural” to developers
• No namespaces
– Biggest strength of XML and RDF
– Biggest headache for developers
• JSON ecosystem is improving (XML history repeating)
– Elasticsearch dominates
– Improved developer tools e.g. jq, XQuery support for JSON
• Many developers see JSON as simpler and better than
XML and may never have heard of RDF
© 2017 IPTC (www.iptc.org) All rights reserved 9
News in JSON Approach
• Create a JSON representation of news that feels natural
– Alternative is not using XML or RDF mapped into JSON
– But a “hand crafted” JSON from scratch
• Process – educate ourselves on JSON best practices
– Select the news feature to model in JSON
– Identify various representation alternatives in JSON
– Try them out with a variety of tools
– Pick the “best” one
• Goal - JSON developer would look at ninjs and
recognize it as a native implementation
© 2010 IPTC (www.iptc.org) All rights reserved 10
Text Markup in JSON
• How to represent richly marked up text in JSON?
• A sweet spot for document-oriented XML
• Could be HTML, XHTML, NITF ...
• We experiment with two existing text markup examples
• NITF: http://www.iptc.org/std/NITF/3.2/examples/nitf-
fishing.xml
• HTML: http://dev.iptc.org/Implementation-Guide-HTML-
5-Microdata-in-IPTC-namespace
© 2010 IPTC (www.iptc.org) All rights reserved 11
Text Markup Options in JSON
• Plain text, stripped of markup
• Preserved but escaped markup
– HTML: https://gist.github.com/anonymous/4996653
– XML: https://gist.github.com/anonymous/4996676
– See http://stackoverflow.com/questions/993970/what-do-i-need-
to-escape-in-my-html-json-response for a discussion of how to
escape markup in JSON
• Mechanically create JSON structures to mimic the
original markup
– We used JSONML as an example http://www.jsonml.org/
– NITF : https://gist.github.com/anonymous/4996697
– HTML: https://gist.github.com/anonymous/4996720
© 2010 IPTC (www.iptc.org) All rights reserved 12
What We Learnt
• Both plain text (no markup) and escaped markup have
clear use cases
– Plain text can be useful for search, for example
– Escaped markup works well for direct display on a webpage
• Markup translated (like JSONML) works OK if you have
a library to implement the rules
– But what is the added benefit beyond just working directly with
XML or HTML?
– Who will write and maintain the libraries for ever language?
• Ninjs supports both plain and escaped text via pattern
properties
© 2010 IPTC (www.iptc.org) All rights reserved 13
Things We Considered But
Decided Against
• Translating from an existing XML standard into JSON
– Not all IPTC standards are XML
– Not all publishers use the same IPTC standards
– Not all publishers use any IPTC standards
• “Mechanically” translating from XML into JSON
– There are many libraries that can do this
– Different choices for how to represent certain XML features
– So each technique results in a slightly different JSON
– We felt that more a more “natural” JSON would be more valuable
© 2010 IPTC (www.iptc.org) All rights reserved 14
Beyond JSON – Binary Formats
• IPTC started on ninjs in 2012 – five years ago!
– Developer interest is moving on…
• AP is looking at binary formats
– Row-based – e.g. avro
– Columnar – e.g. ORC or Parquet
• Same issues
– Mechanically translate from other XML or (more likely) JSON?
– Or handcraft for most natural / best benefit?
– How to quickly become experts in the best practices?
© 2010 IPTC (www.iptc.org) All rights reserved 15
News in JSON / ninjs
• IPTC already has a lot of overlapping standards
• Let’s try to avoid creating duplicate JSON standards too
• Let’s build on what we have
– For example, incorporating IKOS into ninjs
– Solve problems of co-branding (ninjs vs NewsML-G2 in JSON)
© 2010 IPTC (www.iptc.org) All rights reserved 16

More Related Content

Similar to IPTC Approach to News in JSON

S. Bartoli & F. Pompermaier – A Semantic Big Data Companion
S. Bartoli & F. Pompermaier – A Semantic Big Data CompanionS. Bartoli & F. Pompermaier – A Semantic Big Data Companion
S. Bartoli & F. Pompermaier – A Semantic Big Data Companion
Flink Forward
 
OOR Architecture - Towards a Network of Linked Ontology Repositories
OOR Architecture - Towards a Network of Linked Ontology RepositoriesOOR Architecture - Towards a Network of Linked Ontology Repositories
OOR Architecture - Towards a Network of Linked Ontology Repositories
Kim Viljanen
 
IPTC Rights Statements For News
IPTC Rights Statements For NewsIPTC Rights Statements For News
IPTC Rights Statements For News
Stuart Myles
 
Framework for IoT Interoperability
Framework for IoT InteroperabilityFramework for IoT Interoperability
Framework for IoT Interoperability
Samsung Open Source Group
 
IPTC Rights Working Group Autumn 2012
IPTC Rights Working Group Autumn 2012IPTC Rights Working Group Autumn 2012
IPTC Rights Working Group Autumn 2012
Stuart Myles
 
NoSQL on ACID - Meet Unstructured Postgres
NoSQL on ACID - Meet Unstructured PostgresNoSQL on ACID - Meet Unstructured Postgres
NoSQL on ACID - Meet Unstructured Postgres
EDB
 
IPMI is dead, Long live Redfish
IPMI is dead, Long live RedfishIPMI is dead, Long live Redfish
IPMI is dead, Long live Redfish
Bruno Cornec
 
Metadata for web ontologies and rules: current practices and perspectives
Metadata for web ontologies and rules: current practices and perspectivesMetadata for web ontologies and rules: current practices and perspectives
Metadata for web ontologies and rules: current practices and perspectives
Carlos Tejo-Alonso
 
Enabling IoT Devices’ Hardware and Software Interoperability, IPSO Alliance (...
Enabling IoT Devices’ Hardware and Software Interoperability, IPSO Alliance (...Enabling IoT Devices’ Hardware and Software Interoperability, IPSO Alliance (...
Enabling IoT Devices’ Hardware and Software Interoperability, IPSO Alliance (...
Open Mobile Alliance
 
XML in software development
XML in software developmentXML in software development
XML in software development
Lars Marius Garshol
 
Drupal and Apache Stanbol
Drupal and Apache StanbolDrupal and Apache Stanbol
Drupal and Apache Stanbol
Alkuvoima
 
IPTC Rights Expression Working Group Spring 2014
IPTC Rights Expression Working Group Spring 2014IPTC Rights Expression Working Group Spring 2014
IPTC Rights Expression Working Group Spring 2014
Stuart Myles
 
Intro to-technologies-Green-City-Hackathon-Athens
Intro to-technologies-Green-City-Hackathon-AthensIntro to-technologies-Green-City-Hackathon-Athens
Intro to-technologies-Green-City-Hackathon-AthensStoitsis Giannis
 
How to Write the Fastest JSON Parser/Writer in the World
How to Write the Fastest JSON Parser/Writer in the WorldHow to Write the Fastest JSON Parser/Writer in the World
How to Write the Fastest JSON Parser/Writer in the World
Milo Yip
 
IPTC Semantic Web Working Group Autumn 2012
IPTC Semantic Web Working Group Autumn 2012IPTC Semantic Web Working Group Autumn 2012
IPTC Semantic Web Working Group Autumn 2012
Stuart Myles
 
From Lucene to Solr 4 Trunk
From Lucene to Solr 4 TrunkFrom Lucene to Solr 4 Trunk
From Lucene to Solr 4 Trunk
tdthomassld
 
Devteach 2017 Store 2 million of audit a day into elasticsearch
Devteach 2017 Store 2 million of audit a day into elasticsearchDevteach 2017 Store 2 million of audit a day into elasticsearch
Devteach 2017 Store 2 million of audit a day into elasticsearch
Taswar Bhatti
 
aip_developer_overview_icar_2014
aip_developer_overview_icar_2014aip_developer_overview_icar_2014
aip_developer_overview_icar_2014
Matthew Vaughn
 
EXTRA Open Source Rules Classification for News
EXTRA Open Source Rules Classification for NewsEXTRA Open Source Rules Classification for News
EXTRA Open Source Rules Classification for News
Stuart Myles
 
IPTC Welcome to IPTC's Spring 2017 Meeting
IPTC Welcome to IPTC's Spring 2017 MeetingIPTC Welcome to IPTC's Spring 2017 Meeting
IPTC Welcome to IPTC's Spring 2017 Meeting
Stuart Myles
 

Similar to IPTC Approach to News in JSON (20)

S. Bartoli & F. Pompermaier – A Semantic Big Data Companion
S. Bartoli & F. Pompermaier – A Semantic Big Data CompanionS. Bartoli & F. Pompermaier – A Semantic Big Data Companion
S. Bartoli & F. Pompermaier – A Semantic Big Data Companion
 
OOR Architecture - Towards a Network of Linked Ontology Repositories
OOR Architecture - Towards a Network of Linked Ontology RepositoriesOOR Architecture - Towards a Network of Linked Ontology Repositories
OOR Architecture - Towards a Network of Linked Ontology Repositories
 
IPTC Rights Statements For News
IPTC Rights Statements For NewsIPTC Rights Statements For News
IPTC Rights Statements For News
 
Framework for IoT Interoperability
Framework for IoT InteroperabilityFramework for IoT Interoperability
Framework for IoT Interoperability
 
IPTC Rights Working Group Autumn 2012
IPTC Rights Working Group Autumn 2012IPTC Rights Working Group Autumn 2012
IPTC Rights Working Group Autumn 2012
 
NoSQL on ACID - Meet Unstructured Postgres
NoSQL on ACID - Meet Unstructured PostgresNoSQL on ACID - Meet Unstructured Postgres
NoSQL on ACID - Meet Unstructured Postgres
 
IPMI is dead, Long live Redfish
IPMI is dead, Long live RedfishIPMI is dead, Long live Redfish
IPMI is dead, Long live Redfish
 
Metadata for web ontologies and rules: current practices and perspectives
Metadata for web ontologies and rules: current practices and perspectivesMetadata for web ontologies and rules: current practices and perspectives
Metadata for web ontologies and rules: current practices and perspectives
 
Enabling IoT Devices’ Hardware and Software Interoperability, IPSO Alliance (...
Enabling IoT Devices’ Hardware and Software Interoperability, IPSO Alliance (...Enabling IoT Devices’ Hardware and Software Interoperability, IPSO Alliance (...
Enabling IoT Devices’ Hardware and Software Interoperability, IPSO Alliance (...
 
XML in software development
XML in software developmentXML in software development
XML in software development
 
Drupal and Apache Stanbol
Drupal and Apache StanbolDrupal and Apache Stanbol
Drupal and Apache Stanbol
 
IPTC Rights Expression Working Group Spring 2014
IPTC Rights Expression Working Group Spring 2014IPTC Rights Expression Working Group Spring 2014
IPTC Rights Expression Working Group Spring 2014
 
Intro to-technologies-Green-City-Hackathon-Athens
Intro to-technologies-Green-City-Hackathon-AthensIntro to-technologies-Green-City-Hackathon-Athens
Intro to-technologies-Green-City-Hackathon-Athens
 
How to Write the Fastest JSON Parser/Writer in the World
How to Write the Fastest JSON Parser/Writer in the WorldHow to Write the Fastest JSON Parser/Writer in the World
How to Write the Fastest JSON Parser/Writer in the World
 
IPTC Semantic Web Working Group Autumn 2012
IPTC Semantic Web Working Group Autumn 2012IPTC Semantic Web Working Group Autumn 2012
IPTC Semantic Web Working Group Autumn 2012
 
From Lucene to Solr 4 Trunk
From Lucene to Solr 4 TrunkFrom Lucene to Solr 4 Trunk
From Lucene to Solr 4 Trunk
 
Devteach 2017 Store 2 million of audit a day into elasticsearch
Devteach 2017 Store 2 million of audit a day into elasticsearchDevteach 2017 Store 2 million of audit a day into elasticsearch
Devteach 2017 Store 2 million of audit a day into elasticsearch
 
aip_developer_overview_icar_2014
aip_developer_overview_icar_2014aip_developer_overview_icar_2014
aip_developer_overview_icar_2014
 
EXTRA Open Source Rules Classification for News
EXTRA Open Source Rules Classification for NewsEXTRA Open Source Rules Classification for News
EXTRA Open Source Rules Classification for News
 
IPTC Welcome to IPTC's Spring 2017 Meeting
IPTC Welcome to IPTC's Spring 2017 MeetingIPTC Welcome to IPTC's Spring 2017 Meeting
IPTC Welcome to IPTC's Spring 2017 Meeting
 

More from Stuart Myles

IPTC New Taxonomies Ideas
IPTC New Taxonomies IdeasIPTC New Taxonomies Ideas
IPTC New Taxonomies Ideas
Stuart Myles
 
IPTC Board Spring 2019
IPTC Board Spring 2019IPTC Board Spring 2019
IPTC Board Spring 2019
Stuart Myles
 
IPTC Spring 2019 Conference
IPTC Spring 2019 ConferenceIPTC Spring 2019 Conference
IPTC Spring 2019 Conference
Stuart Myles
 
Photomation or Fauxtomation?
Photomation or Fauxtomation?Photomation or Fauxtomation?
Photomation or Fauxtomation?
Stuart Myles
 
Image Tagging at the Associated Press
Image Tagging at the Associated PressImage Tagging at the Associated Press
Image Tagging at the Associated Press
Stuart Myles
 
IPTC Rights Working Group Toronto October 2018
IPTC Rights Working Group Toronto October 2018IPTC Rights Working Group Toronto October 2018
IPTC Rights Working Group Toronto October 2018
Stuart Myles
 
IPTC AGM 2018 Welcome
IPTC AGM 2018 WelcomeIPTC AGM 2018 Welcome
IPTC AGM 2018 Welcome
Stuart Myles
 
How Can We Make Algorithmic News More Transparent?
How Can We Make Algorithmic News More Transparent?How Can We Make Algorithmic News More Transparent?
How Can We Make Algorithmic News More Transparent?
Stuart Myles
 
IPTC EXTRA Spring 2018
IPTC EXTRA Spring 2018IPTC EXTRA Spring 2018
IPTC EXTRA Spring 2018
Stuart Myles
 
IPTC Machine Readable Rights for News and Media: Solving Three Challenges wit...
IPTC Machine Readable Rights for News and Media: Solving Three Challenges wit...IPTC Machine Readable Rights for News and Media: Solving Three Challenges wit...
IPTC Machine Readable Rights for News and Media: Solving Three Challenges wit...
Stuart Myles
 
Ap Taxonomy Localization Requirements and Challenges
Ap Taxonomy Localization Requirements and ChallengesAp Taxonomy Localization Requirements and Challenges
Ap Taxonomy Localization Requirements and Challenges
Stuart Myles
 
IPTC Spring Meeting Welcome To Athens April 2018
IPTC Spring Meeting Welcome To Athens April 2018IPTC Spring Meeting Welcome To Athens April 2018
IPTC Spring Meeting Welcome To Athens April 2018
Stuart Myles
 
Sustaining Television News Technical Challenges
Sustaining Television News Technical ChallengesSustaining Television News Technical Challenges
Sustaining Television News Technical Challenges
Stuart Myles
 
How to Train Your Classifier: Create a Serverless Machine Learning System wit...
How to Train Your Classifier: Create a Serverless Machine Learning System wit...How to Train Your Classifier: Create a Serverless Machine Learning System wit...
How to Train Your Classifier: Create a Serverless Machine Learning System wit...
Stuart Myles
 
The Search for IPTC's Next Managing Director
The Search for IPTC's Next Managing DirectorThe Search for IPTC's Next Managing Director
The Search for IPTC's Next Managing Director
Stuart Myles
 
IPTC EXTRA and EXTRA+ November 2017
IPTC EXTRA and EXTRA+ November 2017IPTC EXTRA and EXTRA+ November 2017
IPTC EXTRA and EXTRA+ November 2017
Stuart Myles
 
Welcome to Barcelona - IPTC November 2017
Welcome to Barcelona - IPTC November 2017Welcome to Barcelona - IPTC November 2017
Welcome to Barcelona - IPTC November 2017
Stuart Myles
 
Credibility Schema Working Group
Credibility Schema Working GroupCredibility Schema Working Group
Credibility Schema Working Group
Stuart Myles
 
Rights for Photo and Video Archives at the Associated Press
Rights for Photo and Video Archives at the Associated PressRights for Photo and Video Archives at the Associated Press
Rights for Photo and Video Archives at the Associated Press
Stuart Myles
 
IPTC Rights October 2016
IPTC Rights October 2016IPTC Rights October 2016
IPTC Rights October 2016
Stuart Myles
 

More from Stuart Myles (20)

IPTC New Taxonomies Ideas
IPTC New Taxonomies IdeasIPTC New Taxonomies Ideas
IPTC New Taxonomies Ideas
 
IPTC Board Spring 2019
IPTC Board Spring 2019IPTC Board Spring 2019
IPTC Board Spring 2019
 
IPTC Spring 2019 Conference
IPTC Spring 2019 ConferenceIPTC Spring 2019 Conference
IPTC Spring 2019 Conference
 
Photomation or Fauxtomation?
Photomation or Fauxtomation?Photomation or Fauxtomation?
Photomation or Fauxtomation?
 
Image Tagging at the Associated Press
Image Tagging at the Associated PressImage Tagging at the Associated Press
Image Tagging at the Associated Press
 
IPTC Rights Working Group Toronto October 2018
IPTC Rights Working Group Toronto October 2018IPTC Rights Working Group Toronto October 2018
IPTC Rights Working Group Toronto October 2018
 
IPTC AGM 2018 Welcome
IPTC AGM 2018 WelcomeIPTC AGM 2018 Welcome
IPTC AGM 2018 Welcome
 
How Can We Make Algorithmic News More Transparent?
How Can We Make Algorithmic News More Transparent?How Can We Make Algorithmic News More Transparent?
How Can We Make Algorithmic News More Transparent?
 
IPTC EXTRA Spring 2018
IPTC EXTRA Spring 2018IPTC EXTRA Spring 2018
IPTC EXTRA Spring 2018
 
IPTC Machine Readable Rights for News and Media: Solving Three Challenges wit...
IPTC Machine Readable Rights for News and Media: Solving Three Challenges wit...IPTC Machine Readable Rights for News and Media: Solving Three Challenges wit...
IPTC Machine Readable Rights for News and Media: Solving Three Challenges wit...
 
Ap Taxonomy Localization Requirements and Challenges
Ap Taxonomy Localization Requirements and ChallengesAp Taxonomy Localization Requirements and Challenges
Ap Taxonomy Localization Requirements and Challenges
 
IPTC Spring Meeting Welcome To Athens April 2018
IPTC Spring Meeting Welcome To Athens April 2018IPTC Spring Meeting Welcome To Athens April 2018
IPTC Spring Meeting Welcome To Athens April 2018
 
Sustaining Television News Technical Challenges
Sustaining Television News Technical ChallengesSustaining Television News Technical Challenges
Sustaining Television News Technical Challenges
 
How to Train Your Classifier: Create a Serverless Machine Learning System wit...
How to Train Your Classifier: Create a Serverless Machine Learning System wit...How to Train Your Classifier: Create a Serverless Machine Learning System wit...
How to Train Your Classifier: Create a Serverless Machine Learning System wit...
 
The Search for IPTC's Next Managing Director
The Search for IPTC's Next Managing DirectorThe Search for IPTC's Next Managing Director
The Search for IPTC's Next Managing Director
 
IPTC EXTRA and EXTRA+ November 2017
IPTC EXTRA and EXTRA+ November 2017IPTC EXTRA and EXTRA+ November 2017
IPTC EXTRA and EXTRA+ November 2017
 
Welcome to Barcelona - IPTC November 2017
Welcome to Barcelona - IPTC November 2017Welcome to Barcelona - IPTC November 2017
Welcome to Barcelona - IPTC November 2017
 
Credibility Schema Working Group
Credibility Schema Working GroupCredibility Schema Working Group
Credibility Schema Working Group
 
Rights for Photo and Video Archives at the Associated Press
Rights for Photo and Video Archives at the Associated PressRights for Photo and Video Archives at the Associated Press
Rights for Photo and Video Archives at the Associated Press
 
IPTC Rights October 2016
IPTC Rights October 2016IPTC Rights October 2016
IPTC Rights October 2016
 

Recently uploaded

Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
91mobiles
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
BookNet Canada
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
Product School
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
Product School
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
DianaGray10
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Sri Ambati
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Thierry Lestable
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
Safe Software
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Product School
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
OnBoard
 
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxIOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
Abida Shariff
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
Elena Simperl
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
RTTS
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
Cheryl Hung
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 
Search and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesSearch and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical Futures
Bhaskar Mitra
 

Recently uploaded (20)

Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
 
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxIOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 
Search and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesSearch and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical Futures
 

IPTC Approach to News in JSON

  • 1. News in JSON Activity http://www.flickr.com/photos/jondresner/5789254800/ The ninjs Approach to ... News in JSON http://www.flickr.com/photos/jondresner/5789254800/
  • 2. What ninjs is Not • Not a restricted news data model • Not XML in JSON • Not RDF in JSON © 2017 IPTC (www.iptc.org) All rights reserved 2
  • 3. ninjs is comprehensive © 2017 IPTC (www.iptc.org) All rights reserved 3 http://groups.yahoo.com/neo/groups/iptc-news-in-json-dev
  • 4. ninjs Data Model © 2017 IPTC (www.iptc.org) All rights reserved 4 http://dev.iptc.org/ninjs
  • 5. Data Model ninjs • The ninjs data model is more comprehensive than other IPTC data models – We selected a set of priority properties to represent – NewsML-G2, NewsML 1, rNews, NITF – We are ready to add more • ninjs is a JSON representation of a news item – Text, Photo, Graphic, Video, Audio, Package – You can represent a complete item, with all properties – Or you may want to convey key properties – Associations are themselves ninjs documents © 2017 IPTC (www.iptc.org) All rights reserved 5
  • 6. A Complete NINJS 1.1 Article { "uri" : "http://ninjs.example.com/newsitems/20130709simp123", "type" : "text", "versioncreated" : "2013-07-09T10:37:00Z", "byline" : "Paulo Santalucia and Frances d'Emilio", "headline" : "Captain of wrecked cruise ship on trial in Italy", "body_text" : "GROSSETO, Italy (EP) -- The trial of the captain of the shipwrecked Costa Concordia cruise liner has begun in a theater converted into a courtroom …" } © 2015 IPTC (www.iptc.org) All rights reserved 6
  • 7. XML and RDF: Powerful Tools • XML – Namespaces – XSLT – XPath and XQuery – Schema Validation • RDF – Object graphs – Sets of triples – Object lists © 2017 IPTC (www.iptc.org) All rights reserved 7
  • 8. And Yet Developers Prefer JSON • How to measure “preferences”? • https://www.programmableweb.com/apis – “Most Popular” 5/10 JSON only, 4/10 JSON+XML, 1 XML only – JSON only Facebook Graph, Google Maps, Twitter, AccuWeather, Pinterest, Reddit, Foursquare – XML and JSON Google Cloud Storage, Linkedin, Flickr • Databases – trends towards JSON – Only JSON MongoDB, CouchDB, Elasticsearch – Added JSON eXistDB, BaseX, MarkLogic, Oracle Database, PostgresSQL • For AP – the number one request is “can we get this in JSON instead?” © 2017 IPTC (www.iptc.org) All rights reserved 8
  • 9. Why JSON? • Maps easily into modern programming data structures – Feels “more natural” to developers • No namespaces – Biggest strength of XML and RDF – Biggest headache for developers • JSON ecosystem is improving (XML history repeating) – Elasticsearch dominates – Improved developer tools e.g. jq, XQuery support for JSON • Many developers see JSON as simpler and better than XML and may never have heard of RDF © 2017 IPTC (www.iptc.org) All rights reserved 9
  • 10. News in JSON Approach • Create a JSON representation of news that feels natural – Alternative is not using XML or RDF mapped into JSON – But a “hand crafted” JSON from scratch • Process – educate ourselves on JSON best practices – Select the news feature to model in JSON – Identify various representation alternatives in JSON – Try them out with a variety of tools – Pick the “best” one • Goal - JSON developer would look at ninjs and recognize it as a native implementation © 2010 IPTC (www.iptc.org) All rights reserved 10
  • 11. Text Markup in JSON • How to represent richly marked up text in JSON? • A sweet spot for document-oriented XML • Could be HTML, XHTML, NITF ... • We experiment with two existing text markup examples • NITF: http://www.iptc.org/std/NITF/3.2/examples/nitf- fishing.xml • HTML: http://dev.iptc.org/Implementation-Guide-HTML- 5-Microdata-in-IPTC-namespace © 2010 IPTC (www.iptc.org) All rights reserved 11
  • 12. Text Markup Options in JSON • Plain text, stripped of markup • Preserved but escaped markup – HTML: https://gist.github.com/anonymous/4996653 – XML: https://gist.github.com/anonymous/4996676 – See http://stackoverflow.com/questions/993970/what-do-i-need- to-escape-in-my-html-json-response for a discussion of how to escape markup in JSON • Mechanically create JSON structures to mimic the original markup – We used JSONML as an example http://www.jsonml.org/ – NITF : https://gist.github.com/anonymous/4996697 – HTML: https://gist.github.com/anonymous/4996720 © 2010 IPTC (www.iptc.org) All rights reserved 12
  • 13. What We Learnt • Both plain text (no markup) and escaped markup have clear use cases – Plain text can be useful for search, for example – Escaped markup works well for direct display on a webpage • Markup translated (like JSONML) works OK if you have a library to implement the rules – But what is the added benefit beyond just working directly with XML or HTML? – Who will write and maintain the libraries for ever language? • Ninjs supports both plain and escaped text via pattern properties © 2010 IPTC (www.iptc.org) All rights reserved 13
  • 14. Things We Considered But Decided Against • Translating from an existing XML standard into JSON – Not all IPTC standards are XML – Not all publishers use the same IPTC standards – Not all publishers use any IPTC standards • “Mechanically” translating from XML into JSON – There are many libraries that can do this – Different choices for how to represent certain XML features – So each technique results in a slightly different JSON – We felt that more a more “natural” JSON would be more valuable © 2010 IPTC (www.iptc.org) All rights reserved 14
  • 15. Beyond JSON – Binary Formats • IPTC started on ninjs in 2012 – five years ago! – Developer interest is moving on… • AP is looking at binary formats – Row-based – e.g. avro – Columnar – e.g. ORC or Parquet • Same issues – Mechanically translate from other XML or (more likely) JSON? – Or handcraft for most natural / best benefit? – How to quickly become experts in the best practices? © 2010 IPTC (www.iptc.org) All rights reserved 15
  • 16. News in JSON / ninjs • IPTC already has a lot of overlapping standards • Let’s try to avoid creating duplicate JSON standards too • Let’s build on what we have – For example, incorporating IKOS into ninjs – Solve problems of co-branding (ninjs vs NewsML-G2 in JSON) © 2010 IPTC (www.iptc.org) All rights reserved 16