SlideShare a Scribd company logo
1 of 3
Download to read offline
Alexander (‘Sasha’) Schwarzman, AGU Balisage 2011: The Markup Conference, Montréal, Canada Page 1 of 3
(sschwarzman@agu.org) August 2–5, 2011
Using Schematron for Appropriate Layer
Validation: a Case Study
Alexander (‘Sasha’) Schwarzman, AGU (sschwarzman@agu.org)
Balisage 2011: The Markup Conference, Montréal, Canada
August 2 – 5, 2011
Appropriate layer validation—advantages
 Even the most “Prussian” DTD cannot enforce all business rules, data types, and house style
 Rules-based checking needed anyway
 May use a “Californian” DTD, such as JATS: de facto industry standard adopted by publishers,
conversion and composition vendors, archives, etc.
 Can use tools developed for JATS: Preview XSLT stylesheets, EPUB conversion processes, etc.
Why Schematron?
 Multiple genres (document types)
 Journal article
 Book chapter
 Book
 Newspaper article
 Different lifecycle phases
 Papers in press (journal article)
 Initial validation (journal article, book chapter)
 Final validation (all genres)
Journal article
Alexander (‘Sasha’) Schwarzman, AGU Balisage 2011: The Markup Conference, Montréal, Canada Page 2 of 3
(sschwarzman@agu.org) August 2–5, 2011
Book chaper and book
Newspaper article
Alexander (‘Sasha’) Schwarzman, AGU Balisage 2011: The Markup Conference, Montréal, Canada Page 3 of 3
(sschwarzman@agu.org) August 2–5, 2011
AGU Schematrons
FBA IJA FJA SWN FBK EOM EOS PIP IBA
AGUcontribs.sch ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓
bibr-adhoc.sch ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓
bibr-ids.sch ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓
bibr-italics.sch ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓
bibr-structures.sch ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓
book-bookarticle.sch ✓
book-meta.sch ✓
bookarticle-meta-final.sch ✓ ✓
bookarticle-meta.sch ✓ ✓ ✓
common-back.sch ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓
common-final.sch ✓ ✓ ✓ ✓ ✓ ✓
common-meta.sch ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓
common.sch ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓
dates.sch ✓ ✓ ✓ ✓ ✓ ✓ ✓
eos-only.sch ✓ ✓
filetypes.sch ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓
global.sch ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓
index-codes.sch ✓ ✓ ✓
index-terms.sch ✓ ✓ ✓ ✓ ✓ ✓
journalarticle-meta-final.sch ✓ ✓ ✓ ✓
journalarticle-meta.sch ✓ ✓ ✓ ✓ ✓ ✓
journalarticle-tech.sch ✓ ✓
mddb-ws.sch ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓
names.sch ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓
print-final.sch ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓
ref-misc.sch ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓
setup.sch ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓
 9 top-level Schematrons, 27 modules
 350+ requirements checked
 Perform Web Services-based verifications against relational metadata database
 Check data quality, markup integrity, business rules, data types, and house style
 Provide control over production processes
 Work best in oXygen (context-sensitive), can be compiled and integrated into pipeline scripts
Paradigm shift: validation focus moves from XML parser to Schematron engine
☛ Content may be valid to the DTD but make no sense
☛ Semantic integrity now depends on Schematron
☛ Should each Schematron release be preserved and the version info added to metadata?
☛ Constraints on business partners: they must be Schematron-capable and have tools
☛ Schematron does not “fix” problems—people do! Processes & procedures must be defined
How to build a good
 Elicit, document, convey, and clarify the Requirements
Schematron
 Ensure Schematron fits into your workflow
 Modularize Schematron
 Ensure that individual Schematron rules aren’t in conflict
 Optimize Schematron performance
 Employ XSLT 2.0
 Test, test, test
 Cultivate Schematron & XSLT 2.0 expertise in-house

More Related Content

Similar to 2011-Balisage-Poster-Schwarzman

Document Object Model
Document Object ModelDocument Object Model
Document Object Modelchomas kandar
 
Document Object Model
Document Object ModelDocument Object Model
Document Object Modelchomas kandar
 
Html and css easy steps
Html and css easy stepsHtml and css easy steps
Html and css easy stepsBiswa Ranjan
 
Building a Scalable XML-based Dynamic Delivery Architecture: Standards and Be...
Building a Scalable XML-based Dynamic Delivery Architecture: Standards and Be...Building a Scalable XML-based Dynamic Delivery Architecture: Standards and Be...
Building a Scalable XML-based Dynamic Delivery Architecture: Standards and Be...Jerry SILVER
 
Building Lakehouses on Delta Lake with SQL Analytics Primer
Building Lakehouses on Delta Lake with SQL Analytics PrimerBuilding Lakehouses on Delta Lake with SQL Analytics Primer
Building Lakehouses on Delta Lake with SQL Analytics PrimerDatabricks
 
Understanding Metadata: Why it's essential to your big data solution and how ...
Understanding Metadata: Why it's essential to your big data solution and how ...Understanding Metadata: Why it's essential to your big data solution and how ...
Understanding Metadata: Why it's essential to your big data solution and how ...Zaloni
 
Extreme-ML-2006-Poster-A-Schwarzman
Extreme-ML-2006-Poster-A-SchwarzmanExtreme-ML-2006-Poster-A-Schwarzman
Extreme-ML-2006-Poster-A-Schwarzmanaschwarzman
 
Ruby on Rails: Coding Guideline
Ruby on Rails: Coding GuidelineRuby on Rails: Coding Guideline
Ruby on Rails: Coding GuidelineNascenia IT
 
Scalable And Incremental Data Profiling With Spark
Scalable And Incremental Data Profiling With SparkScalable And Incremental Data Profiling With Spark
Scalable And Incremental Data Profiling With SparkJen Aman
 
HTML5, CSS3 & Responsive Design
HTML5, CSS3 & Responsive DesignHTML5, CSS3 & Responsive Design
HTML5, CSS3 & Responsive DesignFawzia Essa
 
Deep dive into the native multi model database ArangoDB
Deep dive into the native multi model database ArangoDBDeep dive into the native multi model database ArangoDB
Deep dive into the native multi model database ArangoDBArangoDB Database
 
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and SparkVital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and SparkVital.AI
 
Migration Best Practices - Search Y 2019, Paris
Migration Best Practices - Search Y 2019, ParisMigration Best Practices - Search Y 2019, Paris
Migration Best Practices - Search Y 2019, ParisBastian Grimm
 
Mapping Data Flows Training April 2021
Mapping Data Flows Training April 2021Mapping Data Flows Training April 2021
Mapping Data Flows Training April 2021Mark Kromer
 

Similar to 2011-Balisage-Poster-Schwarzman (20)

Document Object Model
Document Object ModelDocument Object Model
Document Object Model
 
Document Object Model
Document Object ModelDocument Object Model
Document Object Model
 
Html css
Html cssHtml css
Html css
 
Html and css easy steps
Html and css easy stepsHtml and css easy steps
Html and css easy steps
 
Xml Overview
Xml OverviewXml Overview
Xml Overview
 
Building a Scalable XML-based Dynamic Delivery Architecture: Standards and Be...
Building a Scalable XML-based Dynamic Delivery Architecture: Standards and Be...Building a Scalable XML-based Dynamic Delivery Architecture: Standards and Be...
Building a Scalable XML-based Dynamic Delivery Architecture: Standards and Be...
 
Sas training in hyderabad
Sas training in hyderabadSas training in hyderabad
Sas training in hyderabad
 
Markup For Dummies (Russ Ward)
Markup For Dummies (Russ Ward)Markup For Dummies (Russ Ward)
Markup For Dummies (Russ Ward)
 
Building Lakehouses on Delta Lake with SQL Analytics Primer
Building Lakehouses on Delta Lake with SQL Analytics PrimerBuilding Lakehouses on Delta Lake with SQL Analytics Primer
Building Lakehouses on Delta Lake with SQL Analytics Primer
 
Understanding Metadata: Why it's essential to your big data solution and how ...
Understanding Metadata: Why it's essential to your big data solution and how ...Understanding Metadata: Why it's essential to your big data solution and how ...
Understanding Metadata: Why it's essential to your big data solution and how ...
 
Extreme-ML-2006-Poster-A-Schwarzman
Extreme-ML-2006-Poster-A-SchwarzmanExtreme-ML-2006-Poster-A-Schwarzman
Extreme-ML-2006-Poster-A-Schwarzman
 
Ruby on Rails: Coding Guideline
Ruby on Rails: Coding GuidelineRuby on Rails: Coding Guideline
Ruby on Rails: Coding Guideline
 
Scalable And Incremental Data Profiling With Spark
Scalable And Incremental Data Profiling With SparkScalable And Incremental Data Profiling With Spark
Scalable And Incremental Data Profiling With Spark
 
HTML5, CSS3 & Responsive Design
HTML5, CSS3 & Responsive DesignHTML5, CSS3 & Responsive Design
HTML5, CSS3 & Responsive Design
 
Deep dive into the native multi model database ArangoDB
Deep dive into the native multi model database ArangoDBDeep dive into the native multi model database ArangoDB
Deep dive into the native multi model database ArangoDB
 
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and SparkVital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
 
Migration Best Practices - Search Y 2019, Paris
Migration Best Practices - Search Y 2019, ParisMigration Best Practices - Search Y 2019, Paris
Migration Best Practices - Search Y 2019, Paris
 
Mapping Data Flows Training April 2021
Mapping Data Flows Training April 2021Mapping Data Flows Training April 2021
Mapping Data Flows Training April 2021
 
Web technology Unit-II Part-C
Web technology Unit-II Part-CWeb technology Unit-II Part-C
Web technology Unit-II Part-C
 
Coding standards
Coding standards Coding standards
Coding standards
 

More from aschwarzman

2012-08-14-OSA-Pubs-IT_Presentation
2012-08-14-OSA-Pubs-IT_Presentation2012-08-14-OSA-Pubs-IT_Presentation
2012-08-14-OSA-Pubs-IT_Presentationaschwarzman
 
2012-05-20-CSE-2012_Schwarzman
2012-05-20-CSE-2012_Schwarzman2012-05-20-CSE-2012_Schwarzman
2012-05-20-CSE-2012_Schwarzmanaschwarzman
 
2012-03-20-AGU-Librarians_Presentation
2012-03-20-AGU-Librarians_Presentation2012-03-20-AGU-Librarians_Presentation
2012-03-20-AGU-Librarians_Presentationaschwarzman
 
2011-11-14-CrossRef-Workshops_Schwarzman
2011-11-14-CrossRef-Workshops_Schwarzman2011-11-14-CrossRef-Workshops_Schwarzman
2011-11-14-CrossRef-Workshops_Schwarzmanaschwarzman
 
2011-09-27-JATS-Con-Presentation_Schwarzman
2011-09-27-JATS-Con-Presentation_Schwarzman2011-09-27-JATS-Con-Presentation_Schwarzman
2011-09-27-JATS-Con-Presentation_Schwarzmanaschwarzman
 
Schwarzman-CSE2011
Schwarzman-CSE2011Schwarzman-CSE2011
Schwarzman-CSE2011aschwarzman
 
Schwarzman-JATS-Con-slides
Schwarzman-JATS-Con-slidesSchwarzman-JATS-Con-slides
Schwarzman-JATS-Con-slidesaschwarzman
 
XML2004-schwarzman
XML2004-schwarzmanXML2004-schwarzman
XML2004-schwarzmanaschwarzman
 
JATS-Con-Schwarzman-slides_corr-2016-04-29
JATS-Con-Schwarzman-slides_corr-2016-04-29JATS-Con-Schwarzman-slides_corr-2016-04-29
JATS-Con-Schwarzman-slides_corr-2016-04-29aschwarzman
 
Balisage_2011-08-03_Schwarzman
Balisage_2011-08-03_SchwarzmanBalisage_2011-08-03_Schwarzman
Balisage_2011-08-03_Schwarzmanaschwarzman
 
Balisage-2015-funding-poster
Balisage-2015-funding-posterBalisage-2015-funding-poster
Balisage-2015-funding-posteraschwarzman
 
Balisage-2015-sup-mat-poster
Balisage-2015-sup-mat-posterBalisage-2015-sup-mat-poster
Balisage-2015-sup-mat-posteraschwarzman
 
NISO-NFAIS Supplemental Journal Article Materials Working Group: An Update o...
NISO-NFAIS Supplemental Journal Article Materials Working Group: An Update o...NISO-NFAIS Supplemental Journal Article Materials Working Group: An Update o...
NISO-NFAIS Supplemental Journal Article Materials Working Group: An Update o...aschwarzman
 
NISO-NFAIS Supplemental Journal Article Materials Working Group
NISO-NFAIS Supplemental Journal Article Materials Working GroupNISO-NFAIS Supplemental Journal Article Materials Working Group
NISO-NFAIS Supplemental Journal Article Materials Working Groupaschwarzman
 

More from aschwarzman (17)

dineen2013
dineen2013dineen2013
dineen2013
 
XML-talk
XML-talkXML-talk
XML-talk
 
2012-08-14-OSA-Pubs-IT_Presentation
2012-08-14-OSA-Pubs-IT_Presentation2012-08-14-OSA-Pubs-IT_Presentation
2012-08-14-OSA-Pubs-IT_Presentation
 
2012-05-20-CSE-2012_Schwarzman
2012-05-20-CSE-2012_Schwarzman2012-05-20-CSE-2012_Schwarzman
2012-05-20-CSE-2012_Schwarzman
 
2012-03-20-AGU-Librarians_Presentation
2012-03-20-AGU-Librarians_Presentation2012-03-20-AGU-Librarians_Presentation
2012-03-20-AGU-Librarians_Presentation
 
2011-11-14-CrossRef-Workshops_Schwarzman
2011-11-14-CrossRef-Workshops_Schwarzman2011-11-14-CrossRef-Workshops_Schwarzman
2011-11-14-CrossRef-Workshops_Schwarzman
 
2011-09-27-JATS-Con-Presentation_Schwarzman
2011-09-27-JATS-Con-Presentation_Schwarzman2011-09-27-JATS-Con-Presentation_Schwarzman
2011-09-27-JATS-Con-Presentation_Schwarzman
 
Schwarzman-CSE2011
Schwarzman-CSE2011Schwarzman-CSE2011
Schwarzman-CSE2011
 
Schwarzman-JATS-Con-slides
Schwarzman-JATS-Con-slidesSchwarzman-JATS-Con-slides
Schwarzman-JATS-Con-slides
 
XML2004
XML2004XML2004
XML2004
 
XML2004-schwarzman
XML2004-schwarzmanXML2004-schwarzman
XML2004-schwarzman
 
JATS-Con-Schwarzman-slides_corr-2016-04-29
JATS-Con-Schwarzman-slides_corr-2016-04-29JATS-Con-Schwarzman-slides_corr-2016-04-29
JATS-Con-Schwarzman-slides_corr-2016-04-29
 
Balisage_2011-08-03_Schwarzman
Balisage_2011-08-03_SchwarzmanBalisage_2011-08-03_Schwarzman
Balisage_2011-08-03_Schwarzman
 
Balisage-2015-funding-poster
Balisage-2015-funding-posterBalisage-2015-funding-poster
Balisage-2015-funding-poster
 
Balisage-2015-sup-mat-poster
Balisage-2015-sup-mat-posterBalisage-2015-sup-mat-poster
Balisage-2015-sup-mat-poster
 
NISO-NFAIS Supplemental Journal Article Materials Working Group: An Update o...
NISO-NFAIS Supplemental Journal Article Materials Working Group: An Update o...NISO-NFAIS Supplemental Journal Article Materials Working Group: An Update o...
NISO-NFAIS Supplemental Journal Article Materials Working Group: An Update o...
 
NISO-NFAIS Supplemental Journal Article Materials Working Group
NISO-NFAIS Supplemental Journal Article Materials Working GroupNISO-NFAIS Supplemental Journal Article Materials Working Group
NISO-NFAIS Supplemental Journal Article Materials Working Group
 

2011-Balisage-Poster-Schwarzman

  • 1. Alexander (‘Sasha’) Schwarzman, AGU Balisage 2011: The Markup Conference, Montréal, Canada Page 1 of 3 (sschwarzman@agu.org) August 2–5, 2011 Using Schematron for Appropriate Layer Validation: a Case Study Alexander (‘Sasha’) Schwarzman, AGU (sschwarzman@agu.org) Balisage 2011: The Markup Conference, Montréal, Canada August 2 – 5, 2011 Appropriate layer validation—advantages  Even the most “Prussian” DTD cannot enforce all business rules, data types, and house style  Rules-based checking needed anyway  May use a “Californian” DTD, such as JATS: de facto industry standard adopted by publishers, conversion and composition vendors, archives, etc.  Can use tools developed for JATS: Preview XSLT stylesheets, EPUB conversion processes, etc. Why Schematron?  Multiple genres (document types)  Journal article  Book chapter  Book  Newspaper article  Different lifecycle phases  Papers in press (journal article)  Initial validation (journal article, book chapter)  Final validation (all genres) Journal article
  • 2. Alexander (‘Sasha’) Schwarzman, AGU Balisage 2011: The Markup Conference, Montréal, Canada Page 2 of 3 (sschwarzman@agu.org) August 2–5, 2011 Book chaper and book Newspaper article
  • 3. Alexander (‘Sasha’) Schwarzman, AGU Balisage 2011: The Markup Conference, Montréal, Canada Page 3 of 3 (sschwarzman@agu.org) August 2–5, 2011 AGU Schematrons FBA IJA FJA SWN FBK EOM EOS PIP IBA AGUcontribs.sch ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ bibr-adhoc.sch ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ bibr-ids.sch ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ bibr-italics.sch ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ bibr-structures.sch ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ book-bookarticle.sch ✓ book-meta.sch ✓ bookarticle-meta-final.sch ✓ ✓ bookarticle-meta.sch ✓ ✓ ✓ common-back.sch ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ common-final.sch ✓ ✓ ✓ ✓ ✓ ✓ common-meta.sch ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ common.sch ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ dates.sch ✓ ✓ ✓ ✓ ✓ ✓ ✓ eos-only.sch ✓ ✓ filetypes.sch ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ global.sch ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ index-codes.sch ✓ ✓ ✓ index-terms.sch ✓ ✓ ✓ ✓ ✓ ✓ journalarticle-meta-final.sch ✓ ✓ ✓ ✓ journalarticle-meta.sch ✓ ✓ ✓ ✓ ✓ ✓ journalarticle-tech.sch ✓ ✓ mddb-ws.sch ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ names.sch ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ print-final.sch ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ref-misc.sch ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ setup.sch ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓  9 top-level Schematrons, 27 modules  350+ requirements checked  Perform Web Services-based verifications against relational metadata database  Check data quality, markup integrity, business rules, data types, and house style  Provide control over production processes  Work best in oXygen (context-sensitive), can be compiled and integrated into pipeline scripts Paradigm shift: validation focus moves from XML parser to Schematron engine ☛ Content may be valid to the DTD but make no sense ☛ Semantic integrity now depends on Schematron ☛ Should each Schematron release be preserved and the version info added to metadata? ☛ Constraints on business partners: they must be Schematron-capable and have tools ☛ Schematron does not “fix” problems—people do! Processes & procedures must be defined How to build a good  Elicit, document, convey, and clarify the Requirements Schematron  Ensure Schematron fits into your workflow  Modularize Schematron  Ensure that individual Schematron rules aren’t in conflict  Optimize Schematron performance  Employ XSLT 2.0  Test, test, test  Cultivate Schematron & XSLT 2.0 expertise in-house