SlideShare a Scribd company logo
1 of 38
Introduction to
WordprocessingML
A high-level overview of the
structure of a word processing
document
Ecma/TC45/2006/010 (Rev.)
The ‘Document’
• A WordprocessingML document file is a
collection of multiple ‘subdocuments’,
formally called stories:
– The main story
– Header(s) / Footer(s)
– Footnote(s) / Endnote(s)
– Subdocuments
– Frame(s)
– Comment(s)
Shared Story Properties
• All stories* in a document share a common
set of properties:
– Style information
– Numbering definitions
– Font information
– Document settings
*with one exception, which we’ll discuss later
Style Information
• A style defines a specific set of formatting
properties
– For example, the Normal style in Word 2003 is
defined as:
• Font = Times New Roman
• Font Size = 12 point
• Font Language = anguage of Word (English (US) for me)
• Justification = Left
• Line Spacing = Single
Style Types
• Word supports six different types of styles:
– Paragraph styles
– Character styles
– Linked styles (paragraph + character)
– Table styles
– Numbering styles
– Default paragraph and character properties
Style Cascading/Inheritance
• Multiple style ‘types’ can be applied to the same part
of a file, so properties are applied in a specific order.
• The properties set by one type can be removed or
supplemented by following types.
• As well, styles of any given type can inherit from
other styles of that type.
– e.g. The Heading 1 paragraph style inherits properties
from the Normal paragraph style
Style Application
Table Characters Paragraph List Item
Table
Paragraph
Character
Direct Formatting
Numbering
Applicationorder
Document Defaults
Style Example
• Styles are then applied to text via the style’s
ID
Numbering Definitions
• A numbering definition consists of nine levels,
each of which have formatting properties
– Paragraph properties (e.g. margins)
– Number properties (e.g. number text,
justification, character formatting, etc.)
• A numbered paragraph is specified in two
parts:
– The numbering definition instance
– The numbering level
Abstract Numbering Definition
• The abstract numbering definition specifies
the properties for any or all of the nine levels
in the list
• A numbering definition instance specifies the
properties for a specific numbering definition
by inheritance:
– References an abstract list definition
– Provides overrides for zero or more levels in the
numbering definition
Numbering Example
• Paragraphs are associated with a particular
numbering definition instance and level.
Font Information
• The font information stores two distinct
pieces of information:
– Embedded fonts (when the producer chooses to
embed them)
– Font type data
• The latter provides characteristics of the font
which are used to find a suitable replacement
when the specified font is unavailable
Document Settings
• All settings pertinent to the document are stored in
separate parts
• These settings can be divided into two groups:
– Those which affect presentation
• Web settings (e.g. HTML <DIV> and <FRAMESET> data)
• Compatibility options
– ‘Pure’ settings
• View, zoom state
• Defaults
• User preferences (i.e. ‘don’t ask me this again’)
Story Content
• Within each story is the actual content, which
consists of what are formally called block level
structures:
– Paragraphs
– Tables
– Custom Markup (structured document tags,
custom XML)
– Range Permissions
Story Content
• Within each paragraph are what is formally
called inline structures:
– Runs
– Custom Markup (structured document tags,
custom XML)
– Annotations (comments, tracked changes,
bookmarks)
– DrawingML elements
– Fields
– Hyperlinks
Basic Structural Rules
• All text in a word processing document is
contained within runs
– A run is a region of text with a common set of
properties
• All runs must be contained within a paragraph
– A paragraph is a collection of one or more runs
that is displayed as a unit (analogous to the HTML
<P> tag)
Example
• A basic paragraph with three different text
formats:
Basic Structural Rules
• A paragraph may itself be at any location
which allows block level content:
– At the top-most level within a story (e.g. header,
footer, main document)
– Nested within a table cell
– Nested within a structured document tag or
annotation markers
Tables
• Similar to HTML tables, a Word table consists
of the table; properties; rows; and cells.
Properties
Row
Cell
Tables
• Individual table cells can themselves contain
any block level content
– This means that tables can be nested arbitrarily,
etc.
Nested
table
Custom Markup
• Custom markup can be applied within the
contents of any story in a document
• These tags can take one of three forms:
– Smart tags
– Custom XML markup
– Structured document tags
Custom Defined XML
• A facility for embedding arbitrary user XML
within the document at either block or inline
levels
Structured Document Tags
• Provide granular semantics at either the block
or inline levels
– e.g. region can/cannot be edited; region
can/cannot be deleted; region should show a date
picker/drop-down list/textbox
– Do not affect layout
• Similar to custom XML - without the XML
schema semantics; with presentation data
and more granular properties
Sections
• Sections in a word processing document
specify:
– Page properties
• Page size
• Page orientation
• Margins
– Header/footer references
– Footnote/endnote properties
– Column properties
Sections
• Sections specify (cont'd):
– Line numbering
– Text direction (RTL vs. LTR; top-to-bottom vs.
bottom-to-top)
Sections
• Four types of sections:
– Continuous
– Next page (start on next page)
– Even (start on next even page)
– Odd (start on next odd page)
Annotations
• Annotations in a word processing document
store markup information:
– Tracked revisions (insertion, deletion, move)
– Comments
– Bookmarks
Annotations
• Annotation markup can be represented in
three states:
1- Inline
Annotations
2 - ‘Non-wellformed’
• The markup not encapsulate the content –
there is a start and end marker.
Annotations
3 – Property
• The deletion of a paragraph mark is in the
paragraph’s property set
Headers/Footers
• There are three types of headers and footers
in Word:
– Odd page header
– Even page header (optional)
– First page header (optional)
• If one of the optional types is not specified,
the odd page header is used
Headers/Footers
• Headers and footers are stored in separate
parts– one per header or footer
• Each section refers to its header(s)/footer(s)
by an explicit relationship reference:
Headers/Footers
• The type is declared in the header/footer part:
Footnotes/Endnotes
• All footnotes are stored in a single part
– Same applies to all endnotes
• Footnote references are positioned by a
special tag in run content, which specifies the
footnote to reference:
Footnotes/Endnotes
• Within the footnotes part, the actual footnote
story content is found via the ID:
Glossary Document
• Remember that exception to the ‘all stories
share the same data’ rule?
• The glossary document is a completely distinct
main story
– Specifies its own styles, lists, fonts, settings
• This story is used to store document
fragments which may be inserted at a later
time
File Format Types
• Template (DOTX) – classic “DOT”
• Document (DOCX) – classic “DOC”
• Both utilize the same file format –
differentiation is a function of the main
content type and file extension only
Disclaimer
This presentation is for informational purposes only, and should
not be relied upon as a substitute or replacement for Microsoft
formal file format documentation, which is available at the
following website: https://msdn.microsoft.com/en-
us/library/cc313118(v=office.12).aspx. Any views or opinions
presented in this material are solely those of the author and do
not necessarily represent those of Microsoft. Microsoft
disclaims all liability for mistakes or inaccuracies in this
presentation.

More Related Content

What's hot

What's hot (7)

PresentationML Subject: Comments
PresentationML Subject: CommentsPresentationML Subject: Comments
PresentationML Subject: Comments
 
Css
CssCss
Css
 
12 wordprocessing ml subject - annotations
12   wordprocessing ml subject - annotations12   wordprocessing ml subject - annotations
12 wordprocessing ml subject - annotations
 
Mdst 3559-02-01-html
Mdst 3559-02-01-htmlMdst 3559-02-01-html
Mdst 3559-02-01-html
 
XML
XMLXML
XML
 
Xml
XmlXml
Xml
 
Introduction to CSS
Introduction to CSSIntroduction to CSS
Introduction to CSS
 

Viewers also liked

Spreadsheet ml subject pivottable
Spreadsheet ml subject   pivottableSpreadsheet ml subject   pivottable
Spreadsheet ml subject pivottableShawn Villaron
 
Spreadsheet ml subject xml-mapping
Spreadsheet ml subject   xml-mappingSpreadsheet ml subject   xml-mapping
Spreadsheet ml subject xml-mappingShawn Villaron
 
Spreadsheet ml subject shared workbooks
Spreadsheet ml subject   shared workbooksSpreadsheet ml subject   shared workbooks
Spreadsheet ml subject shared workbooksShawn Villaron
 
Spreadsheet ml subject metadata
Spreadsheet ml subject   metadataSpreadsheet ml subject   metadata
Spreadsheet ml subject metadataShawn Villaron
 
Spreadsheet ml subject volatile dependencies
Spreadsheet ml subject   volatile dependenciesSpreadsheet ml subject   volatile dependencies
Spreadsheet ml subject volatile dependenciesShawn Villaron
 
DrawingML Introduction
DrawingML IntroductionDrawingML Introduction
DrawingML IntroductionShawn Villaron
 
DrawingML Subject: Text
DrawingML Subject: TextDrawingML Subject: Text
DrawingML Subject: TextShawn Villaron
 
Spreadsheet ml subject sheet
Spreadsheet ml subject   sheetSpreadsheet ml subject   sheet
Spreadsheet ml subject sheetShawn Villaron
 
Spreadsheet ml subject query table
Spreadsheet ml subject   query tableSpreadsheet ml subject   query table
Spreadsheet ml subject query tableShawn Villaron
 
PresentationML Subject: Presentation
PresentationML Subject: PresentationPresentationML Subject: Presentation
PresentationML Subject: PresentationShawn Villaron
 
DrawingML Subject: Tables
DrawingML Subject: TablesDrawingML Subject: Tables
DrawingML Subject: TablesShawn Villaron
 
PresentationML Subject: Slides
PresentationML Subject: SlidesPresentationML Subject: Slides
PresentationML Subject: SlidesShawn Villaron
 
Spreadsheet ml overview
Spreadsheet ml overviewSpreadsheet ml overview
Spreadsheet ml overviewShawn Villaron
 

Viewers also liked (13)

Spreadsheet ml subject pivottable
Spreadsheet ml subject   pivottableSpreadsheet ml subject   pivottable
Spreadsheet ml subject pivottable
 
Spreadsheet ml subject xml-mapping
Spreadsheet ml subject   xml-mappingSpreadsheet ml subject   xml-mapping
Spreadsheet ml subject xml-mapping
 
Spreadsheet ml subject shared workbooks
Spreadsheet ml subject   shared workbooksSpreadsheet ml subject   shared workbooks
Spreadsheet ml subject shared workbooks
 
Spreadsheet ml subject metadata
Spreadsheet ml subject   metadataSpreadsheet ml subject   metadata
Spreadsheet ml subject metadata
 
Spreadsheet ml subject volatile dependencies
Spreadsheet ml subject   volatile dependenciesSpreadsheet ml subject   volatile dependencies
Spreadsheet ml subject volatile dependencies
 
DrawingML Introduction
DrawingML IntroductionDrawingML Introduction
DrawingML Introduction
 
DrawingML Subject: Text
DrawingML Subject: TextDrawingML Subject: Text
DrawingML Subject: Text
 
Spreadsheet ml subject sheet
Spreadsheet ml subject   sheetSpreadsheet ml subject   sheet
Spreadsheet ml subject sheet
 
Spreadsheet ml subject query table
Spreadsheet ml subject   query tableSpreadsheet ml subject   query table
Spreadsheet ml subject query table
 
PresentationML Subject: Presentation
PresentationML Subject: PresentationPresentationML Subject: Presentation
PresentationML Subject: Presentation
 
DrawingML Subject: Tables
DrawingML Subject: TablesDrawingML Subject: Tables
DrawingML Subject: Tables
 
PresentationML Subject: Slides
PresentationML Subject: SlidesPresentationML Subject: Slides
PresentationML Subject: Slides
 
Spreadsheet ml overview
Spreadsheet ml overviewSpreadsheet ml overview
Spreadsheet ml overview
 

Similar to 0 wordprocessing ml overview

1 wordprocessing ml subject - main document
1   wordprocessing ml subject - main document1   wordprocessing ml subject - main document
1 wordprocessing ml subject - main documentShawn Villaron
 
11 wordprocessing ml subject - glossary document
11   wordprocessing ml subject - glossary document11   wordprocessing ml subject - glossary document
11 wordprocessing ml subject - glossary documentShawn Villaron
 
5 wordprocessing ml subject - sections
5   wordprocessing ml subject - sections5   wordprocessing ml subject - sections
5 wordprocessing ml subject - sectionsShawn Villaron
 
6 wordprocessing ml subject - styles
6   wordprocessing ml subject - styles6   wordprocessing ml subject - styles
6 wordprocessing ml subject - stylesShawn Villaron
 
2 wordprocessing ml subject - paragraphs and rich formatting
2   wordprocessing ml subject - paragraphs and rich formatting2   wordprocessing ml subject - paragraphs and rich formatting
2 wordprocessing ml subject - paragraphs and rich formattingShawn Villaron
 
XML-Extensible Markup Language
XML-Extensible Markup Language XML-Extensible Markup Language
XML-Extensible Markup Language Ann Joseph
 
Publishing xml
Publishing xmlPublishing xml
Publishing xmlKumar
 
7 wordprocessing ml subject - fonts
7   wordprocessing ml subject - fonts7   wordprocessing ml subject - fonts
7 wordprocessing ml subject - fontsShawn Villaron
 
Xml basics
Xml basicsXml basics
Xml basicsKumar
 
M.FLORENCE DAYANA WEB DESIGN -Unit 5 XML
M.FLORENCE DAYANA WEB DESIGN -Unit 5   XMLM.FLORENCE DAYANA WEB DESIGN -Unit 5   XML
M.FLORENCE DAYANA WEB DESIGN -Unit 5 XMLDr.Florence Dayana
 
overview of database concept
overview of database conceptoverview of database concept
overview of database conceptgourav kottawar
 

Similar to 0 wordprocessing ml overview (20)

1 wordprocessing ml subject - main document
1   wordprocessing ml subject - main document1   wordprocessing ml subject - main document
1 wordprocessing ml subject - main document
 
11 wordprocessing ml subject - glossary document
11   wordprocessing ml subject - glossary document11   wordprocessing ml subject - glossary document
11 wordprocessing ml subject - glossary document
 
5 wordprocessing ml subject - sections
5   wordprocessing ml subject - sections5   wordprocessing ml subject - sections
5 wordprocessing ml subject - sections
 
6 wordprocessing ml subject - styles
6   wordprocessing ml subject - styles6   wordprocessing ml subject - styles
6 wordprocessing ml subject - styles
 
2 wordprocessing ml subject - paragraphs and rich formatting
2   wordprocessing ml subject - paragraphs and rich formatting2   wordprocessing ml subject - paragraphs and rich formatting
2 wordprocessing ml subject - paragraphs and rich formatting
 
Ch2 neworder
Ch2 neworderCh2 neworder
Ch2 neworder
 
XML-Extensible Markup Language
XML-Extensible Markup Language XML-Extensible Markup Language
XML-Extensible Markup Language
 
1 xml fundamentals
1 xml fundamentals1 xml fundamentals
1 xml fundamentals
 
Publishing xml
Publishing xmlPublishing xml
Publishing xml
 
7 wordprocessing ml subject - fonts
7   wordprocessing ml subject - fonts7   wordprocessing ml subject - fonts
7 wordprocessing ml subject - fonts
 
Xml basics
Xml basicsXml basics
Xml basics
 
Intro xml
Intro xmlIntro xml
Intro xml
 
Xml
XmlXml
Xml
 
M.FLORENCE DAYANA WEB DESIGN -Unit 5 XML
M.FLORENCE DAYANA WEB DESIGN -Unit 5   XMLM.FLORENCE DAYANA WEB DESIGN -Unit 5   XML
M.FLORENCE DAYANA WEB DESIGN -Unit 5 XML
 
Html
HtmlHtml
Html
 
overview of database concept
overview of database conceptoverview of database concept
overview of database concept
 
Xml iet 2015
Xml iet 2015Xml iet 2015
Xml iet 2015
 
Introduction to XML.ppt
Introduction to XML.pptIntroduction to XML.ppt
Introduction to XML.ppt
 
Introduction to XML.ppt
Introduction to XML.pptIntroduction to XML.ppt
Introduction to XML.ppt
 
WT UNIT-2 XML.pdf
WT UNIT-2 XML.pdfWT UNIT-2 XML.pdf
WT UNIT-2 XML.pdf
 

More from Shawn Villaron

Spreadsheet ml subject external links
Spreadsheet ml subject   external linksSpreadsheet ml subject   external links
Spreadsheet ml subject external linksShawn Villaron
 
Spreadsheet ml subject comments
Spreadsheet ml subject   commentsSpreadsheet ml subject   comments
Spreadsheet ml subject commentsShawn Villaron
 
Spreadsheet ml subject calc chain
Spreadsheet ml subject   calc chainSpreadsheet ml subject   calc chain
Spreadsheet ml subject calc chainShawn Villaron
 
Spreadsheet ml subject workbook
Spreadsheet ml subject   workbookSpreadsheet ml subject   workbook
Spreadsheet ml subject workbookShawn Villaron
 
Spreadsheet ml subject workbook connections
Spreadsheet ml subject   workbook connectionsSpreadsheet ml subject   workbook connections
Spreadsheet ml subject workbook connectionsShawn Villaron
 
Spreadsheet ml subject tables
Spreadsheet ml subject   tablesSpreadsheet ml subject   tables
Spreadsheet ml subject tablesShawn Villaron
 
Spreadsheet ml subject styles
Spreadsheet ml subject   stylesSpreadsheet ml subject   styles
Spreadsheet ml subject stylesShawn Villaron
 
Spreadsheet ml subject strings
Spreadsheet ml subject   stringsSpreadsheet ml subject   strings
Spreadsheet ml subject stringsShawn Villaron
 
3 wordprocessing ml subject - tables
3   wordprocessing ml subject - tables3   wordprocessing ml subject - tables
3 wordprocessing ml subject - tablesShawn Villaron
 
16 wordprocessing ml subject - odds and ends
16   wordprocessing ml subject - odds and ends16   wordprocessing ml subject - odds and ends
16 wordprocessing ml subject - odds and endsShawn Villaron
 
15 wordprocessing ml subject - fields and hyperlinks
15   wordprocessing ml subject - fields and hyperlinks15   wordprocessing ml subject - fields and hyperlinks
15 wordprocessing ml subject - fields and hyperlinksShawn Villaron
 
14 wordprocessing ml subject - settings
14   wordprocessing ml subject - settings14   wordprocessing ml subject - settings
14 wordprocessing ml subject - settingsShawn Villaron
 
13 wordprocessing ml subject - mail merge
13   wordprocessing ml subject - mail merge13   wordprocessing ml subject - mail merge
13 wordprocessing ml subject - mail mergeShawn Villaron
 
8 wordprocessing ml subject - numbering
8   wordprocessing ml subject - numbering8   wordprocessing ml subject - numbering
8 wordprocessing ml subject - numberingShawn Villaron
 

More from Shawn Villaron (14)

Spreadsheet ml subject external links
Spreadsheet ml subject   external linksSpreadsheet ml subject   external links
Spreadsheet ml subject external links
 
Spreadsheet ml subject comments
Spreadsheet ml subject   commentsSpreadsheet ml subject   comments
Spreadsheet ml subject comments
 
Spreadsheet ml subject calc chain
Spreadsheet ml subject   calc chainSpreadsheet ml subject   calc chain
Spreadsheet ml subject calc chain
 
Spreadsheet ml subject workbook
Spreadsheet ml subject   workbookSpreadsheet ml subject   workbook
Spreadsheet ml subject workbook
 
Spreadsheet ml subject workbook connections
Spreadsheet ml subject   workbook connectionsSpreadsheet ml subject   workbook connections
Spreadsheet ml subject workbook connections
 
Spreadsheet ml subject tables
Spreadsheet ml subject   tablesSpreadsheet ml subject   tables
Spreadsheet ml subject tables
 
Spreadsheet ml subject styles
Spreadsheet ml subject   stylesSpreadsheet ml subject   styles
Spreadsheet ml subject styles
 
Spreadsheet ml subject strings
Spreadsheet ml subject   stringsSpreadsheet ml subject   strings
Spreadsheet ml subject strings
 
3 wordprocessing ml subject - tables
3   wordprocessing ml subject - tables3   wordprocessing ml subject - tables
3 wordprocessing ml subject - tables
 
16 wordprocessing ml subject - odds and ends
16   wordprocessing ml subject - odds and ends16   wordprocessing ml subject - odds and ends
16 wordprocessing ml subject - odds and ends
 
15 wordprocessing ml subject - fields and hyperlinks
15   wordprocessing ml subject - fields and hyperlinks15   wordprocessing ml subject - fields and hyperlinks
15 wordprocessing ml subject - fields and hyperlinks
 
14 wordprocessing ml subject - settings
14   wordprocessing ml subject - settings14   wordprocessing ml subject - settings
14 wordprocessing ml subject - settings
 
13 wordprocessing ml subject - mail merge
13   wordprocessing ml subject - mail merge13   wordprocessing ml subject - mail merge
13 wordprocessing ml subject - mail merge
 
8 wordprocessing ml subject - numbering
8   wordprocessing ml subject - numbering8   wordprocessing ml subject - numbering
8 wordprocessing ml subject - numbering
 

Recently uploaded

ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...Christina Lin
 
Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...aditisharan08
 
The Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfThe Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfPower Karaoke
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantAxelRicardoTrocheRiq
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackVICTOR MAESTRE RAMIREZ
 
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...gurkirankumar98700
 
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样umasea
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideChristina Lin
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...stazi3110
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxTier1 app
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...soniya singh
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio, Inc.
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfjoe51371421
 
chapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptchapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptkotipi9215
 
Asset Management Software - Infographic
Asset Management Software - InfographicAsset Management Software - Infographic
Asset Management Software - InfographicHr365.us smith
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...MyIntelliSource, Inc.
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmSujith Sukumaran
 
What are the features of Vehicle Tracking System?
What are the features of Vehicle Tracking System?What are the features of Vehicle Tracking System?
What are the features of Vehicle Tracking System?Watsoo Telematics
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWave PLM
 

Recently uploaded (20)

ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
 
Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...
 
The Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfThe Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdf
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service Consultant
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStack
 
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
 
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
 
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdf
 
chapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptchapter--4-software-project-planning.ppt
chapter--4-software-project-planning.ppt
 
Asset Management Software - Infographic
Asset Management Software - InfographicAsset Management Software - Infographic
Asset Management Software - Infographic
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalm
 
What are the features of Vehicle Tracking System?
What are the features of Vehicle Tracking System?What are the features of Vehicle Tracking System?
What are the features of Vehicle Tracking System?
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need It
 

0 wordprocessing ml overview

  • 1. Introduction to WordprocessingML A high-level overview of the structure of a word processing document Ecma/TC45/2006/010 (Rev.)
  • 2. The ‘Document’ • A WordprocessingML document file is a collection of multiple ‘subdocuments’, formally called stories: – The main story – Header(s) / Footer(s) – Footnote(s) / Endnote(s) – Subdocuments – Frame(s) – Comment(s)
  • 3. Shared Story Properties • All stories* in a document share a common set of properties: – Style information – Numbering definitions – Font information – Document settings *with one exception, which we’ll discuss later
  • 4. Style Information • A style defines a specific set of formatting properties – For example, the Normal style in Word 2003 is defined as: • Font = Times New Roman • Font Size = 12 point • Font Language = anguage of Word (English (US) for me) • Justification = Left • Line Spacing = Single
  • 5. Style Types • Word supports six different types of styles: – Paragraph styles – Character styles – Linked styles (paragraph + character) – Table styles – Numbering styles – Default paragraph and character properties
  • 6. Style Cascading/Inheritance • Multiple style ‘types’ can be applied to the same part of a file, so properties are applied in a specific order. • The properties set by one type can be removed or supplemented by following types. • As well, styles of any given type can inherit from other styles of that type. – e.g. The Heading 1 paragraph style inherits properties from the Normal paragraph style
  • 7. Style Application Table Characters Paragraph List Item Table Paragraph Character Direct Formatting Numbering Applicationorder Document Defaults
  • 8. Style Example • Styles are then applied to text via the style’s ID
  • 9. Numbering Definitions • A numbering definition consists of nine levels, each of which have formatting properties – Paragraph properties (e.g. margins) – Number properties (e.g. number text, justification, character formatting, etc.) • A numbered paragraph is specified in two parts: – The numbering definition instance – The numbering level
  • 10. Abstract Numbering Definition • The abstract numbering definition specifies the properties for any or all of the nine levels in the list • A numbering definition instance specifies the properties for a specific numbering definition by inheritance: – References an abstract list definition – Provides overrides for zero or more levels in the numbering definition
  • 11. Numbering Example • Paragraphs are associated with a particular numbering definition instance and level.
  • 12. Font Information • The font information stores two distinct pieces of information: – Embedded fonts (when the producer chooses to embed them) – Font type data • The latter provides characteristics of the font which are used to find a suitable replacement when the specified font is unavailable
  • 13. Document Settings • All settings pertinent to the document are stored in separate parts • These settings can be divided into two groups: – Those which affect presentation • Web settings (e.g. HTML <DIV> and <FRAMESET> data) • Compatibility options – ‘Pure’ settings • View, zoom state • Defaults • User preferences (i.e. ‘don’t ask me this again’)
  • 14. Story Content • Within each story is the actual content, which consists of what are formally called block level structures: – Paragraphs – Tables – Custom Markup (structured document tags, custom XML) – Range Permissions
  • 15. Story Content • Within each paragraph are what is formally called inline structures: – Runs – Custom Markup (structured document tags, custom XML) – Annotations (comments, tracked changes, bookmarks) – DrawingML elements – Fields – Hyperlinks
  • 16. Basic Structural Rules • All text in a word processing document is contained within runs – A run is a region of text with a common set of properties • All runs must be contained within a paragraph – A paragraph is a collection of one or more runs that is displayed as a unit (analogous to the HTML <P> tag)
  • 17. Example • A basic paragraph with three different text formats:
  • 18. Basic Structural Rules • A paragraph may itself be at any location which allows block level content: – At the top-most level within a story (e.g. header, footer, main document) – Nested within a table cell – Nested within a structured document tag or annotation markers
  • 19. Tables • Similar to HTML tables, a Word table consists of the table; properties; rows; and cells. Properties Row Cell
  • 20. Tables • Individual table cells can themselves contain any block level content – This means that tables can be nested arbitrarily, etc. Nested table
  • 21. Custom Markup • Custom markup can be applied within the contents of any story in a document • These tags can take one of three forms: – Smart tags – Custom XML markup – Structured document tags
  • 22. Custom Defined XML • A facility for embedding arbitrary user XML within the document at either block or inline levels
  • 23. Structured Document Tags • Provide granular semantics at either the block or inline levels – e.g. region can/cannot be edited; region can/cannot be deleted; region should show a date picker/drop-down list/textbox – Do not affect layout • Similar to custom XML - without the XML schema semantics; with presentation data and more granular properties
  • 24. Sections • Sections in a word processing document specify: – Page properties • Page size • Page orientation • Margins – Header/footer references – Footnote/endnote properties – Column properties
  • 25. Sections • Sections specify (cont'd): – Line numbering – Text direction (RTL vs. LTR; top-to-bottom vs. bottom-to-top)
  • 26. Sections • Four types of sections: – Continuous – Next page (start on next page) – Even (start on next even page) – Odd (start on next odd page)
  • 27. Annotations • Annotations in a word processing document store markup information: – Tracked revisions (insertion, deletion, move) – Comments – Bookmarks
  • 28. Annotations • Annotation markup can be represented in three states: 1- Inline
  • 29. Annotations 2 - ‘Non-wellformed’ • The markup not encapsulate the content – there is a start and end marker.
  • 30. Annotations 3 – Property • The deletion of a paragraph mark is in the paragraph’s property set
  • 31. Headers/Footers • There are three types of headers and footers in Word: – Odd page header – Even page header (optional) – First page header (optional) • If one of the optional types is not specified, the odd page header is used
  • 32. Headers/Footers • Headers and footers are stored in separate parts– one per header or footer • Each section refers to its header(s)/footer(s) by an explicit relationship reference:
  • 33. Headers/Footers • The type is declared in the header/footer part:
  • 34. Footnotes/Endnotes • All footnotes are stored in a single part – Same applies to all endnotes • Footnote references are positioned by a special tag in run content, which specifies the footnote to reference:
  • 35. Footnotes/Endnotes • Within the footnotes part, the actual footnote story content is found via the ID:
  • 36. Glossary Document • Remember that exception to the ‘all stories share the same data’ rule? • The glossary document is a completely distinct main story – Specifies its own styles, lists, fonts, settings • This story is used to store document fragments which may be inserted at a later time
  • 37. File Format Types • Template (DOTX) – classic “DOT” • Document (DOCX) – classic “DOC” • Both utilize the same file format – differentiation is a function of the main content type and file extension only
  • 38. Disclaimer This presentation is for informational purposes only, and should not be relied upon as a substitute or replacement for Microsoft formal file format documentation, which is available at the following website: https://msdn.microsoft.com/en- us/library/cc313118(v=office.12).aspx. Any views or opinions presented in this material are solely those of the author and do not necessarily represent those of Microsoft. Microsoft disclaims all liability for mistakes or inaccuracies in this presentation.