Xml Work Flow

1,356
-1

Published on

Published in: Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
1,356
On Slideshare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
21
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Xml Work Flow

  1. 1. SOFTWARE SOLUTIONS HEYDAY
  2. 2. Current Projects <ul><li>E-Publishing </li></ul><ul><ul><li>IMF </li></ul></ul><ul><ul><li>Wiley UK </li></ul></ul><ul><ul><li>VST </li></ul></ul>
  3. 3. PDF to XML Work Flow <ul><li>Data Capture </li></ul><ul><li>Coding </li></ul><ul><li>Validation </li></ul><ul><li>E-Deployment </li></ul>
  4. 4. Data Capture <ul><li>Capture Text,Box-Text and Box-FootNotes from Source PDF - only Chapters </li></ul><ul><li>Capture Chapter/Article-FootNotes from Source PDF- only Chapters </li></ul><ul><li>Capture Images & Tables as JPG and Image Related Text as HTML from Source PDF </li></ul><ul><li>Capture Table Content from the Source PDF as Text and add IMF TAGS </li></ul>
  5. 5. Coding <ul><li>Merging of all the Data capture tasks as per IMF specification </li></ul><ul><li>Creating Front Matter from source PDF parts ( TOC,Preface,Abbrevations,Main Messages) </li></ul><ul><li>Creating Back Matter from source PDF parts(appendixes,Glossary,References) </li></ul><ul><li>Image Editing as per IMF specification </li></ul><ul><li>Merging of all the above tasks as per IMF specification </li></ul>
  6. 6. Validation <ul><li>QC With Epsilon </li></ul><ul><li>QC With Browsers for desired View </li></ul><ul><li>QC With Oxygen </li></ul>
  7. 7. E-Deployment <ul><li>Deploy in Customer Desired Format </li></ul>
  8. 8. TASK 2 TASK 3 TASK 8 TASK 7 TASK 4 TASK 6 TASK 12 TASK 11 TASK 10 TASK 9 QC DEPART TASK 13 DELIVERABLE (XML) INPUT (PDF) TASK 1 TASK 5 Pdf to XML WORK FLOW
  9. 9. TASK 1 Capture Text, Box-Text and Box-Footnotes from Source PDF-Chapters TASK 2 Capture Footnotes of Chapter/Article from Source PDF TASK 3 Capture Images & Tables as JPG from Source PDF-Chapters TASK 4 Capture Table Data as Text from Source PDF and Add IMF-Table Tags TASK 6 Capture Front Matter from Source PDF (TOC,Preface,Abbrevations,Main Messages) TASK 7 Capture Back Matter from Source PDF (Appendixes, Glossaries and References) TASK 5 Merge all previous Tasks output into one and add Required IMF Tags TASK 8 Edit all Images to set required resolution and Size TASK 9 Merge Tasks (from 5 to 8) to get final output Validation Through Epsilon Validation Through Browser for Desired View Validation against of IMF- DTD using Oxygen Detailed Work Flow
  10. 10. Team Members Team Leaders Quality Analyst Abbyy FineReader Epsilon Editor Epsilon DTD XSL Oxygen Task 1, Task 2, Task 3, Task 4 Task 5, Task 6, Task 7, Task 8, Task 9 Task 10, Task 11, Task 12, Task 13 Do Do Do Using Using Using Tasks Distribution and Methodology Capturing Various Type of Data Code around the Data Validate the Code and Data
  11. 11. TASK 1 : SAMPLE Description : Capture Text from Source PDF (Only Chapters) Using OCR Tool Input : Source PDF
  12. 12. TASK 1 : SAMPLE Output : One HTML file for each Chapter/Article
  13. 13. TASK 2 : SAMPLE Description : Capture Chapter/Article-Foot Notes from Source PDF- Only Chapters Input : Source PDF
  14. 14. TASK 2 : SAMPLE Output : One html or multiple html when footnote repeats its ID for each Chapter/Article
  15. 15. TASK 3 : SAMPLE Description : Capture Images & Tables as JPG and Image Related Text as HTML from Source PDF Input : Source PDF
  16. 16. TASK 3 : SAMPLE Output : Multiple JPG’s & One HTML
  17. 17. TASK 4 : SAMPLE Description : Capture Table Content from the Source PDF as Text and add IMF TAGS Input : Source PDF
  18. 18. TASK 4 : SAMPLE Output : HTML
  19. 19. TASK 5 : SAMPLE Description : Merging of all the above Tasks(1 to 4) as per IMF specification Input : Task 1 to Task 4 Output: HTML
  20. 20. TASK 6 : SAMPLE Input : Source PDF Description : Capture Front Matter from source PDF parts ( TOC, Preface, Abbreviations, Main Messages)
  21. 21. TASK 6 : SAMPLE Output : HTML
  22. 22. TASK 7 : SAMPLE Description : Capture Back Matter from source PDF parts (Appendixes, Glossary, References) Input : Source PDF
  23. 23. TASK 7 : SAMPLE Output : HTML
  24. 24. TASK 8 : SAMPLE Description : Image Editing as per IMF specification Output : Final JPG’s Input : Source PDF
  25. 25. TASK 9 : SAMPLE Description : Merging of all the above tasks(5,6,7,8) as per IMF specification Output : Final XML without Validation Input : Task 5 to Task 8
  26. 26. TASK 10 : SAMPLE Description : First Level Validation With Epsilon Output : XML Input : Task 9 - XML
  27. 27. TASK 11 : SAMPLE Description : Validation With Browsers for desired View Output : Final XML Validation- Second Level
  28. 28. TASK 12 : SAMPLE Description : Validation With Oxygen against of IMF-DTD Output : Final XML Validation- Third Level
  29. 29. TASK 13 : SAMPLE Description : Packing Process in Desired Manner Output : Deliverable Product
  30. 30. Thank You [email_address]

×