SlideShare a Scribd company logo
1 of 30
SOFTWARE SOLUTIONS HEYDAY
Current Projects ,[object Object],[object Object],[object Object],[object Object]
PDF to XML  Work Flow ,[object Object],[object Object],[object Object],[object Object]
Data Capture ,[object Object],[object Object],[object Object],[object Object]
Coding ,[object Object],[object Object],[object Object],[object Object],[object Object]
Validation ,[object Object],[object Object],[object Object]
E-Deployment ,[object Object]
TASK 2 TASK 3 TASK 8 TASK 7 TASK 4 TASK 6 TASK 12 TASK 11 TASK 10 TASK 9 QC DEPART TASK 13 DELIVERABLE (XML) INPUT (PDF) TASK 1 TASK 5 Pdf to XML WORK FLOW
TASK 1 Capture Text, Box-Text and Box-Footnotes  from Source PDF-Chapters TASK 2 Capture Footnotes of Chapter/Article  from Source PDF TASK 3 Capture Images & Tables as JPG  from Source PDF-Chapters TASK 4 Capture Table Data as Text from Source PDF and Add IMF-Table Tags TASK 6 Capture Front Matter from Source PDF (TOC,Preface,Abbrevations,Main Messages) TASK 7 Capture Back Matter from Source PDF (Appendixes, Glossaries and References) TASK 5 Merge all previous Tasks output into one and add Required IMF Tags TASK 8 Edit all Images to set required resolution and Size TASK 9 Merge Tasks (from 5 to 8) to get final output Validation Through Epsilon Validation Through Browser for Desired View Validation against of IMF- DTD using Oxygen Detailed Work Flow
Team Members Team Leaders Quality Analyst Abbyy FineReader Epsilon Editor Epsilon DTD XSL Oxygen Task 1, Task 2, Task 3, Task 4 Task 5, Task 6, Task 7, Task 8, Task 9 Task 10, Task 11, Task 12, Task 13 Do Do Do Using Using Using Tasks Distribution and Methodology Capturing Various Type of Data Code around the Data Validate the Code and Data
TASK 1 :  SAMPLE  Description :  Capture Text from Source PDF (Only Chapters) Using OCR Tool Input :   Source PDF
TASK 1 :  SAMPLE  Output :  One HTML file for each Chapter/Article
TASK 2 :   SAMPLE Description :   Capture Chapter/Article-Foot Notes from Source PDF- Only Chapters Input :   Source PDF
TASK 2 :  SAMPLE Output :   One html or multiple html when footnote repeats its ID for each Chapter/Article
TASK 3 :   SAMPLE Description :   Capture Images & Tables as JPG and Image Related Text as HTML    from Source PDF Input :   Source PDF
TASK 3 :   SAMPLE Output :  Multiple JPG’s & One HTML
TASK 4 :   SAMPLE Description :   Capture Table Content from the Source PDF as Text and add IMF TAGS Input :   Source PDF
TASK 4 :   SAMPLE Output :  HTML
TASK 5 :   SAMPLE Description :   Merging of all the above Tasks(1 to 4) as per IMF specification Input :   Task 1 to Task 4 Output:   HTML
TASK 6 :   SAMPLE  Input :  Source PDF Description :   Capture Front Matter from source PDF parts ( TOC, Preface,      Abbreviations, Main Messages)
TASK 6 :   SAMPLE Output :  HTML
TASK 7 :   SAMPLE Description :   Capture Back Matter from source PDF parts (Appendixes, Glossary,    References) Input :   Source PDF
TASK 7 :   SAMPLE Output : HTML
TASK 8 :   SAMPLE Description :   Image Editing as per IMF specification Output :  Final JPG’s Input :   Source PDF
TASK 9  :   SAMPLE Description :   Merging of all the above tasks(5,6,7,8) as per IMF specification Output :  Final XML without Validation Input :   Task 5 to Task 8
TASK 10 :   SAMPLE Description :   First Level  Validation With Epsilon Output :   XML Input :   Task 9 - XML
TASK 11 :   SAMPLE Description :   Validation With Browsers for desired View  Output :  Final XML Validation- Second Level
TASK 12 :   SAMPLE Description :   Validation With Oxygen against of IMF-DTD Output :  Final XML Validation- Third Level
TASK 13 :   SAMPLE Description :   Packing Process in Desired Manner Output :  Deliverable Product
Thank You [email_address]

More Related Content

Similar to Xml Work Flow

Phpconf taiwan-2012
Phpconf taiwan-2012Phpconf taiwan-2012
Phpconf taiwan-2012
Hash Lin
 
Spotfire Integration & Dynamic Output creation
Spotfire Integration & Dynamic Output creationSpotfire Integration & Dynamic Output creation
Spotfire Integration & Dynamic Output creation
Ambareesh Kulkarni
 
ODTUG KSCOPE 2018 - REST APIs for FDMEE and Cloud Data Management
ODTUG KSCOPE 2018 - REST APIs for FDMEE and Cloud Data ManagementODTUG KSCOPE 2018 - REST APIs for FDMEE and Cloud Data Management
ODTUG KSCOPE 2018 - REST APIs for FDMEE and Cloud Data Management
Francisco Amores
 
Make everything realtime & collaborative - JS Summit 2014
Make everything realtime & collaborative - JS Summit 2014Make everything realtime & collaborative - JS Summit 2014
Make everything realtime & collaborative - JS Summit 2014
Joseph Gentle
 

Similar to Xml Work Flow (20)

14 Easy Steps to End-User Empowerment: Convert Custom Reports to BI Publisher
14 Easy Steps to End-User Empowerment: Convert Custom Reports to BI Publisher14 Easy Steps to End-User Empowerment: Convert Custom Reports to BI Publisher
14 Easy Steps to End-User Empowerment: Convert Custom Reports to BI Publisher
 
Dxl As A Lotus Domino Integration Tool
Dxl As A Lotus Domino Integration ToolDxl As A Lotus Domino Integration Tool
Dxl As A Lotus Domino Integration Tool
 
Phpconf taiwan-2012
Phpconf taiwan-2012Phpconf taiwan-2012
Phpconf taiwan-2012
 
Xml Publisher And Reporting To Excel
Xml Publisher And Reporting To ExcelXml Publisher And Reporting To Excel
Xml Publisher And Reporting To Excel
 
Tcs3 stc creating e pub and apps with tcs3
Tcs3 stc creating e pub and apps with tcs3Tcs3 stc creating e pub and apps with tcs3
Tcs3 stc creating e pub and apps with tcs3
 
Apex RnD APEX 5 - Printing
Apex RnD APEX 5 - PrintingApex RnD APEX 5 - Printing
Apex RnD APEX 5 - Printing
 
Web Data & Reporting Zipline – FME Summer Camp
Web Data & Reporting Zipline – FME Summer CampWeb Data & Reporting Zipline – FME Summer Camp
Web Data & Reporting Zipline – FME Summer Camp
 
Spotfire Integration & Dynamic Output creation
Spotfire Integration & Dynamic Output creationSpotfire Integration & Dynamic Output creation
Spotfire Integration & Dynamic Output creation
 
ODTUG KSCOPE 2018 - REST APIs for FDMEE and Cloud Data Management
ODTUG KSCOPE 2018 - REST APIs for FDMEE and Cloud Data ManagementODTUG KSCOPE 2018 - REST APIs for FDMEE and Cloud Data Management
ODTUG KSCOPE 2018 - REST APIs for FDMEE and Cloud Data Management
 
PDF Generation in Rails with Prawn and Prawn-to: John McCaffrey
PDF Generation in Rails with Prawn and Prawn-to: John McCaffreyPDF Generation in Rails with Prawn and Prawn-to: John McCaffrey
PDF Generation in Rails with Prawn and Prawn-to: John McCaffrey
 
Capturing requirements: Importing documents
Capturing requirements: Importing documentsCapturing requirements: Importing documents
Capturing requirements: Importing documents
 
Bi
BiBi
Bi
 
ODF Toolkit with .NET Support
ODF Toolkit with .NET SupportODF Toolkit with .NET Support
ODF Toolkit with .NET Support
 
Make everything realtime & collaborative - JS Summit 2014
Make everything realtime & collaborative - JS Summit 2014Make everything realtime & collaborative - JS Summit 2014
Make everything realtime & collaborative - JS Summit 2014
 
The future will be Realtime & Collaborative
The future will be Realtime & CollaborativeThe future will be Realtime & Collaborative
The future will be Realtime & Collaborative
 
ILUG 2007 - Notes and Office Integration
ILUG 2007 - Notes and Office IntegrationILUG 2007 - Notes and Office Integration
ILUG 2007 - Notes and Office Integration
 
spraa64
spraa64spraa64
spraa64
 
spraa64
spraa64spraa64
spraa64
 
spraa64
spraa64spraa64
spraa64
 
spraa64
spraa64spraa64
spraa64
 

Recently uploaded

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 

Recently uploaded (20)

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 

Xml Work Flow

  • 2.
  • 3.
  • 4.
  • 5.
  • 6.
  • 7.
  • 8. TASK 2 TASK 3 TASK 8 TASK 7 TASK 4 TASK 6 TASK 12 TASK 11 TASK 10 TASK 9 QC DEPART TASK 13 DELIVERABLE (XML) INPUT (PDF) TASK 1 TASK 5 Pdf to XML WORK FLOW
  • 9. TASK 1 Capture Text, Box-Text and Box-Footnotes from Source PDF-Chapters TASK 2 Capture Footnotes of Chapter/Article from Source PDF TASK 3 Capture Images & Tables as JPG from Source PDF-Chapters TASK 4 Capture Table Data as Text from Source PDF and Add IMF-Table Tags TASK 6 Capture Front Matter from Source PDF (TOC,Preface,Abbrevations,Main Messages) TASK 7 Capture Back Matter from Source PDF (Appendixes, Glossaries and References) TASK 5 Merge all previous Tasks output into one and add Required IMF Tags TASK 8 Edit all Images to set required resolution and Size TASK 9 Merge Tasks (from 5 to 8) to get final output Validation Through Epsilon Validation Through Browser for Desired View Validation against of IMF- DTD using Oxygen Detailed Work Flow
  • 10. Team Members Team Leaders Quality Analyst Abbyy FineReader Epsilon Editor Epsilon DTD XSL Oxygen Task 1, Task 2, Task 3, Task 4 Task 5, Task 6, Task 7, Task 8, Task 9 Task 10, Task 11, Task 12, Task 13 Do Do Do Using Using Using Tasks Distribution and Methodology Capturing Various Type of Data Code around the Data Validate the Code and Data
  • 11. TASK 1 : SAMPLE Description : Capture Text from Source PDF (Only Chapters) Using OCR Tool Input : Source PDF
  • 12. TASK 1 : SAMPLE Output : One HTML file for each Chapter/Article
  • 13. TASK 2 : SAMPLE Description : Capture Chapter/Article-Foot Notes from Source PDF- Only Chapters Input : Source PDF
  • 14. TASK 2 : SAMPLE Output : One html or multiple html when footnote repeats its ID for each Chapter/Article
  • 15. TASK 3 : SAMPLE Description : Capture Images & Tables as JPG and Image Related Text as HTML from Source PDF Input : Source PDF
  • 16. TASK 3 : SAMPLE Output : Multiple JPG’s & One HTML
  • 17. TASK 4 : SAMPLE Description : Capture Table Content from the Source PDF as Text and add IMF TAGS Input : Source PDF
  • 18. TASK 4 : SAMPLE Output : HTML
  • 19. TASK 5 : SAMPLE Description : Merging of all the above Tasks(1 to 4) as per IMF specification Input : Task 1 to Task 4 Output: HTML
  • 20. TASK 6 : SAMPLE Input : Source PDF Description : Capture Front Matter from source PDF parts ( TOC, Preface, Abbreviations, Main Messages)
  • 21. TASK 6 : SAMPLE Output : HTML
  • 22. TASK 7 : SAMPLE Description : Capture Back Matter from source PDF parts (Appendixes, Glossary, References) Input : Source PDF
  • 23. TASK 7 : SAMPLE Output : HTML
  • 24. TASK 8 : SAMPLE Description : Image Editing as per IMF specification Output : Final JPG’s Input : Source PDF
  • 25. TASK 9 : SAMPLE Description : Merging of all the above tasks(5,6,7,8) as per IMF specification Output : Final XML without Validation Input : Task 5 to Task 8
  • 26. TASK 10 : SAMPLE Description : First Level Validation With Epsilon Output : XML Input : Task 9 - XML
  • 27. TASK 11 : SAMPLE Description : Validation With Browsers for desired View Output : Final XML Validation- Second Level
  • 28. TASK 12 : SAMPLE Description : Validation With Oxygen against of IMF-DTD Output : Final XML Validation- Third Level
  • 29. TASK 13 : SAMPLE Description : Packing Process in Desired Manner Output : Deliverable Product