SlideShare a Scribd company logo
1 of 14
Download to read offline
1
Unlocking the Power of UiPath:
A Journey into Document Understanding
UiPath St. Louis Chapter
2
A little bit about me…
• Ian Wickline; Senior Trainer, Partner Enablement
• 20+ Year Information Technology Professional
• Adjunct Professor: Computer Science, Information Technology, and Business Management
• Extensive Software Development Background
• Mainframe Developer
• Modern Day Web Development
• Various Data Analytics Roles
• Most Recently: UiPath!
• Lead Automation Engineer
• Developed and implemented numerous UiPath based production automations
• Solution Architecture for many more UiPath based production automations
• Development of a 5-day training program
3
Agenda
• Introduction to Document Understanding
• What is Document Understanding?
• Document Types
• Document Processing Methodologies
• Model-Based Versus Rule-Based Extraction
• Document Understanding: Part of an End-to-end Automation Process
• Document Understanding: Typical Workflow
• Classification and Extraction: Success Criteria
• Example: Classification and Extraction: Invoice Post-processing
• Document Understanding Architectures
4
Introduction to Document
Understanding
Get your documents processed
intelligently.
Teach your robots to understand
documents using rules, templates, or skills
enhanced by Artificial Intelligence (AI) for
data extraction and interpretation.
Drag and drop these capabilities directly
into your Robotic Process Automation
(RPA) workflows to combine AI and RPA.
5
What Is Document Understanding?
Artificial
Intelligence (AI)
Document
Processing
Robotic Process
Automation (RPA)
Document
Understanding
Document Understanding is the ability to
extract and interpret information and
meaning from a wide range of documents.
It emerges at the intersection of document
processing, Artificial Intelligence (AI), and
RPA.
6
Document Types
▪ Required information found in
the same place
▪ Fixed in format
▪ Examples: Forms, passports,
licenses, and time sheets
containing handwritten text,
signatures, checkboxes
▪ Repetitive information each time
▪ Found in fixed and variable
document parts such as tables
▪ Examples: Invoices, receipts,
purchase orders, medical bills,
bank statements, utility bills
▪ No fixed format
▪ Examples:
Contracts, agreements,
emails, disease descriptions,
drug prescriptions, news, voice
scripts​
Structured Semi-structured Unstructured
7
Document Processing Methodologies
Rule-based
Structured fields, mostly
used for structured
documents
Mostly structured
documents, tables,
checkboxes,
handwriting, signatures
Most structured
documents (forms)
Mostly semi–structured
documents
RegEx Based
Extractor
Form Extractor Forms AI Machine Learning
Extractor
Model-based
Hybrid
A combination of both – rule-based and model-based extractors
Mostly documents combining both structured and less structured formats
8
Model-based Versus Rule-based
Extraction
▪ Relies on rules or templates.
▪ Processes documents in a fixed format for
structured data like forms or licenses.
▪ Has high accuracy for already known
documents.
▪ Requires extra costs for the addition of
templates or rules and ongoing
maintenance.
▪ Is unable to work on unknown documents.
▪ Relies on ML models.
▪ Processes less structured documents with
varying layouts, such as invoices or
receipts.
▪ Understands even the most obscure and
complex documents.
▪ Requires pre-trained ML models with
further retraining capabilities.
Model-based extraction
(template-less approach)
Rule-based extraction
(template-based approach)
9
Document Understanding: Part of an
End-to-end Automation Process
Document Understanding processes aren’t standalone. They are a part of bigger business processes to be automated.
Here is the architecture for an end-to-end business process involving Document Understanding.
Document Understanding
Process
Runs one job for each
target file
Acts as dispatcher for the
subsequent processes
“Upstream”
automation, prior
to Document
Understanding
Acts as dispatcher
for the Document
Understanding
process
“Downstream”
automation, post
Document
Understanding
Utilizes extracted
data
10
Document Understanding Typical
Workflow
Load taxonomy
defines document
types and fields for
processing.
Digitize
documents using
Optical Character
Recognition (OCR) to
make them machine-
readable.
Classify
and split the files into
document types.
Extract
information from the
documents.
Export
the extracted data for
further usage.
Train
classifiers based on
the validated data.
Train
extractors based on
the validated data.
Validate
classification results
(human review).
Validate
extractors results
(human review).
11
Classification and Extraction Success
Criteria
▪ After both the classification and extraction
steps, there are dedicated workflows wherein
you can implement the business rule
validation.
▪ Based on the outcome of the validation
workflows, a decision is taken whether the
document will be sent for human validation or
not.
▪ If no logic is implemented, the documents will
be sent, by default, for human validation.
12
Invoice Post-processing
These rules should NOT be used as-is, except for demo purposes. For a real implementation, post-
processing and validation should be tailored to the specifics of the business process.
For the invoice document type, a standard workflow is available that
validates six business rules.
▪ Verify that all mandatory fields and columns are extracted.
▪ Verify that all table rows match the rule: Quantity * Unit Price =
Line Amount.
▪ Verify that the sum of all Line Amounts = Net Amount.
▪ Verify the sum of the Net Amount and all fields defined as
SubTotalAdditions = Total.
▪ Verify the extraction confidence of all defined ‘ConfidenceFields’
against their individual confidence thresholds.
▪ Verify the extraction confidence of all the other fields against the
‘other-Confidence’ threshold.
13
Document Understanding Architectures
Currently, Document Understanding is available both in cloud and on-premises.
Online Air-gapped
Using Public
Services
Using the Cloud
Tenant
In Cloud On-premises
Hybrid
14
Review/Recap
• Introduction to Document Understanding
• What is Document Understanding?
• Document Types
• Document Processing Methodologies
• Model-Based Versus Rule-Based Extraction
• Document Understanding: Part of an End-to-end Automation Process
• Document Understanding: Typical Workflow
• Classification and Extraction: Success Criteria
• Example: Classification and Extraction: Invoice Post-processing
• Document Understanding Architectures

More Related Content

What's hot

Nasscom AI top 50 use cases
Nasscom AI top 50 use casesNasscom AI top 50 use cases
Nasscom AI top 50 use casesADDI AI 2050
 
introduction to data science
introduction to data scienceintroduction to data science
introduction to data sciencebhavesh lande
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data scienceSampath Kumar
 
Analytics Overview #Predictive Analytics
Analytics Overview #Predictive AnalyticsAnalytics Overview #Predictive Analytics
Analytics Overview #Predictive AnalyticsDurga Palakurthy
 
Experimentation to Industrialization: Implementing MLOps
Experimentation to Industrialization: Implementing MLOpsExperimentation to Industrialization: Implementing MLOps
Experimentation to Industrialization: Implementing MLOpsDatabricks
 
Intelligent Banking: AI cases in Retail and Commercial Banking
Intelligent Banking: AI cases in Retail and Commercial BankingIntelligent Banking: AI cases in Retail and Commercial Banking
Intelligent Banking: AI cases in Retail and Commercial BankingDmitry Petukhov
 
MLOps with Azure DevOps
MLOps with Azure DevOpsMLOps with Azure DevOps
MLOps with Azure DevOpsMarco Parenzan
 
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021Tristan Baker
 
Azure Machine Learning tutorial
Azure Machine Learning tutorialAzure Machine Learning tutorial
Azure Machine Learning tutorialGiacomo Lanciano
 
Introduction to data science.pptx
Introduction to data science.pptxIntroduction to data science.pptx
Introduction to data science.pptxSadhanaParameswaran
 
Machine Learning and AI in Risk Management
Machine Learning and AI in Risk ManagementMachine Learning and AI in Risk Management
Machine Learning and AI in Risk ManagementQuantUniversity
 
AI & Robotic Process Automation (RPA) to Digitally Transform Your Environment
AI & Robotic Process Automation (RPA) to Digitally Transform Your EnvironmentAI & Robotic Process Automation (RPA) to Digitally Transform Your Environment
AI & Robotic Process Automation (RPA) to Digitally Transform Your EnvironmentCprime
 
Best Practices in DataOps: How to Create Agile, Automated Data Pipelines
Best Practices in DataOps: How to Create Agile, Automated Data PipelinesBest Practices in DataOps: How to Create Agile, Automated Data Pipelines
Best Practices in DataOps: How to Create Agile, Automated Data PipelinesEric Kavanagh
 
The Hidden Value of Hadoop Migration
The Hidden Value of Hadoop MigrationThe Hidden Value of Hadoop Migration
The Hidden Value of Hadoop MigrationDatabricks
 
Software Bots as Superheroes in the SPACE of Developer Productivity
Software Bots as Superheroes in the SPACE of Developer ProductivitySoftware Bots as Superheroes in the SPACE of Developer Productivity
Software Bots as Superheroes in the SPACE of Developer ProductivityMargaret-Anne Storey
 
Data Analyst vs Data Engineer vs Data Scientist | Data Analytics Masters Prog...
Data Analyst vs Data Engineer vs Data Scientist | Data Analytics Masters Prog...Data Analyst vs Data Engineer vs Data Scientist | Data Analytics Masters Prog...
Data Analyst vs Data Engineer vs Data Scientist | Data Analytics Masters Prog...Edureka!
 
Multidimensional DB design, revolving TPC-H benchmark into OLAP bench
Multidimensional DB design, revolving TPC-H benchmark into OLAP benchMultidimensional DB design, revolving TPC-H benchmark into OLAP bench
Multidimensional DB design, revolving TPC-H benchmark into OLAP benchRim Moussa
 

What's hot (20)

Nasscom AI top 50 use cases
Nasscom AI top 50 use casesNasscom AI top 50 use cases
Nasscom AI top 50 use cases
 
Data Science.pptx
Data Science.pptxData Science.pptx
Data Science.pptx
 
introduction to data science
introduction to data scienceintroduction to data science
introduction to data science
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data science
 
Analytics Overview #Predictive Analytics
Analytics Overview #Predictive AnalyticsAnalytics Overview #Predictive Analytics
Analytics Overview #Predictive Analytics
 
Experimentation to Industrialization: Implementing MLOps
Experimentation to Industrialization: Implementing MLOpsExperimentation to Industrialization: Implementing MLOps
Experimentation to Industrialization: Implementing MLOps
 
Intelligent Banking: AI cases in Retail and Commercial Banking
Intelligent Banking: AI cases in Retail and Commercial BankingIntelligent Banking: AI cases in Retail and Commercial Banking
Intelligent Banking: AI cases in Retail and Commercial Banking
 
MLOps with Azure DevOps
MLOps with Azure DevOpsMLOps with Azure DevOps
MLOps with Azure DevOps
 
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
 
Azure Machine Learning tutorial
Azure Machine Learning tutorialAzure Machine Learning tutorial
Azure Machine Learning tutorial
 
Introduction to data science.pptx
Introduction to data science.pptxIntroduction to data science.pptx
Introduction to data science.pptx
 
Machine Learning and AI in Risk Management
Machine Learning and AI in Risk ManagementMachine Learning and AI in Risk Management
Machine Learning and AI in Risk Management
 
AI & Robotic Process Automation (RPA) to Digitally Transform Your Environment
AI & Robotic Process Automation (RPA) to Digitally Transform Your EnvironmentAI & Robotic Process Automation (RPA) to Digitally Transform Your Environment
AI & Robotic Process Automation (RPA) to Digitally Transform Your Environment
 
Best Practices in DataOps: How to Create Agile, Automated Data Pipelines
Best Practices in DataOps: How to Create Agile, Automated Data PipelinesBest Practices in DataOps: How to Create Agile, Automated Data Pipelines
Best Practices in DataOps: How to Create Agile, Automated Data Pipelines
 
The Hidden Value of Hadoop Migration
The Hidden Value of Hadoop MigrationThe Hidden Value of Hadoop Migration
The Hidden Value of Hadoop Migration
 
Software Bots as Superheroes in the SPACE of Developer Productivity
Software Bots as Superheroes in the SPACE of Developer ProductivitySoftware Bots as Superheroes in the SPACE of Developer Productivity
Software Bots as Superheroes in the SPACE of Developer Productivity
 
What is Data Science
What is Data ScienceWhat is Data Science
What is Data Science
 
Data science
Data scienceData science
Data science
 
Data Analyst vs Data Engineer vs Data Scientist | Data Analytics Masters Prog...
Data Analyst vs Data Engineer vs Data Scientist | Data Analytics Masters Prog...Data Analyst vs Data Engineer vs Data Scientist | Data Analytics Masters Prog...
Data Analyst vs Data Engineer vs Data Scientist | Data Analytics Masters Prog...
 
Multidimensional DB design, revolving TPC-H benchmark into OLAP bench
Multidimensional DB design, revolving TPC-H benchmark into OLAP benchMultidimensional DB design, revolving TPC-H benchmark into OLAP bench
Multidimensional DB design, revolving TPC-H benchmark into OLAP bench
 

Similar to Unlocking the Power of UiPath: A Journey into Document Understanding

Who Does What, When, and How for a Divestiture?
Who Does What, When, and How for a Divestiture?Who Does What, When, and How for a Divestiture?
Who Does What, When, and How for a Divestiture?eprentise
 
Dev Dives: Mastering AI-powered Document Understanding
Dev Dives: Mastering AI-powered Document UnderstandingDev Dives: Mastering AI-powered Document Understanding
Dev Dives: Mastering AI-powered Document UnderstandingUiPathCommunity
 
ZoomPowee 2019: INSZoom Product Roadmap 2020 Secrets Revealed
ZoomPowee 2019: INSZoom Product Roadmap 2020 Secrets RevealedZoomPowee 2019: INSZoom Product Roadmap 2020 Secrets Revealed
ZoomPowee 2019: INSZoom Product Roadmap 2020 Secrets RevealedINSZoom
 
Strategic Portfolio Management for IT
Strategic Portfolio Management for ITStrategic Portfolio Management for IT
Strategic Portfolio Management for ITiasaglobal
 
Unleashing the force of AI-powered intelligent document processing
Unleashing the force of AI-powered intelligent document processingUnleashing the force of AI-powered intelligent document processing
Unleashing the force of AI-powered intelligent document processingUiPathCommunity
 
Choosing the right IDP Solution
Choosing the right IDP SolutionChoosing the right IDP Solution
Choosing the right IDP SolutionProvectus
 
WHAT IS BUSINESS ANALYTICS um hj mnjh nit 1 ppt only kjjn
WHAT IS BUSINESS ANALYTICS um hj mnjh nit 1 ppt only kjjnWHAT IS BUSINESS ANALYTICS um hj mnjh nit 1 ppt only kjjn
WHAT IS BUSINESS ANALYTICS um hj mnjh nit 1 ppt only kjjnRohitKumar639388
 
Beyond Automation: Extracting Actionable Intelligence from Clinical Trials
Beyond Automation: Extracting Actionable Intelligence from Clinical TrialsBeyond Automation: Extracting Actionable Intelligence from Clinical Trials
Beyond Automation: Extracting Actionable Intelligence from Clinical TrialsMontrium
 
A Brief Introduction to Enterprise Architecture
A Brief Introduction to  Enterprise Architecture A Brief Introduction to  Enterprise Architecture
A Brief Introduction to Enterprise Architecture Daljit Banger
 
Anita K Wyatt Resume (01192016)
Anita K Wyatt Resume (01192016)Anita K Wyatt Resume (01192016)
Anita K Wyatt Resume (01192016)Anita Wyatt
 
The ABCs of Clinical Trial Management Systems
The ABCs of Clinical Trial Management SystemsThe ABCs of Clinical Trial Management Systems
The ABCs of Clinical Trial Management SystemsPerficient, Inc.
 
Creating an Enterprise Content Management Strategy
Creating an Enterprise Content Management StrategyCreating an Enterprise Content Management Strategy
Creating an Enterprise Content Management StrategyKaruana Gatimu
 
A Case of Fusion Middleware (IlOUG Tech Days, July 2013)
A Case of Fusion Middleware  (IlOUG Tech Days, July 2013)A Case of Fusion Middleware  (IlOUG Tech Days, July 2013)
A Case of Fusion Middleware (IlOUG Tech Days, July 2013)Lucas Jellema
 
[AIIM16] How Regulatory Data Can Set the Narrative for an Analytics Opportunity
[AIIM16] How Regulatory Data Can Set the Narrative for an Analytics Opportunity[AIIM16] How Regulatory Data Can Set the Narrative for an Analytics Opportunity
[AIIM16] How Regulatory Data Can Set the Narrative for an Analytics OpportunityAIIM International
 
Kiran - Senior_System_Business_Consultant_ETL_DW
Kiran - Senior_System_Business_Consultant_ETL_DWKiran - Senior_System_Business_Consultant_ETL_DW
Kiran - Senior_System_Business_Consultant_ETL_DWNagaRaghuKiranPenuju
 

Similar to Unlocking the Power of UiPath: A Journey into Document Understanding (20)

DU_SERIES_Session1.pdf
DU_SERIES_Session1.pdfDU_SERIES_Session1.pdf
DU_SERIES_Session1.pdf
 
Who Does What, When, and How for a Divestiture?
Who Does What, When, and How for a Divestiture?Who Does What, When, and How for a Divestiture?
Who Does What, When, and How for a Divestiture?
 
Dev Dives: Mastering AI-powered Document Understanding
Dev Dives: Mastering AI-powered Document UnderstandingDev Dives: Mastering AI-powered Document Understanding
Dev Dives: Mastering AI-powered Document Understanding
 
ZoomPowee 2019: INSZoom Product Roadmap 2020 Secrets Revealed
ZoomPowee 2019: INSZoom Product Roadmap 2020 Secrets RevealedZoomPowee 2019: INSZoom Product Roadmap 2020 Secrets Revealed
ZoomPowee 2019: INSZoom Product Roadmap 2020 Secrets Revealed
 
Strategic Portfolio Management for IT
Strategic Portfolio Management for ITStrategic Portfolio Management for IT
Strategic Portfolio Management for IT
 
Unleashing the force of AI-powered intelligent document processing
Unleashing the force of AI-powered intelligent document processingUnleashing the force of AI-powered intelligent document processing
Unleashing the force of AI-powered intelligent document processing
 
Choosing the right IDP Solution
Choosing the right IDP SolutionChoosing the right IDP Solution
Choosing the right IDP Solution
 
WHAT IS BUSINESS ANALYTICS um hj mnjh nit 1 ppt only kjjn
WHAT IS BUSINESS ANALYTICS um hj mnjh nit 1 ppt only kjjnWHAT IS BUSINESS ANALYTICS um hj mnjh nit 1 ppt only kjjn
WHAT IS BUSINESS ANALYTICS um hj mnjh nit 1 ppt only kjjn
 
A case of Fusion Middleware - iLOUG 2013
A case of Fusion Middleware - iLOUG 2013A case of Fusion Middleware - iLOUG 2013
A case of Fusion Middleware - iLOUG 2013
 
Beyond Automation: Extracting Actionable Intelligence from Clinical Trials
Beyond Automation: Extracting Actionable Intelligence from Clinical TrialsBeyond Automation: Extracting Actionable Intelligence from Clinical Trials
Beyond Automation: Extracting Actionable Intelligence from Clinical Trials
 
A Brief Introduction to Enterprise Architecture
A Brief Introduction to  Enterprise Architecture A Brief Introduction to  Enterprise Architecture
A Brief Introduction to Enterprise Architecture
 
Anita K Wyatt Resume (01192016)
Anita K Wyatt Resume (01192016)Anita K Wyatt Resume (01192016)
Anita K Wyatt Resume (01192016)
 
The ABCs of Clinical Trial Management Systems
The ABCs of Clinical Trial Management SystemsThe ABCs of Clinical Trial Management Systems
The ABCs of Clinical Trial Management Systems
 
Creating an Enterprise Content Management Strategy
Creating an Enterprise Content Management StrategyCreating an Enterprise Content Management Strategy
Creating an Enterprise Content Management Strategy
 
A Case of Fusion Middleware (IlOUG Tech Days, July 2013)
A Case of Fusion Middleware  (IlOUG Tech Days, July 2013)A Case of Fusion Middleware  (IlOUG Tech Days, July 2013)
A Case of Fusion Middleware (IlOUG Tech Days, July 2013)
 
[AIIM16] How Regulatory Data Can Set the Narrative for an Analytics Opportunity
[AIIM16] How Regulatory Data Can Set the Narrative for an Analytics Opportunity[AIIM16] How Regulatory Data Can Set the Narrative for an Analytics Opportunity
[AIIM16] How Regulatory Data Can Set the Narrative for an Analytics Opportunity
 
Jgordonres jan262016
Jgordonres jan262016Jgordonres jan262016
Jgordonres jan262016
 
jgordonresJan262016
jgordonresJan262016jgordonresJan262016
jgordonresJan262016
 
Kiran - Senior_System_Business_Consultant_ETL_DW
Kiran - Senior_System_Business_Consultant_ETL_DWKiran - Senior_System_Business_Consultant_ETL_DW
Kiran - Senior_System_Business_Consultant_ETL_DW
 
Resume John Tzanetakis
Resume John TzanetakisResume John Tzanetakis
Resume John Tzanetakis
 

More from DianaGray10

Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Women in Automation 2024: Career session - explore career paths in automation
Women in Automation 2024: Career session - explore career paths in automationWomen in Automation 2024: Career session - explore career paths in automation
Women in Automation 2024: Career session - explore career paths in automationDianaGray10
 
Automation Ops Series: Session 3 - Solutions management
Automation Ops Series: Session 3 - Solutions managementAutomation Ops Series: Session 3 - Solutions management
Automation Ops Series: Session 3 - Solutions managementDianaGray10
 
Efficiencies in RPA with UiPath and CyberArk Technologies - Session 2
Efficiencies in RPA with UiPath and CyberArk Technologies - Session 2Efficiencies in RPA with UiPath and CyberArk Technologies - Session 2
Efficiencies in RPA with UiPath and CyberArk Technologies - Session 2DianaGray10
 
UiPath Platform: The Backend Engine Powering Your Automation - Session 2
UiPath Platform: The Backend Engine Powering Your Automation - Session 2UiPath Platform: The Backend Engine Powering Your Automation - Session 2
UiPath Platform: The Backend Engine Powering Your Automation - Session 2DianaGray10
 
Women in Automation 2024: Technical session - Get your career started in auto...
Women in Automation 2024: Technical session - Get your career started in auto...Women in Automation 2024: Technical session - Get your career started in auto...
Women in Automation 2024: Technical session - Get your career started in auto...DianaGray10
 
Unleashing the power of AI in UiPath Studio with UiPath Autopilot.
Unleashing the power of AI in UiPath Studio with UiPath Autopilot.Unleashing the power of AI in UiPath Studio with UiPath Autopilot.
Unleashing the power of AI in UiPath Studio with UiPath Autopilot.DianaGray10
 
Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...
Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...
Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...DianaGray10
 
UiPath Solutions Management Preview - Northern CA Chapter - March 22.pdf
UiPath Solutions Management Preview - Northern CA Chapter - March 22.pdfUiPath Solutions Management Preview - Northern CA Chapter - March 22.pdf
UiPath Solutions Management Preview - Northern CA Chapter - March 22.pdfDianaGray10
 
UiPath Studio Web workshop series - Day 7
UiPath Studio Web workshop series - Day 7UiPath Studio Web workshop series - Day 7
UiPath Studio Web workshop series - Day 7DianaGray10
 
UiPath Studio Web workshop series - Day 8
UiPath Studio Web workshop series - Day 8UiPath Studio Web workshop series - Day 8
UiPath Studio Web workshop series - Day 8DianaGray10
 
UiPath Platform: The Backend Engine Powering Your Automation - Session 1
UiPath Platform: The Backend Engine Powering Your Automation - Session 1UiPath Platform: The Backend Engine Powering Your Automation - Session 1
UiPath Platform: The Backend Engine Powering Your Automation - Session 1DianaGray10
 
Secure your environment with UiPath and CyberArk technologies - Session 1
Secure your environment with UiPath and CyberArk technologies - Session 1Secure your environment with UiPath and CyberArk technologies - Session 1
Secure your environment with UiPath and CyberArk technologies - Session 1DianaGray10
 
UiPath Studio Web workshop series - Day 6
UiPath Studio Web workshop series - Day 6UiPath Studio Web workshop series - Day 6
UiPath Studio Web workshop series - Day 6DianaGray10
 
UiPath Clipboard AI: "A TIME Magazine Best Invention of 2023 Unveiled"
UiPath Clipboard AI: "A TIME Magazine Best Invention of 2023 Unveiled"UiPath Clipboard AI: "A TIME Magazine Best Invention of 2023 Unveiled"
UiPath Clipboard AI: "A TIME Magazine Best Invention of 2023 Unveiled"DianaGray10
 
UiPath Studio Web workshop series - Day 5
UiPath Studio Web workshop series - Day 5UiPath Studio Web workshop series - Day 5
UiPath Studio Web workshop series - Day 5DianaGray10
 
UiPath Studio Web workshop series - Day 4
UiPath Studio Web workshop series - Day 4UiPath Studio Web workshop series - Day 4
UiPath Studio Web workshop series - Day 4DianaGray10
 
Automation Ops Series: Session 2 - Governance for UiPath projects
Automation Ops Series: Session 2 - Governance for UiPath projectsAutomation Ops Series: Session 2 - Governance for UiPath projects
Automation Ops Series: Session 2 - Governance for UiPath projectsDianaGray10
 
UiPath Studio Web workshop Series - Day 3
UiPath Studio Web workshop Series - Day 3UiPath Studio Web workshop Series - Day 3
UiPath Studio Web workshop Series - Day 3DianaGray10
 
UiPath Studio Web workshop series - Day 2
UiPath Studio Web workshop series - Day 2UiPath Studio Web workshop series - Day 2
UiPath Studio Web workshop series - Day 2DianaGray10
 

More from DianaGray10 (20)

Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Women in Automation 2024: Career session - explore career paths in automation
Women in Automation 2024: Career session - explore career paths in automationWomen in Automation 2024: Career session - explore career paths in automation
Women in Automation 2024: Career session - explore career paths in automation
 
Automation Ops Series: Session 3 - Solutions management
Automation Ops Series: Session 3 - Solutions managementAutomation Ops Series: Session 3 - Solutions management
Automation Ops Series: Session 3 - Solutions management
 
Efficiencies in RPA with UiPath and CyberArk Technologies - Session 2
Efficiencies in RPA with UiPath and CyberArk Technologies - Session 2Efficiencies in RPA with UiPath and CyberArk Technologies - Session 2
Efficiencies in RPA with UiPath and CyberArk Technologies - Session 2
 
UiPath Platform: The Backend Engine Powering Your Automation - Session 2
UiPath Platform: The Backend Engine Powering Your Automation - Session 2UiPath Platform: The Backend Engine Powering Your Automation - Session 2
UiPath Platform: The Backend Engine Powering Your Automation - Session 2
 
Women in Automation 2024: Technical session - Get your career started in auto...
Women in Automation 2024: Technical session - Get your career started in auto...Women in Automation 2024: Technical session - Get your career started in auto...
Women in Automation 2024: Technical session - Get your career started in auto...
 
Unleashing the power of AI in UiPath Studio with UiPath Autopilot.
Unleashing the power of AI in UiPath Studio with UiPath Autopilot.Unleashing the power of AI in UiPath Studio with UiPath Autopilot.
Unleashing the power of AI in UiPath Studio with UiPath Autopilot.
 
Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...
Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...
Connector Corner: Extending LLM automation use cases with UiPath GenAI connec...
 
UiPath Solutions Management Preview - Northern CA Chapter - March 22.pdf
UiPath Solutions Management Preview - Northern CA Chapter - March 22.pdfUiPath Solutions Management Preview - Northern CA Chapter - March 22.pdf
UiPath Solutions Management Preview - Northern CA Chapter - March 22.pdf
 
UiPath Studio Web workshop series - Day 7
UiPath Studio Web workshop series - Day 7UiPath Studio Web workshop series - Day 7
UiPath Studio Web workshop series - Day 7
 
UiPath Studio Web workshop series - Day 8
UiPath Studio Web workshop series - Day 8UiPath Studio Web workshop series - Day 8
UiPath Studio Web workshop series - Day 8
 
UiPath Platform: The Backend Engine Powering Your Automation - Session 1
UiPath Platform: The Backend Engine Powering Your Automation - Session 1UiPath Platform: The Backend Engine Powering Your Automation - Session 1
UiPath Platform: The Backend Engine Powering Your Automation - Session 1
 
Secure your environment with UiPath and CyberArk technologies - Session 1
Secure your environment with UiPath and CyberArk technologies - Session 1Secure your environment with UiPath and CyberArk technologies - Session 1
Secure your environment with UiPath and CyberArk technologies - Session 1
 
UiPath Studio Web workshop series - Day 6
UiPath Studio Web workshop series - Day 6UiPath Studio Web workshop series - Day 6
UiPath Studio Web workshop series - Day 6
 
UiPath Clipboard AI: "A TIME Magazine Best Invention of 2023 Unveiled"
UiPath Clipboard AI: "A TIME Magazine Best Invention of 2023 Unveiled"UiPath Clipboard AI: "A TIME Magazine Best Invention of 2023 Unveiled"
UiPath Clipboard AI: "A TIME Magazine Best Invention of 2023 Unveiled"
 
UiPath Studio Web workshop series - Day 5
UiPath Studio Web workshop series - Day 5UiPath Studio Web workshop series - Day 5
UiPath Studio Web workshop series - Day 5
 
UiPath Studio Web workshop series - Day 4
UiPath Studio Web workshop series - Day 4UiPath Studio Web workshop series - Day 4
UiPath Studio Web workshop series - Day 4
 
Automation Ops Series: Session 2 - Governance for UiPath projects
Automation Ops Series: Session 2 - Governance for UiPath projectsAutomation Ops Series: Session 2 - Governance for UiPath projects
Automation Ops Series: Session 2 - Governance for UiPath projects
 
UiPath Studio Web workshop Series - Day 3
UiPath Studio Web workshop Series - Day 3UiPath Studio Web workshop Series - Day 3
UiPath Studio Web workshop Series - Day 3
 
UiPath Studio Web workshop series - Day 2
UiPath Studio Web workshop series - Day 2UiPath Studio Web workshop series - Day 2
UiPath Studio Web workshop series - Day 2
 

Recently uploaded

Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 

Recently uploaded (20)

Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 

Unlocking the Power of UiPath: A Journey into Document Understanding

  • 1. 1 Unlocking the Power of UiPath: A Journey into Document Understanding UiPath St. Louis Chapter
  • 2. 2 A little bit about me… • Ian Wickline; Senior Trainer, Partner Enablement • 20+ Year Information Technology Professional • Adjunct Professor: Computer Science, Information Technology, and Business Management • Extensive Software Development Background • Mainframe Developer • Modern Day Web Development • Various Data Analytics Roles • Most Recently: UiPath! • Lead Automation Engineer • Developed and implemented numerous UiPath based production automations • Solution Architecture for many more UiPath based production automations • Development of a 5-day training program
  • 3. 3 Agenda • Introduction to Document Understanding • What is Document Understanding? • Document Types • Document Processing Methodologies • Model-Based Versus Rule-Based Extraction • Document Understanding: Part of an End-to-end Automation Process • Document Understanding: Typical Workflow • Classification and Extraction: Success Criteria • Example: Classification and Extraction: Invoice Post-processing • Document Understanding Architectures
  • 4. 4 Introduction to Document Understanding Get your documents processed intelligently. Teach your robots to understand documents using rules, templates, or skills enhanced by Artificial Intelligence (AI) for data extraction and interpretation. Drag and drop these capabilities directly into your Robotic Process Automation (RPA) workflows to combine AI and RPA.
  • 5. 5 What Is Document Understanding? Artificial Intelligence (AI) Document Processing Robotic Process Automation (RPA) Document Understanding Document Understanding is the ability to extract and interpret information and meaning from a wide range of documents. It emerges at the intersection of document processing, Artificial Intelligence (AI), and RPA.
  • 6. 6 Document Types ▪ Required information found in the same place ▪ Fixed in format ▪ Examples: Forms, passports, licenses, and time sheets containing handwritten text, signatures, checkboxes ▪ Repetitive information each time ▪ Found in fixed and variable document parts such as tables ▪ Examples: Invoices, receipts, purchase orders, medical bills, bank statements, utility bills ▪ No fixed format ▪ Examples: Contracts, agreements, emails, disease descriptions, drug prescriptions, news, voice scripts​ Structured Semi-structured Unstructured
  • 7. 7 Document Processing Methodologies Rule-based Structured fields, mostly used for structured documents Mostly structured documents, tables, checkboxes, handwriting, signatures Most structured documents (forms) Mostly semi–structured documents RegEx Based Extractor Form Extractor Forms AI Machine Learning Extractor Model-based Hybrid A combination of both – rule-based and model-based extractors Mostly documents combining both structured and less structured formats
  • 8. 8 Model-based Versus Rule-based Extraction ▪ Relies on rules or templates. ▪ Processes documents in a fixed format for structured data like forms or licenses. ▪ Has high accuracy for already known documents. ▪ Requires extra costs for the addition of templates or rules and ongoing maintenance. ▪ Is unable to work on unknown documents. ▪ Relies on ML models. ▪ Processes less structured documents with varying layouts, such as invoices or receipts. ▪ Understands even the most obscure and complex documents. ▪ Requires pre-trained ML models with further retraining capabilities. Model-based extraction (template-less approach) Rule-based extraction (template-based approach)
  • 9. 9 Document Understanding: Part of an End-to-end Automation Process Document Understanding processes aren’t standalone. They are a part of bigger business processes to be automated. Here is the architecture for an end-to-end business process involving Document Understanding. Document Understanding Process Runs one job for each target file Acts as dispatcher for the subsequent processes “Upstream” automation, prior to Document Understanding Acts as dispatcher for the Document Understanding process “Downstream” automation, post Document Understanding Utilizes extracted data
  • 10. 10 Document Understanding Typical Workflow Load taxonomy defines document types and fields for processing. Digitize documents using Optical Character Recognition (OCR) to make them machine- readable. Classify and split the files into document types. Extract information from the documents. Export the extracted data for further usage. Train classifiers based on the validated data. Train extractors based on the validated data. Validate classification results (human review). Validate extractors results (human review).
  • 11. 11 Classification and Extraction Success Criteria ▪ After both the classification and extraction steps, there are dedicated workflows wherein you can implement the business rule validation. ▪ Based on the outcome of the validation workflows, a decision is taken whether the document will be sent for human validation or not. ▪ If no logic is implemented, the documents will be sent, by default, for human validation.
  • 12. 12 Invoice Post-processing These rules should NOT be used as-is, except for demo purposes. For a real implementation, post- processing and validation should be tailored to the specifics of the business process. For the invoice document type, a standard workflow is available that validates six business rules. ▪ Verify that all mandatory fields and columns are extracted. ▪ Verify that all table rows match the rule: Quantity * Unit Price = Line Amount. ▪ Verify that the sum of all Line Amounts = Net Amount. ▪ Verify the sum of the Net Amount and all fields defined as SubTotalAdditions = Total. ▪ Verify the extraction confidence of all defined ‘ConfidenceFields’ against their individual confidence thresholds. ▪ Verify the extraction confidence of all the other fields against the ‘other-Confidence’ threshold.
  • 13. 13 Document Understanding Architectures Currently, Document Understanding is available both in cloud and on-premises. Online Air-gapped Using Public Services Using the Cloud Tenant In Cloud On-premises Hybrid
  • 14. 14 Review/Recap • Introduction to Document Understanding • What is Document Understanding? • Document Types • Document Processing Methodologies • Model-Based Versus Rule-Based Extraction • Document Understanding: Part of an End-to-end Automation Process • Document Understanding: Typical Workflow • Classification and Extraction: Success Criteria • Example: Classification and Extraction: Invoice Post-processing • Document Understanding Architectures