SlideShare a Scribd company logo
1 of 11
Web data extraction
by Deskover Software
www.deskover.com
Overview
A lot of data is available online these days but you don't usually have
access to a web API to extract it.
Finance and stock info, real estate data, product catalogs, search
engine results, social networks feeds, all come in various formats.
Here is where UiPath comes to rescue. With its powerful (and yet easy
to use) new workflow-based technology, it can automatically
recognize the repeating patterns inside structured and tabular data.
With no programming skills required, you can easily save web data in
your favorite format: Excel, CSV, text or even directly deliver it to a
database or web-service.
This tutorial
• We are going to explore the web data extraction feature in UiPath
product and we'll extract catalog data from an online retail store.
• First we'll do some web automation to get to the catalog web page.
• Use "Extract Data" wizard to automatically recognize the repetitive
patterns in data structure.
• Optionally you can add Excel activities to save the data in XLS
format.
• Run the workflow to actually get the data.
Prerequisite step - web automation
The data you want to extract is usually not on the first page of the web site.
You'll have to perform some login, search, navigation steps first.
Fortunately, all these steps are automatically generated by UiPath web
recorder.
All your actions have been recorded and
web activities were added to the workflow.
Now it's time to start the Extract Web Data
wizard.
Extract web data wizard
Follow the self-explanatory steps in order
for the wizard to automatically recognize
patterns in your data structure.
You have to select two similar objects for
each piece of data you're interested in (like
product name and price).
For each field you can choose the name
and the information to be extracted (like
text or url).
You can extract as many related fields as
you like.
At each step the fields will be highlighted inside the web
page so you can check if the wizard correctly detected the
data pattern.
One more check to see if everything is OK and you're almost done!
Save the extracted data
By default the Extract Web Data wizard adds a CVS activity to save
the data. You can choose an Excel activity instead and specify
a .xls file location.
The connection between the output of the Extract Data activity and
Excel activity is automatically created. The output is a DataTable
which can be passed to database or web service workflow
activities.
Run the workflow
You can launch the workflow from UiPath designer or you may
invoke it from any .Net language (like C#, VB.Net).
At runtime, a browser is open and all recorded web automation
steps are performed. Web data is being extracted and an Excel
workbook will be created.
If the data spreads across multiple pages, it is possible to specify a
repeat step so all the pages will be scrapped.
Thanks!
Thank you for your time.
Please feel free to download UiPath trial from:
http://www.uipath.com/

More Related Content

What's hot

What's hot (20)

What is Robotic Process Automation?
What is Robotic Process Automation?What is Robotic Process Automation?
What is Robotic Process Automation?
 
Robotic process automation Introduction
Robotic process automation IntroductionRobotic process automation Introduction
Robotic process automation Introduction
 
Robotic Process Automation with UiPath Studio
Robotic Process Automation with UiPath StudioRobotic Process Automation with UiPath Studio
Robotic Process Automation with UiPath Studio
 
UiPath Devops.pptx
UiPath Devops.pptxUiPath Devops.pptx
UiPath Devops.pptx
 
UiPath Excel Automation | UiPath Excel Activities | UiPath Training Essential...
UiPath Excel Automation | UiPath Excel Activities | UiPath Training Essential...UiPath Excel Automation | UiPath Excel Activities | UiPath Training Essential...
UiPath Excel Automation | UiPath Excel Activities | UiPath Training Essential...
 
BPMN Introduction
BPMN IntroductionBPMN Introduction
BPMN Introduction
 
UiPath - IT Automation (1).pdf
UiPath - IT Automation (1).pdfUiPath - IT Automation (1).pdf
UiPath - IT Automation (1).pdf
 
Achieving Business Transformation with UiPath RPA
Achieving Business Transformation with UiPath RPAAchieving Business Transformation with UiPath RPA
Achieving Business Transformation with UiPath RPA
 
Robotic Process Automation (RPA)
Robotic Process Automation (RPA)Robotic Process Automation (RPA)
Robotic Process Automation (RPA)
 
Introduction to UiPath licensing model
Introduction to UiPath licensing modelIntroduction to UiPath licensing model
Introduction to UiPath licensing model
 
Automation through APIs with the new UiPath Integration Service
 Automation through APIs with the new UiPath Integration Service Automation through APIs with the new UiPath Integration Service
Automation through APIs with the new UiPath Integration Service
 
RPA overview and UIPath
RPA overview and UIPathRPA overview and UIPath
RPA overview and UIPath
 
What is Robotic Process Automation? (RPA)
What is Robotic Process Automation? (RPA)What is Robotic Process Automation? (RPA)
What is Robotic Process Automation? (RPA)
 
RPA.pptx
RPA.pptxRPA.pptx
RPA.pptx
 
UiPath Task Capture training.pdf
UiPath Task Capture training.pdfUiPath Task Capture training.pdf
UiPath Task Capture training.pdf
 
Enhanced Reframework Session_16-07-2022.pptx
Enhanced Reframework Session_16-07-2022.pptxEnhanced Reframework Session_16-07-2022.pptx
Enhanced Reframework Session_16-07-2022.pptx
 
UiPath Extensions_v3.pdf
UiPath Extensions_v3.pdfUiPath Extensions_v3.pdf
UiPath Extensions_v3.pdf
 
UiPath + Alteryx CE Final_042822.pdf
UiPath + Alteryx CE Final_042822.pdfUiPath + Alteryx CE Final_042822.pdf
UiPath + Alteryx CE Final_042822.pdf
 
UiPath 23.4 Product Release Updates
UiPath 23.4 Product Release UpdatesUiPath 23.4 Product Release Updates
UiPath 23.4 Product Release Updates
 
UiPath - Business Process Automation Software
UiPath - Business Process Automation SoftwareUiPath - Business Process Automation Software
UiPath - Business Process Automation Software
 

Similar to Ui path web data extraction

Oracle application express ppt
Oracle application express pptOracle application express ppt
Oracle application express ppt
Abhinaw Kumar
 
Share point 2010_overview-day4-code
Share point 2010_overview-day4-codeShare point 2010_overview-day4-code
Share point 2010_overview-day4-code
Narayana Reddy
 
Share point 2010_overview-day4-code
Share point 2010_overview-day4-codeShare point 2010_overview-day4-code
Share point 2010_overview-day4-code
Narayana Reddy
 
Wss Object Model
Wss Object ModelWss Object Model
Wss Object Model
maddinapudi
 
Implementation ofWeb Application for Disease Prediction Using AI
Implementation ofWeb Application for Disease Prediction Using AIImplementation ofWeb Application for Disease Prediction Using AI
Implementation ofWeb Application for Disease Prediction Using AI
BOHR International Journal of Computer Science (BIJCS)
 
takingapexandvisualforceaboveandbeyondv1-141120224449-conversion-gate01
takingapexandvisualforceaboveandbeyondv1-141120224449-conversion-gate01takingapexandvisualforceaboveandbeyondv1-141120224449-conversion-gate01
takingapexandvisualforceaboveandbeyondv1-141120224449-conversion-gate01
Sadeesh Jayakumaran ☁
 

Similar to Ui path web data extraction (20)

Intro to Application Express
Intro to Application ExpressIntro to Application Express
Intro to Application Express
 
Oracle application express ppt
Oracle application express pptOracle application express ppt
Oracle application express ppt
 
Share point 2010_overview-day4-code
Share point 2010_overview-day4-codeShare point 2010_overview-day4-code
Share point 2010_overview-day4-code
 
Share point 2010_overview-day4-code
Share point 2010_overview-day4-codeShare point 2010_overview-day4-code
Share point 2010_overview-day4-code
 
Oracle application express
Oracle application expressOracle application express
Oracle application express
 
Wss Object Model
Wss Object ModelWss Object Model
Wss Object Model
 
Scraping Webpage Information by using MS Excel VBA
Scraping Webpage Information by using MS Excel VBAScraping Webpage Information by using MS Excel VBA
Scraping Webpage Information by using MS Excel VBA
 
What Are the Key Steps in Scraping Product Data from Amazon India.pptx
What Are the Key Steps in Scraping Product Data from Amazon India.pptxWhat Are the Key Steps in Scraping Product Data from Amazon India.pptx
What Are the Key Steps in Scraping Product Data from Amazon India.pptx
 
What Are the Key Steps in Scraping Product Data from Amazon India.pdf
What Are the Key Steps in Scraping Product Data from Amazon India.pdfWhat Are the Key Steps in Scraping Product Data from Amazon India.pdf
What Are the Key Steps in Scraping Product Data from Amazon India.pdf
 
How to Scrape Amazon Best Seller Lists with Python and BeautifulSoup.pdf
How to Scrape Amazon Best Seller Lists with Python and BeautifulSoup.pdfHow to Scrape Amazon Best Seller Lists with Python and BeautifulSoup.pdf
How to Scrape Amazon Best Seller Lists with Python and BeautifulSoup.pdf
 
Content migration for sitecore
Content migration for sitecoreContent migration for sitecore
Content migration for sitecore
 
Exp2003 exl ppt_02-continued
Exp2003 exl ppt_02-continuedExp2003 exl ppt_02-continued
Exp2003 exl ppt_02-continued
 
How to Scrape Amazon Best Seller Lists with Python and BeautifulSoup.pptx
How to Scrape Amazon Best Seller Lists with Python and BeautifulSoup.pptxHow to Scrape Amazon Best Seller Lists with Python and BeautifulSoup.pptx
How to Scrape Amazon Best Seller Lists with Python and BeautifulSoup.pptx
 
Implementation ofWeb Application for Disease Prediction Using AI
Implementation ofWeb Application for Disease Prediction Using AIImplementation ofWeb Application for Disease Prediction Using AI
Implementation ofWeb Application for Disease Prediction Using AI
 
Skills Portfolio
Skills PortfolioSkills Portfolio
Skills Portfolio
 
takingapexandvisualforceaboveandbeyondv1-141120224449-conversion-gate01
takingapexandvisualforceaboveandbeyondv1-141120224449-conversion-gate01takingapexandvisualforceaboveandbeyondv1-141120224449-conversion-gate01
takingapexandvisualforceaboveandbeyondv1-141120224449-conversion-gate01
 
Web Scraping Services.pptx
Web Scraping Services.pptxWeb Scraping Services.pptx
Web Scraping Services.pptx
 
Ajax workshop
Ajax workshopAjax workshop
Ajax workshop
 
F0362036045
F0362036045F0362036045
F0362036045
 
Asp.net
Asp.netAsp.net
Asp.net
 

Recently uploaded

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Recently uploaded (20)

Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 

Ui path web data extraction

  • 1. Web data extraction by Deskover Software www.deskover.com
  • 2. Overview A lot of data is available online these days but you don't usually have access to a web API to extract it. Finance and stock info, real estate data, product catalogs, search engine results, social networks feeds, all come in various formats. Here is where UiPath comes to rescue. With its powerful (and yet easy to use) new workflow-based technology, it can automatically recognize the repeating patterns inside structured and tabular data. With no programming skills required, you can easily save web data in your favorite format: Excel, CSV, text or even directly deliver it to a database or web-service.
  • 3. This tutorial • We are going to explore the web data extraction feature in UiPath product and we'll extract catalog data from an online retail store. • First we'll do some web automation to get to the catalog web page. • Use "Extract Data" wizard to automatically recognize the repetitive patterns in data structure. • Optionally you can add Excel activities to save the data in XLS format. • Run the workflow to actually get the data.
  • 4. Prerequisite step - web automation The data you want to extract is usually not on the first page of the web site. You'll have to perform some login, search, navigation steps first. Fortunately, all these steps are automatically generated by UiPath web recorder.
  • 5. All your actions have been recorded and web activities were added to the workflow. Now it's time to start the Extract Web Data wizard.
  • 6. Extract web data wizard Follow the self-explanatory steps in order for the wizard to automatically recognize patterns in your data structure. You have to select two similar objects for each piece of data you're interested in (like product name and price). For each field you can choose the name and the information to be extracted (like text or url). You can extract as many related fields as you like.
  • 7. At each step the fields will be highlighted inside the web page so you can check if the wizard correctly detected the data pattern.
  • 8. One more check to see if everything is OK and you're almost done!
  • 9. Save the extracted data By default the Extract Web Data wizard adds a CVS activity to save the data. You can choose an Excel activity instead and specify a .xls file location. The connection between the output of the Extract Data activity and Excel activity is automatically created. The output is a DataTable which can be passed to database or web service workflow activities.
  • 10. Run the workflow You can launch the workflow from UiPath designer or you may invoke it from any .Net language (like C#, VB.Net). At runtime, a browser is open and all recorded web automation steps are performed. Web data is being extracted and an Excel workbook will be created. If the data spreads across multiple pages, it is possible to specify a repeat step so all the pages will be scrapped.
  • 11. Thanks! Thank you for your time. Please feel free to download UiPath trial from: http://www.uipath.com/