SlideShare a Scribd company logo
1 of 11
Web data extraction
by Deskover Software
www.deskover.com
Overview
A lot of data is available online these days but you don't usually have
access to a web API to extract it.
Finance and stock info, real estate data, product catalogs, search
engine results, social networks feeds, all come in various formats.
Here is where UiPath comes to rescue. With its powerful (and yet easy
to use) new workflow-based technology, it can automatically
recognize the repeating patterns inside structured and tabular data.
With no programming skills required, you can easily save web data in
your favorite format: Excel, CSV, text or even directly deliver it to a
database or web-service.
This tutorial
• We are going to explore the web data extraction feature in UiPath
product and we'll extract catalog data from an online retail store.
• First we'll do some web automation to get to the catalog web page.
• Use "Extract Data" wizard to automatically recognize the repetitive
patterns in data structure.
• Optionally you can add Excel activities to save the data in XLS
format.
• Run the workflow to actually get the data.
Prerequisite step - web automation
The data you want to extract is usually not on the first page of the web site.
You'll have to perform some login, search, navigation steps first.
Fortunately, all these steps are automatically generated by UiPath web
recorder.
All your actions have been recorded and
web activities were added to the workflow.
Now it's time to start the Extract Web Data
wizard.
Extract web data wizard
Follow the self-explanatory steps in order
for the wizard to automatically recognize
patterns in your data structure.
You have to select two similar objects for
each piece of data you're interested in (like
product name and price).
For each field you can choose the name
and the information to be extracted (like
text or url).
You can extract as many related fields as
you like.
At each step the fields will be highlighted inside the web
page so you can check if the wizard correctly detected the
data pattern.
One more check to see if everything is OK and you're almost done!
Save the extracted data
By default the Extract Web Data wizard adds a CVS activity to save
the data. You can choose an Excel activity instead and specify
a .xls file location.
The connection between the output of the Extract Data activity and
Excel activity is automatically created. The output is a DataTable
which can be passed to database or web service workflow
activities.
Run the workflow
You can launch the workflow from UiPath designer or you may
invoke it from any .Net language (like C#, VB.Net).
At runtime, a browser is open and all recorded web automation
steps are performed. Web data is being extracted and an Excel
workbook will be created.
If the data spreads across multiple pages, it is possible to specify a
repeat step so all the pages will be scrapped.
Thanks!
Thank you for your time.
Please feel free to download UiPath trial from:
http://www.uipath.com/

More Related Content

What's hot

UiPath Email Automation | UiPath Tutorial | RPA Training Using UiPath | Edureka
UiPath Email Automation | UiPath Tutorial | RPA Training Using UiPath | EdurekaUiPath Email Automation | UiPath Tutorial | RPA Training Using UiPath | Edureka
UiPath Email Automation | UiPath Tutorial | RPA Training Using UiPath | EdurekaEdureka!
 
Introduction to UiPath licensing model
Introduction to UiPath licensing modelIntroduction to UiPath licensing model
Introduction to UiPath licensing modelVibhor Shrivastava
 
UiPath Orchestrator Authentication v 1.2.pptx
UiPath Orchestrator Authentication v 1.2.pptxUiPath Orchestrator Authentication v 1.2.pptx
UiPath Orchestrator Authentication v 1.2.pptxRohit Radhakrishnan
 
Automation Hub Best practices - Getting Started.pdf
Automation Hub Best practices - Getting Started.pdfAutomation Hub Best practices - Getting Started.pdf
Automation Hub Best practices - Getting Started.pdfCristina Vidu
 
Introduction To UiPath Studio | Edureka
Introduction To UiPath Studio | EdurekaIntroduction To UiPath Studio | Edureka
Introduction To UiPath Studio | EdurekaEdureka!
 
Robotic Process Automation with UiPath Studio
Robotic Process Automation with UiPath StudioRobotic Process Automation with UiPath Studio
Robotic Process Automation with UiPath StudioLalit Birla
 
Ui path online training ppt
Ui path online training pptUi path online training ppt
Ui path online training pptThenmozhiC
 
E-BOOK MANAGEMENT SYSTEM PowerPoint Presentation
E-BOOK MANAGEMENT SYSTEM  PowerPoint PresentationE-BOOK MANAGEMENT SYSTEM  PowerPoint Presentation
E-BOOK MANAGEMENT SYSTEM PowerPoint Presentationgurunggurungaman9
 
IoT vs IIoT vs Industry 4.0
IoT vs IIoT vs Industry 4.0IoT vs IIoT vs Industry 4.0
IoT vs IIoT vs Industry 4.0SMACAR Solutions
 
Certification preparation - Error Handling and Troubleshooting recap.pptx
Certification preparation - Error Handling and Troubleshooting recap.pptxCertification preparation - Error Handling and Troubleshooting recap.pptx
Certification preparation - Error Handling and Troubleshooting recap.pptxRohit Radhakrishnan
 
RPA overview and UIPath
RPA overview and UIPathRPA overview and UIPath
RPA overview and UIPathTarun Kumar
 
15CS81- IoT Module-2
15CS81- IoT Module-215CS81- IoT Module-2
15CS81- IoT Module-2Syed Mustafa
 
What is UiPath RPA Architecture | UiPath Studio, Robot & Orchestrator | Edureka
What is UiPath RPA Architecture | UiPath Studio, Robot & Orchestrator | EdurekaWhat is UiPath RPA Architecture | UiPath Studio, Robot & Orchestrator | Edureka
What is UiPath RPA Architecture | UiPath Studio, Robot & Orchestrator | EdurekaEdureka!
 
Iot internet-of-things-ppt
Iot internet-of-things-pptIot internet-of-things-ppt
Iot internet-of-things-pptSonalSharnam
 
Security for iot and cloud aug 25b 2017
Security for iot and cloud aug 25b 2017Security for iot and cloud aug 25b 2017
Security for iot and cloud aug 25b 2017Ulf Mattsson
 
Object Repository.pdf
Object Repository.pdfObject Repository.pdf
Object Repository.pdfCristina Vidu
 
Robotic Process Automation
Robotic Process Automation Robotic Process Automation
Robotic Process Automation VenkateshBandi8
 

What's hot (20)

UiPath Email Automation | UiPath Tutorial | RPA Training Using UiPath | Edureka
UiPath Email Automation | UiPath Tutorial | RPA Training Using UiPath | EdurekaUiPath Email Automation | UiPath Tutorial | RPA Training Using UiPath | Edureka
UiPath Email Automation | UiPath Tutorial | RPA Training Using UiPath | Edureka
 
RPA Uipath Presentation.pptx
RPA Uipath Presentation.pptxRPA Uipath Presentation.pptx
RPA Uipath Presentation.pptx
 
Introduction to UiPath licensing model
Introduction to UiPath licensing modelIntroduction to UiPath licensing model
Introduction to UiPath licensing model
 
UiPath Orchestrator Authentication v 1.2.pptx
UiPath Orchestrator Authentication v 1.2.pptxUiPath Orchestrator Authentication v 1.2.pptx
UiPath Orchestrator Authentication v 1.2.pptx
 
Automation Hub Best practices - Getting Started.pdf
Automation Hub Best practices - Getting Started.pdfAutomation Hub Best practices - Getting Started.pdf
Automation Hub Best practices - Getting Started.pdf
 
Introduction To UiPath Studio | Edureka
Introduction To UiPath Studio | EdurekaIntroduction To UiPath Studio | Edureka
Introduction To UiPath Studio | Edureka
 
Robotic Process Automation with UiPath Studio
Robotic Process Automation with UiPath StudioRobotic Process Automation with UiPath Studio
Robotic Process Automation with UiPath Studio
 
Ui path| RPA
Ui path| RPAUi path| RPA
Ui path| RPA
 
Ui path online training ppt
Ui path online training pptUi path online training ppt
Ui path online training ppt
 
E-BOOK MANAGEMENT SYSTEM PowerPoint Presentation
E-BOOK MANAGEMENT SYSTEM  PowerPoint PresentationE-BOOK MANAGEMENT SYSTEM  PowerPoint Presentation
E-BOOK MANAGEMENT SYSTEM PowerPoint Presentation
 
IoT vs IIoT vs Industry 4.0
IoT vs IIoT vs Industry 4.0IoT vs IIoT vs Industry 4.0
IoT vs IIoT vs Industry 4.0
 
Certification preparation - Error Handling and Troubleshooting recap.pptx
Certification preparation - Error Handling and Troubleshooting recap.pptxCertification preparation - Error Handling and Troubleshooting recap.pptx
Certification preparation - Error Handling and Troubleshooting recap.pptx
 
RPA overview and UIPath
RPA overview and UIPathRPA overview and UIPath
RPA overview and UIPath
 
15CS81- IoT Module-2
15CS81- IoT Module-215CS81- IoT Module-2
15CS81- IoT Module-2
 
What is UiPath RPA Architecture | UiPath Studio, Robot & Orchestrator | Edureka
What is UiPath RPA Architecture | UiPath Studio, Robot & Orchestrator | EdurekaWhat is UiPath RPA Architecture | UiPath Studio, Robot & Orchestrator | Edureka
What is UiPath RPA Architecture | UiPath Studio, Robot & Orchestrator | Edureka
 
Email Automation
Email AutomationEmail Automation
Email Automation
 
Iot internet-of-things-ppt
Iot internet-of-things-pptIot internet-of-things-ppt
Iot internet-of-things-ppt
 
Security for iot and cloud aug 25b 2017
Security for iot and cloud aug 25b 2017Security for iot and cloud aug 25b 2017
Security for iot and cloud aug 25b 2017
 
Object Repository.pdf
Object Repository.pdfObject Repository.pdf
Object Repository.pdf
 
Robotic Process Automation
Robotic Process Automation Robotic Process Automation
Robotic Process Automation
 

Similar to Ui path web data extraction

Oracle application express ppt
Oracle application express pptOracle application express ppt
Oracle application express pptAbhinaw Kumar
 
Share point 2010_overview-day4-code
Share point 2010_overview-day4-codeShare point 2010_overview-day4-code
Share point 2010_overview-day4-codeNarayana Reddy
 
Share point 2010_overview-day4-code
Share point 2010_overview-day4-codeShare point 2010_overview-day4-code
Share point 2010_overview-day4-codeNarayana Reddy
 
Oracle application express
Oracle application expressOracle application express
Oracle application expressAbhinaw Kumar
 
Wss Object Model
Wss Object ModelWss Object Model
Wss Object Modelmaddinapudi
 
Scraping Webpage Information by using MS Excel VBA
Scraping Webpage Information by using MS Excel VBAScraping Webpage Information by using MS Excel VBA
Scraping Webpage Information by using MS Excel VBAPatrick Ho
 
How to Scrape Amazon Best Seller Lists with Python and BeautifulSoup.pdf
How to Scrape Amazon Best Seller Lists with Python and BeautifulSoup.pdfHow to Scrape Amazon Best Seller Lists with Python and BeautifulSoup.pdf
How to Scrape Amazon Best Seller Lists with Python and BeautifulSoup.pdfProductdata Scrape
 
Content migration for sitecore
Content migration for sitecoreContent migration for sitecore
Content migration for sitecoreSurendra Sharma
 
Exp2003 exl ppt_02-continued
Exp2003 exl ppt_02-continuedExp2003 exl ppt_02-continued
Exp2003 exl ppt_02-continuedlonetree
 
How to Scrape Amazon Best Seller Lists with Python and BeautifulSoup.pptx
How to Scrape Amazon Best Seller Lists with Python and BeautifulSoup.pptxHow to Scrape Amazon Best Seller Lists with Python and BeautifulSoup.pptx
How to Scrape Amazon Best Seller Lists with Python and BeautifulSoup.pptxProductdata Scrape
 
Skills Portfolio
Skills PortfolioSkills Portfolio
Skills Portfoliorolee23
 
takingapexandvisualforceaboveandbeyondv1-141120224449-conversion-gate01
takingapexandvisualforceaboveandbeyondv1-141120224449-conversion-gate01takingapexandvisualforceaboveandbeyondv1-141120224449-conversion-gate01
takingapexandvisualforceaboveandbeyondv1-141120224449-conversion-gate01Sadeesh Jayakumaran ☁
 
F0362036045
F0362036045F0362036045
F0362036045theijes
 
(ATS6-DEV05) Building Interactive Web Applications with the Reporting Collection
(ATS6-DEV05) Building Interactive Web Applications with the Reporting Collection(ATS6-DEV05) Building Interactive Web Applications with the Reporting Collection
(ATS6-DEV05) Building Interactive Web Applications with the Reporting CollectionBIOVIA
 
ArcherGrey Product Uploader ENOVIA V6
ArcherGrey Product Uploader ENOVIA V6ArcherGrey Product Uploader ENOVIA V6
ArcherGrey Product Uploader ENOVIA V6Brion Carroll
 

Similar to Ui path web data extraction (20)

Intro to Application Express
Intro to Application ExpressIntro to Application Express
Intro to Application Express
 
Oracle application express ppt
Oracle application express pptOracle application express ppt
Oracle application express ppt
 
Share point 2010_overview-day4-code
Share point 2010_overview-day4-codeShare point 2010_overview-day4-code
Share point 2010_overview-day4-code
 
Share point 2010_overview-day4-code
Share point 2010_overview-day4-codeShare point 2010_overview-day4-code
Share point 2010_overview-day4-code
 
Oracle application express
Oracle application expressOracle application express
Oracle application express
 
Wss Object Model
Wss Object ModelWss Object Model
Wss Object Model
 
Scraping Webpage Information by using MS Excel VBA
Scraping Webpage Information by using MS Excel VBAScraping Webpage Information by using MS Excel VBA
Scraping Webpage Information by using MS Excel VBA
 
How to Scrape Amazon Best Seller Lists with Python and BeautifulSoup.pdf
How to Scrape Amazon Best Seller Lists with Python and BeautifulSoup.pdfHow to Scrape Amazon Best Seller Lists with Python and BeautifulSoup.pdf
How to Scrape Amazon Best Seller Lists with Python and BeautifulSoup.pdf
 
Content migration for sitecore
Content migration for sitecoreContent migration for sitecore
Content migration for sitecore
 
Exp2003 exl ppt_02-continued
Exp2003 exl ppt_02-continuedExp2003 exl ppt_02-continued
Exp2003 exl ppt_02-continued
 
How to Scrape Amazon Best Seller Lists with Python and BeautifulSoup.pptx
How to Scrape Amazon Best Seller Lists with Python and BeautifulSoup.pptxHow to Scrape Amazon Best Seller Lists with Python and BeautifulSoup.pptx
How to Scrape Amazon Best Seller Lists with Python and BeautifulSoup.pptx
 
Implementation ofWeb Application for Disease Prediction Using AI
Implementation ofWeb Application for Disease Prediction Using AIImplementation ofWeb Application for Disease Prediction Using AI
Implementation ofWeb Application for Disease Prediction Using AI
 
Skills Portfolio
Skills PortfolioSkills Portfolio
Skills Portfolio
 
takingapexandvisualforceaboveandbeyondv1-141120224449-conversion-gate01
takingapexandvisualforceaboveandbeyondv1-141120224449-conversion-gate01takingapexandvisualforceaboveandbeyondv1-141120224449-conversion-gate01
takingapexandvisualforceaboveandbeyondv1-141120224449-conversion-gate01
 
Web Scraping Services.pptx
Web Scraping Services.pptxWeb Scraping Services.pptx
Web Scraping Services.pptx
 
Ajax workshop
Ajax workshopAjax workshop
Ajax workshop
 
F0362036045
F0362036045F0362036045
F0362036045
 
Asp.net
Asp.netAsp.net
Asp.net
 
(ATS6-DEV05) Building Interactive Web Applications with the Reporting Collection
(ATS6-DEV05) Building Interactive Web Applications with the Reporting Collection(ATS6-DEV05) Building Interactive Web Applications with the Reporting Collection
(ATS6-DEV05) Building Interactive Web Applications with the Reporting Collection
 
ArcherGrey Product Uploader ENOVIA V6
ArcherGrey Product Uploader ENOVIA V6ArcherGrey Product Uploader ENOVIA V6
ArcherGrey Product Uploader ENOVIA V6
 

Recently uploaded

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfjimielynbastida
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfngoud9212
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsPrecisely
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentationphoebematthew05
 

Recently uploaded (20)

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdf
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdf
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power Systems
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentation
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 

Ui path web data extraction

  • 1. Web data extraction by Deskover Software www.deskover.com
  • 2. Overview A lot of data is available online these days but you don't usually have access to a web API to extract it. Finance and stock info, real estate data, product catalogs, search engine results, social networks feeds, all come in various formats. Here is where UiPath comes to rescue. With its powerful (and yet easy to use) new workflow-based technology, it can automatically recognize the repeating patterns inside structured and tabular data. With no programming skills required, you can easily save web data in your favorite format: Excel, CSV, text or even directly deliver it to a database or web-service.
  • 3. This tutorial • We are going to explore the web data extraction feature in UiPath product and we'll extract catalog data from an online retail store. • First we'll do some web automation to get to the catalog web page. • Use "Extract Data" wizard to automatically recognize the repetitive patterns in data structure. • Optionally you can add Excel activities to save the data in XLS format. • Run the workflow to actually get the data.
  • 4. Prerequisite step - web automation The data you want to extract is usually not on the first page of the web site. You'll have to perform some login, search, navigation steps first. Fortunately, all these steps are automatically generated by UiPath web recorder.
  • 5. All your actions have been recorded and web activities were added to the workflow. Now it's time to start the Extract Web Data wizard.
  • 6. Extract web data wizard Follow the self-explanatory steps in order for the wizard to automatically recognize patterns in your data structure. You have to select two similar objects for each piece of data you're interested in (like product name and price). For each field you can choose the name and the information to be extracted (like text or url). You can extract as many related fields as you like.
  • 7. At each step the fields will be highlighted inside the web page so you can check if the wizard correctly detected the data pattern.
  • 8. One more check to see if everything is OK and you're almost done!
  • 9. Save the extracted data By default the Extract Web Data wizard adds a CVS activity to save the data. You can choose an Excel activity instead and specify a .xls file location. The connection between the output of the Extract Data activity and Excel activity is automatically created. The output is a DataTable which can be passed to database or web service workflow activities.
  • 10. Run the workflow You can launch the workflow from UiPath designer or you may invoke it from any .Net language (like C#, VB.Net). At runtime, a browser is open and all recorded web automation steps are performed. Web data is being extracted and an Excel workbook will be created. If the data spreads across multiple pages, it is possible to specify a repeat step so all the pages will be scrapped.
  • 11. Thanks! Thank you for your time. Please feel free to download UiPath trial from: http://www.uipath.com/