SlideShare a Scribd company logo
Computers & Technology :: Extracting Data from an AJAX-
enabled Web Site
AJAX is the what makes web sites interactive. It stands for Asynchronous JavaScript and XML. Based
on the xmlHttpRequest function, it allows the website to contact the server and retrieve data
(referred to as an "AJAX Callback") without reloading the web page. In one simplified example, it
stops that flicker when you hit the "Submit" button, giving the website a more streamlined feel.
Because of this, web sites have gotten complicated and web mining has as well. Visual Web Ripper is
still able to work through this process, although code navigation is necessary.
How To
Before starting, you must tell Visual Web Ripper the changes that will occur on the website. To keep
data from being extracted prematurely, VWR will need to wait for the callback to finish.
To have a successful template when working with AJAX, go to Options and click the AJAX or
JavaScript radio button. If necessary, use the Wait for element drop down box to select the proper
element, otherwise the default Wait Element will the first content element in the template.
VWR will render AJAX in three steps, unless told otherwise
1.Clicks on the selected link or web form button
2.Waits for the Wait Element and/or its container to change. This step will be completed
automatically if the content doesn't exist.
3.Waits for the Wait Element to appear on the web page.
Troubleshooting:
A lot of web sites use transition words to tell users that content is being loaded (e.g. "Loading...").
VWR will assume this is the change and extract that as data. To counter this, utilize a Wait Script.
Sometimes an AJAX callback will result in no change either to the entire web page or to the content
that preloads. If either of these scenarios arises, click the appropriate checkbox(es): Wait is optional
and/or Optional wait applies only to the first link in a list.
All AJAX is JavaScript, but not all JavaScript is AJAX. If the code doesn't dynamically change content,
it is simple JavaScript and thus the Wait is optional and/or Optional wait applies only to the first link
in a list boxes need to be checked.
VWR is all-seeing, including hidden content, so no separate AJAX action template is necessary.
Switch to browser mode to find what you need.
AJAX was built for content that appears after a page rendering. In this scenario, go to Options, click
Misc, and check Wait for element.
Iframes are tricky, since they look like AJAX callbacks to VWR. Use the Keep loading webpage until
manual stop button on the toolbar to work around this issue.
Though AJAX sites can present hurdles when doing web mining, combining a powerful extraction
tool such as Visual Web Ripper with the knowledge of how to navigate through it puts the target
data within reach.

More Related Content

What's hot

Ajax Introduction Presentation
Ajax   Introduction   PresentationAjax   Introduction   Presentation
Ajax Introduction Presentation
thinkphp
 
Ajax Ppt
Ajax PptAjax Ppt
Ajax Ppt
Hema Prasanth
 
Overview of AJAX
Overview of AJAXOverview of AJAX
Overview of AJAX
Roshith S Pai
 
Ajax and RJS
Ajax and RJSAjax and RJS
Ajax and RJS
Dominiek ter Heide
 
AJAX
AJAXAJAX
Jquery Ajax
Jquery AjaxJquery Ajax
Jquery Ajax
Anand Kumar Rajana
 
Ajax.ppt
Ajax.pptAjax.ppt
Ajax
AjaxAjax
Sohamsg ajax
Sohamsg ajaxSohamsg ajax
Sohamsg ajax
Soham Sengupta
 
Introduction to ajax
Introduction to ajaxIntroduction to ajax
Introduction to ajax
Venkat Pinagadi
 
Introduction to ajax
Introduction  to  ajaxIntroduction  to  ajax
Introduction to ajax
Pihu Goel
 
AJAX
AJAXAJAX
Java Servlets Part 2
Java Servlets Part 2Java Servlets Part 2
Java Servlets Part 2
Gurpreet singh
 
Ajax Presentation
Ajax PresentationAjax Presentation
Ajax Presentation
jrdoane
 
Web Component Development Using Servlet & JSP Technologies (EE6) - Chapter 1...
 Web Component Development Using Servlet & JSP Technologies (EE6) - Chapter 1... Web Component Development Using Servlet & JSP Technologies (EE6) - Chapter 1...
Web Component Development Using Servlet & JSP Technologies (EE6) - Chapter 1...
WebStackAcademy
 
How to prevent the cache problem in AJAX
How to prevent the cache problem in AJAXHow to prevent the cache problem in AJAX
How to prevent the cache problem in AJAX
Ramzi Alqrainy
 
Ajax presentation
Ajax presentationAjax presentation
Ajax presentation
Bharat_Kumawat
 
Html web workers
Html web workersHtml web workers
Html web workers
AbhishekMondal42
 
Advantages and disadvantages of an ajax based client application
Advantages and disadvantages of an ajax based client applicationAdvantages and disadvantages of an ajax based client application
Advantages and disadvantages of an ajax based client application
Placinta Alin
 
Ajax
AjaxAjax

What's hot (20)

Ajax Introduction Presentation
Ajax   Introduction   PresentationAjax   Introduction   Presentation
Ajax Introduction Presentation
 
Ajax Ppt
Ajax PptAjax Ppt
Ajax Ppt
 
Overview of AJAX
Overview of AJAXOverview of AJAX
Overview of AJAX
 
Ajax and RJS
Ajax and RJSAjax and RJS
Ajax and RJS
 
AJAX
AJAXAJAX
AJAX
 
Jquery Ajax
Jquery AjaxJquery Ajax
Jquery Ajax
 
Ajax.ppt
Ajax.pptAjax.ppt
Ajax.ppt
 
Ajax
AjaxAjax
Ajax
 
Sohamsg ajax
Sohamsg ajaxSohamsg ajax
Sohamsg ajax
 
Introduction to ajax
Introduction to ajaxIntroduction to ajax
Introduction to ajax
 
Introduction to ajax
Introduction  to  ajaxIntroduction  to  ajax
Introduction to ajax
 
AJAX
AJAXAJAX
AJAX
 
Java Servlets Part 2
Java Servlets Part 2Java Servlets Part 2
Java Servlets Part 2
 
Ajax Presentation
Ajax PresentationAjax Presentation
Ajax Presentation
 
Web Component Development Using Servlet & JSP Technologies (EE6) - Chapter 1...
 Web Component Development Using Servlet & JSP Technologies (EE6) - Chapter 1... Web Component Development Using Servlet & JSP Technologies (EE6) - Chapter 1...
Web Component Development Using Servlet & JSP Technologies (EE6) - Chapter 1...
 
How to prevent the cache problem in AJAX
How to prevent the cache problem in AJAXHow to prevent the cache problem in AJAX
How to prevent the cache problem in AJAX
 
Ajax presentation
Ajax presentationAjax presentation
Ajax presentation
 
Html web workers
Html web workersHtml web workers
Html web workers
 
Advantages and disadvantages of an ajax based client application
Advantages and disadvantages of an ajax based client applicationAdvantages and disadvantages of an ajax based client application
Advantages and disadvantages of an ajax based client application
 
Ajax
AjaxAjax
Ajax
 

Viewers also liked

Tetramisole 5036-02-2-api
Tetramisole 5036-02-2-apiTetramisole 5036-02-2-api
Tetramisole 5036-02-2-api
Tetramisole-5036-02-2-api
 
документ (25)
документ (25)документ (25)
документ (25)Pain78
 
Cono Ariti: matched control studies
Cono Ariti: matched control studiesCono Ariti: matched control studies
Cono Ariti: matched control studies
Nuffield Trust
 
Carrera profesional a seguir
Carrera profesional a seguirCarrera profesional a seguir
Carrera profesional a seguir
Wilman Campaña
 
revista uno
revista unorevista uno
revista uno
Franko Uchiha
 
February Newsletter 2012
February Newsletter 2012February Newsletter 2012
February Newsletter 2012
Brad Elliott
 
Universal databes DB2
Universal databes DB2Universal databes DB2
Universal databes DB2
jaimegrajales1
 
Is filing bankruptcy affect my credit rating?
Is filing bankruptcy affect my credit rating?Is filing bankruptcy affect my credit rating?
Is filing bankruptcy affect my credit rating?
xiojjamestaowcy
 
Editorial
EditorialEditorial
Editorial
arualv
 
Macrocosmodia
MacrocosmodiaMacrocosmodia
Macrocosmodia
jthomason337
 
Mundo Digital
Mundo DigitalMundo Digital
Mundo Digital
Soto2002
 
Metodologia
MetodologiaMetodologia
Metodologia
edwinmaxed
 
Economía
EconomíaEconomía

Viewers also liked (14)

Tetramisole 5036-02-2-api
Tetramisole 5036-02-2-apiTetramisole 5036-02-2-api
Tetramisole 5036-02-2-api
 
документ (25)
документ (25)документ (25)
документ (25)
 
Cono Ariti: matched control studies
Cono Ariti: matched control studiesCono Ariti: matched control studies
Cono Ariti: matched control studies
 
Carrera profesional a seguir
Carrera profesional a seguirCarrera profesional a seguir
Carrera profesional a seguir
 
revista uno
revista unorevista uno
revista uno
 
February Newsletter 2012
February Newsletter 2012February Newsletter 2012
February Newsletter 2012
 
Universal databes DB2
Universal databes DB2Universal databes DB2
Universal databes DB2
 
Is filing bankruptcy affect my credit rating?
Is filing bankruptcy affect my credit rating?Is filing bankruptcy affect my credit rating?
Is filing bankruptcy affect my credit rating?
 
Presentación1
Presentación1Presentación1
Presentación1
 
Editorial
EditorialEditorial
Editorial
 
Macrocosmodia
MacrocosmodiaMacrocosmodia
Macrocosmodia
 
Mundo Digital
Mundo DigitalMundo Digital
Mundo Digital
 
Metodologia
MetodologiaMetodologia
Metodologia
 
Economía
EconomíaEconomía
Economía
 

Similar to Computers & Technology :: Extracting Data from an AJAX-enabled Web Site

jQuery Ajax
jQuery AjaxjQuery Ajax
jQuery Ajax
Anand Kumar Rajana
 
M Ramya
M RamyaM Ramya
Ajax
AjaxAjax
Ajax
AjaxAjax
25250716 seminar-on-ajax text
25250716 seminar-on-ajax text25250716 seminar-on-ajax text
25250716 seminar-on-ajax text
Kamleshh Chandnani
 
Ajax presentation
Ajax presentationAjax presentation
Ajax presentation
engcs2008
 
Ajax
AjaxAjax
Asynchronous JavaScript & XML (AJAX)
Asynchronous JavaScript & XML (AJAX)Asynchronous JavaScript & XML (AJAX)
Asynchronous JavaScript & XML (AJAX)
Adnan Sohail
 
Ajax
AjaxAjax
Ajax
AjaxAjax
Ajax
AjaxAjax
Unit 4 1 web technology uptu
Unit 4 1 web technology uptuUnit 4 1 web technology uptu
Unit 4 1 web technology uptu
Abhishek Kesharwani
 
Unit 4 web technology uptu
Unit 4 web technology uptuUnit 4 web technology uptu
Unit 4 web technology uptu
Abhishek Kesharwani
 
Ajax
AjaxAjax
Ajax
AjaxAjax
Core Java tutorial at Unit Nexus
Core Java tutorial at Unit NexusCore Java tutorial at Unit Nexus
Core Java tutorial at Unit Nexus
Unit Nexus Pvt. Ltd.
 
AJAX
AJAXAJAX
AJAX
ARJUN
 
Ajax.pdf
Ajax.pdfAjax.pdf
Ajax
AjaxAjax
Ajax
AjaxAjax

Similar to Computers & Technology :: Extracting Data from an AJAX-enabled Web Site (20)

jQuery Ajax
jQuery AjaxjQuery Ajax
jQuery Ajax
 
M Ramya
M RamyaM Ramya
M Ramya
 
Ajax
AjaxAjax
Ajax
 
Ajax
AjaxAjax
Ajax
 
25250716 seminar-on-ajax text
25250716 seminar-on-ajax text25250716 seminar-on-ajax text
25250716 seminar-on-ajax text
 
Ajax presentation
Ajax presentationAjax presentation
Ajax presentation
 
Ajax
AjaxAjax
Ajax
 
Asynchronous JavaScript & XML (AJAX)
Asynchronous JavaScript & XML (AJAX)Asynchronous JavaScript & XML (AJAX)
Asynchronous JavaScript & XML (AJAX)
 
Ajax
AjaxAjax
Ajax
 
Ajax
AjaxAjax
Ajax
 
Ajax
AjaxAjax
Ajax
 
Unit 4 1 web technology uptu
Unit 4 1 web technology uptuUnit 4 1 web technology uptu
Unit 4 1 web technology uptu
 
Unit 4 web technology uptu
Unit 4 web technology uptuUnit 4 web technology uptu
Unit 4 web technology uptu
 
Ajax
AjaxAjax
Ajax
 
Ajax
AjaxAjax
Ajax
 
Core Java tutorial at Unit Nexus
Core Java tutorial at Unit NexusCore Java tutorial at Unit Nexus
Core Java tutorial at Unit Nexus
 
AJAX
AJAXAJAX
AJAX
 
Ajax.pdf
Ajax.pdfAjax.pdf
Ajax.pdf
 
Ajax
AjaxAjax
Ajax
 
Ajax
AjaxAjax
Ajax
 

Computers & Technology :: Extracting Data from an AJAX-enabled Web Site

  • 1. Computers & Technology :: Extracting Data from an AJAX- enabled Web Site AJAX is the what makes web sites interactive. It stands for Asynchronous JavaScript and XML. Based on the xmlHttpRequest function, it allows the website to contact the server and retrieve data (referred to as an "AJAX Callback") without reloading the web page. In one simplified example, it stops that flicker when you hit the "Submit" button, giving the website a more streamlined feel. Because of this, web sites have gotten complicated and web mining has as well. Visual Web Ripper is still able to work through this process, although code navigation is necessary. How To Before starting, you must tell Visual Web Ripper the changes that will occur on the website. To keep data from being extracted prematurely, VWR will need to wait for the callback to finish. To have a successful template when working with AJAX, go to Options and click the AJAX or JavaScript radio button. If necessary, use the Wait for element drop down box to select the proper element, otherwise the default Wait Element will the first content element in the template. VWR will render AJAX in three steps, unless told otherwise 1.Clicks on the selected link or web form button 2.Waits for the Wait Element and/or its container to change. This step will be completed automatically if the content doesn't exist. 3.Waits for the Wait Element to appear on the web page. Troubleshooting: A lot of web sites use transition words to tell users that content is being loaded (e.g. "Loading..."). VWR will assume this is the change and extract that as data. To counter this, utilize a Wait Script. Sometimes an AJAX callback will result in no change either to the entire web page or to the content that preloads. If either of these scenarios arises, click the appropriate checkbox(es): Wait is optional and/or Optional wait applies only to the first link in a list. All AJAX is JavaScript, but not all JavaScript is AJAX. If the code doesn't dynamically change content, it is simple JavaScript and thus the Wait is optional and/or Optional wait applies only to the first link in a list boxes need to be checked. VWR is all-seeing, including hidden content, so no separate AJAX action template is necessary. Switch to browser mode to find what you need. AJAX was built for content that appears after a page rendering. In this scenario, go to Options, click Misc, and check Wait for element. Iframes are tricky, since they look like AJAX callbacks to VWR. Use the Keep loading webpage until manual stop button on the toolbar to work around this issue.
  • 2. Though AJAX sites can present hurdles when doing web mining, combining a powerful extraction tool such as Visual Web Ripper with the knowledge of how to navigate through it puts the target data within reach.