21

Impulse Technologies
Beacons U to World of technology
044-42133143, 98401 03301,9841091117 ieeeprojects@yahoo.com www.impulse.net.in
Pre-Query Discovery of Domain-specific Query Forms: A Survey
Abstract
The discovery of HTML query forms is one of the main challenges in Deep
Web crawling. Automatic solutions for this problem perform two main tasks. The
first is locating HTML forms on the Web, which is done through the use of
traditional/focused crawlers. The second is identifying which of these forms are
indeed meant for querying, which also typically involves determining a domain for
the underlying data source (and thus for the form as well). This problem has
attracted a great deal of interest, resulting in a long list of algorithms and
techniques. Some of these submit requests through the form and then analyze the
data retrieved in response, typically requiring a great deal of knowledge about the
domain as well as semantic processing. Others do not employ form submission, to
avoid such difficulties, although some techniques rely to some extent on semantics
and domain knowledge. This survey gives an up-to-date review of methods for the
discovery of domain-specific query forms that do not involve form submission. We
detail these methods and discuss how form discovery has become increasingly
more automated over time. We conclude with a forecast of what we believe are the
immediate next steps in this trend.

Your Own Ideas or Any project from any company can be Implemented
at Better price (All Projects can be done in Java or DotNet whichever the student wants)
1

21

Recommended

Recommended

More Related Content

Similar to 21

Similar to 21 (20)

More from Technology_solution

More from Technology_solution (20)

Recently uploaded

Recently uploaded (20)

21