This document describes an unsupervised approach to deduce schema and extract data from template web pages. It first checks if two input web pages are visually similar by comparing their visual block trees. For similar pages, it uses a tree merging algorithm to combine their DOM trees into a pattern tree to detect the schema. It also removes noise blocks and detects the schema to compare the time taken. The algorithm builds visual trees, checks visual similarity, applies tree merging on the DOM trees with or without noise removal, identifies the schema, and extracts data by matching the pattern tree to the HTML tree.