20091006meeting

348 views

Published on

Published in: Technology, Education
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
348
On SlideShare
0
From Embeds
0
Number of Embeds
6
Actions
Shares
0
Downloads
0
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

20091006meeting

  1. 1. Progress Report Che-Min Liao
  2. 2. Data Collection <ul><li>利用兩個相似網頁,將每個 level 間可能的 peer node 相關 feature 列出來。 </li></ul><ul><ul><li>Node1.Path1=Node2.Path2 </li></ul></ul><ul><ul><li>IE MSHTML Engine </li></ul></ul><ul><ul><li>Ignore SCRIPT, NOSCRIPT, #comment, IFRAME </li></ul></ul><ul><ul><li>Ignore Leaf Node (#text 、 IMG) </li></ul></ul>
  3. 3. The Category of Feature <ul><li>Visual Information </li></ul><ul><li>DOM Tree Structure </li></ul><ul><li>Web Page Content </li></ul><ul><li>FivaMatchingScore </li></ul>
  4. 4. Visual Information <ul><li>Left </li></ul><ul><li>Top </li></ul><ul><li>Right </li></ul><ul><li>Bottom </li></ul><ul><li>Width </li></ul><ul><li>Height </li></ul>
  5. 5. Tree Structure <ul><li>Total Size </li></ul><ul><li>Parent Node Size </li></ul><ul><li>Node Size </li></ul><ul><li>Node Degree </li></ul><ul><ul><li>Ignore SCRIPT, NOSCRIPT, #comment </li></ul></ul><ul><li>Attribute Value </li></ul><ul><li>The Same Parent </li></ul><ul><li>The Same Page </li></ul>
  6. 6. Page Content <ul><li>String Length </li></ul><ul><li>Text Node Number </li></ul>
  7. 7. FivaMatchingScore <ul><li>Because of Asymmetric of FivaMatchingScore </li></ul><ul><ul><li>FivaMatchingScoreAttr(N1,N2) </li></ul></ul><ul><ul><li>FivaMatchingScoreAttr(N2,N1) </li></ul></ul><ul><ul><li>FivaMatchingScore(N1,N2) </li></ul></ul><ul><ul><li>FivaMatchingScore(N1,N2) </li></ul></ul>
  8. 8. Future Work <ul><li>標記答案 </li></ul><ul><li>找出更多有用的 feature </li></ul>

×