Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Screaming frog + xpath en

322 views

Published on

#SEO4Pirates Nr.6
How to crawl specific page elements using Screaming Frog and XPath
English version

Published in: Marketing
  • Be the first to comment

Screaming frog + xpath en

  1. 1. Screaming Frog + Xpath Crawl specific page elements on your own or competitor‘s websites @Urlaubspiraten #SEO4Pirates Nr.6 Sabine Langmann
  2. 2. Level 1 23.08.2018 Sabine Langmann Bit.ly/abjsd
  3. 3. What is this about? 23.08.2018 Sabine Langmann Bit.ly/abjsd
  4. 4. We‘d like: to crawl specific elements on our own web pages or the ones of our competition We use: Screaming Frog‘s Custom Search + XPath 23.08.2018 Sabine Langmann Bit.ly/abjsd
  5. 5. 23.08.2018 Sabine Langmann Bit.ly/abjsd
  6. 6. 23.08.2018 Sabine Langmann Bit.ly/abjsd
  7. 7. 23.08.2018 Sabine Langmann Bit.ly/abjsd
  8. 8. 23.08.2018 Sabine Langmann Bit.ly/abjsd
  9. 9. Level 2 23.08.2018 Sabine Langmann Bit.ly/abjsd
  10. 10. Who am I? Geek. SEO. Kuchen. 23.08.2018 Sabine Langmann Bit.ly/abjsd
  11. 11. Who am I? http://sabine-langmann.com https://www.linkedin.com/in/sabine-langmann/ @SabTheLa @sababeille 23.08.2018 Sabine Langmann Bit.ly/abjsd
  12. 12. Slides available at: https://www.slideshare.net/SabineLangmann 23.08.2018 Sabine Langmann Bit.ly/abjsd
  13. 13. Level 3Level 3 23.08.2018 Sabine Langmann Bit.ly/abjsd
  14. 14. Xpath 23.08.2018 Sabine Langmann Bit.ly/abjsd
  15. 15. XPath (XML Path Language) is a query language for selecting nodes from an XML document. Wikipedia 23.08.2018 Sabine Langmann Bit.ly/abjsd
  16. 16. Simple Syntax node every page element (e.g. H2, a, p, div) // adresses a certain node attribute attribute of a node (e.g. class, id) @ adresses a certain attribute count() counts addressed nodes 23.08.2018 Sabine Langmann Bit.ly/abjsd
  17. 17. Simple Syntax 23.08.2018 Sabine Langmann //node[@attribute="attribute_name"] bit.ly/2o3vJ5O
  18. 18. Simple Syntax 23.08.2018 Sabine Langmann count(//node[@attribute="attribute_name"]) bit.ly/2o3vJ5O
  19. 19. Level 4 23.08.2018 Sabine Langmann Bit.ly/abjsd
  20. 20. Some use cases 23.08.2018 Sabine Langmann Bit.ly/abjsd
  21. 21. Urlaubspiraten vs. Sonnenklar TV 23.08.2018 Sabine Langmann Bit.ly/abjsd
  22. 22. 23.08.2018 Sabine Langmann Bit.ly/abjsd
  23. 23. How many images? How many H2, H3, etc? How many words? How many links to which pages? 23.08.2018 Sabine Langmann Bit.ly/abjsd
  24. 24. 23.08.2018 Sabine Langmann Bit.ly/abjsd
  25. 25. 23.08.2018 Sabine Langmann Bit.ly/abjsd How many images? How many H2, H3, etc? How many words? How many links to which pages?
  26. 26. Step #1 23.08.2018 Sabine Langmann Bit.ly/abjsd
  27. 27. 23.08.2018 Sabine Langmann Bit.ly/abjsd Step #1
  28. 28. 23.08.2018 Sabine Langmann Bit.ly/abjsd Step #2
  29. 29. 23.08.2018 Sabine Langmann Bit.ly/abjsd Step #2
  30. 30. What am I searching for? 23.08.2018 Sabine Langmann Bit.ly/abjsd Step #3
  31. 31. In <div class="htmlContent"> I‘m searching for links, which is <a>-Tags 23.08.2018 Sabine Langmann Bit.ly/abjsd Step #3
  32. 32. Suitable Xpath selector: //div[@class ="htmlContent"]//a 23.08.2018 Sabine Langmann Bit.ly/abjsd 3. Schritt
  33. 33. 23.08.2018 Sabine Langmann Bit.ly/abjsd Step #4
  34. 34. 23.08.2018 Sabine Langmann Bit.ly/abjsd Step #4
  35. 35. 23.08.2018 Sabine Langmann Bit.ly/abjsd
  36. 36. 23.08.2018 Sabine Langmann Bit.ly/abjsd Result
  37. 37. Result 23.08.2018 Sabine Langmann Bit.ly/abjsd
  38. 38. 23.08.2018 Sabine Langmann Bit.ly/abjsd
  39. 39. 23.08.2018 Sabine Langmann Bit.ly/abjsd
  40. 40. usw.23.08.2018 Sabine Langmann Bit.ly/abjsd
  41. 41. 23.08.2018 Sabine Langmann Bit.ly/abjsd
  42. 42. 23.08.2018 Sabine Langmann Bit.ly/abjsd
  43. 43. In <div class="container"> I‘m searching for the number of H3 23.08.2018 Sabine Langmann Bit.ly/abjsd
  44. 44. Suitable Xpath selector: count(//div[@class ="container"]//h3) 23.08.2018 Sabine Langmann Bit.ly/abjsd
  45. 45. 23.08.2018 Sabine Langmann Bit.ly/abjsd
  46. 46. 23.08.2018 Sabine Langmann Bit.ly/abjsd Result
  47. 47. Result 23.08.2018 Sabine Langmann Bit.ly/abjsd
  48. 48. Gutscheinsammler vs. Sparwelt 23.08.2018 Sabine Langmann Bit.ly/abjsd
  49. 49. 23.08.2018 Sabine Langmann Bit.ly/abjsd
  50. 50. Which star rating? How many words in the intro? How many active coupon codes? 23.08.2018 Sabine Langmann Bit.ly/abjsd
  51. 51. 23.08.2018 Sabine Langmann Bit.ly/abjsd
  52. 52. Suitable Xpath selector: //button[@data-vouchertype="is__discounts"]//span 23.08.2018 Sabine Langmann Bit.ly/abjsd
  53. 53. Result 23.08.2018 Sabine Langmann Bit.ly/abjsd
  54. 54. 23.08.2018 Sabine Langmann Bit.ly/abjsd
  55. 55. How many active coupon codes? 23.08.2018 Sabine Langmann Bit.ly/abjsd
  56. 56. 23.08.2018 Sabine Langmann Bit.ly/abjsd
  57. 57. 23.08.2018 Sabine Langmann Bit.ly/abjsd
  58. 58. Suitable Xpath selector: count( //div[@class="media-list vouchers-active"] //div[@class="col-xs-12 col-sm-5"] //span[@class="text"] ) 23.08.2018 Sabine Langmann Bit.ly/abjsd
  59. 59. Result 23.08.2018 Sabine Langmann Bit.ly/abjsd
  60. 60. Kino.de vs. Serienjunkies 23.08.2018 Sabine Langmann Bit.ly/abjsd
  61. 61. 23.08.2018 Sabine Langmann Bit.ly/abjsd
  62. 62. How many words in the description? 23.08.2018 Sabine Langmann Bit.ly/abjsd
  63. 63. Suitable Xpath selector: //section[@class="smb-post-body"]//p 23.08.2018 Sabine Langmann Bit.ly/abjsd
  64. 64. 23.08.2018 Sabine Langmann Bit.ly/abjsd
  65. 65. Result 23.08.2018 Sabine Langmann Bit.ly/abjsd
  66. 66. 23.08.2018 Sabine Langmann Bit.ly/abjsd
  67. 67. How many words in the description? 23.08.2018 Sabine Langmann Bit.ly/abjsd
  68. 68. Suitable Xpath selector: //section//p 23.08.2018 Sabine Langmann Bit.ly/abjsd
  69. 69. 23.08.2018 Sabine Langmann Bit.ly/abjsd
  70. 70. Result 23.08.2018 Sabine Langmann Bit.ly/abjsd
  71. 71. Level 5 23.08.2018 Sabine Langmann Bit.ly/abjsd
  72. 72. Recap: Which data do I need? Can I crawl the respective elements? What is the right Xpath selector? That‘s it! 23.08.2018 Sabine Langmann Bit.ly/abjsd

×