The document provides information on optimizing academic documents for discovery on Google Scholar and other academic search engines. It discusses the goals of academic SEO as making scholarly documents easy to locate on Google Scholar, universal web searches, and academic search engines to increase citations and establish authority. Key principles discussed include using relevant keywords and labels, ensuring accessibility through site architecture and formatting, developing links and citations, and accommodating searcher behaviors.
Academic SEO: Optimizing Scholarly Documents for Discovery
1. Academic Search Engine Optimization of
Web Documents for Google Scholar
By
Shari Thurow, Founder and SEO Director
Omni Marketing Interactive
Copyright 1997-2015. All rights reserved.
2. About Shari Thurow:
• Author of Search Engine Visibility, translated
into French, Japanese, Korean, Russian, and
Polish languages.
• Co-author of When Search Meets Web
Usability, translated into Chinese.
Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
3. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
• User Experience Professionals Association (UXPA)
Board Member
Formerly the Usability Professionals Association
• Information Architecture Institute (IAI)
Board Member and Advisory Board
• ASLIB Journal of Information Management
Editorial Advisory Board
•
Bing Ads
Accredited Professional Advisory Board
4. Table of Contents:
• Helpful Google Scholar Links
• Goals of academic SEO
• Understanding search engine optimization (SEO)
• Fundamental principles of SEO
Keywords and copywriting
Accessibility and architecture
Popularity (validation)
Searcher goals and behaviors
• Examples of PDFs and abstracts
• Q&A
Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
6. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
Some symbols:
This is a good idea, or this document (or web page)
implemented a best practice.
This is a bad idea; don’t do this; or this document(or web
page) did not implement a best practice well.
This is really important. Try to remember this tip or user
experience (UX) and findability best practice.
7. HELPFUL GOOGLE SCHOLAR LINKS
Academic SEO of Web Documents for Google Scholar
Copyright 1997-2015. All rights reserved.
8. For quick reference:
• About Google Scholar
https://scholar.google.com/intl/en-US/scholar/about.html
• Inclusion Guidelines for Webmasters
https://scholar.google.com/intl/en-US/scholar/inclusion.html
• Publisher Support
https://scholar.google.com/intl/en-US/scholar/publishers.html
Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
9. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
10. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
11. GOALS OF ACADEMIC SEO
Academic SEO of Web Documents for Google Scholar
Copyright 1997-2015. All rights reserved.
12. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
Not only to appear in Google Scholar:
13. Goals of academic SEO (ASEO):
• Make your scholarly documents easy to locate and discover via:
Google Scholar
Universal/blended web searches
• More citations:
Academic
Non-academic
• Communicate to both humans and technology that you are a topic (or
subject) authority
• Assist with your own research
Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
14. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
Universal/blended search:
15. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
In Google Scholar:
16. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
Increase citations:
17. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
Increase citations (cont’d):
18. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
Communicate topic authority:
19. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
Communicate topic authority (cont’d):
21. Search engine optimization (SEO) is…
• Marketing-only definition: …the process of making web
documents easy to locate/discover, interpret, and
possibly rank on the commercial web search engines
(such as Bing and Google).
• … designing, writing, formatting, programming, and
coding digital documents so that there is an increased
chance that these documents will appropriately appear
at the top of web search results.
• The optimization process also includes how one can
improve the appearance of individual search listings on
search engine results pages (SERPs).
Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
22. SEO professionals are concerned with:
• Labeling website content so that it is easy to find
• Organizing website content so that it is easy to find
• Ensuring search engines have access to desired content
• Ensuring search engines don’t have access (or have limited access) to
undesirable content
• Applies to both:
Web search engines
Site search engines
Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
Many optimization processes and techniques for web SEO can make
the results from a site search engine more accurate.
23. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
Site search engine:
PDF ranking
in
a site search
engine
24. Web search engine:
Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
HTML page ranking in a
web search engine
25. Academic search engine:
Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
PDF ranking
in
an academic
search engine
26. Academic search engine (cont’d):
Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
Abstract (HTML)
in an academic
search engine
27. Web search engine:
Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
PDF ranking in a
web search engine
28. Web search engine (cont’d):
Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
Abstracts (HTML)
ranking in a web
search engine
29. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
SEO applies to many types of digital documents:
X
XLSX DOCX PPTX
30. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
Remember:
SEO is NOT optimizing for search engines only.
31. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
Remember:
SEO is optimizing for people who use search engines.
32. Remember:
• Optimize for people first, technology second.
• Web search usability refers to extent to which digital documents can be
discovered and located by web search-engine users to achieve specified
goals – with effectiveness, efficiency and satisfaction in a specified context
of use.
• Satisfying web searcher goals has been a ranking factor in web search
engines since the 1990s.
Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
33. FUNDAMENTAL PRINCIPLES OF SEO
Academic SEO of Web Documents for Google Scholar
Copyright 1997-2015. All rights reserved.
34. Web search engines:
1. Index text
2. Follow links
3. Measure popularity (authority)
4. Accommodate searcher goals and behaviors
Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
}All crawlers
}Ranking factors
35. 4 building blocks of SEO:
1. Keywords, labels, and copywriting
Do your digital documents contain the words and phrases that your
target audience types into search queries?
2. Accessibility (site architecture & formatting)
Do search engines have easy, user-friendly access to content through site
architecture, navigation, page layout, formatting, and URL structure?
3. Link development & social signals
Does your website get objective, high-quality, third-party links pointing to
it (not only your home page)? In ASEO, are your scholarly documents
cited?
4. Web searcher behaviors
Does your website accommodate known searcher goals and behaviors?
Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
36. All components are essential:
• These building blocks form the fundamental core of a user-friendly and
search-friendly website.
• In order for a site to receive consistent web search-engine traffic over
time, all 4 components should be present on a website.
• Search-engine rankings and traffic depend on:
Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
+ Technology
Factors
Human
Factors
37. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
Link
Development
& Social
Searcher
Goals
On-the-page
criteria
Keywords
& Labels
Access &
Architecture
38. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
Off-the-page
criteria
Link
Development
& Social
Searcher
Goals
Keywords
& Labels
Access &
Info Scent
40. Aboutness:
• When we refer to aboutness, most of the time we are referring to
intentional aboutness.
• Intentional aboutness describes the meaning of a document (such as a
web page, PDF, graphic image or video) from the author’s perspective.
• Please see:
Hjørland, B. (2001). Towards a theory of aboutness, subject, topicality, theme,
domain, field, content... and relevance. Journal of the American Society for
Information Science and Technology, 52(9), 774-778.
Hutchins, W. J. (1978, May). The concept of'aboutness' in subject indexing. In
Aslib proceedings (Vol. 30, No. 5, pp. 172-181). MCB UP Ltd.
Academic SEO of Web Documents Copyright 2007-2015. All rights reserved.
41. (1) Text
• Does the PDF and abstract contain the words and phrases that people
type into search engines?
• Does the PDF and abstract seem focused on specific content?
• Are keywords used in the right order (word sequence)?
Help desk software
Software help desk
• Are the words featured prominently within the PDF and abstract?
Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
42. Document labels:
• Document
• Navigation
• Content
• Technical
Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
43. Label details:
• Document labels
URL
Titles
• Content labels
Headings
Subheadings
Descriptions
Annotations
Captions
• Technical labels
(X)HTML
CSS & JavaScript
• Navigation labels
Global
Local
Utilities
Contextual
Supplemental
Hybrid (breadcrumbs)
Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
44. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
• Title
• Primary heading (h1)
• Top of the page
• Subheadings (if used)
• Meta-tag description
(for display only)
• URL
HTML version:
45. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
46. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
http://uxpajournal.org/the-effects-of-parallax-
scrolling-on-user-experience-in-web-design/
47. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
48. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
49. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
50. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
PDF version:
• PDF title
• Author
• Description
• Keywords
• Top of the page
• URL
51. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
PDF title:
This is the PDF
title. Make sure it
matches the PDF’s
metadata title.
52. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
To go to the PDF’s metadata title:
53. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
PDF metadata title:
54. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
Top of the page:
55. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
PDF author metadata:
Click here.
56. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
PDF author title metadata:
57. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
PDF metadata subject:
Do NOT copy/paste the PDF
title in this text field.
The subject and title can be
similar but should NOT be
exact matches.
If you have a category and
subject, this would be
appropriate text for this field.
58. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
PDF description metadata:
Click here.
59. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
PDF description metadata:
60. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
PDF keywords metadata:
You can use the same keyword
list that you used in the HTML
document or abstract.
But you might want to
differentiate them a bit more
(to show that they are unique).
61. Google’s PDF metadata info:
• https://scholar.google.com/intl/en-us/scholar/inclusion.html#indexing
• When in doubt, use Dublin Core.
Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
62. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
PDF URL:
http://uxpajournal.org/wp-
content/uploads/pdf/JUS_Frederick_Feb2015.
pdf
63. URL structure:
• Remember:
Short but descriptive.
Use hyphens (not underscores) to separate words.
Minimize the use of stop words.
Minimize the use of “funky” characters.
Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
65. Search engines follow links:
• Search engine follow links from one web document to another.
• How your web documents link to each other communicates (to both
humans and technology) how important you believe your content is.
• In other words, navigation matters!
• Search engines and searchers look at:
Information architecture
Site navigation (global, local, contextual, supplemental)
URL structure
Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
66. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
Is the HTML document in the search-engine index?
67. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
Is the PDF in the search-engine index?
Web analytics data can also show this
information.
The advanced search operator (site: ) is a
quick way to see if a document is in the
search index.
68. Links pointing to the PDFs:
• Here’s an example:
http://research.microsoft.com/en-us/um/people/ryenw/publications.html
Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
69. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
70. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
Here’s an abstract:
Abstract URL: http://www.emeraldinsight.com/toc/ajim/66/3
71. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
Here’s the PDF:
http://www.emeraldinsight.com/doi/pdfplus/1
0.1108/AJIM-09-2013-0083
72. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
Tip:
Remember, in PDFs, always show absolute
links, not relative links.
Absolute link:
http://research.microsoft.com/en-
us/um/people/ryenw/papers/BennettTOIS2015.pdf
73. PDF file size:
• Documents larger than 5MB, such as books and long dissertations, should
be uploaded to Google Book Search.
• Google Scholar automatically includes scholarly works from Google Book
Search.
• To minimize PDF file size:
Limit the number of fonts/typefaces used in the document.
Limit the number of graphic images used in the document.
If necessary, reformat graphic images to be smaller in file size. You can put the
higher-resolution images on your site individually.
Limit PDF metadata to essential information (author, title, description,
keywords, URL). Don’t put long paragraphs of information in the metadata.
Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
74. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
Use this tool:
https://developers.google.com/speed/docs/i
nsights/about
Google only has specifications for web pages here, but the tool can help you get
specific recommendations for minimizing file size.
76. What is link development?
• Link popularity
Number of (quantity) external, 3rd-party links pointing to a URL
Quality of external, 3rd-party links pointing to a URL
• Click (or click-thru) popularity
Number of times searchers click on links to your site from a search engine
How long end users stay on your site after clicking on a search listing
» If it’s a quick-fact search, a short time is good. Reason? Searchers got their
questions answered.
» If the query is exploratory, then a longer dwell time might signal that your site has
good information…
» …OR it could also mean that the site navigation and labeling is confusing.
How often people return to your site from search listings
Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
77. To ALWAYS remember about link development:
• The ultimate goal is to have good quantity of high quality links pointing to
the PDF and/or the abstract.
• Quality trumps quantity.
• Citations in bibliographies help in Google Scholar.
Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
78. Recommendations:
• Encourage authors and researchers to link to BOTH the HTML and PDF
versions. Show users how to do it.
• Encourage link development to PDFs via social media as well, even though
social media citations are NOT a ranking factor.
• The best benefit from Google Scholar is to encourage scholarly and book
citations in Google’s preferred format.
The text of each reference must be a formal bibliographic citation in a
commonly used format, without free-form commentary.
Chicago, ALA, and MLA (please see following slide)
Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
79. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
https://scholar.google.com/intl/us/scholar/citations.html#overview
80. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
Google’s preferred bibliographic citation formats:
82. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
To remember:
Navigational Informational Transactional
83. Web searcher goals:
• Try to remember: GO, KNOW, DO.
Navigational: Go to a website or specific web page.
Informational: Know or learn about a topic.
Transactional: Do some web-mediated activity (listen to music, watch video,
look at photos, etc.
• Informational queries are the most common query type on all devices
(computer, tablet, phone).
• Transactional queries are the least common query type.
Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
84. Most important query to satisfy?
• Navigational queries…because the search task is specific: to go to the JUS
website or a document/page on the JUS website.
Navigational queries are difficult to displace.
Web searchers rarely look past position #3 when conducting a navigational
query.
• If aboutness and information scent is clear for navigational queries, then
optimization for informational and transactional queries will likely be clear
to users and technology.
• Quick tip: Do 2 searches.
Simple search: author + article title
Advanced search
Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
85. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
What type of query is this?
86. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
87. “Download” shows transactional intent:
• If a person wants to download a specific PDF from a website, the query is
both navigational and transactional.
• Transactional indicators:
Download
PDF
Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
88. To remember:
• SEO is optimizing for people who use search engines, not optimizing for
search engines (technology) only.
• Digital documents should communicate ‘aboutness’ to both humans and
technology.
HTML pages (abstracts and articles)
PDFs
• Users should have easy access to all digital documents.
• Validation comes in the form of link development as well as bibliographic
citations.
• Ensure your digital documents accommodate known web-searcher
behaviors.
• Follow search engines’ guidelines.
Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
89. Academic SEO of Web Documents Copyright 1997-2015. All rights reserved.
Questions?
Shari Thurow, Founder and SEO Director
Omni Marketing Interactive
sthurow@search-usability.com