SlideShare a Scribd company logo
Structured Data and Metadata Evaluation Methodology for 
Organizations Looking to Improve Image Findability on the Web 
School of Library and Information Studies 
LIS 5733 Taught by: Dr. Susan Burke 
Research Proposal Written by: Emily Kolvitz 
Research Setting: Primarily Geared Towards Online Ecommerce/Business Organizations, but methodology could 
easily translate to Galleries, Museums, Archives, Libraries (GLAMs) or any institution looking to evaluate their 
structured data and metadata practices on the world wide web in an effort to improve findability of product offerings, 
general information or services.
Introduction 
The current state of findability on the web for many organizations is incipient. Search 
Engine Optimization (SEO) techniques change frequently and remain much a mystery 
to many companies. The one variable in the equation of web findability that remains a 
staple is good quality metadata under the hood of the website. 
This research methodology will allow for : 
● An assessment of findability maturity on the web from an image-centric viewpoint 
● Help improve findability on the web by establishing a baseline for where your 
organization is at in terms of structured data content and visualize gaps or areas 
for improvement from a search engine neutral perspective
Introduction 
● Most Searches Start with Google now (Holman 2011) (Lippincott 2013) 
● Search Algorithms Shaping what is most Easily Accessible (Connaway, Dickey & 
Radford 2011) and they are subject to change frequently (Kritzinger 2013) 
● Search Algorithms Look for Your Structured Data and in the future and possibly 
your embedded metadata (Cazier 2014) (Beall 2010)
Literature Review 
Marshall Breeding (2013) assesses the limitations of the major search engine algorithms: 
“But even with the most sophisticated relevancy 
algorithms, index-based search and retrieval lacks the 
ability to lead users to the potential related content. 
Semantic web technologies, in conjunction with 
repositories of open linked data, promise to deliver 
significant new capabilities in exploring and exploiting 
information resources on the web.”
Literature Review 
● Semantic web is founded on good, high-quality 
structured data 
● Future technologies could potentially utilize 
embedded metadata in search (Cazier 2014) 
(Beall 2010) but there is authenticity, 
provenance and “breadcrumbs” value now 
(Reicks 2013)
Literature Review 
● Most users don’t go past the first page of 
search results (Paz 2013) 
● Structured Data Practices can help your 
organization stay relevant (and findable!) in 
the age of information overload 
● Keeping it Search Engine Neutral is 
advisable (Paz 2013)
Topic/Proposed Research 
● Methodology for establishing a baseline or benchmark of where an organization is at 
in terms of structured data pertaining to image records that ultimately helps findability 
on the web 
● By utilizing the proposed methodology for gathering this data for an organization, 
data-informed decisions can be made about structured data strategy going forward to 
maintain relevancy on the web 
● Many structured data elements can affect online findability from file-naming 
standards, presence of alt text tags in html markup, html markup itself, embedded 
metadata, schema.org markup and rich snippets, text description at or nearby images, 
and more. IEEE uses metadata or full-text for search (IEEE Xplore offers this--see 
next slide)
Full Text Search & Metadata Search
Topic/Proposed Research 
● It is also noteworthy that there are additional factors that affect findability on 
the web that do not involve structured data, but this research focuses solely on 
structured data techniques within the control of individual organizations. 
● All of these structured data techniques pertaining to image records will be 
utilized in conjunction with the relevancy of onsite and offsite search results. 
● Image search and information retrieval is a more difficult area than text search 
and retrieval because accessibility to the image content is largely dependent on 
side-car text (or metadata if you will) that describes the aboutness and 
(hopefully) the context for the image record.
Questions 
Research Questions Addressed in this Study 
1. What methods of search are available on the organization’s online website? 
1. What is the file-naming structure for images on the website? 
1. What is the quality of search engine (onsite and offsite) results? 
1. What kinds of search results appear in Image Search when searching by the 
organization’s name and product description both with onsite search and offsite 
search?
Questions 
Research Questions Addressed in this Study 
5. What kinds of search results appear in Google Image Search when searching 
by images taken from the organization’s website? 
5. What kinds of search results come up when looking for specific products 
(measure of structured data) through onsite search and offsite search? 
5. What are the results when looking for specific products on the offsite search 
engine?
Questions 
Research Questions Addressed in this Study 
8. What kinds of structured data are near or around the images on the organization’s 
website? Alt Text? Other? 
9. What file types appear on the organization’s website? (JPEG? TIFF? PNG?) 
9. What embedded metadata is available in images on the website? 
11. What does the XMP/XML/RDF for these images look like and how robust is it? 
What does the graph look like?
Variables 
Quality and number 
of alt text tags 
Type of page 
the image was 
on 
Level of description for the 
filename 
Quality and number of structured 
data tags pertaining to the images 
The image file naming 
convention/filename 
Quality and number 
of embedded 
metadata tags 
Quality and number of search 
results for onsite search 
utilizing filename or alt text 
Quality and number 
of relevant search 
results utilizing 
offsite image search 
These measures are operationalized by utilization of likert scales applied by the human researcher. For 
example, when rating the level of description for the file-name, a research could conclude that the 
filename sp_18379847923.jpg is not very descriptive filename for a human, let alone for a search engine 
(unless of course this is a product sku.) The researcher would then choose to assign it a low value on 
descriptiveness on a 1-5 likert scale.
Data Collection Methods 
Participants 
Participants will include a single institution, anonymized for the protection of their business. The sample of image records utilized 
in this study will be limited to image assets appearing on the organization’s website domain. Most data collection can take place 
from the organization’s website itself. Some procedures will take place on external sites, services, or programs. 
Randomization of Sample 
The sample of images utilized in this study can be randomized by extracting a site map of the particular organization of interest 
using xsitemap.com. After the site map is constructed, the list of URLs should be inputted into a spreadsheet program and a record 
number should be assigned to each URL. From there, the researcher can use a randomizer program to select the order of pages to 
utilize in the study (i.e. Research Randomizer Available at: http://www.randomizer.org/form.htm) This method will be utilized for 
taking a random sample of pages from the organization of interest. 
Consent 
All data collected in this study are publicly available and freely available on the web.
Data Collection Methods 
Obtaining Data on the website 
● Navigate to the URL 
● Right Click Image(s) and “Save As” 
● Right Click Page and “View Source” Save as 
.txt file 
● Collect raw data from image by either 
opening in Photoshop and Navigating to Raw 
Data Column or utilize Phil Harvey’s 
ExifTool 
Obtaining Data through Structured Data Linter 
● Navigate to the Linter website 
● Enter URL 
● Screenshot Structured Data Results -or- save 
as webpage 
Obtaining Data through W3C RDF validator 
● Copy raw data xml extracted earlier and input 
into RDF Validator 
● Select Graph Only on the Options 
● Parse RDF 
● Save Graph or Screenshot Graph 
● Store in Folder with other Data 
Answer Research Questions 
● Systematically go through the collected data 
and input findings into spreadsheet
Data Analysis Methods 
● Descriptive Statistics 
o Bell Curve - measures 
towards a central tendency 
using likert scale data 
Bell Curve Image By Vierge Marie 
(Own work) [Public domain], via 
Wikimedia Commons 
http://upload.wikimedia.org/wikipe 
dia/commons/f/f6/Gaussian_Filter 
.svg
Data Analysis Methods 
● Graphical Analysis 
(Charts and Graphs) 
● Summary Report 
● Discussion of Findings
Visualizing the Results 
The Structured Data Linter, 
utilizing URLs to display 
structured data around the images. 
Available at: 
http://linter.structured-data.org/ 
Summary analysis will be 
crafted utilizing all of these data 
points to show what we are able 
to understand about an image 
versus what a machine or search 
engine is able to know about an 
image. 
W3C RDF Validator Graph 
Visualization utilizing the raw 
data markup extracted from the 
image 
Available at: 
http://www.w3.org/RDF/Validator 
/
Structured Data Linter 
Shows all 
structured Data 
Tags around the 
images and in 
the page markup
RDF Validator 
Visualization of 
embedded data 
for images and 
their subsequent 
relationships to 
other data
Summary Report 
Complete Picture of Structured 
Data, Metadata and Analysis 
of Study
Expected Outcomes 
The anticipated results of this project include a benchmark for where this specific 
organization is at in terms of structured data in the online environment and a 
methodology for other organizations looking to assess their structured data maturity in 
the digital space. These results will be used to create a roadmap for improving resource 
findability both on the web and within websites. Other organizations may also aspire to 
reuse this methodology for assessing their own current state of structured data. Future 
areas of research could include utilizing metadata/RDF-driven search engines in 
conjuncture with Vector Space Models to assess findability of image records on the 
web and within websites.
References (Slides & Full Paper) 
Algebraix Data, Corporation. 0005. "Algebraix Data Launches Industry’s First Cost-Effective Automated Implementation 
of Schema.org." Business Wire (English), 5. 
Beall, Jeffrey. 2010. "How Google Uses Metadata to Improve Search Results." Serials Librarian 59, no. 1: 40-53. 
Breeding, Marshall. 2013. "Linked Data: The Next Big Wave or Another Tech Fad?." Computers In Libraries 33, no. 3: 
20-22. 
Cafarella, M.J., Halevy, A.Y., Zhang, Y., Wang, D.Z., and Wu, E. Uncovering the relational Web. In Proceedings of the 
11th International Workshop on the Web and Databases (Vancouver, B.C., June 13, 2008). 
http://web.eecs.umich.edu/~michjc/papers/webtables_webdb08.pdf 
Connaway, Lynn Sillipigni, Timothy J. Dickey, and Marie L. Radford. 2011. "“If it is too inconvenient I'm not going after it:” 
Convenience as 
a critical factor in information-seeking behaviors." Library & Information Science Research (07408188) 33, no. 3: 179-190.
References (Slides & Full Paper) 
Cazier, Clay, 2014. PM Digital Marketing Blog “The Future of Exif Image Data” Last accessed November 20, 2014. 
http://www.pmdigital.com/blog/2014/04/future-exif-image-data/ 
Diagram Center: Digital Image and Graphic Resources for Accessible Materials , 2014. “Content Model” Last Accessed 
November 23, 2014. http://diagramcenter.org/standards-and-practices/content-model.html 
Google. 2014. “Image Publishing Guidelines” Last accessed November 21, 2014. 
https://support.google.com/webmasters/answer/114016?hl=en 
Holman, Lucy. 2011. "Millennial Students' Mental Models of Search: Implications for Academic Librarians and Database 
Developers." Journal Of Academic Librarianship 37, no. 1: 19-27
References (Slides & Full Paper) 
International Business, Times. 0006. "Bing,Google and Yahoo merge to make search easier with schema.org." 
International Business Times, April. 
IPTC International Press Telecommunications Council, 2014. “Embedded Metadata Manifesto” Last accessed November 
20, 2014. http://www.embeddedmetadata.org/social-media-test-results.php (Embedded Metadata Manifesto 2014). 
Kritzinger, W. T. "Search Engine Optimization and Pay-per-Click Marketing Strategies." Journal of Organizational 
Computing and Electronic Commerce, no. 3 (2013): 273-86. 
Lippincott, Joan K. “Net Generation Students and Libraries,” EDUCAUSE (2005), accessed November 19, 2014, 
http://www.educause.edu/research-and-publications/books/educating-net-generation/net-generation-students-and-libraries
References (Slides & Full Paper) 
Nakanishi, T., "Semantic Context-Dependent Weighting for Vector Space Model," Semantic Computing (ICSC), 2014 
IEEE International Conference on , vol., no., pp.262,266, 16-18. June 2014. doi: 10.1109/ICSC.2014.49 
Paz, Anita. 2013. "In search of Meaning: The Written Word in the Age of Google." Italian Journal Of Library & 
Information Science 4, no. 2: 255-266. 
Priebe, T.; Schlager, C.; Pernul, G., "A search engine for RDF metadata," Database and Expert Systems Applications, 
2004. Proceedings. 15th International Workshop on , vol., no., pp.168,172, 2004. doi: 10.1109/DEXA.2004.1333468 
Reicks, David. 2010. “Why Embedded Metadata Won’t Help Your SEO,” Last Updated December 30, 2013. Last 
Accessed November 23, 2014. http://www.controlledvocabulary.com/blog/embedded-metadata-wont-help-seo.html

More Related Content

What's hot

Semantic search
Semantic searchSemantic search
Semantic search
Andreas Blumauer
 
Perception Determined Constructing Algorithm for Document Clustering
Perception Determined Constructing Algorithm for Document ClusteringPerception Determined Constructing Algorithm for Document Clustering
Perception Determined Constructing Algorithm for Document Clustering
IRJET Journal
 
Nordic health data metadata
Nordic health data   metadataNordic health data   metadata
Nordic health data metadata
Fredric Landqvist
 
Implementing Semantic Search
Implementing Semantic SearchImplementing Semantic Search
Implementing Semantic Search
Paul Wlodarczyk
 
SemTech 2011 Semantic Search tutorial
SemTech 2011 Semantic Search tutorialSemTech 2011 Semantic Search tutorial
SemTech 2011 Semantic Search tutorial
Peter Mika
 
DataONE Education Module 03: Data Management Planning
DataONE Education Module 03: Data Management PlanningDataONE Education Module 03: Data Management Planning
DataONE Education Module 03: Data Management Planning
DataONE
 
What IA, UX and SEO Can Learn from Each Other
What IA, UX and SEO Can Learn from Each OtherWhat IA, UX and SEO Can Learn from Each Other
What IA, UX and SEO Can Learn from Each Other
Ian Lurie
 
Semantic Search on the Rise
Semantic Search on the RiseSemantic Search on the Rise
Semantic Search on the Rise
Peter Mika
 
From federated to aggregated search
From federated to aggregated searchFrom federated to aggregated search
From federated to aggregated search
Mounia Lalmas-Roelleke
 
Semantic Search overview at SSSW 2012
Semantic Search overview at SSSW 2012Semantic Search overview at SSSW 2012
Semantic Search overview at SSSW 2012
Peter Mika
 
An Introduction to Entities in Semantic Search
An Introduction to Entities in Semantic SearchAn Introduction to Entities in Semantic Search
An Introduction to Entities in Semantic Search
David Amerland
 
A Novel Data Extraction and Alignment Method for Web Databases
A Novel Data Extraction and Alignment Method for Web DatabasesA Novel Data Extraction and Alignment Method for Web Databases
A Novel Data Extraction and Alignment Method for Web Databases
IJMER
 
Project Panorama: vistas on validated information
Project Panorama: vistas on validated informationProject Panorama: vistas on validated information
Project Panorama: vistas on validated information
Eric Sieverts
 
Semantic Search tutorial at SemTech 2012
Semantic Search tutorial at SemTech 2012Semantic Search tutorial at SemTech 2012
Semantic Search tutorial at SemTech 2012
Peter Mika
 
`A Survey on approaches of Web Mining in Varied Areas
`A Survey on approaches of Web Mining in Varied Areas`A Survey on approaches of Web Mining in Varied Areas
`A Survey on approaches of Web Mining in Varied Areas
inventionjournals
 
IEEE 2014 JAVA DATA MINING PROJECTS Keyword query routing
IEEE 2014 JAVA DATA MINING PROJECTS Keyword query routingIEEE 2014 JAVA DATA MINING PROJECTS Keyword query routing
IEEE 2014 JAVA DATA MINING PROJECTS Keyword query routing
IEEEFINALYEARSTUDENTPROJECTS
 
Fairification experience clarifying the semantics of data matrices
Fairification experience clarifying the semantics of data matricesFairification experience clarifying the semantics of data matrices
Fairification experience clarifying the semantics of data matrices
Pistoia Alliance
 
Sekhon final 1_ppt
Sekhon final 1_pptSekhon final 1_ppt
Sekhon final 1_ppt
Manant Sweet
 
Web Mining for an Academic Portal: The case of Al-Imam Muhammad Ibn Saud Isla...
Web Mining for an Academic Portal: The case of Al-Imam Muhammad Ibn Saud Isla...Web Mining for an Academic Portal: The case of Al-Imam Muhammad Ibn Saud Isla...
Web Mining for an Academic Portal: The case of Al-Imam Muhammad Ibn Saud Isla...
IOSR Journals
 

What's hot (19)

Semantic search
Semantic searchSemantic search
Semantic search
 
Perception Determined Constructing Algorithm for Document Clustering
Perception Determined Constructing Algorithm for Document ClusteringPerception Determined Constructing Algorithm for Document Clustering
Perception Determined Constructing Algorithm for Document Clustering
 
Nordic health data metadata
Nordic health data   metadataNordic health data   metadata
Nordic health data metadata
 
Implementing Semantic Search
Implementing Semantic SearchImplementing Semantic Search
Implementing Semantic Search
 
SemTech 2011 Semantic Search tutorial
SemTech 2011 Semantic Search tutorialSemTech 2011 Semantic Search tutorial
SemTech 2011 Semantic Search tutorial
 
DataONE Education Module 03: Data Management Planning
DataONE Education Module 03: Data Management PlanningDataONE Education Module 03: Data Management Planning
DataONE Education Module 03: Data Management Planning
 
What IA, UX and SEO Can Learn from Each Other
What IA, UX and SEO Can Learn from Each OtherWhat IA, UX and SEO Can Learn from Each Other
What IA, UX and SEO Can Learn from Each Other
 
Semantic Search on the Rise
Semantic Search on the RiseSemantic Search on the Rise
Semantic Search on the Rise
 
From federated to aggregated search
From federated to aggregated searchFrom federated to aggregated search
From federated to aggregated search
 
Semantic Search overview at SSSW 2012
Semantic Search overview at SSSW 2012Semantic Search overview at SSSW 2012
Semantic Search overview at SSSW 2012
 
An Introduction to Entities in Semantic Search
An Introduction to Entities in Semantic SearchAn Introduction to Entities in Semantic Search
An Introduction to Entities in Semantic Search
 
A Novel Data Extraction and Alignment Method for Web Databases
A Novel Data Extraction and Alignment Method for Web DatabasesA Novel Data Extraction and Alignment Method for Web Databases
A Novel Data Extraction and Alignment Method for Web Databases
 
Project Panorama: vistas on validated information
Project Panorama: vistas on validated informationProject Panorama: vistas on validated information
Project Panorama: vistas on validated information
 
Semantic Search tutorial at SemTech 2012
Semantic Search tutorial at SemTech 2012Semantic Search tutorial at SemTech 2012
Semantic Search tutorial at SemTech 2012
 
`A Survey on approaches of Web Mining in Varied Areas
`A Survey on approaches of Web Mining in Varied Areas`A Survey on approaches of Web Mining in Varied Areas
`A Survey on approaches of Web Mining in Varied Areas
 
IEEE 2014 JAVA DATA MINING PROJECTS Keyword query routing
IEEE 2014 JAVA DATA MINING PROJECTS Keyword query routingIEEE 2014 JAVA DATA MINING PROJECTS Keyword query routing
IEEE 2014 JAVA DATA MINING PROJECTS Keyword query routing
 
Fairification experience clarifying the semantics of data matrices
Fairification experience clarifying the semantics of data matricesFairification experience clarifying the semantics of data matrices
Fairification experience clarifying the semantics of data matrices
 
Sekhon final 1_ppt
Sekhon final 1_pptSekhon final 1_ppt
Sekhon final 1_ppt
 
Web Mining for an Academic Portal: The case of Al-Imam Muhammad Ibn Saud Isla...
Web Mining for an Academic Portal: The case of Al-Imam Muhammad Ibn Saud Isla...Web Mining for an Academic Portal: The case of Al-Imam Muhammad Ibn Saud Isla...
Web Mining for an Academic Portal: The case of Al-Imam Muhammad Ibn Saud Isla...
 

Similar to Structured data and metadata evaluation methodology for organizations looking to improve image findability on the web emily kolvitz_2014

Sweeny ux-seo om-cap 2014_v3
Sweeny ux-seo om-cap 2014_v3Sweeny ux-seo om-cap 2014_v3
Sweeny ux-seo om-cap 2014_v3
Marianne Sweeny
 
Pratical Deep Dive into the Semantic Web - #smconnect
Pratical Deep Dive into the Semantic Web - #smconnectPratical Deep Dive into the Semantic Web - #smconnect
Pratical Deep Dive into the Semantic Web - #smconnect
Jan-Willem Bobbink - Freelance SEO Consultant
 
Gateway to Oklahoma History Case Study: Structured Data and Metadata Evaluati...
Gateway to Oklahoma History Case Study: Structured Data and Metadata Evaluati...Gateway to Oklahoma History Case Study: Structured Data and Metadata Evaluati...
Gateway to Oklahoma History Case Study: Structured Data and Metadata Evaluati...
Emily Kolvitz
 
PageRank algorithm and its variations: A Survey report
PageRank algorithm and its variations: A Survey reportPageRank algorithm and its variations: A Survey report
PageRank algorithm and its variations: A Survey report
IOSR Journals
 
Introduction abstract
Introduction abstractIntroduction abstract
Introduction abstract
Sanghvi Innovative Academy
 
Web Search Engine, Web Crawler, and Semantics Web
Web Search Engine, Web Crawler, and Semantics WebWeb Search Engine, Web Crawler, and Semantics Web
Web Search Engine, Web Crawler, and Semantics Web
Aatif19921
 
3 Understanding Search
3 Understanding Search3 Understanding Search
3 Understanding Search
masiclat
 
IRJET- Text-based Domain and Image Categorization of Google Search Engine usi...
IRJET- Text-based Domain and Image Categorization of Google Search Engine usi...IRJET- Text-based Domain and Image Categorization of Google Search Engine usi...
IRJET- Text-based Domain and Image Categorization of Google Search Engine usi...
IRJET Journal
 
IRJET - Re-Ranking of Google Search Results
IRJET - Re-Ranking of Google Search ResultsIRJET - Re-Ranking of Google Search Results
IRJET - Re-Ranking of Google Search Results
IRJET Journal
 
Image Based Information Retrieval Using Deep Learning and Clustering Techniques
Image Based Information Retrieval Using Deep Learning and Clustering TechniquesImage Based Information Retrieval Using Deep Learning and Clustering Techniques
Image Based Information Retrieval Using Deep Learning and Clustering Techniques
IRJET Journal
 
Image Based Information Retrieval Using Deep Learning and Clustering Techniques
Image Based Information Retrieval Using Deep Learning and Clustering TechniquesImage Based Information Retrieval Using Deep Learning and Clustering Techniques
Image Based Information Retrieval Using Deep Learning and Clustering Techniques
IRJET Journal
 
Search Solutions 2011: Successful Enterprise Search By Design
Search Solutions 2011: Successful Enterprise Search By DesignSearch Solutions 2011: Successful Enterprise Search By Design
Search Solutions 2011: Successful Enterprise Search By Design
Marianne Sweeny
 
Fox-Keynote-Now and Now of Data Publishing-nfdp13
Fox-Keynote-Now and Now of Data Publishing-nfdp13Fox-Keynote-Now and Now of Data Publishing-nfdp13
Fox-Keynote-Now and Now of Data Publishing-nfdp13
DataDryad
 
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.comHABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
HABIB FIGA GUYE
 
IRJET-Multi -Stage Smart Deep Web Crawling Systems: A Review
IRJET-Multi -Stage Smart Deep Web Crawling Systems: A ReviewIRJET-Multi -Stage Smart Deep Web Crawling Systems: A Review
IRJET-Multi -Stage Smart Deep Web Crawling Systems: A Review
IRJET Journal
 
Data mining in web search engine optimization
Data mining in web search engine optimizationData mining in web search engine optimization
Data mining in web search engine optimization
BookStoreLib
 
SharePoint 2013 search improvements
SharePoint 2013 search improvementsSharePoint 2013 search improvements
SharePoint 2013 search improvements
Kunaal Kapoor
 
Web content mining
Web content miningWeb content mining
Web content mining
Daminda Herath
 
Web Content Mining
Web Content MiningWeb Content Mining
Web Content Mining
Daminda Herath
 
Query Recommendation by using Collaborative Filtering Approach
Query Recommendation by using Collaborative Filtering ApproachQuery Recommendation by using Collaborative Filtering Approach
Query Recommendation by using Collaborative Filtering Approach
IRJET Journal
 

Similar to Structured data and metadata evaluation methodology for organizations looking to improve image findability on the web emily kolvitz_2014 (20)

Sweeny ux-seo om-cap 2014_v3
Sweeny ux-seo om-cap 2014_v3Sweeny ux-seo om-cap 2014_v3
Sweeny ux-seo om-cap 2014_v3
 
Pratical Deep Dive into the Semantic Web - #smconnect
Pratical Deep Dive into the Semantic Web - #smconnectPratical Deep Dive into the Semantic Web - #smconnect
Pratical Deep Dive into the Semantic Web - #smconnect
 
Gateway to Oklahoma History Case Study: Structured Data and Metadata Evaluati...
Gateway to Oklahoma History Case Study: Structured Data and Metadata Evaluati...Gateway to Oklahoma History Case Study: Structured Data and Metadata Evaluati...
Gateway to Oklahoma History Case Study: Structured Data and Metadata Evaluati...
 
PageRank algorithm and its variations: A Survey report
PageRank algorithm and its variations: A Survey reportPageRank algorithm and its variations: A Survey report
PageRank algorithm and its variations: A Survey report
 
Introduction abstract
Introduction abstractIntroduction abstract
Introduction abstract
 
Web Search Engine, Web Crawler, and Semantics Web
Web Search Engine, Web Crawler, and Semantics WebWeb Search Engine, Web Crawler, and Semantics Web
Web Search Engine, Web Crawler, and Semantics Web
 
3 Understanding Search
3 Understanding Search3 Understanding Search
3 Understanding Search
 
IRJET- Text-based Domain and Image Categorization of Google Search Engine usi...
IRJET- Text-based Domain and Image Categorization of Google Search Engine usi...IRJET- Text-based Domain and Image Categorization of Google Search Engine usi...
IRJET- Text-based Domain and Image Categorization of Google Search Engine usi...
 
IRJET - Re-Ranking of Google Search Results
IRJET - Re-Ranking of Google Search ResultsIRJET - Re-Ranking of Google Search Results
IRJET - Re-Ranking of Google Search Results
 
Image Based Information Retrieval Using Deep Learning and Clustering Techniques
Image Based Information Retrieval Using Deep Learning and Clustering TechniquesImage Based Information Retrieval Using Deep Learning and Clustering Techniques
Image Based Information Retrieval Using Deep Learning and Clustering Techniques
 
Image Based Information Retrieval Using Deep Learning and Clustering Techniques
Image Based Information Retrieval Using Deep Learning and Clustering TechniquesImage Based Information Retrieval Using Deep Learning and Clustering Techniques
Image Based Information Retrieval Using Deep Learning and Clustering Techniques
 
Search Solutions 2011: Successful Enterprise Search By Design
Search Solutions 2011: Successful Enterprise Search By DesignSearch Solutions 2011: Successful Enterprise Search By Design
Search Solutions 2011: Successful Enterprise Search By Design
 
Fox-Keynote-Now and Now of Data Publishing-nfdp13
Fox-Keynote-Now and Now of Data Publishing-nfdp13Fox-Keynote-Now and Now of Data Publishing-nfdp13
Fox-Keynote-Now and Now of Data Publishing-nfdp13
 
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.comHABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
 
IRJET-Multi -Stage Smart Deep Web Crawling Systems: A Review
IRJET-Multi -Stage Smart Deep Web Crawling Systems: A ReviewIRJET-Multi -Stage Smart Deep Web Crawling Systems: A Review
IRJET-Multi -Stage Smart Deep Web Crawling Systems: A Review
 
Data mining in web search engine optimization
Data mining in web search engine optimizationData mining in web search engine optimization
Data mining in web search engine optimization
 
SharePoint 2013 search improvements
SharePoint 2013 search improvementsSharePoint 2013 search improvements
SharePoint 2013 search improvements
 
Web content mining
Web content miningWeb content mining
Web content mining
 
Web Content Mining
Web Content MiningWeb Content Mining
Web Content Mining
 
Query Recommendation by using Collaborative Filtering Approach
Query Recommendation by using Collaborative Filtering ApproachQuery Recommendation by using Collaborative Filtering Approach
Query Recommendation by using Collaborative Filtering Approach
 

Recently uploaded

制作原版1:1(Monash毕业证)莫纳什大学毕业证成绩单办理假
制作原版1:1(Monash毕业证)莫纳什大学毕业证成绩单办理假制作原版1:1(Monash毕业证)莫纳什大学毕业证成绩单办理假
制作原版1:1(Monash毕业证)莫纳什大学毕业证成绩单办理假
ukwwuq
 
可查真实(Monash毕业证)西澳大学毕业证成绩单退学买
可查真实(Monash毕业证)西澳大学毕业证成绩单退学买可查真实(Monash毕业证)西澳大学毕业证成绩单退学买
可查真实(Monash毕业证)西澳大学毕业证成绩单退学买
cuobya
 
留学学历(UoA毕业证)奥克兰大学毕业证成绩单官方原版办理
留学学历(UoA毕业证)奥克兰大学毕业证成绩单官方原版办理留学学历(UoA毕业证)奥克兰大学毕业证成绩单官方原版办理
留学学历(UoA毕业证)奥克兰大学毕业证成绩单官方原版办理
bseovas
 
存档可查的(USC毕业证)南加利福尼亚大学毕业证成绩单制做办理
存档可查的(USC毕业证)南加利福尼亚大学毕业证成绩单制做办理存档可查的(USC毕业证)南加利福尼亚大学毕业证成绩单制做办理
存档可查的(USC毕业证)南加利福尼亚大学毕业证成绩单制做办理
fovkoyb
 
APNIC Foundation, presented by Ellisha Heppner at the PNG DNS Forum 2024
APNIC Foundation, presented by Ellisha Heppner at the PNG DNS Forum 2024APNIC Foundation, presented by Ellisha Heppner at the PNG DNS Forum 2024
APNIC Foundation, presented by Ellisha Heppner at the PNG DNS Forum 2024
APNIC
 
一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理
一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理
一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理
eutxy
 
Internet of Things in Manufacturing: Revolutionizing Efficiency & Quality | C...
Internet of Things in Manufacturing: Revolutionizing Efficiency & Quality | C...Internet of Things in Manufacturing: Revolutionizing Efficiency & Quality | C...
Internet of Things in Manufacturing: Revolutionizing Efficiency & Quality | C...
CIOWomenMagazine
 
办理毕业证(UPenn毕业证)宾夕法尼亚大学毕业证成绩单快速办理
办理毕业证(UPenn毕业证)宾夕法尼亚大学毕业证成绩单快速办理办理毕业证(UPenn毕业证)宾夕法尼亚大学毕业证成绩单快速办理
办理毕业证(UPenn毕业证)宾夕法尼亚大学毕业证成绩单快速办理
uehowe
 
Explore-Insanony: Watch Instagram Stories Secretly
Explore-Insanony: Watch Instagram Stories SecretlyExplore-Insanony: Watch Instagram Stories Secretly
Explore-Insanony: Watch Instagram Stories Secretly
Trending Blogers
 
manuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaal
manuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaalmanuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaal
manuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaal
wolfsoftcompanyco
 
假文凭国外(Adelaide毕业证)澳大利亚国立大学毕业证成绩单办理
假文凭国外(Adelaide毕业证)澳大利亚国立大学毕业证成绩单办理假文凭国外(Adelaide毕业证)澳大利亚国立大学毕业证成绩单办理
假文凭国外(Adelaide毕业证)澳大利亚国立大学毕业证成绩单办理
cuobya
 
7 Best Cloud Hosting Services to Try Out in 2024
7 Best Cloud Hosting Services to Try Out in 20247 Best Cloud Hosting Services to Try Out in 2024
7 Best Cloud Hosting Services to Try Out in 2024
Danica Gill
 
留学挂科(UofM毕业证)明尼苏达大学毕业证成绩单复刻办理
留学挂科(UofM毕业证)明尼苏达大学毕业证成绩单复刻办理留学挂科(UofM毕业证)明尼苏达大学毕业证成绩单复刻办理
留学挂科(UofM毕业证)明尼苏达大学毕业证成绩单复刻办理
uehowe
 
学位认证网(DU毕业证)迪肯大学毕业证成绩单一比一原版制作
学位认证网(DU毕业证)迪肯大学毕业证成绩单一比一原版制作学位认证网(DU毕业证)迪肯大学毕业证成绩单一比一原版制作
学位认证网(DU毕业证)迪肯大学毕业证成绩单一比一原版制作
zyfovom
 
成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理
成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理
成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理
ysasp1
 
重新申请毕业证书(RMIT毕业证)皇家墨尔本理工大学毕业证成绩单精仿办理
重新申请毕业证书(RMIT毕业证)皇家墨尔本理工大学毕业证成绩单精仿办理重新申请毕业证书(RMIT毕业证)皇家墨尔本理工大学毕业证成绩单精仿办理
重新申请毕业证书(RMIT毕业证)皇家墨尔本理工大学毕业证成绩单精仿办理
vmemo1
 
Meet up Milano 14 _ Axpo Italia_ Migration from Mule3 (On-prem) to.pdf
Meet up Milano 14 _ Axpo Italia_ Migration from Mule3 (On-prem) to.pdfMeet up Milano 14 _ Axpo Italia_ Migration from Mule3 (On-prem) to.pdf
Meet up Milano 14 _ Axpo Italia_ Migration from Mule3 (On-prem) to.pdf
Florence Consulting
 
Ready to Unlock the Power of Blockchain!
Ready to Unlock the Power of Blockchain!Ready to Unlock the Power of Blockchain!
Ready to Unlock the Power of Blockchain!
Toptal Tech
 
Understanding User Behavior with Google Analytics.pdf
Understanding User Behavior with Google Analytics.pdfUnderstanding User Behavior with Google Analytics.pdf
Understanding User Behavior with Google Analytics.pdf
SEO Article Boost
 
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptx
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptxBridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptx
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptx
Brad Spiegel Macon GA
 

Recently uploaded (20)

制作原版1:1(Monash毕业证)莫纳什大学毕业证成绩单办理假
制作原版1:1(Monash毕业证)莫纳什大学毕业证成绩单办理假制作原版1:1(Monash毕业证)莫纳什大学毕业证成绩单办理假
制作原版1:1(Monash毕业证)莫纳什大学毕业证成绩单办理假
 
可查真实(Monash毕业证)西澳大学毕业证成绩单退学买
可查真实(Monash毕业证)西澳大学毕业证成绩单退学买可查真实(Monash毕业证)西澳大学毕业证成绩单退学买
可查真实(Monash毕业证)西澳大学毕业证成绩单退学买
 
留学学历(UoA毕业证)奥克兰大学毕业证成绩单官方原版办理
留学学历(UoA毕业证)奥克兰大学毕业证成绩单官方原版办理留学学历(UoA毕业证)奥克兰大学毕业证成绩单官方原版办理
留学学历(UoA毕业证)奥克兰大学毕业证成绩单官方原版办理
 
存档可查的(USC毕业证)南加利福尼亚大学毕业证成绩单制做办理
存档可查的(USC毕业证)南加利福尼亚大学毕业证成绩单制做办理存档可查的(USC毕业证)南加利福尼亚大学毕业证成绩单制做办理
存档可查的(USC毕业证)南加利福尼亚大学毕业证成绩单制做办理
 
APNIC Foundation, presented by Ellisha Heppner at the PNG DNS Forum 2024
APNIC Foundation, presented by Ellisha Heppner at the PNG DNS Forum 2024APNIC Foundation, presented by Ellisha Heppner at the PNG DNS Forum 2024
APNIC Foundation, presented by Ellisha Heppner at the PNG DNS Forum 2024
 
一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理
一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理
一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理
 
Internet of Things in Manufacturing: Revolutionizing Efficiency & Quality | C...
Internet of Things in Manufacturing: Revolutionizing Efficiency & Quality | C...Internet of Things in Manufacturing: Revolutionizing Efficiency & Quality | C...
Internet of Things in Manufacturing: Revolutionizing Efficiency & Quality | C...
 
办理毕业证(UPenn毕业证)宾夕法尼亚大学毕业证成绩单快速办理
办理毕业证(UPenn毕业证)宾夕法尼亚大学毕业证成绩单快速办理办理毕业证(UPenn毕业证)宾夕法尼亚大学毕业证成绩单快速办理
办理毕业证(UPenn毕业证)宾夕法尼亚大学毕业证成绩单快速办理
 
Explore-Insanony: Watch Instagram Stories Secretly
Explore-Insanony: Watch Instagram Stories SecretlyExplore-Insanony: Watch Instagram Stories Secretly
Explore-Insanony: Watch Instagram Stories Secretly
 
manuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaal
manuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaalmanuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaal
manuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaal
 
假文凭国外(Adelaide毕业证)澳大利亚国立大学毕业证成绩单办理
假文凭国外(Adelaide毕业证)澳大利亚国立大学毕业证成绩单办理假文凭国外(Adelaide毕业证)澳大利亚国立大学毕业证成绩单办理
假文凭国外(Adelaide毕业证)澳大利亚国立大学毕业证成绩单办理
 
7 Best Cloud Hosting Services to Try Out in 2024
7 Best Cloud Hosting Services to Try Out in 20247 Best Cloud Hosting Services to Try Out in 2024
7 Best Cloud Hosting Services to Try Out in 2024
 
留学挂科(UofM毕业证)明尼苏达大学毕业证成绩单复刻办理
留学挂科(UofM毕业证)明尼苏达大学毕业证成绩单复刻办理留学挂科(UofM毕业证)明尼苏达大学毕业证成绩单复刻办理
留学挂科(UofM毕业证)明尼苏达大学毕业证成绩单复刻办理
 
学位认证网(DU毕业证)迪肯大学毕业证成绩单一比一原版制作
学位认证网(DU毕业证)迪肯大学毕业证成绩单一比一原版制作学位认证网(DU毕业证)迪肯大学毕业证成绩单一比一原版制作
学位认证网(DU毕业证)迪肯大学毕业证成绩单一比一原版制作
 
成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理
成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理
成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理
 
重新申请毕业证书(RMIT毕业证)皇家墨尔本理工大学毕业证成绩单精仿办理
重新申请毕业证书(RMIT毕业证)皇家墨尔本理工大学毕业证成绩单精仿办理重新申请毕业证书(RMIT毕业证)皇家墨尔本理工大学毕业证成绩单精仿办理
重新申请毕业证书(RMIT毕业证)皇家墨尔本理工大学毕业证成绩单精仿办理
 
Meet up Milano 14 _ Axpo Italia_ Migration from Mule3 (On-prem) to.pdf
Meet up Milano 14 _ Axpo Italia_ Migration from Mule3 (On-prem) to.pdfMeet up Milano 14 _ Axpo Italia_ Migration from Mule3 (On-prem) to.pdf
Meet up Milano 14 _ Axpo Italia_ Migration from Mule3 (On-prem) to.pdf
 
Ready to Unlock the Power of Blockchain!
Ready to Unlock the Power of Blockchain!Ready to Unlock the Power of Blockchain!
Ready to Unlock the Power of Blockchain!
 
Understanding User Behavior with Google Analytics.pdf
Understanding User Behavior with Google Analytics.pdfUnderstanding User Behavior with Google Analytics.pdf
Understanding User Behavior with Google Analytics.pdf
 
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptx
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptxBridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptx
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptx
 

Structured data and metadata evaluation methodology for organizations looking to improve image findability on the web emily kolvitz_2014

  • 1. Structured Data and Metadata Evaluation Methodology for Organizations Looking to Improve Image Findability on the Web School of Library and Information Studies LIS 5733 Taught by: Dr. Susan Burke Research Proposal Written by: Emily Kolvitz Research Setting: Primarily Geared Towards Online Ecommerce/Business Organizations, but methodology could easily translate to Galleries, Museums, Archives, Libraries (GLAMs) or any institution looking to evaluate their structured data and metadata practices on the world wide web in an effort to improve findability of product offerings, general information or services.
  • 2. Introduction The current state of findability on the web for many organizations is incipient. Search Engine Optimization (SEO) techniques change frequently and remain much a mystery to many companies. The one variable in the equation of web findability that remains a staple is good quality metadata under the hood of the website. This research methodology will allow for : ● An assessment of findability maturity on the web from an image-centric viewpoint ● Help improve findability on the web by establishing a baseline for where your organization is at in terms of structured data content and visualize gaps or areas for improvement from a search engine neutral perspective
  • 3. Introduction ● Most Searches Start with Google now (Holman 2011) (Lippincott 2013) ● Search Algorithms Shaping what is most Easily Accessible (Connaway, Dickey & Radford 2011) and they are subject to change frequently (Kritzinger 2013) ● Search Algorithms Look for Your Structured Data and in the future and possibly your embedded metadata (Cazier 2014) (Beall 2010)
  • 4. Literature Review Marshall Breeding (2013) assesses the limitations of the major search engine algorithms: “But even with the most sophisticated relevancy algorithms, index-based search and retrieval lacks the ability to lead users to the potential related content. Semantic web technologies, in conjunction with repositories of open linked data, promise to deliver significant new capabilities in exploring and exploiting information resources on the web.”
  • 5. Literature Review ● Semantic web is founded on good, high-quality structured data ● Future technologies could potentially utilize embedded metadata in search (Cazier 2014) (Beall 2010) but there is authenticity, provenance and “breadcrumbs” value now (Reicks 2013)
  • 6. Literature Review ● Most users don’t go past the first page of search results (Paz 2013) ● Structured Data Practices can help your organization stay relevant (and findable!) in the age of information overload ● Keeping it Search Engine Neutral is advisable (Paz 2013)
  • 7. Topic/Proposed Research ● Methodology for establishing a baseline or benchmark of where an organization is at in terms of structured data pertaining to image records that ultimately helps findability on the web ● By utilizing the proposed methodology for gathering this data for an organization, data-informed decisions can be made about structured data strategy going forward to maintain relevancy on the web ● Many structured data elements can affect online findability from file-naming standards, presence of alt text tags in html markup, html markup itself, embedded metadata, schema.org markup and rich snippets, text description at or nearby images, and more. IEEE uses metadata or full-text for search (IEEE Xplore offers this--see next slide)
  • 8. Full Text Search & Metadata Search
  • 9. Topic/Proposed Research ● It is also noteworthy that there are additional factors that affect findability on the web that do not involve structured data, but this research focuses solely on structured data techniques within the control of individual organizations. ● All of these structured data techniques pertaining to image records will be utilized in conjunction with the relevancy of onsite and offsite search results. ● Image search and information retrieval is a more difficult area than text search and retrieval because accessibility to the image content is largely dependent on side-car text (or metadata if you will) that describes the aboutness and (hopefully) the context for the image record.
  • 10. Questions Research Questions Addressed in this Study 1. What methods of search are available on the organization’s online website? 1. What is the file-naming structure for images on the website? 1. What is the quality of search engine (onsite and offsite) results? 1. What kinds of search results appear in Image Search when searching by the organization’s name and product description both with onsite search and offsite search?
  • 11. Questions Research Questions Addressed in this Study 5. What kinds of search results appear in Google Image Search when searching by images taken from the organization’s website? 5. What kinds of search results come up when looking for specific products (measure of structured data) through onsite search and offsite search? 5. What are the results when looking for specific products on the offsite search engine?
  • 12. Questions Research Questions Addressed in this Study 8. What kinds of structured data are near or around the images on the organization’s website? Alt Text? Other? 9. What file types appear on the organization’s website? (JPEG? TIFF? PNG?) 9. What embedded metadata is available in images on the website? 11. What does the XMP/XML/RDF for these images look like and how robust is it? What does the graph look like?
  • 13. Variables Quality and number of alt text tags Type of page the image was on Level of description for the filename Quality and number of structured data tags pertaining to the images The image file naming convention/filename Quality and number of embedded metadata tags Quality and number of search results for onsite search utilizing filename or alt text Quality and number of relevant search results utilizing offsite image search These measures are operationalized by utilization of likert scales applied by the human researcher. For example, when rating the level of description for the file-name, a research could conclude that the filename sp_18379847923.jpg is not very descriptive filename for a human, let alone for a search engine (unless of course this is a product sku.) The researcher would then choose to assign it a low value on descriptiveness on a 1-5 likert scale.
  • 14. Data Collection Methods Participants Participants will include a single institution, anonymized for the protection of their business. The sample of image records utilized in this study will be limited to image assets appearing on the organization’s website domain. Most data collection can take place from the organization’s website itself. Some procedures will take place on external sites, services, or programs. Randomization of Sample The sample of images utilized in this study can be randomized by extracting a site map of the particular organization of interest using xsitemap.com. After the site map is constructed, the list of URLs should be inputted into a spreadsheet program and a record number should be assigned to each URL. From there, the researcher can use a randomizer program to select the order of pages to utilize in the study (i.e. Research Randomizer Available at: http://www.randomizer.org/form.htm) This method will be utilized for taking a random sample of pages from the organization of interest. Consent All data collected in this study are publicly available and freely available on the web.
  • 15. Data Collection Methods Obtaining Data on the website ● Navigate to the URL ● Right Click Image(s) and “Save As” ● Right Click Page and “View Source” Save as .txt file ● Collect raw data from image by either opening in Photoshop and Navigating to Raw Data Column or utilize Phil Harvey’s ExifTool Obtaining Data through Structured Data Linter ● Navigate to the Linter website ● Enter URL ● Screenshot Structured Data Results -or- save as webpage Obtaining Data through W3C RDF validator ● Copy raw data xml extracted earlier and input into RDF Validator ● Select Graph Only on the Options ● Parse RDF ● Save Graph or Screenshot Graph ● Store in Folder with other Data Answer Research Questions ● Systematically go through the collected data and input findings into spreadsheet
  • 16. Data Analysis Methods ● Descriptive Statistics o Bell Curve - measures towards a central tendency using likert scale data Bell Curve Image By Vierge Marie (Own work) [Public domain], via Wikimedia Commons http://upload.wikimedia.org/wikipe dia/commons/f/f6/Gaussian_Filter .svg
  • 17. Data Analysis Methods ● Graphical Analysis (Charts and Graphs) ● Summary Report ● Discussion of Findings
  • 18. Visualizing the Results The Structured Data Linter, utilizing URLs to display structured data around the images. Available at: http://linter.structured-data.org/ Summary analysis will be crafted utilizing all of these data points to show what we are able to understand about an image versus what a machine or search engine is able to know about an image. W3C RDF Validator Graph Visualization utilizing the raw data markup extracted from the image Available at: http://www.w3.org/RDF/Validator /
  • 19. Structured Data Linter Shows all structured Data Tags around the images and in the page markup
  • 20. RDF Validator Visualization of embedded data for images and their subsequent relationships to other data
  • 21. Summary Report Complete Picture of Structured Data, Metadata and Analysis of Study
  • 22. Expected Outcomes The anticipated results of this project include a benchmark for where this specific organization is at in terms of structured data in the online environment and a methodology for other organizations looking to assess their structured data maturity in the digital space. These results will be used to create a roadmap for improving resource findability both on the web and within websites. Other organizations may also aspire to reuse this methodology for assessing their own current state of structured data. Future areas of research could include utilizing metadata/RDF-driven search engines in conjuncture with Vector Space Models to assess findability of image records on the web and within websites.
  • 23. References (Slides & Full Paper) Algebraix Data, Corporation. 0005. "Algebraix Data Launches Industry’s First Cost-Effective Automated Implementation of Schema.org." Business Wire (English), 5. Beall, Jeffrey. 2010. "How Google Uses Metadata to Improve Search Results." Serials Librarian 59, no. 1: 40-53. Breeding, Marshall. 2013. "Linked Data: The Next Big Wave or Another Tech Fad?." Computers In Libraries 33, no. 3: 20-22. Cafarella, M.J., Halevy, A.Y., Zhang, Y., Wang, D.Z., and Wu, E. Uncovering the relational Web. In Proceedings of the 11th International Workshop on the Web and Databases (Vancouver, B.C., June 13, 2008). http://web.eecs.umich.edu/~michjc/papers/webtables_webdb08.pdf Connaway, Lynn Sillipigni, Timothy J. Dickey, and Marie L. Radford. 2011. "“If it is too inconvenient I'm not going after it:” Convenience as a critical factor in information-seeking behaviors." Library & Information Science Research (07408188) 33, no. 3: 179-190.
  • 24. References (Slides & Full Paper) Cazier, Clay, 2014. PM Digital Marketing Blog “The Future of Exif Image Data” Last accessed November 20, 2014. http://www.pmdigital.com/blog/2014/04/future-exif-image-data/ Diagram Center: Digital Image and Graphic Resources for Accessible Materials , 2014. “Content Model” Last Accessed November 23, 2014. http://diagramcenter.org/standards-and-practices/content-model.html Google. 2014. “Image Publishing Guidelines” Last accessed November 21, 2014. https://support.google.com/webmasters/answer/114016?hl=en Holman, Lucy. 2011. "Millennial Students' Mental Models of Search: Implications for Academic Librarians and Database Developers." Journal Of Academic Librarianship 37, no. 1: 19-27
  • 25. References (Slides & Full Paper) International Business, Times. 0006. "Bing,Google and Yahoo merge to make search easier with schema.org." International Business Times, April. IPTC International Press Telecommunications Council, 2014. “Embedded Metadata Manifesto” Last accessed November 20, 2014. http://www.embeddedmetadata.org/social-media-test-results.php (Embedded Metadata Manifesto 2014). Kritzinger, W. T. "Search Engine Optimization and Pay-per-Click Marketing Strategies." Journal of Organizational Computing and Electronic Commerce, no. 3 (2013): 273-86. Lippincott, Joan K. “Net Generation Students and Libraries,” EDUCAUSE (2005), accessed November 19, 2014, http://www.educause.edu/research-and-publications/books/educating-net-generation/net-generation-students-and-libraries
  • 26. References (Slides & Full Paper) Nakanishi, T., "Semantic Context-Dependent Weighting for Vector Space Model," Semantic Computing (ICSC), 2014 IEEE International Conference on , vol., no., pp.262,266, 16-18. June 2014. doi: 10.1109/ICSC.2014.49 Paz, Anita. 2013. "In search of Meaning: The Written Word in the Age of Google." Italian Journal Of Library & Information Science 4, no. 2: 255-266. Priebe, T.; Schlager, C.; Pernul, G., "A search engine for RDF metadata," Database and Expert Systems Applications, 2004. Proceedings. 15th International Workshop on , vol., no., pp.168,172, 2004. doi: 10.1109/DEXA.2004.1333468 Reicks, David. 2010. “Why Embedded Metadata Won’t Help Your SEO,” Last Updated December 30, 2013. Last Accessed November 23, 2014. http://www.controlledvocabulary.com/blog/embedded-metadata-wont-help-seo.html

Editor's Notes

  1. Full Paper Available. Please Contact Emily Kolvitz at kolvitz1@gmail.com
  2. Full Paper Available. Please Contact Emily Kolvitz at kolvitz1@gmail.com
  3. Full Paper Available. Please Contact Emily Kolvitz at kolvitz1@gmail.com
  4. Full Paper Available. Please Contact Emily Kolvitz at kolvitz1@gmail.com
  5. Full Paper Available. Please Contact Emily Kolvitz at kolvitz1@gmail.com
  6. Full Paper Available. Please Contact Emily Kolvitz at kolvitz1@gmail.com
  7. Full Paper Available. Please Contact Emily Kolvitz at kolvitz1@gmail.com
  8. Full Paper Available. Please Contact Emily Kolvitz at kolvitz1@gmail.com
  9. Full Paper Available. Please Contact Emily Kolvitz at kolvitz1@gmail.com
  10. Full Paper Available. Please Contact Emily Kolvitz at kolvitz1@gmail.com
  11. Full Paper Available. Please Contact Emily Kolvitz at kolvitz1@gmail.com
  12. Full Paper Available. Please Contact Emily Kolvitz at kolvitz1@gmail.com
  13. Full Paper Available. Please Contact Emily Kolvitz at kolvitz1@gmail.com
  14. Full Paper Available. Please Contact Emily Kolvitz at kolvitz1@gmail.com
  15. Full Paper Available. Please Contact Emily Kolvitz at kolvitz1@gmail.com
  16. Full Paper Available. Please Contact Emily Kolvitz at kolvitz1@gmail.com
  17. Full Paper Available. Please Contact Emily Kolvitz at kolvitz1@gmail.com
  18. Full Paper Available. Please Contact Emily Kolvitz at kolvitz1@gmail.com
  19. Full Paper Available. Please Contact Emily Kolvitz at kolvitz1@gmail.com
  20. Full Paper Available. Please Contact Emily Kolvitz at kolvitz1@gmail.com
  21. Full Paper Available. Please Contact Emily Kolvitz at kolvitz1@gmail.com
  22. Full Paper Available. Please Contact Emily Kolvitz at kolvitz1@gmail.com
  23. Full Paper Available. Please Contact Emily Kolvitz at kolvitz1@gmail.com
  24. Full Paper Available. Please Contact Emily Kolvitz at kolvitz1@gmail.com
  25. Full Paper Available. Please Contact Emily Kolvitz at kolvitz1@gmail.com
  26. Full Paper Available. Please Contact Emily Kolvitz at kolvitz1@gmail.com