The presentative gives research findings from the Research Libraries Group (RLG) on Social Metadata Working Group. The group worked from 2009-2010 researching sites that used social media features before making some recommendations to libraries, archives and museums.
Our original focus. Now lets just take a look at what’s happening regarding social metadata in our planet…..
DG of NLA May 2010 said in a web 2.0 strategy group meeting – “ Run free. I endorse chaos, failure and trial an error. I don’t want to impose controls from above, that would stifle creativity and new ideas. No idea is too silly, try it… Everything cannot be centrally controlled, that is unrealistic. Do not have fears and anxieties about mistakes, Don’t put boundaries around how you will work .”
The RLG Partners Social Metadata Working Group is one of the largest we’ve ever had, indicative of the great interest in this topic among the RLG Partnership. We have 21 RLG Partner staff from five countries, from a wide range of different institutions and staff with various functions.
Will now go through some of the highlights with you starting with website reviews.
Website reviews 6 to follow. Netherland Inst for Sound and Vision – Archive. Video tagging game – for videos and parts within.
“ In advance of an exhibition of Wedding Dresses in 2013 we are creating a database of photographs of clothes worn for weddings from all cultures between 1840 and the present. We include civil partnerships. This database will provide a rich record and help people date their own photographs.”
Washington State University. Members of the public who become registered users have the ability to make their own collections, add comments and add tags. Tribes can also upload their own materials to the portal, using the administrative side of the portal, allowing then to decide the level of access to their own private collections.
Users can indirectly add photos, tags, and comments to Kew's image collection through Kew’s two Flickr sites: Your Kew on Flickr , where users can share their photos of Kew Gardens and Wakehurst Place, and People's Arboretum on Flickr , where users are asked to upload photos of trees in Kew Gardens, Wakehurst Place, and around the world. Kew encourages users of their Flickr groups to add description to the images, and Kew states that contributions of images and description may be repurposed on the Kew Garden site. Flickr, Twitter, Facebook, and blogs to engage a diverse audience, including families, plant lovers, art lovers, conservationists, and scientists.
The crowd-sourcing correction of OCR’d text is impressive. On any result set you can see which texts have been corrected and compare the corrected text with the original. Tagging. High usage.
Potential, tagging, commenting, uploading photos, forum. Impressive aggregation of content – open to public. Only 6 – a further 70 were reviewed….
LibraryThing – can join and catalogue your books easily. Use tags for loans/collections no circ. Or buy LibraryThing for Libraries. Make use of the 64 million tags in your catalogue, reviews, 2 million user uploaded cover art. 1600 libs are doing this so far.
Flickr. Normal account – often used for org publicity and new shots, or do something a bit different eg Oregan state uni has put archive photos on flickr map, PA collections public images in Flickr group, or Flickr Commons – aimed at large institutions to make more widely available public domain photos from collections. Increase exposure. E.g 500 photos from nlnz get 500,000 views in 2 years (1000 views per day) = same views as 100,000 images on their own site. Users can add their knowledge content and tool to draw around items in pic is useful.
Youtube. 1. Educate your users – screencasts/tutorials; 2. promote events and exhibitions; 3. promote collections, 4. post archival footage/clips. Can set up channels easily. Youtube allows blatant advertising and bias.
Facebook. Community building. Can link your applications to facebook.’like this’ popular feature. Get a ground swell of opinion. 1. Engage community with org, 2. Events, announcements, news 3. More novel ideas e.g. Getty Museum Illuminated manuscripts image game – name that saint and his instrument of martydom. US Archives Recovery Team – thefts and recovered items – your stuff has been stolen!!
Twitter. Brief snippets of infor- followers, re-tweet. 1. Events, announcements 2. Collections. 3. Creative ideas e.g. Scott Polar Research Institute posting diary of Captain Scott expedition – linked to full version on home page, CDL – John Muir handwritten letters.
Wikipedia – 1. Org page, 2. articles on topics – populate. Must be unbiased. Until recently libs and archives were not able to create article/collection pages and link to their websites. Now there is a guideline. Wikipedians want open access to be able to use images to illustrate articles e.g. DG NLA agreed for Wikipedians to be able to use any image in NLA collection for this purpose – example mutiny on the bounty. First image is National Maritime museum second is list of mutineers NLA. Wikipedian in residence – Liam Wyatt VP Wikimedia Aus and Smithsonian following suit.
Blogs – the usual, org, collections, daily life, books, but some are creative….e.g. University of Kentucky Archives have a large collection of old photos – many of them with mustached men. They put just the moustache photos onto a moustache blog and gave very amusing descriptions to them – had an instant following…
The site managers who responded to the survey come from seven countries. Responses from U.S. site managers represent the majority (60%). Eight responses came from Australia, four from the United Kingdom, two from New Zealand, and one each from Germany, the Netherlands, and Spain. Site types: Library, archive, museum, community, discipline
More than 70% had been offering social media features for two years or less. Four sites were not even public yet at the time of the survey. On the other hand, eight sites (19%) have been offering social media features for four years or more.
Building community is a key interest across all types. Academic libraries and archives tend to be more interested in increasing traffic to their sites, providing better access to their content, and enhancing description. They are less interested in acquiring additional content from other sources. National- and state-level institutions are more likely to seek additions to their collections. The other responses came from museums interested in inspiring visitors and getting them more involved with exhibits and museum activities. Measuring success was largely subjective. The top three data elements captured are comments (76%), unique visitors (67%), and visits (64%), which are relatively easy to measure. All thought their sites were successful even if they had not yet figured out how to measure quantity or quality.
We offered a list of nineteen social media and user contribution features and asked respondents to select the ones they offered, with an option to describe a feature not listed. The top three features used by the 39 site managers who responded were comments (85%), tagging (67%), and RSS feeds (54%). Frequency of RSS feeds may be because so many open source and off-the-shelf software packages offer it ‘out of the box.’ Annotations (37%), upload materials (31%), user profiles (28%), user-contributed images (26%), bookmarks (21%), reviews (21%) and ratings (21%) made up the midrange. At the bottom of the scale are collaborative filtering and synchronous chat at 3% . This may i ncrease in the near future as social media tools such as Twitter and Facebook drive this form of interaction deeper into the public’s toolkit.
Purple = 3 sites only over 1000 contributors per month. Australian Newspapers, Distributed Proofreaders, and WorldCat.org.
We were surprised that 72% (26/36) of the respondents were not concerned about the way the site’s content is used or repurposed. Perhaps the individuals in an organization who are most concerned with data privacy and security were not those who responded to the survey. Sites that focused on music content were among those who expressed concern about the way the content was shared. Sites where scholars share their original work also have some concern. No definitive answer on integrating metadata. 39% of respondents said that they incorporated metadata into their own descriptive processes. Therefore two-thirds did not, surprising since 60% of all respondents said that improving description was one of their key motivations for offering social media features. So it is just the search that is important?
The monitoring practices are apparently successful because the spam and abuse rate is low. Only two sites reported that spam represents a serious problem; nine reported spam as an “occasional problem.” Cultural heritage organizations seem to be unlikely spam targets. Only 36% of the respondents reported abusive user contributions, which happened a few times a year or less in over half of the cases. Only three sites—AcaWiki, Digital NZ Search, and WorldCat.org—reported abusive contributions as often as “a few times per week.” More than half of the respondents who reported abuse on their sites blocked future contributions after the first infraction. These results imply that abusive user behavior is sporadic and easily managed, which should be especially encouraging to resource-strapped cultural heritage institutions.
Questions about staffing were perhaps not as clear as they should have been. The survey went to different orgs and different professionals. This table shows staff roles for those managing site.
Most had to revise or implement new policies and guidelines. Only four of the 35 respondents (11%) said that they had not implemented any policies. The majority of sites (63%) were concerned with appropriate behavior. 57% of the sites retain the right to edit or remove content; almost all of them are sites that incorporate user content. 17 had policies that were extensions of existing institutional policies and 19 had created new ones as the result of situations arising from the site. LAMs are making efforts to maintain a safe environment for users (with particular attention to under-age users) by encouraging/enforcing acceptable community behavior and appropriate content; safeguard users' privacy; indemnify or otherwise protect the institution; and upholding professional ethics and laws, particularly in regard to providing equal access and protecting intellectual property rights.
Social metadata for libraries, archives and museums: Research findings from the RLG Partners Social Metadata Working Group, October 2010
Social Metadata for Libraries, Archives and Museums. Research findings from the RLG Partners Social Metadata Working Group. Rose Holley [email_address] Karen Smith-Yoshimura [email_address] Libraries Australia Forum Canberra October 20, 2010 http://www.oclc.org/research/activities/aggregating/
Terminology: What are we talking about? <ul><li>Social media/networking </li></ul><ul><li>Ways for people to communicate online with each other e.g. Twitter, Facebook, Blogs. </li></ul><ul><li>User Generated Content (UGC) </li></ul><ul><li>Things produced by users rather than owners of the site e.g. image, video, text AND metadata – tags, comments, notes. </li></ul><ul><li>Social Metadata </li></ul><ul><li>Additional information about a resource given by online users e.g. tags, comments. </li></ul><ul><li>Social Media Features </li></ul><ul><li>Interactive features added to a site that enable virtual groups to build and communicate with each other and social metadata to be added. </li></ul><ul><li>Social Engagement </li></ul><ul><li>User interaction online e.g. communication between users, from users to site owners, from users with objects/resources. </li></ul><ul><li>Web 2.0 </li></ul><ul><li>Online applications that facilitate interactive rather than passive experiences. </li></ul>
Social Metadata Working Group Focus <ul><li>User contributions that can enrich the descriptive metadata created by libraries, archives, and museums. </li></ul><ul><li>Issues that need to be resolved to communicate and share user contributions on the network level. </li></ul>
Woohoo! I have a job!!! http://www.slideshare.net/thebrandbuilder/olivier-blanchard-basics-of-social-media-roi (Adapted from)
Dudes, we are ON THIS!!! Let’s start engagin’!!! I call dibs on the Library blog. http://www.slideshare.net/thebrandbuilder/olivier-blanchard-basics-of-social-media-roi (Adapted from) I’m a man of few words… Tweet!
All systems engage! Engage, full throttle. Mission commence. We have liftoff! We have liftoff! Crickey! I don’t know what I’m doing!!! http://www.slideshare.net/thebrandbuilder/olivier-blanchard-basics-of-social-media-roi (Adapted from)
Oh my! Look at all the new visitors to our website! and all of our FaceBook friends! Hot Damn, we even have comments on the blog! They’re tagging & commenting too! http://www.slideshare.net/thebrandbuilder/olivier-blanchard-basics-of-social-media-roi (Adapted from)
Oh wow. How am I going to measure social engagement - impressions and eyeballs? http://www.slideshare.net/thebrandbuilder/olivier-blanchard-basics-of-social-media-roi (Adapted from)
How long will all this analysis take? It’s all a process of elimination, really. Isolating patterns, quantifying deltas, proving ad-hocs… Then all we have to do is figure out what works, what doesn’t, and give our recommendations to the captain... http://www.slideshare.net/thebrandbuilder/olivier-blanchard-basics-of-social-media-roi (Adapted from)
The Wild West of Social Metadata for Libraries, Museums and Archives <ul><li>Don’t do it… </li></ul><ul><li>Do it with caution…. </li></ul><ul><li>Experimentation….. </li></ul><ul><li>Do a bit of everything – the ‘WILD WEST’ – no rules </li></ul><ul><li>Now: Review what we learnt and consolidate - plan for future, structure. </li></ul>“ With a gay bandanna around his neck, the modern cowboy presents a vivid picture in boots and spurs, and is just as skilful as an old time ‘puncher’”.
Our Research Aims ~20 QUESTIONS… <ul><li>Objectives of Social Metadata? </li></ul><ul><li>How we measure success? </li></ul><ul><li>What UGC is of most value? </li></ul><ul><li>Good examples of sites? </li></ul><ul><li>Best practice – policy, guidelines? </li></ul><ul><li>Staffing? </li></ul><ul><li>Moderation? </li></ul><ul><li>Taxonomies and vocabularies? </li></ul><ul><li>Integration/sharing of social metadata? </li></ul><ul><li>Software, technology, functionality? </li></ul>
Who we are: 21 staff from 5 countries <ul><li>Drew Bourn, Stanford </li></ul><ul><li>Douglas Campbell, National Library of New Zealand </li></ul><ul><li>Kevin Clair, Penn State </li></ul><ul><li>Chris Cronin, U. Chicago </li></ul><ul><li>Christine DeZelar-Tiedman, U. Minnesota </li></ul><ul><li>Mary Elings, UC Berkeley </li></ul><ul><li>Steve Galbraith, Folger </li></ul><ul><li>Cheryl Gowing, U. Miami </li></ul><ul><li>Rose Holley, National Library of Australia </li></ul><ul><li>Rebekah Irwin, Yale </li></ul><ul><li>Lesley Kadish, Minnesota Historical Society </li></ul><ul><li>Helice Koffler, U. Washington </li></ul><ul><li>Daniel Lovins, Yale </li></ul><ul><li>John Lowery, British Library </li></ul><ul><li>Marja Musson, International Institute of Social History </li></ul><ul><li>Henry Raine, New-York Historical Society </li></ul><ul><li>Cyndi Shein, Getty </li></ul><ul><li>Ken Varnum, U. Michigan </li></ul><ul><li>Melanie Wacker, Columbia </li></ul><ul><li>Kayla Willey, Brigham Young </li></ul><ul><li>Beth Yakel, U. Michigan, School of Information </li></ul><ul><li>Staffed by Jean Godby, John MacColl, Karen Smith-Yoshimura </li></ul>
Our Method and Process <ul><li>Identify questions </li></ul><ul><li>Find websites relevant for GLAM and review (76 sites) </li></ul><ul><li>Read, listen, observe and share (200 items) </li></ul><ul><li>Develop questionnaire for website managers and send out </li></ul><ul><li>Analyse results (42 returned) </li></ul><ul><li>Discuss all findings and write up </li></ul><ul><li>Develop recommendations </li></ul>
Our Techniques and Timing <ul><li>Timeline 2009 - 2010 </li></ul><ul><li>Sub working groups (timezones and interests) </li></ul><ul><li>Teleconferences </li></ul><ul><li>Basecamp – project management and collaboration software tool </li></ul>
Our Results <ul><li>Report 1 – Website reviews, and use of third party sites (150 pages) </li></ul><ul><li>Report 2 – Analysis of website manager survey results (50 pages) </li></ul><ul><li>Report 3 – Recommendations for social metadata and bibliography </li></ul><ul><li>Expected date of publication: November 2010 </li></ul><ul><li>NOW FOR THE PREVIEW…. </li></ul>
Use of third party sites <ul><li>LibraryThing for Libraries (LTFL) </li></ul><ul><li>Flickr and Flickr Commons </li></ul><ul><li>Youtube </li></ul><ul><li>Facebook </li></ul><ul><li>Twitter </li></ul><ul><li>Wikipedia </li></ul><ul><li>Blogs </li></ul>
Recommendations <ul><li>Prepare/train staff </li></ul><ul><li>Policies, skills, interest level. </li></ul><ul><li>Consider benefits/trade offs of using third party sites e.g. Flickr, LibraryThing </li></ul><ul><ul><li>Low cost, quick implementation, high visibility, be where your community is. </li></ul></ul><ul><ul><li>No control over how presented, no guarantee of stability/preservation, policies may change, how to get social metadata back to your site? </li></ul></ul><ul><li>Consider open source software </li></ul><ul><li>Do not worry about spam/abuse, issues – Go Ahead! </li></ul><ul><ul><li>Very little seen – fear not reality. Strategies to reduce risk (users register, take down policy, Captcha, high visibility of users and actions, user profiles open, be explicit about what you are doing and why). </li></ul></ul>
Recommendations <ul><li>Usability testing </li></ul><ul><ul><li>Continuous throughout – what works, what doesn’t. Develop with users </li></ul></ul><ul><li>Display AND index social metadata and UGC </li></ul><ul><li>Consider if/how you want to integrate UGC with your own content. </li></ul><ul><ul><li>Layers – user interface, layers behind, integrate? </li></ul></ul><ul><li>Measures for success </li></ul><ul><ul><li>Quantitative/qualitative, subjective/objective </li></ul></ul><ul><ul><li>Return on Investment </li></ul></ul>
Recommendations <ul><li>Use social networking features to build community </li></ul><ul><ul><li>Who is online, contact other users, user profiles, recommendations from other users </li></ul></ul><ul><li>Use persistent identifiers and make them visible </li></ul><ul><li>Site, objects resources (both site owners and UGC) </li></ul><ul><li>Ability to migrate/manage content (especially if using third party) </li></ul><ul><ul><li>Can you migrate to another place, how to manage/delete/modify UGC? </li></ul></ul><ul><li>Get content indexed by Google so users find it </li></ul>
Recommendations <ul><li>Site to be alive – New content </li></ul><ul><li>Make sure visible and new content can be yours or users </li></ul><ul><li>Respond quickly to feedback </li></ul><ul><li>open channels of communication with users </li></ul><ul><ul><ul><li>“ makes me feel like I have a stake in the collections” </li></ul></ul></ul><ul><ul><ul><li>“ self-aggrandizing” </li></ul></ul></ul><ul><ul><ul><li>“ my feedback makes things happen” </li></ul></ul></ul>
QUESTIONS? RLG Social Metadata Working Group Rose Holley [email_address] Karen Smith-Yoshimura [email_address] http://www.oclc.org/research/activities/aggregating/ Do we know what we’re doing now? It’s all in the report captain! Credits: UFO Series http://ufoseries.com/index.html