Worldwide Presence MSR India MSR New England Redmond• Redmond, Washington Sept 1991• Cambridge, United Kingdom July 1997 MSR Cambridge, UK• Beijing, China Nov 1998• Silicon Valley, California July 2001 MSR Asia (Beijing)• Bangalore, India Jan 2005• Cambridge, Massachusetts July 2008• New York City, NY May 2012 Silicon Valley, California
…Thus far we seem to be worse off than before—for we canenormously extend the record; yet even in its present bulk wecan hardly consult it. This is a much larger matter than merelythe extraction of data for the purposes of scientific research; itinvolves the entire process by which man profits by hisinheritance of acquired knowledge. The prime action of use isselection, and here we are halting indeed. There may bemillions of fine thoughts, and the account of the experience onwhich they are based, all encased within stone walls ofacceptable architectural form; but if the scholar can get atonly one a week by diligent search, his syntheses are not likelyto keep up with the current scene… As We May Think by Vannevar Bush The Atlantic, July 1945 http://www.theatlantic.com/doc/194507/bush
According to study called How Much Information bythe University of California at San Diego,“…consumption totaled 3.6 zettabytes and 10,845trillion words, corresponding to 100,500 words and34 gigabytes for an average person on an averageday. A zettabyte is 10 to the 21st power bytes, amillion million gigabytes. These estimates are froman analysis of more than 20 different sources ofinformation, from very old (newspapers and books)to very new (portable computer games, satelliteradio, and Internet video)."[Note: Information at work is not included!]
“It’s not information overload. It’s filter failure.” Clay Shirky at Web 2.0 Expo 2008
http://en.wikipedia.org/wiki/Big_data• information technology  data sets  • [update] petabytes exabytes  meteorology genomics  connectomics   Internet search finance business informatics• remote sensing radio-frequency identification •  [update] quintillion 
Present The Future: an Explosion of DataExperiments Simulations Archives Literature Instruments The Challenge: Enable Discovery Petabytes Deliver the capability to mine, search and analyze this data in near real-time.
DataUp:Data Curation Add-in for Microsoft Excel•• California Digital Library’s Curation Center • • DataONE•
• Figshare is the first online repository for storing and sharing all of your preliminary findings in the form of individual figures, datasets, media or filesets. Post preprint figures on Figshare to claim priority and receive feedback on your findings prior to http://figshare.com/ formal publication.• Figshare allows researchers to publish all of their research outputs in seconds in an easily citable, sharable and discoverable manner. All file formats can be published, including videos and datasets that are often demoted to the supplemental materials section in current publishing models.• Figshare uses Creative Commons licensing to allow frictionless sharing of research data while allowing users to maintain their ownership. Figshare gives http://www.digital-science.com/ users unlimited public space and 1GB of private storage space for free.
••• • http://datacite.org/ • • http://databib.org/• Registry of research data • repositories (hosted by • [Digital Purdue University) Object Identifier] • •
Developed by the Institute of Quantitative Social Science (IQSS) at Harvard University http://thedata.org/
WorldWideScience.org is a global science gateway connecting you to national andinternational scientific databases and portals. WorldWideScience.org acceleratesscientific discovery and progress by providing one-stop searching of global sciencesources. The WorldWideScience Alliance, a multilateral partnership, consists ofparticipating member countries and provides the governance structure forWorldWideScience.org.WorldWideScience.org was developed and is maintained by the Office of Scientific andTechnical Information (OSTI), an element of the Office of Science within the U.S.Department of Energy. Please contact firstname.lastname@example.org if yourepresent a national or international science database or portal and would like yoursource searched by WorldWideScience.org. In 3+ years since launch, the site has grown to 65+ countries, 400+ million pages – 96.5% of which is *not* available via commercial search engines – and can now be translated into multiple world languages (on demand).
"Data Services for the Sciences: A Needs Assessment” Study by Brian Westra (University of Oregon, July 2010) 1. Data storage and backup 2. Making this data findable by others 3. Connecting data acquisition to data storage 4. Allowing or controlling access to this data by others 5. Documenting and tracking updates to the asset 6. Data analysis/manipulation 7. Finding and accessing related data from others 8. Connecting data storage to data analysis 9. Linking this data to publications or other assets 10. Insuring data is secure/trustworthy 11. OtherWestra, B. "Data Services for the Sciences: A Needs Assessment“ (30-July-2010) Ariadne Issue 64 [URL:http://www.ariadne.ac.uk/issue64/westra/]
Workforce Demandand Career Opportunities in University and Research LibrariesNAS Symposium on Digital Curation Anne R. Kenney July 19, 2012
7 NEW ROLES FOR LIBRARIANS* 1. Acquisitions and Rights Advisors 2. Instructional Partners in Learning Spaces 3. Observers/anthropologists of Information Users and Producers 4. Systems Builders 5. Content Producers and Disseminators 6. Organizational Designers 7. Collaborative Network Creators and Participants Walters and Skinner, New Roles for New Times: Digital Curation for Preservation, ARL, Mar 2001
RATINGS OF IMPORTANCE AND FREQUENCY OFESCIENCE INTERNSHIP TASKS From Youngseek Kim, et al, “Education for eScience Professionals”, IJDC 6:1 (2011) http://www.ijdc.net/index.php/ijdc/article/view/168
MOST SIGNIFICANT SKILLS GAPS(CONTINUED) 5. Knowledge to advise on data mining 6. Knowledge to advocate, and advise on, the use of metadata 7. Ability to advise on the preservation of project records, e.g. correspondence 8. Knowledge of sources of research funding to assist researchers to identify potential funders 9. Skills to develop metadata schema, and advise on discipline/subject standards and practices, for individual research projects Mary Auckland, “Re-skilling for Research,” RLUK, January 2012
REQUISITE EXPERTISE FOR DIGITALHUMANITIES AND SOCIAL SCIENCESRequisite ExpertiseDomain/subject expertiseAnalytical expertiseData expertiseProject management expertise Williford and Henry, “One Culture: Computationally Intensive Research in the Humanities and Social Sciences,” CLIR, 2012
39 schools worldwide and growing http://www.ischools.org/The iSchools organization was founded in 2005 by a collective of Information Schools dedicated to advancing theinformation field in the 21st Century. These schools, colleges, and departments have been newly created or are evolvingfrom programs formerly focused on specific tracks such as information technology, library science, informatics, andinformation science. While each individual iSchoolhas its own strengths and specializations, together they share afundamental interest in the relationships between information, people, and technology.
Digital Curationas a Core CompetencySymposium on Digital Curation in the Era of Big Data:Career Opportunities & Educational RequirementsJuly 19, 2012Dean Elizabeth D. LiddyiSchool, Syracuse University
5 Stages of the Data Life Cycle Data Archiving / PreservationData Presentation / Visualization Data Analytics Data ManagementData Collection
Three Additional Vital Competencies Data Archiving / Preservation Data Presentation / Visualization Data Analytics Data Management Data Collection
Preserving Access to Our Digital Future: Building an International Digital Curation CurriculumDigCCurr Matrix Competencies for Curators http://www.ils.unc.edu/digccurr/digccurr-matrix.html/
http://digital-scholarship.org/dcrg/dcrg.htm Available under a Creative Commons Attribution-NonCommercial 3.0 Unported License.
The Opportunity Before Us• Seek out and initiate data projects – Cross-domain partnerships – Enhance broad availability• Pursue value-added services – Data storage and backup services – Enhancing data mark-up and findability – Securing/controlling access to data – Maintaining provenance – Developing analytical and visualization tools – Seeking related data/research – Hosting and linking data to publications/assets – Ensuring that data is preserved for the long-term• Grow your people – Invest in training your existing staff – Change the technical profile of who you hire – Support the evolution of how we educate the field 60
– Helen Harknesshttp://www.amazon.com/Career-Chase-Creative-Control-Chaotic/dp/0891060987
"If you dont like change, youregoing to like irrelevance even less.“ —General Eric Shinseki Retired United States Army four-star general, currently US Secretary of Veterans Affairs
Paintings by Xiaoze XieXiaoze Xie immigrated from the People’s Republicof China in 1992, where he was born and studiedart and architecture. He has MFA degrees fromBeijing and North Texas University, and taught atBucknell University before assuming his currentpost at Stanford. His works are in the collections ofthe Museum of Fine Arts, Houston, the ScottsdaleMuseum of Contemporary Art and distinguishedprivate collections.Xie’s oil paintings bring together serene qualities oftraditional still-life painting and photography. Fromthe long tradition of still-life painting he employs arich and selected palette to represent the books,which take on a nearly symbolic role. Webpage: http://art.stanford.edu/profile/Xiaoze+Xie/ Email: email@example.com
“We’d now like to open the floorto shorter speeches disguised as questions.” Published in The New Yorker 10/18/2010 by Steve Macone
Thank you! Lee Dirks Directory, Portfolio Strategy Microsoft Research | Connections firstname.lastname@example.org