Leveraging VIVO data: visualizations, queries, and reportsPresentation Transcript
Leveraging VIVO DataVisualizations, queries and reportsPaul Albertpaa2013@med.cornell.eduWeill Cornell Medical College
Step #1 Get good data. Step #2 ex ceptionalCreate something useful.
What can we reallylearn from bad data?
Administrators: no fansof the error bar are they
Native Visualizations in VIVO Co-author Publicationvisualization sparklines Map of Science Co-investigator Temporal visualization graphs
Caveats for publication visualizations• Is the data complete?• Map of Science looks at the topic of the journal, not necessarily the article• Agnostic about: - type of publication (e.g., editorial, academic article) - signiﬁcance of ﬁnding - author contribution
Westheimers LawA few months in the laboratory can save a few hours in the library.
Corollary of Westheimers LawA few hours of SPARQL query construction can save you a few seconds of searching.
This does not work.
Object property should takeuser to all faculty members who attended Tufts
Strongly consider linking outyour data properties as well.
The tool of choice for fancy questions SPARQL Query Builder
Most needs for dataare not articulated
Articulated question #1 Show me a current list ofpublications by the following people.
Articulated question #2Which researchers have publishedthe most research articles within agiven set of basic science journals within the last ﬁve years?
Articulated question #3How many journal articles haveresearchers published between October 2011 and March 2012?
Articulated question #4 What is my H-index?
Identifying unarticulated research questions extrapolate investigate hypothesize infer interview collaborate
I have not failed. Ive just found10,000 ways that wont work.Thomas A. EdisonSource: Yahoo Answers
We have proposed several researchquestions that elicited enthusiasm. Dean’s Representative
Proposed question #1How has the number of publicationsco-authored with other institutions changed year to year?
Proposed question #2Publications appearing in journals of a given impact factor
Proposed question #3In any given year, which paper has the most incoming citations?
Proposed question #4 Which papers that have receivedfederal funding are not deposited in PubMed Central?
Proposed question #5Who are our institutions’ open access key opinion leaders?
Proposed question #6 Which PIs have the mostcollaborations based on grant support?
Proposed question #7 Which faculty have dual appointments?
Modes of Delivery• Emailed attachments - Spreadsheet - PDF - Word ﬁle - Bibliographic tool (e.g., .ris ﬁle)• Link to VIVO web interface• Third-party “data dashboard”
Goals for Reporting
1. Define the key questions. Goal: a set of fundamental questions we ask about our researchers.
2. Allow for more keyquestions to be addressed via browsing Ingest more data in key areas and...In areas where VIVO hasn’t quite ﬁgured out how to do something important...
3. Create third-partyframework that will ingest VIVO data and readily answer these questions.Framework will be open source and could be incorporated into VIVO in future releases.
Data DashboardProject for Our Summer CS Students• build on Miles Worthington’s Drupal/VIVO work• tackle some of the proposed questions that have not made it into core• can be customized for diﬀerent viewers• collaboration welcome
Questions?Is VIVO Facebook for Researchers?• No Mendeley is… (Ha) library.weill.cornell.edu