This document summarizes an update on the Big Data to Knowledge (BD2K) initiative at the National Institutes of Health (NIH). It discusses progress made in the first year of BD2K funding in three key areas: advancing data science research through centers and targeted awards; sharing data and software through the development of indexing tools and standards; and expanding training programs. It outlines funding amounts and recipient numbers for fiscal year 2015. Future plans are outlined through 2021 with the goals of further developing tools and applications, expanding the data sharing commons, and increasing training and sustainability efforts.
Big Data in Biomedicine – An NIH PerspectivePhilip Bourne
Keynote at the IEEE International Conference on Bioinformatics and Biomedicine, Washington DC, November 10, 2015.
https://cci.drexel.edu/ieeebibm/bibm2015/
Big Data in Biomedicine – An NIH PerspectivePhilip Bourne
Keynote at the IEEE International Conference on Bioinformatics and Biomedicine, Washington DC, November 10, 2015.
https://cci.drexel.edu/ieeebibm/bibm2015/
SCUP 2016 Mid-Atlantic Symposium: Big Data: Academy Research, Facilities, and Infrastructure Implications and Opportunities. John Hopkins, May 13, 2016
NITRD Big Data Interagency Working Group Workshop: Pioneering the Future of Federally Supported Data Repositories Jan 13, 2021 - Opening comments on where we are and one suggestion of where we might go with an International Data Science Institute (IDSI) - A blue sky view.
A VIVO VIEW OF CANCER RESEARCH: Dream, Vision and RealityPaul Courtney
Presentation made by Paul Courtney (Dana-Farber Cancer Institute, Boston, MA and OHSL, MD) and Anil Srivastava (OHSL) at the 2013 VIVO conference in St. Louis, MO. Material contributed by Rubayi Srivastava (OHSL), Swati Mehta (Centre for Development of Advanced Computing, India), Juliusz Pukacki (Poznan Supercomputing and Network Center, Poland) and Devdatt Dubhashi (Chalmers Institute of Technology, Sweden).
Ginny Pannabecker, Life Science & Scholarly Communications Librarian at Virginia Tech, is an ACRL Science and Technology Section (STS) liaison to the American Institute of Biological Sciences (AIBS). This presentation shares key points for librarians and researchers from an AIBS workshop on "Changing Practices in Data Publications," which took place in December 2014 and involved representatives from federal funding agencies; publishers and librarians; scientific societies and journals; and data services / providers.
Poster RDAP13: Data information literacy multiple paths to a single goalASIS&T
Jake Carlson, Jon Jeffryes, Brian Westra and Sarah Wright
Data Information Literacy: Multiple Paths to a Single Goal
Research Data Access & Preservation Summit 2013
Baltimore, MD April 4, 2013 #rdap13
Data Harmonization for a Molecularly Driven Health SystemWarren Kibbe
Maximizing the value of data, computing, data science in an academic medical center, or 'towards a molecularly informed Learning Health System. Given in October at the University of Florida in Gainesville
SCUP 2016 Mid-Atlantic Symposium: Big Data: Academy Research, Facilities, and Infrastructure Implications and Opportunities. John Hopkins, May 13, 2016
NITRD Big Data Interagency Working Group Workshop: Pioneering the Future of Federally Supported Data Repositories Jan 13, 2021 - Opening comments on where we are and one suggestion of where we might go with an International Data Science Institute (IDSI) - A blue sky view.
A VIVO VIEW OF CANCER RESEARCH: Dream, Vision and RealityPaul Courtney
Presentation made by Paul Courtney (Dana-Farber Cancer Institute, Boston, MA and OHSL, MD) and Anil Srivastava (OHSL) at the 2013 VIVO conference in St. Louis, MO. Material contributed by Rubayi Srivastava (OHSL), Swati Mehta (Centre for Development of Advanced Computing, India), Juliusz Pukacki (Poznan Supercomputing and Network Center, Poland) and Devdatt Dubhashi (Chalmers Institute of Technology, Sweden).
Ginny Pannabecker, Life Science & Scholarly Communications Librarian at Virginia Tech, is an ACRL Science and Technology Section (STS) liaison to the American Institute of Biological Sciences (AIBS). This presentation shares key points for librarians and researchers from an AIBS workshop on "Changing Practices in Data Publications," which took place in December 2014 and involved representatives from federal funding agencies; publishers and librarians; scientific societies and journals; and data services / providers.
Poster RDAP13: Data information literacy multiple paths to a single goalASIS&T
Jake Carlson, Jon Jeffryes, Brian Westra and Sarah Wright
Data Information Literacy: Multiple Paths to a Single Goal
Research Data Access & Preservation Summit 2013
Baltimore, MD April 4, 2013 #rdap13
Data Harmonization for a Molecularly Driven Health SystemWarren Kibbe
Maximizing the value of data, computing, data science in an academic medical center, or 'towards a molecularly informed Learning Health System. Given in October at the University of Florida in Gainesville
Earn more money - build your personal brand onlineKatie McGregor
Having a strong personal brand – i.e., a strong professional reputation – means more money:
- you can charge more (or demand a higher salary)
- retain more clients with less work
- attract better business opportunities
- and you will get more referrals, even from people who have never actually used your services.
THE GOOD NEWS is that the internet makes it easier than ever to BUILD AUTHORITY and GROW YOUR REPUTATION so that you can receive these rewards. This is achieved by showcasing your expertise in key “touch points” on the web so that you can be found by your prospects; at the same time you can also be proactive and reach out to targeted audiences to drive your influence.
By Katie McGregor, Conduit Communications
http://www.conduitcomms.com/pbseminar/
Layer Drupal with emerging technologies to create a performant, scalable data purveyor. Modularizing the architecture creates performant applications for all content and all users.
The Regional Marketer's Playbook - Asia Pacific - 2016Ryan Bonnici
Regional marketing is not just marketing at a regional level. It’s a highly nuanced discipline which involves combining hard data with soft skills, top-down strategy with grassroots customer engagement, consistent branding with uniquely local tonality.
The playbook provides insights from some of Asia's leading marketers, including:
- Paula Parkes, Mktg Director, Adobe
- Sandeep Pal, Mktg Director, Oracle
- Ryan Bonnici, Mktg Director, HubSpot
https://business.linkedin.com/marketing-solutions/c/16/4/regional-marketer-playbook
NIH Data Initiatives: Harnessing Big (and small) Data to Improve Health
Presentation at the internet2 Global Forum, April 28, 2015
Session NIH Perspectives
STI 2022 - Generating large-scale network analyses of scientific landscapes i...Michele Pasin
The growth of large, programatically accessible bibliometrics databases presents new opportunities for complex analyses of publication metadata. In addition to providing a wealth of information about authors and institutions, databases such as those provided by Dimensions also provide conceptual information and links to entities such as grants, funders and patents. However, data is not the only challenge in evaluating patterns in scholarly work: These large datasets can be challenging to integrate, particularly for those unfamiliar with the complex schemas necessary for accommodating such heterogeneous information, and those most comfortable with data mining may not be as experienced in data visualisation. Here, we present an open-source Python library that streamlines the process accessing and diagramming subsets of the Dimensions on Google BigQuery database and demonstrate its use on the freely available Dimensions COVID-19 dataset. We are optimistic that this tool will expand access to this valuable information by streamlining what would otherwise be multiple complex technical tasks, enabling more researchers to examine patterns in research focus and collaboration over time.
dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...dkNET
Abstract
In this presentation, Susan Gregurick, Ph.D., Associate Director of Data Science and Director, Office of Data Science Strategy at the National Institutes of Health, will share the NIH’s vision for a modernized, integrated FAIR biomedical data ecosystem and the strategic roadmap that NIH is following to achieve this vision. Dr. Gregurick will highlight projects being implemented by team members across the NIH’s 27 institutes and centers and will ways that industry, academia, and other communities can help NIH enable a FAIR data ecosystem. Finally, she will weave in how this strategy is being leveraged to address the COVID-19 pandemic.
Presenter: Susan Gregurick, Ph.D., Associate Director of Data Science and Director, Office of Data Science Strategy at the National Institutes of Health
dkNET Webinar Information: https://dknet.org/about/webinar
PSB2014 A Vision for Biomedical ResearchPhilip Bourne
Some preliminary thoughts about my role as Associate Director for Data Science at the NIH so as to have a discussion with attendees at the Pacific Symposium on Biocomputing on Jan 4, 2014, The Big Island of Hawaii.
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...CINECAProject
We live in an era of cloud computing. Many of the services in the life sciences are keenly planning cloud transformations, seeking to create globally distributed ecosystems of harmonised data based on standards from organisations like GA4GH. CINECA faces similar challenges, gathering cohort datasets from all over the globe, many of which are pinned in place, due to their size, legal restrictions, or other considerations. But is “bringing compute to the data” always the right choice? In this webinar, based on experiences from the Human Cell Atlas Data Coordination Platform and other projects from EMBL-EBI, we will explore the concept of “data gravity”: The idea that whilst there are forces that may hold data in one place, there are others that require it to be mobile. We’ll consider how effectively planning a cloud strategy requires consideration of the gravity of datasets, and the impact it may have on team skills required, incentives for good practice, and storage and compute costs.
The CINECA webinar series aims to discuss ways to address common challenges and share best practices in the field of cohort data analysis, as well as distribute CINECA project results. All CINECA webinars include an audience Q&A session during which attendees can ask questions and make suggestions. Please note that all webinars are recorded and available for posterior viewing. CINECA webinars include an audience Q&A session during which attendees can ask questions and make suggestions.
This webinar took place on 12th November 2020 and is part of the CINECA webinar series.
For previous and upcoming CINECA webinars see:
https://www.cineca-project.eu/webinars
Presentation of current challenges of upgrading the intrasturcture for access and preservation of social science research data and worklow in Slovene social science data archive
2013 DataCite Summer Meeting - DOIs and Supercomputing (Terry Jones - Oak Rid...datacite
2013 DataCite Summer Meeting - Making Research better
DataCite. Co-sponsored by CODATA.
Thursday, 19 September 2013 at 13:00 - Friday, 20 September 2013 at 12:30
Washington, DC. National Academy of Sciences
http://datacite.eventbrite.co.uk/
Presented online as part of the NASM series in Advancing Drug Discovery see https://www.nationalacademies.org/event/40883_09-2023_advancing-drug-discovery-data-science-meets-drug-discovery
For a panel discussion at the Associate Research Libraries Spring meeting April 27, 2022, Montreal https://www.arl.org/schedule-for-spring-2022-association-meeting/
Frontiers of Computing at the Cellular and Molecular ScalesPhilip Bourne
3 basic points when establishing a new biomedical initiative. Presented at Frontiers of Computing in Health and Society, George Mason University, September 21, 2021.
How to Make a Field invisible in Odoo 17Celine George
It is possible to hide or invisible some fields in odoo. Commonly using “invisible” attribute in the field definition to invisible the fields. This slide will show how to make a field invisible in odoo 17.
Operation “Blue Star” is the only event in the history of Independent India where the state went into war with its own people. Even after about 40 years it is not clear if it was culmination of states anger over people of the region, a political game of power or start of dictatorial chapter in the democratic setup.
The people of Punjab felt alienated from main stream due to denial of their just demands during a long democratic struggle since independence. As it happen all over the word, it led to militant struggle with great loss of lives of military, police and civilian personnel. Killing of Indira Gandhi and massacre of innocent Sikhs in Delhi and other India cities was also associated with this movement.
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptxEduSkills OECD
Andreas Schleicher presents at the OECD webinar ‘Digital devices in schools: detrimental distraction or secret to success?’ on 27 May 2024. The presentation was based on findings from PISA 2022 results and the webinar helped launch the PISA in Focus ‘Managing screen time: How to protect and equip students against distraction’ https://www.oecd-ilibrary.org/education/managing-screen-time_7c225af4-en and the OECD Education Policy Perspective ‘Students, digital devices and success’ can be found here - https://oe.cd/il/5yV
Instructions for Submissions thorugh G- Classroom.pptxJheel Barad
This presentation provides a briefing on how to upload submissions and documents in Google Classroom. It was prepared as part of an orientation for new Sainik School in-service teacher trainees. As a training officer, my goal is to ensure that you are comfortable and proficient with this essential tool for managing assignments and fostering student engagement.
Ethnobotany and Ethnopharmacology:
Ethnobotany in herbal drug evaluation,
Impact of Ethnobotany in traditional medicine,
New development in herbals,
Bio-prospecting tools for drug discovery,
Role of Ethnopharmacology in drug evaluation,
Reverse Pharmacology.
How to Create Map Views in the Odoo 17 ERPCeline George
The map views are useful for providing a geographical representation of data. They allow users to visualize and analyze the data in a more intuitive manner.
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdfTechSoup
In this webinar you will learn how your organization can access TechSoup's wide variety of product discount and donation programs. From hardware to software, we'll give you a tour of the tools available to help your nonprofit with productivity, collaboration, financial management, donor tracking, security, and more.
We all have good and bad thoughts from time to time and situation to situation. We are bombarded daily with spiraling thoughts(both negative and positive) creating all-consuming feel , making us difficult to manage with associated suffering. Good thoughts are like our Mob Signal (Positive thought) amidst noise(negative thought) in the atmosphere. Negative thoughts like noise outweigh positive thoughts. These thoughts often create unwanted confusion, trouble, stress and frustration in our mind as well as chaos in our physical world. Negative thoughts are also known as “distorted thinking”.
MARUTI SUZUKI- A Successful Joint Venture in India.pptx
BD2K Update
1. BD2K Update
Philip Bourne, PhD, FACMI
Associate Director for Data Science
Advisory Committee to the NIH Director
December 11, 2015
http://datascience.nih.gov
Slides: http://www.slideshare.net/pebourne
2. 439 participants
167 remote viewers
Breakout sessions
133 Posters
16 Demos
3 BOFs
One Year and Counting…
3. • “From the meeting, it was amply clear that NIH has the big data waterfront
well-populated. From imaging, to molecular, to clinical, to mobile, BD2K
has the A teams.” – Zak Kohane, Harvard
• "I have been involved in several national initiatives to bring advanced
technology into biomedical research. I have never seen one with such an
intense drive and uptake as the BD2K program. This stems not only from
excellent leadership and vision, but also from the immediate impact of the
centers.” - Scott Delp, Stanford
• 'BD2K has already changed the landscape of biomedical research in the
USA. The All-hands meeting captured the excitement and change in
culture that is happening across biomedical science, with the realisation
that sharing data lies at the heart of biomedical research today and that
establishing the international infrastructure to do so is critical. Great
science too!!!’ - Janet Thornton, EBI
• If people are the NIH's most valuable resource, then the BD2K centers
are successfully addressing its second most valuable resource: data. -
David Haussler, UCSC
• Amazing interest, support and excitement from the community – Peipei
Ping, UCLA
• ‘We can now let the data lead to the discoveries and are able to do things
we could not do before. Without the new scientific tools and strategies
developed as part of BD2K we would remain anchored in our
reductionistic past.’ - Art Toga, USC
4. Implementing ACD Big Data
Recommendations
DIWG Recommendations
1.Sharing data & software through
indexes
2.Advance big methods, tools &
applications
3.Expand data science training
4.Continued support throughout the
data & software lifecycle
4
BD2K Implementation
1.Implement the Commons (indices,
standards, etc.)
2.Data science research programs
(Centers, U01s, etc.)
3.Training and workforce
development programs
4.Addressing sustainability of
science, technology, and funding
mechanisms
5. BD2K FY15 Funding for Sharing &
Sustainability
FY15Funding($000)
26% 58% 16%
Commons Components ($20M)
• BioCADDIE (data discovery index prototype)
• Standards Coordinating Center contract
• Cloud Broker Model contract
• Supplements to support interoperability of NIH data repositories
• Supplements to MODs and BD2K awards to pilot Commons
6. BD2K FY15 Research Funding
FY15Funding($000)
26% 58% 16%
Data Science Research ($44.8M)
• 13 BD2K Centers awards, span scientific domains across NIH
• Targeted Software Awards on topics: data compression,
visualization, provenance, wrangling.
• Innovations Lab to develop new biomedical-data science
collaborative teams
7. BD2K FY15 Training Funding
FY15Funding($000)
26% 58% 16%
Training and Workforce Development ($11.8M)
• Training Coordination Center
• R25 awards for MOOCS, short courses, open educational resources
• T32 training programs in data science
• K01 career development awards
• R25s MOOCS and online resources to libraries to support data
management and curation
• R25 enhancing diversity in biomedical data science
16. Training Programs Initiated
FY14-15
Biomedical Science Specialists
Data Science Specialists
Courses (R25) [11 awards]
Open Educational Resource (R25s) [8 awards]
Career Development (K01) [20 awards]Training
Programs
(T32/T15)
[6 awards]
Diversity
(R25) [4]
Museum
[1]
17. • 2016 Lecture by Carlos Bustamante, Ph.D.
• Posters
• PiCo Lightening Talks
• Event for High School Students
• Workshop on Reproducible Research
• Pies
• Distinguished Lecture Series
• Frontiers in Data Science Lecture
Series
• Software carpentry
• Hackathons
18. Innovation Lab
• Description:
– 5-day mentored workshop facilitated
by KnowInnovation
– Joint initiative of NSF and NIH
• Purpose:
– To build interdisciplinary (biomedical
and data science) teams
– To develop teams’ research programs
• Outcome:
– New teams formed and competed for
funding
– Innovation lab teams had a higher
than average success rate
22. Community Engagement
In the Commons: Beacon
A beacon answers the simple question,
have you observed a genome with a given
mutation?
You can ask “Do you have a genome with
an A at position 100,000 on chr1?”
YES
23. Commons Credits Model
The CommonsThe Commons
Cloud Provider
A
Cloud Provider
B
Cloud Provider
C
Investigator
NIH
Provides credits Enables Search
Discovery Index
Uses credits in
the Commons
IndexesOption:
Direct Funding
24. BD2K FY17 Funding for Sharing &
Sustainability
Commons Components ($28M)
• Resource Indexing (data, software…)
• Standards coordination and community-based development
• Cloud Broker Model contract
• Reference data sets to the cloud
• Innovations in curation RFA
FY17Funding($000)
26% 57% 18%
25. BD2K FY17 Research Funding
Data Science Research (62.3$M)
• 13 BD2K Centers awards, span scientific domains across NIH
• Targeted Software Awards on topics: data privacy, repurposing,
applying metadata, interactive digital media
• Innovations Lab to develop new biomedical-data science
collaborative teams
• Professional-grade software support and services in the Commons
• CDE harmonization
FY17Funding($000)
26% 57% 18%
26. BD2K FY17 Training Funding
Training and Workforce Development (20.1$M)
• Training Coordination Center
• R25 awards for MOOCS, short courses, open educational resources
• T32 training programs in data science
• K01 career development awards
• R25s MOOCS and online resources to strengthen data science
curriculum in biomedical courses
• R25 enhancing diversity in biomedical data science
FY17Funding($000)
26% 57% 18%
29. Timeline Through 2021
• Advanced Tools & Applications
– Centers
– Software
– Other
• Sharing Data & Software
– Commons
– Credits
– Indexing
• Training
• Sustainability
FY 15 16 17 18 19 20 21
Annual Focus
Pilots
Reference
Data
Large-scale
Adoption
Pilots
Few
FOAs
Few
Inst.
Full
Scale
Prototypes Production
Intramural
Extramural
Eval. Plan Eval.
NLM Integration
Editor's Notes
Azumio – monitors a variety of features
Nhanes –National Health and nutritional examination survey – manual collection CDC
Gini Coeff measures inequality among values of a frequency distribution. 0 equality 1 total inequality
The Beacon project is a project to test the willingness of international sites to share genetic data in the simplest of all technical contexts.
It is defined as a simple public web service that any institution can implement as a service.
The service is designed merely to accept a query of the form "Do you have any genomes with an 'A' at position 100,735 on chromosome 3" (or similar data)
and responds with one of "Yes" or "No." A site offering this service is called a "beacon".