SlideShare a Scribd company logo
1 of 56
It’s hard to make predictions –
especially about the future.
-- attributed to Niels Bohr
Monday, July 11th, 2039
A History of
“Bioinformatics”
C. Titus Brown
Monday, July 11th, 2039
Monday, July 11th, 2039
Invited to reminisce!
 …and perhaps inform the BRAIN2050 initiative.
Note for the young: “bioinformatics” and “systems biology”
are now simply “biology”.
Monday, July 11th, 2039
The 20-teens and onwards
1. Too Much Data: The Datapocalypse
2. Great results, seen once: the reproducibility crisis.
3. Mind the gap: computation in biology.
Monday, July 11th, 2039
1. The Datapocalypse
Monday, July 11th, 2039
Too… much… data…
Between –omics, automated sensor data, and data
sharing, biology grew into a data-intensive science.
Volume, velocity, variety: the general problem.
But also!
Biology was optimized for hypothesis-driven investigation,
not data exploration!
Long arguments over “which is better”, with the people
who controlled the funding => winning.
Monday, July 11th, 2039
HTC, not HPC
For lots of data, High Throughput Computing was needed –
but compute was cheap, not throughput!
Monday, July 11th, 2039
Figure from bbc.co.uk
2. The reproducibility crisis
Monday, July 11th, 2039
Trials
Failed
The reproducibility crisis -
why??
 Well known fact among biotech that the majority of
published experiments were largely lab-specific.
 Neither career incentives nor funding were there! (In
fact, quite the contrary…)
 This slowly started to change later in the decade, as
the public caught on…
Monday, July 11th, 2039
Shift in “publication”
recognition
 Hard to believe now, but back then, people were
rewarded for the first (claimed) “observation” of an
effect.
 Two-lab rule was only instated as best practice in the
early 2020s, once reviewers started rejecting papers
unaccompanied by a replication report.
 Funding shift followed, of course.
Monday, July 11th, 2039
3. Computing & data in biology
Of the sciences, biology had always
been the weakest in terms of computing
education.
This became a complete disaster once
the data tsunami hit – labs generated
data sets they couldn’t analyze,
graduate students planned experiments
that relied on computing they couldn’t
do.
Monday, July 11th, 2039
Photo from Wikipedia
The “easy to use” tools fiasco
 Immense investment in late ‘teens in tools that were
“easy to use” – push-button data analysis, etc.
 This worked well outside of research; however, it turns
out you can’t place most data analysis in a black box.
 “Easy to use” tools embodied so many assumptions
that most results were simply invalid.
Monday, July 11th, 2039
=> Bioinformatics
“sweatshops”
 Cadre of students and low-paid employees devoted to
“service bioinformatics”
 No career path, no significant authorship…
 …but necessary for big labs to make progress!
Monday, July 11th, 2039
Things came to a head…
Monday, July 11th, 2039
www.sanantonio-urbanliving.com
The tipping point
 The well-trained students left for the data science
industry;
 More and more papers were being written by people
who didn’t understand the computing…
 …and an increasing number of them were being
rejected…
 …until the supply of reviewers ran out…
Monday, July 11th, 2039
And then… California.
Monday, July 11th, 2039 Map from Wikimedia
Bioinformaticians, revolt!
Bioinformatics reviewers essentially unionized and laid down
three rules:
1. All of the data and source code must be provided for any
paper.
2. Full methods sections and references are included in the
primary paper review.
3. No unpublished methods can be used in data analysis.
In the end, the only people that complained were companies
like MS Elsevier, because preprints.
Monday, July 11th, 2039
Replication “parties”
Open peer
review
Replication
study!
Bad?
Group I Group II Group III
Publication of
replication attempt
Monday, July 11th, 2039
A community of practice emerged around replication!
Part of a larger renaissance
for biology!
Starting in ~2020,
1. Biomedical enterprise rediscovers basic biology;
2. Rise and triumph of open science;
3. A transition to networked science;
4. Massive investment in the people;
Monday, July 11th, 2039
1. Rediscovering basic
biology
Monday, July 11th, 2039
The biomedical community backs away
from translational medicine.
 Several veterinary and agricultural animals proved to
be better model organisms for human disease than
mouse;
 Ecology and evolution provided valuable theoretical
and empirical observations for understanding human
genetics.
 Microbial interactions between environment and human
proved to be important as well; built environment,
disease reservoirs, etc.
 Cheap sequencing enabled a vast array of studies.
Monday, July 11th, 2039
2. Open science triumphs!
 The computational community knew this by 2016, but it
took a few years for the rest of biology…
A curious story!
1. Biotech pressured congresspeople into decreasing
funding for experiments, since analysis was usually
wrong and raw data was never available;
2. Funding crunch, more generally, tightened the screws
further;
3. Hypothesis driven labs couldn’t compete…
Monday, July 11th, 2039
…hypothesis-driven lab science joined
with discovery.
 Eventually, funders mandated
data availability;
 Labs that made use of available
data had a dramatic edge in
hypothesis-driven
experimentation;
 Data-driven modeling and
model-driven data interpretation
blossomed!
Monday, July 11th, 2039
Image from emory.edu
3. A transition to networked
science
Monday, July 11th, 2039
Universities collapsed!
 So all the senior professors and administrators retired…
 Massive brain drain… 
 … enabled a massive increase in creativity in the research
enterprise! 
Collaboration tools, data sharing, distributed team science…
Monday, July 11th, 2039
“Walled garden” model
Monday, July 11th, 2039
Pioneered by Sage Bionetworks in ~2010s
 Data collection done by small consortia;
 Data made available to all, but publication in step.
Model is of course obsolete nowadays, but was quite effective back then.
4. Massive investment in
people
The NIH finally invested heavily in training.
Among other things:
Data Carpentry
Model Carpentry
Monday, July 11th, 2039
(We won! Yay!)
There are still problems, of
course!
 What do most genes do? Functional annotations are
still poor. Some approaches --
 Biogeochemistry
 Synthetic biology
 Career paths for experimental biologists are very
uncertain.
 “Glam data”
 Cancer is cured, but many complex diseases –
especially neurodegenerative ones – remain poorly
understood.
Monday, July 11th, 2039
BRAIN2050
 Ambitious 10-year proposal to “understand the brain”
by 2050.
 Focus on neurodegenerative diseases, regeneration,
and a mechanistic understanding of intelligence.
 What mistakes can they avoid, with the benefit of
hindsight?
Monday, July 11th, 2039
Correlation is not causation
 You’d think we’d have learned this by now!?
 Original MIND project 25 years ago failed for this
reason. (“Record ALL the neurons”)
Monday, July 11th, 2039 Image from Wikipedia
(Computational) modeling is
critical
 Can we develop models that embody hypotheses that
we can then “test” against the data?
Holistic multidisciplinary research.
(Brain community has always been better off here…)
Monday, July 11th, 2039
Focus less on reproducibility
 A strict requirement for independent replication is
strangling us!
 Completely independent replication is a strong
requirement; understandable, given disasters of the
past, but also slow.
 Can we compromise?
Monday, July 11th, 2039
“Replication debt”
 Can we borrow idea of “technical debt” from software
engineering?
 Semi-independent replication after initial exploratory
phase, followed by articulation of protocols and
independent replication.
Monday, July 11th, 2039
Image from blog.crisp.se
“Replication debt”
 Semi-independent replication after initial exploratory
phase, followed by articulation of protocols and
independent replication.
 Public acknowledgement of debt is important.
Monday, July 11th, 2039
Image from blog.crisp.se
Invest in infrastructure for
collaboration and sharing
 Data sharing is a given
 But existing tools still merely support rather than drive
science with data sharing!
 Push for collaborative process from the outset.
Monday, July 11th, 2039
Can we help drive collaboration with
technology?
Monday, July 11th, 2039
Gather data
Deposit data
Compare
against other
data sets
Notify
Notify
See e.g. pebourne.wordpress.com/2014/01/04/universities-as-big-data/
Tool up! But evaluate,
compare, understand.
 Having a robust and competitive software ecosystem is
important for innovation and creativity.
 Available, open, reusable, remixable: all critical!
 Benchmarks are not always useful; understanding
always is.
Monday, July 11th, 2039
Build commercial software only when
basics are understood
Monday, July 11th, 2039
Research development
Easy-to-use
commercial software
"Popular" protocols
Invest in training as first-class
research citizen!
Monday, July 11th, 2039
Undergraduates
K-12 students
Graduate students
The high school students of yesterday are
the research scientists of tomorrow.
It’s the network, dummies.
 Single molecule full genome sequences did not provide
understanding.
 Reductionist studies of gene function did not provide
understanding.
 Neither will high resolution ensemble neuronal sampling.
 Our main obstacle in understanding aging has been that it seems
to be systemic, just like neurogeneration.
Monday, July 11th, 2039
Concluding thoughts (I)
 Many things the BRAIN2050 field can do to invest in its
own future and accelerate progress!
 Bitter lessons learned from decades of mistakes in
other fields; maybe we can do better?
Monday, July 11th, 2039
Monday, July 11th, 2039
All right…
 Future talk over 
 I thought I’d use this as a foil to highlight issues that I
think are important for the future.
But:
We have to get used to the idea that radical
change keeps happening ... even after 1997.
First published by Broadway Books on May 5, 1997. Via Erich Schwarz
We have to get used to the idea that radical
change keeps happening ... even after 1997.
"Among the pessimists, molecular biologist Gunther Stent
suggests that science is reaching a point of incremental,
diminishing returns as it comes up against the limits of
knowledge..." --review by Publishers Weekly
First published by Broadway Books on May 5, 1997. Via Erich Schwarz
Robert Heinlein's four curves of predicted human
progress (described in 1950)
Ref.: Heinlein, R.A. (1950), "Where To?".
"The solid curve ...
represents many things -
- use of power, speed of
transport, numbers of
scientific and technical
workers, advance in
communication, average
miles traveled per
person per year,
advances in
mathematics ... Call it
the curve of human
achievement."
Via Erich Schwarz
Robert Heinlein's four curves of predicted human
progress (described in 1950)
"Despite everything,
there is a stubborn
'common sense'
tendency to project it
along dotted line
number (1) like the
patent office official of a
hundred years back
who quit his job
'because everything
had already been
invented'."
Ref.: Heinlein, R.A. (1950), "Where To?". Via Erich Schwarz
Robert Heinlein's four curves of predicted human
progress (described in 1950)
"Even those who don't
expect a slowing up at
once tend to expect us
to reach a point of
diminishing returns --
dotted line number (2)."
Ref.: Heinlein, R.A. (1950), "Where To?". Via Erich Schwarz
Robert Heinlein's four curves of predicted human
progress (described in 1950)
"Very daring minds are
willing to predict that we
will continue our
present rate of progress
-- dotted line number
(3) -- a tangent."
Ref.: Heinlein, R.A. (1950), "Where To?". Via Erich Schwarz
Robert Heinlein's four curves of predicted human
progress (described in 1950)
Ref.: Heinlein, R.A. (1950), "Where To?".
"But the proper way to
project the curve is
dotted line number (4),
because there is no
reason, mathematical,
scientific, or historical,
to expect that curve to
flatten out... The correct
projection ... is for the
curve to go on up
indefinitely with
increasing steepness..."
Via Erich Schwarz
Conclusion --
 I certainly don’t know where we’re headed; no one else does
either.
 We must invest in people and process; we must help figure
out what the right process is and then provide career
incentives for people to do things that way.
This community should be leading the way:
Bioinformatics Open Source Conference
(Reminder: we will win.)
But: economics matter
50-million mark note. Weimar Germany, 1923.
Economics matter
Ref.: U.S. Government Accountability Office, Citizen's Guide of 2010.
Prospects for U.S. public funding of science
Ref.: U.S. Government Accountability Office, Citizen's Guide of 2010.
Public support for science
matters!
 Data sharing, openness => maximizing return.
 Must figure out how to align career and funding
incentives.
 We are currently doing a horrible job of this…
 …I’m looking forward to Phil Bourne’s talk :)
Monday, July 11th, 2039
Thanks!
 Discussions with Phil Bourne (NIH), Erich Schwarz (Caltech
& Cornell), Katherine Mejia-Guerra (OSU) and Jeffrey
Campbell (OSU).
 All of this will be (is already?) posted online.
 “The next 10 years of quant bio” by Mike Schatz
 …with apologies to Gary Bernhardt
(Birth & Death of JavaScript – go watch it!)
Monday, July 11th, 2039

More Related Content

What's hot

RT @twitter ist die Uni auf Facebook? #DigitalUniversity #l3t
RT @twitter ist die Uni auf Facebook? #DigitalUniversity  #l3tRT @twitter ist die Uni auf Facebook? #DigitalUniversity  #l3t
RT @twitter ist die Uni auf Facebook? #DigitalUniversity #l3tMartin Ebner
 
Teaching Kids to Code
Teaching Kids to CodeTeaching Kids to Code
Teaching Kids to CodeGrahamAttwell
 
Technological advance in a democratic society
Technological advance in a democratic societyTechnological advance in a democratic society
Technological advance in a democratic societySarah Young
 
The Creative Animal Goes Online (Part A)
The Creative Animal Goes Online (Part A)The Creative Animal Goes Online (Part A)
The Creative Animal Goes Online (Part A)Mitch Goodwin
 
2006, What's Next in Ed Tech: 5th World, National School Boards Association b...
2006, What's Next in Ed Tech: 5th World, National School Boards Association b...2006, What's Next in Ed Tech: 5th World, National School Boards Association b...
2006, What's Next in Ed Tech: 5th World, National School Boards Association b...Jim "Brodie" Brazell
 
Humanity Vs Technology - A "Quote-Unquote" Debate #edcmooc
Humanity Vs Technology - A "Quote-Unquote" Debate #edcmoocHumanity Vs Technology - A "Quote-Unquote" Debate #edcmooc
Humanity Vs Technology - A "Quote-Unquote" Debate #edcmoocRajiv Bajaj
 
40 Years of Education (Education so far)
40 Years of Education (Education so far)40 Years of Education (Education so far)
40 Years of Education (Education so far)Komal Gandhi
 
Information Overload
Information OverloadInformation Overload
Information OverloadMiro Pusnik
 
Technoanthropology 1.0
Technoanthropology 1.0Technoanthropology 1.0
Technoanthropology 1.0Artur Serra
 
Linq 2013 Keynote Rory Mc Greal
Linq 2013 Keynote Rory Mc GrealLinq 2013 Keynote Rory Mc Greal
Linq 2013 Keynote Rory Mc GrealLINQ_Conference
 
Brave New World Technology
Brave New World TechnologyBrave New World Technology
Brave New World Technologyworldofpooh
 
TSEM - Cooper Spring 2012
TSEM - Cooper Spring 2012TSEM - Cooper Spring 2012
TSEM - Cooper Spring 2012Laksamee Putnam
 
San Jose October 2009l
San  Jose  October 2009lSan  Jose  October 2009l
San Jose October 2009ldigitalschema
 

What's hot (17)

RT @twitter ist die Uni auf Facebook? #DigitalUniversity #l3t
RT @twitter ist die Uni auf Facebook? #DigitalUniversity  #l3tRT @twitter ist die Uni auf Facebook? #DigitalUniversity  #l3t
RT @twitter ist die Uni auf Facebook? #DigitalUniversity #l3t
 
Teaching Kids to Code
Teaching Kids to CodeTeaching Kids to Code
Teaching Kids to Code
 
Technological advance in a democratic society
Technological advance in a democratic societyTechnological advance in a democratic society
Technological advance in a democratic society
 
NSF SoCS 2011
NSF SoCS 2011NSF SoCS 2011
NSF SoCS 2011
 
The Creative Animal Goes Online (Part A)
The Creative Animal Goes Online (Part A)The Creative Animal Goes Online (Part A)
The Creative Animal Goes Online (Part A)
 
2006, What's Next in Ed Tech: 5th World, National School Boards Association b...
2006, What's Next in Ed Tech: 5th World, National School Boards Association b...2006, What's Next in Ed Tech: 5th World, National School Boards Association b...
2006, What's Next in Ed Tech: 5th World, National School Boards Association b...
 
Humanity Vs Technology - A "Quote-Unquote" Debate #edcmooc
Humanity Vs Technology - A "Quote-Unquote" Debate #edcmoocHumanity Vs Technology - A "Quote-Unquote" Debate #edcmooc
Humanity Vs Technology - A "Quote-Unquote" Debate #edcmooc
 
40 Years of Education (Education so far)
40 Years of Education (Education so far)40 Years of Education (Education so far)
40 Years of Education (Education so far)
 
Internet
InternetInternet
Internet
 
Information Overload
Information OverloadInformation Overload
Information Overload
 
Transcript of Zurkowski video at LILAC 2014
Transcript of Zurkowski video at LILAC 2014Transcript of Zurkowski video at LILAC 2014
Transcript of Zurkowski video at LILAC 2014
 
Technoanthropology 1.0
Technoanthropology 1.0Technoanthropology 1.0
Technoanthropology 1.0
 
Linq 2013 Keynote Rory Mc Greal
Linq 2013 Keynote Rory Mc GrealLinq 2013 Keynote Rory Mc Greal
Linq 2013 Keynote Rory Mc Greal
 
Brave New World Technology
Brave New World TechnologyBrave New World Technology
Brave New World Technology
 
TSEM - Cooper Spring 2012
TSEM - Cooper Spring 2012TSEM - Cooper Spring 2012
TSEM - Cooper Spring 2012
 
Citizen Science in the Introductory Classroom
Citizen Science in the Introductory ClassroomCitizen Science in the Introductory Classroom
Citizen Science in the Introductory Classroom
 
San Jose October 2009l
San  Jose  October 2009lSan  Jose  October 2009l
San Jose October 2009l
 

Viewers also liked

databases in bioinformatics
databases in bioinformaticsdatabases in bioinformatics
databases in bioinformaticsnadeem akhter
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformaticsbiinoida
 
Effective management of mental distress in the workplace by Helen Lockett
Effective management of mental distress in the workplace by Helen LockettEffective management of mental distress in the workplace by Helen Lockett
Effective management of mental distress in the workplace by Helen LockettAcas Comms
 
Rachel Wolfe Photography Features
Rachel Wolfe Photography FeaturesRachel Wolfe Photography Features
Rachel Wolfe Photography FeaturesRachel Wolfe
 
Rose :: Properties
Rose :: PropertiesRose :: Properties
Rose :: Propertiesrejita
 
One Step Online School Simplified
One Step Online School SimplifiedOne Step Online School Simplified
One Step Online School SimplifiedChineseTeachers.com
 
Enroll And Survey
Enroll And SurveyEnroll And Survey
Enroll And Surveybsrmailbox
 
Intellisoft ipad iphone Info March13
Intellisoft ipad iphone Info March13Intellisoft ipad iphone Info March13
Intellisoft ipad iphone Info March13Sham Yemul
 
Ondernemen kwf 26 nov 2012
Ondernemen kwf 26 nov 2012Ondernemen kwf 26 nov 2012
Ondernemen kwf 26 nov 2012Piet van Vugt
 
Trainings Evaluation Reports WPS Phase-II Bhakkar
Trainings Evaluation Reports WPS Phase-II BhakkarTrainings Evaluation Reports WPS Phase-II Bhakkar
Trainings Evaluation Reports WPS Phase-II BhakkarZafar Ahmad
 
2013 ucdavis-smbe-eukaryotes
2013 ucdavis-smbe-eukaryotes2013 ucdavis-smbe-eukaryotes
2013 ucdavis-smbe-eukaryotesc.titus.brown
 
33 Lead Generation Tips in 33 Minutes
33 Lead Generation Tips in 33 Minutes33 Lead Generation Tips in 33 Minutes
33 Lead Generation Tips in 33 MinutesAlex Rascanu
 
For Legal Professionals
For Legal ProfessionalsFor Legal Professionals
For Legal ProfessionalsEdmund_Wheeler
 
Understanding Facebook Places
Understanding Facebook PlacesUnderstanding Facebook Places
Understanding Facebook PlacesMarco Pacifico
 
Twitter and Webservice
Twitter and WebserviceTwitter and Webservice
Twitter and Webservicezerofe
 
AMD-V™ Nested Paging
AMD-V™ Nested PagingAMD-V™ Nested Paging
AMD-V™ Nested PagingJames Price
 

Viewers also liked (20)

databases in bioinformatics
databases in bioinformaticsdatabases in bioinformatics
databases in bioinformatics
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Ten Common Wage & Hour Blunders
Ten Common Wage & Hour BlundersTen Common Wage & Hour Blunders
Ten Common Wage & Hour Blunders
 
EO for Bethany
EO for BethanyEO for Bethany
EO for Bethany
 
Effective management of mental distress in the workplace by Helen Lockett
Effective management of mental distress in the workplace by Helen LockettEffective management of mental distress in the workplace by Helen Lockett
Effective management of mental distress in the workplace by Helen Lockett
 
Rachel Wolfe Photography Features
Rachel Wolfe Photography FeaturesRachel Wolfe Photography Features
Rachel Wolfe Photography Features
 
Rose :: Properties
Rose :: PropertiesRose :: Properties
Rose :: Properties
 
One Step Online School Simplified
One Step Online School SimplifiedOne Step Online School Simplified
One Step Online School Simplified
 
Enroll And Survey
Enroll And SurveyEnroll And Survey
Enroll And Survey
 
Intellisoft ipad iphone Info March13
Intellisoft ipad iphone Info March13Intellisoft ipad iphone Info March13
Intellisoft ipad iphone Info March13
 
Ondernemen kwf 26 nov 2012
Ondernemen kwf 26 nov 2012Ondernemen kwf 26 nov 2012
Ondernemen kwf 26 nov 2012
 
2013 gbmf-mmi-ci
2013 gbmf-mmi-ci2013 gbmf-mmi-ci
2013 gbmf-mmi-ci
 
Trainings Evaluation Reports WPS Phase-II Bhakkar
Trainings Evaluation Reports WPS Phase-II BhakkarTrainings Evaluation Reports WPS Phase-II Bhakkar
Trainings Evaluation Reports WPS Phase-II Bhakkar
 
2013 ucdavis-smbe-eukaryotes
2013 ucdavis-smbe-eukaryotes2013 ucdavis-smbe-eukaryotes
2013 ucdavis-smbe-eukaryotes
 
33 Lead Generation Tips in 33 Minutes
33 Lead Generation Tips in 33 Minutes33 Lead Generation Tips in 33 Minutes
33 Lead Generation Tips in 33 Minutes
 
For Legal Professionals
For Legal ProfessionalsFor Legal Professionals
For Legal Professionals
 
Understanding Facebook Places
Understanding Facebook PlacesUnderstanding Facebook Places
Understanding Facebook Places
 
Hohmann liber2006
Hohmann liber2006Hohmann liber2006
Hohmann liber2006
 
Twitter and Webservice
Twitter and WebserviceTwitter and Webservice
Twitter and Webservice
 
AMD-V™ Nested Paging
AMD-V™ Nested PagingAMD-V™ Nested Paging
AMD-V™ Nested Paging
 

Similar to 2014 bosc-keynote

Towards a scientific data policy
Towards a scientific data policy Towards a scientific data policy
Towards a scientific data policy Roberto de Pinho
 
Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?LEARN Project
 
Citizen Science overview for ASU HSD598 graduate course, "Citizen Science"
Citizen Science overview for ASU HSD598 graduate course, "Citizen Science"Citizen Science overview for ASU HSD598 graduate course, "Citizen Science"
Citizen Science overview for ASU HSD598 graduate course, "Citizen Science"Darlene Cavalier
 
How to follow actors through their traces. Exploiting digital traceability
How to follow actors through their traces. Exploiting digital traceabilityHow to follow actors through their traces. Exploiting digital traceability
How to follow actors through their traces. Exploiting digital traceabilityINRIA - ENS Lyon
 
The Future of Life Sciences 2013 for Max Planck Institute
The Future of Life Sciences 2013 for Max Planck InstituteThe Future of Life Sciences 2013 for Max Planck Institute
The Future of Life Sciences 2013 for Max Planck InstituteMelanie Swan
 
Jim Gray Award Lecture
Jim Gray Award LectureJim Gray Award Lecture
Jim Gray Award LecturePhilip Bourne
 
"AI" for Blockchain Security (Case Study: Cosmos)
"AI" for Blockchain Security (Case Study: Cosmos)"AI" for Blockchain Security (Case Study: Cosmos)
"AI" for Blockchain Security (Case Study: Cosmos)npinto
 
Talking Tech - the art and science of communicating complex ideas (Bristech2...
Talking Tech  - the art and science of communicating complex ideas (Bristech2...Talking Tech  - the art and science of communicating complex ideas (Bristech2...
Talking Tech - the art and science of communicating complex ideas (Bristech2...Cecilia Thirlway
 
Technology will destroy our planet
Technology will destroy our planetTechnology will destroy our planet
Technology will destroy our planetbailsy94
 
Inspirational Women in STEM and Grand Challenges for you to Solve
Inspirational Women in STEM and Grand Challenges for you to SolveInspirational Women in STEM and Grand Challenges for you to Solve
Inspirational Women in STEM and Grand Challenges for you to SolveBarry O'Sullivan
 
Aprendizaje invisible alfabetismos para un mundo plano
Aprendizaje invisible alfabetismos para un mundo planoAprendizaje invisible alfabetismos para un mundo plano
Aprendizaje invisible alfabetismos para un mundo planoFundación Forem Galicia
 
Roger hoerl say award presentation 2013
Roger hoerl say award presentation 2013Roger hoerl say award presentation 2013
Roger hoerl say award presentation 2013Roger Hoerl
 
Ben Goertzel - Singularity Summit Australia talk in 2011
Ben Goertzel - Singularity Summit Australia talk in 2011Ben Goertzel - Singularity Summit Australia talk in 2011
Ben Goertzel - Singularity Summit Australia talk in 2011Adam Ford
 
Don‘t be such a scientist annotated
Don‘t be such a scientist annotatedDon‘t be such a scientist annotated
Don‘t be such a scientist annotatedSimon Schneider
 
Data Responsibly: The next decade of data science
Data Responsibly: The next decade of data scienceData Responsibly: The next decade of data science
Data Responsibly: The next decade of data scienceUniversity of Washington
 
Open Data in a Global Ecosystem
Open Data in a Global EcosystemOpen Data in a Global Ecosystem
Open Data in a Global EcosystemPhilip Bourne
 
What Should We Be Arguing About?
What Should We Be Arguing About? What Should We Be Arguing About?
What Should We Be Arguing About? Drew Endy
 
NYC Data Science Meetup: Computational Social Science
NYC Data Science Meetup: Computational Social ScienceNYC Data Science Meetup: Computational Social Science
NYC Data Science Meetup: Computational Social Sciencejakehofman
 

Similar to 2014 bosc-keynote (20)

Towards a scientific data policy
Towards a scientific data policy Towards a scientific data policy
Towards a scientific data policy
 
Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?
 
Citizen Science overview for ASU HSD598 graduate course, "Citizen Science"
Citizen Science overview for ASU HSD598 graduate course, "Citizen Science"Citizen Science overview for ASU HSD598 graduate course, "Citizen Science"
Citizen Science overview for ASU HSD598 graduate course, "Citizen Science"
 
How to follow actors through their traces. Exploiting digital traceability
How to follow actors through their traces. Exploiting digital traceabilityHow to follow actors through their traces. Exploiting digital traceability
How to follow actors through their traces. Exploiting digital traceability
 
The Future of Life Sciences 2013 for Max Planck Institute
The Future of Life Sciences 2013 for Max Planck InstituteThe Future of Life Sciences 2013 for Max Planck Institute
The Future of Life Sciences 2013 for Max Planck Institute
 
Jim Gray Award Lecture
Jim Gray Award LectureJim Gray Award Lecture
Jim Gray Award Lecture
 
"AI" for Blockchain Security (Case Study: Cosmos)
"AI" for Blockchain Security (Case Study: Cosmos)"AI" for Blockchain Security (Case Study: Cosmos)
"AI" for Blockchain Security (Case Study: Cosmos)
 
Talking Tech - the art and science of communicating complex ideas (Bristech2...
Talking Tech  - the art and science of communicating complex ideas (Bristech2...Talking Tech  - the art and science of communicating complex ideas (Bristech2...
Talking Tech - the art and science of communicating complex ideas (Bristech2...
 
More information, less knowledge
More information, less knowledgeMore information, less knowledge
More information, less knowledge
 
Technology will destroy our planet
Technology will destroy our planetTechnology will destroy our planet
Technology will destroy our planet
 
Inspirational Women in STEM and Grand Challenges for you to Solve
Inspirational Women in STEM and Grand Challenges for you to SolveInspirational Women in STEM and Grand Challenges for you to Solve
Inspirational Women in STEM and Grand Challenges for you to Solve
 
Aprendizaje invisible alfabetismos para un mundo plano
Aprendizaje invisible alfabetismos para un mundo planoAprendizaje invisible alfabetismos para un mundo plano
Aprendizaje invisible alfabetismos para un mundo plano
 
Roger hoerl say award presentation 2013
Roger hoerl say award presentation 2013Roger hoerl say award presentation 2013
Roger hoerl say award presentation 2013
 
Ben Goertzel - Singularity Summit Australia talk in 2011
Ben Goertzel - Singularity Summit Australia talk in 2011Ben Goertzel - Singularity Summit Australia talk in 2011
Ben Goertzel - Singularity Summit Australia talk in 2011
 
Don‘t be such a scientist annotated
Don‘t be such a scientist annotatedDon‘t be such a scientist annotated
Don‘t be such a scientist annotated
 
Data Responsibly: The next decade of data science
Data Responsibly: The next decade of data scienceData Responsibly: The next decade of data science
Data Responsibly: The next decade of data science
 
Better Data for a Better World
Better Data for a Better WorldBetter Data for a Better World
Better Data for a Better World
 
Open Data in a Global Ecosystem
Open Data in a Global EcosystemOpen Data in a Global Ecosystem
Open Data in a Global Ecosystem
 
What Should We Be Arguing About?
What Should We Be Arguing About? What Should We Be Arguing About?
What Should We Be Arguing About?
 
NYC Data Science Meetup: Computational Social Science
NYC Data Science Meetup: Computational Social ScienceNYC Data Science Meetup: Computational Social Science
NYC Data Science Meetup: Computational Social Science
 

More from c.titus.brown

More from c.titus.brown (20)

2016 bergen-sars
2016 bergen-sars2016 bergen-sars
2016 bergen-sars
 
2016 davis-plantbio
2016 davis-plantbio2016 davis-plantbio
2016 davis-plantbio
 
2016 davis-biotech
2016 davis-biotech2016 davis-biotech
2016 davis-biotech
 
2015 genome-center
2015 genome-center2015 genome-center
2015 genome-center
 
2015 beacon-metagenome-tutorial
2015 beacon-metagenome-tutorial2015 beacon-metagenome-tutorial
2015 beacon-metagenome-tutorial
 
2015 aem-grs-keynote
2015 aem-grs-keynote2015 aem-grs-keynote
2015 aem-grs-keynote
 
2015 msu-code-review
2015 msu-code-review2015 msu-code-review
2015 msu-code-review
 
2015 illinois-talk
2015 illinois-talk2015 illinois-talk
2015 illinois-talk
 
2015 mcgill-talk
2015 mcgill-talk2015 mcgill-talk
2015 mcgill-talk
 
2015 pycon-talk
2015 pycon-talk2015 pycon-talk
2015 pycon-talk
 
2015 opencon-webcast
2015 opencon-webcast2015 opencon-webcast
2015 opencon-webcast
 
2015 vancouver-vanbug
2015 vancouver-vanbug2015 vancouver-vanbug
2015 vancouver-vanbug
 
2015 osu-metagenome
2015 osu-metagenome2015 osu-metagenome
2015 osu-metagenome
 
2015 ohsu-metagenome
2015 ohsu-metagenome2015 ohsu-metagenome
2015 ohsu-metagenome
 
2015 balti-and-bioinformatics
2015 balti-and-bioinformatics2015 balti-and-bioinformatics
2015 balti-and-bioinformatics
 
2015 pag-chicken
2015 pag-chicken2015 pag-chicken
2015 pag-chicken
 
2015 pag-metagenome
2015 pag-metagenome2015 pag-metagenome
2015 pag-metagenome
 
2014 nyu-bio-talk
2014 nyu-bio-talk2014 nyu-bio-talk
2014 nyu-bio-talk
 
2014 bangkok-talk
2014 bangkok-talk2014 bangkok-talk
2014 bangkok-talk
 
2014 anu-canberra-streaming
2014 anu-canberra-streaming2014 anu-canberra-streaming
2014 anu-canberra-streaming
 

Recently uploaded

Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfmuntazimhurra
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PPRINCE C P
 
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfSELF-EXPLANATORY
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 
zoogeography of pakistan.pptx fauna of Pakistan
zoogeography of pakistan.pptx fauna of Pakistanzoogeography of pakistan.pptx fauna of Pakistan
zoogeography of pakistan.pptx fauna of Pakistanzohaibmir069
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxyaramohamed343013
 
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfAnalytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfSwapnil Therkar
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSérgio Sacani
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Patrick Diehl
 
Work, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE PhysicsWork, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE Physicsvishikhakeshava1
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxAleenaTreesaSaji
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...anilsa9823
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptMAESTRELLAMesa2
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 

Recently uploaded (20)

Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C P
 
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
 
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
zoogeography of pakistan.pptx fauna of Pakistan
zoogeography of pakistan.pptx fauna of Pakistanzoogeography of pakistan.pptx fauna of Pakistan
zoogeography of pakistan.pptx fauna of Pakistan
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docx
 
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfAnalytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?
 
Work, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE PhysicsWork, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE Physics
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptx
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.ppt
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 

2014 bosc-keynote

  • 1. It’s hard to make predictions – especially about the future. -- attributed to Niels Bohr Monday, July 11th, 2039
  • 2. A History of “Bioinformatics” C. Titus Brown Monday, July 11th, 2039 Monday, July 11th, 2039
  • 3. Invited to reminisce!  …and perhaps inform the BRAIN2050 initiative. Note for the young: “bioinformatics” and “systems biology” are now simply “biology”. Monday, July 11th, 2039
  • 4. The 20-teens and onwards 1. Too Much Data: The Datapocalypse 2. Great results, seen once: the reproducibility crisis. 3. Mind the gap: computation in biology. Monday, July 11th, 2039
  • 6. Too… much… data… Between –omics, automated sensor data, and data sharing, biology grew into a data-intensive science. Volume, velocity, variety: the general problem. But also! Biology was optimized for hypothesis-driven investigation, not data exploration! Long arguments over “which is better”, with the people who controlled the funding => winning. Monday, July 11th, 2039
  • 7. HTC, not HPC For lots of data, High Throughput Computing was needed – but compute was cheap, not throughput! Monday, July 11th, 2039 Figure from bbc.co.uk
  • 8. 2. The reproducibility crisis Monday, July 11th, 2039 Trials Failed
  • 9. The reproducibility crisis - why??  Well known fact among biotech that the majority of published experiments were largely lab-specific.  Neither career incentives nor funding were there! (In fact, quite the contrary…)  This slowly started to change later in the decade, as the public caught on… Monday, July 11th, 2039
  • 10. Shift in “publication” recognition  Hard to believe now, but back then, people were rewarded for the first (claimed) “observation” of an effect.  Two-lab rule was only instated as best practice in the early 2020s, once reviewers started rejecting papers unaccompanied by a replication report.  Funding shift followed, of course. Monday, July 11th, 2039
  • 11. 3. Computing & data in biology Of the sciences, biology had always been the weakest in terms of computing education. This became a complete disaster once the data tsunami hit – labs generated data sets they couldn’t analyze, graduate students planned experiments that relied on computing they couldn’t do. Monday, July 11th, 2039 Photo from Wikipedia
  • 12. The “easy to use” tools fiasco  Immense investment in late ‘teens in tools that were “easy to use” – push-button data analysis, etc.  This worked well outside of research; however, it turns out you can’t place most data analysis in a black box.  “Easy to use” tools embodied so many assumptions that most results were simply invalid. Monday, July 11th, 2039
  • 13. => Bioinformatics “sweatshops”  Cadre of students and low-paid employees devoted to “service bioinformatics”  No career path, no significant authorship…  …but necessary for big labs to make progress! Monday, July 11th, 2039
  • 14. Things came to a head… Monday, July 11th, 2039 www.sanantonio-urbanliving.com
  • 15. The tipping point  The well-trained students left for the data science industry;  More and more papers were being written by people who didn’t understand the computing…  …and an increasing number of them were being rejected…  …until the supply of reviewers ran out… Monday, July 11th, 2039
  • 16. And then… California. Monday, July 11th, 2039 Map from Wikimedia
  • 17. Bioinformaticians, revolt! Bioinformatics reviewers essentially unionized and laid down three rules: 1. All of the data and source code must be provided for any paper. 2. Full methods sections and references are included in the primary paper review. 3. No unpublished methods can be used in data analysis. In the end, the only people that complained were companies like MS Elsevier, because preprints. Monday, July 11th, 2039
  • 18. Replication “parties” Open peer review Replication study! Bad? Group I Group II Group III Publication of replication attempt Monday, July 11th, 2039 A community of practice emerged around replication!
  • 19. Part of a larger renaissance for biology! Starting in ~2020, 1. Biomedical enterprise rediscovers basic biology; 2. Rise and triumph of open science; 3. A transition to networked science; 4. Massive investment in the people; Monday, July 11th, 2039
  • 21. The biomedical community backs away from translational medicine.  Several veterinary and agricultural animals proved to be better model organisms for human disease than mouse;  Ecology and evolution provided valuable theoretical and empirical observations for understanding human genetics.  Microbial interactions between environment and human proved to be important as well; built environment, disease reservoirs, etc.  Cheap sequencing enabled a vast array of studies. Monday, July 11th, 2039
  • 22. 2. Open science triumphs!  The computational community knew this by 2016, but it took a few years for the rest of biology… A curious story! 1. Biotech pressured congresspeople into decreasing funding for experiments, since analysis was usually wrong and raw data was never available; 2. Funding crunch, more generally, tightened the screws further; 3. Hypothesis driven labs couldn’t compete… Monday, July 11th, 2039
  • 23. …hypothesis-driven lab science joined with discovery.  Eventually, funders mandated data availability;  Labs that made use of available data had a dramatic edge in hypothesis-driven experimentation;  Data-driven modeling and model-driven data interpretation blossomed! Monday, July 11th, 2039 Image from emory.edu
  • 24. 3. A transition to networked science Monday, July 11th, 2039
  • 25. Universities collapsed!  So all the senior professors and administrators retired…  Massive brain drain…   … enabled a massive increase in creativity in the research enterprise!  Collaboration tools, data sharing, distributed team science… Monday, July 11th, 2039
  • 26. “Walled garden” model Monday, July 11th, 2039 Pioneered by Sage Bionetworks in ~2010s  Data collection done by small consortia;  Data made available to all, but publication in step. Model is of course obsolete nowadays, but was quite effective back then.
  • 27. 4. Massive investment in people The NIH finally invested heavily in training. Among other things: Data Carpentry Model Carpentry Monday, July 11th, 2039 (We won! Yay!)
  • 28. There are still problems, of course!  What do most genes do? Functional annotations are still poor. Some approaches --  Biogeochemistry  Synthetic biology  Career paths for experimental biologists are very uncertain.  “Glam data”  Cancer is cured, but many complex diseases – especially neurodegenerative ones – remain poorly understood. Monday, July 11th, 2039
  • 29. BRAIN2050  Ambitious 10-year proposal to “understand the brain” by 2050.  Focus on neurodegenerative diseases, regeneration, and a mechanistic understanding of intelligence.  What mistakes can they avoid, with the benefit of hindsight? Monday, July 11th, 2039
  • 30. Correlation is not causation  You’d think we’d have learned this by now!?  Original MIND project 25 years ago failed for this reason. (“Record ALL the neurons”) Monday, July 11th, 2039 Image from Wikipedia
  • 31. (Computational) modeling is critical  Can we develop models that embody hypotheses that we can then “test” against the data? Holistic multidisciplinary research. (Brain community has always been better off here…) Monday, July 11th, 2039
  • 32. Focus less on reproducibility  A strict requirement for independent replication is strangling us!  Completely independent replication is a strong requirement; understandable, given disasters of the past, but also slow.  Can we compromise? Monday, July 11th, 2039
  • 33. “Replication debt”  Can we borrow idea of “technical debt” from software engineering?  Semi-independent replication after initial exploratory phase, followed by articulation of protocols and independent replication. Monday, July 11th, 2039 Image from blog.crisp.se
  • 34. “Replication debt”  Semi-independent replication after initial exploratory phase, followed by articulation of protocols and independent replication.  Public acknowledgement of debt is important. Monday, July 11th, 2039 Image from blog.crisp.se
  • 35. Invest in infrastructure for collaboration and sharing  Data sharing is a given  But existing tools still merely support rather than drive science with data sharing!  Push for collaborative process from the outset. Monday, July 11th, 2039
  • 36. Can we help drive collaboration with technology? Monday, July 11th, 2039 Gather data Deposit data Compare against other data sets Notify Notify See e.g. pebourne.wordpress.com/2014/01/04/universities-as-big-data/
  • 37. Tool up! But evaluate, compare, understand.  Having a robust and competitive software ecosystem is important for innovation and creativity.  Available, open, reusable, remixable: all critical!  Benchmarks are not always useful; understanding always is. Monday, July 11th, 2039
  • 38. Build commercial software only when basics are understood Monday, July 11th, 2039 Research development Easy-to-use commercial software "Popular" protocols
  • 39. Invest in training as first-class research citizen! Monday, July 11th, 2039 Undergraduates K-12 students Graduate students The high school students of yesterday are the research scientists of tomorrow.
  • 40. It’s the network, dummies.  Single molecule full genome sequences did not provide understanding.  Reductionist studies of gene function did not provide understanding.  Neither will high resolution ensemble neuronal sampling.  Our main obstacle in understanding aging has been that it seems to be systemic, just like neurogeneration. Monday, July 11th, 2039
  • 41. Concluding thoughts (I)  Many things the BRAIN2050 field can do to invest in its own future and accelerate progress!  Bitter lessons learned from decades of mistakes in other fields; maybe we can do better? Monday, July 11th, 2039
  • 43. All right…  Future talk over   I thought I’d use this as a foil to highlight issues that I think are important for the future. But:
  • 44. We have to get used to the idea that radical change keeps happening ... even after 1997. First published by Broadway Books on May 5, 1997. Via Erich Schwarz
  • 45. We have to get used to the idea that radical change keeps happening ... even after 1997. "Among the pessimists, molecular biologist Gunther Stent suggests that science is reaching a point of incremental, diminishing returns as it comes up against the limits of knowledge..." --review by Publishers Weekly First published by Broadway Books on May 5, 1997. Via Erich Schwarz
  • 46. Robert Heinlein's four curves of predicted human progress (described in 1950) Ref.: Heinlein, R.A. (1950), "Where To?". "The solid curve ... represents many things - - use of power, speed of transport, numbers of scientific and technical workers, advance in communication, average miles traveled per person per year, advances in mathematics ... Call it the curve of human achievement." Via Erich Schwarz
  • 47. Robert Heinlein's four curves of predicted human progress (described in 1950) "Despite everything, there is a stubborn 'common sense' tendency to project it along dotted line number (1) like the patent office official of a hundred years back who quit his job 'because everything had already been invented'." Ref.: Heinlein, R.A. (1950), "Where To?". Via Erich Schwarz
  • 48. Robert Heinlein's four curves of predicted human progress (described in 1950) "Even those who don't expect a slowing up at once tend to expect us to reach a point of diminishing returns -- dotted line number (2)." Ref.: Heinlein, R.A. (1950), "Where To?". Via Erich Schwarz
  • 49. Robert Heinlein's four curves of predicted human progress (described in 1950) "Very daring minds are willing to predict that we will continue our present rate of progress -- dotted line number (3) -- a tangent." Ref.: Heinlein, R.A. (1950), "Where To?". Via Erich Schwarz
  • 50. Robert Heinlein's four curves of predicted human progress (described in 1950) Ref.: Heinlein, R.A. (1950), "Where To?". "But the proper way to project the curve is dotted line number (4), because there is no reason, mathematical, scientific, or historical, to expect that curve to flatten out... The correct projection ... is for the curve to go on up indefinitely with increasing steepness..." Via Erich Schwarz
  • 51. Conclusion --  I certainly don’t know where we’re headed; no one else does either.  We must invest in people and process; we must help figure out what the right process is and then provide career incentives for people to do things that way. This community should be leading the way: Bioinformatics Open Source Conference (Reminder: we will win.)
  • 52. But: economics matter 50-million mark note. Weimar Germany, 1923.
  • 53. Economics matter Ref.: U.S. Government Accountability Office, Citizen's Guide of 2010.
  • 54. Prospects for U.S. public funding of science Ref.: U.S. Government Accountability Office, Citizen's Guide of 2010.
  • 55. Public support for science matters!  Data sharing, openness => maximizing return.  Must figure out how to align career and funding incentives.  We are currently doing a horrible job of this…  …I’m looking forward to Phil Bourne’s talk :) Monday, July 11th, 2039
  • 56. Thanks!  Discussions with Phil Bourne (NIH), Erich Schwarz (Caltech & Cornell), Katherine Mejia-Guerra (OSU) and Jeffrey Campbell (OSU).  All of this will be (is already?) posted online.  “The next 10 years of quant bio” by Mike Schatz  …with apologies to Gary Bernhardt (Birth & Death of JavaScript – go watch it!) Monday, July 11th, 2039