SlideShare a Scribd company logo
1 of 241
Deepak Singh, Ph.D.
 
 
 
 
Picture via  Eole  under a CC-BY-NC-SA license
?
Via  Reavel  under a CC-BY-NC-ND license
biz dev manager
 
 
resizable compute capacity
scalable web sites
number crunching
but
life science industry
software
informatics
scientific programmer
product manager
strategist
 
 
 
opinions
lots of opinions
 
career choices
software development
informatics
computing
data
open  data
http://mndoci.com
http://c2cbio.com  or on iTunes
http://bioscreencast.com
By  jasarcadia  under a CC-BY-NC-ND license
A  meme  (pronounced /miːm/) consists of any idea or behavior that can pass from one person to another by learning or imitation
 
big  data
collective intelligence
the new science
By  ~Prescott  under a CC-BY-NC license
datasets
many datasets
PFAM GENBANK ENSEMBL PDB Many Others
 
manageable
download
 
 
 
 
data management  is not  data storage
smart
 
context
Via   Nature Reviews Cancer
technology
technology ? ? ? ?
technology technology technology technology
Back of the room
listening
toxicologists
 
 
experiment design
holistic
systems biology
the s*&t hits the fan
Image courtesy Matt Wood
 
 
 
genome #1
$3 billion
15 years
 
1000  genomes
http://www.1000genomes.org/
By  bitterlysweet  under a CC-BY-NC-ND license
75 TB / week
600 GB – 6 TB / run
200 TB drive
schema
fit on a wall too big to
implications
Via  Barack Obama  under a CC-BY-NC-SA license
 
utilization
capacity planning
data availability
data access
collaboration
computation
typical informatics workflow
 
 
 
 
 
distribute everything
distributed data
distributed computing
Via  bionicteaching  under a CC-BY license
Via  bionicteaching  under a CC-BY license
services everywhere
data services
application services
api
available everywhere
available all the time
 
 
 
 
 
 
 
 
sensors
adverse event reporting
research streaming
 
computing everywhere
Via  Laughing Squid  under a CC-BY-NC-ND license
 
 
collective intelligence  is a shared or group intelligence that emerges from the collaboration and competition of many individuals.
 
networked future of science
 
protective
A biologist would rather share their toothbrush than their (gene) names -- Mike Ashburner (Cambridge)
 
 
wisdom
look elsewhere
Wherever you work most of the smart people are somewhere else -- Bill Joy
TIMTOWDI
data
data finds the data Source:  Jeff Jonas
data finds the data, then people find people Source:  Jon  Udell
important
world wide web
giant global graph
 
search
traverse link graph
people
 
present
future
data in context
linked data
the artist formerly known as
the semantic web
 
entity extraction
 
follow the graph
let the data find the data
and then
people  will  find the people
information overload
 
filter failure
human trust networks
many ways
scientific social networks
 
why?
put people first
 
communities around data
http://ecolicommunity.org
http://ebird.org/content/ebird/
micro-communities
 
 
 
little segue
“ Bursty Work”
loosely distributed collaborations
computational problems
 
 
 
back on track
I define  Web 2.0  as the design of systems that harness network effects to get better the more people use them, or more colloquially, as “ harnessing collective intelligence .” This includes explicit network-enabled collaboration, to be sure, but it should encompass every way that people connected to a network create synergistic effects -- Tim O’Reilly
web as platform
data driven platform
people driven platform
 
 
 
bayesian filter
find relevant information
huge  amounts of data
architect for innovation
google visualization api
structured data
multiple sources
connected to the web
platform
create
share
re-use
visualizations
create
share
re-use
create
share
re-use
 
 
mashups
 
 
collect
analyze
remix
repurpose
only way
open data
obey web standards
xml
json
rdf
all this stuff
new models
research
collaboration
business
exciting times
Via  The Opportunity Agenda  under a CC-BY-NC-SA license
the door is open
take the step
Acknowledgements Matt Wood Carole Goble Larry Lessig The Biogang
 

More Related Content

What's hot

Download PPT file
Download PPT fileDownload PPT file
Download PPT file
Videoguy
 

What's hot (20)

Web 3.0? A look at the future of the World Wide Web
Web 3.0?  A look at the future of the World Wide WebWeb 3.0?  A look at the future of the World Wide Web
Web 3.0? A look at the future of the World Wide Web
 
ProQuest Quantum - 14_0227
ProQuest Quantum - 14_0227ProQuest Quantum - 14_0227
ProQuest Quantum - 14_0227
 
Emerging Forms of Data and Analytics
Emerging Forms of Data and AnalyticsEmerging Forms of Data and Analytics
Emerging Forms of Data and Analytics
 
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...
 
Ethics of Automation
Ethics of AutomationEthics of Automation
Ethics of Automation
 
Open Research Problems in Linked Data - WWW2010
Open Research Problems in Linked Data - WWW2010Open Research Problems in Linked Data - WWW2010
Open Research Problems in Linked Data - WWW2010
 
Cloud and Big Data Come Together in the Ocean Observatories Initiative to Giv...
Cloud and Big Data Come Together in the Ocean Observatories Initiative to Giv...Cloud and Big Data Come Together in the Ocean Observatories Initiative to Giv...
Cloud and Big Data Come Together in the Ocean Observatories Initiative to Giv...
 
Future of Scholarly Communications
Future of Scholarly CommunicationsFuture of Scholarly Communications
Future of Scholarly Communications
 
Making the web work for science - eResearch nz
Making the web work for science - eResearch nzMaking the web work for science - eResearch nz
Making the web work for science - eResearch nz
 
Extended abstract slides
Extended abstract slidesExtended abstract slides
Extended abstract slides
 
Building capacity for open, data-driven science - Grand Rounds
Building capacity for open, data-driven science - Grand RoundsBuilding capacity for open, data-driven science - Grand Rounds
Building capacity for open, data-driven science - Grand Rounds
 
Leveraging the power of the web - Open Repositories 2015
Leveraging the power of the web - Open Repositories 2015Leveraging the power of the web - Open Repositories 2015
Leveraging the power of the web - Open Repositories 2015
 
Hypertext2007 Carole Goble Keynote - "The Return of the Prodigal Web"
Hypertext2007 Carole Goble Keynote - "The Return of the Prodigal Web"Hypertext2007 Carole Goble Keynote - "The Return of the Prodigal Web"
Hypertext2007 Carole Goble Keynote - "The Return of the Prodigal Web"
 
Linked Data: opportunities and challenges
Linked Data: opportunities and challengesLinked Data: opportunities and challenges
Linked Data: opportunities and challenges
 
Laying out the Principles of Open Science
Laying out the Principles of Open ScienceLaying out the Principles of Open Science
Laying out the Principles of Open Science
 
The future of the internet: version 4
The future of the internet: version 4The future of the internet: version 4
The future of the internet: version 4
 
Humanities in the Digital World
Humanities in the Digital WorldHumanities in the Digital World
Humanities in the Digital World
 
Big Data Curricula at the UW eScience Institute, JSM 2013
Big Data Curricula at the UW eScience Institute, JSM 2013Big Data Curricula at the UW eScience Institute, JSM 2013
Big Data Curricula at the UW eScience Institute, JSM 2013
 
Mozilla Science Lab 101
Mozilla Science Lab 101Mozilla Science Lab 101
Mozilla Science Lab 101
 
Download PPT file
Download PPT fileDownload PPT file
Download PPT file
 

Similar to Science Big, Science Connected

Data Sharing and the Polar Information Commons
Data Sharing and the Polar Information CommonsData Sharing and the Polar Information Commons
Data Sharing and the Polar Information Commons
Kaitlin Thaney
 
Amaxus con webdoc_10773
Amaxus con webdoc_10773Amaxus con webdoc_10773
Amaxus con webdoc_10773
vafopoulos
 
Security-Challenges-in-Implementing-Semantic-Web-Unifying-Logic
Security-Challenges-in-Implementing-Semantic-Web-Unifying-LogicSecurity-Challenges-in-Implementing-Semantic-Web-Unifying-Logic
Security-Challenges-in-Implementing-Semantic-Web-Unifying-Logic
Nana Kwame(Emeritus) Gyamfi
 
Brainframes, digital technologies and connected intelligence -Derrick de Kerc...
Brainframes, digital technologies and connected intelligence -Derrick de Kerc...Brainframes, digital technologies and connected intelligence -Derrick de Kerc...
Brainframes, digital technologies and connected intelligence -Derrick de Kerc...
thiteu
 

Similar to Science Big, Science Connected (20)

Data Commons Garvan - 2016
Data Commons Garvan -  2016 Data Commons Garvan -  2016
Data Commons Garvan - 2016
 
New challenges for digital scholarship and curation in the era of ubiquitous ...
New challenges for digital scholarship and curation in the era of ubiquitous ...New challenges for digital scholarship and curation in the era of ubiquitous ...
New challenges for digital scholarship and curation in the era of ubiquitous ...
 
New Data `New Computation
New Data `New ComputationNew Data `New Computation
New Data `New Computation
 
New and Emerging Forms of Data
New and Emerging Forms of DataNew and Emerging Forms of Data
New and Emerging Forms of Data
 
Derrick De K Brainframes Of Web 2.0
Derrick De K Brainframes Of Web 2.0Derrick De K Brainframes Of Web 2.0
Derrick De K Brainframes Of Web 2.0
 
Institutional knowledge and information ecology in a Free Software ecosystem
Institutional knowledge and information ecology in a Free Software ecosystemInstitutional knowledge and information ecology in a Free Software ecosystem
Institutional knowledge and information ecology in a Free Software ecosystem
 
Data Sharing and the Polar Information Commons
Data Sharing and the Polar Information CommonsData Sharing and the Polar Information Commons
Data Sharing and the Polar Information Commons
 
Data Sharing: Social and Normative - ISWC
Data Sharing: Social and Normative - ISWCData Sharing: Social and Normative - ISWC
Data Sharing: Social and Normative - ISWC
 
Knowledge Sharing over social networking systems
Knowledge Sharing over social networking systemsKnowledge Sharing over social networking systems
Knowledge Sharing over social networking systems
 
Sensorpedia
SensorpediaSensorpedia
Sensorpedia
 
Publishing Data on the Web
Publishing Data on the Web Publishing Data on the Web
Publishing Data on the Web
 
Amaxus con webdoc_10773
Amaxus con webdoc_10773Amaxus con webdoc_10773
Amaxus con webdoc_10773
 
Security-Challenges-in-Implementing-Semantic-Web-Unifying-Logic
Security-Challenges-in-Implementing-Semantic-Web-Unifying-LogicSecurity-Challenges-in-Implementing-Semantic-Web-Unifying-Logic
Security-Challenges-in-Implementing-Semantic-Web-Unifying-Logic
 
Brainframes, digital technologies and connected intelligence -Derrick de Kerc...
Brainframes, digital technologies and connected intelligence -Derrick de Kerc...Brainframes, digital technologies and connected intelligence -Derrick de Kerc...
Brainframes, digital technologies and connected intelligence -Derrick de Kerc...
 
Web 3.0
Web 3.0Web 3.0
Web 3.0
 
Cet
CetCet
Cet
 
Leveraging the power of the web - Rocky Mountain Advanced Computing Conference
Leveraging the power of the web - Rocky Mountain Advanced Computing Conference Leveraging the power of the web - Rocky Mountain Advanced Computing Conference
Leveraging the power of the web - Rocky Mountain Advanced Computing Conference
 
myExperiment @ Nettab
myExperiment @ NettabmyExperiment @ Nettab
myExperiment @ Nettab
 
HKU Data Curation MLIM7350 Class 10
HKU Data Curation MLIM7350 Class 10HKU Data Curation MLIM7350 Class 10
HKU Data Curation MLIM7350 Class 10
 
Enhancing the Web Experience
Enhancing the Web ExperienceEnhancing the Web Experience
Enhancing the Web Experience
 

More from Deepak Singh

Intel Theater Presentation - SC11
Intel Theater Presentation - SC11Intel Theater Presentation - SC11
Intel Theater Presentation - SC11
Deepak Singh
 
Talk at West Coast Association of Shared Resource Directors
Talk at West Coast Association of Shared Resource DirectorsTalk at West Coast Association of Shared Resource Directors
Talk at West Coast Association of Shared Resource Directors
Deepak Singh
 
Systems Bioinformatics Workshop Keynote
Systems Bioinformatics Workshop KeynoteSystems Bioinformatics Workshop Keynote
Systems Bioinformatics Workshop Keynote
Deepak Singh
 
Plenary Talk at ACAT 2010
Plenary Talk at ACAT 2010Plenary Talk at ACAT 2010
Plenary Talk at ACAT 2010
Deepak Singh
 
Masterworks talk on Big Data and the implications of petascale science
Masterworks talk on Big Data and the implications of petascale scienceMasterworks talk on Big Data and the implications of petascale science
Masterworks talk on Big Data and the implications of petascale science
Deepak Singh
 
Talk given at "Cloud Computing for Systems Biology" workshop
Talk given at "Cloud Computing for Systems Biology" workshopTalk given at "Cloud Computing for Systems Biology" workshop
Talk given at "Cloud Computing for Systems Biology" workshop
Deepak Singh
 

More from Deepak Singh (20)

Intel Theater Presentation - SC11
Intel Theater Presentation - SC11Intel Theater Presentation - SC11
Intel Theater Presentation - SC11
 
Talk at West Coast Association of Shared Resource Directors
Talk at West Coast Association of Shared Resource DirectorsTalk at West Coast Association of Shared Resource Directors
Talk at West Coast Association of Shared Resource Directors
 
High Performance Cloud Computing
High Performance Cloud ComputingHigh Performance Cloud Computing
High Performance Cloud Computing
 
#arseniclife
#arseniclife#arseniclife
#arseniclife
 
High Performance Cloud Computing
High Performance Cloud ComputingHigh Performance Cloud Computing
High Performance Cloud Computing
 
Systems Bioinformatics Workshop Keynote
Systems Bioinformatics Workshop KeynoteSystems Bioinformatics Workshop Keynote
Systems Bioinformatics Workshop Keynote
 
Talk at NCRR P41 Director's Meeting
Talk at NCRR P41 Director's MeetingTalk at NCRR P41 Director's Meeting
Talk at NCRR P41 Director's Meeting
 
Platforms for data science
Platforms for data sciencePlatforms for data science
Platforms for data science
 
Discovery 2015 Workshop
Discovery 2015 WorkshopDiscovery 2015 Workshop
Discovery 2015 Workshop
 
Bio-IT World 2010 - Keynote talk
Bio-IT World 2010 - Keynote talkBio-IT World 2010 - Keynote talk
Bio-IT World 2010 - Keynote talk
 
Talk at Microsoft Cloud Futures 2010
Talk at Microsoft Cloud Futures 2010Talk at Microsoft Cloud Futures 2010
Talk at Microsoft Cloud Futures 2010
 
NHGRI Cloud Computing talk
NHGRI Cloud Computing talkNHGRI Cloud Computing talk
NHGRI Cloud Computing talk
 
Plenary Talk at ACAT 2010
Plenary Talk at ACAT 2010Plenary Talk at ACAT 2010
Plenary Talk at ACAT 2010
 
Masterworks talk on Big Data and the implications of petascale science
Masterworks talk on Big Data and the implications of petascale scienceMasterworks talk on Big Data and the implications of petascale science
Masterworks talk on Big Data and the implications of petascale science
 
Talk given at "Cloud Computing for Systems Biology" workshop
Talk given at "Cloud Computing for Systems Biology" workshopTalk given at "Cloud Computing for Systems Biology" workshop
Talk given at "Cloud Computing for Systems Biology" workshop
 
Hadoop for Bioinformatics
Hadoop for BioinformaticsHadoop for Bioinformatics
Hadoop for Bioinformatics
 
Big Data & the networked future of Science (at Ignite Seattle 7)
Big Data & the networked future of Science (at Ignite Seattle 7)Big Data & the networked future of Science (at Ignite Seattle 7)
Big Data & the networked future of Science (at Ignite Seattle 7)
 
Bioscreencast: Capturing the life sciences frame by frame
Bioscreencast: Capturing the life sciences frame by frameBioscreencast: Capturing the life sciences frame by frame
Bioscreencast: Capturing the life sciences frame by frame
 
Searching Science
Searching ScienceSearching Science
Searching Science
 
Nanotechnology and medicine
Nanotechnology and medicineNanotechnology and medicine
Nanotechnology and medicine
 

Recently uploaded

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Recently uploaded (20)

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 

Science Big, Science Connected

Editor's Notes

  1. 972 - 37350078