Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Mass tlc big data panel sep 20
1. MassTLC Big Data Seminar
What Does @
m
as
All This #b lc
ig
st
da
Data Mean? ta
September 20, 2012
IBM Innovation Center
Waltham MA
2. What Does All This Data Mean?
Agenda
•Setting the Context
•Introducing the Panel
•Panel Discussion
•Q&A
– Hashtags: @masstlc #bigdata
3. Your Panel
• Richard Dale, Managing Director,
Big Data Boston Ventures
– Twitter: @rdale
• Irene Greif, Fellow, IBM Visualization
– Twitter: @igreif
• Martin Leach, CIO, Broad Institute
– Twitter: @mdleach
• Andrew Pandre, Principal, Sears Holding Cos
– http://apandre.wordpress.com/
4. Richard Dale
Managing Director,
Big Data Boston Ventures
Micro-VC fund investing in
big data companies located in
or connected to the regional big data cluster
Database techie turned Entrepreneur turned VC
– Database Performance Guru, SQL Solutions
– Co-founder, Phase Forward
– Principal, Sigma Partners
– Founder & Managing Director, Big Data Boston Ventures
5. Setting the Context
• What is Big Data?
• Where does Big Data come from?
• What is Big Data going?
6. What is Big Data?
a collection of data
sets so large and complex
that it becomes difficult
to process using on-hand
database management tools
(wikipedia)
7. What is Big Data?
3 V’s:
•volume
•velocity
•variety
(Doug Laney, Gartner)
8. What is Big Data?
Data easier and cheaper to
collect than to analyze
(??)
9. What is Big Data?
Data that you can’t process on a
single machine, however big your
machine (and however long you wait)
or
Data growing faster than
Moore’s law
(Richard Dale)
10. Where Does Big Data Come From?
Behavior
•Social Media
•User Generated Content
•Click streams
•Viewing, Purchasing, Liking, Sharing
•The Quantified Self
11. Where Does Big Data Come From?
Observation (in ever finer granularity)
•Machines
– Computers, Vehicles, Phones, Industrial Machines
•Environments
– RFID, Traffic flow, Nature (and our impact)
•People
– The Quantified Self
– Medical imaging
– Genetic sequencing
12. Where Does Big Data Come From?
Correlations
•Each data item, image or observation can be
cross-correlated with any other
•Even if N is tractable,
N x N x N x … is not
13. Technology Landscape
Applications: Horizontal and Vertical
business or domain applications
Data
Services:
Collecting, Analytics: Algorithms, Visualization,
Collating, Machine Learning
Correlating,
Curating
Infrastructure: Storing, Managing, Moving
Source:
14. Technology Landscape
Applications: Horizontal and Vertical
business or domain applications
Data
Services:
Collecting, Analytics: Algorithms, Visualization,
Collating, Machine Learning
Correlating,
Curating
Infrastructure: Storing, Managing, Moving
Source:
15. Technology Landscape
Applications: Horizontal and Vertical
business or domain applications
Data
Services:
Collecting, Analytics: Algorithms, Visualization,
Collating, Machine Learning
Correlating,
Curating
Infrastructure: Storing, Managing, Moving
Source:
16. A Sea of Choices for Data Viz
• BI packages
• Dashboard reporting tools
• Ad hoc infographics
• Whiteboards
• Napkin scribbles
17. Turning Big Data into Big Clarity
Art or Science? Let’s ask the Panel!
•Irene Greif, IBM Fellow
– Twitter: @igreif
•Martin Leach, CIO, Broad Institute
– Twitter: @mdleach
•Andrew Pandre, Principal, Sears Holding Cos
– http://apandre.wordpress.com/
18. Turning Big Data into Big Clarity
Art or Science? Let’s ask the Panel!
•Irene Greif, IBM Fellow
– Twitter: @igreif
•Martin Leach, CIO, Broad Institute
– Twitter: @mdleach
•Andrew Pandre, Principal, Sears Holding Cos
– http://apandre.wordpress.com/
19. IBM Center for Social Business
Irene Greif, IBM Fellow, Chief Scientist for Social Business
Many
Eyes
20. Turning Big Data into Big Clarity
Art or Science? Let’s ask the Panel!
•Irene Greif, IBM Fellow
– Twitter: @igreif
•Martin Leach, CIO, Broad Institute
– Twitter: @mdleach
•Andrew Pandre, Principal, Sears Holding Cos
– http://apandre.wordpress.com/
21. The Broad Institute of
MIT & Harvard
• The Broad Institute is a non-profit biomedical research
institute
• Ten core faculty members and approximately 150
associate members from across MIT and Harvard
• Greater than 1900 research and administrative staff
Martin Leach, CIO
Programs and Initiatives Platforms
focused on specific disease or biology areas focused technological innovation and application
Cancer Genomics Platform
Genome Biology Biological Samples
Genome Sequencing and Analysis Genome Sequencing
Cell Circuits Genetic Analysis
Psychiatric Disease Chemical Biology/Novel Therapeutics
Metabolism Imaging
Medical and Population Genetics Metabolite Profiling
Chemical Biology/Novel Therapeutics Proteomics
Infectious Disease RNAi
Epigenomics Therapeutics Discovery & Development
22. Turning Big Data into Big Clarity
Art or Science? Let’s ask the Panel!
•Irene Greif, IBM Fellow
– Twitter: @igreif
•Martin Leach, CIO, Broad Institute
– Twitter: @mdleach
•Andrew Pandre, Principal, Sears Holding Cos
– http://apandre.wordpress.com/
23. Big Data
Visualization
Andrew Pandre, Ph.D.,
Principal
Sears Holdings Corporation
Data Visualization Blog
http://apandre.wordpress.com
Google+ microblog:
http://tinyurl.com/VisibleData