SlideShare a Scribd company logo
1 of 30
Download to read offline
Netherlands eScience Center
ICT Synergy Hub, Amsterdam


Taming the Big Data Beast - Together

Nieuwjaarsbijeenkomst Kennisalliantie
Delft, 31 januari-2013

Prof. dr. Jacob de Vlieg ¹ ²
1. CEO & Scientific Director of Netherlands eScience Center, NWO-SURF
2. Head Computational Design & Discovery, CMBI, Radboud University, Medical Center, Nijmegen,
Netherlands
Agenda

• Big Data in Science: Challenges & Opportunities
  – Top Sector ICT Roadmap theme: “Data, Data, Data”


• Netherlands eScience Center (NLeSC)
  – Expert centre for Big Data Research


• Joint NWO-NLeSC “Big Data” project call
  – Public-private partnerships
Data are the lifeblood of modern science and
             the digital economy
Data are the lifeblood of modern science and
               the digital economy

Managing, analyzing, linking & re-using data to create business
  value and/or scientific breakthroughs e.g.
   –   Social media data to influence consumer choices
   –   Sensor networks data: e.g. sensor-enabled smart dikes
   –   Imaging & biobanking data in health care e.g. diagnostics, medicine
   –   And many more opportunities
Data are the lifeblood of modern science and
               the digital economy

Managing, analyzing, linking & re-using data to create business
  value and/or scientific breakthroughs e.g.
   –   Social media data to influence consumer choices
   –   Sensor networks data: e.g. sensor-enabled smart dikes
   –   Imaging & biobanking data in health care e.g. diagnostics, medicine
   –   And many more opportunities


Big Data: a complex concept
   – 4Vs: Volume, Variety, Velocity, Verification
Data are the lifeblood of modern science and
               the digital economy

Managing, analyzing, linking & re-using data to create business
  value and/or scientific breakthroughs e.g.
    –   Social media data to influence consumer choices
    –   Sensor networks data: e.g. sensor-enabled smart dikes
    –   Imaging & biobanking data in health care e.g. diagnostics, medicine
    –   And many more opportunities.


Big Data: a complex concept
    – 4Vs: Volume, Variety, Velocity, Verification


Big Data inextricably connected to eScience/HPC

ICT top sector roadmap: e-Science is about intelligent infrastructure to
   model and/or to access big data
Key eScience challenges Big Data research

–   Cross-type data integration
–   Data-driven & multi-models simulations
–   Visualization & analytics
–   High performance computing: connected computers & fast networks.
Key eScience challenges Big Data research

–   Cross-type data integration
–   Data-driven & multi-models simulations
–   Visualization & analytics
–   High performance computing: connected computers & fast networks




–   Stimulate culture of knowledge sharing: no silos; data stewardship
–   Rationalization of ICT landscapes; interoperability & industry data standards
–   Training & education
Science itself is changing …We
 need to change with it…

                     Neelie Kroes in “Giving Europe’s Scientists the
                     Tools to Deliver”




Two key words: multidisciplinary research & data-driven discovery
eScience and the mystery of the
empty labs
eScience and the mystery of the
empty labs
eScience and the mystery of the
empty labs




       • Much more data per experiment (miniaturized and/or automation)
       • External data sources & outsourcing
       • Experimental design, data management & analytics(eScience)
Quantified Self Movement -> Big Data

                 Use apps and wearable sensors to
                 monitor daily life e.g. hours of sleep, food
                 consumed, exercise taken, etc.


                 Quantified Self = Big Data + Mobile +
                 Sensors + Visualization + Gamification




                 .
eScience Hero
Fights for medical innovation; parkinson’s disease


•   Big Data

•   Pattern recognition

•   Machine learning

•   Social Media

                            Andy Grove (ex-CEO Intel)
Voice algorithms spot Parkinson's disease:
data-driven diagnostics
• Machine learning algorithms that analyse voice
  recordings to detect Parkinson's symptoms early
  on (Little at al. @ Media Lab, MIT)

• Social Media:

                       Looking for volunteers to contribute to
                       the database to improve pattern
                       recognition
Voice algorithms spot Parkinson's disease:
data-driven diagnostics
• Machine learning algorithms that analyse voice
  recordings to detect Parkinson's symptoms early
  on (Little at al. @ Media Lab, MIT)

• Social Media:

                          Looking for volunteers to contribute to
                          the database to improve pattern
                          recognition



Social networking health sites: patient-driven data collection
    •21andme
    •PatientsLikeMe.com
    •And so on
                     Big Data V= Verification: privacy, compliance, etc
'Data Scientist' is
  now the hottest
  job title in Silicon
  Valley…                                    Tim O'Reilly
                                             Founder of O'Reilly Media
                                             Supporter free software and open
                                             source movements



McKinsey projected that the US needs
140,000 to 190,000 more workers with “deep
analytical expertise”
Netherlands eScience Center

          Netherlands organization       Principal Dutch body for
           for scientific research:    ICT innovation for research




                                  NL-eSC
                                            SURF Science park, Amsterdam;
                                            SARA, EGI

                                            Networked innovation model

                                            Bridge:
                                                 •Science & advanced ICT
                                                 •Industry & Academic Research

                                            •Training & Education

New ways to do research made possible because of Big Data/eScience
NLeSC portfolio divided in themes
•Sustainability & Environment   •Life Sciences
- Climate                       - Green Genetics
- Water management              - Translational Research IT
-Energy                         - Foods
-Ecology                        - Cognition/Neuroscience

•Chemistry & Materials          •eScience Methodology & ‘Big Data’
-Chemistry                      - eScience Methodology
                                - Astronomy
•Humanities & Social Sciences
- Humanities
-Social Sciences
Can scientists from digital humanities help food
researchers?
Food Research: Food Specific Ontologies for Food Focused Text Mining

Project Leader: Wynand Alkema

Addressing absence of domain specific structured
vocabularies which limits the use of data mining &
knowledge management methods in food research.


Digital Humanities: BiographyNED

Project Leader: Guus Schreiber

Will improve current version of the Biography Portal by
incorporating analytical tools to show interconnections,
trends, geographical maps and time lines.
eScience & Big Data: providing leads for
        new food applications
NLeSC eScience engineers:
Scientists bridging research and advanced ICT




                   Deliver sustainable solutions for data-driven research

                   Work both at center and on site
Collaborative Innovation Network
     Taming the Data Beast Together

 SMEs,etc




      NLeSC eScience Engineers:

      Work both at center and on site:
      •Exchange of eScience expertise
      •Re-use of proven eScience (technology hopping)
      •Career development & training
Grand scientific challenges leads to
     innovative eScience & Big Data Research
eSalsa NLeSC project: data-driven simulations & advanced
visualization to understand Climate Change
                                                                   Prof. Henk Dijkstra,
                                                                   Univ. of Utrecht
                                                                   NLeSC Integrator Climate




                                                                    Dr. Jason Maassen
                                                                    eScience Engineer
                                                                    NLeSC




 •eScience to allow unprecedented level of detail (large scale distributed computing)

 •State-of-the-art visualization techniques to analyze hundreds of Terabytes of
 output

 •Re-use of proven eScience concepts in new areas (e.g. sector water)
The number of data-driven start-ups is growing—
particularly when it comes to social media.




              Taming the Big Data Beast
Development of a high performance
              Twitter analysis platform
Hadoop – MapReduce architecture @ a large SARA computer cluster


Smart search & analysis software



                                                                 Prof. Antal van den Bosch
Goal is to ask “Big Data” research questions e.g.                NLeSC Integrator Humanities
                                                                 Radboud University Nijmegen

 •   Ability to analyze microblogging data produced over years
 •   Time dependant                                                Dr. Erik Tjong Kim Sang
                                                                   eScience Engineer
 •   Real time sentiment analysis                                  NleSC
 •   And so on…
Cyber-common: a facility for 21st century data-
driven research and multidisciplinary team work
     The key to scientific questions y!
 To link minds and eScience




                      SURF-SARA-NLeSC
Cyber-common: a facility for 21st century data-
driven research and multidisciplinary team work
     The key to to scientific questions
       The key scientific questions y!
                yet unasked!
 To link minds and eScience




                      SURF-SARA-NLeSC
Joint NWO-NLeSC “data sciences” call
• Focus on stimulating public-private partnerships

• Three instruments:
   – Industrial Partnership Programme (IPP)
   – Technology Area’s (TA)
   – Knowledge Innovation Mapping SMEs (KIEM MKB)


           Rosemarie van der Veen-Oei (NLeSC)
           r.vanderveen@nwo.nl
           T 070 3440 851




             Mark Kas (NWO)                     www.nlesc.nl
             m.kas@nwo.nl
             T 070 3440 811, M 06 205 93 207    Netherlands eScience Center
Thank you

     www.esciencecenter.nl

More Related Content

What's hot

Gridforum David De Roure Newe Science 20080402
Gridforum David De Roure Newe Science 20080402Gridforum David De Roure Newe Science 20080402
Gridforum David De Roure Newe Science 20080402vrij
 
(Em)Powering Science: High-Performance Infrastructure in Biomedical Science
(Em)Powering Science: High-Performance Infrastructure in Biomedical Science(Em)Powering Science: High-Performance Infrastructure in Biomedical Science
(Em)Powering Science: High-Performance Infrastructure in Biomedical ScienceAri Berman
 
Science Gateways for Life Sciences – Balancing Usability and Re-Usability
Science Gateways for Life Sciences – Balancing Usability and Re-Usability Science Gateways for Life Sciences – Balancing Usability and Re-Usability
Science Gateways for Life Sciences – Balancing Usability and Re-Usability Sandra Gesing
 
2016 05 sanger
2016 05 sanger2016 05 sanger
2016 05 sangerChris Dwan
 
Building the Pacific Research Platform: Supernetworks for Big Data Science
Building the Pacific Research Platform: Supernetworks for Big Data ScienceBuilding the Pacific Research Platform: Supernetworks for Big Data Science
Building the Pacific Research Platform: Supernetworks for Big Data ScienceLarry Smarr
 
NSF SI2 program discussion at 2014 SI2 PI meeting
NSF SI2 program discussion at 2014 SI2 PI meetingNSF SI2 program discussion at 2014 SI2 PI meeting
NSF SI2 program discussion at 2014 SI2 PI meetingDaniel S. Katz
 
Love for science or 'Academic Prostitution' - DFD2014 version
Love for science or 'Academic Prostitution' - DFD2014 versionLove for science or 'Academic Prostitution' - DFD2014 version
Love for science or 'Academic Prostitution' - DFD2014 versionLourdes Verdes-Montenegro
 
The New e-Science (Bangalore Edition)
The New e-Science (Bangalore Edition)The New e-Science (Bangalore Edition)
The New e-Science (Bangalore Edition)David De Roure
 
2019 June 27 - Big data and data science
2019 June 27 - Big data and data science2019 June 27 - Big data and data science
2019 June 27 - Big data and data scienceFabio Stella
 
Internet2 Bio IT 2016 v2
Internet2 Bio IT 2016 v2Internet2 Bio IT 2016 v2
Internet2 Bio IT 2016 v2Dan Taylor
 
Transforming Big Data into Smart Data for Smart Energy: Deriving Value via ha...
Transforming Big Data into Smart Data for Smart Energy: Deriving Value via ha...Transforming Big Data into Smart Data for Smart Energy: Deriving Value via ha...
Transforming Big Data into Smart Data for Smart Energy: Deriving Value via ha...Amit Sheth
 
Citizen Science 101: What Every Researcher Should Know About Crowdsourcing Sc...
Citizen Science 101: What Every Researcher Should Know About Crowdsourcing Sc...Citizen Science 101: What Every Researcher Should Know About Crowdsourcing Sc...
Citizen Science 101: What Every Researcher Should Know About Crowdsourcing Sc...Andrea Wiggins
 
Maurice Bouwhuis (SARA/Vancis) - Hoe big data te begrijpen door ze te visuali...
Maurice Bouwhuis (SARA/Vancis) - Hoe big data te begrijpen door ze te visuali...Maurice Bouwhuis (SARA/Vancis) - Hoe big data te begrijpen door ze te visuali...
Maurice Bouwhuis (SARA/Vancis) - Hoe big data te begrijpen door ze te visuali...AlmereDataCapital
 
NSF SI2 program discussion at 2013 SI2 PI meeting
NSF SI2 program discussion at 2013 SI2 PI meetingNSF SI2 program discussion at 2013 SI2 PI meeting
NSF SI2 program discussion at 2013 SI2 PI meetingDaniel S. Katz
 
Crowdsourcing Scientific Work: A Comparative Study of Technologies, Processes...
Crowdsourcing Scientific Work: A Comparative Study of Technologies, Processes...Crowdsourcing Scientific Work: A Comparative Study of Technologies, Processes...
Crowdsourcing Scientific Work: A Comparative Study of Technologies, Processes...Andrea Wiggins
 

What's hot (20)

Gridforum David De Roure Newe Science 20080402
Gridforum David De Roure Newe Science 20080402Gridforum David De Roure Newe Science 20080402
Gridforum David De Roure Newe Science 20080402
 
(Em)Powering Science: High-Performance Infrastructure in Biomedical Science
(Em)Powering Science: High-Performance Infrastructure in Biomedical Science(Em)Powering Science: High-Performance Infrastructure in Biomedical Science
(Em)Powering Science: High-Performance Infrastructure in Biomedical Science
 
Science Gateways for Life Sciences – Balancing Usability and Re-Usability
Science Gateways for Life Sciences – Balancing Usability and Re-Usability Science Gateways for Life Sciences – Balancing Usability and Re-Usability
Science Gateways for Life Sciences – Balancing Usability and Re-Usability
 
2016 05 sanger
2016 05 sanger2016 05 sanger
2016 05 sanger
 
Building the Pacific Research Platform: Supernetworks for Big Data Science
Building the Pacific Research Platform: Supernetworks for Big Data ScienceBuilding the Pacific Research Platform: Supernetworks for Big Data Science
Building the Pacific Research Platform: Supernetworks for Big Data Science
 
NSF SI2 program discussion at 2014 SI2 PI meeting
NSF SI2 program discussion at 2014 SI2 PI meetingNSF SI2 program discussion at 2014 SI2 PI meeting
NSF SI2 program discussion at 2014 SI2 PI meeting
 
Data Landscapes - Addiction
Data Landscapes - AddictionData Landscapes - Addiction
Data Landscapes - Addiction
 
Love for science or 'Academic Prostitution' - DFD2014 version
Love for science or 'Academic Prostitution' - DFD2014 versionLove for science or 'Academic Prostitution' - DFD2014 version
Love for science or 'Academic Prostitution' - DFD2014 version
 
NTXT4-WEB-lowres
NTXT4-WEB-lowresNTXT4-WEB-lowres
NTXT4-WEB-lowres
 
The New e-Science (Bangalore Edition)
The New e-Science (Bangalore Edition)The New e-Science (Bangalore Edition)
The New e-Science (Bangalore Edition)
 
Cognitive systems16
Cognitive systems16Cognitive systems16
Cognitive systems16
 
2019 June 27 - Big data and data science
2019 June 27 - Big data and data science2019 June 27 - Big data and data science
2019 June 27 - Big data and data science
 
Internet2 Bio IT 2016 v2
Internet2 Bio IT 2016 v2Internet2 Bio IT 2016 v2
Internet2 Bio IT 2016 v2
 
Transforming Big Data into Smart Data for Smart Energy: Deriving Value via ha...
Transforming Big Data into Smart Data for Smart Energy: Deriving Value via ha...Transforming Big Data into Smart Data for Smart Energy: Deriving Value via ha...
Transforming Big Data into Smart Data for Smart Energy: Deriving Value via ha...
 
Citizen Science 101: What Every Researcher Should Know About Crowdsourcing Sc...
Citizen Science 101: What Every Researcher Should Know About Crowdsourcing Sc...Citizen Science 101: What Every Researcher Should Know About Crowdsourcing Sc...
Citizen Science 101: What Every Researcher Should Know About Crowdsourcing Sc...
 
Maurice Bouwhuis (SARA/Vancis) - Hoe big data te begrijpen door ze te visuali...
Maurice Bouwhuis (SARA/Vancis) - Hoe big data te begrijpen door ze te visuali...Maurice Bouwhuis (SARA/Vancis) - Hoe big data te begrijpen door ze te visuali...
Maurice Bouwhuis (SARA/Vancis) - Hoe big data te begrijpen door ze te visuali...
 
NSF SI2 program discussion at 2013 SI2 PI meeting
NSF SI2 program discussion at 2013 SI2 PI meetingNSF SI2 program discussion at 2013 SI2 PI meeting
NSF SI2 program discussion at 2013 SI2 PI meeting
 
Cyberistructure
CyberistructureCyberistructure
Cyberistructure
 
Crowdsourcing Scientific Work: A Comparative Study of Technologies, Processes...
Crowdsourcing Scientific Work: A Comparative Study of Technologies, Processes...Crowdsourcing Scientific Work: A Comparative Study of Technologies, Processes...
Crowdsourcing Scientific Work: A Comparative Study of Technologies, Processes...
 
Engaging the software in research community
Engaging the software in research communityEngaging the software in research community
Engaging the software in research community
 

Viewers also liked

Peter Bakas - Zero to Insights - Real time analytics with Kafka, C*, and Spar...
Peter Bakas - Zero to Insights - Real time analytics with Kafka, C*, and Spar...Peter Bakas - Zero to Insights - Real time analytics with Kafka, C*, and Spar...
Peter Bakas - Zero to Insights - Real time analytics with Kafka, C*, and Spar...NoSQLmatters
 
3 top tools for taming big data
3 top tools for taming big data3 top tools for taming big data
3 top tools for taming big dataHPC Asia
 
Taming the Data Deluge: How Advertisers and Publishers Can Win at the Big Dat...
Taming the Data Deluge: How Advertisers and Publishers Can Win at the Big Dat...Taming the Data Deluge: How Advertisers and Publishers Can Win at the Big Dat...
Taming the Data Deluge: How Advertisers and Publishers Can Win at the Big Dat...Signal Chicago 2012
 
Taming Big Data in the Reverse Logistics Supply Chain
Taming Big Data in the Reverse Logistics Supply ChainTaming Big Data in the Reverse Logistics Supply Chain
Taming Big Data in the Reverse Logistics Supply ChainJonathan Pine
 
Taming Big Data!
Taming Big Data!Taming Big Data!
Taming Big Data!Ian Foster
 
Taming Social Data: How Social Data Framing liberates analysis and accelerate...
Taming Social Data: How Social Data Framing liberates analysis and accelerate...Taming Social Data: How Social Data Framing liberates analysis and accelerate...
Taming Social Data: How Social Data Framing liberates analysis and accelerate...DataSift
 
Spark: Taming Big Data
Spark: Taming Big DataSpark: Taming Big Data
Spark: Taming Big DataLeonardo Gamas
 
Tools and techniques adopted for big data analytics
Tools and techniques adopted for big data analyticsTools and techniques adopted for big data analytics
Tools and techniques adopted for big data analyticsJOSEPH FRANCIS
 
Taming the Big Data Beast
Taming the Big Data BeastTaming the Big Data Beast
Taming the Big Data BeastBaynote
 
Big Data
Big DataBig Data
Big DataNGDATA
 

Viewers also liked (15)

Taming Big Data with NoSQL
Taming Big Data with NoSQLTaming Big Data with NoSQL
Taming Big Data with NoSQL
 
Peter Bakas - Zero to Insights - Real time analytics with Kafka, C*, and Spar...
Peter Bakas - Zero to Insights - Real time analytics with Kafka, C*, and Spar...Peter Bakas - Zero to Insights - Real time analytics with Kafka, C*, and Spar...
Peter Bakas - Zero to Insights - Real time analytics with Kafka, C*, and Spar...
 
Turning Information chaos into reliable data
Turning Information chaos into reliable dataTurning Information chaos into reliable data
Turning Information chaos into reliable data
 
3 top tools for taming big data
3 top tools for taming big data3 top tools for taming big data
3 top tools for taming big data
 
Taming the Data Deluge: How Advertisers and Publishers Can Win at the Big Dat...
Taming the Data Deluge: How Advertisers and Publishers Can Win at the Big Dat...Taming the Data Deluge: How Advertisers and Publishers Can Win at the Big Dat...
Taming the Data Deluge: How Advertisers and Publishers Can Win at the Big Dat...
 
Taming Big Data in the Reverse Logistics Supply Chain
Taming Big Data in the Reverse Logistics Supply ChainTaming Big Data in the Reverse Logistics Supply Chain
Taming Big Data in the Reverse Logistics Supply Chain
 
Taming Big Data With Modern Software Architecture
Taming Big Data  With Modern Software ArchitectureTaming Big Data  With Modern Software Architecture
Taming Big Data With Modern Software Architecture
 
Taming Big Data!
Taming Big Data!Taming Big Data!
Taming Big Data!
 
Hadoop-2.6.0 Slides
Hadoop-2.6.0 SlidesHadoop-2.6.0 Slides
Hadoop-2.6.0 Slides
 
Taming Social Data: How Social Data Framing liberates analysis and accelerate...
Taming Social Data: How Social Data Framing liberates analysis and accelerate...Taming Social Data: How Social Data Framing liberates analysis and accelerate...
Taming Social Data: How Social Data Framing liberates analysis and accelerate...
 
Spark: Taming Big Data
Spark: Taming Big DataSpark: Taming Big Data
Spark: Taming Big Data
 
Tools and techniques adopted for big data analytics
Tools and techniques adopted for big data analyticsTools and techniques adopted for big data analytics
Tools and techniques adopted for big data analytics
 
Taming the Big Data Beast
Taming the Big Data BeastTaming the Big Data Beast
Taming the Big Data Beast
 
Big Data
Big DataBig Data
Big Data
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 

Similar to Taming the Big Data Beast - Together

06 e science-bio diversity@ pacc 18.07.2014
06 e science-bio diversity@ pacc 18.07.201406 e science-bio diversity@ pacc 18.07.2014
06 e science-bio diversity@ pacc 18.07.2014VinothkumaR Ramu
 
The Science of Data Science
The Science of Data Science The Science of Data Science
The Science of Data Science James Hendler
 
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...BigData_Europe
 
The fourth paradigm: data intensive scientific discovery - Jisc Digifest 2016
The fourth paradigm: data intensive scientific discovery - Jisc Digifest 2016The fourth paradigm: data intensive scientific discovery - Jisc Digifest 2016
The fourth paradigm: data intensive scientific discovery - Jisc Digifest 2016Jisc
 
Data_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptxData_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptxssuser1a4f0f
 
Data_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptxData_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptxwahiba ben abdessalem
 
Data_Science_Applications_&_Use_Cases.pdf
Data_Science_Applications_&_Use_Cases.pdfData_Science_Applications_&_Use_Cases.pdf
Data_Science_Applications_&_Use_Cases.pdfvishal choudhary
 
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014Jisc
 
Real-time applications of Data Science.pptx
Real-time applications  of Data Science.pptxReal-time applications  of Data Science.pptx
Real-time applications of Data Science.pptxshalini s
 
SGCI Science Gateways: Harnessing Big Data and Open Data 03-19-2017
SGCI Science Gateways: Harnessing Big Data and Open Data 03-19-2017SGCI Science Gateways: Harnessing Big Data and Open Data 03-19-2017
SGCI Science Gateways: Harnessing Big Data and Open Data 03-19-2017Sandra Gesing
 
eROSA Stakeholder WS1: Big Data and Open Science in agricultural and environm...
eROSA Stakeholder WS1: Big Data and Open Science in agricultural and environm...eROSA Stakeholder WS1: Big Data and Open Science in agricultural and environm...
eROSA Stakeholder WS1: Big Data and Open Science in agricultural and environm...e-ROSA
 
Making an impact with data science
Making an impact  with data scienceMaking an impact  with data science
Making an impact with data scienceJordan Engbers
 
Real-World Data Challenges: Moving Towards Richer Data Ecosystems
Real-World Data Challenges: Moving Towards Richer Data EcosystemsReal-World Data Challenges: Moving Towards Richer Data Ecosystems
Real-World Data Challenges: Moving Towards Richer Data EcosystemsAnita de Waard
 
Big Data for the Social Sciences
Big Data for the Social SciencesBig Data for the Social Sciences
Big Data for the Social SciencesDavid De Roure
 
From Open Data to Open Science, by Geoffrey Boulton
 From Open Data to Open Science, by Geoffrey Boulton From Open Data to Open Science, by Geoffrey Boulton
From Open Data to Open Science, by Geoffrey BoultonLEARN Project
 

Similar to Taming the Big Data Beast - Together (20)

06 e science-bio diversity@ pacc 18.07.2014
06 e science-bio diversity@ pacc 18.07.201406 e science-bio diversity@ pacc 18.07.2014
06 e science-bio diversity@ pacc 18.07.2014
 
The Science of Data Science
The Science of Data Science The Science of Data Science
The Science of Data Science
 
Big Data
Big Data Big Data
Big Data
 
eResearch@UCT
eResearch@UCTeResearch@UCT
eResearch@UCT
 
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
 
The fourth paradigm: data intensive scientific discovery - Jisc Digifest 2016
The fourth paradigm: data intensive scientific discovery - Jisc Digifest 2016The fourth paradigm: data intensive scientific discovery - Jisc Digifest 2016
The fourth paradigm: data intensive scientific discovery - Jisc Digifest 2016
 
Data_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptxData_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptx
 
Data_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptxData_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptx
 
Data_Science_Applications_&_Use_Cases.pdf
Data_Science_Applications_&_Use_Cases.pdfData_Science_Applications_&_Use_Cases.pdf
Data_Science_Applications_&_Use_Cases.pdf
 
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
 
Real-time applications of Data Science.pptx
Real-time applications  of Data Science.pptxReal-time applications  of Data Science.pptx
Real-time applications of Data Science.pptx
 
SGCI Science Gateways: Harnessing Big Data and Open Data 03-19-2017
SGCI Science Gateways: Harnessing Big Data and Open Data 03-19-2017SGCI Science Gateways: Harnessing Big Data and Open Data 03-19-2017
SGCI Science Gateways: Harnessing Big Data and Open Data 03-19-2017
 
Medical image analysis and big data evaluation infrastructures
Medical image analysis and big data evaluation infrastructuresMedical image analysis and big data evaluation infrastructures
Medical image analysis and big data evaluation infrastructures
 
eROSA Stakeholder WS1: Big Data and Open Science in agricultural and environm...
eROSA Stakeholder WS1: Big Data and Open Science in agricultural and environm...eROSA Stakeholder WS1: Big Data and Open Science in agricultural and environm...
eROSA Stakeholder WS1: Big Data and Open Science in agricultural and environm...
 
Making an impact with data science
Making an impact  with data scienceMaking an impact  with data science
Making an impact with data science
 
Real-World Data Challenges: Moving Towards Richer Data Ecosystems
Real-World Data Challenges: Moving Towards Richer Data EcosystemsReal-World Data Challenges: Moving Towards Richer Data Ecosystems
Real-World Data Challenges: Moving Towards Richer Data Ecosystems
 
Big Data for the Social Sciences
Big Data for the Social SciencesBig Data for the Social Sciences
Big Data for the Social Sciences
 
Sgci iwsg-a-10-10-16
Sgci iwsg-a-10-10-16Sgci iwsg-a-10-10-16
Sgci iwsg-a-10-10-16
 
Information entanglement
Information entanglementInformation entanglement
Information entanglement
 
From Open Data to Open Science, by Geoffrey Boulton
 From Open Data to Open Science, by Geoffrey Boulton From Open Data to Open Science, by Geoffrey Boulton
From Open Data to Open Science, by Geoffrey Boulton
 

More from Kennisalliantie

Beeldcompilatie KennisBruggen
Beeldcompilatie KennisBruggenBeeldcompilatie KennisBruggen
Beeldcompilatie KennisBruggenKennisalliantie
 
Presentatie platform organisatie en instrumenten
Presentatie platform organisatie en instrumentenPresentatie platform organisatie en instrumenten
Presentatie platform organisatie en instrumentenKennisalliantie
 
Ct & groene grondstoffen h ro
Ct & groene grondstoffen h roCt & groene grondstoffen h ro
Ct & groene grondstoffen h roKennisalliantie
 
2 d3d factsheet biobased a4_dowtk_versie02_zonder_snijtekens_hr
2 d3d factsheet biobased a4_dowtk_versie02_zonder_snijtekens_hr2 d3d factsheet biobased a4_dowtk_versie02_zonder_snijtekens_hr
2 d3d factsheet biobased a4_dowtk_versie02_zonder_snijtekens_hrKennisalliantie
 
Samenvatting swot analyse ka lei 2011
Samenvatting swot analyse ka lei 2011Samenvatting swot analyse ka lei 2011
Samenvatting swot analyse ka lei 2011Kennisalliantie
 
Presentatie rhdhv aldert van der kooij
Presentatie rhdhv aldert van der kooijPresentatie rhdhv aldert van der kooij
Presentatie rhdhv aldert van der kooijKennisalliantie
 
Presentatie platform overall
Presentatie platform overall Presentatie platform overall
Presentatie platform overall Kennisalliantie
 
Pitch kenniscentrum plantenstoffen
Pitch kenniscentrum plantenstoffenPitch kenniscentrum plantenstoffen
Pitch kenniscentrum plantenstoffenKennisalliantie
 
Pitch isobutanolplatform
Pitch isobutanolplatformPitch isobutanolplatform
Pitch isobutanolplatformKennisalliantie
 
Pitch holland rijnland biobased
Pitch holland rijnland biobasedPitch holland rijnland biobased
Pitch holland rijnland biobasedKennisalliantie
 
Pitch groene hart biobased
Pitch groene hart biobasedPitch groene hart biobased
Pitch groene hart biobasedKennisalliantie
 
Pitch clusterregeling svz april 2013
Pitch clusterregeling svz april 2013Pitch clusterregeling svz april 2013
Pitch clusterregeling svz april 2013Kennisalliantie
 
Pitch biotech campus delft
Pitch biotech campus delftPitch biotech campus delft
Pitch biotech campus delftKennisalliantie
 

More from Kennisalliantie (20)

Beeldcompilatie KennisBruggen
Beeldcompilatie KennisBruggenBeeldcompilatie KennisBruggen
Beeldcompilatie KennisBruggen
 
Presentatie platform organisatie en instrumenten
Presentatie platform organisatie en instrumentenPresentatie platform organisatie en instrumenten
Presentatie platform organisatie en instrumenten
 
Syntens go bio
Syntens go bioSyntens go bio
Syntens go bio
 
Informatie bioliquid a4
Informatie bioliquid a4Informatie bioliquid a4
Informatie bioliquid a4
 
Ct & groene grondstoffen h ro
Ct & groene grondstoffen h roCt & groene grondstoffen h ro
Ct & groene grondstoffen h ro
 
Biotech campus delft
Biotech campus delftBiotech campus delft
Biotech campus delft
 
Biobased westland
Biobased westlandBiobased westland
Biobased westland
 
A4 alphen aan den rijn
A4 alphen aan den rijnA4 alphen aan den rijn
A4 alphen aan den rijn
 
2 d3d factsheet biobased a4_dowtk_versie02_zonder_snijtekens_hr
2 d3d factsheet biobased a4_dowtk_versie02_zonder_snijtekens_hr2 d3d factsheet biobased a4_dowtk_versie02_zonder_snijtekens_hr
2 d3d factsheet biobased a4_dowtk_versie02_zonder_snijtekens_hr
 
Samenvatting swot analyse ka lei 2011
Samenvatting swot analyse ka lei 2011Samenvatting swot analyse ka lei 2011
Samenvatting swot analyse ka lei 2011
 
Presentatie rhdhv aldert van der kooij
Presentatie rhdhv aldert van der kooijPresentatie rhdhv aldert van der kooij
Presentatie rhdhv aldert van der kooij
 
Presentatie platform overall
Presentatie platform overall Presentatie platform overall
Presentatie platform overall
 
Pitch mkb go bio
Pitch mkb go bioPitch mkb go bio
Pitch mkb go bio
 
Pitch kenniscentrum plantenstoffen
Pitch kenniscentrum plantenstoffenPitch kenniscentrum plantenstoffen
Pitch kenniscentrum plantenstoffen
 
Pitch isobutanolplatform
Pitch isobutanolplatformPitch isobutanolplatform
Pitch isobutanolplatform
 
Pitch holland rijnland biobased
Pitch holland rijnland biobasedPitch holland rijnland biobased
Pitch holland rijnland biobased
 
Pitch hoeksche waard
Pitch hoeksche waardPitch hoeksche waard
Pitch hoeksche waard
 
Pitch groene hart biobased
Pitch groene hart biobasedPitch groene hart biobased
Pitch groene hart biobased
 
Pitch clusterregeling svz april 2013
Pitch clusterregeling svz april 2013Pitch clusterregeling svz april 2013
Pitch clusterregeling svz april 2013
 
Pitch biotech campus delft
Pitch biotech campus delftPitch biotech campus delft
Pitch biotech campus delft
 

Recently uploaded

Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????blackmambaettijean
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 

Recently uploaded (20)

Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 

Taming the Big Data Beast - Together

  • 1. Netherlands eScience Center ICT Synergy Hub, Amsterdam Taming the Big Data Beast - Together Nieuwjaarsbijeenkomst Kennisalliantie Delft, 31 januari-2013 Prof. dr. Jacob de Vlieg ¹ ² 1. CEO & Scientific Director of Netherlands eScience Center, NWO-SURF 2. Head Computational Design & Discovery, CMBI, Radboud University, Medical Center, Nijmegen, Netherlands
  • 2. Agenda • Big Data in Science: Challenges & Opportunities – Top Sector ICT Roadmap theme: “Data, Data, Data” • Netherlands eScience Center (NLeSC) – Expert centre for Big Data Research • Joint NWO-NLeSC “Big Data” project call – Public-private partnerships
  • 3. Data are the lifeblood of modern science and the digital economy
  • 4. Data are the lifeblood of modern science and the digital economy Managing, analyzing, linking & re-using data to create business value and/or scientific breakthroughs e.g. – Social media data to influence consumer choices – Sensor networks data: e.g. sensor-enabled smart dikes – Imaging & biobanking data in health care e.g. diagnostics, medicine – And many more opportunities
  • 5. Data are the lifeblood of modern science and the digital economy Managing, analyzing, linking & re-using data to create business value and/or scientific breakthroughs e.g. – Social media data to influence consumer choices – Sensor networks data: e.g. sensor-enabled smart dikes – Imaging & biobanking data in health care e.g. diagnostics, medicine – And many more opportunities Big Data: a complex concept – 4Vs: Volume, Variety, Velocity, Verification
  • 6. Data are the lifeblood of modern science and the digital economy Managing, analyzing, linking & re-using data to create business value and/or scientific breakthroughs e.g. – Social media data to influence consumer choices – Sensor networks data: e.g. sensor-enabled smart dikes – Imaging & biobanking data in health care e.g. diagnostics, medicine – And many more opportunities. Big Data: a complex concept – 4Vs: Volume, Variety, Velocity, Verification Big Data inextricably connected to eScience/HPC ICT top sector roadmap: e-Science is about intelligent infrastructure to model and/or to access big data
  • 7. Key eScience challenges Big Data research – Cross-type data integration – Data-driven & multi-models simulations – Visualization & analytics – High performance computing: connected computers & fast networks.
  • 8. Key eScience challenges Big Data research – Cross-type data integration – Data-driven & multi-models simulations – Visualization & analytics – High performance computing: connected computers & fast networks – Stimulate culture of knowledge sharing: no silos; data stewardship – Rationalization of ICT landscapes; interoperability & industry data standards – Training & education
  • 9. Science itself is changing …We need to change with it… Neelie Kroes in “Giving Europe’s Scientists the Tools to Deliver” Two key words: multidisciplinary research & data-driven discovery
  • 10. eScience and the mystery of the empty labs
  • 11. eScience and the mystery of the empty labs
  • 12. eScience and the mystery of the empty labs • Much more data per experiment (miniaturized and/or automation) • External data sources & outsourcing • Experimental design, data management & analytics(eScience)
  • 13. Quantified Self Movement -> Big Data Use apps and wearable sensors to monitor daily life e.g. hours of sleep, food consumed, exercise taken, etc. Quantified Self = Big Data + Mobile + Sensors + Visualization + Gamification .
  • 14. eScience Hero Fights for medical innovation; parkinson’s disease • Big Data • Pattern recognition • Machine learning • Social Media Andy Grove (ex-CEO Intel)
  • 15. Voice algorithms spot Parkinson's disease: data-driven diagnostics • Machine learning algorithms that analyse voice recordings to detect Parkinson's symptoms early on (Little at al. @ Media Lab, MIT) • Social Media: Looking for volunteers to contribute to the database to improve pattern recognition
  • 16. Voice algorithms spot Parkinson's disease: data-driven diagnostics • Machine learning algorithms that analyse voice recordings to detect Parkinson's symptoms early on (Little at al. @ Media Lab, MIT) • Social Media: Looking for volunteers to contribute to the database to improve pattern recognition Social networking health sites: patient-driven data collection •21andme •PatientsLikeMe.com •And so on Big Data V= Verification: privacy, compliance, etc
  • 17. 'Data Scientist' is now the hottest job title in Silicon Valley… Tim O'Reilly Founder of O'Reilly Media Supporter free software and open source movements McKinsey projected that the US needs 140,000 to 190,000 more workers with “deep analytical expertise”
  • 18. Netherlands eScience Center Netherlands organization Principal Dutch body for for scientific research: ICT innovation for research NL-eSC SURF Science park, Amsterdam; SARA, EGI Networked innovation model Bridge: •Science & advanced ICT •Industry & Academic Research •Training & Education New ways to do research made possible because of Big Data/eScience
  • 19. NLeSC portfolio divided in themes •Sustainability & Environment •Life Sciences - Climate - Green Genetics - Water management - Translational Research IT -Energy - Foods -Ecology - Cognition/Neuroscience •Chemistry & Materials •eScience Methodology & ‘Big Data’ -Chemistry - eScience Methodology - Astronomy •Humanities & Social Sciences - Humanities -Social Sciences
  • 20. Can scientists from digital humanities help food researchers? Food Research: Food Specific Ontologies for Food Focused Text Mining Project Leader: Wynand Alkema Addressing absence of domain specific structured vocabularies which limits the use of data mining & knowledge management methods in food research. Digital Humanities: BiographyNED Project Leader: Guus Schreiber Will improve current version of the Biography Portal by incorporating analytical tools to show interconnections, trends, geographical maps and time lines.
  • 21. eScience & Big Data: providing leads for new food applications
  • 22. NLeSC eScience engineers: Scientists bridging research and advanced ICT Deliver sustainable solutions for data-driven research Work both at center and on site
  • 23. Collaborative Innovation Network Taming the Data Beast Together SMEs,etc NLeSC eScience Engineers: Work both at center and on site: •Exchange of eScience expertise •Re-use of proven eScience (technology hopping) •Career development & training
  • 24. Grand scientific challenges leads to innovative eScience & Big Data Research eSalsa NLeSC project: data-driven simulations & advanced visualization to understand Climate Change Prof. Henk Dijkstra, Univ. of Utrecht NLeSC Integrator Climate Dr. Jason Maassen eScience Engineer NLeSC •eScience to allow unprecedented level of detail (large scale distributed computing) •State-of-the-art visualization techniques to analyze hundreds of Terabytes of output •Re-use of proven eScience concepts in new areas (e.g. sector water)
  • 25. The number of data-driven start-ups is growing— particularly when it comes to social media. Taming the Big Data Beast
  • 26. Development of a high performance Twitter analysis platform Hadoop – MapReduce architecture @ a large SARA computer cluster Smart search & analysis software Prof. Antal van den Bosch Goal is to ask “Big Data” research questions e.g. NLeSC Integrator Humanities Radboud University Nijmegen • Ability to analyze microblogging data produced over years • Time dependant Dr. Erik Tjong Kim Sang eScience Engineer • Real time sentiment analysis NleSC • And so on…
  • 27. Cyber-common: a facility for 21st century data- driven research and multidisciplinary team work The key to scientific questions y! To link minds and eScience SURF-SARA-NLeSC
  • 28. Cyber-common: a facility for 21st century data- driven research and multidisciplinary team work The key to to scientific questions The key scientific questions y! yet unasked! To link minds and eScience SURF-SARA-NLeSC
  • 29. Joint NWO-NLeSC “data sciences” call • Focus on stimulating public-private partnerships • Three instruments: – Industrial Partnership Programme (IPP) – Technology Area’s (TA) – Knowledge Innovation Mapping SMEs (KIEM MKB) Rosemarie van der Veen-Oei (NLeSC) r.vanderveen@nwo.nl T 070 3440 851 Mark Kas (NWO) www.nlesc.nl m.kas@nwo.nl T 070 3440 811, M 06 205 93 207 Netherlands eScience Center
  • 30. Thank you www.esciencecenter.nl