SlideShare a Scribd company logo
1 of 12
Download to read offline
Bioinformatics Career Day
24 May 2012




Felix Klein
Background


    • physics diploma, University of Heidelberg



    • diploma thesis in radiation dosimetry
      at DKFZ


    • measurements at HIT




2      24.05.2012     Felix Klein
Why bioinformatics?


    • interdisciplinary

    • programmed in R

    • worked on data analysis




3      24.05.2012         Felix Klein
Progress in science is driven by technology




4     24.05.2012   Felix Klein
Chromatin loops




5     24.05.2012   Felix Klein
Investigation of chromatin 3D structure
    • role of chromatin 3D structure in gene regulation

    • 4C to investigate detailed interactions of
      cis-regulatory modules (CRMs)

    • global chromatin interactome using HiC




6      24.05.2012      Felix Klein
Investigation of chromatin 3D structure




7     24.05.2012   Felix Klein
Automated analysis of microscopy based
     RNAi screens
                                                                                                                      Features
                    Imaging                             Segmentation                                                  extraction




 Source image                       Calibrated image                            Segmentation mask
      9.241719
       g.pd




                                                                                  g.x        g.y     g.s g.p     g.pdm
      g.s g.p
      194 67




                                                                         [1,]   123.1391   3.288660 194 67      9.241719
                                                                         [2,]   206.7460   9.442248 961 153    20.513190
                                                                         [3,]   502.9589   7.616438 219 60      8.286918
                                                                         [4,]    20.1919 22.358418 1568 157    22.219461
      3.288660




                                                                         [5,]   344.7959 45.501992 2259 233    35.158966
                 Summary                               Classification    [6,]   188.2611 50.451863 2711 249    28.732680
        g.y




                                                                         [7,]   269.7996 46.404036 2131 180    26.419631
                              aft       apt   neg                        [8,]   106.6127 58.364243 1348 143    21.662879
                                                                         [9,]   218.5582 77.299007 1913 215    25.724580
                                                                        [10,]    19.1766 81.840147 1908 209    26.303760
      123.1391




                                                                        [11,]     6.3558 62.017647 340 68      10.314127
        g.x




                                                                        [12,]    58.9873 86.034128 2139 214    27.463158
                                                                        [13,]   245.1087 94.387405 1048 123    18.280901
                                                                        [14,]   411.2741 109.198678 2572 225   28.660816
                              int       pos                             [15,]
                                                                        [16,]
                                                                                167.8151 107.966014 1942 160
                                                                                281.7084 121.609892 2871 209
                                                                                                               24.671533
                                                                                                               31.577270


Phenotypic profile             Objects labels                                        Object features


 8
What was important for me?
    • bioinformatics group with
      members of diverse
      backgrounds

    • PI who successfully
      trained bioinformaticians

    • well established group in
      bioinformatics




9      24.05.2012      Felix Klein
What might be interesting for you
     • turn data into biology

     • interaction with people from biology groups

     • communication skills !!!

     • workload divides mainly into:
        • programming (50 %)
        • reports, meetings, email




10      24.05.2012        Felix Klein
Acknowledgements
Wolfgang Huber
Simon Anders
Joseph Barry
Bernd Fischer
Julian Gehring
Aleksandra Pekowska
Paul Theodor Pyl
Alejandro Reyes
Maria Secrier

Collaborators:
Michael Boutros
Christian Volz

Eileen Furlong
Yad Ghavi Helm



11     24.05.2012     Felix Klein
Data production rates
LHC: 1.8 GB / s at peak capacity (i.e. actively conducting a
primary aspect of the LHC’s four main experiments: ATLAS,
ALICE, CMS, and LHCb).
These experiments will take roughly a decade to complete, and
each of them is expected to produce over a 1 PB per year of
data.

One Illumina HiSeq: up to 600 Gb/run , i.e. ~600 GB/10 days =
18 TB/year (not including derived data e.g. BAM)
One Digital Embryo (2008): 3.5 TB (2048 x 2048 x 370 x 1226)
EMBL-EBI: in 9/2011, data storage capacity was 14 PB

More Related Content

Similar to P3 training and_life_as_a_postdoc_(felix_klein)

Tvm table3
Tvm table3Tvm table3
Tvm table3divyaav
 
Korepatentistaitsitkleri
KorepatentistaitsitkleriKorepatentistaitsitkleri
KorepatentistaitsitkleriAli CAVUSOGLU
 
Sip _ready_reckoner___compliance_approved
Sip  _ready_reckoner___compliance_approvedSip  _ready_reckoner___compliance_approved
Sip _ready_reckoner___compliance_approvedguestc7ba7d90
 
Table Of Trigonometric Ratios
Table Of Trigonometric RatiosTable Of Trigonometric Ratios
Table Of Trigonometric Ratiosvolky tolky
 
Limites de control para gráficos xr xs
Limites de control para gráficos xr xsLimites de control para gráficos xr xs
Limites de control para gráficos xr xsMennys-SPC-UTT
 
6 dimension and properties table of ipe shape
6 dimension and properties table of ipe shape6 dimension and properties table of ipe shape
6 dimension and properties table of ipe shapeChhay Teng
 
Fundamental Equity Analysis - World Gold Miners
Fundamental Equity Analysis - World Gold MinersFundamental Equity Analysis - World Gold Miners
Fundamental Equity Analysis - World Gold MinersBCV
 
Silver dollars stock club chap2 (marvie f.)
Silver dollars stock club chap2 (marvie f.)Silver dollars stock club chap2 (marvie f.)
Silver dollars stock club chap2 (marvie f.)marvie-marv
 
9 dimension and properties table of upe
9 dimension and properties table of upe9 dimension and properties table of upe
9 dimension and properties table of upeChhay Teng
 
(東北大学)環境報告書20(再提出)
(東北大学)環境報告書20(再提出)(東北大学)環境報告書20(再提出)
(東北大学)環境報告書20(再提出)env63
 
Ltn200804281069 C
Ltn200804281069 CLtn200804281069 C
Ltn200804281069 Cguest54ca90
 
Recap des résultats 2010 sociétés cotées
Recap des résultats 2010 sociétés cotéesRecap des résultats 2010 sociétés cotées
Recap des résultats 2010 sociétés cotéesIDIR2001
 
Recap des sociétés cotées
Recap des sociétés cotéesRecap des sociétés cotées
Recap des sociétés cotéesIDIR2001
 
Estadistica basica I
Estadistica basica IEstadistica basica I
Estadistica basica Igmayo
 

Similar to P3 training and_life_as_a_postdoc_(felix_klein) (20)

Tvm table3
Tvm table3Tvm table3
Tvm table3
 
Korepatentistaitsitkleri
KorepatentistaitsitkleriKorepatentistaitsitkleri
Korepatentistaitsitkleri
 
Sip _ready_reckoner___compliance_approved
Sip  _ready_reckoner___compliance_approvedSip  _ready_reckoner___compliance_approved
Sip _ready_reckoner___compliance_approved
 
Table Of Trigonometric Ratios
Table Of Trigonometric RatiosTable Of Trigonometric Ratios
Table Of Trigonometric Ratios
 
Tabla de afiliacion_ASOMATE
Tabla de afiliacion_ASOMATETabla de afiliacion_ASOMATE
Tabla de afiliacion_ASOMATE
 
Limites de control para gráficos xr xs
Limites de control para gráficos xr xsLimites de control para gráficos xr xs
Limites de control para gráficos xr xs
 
6 dimension and properties table of ipe shape
6 dimension and properties table of ipe shape6 dimension and properties table of ipe shape
6 dimension and properties table of ipe shape
 
Appendix a present value tables
Appendix a   present value tablesAppendix a   present value tables
Appendix a present value tables
 
Fundamental Equity Analysis - World Gold Miners
Fundamental Equity Analysis - World Gold MinersFundamental Equity Analysis - World Gold Miners
Fundamental Equity Analysis - World Gold Miners
 
Silver dollars stock club chap2 (marvie f.)
Silver dollars stock club chap2 (marvie f.)Silver dollars stock club chap2 (marvie f.)
Silver dollars stock club chap2 (marvie f.)
 
Sanhuu udirdlaga
Sanhuu udirdlagaSanhuu udirdlaga
Sanhuu udirdlaga
 
Forum links
Forum linksForum links
Forum links
 
9 dimension and properties table of upe
9 dimension and properties table of upe9 dimension and properties table of upe
9 dimension and properties table of upe
 
(東北大学)環境報告書20(再提出)
(東北大学)環境報告書20(再提出)(東北大学)環境報告書20(再提出)
(東北大学)環境報告書20(再提出)
 
Ltn200804281069 C
Ltn200804281069 CLtn200804281069 C
Ltn200804281069 C
 
Petroleum Import (2000-2010)
Petroleum Import (2000-2010)Petroleum Import (2000-2010)
Petroleum Import (2000-2010)
 
Gsom1
Gsom1Gsom1
Gsom1
 
Recap des résultats 2010 sociétés cotées
Recap des résultats 2010 sociétés cotéesRecap des résultats 2010 sociétés cotées
Recap des résultats 2010 sociétés cotées
 
Recap des sociétés cotées
Recap des sociétés cotéesRecap des sociétés cotées
Recap des sociétés cotées
 
Estadistica basica I
Estadistica basica IEstadistica basica I
Estadistica basica I
 

More from phdcareers

P2 training and_life_as_a_postdoc_(kota_miura)
P2 training and_life_as_a_postdoc_(kota_miura)P2 training and_life_as_a_postdoc_(kota_miura)
P2 training and_life_as_a_postdoc_(kota_miura)phdcareers
 
E2 life as_a_scientific_database_curator_(sandra_orchard)
E2 life as_a_scientific_database_curator_(sandra_orchard)E2 life as_a_scientific_database_curator_(sandra_orchard)
E2 life as_a_scientific_database_curator_(sandra_orchard)phdcareers
 
E1 life as_an_outreach_project_leader_(giulietta_spudich)
E1 life as_an_outreach_project_leader_(giulietta_spudich)E1 life as_an_outreach_project_leader_(giulietta_spudich)
E1 life as_an_outreach_project_leader_(giulietta_spudich)phdcareers
 
2 training opportunities_at_embl_(helke_hillebrand)
2 training opportunities_at_embl_(helke_hillebrand)2 training opportunities_at_embl_(helke_hillebrand)
2 training opportunities_at_embl_(helke_hillebrand)phdcareers
 
1 introduction to_the_ebi_(katrina_pavelin)
1 introduction to_the_ebi_(katrina_pavelin)1 introduction to_the_ebi_(katrina_pavelin)
1 introduction to_the_ebi_(katrina_pavelin)phdcareers
 
P4 training and_life_as_a_postdoc_(shinichi_sunagawa)
P4 training and_life_as_a_postdoc_(shinichi_sunagawa)P4 training and_life_as_a_postdoc_(shinichi_sunagawa)
P4 training and_life_as_a_postdoc_(shinichi_sunagawa)phdcareers
 
Bioinformatics Career Day
Bioinformatics Career DayBioinformatics Career Day
Bioinformatics Career Dayphdcareers
 

More from phdcareers (8)

PhDretreat
PhDretreat PhDretreat
PhDretreat
 
P2 training and_life_as_a_postdoc_(kota_miura)
P2 training and_life_as_a_postdoc_(kota_miura)P2 training and_life_as_a_postdoc_(kota_miura)
P2 training and_life_as_a_postdoc_(kota_miura)
 
E2 life as_a_scientific_database_curator_(sandra_orchard)
E2 life as_a_scientific_database_curator_(sandra_orchard)E2 life as_a_scientific_database_curator_(sandra_orchard)
E2 life as_a_scientific_database_curator_(sandra_orchard)
 
E1 life as_an_outreach_project_leader_(giulietta_spudich)
E1 life as_an_outreach_project_leader_(giulietta_spudich)E1 life as_an_outreach_project_leader_(giulietta_spudich)
E1 life as_an_outreach_project_leader_(giulietta_spudich)
 
2 training opportunities_at_embl_(helke_hillebrand)
2 training opportunities_at_embl_(helke_hillebrand)2 training opportunities_at_embl_(helke_hillebrand)
2 training opportunities_at_embl_(helke_hillebrand)
 
1 introduction to_the_ebi_(katrina_pavelin)
1 introduction to_the_ebi_(katrina_pavelin)1 introduction to_the_ebi_(katrina_pavelin)
1 introduction to_the_ebi_(katrina_pavelin)
 
P4 training and_life_as_a_postdoc_(shinichi_sunagawa)
P4 training and_life_as_a_postdoc_(shinichi_sunagawa)P4 training and_life_as_a_postdoc_(shinichi_sunagawa)
P4 training and_life_as_a_postdoc_(shinichi_sunagawa)
 
Bioinformatics Career Day
Bioinformatics Career DayBioinformatics Career Day
Bioinformatics Career Day
 

Recently uploaded

Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 

Recently uploaded (20)

Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 

P3 training and_life_as_a_postdoc_(felix_klein)

  • 1. Bioinformatics Career Day 24 May 2012 Felix Klein
  • 2. Background • physics diploma, University of Heidelberg • diploma thesis in radiation dosimetry at DKFZ • measurements at HIT 2 24.05.2012 Felix Klein
  • 3. Why bioinformatics? • interdisciplinary • programmed in R • worked on data analysis 3 24.05.2012 Felix Klein
  • 4. Progress in science is driven by technology 4 24.05.2012 Felix Klein
  • 5. Chromatin loops 5 24.05.2012 Felix Klein
  • 6. Investigation of chromatin 3D structure • role of chromatin 3D structure in gene regulation • 4C to investigate detailed interactions of cis-regulatory modules (CRMs) • global chromatin interactome using HiC 6 24.05.2012 Felix Klein
  • 7. Investigation of chromatin 3D structure 7 24.05.2012 Felix Klein
  • 8. Automated analysis of microscopy based RNAi screens Features Imaging Segmentation extraction Source image Calibrated image Segmentation mask 9.241719 g.pd g.x g.y g.s g.p g.pdm g.s g.p 194 67 [1,] 123.1391 3.288660 194 67 9.241719 [2,] 206.7460 9.442248 961 153 20.513190 [3,] 502.9589 7.616438 219 60 8.286918 [4,] 20.1919 22.358418 1568 157 22.219461 3.288660 [5,] 344.7959 45.501992 2259 233 35.158966 Summary Classification [6,] 188.2611 50.451863 2711 249 28.732680 g.y [7,] 269.7996 46.404036 2131 180 26.419631 aft apt neg [8,] 106.6127 58.364243 1348 143 21.662879 [9,] 218.5582 77.299007 1913 215 25.724580 [10,] 19.1766 81.840147 1908 209 26.303760 123.1391 [11,] 6.3558 62.017647 340 68 10.314127 g.x [12,] 58.9873 86.034128 2139 214 27.463158 [13,] 245.1087 94.387405 1048 123 18.280901 [14,] 411.2741 109.198678 2572 225 28.660816 int pos [15,] [16,] 167.8151 107.966014 1942 160 281.7084 121.609892 2871 209 24.671533 31.577270 Phenotypic profile Objects labels Object features 8
  • 9. What was important for me? • bioinformatics group with members of diverse backgrounds • PI who successfully trained bioinformaticians • well established group in bioinformatics 9 24.05.2012 Felix Klein
  • 10. What might be interesting for you • turn data into biology • interaction with people from biology groups • communication skills !!! • workload divides mainly into: • programming (50 %) • reports, meetings, email 10 24.05.2012 Felix Klein
  • 11. Acknowledgements Wolfgang Huber Simon Anders Joseph Barry Bernd Fischer Julian Gehring Aleksandra Pekowska Paul Theodor Pyl Alejandro Reyes Maria Secrier Collaborators: Michael Boutros Christian Volz Eileen Furlong Yad Ghavi Helm 11 24.05.2012 Felix Klein
  • 12. Data production rates LHC: 1.8 GB / s at peak capacity (i.e. actively conducting a primary aspect of the LHC’s four main experiments: ATLAS, ALICE, CMS, and LHCb). These experiments will take roughly a decade to complete, and each of them is expected to produce over a 1 PB per year of data. One Illumina HiSeq: up to 600 Gb/run , i.e. ~600 GB/10 days = 18 TB/year (not including derived data e.g. BAM) One Digital Embryo (2008): 3.5 TB (2048 x 2048 x 370 x 1226) EMBL-EBI: in 9/2011, data storage capacity was 14 PB