Investigation of chromatin 3D structure • role of chromatin 3D structure in gene regulation • 4C to investigate detailed interactions of cis-regulatory modules (CRMs) • global chromatin interactome using HiC6 24.05.2012 Felix Klein
Investigation of chromatin 3D structure7 24.05.2012 Felix Klein
What was important for me? • bioinformatics group with members of diverse backgrounds • PI who successfully trained bioinformaticians • well established group in bioinformatics9 24.05.2012 Felix Klein
What might be interesting for you • turn data into biology • interaction with people from biology groups • communication skills !!! • workload divides mainly into: • programming (50 %) • reports, meetings, email10 24.05.2012 Felix Klein
AcknowledgementsWolfgang HuberSimon AndersJoseph BarryBernd FischerJulian GehringAleksandra PekowskaPaul Theodor PylAlejandro ReyesMaria SecrierCollaborators:Michael BoutrosChristian VolzEileen FurlongYad Ghavi Helm11 24.05.2012 Felix Klein
Data production ratesLHC: 1.8 GB / s at peak capacity (i.e. actively conducting aprimary aspect of the LHC’s four main experiments: ATLAS,ALICE, CMS, and LHCb).These experiments will take roughly a decade to complete, andeach of them is expected to produce over a 1 PB per year ofdata.One Illumina HiSeq: up to 600 Gb/run , i.e. ~600 GB/10 days =18 TB/year (not including derived data e.g. BAM)One Digital Embryo (2008): 3.5 TB (2048 x 2048 x 370 x 1226)EMBL-EBI: in 9/2011, data storage capacity was 14 PB
A particular slide catching your eye?
Clipping is a handy way to collect important slides you want to go back to later.