SlideShare a Scribd company logo
Unveiling Epigenetic Regulation with
Next Generation Sequencing (NGS) and Python
Ying Liu
Weill Cornell Medical College
The New York Python Meetup, 05-29-2014
About Me
• PhD candidate, Weill Cornell Medical College
• Major Area: Stem cell epigenomics, Computational Biology
• Graduation: Fall 2014
• Dream job: data scientist in biomedical informatics
• Email: yingliuyl@gmail.com
• LinkedIn: https://www.linkedin.com/pub/ying-liu/b/669/605
Reprogram > 20 days
(Thousands of genes change expression)
Induced pluripotent stem
(iPS) cells
Adult cells
Express pluripotent stem cell
specific genes (4 genes)
2012 Nobel Prize in
Physiology or Medicine
Generate Pluripotent Stem Cells
from Mature Adult Cells
Limitation
• Reprogram efficiency: 0.01 - 0.1%
• Molecular mechanism is not fully understood
?
Human Genome Project
• Human genome: ~ 3 billion DNA base pairs
• Complete sequence: 2003
First sequence draft: 2001
Nature 454, 711-715
Gene Expression
My project: Histone X
Enriches at expressing genes
Epigenetic Regulation
• Epigenetics: study of heritable changes
in gene activity that are NOT caused by
changes in the DNA sequence
• One of the major epigenetic regulators:
Histone protein
Histone
proteins
DNA
Induced pluripotent stem
(iPS) cells
Adult cells
2012 Nobel Prize in
Physiology or Medicine
Project
Detect histone X function in initiating
adult cells reprogramming to iPS cells.
Experiment
• Collect cells at the beginning (Day 0, 3, 6,
10) and after reprogramming (iPS);
• Map genome-wide histone X localization
with Next Generation Sequencing (NGS);
• Analyze the dynamic change of genome-
wide histone X localization with Python
program and framework.
Reprogram > 20 days
(Thousands of genes change expression)
Express pluripotent stem cell
specific genes (4 genes)
Generate Pluripotent Stem Cells
from Mature Adult Cells
Next Generation
DNA Sequencing
(Illumina, Inc)
Genome-wide Analysis of Epigenetic Regulation
Computation analysis (by genome)
Tools: Python, R, etc.
Align DNA sequence to chromosome
Display in genome browser (by gene)
chromosome
10 kb
Day 0
Day 3
Day 6
Day 10
Day 0
Day 3
Day 6
Day 10
Histone
X
K27me3
Pou5f1 Nanog
Histone X Enriches Near Stem Cell Specific Genes
At the Beginning of Cell Reprogramming
Genome browser (IGV)
Alignment output (BED format)
chr1 3000062 3000113 HWI-1KL117_0134:6:2101:14893:19331#ACAGTG/A..GTG. 37 +
chr1 3000113 3000164 HWI-1KL117_0134:6:2302:6790:10626#ACAGTG/A..GT.. 37 +
chr1 3000146 3000197 HWI-1KL117_0134:6:2303:8145:108924#ACAGTG/A..GT.. 37 -
chr1 3000154 3000205 HWI-1KL117_0134:6:2202:14995:109690#ACAGTG/A..GT.. 37 -
chr1 3000241 3000292 HWI-1KL117_0134:6:1304:12589:77263#ACAGTG/A..GT.. 25 -
chr1 3000311 3000362 HWI-1KL117_0134:6:1101:17212:111473#ACAGTG/A..GT.. 37 -
chr1 3000334 3000385 HWI-1KL117_0134:6:2308:10385:78074#ACAGTG/A..GT.. 25 -
chr1 3000385 3000436 HWI-1KL117_0134:6:2102:20734:102615#ACAGTG/A..GG.. 37 +
chr1 3000498 3000549 HWI-1KL117_0134:6:1203:3146:72739#ACAGTG/A..GTG. 37 -
chr1 3000538 3000589 HWI-1KL117_0134:6:1101:1921:57017#ACAGTG/A..GT.. 37 +
Chrom Start End Strand
Computational Pipeline for Genome-wide DNA
Sequence Analysis
Bardet AF, Stark A, Nature Protocols, 2012
Alignment Analysis (Python, Perl)
• BWA
• Picard
• Samtools
• MACS, Cistrome (X. Shirley Liu Lab)
• ChIPseeqer (Olivier Elemento Lab)
Peak Identification with Python Program:
Model-based Analysis of ChIP-Seq (MACS)
Zhang Y, Liu XS, et al. Genome Biology 2008
Feng J, Liu XS, et al. Nature Protocol 2012
(1)
(2)
Requirement: ~3 GB of RAM, 1.5 h per data set with
30 million DNA sequence reads.
d: estimated DNA fragment size
5’
3’
3’
5’
d
• Read distribution: Poisson distribution
• Use dynamic λlocal to capture local biases in the genome
λlocal = max (λBG, [λregion, λ1k], λ5k, λ10k)
λBG: constant estimated from the genome background
λregion: estimated from the candidate region
λ1k, λ5k, λ10k: estimated from 1kb, 5kb, 10kb local window in the control
• p-value: default threshold is 10-5
(3)
(4)
Galaxy / Cistrome
MACS integrated web-based application: http://cistrome.org/ap/
ChIPseeqer
• Graphical User Interface
• Command-line
http://physiology.med.cornell.edu/faculty/elemento/lab/chipseq.shtml
10 kb
Day 0
Day 3
Day 6
Day 10
Day 0
Day 3
Day 6
Day 10
Histone
X
K27me3
Pou5f1 Nanog
Histone X Enriches Near Stem Cell Specific Genes
At the Beginning of Cell Reprogramming
Day: 0 3 6 10 iPS E 0 3 6 10 iPS E
L H L H
Expression Histone X
ExpressionChangeExpressionStable
Pou5f1
Sox2
Cdh1
Cldn3
Jag2
Zbtb32
Elf3
Msh6
Lefty1
Piwil2
Notch4
Tjp3
Fbxo15
Cldn6
Foxh1
Zp3
Fgf15
Nodal
Tdgf1
Gdf3
Nanog
Fgf4
Dppa3
Histone X Enriches At Stem Cell Specific Gene Promoters
Prior to Gene Expression Activation
Embryonic placenta development
Stem cell maintenance
Response to nutrient
Cell-cell signaling
DNA metabolic process
DNA recombination
Formation of primary germ layer
Chromosome organization
Mesoderm development
Cell fate commitment
Stem cell differentiation
Blastocyst formation
Meiosis
Sexual reproduction
Thyroid hormone metabolic process
Cellular response to abiotic stimulus
X0
X1
X2
X3
X4
X5
X6
Embryonic placenta development, GO:0001892
Stem cell maintenance, GO:0019827
Response to nutrient, GO:0007584
Cell-cell signaling, GO:0007267
DNA metabolic process, GO:0006259
DNA recombination, GO:0006310
Formation of primary germ layer, GO:0001704
Chromosome organization, GO:0051276
Mesoderm development, GO:0007498
Cell fate commitment, GO:0045165
Stem cell differentiation, GO:0048863
Blastocyst formation, GO:0001825
Meiosis, GO:0007126
Sexual reproduction, GO:0019953
Thyroid hormone metabolic process, GO:0042403
Cellular response to abiotic stimulus, GO:0071214
5
Enrichment
-5
Depletion
X0
X1
X2
X3
X4
X5
X6
Embryonic p
Stem cell m
Response to
Cell-cell s
DNA metabol
DNA recombi
Formation o
Chromosome
Mesoderm de
Cell fate c
Stem cell d
Blastocyst
Meiosis, GO
Sexual repr
Thyroid hor
Cellular re
5
Enrichment
-5
Depletion
Expression Active Stable
Group - a b c a b c
Gene Ontology Analysis
a. Histone X enrich during Day 0 – 10
b. Histone X enrich in iPS (after Day 10)
c. Histone X not Enrich
Induced pluripotent stem
(iPS) cells
Adult cells
Limitation
• Reprogram efficiency: 0.01 - 0.1%
• Molecular mechanism is not fully understood
Our Genome-wide analysis suggests:
Histone X participates in stem cell gene activation
at the early stage of adult cell reprogram.
Express pluripotent stem cell
specific genes (4 genes)
Reprogram > 20 days
(Thousands of genes change expression)
Generate Pluripotent Stem Cells
from Mature Adult Cells

More Related Content

What's hot

Knock out mice production and its applications
Knock out mice production and its applicationsKnock out mice production and its applications
Knock out mice production and its applications
tara singh rawat
 
Knock out mice
Knock out miceKnock out mice
Knock out mice
Dr. Khushboo Bhojwani
 
Production of transgenic farm animals
Production of transgenic farm  animalsProduction of transgenic farm  animals
Production of transgenic farm animals
Mohsen Ali
 
Andrea Crisanti-Enfermedades transmitidas por vectores
Andrea Crisanti-Enfermedades transmitidas por vectoresAndrea Crisanti-Enfermedades transmitidas por vectores
Andrea Crisanti-Enfermedades transmitidas por vectores
Fundación Ramón Areces
 
Knockout mice
Knockout miceKnockout mice
Knockout mice
Lovnish Thakur
 
Transgenic animals new
Transgenic animals newTransgenic animals new
Transgenic animals new
Karthi Kumar
 
TRANSGENIC MICE
TRANSGENIC MICETRANSGENIC MICE
TRANSGENIC MICE
Mufassira Rahman
 
GENE KNOCKOUT
GENE KNOCKOUTGENE KNOCKOUT
GENE KNOCKOUT
RANA SAHA
 
Characteristics of transgenic Fish
Characteristics of transgenic FishCharacteristics of transgenic Fish
Characteristics of transgenic Fish
Sadya Laraib
 
Transgenic animals mice for students
Transgenic animals mice  for studentsTransgenic animals mice  for students
Transgenic animals mice for students
thirupathiSathya
 
Transgenic Animal (KNOCKOUT MOUSE) production and Application
Transgenic Animal (KNOCKOUT MOUSE) production and Application Transgenic Animal (KNOCKOUT MOUSE) production and Application
Transgenic Animal (KNOCKOUT MOUSE) production and Application
Nethravathi Siri
 
Emerging Clinical Applications of CRISPR-Cas9 as Promising Strategies in Gene...
Emerging Clinical Applications of CRISPR-Cas9 as Promising Strategies in Gene...Emerging Clinical Applications of CRISPR-Cas9 as Promising Strategies in Gene...
Emerging Clinical Applications of CRISPR-Cas9 as Promising Strategies in Gene...Chi-Ping Day
 
Ociit conference 2010 final
Ociit conference 2010 finalOciit conference 2010 final
Ociit conference 2010 finalbek1826
 
Uncovering novel candidate genes for pyridoxine-responsive epilepsy in a cons...
Uncovering novel candidate genes for pyridoxine-responsive epilepsy in a cons...Uncovering novel candidate genes for pyridoxine-responsive epilepsy in a cons...
Uncovering novel candidate genes for pyridoxine-responsive epilepsy in a cons...
Golden Helix Inc
 
TRANSGENIC TECHNIQUES AND GENE THERAPY
TRANSGENIC TECHNIQUES AND GENE THERAPYTRANSGENIC TECHNIQUES AND GENE THERAPY
TRANSGENIC TECHNIQUES AND GENE THERAPY
Aphurvikaa Manivannan
 
Transgenesis in animals
Transgenesis in animalsTransgenesis in animals
Transgenesis in animals
Dr Dhavalkumar F. Chaudhary
 
Transgenic and cloned organisms
Transgenic and cloned organismsTransgenic and cloned organisms
Transgenic and cloned organismsLegae Kebakuile
 

What's hot (20)

Genetic engineering in animal cells
Genetic engineering in animal cellsGenetic engineering in animal cells
Genetic engineering in animal cells
 
Knock out mice production and its applications
Knock out mice production and its applicationsKnock out mice production and its applications
Knock out mice production and its applications
 
Knock out mice
Knock out miceKnock out mice
Knock out mice
 
Production of transgenic farm animals
Production of transgenic farm  animalsProduction of transgenic farm  animals
Production of transgenic farm animals
 
Andrea Crisanti-Enfermedades transmitidas por vectores
Andrea Crisanti-Enfermedades transmitidas por vectoresAndrea Crisanti-Enfermedades transmitidas por vectores
Andrea Crisanti-Enfermedades transmitidas por vectores
 
Knockout mice
Knockout miceKnockout mice
Knockout mice
 
Transgenic animals new
Transgenic animals newTransgenic animals new
Transgenic animals new
 
Presentation gmo
Presentation gmoPresentation gmo
Presentation gmo
 
TRANSGENIC MICE
TRANSGENIC MICETRANSGENIC MICE
TRANSGENIC MICE
 
GENE KNOCKOUT
GENE KNOCKOUTGENE KNOCKOUT
GENE KNOCKOUT
 
Chromosome manipulations
Chromosome manipulationsChromosome manipulations
Chromosome manipulations
 
Characteristics of transgenic Fish
Characteristics of transgenic FishCharacteristics of transgenic Fish
Characteristics of transgenic Fish
 
Transgenic animals mice for students
Transgenic animals mice  for studentsTransgenic animals mice  for students
Transgenic animals mice for students
 
Transgenic Animal (KNOCKOUT MOUSE) production and Application
Transgenic Animal (KNOCKOUT MOUSE) production and Application Transgenic Animal (KNOCKOUT MOUSE) production and Application
Transgenic Animal (KNOCKOUT MOUSE) production and Application
 
Emerging Clinical Applications of CRISPR-Cas9 as Promising Strategies in Gene...
Emerging Clinical Applications of CRISPR-Cas9 as Promising Strategies in Gene...Emerging Clinical Applications of CRISPR-Cas9 as Promising Strategies in Gene...
Emerging Clinical Applications of CRISPR-Cas9 as Promising Strategies in Gene...
 
Ociit conference 2010 final
Ociit conference 2010 finalOciit conference 2010 final
Ociit conference 2010 final
 
Uncovering novel candidate genes for pyridoxine-responsive epilepsy in a cons...
Uncovering novel candidate genes for pyridoxine-responsive epilepsy in a cons...Uncovering novel candidate genes for pyridoxine-responsive epilepsy in a cons...
Uncovering novel candidate genes for pyridoxine-responsive epilepsy in a cons...
 
TRANSGENIC TECHNIQUES AND GENE THERAPY
TRANSGENIC TECHNIQUES AND GENE THERAPYTRANSGENIC TECHNIQUES AND GENE THERAPY
TRANSGENIC TECHNIQUES AND GENE THERAPY
 
Transgenesis in animals
Transgenesis in animalsTransgenesis in animals
Transgenesis in animals
 
Transgenic and cloned organisms
Transgenic and cloned organismsTransgenic and cloned organisms
Transgenic and cloned organisms
 

Similar to Python Meetup2014 (Ying Liu)

Python meetup 2014
Python meetup 2014Python meetup 2014
Python meetup 2014
eilosei
 
Animal Epigenetics
Animal Epigenetics Animal Epigenetics
Animal Epigenetics
Jingyue (Ellie) Duan
 
Open biomedical knowledge using crowdsourcing and citizen science
Open biomedical knowledge using crowdsourcing and citizen scienceOpen biomedical knowledge using crowdsourcing and citizen science
Open biomedical knowledge using crowdsourcing and citizen science
Andrew Su
 
Zfn Compozr Mammalian Cle
Zfn Compozr Mammalian CleZfn Compozr Mammalian Cle
Zfn Compozr Mammalian Cle
blehr
 
Osp2 brennan SGC
Osp2 brennan SGCOsp2 brennan SGC
Osp2 brennan SGC
Alina Grenier-Arellano
 
4.6 Biotechnology And Genetic Engineering
4.6 Biotechnology And Genetic Engineering4.6 Biotechnology And Genetic Engineering
4.6 Biotechnology And Genetic EngineeringPatricia Lopez
 
4.4 Biotechnology And Genetic Engineering
4.4 Biotechnology And Genetic Engineering4.4 Biotechnology And Genetic Engineering
4.4 Biotechnology And Genetic EngineeringPatricia Lopez
 
Osteoblasts remotely supply lung tumors with cancer-promoting SiglecFhigh neu...
Osteoblasts remotely supply lung tumors with cancer-promoting SiglecFhigh neu...Osteoblasts remotely supply lung tumors with cancer-promoting SiglecFhigh neu...
Osteoblasts remotely supply lung tumors with cancer-promoting SiglecFhigh neu...
Gul Muneer
 
35_cancer2.ppt
35_cancer2.ppt35_cancer2.ppt
35_cancer2.ppt
KhalidBassiouny1
 
Understanding the mechanism of Cancer genetics
Understanding the mechanism of Cancer geneticsUnderstanding the mechanism of Cancer genetics
Understanding the mechanism of Cancer genetics
MOPHCHOLAVANAHALLY
 
article
articlearticle
article
dalya shakir
 
cloning and sub-cloning
cloning and sub-cloningcloning and sub-cloning
cloning and sub-cloning
Sandhya Talla
 
2015 07 09__epigenetic_profiling_environmental_health_sciences_v42
2015 07 09__epigenetic_profiling_environmental_health_sciences_v422015 07 09__epigenetic_profiling_environmental_health_sciences_v42
2015 07 09__epigenetic_profiling_environmental_health_sciences_v42
Prof. Wim Van Criekinge
 
Next Generation Sequencing and its Applications in Medical Research - Frances...
Next Generation Sequencing and its Applications in Medical Research - Frances...Next Generation Sequencing and its Applications in Medical Research - Frances...
Next Generation Sequencing and its Applications in Medical Research - Frances...
Sri Ambati
 
Wei Li Di-Enfermedades raras de la piel
Wei Li Di-Enfermedades raras de la pielWei Li Di-Enfermedades raras de la piel
Wei Li Di-Enfermedades raras de la piel
Fundación Ramón Areces
 
Molecular techniques for pathology research - MDX .pdf
Molecular techniques for pathology research - MDX .pdfMolecular techniques for pathology research - MDX .pdf
Molecular techniques for pathology research - MDX .pdf
sabyabby
 
Oncogene
OncogeneOncogene
Oncogene
AshfaqAziz2
 
Hupo2017 wessels Bruker Lunch Seminar
Hupo2017 wessels Bruker Lunch SeminarHupo2017 wessels Bruker Lunch Seminar
Hupo2017 wessels Bruker Lunch Seminar
Hans Wessels
 
Discovering my research interest by eric garson sheffield university 2017
Discovering my research interest by eric garson   sheffield university 2017Discovering my research interest by eric garson   sheffield university 2017
Discovering my research interest by eric garson sheffield university 2017
Eric Garson
 

Similar to Python Meetup2014 (Ying Liu) (20)

Python meetup 2014
Python meetup 2014Python meetup 2014
Python meetup 2014
 
Animal Epigenetics
Animal Epigenetics Animal Epigenetics
Animal Epigenetics
 
Open biomedical knowledge using crowdsourcing and citizen science
Open biomedical knowledge using crowdsourcing and citizen scienceOpen biomedical knowledge using crowdsourcing and citizen science
Open biomedical knowledge using crowdsourcing and citizen science
 
Zfn Compozr Mammalian Cle
Zfn Compozr Mammalian CleZfn Compozr Mammalian Cle
Zfn Compozr Mammalian Cle
 
Osp2 brennan SGC
Osp2 brennan SGCOsp2 brennan SGC
Osp2 brennan SGC
 
4.6 Biotechnology And Genetic Engineering
4.6 Biotechnology And Genetic Engineering4.6 Biotechnology And Genetic Engineering
4.6 Biotechnology And Genetic Engineering
 
4.4 Biotechnology And Genetic Engineering
4.4 Biotechnology And Genetic Engineering4.4 Biotechnology And Genetic Engineering
4.4 Biotechnology And Genetic Engineering
 
Osteoblasts remotely supply lung tumors with cancer-promoting SiglecFhigh neu...
Osteoblasts remotely supply lung tumors with cancer-promoting SiglecFhigh neu...Osteoblasts remotely supply lung tumors with cancer-promoting SiglecFhigh neu...
Osteoblasts remotely supply lung tumors with cancer-promoting SiglecFhigh neu...
 
35_cancer2.ppt
35_cancer2.ppt35_cancer2.ppt
35_cancer2.ppt
 
Understanding the mechanism of Cancer genetics
Understanding the mechanism of Cancer geneticsUnderstanding the mechanism of Cancer genetics
Understanding the mechanism of Cancer genetics
 
article
articlearticle
article
 
cloning and sub-cloning
cloning and sub-cloningcloning and sub-cloning
cloning and sub-cloning
 
2015 07 09__epigenetic_profiling_environmental_health_sciences_v42
2015 07 09__epigenetic_profiling_environmental_health_sciences_v422015 07 09__epigenetic_profiling_environmental_health_sciences_v42
2015 07 09__epigenetic_profiling_environmental_health_sciences_v42
 
Next Generation Sequencing and its Applications in Medical Research - Frances...
Next Generation Sequencing and its Applications in Medical Research - Frances...Next Generation Sequencing and its Applications in Medical Research - Frances...
Next Generation Sequencing and its Applications in Medical Research - Frances...
 
Wei Li Di-Enfermedades raras de la piel
Wei Li Di-Enfermedades raras de la pielWei Li Di-Enfermedades raras de la piel
Wei Li Di-Enfermedades raras de la piel
 
Molecular techniques for pathology research - MDX .pdf
Molecular techniques for pathology research - MDX .pdfMolecular techniques for pathology research - MDX .pdf
Molecular techniques for pathology research - MDX .pdf
 
Fenton opac2013
Fenton opac2013Fenton opac2013
Fenton opac2013
 
Oncogene
OncogeneOncogene
Oncogene
 
Hupo2017 wessels Bruker Lunch Seminar
Hupo2017 wessels Bruker Lunch SeminarHupo2017 wessels Bruker Lunch Seminar
Hupo2017 wessels Bruker Lunch Seminar
 
Discovering my research interest by eric garson sheffield university 2017
Discovering my research interest by eric garson   sheffield university 2017Discovering my research interest by eric garson   sheffield university 2017
Discovering my research interest by eric garson sheffield university 2017
 

Recently uploaded

Lateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensiveLateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensive
silvermistyshot
 
Toxic effects of heavy metals : Lead and Arsenic
Toxic effects of heavy metals : Lead and ArsenicToxic effects of heavy metals : Lead and Arsenic
Toxic effects of heavy metals : Lead and Arsenic
sanjana502982
 
Anemia_ types_clinical significance.pptx
Anemia_ types_clinical significance.pptxAnemia_ types_clinical significance.pptx
Anemia_ types_clinical significance.pptx
muralinath2
 
ANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptx
ANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptxANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptx
ANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptx
RASHMI M G
 
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdfTopic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
TinyAnderson
 
Orion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWSOrion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWS
Columbia Weather Systems
 
ESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptxESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptx
PRIYANKA PATEL
 
Leaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdfLeaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdf
RenuJangid3
 
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptxThe use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
MAGOTI ERNEST
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
SAMIR PANDA
 
Phenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvementPhenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvement
IshaGoswami9
 
Richard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlandsRichard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlands
Richard Gill
 
20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx
Sharon Liu
 
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốtmô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
HongcNguyn6
 
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills MN
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
moosaasad1975
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Sérgio Sacani
 
platelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptxplatelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptx
muralinath2
 
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
Wasswaderrick3
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
University of Rennes, INSA Rennes, Inria/IRISA, CNRS
 

Recently uploaded (20)

Lateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensiveLateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensive
 
Toxic effects of heavy metals : Lead and Arsenic
Toxic effects of heavy metals : Lead and ArsenicToxic effects of heavy metals : Lead and Arsenic
Toxic effects of heavy metals : Lead and Arsenic
 
Anemia_ types_clinical significance.pptx
Anemia_ types_clinical significance.pptxAnemia_ types_clinical significance.pptx
Anemia_ types_clinical significance.pptx
 
ANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptx
ANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptxANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptx
ANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptx
 
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdfTopic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
 
Orion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWSOrion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWS
 
ESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptxESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptx
 
Leaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdfLeaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdf
 
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptxThe use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
 
Phenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvementPhenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvement
 
Richard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlandsRichard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlands
 
20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx
 
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốtmô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
 
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
 
platelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptxplatelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptx
 
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
DERIVATION OF MODIFIED BERNOULLI EQUATION WITH VISCOUS EFFECTS AND TERMINAL V...
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
 

Python Meetup2014 (Ying Liu)

  • 1. Unveiling Epigenetic Regulation with Next Generation Sequencing (NGS) and Python Ying Liu Weill Cornell Medical College The New York Python Meetup, 05-29-2014
  • 2. About Me • PhD candidate, Weill Cornell Medical College • Major Area: Stem cell epigenomics, Computational Biology • Graduation: Fall 2014 • Dream job: data scientist in biomedical informatics • Email: yingliuyl@gmail.com • LinkedIn: https://www.linkedin.com/pub/ying-liu/b/669/605
  • 3. Reprogram > 20 days (Thousands of genes change expression) Induced pluripotent stem (iPS) cells Adult cells Express pluripotent stem cell specific genes (4 genes) 2012 Nobel Prize in Physiology or Medicine Generate Pluripotent Stem Cells from Mature Adult Cells Limitation • Reprogram efficiency: 0.01 - 0.1% • Molecular mechanism is not fully understood ?
  • 4. Human Genome Project • Human genome: ~ 3 billion DNA base pairs • Complete sequence: 2003 First sequence draft: 2001
  • 5. Nature 454, 711-715 Gene Expression My project: Histone X Enriches at expressing genes Epigenetic Regulation • Epigenetics: study of heritable changes in gene activity that are NOT caused by changes in the DNA sequence • One of the major epigenetic regulators: Histone protein Histone proteins DNA
  • 6. Induced pluripotent stem (iPS) cells Adult cells 2012 Nobel Prize in Physiology or Medicine Project Detect histone X function in initiating adult cells reprogramming to iPS cells. Experiment • Collect cells at the beginning (Day 0, 3, 6, 10) and after reprogramming (iPS); • Map genome-wide histone X localization with Next Generation Sequencing (NGS); • Analyze the dynamic change of genome- wide histone X localization with Python program and framework. Reprogram > 20 days (Thousands of genes change expression) Express pluripotent stem cell specific genes (4 genes) Generate Pluripotent Stem Cells from Mature Adult Cells
  • 7. Next Generation DNA Sequencing (Illumina, Inc) Genome-wide Analysis of Epigenetic Regulation Computation analysis (by genome) Tools: Python, R, etc. Align DNA sequence to chromosome Display in genome browser (by gene) chromosome
  • 8. 10 kb Day 0 Day 3 Day 6 Day 10 Day 0 Day 3 Day 6 Day 10 Histone X K27me3 Pou5f1 Nanog Histone X Enriches Near Stem Cell Specific Genes At the Beginning of Cell Reprogramming Genome browser (IGV) Alignment output (BED format) chr1 3000062 3000113 HWI-1KL117_0134:6:2101:14893:19331#ACAGTG/A..GTG. 37 + chr1 3000113 3000164 HWI-1KL117_0134:6:2302:6790:10626#ACAGTG/A..GT.. 37 + chr1 3000146 3000197 HWI-1KL117_0134:6:2303:8145:108924#ACAGTG/A..GT.. 37 - chr1 3000154 3000205 HWI-1KL117_0134:6:2202:14995:109690#ACAGTG/A..GT.. 37 - chr1 3000241 3000292 HWI-1KL117_0134:6:1304:12589:77263#ACAGTG/A..GT.. 25 - chr1 3000311 3000362 HWI-1KL117_0134:6:1101:17212:111473#ACAGTG/A..GT.. 37 - chr1 3000334 3000385 HWI-1KL117_0134:6:2308:10385:78074#ACAGTG/A..GT.. 25 - chr1 3000385 3000436 HWI-1KL117_0134:6:2102:20734:102615#ACAGTG/A..GG.. 37 + chr1 3000498 3000549 HWI-1KL117_0134:6:1203:3146:72739#ACAGTG/A..GTG. 37 - chr1 3000538 3000589 HWI-1KL117_0134:6:1101:1921:57017#ACAGTG/A..GT.. 37 + Chrom Start End Strand
  • 9. Computational Pipeline for Genome-wide DNA Sequence Analysis Bardet AF, Stark A, Nature Protocols, 2012 Alignment Analysis (Python, Perl) • BWA • Picard • Samtools • MACS, Cistrome (X. Shirley Liu Lab) • ChIPseeqer (Olivier Elemento Lab)
  • 10. Peak Identification with Python Program: Model-based Analysis of ChIP-Seq (MACS) Zhang Y, Liu XS, et al. Genome Biology 2008 Feng J, Liu XS, et al. Nature Protocol 2012 (1) (2) Requirement: ~3 GB of RAM, 1.5 h per data set with 30 million DNA sequence reads. d: estimated DNA fragment size 5’ 3’ 3’ 5’ d • Read distribution: Poisson distribution • Use dynamic λlocal to capture local biases in the genome λlocal = max (λBG, [λregion, λ1k], λ5k, λ10k) λBG: constant estimated from the genome background λregion: estimated from the candidate region λ1k, λ5k, λ10k: estimated from 1kb, 5kb, 10kb local window in the control • p-value: default threshold is 10-5 (3) (4)
  • 11. Galaxy / Cistrome MACS integrated web-based application: http://cistrome.org/ap/
  • 12. ChIPseeqer • Graphical User Interface • Command-line http://physiology.med.cornell.edu/faculty/elemento/lab/chipseq.shtml
  • 13. 10 kb Day 0 Day 3 Day 6 Day 10 Day 0 Day 3 Day 6 Day 10 Histone X K27me3 Pou5f1 Nanog Histone X Enriches Near Stem Cell Specific Genes At the Beginning of Cell Reprogramming
  • 14. Day: 0 3 6 10 iPS E 0 3 6 10 iPS E L H L H Expression Histone X ExpressionChangeExpressionStable Pou5f1 Sox2 Cdh1 Cldn3 Jag2 Zbtb32 Elf3 Msh6 Lefty1 Piwil2 Notch4 Tjp3 Fbxo15 Cldn6 Foxh1 Zp3 Fgf15 Nodal Tdgf1 Gdf3 Nanog Fgf4 Dppa3 Histone X Enriches At Stem Cell Specific Gene Promoters Prior to Gene Expression Activation Embryonic placenta development Stem cell maintenance Response to nutrient Cell-cell signaling DNA metabolic process DNA recombination Formation of primary germ layer Chromosome organization Mesoderm development Cell fate commitment Stem cell differentiation Blastocyst formation Meiosis Sexual reproduction Thyroid hormone metabolic process Cellular response to abiotic stimulus X0 X1 X2 X3 X4 X5 X6 Embryonic placenta development, GO:0001892 Stem cell maintenance, GO:0019827 Response to nutrient, GO:0007584 Cell-cell signaling, GO:0007267 DNA metabolic process, GO:0006259 DNA recombination, GO:0006310 Formation of primary germ layer, GO:0001704 Chromosome organization, GO:0051276 Mesoderm development, GO:0007498 Cell fate commitment, GO:0045165 Stem cell differentiation, GO:0048863 Blastocyst formation, GO:0001825 Meiosis, GO:0007126 Sexual reproduction, GO:0019953 Thyroid hormone metabolic process, GO:0042403 Cellular response to abiotic stimulus, GO:0071214 5 Enrichment -5 Depletion X0 X1 X2 X3 X4 X5 X6 Embryonic p Stem cell m Response to Cell-cell s DNA metabol DNA recombi Formation o Chromosome Mesoderm de Cell fate c Stem cell d Blastocyst Meiosis, GO Sexual repr Thyroid hor Cellular re 5 Enrichment -5 Depletion Expression Active Stable Group - a b c a b c Gene Ontology Analysis a. Histone X enrich during Day 0 – 10 b. Histone X enrich in iPS (after Day 10) c. Histone X not Enrich
  • 15. Induced pluripotent stem (iPS) cells Adult cells Limitation • Reprogram efficiency: 0.01 - 0.1% • Molecular mechanism is not fully understood Our Genome-wide analysis suggests: Histone X participates in stem cell gene activation at the early stage of adult cell reprogram. Express pluripotent stem cell specific genes (4 genes) Reprogram > 20 days (Thousands of genes change expression) Generate Pluripotent Stem Cells from Mature Adult Cells