SlideShare a Scribd company logo
1 of 12
Download to read offline
EPFL/EDCB Ph.D. Candidate
Presentation
Jérémie KALFON,
ECE paris,
University of Kent
jkobject.com, linkedin.com/jkobject, github.com/jkobject, @jkobject
CaImAn: Calcium Imaging Analysis
1. A Giovannucci, J Friedrich, P Gunn, J Kalfon, et. al. “CaImAn: An open source tool for scalable Calcium Imaging data
Analysis”, eLife
CaImAn: Calcium Imaging Analysis
1. A Giovannucci, J Friedrich, P Gunn, J Kalfon, et. al. “CaImAn: An open source tool for scalable Calcium Imaging data
Analysis”, eLife
PyCUB: Hidden Patterns of the Codon Usage Bias
1. J Kalfon, “PyCUB: A machine exploration of the Codon Usage Bias”, University of Kent.
2. Y Deng, J Kalfon, et. al., “Hidden pattersn of the Codon Usage Bias”, Nature Communication, in review
● GC content
● tRNA pool
● replication speed
● environment temperature
● nitrogen availability
● biased random mutations
PyCUB: Hidden Patterns of the Codon Usage Bias
1. J Kalfon, “PyCUB: A machine exploration of the Codon Usage Bias”, University of Kent.
2. Y Deng, J Kalfon, et. al., “Hidden pattersn of the Codon Usage Bias”, Nature Communication, in review
● frequency measures
● deviation-to-reference
measures
● entropy measures
PyCUB: Methods
● 500 species from
ensembl, python
pipeline, scikit learn...
● Vector comparison
● Preprocessing (wide
range of measures ~20)
PyCUB: Methods
● Entropy → Force driving the CUB
● DBscan to cluster with outliers
● t-SNE & PCA to represent the data
● modelisation of the process
Results
❖ Specific distribution by
species groups
❖ Importance sequence’s
age
❖ Correlation to
sequence’s position.
❖ multiplicity of latent
factors
❖ Most Species have
specific CUBs
1. J Kalfon, “PyCUB: A machine exploration of the Codon Usage Bias”, University of Kent.
2. Y Deng, J Kalfon, et. al., “Hidden pattersn of the Codon Usage Bias”, Nature Communication, in review
Results
❖ Consistent results
❖ A python package to
analyse the CUB across
species
❖ A new measure of the
CUB with a fast
computation time.
1. J Kalfon, “PyCUB: A machine exploration of the Codon Usage Bias”, University of Kent.
2. Y Deng, J Kalfon, et. al., “Hidden pattersn of the Codon Usage Bias”, Nature Communication, in review
Conclusion
❖ Not one determinant of the
CUB
❖ The entropy measure is a
suitable one
❖ There is specific distribution
across genes (SLS).
Future research and ideas:
Using more big data specific
approach to analyze other/richer
kingdoms.
Remarks:
➔ The data was displaying a lot of
improbable sequences,
homologies, etc…
➔ t-SNE allowed to see clearly
driving mechanisms
1. J Kalfon, “PyCUB: A machine exploration of the Codon Usage Bias”, University of Kent.
2. Y Deng, J Kalfon, et. al., “Hidden pattersn of the Codon Usage Bias”, Nature Communication, in review
The things I loved
● Machine Learning / Data Science
● genomics / multi-omics & visual
data
● understand and model how cells
work.
● translational applications in
biomedicine
● working with teams, freedom to
explore and create
Computer Science + Biology = <3
1. see: statement of research objectives (jkobject.com)
2. VCF2ancestry, github/jkobject
Thank you!
goals > topics
🎉 reproducible
research

More Related Content

Similar to Epfl edcb ph.d. candidate presentation

J Klein - KUPKB: sharing, connecting and exposing kidney and urinary knowledg...
J Klein - KUPKB: sharing, connecting and exposing kidney and urinary knowledg...J Klein - KUPKB: sharing, connecting and exposing kidney and urinary knowledg...
J Klein - KUPKB: sharing, connecting and exposing kidney and urinary knowledg...Jan Aerts
 
JulieKlein_Bosc2012
JulieKlein_Bosc2012JulieKlein_Bosc2012
JulieKlein_Bosc2012KUPKB_Team
 
ICSB 2013 - Visits Abroad Report
ICSB 2013 - Visits Abroad ReportICSB 2013 - Visits Abroad Report
ICSB 2013 - Visits Abroad ReportLeighton Pritchard
 
The Emerging Global Collaboratory for Microbial Metagenomics Researchers
The Emerging Global Collaboratory for Microbial Metagenomics ResearchersThe Emerging Global Collaboratory for Microbial Metagenomics Researchers
The Emerging Global Collaboratory for Microbial Metagenomics ResearchersLarry Smarr
 
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...Larry Smarr
 
Biophysics 2016_Kayla Washenberger
Biophysics 2016_Kayla WashenbergerBiophysics 2016_Kayla Washenberger
Biophysics 2016_Kayla WashenbergerKayla Washenberger
 
Jack Gilbert: Welcome to the 1st International EMP Meeting: the first 10,000 ...
Jack Gilbert: Welcome to the 1st International EMP Meeting: the first 10,000 ...Jack Gilbert: Welcome to the 1st International EMP Meeting: the first 10,000 ...
Jack Gilbert: Welcome to the 1st International EMP Meeting: the first 10,000 ...GigaScience, BGI Hong Kong
 
The Emerging Global Community of Microbial Metagenomics Researchers
The Emerging Global Community of Microbial Metagenomics ResearchersThe Emerging Global Community of Microbial Metagenomics Researchers
The Emerging Global Community of Microbial Metagenomics ResearchersLarry Smarr
 
Hail: SCALING GENETIC DATA ANALYSIS WITH APACHE SPARK: Keynote by Cotton Seed
Hail: SCALING GENETIC DATA ANALYSIS WITH APACHE SPARK: Keynote by Cotton SeedHail: SCALING GENETIC DATA ANALYSIS WITH APACHE SPARK: Keynote by Cotton Seed
Hail: SCALING GENETIC DATA ANALYSIS WITH APACHE SPARK: Keynote by Cotton SeedSpark Summit
 
Statistics for K-mer Based Splicing Analysis
Statistics for K-mer Based Splicing AnalysisStatistics for K-mer Based Splicing Analysis
Statistics for K-mer Based Splicing AnalysisRuofei Du
 

Similar to Epfl edcb ph.d. candidate presentation (12)

J Klein - KUPKB: sharing, connecting and exposing kidney and urinary knowledg...
J Klein - KUPKB: sharing, connecting and exposing kidney and urinary knowledg...J Klein - KUPKB: sharing, connecting and exposing kidney and urinary knowledg...
J Klein - KUPKB: sharing, connecting and exposing kidney and urinary knowledg...
 
JulieKlein_Bosc2012
JulieKlein_Bosc2012JulieKlein_Bosc2012
JulieKlein_Bosc2012
 
ICSB 2013 - Visits Abroad Report
ICSB 2013 - Visits Abroad ReportICSB 2013 - Visits Abroad Report
ICSB 2013 - Visits Abroad Report
 
The Emerging Global Collaboratory for Microbial Metagenomics Researchers
The Emerging Global Collaboratory for Microbial Metagenomics ResearchersThe Emerging Global Collaboratory for Microbial Metagenomics Researchers
The Emerging Global Collaboratory for Microbial Metagenomics Researchers
 
Curriculum Vitae
Curriculum VitaeCurriculum Vitae
Curriculum Vitae
 
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...
 
DPHEP_BLUETWO_001
DPHEP_BLUETWO_001DPHEP_BLUETWO_001
DPHEP_BLUETWO_001
 
Biophysics 2016_Kayla Washenberger
Biophysics 2016_Kayla WashenbergerBiophysics 2016_Kayla Washenberger
Biophysics 2016_Kayla Washenberger
 
Jack Gilbert: Welcome to the 1st International EMP Meeting: the first 10,000 ...
Jack Gilbert: Welcome to the 1st International EMP Meeting: the first 10,000 ...Jack Gilbert: Welcome to the 1st International EMP Meeting: the first 10,000 ...
Jack Gilbert: Welcome to the 1st International EMP Meeting: the first 10,000 ...
 
The Emerging Global Community of Microbial Metagenomics Researchers
The Emerging Global Community of Microbial Metagenomics ResearchersThe Emerging Global Community of Microbial Metagenomics Researchers
The Emerging Global Community of Microbial Metagenomics Researchers
 
Hail: SCALING GENETIC DATA ANALYSIS WITH APACHE SPARK: Keynote by Cotton Seed
Hail: SCALING GENETIC DATA ANALYSIS WITH APACHE SPARK: Keynote by Cotton SeedHail: SCALING GENETIC DATA ANALYSIS WITH APACHE SPARK: Keynote by Cotton Seed
Hail: SCALING GENETIC DATA ANALYSIS WITH APACHE SPARK: Keynote by Cotton Seed
 
Statistics for K-mer Based Splicing Analysis
Statistics for K-mer Based Splicing AnalysisStatistics for K-mer Based Splicing Analysis
Statistics for K-mer Based Splicing Analysis
 

Recently uploaded

GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry Areesha Ahmad
 
Human genetics..........................pptx
Human genetics..........................pptxHuman genetics..........................pptx
Human genetics..........................pptxSilpa
 
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...Monika Rani
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfSumit Kumar yadav
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxMohamedFarag457087
 
300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptxryanrooker
 
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.Silpa
 
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRingsTransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRingsSérgio Sacani
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryAlex Henderson
 
Genome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptxGenome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptxSilpa
 
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Silpa
 
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxPSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxSuji236384
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsSérgio Sacani
 
Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Silpa
 
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIACURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIADr. TATHAGAT KHOBRAGADE
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.Silpa
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learninglevieagacer
 
Genetics and epigenetics of ADHD and comorbid conditions
Genetics and epigenetics of ADHD and comorbid conditionsGenetics and epigenetics of ADHD and comorbid conditions
Genetics and epigenetics of ADHD and comorbid conditionsbassianu17
 

Recently uploaded (20)

GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
 
Human genetics..........................pptx
Human genetics..........................pptxHuman genetics..........................pptx
Human genetics..........................pptx
 
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdf
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
 
300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx
 
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
 
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRingsTransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
Genome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptxGenome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptx
 
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
 
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxPSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx                 .Clean In Place(CIP).pptx                 .
Clean In Place(CIP).pptx .
 
Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.
 
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIACURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
Genetics and epigenetics of ADHD and comorbid conditions
Genetics and epigenetics of ADHD and comorbid conditionsGenetics and epigenetics of ADHD and comorbid conditions
Genetics and epigenetics of ADHD and comorbid conditions
 

Epfl edcb ph.d. candidate presentation

  • 1. EPFL/EDCB Ph.D. Candidate Presentation Jérémie KALFON, ECE paris, University of Kent jkobject.com, linkedin.com/jkobject, github.com/jkobject, @jkobject
  • 2. CaImAn: Calcium Imaging Analysis 1. A Giovannucci, J Friedrich, P Gunn, J Kalfon, et. al. “CaImAn: An open source tool for scalable Calcium Imaging data Analysis”, eLife
  • 3. CaImAn: Calcium Imaging Analysis 1. A Giovannucci, J Friedrich, P Gunn, J Kalfon, et. al. “CaImAn: An open source tool for scalable Calcium Imaging data Analysis”, eLife
  • 4. PyCUB: Hidden Patterns of the Codon Usage Bias 1. J Kalfon, “PyCUB: A machine exploration of the Codon Usage Bias”, University of Kent. 2. Y Deng, J Kalfon, et. al., “Hidden pattersn of the Codon Usage Bias”, Nature Communication, in review ● GC content ● tRNA pool ● replication speed ● environment temperature ● nitrogen availability ● biased random mutations
  • 5. PyCUB: Hidden Patterns of the Codon Usage Bias 1. J Kalfon, “PyCUB: A machine exploration of the Codon Usage Bias”, University of Kent. 2. Y Deng, J Kalfon, et. al., “Hidden pattersn of the Codon Usage Bias”, Nature Communication, in review ● frequency measures ● deviation-to-reference measures ● entropy measures
  • 6. PyCUB: Methods ● 500 species from ensembl, python pipeline, scikit learn... ● Vector comparison ● Preprocessing (wide range of measures ~20)
  • 7. PyCUB: Methods ● Entropy → Force driving the CUB ● DBscan to cluster with outliers ● t-SNE & PCA to represent the data ● modelisation of the process
  • 8. Results ❖ Specific distribution by species groups ❖ Importance sequence’s age ❖ Correlation to sequence’s position. ❖ multiplicity of latent factors ❖ Most Species have specific CUBs 1. J Kalfon, “PyCUB: A machine exploration of the Codon Usage Bias”, University of Kent. 2. Y Deng, J Kalfon, et. al., “Hidden pattersn of the Codon Usage Bias”, Nature Communication, in review
  • 9. Results ❖ Consistent results ❖ A python package to analyse the CUB across species ❖ A new measure of the CUB with a fast computation time. 1. J Kalfon, “PyCUB: A machine exploration of the Codon Usage Bias”, University of Kent. 2. Y Deng, J Kalfon, et. al., “Hidden pattersn of the Codon Usage Bias”, Nature Communication, in review
  • 10. Conclusion ❖ Not one determinant of the CUB ❖ The entropy measure is a suitable one ❖ There is specific distribution across genes (SLS). Future research and ideas: Using more big data specific approach to analyze other/richer kingdoms. Remarks: ➔ The data was displaying a lot of improbable sequences, homologies, etc… ➔ t-SNE allowed to see clearly driving mechanisms 1. J Kalfon, “PyCUB: A machine exploration of the Codon Usage Bias”, University of Kent. 2. Y Deng, J Kalfon, et. al., “Hidden pattersn of the Codon Usage Bias”, Nature Communication, in review
  • 11. The things I loved ● Machine Learning / Data Science ● genomics / multi-omics & visual data ● understand and model how cells work. ● translational applications in biomedicine ● working with teams, freedom to explore and create
  • 12. Computer Science + Biology = <3 1. see: statement of research objectives (jkobject.com) 2. VCF2ancestry, github/jkobject Thank you! goals > topics 🎉 reproducible research