SlideShare a Scribd company logo
1/59
ELIS – Multimedia Lab
<Title>
<Author>
<Date>
Genome Sequences as Media Files
Tom Paridaens, Wesley De Neve, Peter Lambert and Rik Van de Walle
05-03-2014
Genome Sequences as Media Files
Towards effective, efficient, and functional
compression of genomic data
2/59
ELIS – Multimedia Lab
<Title>
<Author>
<Date>
Video
Coding
Adap-
tation
Analysis
Experience
Satellite
DVB-
S2
VSAT
Ground
Station
CT
Scans
Com-
pression
Trans-
mission
Genome Sequences as Media Files
Tom Paridaens, Wesley De Neve, Peter Lambert and Rik Van de Walle
05-03-2014
3/59
ELIS – Multimedia Lab
<Title>
<Author>
<Date>
Genome Sequences as Media Files
Tom Paridaens, Wesley De Neve, Peter Lambert and Rik Van de Walle
05-03-2014
4/59
ELIS – Multimedia Lab
<Title>
<Author>
<Date>
Compression Triangle
Efficiency
FunctionalityEffectiveness
Genome Sequences as Media Files
Tom Paridaens, Wesley De Neve, Peter Lambert and Rik Van de Walle
05-03-2014
5/59
ELIS – Multimedia Lab
<Title>
<Author>
<Date>
Compression Triangle
Efficiency
FunctionalityEffectiveness
Genome Sequences as Media Files
Tom Paridaens, Wesley De Neve, Peter Lambert and Rik Van de Walle
05-03-2014
6/59
ELIS – Multimedia Lab
<Title>
<Author>
<Date>
Effectiveness - SOTA
• 2 bits per nucleotideBit Encoding
• Matching groups of
nucleotides
Dictionary-
based Encoding
• Prediction using a
probabilistic model
Statistical
Encoding
• Matching with another
genome
Reference-
based Encoding
Genome Sequences as Media Files
Tom Paridaens, Wesley De Neve, Peter Lambert and Rik Van de Walle
05-03-2014
7/59
ELIS – Multimedia Lab
<Title>
<Author>
<Date>
Split
sequence in
blocks
Select
prediction
tool
Encode tool
parameters
Encode
residue
Block-based compression
Genome Sequences as Media Files
Tom Paridaens, Wesley De Neve, Peter Lambert and Rik Van de Walle
05-03-2014
8/59
ELIS – Multimedia Lab
<Title>
<Author>
<Date>
Split
sequence in
blocks
Select
prediction
tool
Encode tool
parameters
Encode
residue
Block-based compression
Genome Sequences as Media Files
Tom Paridaens, Wesley De Neve, Peter Lambert and Rik Van de Walle
05-03-2014
9/59
ELIS – Multimedia Lab
<Title>
<Author>
<Date>
Prediction
Residue
Genomic
data
Prediction & Residue
Genome Sequences as Media Files
Tom Paridaens, Wesley De Neve, Peter Lambert and Rik Van de Walle
05-03-2014
10/59
ELIS – Multimedia Lab
<Title>
<Author>
<Date>
Prediction
INTRA
“AAA…AA”
“TTT…TT”
“ATAT…AT”
“CGCG…CG”
Huffman
Encode in 2-
base Huffman
INTER
Search similar
block
Search
inverse
complement
Selecting Prediction Tools
Genome Sequences as Media Files
Tom Paridaens, Wesley De Neve, Peter Lambert and Rik Van de Walle
05-03-2014
11/59
ELIS – Multimedia Lab
<Title>
<Author>
<Date>
Split
sequence in
blocks
Select
prediction
tool
Encode tool
parameters
Encode
residue
Encoding prediction tool & residue
Context Adaptive Binary Arithmetic Coding
Genome Sequences as Media Files
Tom Paridaens, Wesley De Neve, Peter Lambert and Rik Van de Walle
05-03-2014
12/59
ELIS – Multimedia Lab
<Title>
<Author>
<Date>
Compression Triangle
Efficiency
FunctionalityEffectiveness
Genome Sequences as Media Files
Tom Paridaens, Wesley De Neve, Peter Lambert and Rik Van de Walle
05-03-2014
13/59
ELIS – Multimedia Lab
<Title>
<Author>
<Date>
Efficiency - SOTA
• Hashing
• Quick match detection tools (e.g. Pattern
Hunter)
• Single-threaded processing
• Decoding & transmitting complete files
Genome Sequences as Media Files
Tom Paridaens, Wesley De Neve, Peter Lambert and Rik Van de Walle
05-03-2014
14/59
ELIS – Multimedia Lab
<Title>
<Author>
<Date>
Block-based compression
Parallel processing
Genome Sequences as Media Files
Tom Paridaens, Wesley De Neve, Peter Lambert and Rik Van de Walle
05-03-2014
Split
sequence in
blocks
Select
prediction
tool
Encode tool
parameters
Encode
residue
15/59
ELIS – Multimedia Lab
<Title>
<Author>
<Date>
Efficiency - Research
• Partial decoding (@ block level)
• Live encoding/streaming
• Smart and adaptive use of compression
tools
– Load balancing
– Compression speed vs ratio
Genome Sequences as Media Files
Tom Paridaens, Wesley De Neve, Peter Lambert and Rik Van de Walle
05-03-2014
16/59
ELIS – Multimedia Lab
<Title>
<Author>
<Date>
Compression Triangle
Efficiency
FunctionalityEffectiveness
Genome Sequences as Media Files
Tom Paridaens, Wesley De Neve, Peter Lambert and Rik Van de Walle
05-03-2014
17/59
ELIS – Multimedia Lab
<Title>
<Author>
<Date>
Functionality - SOTA
• Random access
• Metadata
• Encryption @ full-file level
Genome Sequences as Media Files
Tom Paridaens, Wesley De Neve, Peter Lambert and Rik Van de Walle
05-03-2014
18/59
ELIS – Multimedia Lab
<Title>
<Author>
<Date>
Functionality - Research
• Random access
• Metadata
– File adaptation
Genome Sequences as Media Files
Tom Paridaens, Wesley De Neve, Peter Lambert and Rik Van de Walle
05-03-2014
19/59
ELIS – Multimedia Lab
<Title>
<Author>
<Date>
Functionality - Research
• Compressed-domain analysis and
adaptation
– Selecting parts of the genome for
transmission
– Using INTER information to track repeats
Genome Sequences as Media Files
Tom Paridaens, Wesley De Neve, Peter Lambert and Rik Van de Walle
05-03-2014
20/59
ELIS – Multimedia Lab
<Title>
<Author>
<Date>
Functionality - Research
• DRM/Encryption
– @(sub)block level
• Random access
• Adaptation
• Some compressed-domain analysis
– @accuracy level
• Lossless for trusted researchers
• Near-lossless for everybody else
Genome Sequences as Media Files
Tom Paridaens, Wesley De Neve, Peter Lambert and Rik Van de Walle
05-03-2014
21/59
ELIS – Multimedia Lab
<Title>
<Author>
<Date>
Can we apply media file technology on
genomic data?
Genome Sequences as Media Files
Tom Paridaens, Wesley De Neve, Peter Lambert and Rik Van de Walle
05-03-2014
Benchmarking sets?
Please…
22/59
ELIS – Multimedia Lab
<Title>
<Author>
<Date>
Compression Triangle
Efficiency
FunctionalityEffectiveness
Genome Sequences as Media Files
Tom Paridaens, Wesley De Neve, Peter Lambert and Rik Van de Walle
05-03-2014

More Related Content

Viewers also liked

Content-Driven Apps with React
Content-Driven Apps with ReactContent-Driven Apps with React
Content-Driven Apps with React
Netcetera
 
Curoverse Presentation at ICG-11 (November 2016)
Curoverse Presentation at ICG-11 (November 2016)Curoverse Presentation at ICG-11 (November 2016)
Curoverse Presentation at ICG-11 (November 2016)
Arvados
 
ACMG 2017 The Data Behind the Results - Bioinformatics for Clinicians
ACMG 2017 The Data Behind the Results - Bioinformatics for CliniciansACMG 2017 The Data Behind the Results - Bioinformatics for Clinicians
ACMG 2017 The Data Behind the Results - Bioinformatics for Clinicians
Erica Ramos
 
Netcetera Innovation Summit 2016: The Past 12 Months - What's New & Exciting
Netcetera Innovation Summit 2016: The Past 12 Months - What's New & ExcitingNetcetera Innovation Summit 2016: The Past 12 Months - What's New & Exciting
Netcetera Innovation Summit 2016: The Past 12 Months - What's New & Exciting
Netcetera
 
SwissWallet - Die digitale Währung heisst Vertrauen
SwissWallet - Die digitale Währung heisst Vertrauen SwissWallet - Die digitale Währung heisst Vertrauen
SwissWallet - Die digitale Währung heisst Vertrauen
Netcetera
 
COSCUP 2016 Workshop : 快快樂樂學Neo4j
COSCUP 2016 Workshop : 快快樂樂學Neo4jCOSCUP 2016 Workshop : 快快樂樂學Neo4j
COSCUP 2016 Workshop : 快快樂樂學Neo4j
Eric Lee
 
Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...
Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...
Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...
Wesley De Neve
 
Authentication requirements and application of PSD2 in e-Commerce - Presentat...
Authentication requirements and application of PSD2 in e-Commerce - Presentat...Authentication requirements and application of PSD2 in e-Commerce - Presentat...
Authentication requirements and application of PSD2 in e-Commerce - Presentat...
Netcetera
 
SkopjePulse: Designing a better city with IoT
SkopjePulse: Designing a better city with IoTSkopjePulse: Designing a better city with IoT
SkopjePulse: Designing a better city with IoT
Netcetera
 
Polyglot Persistence e Big Data: tra innovazione e difficoltà su casi reali -...
Polyglot Persistence e Big Data: tra innovazione e difficoltà su casi reali -...Polyglot Persistence e Big Data: tra innovazione e difficoltà su casi reali -...
Polyglot Persistence e Big Data: tra innovazione e difficoltà su casi reali -...
Data Driven Innovation
 
2015 genome-center
2015 genome-center2015 genome-center
2015 genome-center
c.titus.brown
 
Die Herausforderungen in der Payment-Industrie
Die Herausforderungen in der Payment-IndustrieDie Herausforderungen in der Payment-Industrie
Die Herausforderungen in der Payment-Industrie
Netcetera
 
Managers - The Missing Manual
Managers - The Missing ManualManagers - The Missing Manual
Managers - The Missing Manual
Netcetera
 

Viewers also liked (13)

Content-Driven Apps with React
Content-Driven Apps with ReactContent-Driven Apps with React
Content-Driven Apps with React
 
Curoverse Presentation at ICG-11 (November 2016)
Curoverse Presentation at ICG-11 (November 2016)Curoverse Presentation at ICG-11 (November 2016)
Curoverse Presentation at ICG-11 (November 2016)
 
ACMG 2017 The Data Behind the Results - Bioinformatics for Clinicians
ACMG 2017 The Data Behind the Results - Bioinformatics for CliniciansACMG 2017 The Data Behind the Results - Bioinformatics for Clinicians
ACMG 2017 The Data Behind the Results - Bioinformatics for Clinicians
 
Netcetera Innovation Summit 2016: The Past 12 Months - What's New & Exciting
Netcetera Innovation Summit 2016: The Past 12 Months - What's New & ExcitingNetcetera Innovation Summit 2016: The Past 12 Months - What's New & Exciting
Netcetera Innovation Summit 2016: The Past 12 Months - What's New & Exciting
 
SwissWallet - Die digitale Währung heisst Vertrauen
SwissWallet - Die digitale Währung heisst Vertrauen SwissWallet - Die digitale Währung heisst Vertrauen
SwissWallet - Die digitale Währung heisst Vertrauen
 
COSCUP 2016 Workshop : 快快樂樂學Neo4j
COSCUP 2016 Workshop : 快快樂樂學Neo4jCOSCUP 2016 Workshop : 快快樂樂學Neo4j
COSCUP 2016 Workshop : 快快樂樂學Neo4j
 
Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...
Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...
Deep Machine Learning for Making Sense of Biotech Data - From Clean Energy to...
 
Authentication requirements and application of PSD2 in e-Commerce - Presentat...
Authentication requirements and application of PSD2 in e-Commerce - Presentat...Authentication requirements and application of PSD2 in e-Commerce - Presentat...
Authentication requirements and application of PSD2 in e-Commerce - Presentat...
 
SkopjePulse: Designing a better city with IoT
SkopjePulse: Designing a better city with IoTSkopjePulse: Designing a better city with IoT
SkopjePulse: Designing a better city with IoT
 
Polyglot Persistence e Big Data: tra innovazione e difficoltà su casi reali -...
Polyglot Persistence e Big Data: tra innovazione e difficoltà su casi reali -...Polyglot Persistence e Big Data: tra innovazione e difficoltà su casi reali -...
Polyglot Persistence e Big Data: tra innovazione e difficoltà su casi reali -...
 
2015 genome-center
2015 genome-center2015 genome-center
2015 genome-center
 
Die Herausforderungen in der Payment-Industrie
Die Herausforderungen in der Payment-IndustrieDie Herausforderungen in der Payment-Industrie
Die Herausforderungen in der Payment-Industrie
 
Managers - The Missing Manual
Managers - The Missing ManualManagers - The Missing Manual
Managers - The Missing Manual
 

Similar to Genome sequences as media files

Vitus Masters Defense
Vitus Masters DefenseVitus Masters Defense
Vitus Masters Defense
derDoc
 
TERN ESA Workshop 2012, 'Smarter Workflows for Ecologists'
TERN ESA Workshop 2012, 'Smarter Workflows for Ecologists'TERN ESA Workshop 2012, 'Smarter Workflows for Ecologists'
TERN ESA Workshop 2012, 'Smarter Workflows for Ecologists'
TERN Australia
 
Text and Data Mining explained at FTDM
Text and Data Mining explained at FTDMText and Data Mining explained at FTDM
Text and Data Mining explained at FTDM
petermurrayrust
 
Content Mining of Science and Medicine
Content Mining of Science and MedicineContent Mining of Science and Medicine
Content Mining of Science and Medicine
TheContentMine
 
TERN Facility Portals - Stuart Phinn
TERN Facility Portals - Stuart PhinnTERN Facility Portals - Stuart Phinn
TERN Facility Portals - Stuart Phinn
TERN Australia
 
Odam: Open Data, Access and Mining
Odam: Open Data, Access and MiningOdam: Open Data, Access and Mining
Odam: Open Data, Access and Mining
Daniel JACOB
 
06-dash.pptx
06-dash.pptx06-dash.pptx
06-dash.pptx
AliIssa53
 
Research Data Management Implementations
Research Data Management Implementations Research Data Management Implementations
Research Data Management Implementations
Globus
 
Virtual Gov Day - IT Operations Breakout - Jennifer Green, R&D Scientist, Los...
Virtual Gov Day - IT Operations Breakout - Jennifer Green, R&D Scientist, Los...Virtual Gov Day - IT Operations Breakout - Jennifer Green, R&D Scientist, Los...
Virtual Gov Day - IT Operations Breakout - Jennifer Green, R&D Scientist, Los...
Splunk
 
Apache Tika: 1 point Oh!
Apache Tika: 1 point Oh!Apache Tika: 1 point Oh!
Apache Tika: 1 point Oh!
Chris Mattmann
 
Dynamic Adaptive Streaming over HTTP (DASH)
Dynamic Adaptive Streaming over HTTP (DASH)Dynamic Adaptive Streaming over HTTP (DASH)
Dynamic Adaptive Streaming over HTTP (DASH)Alpen-Adria-Universität
 
Is Your Data Secure
Is Your Data SecureIs Your Data Secure
Is Your Data Secure
Real-Time Innovations (RTI)
 
Jeff Grethe: CAMERA
Jeff Grethe: CAMERAJeff Grethe: CAMERA
Jeff Grethe: CAMERA
Iddo
 
Course Research Data Management
Course Research Data ManagementCourse Research Data Management
Course Research Data Management
Maarten Van Bentum
 
AusCover portal presentation
AusCover portal presentationAusCover portal presentation
AusCover portal presentationTERN Australia
 
HyperTED - Searching and browsing through fragments of TED Talks
HyperTED - Searching and browsing through fragments of TED TalksHyperTED - Searching and browsing through fragments of TED Talks
HyperTED - Searching and browsing through fragments of TED Talks
Mariella Sabatino
 
ProteomeXchange: data deposition and data retrieval made easy
ProteomeXchange: data deposition and data retrieval made easyProteomeXchange: data deposition and data retrieval made easy
ProteomeXchange: data deposition and data retrieval made easy
Juan Antonio Vizcaino
 
Ph. D. Final Dissertation SLides
Ph. D. Final Dissertation SLidesPh. D. Final Dissertation SLides
Ph. D. Final Dissertation SLides
Emanuele Panigati
 
Make your data great again - Ver 2
Make your data great again - Ver 2Make your data great again - Ver 2
Make your data great again - Ver 2
Daniel JACOB
 
Empowering Transformational Science
Empowering Transformational ScienceEmpowering Transformational Science
Empowering Transformational Science
Chelle Gentemann
 

Similar to Genome sequences as media files (20)

Vitus Masters Defense
Vitus Masters DefenseVitus Masters Defense
Vitus Masters Defense
 
TERN ESA Workshop 2012, 'Smarter Workflows for Ecologists'
TERN ESA Workshop 2012, 'Smarter Workflows for Ecologists'TERN ESA Workshop 2012, 'Smarter Workflows for Ecologists'
TERN ESA Workshop 2012, 'Smarter Workflows for Ecologists'
 
Text and Data Mining explained at FTDM
Text and Data Mining explained at FTDMText and Data Mining explained at FTDM
Text and Data Mining explained at FTDM
 
Content Mining of Science and Medicine
Content Mining of Science and MedicineContent Mining of Science and Medicine
Content Mining of Science and Medicine
 
TERN Facility Portals - Stuart Phinn
TERN Facility Portals - Stuart PhinnTERN Facility Portals - Stuart Phinn
TERN Facility Portals - Stuart Phinn
 
Odam: Open Data, Access and Mining
Odam: Open Data, Access and MiningOdam: Open Data, Access and Mining
Odam: Open Data, Access and Mining
 
06-dash.pptx
06-dash.pptx06-dash.pptx
06-dash.pptx
 
Research Data Management Implementations
Research Data Management Implementations Research Data Management Implementations
Research Data Management Implementations
 
Virtual Gov Day - IT Operations Breakout - Jennifer Green, R&D Scientist, Los...
Virtual Gov Day - IT Operations Breakout - Jennifer Green, R&D Scientist, Los...Virtual Gov Day - IT Operations Breakout - Jennifer Green, R&D Scientist, Los...
Virtual Gov Day - IT Operations Breakout - Jennifer Green, R&D Scientist, Los...
 
Apache Tika: 1 point Oh!
Apache Tika: 1 point Oh!Apache Tika: 1 point Oh!
Apache Tika: 1 point Oh!
 
Dynamic Adaptive Streaming over HTTP (DASH)
Dynamic Adaptive Streaming over HTTP (DASH)Dynamic Adaptive Streaming over HTTP (DASH)
Dynamic Adaptive Streaming over HTTP (DASH)
 
Is Your Data Secure
Is Your Data SecureIs Your Data Secure
Is Your Data Secure
 
Jeff Grethe: CAMERA
Jeff Grethe: CAMERAJeff Grethe: CAMERA
Jeff Grethe: CAMERA
 
Course Research Data Management
Course Research Data ManagementCourse Research Data Management
Course Research Data Management
 
AusCover portal presentation
AusCover portal presentationAusCover portal presentation
AusCover portal presentation
 
HyperTED - Searching and browsing through fragments of TED Talks
HyperTED - Searching and browsing through fragments of TED TalksHyperTED - Searching and browsing through fragments of TED Talks
HyperTED - Searching and browsing through fragments of TED Talks
 
ProteomeXchange: data deposition and data retrieval made easy
ProteomeXchange: data deposition and data retrieval made easyProteomeXchange: data deposition and data retrieval made easy
ProteomeXchange: data deposition and data retrieval made easy
 
Ph. D. Final Dissertation SLides
Ph. D. Final Dissertation SLidesPh. D. Final Dissertation SLides
Ph. D. Final Dissertation SLides
 
Make your data great again - Ver 2
Make your data great again - Ver 2Make your data great again - Ver 2
Make your data great again - Ver 2
 
Empowering Transformational Science
Empowering Transformational ScienceEmpowering Transformational Science
Empowering Transformational Science
 

Recently uploaded

Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Erdal Coalmaker
 
SAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdfSAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdf
KrushnaDarade1
 
Introduction to Mean Field Theory(MFT).pptx
Introduction to Mean Field Theory(MFT).pptxIntroduction to Mean Field Theory(MFT).pptx
Introduction to Mean Field Theory(MFT).pptx
zeex60
 
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Ana Luísa Pinho
 
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptxThe use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
MAGOTI ERNEST
 
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốtmô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
HongcNguyn6
 
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills MN
 
Eukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptxEukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptx
RitabrataSarkar3
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
Nistarini College, Purulia (W.B) India
 
Orion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWSOrion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWS
Columbia Weather Systems
 
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
University of Maribor
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
SAMIR PANDA
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
University of Rennes, INSA Rennes, Inria/IRISA, CNRS
 
bordetella pertussis.................................ppt
bordetella pertussis.................................pptbordetella pertussis.................................ppt
bordetella pertussis.................................ppt
kejapriya1
 
Lateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensiveLateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensive
silvermistyshot
 
Shallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptxShallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptx
Gokturk Mehmet Dilci
 
Leaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdfLeaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdf
RenuJangid3
 
platelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptxplatelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptx
muralinath2
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Sérgio Sacani
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
moosaasad1975
 

Recently uploaded (20)

Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
 
SAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdfSAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdf
 
Introduction to Mean Field Theory(MFT).pptx
Introduction to Mean Field Theory(MFT).pptxIntroduction to Mean Field Theory(MFT).pptx
Introduction to Mean Field Theory(MFT).pptx
 
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
 
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptxThe use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
 
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốtmô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
 
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
 
Eukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptxEukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptx
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
 
Orion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWSOrion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWS
 
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
 
bordetella pertussis.................................ppt
bordetella pertussis.................................pptbordetella pertussis.................................ppt
bordetella pertussis.................................ppt
 
Lateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensiveLateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensive
 
Shallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptxShallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptx
 
Leaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdfLeaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdf
 
platelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptxplatelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptx
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
 

Genome sequences as media files

  • 1. 1/59 ELIS – Multimedia Lab <Title> <Author> <Date> Genome Sequences as Media Files Tom Paridaens, Wesley De Neve, Peter Lambert and Rik Van de Walle 05-03-2014 Genome Sequences as Media Files Towards effective, efficient, and functional compression of genomic data
  • 2. 2/59 ELIS – Multimedia Lab <Title> <Author> <Date> Video Coding Adap- tation Analysis Experience Satellite DVB- S2 VSAT Ground Station CT Scans Com- pression Trans- mission Genome Sequences as Media Files Tom Paridaens, Wesley De Neve, Peter Lambert and Rik Van de Walle 05-03-2014
  • 3. 3/59 ELIS – Multimedia Lab <Title> <Author> <Date> Genome Sequences as Media Files Tom Paridaens, Wesley De Neve, Peter Lambert and Rik Van de Walle 05-03-2014
  • 4. 4/59 ELIS – Multimedia Lab <Title> <Author> <Date> Compression Triangle Efficiency FunctionalityEffectiveness Genome Sequences as Media Files Tom Paridaens, Wesley De Neve, Peter Lambert and Rik Van de Walle 05-03-2014
  • 5. 5/59 ELIS – Multimedia Lab <Title> <Author> <Date> Compression Triangle Efficiency FunctionalityEffectiveness Genome Sequences as Media Files Tom Paridaens, Wesley De Neve, Peter Lambert and Rik Van de Walle 05-03-2014
  • 6. 6/59 ELIS – Multimedia Lab <Title> <Author> <Date> Effectiveness - SOTA • 2 bits per nucleotideBit Encoding • Matching groups of nucleotides Dictionary- based Encoding • Prediction using a probabilistic model Statistical Encoding • Matching with another genome Reference- based Encoding Genome Sequences as Media Files Tom Paridaens, Wesley De Neve, Peter Lambert and Rik Van de Walle 05-03-2014
  • 7. 7/59 ELIS – Multimedia Lab <Title> <Author> <Date> Split sequence in blocks Select prediction tool Encode tool parameters Encode residue Block-based compression Genome Sequences as Media Files Tom Paridaens, Wesley De Neve, Peter Lambert and Rik Van de Walle 05-03-2014
  • 8. 8/59 ELIS – Multimedia Lab <Title> <Author> <Date> Split sequence in blocks Select prediction tool Encode tool parameters Encode residue Block-based compression Genome Sequences as Media Files Tom Paridaens, Wesley De Neve, Peter Lambert and Rik Van de Walle 05-03-2014
  • 9. 9/59 ELIS – Multimedia Lab <Title> <Author> <Date> Prediction Residue Genomic data Prediction & Residue Genome Sequences as Media Files Tom Paridaens, Wesley De Neve, Peter Lambert and Rik Van de Walle 05-03-2014
  • 10. 10/59 ELIS – Multimedia Lab <Title> <Author> <Date> Prediction INTRA “AAA…AA” “TTT…TT” “ATAT…AT” “CGCG…CG” Huffman Encode in 2- base Huffman INTER Search similar block Search inverse complement Selecting Prediction Tools Genome Sequences as Media Files Tom Paridaens, Wesley De Neve, Peter Lambert and Rik Van de Walle 05-03-2014
  • 11. 11/59 ELIS – Multimedia Lab <Title> <Author> <Date> Split sequence in blocks Select prediction tool Encode tool parameters Encode residue Encoding prediction tool & residue Context Adaptive Binary Arithmetic Coding Genome Sequences as Media Files Tom Paridaens, Wesley De Neve, Peter Lambert and Rik Van de Walle 05-03-2014
  • 12. 12/59 ELIS – Multimedia Lab <Title> <Author> <Date> Compression Triangle Efficiency FunctionalityEffectiveness Genome Sequences as Media Files Tom Paridaens, Wesley De Neve, Peter Lambert and Rik Van de Walle 05-03-2014
  • 13. 13/59 ELIS – Multimedia Lab <Title> <Author> <Date> Efficiency - SOTA • Hashing • Quick match detection tools (e.g. Pattern Hunter) • Single-threaded processing • Decoding & transmitting complete files Genome Sequences as Media Files Tom Paridaens, Wesley De Neve, Peter Lambert and Rik Van de Walle 05-03-2014
  • 14. 14/59 ELIS – Multimedia Lab <Title> <Author> <Date> Block-based compression Parallel processing Genome Sequences as Media Files Tom Paridaens, Wesley De Neve, Peter Lambert and Rik Van de Walle 05-03-2014 Split sequence in blocks Select prediction tool Encode tool parameters Encode residue
  • 15. 15/59 ELIS – Multimedia Lab <Title> <Author> <Date> Efficiency - Research • Partial decoding (@ block level) • Live encoding/streaming • Smart and adaptive use of compression tools – Load balancing – Compression speed vs ratio Genome Sequences as Media Files Tom Paridaens, Wesley De Neve, Peter Lambert and Rik Van de Walle 05-03-2014
  • 16. 16/59 ELIS – Multimedia Lab <Title> <Author> <Date> Compression Triangle Efficiency FunctionalityEffectiveness Genome Sequences as Media Files Tom Paridaens, Wesley De Neve, Peter Lambert and Rik Van de Walle 05-03-2014
  • 17. 17/59 ELIS – Multimedia Lab <Title> <Author> <Date> Functionality - SOTA • Random access • Metadata • Encryption @ full-file level Genome Sequences as Media Files Tom Paridaens, Wesley De Neve, Peter Lambert and Rik Van de Walle 05-03-2014
  • 18. 18/59 ELIS – Multimedia Lab <Title> <Author> <Date> Functionality - Research • Random access • Metadata – File adaptation Genome Sequences as Media Files Tom Paridaens, Wesley De Neve, Peter Lambert and Rik Van de Walle 05-03-2014
  • 19. 19/59 ELIS – Multimedia Lab <Title> <Author> <Date> Functionality - Research • Compressed-domain analysis and adaptation – Selecting parts of the genome for transmission – Using INTER information to track repeats Genome Sequences as Media Files Tom Paridaens, Wesley De Neve, Peter Lambert and Rik Van de Walle 05-03-2014
  • 20. 20/59 ELIS – Multimedia Lab <Title> <Author> <Date> Functionality - Research • DRM/Encryption – @(sub)block level • Random access • Adaptation • Some compressed-domain analysis – @accuracy level • Lossless for trusted researchers • Near-lossless for everybody else Genome Sequences as Media Files Tom Paridaens, Wesley De Neve, Peter Lambert and Rik Van de Walle 05-03-2014
  • 21. 21/59 ELIS – Multimedia Lab <Title> <Author> <Date> Can we apply media file technology on genomic data? Genome Sequences as Media Files Tom Paridaens, Wesley De Neve, Peter Lambert and Rik Van de Walle 05-03-2014 Benchmarking sets? Please…
  • 22. 22/59 ELIS – Multimedia Lab <Title> <Author> <Date> Compression Triangle Efficiency FunctionalityEffectiveness Genome Sequences as Media Files Tom Paridaens, Wesley De Neve, Peter Lambert and Rik Van de Walle 05-03-2014

Editor's Notes

  1. Onderzoeksvraag: in welke mate kan multimediatechnologie hergebruikt worden voor het verwerken/analyseren/comprimeren en transmissie van genomische data