SlideShare a Scribd company logo
Genome Visualization: Circos Plots
Samarth Kulshrestha
Data Analyst
sk3@nibmg.ac.in
● Circos Plots: Examples
● Technical Requirements
● Chromosomes & Ideogram
● Points to remember
● Circos Configuration file format
● Circos Distribution.
– Folder Distribution
● Data format
Overview
● Circos is a softare package for visualizing data
and information. It visualize data in circular
layout.
● Circos is ideal for creating publication-quality
infographics, richly layered data and pleasant
symmetries.
● If you are a researcher, analyst,data geek or
visual artist who is seeking to communicate a
dataset, circos is for you.
Circos plots
● The biological scientific community has
adopted circos worldwide. Circos has
appeared in many top scientific journals.
Availability @
www.circos.ca
5
D
C
B
A
Available Data
Tracks/Display:
D) Stacked Histogram
C) Ideogram
B) Highlights
A) Links
Circos-plot example
6
Circos-plot example
Available Data Tracks/Display:
F) Ticks
E) Text
E
F
7
8
Nature Communications 6, Article number: 5973 doi:10.1038/ncomms6973
Circos Plot showing the
count of SNV variants
and copy number
changes.
9
1) A machine with installed perl and additional
perl modules.
2) Karyotype file
3) Configuration file
4) Genomic data files.
To generate Circos, We NEED
9
Technical Requirements
Technical Requirements
● Platform: Windows/Linux/Mac OS
● Perl 5.8 or newer version and additional
perl modules
11
..continue
12
..continue
●
Config::General
Font::TTF
GD
List::MoreUtils
Math::Bezier
Math::Vecstat
Param::Validat
e Readonly
.............. And many
more
●
●
●
●
●
●
●
For more information regarding perl
modules installation, please follow the link
http://circos.ca/tutorials/lessons/configurat
ion/perl_and_modules/
Additional Perl Modules
●
To list all the required
modules and check whether
they are installed, use
-modules.
First open a new terminal &
change directory to Circos
folder
●
Now check installed
modules using:
13
perl circos -modules
Chromosomes & Ideogram
14
vs
Traditional Way vs Circular way
The chromosome is the entire sequence structure
as defined in Karyotype file.
Ideogram is the depiction of the chromosome, or
region. Ideogram of hg 19 has been shown in the
image. (22+X,Y chr)
Format of hg19 Karyotype File: (available with
circos package in Workshop/circos/data/karyotype)
16
chr - hs1 1 0 249250621 chr1
chr - hs2 2 0 243199373 chr2
chr - hs3 3 0 198022430 chr3
chr - hs4 4 0 191154276 chr4
chr - hs5 5 0 180915260 chr5
--------------------------------------
--------------------------------------
band hs1 p36.33 p36.33 0 2300000 gneg
band hs1 p36.32 p36.32 2300000 5400000 gpos25
--------------------------------------
Example of Ideogram
Chromosome & Ideogram
Points to remember
17
● All input files are text.
● Output image formats are .png and .svg and are
static.
● It requires configuration file(s) to specify Circos
layout and data tracks.
● It does not do any analysis,only for visualization.
● Comment lines starts with a hash “#”
● Run on command-line.
Points To remember
Circos Configuration file format
19
20
●
The image generation process is driven by
a central configuration file.
●
Circos configuration (.conf) file is text file with
some parameters that define the image,
including input files, image size, formatting
etc.
●
This file usually imports other
configuration files, such as global color
and font settings.
Circos configuration (.conf) file
format
21
..continue
Configuration file syntax
➢
Blocks
<ideogram> #start of block
thickness = 30p
fill = yes
..........
</ideogram> #end of block
Nested Blocks
<plots>
<plot>
type=line
➢
file = data/file1.txt
.........
</plot>
<plot>
type=line
file = data/file2.txt
..........
</plot>
</plots>
An example of Nested Block
Under <plots> parameter there are 2
line plots, hence Nested plots. We
can add more than 2 plots in a
nested block.
An example of a simple block
This is an example of single
block for an ideogram.
Global vs Local parameters
<plots>
type
color
# start of plots block
= line
= grey
<plot> # start of first plot
file = data/file1.txt
..........
</plot>
<plot>
# end of first plot
# start of second plot
file = data/file2.txt
..........
</plot> # end of second plot
<plot>
type
color
file
</plot>
# start of third plot
= scatter
= blue
= data/file3.txt
# end of third plot
Global parameters type and
color. First two plots will be of
grey color line plots.
22</plots> # end of plots block
Local parameters type and
color are specific to third
plot only. So third plot will
be of blue scatterd.
Circos Distribution
23
Following directories comes with circos package.
bin/
data/
etc/
.........
Circos executable
Contains Karyotype files
Necessary configuration files
24
Circos Distribution
circos -conf <configFile>
Configuration file
Circos
Circos.png
Data track files 25
Circos.svg
Command to run
circos:
Simple Circos Process
circos -conf <configFile>
Configuration file
Additional conf files
Circos
Circos.svg
Circos.png
Data track files 26
Command to run
circos:
27
●
User Should always import these
additional conf files
# colors, fonts and fill patterns
<<include etc/colors_fonts_patterns.conf>>
# system and debug parameters
<<include etc/housekeeping.conf>>
# image parameters
<<include etc/image.conf>>
Data format for circos input
28NIBMG Winter School 2017
hs10 60001 134200
hs10 134201 234000
hs10 234001 317600
hs10 317601 445100
...................................
chr10 60001 134200
chr10 134201 234000
chr10 234001 317600
chr10 317601 445100
...................................
Chr Start Stop Values
Data format for different
tracks for Circos plot is 4
column data ie
Chr Start Stop Values
But before creating any
plot “chr” needs to be
change to “hs” and
header should be removed
as shown in the right hand
side image.
Data formats for Circos tracks
..continue
hs10
hs10
hs10
hs10
50632 50632 CALY
134201 234000 ------
219699 219699 UNC5B
317601 445100 ------
...................................
hs10
hs10
hs10
hs10
50632 50632
134201 234000
219699 219699
317601 445100
...................................
Chr Start Stop Values
Data format for different
tracks for Circos plot is 4
column data ie
Chr Start Stop Values
But before creating any
plot “chr” needs to be
change to “hs” and
header should be
removed as shown in the
right hand side image.
30
<ideogram>
<spacing>
default = 0.005r
</spacing>
radius = 0.90r
thickness = 30p
fill = yes
#Radius of ideogram
# thickness of ideogram
# Color in ideogram
show_label = yes
label_font = default
label_radius
label_size
label_parallel
label_case
label_center
</ideogram>
= dims(ideogram,radius)+50p
= 50
= yes
= upper
= yes
radius=0.90r
31
Parameter Explained for a simple
ideogram
Thank You

More Related Content

What's hot

Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
Athira RG
 
High Throughput Sequencing Technologies: On the path to the $0* genome
High Throughput Sequencing Technologies: On the path to the $0* genomeHigh Throughput Sequencing Technologies: On the path to the $0* genome
High Throughput Sequencing Technologies: On the path to the $0* genome
Brian Krueger
 
Gene prediction and expression
Gene prediction and expressionGene prediction and expression
Gene prediction and expression
ishi tandon
 
Genome assembly
Genome assemblyGenome assembly
Genome Database Systems
Genome Database Systems Genome Database Systems
Genome Database Systems
Harindu Chathuranga Korala
 
Genome organisation
Genome organisationGenome organisation
Genome organisation
Pram Priyanca
 
Tech Talk: UCSC Genome Browser
Tech Talk: UCSC Genome BrowserTech Talk: UCSC Genome Browser
Tech Talk: UCSC Genome Browser
Hoffman Lab
 
Microarray technology and applications
Microarray technology and applicationsMicroarray technology and applications
Microarray technology and applications
Purnima Kartha
 
Genome Assembly
Genome AssemblyGenome Assembly
Genome Assembly
Aureliano Bombarely
 
ILLUMINA SEQUENCE.pptx
ILLUMINA SEQUENCE.pptxILLUMINA SEQUENCE.pptx
ILLUMINA SEQUENCE.pptx
projectliberary
 
Clustal X
Clustal XClustal X
Clustal X
biinoida
 
Structural genomics
Structural genomicsStructural genomics
Structural genomics
Bhavitha Pulaparthi
 
Introduction to next generation sequencing
Introduction to next generation sequencingIntroduction to next generation sequencing
Introduction to next generation sequencing
VHIR Vall d’Hebron Institut de Recerca
 
Genetic mapping
Genetic mappingGenetic mapping
Genetic mapping
Bruno Mmassy
 
Whole genome shotgun sequencing
Whole genome shotgun sequencingWhole genome shotgun sequencing
Whole genome shotgun sequencing
Goutham Sarovar
 
Forward and reverse genetics
Forward and reverse geneticsForward and reverse genetics
Forward and reverse genetics
Sachin Ekatpure
 
microarrary
microarrarymicroarrary
microarrary
Dr Nisha Singh
 
Dna sequencing ppt
Dna sequencing pptDna sequencing ppt
Dna sequencing ppt
Siddaraj Basavaraj
 
DNA microarray
DNA microarrayDNA microarray
DNA microarray
manojjeya
 
15 molecular markers techniques
15 molecular markers techniques15 molecular markers techniques
15 molecular markers techniques
AVINASH KUSHWAHA
 

What's hot (20)

Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
High Throughput Sequencing Technologies: On the path to the $0* genome
High Throughput Sequencing Technologies: On the path to the $0* genomeHigh Throughput Sequencing Technologies: On the path to the $0* genome
High Throughput Sequencing Technologies: On the path to the $0* genome
 
Gene prediction and expression
Gene prediction and expressionGene prediction and expression
Gene prediction and expression
 
Genome assembly
Genome assemblyGenome assembly
Genome assembly
 
Genome Database Systems
Genome Database Systems Genome Database Systems
Genome Database Systems
 
Genome organisation
Genome organisationGenome organisation
Genome organisation
 
Tech Talk: UCSC Genome Browser
Tech Talk: UCSC Genome BrowserTech Talk: UCSC Genome Browser
Tech Talk: UCSC Genome Browser
 
Microarray technology and applications
Microarray technology and applicationsMicroarray technology and applications
Microarray technology and applications
 
Genome Assembly
Genome AssemblyGenome Assembly
Genome Assembly
 
ILLUMINA SEQUENCE.pptx
ILLUMINA SEQUENCE.pptxILLUMINA SEQUENCE.pptx
ILLUMINA SEQUENCE.pptx
 
Clustal X
Clustal XClustal X
Clustal X
 
Structural genomics
Structural genomicsStructural genomics
Structural genomics
 
Introduction to next generation sequencing
Introduction to next generation sequencingIntroduction to next generation sequencing
Introduction to next generation sequencing
 
Genetic mapping
Genetic mappingGenetic mapping
Genetic mapping
 
Whole genome shotgun sequencing
Whole genome shotgun sequencingWhole genome shotgun sequencing
Whole genome shotgun sequencing
 
Forward and reverse genetics
Forward and reverse geneticsForward and reverse genetics
Forward and reverse genetics
 
microarrary
microarrarymicroarrary
microarrary
 
Dna sequencing ppt
Dna sequencing pptDna sequencing ppt
Dna sequencing ppt
 
DNA microarray
DNA microarrayDNA microarray
DNA microarray
 
15 molecular markers techniques
15 molecular markers techniques15 molecular markers techniques
15 molecular markers techniques
 

Similar to Circos Tutorial

Circos
CircosCircos
Circos
mkim8
 
Circos
CircosCircos
H2020-AHTOOLS Use Case 3 Functional Design
H2020-AHTOOLS Use Case 3 Functional DesignH2020-AHTOOLS Use Case 3 Functional Design
H2020-AHTOOLS Use Case 3 Functional Design
CARLOS III UNIVERSITY OF MADRID
 
Short.course.introduction.to.vhdl
Short.course.introduction.to.vhdlShort.course.introduction.to.vhdl
Short.course.introduction.to.vhdl
Ravi Sony
 
Graph500 and Green Graph500 benchmarks on SGI UV2000 @ SGI UG SC14
Graph500 and Green Graph500 benchmarks on SGI UV2000 @ SGI UG SC14Graph500 and Green Graph500 benchmarks on SGI UV2000 @ SGI UG SC14
Graph500 and Green Graph500 benchmarks on SGI UV2000 @ SGI UG SC14
Yuichiro Yasui
 
What is C3D?
What is C3D?What is C3D?
What is C3D?
Arkadiy Kamnev
 
Tutorial for EDA Tools
Tutorial for EDA ToolsTutorial for EDA Tools
Tutorial for EDA Tools
Minh Anh Nguyen
 
Tutorial for EDA Tools:
Tutorial for EDA Tools:Tutorial for EDA Tools:
Tutorial for EDA Tools:
Minh Anh Nguyen
 
Klessydra t - designing vector coprocessors for multi-threaded edge-computing...
Klessydra t - designing vector coprocessors for multi-threaded edge-computing...Klessydra t - designing vector coprocessors for multi-threaded edge-computing...
Klessydra t - designing vector coprocessors for multi-threaded edge-computing...
RISC-V International
 
C3D Labs. Geometric Modeling Toolkit
C3D Labs. Geometric Modeling ToolkitC3D Labs. Geometric Modeling Toolkit
C3D Labs. Geometric Modeling Toolkit
Сергей Бирюков
 
C3D Labs. Geometric Modeling Toolkit
C3D Labs. Geometric Modeling ToolkitC3D Labs. Geometric Modeling Toolkit
C3D Labs. Geometric Modeling Toolkit
Arkadiy Kamnev
 
Arm: Enabling CXL devices within the Data Center with Arm Solutions
Arm: Enabling CXL devices within the Data Center with Arm SolutionsArm: Enabling CXL devices within the Data Center with Arm Solutions
Arm: Enabling CXL devices within the Data Center with Arm Solutions
Memory Fabric Forum
 
PCL (Point Cloud Library)
PCL (Point Cloud Library)PCL (Point Cloud Library)
PCL (Point Cloud Library)
University of Oklahoma
 
Algebra
AlgebraAlgebra
Algebra
Hira Maharjan
 
Delivering Supermicro Software Defined Storage Solutions with OSNexus QuantaStor
Delivering Supermicro Software Defined Storage Solutions with OSNexus QuantaStorDelivering Supermicro Software Defined Storage Solutions with OSNexus QuantaStor
Delivering Supermicro Software Defined Storage Solutions with OSNexus QuantaStor
Rebekah Rodriguez
 
Performance Evaluation & Design Methodologies for Automated 32 Bit CRC Checki...
Performance Evaluation & Design Methodologies for Automated 32 Bit CRC Checki...Performance Evaluation & Design Methodologies for Automated 32 Bit CRC Checki...
Performance Evaluation & Design Methodologies for Automated 32 Bit CRC Checki...
VIT-AP University
 
Pcbgcode
PcbgcodePcbgcode
Data Analysis with TensorFlow in PostgreSQL
Data Analysis with TensorFlow in PostgreSQLData Analysis with TensorFlow in PostgreSQL
Data Analysis with TensorFlow in PostgreSQL
EDB
 
Graphics Standards and Algorithm
Graphics Standards and AlgorithmGraphics Standards and Algorithm
Graphics Standards and Algorithm
Yatin Singh
 
Hardware accelerator for financial application in HDL and HLS, SAMOS 2017
Hardware accelerator for financial application in HDL and HLS, SAMOS 2017Hardware accelerator for financial application in HDL and HLS, SAMOS 2017
Hardware accelerator for financial application in HDL and HLS, SAMOS 2017
VINEYARD - Versatile Integrated Accelerator-based Heterogeneous Data Centres
 

Similar to Circos Tutorial (20)

Circos
CircosCircos
Circos
 
Circos
CircosCircos
Circos
 
H2020-AHTOOLS Use Case 3 Functional Design
H2020-AHTOOLS Use Case 3 Functional DesignH2020-AHTOOLS Use Case 3 Functional Design
H2020-AHTOOLS Use Case 3 Functional Design
 
Short.course.introduction.to.vhdl
Short.course.introduction.to.vhdlShort.course.introduction.to.vhdl
Short.course.introduction.to.vhdl
 
Graph500 and Green Graph500 benchmarks on SGI UV2000 @ SGI UG SC14
Graph500 and Green Graph500 benchmarks on SGI UV2000 @ SGI UG SC14Graph500 and Green Graph500 benchmarks on SGI UV2000 @ SGI UG SC14
Graph500 and Green Graph500 benchmarks on SGI UV2000 @ SGI UG SC14
 
What is C3D?
What is C3D?What is C3D?
What is C3D?
 
Tutorial for EDA Tools
Tutorial for EDA ToolsTutorial for EDA Tools
Tutorial for EDA Tools
 
Tutorial for EDA Tools:
Tutorial for EDA Tools:Tutorial for EDA Tools:
Tutorial for EDA Tools:
 
Klessydra t - designing vector coprocessors for multi-threaded edge-computing...
Klessydra t - designing vector coprocessors for multi-threaded edge-computing...Klessydra t - designing vector coprocessors for multi-threaded edge-computing...
Klessydra t - designing vector coprocessors for multi-threaded edge-computing...
 
C3D Labs. Geometric Modeling Toolkit
C3D Labs. Geometric Modeling ToolkitC3D Labs. Geometric Modeling Toolkit
C3D Labs. Geometric Modeling Toolkit
 
C3D Labs. Geometric Modeling Toolkit
C3D Labs. Geometric Modeling ToolkitC3D Labs. Geometric Modeling Toolkit
C3D Labs. Geometric Modeling Toolkit
 
Arm: Enabling CXL devices within the Data Center with Arm Solutions
Arm: Enabling CXL devices within the Data Center with Arm SolutionsArm: Enabling CXL devices within the Data Center with Arm Solutions
Arm: Enabling CXL devices within the Data Center with Arm Solutions
 
PCL (Point Cloud Library)
PCL (Point Cloud Library)PCL (Point Cloud Library)
PCL (Point Cloud Library)
 
Algebra
AlgebraAlgebra
Algebra
 
Delivering Supermicro Software Defined Storage Solutions with OSNexus QuantaStor
Delivering Supermicro Software Defined Storage Solutions with OSNexus QuantaStorDelivering Supermicro Software Defined Storage Solutions with OSNexus QuantaStor
Delivering Supermicro Software Defined Storage Solutions with OSNexus QuantaStor
 
Performance Evaluation & Design Methodologies for Automated 32 Bit CRC Checki...
Performance Evaluation & Design Methodologies for Automated 32 Bit CRC Checki...Performance Evaluation & Design Methodologies for Automated 32 Bit CRC Checki...
Performance Evaluation & Design Methodologies for Automated 32 Bit CRC Checki...
 
Pcbgcode
PcbgcodePcbgcode
Pcbgcode
 
Data Analysis with TensorFlow in PostgreSQL
Data Analysis with TensorFlow in PostgreSQLData Analysis with TensorFlow in PostgreSQL
Data Analysis with TensorFlow in PostgreSQL
 
Graphics Standards and Algorithm
Graphics Standards and AlgorithmGraphics Standards and Algorithm
Graphics Standards and Algorithm
 
Hardware accelerator for financial application in HDL and HLS, SAMOS 2017
Hardware accelerator for financial application in HDL and HLS, SAMOS 2017Hardware accelerator for financial application in HDL and HLS, SAMOS 2017
Hardware accelerator for financial application in HDL and HLS, SAMOS 2017
 

Recently uploaded

Evaluation and Identification of J'BaFofi the Giant Spider of Congo and Moke...
Evaluation and Identification of J'BaFofi the Giant  Spider of Congo and Moke...Evaluation and Identification of J'BaFofi the Giant  Spider of Congo and Moke...
Evaluation and Identification of J'BaFofi the Giant Spider of Congo and Moke...
MrSproy
 
Post translation modification by Suyash Garg
Post translation modification by Suyash GargPost translation modification by Suyash Garg
Post translation modification by Suyash Garg
suyashempire
 
seed production, Nursery & Gardening.pdf
seed production, Nursery & Gardening.pdfseed production, Nursery & Gardening.pdf
seed production, Nursery & Gardening.pdf
Nistarini College, Purulia (W.B) India
 
AJAY KUMAR NIET GreNo Guava Project File.pdf
AJAY KUMAR NIET GreNo Guava Project File.pdfAJAY KUMAR NIET GreNo Guava Project File.pdf
AJAY KUMAR NIET GreNo Guava Project File.pdf
AJAY KUMAR
 
一比一原版美国佩斯大学毕业证如何办理
一比一原版美国佩斯大学毕业证如何办理一比一原版美国佩斯大学毕业证如何办理
一比一原版美国佩斯大学毕业证如何办理
gyhwyo
 
Mechanics:- Simple and Compound Pendulum
Mechanics:- Simple and Compound PendulumMechanics:- Simple and Compound Pendulum
Mechanics:- Simple and Compound Pendulum
PravinHudge1
 
Analysis of Polygenic Traits (GPB-602)
Analysis of Polygenic Traits (GPB-602)Analysis of Polygenic Traits (GPB-602)
Analysis of Polygenic Traits (GPB-602)
PABOLU TEJASREE
 
Introduction_Ch_01_Biotech Biotechnology course .pptx
Introduction_Ch_01_Biotech Biotechnology course .pptxIntroduction_Ch_01_Biotech Biotechnology course .pptx
Introduction_Ch_01_Biotech Biotechnology course .pptx
QusayMaghayerh
 
TOPIC OF DISCUSSION: CENTRIFUGATION SLIDESHARE.pptx
TOPIC OF DISCUSSION: CENTRIFUGATION SLIDESHARE.pptxTOPIC OF DISCUSSION: CENTRIFUGATION SLIDESHARE.pptx
TOPIC OF DISCUSSION: CENTRIFUGATION SLIDESHARE.pptx
shubhijain836
 
Explainable Deepfake Image/Video Detection
Explainable Deepfake Image/Video DetectionExplainable Deepfake Image/Video Detection
Explainable Deepfake Image/Video Detection
VasileiosMezaris
 
Nutaceuticsls herbal drug technology CVS, cancer.pptx
Nutaceuticsls herbal drug technology CVS, cancer.pptxNutaceuticsls herbal drug technology CVS, cancer.pptx
Nutaceuticsls herbal drug technology CVS, cancer.pptx
vimalveerammal
 
the fundamental unit of life CBSE class 9.pptx
the fundamental unit of life CBSE class 9.pptxthe fundamental unit of life CBSE class 9.pptx
the fundamental unit of life CBSE class 9.pptx
parminder0808singh
 
Reaching the age of Adolescence- Class 8
Reaching the age of Adolescence- Class 8Reaching the age of Adolescence- Class 8
Reaching the age of Adolescence- Class 8
abhinayakamasamudram
 
Holsinger, Bruce W. - Music, body and desire in medieval culture [2001].pdf
Holsinger, Bruce W. - Music, body and desire in medieval culture [2001].pdfHolsinger, Bruce W. - Music, body and desire in medieval culture [2001].pdf
Holsinger, Bruce W. - Music, body and desire in medieval culture [2001].pdf
frank0071
 
Mites,Slug,Snail_Infesting agricultural crops.pdf
Mites,Slug,Snail_Infesting agricultural crops.pdfMites,Slug,Snail_Infesting agricultural crops.pdf
Mites,Slug,Snail_Infesting agricultural crops.pdf
PirithiRaju
 
GBSN - Microbiology (Unit 2) Antimicrobial agents
GBSN - Microbiology (Unit 2) Antimicrobial agentsGBSN - Microbiology (Unit 2) Antimicrobial agents
GBSN - Microbiology (Unit 2) Antimicrobial agents
Areesha Ahmad
 
BANANA BUNCHY TOP K R.pptx
BANANA BUNCHY  TOP               K R.pptxBANANA BUNCHY  TOP               K R.pptx
BANANA BUNCHY TOP K R.pptx
KARTHIK REDDY C A
 
Candidate young stellar objects in the S-cluster: Kinematic analysis of a sub...
Candidate young stellar objects in the S-cluster: Kinematic analysis of a sub...Candidate young stellar objects in the S-cluster: Kinematic analysis of a sub...
Candidate young stellar objects in the S-cluster: Kinematic analysis of a sub...
Sérgio Sacani
 
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
PsychoTech Services
 
Signatures of wave erosion in Titan’s coasts
Signatures of wave erosion in Titan’s coastsSignatures of wave erosion in Titan’s coasts
Signatures of wave erosion in Titan’s coasts
Sérgio Sacani
 

Recently uploaded (20)

Evaluation and Identification of J'BaFofi the Giant Spider of Congo and Moke...
Evaluation and Identification of J'BaFofi the Giant  Spider of Congo and Moke...Evaluation and Identification of J'BaFofi the Giant  Spider of Congo and Moke...
Evaluation and Identification of J'BaFofi the Giant Spider of Congo and Moke...
 
Post translation modification by Suyash Garg
Post translation modification by Suyash GargPost translation modification by Suyash Garg
Post translation modification by Suyash Garg
 
seed production, Nursery & Gardening.pdf
seed production, Nursery & Gardening.pdfseed production, Nursery & Gardening.pdf
seed production, Nursery & Gardening.pdf
 
AJAY KUMAR NIET GreNo Guava Project File.pdf
AJAY KUMAR NIET GreNo Guava Project File.pdfAJAY KUMAR NIET GreNo Guava Project File.pdf
AJAY KUMAR NIET GreNo Guava Project File.pdf
 
一比一原版美国佩斯大学毕业证如何办理
一比一原版美国佩斯大学毕业证如何办理一比一原版美国佩斯大学毕业证如何办理
一比一原版美国佩斯大学毕业证如何办理
 
Mechanics:- Simple and Compound Pendulum
Mechanics:- Simple and Compound PendulumMechanics:- Simple and Compound Pendulum
Mechanics:- Simple and Compound Pendulum
 
Analysis of Polygenic Traits (GPB-602)
Analysis of Polygenic Traits (GPB-602)Analysis of Polygenic Traits (GPB-602)
Analysis of Polygenic Traits (GPB-602)
 
Introduction_Ch_01_Biotech Biotechnology course .pptx
Introduction_Ch_01_Biotech Biotechnology course .pptxIntroduction_Ch_01_Biotech Biotechnology course .pptx
Introduction_Ch_01_Biotech Biotechnology course .pptx
 
TOPIC OF DISCUSSION: CENTRIFUGATION SLIDESHARE.pptx
TOPIC OF DISCUSSION: CENTRIFUGATION SLIDESHARE.pptxTOPIC OF DISCUSSION: CENTRIFUGATION SLIDESHARE.pptx
TOPIC OF DISCUSSION: CENTRIFUGATION SLIDESHARE.pptx
 
Explainable Deepfake Image/Video Detection
Explainable Deepfake Image/Video DetectionExplainable Deepfake Image/Video Detection
Explainable Deepfake Image/Video Detection
 
Nutaceuticsls herbal drug technology CVS, cancer.pptx
Nutaceuticsls herbal drug technology CVS, cancer.pptxNutaceuticsls herbal drug technology CVS, cancer.pptx
Nutaceuticsls herbal drug technology CVS, cancer.pptx
 
the fundamental unit of life CBSE class 9.pptx
the fundamental unit of life CBSE class 9.pptxthe fundamental unit of life CBSE class 9.pptx
the fundamental unit of life CBSE class 9.pptx
 
Reaching the age of Adolescence- Class 8
Reaching the age of Adolescence- Class 8Reaching the age of Adolescence- Class 8
Reaching the age of Adolescence- Class 8
 
Holsinger, Bruce W. - Music, body and desire in medieval culture [2001].pdf
Holsinger, Bruce W. - Music, body and desire in medieval culture [2001].pdfHolsinger, Bruce W. - Music, body and desire in medieval culture [2001].pdf
Holsinger, Bruce W. - Music, body and desire in medieval culture [2001].pdf
 
Mites,Slug,Snail_Infesting agricultural crops.pdf
Mites,Slug,Snail_Infesting agricultural crops.pdfMites,Slug,Snail_Infesting agricultural crops.pdf
Mites,Slug,Snail_Infesting agricultural crops.pdf
 
GBSN - Microbiology (Unit 2) Antimicrobial agents
GBSN - Microbiology (Unit 2) Antimicrobial agentsGBSN - Microbiology (Unit 2) Antimicrobial agents
GBSN - Microbiology (Unit 2) Antimicrobial agents
 
BANANA BUNCHY TOP K R.pptx
BANANA BUNCHY  TOP               K R.pptxBANANA BUNCHY  TOP               K R.pptx
BANANA BUNCHY TOP K R.pptx
 
Candidate young stellar objects in the S-cluster: Kinematic analysis of a sub...
Candidate young stellar objects in the S-cluster: Kinematic analysis of a sub...Candidate young stellar objects in the S-cluster: Kinematic analysis of a sub...
Candidate young stellar objects in the S-cluster: Kinematic analysis of a sub...
 
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
 
Signatures of wave erosion in Titan’s coasts
Signatures of wave erosion in Titan’s coastsSignatures of wave erosion in Titan’s coasts
Signatures of wave erosion in Titan’s coasts
 

Circos Tutorial

  • 1. Genome Visualization: Circos Plots Samarth Kulshrestha Data Analyst sk3@nibmg.ac.in
  • 2. ● Circos Plots: Examples ● Technical Requirements ● Chromosomes & Ideogram ● Points to remember ● Circos Configuration file format ● Circos Distribution. – Folder Distribution ● Data format Overview
  • 3. ● Circos is a softare package for visualizing data and information. It visualize data in circular layout. ● Circos is ideal for creating publication-quality infographics, richly layered data and pleasant symmetries. ● If you are a researcher, analyst,data geek or visual artist who is seeking to communicate a dataset, circos is for you. Circos plots
  • 4. ● The biological scientific community has adopted circos worldwide. Circos has appeared in many top scientific journals.
  • 6. D C B A Available Data Tracks/Display: D) Stacked Histogram C) Ideogram B) Highlights A) Links Circos-plot example 6
  • 7. Circos-plot example Available Data Tracks/Display: F) Ticks E) Text E F 7
  • 8. 8 Nature Communications 6, Article number: 5973 doi:10.1038/ncomms6973 Circos Plot showing the count of SNV variants and copy number changes.
  • 9. 9 1) A machine with installed perl and additional perl modules. 2) Karyotype file 3) Configuration file 4) Genomic data files. To generate Circos, We NEED
  • 11. Technical Requirements ● Platform: Windows/Linux/Mac OS ● Perl 5.8 or newer version and additional perl modules 11 ..continue
  • 12. 12 ..continue ● Config::General Font::TTF GD List::MoreUtils Math::Bezier Math::Vecstat Param::Validat e Readonly .............. And many more ● ● ● ● ● ● ● For more information regarding perl modules installation, please follow the link http://circos.ca/tutorials/lessons/configurat ion/perl_and_modules/ Additional Perl Modules
  • 13. ● To list all the required modules and check whether they are installed, use -modules. First open a new terminal & change directory to Circos folder ● Now check installed modules using: 13 perl circos -modules
  • 15. vs Traditional Way vs Circular way
  • 16. The chromosome is the entire sequence structure as defined in Karyotype file. Ideogram is the depiction of the chromosome, or region. Ideogram of hg 19 has been shown in the image. (22+X,Y chr) Format of hg19 Karyotype File: (available with circos package in Workshop/circos/data/karyotype) 16 chr - hs1 1 0 249250621 chr1 chr - hs2 2 0 243199373 chr2 chr - hs3 3 0 198022430 chr3 chr - hs4 4 0 191154276 chr4 chr - hs5 5 0 180915260 chr5 -------------------------------------- -------------------------------------- band hs1 p36.33 p36.33 0 2300000 gneg band hs1 p36.32 p36.32 2300000 5400000 gpos25 -------------------------------------- Example of Ideogram Chromosome & Ideogram
  • 18. ● All input files are text. ● Output image formats are .png and .svg and are static. ● It requires configuration file(s) to specify Circos layout and data tracks. ● It does not do any analysis,only for visualization. ● Comment lines starts with a hash “#” ● Run on command-line. Points To remember
  • 20. 20 ● The image generation process is driven by a central configuration file. ● Circos configuration (.conf) file is text file with some parameters that define the image, including input files, image size, formatting etc. ● This file usually imports other configuration files, such as global color and font settings. Circos configuration (.conf) file format
  • 21. 21 ..continue Configuration file syntax ➢ Blocks <ideogram> #start of block thickness = 30p fill = yes .......... </ideogram> #end of block Nested Blocks <plots> <plot> type=line ➢ file = data/file1.txt ......... </plot> <plot> type=line file = data/file2.txt .......... </plot> </plots> An example of Nested Block Under <plots> parameter there are 2 line plots, hence Nested plots. We can add more than 2 plots in a nested block. An example of a simple block This is an example of single block for an ideogram.
  • 22. Global vs Local parameters <plots> type color # start of plots block = line = grey <plot> # start of first plot file = data/file1.txt .......... </plot> <plot> # end of first plot # start of second plot file = data/file2.txt .......... </plot> # end of second plot <plot> type color file </plot> # start of third plot = scatter = blue = data/file3.txt # end of third plot Global parameters type and color. First two plots will be of grey color line plots. 22</plots> # end of plots block Local parameters type and color are specific to third plot only. So third plot will be of blue scatterd.
  • 24. Following directories comes with circos package. bin/ data/ etc/ ......... Circos executable Contains Karyotype files Necessary configuration files 24 Circos Distribution
  • 25. circos -conf <configFile> Configuration file Circos Circos.png Data track files 25 Circos.svg Command to run circos: Simple Circos Process
  • 26. circos -conf <configFile> Configuration file Additional conf files Circos Circos.svg Circos.png Data track files 26 Command to run circos:
  • 27. 27 ● User Should always import these additional conf files # colors, fonts and fill patterns <<include etc/colors_fonts_patterns.conf>> # system and debug parameters <<include etc/housekeeping.conf>> # image parameters <<include etc/image.conf>>
  • 28. Data format for circos input 28NIBMG Winter School 2017
  • 29. hs10 60001 134200 hs10 134201 234000 hs10 234001 317600 hs10 317601 445100 ................................... chr10 60001 134200 chr10 134201 234000 chr10 234001 317600 chr10 317601 445100 ................................... Chr Start Stop Values Data format for different tracks for Circos plot is 4 column data ie Chr Start Stop Values But before creating any plot “chr” needs to be change to “hs” and header should be removed as shown in the right hand side image. Data formats for Circos tracks ..continue
  • 30. hs10 hs10 hs10 hs10 50632 50632 CALY 134201 234000 ------ 219699 219699 UNC5B 317601 445100 ------ ................................... hs10 hs10 hs10 hs10 50632 50632 134201 234000 219699 219699 317601 445100 ................................... Chr Start Stop Values Data format for different tracks for Circos plot is 4 column data ie Chr Start Stop Values But before creating any plot “chr” needs to be change to “hs” and header should be removed as shown in the right hand side image. 30
  • 31. <ideogram> <spacing> default = 0.005r </spacing> radius = 0.90r thickness = 30p fill = yes #Radius of ideogram # thickness of ideogram # Color in ideogram show_label = yes label_font = default label_radius label_size label_parallel label_case label_center </ideogram> = dims(ideogram,radius)+50p = 50 = yes = upper = yes radius=0.90r 31 Parameter Explained for a simple ideogram