SlideShare a Scribd company logo
1 of 3
Variants Density along DNA Sequence
Y. Pageaud
27 septembre 2016
User Guide:
In this document you will find all needed informations to use the script ‘Variants_Density_Along_DNA_Sequence.R’.
This R script allow you to plot the density of variants contained in a VCF file along the DNA sequence you
have annotated (Chromosome, Scaffold, Contig, Whole Genome. . . ).
Compatibilities:
Ubuntu 16.04 LTS
R version 3.2.3 (2015-12-10) – “Wooden Christmas-Tree”
VCF File Format VCFv4.2
Prerequisite:
Install R version 3.2.3 or later version.
Optionally you can install RStudio after.
Install R packages ‘VariantAnnotation’ and ‘plyr’ from the website Bioconductor: https://www.bioconductor.org/
Package Loading:
library(VariantAnnotation)
library(plyr)
VCF File Example:
You can test the script on an example of a VCF file (‘Example.VCF’) in the archive .rar.
Set the Working Directory:
Put your working directory (where your VCF file is saved) between quotation marks:
setwd("/home/user/your_folder/")
Loads your VCF File:
vcf <- readVcf("yourfile.vcf","species")
1
DNA Sequence Variants Extraction:
Data_Frame_ranges<-as.data.frame(ranges(vcf))
sequence<-Data_Frame_ranges[grep("^sequence_name",Data_Frame_ranges$names),]
Variants Density Histograms Along DNA Sequence:
Function hist() parameters:
• Plot title: main = ‘string’
• Start position: start_position = int;
• End position: end_position = int
• Limits of the Y axis: ylim = c(float/int,float/int). Useful for thresholding the minimum density of
variants.
• Size of the sliding window: change the number of breakpoints of your histogram: histogram_breakpoints
= int ; to increase its size choose a smaller value, to decrease its size choose a higher value. the window
size depends on the length of the selected region of your sequence, and on the number of breakpoints
chosen.
Function axis() parameters:
• Start position: start_position = int;
• End position: end_position = int
• Accuracy or step value: change the graduations of the X axis: step_accuracy = int
• Rounding parameters: round_any(. . . ,f=floor/ceiling/trunc) ; select floor if you want to round to the
lower step value ; select ceiling if you want to round to the upper step value.
Default Parameters:
Matrix_Position_Sequence<-as.matrix(sequence[1])
start_position = head(sequence[[2]],n=1L)
end_position = tail(sequence[[2]],n=1L)
histogram_breakpoints = 347
window_size = round((end_position - start_position)/histogram_breakpoints)
hist(Matrix_Position_Sequence,
xlim=c(start_position,end_position),
breaks = histogram_breakpoints,
col='blue',
xaxt="n",
border='blue',
ylim=c(0,10),
main='Density of Variants along a DNA Sequence',
xlab='Positions on Sequence (bp)',
ylab=paste("Variants Density (Sliding Window of ~",window_size,"bp)",sep=" "))
2
step_accuracy=10**4
axis(1, at = seq(from = round_any(start_position,
step_accuracy,
f=floor),
to = round_any(end_position,
step_accuracy,
f=ceiling),
by = round_any(end_position,
step_accuracy,
f=ceiling)/
((1/step_accuracy)*
round_any(end_position,
step_accuracy,
f=ceiling))))
Density of Variants along a DNA Sequence
Positions on Sequence (bp)
VariantsDensity(SlidingWindowof~100bp)
0246810
10000 20000 30000 40000
For any question, feel free to ask me by E-mail: yoann.pageaud@gmail.com
References:
“The Variant Call Format (VCF) Version 4.2 Specification” January 26, 2015. https://github.
com/samtools/hts-specs
3

More Related Content

What's hot

Dynamic memory allocation in c
Dynamic memory allocation in cDynamic memory allocation in c
Dynamic memory allocation in clavanya marichamy
 
Ppt presentation of queues
Ppt presentation of queuesPpt presentation of queues
Ppt presentation of queuesBuxoo Abdullah
 
array, function, pointer, pattern matching
array, function, pointer, pattern matchingarray, function, pointer, pattern matching
array, function, pointer, pattern matchingShakila Mahjabin
 
Introduction to programming in MATLAB
Introduction to programming in MATLABIntroduction to programming in MATLAB
Introduction to programming in MATLABmustafa_92
 
Insertion sort and shell sort
Insertion sort and shell sortInsertion sort and shell sort
Insertion sort and shell sortPraveen Kumar
 
Memory allocation
Memory allocationMemory allocation
Memory allocationsanya6900
 

What's hot (8)

Dynamic memory allocation in c
Dynamic memory allocation in cDynamic memory allocation in c
Dynamic memory allocation in c
 
Ppt presentation of queues
Ppt presentation of queuesPpt presentation of queues
Ppt presentation of queues
 
array, function, pointer, pattern matching
array, function, pointer, pattern matchingarray, function, pointer, pattern matching
array, function, pointer, pattern matching
 
Introduction to programming in MATLAB
Introduction to programming in MATLABIntroduction to programming in MATLAB
Introduction to programming in MATLAB
 
Insertion sort and shell sort
Insertion sort and shell sortInsertion sort and shell sort
Insertion sort and shell sort
 
Memory allocation
Memory allocationMemory allocation
Memory allocation
 
Stack and queue
Stack and queueStack and queue
Stack and queue
 
Composite types
Composite typesComposite types
Composite types
 

Viewers also liked

Урок 16 для 2 класу - Пристрої різні, але всі корисні
 Урок 16 для 2 класу - Пристрої різні, але всі корисні Урок 16 для 2 класу - Пристрої різні, але всі корисні
Урок 16 для 2 класу - Пристрої різні, але всі корисніVsimPPT
 
Урок 18 для 6 класу - Поняття текстового документа
 Урок 18 для 6 класу -  Поняття текстового документа Урок 18 для 6 класу -  Поняття текстового документа
Урок 18 для 6 класу - Поняття текстового документаVsimPPT
 
Programación Xaruma 2015
Programación Xaruma 2015 Programación Xaruma 2015
Programación Xaruma 2015 AsocXaruma
 
Happy Mother's Day From Schuler Bauer Real Estate Services powered by ERA
Happy Mother's Day From Schuler Bauer Real Estate Services powered by ERAHappy Mother's Day From Schuler Bauer Real Estate Services powered by ERA
Happy Mother's Day From Schuler Bauer Real Estate Services powered by ERALauren Bauer
 
La educación y la sociedad del conocimiento
La educación y la sociedad  del conocimientoLa educación y la sociedad  del conocimiento
La educación y la sociedad del conocimientomariaga20
 
my new updated resume shivam(1)
my new updated resume shivam(1)my new updated resume shivam(1)
my new updated resume shivam(1)shivam singh
 
самоосвіта
самоосвітасамоосвіта
самоосвітаmrsvetlana01
 
Mehr Produktivität durch motivierte Post Editoren
Mehr Produktivität durch motivierte Post EditorenMehr Produktivität durch motivierte Post Editoren
Mehr Produktivität durch motivierte Post EditorenPatrick Bessler
 
인­터­넷­카­지­노­주소つづ「play5。co。kr 」ぴ인­터­넷­바­카­라­주소\\
인­터­넷­카­지­노­주소つづ「play5。co。kr 」ぴ인­터­넷­바­카­라­주소\\인­터­넷­카­지­노­주소つづ「play5。co。kr 」ぴ인­터­넷­바­카­라­주소\\
인­터­넷­카­지­노­주소つづ「play5。co。kr 」ぴ인­터­넷­바­카­라­주소\\gkstldn4561
 
6 клас (урок№7)Практична робота №2: «СКЛАДАННЯ АЛГОРИТМІВ ОПРАЦЮВАННЯ ПОДІЙ З...
6 клас (урок№7)Практична робота №2: «СКЛАДАННЯ АЛГОРИТМІВ ОПРАЦЮВАННЯ ПОДІЙ З...6 клас (урок№7)Практична робота №2: «СКЛАДАННЯ АЛГОРИТМІВ ОПРАЦЮВАННЯ ПОДІЙ З...
6 клас (урок№7)Практична робота №2: «СКЛАДАННЯ АЛГОРИТМІВ ОПРАЦЮВАННЯ ПОДІЙ З...Sanya Dzhedzhera
 
проект планета просить порятунку
проект планета просить порятункупроект планета просить порятунку
проект планета просить порятункуСофія Кріцак
 
6 Tendências de Marketing Digital para 2016
6 Tendências de Marketing Digital para 20166 Tendências de Marketing Digital para 2016
6 Tendências de Marketing Digital para 2016Mirago Marketing Digital
 
Adobe Animate CC: Introduction to Animation and Interactivity
Adobe Animate CC: Introduction to Animation and InteractivityAdobe Animate CC: Introduction to Animation and Interactivity
Adobe Animate CC: Introduction to Animation and InteractivityJoseph Labrecque
 
Plastico verde -_cana_de_açular_(1)
Plastico verde -_cana_de_açular_(1)Plastico verde -_cana_de_açular_(1)
Plastico verde -_cana_de_açular_(1)afermartins
 
Урок 19 для 10 класу - Форматування даних, комірок і діапазонів комірок. Прак...
Урок 19 для 10 класу - Форматування даних, комірок і діапазонів комірок. Прак...Урок 19 для 10 класу - Форматування даних, комірок і діапазонів комірок. Прак...
Урок 19 для 10 класу - Форматування даних, комірок і діапазонів комірок. Прак...VsimPPT
 

Viewers also liked (20)

Sociological
SociologicalSociological
Sociological
 
урок 15
урок 15урок 15
урок 15
 
Урок 16 для 2 класу - Пристрої різні, але всі корисні
 Урок 16 для 2 класу - Пристрої різні, але всі корисні Урок 16 для 2 класу - Пристрої різні, але всі корисні
Урок 16 для 2 класу - Пристрої різні, але всі корисні
 
Урок 18 для 6 класу - Поняття текстового документа
 Урок 18 для 6 класу -  Поняття текстового документа Урок 18 для 6 класу -  Поняття текстового документа
Урок 18 для 6 класу - Поняття текстового документа
 
Programación Xaruma 2015
Programación Xaruma 2015 Programación Xaruma 2015
Programación Xaruma 2015
 
Happy Mother's Day From Schuler Bauer Real Estate Services powered by ERA
Happy Mother's Day From Schuler Bauer Real Estate Services powered by ERAHappy Mother's Day From Schuler Bauer Real Estate Services powered by ERA
Happy Mother's Day From Schuler Bauer Real Estate Services powered by ERA
 
La educación y la sociedad del conocimiento
La educación y la sociedad  del conocimientoLa educación y la sociedad  del conocimiento
La educación y la sociedad del conocimiento
 
my new updated resume shivam(1)
my new updated resume shivam(1)my new updated resume shivam(1)
my new updated resume shivam(1)
 
самоосвіта
самоосвітасамоосвіта
самоосвіта
 
Mehr Produktivität durch motivierte Post Editoren
Mehr Produktivität durch motivierte Post EditorenMehr Produktivität durch motivierte Post Editoren
Mehr Produktivität durch motivierte Post Editoren
 
Урок 5
Урок 5Урок 5
Урок 5
 
인­터­넷­카­지­노­주소つづ「play5。co。kr 」ぴ인­터­넷­바­카­라­주소\\
인­터­넷­카­지­노­주소つづ「play5。co。kr 」ぴ인­터­넷­바­카­라­주소\\인­터­넷­카­지­노­주소つづ「play5。co。kr 」ぴ인­터­넷­바­카­라­주소\\
인­터­넷­카­지­노­주소つづ「play5。co。kr 」ぴ인­터­넷­바­카­라­주소\\
 
6 клас (урок№7)Практична робота №2: «СКЛАДАННЯ АЛГОРИТМІВ ОПРАЦЮВАННЯ ПОДІЙ З...
6 клас (урок№7)Практична робота №2: «СКЛАДАННЯ АЛГОРИТМІВ ОПРАЦЮВАННЯ ПОДІЙ З...6 клас (урок№7)Практична робота №2: «СКЛАДАННЯ АЛГОРИТМІВ ОПРАЦЮВАННЯ ПОДІЙ З...
6 клас (урок№7)Практична робота №2: «СКЛАДАННЯ АЛГОРИТМІВ ОПРАЦЮВАННЯ ПОДІЙ З...
 
проект планета просить порятунку
проект планета просить порятункупроект планета просить порятунку
проект планета просить порятунку
 
6 Tendências de Marketing Digital para 2016
6 Tendências de Marketing Digital para 20166 Tendências de Marketing Digital para 2016
6 Tendências de Marketing Digital para 2016
 
Comunicative competence
Comunicative competenceComunicative competence
Comunicative competence
 
Adobe Animate CC: Introduction to Animation and Interactivity
Adobe Animate CC: Introduction to Animation and InteractivityAdobe Animate CC: Introduction to Animation and Interactivity
Adobe Animate CC: Introduction to Animation and Interactivity
 
Plastico verde -_cana_de_açular_(1)
Plastico verde -_cana_de_açular_(1)Plastico verde -_cana_de_açular_(1)
Plastico verde -_cana_de_açular_(1)
 
Урок 19 для 10 класу - Форматування даних, комірок і діапазонів комірок. Прак...
Урок 19 для 10 класу - Форматування даних, комірок і діапазонів комірок. Прак...Урок 19 для 10 класу - Форматування даних, комірок і діапазонів комірок. Прак...
Урок 19 для 10 класу - Форматування даних, комірок і діапазонів комірок. Прак...
 
Bootstrap Fundamentals
Bootstrap FundamentalsBootstrap Fundamentals
Bootstrap Fundamentals
 

Similar to Variants Density along DNA Sequence

WSO2 Product Release Webinar: WSO2 Complex Event Processor 4.0
WSO2 Product Release Webinar: WSO2 Complex Event Processor 4.0WSO2 Product Release Webinar: WSO2 Complex Event Processor 4.0
WSO2 Product Release Webinar: WSO2 Complex Event Processor 4.0WSO2
 
Modify your solution for PLP04 to allow the user to choose the shape.pdf
Modify your solution for PLP04 to allow the user to choose the shape.pdfModify your solution for PLP04 to allow the user to choose the shape.pdf
Modify your solution for PLP04 to allow the user to choose the shape.pdfhullibergerr25980
 
Summerization notes for descriptive statistics using r
Summerization notes for descriptive statistics using r Summerization notes for descriptive statistics using r
Summerization notes for descriptive statistics using r Ashwini Mathur
 
Input and output basic of c++ programming and escape sequences
Input and output basic of c++ programming and escape sequencesInput and output basic of c++ programming and escape sequences
Input and output basic of c++ programming and escape sequencesssuserf86fba
 
C programming language tutorial
C programming language tutorial C programming language tutorial
C programming language tutorial javaTpoint s
 
Workshop NGS data analysis - 2
Workshop NGS data analysis - 2Workshop NGS data analysis - 2
Workshop NGS data analysis - 2Maté Ongenaert
 
golang_getting_started.pptx
golang_getting_started.pptxgolang_getting_started.pptx
golang_getting_started.pptxGuy Komari
 
Go Programming Language (Golang)
Go Programming Language (Golang)Go Programming Language (Golang)
Go Programming Language (Golang)Ishin Vin
 
PPT ON VHDL subprogram,package,alias,use,generate and concurrent statments an...
PPT ON VHDL subprogram,package,alias,use,generate and concurrent statments an...PPT ON VHDL subprogram,package,alias,use,generate and concurrent statments an...
PPT ON VHDL subprogram,package,alias,use,generate and concurrent statments an...Khushboo Jain
 
An Overview of SystemVerilog for Design and Verification
An Overview of SystemVerilog  for Design and VerificationAn Overview of SystemVerilog  for Design and Verification
An Overview of SystemVerilog for Design and VerificationKapilRaghunandanTrip
 
Programming Without Coding Technology (PWCT) Environment
Programming Without Coding Technology (PWCT) EnvironmentProgramming Without Coding Technology (PWCT) Environment
Programming Without Coding Technology (PWCT) EnvironmentMahmoud Samir Fayed
 
Introduction to c
Introduction to cIntroduction to c
Introduction to camol_chavan
 
Stream Based Input Output
Stream Based Input OutputStream Based Input Output
Stream Based Input OutputBharat17485
 
Productionizing your Streaming Jobs
Productionizing your Streaming JobsProductionizing your Streaming Jobs
Productionizing your Streaming JobsDatabricks
 
Specification of the sequ commandCopyright © 2013 Bart Masse.docx
Specification of the sequ commandCopyright © 2013 Bart Masse.docxSpecification of the sequ commandCopyright © 2013 Bart Masse.docx
Specification of the sequ commandCopyright © 2013 Bart Masse.docxrafbolet0
 
C++ BinaryTree Help Creating main function for Trees...Here are .pdf
C++ BinaryTree Help  Creating main function for Trees...Here are .pdfC++ BinaryTree Help  Creating main function for Trees...Here are .pdf
C++ BinaryTree Help Creating main function for Trees...Here are .pdfforecastfashions
 
maxbox starter60 machine learning
maxbox starter60 machine learningmaxbox starter60 machine learning
maxbox starter60 machine learningMax Kleiner
 

Similar to Variants Density along DNA Sequence (20)

WSO2 Product Release Webinar: WSO2 Complex Event Processor 4.0
WSO2 Product Release Webinar: WSO2 Complex Event Processor 4.0WSO2 Product Release Webinar: WSO2 Complex Event Processor 4.0
WSO2 Product Release Webinar: WSO2 Complex Event Processor 4.0
 
Modify your solution for PLP04 to allow the user to choose the shape.pdf
Modify your solution for PLP04 to allow the user to choose the shape.pdfModify your solution for PLP04 to allow the user to choose the shape.pdf
Modify your solution for PLP04 to allow the user to choose the shape.pdf
 
Summerization notes for descriptive statistics using r
Summerization notes for descriptive statistics using r Summerization notes for descriptive statistics using r
Summerization notes for descriptive statistics using r
 
Input and output basic of c++ programming and escape sequences
Input and output basic of c++ programming and escape sequencesInput and output basic of c++ programming and escape sequences
Input and output basic of c++ programming and escape sequences
 
C programming language tutorial
C programming language tutorial C programming language tutorial
C programming language tutorial
 
Workshop NGS data analysis - 2
Workshop NGS data analysis - 2Workshop NGS data analysis - 2
Workshop NGS data analysis - 2
 
srgoc
srgocsrgoc
srgoc
 
golang_getting_started.pptx
golang_getting_started.pptxgolang_getting_started.pptx
golang_getting_started.pptx
 
Go Programming Language (Golang)
Go Programming Language (Golang)Go Programming Language (Golang)
Go Programming Language (Golang)
 
PPT ON VHDL subprogram,package,alias,use,generate and concurrent statments an...
PPT ON VHDL subprogram,package,alias,use,generate and concurrent statments an...PPT ON VHDL subprogram,package,alias,use,generate and concurrent statments an...
PPT ON VHDL subprogram,package,alias,use,generate and concurrent statments an...
 
An Overview of SystemVerilog for Design and Verification
An Overview of SystemVerilog  for Design and VerificationAn Overview of SystemVerilog  for Design and Verification
An Overview of SystemVerilog for Design and Verification
 
OpenGL ES 3.1 Reference Card
OpenGL ES 3.1 Reference CardOpenGL ES 3.1 Reference Card
OpenGL ES 3.1 Reference Card
 
Programming Without Coding Technology (PWCT) Environment
Programming Without Coding Technology (PWCT) EnvironmentProgramming Without Coding Technology (PWCT) Environment
Programming Without Coding Technology (PWCT) Environment
 
Theory1&amp;2
Theory1&amp;2Theory1&amp;2
Theory1&amp;2
 
Introduction to c
Introduction to cIntroduction to c
Introduction to c
 
Stream Based Input Output
Stream Based Input OutputStream Based Input Output
Stream Based Input Output
 
Productionizing your Streaming Jobs
Productionizing your Streaming JobsProductionizing your Streaming Jobs
Productionizing your Streaming Jobs
 
Specification of the sequ commandCopyright © 2013 Bart Masse.docx
Specification of the sequ commandCopyright © 2013 Bart Masse.docxSpecification of the sequ commandCopyright © 2013 Bart Masse.docx
Specification of the sequ commandCopyright © 2013 Bart Masse.docx
 
C++ BinaryTree Help Creating main function for Trees...Here are .pdf
C++ BinaryTree Help  Creating main function for Trees...Here are .pdfC++ BinaryTree Help  Creating main function for Trees...Here are .pdf
C++ BinaryTree Help Creating main function for Trees...Here are .pdf
 
maxbox starter60 machine learning
maxbox starter60 machine learningmaxbox starter60 machine learning
maxbox starter60 machine learning
 

More from Yoann Pageaud

Student Project - Accurate Computer-assisted Cell Culture using Ultrasounds
Student Project - Accurate Computer-assisted Cell Culture using UltrasoundsStudent Project - Accurate Computer-assisted Cell Culture using Ultrasounds
Student Project - Accurate Computer-assisted Cell Culture using UltrasoundsYoann Pageaud
 
Student Project - Accurate Computer-assisted Cell Culture using Ultrasounds
Student Project - Accurate Computer-assisted Cell Culture using UltrasoundsStudent Project - Accurate Computer-assisted Cell Culture using Ultrasounds
Student Project - Accurate Computer-assisted Cell Culture using UltrasoundsYoann Pageaud
 
Identification of PFOA linked metabolic diseases by crossing databases
Identification of PFOA linked metabolic diseases by crossing databasesIdentification of PFOA linked metabolic diseases by crossing databases
Identification of PFOA linked metabolic diseases by crossing databasesYoann Pageaud
 
CDS Filtering Program - User Manual
CDS Filtering Program - User ManualCDS Filtering Program - User Manual
CDS Filtering Program - User ManualYoann Pageaud
 
IGV manuel d'installation et démarrage sous windows 10 version française
IGV manuel d'installation et démarrage sous windows 10 version françaiseIGV manuel d'installation et démarrage sous windows 10 version française
IGV manuel d'installation et démarrage sous windows 10 version françaiseYoann Pageaud
 
How to manage your Experimental Protocol with Basic Statistics
How to manage your Experimental Protocol with Basic StatisticsHow to manage your Experimental Protocol with Basic Statistics
How to manage your Experimental Protocol with Basic StatisticsYoann Pageaud
 
3D Creations & Models update 25/07/2015
3D Creations & Models update 25/07/20153D Creations & Models update 25/07/2015
3D Creations & Models update 25/07/2015Yoann Pageaud
 
Introduction to the software Predicsites©
Introduction to the software Predicsites©Introduction to the software Predicsites©
Introduction to the software Predicsites©Yoann Pageaud
 
Présentation "De l'ADN au visage" - Colloque ESI 2015
Présentation "De l'ADN au visage" - Colloque ESI 2015Présentation "De l'ADN au visage" - Colloque ESI 2015
Présentation "De l'ADN au visage" - Colloque ESI 2015Yoann Pageaud
 
Creation of a synthetic organ - the skeletal striated muscle
Creation of a synthetic organ - the skeletal striated muscleCreation of a synthetic organ - the skeletal striated muscle
Creation of a synthetic organ - the skeletal striated muscleYoann Pageaud
 
Creation of synthetic organs - Ethical Issues
Creation of synthetic organs - Ethical IssuesCreation of synthetic organs - Ethical Issues
Creation of synthetic organs - Ethical IssuesYoann Pageaud
 

More from Yoann Pageaud (11)

Student Project - Accurate Computer-assisted Cell Culture using Ultrasounds
Student Project - Accurate Computer-assisted Cell Culture using UltrasoundsStudent Project - Accurate Computer-assisted Cell Culture using Ultrasounds
Student Project - Accurate Computer-assisted Cell Culture using Ultrasounds
 
Student Project - Accurate Computer-assisted Cell Culture using Ultrasounds
Student Project - Accurate Computer-assisted Cell Culture using UltrasoundsStudent Project - Accurate Computer-assisted Cell Culture using Ultrasounds
Student Project - Accurate Computer-assisted Cell Culture using Ultrasounds
 
Identification of PFOA linked metabolic diseases by crossing databases
Identification of PFOA linked metabolic diseases by crossing databasesIdentification of PFOA linked metabolic diseases by crossing databases
Identification of PFOA linked metabolic diseases by crossing databases
 
CDS Filtering Program - User Manual
CDS Filtering Program - User ManualCDS Filtering Program - User Manual
CDS Filtering Program - User Manual
 
IGV manuel d'installation et démarrage sous windows 10 version française
IGV manuel d'installation et démarrage sous windows 10 version françaiseIGV manuel d'installation et démarrage sous windows 10 version française
IGV manuel d'installation et démarrage sous windows 10 version française
 
How to manage your Experimental Protocol with Basic Statistics
How to manage your Experimental Protocol with Basic StatisticsHow to manage your Experimental Protocol with Basic Statistics
How to manage your Experimental Protocol with Basic Statistics
 
3D Creations & Models update 25/07/2015
3D Creations & Models update 25/07/20153D Creations & Models update 25/07/2015
3D Creations & Models update 25/07/2015
 
Introduction to the software Predicsites©
Introduction to the software Predicsites©Introduction to the software Predicsites©
Introduction to the software Predicsites©
 
Présentation "De l'ADN au visage" - Colloque ESI 2015
Présentation "De l'ADN au visage" - Colloque ESI 2015Présentation "De l'ADN au visage" - Colloque ESI 2015
Présentation "De l'ADN au visage" - Colloque ESI 2015
 
Creation of a synthetic organ - the skeletal striated muscle
Creation of a synthetic organ - the skeletal striated muscleCreation of a synthetic organ - the skeletal striated muscle
Creation of a synthetic organ - the skeletal striated muscle
 
Creation of synthetic organs - Ethical Issues
Creation of synthetic organs - Ethical IssuesCreation of synthetic organs - Ethical Issues
Creation of synthetic organs - Ethical Issues
 

Recently uploaded

Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...lizamodels9
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Nistarini College, Purulia (W.B) India
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real timeSatoshi NAKAHIRA
 
Evidences of Evolution General Biology 2
Evidences of Evolution General Biology 2Evidences of Evolution General Biology 2
Evidences of Evolution General Biology 2John Carlo Rollon
 
Forest laws, Indian forest laws, why they are important
Forest laws, Indian forest laws, why they are importantForest laws, Indian forest laws, why they are important
Forest laws, Indian forest laws, why they are importantadityabhardwaj282
 
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxGenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxBerniceCayabyab1
 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024AyushiRastogi48
 
Twin's paradox experiment is a meassurement of the extra dimensions.pptx
Twin's paradox experiment is a meassurement of the extra dimensions.pptxTwin's paradox experiment is a meassurement of the extra dimensions.pptx
Twin's paradox experiment is a meassurement of the extra dimensions.pptxEran Akiva Sinbar
 
Recombinant DNA technology( Transgenic plant and animal)
Recombinant DNA technology( Transgenic plant and animal)Recombinant DNA technology( Transgenic plant and animal)
Recombinant DNA technology( Transgenic plant and animal)DHURKADEVIBASKAR
 
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPirithiRaju
 
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPirithiRaju
 
‏‏VIRUS - 123455555555555555555555555555555555555555
‏‏VIRUS -  123455555555555555555555555555555555555555‏‏VIRUS -  123455555555555555555555555555555555555555
‏‏VIRUS - 123455555555555555555555555555555555555555kikilily0909
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxkessiyaTpeter
 
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.PraveenaKalaiselvan1
 
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptx
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptxSTOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptx
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptxMurugaveni B
 
Transposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.pptTransposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.pptArshadWarsi13
 
Manassas R - Parkside Middle School 🌎🏫
Manassas R - Parkside Middle School 🌎🏫Manassas R - Parkside Middle School 🌎🏫
Manassas R - Parkside Middle School 🌎🏫qfactory1
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxyaramohamed343013
 

Recently uploaded (20)

Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real time
 
Evidences of Evolution General Biology 2
Evidences of Evolution General Biology 2Evidences of Evolution General Biology 2
Evidences of Evolution General Biology 2
 
Forest laws, Indian forest laws, why they are important
Forest laws, Indian forest laws, why they are importantForest laws, Indian forest laws, why they are important
Forest laws, Indian forest laws, why they are important
 
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxGenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024
 
Twin's paradox experiment is a meassurement of the extra dimensions.pptx
Twin's paradox experiment is a meassurement of the extra dimensions.pptxTwin's paradox experiment is a meassurement of the extra dimensions.pptx
Twin's paradox experiment is a meassurement of the extra dimensions.pptx
 
Recombinant DNA technology( Transgenic plant and animal)
Recombinant DNA technology( Transgenic plant and animal)Recombinant DNA technology( Transgenic plant and animal)
Recombinant DNA technology( Transgenic plant and animal)
 
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
 
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdfPests of soyabean_Binomics_IdentificationDr.UPR.pdf
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
 
‏‏VIRUS - 123455555555555555555555555555555555555555
‏‏VIRUS -  123455555555555555555555555555555555555555‏‏VIRUS -  123455555555555555555555555555555555555555
‏‏VIRUS - 123455555555555555555555555555555555555555
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
 
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
 
Engler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomyEngler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomy
 
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptx
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptxSTOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptx
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptx
 
Transposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.pptTransposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.ppt
 
Manassas R - Parkside Middle School 🌎🏫
Manassas R - Parkside Middle School 🌎🏫Manassas R - Parkside Middle School 🌎🏫
Manassas R - Parkside Middle School 🌎🏫
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docx
 

Variants Density along DNA Sequence

  • 1. Variants Density along DNA Sequence Y. Pageaud 27 septembre 2016 User Guide: In this document you will find all needed informations to use the script ‘Variants_Density_Along_DNA_Sequence.R’. This R script allow you to plot the density of variants contained in a VCF file along the DNA sequence you have annotated (Chromosome, Scaffold, Contig, Whole Genome. . . ). Compatibilities: Ubuntu 16.04 LTS R version 3.2.3 (2015-12-10) – “Wooden Christmas-Tree” VCF File Format VCFv4.2 Prerequisite: Install R version 3.2.3 or later version. Optionally you can install RStudio after. Install R packages ‘VariantAnnotation’ and ‘plyr’ from the website Bioconductor: https://www.bioconductor.org/ Package Loading: library(VariantAnnotation) library(plyr) VCF File Example: You can test the script on an example of a VCF file (‘Example.VCF’) in the archive .rar. Set the Working Directory: Put your working directory (where your VCF file is saved) between quotation marks: setwd("/home/user/your_folder/") Loads your VCF File: vcf <- readVcf("yourfile.vcf","species") 1
  • 2. DNA Sequence Variants Extraction: Data_Frame_ranges<-as.data.frame(ranges(vcf)) sequence<-Data_Frame_ranges[grep("^sequence_name",Data_Frame_ranges$names),] Variants Density Histograms Along DNA Sequence: Function hist() parameters: • Plot title: main = ‘string’ • Start position: start_position = int; • End position: end_position = int • Limits of the Y axis: ylim = c(float/int,float/int). Useful for thresholding the minimum density of variants. • Size of the sliding window: change the number of breakpoints of your histogram: histogram_breakpoints = int ; to increase its size choose a smaller value, to decrease its size choose a higher value. the window size depends on the length of the selected region of your sequence, and on the number of breakpoints chosen. Function axis() parameters: • Start position: start_position = int; • End position: end_position = int • Accuracy or step value: change the graduations of the X axis: step_accuracy = int • Rounding parameters: round_any(. . . ,f=floor/ceiling/trunc) ; select floor if you want to round to the lower step value ; select ceiling if you want to round to the upper step value. Default Parameters: Matrix_Position_Sequence<-as.matrix(sequence[1]) start_position = head(sequence[[2]],n=1L) end_position = tail(sequence[[2]],n=1L) histogram_breakpoints = 347 window_size = round((end_position - start_position)/histogram_breakpoints) hist(Matrix_Position_Sequence, xlim=c(start_position,end_position), breaks = histogram_breakpoints, col='blue', xaxt="n", border='blue', ylim=c(0,10), main='Density of Variants along a DNA Sequence', xlab='Positions on Sequence (bp)', ylab=paste("Variants Density (Sliding Window of ~",window_size,"bp)",sep=" ")) 2
  • 3. step_accuracy=10**4 axis(1, at = seq(from = round_any(start_position, step_accuracy, f=floor), to = round_any(end_position, step_accuracy, f=ceiling), by = round_any(end_position, step_accuracy, f=ceiling)/ ((1/step_accuracy)* round_any(end_position, step_accuracy, f=ceiling)))) Density of Variants along a DNA Sequence Positions on Sequence (bp) VariantsDensity(SlidingWindowof~100bp) 0246810 10000 20000 30000 40000 For any question, feel free to ask me by E-mail: yoann.pageaud@gmail.com References: “The Variant Call Format (VCF) Version 4.2 Specification” January 26, 2015. https://github. com/samtools/hts-specs 3