This document provides instructions for using RawGeno, an R package for automating the scoring of amplified fragment length polymorphism (AFLP) datasets. It describes how to organize an AFLP project, install the necessary software including PeakScanner and RawGeno, and use RawGeno's functions to analyze electropherograms, score and bin AFLP peaks, quality check results, and export scored datasets for further analysis. The document provides step-by-step guidance on setting up and running an AFLP analysis with RawGeno.
This document provides an overview of the RAPTOR protein structure prediction software developed by Bioinformatics Solutions Inc. It describes the installation process, user interface, and how to perform a sample run of the software. Key information covered includes installing required files and registering the software, navigating the menu system and output panels, configuring a run using different threading methods and databases, and interpreting the output results including alignments and structural predictions.
This document provides instructions on how to install and run the R statistical software. It describes downloading and installing the latest version of R for Windows, Mac, or Linux computers. It also explains how to install additional R packages, including Bioconductor packages, which are useful for bioinformatics analysis. The document is intended as a simple introduction to getting started with the R programming environment for statistical analysis and time series work.
This document provides instructions on how to install and use the R statistical software package for time series analysis. It covers downloading and installing R on Windows, checking the latest R version, and installing additional R packages, including Bioconductor packages. The document is intended as a simple introduction to getting started with R for time series analysis.
DEF CON 27 - workshop - ISAAC EVANS - discover exploit and eradicate entire v...Felipe Prado
This document provides documentation for the r2c analysis platform and command line interface (CLI). It describes how to install and set up the r2c CLI, create an example analyzer, write analysis code using Python, run the analyzer locally on a test codebase, and publish the analyzer to the r2c platform to run at larger scale. The example analyzer counts the percentage of whitespace in JavaScript files to identify potentially minified code. The document guides the reader through each step of developing and testing an analyzer locally before publishing it for cloud-based analysis.
This document provides an introduction to using the statistical software R. It discusses using both the command line interface and RStudio integrated development environment. It covers getting help, setting the working directory, loading libraries, and the basic data structures in R including vectors, matrices, arrays, data frames and lists. Vectors are the most basic data structure and can contain elements of a single type such as numeric, character, logical, etc. The document provides examples of creating, assigning, and testing vector objects.
Description of the status of design and developing activities of the data analysis software of SuperAGILE instrument of the AGILE Space Mission, one year before the launch.
Shen Lu worked on analyzing next generation sequencing data over 14 weeks. The first few weeks were spent adding background knowledge, converting SRA files to FASTQ format using fastq-dump, and performing quality control checks on the raw data using FASTX toolkit and Trimmomatic. Later weeks involved mapping reads to the reference genome using TopHat, Bowtie and Samtools, estimating transcription levels with Cufflinks, using supercomputers to facilitate analysis, and configuring databases to store and visualize the results.
The document provides installation instructions for integrating the R extension with RapidMiner. It explains how to install R and the rJava package, configure environment variables to enable RapidMiner to access R libraries, and select the correct rJava library file during RapidMiner configuration. It also gives an overview of the key capabilities of the R perspective in RapidMiner like executing R code and scripts, saving variables to a repository, and plotting capabilities. The acknowledgments section recognizes contributors to the extension's development.
This document provides an overview of the RAPTOR protein structure prediction software developed by Bioinformatics Solutions Inc. It describes the installation process, user interface, and how to perform a sample run of the software. Key information covered includes installing required files and registering the software, navigating the menu system and output panels, configuring a run using different threading methods and databases, and interpreting the output results including alignments and structural predictions.
This document provides instructions on how to install and run the R statistical software. It describes downloading and installing the latest version of R for Windows, Mac, or Linux computers. It also explains how to install additional R packages, including Bioconductor packages, which are useful for bioinformatics analysis. The document is intended as a simple introduction to getting started with the R programming environment for statistical analysis and time series work.
This document provides instructions on how to install and use the R statistical software package for time series analysis. It covers downloading and installing R on Windows, checking the latest R version, and installing additional R packages, including Bioconductor packages. The document is intended as a simple introduction to getting started with R for time series analysis.
DEF CON 27 - workshop - ISAAC EVANS - discover exploit and eradicate entire v...Felipe Prado
This document provides documentation for the r2c analysis platform and command line interface (CLI). It describes how to install and set up the r2c CLI, create an example analyzer, write analysis code using Python, run the analyzer locally on a test codebase, and publish the analyzer to the r2c platform to run at larger scale. The example analyzer counts the percentage of whitespace in JavaScript files to identify potentially minified code. The document guides the reader through each step of developing and testing an analyzer locally before publishing it for cloud-based analysis.
This document provides an introduction to using the statistical software R. It discusses using both the command line interface and RStudio integrated development environment. It covers getting help, setting the working directory, loading libraries, and the basic data structures in R including vectors, matrices, arrays, data frames and lists. Vectors are the most basic data structure and can contain elements of a single type such as numeric, character, logical, etc. The document provides examples of creating, assigning, and testing vector objects.
Description of the status of design and developing activities of the data analysis software of SuperAGILE instrument of the AGILE Space Mission, one year before the launch.
Shen Lu worked on analyzing next generation sequencing data over 14 weeks. The first few weeks were spent adding background knowledge, converting SRA files to FASTQ format using fastq-dump, and performing quality control checks on the raw data using FASTX toolkit and Trimmomatic. Later weeks involved mapping reads to the reference genome using TopHat, Bowtie and Samtools, estimating transcription levels with Cufflinks, using supercomputers to facilitate analysis, and configuring databases to store and visualize the results.
The document provides installation instructions for integrating the R extension with RapidMiner. It explains how to install R and the rJava package, configure environment variables to enable RapidMiner to access R libraries, and select the correct rJava library file during RapidMiner configuration. It also gives an overview of the key capabilities of the R perspective in RapidMiner like executing R code and scripts, saving variables to a repository, and plotting capabilities. The acknowledgments section recognizes contributors to the extension's development.
This document discusses how to create a private version of CRAN packages behind a company firewall for internal use. It describes using the miniCRAN R package to selectively download and create a local mirror of specific CRAN packages and their dependencies. This allows controlling the packages available internally. It also discusses using MRAN and the RRT package to reproduce analyses and ensure scripts work across package and R version updates.
This document discusses using machine learning to classify malware into families based on the DREBIN dataset. It covers:
1. Preprocessing the dataset, including integer encoding and one-hot encoding to convert categorical data to numeric form for modeling.
2. Addressing overfitting by splitting the data into training and test sets and using cross-validation.
3. Using classifiers like Random Forest and SVM with strategies like one-vs-all and one-vs-one to perform multiclass classification of malware families.
4. The process of using binary classifiers for each family first, then combining the results to classify malware into the appropriate family.
This document provides instructions for installing and using PhyloPipe v1.1, a phylogenetic analysis software package. It describes how to install prerequisite programs like HaMStR, MAFFT, and RAxMLGUI. It explains the directory structure and how to run PhyloPipe directly or modify the project files. Usage instructions are provided for the main window and settings dialog where paths to programs can be configured. The document aims to help users get started with PhyloPipe for phylogenetic analysis.
Hb 1486-001 1074970 qsg-gene_readdataanalysis_1112Elsa von Licy
The document provides instructions for analyzing sequencing reads generated from a GeneRead DNAseq Gene Panel kit using targeted exon enrichment. It describes uploading reads to the analysis software, setting analysis parameters, how reads are mapped to the reference genome and variants identified, and definitions of terms in the variant report. Minimum system requirements are a web browser like Firefox or Chrome, as the analysis is web-based. The software maps reads, identifies variants, annotates variants, and outputs summary files and a VCF file containing called variants.
A Handbook Of Statistical Analyses Using RNicole Adams
This document provides an introduction to using R for statistical analysis. It discusses what R is, how to install R and packages, and basic data structures in R like vectors, factors, and data frames. The Forbes 2000 dataset is used as a running example to demonstrate working with data in R. It contains information on 2000 companies, including their ranking, name, country, business category, sales, profits, assets, and market value.
This document provides an introduction to the R programming language. It covers downloading and installing R, the basic syntax and functions of R like vectors and algebra, importing and exporting data, plotting graphs, and tips for writing good R code. The document is intended to teach readers enough R to complete most common data analysis tasks. It assumes no prior programming experience.
This document discusses Nmap and Zenmap, two network scanning tools. It provides an overview of Nmap's features for discovering hosts, services, and operating systems. It also describes Zenmap as the graphical user interface for Nmap, and how to install, use, save scans, and compare results with Zenmap. The document recommends ways to secure a network against these tools, such as closing unused ports and services, and masking the operating system.
CDS Filtering is a little python program allowing you to filter your predicted ORFs, CDSs, RNA, or any other type of DNA or RNA sequences.
Download Link for CDS Filtering Program : https://goo.gl/RiAtC0
This document provides an introduction to the Robot Operating System (ROS) by explaining its key components:
1) The ROS computation graph which divides robot control software into independent nodes that communicate via topics and services.
2) The ROS file system which organizes code into packages and manages dependencies between packages using tools like catkin.
3) It provides instructions for setting up a new ROS environment and navigating the ROS file system to understand packages.
NRPE - Nagios Remote Plugin Executor. NRPE plugin for Nagios Core 4 and others.Marc Trimble
The NRPE documentation provides instructions for installing and configuring the Nagios Remote Plugin Executor (NRPE) to allow the Nagios monitoring system to check system resources on remote Linux/Unix machines. It describes installing the NRPE daemon on the remote host, along with Nagios plugins. It also covers installing the check_nrpe plugin on the Nagios monitoring host and configuring it to communicate with the NRPE daemon. Host and service definitions are created to define the remote machine and the specific system metrics that will be monitored, such as CPU load, disk usage, and number of users.
This document provides an overview and instructions for installing and using Python on Arch Linux. It discusses the differences between Python 2 and Python 3, how to install each version, and how to deal with version conflicts. It also lists several integrated development environments (IDEs) for Python programming and describes various widget toolkits available for building graphical user interfaces.
[DOC] Java - Code Analysis using SonarQubeAngelin R
This document discusses how to set up and use SonarQube to analyze code quality for both Maven and non-Maven projects. It describes installing the SonarQube server and runner, configuring the server, and setting up projects for analysis by adding a sonar-project.properties file. The document then explains how to generate a SonarQube report by running the sonar-runner on a project and view results on the SonarQube server dashboard.
Java Source Code Analysis using SonarQubeAngelin R
This document discusses how to set up and use SonarQube to analyze code quality for both Maven and non-Maven projects. It describes installing the SonarQube server and runner, configuring the server, and setting up projects for analysis by adding a sonar-project.properties file. It then explains how to generate a SonarQube report by running the sonar-runner on a project and view results on the SonarQube server dashboard.
Language-agnostic data analysis workflows and reproducible researchAndrew Lowe
This was a talk that I gave at CERN at the Inter-experimental Machine Learning (IML) Working Group Meeting in April 2017 about language-agnostic (or polyglot) analysis workflows. I show how it is possible to work in multiple languages and switch between them without leaving the workflow you started. Additionally, I demonstrate how an entire workflow can be encapsulated in a markdown file that is rendered to a publishable paper with cross-references and a bibliography (and with raw LaTeX file produced as a by-product) in a simple process, making the whole analysis workflow reproducible. For experimental particle physics, ROOT is the ubiquitous data analysis tool, and has been for the last 20 years old, so I also talk about how to exchange data to and from ROOT.
This document provides an introduction to R, including what R is, how to install and use it, common mistakes, and data structures. It discusses that R was created by Ross Ihaka and Robert Gentleman and contains over 10,000 user-developed packages on topics like statistics, graphics, and data analysis. It also provides instructions on installing R from its homepage or a Italian download site, using the R console and R Studio interfaces, the workspace environment, and saving workspaces to preserve data between sessions.
The document is a user's guide for Revo Uninstaller software that helps uninstall programs and clean up leftover files. It provides an overview of the software's features, including uninstalling programs, scanning for leftover files, and additional tools for cleaning browsers, Windows, and removing private data. The guide outlines the main interface components and provides instructions for common tasks like uninstalling a program, scanning for leftover files, and deleting found items.
This document provides instructions for installing and configuring Snort 2.9.6 and DAQ 2.0 on CentOS 6.3/6.4 running in a VirtualBox virtual machine. It describes compiling and installing necessary libraries like libpcap and libdnet. It then provides commands for extracting, configuring, compiling and installing DAQ and Snort. Finally it discusses configuring Snort configuration files, adding the Snort user, and providing a script to start and stop Snort.
Title.JPG (502607) Essay Cover Page, Essay Title PagAlicia Buske
The document provides instructions for requesting writing assistance from HelpWriting.net. It outlines a 5-step process: 1) Create an account with an email and password. 2) Complete a order form with instructions, sources, and deadline. 3) Review bids from writers and select one. 4) Review the completed paper and authorize payment. 5) Request revisions until satisfied, with a refund option for plagiarism. The service utilizes a bidding system and promises original, high-quality content.
The document provides instructions for creating an account and submitting assignment requests on the HelpWriting.net website. It is a 5-step process: 1) Create an account with a password and email. 2) Complete a form with assignment details. 3) Review bids from writers and select one. 4) Review the completed paper and authorize payment. 5) Request revisions to ensure satisfaction, with refunds offered for plagiarized work.
More Related Content
Similar to Automatize Scoring Of AFLP Datasets With RawGeno A Free R CRAN Library
This document discusses how to create a private version of CRAN packages behind a company firewall for internal use. It describes using the miniCRAN R package to selectively download and create a local mirror of specific CRAN packages and their dependencies. This allows controlling the packages available internally. It also discusses using MRAN and the RRT package to reproduce analyses and ensure scripts work across package and R version updates.
This document discusses using machine learning to classify malware into families based on the DREBIN dataset. It covers:
1. Preprocessing the dataset, including integer encoding and one-hot encoding to convert categorical data to numeric form for modeling.
2. Addressing overfitting by splitting the data into training and test sets and using cross-validation.
3. Using classifiers like Random Forest and SVM with strategies like one-vs-all and one-vs-one to perform multiclass classification of malware families.
4. The process of using binary classifiers for each family first, then combining the results to classify malware into the appropriate family.
This document provides instructions for installing and using PhyloPipe v1.1, a phylogenetic analysis software package. It describes how to install prerequisite programs like HaMStR, MAFFT, and RAxMLGUI. It explains the directory structure and how to run PhyloPipe directly or modify the project files. Usage instructions are provided for the main window and settings dialog where paths to programs can be configured. The document aims to help users get started with PhyloPipe for phylogenetic analysis.
Hb 1486-001 1074970 qsg-gene_readdataanalysis_1112Elsa von Licy
The document provides instructions for analyzing sequencing reads generated from a GeneRead DNAseq Gene Panel kit using targeted exon enrichment. It describes uploading reads to the analysis software, setting analysis parameters, how reads are mapped to the reference genome and variants identified, and definitions of terms in the variant report. Minimum system requirements are a web browser like Firefox or Chrome, as the analysis is web-based. The software maps reads, identifies variants, annotates variants, and outputs summary files and a VCF file containing called variants.
A Handbook Of Statistical Analyses Using RNicole Adams
This document provides an introduction to using R for statistical analysis. It discusses what R is, how to install R and packages, and basic data structures in R like vectors, factors, and data frames. The Forbes 2000 dataset is used as a running example to demonstrate working with data in R. It contains information on 2000 companies, including their ranking, name, country, business category, sales, profits, assets, and market value.
This document provides an introduction to the R programming language. It covers downloading and installing R, the basic syntax and functions of R like vectors and algebra, importing and exporting data, plotting graphs, and tips for writing good R code. The document is intended to teach readers enough R to complete most common data analysis tasks. It assumes no prior programming experience.
This document discusses Nmap and Zenmap, two network scanning tools. It provides an overview of Nmap's features for discovering hosts, services, and operating systems. It also describes Zenmap as the graphical user interface for Nmap, and how to install, use, save scans, and compare results with Zenmap. The document recommends ways to secure a network against these tools, such as closing unused ports and services, and masking the operating system.
CDS Filtering is a little python program allowing you to filter your predicted ORFs, CDSs, RNA, or any other type of DNA or RNA sequences.
Download Link for CDS Filtering Program : https://goo.gl/RiAtC0
This document provides an introduction to the Robot Operating System (ROS) by explaining its key components:
1) The ROS computation graph which divides robot control software into independent nodes that communicate via topics and services.
2) The ROS file system which organizes code into packages and manages dependencies between packages using tools like catkin.
3) It provides instructions for setting up a new ROS environment and navigating the ROS file system to understand packages.
NRPE - Nagios Remote Plugin Executor. NRPE plugin for Nagios Core 4 and others.Marc Trimble
The NRPE documentation provides instructions for installing and configuring the Nagios Remote Plugin Executor (NRPE) to allow the Nagios monitoring system to check system resources on remote Linux/Unix machines. It describes installing the NRPE daemon on the remote host, along with Nagios plugins. It also covers installing the check_nrpe plugin on the Nagios monitoring host and configuring it to communicate with the NRPE daemon. Host and service definitions are created to define the remote machine and the specific system metrics that will be monitored, such as CPU load, disk usage, and number of users.
This document provides an overview and instructions for installing and using Python on Arch Linux. It discusses the differences between Python 2 and Python 3, how to install each version, and how to deal with version conflicts. It also lists several integrated development environments (IDEs) for Python programming and describes various widget toolkits available for building graphical user interfaces.
[DOC] Java - Code Analysis using SonarQubeAngelin R
This document discusses how to set up and use SonarQube to analyze code quality for both Maven and non-Maven projects. It describes installing the SonarQube server and runner, configuring the server, and setting up projects for analysis by adding a sonar-project.properties file. The document then explains how to generate a SonarQube report by running the sonar-runner on a project and view results on the SonarQube server dashboard.
Java Source Code Analysis using SonarQubeAngelin R
This document discusses how to set up and use SonarQube to analyze code quality for both Maven and non-Maven projects. It describes installing the SonarQube server and runner, configuring the server, and setting up projects for analysis by adding a sonar-project.properties file. It then explains how to generate a SonarQube report by running the sonar-runner on a project and view results on the SonarQube server dashboard.
Language-agnostic data analysis workflows and reproducible researchAndrew Lowe
This was a talk that I gave at CERN at the Inter-experimental Machine Learning (IML) Working Group Meeting in April 2017 about language-agnostic (or polyglot) analysis workflows. I show how it is possible to work in multiple languages and switch between them without leaving the workflow you started. Additionally, I demonstrate how an entire workflow can be encapsulated in a markdown file that is rendered to a publishable paper with cross-references and a bibliography (and with raw LaTeX file produced as a by-product) in a simple process, making the whole analysis workflow reproducible. For experimental particle physics, ROOT is the ubiquitous data analysis tool, and has been for the last 20 years old, so I also talk about how to exchange data to and from ROOT.
This document provides an introduction to R, including what R is, how to install and use it, common mistakes, and data structures. It discusses that R was created by Ross Ihaka and Robert Gentleman and contains over 10,000 user-developed packages on topics like statistics, graphics, and data analysis. It also provides instructions on installing R from its homepage or a Italian download site, using the R console and R Studio interfaces, the workspace environment, and saving workspaces to preserve data between sessions.
The document is a user's guide for Revo Uninstaller software that helps uninstall programs and clean up leftover files. It provides an overview of the software's features, including uninstalling programs, scanning for leftover files, and additional tools for cleaning browsers, Windows, and removing private data. The guide outlines the main interface components and provides instructions for common tasks like uninstalling a program, scanning for leftover files, and deleting found items.
This document provides instructions for installing and configuring Snort 2.9.6 and DAQ 2.0 on CentOS 6.3/6.4 running in a VirtualBox virtual machine. It describes compiling and installing necessary libraries like libpcap and libdnet. It then provides commands for extracting, configuring, compiling and installing DAQ and Snort. Finally it discusses configuring Snort configuration files, adding the Snort user, and providing a script to start and stop Snort.
Similar to Automatize Scoring Of AFLP Datasets With RawGeno A Free R CRAN Library (20)
Title.JPG (502607) Essay Cover Page, Essay Title PagAlicia Buske
The document provides instructions for requesting writing assistance from HelpWriting.net. It outlines a 5-step process: 1) Create an account with an email and password. 2) Complete a order form with instructions, sources, and deadline. 3) Review bids from writers and select one. 4) Review the completed paper and authorize payment. 5) Request revisions until satisfied, with a refund option for plagiarism. The service utilizes a bidding system and promises original, high-quality content.
The document provides instructions for creating an account and submitting assignment requests on the HelpWriting.net website. It is a 5-step process: 1) Create an account with a password and email. 2) Complete a form with assignment details. 3) Review bids from writers and select one. 4) Review the completed paper and authorize payment. 5) Request revisions to ensure satisfaction, with refunds offered for plagiarized work.
The document discusses the steps involved in requesting and receiving a custom paper writing service from the website HelpWriting.net. It outlines 5 steps: 1) Create an account with a password and email. 2) Complete a 10-minute order form providing instructions, sources, and deadline. 3) Review bids from writers and select one. 4) Review the completed paper and authorize payment if satisfied. 5) Request revisions to ensure satisfaction, and the company offers refunds for plagiarized work.
Expert Help With A Reaction Paper For YouAlicia Buske
The document discusses online sports gambling and daily fantasy sports sites like DraftKings and FanDuel. It notes that the New York attorney general issued a cease and desist order against the two companies, arguing their activities constitute illegal gambling under state law. The order aims to stop the companies from accepting bets the attorney general views as illegal. This action has made the attorney general unpopular with the multibillion dollar daily fantasy sports industry.
GRE Sample Essays To Nail The GRE Analytical Writing SectionAlicia Buske
Here are a few key reasons why people use natural gas:
- Cost. Natural gas is generally cheaper than other energy sources like electricity or gasoline, making it an affordable option for heating homes and powering appliances. As an abundant domestic resource, it helps control energy costs.
- Convenience. Natural gas infrastructure is widely available through pipelines, allowing homes and businesses direct access to receive reliable deliveries. It's a convenient fuel that doesn't require on-site storage tanks.
- Clean burning. When combusted, natural gas emits less carbon dioxide than coal or petroleum, producing fewer emissions. It burns very cleanly and is a lower-carbon energy source.
- Versatility. Natural gas is used for multiple
The document provides instructions for using a writing service called HelpWriting.net. It outlines 5 steps: 1) Create an account with a password and email. 2) Complete a 10-minute order form with instructions, sources, and deadline. 3) Review bids from writers and choose one. 4) Review the completed paper and authorize payment if pleased. 5) Papers can receive multiple revisions to ensure satisfaction. The service promises original, high-quality work with refunds for plagiarism.
How To Read A Scientific Paper Research4LifeAlicia Buske
The document discusses how to live a happy and simple life. It argues that true happiness does not require wealth, as one can be poor yet happy. Living simply requires patience and dedicating to reducing desires to only what is needed. While some claim money buys happiness, those who are happy with little disprove that claim. Social media can also impact happiness negatively if it increases depression from online bullying or knowing too much about events with no personal effect. Overall, reducing desires and focusing on inner contentment rather than material goods allows for a happy and simple life.
TRUESTORE A4 Designer Project Sheet Art And CrafAlicia Buske
The Communist Manifesto, written by Karl Marx and Friedrich Engels in the 1840s, aimed to end class struggle brought about by the Industrial Revolution. As industries grew, more workers became proletariats who had to leave their previous occupations to work in factories under unhealthy conditions. Their communities were disrupted as they moved to cities near industries or even other countries. Marx and Engels sought to address the plight of the working classes through their influential document.
How To Write An Effective Research Paper Abstract FAlicia Buske
The document discusses the steps to request and receive writing assistance from the HelpWriting.net service, including registering an account, completing an order form with instructions and deadline, and reviewing writer bids before authorizing payment upon receiving a satisfactory completed paper. The process involves writers bidding on requests and customers choosing a writer based on qualifications before a deposit starts the writing. Customers can also request revisions to ensure satisfaction.
FREE 22 Sample Essay Templates In MS Word GAlicia Buske
This document provides instructions for students to get writing assistance from HelpWriting.net. It involves a 5-step process: 1) Create an account with a password and email. 2) Complete a 10-minute order form providing instructions, sources, and deadline. 3) Review bids from writers and select one. 4) Review the completed paper and authorize payment. 5) Request revisions until satisfied. The site promises original, high-quality content or a full refund.
The document discusses developing effective marketing strategies for entering the Japanese market. It notes that Japan has the world's third largest economy, making it attractive to international exporters and investors. The paper will review literature on crafting marketing strategies for success in Japan.
Teaching Writing At KS1 Teaching ResourcesAlicia Buske
This document provides instructions for requesting writing assistance from HelpWriting.net. It outlines a 5-step process: 1) Create an account with a password and email. 2) Complete a 10-minute order form providing instructions, sources, and deadline. 3) Review bids from writers and choose one based on qualifications. 4) Review the completed paper and authorize payment if satisfied. 5) Request revisions to ensure satisfaction, with a full refund option for plagiarized work. The service aims to fully meet customer needs through an original, high-quality paper.
People In Prison Explain What Music Means To Them And HAlicia Buske
The document summarizes Barbara Ehrenreich's book "Bait and Switch" which examines the state of white-collar jobs in America. It discusses how Ehrenreich went undercover to experience the job search process firsthand. The document notes that according to Ehrenreich's research, job stability is declining as the average person born during the baby boom held around 10.5 jobs between 18-40, more than previous generations. Ehrenreich also found that the first job obtained after graduation is unlikely to be one's last job.
Want To Improve Your Writing Take These FREAlicia Buske
1. The document provides instructions for getting writing help from HelpWriting.net in 5 steps - registering an account, completing an order form, reviewing writer bids and choosing one, reviewing and authorizing payment for completed work, and requesting revisions if needed.
2. Writers on the platform use a bidding system, and customers can choose a writer based on qualifications, order history, and feedback to work on their assignment.
3. Customers can request revisions to ensure satisfaction with their completed assignment, and HelpWriting.net promises original, high-quality work or a full refund if plagiarized.
Reflective Essay On Personal Development PlaAlicia Buske
The document discusses the effects of new technologies that emerged during the Middle Ages and Renaissance, including the telescope and printed book. The telescope was invented by Galileo, who published his findings about other planets and galaxies in his influential book "The Starry Messenger". However, Galileo's findings contradicted the Catholic Church's beliefs and his book was banned. Overall, new technologies began changing religious and scientific understanding during this transitional period.
Conclusion Paragraph How To Write A Conclusion Paragraph WitAlicia Buske
This document discusses different types of followers and leadership styles. It describes four groups of ineffective followers: sheep who lack initiative, yes people who refuse disagreement, alienated followers who think independently but resent leadership, and survivors who prioritize self-preservation over all else. These types of followers can hinder good leadership. The document also discusses transformational, pseudotransformational, and transactional leadership styles and analyzes Jim Jones' leadership as an example of pseudotransformational leadership, which appears transformational but is actually self-serving and exploitative.
The document outlines the steps a customer would take to use an admission essay writing service, including creating an account, submitting a request form with instructions and sources, and choosing a writer to complete the assignment who would then submit a draft for the customer's review and approval before finalizing payment. The process aims to match customers with qualified writers based on bids to complete admission essay writing requests while allowing revisions to ensure customer satisfaction.
The document provides instructions for seeking analytical essay help from HelpWriting.net in 5 steps:
1. Create an account with a password and email.
2. Complete a 10-minute order form providing instructions, sources, and deadline.
3. Review bids from writers and choose one based on qualifications.
4. Review the completed paper and authorize payment if satisfied.
5. Request revisions to ensure satisfaction, with a full refund option for plagiarism.
African Cloth, Export Production, and Secondhand Clothing in Kenya.pdfAlicia Buske
This document summarizes and analyzes three intersecting clothing systems in Kenya: 1) African print cloth traditionally produced for domestic consumption but now in decline; 2) more recently established export-oriented mass-produced clothing primarily for Western markets; and 3) the importation and sale of secondhand clothing from Western markets. It discusses how trade agreements like AGOA have supported growth in Kenya's export apparel sector but reliance on Asian textile imports and competition from secondhand clothing contributed to the collapse of domestic cloth production. The interrelationships between these three clothing systems are reshaping textile production, trade, and consumption in Kenya.
A Paradigm Analysis of Arts-Based Research and Implications for Education.pdfAlicia Buske
This article analyzes arts-based research as a paradigm and discusses its implications for education. It explores how arts-based research uses artistic methods and expressions to understand phenomena in ways that may complement more traditional research approaches. The article suggests arts-based research could provide new insights for educational research by accessing forms of understanding not attainable through conventional methods alone.
How to Make a Field Mandatory in Odoo 17Celine George
In Odoo, making a field required can be done through both Python code and XML views. When you set the required attribute to True in Python code, it makes the field required across all views where it's used. Conversely, when you set the required attribute in XML views, it makes the field required only in the context of that particular view.
Chapter wise All Notes of First year Basic Civil Engineering.pptxDenish Jangid
Chapter wise All Notes of First year Basic Civil Engineering
Syllabus
Chapter-1
Introduction to objective, scope and outcome the subject
Chapter 2
Introduction: Scope and Specialization of Civil Engineering, Role of civil Engineer in Society, Impact of infrastructural development on economy of country.
Chapter 3
Surveying: Object Principles & Types of Surveying; Site Plans, Plans & Maps; Scales & Unit of different Measurements.
Linear Measurements: Instruments used. Linear Measurement by Tape, Ranging out Survey Lines and overcoming Obstructions; Measurements on sloping ground; Tape corrections, conventional symbols. Angular Measurements: Instruments used; Introduction to Compass Surveying, Bearings and Longitude & Latitude of a Line, Introduction to total station.
Levelling: Instrument used Object of levelling, Methods of levelling in brief, and Contour maps.
Chapter 4
Buildings: Selection of site for Buildings, Layout of Building Plan, Types of buildings, Plinth area, carpet area, floor space index, Introduction to building byelaws, concept of sun light & ventilation. Components of Buildings & their functions, Basic concept of R.C.C., Introduction to types of foundation
Chapter 5
Transportation: Introduction to Transportation Engineering; Traffic and Road Safety: Types and Characteristics of Various Modes of Transportation; Various Road Traffic Signs, Causes of Accidents and Road Safety Measures.
Chapter 6
Environmental Engineering: Environmental Pollution, Environmental Acts and Regulations, Functional Concepts of Ecology, Basics of Species, Biodiversity, Ecosystem, Hydrological Cycle; Chemical Cycles: Carbon, Nitrogen & Phosphorus; Energy Flow in Ecosystems.
Water Pollution: Water Quality standards, Introduction to Treatment & Disposal of Waste Water. Reuse and Saving of Water, Rain Water Harvesting. Solid Waste Management: Classification of Solid Waste, Collection, Transportation and Disposal of Solid. Recycling of Solid Waste: Energy Recovery, Sanitary Landfill, On-Site Sanitation. Air & Noise Pollution: Primary and Secondary air pollutants, Harmful effects of Air Pollution, Control of Air Pollution. . Noise Pollution Harmful Effects of noise pollution, control of noise pollution, Global warming & Climate Change, Ozone depletion, Greenhouse effect
Text Books:
1. Palancharmy, Basic Civil Engineering, McGraw Hill publishers.
2. Satheesh Gopi, Basic Civil Engineering, Pearson Publishers.
3. Ketki Rangwala Dalal, Essentials of Civil Engineering, Charotar Publishing House.
4. BCP, Surveying volume 1
বাংলাদেশের অর্থনৈতিক সমীক্ষা ২০২৪ [Bangladesh Economic Review 2024 Bangla.pdf] কম্পিউটার , ট্যাব ও স্মার্ট ফোন ভার্সন সহ সম্পূর্ণ বাংলা ই-বুক বা pdf বই " সুচিপত্র ...বুকমার্ক মেনু 🔖 ও হাইপার লিংক মেনু 📝👆 যুক্ত ..
আমাদের সবার জন্য খুব খুব গুরুত্বপূর্ণ একটি বই ..বিসিএস, ব্যাংক, ইউনিভার্সিটি ভর্তি ও যে কোন প্রতিযোগিতা মূলক পরীক্ষার জন্য এর খুব ইম্পরট্যান্ট একটি বিষয় ...তাছাড়া বাংলাদেশের সাম্প্রতিক যে কোন ডাটা বা তথ্য এই বইতে পাবেন ...
তাই একজন নাগরিক হিসাবে এই তথ্য গুলো আপনার জানা প্রয়োজন ...।
বিসিএস ও ব্যাংক এর লিখিত পরীক্ষা ...+এছাড়া মাধ্যমিক ও উচ্চমাধ্যমিকের স্টুডেন্টদের জন্য অনেক কাজে আসবে ...
This presentation includes basic of PCOS their pathology and treatment and also Ayurveda correlation of PCOS and Ayurvedic line of treatment mentioned in classics.
A review of the growth of the Israel Genealogy Research Association Database Collection for the last 12 months. Our collection is now passed the 3 million mark and still growing. See which archives have contributed the most. See the different types of records we have, and which years have had records added. You can also see what we have for the future.
A workshop hosted by the South African Journal of Science aimed at postgraduate students and early career researchers with little or no experience in writing and publishing journal articles.
How to Manage Your Lost Opportunities in Odoo 17 CRMCeline George
Odoo 17 CRM allows us to track why we lose sales opportunities with "Lost Reasons." This helps analyze our sales process and identify areas for improvement. Here's how to configure lost reasons in Odoo 17 CRM
LAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UPRAHUL
This Dissertation explores the particular circumstances of Mirzapur, a region located in the
core of India. Mirzapur, with its varied terrains and abundant biodiversity, offers an optimal
environment for investigating the changes in vegetation cover dynamics. Our study utilizes
advanced technologies such as GIS (Geographic Information Systems) and Remote sensing to
analyze the transformations that have taken place over the course of a decade.
The complex relationship between human activities and the environment has been the focus
of extensive research and worry. As the global community grapples with swift urbanization,
population expansion, and economic progress, the effects on natural ecosystems are becoming
more evident. A crucial element of this impact is the alteration of vegetation cover, which plays a
significant role in maintaining the ecological equilibrium of our planet.Land serves as the foundation for all human activities and provides the necessary materials for
these activities. As the most crucial natural resource, its utilization by humans results in different
'Land uses,' which are determined by both human activities and the physical characteristics of the
land.
The utilization of land is impacted by human needs and environmental factors. In countries
like India, rapid population growth and the emphasis on extensive resource exploitation can lead
to significant land degradation, adversely affecting the region's land cover.
Therefore, human intervention has significantly influenced land use patterns over many
centuries, evolving its structure over time and space. In the present era, these changes have
accelerated due to factors such as agriculture and urbanization. Information regarding land use and
cover is essential for various planning and management tasks related to the Earth's surface,
providing crucial environmental data for scientific, resource management, policy purposes, and
diverse human activities.
Accurate understanding of land use and cover is imperative for the development planning
of any area. Consequently, a wide range of professionals, including earth system scientists, land
and water managers, and urban planners, are interested in obtaining data on land use and cover
changes, conversion trends, and other related patterns. The spatial dimensions of land use and
cover support policymakers and scientists in making well-informed decisions, as alterations in
these patterns indicate shifts in economic and social conditions. Monitoring such changes with the
help of Advanced technologies like Remote Sensing and Geographic Information Systems is
crucial for coordinated efforts across different administrative levels. Advanced technologies like
Remote Sensing and Geographic Information Systems
9
Changes in vegetation cover refer to variations in the distribution, composition, and overall
structure of plant communities across different temporal and spatial scales. These changes can
occur natural.
Liberal Approach to the Study of Indian Politics.pdf
Automatize Scoring Of AFLP Datasets With RawGeno A Free R CRAN Library
1. RawGeno, user manual Automated AFLP scoring Version 2.0-0
Summer 2010 1
Automatize scoring of AFLP datasets with RawGeno: a free
R CRAN library.
Nils ARRIGO, Dorothée EHRICH and Nadir ALVAREZ.
1 Getting started.......................................................................................................................................................2
1.1 Overview of the analysis...............................................................................................................................2
1.2 Organizing your project ................................................................................................................................2
1.3 Installing the softwares.................................................................................................................................3
1.3.1 Installing PeakScanner ............................................................................................................................3
1.3.2 Installing RawGeno ..................................................................................................................................3
1.4 First steps with RawGeno.............................................................................................................................4
1.4.1 The Files menu.........................................................................................................................................4
1.4.2 The Scoring menu....................................................................................................................................4
1.4.3 The Quality check menu ..........................................................................................................................4
1.4.4 The Save menu........................................................................................................................................5
2 Analysing electropherograms with PeakScanner .................................................................................................6
3 Importing data within RawGeno............................................................................................................................7
3.1 Importation....................................................................................................................................................7
3.2 Filtering of low quality samples ....................................................................................................................7
4 Scoring ..................................................................................................................................................................8
4.1 Binning..........................................................................................................................................................8
4.2 Filtering.........................................................................................................................................................9
5 Quality checking..................................................................................................................................................10
5.1 Binning quality ............................................................................................................................................10
5.1.1 Manual reviewing ...................................................................................................................................10
5.1.2 Quality statistics .....................................................................................................................................10
5.2 Samples checking ......................................................................................................................................11
6 Exporting results..................................................................................................................................................12
7 Merging data from several AFLP markers ..........................................................................................................12
8 How to cite RawGeno? .......................................................................................................................................13
9 Acknowledgements .............................................................................................................................................13
10 References..........................................................................................................................................................13
2. RawGeno, user manual Automated AFLP scoring Version 2.0-0
Summer 2010 2
1 Getting started
1.1 Overview of the analysis
Analysing AFLP electropherograms is achieved using two programs: PeakScanner and RawGeno. Whereas
PeakScanner detects AFLP peaks along electropherograms and calculates their intensity and size (by relying on an
internal size standard included in electropherosis), RawGeno proceeds to the binning and scoring of AFLP
electropherograms.
RawGeno includes several filters to assess the quality of electropherograms and checks the consistency of binning.
In addition, several preliminary analyses are available to the user for making biological inferences and/or remove
outlier samples. Finally, several functions allow exporting resulting data sets into properly formatted files for further
analyses.
1.2 Organizing your project
In RawGeno 2.0 the AFLP scoring project should be organized according to the following procedure.
1. Create a folder (hereafter “project folder”) from which the project will be managed.
2. In this folder, add a sub-directory including all electropherogram files (*.fsa). Also add an R shortcut
(Windows users) to conveniently launch RawGeno scoring sessions. Right-clicking on this R shortcut allows
defining the default working directory of R by specifying it into the “Start into” addressing field of the
shortcut. Copy-pasting the project folder address into this field will set-up the working directory of R
accordingly.
Create a text-tabulated table censing individuals included in
the project (hereafter referred to as “info table”). The info table
is optional as the minimal RawGeno analysis can proceed
without it. However, RawGeno includes several functions
relying on this table, for instance to label individuals during
preliminary analyses or facilitate the sorting and selection of
individuals (for example, according to populations or species)
during the production of exports.
Therefore, the info table should include any additional relevant
information that the user would like to consider. It must contain
at least the name of individuals (i.e. in a column named “Tag”)
and any supplementary information in extra columns.
Fig. 1 Example of info table. The Tag column is mandatory
(in red), Plate and Pos (in green, PCR plate and well name)
are optional. In black: any relevant data to be compared with
AFLP data. Pay attention to avoid weird characters (such as
#, @ and others).
Tag Plate Pos Species DNA_quality
BEv14-2 1 A1 BEv 0
BEv2-2 1 A2 BEv 0
BF29-2 1 A3 BF2 0
BF36-2 1 A4 BF3 Low
BF36-2b 1 A5 BF3 Low
BHa1-2 1 A6 BHa 0
BHa3-2 1 A7 BHa 0
… … … … …
3. RawGeno, user manual Automated AFLP scoring Version 2.0-0
Summer 2010 3
1.3 Installing the softwares
RawGeno works in combination with PeakScanner, an
electropherogram analyser freely distributed by ABI.
1.3.1 Installing PeakScanner
PeakScanner is freely available from
http://marketing.appliedbiosystems.com/mk/get/PS1_login.
This is a windows distribution. Linux users might run it through
Wine (for the command version at least, refer to PeakScanner
documentation).
1.3.2 Installing RawGeno
RawGeno is a library of R CRAN, which is freely available from
http://cran.r-project.org. Of course, you must first install R CRAN before willing to use RawGeno.
RawGeno is freely available from http://sourceforge.net/projects/rawgeno as a zip file.
In windows, the installation is achieved either using the graphical user interface of R (menu “Packages/Install
package(s) from local zip files”) or the following command line in the R console:
utils:::menuInstallLocal()
Installing RawGeno with Linux requires to decompress RawGeno.zip into the library folder of R (see Note 1). Using
the shell command line, this is done as follows (linux command lines, starting from the folder where RawGeno.zip
was downloaded):
sudo mv RawGeno.zip /usr/lib64/R/library/RawGeno.zip
cd /usr/lib64/R/library/RawGeno.zip
sudo unzip RawGeno.zip
sudo rm RawGeno.zip
Finally, RawGeno requires the installation of two companion
packages: vegan and rpanel, that both are available from usual
R CRAN repositories. Their installation is achieved either using
the graphical user interface of R (menu “Packages/Install
package(s) from CRAN”) or with the following command lines
(prompted into the R console):
install.packages("vegan")
install.packages("rpanel")
Fig. 2 Installing RawGeno. Using the graphical interface of
R CRAN. RawGeno is provided as a zip file that has to be
unpacked into the “library” folder of R. This operation can
also be done manually with your files explorer. Similarly,
uninstalling RawGeno is done by removing it from the
“library” folder of R.
Note 1. Linux users might need to run R as “sudo” users
to properly install companion packages (i.e. vegan and
rpanel). In addition, troubles might arise because R
libraries are downloaded as source code and compiled
locally before being installed. This requires that all
compilers needed by R (such as gc, gcc, gcc-fortran and
others) have been installed locally, before attempting the
installation of external R packages. In OpenSUSE, the
necessary compilers can be obtained using YaST2 (into the
rpm groups dedicated to development tools). Ubuntu users
are more fortunate because Synaptic Manager can install
ready-to-use R libraries in addition to usual compilers
(refer to http://cran.r-
project.org/bin/linux/ubuntu/README for further details
regarding repository addresses).
4. RawGeno, user manual Automated AFLP scoring Version 2.0-0
Summer 2010 4
1.4 First steps with RawGeno
RawGeno can be launched by copy-pasting the following
commands in the command line window of R:
require(RawGeno)
require(vegan)
RawGeno()
RawGeno is interfaced in a way to “guide” users from the
importation of data to the export of final results. The graphical
user interface (hereafter “GUI”) appears with four main
menus : 1. Files, 2. Scoring, 3. Quality check, 4. Save.
1.4.1 The Files menu
Gives access to all importation steps required for the analysis.
More specifically:
• Files / Electroph. / PeakScanner (*.txt): launches the
importation device of PeakScanner results. To be
used to analyse the electropherograms of interest.
• Files / Import / Single datasets (Binary Table): imports
datasets that were already scored (and stored as
simple binary tables, with lines and columns being
individuals and AFLP loci, respectively). This
importation function is helpful for users willing to
visualise their data within RawGeno or produce
exports towards external statistical programs.
• Files / Import / Merge several datasets (Binary
Tables): imports batches of datasets that were already
scored and merges them into a single binary matrix.
This is useful for merging results from several AFLP
primer pairs.
1.4.2 The Scoring menu
To be used once data from electropherograms have been
imported into RawGeno.
• Scoring / Scoring: for… scoring data from
electropherograms, once they have been successfully
imported within RawGeno. This launches a GUI where
parameters of the binning algorithm and the filtering of
bins can be managed.
1.4.3 The Quality check menu
To be used once electropherograms have been scored.
• Quality Check / Review Binning: because binning and
filtering are done by algorithms, one might want to
have a look at what happened to the data. This menu launches a GUI where users can see how bins were
defined along the electropherograms. This device also allows manual editing of the binning.
• Quality Check / Scoring Quality: provides statistics regarding the dataset and bins properties. More
specifically:
o Binning diagnostics: shows various statistics about the quality of bins, along the electropherograms or
in general (i.e. as scatterplots or distributions).
Fig. 3 RawGeno main interface. In addition, consider the R
CRAN console where messages / statistics are displayed
during the analysis.
5. RawGeno, user manual Automated AFLP scoring Version 2.0-0
Summer 2010 5
o Scoring statistics: provides standard statistics regarding the dataset (i.e. number of bins that were
filtered during the scoring, etc.)
o Scoring parameters: a summary of parameters that were used during the scoring.
• Quality Check / Samples Checking: my preferred! It allows user to explore their results. More specifically:
o Visualise samples: launches a GUI where heatmap and principal coordinate analyses can be performed
on the AFLP dataset.
o Export Sample Diagnostic Values: produces a table where quality statistics regarding samples are
saved.
o PCR Plates Check: launches a GUI for visualizing how AFLP reactions performed across PCR plates.
This is especially useful during wet-lab optimizations.
1.4.4 The Save menu
To be used once electropherograms have been scored and quality checked
OR when willing to save results from datasets that were already scored.
• Plot PseudoGel (*.pdf): saves the complete scoring project as a “gel-like” image. Still experimental
• Save / Export Currently Scored Dataset: launches a GUI for managing the export of results towards various
formats.
6. RawGeno, user manual Automated AFLP scoring Version 2.0-0
Summer 2010 6
2 Analysing electropherograms with PeakScanner
The analysis of AFLPs starts by using PeakScanner in order to detect peaks along
electropherograms and to calculate their size. The procedure is highly automated,
leaving the user to set peak detection parameters and check the quality of
electropherograms.
The peak detection parameters are set up using a so-called “Analysis Method”,
which is available from the graphical interface (menu “Resources/Manage Analysis
Methods”). Typically, a proper peak detection attempts to detect only peaks that
are biologically relevant and exclude peaks only reflecting technical background
noise.
We advise to set the “Analysis Method” using the following guidelines.
• Prior to the detection of peaks per se, a light smoothing of electropherograms might be desirable, in order to
eliminate small secondary peaks inherent to technical background noise.
• The detection of peaks is achieved through a “sliding window” analysis that inspects electropherograms
locally. Within the inspected region, PeakScanner first creates a modelled version of the electropherogram
by fitting a polynomial curve to the data. Peaks are detected according to this modelled signal, based on
their absolute width. Therefore, the detection sensitivity is adjusted by modifying the width of the sliding
window (i.e. in terms of data points, the smaller it is, the more sensitive the procedure becomes), the
goodness of fit reachable by the polynomial curve (again, increasing the polynomial degree of fitting
increases the detection sensitivity) and the minimal width above which a peak is recorded as present. We
advise to use default parameters as a starting point, as they have been shown to provide reliable results (4,
5): set 15 points for the sliding window width, use a third degree polynomial curve and consider peaks that
at least have two points of half-width.
• Downstream to peak detection, PeakScanner filters peaks according to their absolute fluorescence
intensity, i.e. the peak height, measured in relative fluorescent units (rfu). Visually checking
electropherograms obtained from blank samples generally helps to adjust the fluorescence threshold to the
upper limit of the technical background noise. While some applications might benefit from considering only
peaks with a strong fluorescence (e.g. greater than 150 rfu to provide conservative estimates for band
presence statistics), most users will prefer using a more permissive threshold at this stage and apply a
posteriori filtering strategies based on bin quality statistics (1 - 4). We advise to use 50 rfu as a minimal
fluorescence for considering individual AFLP peaks.
• Save the customized “Analysis Method” in order to use it during electropherogram analysis.
Once the “Analysis Method” is set up, import the electropherograms
(stored as *.fsa files) into PeakScanner using the “Add Files” button.
Define the size standard and the “Analysis Method” to be used for all
individuals included into the project (set this information for the first
individual, then select the columns “Size Standards” and “Analysis
Method” and use the “ctrl+D” keyboard shortcut to apply these settings to
the remaining individuals).
The detection and sizing of peaks is processed using the “Analysis” menu, from the graphical interface. Once
achieved, electropherograms can be visualized and compared among individuals. This might help identifying AFLP
reactions that were not successful (e.g. individuals with a systematically low fluorescence or showing abnormal
peaks). Removing such individuals prior to the RawGeno analysis will help to enhance the final quality of scoring.
The PeakScanner analysis ends with a simple export process in which the list of peaks detected throughout the
complete set of analyzed individuals is stored in a table. This is achieved using the “Export/Export Combined Table”
menu, producing a text-tabulated file containing the size, height, area and width of all detected peaks (this can be
checked using the “Edit Table Settings” menu).
7. RawGeno, user manual Automated AFLP scoring Version 2.0-0
Summer 2010 7
Dye/Sample Peak Sample File Name Size Height Area in BP
B, 1 BEv14-2.fsa 50.9319 5943 4972
B, 2 BEv14-2.fsa 53.7099 9291 7676
B, 3 BEv14-2.fsa 56.0962 69 18
B, 4 BEv14-2.fsa 57.0999 249 91
B, 5 BEv14-2.fsa 61.7942 7334 4659
B, 6 BEv14-2.fsa 63.2218 353 259
B, 7 BEv14-2.fsa 64.6425 632 418
B, 8 BEv14-2.fsa 65.792 384 286
… … … … …
3 Importing data within RawGeno
3.1 Importation
Importing the PeakScanner text-tabulated file in RawGeno is done using the menu
Files/RawGeno/Electroph./PeakScanner (*.txt).
During importation, RawGeno handles a single dye color at a time, which is user-specified
and considers the “dye” parameter with the following values: "B" (blue; FAM), "G" (green;
HEX), "Y" (yellow; NED), “R” (red; ROX) or “O” (orange; LIZ). If electropherosis was
achieved using several dyes simultaneously (e.g. multiplexing of PCR products), each dye
must be analysed separately in RawGeno. Datasets obtained from several dyes can be
merged a posteriori in a final binary table (see below).
3.2 Filtering of low quality samples
Because the detection of AFLP peaks is based on a
defined threshold, it is not easy to handle reactions
showing electropherograms with varying intensities.
When improperly handled, such a situation leads to
the inclusion of samples characterised by many false-
absences in the final dataset.
Although the only way to correctly address this issue
is a robust wet-lab protocol, RawGeno still attempts
limiting the influence of low quality AFLPs on binning
and scoring, by filtering individuals that were
unsuccessful before proceeding to binning.
Here, the variability in the number of peaks detected
per individual is used as a proxy of AFLP reactions
quality. Empirical evidence shows that this statistic is
dependent of the specific dataset used and the
biological organism studied (see Note 3). The lower
bound of this distribution most generally includes
individuals with low AFLP intensities, characterized by
many AFLP peaks that remain undetected in the
electropherograms. Because such individuals usually
represent a small fraction of the complete project, we
advise removing them from the dataset. The upper
bound of the distribution can either reflect a
biologically relevant signal (e.g. hybridization) or a
technical bias (e.g. contamination, odd PCR reaction).
Such individuals should be either discarded or identified as outliers for proper interpretation in further analyses.
Note 2. Importing data from other sequencers. RawGeno device handles text-tabulated files produced by PeakScanner.
However, results from other genescan systems can be inputted within RawGeno by following an ad-hoc data preparation. More
specifically, RawGeno imports a table where all peaks detected in the dataset are stored with the following information:
• Dye/Sample Peak = color code (B, G, Y, O or R), Peak number
• Sample File Name = sample where the peak was observed (the *.fsa
is optional)
• Size = peak size (in bp)
• Height = peak fluorescence (as measured by the sequencer)
• Area in BP = peak area (optional, required by the binning edition
device of RawGeno)
Fig 4. Filtering low quality samples. Samples are sorted according to
their number of AFLP peaks. Click-and-dragging the green area allows
selecting samples to be conserved for further analyses.
Note 3. Datasets with multiple species. Because the number of
peaks per sample is species-dependent, users should expect a
multimodal curve if several species are present in the dataset.
8. RawGeno, user manual Automated AFLP scoring Version 2.0-0
Summer 2010 8
4 Scoring
Scoring is launched using the menu
Scoring / Scoring
4.1 Binning
Building a presence/absence matrix requires recognizing which
AFLP peaks are homologous across individuals. This procedure
relies on the size of peaks along electropherograms and
assumes
homology for peaks sharing identical sizes. Because peak sizes
are determined empirically using electropherosis, measurements
generally include technical variations preventing the observation
of identical sizes across homologous AFLP peaks. Indeed,
Holland et al. (5) reported measurement variations ranging
between 0 and 0.66 bp (with 0.08 bp in average) for replicated
AFLP peaks. Therefore, properly recording the signals of AFLP
peaks asks for taking into account size variations by defining size
categories (i.e. “bins”) into which the presence / absence of
AFLP peaks are recorded. Bins are characterised by their
position along the electropherogram (i.e. the average size of
peaks they include) and their width (the size difference between
the largest and shortest peak included in the bin).
RawGeno uses a binning algorithm relying on the size of AFLP
peaks over all individuals included in the project and defines bins
in a way that respects the two following conditions.
• The first constrain is a maximal bin width. This prevents
the definition of too large bins that could lead to
homoplasy (i.e. erroneously assigning non-homologous
AFLP peaks within the same bin). This limit is set using
the MaxBin parameter. We advise to use MaxBin
values ranging between 1.5 and 2 bp (from our
experiments). Using small values (i.e. MaxBin < 0.5 bp)
should be avoided as this generally causes oversplitting,
a situation where the presence / absence of homologous
AFLP bands are coded using an exaggerated number of
bins.
• The second constrain prevents the assignment of more
than one peak from the same individual within the same
bin [i.e. “technical homoplasy” (1)]. If such a situation
occurs, RawGeno defines two separate bins in which the
two peaks are assigned. This constrain can be relaxed
increasing the “MinBin” parameter in order to include two
peaks of the same individual in the same bin (when not
exceeding the “MinBin” value in size difference). Such a
relaxing might be desirable, for instance when artefactual
peaks (i.e. shoulder, stutter or secondary peaks
bordering the authentic peak in a same individual) lead
to the definition of numerous extra-bins. In such a
situation, artefactual peaks can cause the local definition
of extra bins into which homologous peaks can be
inconsistently assigned. We advise to use MinBin
values ranging between 1 and 1.5 bp.
Fig 5. Scoring interface. (a) Binning algorithm settings (i.e.
define how bins are defined), (b) Scoring range (define the
electropherogram region over which the scoring is performed),
(c) Post-scoring filtering (removal of singletons and bins with
poor fluorescence or reproducibility).
Fig 6. Binning algorithm implemented in RawGeno. Left
panel: main steps followed by the algorithm to define bins; right
panel: illutration of binning with two samples (S1 and S2); the
bin widths (i.e. the difference in size between the largest and
the shortest amplicons included in the considered bin) and the
technical homoplasy rates (i.e. HR, the mean number of peaks
belonging to the same sample that are included in a same bin
and Width, the size difference between the shortest and largest
peak included into the bin).
(b)
(a)
(c)
9. RawGeno, user manual Automated AFLP scoring Version 2.0-0
Summer 2010 9
4.2 Filtering
Once defined, bins can be filtered according to their properties and/or quality. Note that such filtering strategies
require analyzing AFLP reactions with a consistent quality across individuals. In its current version, RawGeno
includes three kinds of filters.
• The size filter restricts binning to a given portion of the electropherogram (i.e. the “scoring range”). We
advise to limit the binning to peaks included in the range of the size ladder, because their size is accurately
interpolated by PeakScanner (in contrast to larger peaks where the size is extrapolated). We recommend
discarding peaks with small sizes (i.e. smaller than 100 bp, RMIN=100) as they are more likely to be
homoplasic (6, 7).
• The second filter eliminates bins according to their average fluorescence. This filter assumes that bins with
a high average fluorescence retrieve a more consistent signal than bins with a low fluorescence. The
rationale for this strategy is the following. The fluorescence of an AFLP fragment largely determines its
detection probability during the PeakScanner analysis of electropherograms. Therefore, fragments that
systematically produce low fluorescences are more likely to be erroneously censed as absent from
electropherograms as they might pass the threshold in some reactions but not in others just by chance.
• The reproducibility filter evaluates bin quality according
to their robustness across AFLP reactions, by relying on
replicated samples. This filter assumes that replicated
individuals were selected randomly from the original
dataset, in a way to scan the genetic diversity at best.
Consider that RawGeno identifies replicated individuals
using their names. Replicates must be named using the
original individual name plus a suffix letter. The suffix is
matched using the “who” parameter of the filtering
algorithm. As an example, “mysample.fsa” and
“mysampleB.fsa” are a pair of original-replicated
samples, being identified with a “B” suffix (therefore, set
who = “B” when filtering). For each bin, RawGeno
compares original to replicated individuals and
calculates the percentage of original-replicated pairs for
which the AFLP signal is successfully reproduced. Bins
where reproducibility cannot reach a satisfactory rate are
eliminated from the final dataset.
Note 4. managing replicated samples. The replicated
sample must have the same file name as the original
sample, with a distinctive suffix character (the
“ReplicateID”).
Original samples: “E27-1.fsa”, “F26-1.fsa”, “F28-1.fsa”
Replicates : “E27-1x.fsa”, “F26-1x.fsa, “F28-1x.fsa”
ReplicateID : x
You might want using file-managing programs such as
“Ant renamer” http://www.antp.be/software/renamer.
IMPORTANT: not all bins can be checked for
reproducibility, if the dataset is not completely replicated.
In such a situation, users can choose to either eliminate the
untestable bins, or to keep them included in their final
dataset.
10. RawGeno, user manual Automated AFLP scoring Version 2.0-0
Summer 2010 10
5 Quality checking
5.1 Binning quality
5.1.1 Manual reviewing
Binning is an automated and straightforward analysis
step that users might want to review interactively.
RawGeno includes a visualization device for manually
editing the binning by adding, removing or modifying the
width and position of bins. This device includes several
help-to-decision statistics such as the average size and
the number of presences associated to each bin.
5.1.2 Quality statistics
RawGeno provides statistics about the dataset quality,
from the viewpoint of bins.
• Bin width, the difference between the largest
and shortest AFLP bands included into the focal
bin.
• Technical homoplasy, the average number of
peaks belonging to same individual, included
into the same bin.
• Size homoplasy, according to Vekemans et al.
(6), size homoplasy can be detected by
measuring and testing the linear correlation
existing between the size of bins and their
frequency. This procedure is already
implemented in the program AFLPsurv. A
negative and significant correlation suggests the
occurrence of size homoplasy in the dataset.
• Peaks intensity, the fluorescence of AFLP
peaks.
These values are displayed either according to the bin
size (i.e. the position along the electropherogram) or as
distributions.
Fig 7. Binning review interface. Allowing users to shift, resize, add
or remove bins interactively, starting from bins that were initially
defined by the automated algorithm. Summary statistics (i.e. the
average size of bins and the number of present AFLP peaks per bin)
are provided as editing guidelines.
100 200 300 400 500
0.0
0.5
1.0
1.5
Aggregation pattern
Bin Sizes (bp)
Bin
Widths
(bp)
100 200 300 400 500
0.6
0.8
1.0
1.2
1.4
Technical Homoplasy
Bin Sizes (bp)
Peaks
from
same
individual
/
Bin
100 200 300 400 500
0.0
0.2
0.4
0.6
0.8
1.0
Size Homoplasy
R^2 = -0.153 P-value= 0
Bin size (bp)
Bin
Frequency
100 200 300 400 500
4
5
6
7
8
9
Peaks intensity
(only 1/50 of the w hole datast is show n)
Bin Sizes (bp)
Peaks
height
(
log(RFU)
)
0.0 0.5 1.0 1.5
0.0
0.2
0.4
0.6
0.8
1.0
Bin Widths
Mean(Widths) = 0.49 bp
Bin Widths (bp)
Density
0.7 0.8 0.9 1.0 1.1 1.2 1.3
0.0
0.5
1.0
1.5
2.0
2.5
3.0
3.5
Technical Homoplasy
Mean(homoplasy) = 1 peaks / Bin
Peaks fromsame individual / Bin
Density
Bin Frequency
In the w hole dataset
Bin size (bp)
Frequency
(counts)
0 100 200 300 400 500
0
20
40
60
80
100
4 5 6 7 8 9
0.00
0.05
0.10
0.15
0.20
0.25
Peaks intensity
Peaks height ( log(RFU) )
Density
min = 50 RFU
mean = 1377 RFU
max = 8684 RFU
Fig 8. Quality statistics. The width, technical homoplasy and
fluorescence of bins are displayed either according to the bin size
(i.e. the position along the electropherogram) or as distributions.
Statistics about size homoplasy are also provided.
11. RawGeno, user manual Automated AFLP scoring Version 2.0-0
Summer 2010 11
5.2 Samples checking
RawGeno offers two displays for exploring scoring
results (menu “RawGeno/Quality Check/Samples
Checking”). The binary matrix can be directly visualized
using a heatmap, showing individuals sorted according
to their genetic relatedness. Alternatively, individuals can
be examined with a principal coordinates analysis.
Both displays allow plotting quality statistics or external
information (i.e. picked from the “info table” cited above)
against the AFLP results. RawGeno computes four
quality statistics regarding samples:
• The number of AFLP bands per sample,
• The outlier detection index, defined as the
average frequency of the observed genotype.
=
=
nBin
1
i
nBin
1
R i
P
Where nBin = number of AFLP bands in the sample,
Pi = Frequency of the bin i in the whole dataset, that
is present in the focal sample.
• The mean AFLP bands fluorescence per sample
• The standard deviation of AFLP bands
fluorescence per sample
Fig 10. Visualizations of samples. RawGeno includes basic vizualisation devices for performing preliminary data mining.
Specifically, results can be reviewed using (a) heatmaps of the binary matrix, where samples are sorted according to their genetic
similarity and (b) principal coordinates analysis of the corresponding matrix. Both devices can compare AFLP results with either quality
statistics (i.e. number of AFLP peaks per sample, mean and variance in fluorescence intensity and outlier detection index) or external
information provided by users (e.g. the population from where samples were collected). In addition, both devices are handled through a
graphical user interface for sorting and selecting samples to be visualized.
Fig 9. Visualization device. (a) Selection of the info table (in order
to compare external information with AFLP data), (b) selection and
sorting of samples according to external information from the info
table, (c) selection of quality statistics / external data to compare with
the AFLP data. This device can produce either heatmaps of the AFLP
binary matrix or principal coordinate analyses.
Note 5. Managing the dataset content. Users generally want to interactively manage their dataset along with visualizing results. Although the
GUI gives some flexibility to this respect, the removal of outlier samples remains an operation that is not interactively available. Removing (or
renaming) outlier samples within the info table provides a simple way for achieving this goal. RawGeno joins the info table with the scored
AFLP dataset, therefore samples that are missing from the info table will not be displayed in visualization devices. Note that this “trick” only
allows modifying results that are displayed as it does not affect the originally scored matrix nor the PeakScanner results. The production of the
definitive dataset should start by removing undesirable samples before performing the PeakScanner analysis.
(a)
(c)
(b)
12. RawGeno, user manual Automated AFLP scoring Version 2.0-0
Summer 2010 12
6 Exporting results
RawGeno produces exports in various formats
(8). These are accessible from the menu
“4.Save / Export Currently Scored Datasets”.
7 Merging data from several AFLP markers
The strategy is the following :
- Analyse each AFLP primer pair independently, and save
it as a binary table (be consistent with the used data
format, e.g. always save scored results as a binary
table, with individuals as rows).
- Merge these binary tables into a single matrix using
RawGeno
(menu Files / Import / Merge several datasets (Binary
tables))
- Save this merged table using the usual saving GUI.
Adding sample
information
Selecting samples
to export
Selecting exporting
formats
Fig 11. Results export device.
AFLP marker 1 AFLP marker 2 AFLP marker 3
Merged binary
matrix
Merge the binary
matrices
Binary matrix 1
Scoring +
saving
Scoring +
saving
Binary matrix 3
Binary matrix 2
Scoring +
saving
Save the merged matrix
Exports
Exports
Exports
Exports
Fig 12. Merging several datasets. Strategy to merge
several datasets using RawGeno. Note that this procedure
requires to use the menu
“Files / Import / Merge several datasets (Binary tables)”
to proceed to merging, followed by the menu
“Save / Export Currently Scored Dataset”
to save results towards exports.
Note 6 All the datasets to be merged MUST have the same samples names. This
is especially obvious when markers were multiplexed and therefore share the
same *.fsa name, but this limitation must be checked carefully when merging
data from various provenances.
Missing samples or different ordering of the scored dataset are allowed. Samples
with missing genotypes are removed from the final dataset. This limitation can
be bypassed by using the command line version:
i. Select files to merge
list.merge=choose.files(caption='Choose Files to Merge')
For linux users, use instead
list.merge=tk_choose.files(caption='Choose Files to Merge')
ii. Proceed to merging
MERGING(transpose = "indRows", exclude = T, replacewith = NA)
The transpose parameter states whether the binary matrices store individuals as
lines (“indRows”) or as columns (“indColumns”), the exclude parameter defines
whether individuals that are not shared by all matrices will be removed from the
final merged dataset (exclude = “T”). If kept (exclude = “F”), individuals with
missing AFLP genotypes will be completed using NA values (replacewith =
NA) when no data is available. Then proceed to export as explained above.
13. RawGeno, user manual Automated AFLP scoring Version 2.0-0
Summer 2010 13
8 How to cite RawGeno?
Please cite RawGeno using the reference where it was first presented:
Arrigo N, Tuszynski JW, Ehrich D, Gerdes T, Alvarez N (2009) Evaluating the impact of scoring parameters on the
structure of intra-specific genetic variation using RawGeno, an R package for automating AFLP scoring. BMC
Bioinformatics doi:10.1186/1471-2105-10-33.
9 Acknowledgements
We are grateful to Anahi Espindola, Matthias Borer, Anouk Beguin, Gregor Kozlowski, Aurélien Labhardt, Felix
Gugerli, Sabine Brodbeck, Piya Kittipadakul, Julie B. Hébert, Julie Lee Yaw, Christian Parisod, Roland Dubillard and
Robin Arnoux provided helpful comments during the development of RawGeno. This project was supported by the
National Centre of Competence in Research (NCCR) Plant Survival and the PNR-59, two programs of the Swiss
National Science Fundation.
10 References
1. Arrigo N, Tuszynski JW, Ehrich D, Gerdes T, Alvarez N (2009) Evaluating the impact of scoring parameters on the
structure of intra-specific genetic variation using RawGeno, an R package for automating AFLP scoring. BMC
Bioinformatics doi:10.1186/1471-2105-10-33.
2. Arrigo N, Felber F, Parisod C, Buerki S, Alvarez N, David J, Guadagnuolo R (in press) Origin and expansion of the
allotetraploid Aegilops geniculata, a wild relative of wheat. New Phytol.
3. Whitlock R, Hipperson H, Mannarelli M, Butlin RK, Burke T (2008) An objective, rapid and reproducible method for
scoring AFLP peak-height data that minimizes genotyping error. Mol Ecol Res 8:725-735.
4. Herrmann D, Poncet BN, Manel S, Rioux D, Gielly L, Taberlet P, Gugerli F (2010) Selection criteria for scoring
amplified fragment length polymorphisms (AFLPs) positively affect the reliability of population genetic parameter
estimates. Genome 53(4):302-310.
5. Holland BR, Clarke AC, Meudt HM (2008) Optimizing automated AFLP scoring parameters to improve phylogenetic
resolution. Syst Biol 57(3):347-366.
6. Vekemans X, Beauwens T, Lemaire M, Roldan-Ruiz I (2002) Data from amplified fragment length polymorphism
(AFLP) markers show indication of size homoplasy and of a relationship between degree of homoplasy and fragment
size. Mol Ecol 11(1):139-151.
7. Paris M, Bonnes B, Ficetola GF, Poncet BN, Després L (2010) Amplified fragment length homoplasy: in silico
analysis for model and non-model species. BMC Genomics doi:10.1186/1471-2164-11-287.
8. Ehrich D (2006) AFLPdat: a collection of R functions for convenient handling of AFLP data. Mol Ecol Notes 6: 603-
604.