2015_CV_J_SHELTON_linked

JENNIFER M SHELTON Email: sheltonj@ksu.edu
Kansas State University
Division of Biology
116 Ackert Hall
Manhattan, KS 66506

Research products
Persistent digital research identifier
Open source code
Posters
STATEMENT OF INTEREST
I have chosen to focus on bioinformatics because it is clear that analysis of
large datasets is increasingly important to the biological sciences. I find this
quickly evolving field rich with opportunities to learn new skills and to further
develop those skills by training and collaborating with other scientists.
Working with data from existing and emerging technologies in an open and
collaborative environment is preferred. In my code, I focus on reproducibility by
sharing as much of my code and source data as is appropriate for a project.
I am interested in both optimizing heuristics for informative but speedy initial
data analysis and more open ended exploration of new kinds of data to identify
unanticipated sources of information.
I have worked primarily on de novo assembly and draft assembly
improvement projects for NGS and genome map data. However I am also very
interested to work on projects involving cancer genomics.
EDUCATION
2012 MS, Biology Kansas State University
2010 post-baccalaureate, science Brooklyn College
2009 post-baccalaureate, science Hunter College
2008 post-baccalaureate, Ecology,
Evolution & Environmental Biology Columbia University
2002 BFA, Cum Laude Maryland Institute, College of Art
POSITIONS/ WORK EXPERIENCE
IIIIIIIIIIIIIIIIII
IIIIIIIIIII
IIII
IIIIIIIIIII

2012-present Bioinformatics Core Outreach Coordinator, K-INBRE, KSU
2010-2012 Graduate Teaching Assistant, KSU
2009 Intern, New York Botanical Gardens
2008-2009 Intern, American Museum of Natural History
AWARDS
2011 James E. Ackert Award for Outstanding Presentation by a
Graduate Student, KSU
2001 Maryland Institute Achievement Award
1998-2002 Dean’s List, Maryland Institute, College of Art
SUPPORT
2010-2012 Graduate Teaching Assistantship, KSU $25,000/annum
2001 Maryland Institute Achievement Award $2,000
1998-02 Trustee Award Scholarship. $20,000
SKILLS
Software Carpentry trained instructor: SWC is an non-profit organization of
scientists train other scientists in basic coding skills. http://software-carpentry.org/
pages/team.html.
Bioinformatics tools: miraEST, Tophat2, CuffDiff2, Bowtie2, Cufflinks, Velvet,
Oases, Trinity, Prinseq, ABySS, BLAST, BioNano RefAligner, BioNano
Assembler, IrysView, MaSuRCA, Trimmomatic, BWA, Sam Tools.
Languages: BASH, Perl, R, SAS, Python, LaTeX
ALGORITHMS AND WORKFLOWS
2015
CleanIllumina: Readme Processes Illumina reads using Trimmomatic's custom
palindrome adapter cleaning and their max information adaptive quality trimmer.
Pipeline includes sample data and a lab tutorial. (Under development)
Sewing Machine: Readme Comprehensive, customizable and robust workflow
that starts with assembled BioNano genome maps, a draft genome FASTA and
in silico labeled maps created from the draft genome FASTA. The pipeline
iteratively runs alignments between the in silico maps and the genome maps
and runs Stitch with the resulting alignment. The pipeline tests both a default
and relaxed set of alignment parameters. Sewing Machine iterates Stitch until
no new super scaffolds are created and the summarizes the results of the

alignments and super scaffolding. Pipeline includes sample data and a lab
tutorial. doi
AssembleIrysXeonPhi de novo: Readme Comprehensive, customizable and
robust workflow that takes raw BioNano data (i.e. a Datasets directory with BNX
files from IrysView) and writes assembly scripts to test a range of parameters,
summarizes assembly results and finally organizes output and create summary
reports and graphs. Pipeline includes sample data and a lab tutorial. doi
AssembleIrysXeonPhi: Readme Comprehensive, customizable and robust
workflow that starts with raw BioNano data (i.e. a Datasets directory with BNX
files from IrysView) and a draft genome FASTA to prepare in silico labeled
maps, rescale BioNano molecule maps, write assembly scripts to test a range
of parameters, summarize assembly results, align best BioNano genome maps
to in silico maps and super scaffold the draft genome FASTA (with Sewing
Machine) and finally organize output and create summary reports and graphs.
Pipeline includes sample data and a lab tutorial. doi
2014
tBlastx: Code Annotates de novo transcriptomes with the NCBI nt database.
Pipeline output was customized for a sequencing facilities request. Specialty
scripts for PI Dr. Benjamin Hause (KSU Vet Med). doi
AssembleG: Readme Processes Illumina DNA reads from cleaning and multi-k
assemblies to produce de novo assembled genomes. Pipeline includes sample
data and a lab tutorial. doi
AssembleT: Readme Processes Illumina RNA reads from cleaning and multi-k
assemblies to clustering for de novo transcriptomes. Pipeline includes sample
data and a lab tutorial. doi
RNA-SeqAlign2Ref: Readme Processes Illumina RNA reads from cleaning and
aligning to differential expression profiling for the transcriptomes of annotated
genomes. Pipeline includes sample data and a lab tutorial. doi
RNA-SeqAlign: Readme Processes Illumina RNA reads from cleaning and
aligning to count summarizing for de novo transcriptomes. Pipeline includes
sample data and a lab tutorial. doi
AssembleIrysCluster: Readme Preps raw molecule maps for the BioNano Irys
System and customizes assembly scripts to test a range of assembly
parameters. doi
2013
Stitch: Readme Super scaffolds genomic sequence samples using alignments to
BioNano assembled genome maps. doi

Blastx: Readme Annotates de novo transcriptomes with the NCBI nr database.
Pipeline can be run recursively to recover annotations that timed out on a
cluster. Pipeline includes sample data and a lab tutorial. doi
Count_reads_denovo: Readme Summarizes read counts for projects with non-
model organisms where no reference genome or annotation files are available.
Also considers both pairs for paired end reads unlike HTSeq. doi
TEACHING EXPERIENCE
As the K-INBRE Bioinformatics Core Outreach Coordinator:
2015
Software Carpentry Workshop at Memorial Sloan Kettering (graduate
students and postdocs). Lead Data Carpentry instructor at workshop. Aug 24-25.
Workshop website
Software Carpentry Workshop at Stanford University (graduate students and
postdocs). Lead SWC instructor at workshop July 23-24. Workshop website
Software Carpentry Workshop at The Jackson Laboratory (graduate students
and postdocs). SWC instructor at workshop. Workshop website
Software Carpentry Workshop at University of Connecticut Storrs (graduate
students and postdocs). SWC instructor at workshop. Workshop website
Software Carpentry Workshop at University of Campinas (graduate students
and postdocs). SWC instructor at workshop. Workshop website
Software Carpentry Workshop at Weill Cornell Medical College (graduate
students and postdocs). SWC instructor at workshop. Workshop website
2014
Beocat with UNIX and Perl (graduate students, postdocs and faculty). I
developed this month long workshop, NGS Analysis on Beocat, in response to
student interest in coding for NGS data analysis on Beocat, our HPC cluster. It
quickly evolved into a collaboration between the Division of Biology and the KSU
Center for Scientific Supercomputing. I developed most of the exercises and
taught the UNIX/Cluster submission half of the workshop. Dr. Greg Wilson from
Software Carpentry (SWC) video conferenced with me at an early stage of
workshop development and I created the NGS Analysis on Beocat material to fit
the SWC training model. There were 60+ applicants for 30 seats.
Introduction to Bioinformatics (graduate students and postdocs). T.A., As my
analysis scripts developed into workflows I created tutorials to expedite student
training in the lab portion of an introductory Bioinformatics course.
2013

Informal Perl course (graduate students and PostDocs). T.A., Dr. Brad Olson
led an informal course to help develop instructional material to teach Perl to
Biologists.
2012
Transcriptomics and Phlyogenetics video lessons (undergraduate students).
Created videos to teach undergraduate students about two bioinformatics tools
with accompanied exercises.
As a GTA in the Division of Biology, KSU:
2010-2012
Plant Physiology Lab (undergraduate students). GT.A., Created “A. thaliana
heat shock mutant vs. WT” and “Plant growth: using imageJ to estimate leaf
area over a time course” labs as well as lectures. Organized and guided labs.
Principles of Biology: (undergraduate students). Created lab lectures and
assisted students with a studio model introductory biology course.
Organismic Biology Lab: (undergraduate students). Created lab lectures and
examine questions. Organized and guided labs.
RESEARCH AND OUTREACH COLLABORATIONS
2012-current
Applied Bioinformatics Journal Club I organized a journal club where methods
articles, selected by students who are using them, are discussed weekly by an
interdepartmental group of biologists and computer scientists from KSU, KUMC,
and KU-L Bioinformatics Core members via video conference link. I maintain an
archive of articles chosen for discussion in our website. The Bioinformatics Core
Applied Bioinformatics Journal Club and training blogs have been visited over
92,000 times to date.
2011
Grass Journal Club As a graduate student, I organized a graduate student/
postdoc only journal club to encourage student participation and leverage the full
range of ecological and agronomic research into grass physiology, phylogeny,
and functional genetics conducted at Kansas State’s Division of Biology and
Plant Pathology. website
RESEARCH EXPERIENCE
2014-present
Assemble de novo BioNano genome maps. Kansas State University, Division
of Biology.

Various Principal investigators. Assembled de novo assemblies of BioNano
molecule maps for bacteria and eukaryotes (including but not limited to Zea
mays, Corvus corone, Monachus schauinslandi).
2013-present
Assemble BioNano genome maps with a reference. Kansas State University,
Division of Biology.
Various Principal investigators. Assembled de novo assemblies of BioNano
molecule maps for bacteria and eukaryotes (including but not limited to
Arabidopsis thaliana, Oryza sativa ssp. japonica, Pan troglodytes, Homo sapien,
Microcebus murinus, Amaranthus hypochondriacus, Trypanosoma cruzi,
Medicago truncatula, Gossypium raimondii, Gallus gallus, Xanthomonas
axonopodis pv glycines, Tribolium castaneum, Gonium pectorale, Manduca
sexta, Danaus plexippus, Triticum aestivum, Nicrophorus vespilloides, Tribolium
madens, Drosophila miranda, Drosophila pseudoobscura, Escherichia coli,
Corvus corone, Zea mays, Electrophorus electricus and Acyrthosiphon pisum).
2012-2014
Assemble de novo plant transcriptomes. Kansas State University, Division of
Biology.
Principal investigators, Dr. Timothy Durrett and Dr. Loretta Johnson. Assembled
multi-k-mer de novo assemblies and count summaries for eight plant taxa.
2010-2012
Cuticular wax synthesis and deposition. Kansas State University, Division of
Biology.
Principal investigator, Dr. Loretta Johnson. Investigating the varied cuticular
waxes of locally adapted Andropogon gerardii. Using traditional and next
generation sequencing and quantification technology to study the Fatty Acid
Elongation (FAE) system and Fatty Acid Synthase (FAS) system while classifying
wax crystalloid structure, chemical composition, and barrier properties.
2008-2009
Function and evolution of AGL6 genes. New York Botanical Gardens.
Supervised by Dr. Amy Litt. Carried out studies on the evolution of the
AGAMOUS-LIKE6 (AGL6) gene to identify changes in number and sequence
that might have influenced flower evolution. Determined the sequence of the
AGL6 gene in non-model species of flowering plants. Generated construct and
over-expressed AGL6 via a CaMV-35S promoter, in Arabidopsis.
Flower Development Genes. New York Botanical Gardens.
Supervised by Dr. Abeer Mohamed. Investigated function of MADS-box
transcription factors that promote flower and fruit development. Examined
change in expression/phenotype when the APELATA1 (AP1) gene is

expressed by the FRUITFULL (FUL) promoter and vice versa, in mutant
Arabidopsis.
Population Study, Asian Cycads. Sackler Institute of Comparative Genomics,
American Museum of Natural History.
Supervised by Dr. Angelica Cibrian-Jaramillo. Distribution of Cycas
micronesica and sympatric cycad species in Southeast Asia.
Microsatellite Library, C. trifolia. Sackler Institute for Comparative Genomics,
American Museum of Natural History.
Assistant to Dr. Aswini Pai on project assessing (1) whether Coptis trifolia only
reproduces clonally, and (2) if sexual reproduction occurs, how much diversity
is in the gene pool.
Isolated microsatellites from tissue gathered in northern New York. Analyzed
results, designed primers from ﬂanking regions. Advised Dr. Pai, a visiting
professor, of the lab layout and protocols.
Genetic Variability in Arctic Mammals. Sackler Institute for Comparative
Genomics, American Museum of Natural History.
Supervised by Dr. Diana Weber on the project “Linkage Disequilibrium of MHC
Loci in Polar Mammals: Potential Maladaptive Consequences for a Warming
Globe.” Investigated genetic relationships in the major histocompatibility
complex (MHC) Class II region for DQB and DRB from arctic mammals, with
emphasis on the muskox and polar bear.
PUBLICATIONS
Shelton J., Coleman M. C., Herndon N., Lu N., Lam E.T., Anantharaman T. and
Brown S.J. Tools and pipelines for BioNano data: molecule assembly pipeline
and FASTA super scaffolding tool. 2015 bioRxiv doi: http://dx.doi.org/
10.1101/020966. (Paper revisions are currently under review by BMC
Genomics).
Weber D.S., Van Coeverden De Groot P.J., Peacock E., Schrenzel M.D., Perez
D.A., Thomas S., Shelton J., Else C.K., Darby L.L., Acosta L., Harris C.,
Youngblood J., Boag P. and Desalle R. 2013. Low MHC variation in the polar
bear: implications in the face of Arctic warming. Animal Conservation, 16: 671–
683. doi.
PRESENTATIONS
2015
*Cunningham C. B. , Ji L. , Shelton J., Schmitz R. J., Brown S. J., Moore A. J.
Methylation occurs in beetles: the genome of the subsocial beetle Nicrophorus

vespilloides (Coleoptera: Silphidae). Ninth Annual Arthropod Genomics
Symposium.
Rogers J., Larsen P.A., Raveendran M., Liu Y., English A., Han Y.., Vee V. ,
Campbell C.R., Shelton J., Brown S.J., Muzny D.M., Gibbs R.A., Yoder A.D.,
Worley K. Whole genome assembly of the gray mouse lemur (Microcebus
murinus) genome: integrating diverse platforms and data types. Biology of
Genomes. Cold Spring Harbor.
2014
Brown S.J. and Shelton J. BioNano Genomics Webinar: Using BioNano Maps to
Improve an Insect Genome Assembly. http://www.bionanogenomics.com/
bionano-community/webinars/.
*Shelton J., Herndon N., Gray M., Liang H., Durrett T., Johnson L., Akhunova A.,
Brown S.J. Multi-K-Mer de novo Transcriptome Assembly, Validation, and
Count. Plant and Animal Genomes XXII.
*Herndon N., Shelton J., Andrews W., Wang W., Brown S.J. Improving the
Tribolium draft Assembly with Physical Maps Based on Imaging Ultra-Long
Single DNA Molecules. Plant and Animal Genomes XXII.
*Gray M., Shelton J., Chellapilla S., Bello N., Akhunova A., Liang H., Garrett K.,
Akhunov E., Morgan T., Johnson L. Transcriptional Differences of Mesic and
Xeric Ecotypes of an Ecologically-Dominant Prairie Grass Andropogon gerardii
to Abiotic Stress. Plant and Animal Genomes XXII.
Johnson L., Brown S.J., Gray M., Shelton J., Baer S.G., Maricle B., Bello N.
Genetic Differentiation, Transcriptome Variation, and Local Adaptation of an
Ecologically Dominant Prairie and Bioenergy Grass Andropogon gerardii (big
bluestem) Occurring Along the Precipitation Gradient in Midwest U.S.
Grasslands. Plant and Animal Genomes XXII.
*Shelton J., Gray M., Brown S.J., Chellapilla S., Akhunova A., Akhunov E., Liang
X., Johnson L. De novo Transcriptome Proﬁling of Two Edaphically and
Phenotypically Divergent Grasses: Dominant Forage Grass Big Bluestem
Andropogon gerardii and Drought-Tolerant Sand Bluestem Andropogon
gerardii ssp. hallii. Plant and Animal Genomes XXI.
Shelton J. Micromorphology, Chemistry, and Physiology of epicuticular waxes in
sand bluestem, an edaphic ecotype of big bluestem. Ecological Genomics
Research Forum, Manhattan, KS.
*Shelton J., Johnson, L., Ravenek J., Jeannotte R., Song Z., Welti R. Bello N.,
Nippert J. Divergent epicuticular physiology and morphology in the ecological
dominant prairie grass big bluestem Andropogon gerardii, and its edaphic

variety sand bluestem A. gerardii var. hallii. American Society of Plant
Biologists Midwestern Sectional Meeting Lincoln, NE.
*Shelton J., Johnson, L., Ravenek J., Jeannotte R., Song Z., Welti R.
Divergence in leaf morphology and physiology in Andropogon gerardii var.
hallii (sand bluestem) a locally adapted variety of Andropogon gerardii (big
bluestem). ASPB annual meeting.
Shelton J., Johnson, L., Ravenek J., Jeannotte R., Song Z., Welti R. Phenotypic
divergence of cuticle chemistry and surface morphology in big and sand
bluestem. Biology Graduate Students Research Forum, Manhattan KS.
*Shelton J., Johnson, L., Ravenek J., Jeannotte R., Song Z., Welti R. Local
adaptation of wax biosynthesis in western Andropogon gerardii and
Andropogon gerardii (subsp. hallii). Ecological Genomics Symposium, Kansas
City, KS.
* indicates poster presentation
REFERENCES
* Public Web Version of CV: Contact Jennifer M Shelton directly for reference
contact details.
Susan J Brown
Director K-INBRE Bioinformatics Core
University Distinguished Professor
239A Chalmers Hall
Manhattan KS, 66506
Todd Dickinson
Former VP, Global Commercial Operations BioNano Genomics
CEO Dovetail Genomics, CEO IdentifyGenomics, CEO BigDataBio
Greg Wilson
Software Carpentry, Executive Director
http://www.software-carpentry.org/
Arvind Bharti PhD, PMP
Syngenta Crop Protection, LLC
3054 E Cornwallis Rd
Research Triangle Park, NC 27709
Jeremy Chien

Member of the Applied Bioinformatics Journal Club and Researcher in the K-
INBRE network
Assistant Professor, Cancer Biology
Assistant Director, Translational Genomics, University of Kansas Cancer Center
2020B Wahl Hall East, Mailstop 1027
3901 Rainbow Boulevard
Kansas City, KS 66160
Peter A. Larsen
I am collaborating with the the grey mouse lemur genome assembly project that
Peter Larsen is working on
Postdoctoral Associate, Yoder lab
Duke University
Box 90338
BioSci 130 Science Drive
Durham, NC 27708
PERSONAL SCHOLARLY DEVELOPMENT
As a graduate student I studied genomics applications, chromosome and
genome analysis, and bioinformatics. This prepared me for assembling my own
transcriptomes for two wild grasses in my master research project.
As the Bioinformatics Core Outreach Coordinator I learned to code in Perl and
BASH as well as work on an SGE Cluster by collaborating with bioinformatics
specialists and cluster administrators. As I accrued these skills I made completed
scripts available for use through Github as “kstatebioinfo". As these scripts
developed into workflows I created tutorials that I used to train students in the lab
portion of an introductory Bioinformatics course. Students in this course became
interested in learning how to code, prompting the creation of a month long coding
workshop. The workshop, NGS Analysis on Beocat, turned into a collaboration
between the Division of Biology and KSU Center for Scientific Supercomputing. I
developed most of the exercises and taught the first two weeks of the workshop.
Greg Wilson from Software Carpentry (SWC) met with us at an early stage of
workshop development and I adjusted the NGS Analysis on Beocat material to fit
the SWC training model.
In addition to coding, codesharing and training I ,created the Applied
Bioinformatics Journal Club (ABJC) as a forum to discuss workflows and projects
within and between Bioinformatics Cores. At the ABJC, methods articles are
discussed weekly by an interdepartmental group of biologists and computer
scientists from KSU, KUMC, and KU-L’s Bioinformatics Cores via video

conference. Usage of the ABJC’s archived topics has already exceeded 90,000
views.
I am also collaborating with scientists at BioNano Genomics to develop
workﬂows for BioNano molecule maps. These can be used to assemble physical
maps or order/orient sequence assemblies. Working in a beta test site for this
new technology and working with my Bioinformatics Core to start our certiﬁed
BioNano Pro service center brought together the dynamic and collaborative
aspects that draw me to bioinformatic analysis. This was perhaps the most
engaging collaborative experience I have had so far.

2015_CV_J_SHELTON_linked

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to 2015_CV_J_SHELTON_linked

Similar to 2015_CV_J_SHELTON_linked (20)

2015_CV_J_SHELTON_linked