This document provides information about genome annotation. It begins by describing how open reading frames (ORFs) are identified in genomes and how genomes are annotated. It discusses the types of databases used to classify genes, such as those involved in metabolism. It provides examples of how genes are categorized, including by enzyme commission numbers, FIGfams, Pfam, COGs, KEGG Orthology numbers, and metabolic pathways. It also discusses topics like pseudogenes, the origin of replication, ribosomal operons, GC skew, and central carbon metabolism pathways like glycolysis and the Entner-Doudoroff pathway.
This document provides an overview of intracellular receptors, enzymes, signals, transcription factors, structural proteins, and nucleic acids. It discusses in detail the structures and functions of DNA, RNA, nuclear receptors, IP3 receptors, intracellular enzymes, transcription factors, and structural proteins like collagen, keratin, myosin, and actin. The presentation was given by S. Dinakar from the Department of Pharmacology at PSG College of Pharmacy in Coimbatore, India.
This document summarizes bacterial and viral chromosomes. It discusses the structure of bacterial chromosomes, using E. coli as an example. It notes that E. coli has a single circular chromosome containing all of its genes. It describes the sequencing and mapping of the E. coli chromosome, including its size, gene content, and organization. It also discusses horizontal gene transfer in E. coli and the insertion of genetic elements into its chromosome. The document then summarizes viral genomes, noting that viruses have either DNA or RNA and can be single or double stranded. It provides details on the genome of bacteriophage T4.
1.introduction to genetic engineering and restriction enzymesGetachew Birhanu
An introduction to Genetic engineering
A short background and history of Genetic Engineering
Classification of DNA manipulating Enzymes, nomenclature
Restriction recognition sequences, the anatomy of a gene and the flow of genetic information
More emphasis is given for the essential DNA Manipulating Enzymes
Finally Restriction mapping (analysis)
Seminar /certified fixed orthodontic courses by Indian dental academy Indian dental academy
The document discusses gene mapping techniques to localize genes on chromosomes. It describes two main approaches: somatic cell hybridization and fluorescent in situ hybridization (FISH). Somatic cell hybridization involves fusing cells from two species and tracking which chromosomes are retained based on expression of phenotypes. FISH uses fluorescent probes that hybridize to complementary DNA sequences, allowing visualization of specific chromosomes or regions under UV light. These techniques have revolutionized chromosome analysis and gene mapping by precisely locating genes on human chromosomes.
Chromosomes /certified fixed orthodontic courses by Indian dental academy Indian dental academy
The Indian Dental Academy is the Leader in continuing dental education , training dentists in all aspects of dentistry and offering a wide range of dental certified courses in different formats.
Indian dental academy provides dental crown & Bridge,rotary endodontics,fixed orthodontics,
Dental implants courses.for details pls visit www.indiandentalacademy.com ,or call
0091-9248678078
Gene cloning allows making many copies of a gene or DNA fragment. There are two main approaches - cell-based cloning and PCR. Cell-based cloning involves isolating DNA, inserting it into a vector, and introducing the vector into a host cell. As the host cell divides, it makes many copies of the inserted DNA fragment. Common vectors used include bacterial plasmids and phages. Screening techniques are used to identify clones containing the desired gene, such as selecting for an antibiotic resistance marker or detecting expression of a protein.
Bioinformatics is the interdisciplinary study of biology and computer science. It involves developing tools to analyze large amounts of biological data, such as genetic sequences. There are two main building blocks studied in bioinformatics: nucleic acids like DNA and RNA, and proteins. DNA stores genetic information that is transcribed into RNA, which is then translated into proteins according to the genetic code. Technological advances have led to an explosion of biological data that requires bioinformatics approaches to analyze and interpret.
The sequence of nucleotides in deoxyribonucleic acid (DNA) and ribonucleic acid (RNA) that determines the amino acid sequence of proteins. Though the linear sequence of nucleotides in DNA contains the information for protein sequences, proteins are not made directly from DNA. Instead, a messenger RNA (mRNA) molecule is synthesized from the DNA and directs the formation of the protein. RNA is composed of four nucleotides: adenine (A), guanine (G), cytosine (C), and uracil."(U)."
This document provides an overview of intracellular receptors, enzymes, signals, transcription factors, structural proteins, and nucleic acids. It discusses in detail the structures and functions of DNA, RNA, nuclear receptors, IP3 receptors, intracellular enzymes, transcription factors, and structural proteins like collagen, keratin, myosin, and actin. The presentation was given by S. Dinakar from the Department of Pharmacology at PSG College of Pharmacy in Coimbatore, India.
This document summarizes bacterial and viral chromosomes. It discusses the structure of bacterial chromosomes, using E. coli as an example. It notes that E. coli has a single circular chromosome containing all of its genes. It describes the sequencing and mapping of the E. coli chromosome, including its size, gene content, and organization. It also discusses horizontal gene transfer in E. coli and the insertion of genetic elements into its chromosome. The document then summarizes viral genomes, noting that viruses have either DNA or RNA and can be single or double stranded. It provides details on the genome of bacteriophage T4.
1.introduction to genetic engineering and restriction enzymesGetachew Birhanu
An introduction to Genetic engineering
A short background and history of Genetic Engineering
Classification of DNA manipulating Enzymes, nomenclature
Restriction recognition sequences, the anatomy of a gene and the flow of genetic information
More emphasis is given for the essential DNA Manipulating Enzymes
Finally Restriction mapping (analysis)
Seminar /certified fixed orthodontic courses by Indian dental academy Indian dental academy
The document discusses gene mapping techniques to localize genes on chromosomes. It describes two main approaches: somatic cell hybridization and fluorescent in situ hybridization (FISH). Somatic cell hybridization involves fusing cells from two species and tracking which chromosomes are retained based on expression of phenotypes. FISH uses fluorescent probes that hybridize to complementary DNA sequences, allowing visualization of specific chromosomes or regions under UV light. These techniques have revolutionized chromosome analysis and gene mapping by precisely locating genes on human chromosomes.
Chromosomes /certified fixed orthodontic courses by Indian dental academy Indian dental academy
The Indian Dental Academy is the Leader in continuing dental education , training dentists in all aspects of dentistry and offering a wide range of dental certified courses in different formats.
Indian dental academy provides dental crown & Bridge,rotary endodontics,fixed orthodontics,
Dental implants courses.for details pls visit www.indiandentalacademy.com ,or call
0091-9248678078
Gene cloning allows making many copies of a gene or DNA fragment. There are two main approaches - cell-based cloning and PCR. Cell-based cloning involves isolating DNA, inserting it into a vector, and introducing the vector into a host cell. As the host cell divides, it makes many copies of the inserted DNA fragment. Common vectors used include bacterial plasmids and phages. Screening techniques are used to identify clones containing the desired gene, such as selecting for an antibiotic resistance marker or detecting expression of a protein.
Bioinformatics is the interdisciplinary study of biology and computer science. It involves developing tools to analyze large amounts of biological data, such as genetic sequences. There are two main building blocks studied in bioinformatics: nucleic acids like DNA and RNA, and proteins. DNA stores genetic information that is transcribed into RNA, which is then translated into proteins according to the genetic code. Technological advances have led to an explosion of biological data that requires bioinformatics approaches to analyze and interpret.
The sequence of nucleotides in deoxyribonucleic acid (DNA) and ribonucleic acid (RNA) that determines the amino acid sequence of proteins. Though the linear sequence of nucleotides in DNA contains the information for protein sequences, proteins are not made directly from DNA. Instead, a messenger RNA (mRNA) molecule is synthesized from the DNA and directs the formation of the protein. RNA is composed of four nucleotides: adenine (A), guanine (G), cytosine (C), and uracil."(U)."
D.N.A and genetics /certified fixed orthodontic courses by Indian dental acad...Indian dental academy
The Indian Dental Academy is the Leader in continuing dental education , training dentists in all aspects of dentistry and offering a wide range of dental certified courses in different formats.
Cloning vectors are small pieces of DNA that can be stably maintained in an organism and have foreign DNA inserted into them for cloning purposes. The most commonly used cloning vectors are genetically engineered plasmids. Plasmids are taken from bacteria and can replicate within bacterial cells. Other types of cloning vectors include bacteriophages, cosmids, yeast artificial chromosomes, and bacterial artificial chromosomes, which can accommodate larger DNA fragments. Restriction enzymes and DNA ligase are used to cut and join DNA fragments for cloning into vectors.
Comparative genome analysis requires high quality annotations of all genomic elements. Today’s sequencing projects face numerous challenges including lower coverage, more frequent assembly errors, and the lack of closely related species with well-annotated genomes. Precise elucidation of the many different biological features encoded in any genome requires careful examination and review. We need genome annotation editing tools to modify and refine the location and structure of the genome elements that predictive algorithms cannot yet resolve automatically. During the manual annotation process, curators identify elements that best represent the underlying biology and eliminate elements that reflect systemic errors of automated analyses.
Apollo is a web-based application that supports and enables collaborative genome curation in real time, analogous to Google Docs, allowing teams of curators to improve on existing automated gene models through an intuitive interface. Researchers from nearly one hundred institutions worldwide are currently using Apollo for distributed curation efforts in over sixty genome projects across the tree of life: from plants to arthropods, to fungi, to species of fish and other vertebrates including human, cattle (bovine), and dog.
Comparative genome analysis requires high quality annotations of all genomic elements. Today’s sequencing projects face numerous challenges including lower coverage, more frequent assembly errors, and the lack of closely related species with well-annotated genomes. Precise elucidation of the many different biological features encoded in any genome requires careful examination and review. We need genome annotation editing tools to modify and refine the location and structure of the genome elements that predictive algorithms cannot yet resolve automatically. During the manual annotation process, curators identify elements that best represent the underlying biology and eliminate elements that reflect systemic errors of automated analyses.
Apollo is a web-based application that supports and enables collaborative genome curation in real time, analogous to Google Docs, allowing teams of curators to improve on existing automated gene models through an intuitive interface. Researchers from nearly one hundred institutions worldwide are currently using Apollo for distributed curation efforts in over sixty genome projects across the tree of life: from plants to arthropods, to fungi, to species of fish and other vertebrates including human, cattle (bovine), and dog.
Bioinformatics uses computers to store, organize, and analyze biological data, particularly DNA and protein sequences. Key data types include DNA, RNA, and protein sequences, as well as data from experiments like transcriptomics and proteomics. Common analyses include sequence comparisons and searches for coding regions. DNA contains genetic information encoded as sequences of nucleotides that are read from 5' to 3'. It is double-stranded and antiparallel. Genes encode proteins through transcription of DNA to mRNA and translation of mRNA to protein.
Apollo is a web-based application that supports and enables collaborative genome curation in real time, allowing teams of curators to improve on existing automated gene models through an intuitive interface. Apollo allows researchers to break down large amounts of data into manageable portions to mobilize groups of researchers with shared interests.
The i5K, an initiative to sequence the genomes of 5,000 insect and related arthropod species, is a broad and inclusive effort that seeks to involve scientists from around the world in their genome curation process, and Apollo is serving as the platform to empower this community.
This presentation is an introduction to Apollo for the members of the i5K Pilot Project on Eurytemora affinis
This lecture discusses biological data used in bioinformatics and provides an overview of major types of biological data including DNA, RNA, protein, and gene expression data. It also covers basic concepts in molecular biology such as DNA replication, transcription, and translation. The structures of DNA and RNA are described along with key components such as nucleotides, base pairing rules, and gene structure in eukaryotes. Regions of genes like promoters, enhancers, and terminators are also summarized.
The document discusses viruses with large and small DNA genomes as well as positive-strand and negative-strand RNA viruses. Herpesviruses have very large DNA genomes up to 235 kbp that encode many enzymes. Adenoviruses and phages like lambda have smaller genomes between 30-54 kbp. Animal viruses like parvoviruses and polyomaviruses have even smaller genomes around 5 kbp that tightly pack genes. Picornaviruses, togaviruses, and flaviviruses have single-stranded RNA genomes between 7-11 kbp. Coronaviruses have the largest RNA genomes around 30 kbp. Segmentation allows larger coding capacity for viruses like influenza and gemin
This document discusses restriction enzymes and their uses in molecular biology. Restriction enzymes recognize specific short nucleotide sequences in DNA and cut the DNA at those sites. They were originally discovered in bacteria as a defense mechanism against viruses. Restriction enzymes cut DNA into fragments that can then be analyzed on agarose gels or used in recombinant DNA techniques. The document provides examples of specific restriction enzymes, their recognition sequences, and how they are used to study genetic variations and diseases.
This document outlines a DNA barcoding protocol for Census of Marine Life (CoML) investigators to determine DNA barcodes from collected specimens. The protocol recommends preserving specimens in 95% ethanol, amplifying and sequencing the cytochrome c oxidase subunit I (COI) gene as the primary barcode marker, and submitting sequences to public databases linked to specimen data. Alternate targets may be needed for some taxa. The goal is to provide a uniform method for species identification that will aid CoML research and have broader scientific applications.
This is an introduction to conducting manual annotation efforts using Apollo. This webinar was offered to members of the i5K Research community on 2015-10-07.
The genetic code is the set of rules by which information encoded in DNA is translated into proteins by living cells. It specifies how sequences of nucleotides in mRNA are used to direct protein synthesis through codon-anticodon interactions between mRNA and tRNA. The genetic code is nearly universal, with some minor variations, and is written in the 5' to 3' direction on mRNA. It uses 64 possible codon combinations to specify 20 standard amino acids and 3 stop codons.
Introduction to Apollo: A webinar for the i5K Research CommunityMonica Munoz-Torres
This document provides an introduction and outline for a webinar on using the Apollo genome annotation editing tool. It was presented by Monica Munoz-Torres of BBOP to the i5K Research Community. The webinar aimed to help participants better understand genome curation in the context of automated and manual annotation. It also aimed to familiarize participants with Apollo's functionality and how to identify homologs of known genes, corroborate gene models using evidence, and modify automated annotations in Apollo. The document includes sections on genome sequencing projects, the objectives and uses of genome annotation, and a biological refresher on concepts relevant to manual annotation like genes, transcription, translation, and genome curation steps.
Enzymes that cut DNA at or near specific recognition nucleotide sequences known as restriction sites.
Especial class of enzymes that cleave (cut) DNA at a specific unique internal location along its length.
Often called restriction endonucleases (Because they cut within the molecule).
Discovered in the late 1970s by Werner Arber, Hamilton Smith, and Daniel Nathans.
Essential tools for recombinant DNA technology.
Naturally produced by bacteria that use them as a defense mechanism against viral infection.
Chop up the viral nucleic acids and protect a bacterial cell by hydrolyzing phage DNA.
• The genetic code is the set of rules by which information encoded in genetic material (DNA or RNA sequences) is translated into proteins (amino acid sequences) by living cells.
• The genetic code, once thought to be identical in all forms of life, has been found to diverge slightly in certain organisms and in the mitochondria of some eukaryotes.
• Nevertheless, these differences are rare, and the genetic code is identical in almost all species, with the same codons specifying the same amino acids.
This document discusses gene discovery and DNA structure. It defines key terms like genotype, phenotype, genes and genomes. It describes the double helix structure of DNA and how DNA is composed of nucleotides. Genes are located on chromosomes and encode proteins. DNA is transcribed into mRNA and then translated into proteins. Eukaryotic and prokaryotic genome annotation aims to find genes, exons, transcripts and predict function. DNA must replicate before cell division to produce double-stranded chromosomes from single-stranded ones.
This document discusses hypothesis testing in science. It explains that hypothesis testing is one method of scientific inquiry alongside other ways of knowing like traditional ecological knowledge. The document defines bias and lists some types of bias like survivorship bias that can influence scientific questions and results. It also outlines the characteristics of a good hypothesis, providing examples, and explains that the purpose of testing a hypothesis is to evaluate a proposed explanation for a phenomenon.
This document provides an overview of a microbial genomics course project involving bacterial isolates from a long-term soil warming experiment. The project aims to sequence genomes of isolates collected from control and warmed soil plots over the course of the experiment to investigate evidence of microbial adaptation to warming at the genomic level. Isolates representing different time points in the experiment may allow insights into evolutionary responses to warming over time. The document outlines the workflow for building a culture collection from the soil samples and selecting isolates for genome sequencing based on phylogenetic and physiological analyses.
D.N.A and genetics /certified fixed orthodontic courses by Indian dental acad...Indian dental academy
The Indian Dental Academy is the Leader in continuing dental education , training dentists in all aspects of dentistry and offering a wide range of dental certified courses in different formats.
Cloning vectors are small pieces of DNA that can be stably maintained in an organism and have foreign DNA inserted into them for cloning purposes. The most commonly used cloning vectors are genetically engineered plasmids. Plasmids are taken from bacteria and can replicate within bacterial cells. Other types of cloning vectors include bacteriophages, cosmids, yeast artificial chromosomes, and bacterial artificial chromosomes, which can accommodate larger DNA fragments. Restriction enzymes and DNA ligase are used to cut and join DNA fragments for cloning into vectors.
Comparative genome analysis requires high quality annotations of all genomic elements. Today’s sequencing projects face numerous challenges including lower coverage, more frequent assembly errors, and the lack of closely related species with well-annotated genomes. Precise elucidation of the many different biological features encoded in any genome requires careful examination and review. We need genome annotation editing tools to modify and refine the location and structure of the genome elements that predictive algorithms cannot yet resolve automatically. During the manual annotation process, curators identify elements that best represent the underlying biology and eliminate elements that reflect systemic errors of automated analyses.
Apollo is a web-based application that supports and enables collaborative genome curation in real time, analogous to Google Docs, allowing teams of curators to improve on existing automated gene models through an intuitive interface. Researchers from nearly one hundred institutions worldwide are currently using Apollo for distributed curation efforts in over sixty genome projects across the tree of life: from plants to arthropods, to fungi, to species of fish and other vertebrates including human, cattle (bovine), and dog.
Comparative genome analysis requires high quality annotations of all genomic elements. Today’s sequencing projects face numerous challenges including lower coverage, more frequent assembly errors, and the lack of closely related species with well-annotated genomes. Precise elucidation of the many different biological features encoded in any genome requires careful examination and review. We need genome annotation editing tools to modify and refine the location and structure of the genome elements that predictive algorithms cannot yet resolve automatically. During the manual annotation process, curators identify elements that best represent the underlying biology and eliminate elements that reflect systemic errors of automated analyses.
Apollo is a web-based application that supports and enables collaborative genome curation in real time, analogous to Google Docs, allowing teams of curators to improve on existing automated gene models through an intuitive interface. Researchers from nearly one hundred institutions worldwide are currently using Apollo for distributed curation efforts in over sixty genome projects across the tree of life: from plants to arthropods, to fungi, to species of fish and other vertebrates including human, cattle (bovine), and dog.
Bioinformatics uses computers to store, organize, and analyze biological data, particularly DNA and protein sequences. Key data types include DNA, RNA, and protein sequences, as well as data from experiments like transcriptomics and proteomics. Common analyses include sequence comparisons and searches for coding regions. DNA contains genetic information encoded as sequences of nucleotides that are read from 5' to 3'. It is double-stranded and antiparallel. Genes encode proteins through transcription of DNA to mRNA and translation of mRNA to protein.
Apollo is a web-based application that supports and enables collaborative genome curation in real time, allowing teams of curators to improve on existing automated gene models through an intuitive interface. Apollo allows researchers to break down large amounts of data into manageable portions to mobilize groups of researchers with shared interests.
The i5K, an initiative to sequence the genomes of 5,000 insect and related arthropod species, is a broad and inclusive effort that seeks to involve scientists from around the world in their genome curation process, and Apollo is serving as the platform to empower this community.
This presentation is an introduction to Apollo for the members of the i5K Pilot Project on Eurytemora affinis
This lecture discusses biological data used in bioinformatics and provides an overview of major types of biological data including DNA, RNA, protein, and gene expression data. It also covers basic concepts in molecular biology such as DNA replication, transcription, and translation. The structures of DNA and RNA are described along with key components such as nucleotides, base pairing rules, and gene structure in eukaryotes. Regions of genes like promoters, enhancers, and terminators are also summarized.
The document discusses viruses with large and small DNA genomes as well as positive-strand and negative-strand RNA viruses. Herpesviruses have very large DNA genomes up to 235 kbp that encode many enzymes. Adenoviruses and phages like lambda have smaller genomes between 30-54 kbp. Animal viruses like parvoviruses and polyomaviruses have even smaller genomes around 5 kbp that tightly pack genes. Picornaviruses, togaviruses, and flaviviruses have single-stranded RNA genomes between 7-11 kbp. Coronaviruses have the largest RNA genomes around 30 kbp. Segmentation allows larger coding capacity for viruses like influenza and gemin
This document discusses restriction enzymes and their uses in molecular biology. Restriction enzymes recognize specific short nucleotide sequences in DNA and cut the DNA at those sites. They were originally discovered in bacteria as a defense mechanism against viruses. Restriction enzymes cut DNA into fragments that can then be analyzed on agarose gels or used in recombinant DNA techniques. The document provides examples of specific restriction enzymes, their recognition sequences, and how they are used to study genetic variations and diseases.
This document outlines a DNA barcoding protocol for Census of Marine Life (CoML) investigators to determine DNA barcodes from collected specimens. The protocol recommends preserving specimens in 95% ethanol, amplifying and sequencing the cytochrome c oxidase subunit I (COI) gene as the primary barcode marker, and submitting sequences to public databases linked to specimen data. Alternate targets may be needed for some taxa. The goal is to provide a uniform method for species identification that will aid CoML research and have broader scientific applications.
This is an introduction to conducting manual annotation efforts using Apollo. This webinar was offered to members of the i5K Research community on 2015-10-07.
The genetic code is the set of rules by which information encoded in DNA is translated into proteins by living cells. It specifies how sequences of nucleotides in mRNA are used to direct protein synthesis through codon-anticodon interactions between mRNA and tRNA. The genetic code is nearly universal, with some minor variations, and is written in the 5' to 3' direction on mRNA. It uses 64 possible codon combinations to specify 20 standard amino acids and 3 stop codons.
Introduction to Apollo: A webinar for the i5K Research CommunityMonica Munoz-Torres
This document provides an introduction and outline for a webinar on using the Apollo genome annotation editing tool. It was presented by Monica Munoz-Torres of BBOP to the i5K Research Community. The webinar aimed to help participants better understand genome curation in the context of automated and manual annotation. It also aimed to familiarize participants with Apollo's functionality and how to identify homologs of known genes, corroborate gene models using evidence, and modify automated annotations in Apollo. The document includes sections on genome sequencing projects, the objectives and uses of genome annotation, and a biological refresher on concepts relevant to manual annotation like genes, transcription, translation, and genome curation steps.
Enzymes that cut DNA at or near specific recognition nucleotide sequences known as restriction sites.
Especial class of enzymes that cleave (cut) DNA at a specific unique internal location along its length.
Often called restriction endonucleases (Because they cut within the molecule).
Discovered in the late 1970s by Werner Arber, Hamilton Smith, and Daniel Nathans.
Essential tools for recombinant DNA technology.
Naturally produced by bacteria that use them as a defense mechanism against viral infection.
Chop up the viral nucleic acids and protect a bacterial cell by hydrolyzing phage DNA.
• The genetic code is the set of rules by which information encoded in genetic material (DNA or RNA sequences) is translated into proteins (amino acid sequences) by living cells.
• The genetic code, once thought to be identical in all forms of life, has been found to diverge slightly in certain organisms and in the mitochondria of some eukaryotes.
• Nevertheless, these differences are rare, and the genetic code is identical in almost all species, with the same codons specifying the same amino acids.
This document discusses gene discovery and DNA structure. It defines key terms like genotype, phenotype, genes and genomes. It describes the double helix structure of DNA and how DNA is composed of nucleotides. Genes are located on chromosomes and encode proteins. DNA is transcribed into mRNA and then translated into proteins. Eukaryotic and prokaryotic genome annotation aims to find genes, exons, transcripts and predict function. DNA must replicate before cell division to produce double-stranded chromosomes from single-stranded ones.
This document discusses hypothesis testing in science. It explains that hypothesis testing is one method of scientific inquiry alongside other ways of knowing like traditional ecological knowledge. The document defines bias and lists some types of bias like survivorship bias that can influence scientific questions and results. It also outlines the characteristics of a good hypothesis, providing examples, and explains that the purpose of testing a hypothesis is to evaluate a proposed explanation for a phenomenon.
This document provides an overview of a microbial genomics course project involving bacterial isolates from a long-term soil warming experiment. The project aims to sequence genomes of isolates collected from control and warmed soil plots over the course of the experiment to investigate evidence of microbial adaptation to warming at the genomic level. Isolates representing different time points in the experiment may allow insights into evolutionary responses to warming over time. The document outlines the workflow for building a culture collection from the soil samples and selecting isolates for genome sequencing based on phylogenetic and physiological analyses.
This document provides an overview of phylogeny and constructing phylogenetic trees. It defines phylogeny as models of evolutionary relationships among species based on sequence similarities, often illustrated as phylogenetic trees. It describes how to construct phylogenetic trees, including choosing marker genes, aligning sequences, calculating evolutionary distances, performing phylogenetic analysis, and dealing with complexities like long-branch attraction. It also discusses species definitions in microbes and operational species concepts based on metrics like 16S rRNA sequence identity.
This document discusses sequence alignment and its applications in bioinformatics. It begins by explaining the goals of learning about homology and how sequence alignment relates to function across organisms. It then describes different types of sequence alignment including global, local, Needleman-Wunsch, Smith-Waterman, and BLAST. It explains how to quantify alignment scores and perform statistical analysis of alignments. The document provides examples of alignment matrices and algorithms for finding the best alignment between sequences.
Measures of DNA sequence quality include chastity, low quality reads, adapter contamination, discordant read pairs, duplicate reads, biases, contamination, and complexity of genomes. Chastity measures the signal to noise ratio, while low quality reads have high incorrect base calling. Adapter contamination occurs when sequencing reads include adapter sequences. Discordant read pairs have the paired-end sequences out of order. Duplicate reads are more common than expected by chance. Biases can skew sequence composition. Contamination introduces undesired sequences. Complex genomes like those with repeats or heterozygosity challenge assembly. Ensuring high quality involves evaluating these measures and preprocessing like trimming.
This document discusses genome assembly from metagenomic sequencing data. It defines key terms like metagenome assembled genomes (MAGs) and describes how genome assembly works, including using de Bruijn graphs to assemble short sequencing reads into longer contigs and scaffolds. The document also outlines several measures used to assess genome assembly quality, such as coverage, contig length metrics like N50 and N75, and completeness and contamination measurements.
This document provides an overview of different DNA sequencing technologies, including:
- Sanger sequencing, the first generation method using chain termination.
- Next generation sequencing methods like Illumina that use sequencing by synthesis and massively parallel approaches.
- Third generation long-read sequencing methods like PacBio and Oxford Nanopore that sequence single native DNA molecules and can detect modifications but have lower throughput.
It describes the key innovations, working mechanisms, and tradeoffs of read length, output, and accuracy between Sanger, next generation, and long-read third generation sequencing technologies. It also highlights the portability of Oxford Nanopore sequencing with the MinION device.
This document provides an overview of the topics that will be covered in a lecture on bacterial genomics and molecular biology. The lecture will describe bacterial, eukaryotic, and endosymbiont genome structure and organization. It will explain how DNA replication, transcription, and translation are timed with the cell cycle. The lecture will also define the components of typical bacterial genes and how they are arranged in genomes. Finally, it will contrast the molecular structures of DNA and RNA.
This document provides an overview of a bioinformatics lab course at UMass Amherst taught by Professor Kristen DeAngelis in the fall of 2022. The goals of the course are to use genomics to understand ecosystems and microbial adaptation to climate change. Students will analyze bacterial genomes from the professor's research lab using bioinformatics tools on the MGHPCC cluster and KBase. The course will involve both guided and independent analysis of genomes. Students are expected to participate actively and work independently on a capstone project analyzing a newly sequenced bacterial genome.
Instructions and bracket to play Morrill Microbe Madness, a game to review representative organisms from the major phyla of the domain bacteria, part of MICROBIO 480 Microbial Diversity.
Unit 11: Viruses and Prions
LECTURE LEARNING GOALS
1. Define what is a virus, and describe the three theories on the origin of viruses.
2. Define and contrast prions and subviral agents. Explain how they are different from viruses.
3. Explain coronaviruses, the origin of SARS- CoV-2, how it infects cells, and the tools we use to fight the spread of COVID-19.
Unit 10: Diversity of Permafrost
LECTURE LEARNING GOALS
1. Describe permafrost, and the microbial diversity of permafrost. Explain how the greatest diversity of Archaea exist in cold environments.
2. Describe the two main Archaeal phyla, and describe example species.
3. Explain how climate change is affecting permafrost and microbial diversity.
Unit 9: Human Microbiome
LECTURE LEARNING GOALS
1. Describe the human microbiome: how many microbes there are, how you get your microbiome, who’s there, and how it changes over time and by region.
2. Describe the domain eukarya. List the five superkingdoms and a few notable species.
3. Explain how the human microbiome is related to health and disease.
Unit 8: Rare and Uncultured Microbes
LECTURE LEARNING GOALS
1. Describe the phyla containing rare bacteria: Deinococcus/Thermus, Chlamydia & Planctomycetes.
2. Describe the sequencing methods used to understand uncultured microbes. Explain the Eocyte hypothesis and how this model differs from the three domain tree of life.
3. For the cultured microbes, describe major characteristics for the 13 bacterial phyla, and explain why some microbe remain uncultivated.
6
Unit 7: Diversity of Soils & Sediments
LECTURE LEARNING GOALS
1. Define soils and sediment, and contrast the microbes living in each. Explain biogeochemical cycles.
2. Describe the diversity, metabolism & habitat of the five classes of the phylum Proteobacteria, including some common example species.
3. Describe the diversity, metabolism & habitat of the Gram-positive bacteria (phylua Firmicutes & Actinobacteria).
Unit 6: Diversity of Microbial Mats
LECTURE LEARNING GOALS
1. Definemicrobialmats.Describethe functional guilds of microbes in the different layers, and how they interact.
2. Foreachofthethreephylaof photosynthetic bacteria, contrast how each fixes C and gains energy and reducing equivalents from light.
3. Forthetwothermophilicbacterialphyla, describe their adaptations to life at high
temperature. Explain how they are primitive and deeply-branching.
Unit 5: Everything is everywhere?
LECTURE LEARNING GOALS
1. State the Baas Becking hypothesis, and describe the environmental traits are the strongest drivers of microbial community.
2. Explain how to measure community dissimilarity. Explain why the Baas Becking hypothesis continues to be tested today.
3. Describe methods to link taxonomic or community structure to function.
Unit 4: Biofilms & Motility
LECTURE LEARNING GOALS
• Describethethreetypesofbacterialbiofilm, and how each develop.
• Contrastthedifferentwaysthatmicrobes move using flagella. Explain the ways that bacterial and archaeal flagella are different. Describe non-flagellar movement.
• Giveexamplesofhowmicrobesmovefrom the phyla spirochetes and bacteroidetes.
Unit 3: Microbiology of Early Earth
LECTURE LEARNING GOALS
• Describe the early Earth environment, and prevailing theories for the origins of life.
• Describe the major events in the evolution of cellular life, and when they happened.
• Explain the lines of evidence that lead us to know when early life arose, and the scientific basis behind each line.
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...PsychoTech Services
A proprietary approach developed by bringing together the best of learning theories from Psychology, design principles from the world of visualization, and pedagogical methods from over a decade of training experience, that enables you to: Learn better, faster!
SDSS1335+0728: The awakening of a ∼ 106M⊙ black hole⋆Sérgio Sacani
Context. The early-type galaxy SDSS J133519.91+072807.4 (hereafter SDSS1335+0728), which had exhibited no prior optical variations during the preceding two decades, began showing significant nuclear variability in the Zwicky Transient Facility (ZTF) alert stream from December 2019 (as ZTF19acnskyy). This variability behaviour, coupled with the host-galaxy properties, suggests that SDSS1335+0728 hosts a ∼ 106M⊙ black hole (BH) that is currently in the process of ‘turning on’. Aims. We present a multi-wavelength photometric analysis and spectroscopic follow-up performed with the aim of better understanding the origin of the nuclear variations detected in SDSS1335+0728. Methods. We used archival photometry (from WISE, 2MASS, SDSS, GALEX, eROSITA) and spectroscopic data (from SDSS and LAMOST) to study the state of SDSS1335+0728 prior to December 2019, and new observations from Swift, SOAR/Goodman, VLT/X-shooter, and Keck/LRIS taken after its turn-on to characterise its current state. We analysed the variability of SDSS1335+0728 in the X-ray/UV/optical/mid-infrared range, modelled its spectral energy distribution prior to and after December 2019, and studied the evolution of its UV/optical spectra. Results. From our multi-wavelength photometric analysis, we find that: (a) since 2021, the UV flux (from Swift/UVOT observations) is four times brighter than the flux reported by GALEX in 2004; (b) since June 2022, the mid-infrared flux has risen more than two times, and the W1−W2 WISE colour has become redder; and (c) since February 2024, the source has begun showing X-ray emission. From our spectroscopic follow-up, we see that (i) the narrow emission line ratios are now consistent with a more energetic ionising continuum; (ii) broad emission lines are not detected; and (iii) the [OIII] line increased its flux ∼ 3.6 years after the first ZTF alert, which implies a relatively compact narrow-line-emitting region. Conclusions. We conclude that the variations observed in SDSS1335+0728 could be either explained by a ∼ 106M⊙ AGN that is just turning on or by an exotic tidal disruption event (TDE). If the former is true, SDSS1335+0728 is one of the strongest cases of an AGNobserved in the process of activating. If the latter were found to be the case, it would correspond to the longest and faintest TDE ever observed (or another class of still unknown nuclear transient). Future observations of SDSS1335+0728 are crucial to further understand its behaviour. Key words. galaxies: active– accretion, accretion discs– galaxies: individual: SDSS J133519.91+072807.4
Candidate young stellar objects in the S-cluster: Kinematic analysis of a sub...Sérgio Sacani
Context. The observation of several L-band emission sources in the S cluster has led to a rich discussion of their nature. However, a definitive answer to the classification of the dusty objects requires an explanation for the detection of compact Doppler-shifted Brγ emission. The ionized hydrogen in combination with the observation of mid-infrared L-band continuum emission suggests that most of these sources are embedded in a dusty envelope. These embedded sources are part of the S-cluster, and their relationship to the S-stars is still under debate. To date, the question of the origin of these two populations has been vague, although all explanations favor migration processes for the individual cluster members. Aims. This work revisits the S-cluster and its dusty members orbiting the supermassive black hole SgrA* on bound Keplerian orbits from a kinematic perspective. The aim is to explore the Keplerian parameters for patterns that might imply a nonrandom distribution of the sample. Additionally, various analytical aspects are considered to address the nature of the dusty sources. Methods. Based on the photometric analysis, we estimated the individual H−K and K−L colors for the source sample and compared the results to known cluster members. The classification revealed a noticeable contrast between the S-stars and the dusty sources. To fit the flux-density distribution, we utilized the radiative transfer code HYPERION and implemented a young stellar object Class I model. We obtained the position angle from the Keplerian fit results; additionally, we analyzed the distribution of the inclinations and the longitudes of the ascending node. Results. The colors of the dusty sources suggest a stellar nature consistent with the spectral energy distribution in the near and midinfrared domains. Furthermore, the evaporation timescales of dusty and gaseous clumps in the vicinity of SgrA* are much shorter ( 2yr) than the epochs covered by the observations (≈15yr). In addition to the strong evidence for the stellar classification of the D-sources, we also find a clear disk-like pattern following the arrangements of S-stars proposed in the literature. Furthermore, we find a global intrinsic inclination for all dusty sources of 60 ± 20◦, implying a common formation process. Conclusions. The pattern of the dusty sources manifested in the distribution of the position angles, inclinations, and longitudes of the ascending node strongly suggests two different scenarios: the main-sequence stars and the dusty stellar S-cluster sources share a common formation history or migrated with a similar formation channel in the vicinity of SgrA*. Alternatively, the gravitational influence of SgrA* in combination with a massive perturber, such as a putative intermediate mass black hole in the IRS 13 cluster, forces the dusty objects and S-stars to follow a particular orbital arrangement. Key words. stars: black holes– stars: formation– Galaxy: center– galaxies: star formation
Signatures of wave erosion in Titan’s coastsSérgio Sacani
The shorelines of Titan’s hydrocarbon seas trace flooded erosional landforms such as river valleys; however, it isunclear whether coastal erosion has subsequently altered these shorelines. Spacecraft observations and theo-retical models suggest that wind may cause waves to form on Titan’s seas, potentially driving coastal erosion,but the observational evidence of waves is indirect, and the processes affecting shoreline evolution on Titanremain unknown. No widely accepted framework exists for using shoreline morphology to quantitatively dis-cern coastal erosion mechanisms, even on Earth, where the dominant mechanisms are known. We combinelandscape evolution models with measurements of shoreline shape on Earth to characterize how differentcoastal erosion mechanisms affect shoreline morphology. Applying this framework to Titan, we find that theshorelines of Titan’s seas are most consistent with flooded landscapes that subsequently have been eroded bywaves, rather than a uniform erosional process or no coastal erosion, particularly if wave growth saturates atfetch lengths of tens of kilometers.
TOPIC OF DISCUSSION: CENTRIFUGATION SLIDESHARE.pptxshubhijain836
Centrifugation is a powerful technique used in laboratories to separate components of a heterogeneous mixture based on their density. This process utilizes centrifugal force to rapidly spin samples, causing denser particles to migrate outward more quickly than lighter ones. As a result, distinct layers form within the sample tube, allowing for easy isolation and purification of target substances.
(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...Scintica Instrumentation
Targeting Hsp90 and its pathogen Orthologs with Tethered Inhibitors as a Diagnostic and Therapeutic Strategy for cancer and infectious diseases with Dr. Timothy Haystead.
ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...Advanced-Concepts-Team
Presentation in the Science Coffee of the Advanced Concepts Team of the European Space Agency on the 07.06.2024.
Speaker: Diego Blas (IFAE/ICREA)
Title: Gravitational wave detection with orbital motion of Moon and artificial
Abstract:
In this talk I will describe some recent ideas to find gravitational waves from supermassive black holes or of primordial origin by studying their secular effect on the orbital motion of the Moon or satellites that are laser ranged.
2. Lecture Learning Goals
• Describe how genes are identified.
• Distinguish between an open reading frame, a genome feature, a
gene, and a protein coding region.
• Explain how genomes are annotated and the kinds of databases that
are used to classify genes.
• List the genes involved in cellular metabolism, for both energy
generation (catabolism) and cell growth (anabolism).
• Explain the idea behind metabolic models, and describe one
application.
2
4. Open Reading Frames
• Some ORFs are located one strand, and others, on the other strand- facing in the
opposite orientation. The strands are designated as + or – and ORFs are
diagrammed as located on the top (+) or bottom (-) strand template. The diagram
below shows that most ORFs are in the same orientation for S-TIM5
bacteriophage.
• For the ORFs located above the line, ‘upstream’, where the promoter is located
(the 5’ end of the ORF), is to the left.
• Open reading frames (ORFs) are sections of the genome that are flanked by start
and stop codons, and thus can be readily identified with computer algorithms.
Algorithm identify ORFs that may or may not be used by the cell to produce a
protein (termed CDS- coding sequence).
4
6. Origin of replication
• Models for bacterial (A) and eukaryotic
(B) DNA replication initiation.
• A) Circular bacterial chromosomes contain a
cis-acting element, the replicator, that is
located at or near replication origins.
• B) Linear eukaryotic chromosomes contain
many replication origins.
• Most bacterial chromosomes are
circular and contain a single origin of
chromosomal replication (oriC).
• Origins in bacteria contain three
functional elements that control origin
activity:
• conserved DNA repeats that are specifically
recognized by DnaA (called DnaA-boxes)
• an AT-rich DNA unwinding element (DUE)
• and binding sites for proteins that help
regulate replication initiation
6
7. Ribosomal operons tend to locate near the origin
of replication
• rRNA is the ribosomal RNA, a major constituent of the ribosome, accounting for about 2/3
of its mass
• A large number of ribosomes is required for growing cells
• Fast-growing cells have many copies of the ribosomal operon
7
http://book.bionumbers.org/how-many-ribosomal-rna-gene-copies-are-in-the-genome/
8. GC skew
• The leading (single) strand tends to
have more Gs than Cs, though the
number of each base are the same
when you examine all base pairs
(double stranded).
• The difference is referred to as GC
skew, which can be examined to
locate the origin of replication.
• When the G content exceed the C
content, this is considered a positive
skew and indicates a leading strand.
8
Billings et al., Standards in Genomic Sciences 2015
9. key elements to genome annotation
1. The program scans through the sequence to identify rRNA and tRNA genes.
• rRNA = ribosomal RNA genes, structural RNA in the ribosome with ribosomal proteins
• tRNA = transfer RNA genes, connects the amino acid to the mRNA for growing proteins
2. The program predicts gene-encoding regions (also known as Open Reading
Frames, or ORFs)
3. The program looks for other elements of interest (phages, CRISPR arrays, etc)
4. Compare the sequence of a feature (any of items 1-3) to a reference database
of sequences with known functions. If the sequence looks similar to what has
already been annotated in the database (hopefully based on experimental
evidence), then it assigns the same function to this sequence - whether or not
that is actually what it does! But it's the best we can do.
9
10. Ribosomes and non-coding RNA
• Ribosomes are mostly coded in operons
• Ribosome structure requires 3 types of structural RNA molecules: 5s, 16s and
23s rRNAs
• Ribosomes also require proteins; these are also good phylogenetic markers
• Unlinked rRNA genes are widespread among bacteria and archaea
10
Brewer et al., ISMEJ 2019
11. Annotate Genomes with Prokka
• Number of genes predicted
• aka total CDS
• aka total coding sequences
• Number of protein coding genes
• Number of genes with non-hypothetical
function
• Number of genes with EC number
• Total tRNAs
• Total rRNAs
11
Seemann, Bioinformatics 2014
12. How many ORFs are annotated?
• UP to half of all ORFs have no known homologs… !
• Orphan genes, or ORFans … usually considered unique to a very narrow taxon,
generally a species
• Orphans are a subset of taxonomically-restricted genes (TRGs), which are
unique to a specific taxonomic level (e.g. plant-specific)
• Non-homology based methods based on the context and the interactions of a
protein may help identify missing metabolic activities and functional
annotation
• Why?
• Some are sequencing errors
• Some may be derived from horizontal gene transfer, duplication and
divergence, or de novo origination
• Some could be non-coding RNAs
12
13. Pseudogenes
• Pseudogenes are nonfunctional segments of DNA that resemble
functional genes
• Most bacterial pseudogenes are found in non-free-living organisms,
like symbionts or obligate intracellular parasites
• These will (generally) not be included in genome annotations
13
14. Categorizing protein coding genes
• Many organizational schemes categorize protein coding genes
• Which one you choose depends upon which are available your goals
• Common options include:
• Enzyme (enzyme nomenclature) and EC numbers,
• FIGfams (functional homologs, part of SEED subsystems),
• Pfam and TIGRfam (curated protein families),
• COG (curated clusters of orthologous groups of proteins),
• KO (KEGG Orthology), KEGG (metabolic pathways and reactions),
• InterPro (protein families and domains),
• GO (gene ontologies),
• LIGAND (compounds), and
• MetaCyc (metabolic pathways)
14
https://img.jgi.doe.gov/datasource.html
15. Categorizing protein coding genes: EC number
• EC number stand for Enzyme Commission number
• EC numbers are assigned by the Nomenclature Committee of the
International Union of Biochemistry and Molecular Biology
15
16. Categorizing protein coding genes: EC number
• EC numbers have four positions which describe exactly what kind of
reaction the enzyme catalyzes
• An example is beta-glucosidase, the terminal exonuclease in the
depolymerization of cellulose to sugars
16
EC 3.2.1.21
general type of
reaction catalyzed
by the enzyme;
EC 3 group is
hydrolyase
https://www.qmul.ac.uk/sbcs/iubmb/enzyme/EC3/2/1/21.html
Subclass of the
top-level group;
EC 3.2 group is
glycosylases
Sub-subclass of the
top-level group;
EC 3.2.1 group is
Glycosidases, i.e.
enzymes hydrolysing
O- and S-glycosyl
compounds
serial number of the
enzyme in its sub-subclass;
β-glucosidase, Hydrolysis of
terminal, non-reducing β-
D-glucosyl residues with
release of β-D-glucose
17. Categorizing protein coding genes: FIGfams
• The original SEED Project was started in 2003 by the Fellowship for Interpretation
of Genomes (FIG) as an open source effort
• annotation is done by the
curation of subsystems across
many genomes, not on a gene-
by-gene basis
• From the curated subsystems we
extract a set of freely available
protein families (FIGfams)
• These FIGfams form the core
component of the RAST server
(RAST=Rapid Annotation using
Subsytems Technology)
17
https://www.theseed.org/wiki/Home_of_the_SEED
19. Categorizing protein coding genes: FIGfams
• Each FIGfam is a set of proteins that are believed to be isofunctional
homologs
• they all are believed to implement the same function,
• and they are believed to derive from a common ancestor because they
appear to be similar
19
20. Categorizing protein coding genes: pfams
• The Pfam database is a large collection of protein families, each
represented by multiple sequence alignments and hidden Markov
models (HMMs).
• Pfam 34.0 (March 2021, 19179 entries)
• The general purpose of the Pfam database is to provide a complete
and accurate classification of protein families and domains
20
http://pfam.xfam.org; Mistry et al., Nucleic Acids Research, 2020
21. Categorizing protein coding genes: pfams
• Proteins may have multiple pfams, since domains are characterized
21
Mistry et al., Nucleic Acids Research, 2020
• Newly revised the Pfam entries
that cover the SARS-CoV-2
proteome, with new entries for
regions not covered by Pfam.
• The structure of NSP15 from
Kim et al. shows the three new
Pfam domains,
• (i) CoV_NSP15_N Coronavirus
replicase domain in red,
• (ii) CoV_NSP15_M Coronavirus
replicase NSP15 domain in blue,
• (iii) CoV_NSP15_C Coronavirus
replicase NSP15, uridylate-specific
endoribonuclease in green.
22. Categorizing protein
coding genes: COGs
• Clusters of Orthologous Genes
(COGs)
• relatively small collection of fewer
than 5000 clusters of orthologous
proteins (COGs) consists of the
products of the most widespread
bacterial and archaeal genes
22
https://www.ncbi.nlm.nih.gov/research/COG
23. Categorizing protein coding genes: COGs
23
Shields et al., mSphere 2018
• An example of how COGs are used in analyzing change in relative
abundance of protein coding genes across treatments
24. Categorizing protein coding genes: KEGG and KO
• KEGG: Kyoto Encyclopedia of Genes and Genomes
• KEGG is a database resource for understanding high-level functions
and utilities of the biological system, such as the cell, the organism
and the ecosystem, from molecular-level information, especially
large-scale molecular datasets generated by genome sequencing and
other high-throughput experimental technologies.
24
https://www.genome.jp/kegg/
26. Categorizing protein coding genes: KEGG and KO
• KEGG consists of
eighteen original
databases in four
categories
26
Kanehisa et al., Nucleic Acids Research 2020
30. Categorizing protein coding genes: KEGG and KO
• The KO (KEGG Orthology) database is a database of molecular
functions represented in terms of functional orthologs.
• A functional ortholog is manually defined in the context of KEGG molecular
networks, namely, KEGG pathway maps, BRITE hierarchies and KEGG
modules.
• Each node of the network, such as a box in the KEGG pathway map, is given a
KO identifier (called K number) as a functional ortholog defined from
experimentally characterized genes and proteins in specific organisms, which
are then used to assign orthologous genes in other organisms based on
sequence similarity.
• The granularity of "function" is context-dependent, and the resulting KO
grouping may correspond to a group of highly similar sequences within a
limited organism group or it may be a more divergent group.
30
31. Categorizing protein coding genes: KEGG and KO
• The KO (KEGG Orthology) database
• KEGG pathway maps are drawn based on experimental evidence in
specific organisms but they are designed to be applicable to other
organisms as well, because different organisms, such as human and
mouse, often share identical pathways consisting of functionally
identical genes, called orthologous genes or orthologs
31
32. Metabolism
• All chemical reactions inside a cell
• Metabolic pathways are the stepwise reactions that generate energy
by breaking down larger molecules (catabolism) or that are
biosynthetic and require energy (anabolism)
32
https://openstax.org/books/microbiology/pages/8-1-energy-matter-and-enzymes
33. Metabolism
• The energy currency
of cells include ATP,
NAD+, NADP+, and
FAD
• Exergonic reactions
are coupled to
endergonic reactions
to make the
combinations
favorable
33
https://openstax.org/books/microbiology/pages/8-1-energy-matter-and-enzymes
34. Catabolism of carbohydrates: glycolysis
• the most common pathway for the metabolism of glucose
• Produces energy, reduced electron carriers, and precursor molecules
for anabolism
• Can be coupled to aerobic or anaerobic growth
• Glycolysis
• Embden-Meyerhof-Parnoff pathway, aka “glycolysis”
• Entner-Doudoroff pathway is an alternative glycolysis
• Pentose-phosphate pathway processes five-carbon sugars
34
35. Glycolysis, the “upper” half
• 2 ATPs are used to
phosphorylate
glucose, which is
then split into two
3-carbon molecules
35
https://openstax.org/books/microbiology/pages/c-metabolic-pathways
36. Glycolysis, the “lower” half
• Further phosphorylation
requires NAD+, producing
4 ATPs per glucose
• Net 2 ATP per glucose
36
https://openstax.org/books/microbiology/pages/c-metabolic-pathways
37. Substrate-level phosphorylation
• One of two enzymatic reactions in the energy payoff phase of
glycolysis generates ATP
37
https://openstax.org/books/microbiology/pages/8-2-catabolism-of-carbohydrates
38. Entner-Doudoroff
pathway
• to catabolize glucose to
pyruvate, ED uses the
unique enzymes
• 6-phosphogluconate
dehydratase aldolase
(EC 4.2.1.12) and
• 2-keto-deoxy-6-
phosphogluconate
aldolase (EC 4.2.1.14)
(KDPG)
38
https://openstax.org/books/microbiology/pages/c-metabolic-pathways
39. Entner-Doudoroff pathway
• EMP glycolysis generates
net 2 ATP per glucose
• ED glycolysis only generates
one ATP per glucose
39
Flamholz et al., PNAS 2013
40. Entner-Doudoroff pathway
• EMP glycolysis generates
net 2 ATP per glucose
• ED glycolysis only generates
one ATP per glucose
• Why?
40
Flamholz et al., PNAS 2013
41. Entner-Doudoroff pathway
• “ED pathway is expected to
require several-fold less
enzymatic protein to
achieve the same glucose
conversion rate as the EMP
pathway”
41
Flamholz et al., PNAS 2013
42. Entner-Doudoroff pathway
• “energy-deprived anaerobes
overwhelmingly rely upon
the higher ATP yield of the
EMP pathway, whereas the
ED pathway is common
among facultative
anaerobes and even more
common among aerobes”
42
Flamholz et al., PNAS 2013
43. Pentose-Phosphate pathway
• aka phosphogluconate pathway and the hexose monophosphate shunt
• Parallels glycolysis, generates NADPH and 5C sugars as well as ribose 5-
phosphate, a precursor for the synthesis of nucleotides from glucose
43
https://openstax.org/books/microbiology/pages/c-metabolic-pathways
44. The Transition Reaction
• Glycolysis produces pyruvate, which can be further oxidized to
generate more energy
• For this to happen, pyruvate must be decarboxylated (below, left)
• This is accomplished by the Coenyzyme-A (“CoA”, below, right)
44
https://openstax.org/books/microbiology/pages/c-metabolic-pathways
45. Tricarboxylic Acid (TCA) Cycle
• Closed loop pathway in 8 steps that capture the 2C acetyl group of
acetyl-CoA, producing 2 CO2, 1 ATP, 3 NADH and 1 FADH2
45
https://openstax.org/books/microbiology/pages/8-2-catabolism-of-carbohydrates
46. TCA cycle intersects anabolism and catabolism
• As well as generating energy,
intermediate compounds are
precursors for biosynthesis of
• amino acids,
• chlorophylls,
• fatty acids, and
• nucleotides
• TCA cycle is anabolic and
catabolic
46
https://openstax.org/books/microbiology/pages/8-2-catabolism-of-carbohydrates
48. Respiration
• Most cellular ATP is
generated by oxidative
phosphorylation
• As opposed to substrate-
level phosphorylation
• In oxidative
phosphorylation, ATP is
formed from the transfer
of electrons from NADH
or FADH2 to O2 by a
series of electron
carriers
• How much ATP depends
on the terminal electron
acceptor
• More ATP from O2 than
from NO3
-, SO4
2-, Fe3+,
CO2, other inorganics
48
https://openstax.org/books/microbiology/pages/8-3-cellular-respiration
49. Electron Transport Chain
• A series of electron
carriers and ion pumps
embedded in the cell
membrane that pump
protons (H+) across a
membrane
• Proton motive force is
generated by expelling
protons outside of the cell
• Protons then want to flow
across the membrane, but
must go through the ATP
synthase, which drives
ATP production
49
https://openstax.org/books/microbiology/pages/c-metabolic-pathways
50. Carbohydrate Active Enzymes (CAZy)
http://www.cazy.org
Modules that catalyze the breakdown, biosynthesis or modification of
carbohydrates and glycoconjugates :
• Glycoside Hydrolases (GHs) : hydrolysis and/or rearrangement of glycosidic bonds
• GlycosylTransferases (GTs) : formation of glycosidic bonds
• Polysaccharide Lyases (PLs) : non-hydrolytic cleavage of glycosidic bonds
• Carbohydrate Esterases (CEs) : hydrolysis of carbohydrate esters
• Auxiliary Activities (AAs) : redox enzymes that act in conjunction with CAZymes.
Associated Modules currently covered
• Carbohydrate-Binding Modules (CBMs) : adhesion to carbohydrates
50
51. Metabolic Modeling
• Combination of genome
sequence with physiology to
predict growth
• Mathematical network
model that represents the
systems biology of metabolic
pathways within an organism
51
Sertbas & Ulgen, Front. C.D.B, 2020
53. Metabolic models to identify novel antimicrobial
drug targets and develop new antibiotics
53
https://doi.org/10.1038/s41429-020-00366-2
54. Metabolic models improve food fermentation
• Lactic acid bacteria like Lactococcus
lactis make lactic acid from sugars in
foods like cheese, yogurt, wine, salami,
and sauerkraut
• They also make therapeutic proteins &
flavor ingredients
• By targeting the lac operon (below),
genetic engineers can tune metabolic
pathways and products (left)
54
https://doi.org/10.1016/j.tibtech.2003.11.011
55. Lecture Learning Goals
• Describe how genes are identified.
• Distinguish between an open reading frame, a genome feature, a
gene, and a protein coding region.
• Explain how genomes are annotated and the kinds of databases that
are used to classify genes.
• List the genes involved in cellular metabolism, for both energy
generation (catabolism) and cell growth (anabolism).
• Explain the idea behind metabolic models, and describe one
application.
55