Keynote presentation to the 2017 GARR conference, 17 November 2017, Venice, Italy. Introduction to natural history data types and analysis examples. Discussion of current practices in promoting reproducibility.
Species delimitation - species limits and character evolutionRutger Vos
Lecture slides for the program orientation Evolutionary Biology at the Institute of Biology Leiden, the Netherlands. Thursday, September 7th, 2017.
Lecture notes are here: https://docs.google.com/document/d/e/2PACX-1vRIv5mKK1fjBby--u97emC7hrqXUbxFQZe63P1FpguuhHLG6xykbwXKeKXCUE5W-LSpakXYCI621xCK/pub
Iczn(The International Commission on Zoological Nomenclature )Al Nahian Avro
The International Commission on Zoological Nomenclature (ICZN) acts as adviser and arbiter for the zoological community by generating and disseminating information on the correct use of the scientific names of animals. The ICZN is responsible for producing the International Code of Zoological Nomenclature - a set of rules for the naming of animals and the resolution of nomenclatural problems.
Species delimitation - species limits and character evolutionRutger Vos
Lecture slides for the program orientation Evolutionary Biology at the Institute of Biology Leiden, the Netherlands. Thursday, September 7th, 2017.
Lecture notes are here: https://docs.google.com/document/d/e/2PACX-1vRIv5mKK1fjBby--u97emC7hrqXUbxFQZe63P1FpguuhHLG6xykbwXKeKXCUE5W-LSpakXYCI621xCK/pub
Iczn(The International Commission on Zoological Nomenclature )Al Nahian Avro
The International Commission on Zoological Nomenclature (ICZN) acts as adviser and arbiter for the zoological community by generating and disseminating information on the correct use of the scientific names of animals. The ICZN is responsible for producing the International Code of Zoological Nomenclature - a set of rules for the naming of animals and the resolution of nomenclatural problems.
Modeling the biosphere: the natural historian's perspectiveRutger Vos
Natural history collections of specimens are a rich source of data for discovering the patterns of biodiversity in space and time and for furthering our understanding of the underlying processes that generate these patterns. Modeling the biosphere in this manner can help address global challenges in relation to climate change, food security, emerging disease and conservation. (Talk to the 3rd annual eScience symposium, 8 October 2015).
lecture for doctorate students while I was working as researcher assisstance about phylogenetic science, definition,
Understand the most basic concepts of phylogeny
Understand the difference between orthology, paralogy and xenology.
Be able to compute simple phylogenetic trees
Understand what bootstrapping means in phylogeny
Webs of Life and Data: Impacts of open and networked data on scientific pract...Sarah Anna Stewart
A presentation on my research context presented at the Oxford Internet Institute, University of Oxford as part of the DPhil Doctoral Seminar Series on Nov. 22, 2017. This early-stage presentation provides some background and context and introduces my research topic.
This presentation is licensed for re-use under Creative Commons CC-BY license. Please cite the following DOI:
http://doi.org/10.5281/zenodo.1066021
https://www.zenodo.org/record/1066021#.WhgWnVVl_cs
Modeling the biosphere: the natural historian's perspectiveRutger Vos
Natural history collections of specimens are a rich source of data for discovering the patterns of biodiversity in space and time and for furthering our understanding of the underlying processes that generate these patterns. Modeling the biosphere in this manner can help address global challenges in relation to climate change, food security, emerging disease and conservation. (Talk to the 3rd annual eScience symposium, 8 October 2015).
lecture for doctorate students while I was working as researcher assisstance about phylogenetic science, definition,
Understand the most basic concepts of phylogeny
Understand the difference between orthology, paralogy and xenology.
Be able to compute simple phylogenetic trees
Understand what bootstrapping means in phylogeny
Webs of Life and Data: Impacts of open and networked data on scientific pract...Sarah Anna Stewart
A presentation on my research context presented at the Oxford Internet Institute, University of Oxford as part of the DPhil Doctoral Seminar Series on Nov. 22, 2017. This early-stage presentation provides some background and context and introduces my research topic.
This presentation is licensed for re-use under Creative Commons CC-BY license. Please cite the following DOI:
http://doi.org/10.5281/zenodo.1066021
https://www.zenodo.org/record/1066021#.WhgWnVVl_cs
Finding the annotation needs of the botanical community in a digital libraryWilliam Ulate
The Center for Biodiversity Informatics at the Missouri Botanical Garden and Saint Louis University are analyzing the web annotation needs of the botanical community to develop a prototype of how those needs may be met within a digital library platform. We want to assess the practicality of existing tools to satisfy the technical, economic, and operational needs of botanical users to annotate. This will inform on requisites, best practices, and further developments for a research project to integrate an annotation tool within a virtual library. We surveyed 14 members of 10 different institutions in the botanical and scientific communities. We included both, those who currently annotate online as well as those who have only annotated offline (e.g. print or analog), in order to better understand the functionality needed to encourage and support online annotation activities. The answers to this survey were analyzed in the context of an annotation tool in a digital library and a prioritized list of annotation needs for users of a botanical virtual library was produced, taking into account the minimal and recommended functionality required to comply with the users requirements. Preliminary results from the report of the in-depth user assessments of annotation needs in the specific domain of botanists are shared with the attendees. Advances in the definition of a prototype are also shown.
Phylogenomic methods for comparative evolutionary biology - University Colleg...Joe Parker
Invited research seminar given to MSc students at University College Dublin on 24th October 2013.
I introduce the discipline of phylogenomics - comparative phylogenetic analyses of DNA sequences across genomes - and some of the applications and recent breakthroughs in the field.
As an in-depth case study I explain the methods and significance of our 2013 Nature paper on adaptive genotypic molecular convergence in echolocating mammals.
I then highlight some of the avenues of study on the frontiers of current research.
Interpreting ‘tree space’ in the context of very large empirical datasetsJoe Parker
Seminar presented at the Maths Department, University of Portsmouth, 19th November 2014
Evolutionary biologists represent actual or hypothesised evolutionary relations between living organisms using phylogenies, directed bifurcating graphs (trees) that describe evolutionary processes in terms of speciation or splitting events (nodes) and elapsed evolutionary time or distance (edges). Molecular evolution itself is largely dominated by mutations in DNA sequences, a stochastic process. Traditionally, probabilistic models of molecular evolution and phylogenies are fitted to DNA sequence data by maximum likelihood on the assumption that a single simple phylogeny will serve to approximate the evolution of a majority of DNA positions in the dataset. However modern studies now routinely sample several orders of magnitude more DNA positions, and this assumption no longer holds. Unfortunately, our conception of ‘tree space’ - a notional multidimensional surface containing all possible phylogenies - is extremely imprecise, and similarly techniques to model phylogeny model fitting in very large datasets are limited. I will show the background to this field and present some of the challenges arising from the present limited analytical framework.
This presentation starts with basic information about the Social Science Data Archives. Then it mostly introduces complexity and diversity of research data field. Participants can learn about Open Data project in Slovenia, about research lifecycle and research data lifecycle. And it concludes with roles and responsibilities in research data lifecycle.
Event was one of Foster Cessda trainings for doctoral students.
Videos: http://videolectures.net/adptecaj2015_ljubljana/
Related link: https://www.fosteropenscience.eu/event/cessda-research-data-management-open-data-doctoral-training-series-research-data-management
Frontiers of discovery with Encyclopedia of LifeCyndy Parr
Presented at the National Museum of Natural History, Smithsonian Institution 18 June 2014
Describes, among other things, development of the TraitBank repository of species attributes, and the use of EOL and TraitBank in scientific research.
Slicing and Dicing a Newspaper Corpus for Historical Ecology ResearchMarieke van Erp
Presented at EKAW 2018
Historical newspapers are a novel source of information for historical ecologists to study the interactions between humans and animals through time and space. Newspaper archives are particularly interesting to analyse because of their breadth and depth. However, the size and the occasional noisiness of such archives also brings difficulties, as manual analysis is impossible. In this paper, we present experiments and results on automatic query expansion and categorisation for the perception of animal species between 1800 and 1940. For query expansion and to the manual annotation process, we used lexicons. For the categorisation we trained a Support Vector Machine model. Our results indicate that we can distinguish newspaper articles that are about animal species from those that are not with an F 1 of 0.92 and the subcategorisation of the different types of newspapers on animals up to 0.84 F 1 .
Multiplying method: Ethnography and the reconceptualization of evaluation stu...Gemma Derrick
Discussion session hosted by Pau Wouters and Sarah de Rijcke at the Workshop exploring Qualitative and Mixed Methods in Research Evaluation and Policy 2015 (QMM2015)
This talk describes the potential semantic web technology has to make the practice of taxonomy easier. It was presented at the 2011 Phycological Society of America conference in Seattle, WA, USA.
From January-March 2016, inaugural EES Cairo Fellow, Melanie Pitkin, is presenting a series of workshops focused on 'Research Skills for Egyptology' primarily at the Greek Campus in Downtown Cairo, but also across other parts of Egypt. This is a copy of Melanie's presentation. To find out more upcoming workshops, or for any questions you might have related to the content, please contact Melanie: melanie.pitkin@ees.ac.uk.
Anthropology
REFERENCES:
Ember, C. (2007). Anthropology. Singapore: Pearson Educational South Asia.
Ember, C., Ember, M., & Peregrine, P. (2009). Human evolution and culture: Highlights of anthropology. (6th ed.). New Jersey: Pearson Prentice Hall, Inc.
Ervin, A. (2005). Applied anthropology: Tools and perspectives for contemporary practice. Boston: Pearson.
Kottak, C. (2011). Anthropology: Appreciating cultural diversity. New York: Mc Graw-Hill.
Kottak, C. (2008). Anthropology: The explanation of human diversity. Boston: Mc Graw-Hill.
Launda, R. (2010). Core concepts in cultural anthropology. Boston: McGraw-Hill Higher Education.
Nanda, S. (2007). Cultural anthropology. Belmont, California: Walsworth/Thomson Learning.
This topic explains genomics and proteomics and types of genomics and proteoics and explains about positional cloning,microsatellites,SNP,VNTRS,HUMAN GENOME PRPJECT
Similar to Natural history research as a replicable data science (20)
Onderzoek bio-informatica Naturalis. Raad voor Cultuur 2017.Rutger Vos
Presentatie voor leden van de Raad voor Cultuur, 27 juni 2017, Naturalis. Geeft een overzicht van de onderzoeksactiviteiten aan collectiemateriaal met een bio-informatische component.
Presentation about image recognition applied to digitized specimen of the Van Groenendael Krijger collection of Javanese Papilionid butterflies. Occasion: BrainFood, 12 April 2017, Naturalis, Leiden, the Netherlands.
Taxonomic classification of digitized specimens using machine learningRutger Vos
Progress in the development of neural networks that classify images of slipper orchids and Javanese butterflies. Talk to LEBEN at Leiden University's biology department, IBL, 20 September 2016.
Self-Updating Platform for the Estimation of Rates of Speciation, Migration A...Rutger Vos
Slides for my lightning talk on the SUPERSMART platform to the SSB/SSE/ASN annual meeting, Austin, TX, USA. SSB Spotlight Session: "Next generation phylogenetic inference 2". Monday, June 20th 2016, 3:20PM, Ballroom A.
Hoe leer je een robot soorten te herkennen?Rutger Vos
Guest lecture slides for the bioinformatics student union (Exon) at the university of applied sciences, Leiden, the Netherlands. In this lecture I present the results of a research project at Naturalis Biodiversity Center to identify slipper orchids using image recognition techniques.
Kunnen we een tomaat van 400 jaar oud proevenRutger Vos
Slides voor mijn college aan de Museum Jeugd Universiteit (http://museumjeugduniversiteit.nl) in Museum Boerhaave (http://www.museumboerhaave.nl), 19 october 2014.
PhyloTastic: names-based phyloinformatic data integrationRutger Vos
Lightning talk to the 2013 TDWG conference symposium on phyloinformatics, brief report on PhyloTastic with special attention to the taxonomic name reconciliation service TaxoSaurus.
NeXML is an exchange standard for representing phyloinformatic data — inspired by the commonly used NEXUS format, but more robust and easier to process.
Full title: "The tree of life as central unifying artefact for the integration of phylogenetic knowledge." This is a brief intro presentation for the 2011 BioHackathon in Kyoto, Japan. I describe a simple workflow built around semantic web services that add metadata to a backbone of the Tree of Life. The take home message is that such a structure can be a useful anchor to which knowledge can be attached, but that there are still issues with standards definition and adoption.
Course slides for computational phyloinformatics, an annual course organized by NESCent in collaboration with hosting organizations across the world. I am the teacher of the Perl section of the course, these are the slides I presented in 2010 at BGI, Shenzhen, PRC.
Nutraceutical market, scope and growth: Herbal drug technologyLokesh Patil
As consumer awareness of health and wellness rises, the nutraceutical market—which includes goods like functional meals, drinks, and dietary supplements that provide health advantages beyond basic nutrition—is growing significantly. As healthcare expenses rise, the population ages, and people want natural and preventative health solutions more and more, this industry is increasing quickly. Further driving market expansion are product formulation innovations and the use of cutting-edge technology for customized nutrition. With its worldwide reach, the nutraceutical industry is expected to keep growing and provide significant chances for research and investment in a number of categories, including vitamins, minerals, probiotics, and herbal supplements.
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Sérgio Sacani
Since volcanic activity was first discovered on Io from Voyager images in 1979, changes
on Io’s surface have been monitored from both spacecraft and ground-based telescopes.
Here, we present the highest spatial resolution images of Io ever obtained from a groundbased telescope. These images, acquired by the SHARK-VIS instrument on the Large
Binocular Telescope, show evidence of a major resurfacing event on Io’s trailing hemisphere. When compared to the most recent spacecraft images, the SHARK-VIS images
show that a plume deposit from a powerful eruption at Pillan Patera has covered part
of the long-lived Pele plume deposit. Although this type of resurfacing event may be common on Io, few have been detected due to the rarity of spacecraft visits and the previously low spatial resolution available from Earth-based telescopes. The SHARK-VIS instrument ushers in a new era of high resolution imaging of Io’s surface using adaptive
optics at visible wavelengths.
Seminar of U.V. Spectroscopy by SAMIR PANDASAMIR PANDA
Spectroscopy is a branch of science dealing the study of interaction of electromagnetic radiation with matter.
Ultraviolet-visible spectroscopy refers to absorption spectroscopy or reflect spectroscopy in the UV-VIS spectral region.
Ultraviolet-visible spectroscopy is an analytical method that can measure the amount of light received by the analyte.
Introduction:
RNA interference (RNAi) or Post-Transcriptional Gene Silencing (PTGS) is an important biological process for modulating eukaryotic gene expression.
It is highly conserved process of posttranscriptional gene silencing by which double stranded RNA (dsRNA) causes sequence-specific degradation of mRNA sequences.
dsRNA-induced gene silencing (RNAi) is reported in a wide range of eukaryotes ranging from worms, insects, mammals and plants.
This process mediates resistance to both endogenous parasitic and exogenous pathogenic nucleic acids, and regulates the expression of protein-coding genes.
What are small ncRNAs?
micro RNA (miRNA)
short interfering RNA (siRNA)
Properties of small non-coding RNA:
Involved in silencing mRNA transcripts.
Called “small” because they are usually only about 21-24 nucleotides long.
Synthesized by first cutting up longer precursor sequences (like the 61nt one that Lee discovered).
Silence an mRNA by base pairing with some sequence on the mRNA.
Discovery of siRNA?
The first small RNA:
In 1993 Rosalind Lee (Victor Ambros lab) was studying a non- coding gene in C. elegans, lin-4, that was involved in silencing of another gene, lin-14, at the appropriate time in the
development of the worm C. elegans.
Two small transcripts of lin-4 (22nt and 61nt) were found to be complementary to a sequence in the 3' UTR of lin-14.
Because lin-4 encoded no protein, she deduced that it must be these transcripts that are causing the silencing by RNA-RNA interactions.
Types of RNAi ( non coding RNA)
MiRNA
Length (23-25 nt)
Trans acting
Binds with target MRNA in mismatch
Translation inhibition
Si RNA
Length 21 nt.
Cis acting
Bind with target Mrna in perfect complementary sequence
Piwi-RNA
Length ; 25 to 36 nt.
Expressed in Germ Cells
Regulates trnasposomes activity
MECHANISM OF RNAI:
First the double-stranded RNA teams up with a protein complex named Dicer, which cuts the long RNA into short pieces.
Then another protein complex called RISC (RNA-induced silencing complex) discards one of the two RNA strands.
The RISC-docked, single-stranded RNA then pairs with the homologous mRNA and destroys it.
THE RISC COMPLEX:
RISC is large(>500kD) RNA multi- protein Binding complex which triggers MRNA degradation in response to MRNA
Unwinding of double stranded Si RNA by ATP independent Helicase
Active component of RISC is Ago proteins( ENDONUCLEASE) which cleave target MRNA.
DICER: endonuclease (RNase Family III)
Argonaute: Central Component of the RNA-Induced Silencing Complex (RISC)
One strand of the dsRNA produced by Dicer is retained in the RISC complex in association with Argonaute
ARGONAUTE PROTEIN :
1.PAZ(PIWI/Argonaute/ Zwille)- Recognition of target MRNA
2.PIWI (p-element induced wimpy Testis)- breaks Phosphodiester bond of mRNA.)RNAse H activity.
MiRNA:
The Double-stranded RNAs are naturally produced in eukaryotic cells during development, and they have a key role in regulating gene expression .
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.Sérgio Sacani
The return of a sample of near-surface atmosphere from Mars would facilitate answers to several first-order science questions surrounding the formation and evolution of the planet. One of the important aspects of terrestrial planet formation in general is the role that primary atmospheres played in influencing the chemistry and structure of the planets and their antecedents. Studies of the martian atmosphere can be used to investigate the role of a primary atmosphere in its history. Atmosphere samples would also inform our understanding of the near-surface chemistry of the planet, and ultimately the prospects for life. High-precision isotopic analyses of constituent gases are needed to address these questions, requiring that the analyses are made on returned samples rather than in situ.
Multi-source connectivity as the driver of solar wind variability in the heli...Sérgio Sacani
The ambient solar wind that flls the heliosphere originates from multiple
sources in the solar corona and is highly structured. It is often described
as high-speed, relatively homogeneous, plasma streams from coronal
holes and slow-speed, highly variable, streams whose source regions are
under debate. A key goal of ESA/NASA’s Solar Orbiter mission is to identify
solar wind sources and understand what drives the complexity seen in the
heliosphere. By combining magnetic feld modelling and spectroscopic
techniques with high-resolution observations and measurements, we show
that the solar wind variability detected in situ by Solar Orbiter in March
2022 is driven by spatio-temporal changes in the magnetic connectivity to
multiple sources in the solar atmosphere. The magnetic feld footpoints
connected to the spacecraft moved from the boundaries of a coronal hole
to one active region (12961) and then across to another region (12957). This
is refected in the in situ measurements, which show the transition from fast
to highly Alfvénic then to slow solar wind that is disrupted by the arrival of
a coronal mass ejection. Our results describe solar wind variability at 0.5 au
but are applicable to near-Earth observatories.
This pdf is about the Schizophrenia.
For more details visit on YouTube; @SELF-EXPLANATORY;
https://www.youtube.com/channel/UCAiarMZDNhe1A3Rnpr_WkzA/videos
Thanks...!
Richard's entangled aventures in wonderlandRichard Gill
Since the loophole-free Bell experiments of 2020 and the Nobel prizes in physics of 2022, critics of Bell's work have retreated to the fortress of super-determinism. Now, super-determinism is a derogatory word - it just means "determinism". Palmer, Hance and Hossenfelder argue that quantum mechanics and determinism are not incompatible, using a sophisticated mathematical construction based on a subtle thinning of allowed states and measurements in quantum mechanics, such that what is left appears to make Bell's argument fail, without altering the empirical predictions of quantum mechanics. I think however that it is a smoke screen, and the slogan "lost in math" comes to my mind. I will discuss some other recent disproofs of Bell's theorem using the language of causality based on causal graphs. Causal thinking is also central to law and justice. I will mention surprising connections to my work on serial killer nurse cases, in particular the Dutch case of Lucia de Berk and the current UK case of Lucy Letby.
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...Sérgio Sacani
We characterize the earliest galaxy population in the JADES Origins Field (JOF), the deepest
imaging field observed with JWST. We make use of the ancillary Hubble optical images (5 filters
spanning 0.4−0.9µm) and novel JWST images with 14 filters spanning 0.8−5µm, including 7 mediumband filters, and reaching total exposure times of up to 46 hours per filter. We combine all our data
at > 2.3µm to construct an ultradeep image, reaching as deep as ≈ 31.4 AB mag in the stack and
30.3-31.0 AB mag (5σ, r = 0.1” circular aperture) in individual filters. We measure photometric
redshifts and use robust selection criteria to identify a sample of eight galaxy candidates at redshifts
z = 11.5 − 15. These objects show compact half-light radii of R1/2 ∼ 50 − 200pc, stellar masses of
M⋆ ∼ 107−108M⊙, and star-formation rates of SFR ∼ 0.1−1 M⊙ yr−1
. Our search finds no candidates
at 15 < z < 20, placing upper limits at these redshifts. We develop a forward modeling approach to
infer the properties of the evolving luminosity function without binning in redshift or luminosity that
marginalizes over the photometric redshift uncertainty of our candidate galaxies and incorporates the
impact of non-detections. We find a z = 12 luminosity function in good agreement with prior results,
and that the luminosity function normalization and UV luminosity density decline by a factor of ∼ 2.5
from z = 12 to z = 14. We discuss the possible implications of our results in the context of theoretical
models for evolution of the dark matter halo mass function.
Professional air quality monitoring systems provide immediate, on-site data for analysis, compliance, and decision-making.
Monitor common gases, weather parameters, particulates.
2. Natural history museums
and collections
• Main goal is not to
exhibit but to collect and
curate specimens
• Usually multiple
specimens per species,
sometimes many more
• Specimens are research
and reference materials
3. Natural history research
To understand the patterns and processes of biodiversity
Biodiversity is expressed and studied in multiple ways:
- Species diversity, e.g.
counts of species, maybe
taking abundances into
account
- Phylogenetic diversity,
i.e. the evolutionary
distances between
species
- Functional diversity, i.e.
the ecological roles
species play, and the
characteristics associated
with that role
4. Natural history research
To understand the patterns and processes of biodiversity
The patterns and processes of biodiversity are systematized
as taking place:
- Within a given system (⍺
diversity), e.g. a biome
- Across systems (β
diversity, turnover)
- Among systems (ɣ
diversity, totality)
6. DNA barcoding
• Some genes are variable so that a
few hundred letters suffice to identify
species
• In addition, barcodes are useful for
studying evolution and phylogeny
• Taking the barcode of a specimen (by
Sanger seq) is part of the workflow of
indexing collection specimens
7. Barcoding example: species boundaries in beetles
Pentinsaari, Vos & Mutanen. 2016. Algorithmic single-locus species
delimitation: effects of sampling effort, variation and nonmonophyly in four
methods and 1870 species of beetles. Molecular Ecology Resources 17(3):
393-404
8. Metabarcoding
• The species contents of organic mixtures can also be identified using
identifiable marker genes
• This is typically done using multiplexed, high-throughput (“next
generation”) sequencing
• Consequently, data storage and processing requirements are higher
10. Species distribution modelling
• Collection specimens are (ideally) stored with their collection locality
recorded as lat/lon coordinates
• Based on the localities where specimens were found, and geospatial
data layers (climate, land use, soil, etc.) a correlative model of the
species affinities can be constructed
• With such a model, habitat suitability and predictive scenarios (e.g.
climate change) can be projected
13. Natural history data
• Highest data volumes are HTS,
3D scanning, images
• High dimensionality at multiple
scales
• Many biases in species/locality
sampling
• Many axes are messy:
- Species names have been
changing for centuries
- Likewise place names
- Trait descriptions are often
ambiguous 3d.naturalis.nl
14. The Reproducibility Crisis
• More than 70%
of researchers
(n=1576) have
tried and failed to
reproduce
another
scientist's
experiments
• More than half
have failed to
reproduce their
own experiments
15. Reproducible data science
and cultural change
1. “Data available from the author upon request”
No: data are open, as FAIR as possible
2. “Data were processed with custom scripts”
No: scripts/workflows are open source
3. “Data were analyzed on a Pentium III 450 MHz…”
No: the environment can be cloned as VM
16. 1. FAIR data management
Findable: increasing attention to
metadata, and discoverability
and indexing of data
Accessible: implementation of
resolvable identifiers, e.g.
PURLs and DOIs
Interoperable: increasing
attention for open community
standards (syntax) and
semantics
Re-usable: increasing attention
for data ownership and
licensing
17. 2. Open source
Analytical code is no longer a folder on a postdoc’s laptop, it’s a code
repository with specific versions, documentation, tests, and a license
18. 3. Virtualization
- Analyses are not run on dedicated hardware, e.g.
workstations, clusters, but in the (private) cloud
- Complex workflows are distributed as virtual machines,
docker containers, or deployed with devops tools