Concatenate hits to
Set up workflow, binaries, and reference /
Deploy to machines.
Protein-protein blast reads (from MG-
RAST repository, Bass Strait oil field)
against 458 core eukaryote genes from
CEGMA. Keep only top hits. Use max.
Append top hit sequences to CEGMA
Align in MUSCLE using default
Infer de novo phylogeny in RAxML
under Dayhoff, random starting tree
and max. PTHREADS.
Output and parse times.
Raspi in practice"
• ARM not x86
• 2 GB RAM… "
The cloud in practice"
• Fiddly setup, easy to
• Need a connection to
get data up there (and
• Pi opportunities but not there yet, also you’ll
still need a connection unless you’re very
• Installation in situ?"
• Consider cloud computing (connections can
• Portability of the workﬂow enhances
portability of the system!
– …which you should be embracing anyway for