SlideShare a Scribd company logo
1 of 25
Download to read offline
Scien&fic	
  Compu&ng	
  Services	
  at	
  Janelia	
  
	
  
Saul	
  A.	
  Kravitz,	
  PhD	
  
HHMI	
  Janelia	
  Research	
  Campus	
  
HHMI	
  Janelia	
  Research	
  Campus	
  
• 	
  	
  Opened	
  2007,	
  100%	
  Internally	
  Funded	
  
• 	
  	
  Fundamental	
  Neuroscience	
  and	
  Imaging	
  
• 	
  	
  Research	
  Staff	
  470	
  
– 	
  50+	
  Lab	
  Groups	
  with	
  1-­‐10	
  researchers	
  
– 	
  125	
  Shared	
  Resource	
  Staff	
  
• 	
  	
  Scien&fic	
  IT:	
  	
   	
  	
  	
  4.5	
  FTE	
  
• 	
  	
  Scien&fic	
  SoXware:	
  	
  27	
  FTE	
  
Janelia	
  Differen&ators	
  
• 	
  	
  Rich	
  Shared	
  Resources	
  (core	
  services)	
  
• 	
  	
  Laboratory	
  Support	
  Services	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  -­‐	
  Reed	
  George	
  
• 	
  	
  Scien&fic	
  Compu&ng	
  SoXware	
  and	
  IT	
  	
  -­‐	
  Saul	
  Kravitz	
  
• 	
  	
  Team	
  Projects	
  
• 	
  	
  Dedicated	
  staff	
  +	
  matrixed	
  soXware	
  staff	
  
• 	
  	
  Main	
  drivers	
  for	
  data/compute	
  growth	
  
• 	
  	
  Leverage	
  lab	
  technologies	
  
• 	
  	
  Feed	
  lab	
  research	
  
Shared	
  Resources	
  
Scien&fic	
  Compu&ng	
  Systems	
  
Goran Ceric
Scien&fic	
  IT	
  Principles	
  
• Be`er	
  infrastructure	
  than	
  any	
  non-­‐profit	
  life	
  
sciences	
  research	
  ins&tu&on	
  in	
  the	
  world	
  
• None	
  of	
  our	
  scien&sts	
  should	
  say:	
  
– 	
  “I	
  wish	
  I	
  was	
  back	
  at	
  ______”	
  
• We	
  don’t	
  lose	
  data	
  
• Minimize	
  data	
  movement	
  
• 365x24x7	
  Availability	
  (-­‐	
  8	
  hrs/year	
  down&me)	
  
• Compute	
  and	
  storage	
  cost	
  recovery	
  
Computa&onal	
  Environment	
  
• 	
  	
  Fiber-­‐connected	
  Data	
  Centers:	
  	
  Janelia	
  and	
  HQ	
  
•  High	
  Throughput	
  Compu&ng	
  
–  3	
  million	
  jobs	
  scheduled	
  per	
  month	
  
–  Heterogenous	
  job	
  mix,	
  many	
  interac&ve	
  jobs	
  
• 	
  	
  Hundreds	
  of	
  Users	
  
– 	
  Wide	
  range	
  of	
  sophis&ca&on	
  
•  Trends	
  
– 	
  Spark,	
  MongoDB,	
  Scality	
  Object	
  Store	
  
	
  
Computa&onal	
  Resources	
  
•  5120	
  Core	
  Compute	
  Cluster	
  (~50%	
  u&liza&on)	
  
–  4096	
  cores	
  2.7	
  GHz	
  Intel	
  Sandy	
  Bridge	
  (E5-­‐2680)	
  
–  1024	
  cores	
  2.3	
  GHz	
  Intel	
  Haswell	
  (E5-­‐2698)	
  
–  8	
  GB	
  memory/core	
  
–  10	
  GB/s	
  interconnect	
  
–  Univa	
  Grid	
  Engine	
  Scheduler	
  
•  Small	
  GPU	
  Cluster	
  (low	
  u&liza&on)	
  
•  200-­‐core	
  VMWare	
  Cluster	
  
•  Dedicated	
  Applica&on	
  Servers	
  (including	
  databases)	
  
Storage:	
  Match	
  storage	
  class	
  to	
  use	
  case	
  
	
  •  General	
  purpose,	
  remote	
  backup	
  -­‐	
  Isilon	
  -­‐	
  (2.5	
  PB)	
  
–  Home	
  directories,	
  most	
  lab	
  files	
  
–  Cluster	
  accessible	
  
•  High	
  performance	
  scratch	
  space	
  -­‐	
  DDN	
  GPFS	
  	
  	
  (1	
  PB)	
  
–  Designed	
  for	
  larger	
  (>40MB)	
  files	
  
–  Cluster	
  accessible	
  
•  Image	
  Data	
  Storage	
  -­‐	
  Scality	
  Object	
  Store	
  (1.4	
  PB)	
  
–  Replicated	
  (offsite)	
  and	
  Unreplicated	
  Rings	
  (0.7PB	
  each)	
  
–  Cluster	
  accessible	
  via	
  API	
  
•  Cold	
  Storage,	
  no	
  backup	
  -­‐	
  Isilon	
  (1	
  PB)	
  
–  low	
  performance	
  
–  NOT	
  cluster	
  accessible	
  
	
  
	
  
Scien&fic	
  Compu&ng	
  SoXware	
  
Sean Murphy Todd Safford Rob Svirskas
•  Perl, Python
•  Web-based GUI
•  Image Processing
•  Enterprise Java
•  Relational and NoSQL DB
•  3D Graphics
•  MatLab, C++, Java, Python
•  Neuroscience domain
•  Algorithms
Scien&fic	
  Compu&ng	
  SoXware	
  
Don’t	
  be	
  fooled!	
  
Saul’s
Presentation
Rest of Scientific
Computing Activities
SoXware-­‐Intensive	
  Team	
  Projects	
  
• Mouse	
  Light	
  
– Mouse	
  Projec&ons	
  and	
  Connec&ons	
  
• Fly	
  Light	
  
– Op&cal	
  Mapping	
  of	
  the	
  Fly	
  Nervous	
  System	
  
• FlyEM	
  
– Electron	
  Microscopic	
  Reconstruc&ng	
  of	
  the	
  
Drosophila	
  Nervous	
  System	
  
Mouse	
  Light	
  
MouseLight Project (JRC)Allen Mouse Brain
Connectivity Project (AIBS)
Jayaram Chandrashekar (MouseLight)
Mouse	
  Light	
  
•  3D Mosaic Imaging of the entire mouse brain
•  Custom built 2-photon microscope (N. Clack)
•  ~20,000 tiles (0.3um x 0.3um x 1.0um)
•  ~15TB/Chan
•  Stitched/rendered into seamless 3D volume
Jayaram Chandrashekar (MouseLight)
Layer 2/3 IT-type Layer 5a IT-type
Jayaram Chandrashekar (MouseLight)
Tools	
  for	
  Confocal	
  Imaging	
  of	
  Fly	
  Brains	
  
• 	
  	
  Confocal	
  imaging	
  of	
  Gal4	
  and	
  Split-­‐Gal4	
  Lines	
  	
  
– 	
  sparse	
  labeling	
  of	
  neurons	
  è	
  few	
  visible	
  per	
  image	
  
• 	
  	
  >100k	
  samples	
  imaged	
  
– 	
  aligned	
  to	
  common	
  template	
  
• 	
  Janelia	
  Worksta&on	
  
– 	
  Tools	
  to	
  manage,	
  browse	
  and	
  annotate	
  
– 	
  View	
  full	
  spa&al	
  resolu&on	
  3D	
  stack	
  in	
  2s	
  
S Murphy, T Safford, K Rokicki, C Bruns, Y Yu, L Foster, E
Trautman, D Olbris, P Davies
Alignment	
  Board	
  for	
  Separated	
  Neurons	
  
Saul’s
Presentation
Rest of Scientific
Computing Activities
Key Benefits:
•  Realizes the value of sample alignment
•  Reveals structure/function relationships
•  Gets max utility from collection of sparse samples
Credits:
•  Inspiration: A Nern, T Wolff, Y Aso
•  Conception: S Murphy
•  Design and Implementation: L Foster
•  Advanced 3D Consultant: C Bruns
Tools	
  for	
  EM	
  Connectomics	
  
• 	
  Two	
  EM	
  Projects	
  on	
  Fly	
  Brain	
  
– 	
  isotropic	
  FIB	
  SEM	
  	
  (Harald	
  Hass,	
  Steve	
  Plaza)	
  
– 	
  anisotropic	
  TEM	
  	
  	
  	
  (Davi	
  Bock,	
  Khaled	
  Khairy)	
  
• 	
  Mul&ple	
  tools	
  for	
  tracing	
  and	
  proofreading	
  
• 	
  Goal:	
  	
  mix	
  and	
  match	
  computa&on	
  and	
  GUI	
  tools	
  
• 	
  Key:	
  	
  	
  	
  shared	
  services-­‐based	
  storage	
  
• 	
  Full	
  rendered	
  CNS	
  40-­‐100TB	
  
 
DVID:	
  Services-­‐Based	
  SoXware	
  for	
  EM	
  
MongDB
Scality
Cloud
DVID: Bill Katz (FlyEM) Mongo/Scality: Tom Dolafi, Greg Pinero
DVID	
  Datatype	
  Examples	
  
Image tiles
(for low-latency
image browsing)
3D body volume
(validation / modifying
neuron shape)
Regions of interest
(define important
parts of the dataset)
Graph
(define overlap
between neurons)
L1
T4
Mi1Tm3
Segmentation
(validation/
modifying
segmentation)
DVID: Bill Katz (FlyEM)
Ques&ons	
  

More Related Content

Similar to CASC-Fall-2015-Kravitz

Dev nexus 2017
Dev nexus 2017Dev nexus 2017
Dev nexus 2017Roy Russo
 
Working with Instrument Data (GlobusWorld Tour - UMich)
Working with Instrument Data (GlobusWorld Tour - UMich)Working with Instrument Data (GlobusWorld Tour - UMich)
Working with Instrument Data (GlobusWorld Tour - UMich)Globus
 
What's New in Lucene/Solr Presented by Grant Ingersoll at SolrExchage DC
What's New  in Lucene/Solr Presented by Grant Ingersoll at SolrExchage DCWhat's New  in Lucene/Solr Presented by Grant Ingersoll at SolrExchage DC
What's New in Lucene/Solr Presented by Grant Ingersoll at SolrExchage DCLucidworks (Archived)
 
Prepare for the Worst: Reliable Data Protection with Oracle RMAN and Oracle D...
Prepare for the Worst: Reliable Data Protection with Oracle RMAN and Oracle D...Prepare for the Worst: Reliable Data Protection with Oracle RMAN and Oracle D...
Prepare for the Worst: Reliable Data Protection with Oracle RMAN and Oracle D...Szymon Skorupinski
 
Devnexus 2018
Devnexus 2018Devnexus 2018
Devnexus 2018Roy Russo
 
Real World Performance - Data Warehouses
Real World Performance - Data WarehousesReal World Performance - Data Warehouses
Real World Performance - Data WarehousesConnor McDonald
 
Atom: A cloud native deep learning platform at Supremind
Atom: A cloud native deep learning platform at SupremindAtom: A cloud native deep learning platform at Supremind
Atom: A cloud native deep learning platform at SupremindAlluxio, Inc.
 
Deep Learning Pipelines for High Energy Physics using Apache Spark with Distr...
Deep Learning Pipelines for High Energy Physics using Apache Spark with Distr...Deep Learning Pipelines for High Energy Physics using Apache Spark with Distr...
Deep Learning Pipelines for High Energy Physics using Apache Spark with Distr...Databricks
 
Ceph Day Melbourne - Scale and performance: Servicing the Fabric and the Work...
Ceph Day Melbourne - Scale and performance: Servicing the Fabric and the Work...Ceph Day Melbourne - Scale and performance: Servicing the Fabric and the Work...
Ceph Day Melbourne - Scale and performance: Servicing the Fabric and the Work...Ceph Community
 
Apache con 2020 use cases and optimizations of iotdb
Apache con 2020 use cases and optimizations of iotdbApache con 2020 use cases and optimizations of iotdb
Apache con 2020 use cases and optimizations of iotdbZhangZhengming
 
Real time analytics using Hadoop and Elasticsearch
Real time analytics using Hadoop and ElasticsearchReal time analytics using Hadoop and Elasticsearch
Real time analytics using Hadoop and ElasticsearchAbhishek Andhavarapu
 
Deep Learning on Apache Spark at CERN’s Large Hadron Collider with Intel Tech...
Deep Learning on Apache Spark at CERN’s Large Hadron Collider with Intel Tech...Deep Learning on Apache Spark at CERN’s Large Hadron Collider with Intel Tech...
Deep Learning on Apache Spark at CERN’s Large Hadron Collider with Intel Tech...Databricks
 
Using Containers and HPC to Solve the Mysteries of the Universe by Deborah Bard
Using Containers and HPC to Solve the Mysteries of the Universe by Deborah BardUsing Containers and HPC to Solve the Mysteries of the Universe by Deborah Bard
Using Containers and HPC to Solve the Mysteries of the Universe by Deborah BardDocker, Inc.
 
Notes from 2016 bay area deep learning school
Notes from 2016 bay area deep learning school Notes from 2016 bay area deep learning school
Notes from 2016 bay area deep learning school Niketan Pansare
 
Talavant Data Lake Analytics
Talavant Data Lake Analytics Talavant Data Lake Analytics
Talavant Data Lake Analytics Sean Forgatch
 
Machine Learning for Data Extraction
Machine Learning for Data ExtractionMachine Learning for Data Extraction
Machine Learning for Data ExtractionDasha Herrmannova
 
Lucene BootCamp
Lucene BootCampLucene BootCamp
Lucene BootCampGokulD
 
Tableau on Hadoop Meet Up: Advancing from Extracts to Live Connect
Tableau on Hadoop Meet Up: Advancing from Extracts to Live ConnectTableau on Hadoop Meet Up: Advancing from Extracts to Live Connect
Tableau on Hadoop Meet Up: Advancing from Extracts to Live ConnectRemy Rosenbaum
 
Scaling SolrCloud to a Large Number of Collections: Presented by Shalin Shekh...
Scaling SolrCloud to a Large Number of Collections: Presented by Shalin Shekh...Scaling SolrCloud to a Large Number of Collections: Presented by Shalin Shekh...
Scaling SolrCloud to a Large Number of Collections: Presented by Shalin Shekh...Lucidworks
 
Deep Learning Intro - Georgia Tech - CSE6242 - March 2015
Deep Learning Intro - Georgia Tech - CSE6242 - March 2015Deep Learning Intro - Georgia Tech - CSE6242 - March 2015
Deep Learning Intro - Georgia Tech - CSE6242 - March 2015Josh Patterson
 

Similar to CASC-Fall-2015-Kravitz (20)

Dev nexus 2017
Dev nexus 2017Dev nexus 2017
Dev nexus 2017
 
Working with Instrument Data (GlobusWorld Tour - UMich)
Working with Instrument Data (GlobusWorld Tour - UMich)Working with Instrument Data (GlobusWorld Tour - UMich)
Working with Instrument Data (GlobusWorld Tour - UMich)
 
What's New in Lucene/Solr Presented by Grant Ingersoll at SolrExchage DC
What's New  in Lucene/Solr Presented by Grant Ingersoll at SolrExchage DCWhat's New  in Lucene/Solr Presented by Grant Ingersoll at SolrExchage DC
What's New in Lucene/Solr Presented by Grant Ingersoll at SolrExchage DC
 
Prepare for the Worst: Reliable Data Protection with Oracle RMAN and Oracle D...
Prepare for the Worst: Reliable Data Protection with Oracle RMAN and Oracle D...Prepare for the Worst: Reliable Data Protection with Oracle RMAN and Oracle D...
Prepare for the Worst: Reliable Data Protection with Oracle RMAN and Oracle D...
 
Devnexus 2018
Devnexus 2018Devnexus 2018
Devnexus 2018
 
Real World Performance - Data Warehouses
Real World Performance - Data WarehousesReal World Performance - Data Warehouses
Real World Performance - Data Warehouses
 
Atom: A cloud native deep learning platform at Supremind
Atom: A cloud native deep learning platform at SupremindAtom: A cloud native deep learning platform at Supremind
Atom: A cloud native deep learning platform at Supremind
 
Deep Learning Pipelines for High Energy Physics using Apache Spark with Distr...
Deep Learning Pipelines for High Energy Physics using Apache Spark with Distr...Deep Learning Pipelines for High Energy Physics using Apache Spark with Distr...
Deep Learning Pipelines for High Energy Physics using Apache Spark with Distr...
 
Ceph Day Melbourne - Scale and performance: Servicing the Fabric and the Work...
Ceph Day Melbourne - Scale and performance: Servicing the Fabric and the Work...Ceph Day Melbourne - Scale and performance: Servicing the Fabric and the Work...
Ceph Day Melbourne - Scale and performance: Servicing the Fabric and the Work...
 
Apache con 2020 use cases and optimizations of iotdb
Apache con 2020 use cases and optimizations of iotdbApache con 2020 use cases and optimizations of iotdb
Apache con 2020 use cases and optimizations of iotdb
 
Real time analytics using Hadoop and Elasticsearch
Real time analytics using Hadoop and ElasticsearchReal time analytics using Hadoop and Elasticsearch
Real time analytics using Hadoop and Elasticsearch
 
Deep Learning on Apache Spark at CERN’s Large Hadron Collider with Intel Tech...
Deep Learning on Apache Spark at CERN’s Large Hadron Collider with Intel Tech...Deep Learning on Apache Spark at CERN’s Large Hadron Collider with Intel Tech...
Deep Learning on Apache Spark at CERN’s Large Hadron Collider with Intel Tech...
 
Using Containers and HPC to Solve the Mysteries of the Universe by Deborah Bard
Using Containers and HPC to Solve the Mysteries of the Universe by Deborah BardUsing Containers and HPC to Solve the Mysteries of the Universe by Deborah Bard
Using Containers and HPC to Solve the Mysteries of the Universe by Deborah Bard
 
Notes from 2016 bay area deep learning school
Notes from 2016 bay area deep learning school Notes from 2016 bay area deep learning school
Notes from 2016 bay area deep learning school
 
Talavant Data Lake Analytics
Talavant Data Lake Analytics Talavant Data Lake Analytics
Talavant Data Lake Analytics
 
Machine Learning for Data Extraction
Machine Learning for Data ExtractionMachine Learning for Data Extraction
Machine Learning for Data Extraction
 
Lucene BootCamp
Lucene BootCampLucene BootCamp
Lucene BootCamp
 
Tableau on Hadoop Meet Up: Advancing from Extracts to Live Connect
Tableau on Hadoop Meet Up: Advancing from Extracts to Live ConnectTableau on Hadoop Meet Up: Advancing from Extracts to Live Connect
Tableau on Hadoop Meet Up: Advancing from Extracts to Live Connect
 
Scaling SolrCloud to a Large Number of Collections: Presented by Shalin Shekh...
Scaling SolrCloud to a Large Number of Collections: Presented by Shalin Shekh...Scaling SolrCloud to a Large Number of Collections: Presented by Shalin Shekh...
Scaling SolrCloud to a Large Number of Collections: Presented by Shalin Shekh...
 
Deep Learning Intro - Georgia Tech - CSE6242 - March 2015
Deep Learning Intro - Georgia Tech - CSE6242 - March 2015Deep Learning Intro - Georgia Tech - CSE6242 - March 2015
Deep Learning Intro - Georgia Tech - CSE6242 - March 2015
 

CASC-Fall-2015-Kravitz

  • 1. Scien&fic  Compu&ng  Services  at  Janelia     Saul  A.  Kravitz,  PhD   HHMI  Janelia  Research  Campus  
  • 2. HHMI  Janelia  Research  Campus   •     Opened  2007,  100%  Internally  Funded   •     Fundamental  Neuroscience  and  Imaging   •     Research  Staff  470   –   50+  Lab  Groups  with  1-­‐10  researchers   –   125  Shared  Resource  Staff   •     Scien&fic  IT:          4.5  FTE   •     Scien&fic  SoXware:    27  FTE  
  • 3. Janelia  Differen&ators   •     Rich  Shared  Resources  (core  services)   •     Laboratory  Support  Services                                  -­‐  Reed  George   •     Scien&fic  Compu&ng  SoXware  and  IT    -­‐  Saul  Kravitz   •     Team  Projects   •     Dedicated  staff  +  matrixed  soXware  staff   •     Main  drivers  for  data/compute  growth   •     Leverage  lab  technologies   •     Feed  lab  research  
  • 6. Scien&fic  IT  Principles   • Be`er  infrastructure  than  any  non-­‐profit  life   sciences  research  ins&tu&on  in  the  world   • None  of  our  scien&sts  should  say:   –   “I  wish  I  was  back  at  ______”   • We  don’t  lose  data   • Minimize  data  movement   • 365x24x7  Availability  (-­‐  8  hrs/year  down&me)   • Compute  and  storage  cost  recovery  
  • 7. Computa&onal  Environment   •     Fiber-­‐connected  Data  Centers:    Janelia  and  HQ   •  High  Throughput  Compu&ng   –  3  million  jobs  scheduled  per  month   –  Heterogenous  job  mix,  many  interac&ve  jobs   •     Hundreds  of  Users   –   Wide  range  of  sophis&ca&on   •  Trends   –   Spark,  MongoDB,  Scality  Object  Store    
  • 8. Computa&onal  Resources   •  5120  Core  Compute  Cluster  (~50%  u&liza&on)   –  4096  cores  2.7  GHz  Intel  Sandy  Bridge  (E5-­‐2680)   –  1024  cores  2.3  GHz  Intel  Haswell  (E5-­‐2698)   –  8  GB  memory/core   –  10  GB/s  interconnect   –  Univa  Grid  Engine  Scheduler   •  Small  GPU  Cluster  (low  u&liza&on)   •  200-­‐core  VMWare  Cluster   •  Dedicated  Applica&on  Servers  (including  databases)  
  • 9. Storage:  Match  storage  class  to  use  case    •  General  purpose,  remote  backup  -­‐  Isilon  -­‐  (2.5  PB)   –  Home  directories,  most  lab  files   –  Cluster  accessible   •  High  performance  scratch  space  -­‐  DDN  GPFS      (1  PB)   –  Designed  for  larger  (>40MB)  files   –  Cluster  accessible   •  Image  Data  Storage  -­‐  Scality  Object  Store  (1.4  PB)   –  Replicated  (offsite)  and  Unreplicated  Rings  (0.7PB  each)   –  Cluster  accessible  via  API   •  Cold  Storage,  no  backup  -­‐  Isilon  (1  PB)   –  low  performance   –  NOT  cluster  accessible      
  • 10.
  • 11. Scien&fic  Compu&ng  SoXware   Sean Murphy Todd Safford Rob Svirskas •  Perl, Python •  Web-based GUI •  Image Processing •  Enterprise Java •  Relational and NoSQL DB •  3D Graphics •  MatLab, C++, Java, Python •  Neuroscience domain •  Algorithms
  • 12. Scien&fic  Compu&ng  SoXware   Don’t  be  fooled!   Saul’s Presentation Rest of Scientific Computing Activities
  • 13. SoXware-­‐Intensive  Team  Projects   • Mouse  Light   – Mouse  Projec&ons  and  Connec&ons   • Fly  Light   – Op&cal  Mapping  of  the  Fly  Nervous  System   • FlyEM   – Electron  Microscopic  Reconstruc&ng  of  the   Drosophila  Nervous  System  
  • 14. Mouse  Light   MouseLight Project (JRC)Allen Mouse Brain Connectivity Project (AIBS) Jayaram Chandrashekar (MouseLight)
  • 15. Mouse  Light   •  3D Mosaic Imaging of the entire mouse brain •  Custom built 2-photon microscope (N. Clack) •  ~20,000 tiles (0.3um x 0.3um x 1.0um) •  ~15TB/Chan •  Stitched/rendered into seamless 3D volume Jayaram Chandrashekar (MouseLight)
  • 16.
  • 17. Layer 2/3 IT-type Layer 5a IT-type Jayaram Chandrashekar (MouseLight)
  • 18. Tools  for  Confocal  Imaging  of  Fly  Brains   •     Confocal  imaging  of  Gal4  and  Split-­‐Gal4  Lines     –   sparse  labeling  of  neurons  è  few  visible  per  image   •     >100k  samples  imaged   –   aligned  to  common  template   •   Janelia  Worksta&on   –   Tools  to  manage,  browse  and  annotate   –   View  full  spa&al  resolu&on  3D  stack  in  2s   S Murphy, T Safford, K Rokicki, C Bruns, Y Yu, L Foster, E Trautman, D Olbris, P Davies
  • 19. Alignment  Board  for  Separated  Neurons   Saul’s Presentation Rest of Scientific Computing Activities Key Benefits: •  Realizes the value of sample alignment •  Reveals structure/function relationships •  Gets max utility from collection of sparse samples Credits: •  Inspiration: A Nern, T Wolff, Y Aso •  Conception: S Murphy •  Design and Implementation: L Foster •  Advanced 3D Consultant: C Bruns
  • 20.
  • 21.
  • 22. Tools  for  EM  Connectomics   •   Two  EM  Projects  on  Fly  Brain   –   isotropic  FIB  SEM    (Harald  Hass,  Steve  Plaza)   –   anisotropic  TEM        (Davi  Bock,  Khaled  Khairy)   •   Mul&ple  tools  for  tracing  and  proofreading   •   Goal:    mix  and  match  computa&on  and  GUI  tools   •   Key:        shared  services-­‐based  storage   •   Full  rendered  CNS  40-­‐100TB  
  • 23.   DVID:  Services-­‐Based  SoXware  for  EM   MongDB Scality Cloud DVID: Bill Katz (FlyEM) Mongo/Scality: Tom Dolafi, Greg Pinero
  • 24. DVID  Datatype  Examples   Image tiles (for low-latency image browsing) 3D body volume (validation / modifying neuron shape) Regions of interest (define important parts of the dataset) Graph (define overlap between neurons) L1 T4 Mi1Tm3 Segmentation (validation/ modifying segmentation) DVID: Bill Katz (FlyEM)