SlideShare a Scribd company logo
1 of 28
Scientific Data Processing in
Amazon EC2
    Amazon Cloud for the Enterprise Event
    London, Tuesday November 3rd, 2009

    Paul Parsons,
    CTO, The Server Labs




 November 09                                © The Server Labs S.L. 2009
About the Presenter




               Paul Parsons

                   Founder & CTO of The Server Labs
                   Over 20 years experience in many fields of IT




11/5/2009                            © The Server Labs S.L. 2009    2
About The Server Labs




     100% privately funded limited company created in 2004 in Spain
       and in 2007 in the UK.

                                       • IT architects

                                       • Extensive experience

                                       • Specialised, niche consultancy

                                       • Hands-on

                                       • Agile execution

                                       • Passion for technology



11/5/2009                       © The Server Labs S.L. 2009               3
EC2 Feasibility Study



               Feasibility Study to investigate running part of
                ESA’s Gaia data processing in the cloud

               Two main objectives

                   Evaluate the Tecnical Feasibility of using Amazon EC2
                    to run scientific data processing applications for Gaia

                   Evaluate the Financial Viability of using pay on
                    demand compute power vs. traditional in-house data
                    processing




11/5/2009                            © The Server Labs S.L. 2009              4
A Stereoscopic Census of our Galaxy




      (based on slides from Jos de Bruijne and William O’Mullane)




11/5/2009                                 © The Server Labs S.L. 2009   5
The Gaia Mission



               Primary goal of the Gaia mission is to create an
                astrometric catalogue of 1 billion stars (approx
                1% of our Galaxy) with micro arc second
                precision.

               Gaia satellite to be launched in 2011.

               Observations done until 2017.

               Catalogue ready around 2019.




11/5/2009                         © The Server Labs S.L. 2009      6
The Gaia Mission




    Credit: Images ESA


11/5/2009                            © The Server Labs S.L. 2009   7
Data Downlink



               Using Cebreros, Spain (35M)
                   3-8Mb/s downlink
                     • depends on encoding
                     • which depends on weather !                     Hopefully Spain
                   ~ 40GB/day -> ~100TB                             remains sunny !
               Occasionally New Norcia, Australia
                   during Galactic plane scans
                   data accumulated onboard downlinked later
               Data is compressed encoded and requires a lot
                of processing (~10^21 FLOP)




11/5/2009                              © The Server Labs S.L. 2009                      9
If it took 1 millisecond to process one image,
             the processing time for just one pass through the
                                      data
                (on a single processor) would take 30 years.

            Obviously the adopted solution is much faster …
                       …. distributed/parallel processing.




11/5/2009                        © The Server Labs S.L. 2009     10
AGIS: Astrometric Global Iterative
            Solution




                                                                   Scan width: 0.7




                                             1. Objects are matched in successive scans
                                             2. Attitude and calibrations are updated
                                             3. Objects positions etc. are solved
                                             4. Higher-order terms are solved
                                             5. More scans are added
                                             6. Whole system is iterated

11/5/2009                © The Server Labs S.L. 2009                                 11
AGIS Architecture: grid based


        Calibration    Global      Attitude              Source


    Datatrains drive through AGIS Database passing observations to algorithms.
    There can be as many Datatrains in parallel as we wish




                                                                                AstroElementary
        Elementary Takers

                                Optimised AGIS                      Data Access Layer
Algorithm does not
                                   Database
access data directly

11/5/2009                             © The Server Labs S.L. 2009                             12
AGIS Architecture - detailed

Convergence
  Monitor                                                  Elementary
                                                                                  Source Collector
                       AstroElementary                      DataTrain


                                                                                 Attitucde Collector

                       Object Factory
RunManager                                                                      Calibration Collector
                                            Request
                                            AstroElementaries
                                            between a range
                                            (x,y)                                 Global Collector
                         GaiaTable



                           Store




                                                Attitude Update         Global Update
              Oracle     AGIS DB                     Server                Server




 11/5/2009                               © The Server Labs S.L. 2009                                   13
Scheduling



               Very simple ..
                   Keep all machines busy all the time!
                     • Busy = CPU ~90%
                   Post jobs on whiteboard
       Trains/Workers Mark Jobs – and do them
       Mark finished – repeat until done

                 Previous attempt had much more general
     Job 1       scheduling
     Job 2       It was also ~1000 times slower.
     Job 3
       …
     Job N


11/5/2009                            © The Server Labs S.L. 2009   14
AGIS is not 24x7


           Iterative processing – 6 month Data Reduction Cycles
           At current estimates AGIS will run 2 weeks every 6 months
           Amount of data increases over the 6 year mission


                                                    AGIS Peak Processing (Hours)

                          2500                                                                Cap Ex
                          2000

                          1500
                  Hours
                          1000
                                                                                               AGIS 6 monthly processing
                           500

                                  0
                               v-11      v-1
                                             2
                                                  v-1
                                                      3
                                                           v-1
                                                               4
                                                                    v-1
                                                                        5
                                                                             v-1
                                                                                 6
                                                                                      v-1
                                                                                          7
                          No          No       No       No       No       No       No
                                                            Date




11/5/2009                                                       © The Server Labs S.L. 2009                                15
The Study: Running AGIS in Amazon
                    EC2


               Technical Feasibility:
                   Can AGIS run in the cloud?
                   What are the restrictions?
                   What modifications do we have to make?


               Financial Viability
                   What would be the cost of using EC2 for AGIS?
                   Can we do a hybrid solution using a local data centre
                    followed by a mix of local/EC2?




11/5/2009                             © The Server Labs S.L. 2009           16
Why we chose Amazon



               64 bit EC2 images
                   Large, Extra Large and High CPU Large


               Oracle ASM Image based on Oracle Database
                11g Release 1 Enterprise Edition - 64 Bit (Large
                instance) -ami-7ecb2f17

               AGIS Self configuring Image based on Ubuntu
                8.04 LTS Hardy Server 64-Bit (Large, Extra
                Large and High CPU Large Instances) - ami-
                e257b08b



11/5/2009                           © The Server Labs S.L. 2009    17
Architecture in the Cloud
   1x Large Instance
   AGIS AMI
   Elastic IP

                                                                        Elementary
      Convergence                         AstroElementary               DataTrain       Source Collector

        Monitor
                                                                                       Attitucde Collector

                                          Object Factory

                                                                Request               Calibration Collector

     RunManager
                                                                AstroElementari
                                                                es between a
                                            GaiaTable           range (x,y)             Global Collector




                                              Store
                                                                             Data Trains

                                           <n> x Extra Large or High CPU Large instances
                  1x Large instance
                                           AGIS AMI
                  Oracle AMI
                  Elastic IP


                  Oracle                                     Attitude Update
                                AGIS DB                           Server
                                                                                     3 x Extra Large
                                                                                     instances
                                                                                     AGIS AMI

11/5/2009                                             © The Server Labs S.L. 2009                             18
Oracle Image



               EC2 Large instances (m1.large & m1.xlarge)
               Oracle Enterprise Edition 11g 64 bit (11.0.6)
               Oracle ASM
               Elastic Block Storage
                     5 x 100GB disks /dev/sdg - /dev/sdk

                                                         /dev/sdg

                            ORACLE       ASM             /dev/sdh
            /   /dev/sda1
                                         Disk
                                                          /dev/sdi
            /mnt /dev/sdb   AGIS DB
                                         Group
                                         (EBS)            /dev/sdj

                                                         /dev/sdk




11/5/2009                              © The Server Labs S.L. 2009   19
AGIS Image



               Instances (m1.large and c1.xlarge).
               Java version 1.6.0_13
                   Java(TM) SE Runtime Environment (build 1.6.0_13-
                    b03)
                   Java HotSpot(TM) 64-Bit Server VM (build 11.3-b02,
                    mixed mode)
               Apache Tomcat 6.0.14
               AGIS software.
               Creation of agis user acoount
               rc.local script modified to run the AGIS process
                   Self configuring using user-data




11/5/2009                            © The Server Labs S.L. 2009         20
Phase 2



               Currently running a second feasibility study

                   New data set, 60 million primary stars (1/3 final
                    primaries). Approx 1.5 TB
                   Idea is to see how scalable the Gaia DataTrain grid is.
                    Try and scale to 1000 8CPU EC2 instances
                   Compare S3 performance with Oracle
                   Using Rightscale technology




11/5/2009                             © The Server Labs S.L. 2009             21
Lessons Learned


               Creating new images takes a long time so make sure you
                get it right 

               Oracle is very fussy about it’s IP address, hence the
                Elastic IP
                    Oracle instance hostname changed to be the public DNS
                     name

               Some work still needed on the startup script to make sure
                the ASM boots first time
                    Fixed with new Oracle AMIs

               The Attitude Servers took a long time to start up (20 mins)
                    This was due to a race condition caused by spin locks in the
                     type of java Thread Pool we were using.


11/5/2009                               © The Server Labs S.L. 2009                 22
Timescales and effort


               Took a team of 2 less than 20 man days to get running
                (Parsons,Olias).

               Just 4 lines of code changed!




11/5/2009                            © The Server Labs S.L. 2009        23
Cost effectiveness of E2C


             AGIS runs intermittently with growing Data volume.
             Estimate 2015 ~1.1MEuro (machine) + 1Meuro
              (energy bill less ?) = ~2Meuro
                In fact staggered spending for machines
                buy machines as data volume increase
             Estimate on Amazon at today prices with 10
              intermittent runs ~400Keuro
                Possibility to use more nodes and finish faster !
             Reckon you still need in house machine to avoid
              wasting time testing on E2C
             Old nut, Vendor lock-in ? Need standards



11/5/2009                          © The Server Labs S.L. 2009       24
Conclusions



               AGIS can be run in the cloud!

               Running with 50 Data train nodes gave us new
                scalability problems we hadn’t seen before!

               The Economics work out:
                   To do the same amount of processing in the same
                    time, EC2 works out cheaper for AGIS!
                   With the knowledge that EC2 is an option we can
                    delay buying more machines until the middle of the
                    mission (2014) and decide then.




11/5/2009                            © The Server Labs S.L. 2009         25
Demo




               Let’s see it in action!




11/5/2009                           © The Server Labs S.L. 2009   26
Thank you for listening




            Thank you for listening
            http://blog.theserverlabs.com
            http://www.theserverlabs.com

            mailto:pparsons@theserverlabs.com


11/5/2009                   © The Server Labs S.L. 2009   27
Contact us!

            Dolores Saiz, CEO
            Paul Parsons, CTO
            Website, e-mail
                       http://www.theserverlabs.com
                       dsaiz@theserverlabs.com
                       pparsons@theserverlabs.com
                       dgarcia@theserverlabs.com
    Spain                      UK                                         Germany
    The Server Labs S.L.       The Server Labs Ltd.                       Trianon, Mainzer Landstraße 16
    C/Pinar, 5                 Aston Court, Kingsmead Business Park       60325 Frankfurt
    28006 Madrid, Spain        Frederick Place High Wycombe               Tel: (+49) (0) 69 971 68 428
    Tel: (+34) 91 745 68 77    HP11 1LA
                               Tel: (+44) 20 8133 1620




11/5/2009                                   © The Server Labs S.L. 2009                                    28
ESA Copyright Notice

                  This presentation contains images and videos which have been released publicly from ESA.
                         You may use ESA images or videos for educational or informational purposes.
                 The publicly released ESA images may be reproduced without fee, on the following conditions:

                                              * Credit ESA as the source of the images:
                         Examples: Photo: ESA; Photo: ESA/Cluster; Image: ESA/NASA - SOHO/LASCO
    * ESA images may not be used to state or imply the endorsement by ESA or any ESA employee of a commercial product,
                               process or service, or used in any other manner that might mislead.
    * If an image includes an identifiable person, using that image for commercial purposes may infringe that person‘s right of
                            privacy, and separate permission should be obtained from the individual.

    If these images are to be used in advertising or any commercial promotion, layout and copy must be submitted to ESA
                                                  beforehand for approval to:

                                                       ESA Multimedia
                                                      multimedia@esa.int

Some images contained in this presentation have come from other sources, and this is indicated in the Copyright notice. For re-
                                  use of non-ESA images contact the designated authority.

                                                      Use of ESA videos

 The use of ESA video images in streaming and downloadable format is limited to direct viewing and/or file storage on a single
   computer per stream and/or download. Forwarding of files or streams to other computers, or use on any non-ESA Web is
                             prohibited. For the authorisation of any such use, please contact:

                                                       ESA Multimedia
                                                      multimedia@esa.int

       Seminar, IAC, 30th April 2008                 Dr. Ralf Kohley, European Space Astronomy Centre            29
 11/5/2009                                              © The Server Labs S.L. 2009                                          29

More Related Content

Viewers also liked

AdTech & MarTech Barometer - Q1 2015 Market Review
AdTech & MarTech Barometer - Q1 2015 Market ReviewAdTech & MarTech Barometer - Q1 2015 Market Review
AdTech & MarTech Barometer - Q1 2015 Market Reviewresultsig
 
Digital Advertising on AWS - Pop-up Loft Tel Aviv
Digital Advertising on AWS - Pop-up Loft Tel AvivDigital Advertising on AWS - Pop-up Loft Tel Aviv
Digital Advertising on AWS - Pop-up Loft Tel AvivAmazon Web Services
 
Real Time Bidding on AWS - Pop-up Loft Tel Aviv
Real Time Bidding on AWS - Pop-up Loft Tel AvivReal Time Bidding on AWS - Pop-up Loft Tel Aviv
Real Time Bidding on AWS - Pop-up Loft Tel AvivAmazon Web Services
 
Get the Most Out of Amazon EC2: A Deep Dive on Reserved, On-Demand, and Spot ...
Get the Most Out of Amazon EC2: A Deep Dive on Reserved, On-Demand, and Spot ...Get the Most Out of Amazon EC2: A Deep Dive on Reserved, On-Demand, and Spot ...
Get the Most Out of Amazon EC2: A Deep Dive on Reserved, On-Demand, and Spot ...Amazon Web Services
 
13 tips for a highly engaging feed
13 tips for a highly engaging feed13 tips for a highly engaging feed
13 tips for a highly engaging feedThierry Schellenbach
 
A summary of software architecture guide
A summary of software architecture guideA summary of software architecture guide
A summary of software architecture guideTriet Ho
 
The Anatomy Of The Google Architecture Fina Lv1.1
The Anatomy Of The Google Architecture Fina Lv1.1The Anatomy Of The Google Architecture Fina Lv1.1
The Anatomy Of The Google Architecture Fina Lv1.1Hassy Veldstra
 
Google cluster architecture
Google cluster architecture Google cluster architecture
Google cluster architecture Abhijeet Desai
 
AWS Architecture Case Study: Real-Time Bidding
AWS Architecture Case Study: Real-Time BiddingAWS Architecture Case Study: Real-Time Bidding
AWS Architecture Case Study: Real-Time BiddingAmazon Web Services
 
Webinar: Data Modeling and Shortcuts to Success in Scaling Time Series Applic...
Webinar: Data Modeling and Shortcuts to Success in Scaling Time Series Applic...Webinar: Data Modeling and Shortcuts to Success in Scaling Time Series Applic...
Webinar: Data Modeling and Shortcuts to Success in Scaling Time Series Applic...DATAVERSITY
 
Software Architecture: Introduction
Software Architecture: IntroductionSoftware Architecture: Introduction
Software Architecture: IntroductionHenry Muccini
 
Google Architecture - Breaking it Open
Google Architecture - Breaking it OpenGoogle Architecture - Breaking it Open
Google Architecture - Breaking it OpenHARMAN Services
 
Software Architecture Document Final
Software Architecture Document FinalSoftware Architecture Document Final
Software Architecture Document FinalAli Ahmed
 
Etsy Activity Feeds Architecture
Etsy Activity Feeds ArchitectureEtsy Activity Feeds Architecture
Etsy Activity Feeds ArchitectureDan McKinley
 
facebook architecture for 600M users
facebook architecture for 600M usersfacebook architecture for 600M users
facebook architecture for 600M usersJongyoon Choi
 

Viewers also liked (16)

AdTech & MarTech Barometer - Q1 2015 Market Review
AdTech & MarTech Barometer - Q1 2015 Market ReviewAdTech & MarTech Barometer - Q1 2015 Market Review
AdTech & MarTech Barometer - Q1 2015 Market Review
 
Digital Advertising on AWS - Pop-up Loft Tel Aviv
Digital Advertising on AWS - Pop-up Loft Tel AvivDigital Advertising on AWS - Pop-up Loft Tel Aviv
Digital Advertising on AWS - Pop-up Loft Tel Aviv
 
Real Time Bidding on AWS - Pop-up Loft Tel Aviv
Real Time Bidding on AWS - Pop-up Loft Tel AvivReal Time Bidding on AWS - Pop-up Loft Tel Aviv
Real Time Bidding on AWS - Pop-up Loft Tel Aviv
 
Get the Most Out of Amazon EC2: A Deep Dive on Reserved, On-Demand, and Spot ...
Get the Most Out of Amazon EC2: A Deep Dive on Reserved, On-Demand, and Spot ...Get the Most Out of Amazon EC2: A Deep Dive on Reserved, On-Demand, and Spot ...
Get the Most Out of Amazon EC2: A Deep Dive on Reserved, On-Demand, and Spot ...
 
13 tips for a highly engaging feed
13 tips for a highly engaging feed13 tips for a highly engaging feed
13 tips for a highly engaging feed
 
A summary of software architecture guide
A summary of software architecture guideA summary of software architecture guide
A summary of software architecture guide
 
The Anatomy Of The Google Architecture Fina Lv1.1
The Anatomy Of The Google Architecture Fina Lv1.1The Anatomy Of The Google Architecture Fina Lv1.1
The Anatomy Of The Google Architecture Fina Lv1.1
 
Google cluster architecture
Google cluster architecture Google cluster architecture
Google cluster architecture
 
AWS Architecture Case Study: Real-Time Bidding
AWS Architecture Case Study: Real-Time BiddingAWS Architecture Case Study: Real-Time Bidding
AWS Architecture Case Study: Real-Time Bidding
 
Webinar: Data Modeling and Shortcuts to Success in Scaling Time Series Applic...
Webinar: Data Modeling and Shortcuts to Success in Scaling Time Series Applic...Webinar: Data Modeling and Shortcuts to Success in Scaling Time Series Applic...
Webinar: Data Modeling and Shortcuts to Success in Scaling Time Series Applic...
 
Software Architecture: Introduction
Software Architecture: IntroductionSoftware Architecture: Introduction
Software Architecture: Introduction
 
Google Architecture - Breaking it Open
Google Architecture - Breaking it OpenGoogle Architecture - Breaking it Open
Google Architecture - Breaking it Open
 
Introduction to Amazon EC2
Introduction to Amazon EC2Introduction to Amazon EC2
Introduction to Amazon EC2
 
Software Architecture Document Final
Software Architecture Document FinalSoftware Architecture Document Final
Software Architecture Document Final
 
Etsy Activity Feeds Architecture
Etsy Activity Feeds ArchitectureEtsy Activity Feeds Architecture
Etsy Activity Feeds Architecture
 
facebook architecture for 600M users
facebook architecture for 600M usersfacebook architecture for 600M users
facebook architecture for 600M users
 

Similar to AWS Customer Presentation - The Server Labs

Virtual Appliances, Cloud Computing, and Reproducible Research
Virtual Appliances, Cloud Computing, and Reproducible ResearchVirtual Appliances, Cloud Computing, and Reproducible Research
Virtual Appliances, Cloud Computing, and Reproducible ResearchUniversity of Washington
 
Tricks And Tradeoffs Of Deploying My Sql Clusters In The Cloud
Tricks And Tradeoffs Of Deploying My Sql Clusters In The CloudTricks And Tradeoffs Of Deploying My Sql Clusters In The Cloud
Tricks And Tradeoffs Of Deploying My Sql Clusters In The CloudMySQLConference
 
Accelerating Analytics for the Future of Genomics
Accelerating Analytics for the Future of GenomicsAccelerating Analytics for the Future of Genomics
Accelerating Analytics for the Future of GenomicsAmazon Web Services
 
Using Storlets/Docker For Large Scale Image Processing
Using Storlets/Docker For Large Scale Image ProcessingUsing Storlets/Docker For Large Scale Image Processing
Using Storlets/Docker For Large Scale Image ProcessingKota Tsuyuzaki
 
The Synergy Between the Object Database, Graph Database, Cloud Computing and ...
The Synergy Between the Object Database, Graph Database, Cloud Computing and ...The Synergy Between the Object Database, Graph Database, Cloud Computing and ...
The Synergy Between the Object Database, Graph Database, Cloud Computing and ...InfiniteGraph
 
Good Data: Collaborative Analytics On Demand
Good Data: Collaborative Analytics On DemandGood Data: Collaborative Analytics On Demand
Good Data: Collaborative Analytics On Demandzsvoboda
 
Reducing Database Costs via Shard Consolidation
Reducing Database Costs via Shard ConsolidationReducing Database Costs via Shard Consolidation
Reducing Database Costs via Shard ConsolidationAmazon Web Services
 
Scality, Cloud Storage pour Zimbra
Scality, Cloud Storage pour ZimbraScality, Cloud Storage pour Zimbra
Scality, Cloud Storage pour ZimbraAntony Barroux
 
Implementing the Future of PostgreSQL Clustering with Tungsten
Implementing the Future of PostgreSQL Clustering with TungstenImplementing the Future of PostgreSQL Clustering with Tungsten
Implementing the Future of PostgreSQL Clustering with TungstenCommand Prompt., Inc
 
Running your Java EE applications in the Cloud
Running your Java EE applications in the CloudRunning your Java EE applications in the Cloud
Running your Java EE applications in the CloudArun Gupta
 
CMPE 297 Lecture: Building Infrastructure Clouds with OpenStack
CMPE 297 Lecture: Building Infrastructure Clouds with OpenStackCMPE 297 Lecture: Building Infrastructure Clouds with OpenStack
CMPE 297 Lecture: Building Infrastructure Clouds with OpenStackJoe Arnold
 
AWS Storage Tiering for Enterprise Workloads
AWS Storage Tiering for Enterprise WorkloadsAWS Storage Tiering for Enterprise Workloads
AWS Storage Tiering for Enterprise WorkloadsTom Laszewski
 
ENT101 Embracing the Cloud - AWS re: Invent 2012
ENT101 Embracing the Cloud - AWS re: Invent 2012ENT101 Embracing the Cloud - AWS re: Invent 2012
ENT101 Embracing the Cloud - AWS re: Invent 2012Amazon Web Services
 
Netflix Story of Embracing the Cloud
Netflix Story of Embracing the CloudNetflix Story of Embracing the Cloud
Netflix Story of Embracing the CloudKate Karniouchina
 
2012 re:Invent Netflix: embracing the cloud final
2012 re:Invent Netflix: embracing the cloud final2012 re:Invent Netflix: embracing the cloud final
2012 re:Invent Netflix: embracing the cloud finalYuryIzrailevsky
 
How to run your Hadoop Cluster in 10 minutes
How to run your Hadoop Cluster in 10 minutesHow to run your Hadoop Cluster in 10 minutes
How to run your Hadoop Cluster in 10 minutesVladimir Simek
 
A Few of My Favorite Things: The Magic that FME Brings to My Life
A Few of My Favorite Things:The Magic that FME Brings to My LifeA Few of My Favorite Things:The Magic that FME Brings to My Life
A Few of My Favorite Things: The Magic that FME Brings to My LifeSafe Software
 
Aspirus Enterprise Backup Assessment And Implementation Of Avamar
Aspirus Enterprise Backup Assessment And Implementation Of AvamarAspirus Enterprise Backup Assessment And Implementation Of Avamar
Aspirus Enterprise Backup Assessment And Implementation Of Avamartomwhalen
 

Similar to AWS Customer Presentation - The Server Labs (20)

Cloud Science
Cloud ScienceCloud Science
Cloud Science
 
Virtual Appliances, Cloud Computing, and Reproducible Research
Virtual Appliances, Cloud Computing, and Reproducible ResearchVirtual Appliances, Cloud Computing, and Reproducible Research
Virtual Appliances, Cloud Computing, and Reproducible Research
 
Tricks And Tradeoffs Of Deploying My Sql Clusters In The Cloud
Tricks And Tradeoffs Of Deploying My Sql Clusters In The CloudTricks And Tradeoffs Of Deploying My Sql Clusters In The Cloud
Tricks And Tradeoffs Of Deploying My Sql Clusters In The Cloud
 
Accelerating Analytics for the Future of Genomics
Accelerating Analytics for the Future of GenomicsAccelerating Analytics for the Future of Genomics
Accelerating Analytics for the Future of Genomics
 
Using Storlets/Docker For Large Scale Image Processing
Using Storlets/Docker For Large Scale Image ProcessingUsing Storlets/Docker For Large Scale Image Processing
Using Storlets/Docker For Large Scale Image Processing
 
The Synergy Between the Object Database, Graph Database, Cloud Computing and ...
The Synergy Between the Object Database, Graph Database, Cloud Computing and ...The Synergy Between the Object Database, Graph Database, Cloud Computing and ...
The Synergy Between the Object Database, Graph Database, Cloud Computing and ...
 
Good Data: Collaborative Analytics On Demand
Good Data: Collaborative Analytics On DemandGood Data: Collaborative Analytics On Demand
Good Data: Collaborative Analytics On Demand
 
Reducing Database Costs via Shard Consolidation
Reducing Database Costs via Shard ConsolidationReducing Database Costs via Shard Consolidation
Reducing Database Costs via Shard Consolidation
 
Scality, Cloud Storage pour Zimbra
Scality, Cloud Storage pour ZimbraScality, Cloud Storage pour Zimbra
Scality, Cloud Storage pour Zimbra
 
Implementing the Future of PostgreSQL Clustering with Tungsten
Implementing the Future of PostgreSQL Clustering with TungstenImplementing the Future of PostgreSQL Clustering with Tungsten
Implementing the Future of PostgreSQL Clustering with Tungsten
 
Running your Java EE applications in the Cloud
Running your Java EE applications in the CloudRunning your Java EE applications in the Cloud
Running your Java EE applications in the Cloud
 
EMC config Hadoop
EMC config HadoopEMC config Hadoop
EMC config Hadoop
 
CMPE 297 Lecture: Building Infrastructure Clouds with OpenStack
CMPE 297 Lecture: Building Infrastructure Clouds with OpenStackCMPE 297 Lecture: Building Infrastructure Clouds with OpenStack
CMPE 297 Lecture: Building Infrastructure Clouds with OpenStack
 
AWS Storage Tiering for Enterprise Workloads
AWS Storage Tiering for Enterprise WorkloadsAWS Storage Tiering for Enterprise Workloads
AWS Storage Tiering for Enterprise Workloads
 
ENT101 Embracing the Cloud - AWS re: Invent 2012
ENT101 Embracing the Cloud - AWS re: Invent 2012ENT101 Embracing the Cloud - AWS re: Invent 2012
ENT101 Embracing the Cloud - AWS re: Invent 2012
 
Netflix Story of Embracing the Cloud
Netflix Story of Embracing the CloudNetflix Story of Embracing the Cloud
Netflix Story of Embracing the Cloud
 
2012 re:Invent Netflix: embracing the cloud final
2012 re:Invent Netflix: embracing the cloud final2012 re:Invent Netflix: embracing the cloud final
2012 re:Invent Netflix: embracing the cloud final
 
How to run your Hadoop Cluster in 10 minutes
How to run your Hadoop Cluster in 10 minutesHow to run your Hadoop Cluster in 10 minutes
How to run your Hadoop Cluster in 10 minutes
 
A Few of My Favorite Things: The Magic that FME Brings to My Life
A Few of My Favorite Things:The Magic that FME Brings to My LifeA Few of My Favorite Things:The Magic that FME Brings to My Life
A Few of My Favorite Things: The Magic that FME Brings to My Life
 
Aspirus Enterprise Backup Assessment And Implementation Of Avamar
Aspirus Enterprise Backup Assessment And Implementation Of AvamarAspirus Enterprise Backup Assessment And Implementation Of Avamar
Aspirus Enterprise Backup Assessment And Implementation Of Avamar
 

More from Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Amazon Web Services
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Amazon Web Services
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateAmazon Web Services
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSAmazon Web Services
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Amazon Web Services
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Amazon Web Services
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...Amazon Web Services
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsAmazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareAmazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSAmazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAmazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareAmazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWSAmazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckAmazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without serversAmazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceAmazon Web Services
 

More from Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Recently uploaded

Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 

Recently uploaded (20)

Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 

AWS Customer Presentation - The Server Labs

  • 1. Scientific Data Processing in Amazon EC2 Amazon Cloud for the Enterprise Event London, Tuesday November 3rd, 2009 Paul Parsons, CTO, The Server Labs November 09 © The Server Labs S.L. 2009
  • 2. About the Presenter  Paul Parsons  Founder & CTO of The Server Labs  Over 20 years experience in many fields of IT 11/5/2009 © The Server Labs S.L. 2009 2
  • 3. About The Server Labs 100% privately funded limited company created in 2004 in Spain and in 2007 in the UK. • IT architects • Extensive experience • Specialised, niche consultancy • Hands-on • Agile execution • Passion for technology 11/5/2009 © The Server Labs S.L. 2009 3
  • 4. EC2 Feasibility Study  Feasibility Study to investigate running part of ESA’s Gaia data processing in the cloud  Two main objectives  Evaluate the Tecnical Feasibility of using Amazon EC2 to run scientific data processing applications for Gaia  Evaluate the Financial Viability of using pay on demand compute power vs. traditional in-house data processing 11/5/2009 © The Server Labs S.L. 2009 4
  • 5. A Stereoscopic Census of our Galaxy (based on slides from Jos de Bruijne and William O’Mullane) 11/5/2009 © The Server Labs S.L. 2009 5
  • 6. The Gaia Mission  Primary goal of the Gaia mission is to create an astrometric catalogue of 1 billion stars (approx 1% of our Galaxy) with micro arc second precision.  Gaia satellite to be launched in 2011.  Observations done until 2017.  Catalogue ready around 2019. 11/5/2009 © The Server Labs S.L. 2009 6
  • 7. The Gaia Mission Credit: Images ESA 11/5/2009 © The Server Labs S.L. 2009 7
  • 8. Data Downlink  Using Cebreros, Spain (35M)  3-8Mb/s downlink • depends on encoding • which depends on weather ! Hopefully Spain  ~ 40GB/day -> ~100TB remains sunny !  Occasionally New Norcia, Australia  during Galactic plane scans  data accumulated onboard downlinked later  Data is compressed encoded and requires a lot of processing (~10^21 FLOP) 11/5/2009 © The Server Labs S.L. 2009 9
  • 9. If it took 1 millisecond to process one image, the processing time for just one pass through the data (on a single processor) would take 30 years. Obviously the adopted solution is much faster … …. distributed/parallel processing. 11/5/2009 © The Server Labs S.L. 2009 10
  • 10. AGIS: Astrometric Global Iterative Solution Scan width: 0.7 1. Objects are matched in successive scans 2. Attitude and calibrations are updated 3. Objects positions etc. are solved 4. Higher-order terms are solved 5. More scans are added 6. Whole system is iterated 11/5/2009 © The Server Labs S.L. 2009 11
  • 11. AGIS Architecture: grid based Calibration Global Attitude Source Datatrains drive through AGIS Database passing observations to algorithms. There can be as many Datatrains in parallel as we wish AstroElementary Elementary Takers Optimised AGIS Data Access Layer Algorithm does not Database access data directly 11/5/2009 © The Server Labs S.L. 2009 12
  • 12. AGIS Architecture - detailed Convergence Monitor Elementary Source Collector AstroElementary DataTrain Attitucde Collector Object Factory RunManager Calibration Collector Request AstroElementaries between a range (x,y) Global Collector GaiaTable Store Attitude Update Global Update Oracle AGIS DB Server Server 11/5/2009 © The Server Labs S.L. 2009 13
  • 13. Scheduling  Very simple ..  Keep all machines busy all the time! • Busy = CPU ~90%  Post jobs on whiteboard Trains/Workers Mark Jobs – and do them Mark finished – repeat until done Previous attempt had much more general Job 1 scheduling Job 2 It was also ~1000 times slower. Job 3 … Job N 11/5/2009 © The Server Labs S.L. 2009 14
  • 14. AGIS is not 24x7  Iterative processing – 6 month Data Reduction Cycles  At current estimates AGIS will run 2 weeks every 6 months  Amount of data increases over the 6 year mission AGIS Peak Processing (Hours) 2500 Cap Ex 2000 1500 Hours 1000 AGIS 6 monthly processing 500 0 v-11 v-1 2 v-1 3 v-1 4 v-1 5 v-1 6 v-1 7 No No No No No No No Date 11/5/2009 © The Server Labs S.L. 2009 15
  • 15. The Study: Running AGIS in Amazon EC2  Technical Feasibility:  Can AGIS run in the cloud?  What are the restrictions?  What modifications do we have to make?  Financial Viability  What would be the cost of using EC2 for AGIS?  Can we do a hybrid solution using a local data centre followed by a mix of local/EC2? 11/5/2009 © The Server Labs S.L. 2009 16
  • 16. Why we chose Amazon  64 bit EC2 images  Large, Extra Large and High CPU Large  Oracle ASM Image based on Oracle Database 11g Release 1 Enterprise Edition - 64 Bit (Large instance) -ami-7ecb2f17  AGIS Self configuring Image based on Ubuntu 8.04 LTS Hardy Server 64-Bit (Large, Extra Large and High CPU Large Instances) - ami- e257b08b 11/5/2009 © The Server Labs S.L. 2009 17
  • 17. Architecture in the Cloud 1x Large Instance AGIS AMI Elastic IP Elementary Convergence AstroElementary DataTrain Source Collector Monitor Attitucde Collector Object Factory Request Calibration Collector RunManager AstroElementari es between a GaiaTable range (x,y) Global Collector Store Data Trains <n> x Extra Large or High CPU Large instances 1x Large instance AGIS AMI Oracle AMI Elastic IP Oracle Attitude Update AGIS DB Server 3 x Extra Large instances AGIS AMI 11/5/2009 © The Server Labs S.L. 2009 18
  • 18. Oracle Image  EC2 Large instances (m1.large & m1.xlarge)  Oracle Enterprise Edition 11g 64 bit (11.0.6)  Oracle ASM  Elastic Block Storage  5 x 100GB disks /dev/sdg - /dev/sdk /dev/sdg ORACLE ASM /dev/sdh / /dev/sda1 Disk /dev/sdi /mnt /dev/sdb AGIS DB Group (EBS) /dev/sdj /dev/sdk 11/5/2009 © The Server Labs S.L. 2009 19
  • 19. AGIS Image  Instances (m1.large and c1.xlarge).  Java version 1.6.0_13  Java(TM) SE Runtime Environment (build 1.6.0_13- b03)  Java HotSpot(TM) 64-Bit Server VM (build 11.3-b02, mixed mode)  Apache Tomcat 6.0.14  AGIS software.  Creation of agis user acoount  rc.local script modified to run the AGIS process  Self configuring using user-data 11/5/2009 © The Server Labs S.L. 2009 20
  • 20. Phase 2  Currently running a second feasibility study  New data set, 60 million primary stars (1/3 final primaries). Approx 1.5 TB  Idea is to see how scalable the Gaia DataTrain grid is. Try and scale to 1000 8CPU EC2 instances  Compare S3 performance with Oracle  Using Rightscale technology 11/5/2009 © The Server Labs S.L. 2009 21
  • 21. Lessons Learned  Creating new images takes a long time so make sure you get it right   Oracle is very fussy about it’s IP address, hence the Elastic IP  Oracle instance hostname changed to be the public DNS name  Some work still needed on the startup script to make sure the ASM boots first time  Fixed with new Oracle AMIs  The Attitude Servers took a long time to start up (20 mins)  This was due to a race condition caused by spin locks in the type of java Thread Pool we were using. 11/5/2009 © The Server Labs S.L. 2009 22
  • 22. Timescales and effort  Took a team of 2 less than 20 man days to get running (Parsons,Olias).  Just 4 lines of code changed! 11/5/2009 © The Server Labs S.L. 2009 23
  • 23. Cost effectiveness of E2C  AGIS runs intermittently with growing Data volume.  Estimate 2015 ~1.1MEuro (machine) + 1Meuro (energy bill less ?) = ~2Meuro  In fact staggered spending for machines  buy machines as data volume increase  Estimate on Amazon at today prices with 10 intermittent runs ~400Keuro  Possibility to use more nodes and finish faster !  Reckon you still need in house machine to avoid wasting time testing on E2C  Old nut, Vendor lock-in ? Need standards 11/5/2009 © The Server Labs S.L. 2009 24
  • 24. Conclusions  AGIS can be run in the cloud!  Running with 50 Data train nodes gave us new scalability problems we hadn’t seen before!  The Economics work out:  To do the same amount of processing in the same time, EC2 works out cheaper for AGIS!  With the knowledge that EC2 is an option we can delay buying more machines until the middle of the mission (2014) and decide then. 11/5/2009 © The Server Labs S.L. 2009 25
  • 25. Demo  Let’s see it in action! 11/5/2009 © The Server Labs S.L. 2009 26
  • 26. Thank you for listening Thank you for listening http://blog.theserverlabs.com http://www.theserverlabs.com mailto:pparsons@theserverlabs.com 11/5/2009 © The Server Labs S.L. 2009 27
  • 27. Contact us! Dolores Saiz, CEO Paul Parsons, CTO Website, e-mail http://www.theserverlabs.com dsaiz@theserverlabs.com pparsons@theserverlabs.com dgarcia@theserverlabs.com Spain UK Germany The Server Labs S.L. The Server Labs Ltd. Trianon, Mainzer Landstraße 16 C/Pinar, 5 Aston Court, Kingsmead Business Park 60325 Frankfurt 28006 Madrid, Spain Frederick Place High Wycombe Tel: (+49) (0) 69 971 68 428 Tel: (+34) 91 745 68 77 HP11 1LA Tel: (+44) 20 8133 1620 11/5/2009 © The Server Labs S.L. 2009 28
  • 28. ESA Copyright Notice This presentation contains images and videos which have been released publicly from ESA. You may use ESA images or videos for educational or informational purposes. The publicly released ESA images may be reproduced without fee, on the following conditions: * Credit ESA as the source of the images: Examples: Photo: ESA; Photo: ESA/Cluster; Image: ESA/NASA - SOHO/LASCO * ESA images may not be used to state or imply the endorsement by ESA or any ESA employee of a commercial product, process or service, or used in any other manner that might mislead. * If an image includes an identifiable person, using that image for commercial purposes may infringe that person‘s right of privacy, and separate permission should be obtained from the individual. If these images are to be used in advertising or any commercial promotion, layout and copy must be submitted to ESA beforehand for approval to: ESA Multimedia multimedia@esa.int Some images contained in this presentation have come from other sources, and this is indicated in the Copyright notice. For re- use of non-ESA images contact the designated authority. Use of ESA videos The use of ESA video images in streaming and downloadable format is limited to direct viewing and/or file storage on a single computer per stream and/or download. Forwarding of files or streams to other computers, or use on any non-ESA Web is prohibited. For the authorisation of any such use, please contact: ESA Multimedia multimedia@esa.int Seminar, IAC, 30th April 2008 Dr. Ralf Kohley, European Space Astronomy Centre 29 11/5/2009 © The Server Labs S.L. 2009 29