SlideShare a Scribd company logo
A Model for the Systems
                             Architecture of the Future



         Prof. Paul A. Strassmann
         George Mason University, December 5, 2005
                                                                          1
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Data-Centric Era; IBM Dominates




                Hundred Sources

                   1950-1980




                Months⇒Weeks


                                                                          2
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Workgroup-Centric Era; Microsoft, INTEL Dominate




                                                        Million Sources

               Hundred Sources

                   1950-1980                             1980-2010


                                                        Weeks⇒Days

                Months⇒Weeks


                                                                          3
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Network-Centric Era; Google and Cisco?
                                                                          +Multi-Media
                                                              +Text       Billions Sources
                        Data
                                                        Million Sources

               Hundred Sources

                   1950-1980                             1980-2010         2010-

                                                                          Days⇒Real-Time

                                                      Weeks⇒Days

                Months⇒Weeks


                                                                                             4
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Example of a Network-Centric System




                                                                          5
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Network-Centric Requirements (2010)

        •     Downtime (< 5 min/yr);
        •     Display (200 Billion ops/sec);
        •     Connectivity (> 1 Gigabyte/sec);
        •     Access (< 0.25 sec);
        •     Innovation (< 1 day);
        •     Security (> 8 sigma).



                                                                          6
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Performance (2005)


         •      Infrastructure = > 50% of spending;
         •      Security = ?;
         •      Integration = > 50% of applications;
         •      Network downtime = > 1 hour/year;
         •      Innovation = > 1 year.




                                                                          7
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Conclusion




         • Network-Centric systems cannot be
           built on Workgroup-Centric architecture.




                                                                          8
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Network-Centric Principles (Google)

1. Build & operate protected information
   network;
2. Offer universal connectivity for:
   – Collection, processing and storing of
      information;
   – Provide secured communications.
3. Maintain shared data models;
4. Require continued upgrading & innovation.


                                               9
Google Principle #1




Build & operate protected information network




                                          10
Standard Google Clusters Operate Net

                •         359 racks
                •         31,654 machines
                •         63,184 CPUs
                •         126,368 Ghz of processing power
                •         63,184 Gb of RAM
                •         2,527 Tb of Hard Drive space

         • Appx. 40 million searches/day

                                                                          11
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Clusters Have Identical Architecture



                                 Index
                                Servers

   Web             Web
  Switch          Servers



                                Document
                                 Servers


                                           12
Google Cluster Set-Up = Three Days




                                     13
Google Infrastructure: Key to Growth (1)
        • >200,000 custom-built commodity servers;
        •       Acting as one parallel supercomputer;
        • Fault tolerant hardware.
        • Storage capacity >5 petabytes; low response latency
          (0.2 sec); >80GB per server, distributed;
        • Indexed >8 billion web pages; Indexing is
          computationally complex (>500M * > 2B matrix)
        • Capital and operating costs at fraction of large scale
          commercial servers; traffic growth 20-30%/month; data
          centers (>12); in US, Europe and Asia.


                                                                          14
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Google Infrastructure: Key to Growth (2)
        •     >200,000 commodity Linux servers built to custom
              specifications; distributed cluster architecture; acting as one
              parallel supercomputer; scaleable;
        •     >50,000 requests/sec; fault tolerant (no single point of failure);
              diverse hardware; stripped version of Red Hat;
        •     Storage capacity >5 petabytes;
        •     >80GB per server;
        •     Indexed >8 billion web pages;
        •     Indexing is complex (500M x 2B matrix)
        •     Capital and operating costs at fraction of large scale commercial
              servers; traffic growth 20-30%/month; data centers (>12); in US,
              Europe and Asia.



                                                                          15
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Google Infrastructure: Key to Growth (3)
        •     >200,000 commodity Linux servers built to custom
              specifications; distributed cluster architecture; acting as one
              parallel supercomputer; scaleable;
        •     >50,000 requests/sec; fault tolerant (no single point of failure);
              diverse hardware; stripped version of Red Hat;
        •     Storage capacity >5 petabytes; low response latency (0.2 sec);
              >80GB per server, distributed;
        •     Indexed >8 billion web pages; Indexing is computationally
              complex (>500M * > 2B matrix)
        •     Capital and operating costs a fraction of commercial servers;
        •     Traffic growth 20-30%/month;
        •     Data centers (>20), in US, Europe and Asia.



                                                                          16
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Architecture for Reliability


  • Replication (3x+) for redundancy;
  • Replication for proximity and response;
  • Reliability with software and architecture,
    not with hardware.




                                                                          17
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Indexing for Response


         •      Dynamic indexing of 8B+ pages;
         •      Dynamic indexing of 1B+ images;
         •      Indexing of 1B+ messages;
         •      Index broken into “shards” and
                distributed across data centers.




                                                                          18
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Query Serving Infrastructure

     • Processing a single query may involve
       1000+ servers;
     • Index Servers access Index Shards;
     • Document Servers access Doc Shards;
     • Response times monitored to assure
       <0.25 sec latency.



                                                                          19
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Google MapReduce System (1)

 •     Coordinates servers in real-time;
 •     Automates distribution of workload;
 •     Fault tolerance and service reconstitution;
 •     Systems-wide I/O cluster scheduling;
 •     Status and performance monitoring.




                                                                          20
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Google MapReduce System (2)


 •     Coordination of servers in real-time;
 •     Automates distribution of workload;
 •     Fault tolerance & service reconstitution;
 •     Systems-wide cluster scheduling;
 •     Status and performance monitoring.




                                                                          21
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Google Principle #2




      Universal connectivity




                               22
Multi-Lingual Services




                                                                          23
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Search in Arabic Media




                                                                          24
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Video Searches




                                                                          25
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Google Base - Connecting Diverse Sources

                                                                           Locate events within
                                                                          45 miles of New York in
                                                                             November, 2005




                                                                                                    26
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Semantic Parsing


  • Tools parse millions of documents;
  • Automated learning for related information.
           – Query: “Bay Area Cooking Classes”

           – Finds: “San Francisco College Classes”;
                    “The Magic of Thai Cuisine”




                                                                          27
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Google Principle #3




         Shared data models




                              28
Data Engineering

    • Standard file management: The Google
      File System (GFS);
    • Standard job scheduling: The Global Work
      Queue (GWQ);
    • Standard network management: The
      Google MapReduce system.



                                                                          29
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Google File System (GFS)

        • Replicated Masters manage MetaData
          directories;
        • Data transfers directly at the machine
          level within 2,000+ clusters;
        • File broken into 64 MB chunks for
          2000+ MB/second read/write load;
        • All file chunks at least triplicate for
          safety.

                                                                          30
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Data Dictionary for Interoperability




                                                                          31
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Application Interfaces




                                                                          32
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Google Principle #4




        Upgrading & Innovation




                                 33
Deliver On-Line Services




                                                                          34
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Shopping Services




                                                                          35
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Environment for Rapid Innovation




                                                                          36
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
A New Application Launched in 15 Minutes




                                                                          37
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Occupy the Desktop




                                                                          38
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Multimedia Services




                                                                          39
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Workgroup vs. Network Architectures




    Comparison Summaries




                                      40
Workgroup Computing Today:
        Millions of Local Applications+Local Data




                                                                          41
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Work-Groups Vulnerable Today




                                                                          42
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
New Internet: Billions of Browsers,
        Secure Shared Applications+Data




     Browsers




                     Application
                     Application
                     Application                                          43
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Workgroup vs. Network Architectures (1)

  Workgroup Centric                                                       Network Centric

 Strategy: Capture Desktop                                                Strategy: Occupy Internet
 Customer’s labor and capital                                             Labor and capital in network
 User-specific infrastructures                                            Infrastructure is universal
 Systems controls by user                                                 Network controls in network
 Operating system dependency                                              Open source browsers
 License Software                                                         Pay for Use




                                                                                             44
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Workgroup vs. Network Architectures (2)

Workgroup Centric                                                         Network Centric

Strategy: Capture Desktop                                                 Strategy: Occupy Internet
Customer’s labor and capital                                              Labor and capital in network
User-specific infrastructures                                             Infrastructure is universal
Systems controls by user                                                  Network controls in network
Operating system dependency                                               Open source browsers
License Software                                                          Pay for Use




                                                                                            45
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Workgroup vs. Network Architectures (3)

Workgroup Centric                                                         Network Centric

Strategy: Capture Desktop                                                 Strategy: Occupy Internet
Customer’s labor and capital                                              Labor and capital in network
User-specific infrastructures                                             Infrastructure is universal
Systems controls by user                                                  Network controls in network
Operating system dependency                                               Open source browsers
License Software                                                          Pay for Use




                                                                                              46
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Workgroup vs. Network Architectures (4)

Workgroup Centric                                                         Network Centric

Strategy: Capture Desktop                                                 Strategy: Occupy Internet
Customer’s labor and capital                                              Labor and capital in network
User-specific infrastructures                                             Infrastructure is universal
Systems controls by user                                                  Network controls in network
Operating system dependency                                               Open source browsers
License Software                                                          Pay for Use




                                                                                            47
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Workgroup vs. Network Architectures (5)

Workgroup Centric                                                         Network Centric

Strategy: Capture Desktop                                                 Strategy: Occupy Internet
Customer’s labor and capital                                              Labor and capital in network
User-specific infrastructures                                             Infrastructure is universal
Systems controls by user                                                  Network controls in network
Operating system dependent                                                Open source browsers
License Software                                                          Pay for Use




                                                                                              48
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Workgroup vs. Network Architectures (6)

Workgroup Centric                                                         Network Centric

Strategy: Capture Desktop                                                 Strategy: Occupy Internet
Customer’s labor and capital                                              Labor and capital in network
User-specific infrastructures                                             Infrastructure is universal
Systems controls by user                                                  Network controls in network

License Software                                                          Pay for Use




                                                                                              49
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Workgroup vs. Network Architectures (7)

Workgroup Centric                                                         Network Centric

Strategy: Capture Desktop                                                 Strategy: Occupy Internet
Customer’s labor and capital                                              Labor and capital in network
User-specific infrastructures                                             Infrastructure is universal
Systems controls by user                                                  Network controls in network

License Software                                                          Pay for Use
Data read from files                                                      Data assembled in context



                                                                                              50
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
The Future

  Technology
  • All electronic devices on Internet.
  • Data, voice, video, sensor inputs accessible.
  • Phone, TV and print media displaced.

  Services
  • Systems respond to questions.
  • Information is displayed in context.
  • Applications for decision-making.

                                                                          51
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
Relevance for National Security Systems

       • Workgroups to Network-Centric services.
       • Migrate through displacement.
       • Invest savings in innovation.




                                                                          52
Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY

More Related Content

Similar to Lecture v4

The future of tape
The future of tapeThe future of tape
The future of tape
Josef Weingand
 
IBM Tape the future of tape
IBM Tape the future of tapeIBM Tape the future of tape
IBM Tape the future of tape
Josef Weingand
 
Vortrag ralph behrens_ibm-data
Vortrag ralph behrens_ibm-dataVortrag ralph behrens_ibm-data
Vortrag ralph behrens_ibm-data
Aravindharamanan S
 
RAMCloud: Scalable Datacenter Storage Entirely in DRAM
RAMCloud: Scalable  Datacenter Storage  Entirely in DRAMRAMCloud: Scalable  Datacenter Storage  Entirely in DRAM
RAMCloud: Scalable Datacenter Storage Entirely in DRAM
George Ang
 
Horizon 20110928
Horizon 20110928Horizon 20110928
Horizon 20110928
Mike Miller
 
Why z/OS is a Great Platform for Developing and Hosting APIs
Why z/OS is a Great Platform for Developing and Hosting APIsWhy z/OS is a Great Platform for Developing and Hosting APIs
Why z/OS is a Great Platform for Developing and Hosting APIs
Teodoro Cipresso
 
KIISE:SIGDB Workshop presentation.
KIISE:SIGDB Workshop presentation.KIISE:SIGDB Workshop presentation.
KIISE:SIGDB Workshop presentation.
Kyong-Ha Lee
 
S de2784 footprint-reduction-edge2015-v2
S de2784 footprint-reduction-edge2015-v2S de2784 footprint-reduction-edge2015-v2
S de2784 footprint-reduction-edge2015-v2
Tony Pearson
 
Big Data and OSS at IBM
Big Data and OSS at IBMBig Data and OSS at IBM
Big Data and OSS at IBM
Boulder Java User's Group
 
Flash Ahead: IBM Flash System Selling Point
Flash Ahead: IBM Flash System Selling PointFlash Ahead: IBM Flash System Selling Point
Flash Ahead: IBM Flash System Selling Point
CTI Group
 
USENIX NSDI 2016 (Session: Resource Sharing)
USENIX NSDI 2016 (Session: Resource Sharing)USENIX NSDI 2016 (Session: Resource Sharing)
USENIX NSDI 2016 (Session: Resource Sharing)
Ryousei Takano
 
Webinar: How Snapshots CAN be Backups
Webinar: How Snapshots CAN be BackupsWebinar: How Snapshots CAN be Backups
Webinar: How Snapshots CAN be Backups
Storage Switzerland
 
Compare Clustering Methods for MS SQL Server
Compare Clustering Methods for MS SQL ServerCompare Clustering Methods for MS SQL Server
Compare Clustering Methods for MS SQL Server
AlexDepo
 
Storage overview
Storage overviewStorage overview
Storage overview
Christalin Nelson
 
New Business Applications Powered by In-Memory Technology @MIT Forum for Supp...
New Business Applications Powered by In-Memory Technology @MIT Forum for Supp...New Business Applications Powered by In-Memory Technology @MIT Forum for Supp...
New Business Applications Powered by In-Memory Technology @MIT Forum for Supp...
Paul Hofmann
 
Scalable Relational Databases with Amazon Aurora. Madrid Summit 2019
Scalable Relational Databases with Amazon Aurora. Madrid Summit 2019Scalable Relational Databases with Amazon Aurora. Madrid Summit 2019
Scalable Relational Databases with Amazon Aurora. Madrid Summit 2019
javier ramirez
 
Research and technology explosion in scale-out storage
Research and technology explosion in scale-out storageResearch and technology explosion in scale-out storage
Research and technology explosion in scale-out storage
Jeff Spencer
 
Greenplum Architecture
Greenplum ArchitectureGreenplum Architecture
Greenplum Architecture
Alexey Grishchenko
 
NZS-4532 - Bringing Historical Data to Life with IBMs SMF Data Engine
NZS-4532 - Bringing Historical Data to Life with IBMs SMF Data EngineNZS-4532 - Bringing Historical Data to Life with IBMs SMF Data Engine
NZS-4532 - Bringing Historical Data to Life with IBMs SMF Data Engine
IBM z Systems Software - IT Service Management
 
Theresa Melvin, HP Enterprise - IOT/AI/ML at Hyperscale - how to go faster wi...
Theresa Melvin, HP Enterprise - IOT/AI/ML at Hyperscale - how to go faster wi...Theresa Melvin, HP Enterprise - IOT/AI/ML at Hyperscale - how to go faster wi...
Theresa Melvin, HP Enterprise - IOT/AI/ML at Hyperscale - how to go faster wi...
Aerospike
 

Similar to Lecture v4 (20)

The future of tape
The future of tapeThe future of tape
The future of tape
 
IBM Tape the future of tape
IBM Tape the future of tapeIBM Tape the future of tape
IBM Tape the future of tape
 
Vortrag ralph behrens_ibm-data
Vortrag ralph behrens_ibm-dataVortrag ralph behrens_ibm-data
Vortrag ralph behrens_ibm-data
 
RAMCloud: Scalable Datacenter Storage Entirely in DRAM
RAMCloud: Scalable  Datacenter Storage  Entirely in DRAMRAMCloud: Scalable  Datacenter Storage  Entirely in DRAM
RAMCloud: Scalable Datacenter Storage Entirely in DRAM
 
Horizon 20110928
Horizon 20110928Horizon 20110928
Horizon 20110928
 
Why z/OS is a Great Platform for Developing and Hosting APIs
Why z/OS is a Great Platform for Developing and Hosting APIsWhy z/OS is a Great Platform for Developing and Hosting APIs
Why z/OS is a Great Platform for Developing and Hosting APIs
 
KIISE:SIGDB Workshop presentation.
KIISE:SIGDB Workshop presentation.KIISE:SIGDB Workshop presentation.
KIISE:SIGDB Workshop presentation.
 
S de2784 footprint-reduction-edge2015-v2
S de2784 footprint-reduction-edge2015-v2S de2784 footprint-reduction-edge2015-v2
S de2784 footprint-reduction-edge2015-v2
 
Big Data and OSS at IBM
Big Data and OSS at IBMBig Data and OSS at IBM
Big Data and OSS at IBM
 
Flash Ahead: IBM Flash System Selling Point
Flash Ahead: IBM Flash System Selling PointFlash Ahead: IBM Flash System Selling Point
Flash Ahead: IBM Flash System Selling Point
 
USENIX NSDI 2016 (Session: Resource Sharing)
USENIX NSDI 2016 (Session: Resource Sharing)USENIX NSDI 2016 (Session: Resource Sharing)
USENIX NSDI 2016 (Session: Resource Sharing)
 
Webinar: How Snapshots CAN be Backups
Webinar: How Snapshots CAN be BackupsWebinar: How Snapshots CAN be Backups
Webinar: How Snapshots CAN be Backups
 
Compare Clustering Methods for MS SQL Server
Compare Clustering Methods for MS SQL ServerCompare Clustering Methods for MS SQL Server
Compare Clustering Methods for MS SQL Server
 
Storage overview
Storage overviewStorage overview
Storage overview
 
New Business Applications Powered by In-Memory Technology @MIT Forum for Supp...
New Business Applications Powered by In-Memory Technology @MIT Forum for Supp...New Business Applications Powered by In-Memory Technology @MIT Forum for Supp...
New Business Applications Powered by In-Memory Technology @MIT Forum for Supp...
 
Scalable Relational Databases with Amazon Aurora. Madrid Summit 2019
Scalable Relational Databases with Amazon Aurora. Madrid Summit 2019Scalable Relational Databases with Amazon Aurora. Madrid Summit 2019
Scalable Relational Databases with Amazon Aurora. Madrid Summit 2019
 
Research and technology explosion in scale-out storage
Research and technology explosion in scale-out storageResearch and technology explosion in scale-out storage
Research and technology explosion in scale-out storage
 
Greenplum Architecture
Greenplum ArchitectureGreenplum Architecture
Greenplum Architecture
 
NZS-4532 - Bringing Historical Data to Life with IBMs SMF Data Engine
NZS-4532 - Bringing Historical Data to Life with IBMs SMF Data EngineNZS-4532 - Bringing Historical Data to Life with IBMs SMF Data Engine
NZS-4532 - Bringing Historical Data to Life with IBMs SMF Data Engine
 
Theresa Melvin, HP Enterprise - IOT/AI/ML at Hyperscale - how to go faster wi...
Theresa Melvin, HP Enterprise - IOT/AI/ML at Hyperscale - how to go faster wi...Theresa Melvin, HP Enterprise - IOT/AI/ML at Hyperscale - how to go faster wi...
Theresa Melvin, HP Enterprise - IOT/AI/ML at Hyperscale - how to go faster wi...
 

Recently uploaded

Harnessing the Power of NLP and Knowledge Graphs for Opioid Research
Harnessing the Power of NLP and Knowledge Graphs for Opioid ResearchHarnessing the Power of NLP and Knowledge Graphs for Opioid Research
Harnessing the Power of NLP and Knowledge Graphs for Opioid Research
Neo4j
 
Must Know Postgres Extension for DBA and Developer during Migration
Must Know Postgres Extension for DBA and Developer during MigrationMust Know Postgres Extension for DBA and Developer during Migration
Must Know Postgres Extension for DBA and Developer during Migration
Mydbops
 
The Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptxThe Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptx
operationspcvita
 
AI in the Workplace Reskilling, Upskilling, and Future Work.pptx
AI in the Workplace Reskilling, Upskilling, and Future Work.pptxAI in the Workplace Reskilling, Upskilling, and Future Work.pptx
AI in the Workplace Reskilling, Upskilling, and Future Work.pptx
Sunil Jagani
 
Introducing BoxLang : A new JVM language for productivity and modularity!
Introducing BoxLang : A new JVM language for productivity and modularity!Introducing BoxLang : A new JVM language for productivity and modularity!
Introducing BoxLang : A new JVM language for productivity and modularity!
Ortus Solutions, Corp
 
"NATO Hackathon Winner: AI-Powered Drug Search", Taras Kloba
"NATO Hackathon Winner: AI-Powered Drug Search",  Taras Kloba"NATO Hackathon Winner: AI-Powered Drug Search",  Taras Kloba
"NATO Hackathon Winner: AI-Powered Drug Search", Taras Kloba
Fwdays
 
Apps Break Data
Apps Break DataApps Break Data
Apps Break Data
Ivo Velitchkov
 
inQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
inQuba Webinar Mastering Customer Journey Management with Dr Graham HillinQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
inQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
LizaNolte
 
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptxPRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
christinelarrosa
 
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
AlexanderRichford
 
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdfLee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
leebarnesutopia
 
Call Girls Chandigarh🔥7023059433🔥Agency Profile Escorts in Chandigarh Availab...
Call Girls Chandigarh🔥7023059433🔥Agency Profile Escorts in Chandigarh Availab...Call Girls Chandigarh🔥7023059433🔥Agency Profile Escorts in Chandigarh Availab...
Call Girls Chandigarh🔥7023059433🔥Agency Profile Escorts in Chandigarh Availab...
manji sharman06
 
Mutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented ChatbotsMutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented Chatbots
Pablo Gómez Abajo
 
Leveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and StandardsLeveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and Standards
Neo4j
 
What is an RPA CoE? Session 1 – CoE Vision
What is an RPA CoE?  Session 1 – CoE VisionWhat is an RPA CoE?  Session 1 – CoE Vision
What is an RPA CoE? Session 1 – CoE Vision
DianaGray10
 
QA or the Highway - Component Testing: Bridging the gap between frontend appl...
QA or the Highway - Component Testing: Bridging the gap between frontend appl...QA or the Highway - Component Testing: Bridging the gap between frontend appl...
QA or the Highway - Component Testing: Bridging the gap between frontend appl...
zjhamm304
 
GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)
Javier Junquera
 
From Natural Language to Structured Solr Queries using LLMs
From Natural Language to Structured Solr Queries using LLMsFrom Natural Language to Structured Solr Queries using LLMs
From Natural Language to Structured Solr Queries using LLMs
Sease
 
Essentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation ParametersEssentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation Parameters
Safe Software
 
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
DanBrown980551
 

Recently uploaded (20)

Harnessing the Power of NLP and Knowledge Graphs for Opioid Research
Harnessing the Power of NLP and Knowledge Graphs for Opioid ResearchHarnessing the Power of NLP and Knowledge Graphs for Opioid Research
Harnessing the Power of NLP and Knowledge Graphs for Opioid Research
 
Must Know Postgres Extension for DBA and Developer during Migration
Must Know Postgres Extension for DBA and Developer during MigrationMust Know Postgres Extension for DBA and Developer during Migration
Must Know Postgres Extension for DBA and Developer during Migration
 
The Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptxThe Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptx
 
AI in the Workplace Reskilling, Upskilling, and Future Work.pptx
AI in the Workplace Reskilling, Upskilling, and Future Work.pptxAI in the Workplace Reskilling, Upskilling, and Future Work.pptx
AI in the Workplace Reskilling, Upskilling, and Future Work.pptx
 
Introducing BoxLang : A new JVM language for productivity and modularity!
Introducing BoxLang : A new JVM language for productivity and modularity!Introducing BoxLang : A new JVM language for productivity and modularity!
Introducing BoxLang : A new JVM language for productivity and modularity!
 
"NATO Hackathon Winner: AI-Powered Drug Search", Taras Kloba
"NATO Hackathon Winner: AI-Powered Drug Search",  Taras Kloba"NATO Hackathon Winner: AI-Powered Drug Search",  Taras Kloba
"NATO Hackathon Winner: AI-Powered Drug Search", Taras Kloba
 
Apps Break Data
Apps Break DataApps Break Data
Apps Break Data
 
inQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
inQuba Webinar Mastering Customer Journey Management with Dr Graham HillinQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
inQuba Webinar Mastering Customer Journey Management with Dr Graham Hill
 
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptxPRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
PRODUCT LISTING OPTIMIZATION PRESENTATION.pptx
 
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
QR Secure: A Hybrid Approach Using Machine Learning and Security Validation F...
 
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdfLee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
 
Call Girls Chandigarh🔥7023059433🔥Agency Profile Escorts in Chandigarh Availab...
Call Girls Chandigarh🔥7023059433🔥Agency Profile Escorts in Chandigarh Availab...Call Girls Chandigarh🔥7023059433🔥Agency Profile Escorts in Chandigarh Availab...
Call Girls Chandigarh🔥7023059433🔥Agency Profile Escorts in Chandigarh Availab...
 
Mutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented ChatbotsMutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented Chatbots
 
Leveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and StandardsLeveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and Standards
 
What is an RPA CoE? Session 1 – CoE Vision
What is an RPA CoE?  Session 1 – CoE VisionWhat is an RPA CoE?  Session 1 – CoE Vision
What is an RPA CoE? Session 1 – CoE Vision
 
QA or the Highway - Component Testing: Bridging the gap between frontend appl...
QA or the Highway - Component Testing: Bridging the gap between frontend appl...QA or the Highway - Component Testing: Bridging the gap between frontend appl...
QA or the Highway - Component Testing: Bridging the gap between frontend appl...
 
GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)
 
From Natural Language to Structured Solr Queries using LLMs
From Natural Language to Structured Solr Queries using LLMsFrom Natural Language to Structured Solr Queries using LLMs
From Natural Language to Structured Solr Queries using LLMs
 
Essentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation ParametersEssentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation Parameters
 
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
 

Lecture v4

  • 1. A Model for the Systems Architecture of the Future Prof. Paul A. Strassmann George Mason University, December 5, 2005 1 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 2. Data-Centric Era; IBM Dominates Hundred Sources 1950-1980 Months⇒Weeks 2 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 3. Workgroup-Centric Era; Microsoft, INTEL Dominate Million Sources Hundred Sources 1950-1980 1980-2010 Weeks⇒Days Months⇒Weeks 3 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 4. Network-Centric Era; Google and Cisco? +Multi-Media +Text Billions Sources Data Million Sources Hundred Sources 1950-1980 1980-2010 2010- Days⇒Real-Time Weeks⇒Days Months⇒Weeks 4 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 5. Example of a Network-Centric System 5 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 6. Network-Centric Requirements (2010) • Downtime (< 5 min/yr); • Display (200 Billion ops/sec); • Connectivity (> 1 Gigabyte/sec); • Access (< 0.25 sec); • Innovation (< 1 day); • Security (> 8 sigma). 6 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 7. Performance (2005) • Infrastructure = > 50% of spending; • Security = ?; • Integration = > 50% of applications; • Network downtime = > 1 hour/year; • Innovation = > 1 year. 7 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 8. Conclusion • Network-Centric systems cannot be built on Workgroup-Centric architecture. 8 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 9. Network-Centric Principles (Google) 1. Build & operate protected information network; 2. Offer universal connectivity for: – Collection, processing and storing of information; – Provide secured communications. 3. Maintain shared data models; 4. Require continued upgrading & innovation. 9
  • 10. Google Principle #1 Build & operate protected information network 10
  • 11. Standard Google Clusters Operate Net • 359 racks • 31,654 machines • 63,184 CPUs • 126,368 Ghz of processing power • 63,184 Gb of RAM • 2,527 Tb of Hard Drive space • Appx. 40 million searches/day 11 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 12. Clusters Have Identical Architecture Index Servers Web Web Switch Servers Document Servers 12
  • 13. Google Cluster Set-Up = Three Days 13
  • 14. Google Infrastructure: Key to Growth (1) • >200,000 custom-built commodity servers; • Acting as one parallel supercomputer; • Fault tolerant hardware. • Storage capacity >5 petabytes; low response latency (0.2 sec); >80GB per server, distributed; • Indexed >8 billion web pages; Indexing is computationally complex (>500M * > 2B matrix) • Capital and operating costs at fraction of large scale commercial servers; traffic growth 20-30%/month; data centers (>12); in US, Europe and Asia. 14 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 15. Google Infrastructure: Key to Growth (2) • >200,000 commodity Linux servers built to custom specifications; distributed cluster architecture; acting as one parallel supercomputer; scaleable; • >50,000 requests/sec; fault tolerant (no single point of failure); diverse hardware; stripped version of Red Hat; • Storage capacity >5 petabytes; • >80GB per server; • Indexed >8 billion web pages; • Indexing is complex (500M x 2B matrix) • Capital and operating costs at fraction of large scale commercial servers; traffic growth 20-30%/month; data centers (>12); in US, Europe and Asia. 15 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 16. Google Infrastructure: Key to Growth (3) • >200,000 commodity Linux servers built to custom specifications; distributed cluster architecture; acting as one parallel supercomputer; scaleable; • >50,000 requests/sec; fault tolerant (no single point of failure); diverse hardware; stripped version of Red Hat; • Storage capacity >5 petabytes; low response latency (0.2 sec); >80GB per server, distributed; • Indexed >8 billion web pages; Indexing is computationally complex (>500M * > 2B matrix) • Capital and operating costs a fraction of commercial servers; • Traffic growth 20-30%/month; • Data centers (>20), in US, Europe and Asia. 16 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 17. Architecture for Reliability • Replication (3x+) for redundancy; • Replication for proximity and response; • Reliability with software and architecture, not with hardware. 17 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 18. Indexing for Response • Dynamic indexing of 8B+ pages; • Dynamic indexing of 1B+ images; • Indexing of 1B+ messages; • Index broken into “shards” and distributed across data centers. 18 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 19. Query Serving Infrastructure • Processing a single query may involve 1000+ servers; • Index Servers access Index Shards; • Document Servers access Doc Shards; • Response times monitored to assure <0.25 sec latency. 19 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 20. Google MapReduce System (1) • Coordinates servers in real-time; • Automates distribution of workload; • Fault tolerance and service reconstitution; • Systems-wide I/O cluster scheduling; • Status and performance monitoring. 20 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 21. Google MapReduce System (2) • Coordination of servers in real-time; • Automates distribution of workload; • Fault tolerance & service reconstitution; • Systems-wide cluster scheduling; • Status and performance monitoring. 21 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 22. Google Principle #2 Universal connectivity 22
  • 23. Multi-Lingual Services 23 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 24. Search in Arabic Media 24 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 25. Video Searches 25 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 26. Google Base - Connecting Diverse Sources Locate events within 45 miles of New York in November, 2005 26 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 27. Semantic Parsing • Tools parse millions of documents; • Automated learning for related information. – Query: “Bay Area Cooking Classes” – Finds: “San Francisco College Classes”; “The Magic of Thai Cuisine” 27 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 28. Google Principle #3 Shared data models 28
  • 29. Data Engineering • Standard file management: The Google File System (GFS); • Standard job scheduling: The Global Work Queue (GWQ); • Standard network management: The Google MapReduce system. 29 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 30. Google File System (GFS) • Replicated Masters manage MetaData directories; • Data transfers directly at the machine level within 2,000+ clusters; • File broken into 64 MB chunks for 2000+ MB/second read/write load; • All file chunks at least triplicate for safety. 30 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 31. Data Dictionary for Interoperability 31 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 32. Application Interfaces 32 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 33. Google Principle #4 Upgrading & Innovation 33
  • 34. Deliver On-Line Services 34 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 35. Shopping Services 35 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 36. Environment for Rapid Innovation 36 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 37. A New Application Launched in 15 Minutes 37 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 38. Occupy the Desktop 38 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 39. Multimedia Services 39 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 40. Workgroup vs. Network Architectures Comparison Summaries 40
  • 41. Workgroup Computing Today: Millions of Local Applications+Local Data 41 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 42. Work-Groups Vulnerable Today 42 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 43. New Internet: Billions of Browsers, Secure Shared Applications+Data Browsers Application Application Application 43 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 44. Workgroup vs. Network Architectures (1) Workgroup Centric Network Centric Strategy: Capture Desktop Strategy: Occupy Internet Customer’s labor and capital Labor and capital in network User-specific infrastructures Infrastructure is universal Systems controls by user Network controls in network Operating system dependency Open source browsers License Software Pay for Use 44 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 45. Workgroup vs. Network Architectures (2) Workgroup Centric Network Centric Strategy: Capture Desktop Strategy: Occupy Internet Customer’s labor and capital Labor and capital in network User-specific infrastructures Infrastructure is universal Systems controls by user Network controls in network Operating system dependency Open source browsers License Software Pay for Use 45 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 46. Workgroup vs. Network Architectures (3) Workgroup Centric Network Centric Strategy: Capture Desktop Strategy: Occupy Internet Customer’s labor and capital Labor and capital in network User-specific infrastructures Infrastructure is universal Systems controls by user Network controls in network Operating system dependency Open source browsers License Software Pay for Use 46 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 47. Workgroup vs. Network Architectures (4) Workgroup Centric Network Centric Strategy: Capture Desktop Strategy: Occupy Internet Customer’s labor and capital Labor and capital in network User-specific infrastructures Infrastructure is universal Systems controls by user Network controls in network Operating system dependency Open source browsers License Software Pay for Use 47 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 48. Workgroup vs. Network Architectures (5) Workgroup Centric Network Centric Strategy: Capture Desktop Strategy: Occupy Internet Customer’s labor and capital Labor and capital in network User-specific infrastructures Infrastructure is universal Systems controls by user Network controls in network Operating system dependent Open source browsers License Software Pay for Use 48 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 49. Workgroup vs. Network Architectures (6) Workgroup Centric Network Centric Strategy: Capture Desktop Strategy: Occupy Internet Customer’s labor and capital Labor and capital in network User-specific infrastructures Infrastructure is universal Systems controls by user Network controls in network License Software Pay for Use 49 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 50. Workgroup vs. Network Architectures (7) Workgroup Centric Network Centric Strategy: Capture Desktop Strategy: Occupy Internet Customer’s labor and capital Labor and capital in network User-specific infrastructures Infrastructure is universal Systems controls by user Network controls in network License Software Pay for Use Data read from files Data assembled in context 50 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 51. The Future Technology • All electronic devices on Internet. • Data, voice, video, sensor inputs accessible. • Phone, TV and print media displaced. Services • Systems respond to questions. • Information is displayed in context. • Applications for decision-making. 51 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY
  • 52. Relevance for National Security Systems • Workgroups to Network-Centric services. • Migrate through displacement. • Invest savings in innovation. 52 Prof. Strassmann, GMU Lecture, 12/05/05 - REPRODUCED BY PERMISSION ONLY