SlideShare a Scribd company logo
1 of 18
High Performance
Computing for LiDAR
  Data Production




                      March 3, 2010
Processor Technologies and Speeds




PRE0184
64-bit Computing

           Requirements
               64-bit compatible hardware
               64-bit operating system
               64-bit designed software (not the same as 32-bit software that will run
                on a 64-bit computer)
           Benefits
               Access to larger amounts of memory, 32-bit OS can only read 4 GB of
                RAM (maximum accessible RAM is closer to 3.4 GB due to other
                hardware limitations)
                   More data can be read in at once
                   More RAM space available for multiple processes requiring memory space
               More stable
                   Less memory management needed to prevent out of memory errors
                   32-bit operating systems limit the kernel-mode virtual address space to 2
                    GB, the 64-bit limit is 8 TB


PRE0184
CPU Trends




PRE0184
Use of Multiple Processors and Large Amounts of RAM




PRE0184
Multi-Core / Multi-Threaded Processing

                                 MARS® Auto-Filter Benchmark Results
                                                                              Multiprocessing with
                                                                             16 processors yielding
                                                                              a 91% time savings

                       4:48:00

                       4:19:12

                       3:50:24

                       3:21:36
            HH:MM:SS




                       2:52:48
                                            4:31:59
                       2:24:00

                       1:55:12

                       1:26:24

                       0:57:36

                       0:28:48                                   0:25:14

                       0:00:00

                                        Single Processor   Multi Processor




PRE0184
Multi-Processing Results (single large file vs. small tiles)


                               MARS® Grid Export Multi-Processing Benchmark Results
                                       Multiprocessing with 16 processors yielding   Multiprocessing with 16 processors yielding
                                        a 79% time savings for single file export      an 81% time savings for tiled export


                          0:23:02


                          0:20:10


                          0:17:17


                          0:14:24
               HH:MM:SS




                                               0:21:33

                          0:11:31

                                                                                       0:15:04
                          0:08:38


                          0:05:46

                                                                     0:04:27
                          0:02:53                                                                          0:02:49


                          0:00:00

                                     Single File Export Single Processor              Single File Export Multi Processor
                                     Tiled Export Single Processor                    Tiled Export Multi Processor



PRE0184
Multi-Processing # of CPUs Performance Curve
                                   MARS® Multi-Processing Times (Runtime per # of CPUs)
                     1:55:12


                                                                                      Processor Intensive = High
                     1:40:48

                                                                                      Processor Intensive = Medium
                     1:26:24



                     1:12:00
          HH:MM:SS




                     0:57:36



                     0:43:12



                     0:28:48



                     0:14:24



                     0:00:00
                               1    2   3    4   5   6   7      8     9     10   11   12    13      14     15        16
                                                          Number of Processors




PRE0184
General-Purpose Graphics Processing Unit (GP-GPU)

    CPUs are designed to handle a variety of different
     applications and operating system needs (currently
     maxed at 16 cores)
    GPUs were originally designed to handle video
     rendering and screen display
    GP-GPUs can be used for massively parallel
     processing with the use of SDKs like NVIDIA’s
     CUDA (Compute Unified Device Architecture)
    GP-GPU processing system enhancements can be
     accomplished by the use of one or more graphics
     cards in a workstation or rack mounted GPU server
     clusters (thousands of processor cores)




PRE0184
CPU vs. GPU Speeds




      August 24, 2009 - Nvidia’s CEO predicted that GPU computing will experience a rapid performance boost over the next six
      years. According to Jen-Hsun Huang, GPU computing is likely to increase its current capabilities by 570x, while 'pure'
      CPU performance will progress by a limited 3x. This would require tripling the speed of the GPU every year.


    Source: NVIDIA webinar: http://www.nvidia.com/content/webinar/Tesla_Fermi_Webinar_Jan13_10_v1_1.pdf
PRE0184
Internal Computer Hard Drives
             Technologies
                    Solid state (no moving parts)
                         Pros
                                Faster start up
                                Faster read/write
                                Fragmentation has little effect
                                Silent operations
                                More reliable
                                Can endure shock, high altitude, extreme temperatures
                         Cons (currently)
                                Lower capacities
                                More expensive
                    Spinning (most widely used)
             Typical types
                    SAS (Serial Attached SCSI) – successor to SCSI
                    SATA (Serial AT Attachment) – successor to ATA
             Performance
                         SAS – faster 15,000 RPMs
                         SATA – slower 9,200 RPMs
             Size
                    SAS – up to 500 GB
                    SATA – up to 2 TB
             RAID levels
                    10 (RAID 1+0) – highest performance
                    50 (RAID 5+0) - larger space, more redundancy
             Connector types
                    iSCSI – quick install, less hardware
                    Fiber – complex install, more hardware

PRE0184
Writing Data to Temporary Local Drive Space
                               MARS® Grid Export Testing Network and Local Drive I/O
                                                                                                                        80% export time savings
                                                                                                                        writing to local drive then
                                                                                                                            moving product to
                                                                                                                              network drive

                     4:48:00

                     4:19:12

                     3:50:24

                     3:21:36

                     2:52:48
          HH:MM:SS




                                                       4:22:59
                     2:24:00

                     1:55:12

                     1:26:24
                                                                                  0:55:27
                     0:57:36                                                                              0:47:47

                     0:28:48

                     0:00:00
                               Export to grid across network (input and output on network)
                               Export to grid across network (input on network, output to local drive temp space and then moved to network)
                               Export to grid with system drive (input and output on local drive)




PRE0184
High Speed Local Area Networks (LAN)
                                Gigabit vs. 10 Gigabit Ethernet Network File Copy Times

                                                                               39% disc I/O time savings
                                                                                   using 10 Gbps as
                                                                                 compared to 1 Gbps

                                                                               22.66 GB of varying files sized
                                                                               from 6 kb to 18 GB (files read
                                                                                 and written using Windows
                      0:03:36                                                         Server 2008 R2)




                      0:02:53
           HH:MM:SS




                                                  0:03:30
                      0:02:10



                                                                    0:02:08
                      0:01:26




                      0:00:43




                      0:00:00

                                                   Gigabit        10 Gigabit



PRE0184
Disc I/O Improvements in Windows Operating Systems


                               File Copy Times For Windows Operating System
                                                                                                     70% disc I/O time savings
                                                                                                       using Windows 7 as
                                                                                                     compared to Windows XP
                          0:05:46
                                                                                                      4.66 GB of varying files
                                                                                                     sized from 6 kb to 1.2 GB
                          0:05:02


                          0:04:19


                          0:03:36
               HH:MM:SS




                                                         0:05:17
                          0:02:53


                          0:02:10


                          0:01:26                                         0:01:56
                                                                                           0:01:36

                          0:00:43


                          0:00:00
                                    Windows XP (Windows 2003)      Windows Vista (Windows 2008)      Windows 7 (Windows 2008 R2)




PRE0184
Server-side Processing
     Network – 10GE                 File server(s)
     Processing server(s)               Scalable
         Fast processor                 Tiered storage for best
          technology – Nehalem            performance
          microarchitecture
                                             SSD – for temporary,
         Multi-processor CPUs,               unfragmented files
          8 or more cores                    SAS – for fast
                                              processing
         Fast local HDD (does
          not have to be huge)               SATA – for large
                                              storage
         GP-GPU cluster




PRE0184
Clustering
             Distributed Processing
                 Typically designed to work within a LAN environment
                 Highly scalable
                 Scheduled processes
                 Harvest free CPU clock cycles
                 Very configurable
                     Resource limits
                     Priorities
                     Time limits



             Cloud Computing
                 Shared computer processing resources via the Internet by
                  renting usage from a third-party provider
                 Data and software is usually stored on remote servers
                 Key features
                     Agility                        Scalability
                     Cost                           Security
                     Device and location            Maintenance
                      independence
                                                     Metering
                     Multi-tenancy
                                                     High performance
                     Reliability

                                                      Source: Wikipedia http://en.wikipedia.org/wiki/Cloud_computing
PRE0184
Summary
                             Technologies                         Recommendation
          Processor microarchitecture                                    Nehalem

          32/64 bit                                                        64-bit*

          Multi-processing                                       Multi-Core / Multi-Thread*

          Number of CPUs                                                 8 or more*

          Amount of RAM                                                8GB or more*
                                                                   GP-GPU processing*
          Beyond CPU processing
                                                        (higher end Nvidia card is worth the money)
                                                              SAS with iSCSI connection
          Internal Hard Drives
                                                        (SSD when price drops and size increases)
          Read/Write “trick”                           File server → local internal HDD → file server*

          LAN                                                   10 Gigabit Ethernet (10GE)

          Operating System (if using Windows)                  Windows 7 / Server 2008 R2*

          Processing architecture                                       Server-side

          Clustering                                    Distributed processing or Cloud computing*

           * If software supports this functionality
PRE0184
Thank you!


              Any questions?


             Matthew Bethel, GISP
          Manager of Systems Engineering
          Email: matt.bethel@merrick.com

PRE0184

More Related Content

Viewers also liked

Internet Of Things - The value beyond hypes
Internet Of Things - The value beyond hypesInternet Of Things - The value beyond hypes
Internet Of Things - The value beyond hypesRenjith Ramachandran
 
Ilmf2010 breaklines
Ilmf2010 breaklinesIlmf2010 breaklines
Ilmf2010 breaklinesraj.m.rao
 
Apresentação do catálogo 16 oriflame
Apresentação do catálogo 16 oriflameApresentação do catálogo 16 oriflame
Apresentação do catálogo 16 oriflamenmicaelo
 
STOP's Corporate Presentation
STOP's Corporate PresentationSTOP's Corporate Presentation
STOP's Corporate Presentationstoptheft
 
Campus Laptop Security 2010
Campus Laptop Security 2010Campus Laptop Security 2010
Campus Laptop Security 2010stoptheft
 
Oriflame Catálogo 7 2016
Oriflame Catálogo 7 2016Oriflame Catálogo 7 2016
Oriflame Catálogo 7 2016nmicaelo
 
The 411 on Facebook: An FYI for Teachers
The 411 on Facebook: An FYI for TeachersThe 411 on Facebook: An FYI for Teachers
The 411 on Facebook: An FYI for TeachersAPatterson79
 
Lidar hsi datafusion ilmf 2010
Lidar hsi datafusion ilmf 2010Lidar hsi datafusion ilmf 2010
Lidar hsi datafusion ilmf 2010raj.m.rao
 
Behaviour - Biological bases
  Behaviour - Biological bases  Behaviour - Biological bases
Behaviour - Biological basesROY AUGUSTINE
 
Novidades do catálogo 14 oriflame
Novidades do catálogo 14 oriflameNovidades do catálogo 14 oriflame
Novidades do catálogo 14 oriflamenmicaelo
 
Assistive Technology Presentation
Assistive Technology PresentationAssistive Technology Presentation
Assistive Technology PresentationAPatterson79
 
Flickr presentation
Flickr presentationFlickr presentation
Flickr presentationAPatterson79
 

Viewers also liked (14)

Internet Of Things - The value beyond hypes
Internet Of Things - The value beyond hypesInternet Of Things - The value beyond hypes
Internet Of Things - The value beyond hypes
 
Ilmf2010 breaklines
Ilmf2010 breaklinesIlmf2010 breaklines
Ilmf2010 breaklines
 
Apresentação do catálogo 16 oriflame
Apresentação do catálogo 16 oriflameApresentação do catálogo 16 oriflame
Apresentação do catálogo 16 oriflame
 
STOP's Corporate Presentation
STOP's Corporate PresentationSTOP's Corporate Presentation
STOP's Corporate Presentation
 
Campus Laptop Security 2010
Campus Laptop Security 2010Campus Laptop Security 2010
Campus Laptop Security 2010
 
Oriflame Catálogo 7 2016
Oriflame Catálogo 7 2016Oriflame Catálogo 7 2016
Oriflame Catálogo 7 2016
 
The 411 on Facebook: An FYI for Teachers
The 411 on Facebook: An FYI for TeachersThe 411 on Facebook: An FYI for Teachers
The 411 on Facebook: An FYI for Teachers
 
Lidar hsi datafusion ilmf 2010
Lidar hsi datafusion ilmf 2010Lidar hsi datafusion ilmf 2010
Lidar hsi datafusion ilmf 2010
 
Behaviour - Biological bases
  Behaviour - Biological bases  Behaviour - Biological bases
Behaviour - Biological bases
 
Novidades do catálogo 14 oriflame
Novidades do catálogo 14 oriflameNovidades do catálogo 14 oriflame
Novidades do catálogo 14 oriflame
 
Tissues
TissuesTissues
Tissues
 
SMAC Stack - A Quick Intro
SMAC Stack - A Quick IntroSMAC Stack - A Quick Intro
SMAC Stack - A Quick Intro
 
Assistive Technology Presentation
Assistive Technology PresentationAssistive Technology Presentation
Assistive Technology Presentation
 
Flickr presentation
Flickr presentationFlickr presentation
Flickr presentation
 

Similar to 2010 ilmf asprs hot topics session

High Performance Computing for LiDAR Data Production
High Performance Computing for LiDAR Data ProductionHigh Performance Computing for LiDAR Data Production
High Performance Computing for LiDAR Data ProductionMattBethel1
 
Cracking the nut, solving edge ai with apache tools and frameworks
Cracking the nut, solving edge ai with apache tools and frameworksCracking the nut, solving edge ai with apache tools and frameworks
Cracking the nut, solving edge ai with apache tools and frameworksTimothy Spann
 
Frokost seminar windows server 2012
Frokost seminar   windows server 2012Frokost seminar   windows server 2012
Frokost seminar windows server 2012Olav Tvedt
 
Hadoop World 2011: Hadoop and Performance - Todd Lipcon & Yanpei Chen, Cloudera
Hadoop World 2011: Hadoop and Performance - Todd Lipcon & Yanpei Chen, ClouderaHadoop World 2011: Hadoop and Performance - Todd Lipcon & Yanpei Chen, Cloudera
Hadoop World 2011: Hadoop and Performance - Todd Lipcon & Yanpei Chen, ClouderaCloudera, Inc.
 
S3332 peter bakkum
S3332 peter bakkumS3332 peter bakkum
S3332 peter bakkumPeter Bakkum
 
M&t presentation
M&t presentationM&t presentation
M&t presentationcivcimix
 
infoShare AI Roadshow 2018 - Tomasz Kopacz (Microsoft) - jakie możliwości daj...
infoShare AI Roadshow 2018 - Tomasz Kopacz (Microsoft) - jakie możliwości daj...infoShare AI Roadshow 2018 - Tomasz Kopacz (Microsoft) - jakie możliwości daj...
infoShare AI Roadshow 2018 - Tomasz Kopacz (Microsoft) - jakie możliwości daj...Infoshare
 
Servers Technologies and Enterprise Data Center Trends 2014 - Thailand
Servers Technologies and Enterprise Data Center Trends 2014 - ThailandServers Technologies and Enterprise Data Center Trends 2014 - Thailand
Servers Technologies and Enterprise Data Center Trends 2014 - ThailandAruj Thirawat
 
A Dataflow Processing Chip for Training Deep Neural Networks
A Dataflow Processing Chip for Training Deep Neural NetworksA Dataflow Processing Chip for Training Deep Neural Networks
A Dataflow Processing Chip for Training Deep Neural Networksinside-BigData.com
 
Super scaling singleton inserts
Super scaling singleton insertsSuper scaling singleton inserts
Super scaling singleton insertsChris Adkin
 
PC = Personal Cloud (or how to use your development machine with Vagrant and ...
PC = Personal Cloud (or how to use your development machine with Vagrant and ...PC = Personal Cloud (or how to use your development machine with Vagrant and ...
PC = Personal Cloud (or how to use your development machine with Vagrant and ...Codemotion
 
Loadays managing my sql with percona toolkit
Loadays managing my sql with percona toolkitLoadays managing my sql with percona toolkit
Loadays managing my sql with percona toolkitFrederic Descamps
 
Advanced Windows Debugging
Advanced Windows DebuggingAdvanced Windows Debugging
Advanced Windows DebuggingBala Subra
 
Hadoop on a personal supercomputer
Hadoop on a personal supercomputerHadoop on a personal supercomputer
Hadoop on a personal supercomputerPaul Dingman
 
How to Make Norikra Perfect
How to Make Norikra PerfectHow to Make Norikra Perfect
How to Make Norikra PerfectSATOSHI TAGOMORI
 
Hadoop - Disk Fail In Place (DFIP)
Hadoop - Disk Fail In Place (DFIP)Hadoop - Disk Fail In Place (DFIP)
Hadoop - Disk Fail In Place (DFIP)mundlapudi
 
Critical Attributes for a High-Performance, Low-Latency Database
Critical Attributes for a High-Performance, Low-Latency DatabaseCritical Attributes for a High-Performance, Low-Latency Database
Critical Attributes for a High-Performance, Low-Latency DatabaseScyllaDB
 

Similar to 2010 ilmf asprs hot topics session (20)

High Performance Computing for LiDAR Data Production
High Performance Computing for LiDAR Data ProductionHigh Performance Computing for LiDAR Data Production
High Performance Computing for LiDAR Data Production
 
Cracking the nut, solving edge ai with apache tools and frameworks
Cracking the nut, solving edge ai with apache tools and frameworksCracking the nut, solving edge ai with apache tools and frameworks
Cracking the nut, solving edge ai with apache tools and frameworks
 
Frokost seminar windows server 2012
Frokost seminar   windows server 2012Frokost seminar   windows server 2012
Frokost seminar windows server 2012
 
Hadoop World 2011: Hadoop and Performance - Todd Lipcon & Yanpei Chen, Cloudera
Hadoop World 2011: Hadoop and Performance - Todd Lipcon & Yanpei Chen, ClouderaHadoop World 2011: Hadoop and Performance - Todd Lipcon & Yanpei Chen, Cloudera
Hadoop World 2011: Hadoop and Performance - Todd Lipcon & Yanpei Chen, Cloudera
 
S3332 peter bakkum
S3332 peter bakkumS3332 peter bakkum
S3332 peter bakkum
 
M&t presentation
M&t presentationM&t presentation
M&t presentation
 
infoShare AI Roadshow 2018 - Tomasz Kopacz (Microsoft) - jakie możliwości daj...
infoShare AI Roadshow 2018 - Tomasz Kopacz (Microsoft) - jakie możliwości daj...infoShare AI Roadshow 2018 - Tomasz Kopacz (Microsoft) - jakie możliwości daj...
infoShare AI Roadshow 2018 - Tomasz Kopacz (Microsoft) - jakie możliwości daj...
 
Servers Technologies and Enterprise Data Center Trends 2014 - Thailand
Servers Technologies and Enterprise Data Center Trends 2014 - ThailandServers Technologies and Enterprise Data Center Trends 2014 - Thailand
Servers Technologies and Enterprise Data Center Trends 2014 - Thailand
 
A Dataflow Processing Chip for Training Deep Neural Networks
A Dataflow Processing Chip for Training Deep Neural NetworksA Dataflow Processing Chip for Training Deep Neural Networks
A Dataflow Processing Chip for Training Deep Neural Networks
 
Super scaling singleton inserts
Super scaling singleton insertsSuper scaling singleton inserts
Super scaling singleton inserts
 
Current Trends in HPC
Current Trends in HPCCurrent Trends in HPC
Current Trends in HPC
 
PC = Personal Cloud (or how to use your development machine with Vagrant and ...
PC = Personal Cloud (or how to use your development machine with Vagrant and ...PC = Personal Cloud (or how to use your development machine with Vagrant and ...
PC = Personal Cloud (or how to use your development machine with Vagrant and ...
 
Loadays managing my sql with percona toolkit
Loadays managing my sql with percona toolkitLoadays managing my sql with percona toolkit
Loadays managing my sql with percona toolkit
 
Advanced Windows Debugging
Advanced Windows DebuggingAdvanced Windows Debugging
Advanced Windows Debugging
 
Hadoop on a personal supercomputer
Hadoop on a personal supercomputerHadoop on a personal supercomputer
Hadoop on a personal supercomputer
 
How to Make Norikra Perfect
How to Make Norikra PerfectHow to Make Norikra Perfect
How to Make Norikra Perfect
 
Hadoop - Disk Fail In Place (DFIP)
Hadoop - Disk Fail In Place (DFIP)Hadoop - Disk Fail In Place (DFIP)
Hadoop - Disk Fail In Place (DFIP)
 
Vigor Ex
Vigor ExVigor Ex
Vigor Ex
 
Loom promises: be there!
Loom promises: be there!Loom promises: be there!
Loom promises: be there!
 
Critical Attributes for a High-Performance, Low-Latency Database
Critical Attributes for a High-Performance, Low-Latency DatabaseCritical Attributes for a High-Performance, Low-Latency Database
Critical Attributes for a High-Performance, Low-Latency Database
 

Recently uploaded

"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 

Recently uploaded (20)

"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 

2010 ilmf asprs hot topics session

  • 1. High Performance Computing for LiDAR Data Production March 3, 2010
  • 2. Processor Technologies and Speeds PRE0184
  • 3. 64-bit Computing  Requirements  64-bit compatible hardware  64-bit operating system  64-bit designed software (not the same as 32-bit software that will run on a 64-bit computer)  Benefits  Access to larger amounts of memory, 32-bit OS can only read 4 GB of RAM (maximum accessible RAM is closer to 3.4 GB due to other hardware limitations)  More data can be read in at once  More RAM space available for multiple processes requiring memory space  More stable  Less memory management needed to prevent out of memory errors  32-bit operating systems limit the kernel-mode virtual address space to 2 GB, the 64-bit limit is 8 TB PRE0184
  • 5. Use of Multiple Processors and Large Amounts of RAM PRE0184
  • 6. Multi-Core / Multi-Threaded Processing MARS® Auto-Filter Benchmark Results Multiprocessing with 16 processors yielding a 91% time savings 4:48:00 4:19:12 3:50:24 3:21:36 HH:MM:SS 2:52:48 4:31:59 2:24:00 1:55:12 1:26:24 0:57:36 0:28:48 0:25:14 0:00:00 Single Processor Multi Processor PRE0184
  • 7. Multi-Processing Results (single large file vs. small tiles) MARS® Grid Export Multi-Processing Benchmark Results Multiprocessing with 16 processors yielding Multiprocessing with 16 processors yielding a 79% time savings for single file export an 81% time savings for tiled export 0:23:02 0:20:10 0:17:17 0:14:24 HH:MM:SS 0:21:33 0:11:31 0:15:04 0:08:38 0:05:46 0:04:27 0:02:53 0:02:49 0:00:00 Single File Export Single Processor Single File Export Multi Processor Tiled Export Single Processor Tiled Export Multi Processor PRE0184
  • 8. Multi-Processing # of CPUs Performance Curve MARS® Multi-Processing Times (Runtime per # of CPUs) 1:55:12 Processor Intensive = High 1:40:48 Processor Intensive = Medium 1:26:24 1:12:00 HH:MM:SS 0:57:36 0:43:12 0:28:48 0:14:24 0:00:00 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 Number of Processors PRE0184
  • 9. General-Purpose Graphics Processing Unit (GP-GPU)  CPUs are designed to handle a variety of different applications and operating system needs (currently maxed at 16 cores)  GPUs were originally designed to handle video rendering and screen display  GP-GPUs can be used for massively parallel processing with the use of SDKs like NVIDIA’s CUDA (Compute Unified Device Architecture)  GP-GPU processing system enhancements can be accomplished by the use of one or more graphics cards in a workstation or rack mounted GPU server clusters (thousands of processor cores) PRE0184
  • 10. CPU vs. GPU Speeds August 24, 2009 - Nvidia’s CEO predicted that GPU computing will experience a rapid performance boost over the next six years. According to Jen-Hsun Huang, GPU computing is likely to increase its current capabilities by 570x, while 'pure' CPU performance will progress by a limited 3x. This would require tripling the speed of the GPU every year. Source: NVIDIA webinar: http://www.nvidia.com/content/webinar/Tesla_Fermi_Webinar_Jan13_10_v1_1.pdf PRE0184
  • 11. Internal Computer Hard Drives  Technologies  Solid state (no moving parts)  Pros  Faster start up  Faster read/write  Fragmentation has little effect  Silent operations  More reliable  Can endure shock, high altitude, extreme temperatures  Cons (currently)  Lower capacities  More expensive  Spinning (most widely used)  Typical types  SAS (Serial Attached SCSI) – successor to SCSI  SATA (Serial AT Attachment) – successor to ATA  Performance  SAS – faster 15,000 RPMs  SATA – slower 9,200 RPMs  Size  SAS – up to 500 GB  SATA – up to 2 TB  RAID levels  10 (RAID 1+0) – highest performance  50 (RAID 5+0) - larger space, more redundancy  Connector types  iSCSI – quick install, less hardware  Fiber – complex install, more hardware PRE0184
  • 12. Writing Data to Temporary Local Drive Space MARS® Grid Export Testing Network and Local Drive I/O 80% export time savings writing to local drive then moving product to network drive 4:48:00 4:19:12 3:50:24 3:21:36 2:52:48 HH:MM:SS 4:22:59 2:24:00 1:55:12 1:26:24 0:55:27 0:57:36 0:47:47 0:28:48 0:00:00 Export to grid across network (input and output on network) Export to grid across network (input on network, output to local drive temp space and then moved to network) Export to grid with system drive (input and output on local drive) PRE0184
  • 13. High Speed Local Area Networks (LAN) Gigabit vs. 10 Gigabit Ethernet Network File Copy Times 39% disc I/O time savings using 10 Gbps as compared to 1 Gbps 22.66 GB of varying files sized from 6 kb to 18 GB (files read and written using Windows 0:03:36 Server 2008 R2) 0:02:53 HH:MM:SS 0:03:30 0:02:10 0:02:08 0:01:26 0:00:43 0:00:00 Gigabit 10 Gigabit PRE0184
  • 14. Disc I/O Improvements in Windows Operating Systems File Copy Times For Windows Operating System 70% disc I/O time savings using Windows 7 as compared to Windows XP 0:05:46 4.66 GB of varying files sized from 6 kb to 1.2 GB 0:05:02 0:04:19 0:03:36 HH:MM:SS 0:05:17 0:02:53 0:02:10 0:01:26 0:01:56 0:01:36 0:00:43 0:00:00 Windows XP (Windows 2003) Windows Vista (Windows 2008) Windows 7 (Windows 2008 R2) PRE0184
  • 15. Server-side Processing  Network – 10GE  File server(s)  Processing server(s)  Scalable  Fast processor  Tiered storage for best technology – Nehalem performance microarchitecture  SSD – for temporary,  Multi-processor CPUs, unfragmented files 8 or more cores  SAS – for fast processing  Fast local HDD (does not have to be huge)  SATA – for large storage  GP-GPU cluster PRE0184
  • 16. Clustering  Distributed Processing  Typically designed to work within a LAN environment  Highly scalable  Scheduled processes  Harvest free CPU clock cycles  Very configurable  Resource limits  Priorities  Time limits  Cloud Computing  Shared computer processing resources via the Internet by renting usage from a third-party provider  Data and software is usually stored on remote servers  Key features  Agility  Scalability  Cost  Security  Device and location  Maintenance independence  Metering  Multi-tenancy  High performance  Reliability Source: Wikipedia http://en.wikipedia.org/wiki/Cloud_computing PRE0184
  • 17. Summary Technologies Recommendation Processor microarchitecture Nehalem 32/64 bit 64-bit* Multi-processing Multi-Core / Multi-Thread* Number of CPUs 8 or more* Amount of RAM 8GB or more* GP-GPU processing* Beyond CPU processing (higher end Nvidia card is worth the money) SAS with iSCSI connection Internal Hard Drives (SSD when price drops and size increases) Read/Write “trick” File server → local internal HDD → file server* LAN 10 Gigabit Ethernet (10GE) Operating System (if using Windows) Windows 7 / Server 2008 R2* Processing architecture Server-side Clustering Distributed processing or Cloud computing* * If software supports this functionality PRE0184
  • 18. Thank you! Any questions? Matthew Bethel, GISP Manager of Systems Engineering Email: matt.bethel@merrick.com PRE0184