SlideShare a Scribd company logo
1 of 39
Utilizing Teradata Express for
Development and Sandboxing


Cameron Monson, Database Administrator
O.co
Agenda
•   Company Background and EDW Configuration

•   Why Virtualize?

•   Virtual Environment Details

•   Benefits

•   Challenges

•   Results

•   Next Steps

•   Questions
About O.co

•   Initial focus on liquidating excess inventory and currently
    focus on offering brand-name merchandise at discount prices
•   Products offering has grown from less than 100 in 1999 to
    over 700,000 currently
•   Customer Service – Currently ranked #4 by National Retail
    Federation and American Express
•   O.co, also known as Overstock.com, is Your Savings Engine offering
    brand-name products. The company offers its customers an
    opportunity to shop for bargains conveniently, while offering its
    suppliers an alternative inventory distribution channel. O.co,
    headquartered in Salt Lake City, is a publicly traded company listed on
    the NASDAQ Global Market System and can be found online at
    www.overstock.com and www.o.co.
Teradata at O.co
•   2005 – 8 Node (8+0) 5400
•   2006
     • 30 Node (24+6) 5450 (Production)
     • 6 Node (4+2) 5450 (Development/Test)
•   2010
     • 9 Node (6+3) 5600 (Production)
     • 2 Node (1+1) 5600 (Development/Test)
•   Supports mixed workload environment
     • TRM, DCM, SAS
     • Microstrategy
     • Ad-hoc Analytical and BI workload
     • ELT
        • GoldenGate
        • ODI
        • DMExpress
Why Virtualize?
Abstract & Motives
•   Test system currently functions as both development and test
    environments

     • Creates inconsistent object names between environments

•   Evaluating the limits and boundaries of new technology
    offerings

•   Enable Test system to more closely resemble production

•   Functional testing and development
Other Considerations for Virtualization

• Performance

   • Decreased load and demands on EDW systems

• Availability

   • Additional flexibility for scheduling system maintenance

• Isolation & Sandbox

• Budget

   • Commodity hardware
Virtual Environment
VMware ESXi

•   Installs directly on physical servers and partitions into
    multiple virtual machines that can be run simultaneously

•   Currently utilizing ESXi 4.0 cluster
     • Can clone and deploy additional images from master
       templates
     • Requires administrator interaction to clone and perform
       basic configuration steps
     • Provides workload management and automatic migration
       between hosts on cluster
     • “Plug & Play” install of new TD Express images

•   SAN Storage
     • Tier 2 SAN Storage (10K RPM SAS)
TD Express
•   Free developer version of the Teradata Database

•   1 TB Teradata Express 13.10 for VMware Player

•   Provided configuration includes

    • SLES 10 64-bit Linux

    • Teradata Database

    • Teradata Tools and Utilities (TTU)

    • Load & Unload Tools

        • Teradata’s EasyLoader tool

        • SQL Assistant Java Edition

•   Provided AS IS (Unsupported)
Benefits
Benefits
•   Secure

    • Data never outside corporate datacenters

•   Testing

    • Uncertified software ahead of upgrades

    • New features

    • BAR/Disaster Recovery

•   Able to connect externally using existing TTU

•   Developer Isolation & Sandboxes
Additional Benefits

•   Educational playground

    • Additional exposure to internal processes and programs

    • New features & functionality

•   Growth & capacity planning

    • Additional hardware (Nodes)

    • Additional storage
Environment Isolation
•   Allows for greater flexibility in testing and development

•   Reduces contention on frequently updated and accessed
    objects

•   Experimentation

     • New processes, programs & functionality

     • UDF’s

     • Safe

•   Snapshot functionality

     • Regression testing
Challenges
Database Administration Challenges

•   Synchronizing and Migrating DDL
     • TSET
•   Data Movement
     • BAR (Arcmain)
     • Data Mover
     • ETL (Fastload, TPT, Multiload, etc..)
•   Performance
     • Limited to 2 AMP’s
     • Hardware configuration
     • Shared hardware
•   Data Distribution/Skew
Security & Access

•   Maintain an environment representative of the enterprise
    Teradata systems

•   Simplify roles and rights to limit maintenance overhead on
    multiple virtual machines

•   Consolidate users on virtual machines

•   Corporate security considerations (SSO, etc…)
1 TB Perm Space Limitation

•   Try to minimize base image size to limit disk space overhead

•   Allow developers to load larger datasets as needed

•   Varying projects require vastly different data sets
Perm Space Limitation – Solution 1
Limited Data Set

   •   Seed smaller base objects (Lookup/Dimensions) in full

   •   Larger tables can be seeded partially or left empty (DDL only)

   •   Maintaining referential integrity can be challenging

   •   Still may not be able to provide enough data for functional
       testing in very large environments

   •   May enable limited integration testing in DEV environment

   •   Larger storage footprint
Perm Space Limitation – Solution 2
Break Virtual Images into Subject Areas

    •   Can be difficult to identify and maintain objects

    •   Duplicate Objects

         • Same objects required across multiple subject areas (3NF)

    •   Not feasible for all subject areas

    •   Referential integrity across subject areas

    •   Merging DDL changes back to trunk

    •   Integration testing may not be possible
Perm Space Limitation – Solutions 3 & 4
Ad hoc – Self Service

    •   Allows for greater flexibility in data size

    •   Requires self service options to limit administrative overhead

    •   Additional work involved for developers to prepare environment

    •   Unit testing only - Integration testing not possible

 Materialize Queries – BI/Reporting Only

    •   Limited to report development only

    •   Better performance for multiple query executions during
        reporting development (Cube, Grid & Report refreshes)

    •   SQL would not be consistent between environments

    •   Requires self service options to be implemented
Other Challenges
•   Development & Deployment Lifecycle

•   Image Management

    • Version Control

    • Library (Check in/out)

    • Refresh Interval

•   Migration and Project Planning
Support Considerations
•   DBA

•   Architecture

•   Self Service

•   External Groups

     • VMware systems

     • Storage

     • Network
Results
Achievements
•   Testing new features and functionality ahead of 13.10 upgrade

•   Conduct proof of concept testing with minimal impact to
    enterprise systems

•   Ability to perform preliminary process validation

•   Currently rolling out to ETL Developers

•   Endpoint Testing

     • Integration into ETL infrastructure

     • TTU Tools Validation
Successful Projects

•   Oracle Data Integrator (ODI/Sunopsis )

•   Hadoop UDFs – Full Development Lifecycle

    • Proof of Concept

    • Development

    • Testing & Validation

•   GoldenGate Testing & Validation
Lessons Learned and Takeaways

•   Security & Access Controls
     • Decide early on how to simplify roles and consolidate users
•   Mixed response from end users and developers
     • Perform limited Beta testing with mixed selection of users
•   Data seeding challenges
     • Perm Space Limitations
     • Refresh Method
     • Throughput
     • Refresh Intervals
•   System Requirements
     • IP Addresses
     • Hosts file entries
•   Keep it simple!
Hardware Resource Allocation
•   2 Core Minimum (x64 CPU)
•   2 GB Memory Minimum
     • 4 GB suggested for acceptable performance
•   Static IP Addresses
     • Encountered issues with VMware dynamically assigning IP
       addresses as images migrated between hosts
•   Physical Network Connections
     • Ensure adequate bandwidth for all Virtual Machines on the
       physical host
•   Storage
     • Tier 2 SAN (10K Serial Attached SCSI)
•   Workload Management
     • Disk I/O
     • Concurrent Users
•   Dedicated Virtual Environment
End User/Developer Considerations

•   Deployment procedures and processes
     • Well defined image life cycle & refresh interval
•   Training
     • Best Practices Guide
•   Constraints
     • Perm space
         • Limited Data Set
         • Storage
     • Performance
         • 2 Amps
         • Shared Hardware
     • Skew
•   Unit Testing Only
Development Lifecycle
Data Discovery
                     (Source)



   Refresh
                                    Checkout and
 Development
                                      Configure
Images (Virtual
                                   Virtual Machine
  Machine)




        Development Lifecycle

 Production                        Development &
 Deployment                         Unit Testing
 (Enterprise                          (Virtual
  Systems)                           Machine)


                    Integration
                   Testing & QA
                    (Enterprise
                     Systems)
Next Steps
Next Steps
•   Developer and End user Access Levels

    • System Level

    • Database Level

•   Automation

    • Master Image Updates

•   Viewpoint

    • Data Mover -> Self Service

•   Full Deployment

    • Decommission existing DEV environment
Next Steps – VMware Environment

•   SAN Storage
     • NetApp Storage Solution
         • Enables the ability to quickly clone Virtual Machines
           without consuming additional space
     • Capacity Planning
         • Thin Provisioning
•   VMware Lab Manager
     • Reduced administration
     • Self service portal
     • Better support for high turnover machines
     • Linked machines (de-duplication)
Self Service?
Virtual Self Service Environment
                                                          Developers
    Image 1              Users select an available
                         image from library
    Image 2
    Image 3
    Image 4
    Image 5               Image is copied from
    Image 6               library and place in
                                                               Image is started in
                          an available slot on
                                                               virtual environment
                          VMware Cluster
                                                               and ready for use

  Image Library




                                            Image 1      Image 2        Image 4
                                            Image 3      Image 1        Image 1
       Administrator                        Image 2                     Image 6
                                                         Image 5
Creates and Maintains Library
                                                         Snapshot
                                                      VMware Cluster
Summary

•   Provides isolated sandbox for testing and development
•   Safe and secure environment
•   Provided by Teradata
     • Supports Teradata Tools and Utilities (TTU)
     • Works with existing Teradata infrastructure
•   Scales easily with additional users & projects
•   1 TB perm space constraint can be challenging
•   Cannot be used for performance testing or tuning
•   Decreases loads on enterprise systems
•   Budget considerations
     • Runs on commodity hardware & disk
Questions?

cmonson@o.co
Resources

•   VMware vSphere TM -
    http://www.vmware.com/products/vsphere/overview.html

•   TD Express (Teradata Developer Exchange) -
    http://downloads.teradata.com/download/database/teradata-
    express/vmware
     • Requires Teradata Developer Exchange Login
     • Link to download TD Express software must be requested
        from Teradata Developer Exchange

More Related Content

What's hot

Microsoft Offical Course 20410C_12
Microsoft Offical Course 20410C_12Microsoft Offical Course 20410C_12
Microsoft Offical Course 20410C_12
gameaxt
 
VMWARE Professionals - Storage and Resources
VMWARE Professionals -  Storage and ResourcesVMWARE Professionals -  Storage and Resources
VMWARE Professionals - Storage and Resources
Paulo Freitas
 
Planning For Catastrophe with IBM WAS and IBM BPM
Planning For Catastrophe with IBM WAS and IBM BPMPlanning For Catastrophe with IBM WAS and IBM BPM
Planning For Catastrophe with IBM WAS and IBM BPM
WASdev Community
 

What's hot (20)

Microsoft Offical Course 20410C_12
Microsoft Offical Course 20410C_12Microsoft Offical Course 20410C_12
Microsoft Offical Course 20410C_12
 
VMworld 2013: IBM Solutions for VMware Virtual SAN
VMworld 2013: IBM Solutions for VMware Virtual SAN VMworld 2013: IBM Solutions for VMware Virtual SAN
VMworld 2013: IBM Solutions for VMware Virtual SAN
 
Skylark: Easy Cloud Computing
Skylark: Easy Cloud ComputingSkylark: Easy Cloud Computing
Skylark: Easy Cloud Computing
 
SQL Azure for ITPros
SQL Azure for ITProsSQL Azure for ITPros
SQL Azure for ITPros
 
Backup & recovery for exadata
Backup & recovery for exadataBackup & recovery for exadata
Backup & recovery for exadata
 
Windows 7 Feature Overview It Academic Day 2009
Windows 7 Feature Overview   It Academic Day 2009Windows 7 Feature Overview   It Academic Day 2009
Windows 7 Feature Overview It Academic Day 2009
 
VMworld 2013: Virtualizing Highly Available SQL Servers
VMworld 2013: Virtualizing Highly Available SQL Servers VMworld 2013: Virtualizing Highly Available SQL Servers
VMworld 2013: Virtualizing Highly Available SQL Servers
 
VMworld 2013: Automated Management of Tier-1 Applications on VMware
VMworld 2013: Automated Management of Tier-1 Applications on VMware VMworld 2013: Automated Management of Tier-1 Applications on VMware
VMworld 2013: Automated Management of Tier-1 Applications on VMware
 
DBA Basics guide
DBA Basics guideDBA Basics guide
DBA Basics guide
 
44spotkaniePLSSUGWRO_CoNowegowKrainieChmur
44spotkaniePLSSUGWRO_CoNowegowKrainieChmur44spotkaniePLSSUGWRO_CoNowegowKrainieChmur
44spotkaniePLSSUGWRO_CoNowegowKrainieChmur
 
Solving the DB2 LUW Administration Dilemma
Solving the DB2 LUW Administration DilemmaSolving the DB2 LUW Administration Dilemma
Solving the DB2 LUW Administration Dilemma
 
KoprowskiT_SPBizConference_2AMaDisasterJustBegan
KoprowskiT_SPBizConference_2AMaDisasterJustBeganKoprowskiT_SPBizConference_2AMaDisasterJustBegan
KoprowskiT_SPBizConference_2AMaDisasterJustBegan
 
VMWARE Professionals - Storage and Resources
VMWARE Professionals -  Storage and ResourcesVMWARE Professionals -  Storage and Resources
VMWARE Professionals - Storage and Resources
 
Planning For Catastrophe with IBM WAS and IBM BPM
Planning For Catastrophe with IBM WAS and IBM BPMPlanning For Catastrophe with IBM WAS and IBM BPM
Planning For Catastrophe with IBM WAS and IBM BPM
 
VMworld 2013: Demystifying VMware Mirage: Tips and Tricks for Success
VMworld 2013: Demystifying VMware Mirage: Tips and Tricks for Success VMworld 2013: Demystifying VMware Mirage: Tips and Tricks for Success
VMworld 2013: Demystifying VMware Mirage: Tips and Tricks for Success
 
Citrix CloudGateway 2.5 Technical Overview and Troubleshooting
Citrix CloudGateway 2.5 Technical Overview and TroubleshootingCitrix CloudGateway 2.5 Technical Overview and Troubleshooting
Citrix CloudGateway 2.5 Technical Overview and Troubleshooting
 
Windows 2012 R2 Multi Server Management
Windows 2012 R2 Multi Server ManagementWindows 2012 R2 Multi Server Management
Windows 2012 R2 Multi Server Management
 
What You Should Know About WebLogic Server 12c (12.2.1.2) #oow2015 #otntour2...
What You Should Know About WebLogic Server 12c (12.2.1.2)  #oow2015 #otntour2...What You Should Know About WebLogic Server 12c (12.2.1.2)  #oow2015 #otntour2...
What You Should Know About WebLogic Server 12c (12.2.1.2) #oow2015 #otntour2...
 
Living with the Oracle Database Appliance
Living with the Oracle Database ApplianceLiving with the Oracle Database Appliance
Living with the Oracle Database Appliance
 
Sql Server High Availability & DR Technologies
Sql Server High Availability & DR TechnologiesSql Server High Availability & DR Technologies
Sql Server High Availability & DR Technologies
 

Similar to Teradata Partners 2011 - Utilizing Teradata Express For Development And Sandboxing

Cloud Hosting for Government Agencies: Drupal Platform as a Service
Cloud Hosting for Government Agencies: Drupal Platform as a ServiceCloud Hosting for Government Agencies: Drupal Platform as a Service
Cloud Hosting for Government Agencies: Drupal Platform as a Service
Acquia
 
Oracle Cloud DBaaS
Oracle Cloud DBaaSOracle Cloud DBaaS
Oracle Cloud DBaaS
Arush Jain
 
Vizuri Exadata East Coast Users Conference
Vizuri Exadata East Coast Users ConferenceVizuri Exadata East Coast Users Conference
Vizuri Exadata East Coast Users Conference
Isaac Christoffersen
 

Similar to Teradata Partners 2011 - Utilizing Teradata Express For Development And Sandboxing (20)

Stay productive_while_slicing_up_the_monolith
Stay productive_while_slicing_up_the_monolithStay productive_while_slicing_up_the_monolith
Stay productive_while_slicing_up_the_monolith
 
VMworld 2013: Virtualizing Databases: Doing IT Right
VMworld 2013: Virtualizing Databases: Doing IT Right VMworld 2013: Virtualizing Databases: Doing IT Right
VMworld 2013: Virtualizing Databases: Doing IT Right
 
Cloud Hosting for Government Agencies: Drupal Platform as a Service
Cloud Hosting for Government Agencies: Drupal Platform as a ServiceCloud Hosting for Government Agencies: Drupal Platform as a Service
Cloud Hosting for Government Agencies: Drupal Platform as a Service
 
Best practices in Deploying SUSE CaaS Platform v3
Best practices in Deploying SUSE CaaS Platform v3Best practices in Deploying SUSE CaaS Platform v3
Best practices in Deploying SUSE CaaS Platform v3
 
VMworld Europe 2014: Virtualizing Databases Doing IT Right – The Sequel
VMworld Europe 2014: Virtualizing Databases Doing IT Right – The SequelVMworld Europe 2014: Virtualizing Databases Doing IT Right – The Sequel
VMworld Europe 2014: Virtualizing Databases Doing IT Right – The Sequel
 
John adams talk cloudy
John adams   talk cloudyJohn adams   talk cloudy
John adams talk cloudy
 
Docker for the enterprise
Docker for the enterpriseDocker for the enterprise
Docker for the enterprise
 
What's New for the Windows Azure Developer? Lots!!
What's New for the Windows Azure Developer?  Lots!!What's New for the Windows Azure Developer?  Lots!!
What's New for the Windows Azure Developer? Lots!!
 
VMworld 2013: Three Advantages of Running Cloud Foundry in a VMware Private C...
VMworld 2013: Three Advantages of Running Cloud Foundry in a VMware Private C...VMworld 2013: Three Advantages of Running Cloud Foundry in a VMware Private C...
VMworld 2013: Three Advantages of Running Cloud Foundry in a VMware Private C...
 
How does Apache Pegasus (incubating) community develop at SensorsData
How does Apache Pegasus (incubating) community develop at SensorsDataHow does Apache Pegasus (incubating) community develop at SensorsData
How does Apache Pegasus (incubating) community develop at SensorsData
 
Data Lake and the rise of the microservices
Data Lake and the rise of the microservicesData Lake and the rise of the microservices
Data Lake and the rise of the microservices
 
What's new in JBoss ON 3.2
What's new in JBoss ON 3.2What's new in JBoss ON 3.2
What's new in JBoss ON 3.2
 
Oracle Cloud DBaaS
Oracle Cloud DBaaSOracle Cloud DBaaS
Oracle Cloud DBaaS
 
Running database infrastructure on containers
Running database infrastructure on containersRunning database infrastructure on containers
Running database infrastructure on containers
 
Citrix Synergy 2014: Going the CloudPlatform Way
Citrix Synergy 2014: Going the CloudPlatform WayCitrix Synergy 2014: Going the CloudPlatform Way
Citrix Synergy 2014: Going the CloudPlatform Way
 
Adding Value in the Cloud with Performance Test
Adding Value in the Cloud with Performance TestAdding Value in the Cloud with Performance Test
Adding Value in the Cloud with Performance Test
 
"Clouds on the Horizon Get Ready for Drizzle" by David Axmark @ eLiberatica 2009
"Clouds on the Horizon Get Ready for Drizzle" by David Axmark @ eLiberatica 2009"Clouds on the Horizon Get Ready for Drizzle" by David Axmark @ eLiberatica 2009
"Clouds on the Horizon Get Ready for Drizzle" by David Axmark @ eLiberatica 2009
 
Vizuri Exadata East Coast Users Conference
Vizuri Exadata East Coast Users ConferenceVizuri Exadata East Coast Users Conference
Vizuri Exadata East Coast Users Conference
 
Sql Start! 2020 - SQL Server Lift & Shift su Azure
Sql Start! 2020 - SQL Server Lift & Shift su AzureSql Start! 2020 - SQL Server Lift & Shift su Azure
Sql Start! 2020 - SQL Server Lift & Shift su Azure
 
Advanced Discussion on Cloud Formation
Advanced Discussion on Cloud FormationAdvanced Discussion on Cloud Formation
Advanced Discussion on Cloud Formation
 

Teradata Partners 2011 - Utilizing Teradata Express For Development And Sandboxing

  • 1. Utilizing Teradata Express for Development and Sandboxing Cameron Monson, Database Administrator O.co
  • 2. Agenda • Company Background and EDW Configuration • Why Virtualize? • Virtual Environment Details • Benefits • Challenges • Results • Next Steps • Questions
  • 3. About O.co • Initial focus on liquidating excess inventory and currently focus on offering brand-name merchandise at discount prices • Products offering has grown from less than 100 in 1999 to over 700,000 currently • Customer Service – Currently ranked #4 by National Retail Federation and American Express • O.co, also known as Overstock.com, is Your Savings Engine offering brand-name products. The company offers its customers an opportunity to shop for bargains conveniently, while offering its suppliers an alternative inventory distribution channel. O.co, headquartered in Salt Lake City, is a publicly traded company listed on the NASDAQ Global Market System and can be found online at www.overstock.com and www.o.co.
  • 4. Teradata at O.co • 2005 – 8 Node (8+0) 5400 • 2006 • 30 Node (24+6) 5450 (Production) • 6 Node (4+2) 5450 (Development/Test) • 2010 • 9 Node (6+3) 5600 (Production) • 2 Node (1+1) 5600 (Development/Test) • Supports mixed workload environment • TRM, DCM, SAS • Microstrategy • Ad-hoc Analytical and BI workload • ELT • GoldenGate • ODI • DMExpress
  • 6. Abstract & Motives • Test system currently functions as both development and test environments • Creates inconsistent object names between environments • Evaluating the limits and boundaries of new technology offerings • Enable Test system to more closely resemble production • Functional testing and development
  • 7. Other Considerations for Virtualization • Performance • Decreased load and demands on EDW systems • Availability • Additional flexibility for scheduling system maintenance • Isolation & Sandbox • Budget • Commodity hardware
  • 9. VMware ESXi • Installs directly on physical servers and partitions into multiple virtual machines that can be run simultaneously • Currently utilizing ESXi 4.0 cluster • Can clone and deploy additional images from master templates • Requires administrator interaction to clone and perform basic configuration steps • Provides workload management and automatic migration between hosts on cluster • “Plug & Play” install of new TD Express images • SAN Storage • Tier 2 SAN Storage (10K RPM SAS)
  • 10. TD Express • Free developer version of the Teradata Database • 1 TB Teradata Express 13.10 for VMware Player • Provided configuration includes • SLES 10 64-bit Linux • Teradata Database • Teradata Tools and Utilities (TTU) • Load & Unload Tools • Teradata’s EasyLoader tool • SQL Assistant Java Edition • Provided AS IS (Unsupported)
  • 12. Benefits • Secure • Data never outside corporate datacenters • Testing • Uncertified software ahead of upgrades • New features • BAR/Disaster Recovery • Able to connect externally using existing TTU • Developer Isolation & Sandboxes
  • 13. Additional Benefits • Educational playground • Additional exposure to internal processes and programs • New features & functionality • Growth & capacity planning • Additional hardware (Nodes) • Additional storage
  • 14. Environment Isolation • Allows for greater flexibility in testing and development • Reduces contention on frequently updated and accessed objects • Experimentation • New processes, programs & functionality • UDF’s • Safe • Snapshot functionality • Regression testing
  • 16. Database Administration Challenges • Synchronizing and Migrating DDL • TSET • Data Movement • BAR (Arcmain) • Data Mover • ETL (Fastload, TPT, Multiload, etc..) • Performance • Limited to 2 AMP’s • Hardware configuration • Shared hardware • Data Distribution/Skew
  • 17. Security & Access • Maintain an environment representative of the enterprise Teradata systems • Simplify roles and rights to limit maintenance overhead on multiple virtual machines • Consolidate users on virtual machines • Corporate security considerations (SSO, etc…)
  • 18. 1 TB Perm Space Limitation • Try to minimize base image size to limit disk space overhead • Allow developers to load larger datasets as needed • Varying projects require vastly different data sets
  • 19. Perm Space Limitation – Solution 1 Limited Data Set • Seed smaller base objects (Lookup/Dimensions) in full • Larger tables can be seeded partially or left empty (DDL only) • Maintaining referential integrity can be challenging • Still may not be able to provide enough data for functional testing in very large environments • May enable limited integration testing in DEV environment • Larger storage footprint
  • 20. Perm Space Limitation – Solution 2 Break Virtual Images into Subject Areas • Can be difficult to identify and maintain objects • Duplicate Objects • Same objects required across multiple subject areas (3NF) • Not feasible for all subject areas • Referential integrity across subject areas • Merging DDL changes back to trunk • Integration testing may not be possible
  • 21. Perm Space Limitation – Solutions 3 & 4 Ad hoc – Self Service • Allows for greater flexibility in data size • Requires self service options to limit administrative overhead • Additional work involved for developers to prepare environment • Unit testing only - Integration testing not possible Materialize Queries – BI/Reporting Only • Limited to report development only • Better performance for multiple query executions during reporting development (Cube, Grid & Report refreshes) • SQL would not be consistent between environments • Requires self service options to be implemented
  • 22. Other Challenges • Development & Deployment Lifecycle • Image Management • Version Control • Library (Check in/out) • Refresh Interval • Migration and Project Planning
  • 23. Support Considerations • DBA • Architecture • Self Service • External Groups • VMware systems • Storage • Network
  • 25. Achievements • Testing new features and functionality ahead of 13.10 upgrade • Conduct proof of concept testing with minimal impact to enterprise systems • Ability to perform preliminary process validation • Currently rolling out to ETL Developers • Endpoint Testing • Integration into ETL infrastructure • TTU Tools Validation
  • 26. Successful Projects • Oracle Data Integrator (ODI/Sunopsis ) • Hadoop UDFs – Full Development Lifecycle • Proof of Concept • Development • Testing & Validation • GoldenGate Testing & Validation
  • 27. Lessons Learned and Takeaways • Security & Access Controls • Decide early on how to simplify roles and consolidate users • Mixed response from end users and developers • Perform limited Beta testing with mixed selection of users • Data seeding challenges • Perm Space Limitations • Refresh Method • Throughput • Refresh Intervals • System Requirements • IP Addresses • Hosts file entries • Keep it simple!
  • 28. Hardware Resource Allocation • 2 Core Minimum (x64 CPU) • 2 GB Memory Minimum • 4 GB suggested for acceptable performance • Static IP Addresses • Encountered issues with VMware dynamically assigning IP addresses as images migrated between hosts • Physical Network Connections • Ensure adequate bandwidth for all Virtual Machines on the physical host • Storage • Tier 2 SAN (10K Serial Attached SCSI) • Workload Management • Disk I/O • Concurrent Users • Dedicated Virtual Environment
  • 29. End User/Developer Considerations • Deployment procedures and processes • Well defined image life cycle & refresh interval • Training • Best Practices Guide • Constraints • Perm space • Limited Data Set • Storage • Performance • 2 Amps • Shared Hardware • Skew • Unit Testing Only
  • 31. Data Discovery (Source) Refresh Checkout and Development Configure Images (Virtual Virtual Machine Machine) Development Lifecycle Production Development & Deployment Unit Testing (Enterprise (Virtual Systems) Machine) Integration Testing & QA (Enterprise Systems)
  • 33. Next Steps • Developer and End user Access Levels • System Level • Database Level • Automation • Master Image Updates • Viewpoint • Data Mover -> Self Service • Full Deployment • Decommission existing DEV environment
  • 34. Next Steps – VMware Environment • SAN Storage • NetApp Storage Solution • Enables the ability to quickly clone Virtual Machines without consuming additional space • Capacity Planning • Thin Provisioning • VMware Lab Manager • Reduced administration • Self service portal • Better support for high turnover machines • Linked machines (de-duplication)
  • 36. Virtual Self Service Environment Developers Image 1 Users select an available image from library Image 2 Image 3 Image 4 Image 5 Image is copied from Image 6 library and place in Image is started in an available slot on virtual environment VMware Cluster and ready for use Image Library Image 1 Image 2 Image 4 Image 3 Image 1 Image 1 Administrator Image 2 Image 6 Image 5 Creates and Maintains Library Snapshot VMware Cluster
  • 37. Summary • Provides isolated sandbox for testing and development • Safe and secure environment • Provided by Teradata • Supports Teradata Tools and Utilities (TTU) • Works with existing Teradata infrastructure • Scales easily with additional users & projects • 1 TB perm space constraint can be challenging • Cannot be used for performance testing or tuning • Decreases loads on enterprise systems • Budget considerations • Runs on commodity hardware & disk
  • 39. Resources • VMware vSphere TM - http://www.vmware.com/products/vsphere/overview.html • TD Express (Teradata Developer Exchange) - http://downloads.teradata.com/download/database/teradata- express/vmware • Requires Teradata Developer Exchange Login • Link to download TD Express software must be requested from Teradata Developer Exchange

Editor's Notes

  1. Talk about the process of Why O.co decided to virtualize and the process of getting to where we are.
  2. Talk about the process of Why O.co decided to virtualize and the process of getting to where we are.